BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 023180
(286 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|297798522|ref|XP_002867145.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297312981|gb|EFH43404.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 288
Score = 453 bits (1165), Expect = e-125, Method: Compositional matrix adjust.
Identities = 208/277 (75%), Positives = 242/277 (87%), Gaps = 2/277 (0%)
Query: 11 KLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVE--KENESGLPHGETGDA 68
KLGL T ++ CS FL GFYGST LS++VP ++P+LR LE+VE +E+ +PHG TG+
Sbjct: 12 KLGLATVIVFCSLCFLVGFYGSTLLSQNVPRVKPRLRMLEMVENGEEDTGSMPHGVTGEE 71
Query: 69 SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRT 128
S+ SIPFQVLSWRPRA+YFPNFA+AEQCQ+II AK LKPS LALR+GET E+TKGTRT
Sbjct: 72 SVGSIPFQVLSWRPRAIYFPNFATAEQCQAIIERAKVNLKPSALALRKGETAENTKGTRT 131
Query: 129 SSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY 188
SSGTFISASED TG L+ +E KIARATM+P++HGE+FN+LRYE+GQKYDSHYD FNP EY
Sbjct: 132 SSGTFISASEDSTGALDFVERKIARATMIPRSHGESFNILRYELGQKYDSHYDVFNPTEY 191
Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 248
GPQ SQR+ASFLLYLSDVEEGGETMFPFENG + +GYDYK+CIGLKVKPR+GDGLLFYS
Sbjct: 192 GPQSSQRIASFLLYLSDVEEGGETMFPFENGSNMGTGYDYKQCIGLKVKPRKGDGLLFYS 251
Query: 249 LFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 285
+FPNGTID+TSLHGSCPV KGEKWVATKWIRDQ+Q E
Sbjct: 252 VFPNGTIDQTSLHGSCPVTKGEKWVATKWIRDQDQEE 288
>gi|255573113|ref|XP_002527486.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223533126|gb|EEF34884.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 286
Score = 452 bits (1164), Expect = e-125, Method: Compositional matrix adjust.
Identities = 206/273 (75%), Positives = 242/273 (88%)
Query: 13 GLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKENESGLPHGETGDASIQS 72
GLP +L+CS FF+AGFY ST +S+DVP I+P+LR LEV ++E +P G TG++ I+S
Sbjct: 14 GLPAVILVCSVFFVAGFYASTLISQDVPVIKPRLRMLEVTDEEKHQAMPRGVTGESYIES 73
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
IPFQVLSW+PRA+YFP+FA+ EQC++II AK RLKPS LALR+GET ESTKGTRTSSGT
Sbjct: 74 IPFQVLSWKPRAVYFPDFATPEQCKNIIEMAKLRLKPSGLALRKGETAESTKGTRTSSGT 133
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
F+SASED TG L+ IEHKIARATM+P++HGEAFN+LRYEIGQKYDSHYD+FNPAEYGPQM
Sbjct: 134 FLSASEDGTGTLDFIEHKIARATMIPRSHGEAFNILRYEIGQKYDSHYDSFNPAEYGPQM 193
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
SQR+ASFLLYLSDVE+GGETMFPFENG+ + S YDYKKC GLKVKPR+GDG+LFYSL PN
Sbjct: 194 SQRVASFLLYLSDVEKGGETMFPFENGVKISSVYDYKKCAGLKVKPRQGDGILFYSLLPN 253
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 285
GTID+TSLHGSCPVI+GEKWVATKWIRDQ Q +
Sbjct: 254 GTIDQTSLHGSCPVIEGEKWVATKWIRDQVQMD 286
>gi|385137888|gb|AFI41205.1| oxygenase protein, partial [Arabidopsis thaliana]
Length = 288
Score = 449 bits (1155), Expect = e-124, Method: Compositional matrix adjust.
Identities = 210/277 (75%), Positives = 242/277 (87%), Gaps = 2/277 (0%)
Query: 11 KLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEK-ENESG-LPHGETGDA 68
KLGL T ++ CS FL GFYGST LS++VP ++P+LR LE+VE E E+G +PHG TG+
Sbjct: 12 KLGLATVIVFCSLCFLFGFYGSTLLSQNVPRVKPRLRMLEMVENGEEEAGSMPHGVTGEE 71
Query: 69 SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRT 128
SI SIPFQVLSWRPRA+YFPNFA+AEQCQ+II AK LKPS LALR+GET E+TKGTRT
Sbjct: 72 SIGSIPFQVLSWRPRAIYFPNFATAEQCQAIIERAKVNLKPSALALRKGETAENTKGTRT 131
Query: 129 SSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY 188
SSGTFISASE+ TG L+ +E KIARATM+P++HGE+FN+LRYE+GQKYDSHYD FNP EY
Sbjct: 132 SSGTFISASEESTGALDFVERKIARATMIPRSHGESFNILRYELGQKYDSHYDVFNPTEY 191
Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 248
GPQ SQR+ASFLLYLSDVEEGGETMFPFENG + GYDYK+CIGLKVKPR+GDGLLFYS
Sbjct: 192 GPQSSQRIASFLLYLSDVEEGGETMFPFENGSNMGIGYDYKQCIGLKVKPRKGDGLLFYS 251
Query: 249 LFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 285
+FPNGTID+TSLHGSCPV KGEKWVATKWIRDQ+Q E
Sbjct: 252 VFPNGTIDQTSLHGSCPVTKGEKWVATKWIRDQDQEE 288
>gi|18418321|ref|NP_567941.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
gi|17381226|gb|AAL36425.1| unknown protein [Arabidopsis thaliana]
gi|20465827|gb|AAM20018.1| unknown protein [Arabidopsis thaliana]
gi|21592377|gb|AAM64328.1| putative dioxygenase [Arabidopsis thaliana]
gi|332660892|gb|AEE86292.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
Length = 288
Score = 449 bits (1154), Expect = e-124, Method: Compositional matrix adjust.
Identities = 208/277 (75%), Positives = 241/277 (87%), Gaps = 2/277 (0%)
Query: 11 KLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVE--KENESGLPHGETGDA 68
KLGL T ++ CS FL GFYGST LS++VP ++P+LR L++VE +E S +PHG TG+
Sbjct: 12 KLGLATVIVFCSLCFLFGFYGSTLLSQNVPRVKPRLRMLDMVENGEEEASSMPHGVTGEE 71
Query: 69 SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRT 128
SI SIPFQVLSWRPRA+YFPNFA+AEQCQ+II AK LKPS LALR+GET E+TKGTRT
Sbjct: 72 SIGSIPFQVLSWRPRAIYFPNFATAEQCQAIIERAKVNLKPSALALRKGETAENTKGTRT 131
Query: 129 SSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY 188
SSGTFISASE+ TG L+ +E KIARATM+P++HGE+FN+LRYE+GQKYDSHYD FNP EY
Sbjct: 132 SSGTFISASEESTGALDFVERKIARATMIPRSHGESFNILRYELGQKYDSHYDVFNPTEY 191
Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 248
GPQ SQR+ASFLLYLSDVEEGGETMFPFENG + GYDYK+CIGLKVKPR+GDGLLFYS
Sbjct: 192 GPQSSQRIASFLLYLSDVEEGGETMFPFENGSNMGIGYDYKQCIGLKVKPRKGDGLLFYS 251
Query: 249 LFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 285
+FPNGTID+TSLHGSCPV KGEKWVATKWIRDQ+Q E
Sbjct: 252 VFPNGTIDQTSLHGSCPVTKGEKWVATKWIRDQDQEE 288
>gi|224103711|ref|XP_002313164.1| predicted protein [Populus trichocarpa]
gi|222849572|gb|EEE87119.1| predicted protein [Populus trichocarpa]
Length = 294
Score = 438 bits (1126), Expect = e-120, Method: Compositional matrix adjust.
Identities = 209/289 (72%), Positives = 240/289 (83%), Gaps = 9/289 (3%)
Query: 1 MKGKAKRSSTKLGLPTALLLCSFFFLAGFYGSTFLSR--------DVPSIRPKLRTLEVV 52
MK K KRS KLGLP A++LCS FFLAGFY S F DVP RP+LR LEV
Sbjct: 1 MKIKTKRSKRKLGLPAAIILCSLFFLAGFYSSIFFPHVSASLSACDVPVTRPRLRMLEVA 60
Query: 53 EKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQL 112
E ++++ +PHG TG+AS++SIPFQVLSW+PRALYFP FA+ EQC+SII + +LKPS L
Sbjct: 61 EGDHDT-MPHGVTGEASVESIPFQVLSWKPRALYFPKFATPEQCESIIKMVESKLKPSTL 119
Query: 113 ALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEI 172
ALR+GET ESTK TRTSSG+F+S SED+TG L+ IE KIA+ATM+PQ+HGEAFN+LRYEI
Sbjct: 120 ALRKGETAESTKDTRTSSGSFVSGSEDETGTLDFIEKKIAKATMIPQSHGEAFNILRYEI 179
Query: 173 GQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI 232
GQKYDSHYDAFNP EYG Q SQR ASFLLYLS+VEEGGETMFPFENG + G+DYK+C+
Sbjct: 180 GQKYDSHYDAFNPDEYGQQSSQRTASFLLYLSNVEEGGETMFPFENGSAVIPGFDYKQCV 239
Query: 233 GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
GLKVKPR+GDGLLFYSLFPNGTID TSLHGSCPVIKG KWVATKWIRDQ
Sbjct: 240 GLKVKPRQGDGLLFYSLFPNGTIDPTSLHGSCPVIKGVKWVATKWIRDQ 288
>gi|225428938|ref|XP_002262952.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
gi|296083079|emb|CBI22483.3| unnamed protein product [Vitis vinifera]
Length = 284
Score = 436 bits (1122), Expect = e-120, Method: Compositional matrix adjust.
Identities = 211/286 (73%), Positives = 243/286 (84%), Gaps = 3/286 (1%)
Query: 1 MKGKAKRS-STKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKENESG 59
MKGK K KLGL + SFFFLAG +GS S+DV +R + R LE VE+ S
Sbjct: 1 MKGKGKGVWRPKLGLLLLFISWSFFFLAGLFGSMLFSQDVNGVRSQPRLLESVEEY--SP 58
Query: 60 LPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET 119
+PHGETG++S+ IPFQVLSW+PRALYFP FA+AEQCQSII AK L+PS LALRQGET
Sbjct: 59 MPHGETGESSVDMIPFQVLSWKPRALYFPRFATAEQCQSIIEMAKSHLRPSTLALRQGET 118
Query: 120 VESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH 179
ESTKGTRTSSGTFISASEDKTGIL+ +E KIA+ATM+P++HGEAFN+LRYEIGQ+Y+SH
Sbjct: 119 DESTKGTRTSSGTFISASEDKTGILDFVERKIAKATMIPRSHGEAFNILRYEIGQRYNSH 178
Query: 180 YDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPR 239
YDAFNPAEYGPQ SQR+ASFLLYLSDVEEGGETMFPFE+ + + +GYDYKKCIGLKVKP+
Sbjct: 179 YDAFNPAEYGPQTSQRVASFLLYLSDVEEGGETMFPFEHDLNIGTGYDYKKCIGLKVKPQ 238
Query: 240 RGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 285
RGDGLLFYS+FPNGTIDRTSLHGSCPVI GEKWVATKWIRD++Q +
Sbjct: 239 RGDGLLFYSVFPNGTIDRTSLHGSCPVIAGEKWVATKWIRDEQQDD 284
>gi|449448264|ref|XP_004141886.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 294
Score = 424 bits (1091), Expect = e-116, Method: Compositional matrix adjust.
Identities = 206/291 (70%), Positives = 240/291 (82%), Gaps = 6/291 (2%)
Query: 1 MKGKAKRSS----TKLGLPTAL-LLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKE 55
MKGK+ RS+ +KLGLP + +LC F FLAGF+GST LS+DV RP+ R L+
Sbjct: 1 MKGKSGRSNWSLRSKLGLPALIFVLCLFCFLAGFFGSTLLSQDVDDDRPRARLLQSASDV 60
Query: 56 NESGL-PHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLAL 114
E L GE GD SI SIPFQVLSWRPRALYFP FA+AEQCQSI+ AK +L+PS LAL
Sbjct: 61 TEFDLMSSGENGDDSISSIPFQVLSWRPRALYFPKFATAEQCQSIVNLAKPKLRPSTLAL 120
Query: 115 RQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQ 174
R+GET ESTKG RTSSG F SASED++G L +IE KIARATM+P+THGEA+N+LRYEIGQ
Sbjct: 121 RKGETAESTKGVRTSSGVFFSASEDESGTLGVIEEKIARATMIPRTHGEAYNILRYEIGQ 180
Query: 175 KYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGL 234
KY+SHYDAF P+EYGPQ SQR+ASFLLYL+DVEEGGETMFPFENG+ +D Y+++ CIGL
Sbjct: 181 KYNSHYDAFKPSEYGPQKSQRVASFLLYLTDVEEGGETMFPFENGLNMDGTYNFQTCIGL 240
Query: 235 KVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 285
KVKPR+GDGLLFYS+FPNGTID TSLHGSCPVIKG+KWVATKWIRDQ Q +
Sbjct: 241 KVKPRQGDGLLFYSVFPNGTIDPTSLHGSCPVIKGQKWVATKWIRDQMQED 291
>gi|449511009|ref|XP_004163837.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
alpha-1-like [Cucumis sativus]
Length = 294
Score = 422 bits (1086), Expect = e-116, Method: Compositional matrix adjust.
Identities = 205/291 (70%), Positives = 239/291 (82%), Gaps = 6/291 (2%)
Query: 1 MKGKAKRSS----TKLGLPTAL-LLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKE 55
MKGK+ RS+ +KLGLP + +LC F FLAGF+GST LS+DV RP+ R L+
Sbjct: 1 MKGKSGRSNWSLRSKLGLPALIFVLCLFXFLAGFFGSTLLSQDVDDDRPRARLLQSASDV 60
Query: 56 NESGL-PHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLAL 114
E L GE GD SI SIPFQVLSWRPRALYFP FA+AEQCQSI+ AK +L+PS LAL
Sbjct: 61 TEFDLMSSGENGDDSISSIPFQVLSWRPRALYFPKFATAEQCQSIVNLAKPKLRPSTLAL 120
Query: 115 RQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQ 174
R+GET ESTKG RTSSG F SASED++G L +IE K ARATM+P+THGEA+N+LRYEIGQ
Sbjct: 121 RKGETAESTKGVRTSSGVFFSASEDESGTLGVIEEKXARATMIPRTHGEAYNILRYEIGQ 180
Query: 175 KYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGL 234
KY+SHYDAF P+EYGPQ SQR+ASFLLYL+DVEEGGETMFPFENG+ +D Y+++ CIGL
Sbjct: 181 KYNSHYDAFKPSEYGPQKSQRVASFLLYLTDVEEGGETMFPFENGLNMDGTYNFQTCIGL 240
Query: 235 KVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 285
KVKPR+GDGLLFYS+FPNGTID TSLHGSCPVIKG+KWVATKWIRDQ Q +
Sbjct: 241 KVKPRQGDGLLFYSVFPNGTIDPTSLHGSCPVIKGQKWVATKWIRDQMQED 291
>gi|147823227|emb|CAN70872.1| hypothetical protein VITISV_009065 [Vitis vinifera]
Length = 276
Score = 422 bits (1086), Expect = e-116, Method: Compositional matrix adjust.
Identities = 208/286 (72%), Positives = 238/286 (83%), Gaps = 11/286 (3%)
Query: 1 MKGKAKRS-STKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKENESG 59
MKGK K KLGL + SFFFLAG +GS S+ R LE VE+ S
Sbjct: 1 MKGKGKGVWRPKLGLLLLFISWSFFFLAGLFGSMLFSQP--------RLLESVEEY--SP 50
Query: 60 LPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET 119
+PHGETG++S+ IPFQVLSW+PRALYFP FA+AEQCQSII AK L+PS LALRQGET
Sbjct: 51 MPHGETGESSVDMIPFQVLSWKPRALYFPRFATAEQCQSIIEMAKSHLRPSTLALRQGET 110
Query: 120 VESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH 179
ESTKGTRTSSGTFISASEDKTGIL+ +E KIA+ATM+P++HGEAFN+LRYEIGQ+Y+SH
Sbjct: 111 DESTKGTRTSSGTFISASEDKTGILDFVERKIAKATMIPRSHGEAFNILRYEIGQRYNSH 170
Query: 180 YDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPR 239
YDAFNPAEYGPQ SQR+ASFLLYLSDVEEGGETMFPFE+ + + +GYDYKKCIGLKVKP+
Sbjct: 171 YDAFNPAEYGPQTSQRVASFLLYLSDVEEGGETMFPFEHDLNIGTGYDYKKCIGLKVKPQ 230
Query: 240 RGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 285
RGDGLLFYS+FPNGTIDRTSLHGSCPVI GEKWVATKWIRD++Q +
Sbjct: 231 RGDGLLFYSVFPNGTIDRTSLHGSCPVIAGEKWVATKWIRDEQQDD 276
>gi|255584898|ref|XP_002533164.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223527036|gb|EEF29223.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 290
Score = 417 bits (1072), Expect = e-114, Method: Compositional matrix adjust.
Identities = 202/291 (69%), Positives = 235/291 (80%), Gaps = 7/291 (2%)
Query: 1 MKGKAKRSS----TKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKE- 55
MK K + +KLGLP L C FFFLAG + S +S++V + + R L+ V++E
Sbjct: 1 MKAKGSKGKWSIKSKLGLPVVFLSCLFFFLAGLFASNLISQNVNGDKNR-RQLQWVKEEI 59
Query: 56 -NESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLAL 114
LP G+TGD + IPFQVLSW+PRALYFPNFA+AEQCQS+I AK L PS LAL
Sbjct: 60 IEYDLLPSGDTGDDYLTVIPFQVLSWKPRALYFPNFATAEQCQSVINMAKPNLTPSTLAL 119
Query: 115 RQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQ 174
R+GET E+TKG RTSSG F+SASEDKTG+L+ IE KIARATMLP+ +GEAFN+LRYEIGQ
Sbjct: 120 RKGETEENTKGIRTSSGMFLSASEDKTGVLDAIEEKIARATMLPRANGEAFNILRYEIGQ 179
Query: 175 KYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGL 234
KY+SHYDAFNPAEYGPQ SQR+ASFLLYLSDVEEGGETMFPFEN + +D YD++KCIGL
Sbjct: 180 KYNSHYDAFNPAEYGPQKSQRVASFLLYLSDVEEGGETMFPFENDLDVDESYDFEKCIGL 239
Query: 235 KVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 285
+V+PRRGDGLLFYSLFPN TID TSLHGSCPVIKGEKWVATKWIRDQEQ +
Sbjct: 240 QVRPRRGDGLLFYSLFPNNTIDPTSLHGSCPVIKGEKWVATKWIRDQEQDD 290
>gi|356536125|ref|XP_003536590.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 286
Score = 417 bits (1071), Expect = e-114, Method: Compositional matrix adjust.
Identities = 197/285 (69%), Positives = 234/285 (82%), Gaps = 2/285 (0%)
Query: 1 MKGKAKRSSTKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTL-EVVEKENESG 59
MKGK K S KLG+PT +LC+ FF GF+ S L +D+ + P+ R L E V+KE E
Sbjct: 2 MKGKVKSSKLKLGVPTLFILCALFFFVGFFVSPLLFQDLDDVGPRSRILQESVKKEYEP- 60
Query: 60 LPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET 119
L HGE+G+ + SIP Q+LSWRPRA++FPNF S E CQ II AK +L+PS+LALR+GET
Sbjct: 61 LEHGESGEPFVDSIPSQILSWRPRAVFFPNFTSVEVCQQIIEMAKPKLEPSKLALRKGET 120
Query: 120 VESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH 179
ESTK TRTSSGTFISASEDK+GIL+L+E KIA+ TM+P+THGE FN+L+YE+GQKYDSH
Sbjct: 121 AESTKDTRTSSGTFISASEDKSGILDLVERKIAKVTMIPRTHGEIFNILKYEVGQKYDSH 180
Query: 180 YDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPR 239
YDAFNP EYG SQR+ASFLLYLS+VE GGETMFP+E G+ +D GYDY+KCIGLKVKPR
Sbjct: 181 YDAFNPDEYGSVESQRIASFLLYLSNVEAGGETMFPYEGGLNIDRGYDYQKCIGLKVKPR 240
Query: 240 RGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQH 284
+GDGLLFYSL PNG ID+TSLHGSCPVIKGEKWVATKWI D+EQH
Sbjct: 241 QGDGLLFYSLLPNGKIDKTSLHGSCPVIKGEKWVATKWIDDREQH 285
>gi|357476355|ref|XP_003608463.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
gi|355509518|gb|AES90660.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
Length = 297
Score = 410 bits (1055), Expect = e-112, Method: Compositional matrix adjust.
Identities = 192/282 (68%), Positives = 230/282 (81%), Gaps = 4/282 (1%)
Query: 9 STKLGLPTALLLCSFFFLAGFYGSTFLSR----DVPSIRPKLRTLEVVEKENESGLPHGE 64
+ KL P L+C FFFLAGF+GST S D +RP+ R L+ ++ + + GE
Sbjct: 15 TNKLTFPYVFLICIFFFLAGFFGSTLFSHSQDGDGYGLRPRPRLLDSTKETEYNLMTAGE 74
Query: 65 TGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTK 124
GD SI SIPFQVLSW+PRALYFPNFA+AEQC++I++ AK LKPS LALR+GET E+TK
Sbjct: 75 FGDDSITSIPFQVLSWKPRALYFPNFATAEQCENIVSVAKAGLKPSSLALRKGETTENTK 134
Query: 125 GTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN 184
G RTSSG F+SAS DKT LE IE KIARATM+P++HGEAFN+LRYE+GQ+Y+SHYDAFN
Sbjct: 135 GIRTSSGVFLSASRDKTKTLEAIEEKIARATMIPRSHGEAFNILRYEVGQRYNSHYDAFN 194
Query: 185 PAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
P EYGPQ SQR+ASFLLYL+DVEEGGETMFPFENG+ +D Y Y+ C+GL+VKPR+GDGL
Sbjct: 195 PDEYGPQKSQRVASFLLYLTDVEEGGETMFPFENGLNMDGTYGYEDCVGLRVKPRQGDGL 254
Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHED 286
LFYSL PNGTID+TSLHGSCPVIKGEKWVATKWIR+ +Q +D
Sbjct: 255 LFYSLLPNGTIDQTSLHGSCPVIKGEKWVATKWIRNLDQEDD 296
>gi|40809925|dbj|BAD07294.1| prolyl 4-hydroxylase [Nicotiana tabacum]
Length = 286
Score = 407 bits (1047), Expect = e-111, Method: Compositional matrix adjust.
Identities = 194/252 (76%), Positives = 220/252 (87%)
Query: 34 FLSRDVPSIRPKLRTLEVVEKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASA 93
F +DVPS+RP+ R LE V +E+ LP GETG+ S+ SIPFQVLSW PRALYFPNFAS
Sbjct: 35 FSHQDVPSVRPRPRFLESVYQEDFDPLPIGETGEHSLISIPFQVLSWFPRALYFPNFASI 94
Query: 94 EQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIAR 153
EQCQSII AK ++PS LALR GET E+TKG RTSSGTFISASEDKTGIL+LIE KIA+
Sbjct: 95 EQCQSIIKMAKANMEPSSLALRTGETEETTKGIRTSSGTFISASEDKTGILDLIEEKIAK 154
Query: 154 ATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETM 213
ATM+P+THGEAFNVLRYEIGQ+Y SHYDAF+PA+YGPQ SQR ASFLLYLSDVEEGGET+
Sbjct: 155 ATMIPKTHGEAFNVLRYEIGQRYQSHYDAFDPAQYGPQKSQRAASFLLYLSDVEEGGETV 214
Query: 214 FPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWV 273
FP+ENG +D+ YD+ KCIGLKVKPRRGDGLLFYSLFPNGTID TSLHGSCPVI+GEKWV
Sbjct: 215 FPYENGQNMDASYDFSKCIGLKVKPRRGDGLLFYSLFPNGTIDLTSLHGSCPVIRGEKWV 274
Query: 274 ATKWIRDQEQHE 285
ATKWIR+Q+Q +
Sbjct: 275 ATKWIRNQDQDD 286
>gi|225428943|ref|XP_002263094.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
gi|296083076|emb|CBI22480.3| unnamed protein product [Vitis vinifera]
Length = 282
Score = 406 bits (1043), Expect = e-111, Method: Compositional matrix adjust.
Identities = 199/285 (69%), Positives = 234/285 (82%), Gaps = 3/285 (1%)
Query: 1 MKGKAKRSSTKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKENESGL 60
MKGK S KL LP L S FFLAG +GS S+DV +R R LE VE+E S +
Sbjct: 1 MKGKGGWSP-KLELPLLFLSWSLFFLAGLFGSMLFSQDVNGVRSPPRLLESVEEEY-SSM 58
Query: 61 PHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETV 120
PHGETG++S+ IPFQVLSW+PRA YFP+FA+AEQCQSII AK L PS L LR+GET
Sbjct: 59 PHGETGESSVDLIPFQVLSWKPRARYFPHFATAEQCQSIIEMAKSGLSPSTLVLRKGETE 118
Query: 121 ESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHY 180
ESTKG RTSSGTFISASEDKTGIL+ IE KIA+ATM+P+ HGE FN+LRYEIGQ+Y+SHY
Sbjct: 119 ESTKGIRTSSGTFISASEDKTGILDFIERKIAKATMIPRNHGEVFNILRYEIGQRYNSHY 178
Query: 181 DAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRR 240
DA +PAEYG Q SQR+ASFLLYLSDVEEGGETMFPFE+ + +++ ++ +KCIGLKVKPRR
Sbjct: 179 DAISPAEYGLQTSQRIASFLLYLSDVEEGGETMFPFEHDLNINT-FNSRKCIGLKVKPRR 237
Query: 241 GDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 285
GDGLLFYS+FPNGTID TS+HGSCPVI+GEKWVATKWIRD++Q +
Sbjct: 238 GDGLLFYSVFPNGTIDWTSMHGSCPVIEGEKWVATKWIRDEQQED 282
>gi|388505024|gb|AFK40578.1| unknown [Medicago truncatula]
Length = 297
Score = 404 bits (1038), Expect = e-110, Method: Compositional matrix adjust.
Identities = 191/282 (67%), Positives = 228/282 (80%), Gaps = 4/282 (1%)
Query: 9 STKLGLPTALLLCSFFFLAGFYGSTFLSR----DVPSIRPKLRTLEVVEKENESGLPHGE 64
+ KL P L+C FFFLAGF+GST S D +RP+ R L+ ++ + + GE
Sbjct: 15 TNKLTFPYVFLICIFFFLAGFFGSTLFSHSQDGDGYGLRPRPRLLDSTKETEYNLMTAGE 74
Query: 65 TGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTK 124
GD SI SIPFQVLSW+PRALYFPNFA+AEQC++I++ AK LKPS LALR+GET E+TK
Sbjct: 75 FGDDSITSIPFQVLSWKPRALYFPNFATAEQCENIVSVAKAGLKPSSLALRKGETTENTK 134
Query: 125 GTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN 184
G RTSSG F+SAS DKT LE IE KIARATM+P++HGEAFN+LRYE+GQ+Y SHYDAFN
Sbjct: 135 GIRTSSGVFLSASRDKTKTLEAIEEKIARATMIPRSHGEAFNILRYEVGQRYYSHYDAFN 194
Query: 185 PAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
P EYGPQ SQR+ASFLLYL+DVEEGGETMFPFENG+ +D Y Y+ +GL+VKPR+GDGL
Sbjct: 195 PDEYGPQKSQRVASFLLYLTDVEEGGETMFPFENGLNMDGTYGYEDRVGLRVKPRQGDGL 254
Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHED 286
LFYSL PNGTID+TSLHGSCPVIKGEKWVATKWIR+ +Q +D
Sbjct: 255 LFYSLLPNGTIDQTSLHGSCPVIKGEKWVATKWIRNLDQEDD 296
>gi|356574299|ref|XP_003555286.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 290
Score = 399 bits (1025), Expect = e-109, Method: Compositional matrix adjust.
Identities = 194/285 (68%), Positives = 230/285 (80%), Gaps = 1/285 (0%)
Query: 1 MKGKAKRSSTKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKENESGL 60
MKGK +RS KLG+PT LC+ FF GF+ S L +D+ + P+ R L+ K+ L
Sbjct: 1 MKGKVRRSKLKLGVPTFFFLCALFFFVGFFVSPLLFQDLGDVGPRSRILQESVKKAYEPL 60
Query: 61 PHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETV 120
HGE+G+ + SIPFQ+LSWRPRA+YFPNF S E CQ II AK +L+PS+LALR+GET
Sbjct: 61 EHGESGEPFLNSIPFQILSWRPRAVYFPNFTSVEVCQQIIEMAKPKLEPSKLALRKGETA 120
Query: 121 ESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHY 180
ESTK TRTSSGTFISASEDK+GIL+ +E KIA+ TM+P+THGE FN+L+YE+ QKYDSHY
Sbjct: 121 ESTKDTRTSSGTFISASEDKSGILDFVERKIAKVTMIPRTHGEKFNILKYEVAQKYDSHY 180
Query: 181 DAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSG-YDYKKCIGLKVKPR 239
DAFNP EYG SQR+ASFLLYLS+VE GGETMFP+E G+ +D G YDYKKCIGLKVKPR
Sbjct: 181 DAFNPDEYGTVESQRIASFLLYLSNVEAGGETMFPYEGGLNIDKGYYDYKKCIGLKVKPR 240
Query: 240 RGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQH 284
+GDGLLFYSL PNG ID+TSLHGSCPVIKGEKWVATKWI D+EQH
Sbjct: 241 QGDGLLFYSLLPNGKIDKTSLHGSCPVIKGEKWVATKWIDDREQH 285
>gi|388523073|gb|AFK49598.1| unknown [Lotus japonicus]
Length = 318
Score = 398 bits (1023), Expect = e-108, Method: Compositional matrix adjust.
Identities = 202/293 (68%), Positives = 227/293 (77%), Gaps = 12/293 (4%)
Query: 1 MKGKAKRSSTKLGLPTALLLCSFFFLAGFYGSTFL-----SRDVPSIR----PKL-RTLE 50
++G K+ P+ LLC FFFLAGF+GST D +R P+ R LE
Sbjct: 6 VRGNWSWRMNKIAFPSVFLLCIFFFLAGFFGSTLFFQYHSQEDEHGLRQIQRPRTTRLLE 65
Query: 51 -VVEKENESGL-PHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLK 108
+ EKE ES L P GETGD I +IPFQVLSW P ALYFPNFA+AEQC+SII TAK+ LK
Sbjct: 66 NLTEKETESHLLPAGETGDNFITTIPFQVLSWNPHALYFPNFATAEQCESIIETAKEGLK 125
Query: 109 PSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVL 168
PS L LR GET EST G RTSSG FISA EDKTG+L++IE KIARAT +P+THGEAFNVL
Sbjct: 126 PSTLVLRVGETDESTTGIRTSSGVFISAFEDKTGVLDVIEEKIARATKIPRTHGEAFNVL 185
Query: 169 RYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDY 228
RY++GQKY SHYDA +P YGPQ SQR+ASFLLYLSDV EGGETMFPFENG+ +D Y Y
Sbjct: 186 RYKVGQKYSSHYDALHPDIYGPQKSQRMASFLLYLSDVPEGGETMFPFENGLNMDGSYYY 245
Query: 229 KKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
+KCIGLKVKPR+GDGLLFYSLFPNGTID SLHGSCPVIKGEKWVATKWIRDQ
Sbjct: 246 EKCIGLKVKPRKGDGLLFYSLFPNGTIDPMSLHGSCPVIKGEKWVATKWIRDQ 298
>gi|223945827|gb|ACN26997.1| unknown [Zea mays]
gi|414872966|tpg|DAA51523.1| TPA: prolyl 4-hydroxylase [Zea mays]
Length = 294
Score = 396 bits (1018), Expect = e-108, Method: Compositional matrix adjust.
Identities = 190/275 (69%), Positives = 223/275 (81%), Gaps = 7/275 (2%)
Query: 10 TKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIR---PKLRTLEVVEKENESGLPHGETG 66
T+L LP LL CS FFLAGF+GS ++D P+ R LE E +P+GE+G
Sbjct: 22 TRLRLPVVLLSCSLFFLAGFFGSLLFTQDPEEADMPVPRERLLEAAWPE----MPYGESG 77
Query: 67 DASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT 126
+A+ IP+Q+LSW+PRALYFP FA++EQC++I+ TAK+RLKPS LALR+GET ESTKG
Sbjct: 78 EAAPSLIPYQILSWQPRALYFPQFATSEQCENIVKTAKERLKPSTLALRKGETAESTKGI 137
Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
RTSSGTF+SA+ED T L IE KIARATMLP+ HGE FNVLRY IGQ+Y SHYDAF+PA
Sbjct: 138 RTSSGTFLSANEDPTETLAEIEKKIARATMLPRNHGEPFNVLRYNIGQRYASHYDAFDPA 197
Query: 187 EYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLF 246
+YGPQ +QR+ASFLLYL+DVEEGGETMFP+EN +D GYDY+KCIGLKVKPR+GDGLLF
Sbjct: 198 QYGPQKNQRVASFLLYLTDVEEGGETMFPYENSENMDIGYDYEKCIGLKVKPRKGDGLLF 257
Query: 247 YSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
YSL NGTIDRTSLHGSCPVIKGEKWVATKWIRD
Sbjct: 258 YSLMVNGTIDRTSLHGSCPVIKGEKWVATKWIRDN 292
>gi|356563543|ref|XP_003550021.1| PREDICTED: putative prolyl 4-hydroxylase-like [Glycine max]
Length = 293
Score = 395 bits (1015), Expect = e-107, Method: Compositional matrix adjust.
Identities = 198/285 (69%), Positives = 229/285 (80%), Gaps = 3/285 (1%)
Query: 1 MKGKAKRSSTKLGLPTALLLCSFFFLAGFYGSTFLSR---DVPSIRPKLRTLEVVEKENE 57
+KG + KL P L+C FFFLAGF+G T S D +RP+ R L+ +
Sbjct: 6 VKGNWSLRTNKLSFPYVFLVCIFFFLAGFFGFTLFSHSQGDGDGLRPRQRLLDSTNEAEY 65
Query: 58 SGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQG 117
+ +P GE GD SI SIPFQVLSWRPRA+YFPNFA+AEQC+SII AK LKPS LALRQG
Sbjct: 66 NLMPVGELGDDSITSIPFQVLSWRPRAVYFPNFATAEQCESIIDVAKDGLKPSTLALRQG 125
Query: 118 ETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYD 177
ET ++TKG RTSSG F+SASEDKT L++IE KIARATM+P++HGEAFN+LRYE+ Q+Y+
Sbjct: 126 ETEDNTKGIRTSSGVFVSASEDKTRTLDVIEEKIARATMIPRSHGEAFNILRYEVNQRYN 185
Query: 178 SHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVK 237
SHYDAFNPAEYGPQ SQR+ASFLLYL+DVEEGGETMFPFENG+ +D Y Y+ CIGLKVK
Sbjct: 186 SHYDAFNPAEYGPQKSQRMASFLLYLTDVEEGGETMFPFENGLNMDGNYGYEDCIGLKVK 245
Query: 238 PRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
PR+GDGLLFYSL NGTID TSLHGSCPVIKGEKWVATKWIRDQE
Sbjct: 246 PRQGDGLLFYSLLTNGTIDPTSLHGSCPVIKGEKWVATKWIRDQE 290
>gi|242038031|ref|XP_002466410.1| hypothetical protein SORBIDRAFT_01g007280 [Sorghum bicolor]
gi|241920264|gb|EER93408.1| hypothetical protein SORBIDRAFT_01g007280 [Sorghum bicolor]
Length = 294
Score = 395 bits (1014), Expect = e-107, Method: Compositional matrix adjust.
Identities = 189/275 (68%), Positives = 223/275 (81%), Gaps = 7/275 (2%)
Query: 10 TKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIR---PKLRTLEVVEKENESGLPHGETG 66
T+L LP LL CS FFLAGF+GS ++D P+ R LE E +P+GE+G
Sbjct: 22 TRLRLPVVLLSCSLFFLAGFFGSLLFTQDPEEADMPVPRERLLETAWPE----MPYGESG 77
Query: 67 DASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT 126
+A+ IP+Q+LSW+PRALYFP FA++EQC++I+ TAK+RLKPS LALR+GET ESTKG
Sbjct: 78 EAAPSLIPYQILSWQPRALYFPQFATSEQCENIVKTAKERLKPSTLALRKGETAESTKGI 137
Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
RTSSGTF+SA+ED T L IE KIARATM+P+ HGE FNVLRY IGQ+Y SHYDAF+P
Sbjct: 138 RTSSGTFLSANEDPTRTLAEIEKKIARATMIPRNHGEPFNVLRYNIGQRYASHYDAFDPV 197
Query: 187 EYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLF 246
+YGPQ SQR+ASFLLYL++VEEGGETMFP+ENG +D GYDY+KCIGLKVKPR+GDGLLF
Sbjct: 198 QYGPQKSQRVASFLLYLTNVEEGGETMFPYENGENMDIGYDYEKCIGLKVKPRKGDGLLF 257
Query: 247 YSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
YSL NGTIDRTSLHGSCPVIKGEKWVATKWIRD
Sbjct: 258 YSLMVNGTIDRTSLHGSCPVIKGEKWVATKWIRDN 292
>gi|226499492|ref|NP_001150030.1| LOC100283657 [Zea mays]
gi|195636206|gb|ACG37571.1| prolyl 4-hydroxylase [Zea mays]
gi|347978804|gb|AEP37744.1| prolyl 4-hydroxylase 3 [Zea mays]
Length = 294
Score = 395 bits (1014), Expect = e-107, Method: Compositional matrix adjust.
Identities = 189/275 (68%), Positives = 223/275 (81%), Gaps = 7/275 (2%)
Query: 10 TKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIR---PKLRTLEVVEKENESGLPHGETG 66
T+L LP LL CS FFLAGF+GS ++D P+ R L+ E +P+GE+G
Sbjct: 22 TRLRLPVVLLSCSLFFLAGFFGSLLFTQDPEEADMPVPRERLLDAAWPE----MPYGESG 77
Query: 67 DASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT 126
+A+ IP+Q+LSW+PRALYFP FA++EQC++I+ TAK+RLKPS LALR+GET ESTKG
Sbjct: 78 EAAPSLIPYQILSWQPRALYFPQFATSEQCENIVKTAKERLKPSTLALRKGETAESTKGI 137
Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
RTSSGTF+SA+ED T L IE KIARATMLP+ HGE FNVLRY IGQ+Y SHYDAF+PA
Sbjct: 138 RTSSGTFLSANEDPTETLAEIEKKIARATMLPRNHGEPFNVLRYNIGQRYASHYDAFDPA 197
Query: 187 EYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLF 246
+YGPQ +QR+ASFLLYL+DVEEGGETMFP+EN +D GYDY+KCIGLKVKPR+GDGLLF
Sbjct: 198 QYGPQKNQRVASFLLYLTDVEEGGETMFPYENSENMDIGYDYEKCIGLKVKPRKGDGLLF 257
Query: 247 YSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
YSL NGTIDRTSLHGSCPVIKGEKWVATKWIRD
Sbjct: 258 YSLMVNGTIDRTSLHGSCPVIKGEKWVATKWIRDN 292
>gi|255647903|gb|ACU24410.1| unknown [Glycine max]
Length = 293
Score = 394 bits (1013), Expect = e-107, Method: Compositional matrix adjust.
Identities = 198/285 (69%), Positives = 229/285 (80%), Gaps = 3/285 (1%)
Query: 1 MKGKAKRSSTKLGLPTALLLCSFFFLAGFYGSTFLSR---DVPSIRPKLRTLEVVEKENE 57
+KG + KL P L+C FFFLAGF+G T S D +RP+ R L+ +
Sbjct: 6 VKGNWSLRTNKLSFPYVFLVCIFFFLAGFFGFTLFSHSQGDGDGLRPRQRLLDSTNEAEY 65
Query: 58 SGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQG 117
+ +P GE GD SI SIPFQVLSWRPRA+YFPNFA+AEQC+SII AK LKPS LALRQG
Sbjct: 66 NLMPVGELGDDSITSIPFQVLSWRPRAVYFPNFATAEQCESIIDVAKDGLKPSTLALRQG 125
Query: 118 ETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYD 177
ET ++TKG RTSSG F+SASEDKT L++IE KIARATM+P++HGEAFN+LRYE+ Q+Y+
Sbjct: 126 ETEDNTKGIRTSSGVFVSASEDKTRTLDVIEEKIARATMIPRSHGEAFNILRYEVNQRYN 185
Query: 178 SHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVK 237
SHYDAFNPAEYGPQ SQR+ASFLLYL+DVEEGGETMFPFENG+ +D Y Y+ CIGLKVK
Sbjct: 186 SHYDAFNPAEYGPQKSQRMASFLLYLTDVEEGGETMFPFENGLNMDGNYGYEGCIGLKVK 245
Query: 238 PRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
PR+GDGLLFYSL NGTID TSLHGSCPVIKGEKWVATKWIRDQE
Sbjct: 246 PRQGDGLLFYSLLTNGTIDPTSLHGSCPVIKGEKWVATKWIRDQE 290
>gi|363807682|ref|NP_001242420.1| uncharacterized protein LOC100775302 [Glycine max]
gi|255641811|gb|ACU21174.1| unknown [Glycine max]
Length = 293
Score = 392 bits (1007), Expect = e-107, Method: Compositional matrix adjust.
Identities = 199/288 (69%), Positives = 229/288 (79%), Gaps = 3/288 (1%)
Query: 1 MKGKAKRSSTKLGLPTALLLCSFFFLAGFYGSTFLSR---DVPSIRPKLRTLEVVEKENE 57
+KG + KL P L+C FFFLAGF+G T S D +RP+ R LE +
Sbjct: 6 VKGNWSLRTNKLSFPYVFLVCIFFFLAGFFGFTLFSHSQGDGDGLRPRQRLLEPGNEAEY 65
Query: 58 SGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQG 117
+ +P + GD SI SIPFQVLSWRPRALYFPNFA+AEQC++II AK LKPS LALRQG
Sbjct: 66 NLMPVRDLGDDSITSIPFQVLSWRPRALYFPNFATAEQCENIIDVAKDGLKPSTLALRQG 125
Query: 118 ETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYD 177
ET E+TKG RTSSG F+SAS DKTG L +IE KIARATM+P++HGEAFN+LRYE+ Q+Y+
Sbjct: 126 ETEENTKGIRTSSGVFVSASGDKTGTLAVIEEKIARATMIPRSHGEAFNILRYEVDQRYN 185
Query: 178 SHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVK 237
SHYDAFNPAEYGPQ SQR+ASFLLYL+DVEEGGETMFPFENG+ +D Y Y+ CIGLKVK
Sbjct: 186 SHYDAFNPAEYGPQKSQRMASFLLYLTDVEEGGETMFPFENGLNMDGNYGYEDCIGLKVK 245
Query: 238 PRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 285
PR+GDGLLFYSL NGTID TSLHGSCPVIKGEKWVATKWIRDQEQ +
Sbjct: 246 PRQGDGLLFYSLLTNGTIDPTSLHGSCPVIKGEKWVATKWIRDQEQDD 293
>gi|225438938|ref|XP_002279411.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
gi|296087348|emb|CBI33722.3| unnamed protein product [Vitis vinifera]
Length = 285
Score = 391 bits (1004), Expect = e-106, Method: Compositional matrix adjust.
Identities = 200/288 (69%), Positives = 232/288 (80%), Gaps = 6/288 (2%)
Query: 1 MKGKAK---RSSTKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKENE 57
MK KAK R TKLGLP L C FFFLAG +GS L ++ S P+ + E+ +
Sbjct: 1 MKSKAKGKWRFGTKLGLPVVFLFCLFFFLAGLFGSGLLPQEFSSSEPRRL---IREETDY 57
Query: 58 SGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQG 117
L HGE+G+ S+ SIPFQVLSWRPRALYFPNFA++EQCQSII AK L PS +ALR G
Sbjct: 58 DPLAHGESGEDSVTSIPFQVLSWRPRALYFPNFATSEQCQSIINMAKSNLTPSTVALRVG 117
Query: 118 ETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYD 177
E +T+G RTSSG FISASEDKTG L+LIE KIAR M+P+THGEAFNVLRYEIGQ+Y+
Sbjct: 118 EIRGNTEGIRTSSGVFISASEDKTGTLDLIEQKIARVIMIPRTHGEAFNVLRYEIGQRYN 177
Query: 178 SHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVK 237
SHYDAF+PAEYGPQ S R+A+FL+YLSDVEEGGETMFPFENG+ +D YD+++CIGLKVK
Sbjct: 178 SHYDAFDPAEYGPQKSHRIATFLVYLSDVEEGGETMFPFENGLNMDKDYDFQRCIGLKVK 237
Query: 238 PRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 285
P +GDGLLFYS+FPNGTID TSLHGSCPVIKGEKWVATKWIRDQEQ +
Sbjct: 238 PHQGDGLLFYSMFPNGTIDPTSLHGSCPVIKGEKWVATKWIRDQEQDD 285
>gi|125588006|gb|EAZ28670.1| hypothetical protein OsJ_12681 [Oryza sativa Japonica Group]
Length = 280
Score = 387 bits (993), Expect = e-105, Method: Compositional matrix adjust.
Identities = 185/277 (66%), Positives = 225/277 (81%), Gaps = 4/277 (1%)
Query: 10 TKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKENES---GLPHGETG 66
T+L LP LL CS FFLAGF+GS ++D P +L T E+ E+ G+ +GE+G
Sbjct: 3 TRLRLPVVLLSCSLFFLAGFFGSILFTQD-PQGEEELDTPMRRERLMEAAWPGMAYGESG 61
Query: 67 DASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT 126
+ IP+Q+LSW+PRALYFP FA+++QC++I+ TAK+RL PS LALR+GET ESTKG
Sbjct: 62 EPEPSLIPYQILSWQPRALYFPQFATSQQCENIVKTAKQRLMPSTLALRKGETEESTKGI 121
Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
RTSSGTF+S+ ED TG L +E KIA+ATM+P+ HGE FN+LRYEIGQ+Y SHYDAF+PA
Sbjct: 122 RTSSGTFLSSDEDPTGTLAEVEKKIAKATMIPRHHGEPFNILRYEIGQRYASHYDAFDPA 181
Query: 187 EYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLF 246
+YGPQ SQR+ASFLLYL+DVEEGGETMFP+ENG +D GYDY+KCIGLKVKPR+GDGLLF
Sbjct: 182 QYGPQKSQRVASFLLYLTDVEEGGETMFPYENGENMDIGYDYEKCIGLKVKPRKGDGLLF 241
Query: 247 YSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
YSL NGTID TSLHGSCPVIKGEKWVATKWIRD+ +
Sbjct: 242 YSLMVNGTIDPTSLHGSCPVIKGEKWVATKWIRDKSK 278
>gi|115455509|ref|NP_001051355.1| Os03g0761900 [Oryza sativa Japonica Group]
gi|14488368|gb|AAK63935.1|AC084282_16 putative dioxygenase [Oryza sativa Japonica Group]
gi|17027263|gb|AAL34117.1|AC090713_4 putative hydroxylase subunit [Oryza sativa Japonica Group]
gi|108711218|gb|ABF99013.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
Group]
gi|113549826|dbj|BAF13269.1| Os03g0761900 [Oryza sativa Japonica Group]
gi|125545807|gb|EAY91946.1| hypothetical protein OsI_13633 [Oryza sativa Indica Group]
Length = 310
Score = 386 bits (992), Expect = e-105, Method: Compositional matrix adjust.
Identities = 185/277 (66%), Positives = 225/277 (81%), Gaps = 4/277 (1%)
Query: 10 TKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKENES---GLPHGETG 66
T+L LP LL CS FFLAGF+GS ++D P +L T E+ E+ G+ +GE+G
Sbjct: 33 TRLRLPVVLLSCSLFFLAGFFGSILFTQD-PQGEEELDTPMRRERLMEAAWPGMAYGESG 91
Query: 67 DASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT 126
+ IP+Q+LSW+PRALYFP FA+++QC++I+ TAK+RL PS LALR+GET ESTKG
Sbjct: 92 EPEPSLIPYQILSWQPRALYFPQFATSQQCENIVKTAKQRLMPSTLALRKGETEESTKGI 151
Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
RTSSGTF+S+ ED TG L +E KIA+ATM+P+ HGE FN+LRYEIGQ+Y SHYDAF+PA
Sbjct: 152 RTSSGTFLSSDEDPTGTLAEVEKKIAKATMIPRHHGEPFNILRYEIGQRYASHYDAFDPA 211
Query: 187 EYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLF 246
+YGPQ SQR+ASFLLYL+DVEEGGETMFP+ENG +D GYDY+KCIGLKVKPR+GDGLLF
Sbjct: 212 QYGPQKSQRVASFLLYLTDVEEGGETMFPYENGENMDIGYDYEKCIGLKVKPRKGDGLLF 271
Query: 247 YSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
YSL NGTID TSLHGSCPVIKGEKWVATKWIRD+ +
Sbjct: 272 YSLMVNGTIDPTSLHGSCPVIKGEKWVATKWIRDKSK 308
>gi|326492085|dbj|BAJ98267.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 347
Score = 384 bits (987), Expect = e-104, Method: Compositional matrix adjust.
Identities = 185/278 (66%), Positives = 224/278 (80%), Gaps = 10/278 (3%)
Query: 10 TKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSI----RP--KLRTLEVVEKENESGLPHG 63
T+L LP LL CS FFLAGF GS ++D RP + R +E V E + +G
Sbjct: 72 TRLRLPVVLLSCSLFFLAGFLGSLLFTQDPQGEEDPERPLRRERVMEAVWPE----MAYG 127
Query: 64 ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
E+GD + IP+Q+LSW+PRALYFP FA+AEQC++++ TAK RL+PS LALR+GE+ E+T
Sbjct: 128 ESGDPAPSLIPYQILSWQPRALYFPQFATAEQCENVVKTAKARLRPSTLALRKGESEETT 187
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
KG RTSSGTF+SA ED TG L IE KIA+ATM+P++HGE FNVLRYEIGQKY SHYDAF
Sbjct: 188 KGIRTSSGTFLSAEEDPTGALAEIETKIAKATMMPRSHGEPFNVLRYEIGQKYASHYDAF 247
Query: 184 NPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDG 243
+PA+YGPQ SQR+ASFLLYL+DVEEGGETMFP+ENG ++ GYDY++CIGLKVKPR+GDG
Sbjct: 248 DPAQYGPQKSQRVASFLLYLTDVEEGGETMFPYENGDNMNIGYDYEQCIGLKVKPRKGDG 307
Query: 244 LLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
LLFYSL NGTID TSLHGSCPV++GEKWVATKWIRD+
Sbjct: 308 LLFYSLMVNGTIDPTSLHGSCPVVRGEKWVATKWIRDK 345
>gi|357114580|ref|XP_003559078.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
distachyon]
Length = 295
Score = 382 bits (982), Expect = e-104, Method: Compositional matrix adjust.
Identities = 183/275 (66%), Positives = 223/275 (81%), Gaps = 4/275 (1%)
Query: 10 TKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKENESGLP---HGETG 66
T+L LP LL CS FFLAGF+GS ++D P +L E+ E+ P +G++G
Sbjct: 20 TRLRLPVVLLSCSLFFLAGFFGSLLFTQD-PQGEEELERPLSRERLMEAVWPEMAYGDSG 78
Query: 67 DASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT 126
D + IP+Q+LSW+PRALYFP FA++EQC++++ TAK RL+PS LALR+GET E+TKG
Sbjct: 79 DPAPSLIPYQILSWQPRALYFPQFATSEQCENVVKTAKARLRPSTLALRKGETEETTKGI 138
Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
RTSSGTF+SA ED T L +E KIA+ATM+P++HGE FNVLRYEIGQKY SHYDAF+PA
Sbjct: 139 RTSSGTFLSADEDPTRTLAEVEKKIAKATMIPRSHGEPFNVLRYEIGQKYASHYDAFDPA 198
Query: 187 EYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLF 246
+YGPQ SQR+ASFLLYL+DVEEGGETMFP+ENG +D GYDY++CIGLKVKPR+GDGLLF
Sbjct: 199 QYGPQKSQRVASFLLYLTDVEEGGETMFPYENGENMDIGYDYEQCIGLKVKPRKGDGLLF 258
Query: 247 YSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
YSL NGTID TSLHGSCPVIKGEKWVATKWIR++
Sbjct: 259 YSLMVNGTIDLTSLHGSCPVIKGEKWVATKWIRNK 293
>gi|168043388|ref|XP_001774167.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674574|gb|EDQ61081.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 284
Score = 375 bits (964), Expect = e-102, Method: Compositional matrix adjust.
Identities = 179/273 (65%), Positives = 215/273 (78%), Gaps = 6/273 (2%)
Query: 11 KLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKENESGLPHGETGDASI 70
KLG P LL C+FFFL GFYGS LS++ + ++ V E + HG TGD+S+
Sbjct: 17 KLGFPLVLLCCTFFFLVGFYGSNSLSKEEKHV-----VIDPVTNE-KLVFEHGRTGDSSV 70
Query: 71 QSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSS 130
IPFQVLSW+PRAL +PNFAS EQC++II A+ RL PS LALR+GE+ +TK RTSS
Sbjct: 71 TDIPFQVLSWKPRALLYPNFASKEQCEAIIKLARTRLAPSGLALRKGESEATTKEIRTSS 130
Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP 190
GTF+ ASEDKT L +E K+ARATM+P+ +GEAFNVLRY GQKYD HYD F+PAEYGP
Sbjct: 131 GTFLRASEDKTQSLAEVEEKMARATMIPRQNGEAFNVLRYNPGQKYDCHYDVFDPAEYGP 190
Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
Q SQR+ASFLLYLSDVEEGGETMFPFEN +++GY+YK CIGLKVKPR+GD LLFYS+
Sbjct: 191 QPSQRMASFLLYLSDVEEGGETMFPFENFQNMNTGYNYKDCIGLKVKPRQGDALLFYSMH 250
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
PNGT D+T+LHGSCPVIKGEKWVATKWIR+ ++
Sbjct: 251 PNGTFDKTALHGSCPVIKGEKWVATKWIRNTDK 283
>gi|224071291|ref|XP_002303388.1| predicted protein [Populus trichocarpa]
gi|222840820|gb|EEE78367.1| predicted protein [Populus trichocarpa]
Length = 297
Score = 375 bits (964), Expect = e-101, Method: Compositional matrix adjust.
Identities = 188/297 (63%), Positives = 222/297 (74%), Gaps = 15/297 (5%)
Query: 1 MKGKAKRSSTK------LGLPTALLLCSFFFLAGFYGSTFL-----SRDVPSIRPKLRTL 49
M+GKA + + + LG P +L CSFFF+AG + S L S D +R + R L
Sbjct: 1 MRGKACKENWRFYKKPNLGFPALILSCSFFFIAGLFASNLLLSQGTSSDERWLRARARQL 60
Query: 50 EVVEKENESG---LPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKR 106
+ VE+E S LP GE+GD I IPFQVLSWRPRALY+P F +AEQCQ II AK
Sbjct: 61 QSVEEEIISKYDLLPSGESGDDFITLIPFQVLSWRPRALYYPGFITAEQCQHIINMAKPS 120
Query: 107 LKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFN 166
L+PS LALR+GET E+TKG RTSSG F+ +SED+ G+L++IE KIARATM+P THGEAFN
Sbjct: 121 LQPSTLALRKGETAETTKGIRTSSGMFVFSSEDQAGVLQVIEEKIARATMIPSTHGEAFN 180
Query: 167 VLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGY 226
VLRYEIGQKYD+HYDAFNPAEYGPQ SQR+A+FLLYLS+ EEGGET FP EN + GY
Sbjct: 181 VLRYEIGQKYDAHYDAFNPAEYGPQTSQRVATFLLYLSNFEEGGETTFPIENDENFE-GY 239
Query: 227 DYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
D +KC GL+VKP +GD +LFYS+FPN TID SLH SC VIKGEKWVATKWIRDQ Q
Sbjct: 240 DAQKCNGLRVKPHQGDAILFYSIFPNNTIDPASLHASCHVIKGEKWVATKWIRDQVQ 296
>gi|356496957|ref|XP_003517331.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 299
Score = 372 bits (955), Expect = e-101, Method: Compositional matrix adjust.
Identities = 184/282 (65%), Positives = 215/282 (76%), Gaps = 13/282 (4%)
Query: 10 TKLGLPTALLLCSFFFLAGFYGSTFLSRDVPS--------IRPKLRTLE--VVEKENESG 59
KL LP+ L+C FFFLAGF S+ S PS + P+ R LE EK
Sbjct: 17 NKLDLPSVFLICIFFFLAGFSASSLFS---PSQEHEYDLRLIPRARLLEKSTHEKTEHHL 73
Query: 60 LPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET 119
L G++GD I IPFQVLSW PRALYFPNF SAEQC++II A+ LKPS L LR+GET
Sbjct: 74 LKAGDSGDDYITLIPFQVLSWYPRALYFPNFVSAEQCETIIEMARGGLKPSTLVLRKGET 133
Query: 120 VESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH 179
ESTKG RTS G F+SASED+TGIL+ IE KIA+AT +P+THGEAFN+LRYE+GQKY H
Sbjct: 134 EESTKGIRTSYGVFMSASEDETGILDSIEEKIAKATKIPRTHGEAFNILRYEVGQKYSPH 193
Query: 180 YDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPR 239
YDAF+ AE+GP SQR ASFLLYL+DV EGGET+FP+ENG D YD++ CIGL+V+PR
Sbjct: 194 YDAFDEAEFGPLQSQRAASFLLYLTDVPEGGETLFPYENGFNRDGSYDFEDCIGLRVRPR 253
Query: 240 RGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
+GDGLLFYSL PNGTID+TS+HGSCPVIKGEKWVATKWIRDQ
Sbjct: 254 KGDGLLFYSLLPNGTIDQTSVHGSCPVIKGEKWVATKWIRDQ 295
>gi|357453665|ref|XP_003597113.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
gi|357482683|ref|XP_003611628.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
gi|355486161|gb|AES67364.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
gi|355512963|gb|AES94586.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
Length = 294
Score = 367 bits (943), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 189/295 (64%), Positives = 221/295 (74%), Gaps = 12/295 (4%)
Query: 1 MKGKAKRS-----STKLGLPTALLLC---SFFFLAGFYGSTFLSRDVPSIRPKLRTLE-V 51
MK K ++ + K LP+ LLC GF D +R +R LE
Sbjct: 1 MKAKTVKANWSLRTKKFSLPSVFLLCIFFFLAGFFGFSFFHHSQEDEYGVR--VRLLEKS 58
Query: 52 VEKENESGLPH-GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPS 110
V+ E E L H G++GD I SIPFQVLSW PRALYFPNFASAEQC II AK L PS
Sbjct: 59 VKDETEHRLLHAGKSGDNFITSIPFQVLSWNPRALYFPNFASAEQCDRIIEMAKAELSPS 118
Query: 111 QLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRY 170
+L LR+GET E TKG RTSSG FISASEDKTG+LE+I+ KIARA +P+THG A+N+LRY
Sbjct: 119 RLMLREGETEEGTKGIRTSSGMFISASEDKTGLLEVIDEKIARAAKIPKTHGGAYNILRY 178
Query: 171 EIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKK 230
++GQKY+SHYDAFNPAEYGPQ SQR+ASFLLYL+DV EGGETMFPFENG +DS Y+++
Sbjct: 179 KVGQKYNSHYDAFNPAEYGPQESQRVASFLLYLTDVPEGGETMFPFENGSNMDSSYNFED 238
Query: 231 CIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 285
CIGLK+KP +GDGLLFYSLFPNGTID TSLHGSCPVIKGEKWVATKWIR+Q ++
Sbjct: 239 CIGLKIKPLKGDGLLFYSLFPNGTIDPTSLHGSCPVIKGEKWVATKWIREQLHYD 293
>gi|356541677|ref|XP_003539300.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 297
Score = 367 bits (941), Expect = 4e-99, Method: Compositional matrix adjust.
Identities = 183/293 (62%), Positives = 217/293 (74%), Gaps = 13/293 (4%)
Query: 4 KAKRSSTKLGLPTALLLCSFFFLAGFYGSTFLSR------DVPSIRPKLRTLEVVEKENE 57
K R KL LP+ L+C FFFLAGF S+ S D+ ++EK +
Sbjct: 7 KGNRRMNKLDLPSVFLICIFFFLAGFSASSLFSHSQKHEYDLRLRPRPRPRARLLEKSTQ 66
Query: 58 SG-----LPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQL 112
L G++GD + IPFQVLSW PRALYFPNFASAEQC+SII A+ LK S L
Sbjct: 67 EKTEYHLLKAGDSGDDYVTLIPFQVLSWYPRALYFPNFASAEQCESIIEMARGGLKSSTL 126
Query: 113 ALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEI 172
ALR+GET ESTKG RTSSG F+SASED+TGIL+ IE KIA+AT +P+THGEAFN+LRYE+
Sbjct: 127 ALRKGETEESTKGIRTSSGVFMSASEDETGILDAIEEKIAKATKIPRTHGEAFNILRYEV 186
Query: 173 GQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI 232
GQKY+SHYDAF+ AEYGP SQR+ASFLLYL+DV EGGETMFP+ENG D + + CI
Sbjct: 187 GQKYNSHYDAFDEAEYGPLQSQRVASFLLYLTDVPEGGETMFPYENGFNRDG--NVEDCI 244
Query: 233 GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 285
GL+V+PR+GD LLFYSL PNGTID+TS HGSCPVIKGEKWVATKWIR+Q Q +
Sbjct: 245 GLRVRPRKGDALLFYSLLPNGTIDQTSAHGSCPVIKGEKWVATKWIRNQVQDD 297
>gi|302802700|ref|XP_002983104.1| hypothetical protein SELMODRAFT_234144 [Selaginella moellendorffii]
gi|300149257|gb|EFJ15913.1| hypothetical protein SELMODRAFT_234144 [Selaginella moellendorffii]
Length = 292
Score = 358 bits (919), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 173/290 (59%), Positives = 212/290 (73%), Gaps = 9/290 (3%)
Query: 1 MKGKAK--RSSTKLGLPTALLLCSFFFLAGFYGSTF------LSRDVPSIRPKLRTLEVV 52
MKG+++ ++ P L+LC FL G++ F +PS +
Sbjct: 1 MKGRSRGRVCGHQVSAPLVLVLCVLAFLVGYFIPEFQQVILVTKHSIPSFSSFANQRHEL 60
Query: 53 EKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQL 112
E+ + HG TGD + IPFQVLSW PRAL FP FAS QC++II+ AK +L PS L
Sbjct: 61 -LEDVTVAEHGVTGDDQLSFIPFQVLSWTPRALLFPKFASPAQCEAIISLAKTKLTPSSL 119
Query: 113 ALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEI 172
ALR+GET T+ RTS G F+S+ +DKTG L +E K+A+ATM+P++HGEAFNVLRYEI
Sbjct: 120 ALRKGETATETQDVRTSHGCFLSSRQDKTGTLAWVEEKMAKATMIPKSHGEAFNVLRYEI 179
Query: 173 GQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI 232
GQKY+SHYD FNPAEYGPQ SQR+ASFLLYLSDVEEGGETMFPFEN ++ YDYK+CI
Sbjct: 180 GQKYNSHYDVFNPAEYGPQKSQRMASFLLYLSDVEEGGETMFPFENYEHMNENYDYKECI 239
Query: 233 GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
GLKVKP++GD LLFYS+FPNGT D+T+LHGSCPVIKGEKWVATKWIRD+E
Sbjct: 240 GLKVKPKQGDALLFYSMFPNGTFDKTALHGSCPVIKGEKWVATKWIRDKE 289
>gi|168006299|ref|XP_001755847.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693166|gb|EDQ79520.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 299
Score = 354 bits (909), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 172/282 (60%), Positives = 209/282 (74%), Gaps = 9/282 (3%)
Query: 11 KLGLPTALLLCSFFFLAGFYGSTFLSRDVP--SIRPKLRTLEVVEKENESG-------LP 61
KL LP LL C+ LAG+ S F + + +I R E E G L
Sbjct: 17 KLSLPQVLLWCTICLLAGYAASNFFPQKIEEEAIYQPYRKSAQQEGEFPFGEFSEKVVLD 76
Query: 62 HGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE 121
HG TGD I IPFQVLSW+PRAL +P FAS EQC++I+ A+ RL PS LALR+GE+ +
Sbjct: 77 HGSTGDNFIADIPFQVLSWKPRALLYPRFASKEQCEAIMKLARTRLAPSALALRKGESED 136
Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
STK RTSSGTF+ A ED T LE +E K+A+ATM+P+ +GEAFNVL+Y +GQKYD HYD
Sbjct: 137 STKDIRTSSGTFLRADEDTTRSLEQVEEKMAKATMIPRENGEAFNVLKYNVGQKYDCHYD 196
Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRG 241
F+PAEYGPQ SQR+ASFLLYLSDVEEGGETMFPFEN ++ G+DYKKCIG+KVKPR+G
Sbjct: 197 VFDPAEYGPQPSQRMASFLLYLSDVEEGGETMFPFENFQNMNIGFDYKKCIGMKVKPRQG 256
Query: 242 DGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
D LLFYS+ PNGT D+++LHGSCPVIKGEKWVATKWIR+ ++
Sbjct: 257 DALLFYSMHPNGTFDKSALHGSCPVIKGEKWVATKWIRNTDK 298
>gi|302764866|ref|XP_002965854.1| hypothetical protein SELMODRAFT_84512 [Selaginella moellendorffii]
gi|300166668|gb|EFJ33274.1| hypothetical protein SELMODRAFT_84512 [Selaginella moellendorffii]
Length = 231
Score = 349 bits (895), Expect = 8e-94, Method: Compositional matrix adjust.
Identities = 158/221 (71%), Positives = 186/221 (84%)
Query: 62 HGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE 121
HG TG+ + IPFQVLSW PRAL FP FAS QC++II+ AK +L PS LALR+GET
Sbjct: 8 HGVTGEDQLAFIPFQVLSWTPRALLFPKFASPAQCEAIISLAKTKLTPSSLALRKGETAT 67
Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
T+ RTS G F+S+ +DKTG L +E K+A+ATM+P++HGEAFNVLRYEIGQKY+SHYD
Sbjct: 68 ETQDVRTSHGCFLSSRQDKTGTLAWVEEKMAKATMIPKSHGEAFNVLRYEIGQKYNSHYD 127
Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRG 241
FNPAEYGPQ SQR+ASFLLYLSDVEEGGETMFPFEN ++ YDYK+CIGLKVKP++G
Sbjct: 128 VFNPAEYGPQKSQRMASFLLYLSDVEEGGETMFPFENYEHMNENYDYKECIGLKVKPKQG 187
Query: 242 DGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
D LLFYS+FPNGT D+T+LHGSCPVIKGEKWVATKWIRD+E
Sbjct: 188 DALLFYSMFPNGTFDKTALHGSCPVIKGEKWVATKWIRDKE 228
>gi|3297815|emb|CAA19873.1| putative protein [Arabidopsis thaliana]
gi|7270340|emb|CAB80108.1| putative protein [Arabidopsis thaliana]
Length = 257
Score = 343 bits (880), Expect = 5e-92, Method: Compositional matrix adjust.
Identities = 158/206 (76%), Positives = 182/206 (88%)
Query: 54 KENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA 113
+E S +PHG TG+ SI SIPFQVLSWRPRA+YFPNFA+AEQCQ+II AK LKPS LA
Sbjct: 6 EEEASSMPHGVTGEESIGSIPFQVLSWRPRAIYFPNFATAEQCQAIIERAKVNLKPSALA 65
Query: 114 LRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIG 173
LR+GET E+TKGTRTSSGTFISASE+ TG L+ +E KIARATM+P++HGE+FN+LRYE+G
Sbjct: 66 LRKGETAENTKGTRTSSGTFISASEESTGALDFVERKIARATMIPRSHGESFNILRYELG 125
Query: 174 QKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIG 233
QKYDSHYD FNP EYGPQ SQR+ASFLLYLSDVEEGGETMFPFENG + GYDYK+CIG
Sbjct: 126 QKYDSHYDVFNPTEYGPQSSQRIASFLLYLSDVEEGGETMFPFENGSNMGIGYDYKQCIG 185
Query: 234 LKVKPRRGDGLLFYSLFPNGTIDRTS 259
LKVKPR+GDGLLFYS+FPNGTID+ +
Sbjct: 186 LKVKPRKGDGLLFYSVFPNGTIDQVN 211
>gi|297825201|ref|XP_002880483.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297326322|gb|EFH56742.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 272
Score = 317 bits (812), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 157/272 (57%), Positives = 196/272 (72%), Gaps = 12/272 (4%)
Query: 11 KLGLPTALLLCSFFF-LAGFYGSTFLSRDVPSIRPKLRTLEVVEKENESGLPHGETGDAS 69
KLG P L C FF + GF LS+ + ++ T V +E ES L HG S
Sbjct: 9 KLGFPYVFLACCFFLAIFGFCFFNLLSQIISF--SEISTRRSVNEETES-LDHG-----S 60
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+ +IPF LSW PR Y PNFA+ +QC+++I AK +LKPS LALR+GET E+T+ RT
Sbjct: 61 VSNIPFHGLSWNPRVFYLPNFATKQQCEAVIDMAKPKLKPSLLALRKGETAETTQNVRTR 120
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
ED++GIL IE KIA AT +P + E+FN+LRY++GQKYDSHYDAF+PAEYG
Sbjct: 121 ---LKKTDEDESGILAAIEEKIALATRIPIDYYESFNILRYQLGQKYDSHYDAFHPAEYG 177
Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSL 249
PQ+SQR+ +F+L+LS VEEGGETMFPFENG ++ YDY+ CIGL+VKPR+GD + FY+L
Sbjct: 178 PQISQRVVTFILFLSSVEEGGETMFPFENGRNMNGRYDYETCIGLRVKPRQGDAIFFYNL 237
Query: 250 FPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
PN TID+TSLHGSCPVIKGEKWVATKWIRDQ
Sbjct: 238 LPNRTIDQTSLHGSCPVIKGEKWVATKWIRDQ 269
>gi|30681957|ref|NP_850038.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
gi|330252315|gb|AEC07409.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
Length = 274
Score = 311 bits (798), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 154/272 (56%), Positives = 194/272 (71%), Gaps = 10/272 (3%)
Query: 11 KLGLPTALLLCSFFF-LAGFYGSTFLSRDVPSIRPKLRTLEVVEKENESGLPHGETGDAS 69
KL P + C FF + GF S+ + S T V E +S L HG +S
Sbjct: 9 KLVFPYVFIACCFFLAIFGFCFFNLFSQGI-SFSEIPTTRRSVNDETDS-LDHG----SS 62
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+ +IPF LSW PR Y PNFA+ +QC+++I AK +LKPS LALR+GET E+T+ R+
Sbjct: 63 VSNIPFHGLSWNPRVFYLPNFATKQQCEAVIDMAKPKLKPSTLALRKGETAETTQNYRSL 122
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
ED++G+L IE KIA AT P+ + E+FN+LRY++GQKYDSHYDAF+ AEYG
Sbjct: 123 HQ---HTDEDESGVLAAIEEKIALATRFPKDYYESFNILRYQLGQKYDSHYDAFHSAEYG 179
Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSL 249
P +SQR+ +FLL+LS VEEGGETMFPFENG ++ YDY+KC+GLKVKPR+GD + FY+L
Sbjct: 180 PLISQRVVTFLLFLSSVEEGGETMFPFENGRNMNGRYDYEKCVGLKVKPRQGDAIFFYNL 239
Query: 250 FPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
FPNGTID+TSLHGSCPVIKGEKWVATKWIRDQ
Sbjct: 240 FPNGTIDQTSLHGSCPVIKGEKWVATKWIRDQ 271
>gi|224056224|ref|XP_002298763.1| predicted protein [Populus trichocarpa]
gi|222846021|gb|EEE83568.1| predicted protein [Populus trichocarpa]
Length = 175
Score = 300 bits (767), Expect = 6e-79, Method: Compositional matrix adjust.
Identities = 142/179 (79%), Positives = 156/179 (87%), Gaps = 9/179 (5%)
Query: 103 AKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHG 162
AK +LKPS LALR+GET EST FI SEDKTG L+ IE KIA+ATM+PQ+HG
Sbjct: 2 AKSKLKPSTLALRKGETTEST---------FIGGSEDKTGTLDFIERKIAKATMIPQSHG 52
Query: 163 EAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFL 222
EAFN+LRYEIGQKYDSHYDAFNP EYGPQ SQR+ASFLLYLS VEEGGETMFPFENG +
Sbjct: 53 EAFNILRYEIGQKYDSHYDAFNPDEYGPQPSQRVASFLLYLSSVEEGGETMFPFENGSAV 112
Query: 223 DSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
SG++YK+C+GLKVKPR+GDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ
Sbjct: 113 SSGFEYKQCVGLKVKPRQGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 171
>gi|412994121|emb|CCO14632.1| predicted protein [Bathycoccus prasinos]
Length = 341
Score = 276 bits (707), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 129/222 (58%), Positives = 168/222 (75%), Gaps = 3/222 (1%)
Query: 63 GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES 122
G GD + + FQ+LS PR++ + NFAS C +I+ A+ RL S LAL++GET+E+
Sbjct: 116 GALGDEYLTELKFQLLSTAPRSVMYRNFASDADCDAIVEAARSRLHKSGLALKRGETLET 175
Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
TK RTSSGTF+++ +++G L+ +E K+ARAT +P THGEA+N+LRYEIGQKYDSHYD
Sbjct: 176 TKNIRTSSGTFLTSKMEQSGALKRVEEKMARATHIPATHGEAYNILRYEIGQKYDSHYDM 235
Query: 183 FNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFE--NGIFLDSGYDYKKC-IGLKVKPR 239
F+P++YGPQ SQR+ASFLLYL+ +EGGET+FP E NG++ G DY C GLKVKPR
Sbjct: 236 FDPSQYGPQRSQRVASFLLYLTTPDEGGETVFPLEGQNGLYRLRGIDYTSCEAGLKVKPR 295
Query: 240 RGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
+GD LLF+S+ PN T DR+SLHG CPVI G K+VATKWI D
Sbjct: 296 KGDALLFWSVHPNNTFDRSSLHGGCPVISGTKFVATKWIHDN 337
>gi|147834798|emb|CAN75013.1| hypothetical protein VITISV_039948 [Vitis vinifera]
Length = 282
Score = 264 bits (674), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 126/192 (65%), Positives = 144/192 (75%), Gaps = 33/192 (17%)
Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGE----------------------- 163
R SG FISASEDKTG L+LIE KIAR M+P+THGE
Sbjct: 91 RLCSGVFISASEDKTGTLDLIEQKIARVIMIPRTHGEIKPKENCLNWLGQVPPFEFVVMK 150
Query: 164 ----------AFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETM 213
AFN+LRYEIGQ+Y+SHYDAF+PAEYGPQ S R+A+FL+YLSDVEEGGETM
Sbjct: 151 RFLTDVVYHVAFNILRYEIGQRYNSHYDAFDPAEYGPQKSHRIATFLVYLSDVEEGGETM 210
Query: 214 FPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWV 273
FPFENG+ +D YD+++CIGLKVKP +GDGLLFYS+FPNGTID TSLHGSCPVIKGEKWV
Sbjct: 211 FPFENGLNMDKDYDFQRCIGLKVKPHQGDGLLFYSMFPNGTIDPTSLHGSCPVIKGEKWV 270
Query: 274 ATKWIRDQEQHE 285
ATKWIRDQEQ +
Sbjct: 271 ATKWIRDQEQDD 282
>gi|384250599|gb|EIE24078.1| hypothetical protein COCSUDRAFT_47131 [Coccomyxa subellipsoidea
C-169]
Length = 327
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 119/223 (53%), Positives = 151/223 (67%), Gaps = 5/223 (2%)
Query: 64 ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
E+G+ P Q++SW PR + +P F E+C+ + AK RL PS LALR E + T
Sbjct: 98 ESGNDFYSVQPQQLISWYPRIILYPGFIDPERCKHFVKVAKARLAPSGLALRTTEGPQET 157
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
+ RTS GTF+S +D G++ +E K A+ T LP +HGE FNVLRY+ GQ YDSHYD F
Sbjct: 158 ENVRTSQGTFMSRKDDPAGVIAWVEEKAAQVTGLPVSHGEPFNVLRYQDGQHYDSHYDIF 217
Query: 184 NPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLD----SGYDYKKC-IGLKVKP 238
P YGPQ SQR+A+ L YL+DVEEGGET+FP E D +G++YK C G K KP
Sbjct: 218 EPESYGPQPSQRMATILFYLTDVEEGGETIFPLEGRYGPDLLKMTGFNYKSCTTGFKYKP 277
Query: 239 RRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
R GD L+FYS+ PNGT D+ +LHG CPV+ GEKWVATKWIRD+
Sbjct: 278 RMGDALMFYSMHPNGTFDKHALHGGCPVMAGEKWVATKWIRDK 320
>gi|302845120|ref|XP_002954099.1| hypothetical protein VOLCADRAFT_64439 [Volvox carteri f.
nagariensis]
gi|300260598|gb|EFJ44816.1| hypothetical protein VOLCADRAFT_64439 [Volvox carteri f.
nagariensis]
Length = 231
Score = 234 bits (597), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 117/226 (51%), Positives = 154/226 (68%), Gaps = 5/226 (2%)
Query: 60 LPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET 119
LP E+G ++ IPFQ+LSW PR + FP F + + +I A K + PS LA R GET
Sbjct: 2 LPAAESGSDNVYVIPFQILSWYPRVVVFPGFIDKARAEYVIKLASKFMYPSGLAYRPGET 61
Query: 120 VESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH 179
V+ ++ TRTS+GTF++A+ D G+L +E +IA AT+LP +GEAFNVL YE Q YDSH
Sbjct: 62 VDPSQQTRTSTGTFLAAAMDPEGVLGWVEQRIAAATLLPAENGEAFNVLHYEKEQHYDSH 121
Query: 180 YDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKCI--GLK 235
YD F+P E+GPQ SQR+A+ LLYLS+V EGGET+F E G+ ++ D++ C K
Sbjct: 122 YDTFDPKEFGPQPSQRIATVLLYLSEVLEGGETVFKRE-GVDGENRVIGDWRNCDDGSFK 180
Query: 236 VKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
PR GD +LF+ PNG ID +LHG CPV +GEKWVATKWIR +
Sbjct: 181 YMPRMGDAVLFWGTKPNGDIDPHALHGGCPVKRGEKWVATKWIRSR 226
>gi|307108817|gb|EFN57056.1| hypothetical protein CHLNCDRAFT_143796 [Chlorella variabilis]
Length = 334
Score = 234 bits (597), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 118/225 (52%), Positives = 148/225 (65%), Gaps = 15/225 (6%)
Query: 60 LPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET 119
L HGE+G + P Q+LS PRA P F S +QC +IA A++RL PS LA + G+T
Sbjct: 116 LEHGESGHSFYTVQPMQLLSLYPRAYLMPRFLSQKQCDHVIAMAERRLAPSGLAFKAGDT 175
Query: 120 VESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH 179
E+T+ ED G+L IE K+A TM+P HGE FNVLRYE Q YDSH
Sbjct: 176 AENTR------------DEDPDGVLAWIEDKLAAVTMIPAGHGEPFNVLRYEPSQHYDSH 223
Query: 180 YDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFE--NGIFLDSGYDYKKC-IGLKV 236
YD+F+ EYGPQ SQR+A+ LLYL+DVEEGGET+F E G+ DYK C G+KV
Sbjct: 224 YDSFSEEEYGPQFSQRIATVLLYLADVEEGGETVFLLEGKGGLARLERIDYKACDTGIKV 283
Query: 237 KPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
KPR+GD LLF+S+ NGT+D+ SLHG CPV+ G KW TKWIR++
Sbjct: 284 KPRQGDALLFFSVSVNGTLDKHSLHGGCPVVAGTKWAMTKWIRNR 328
>gi|159489502|ref|XP_001702736.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280758|gb|EDP06515.1| predicted protein [Chlamydomonas reinhardtii]
Length = 231
Score = 228 bits (582), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 117/225 (52%), Positives = 147/225 (65%), Gaps = 3/225 (1%)
Query: 60 LPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET 119
LP +G IPFQ+LSW PR + FP F + + I+ A K + PS LA R GE
Sbjct: 2 LPAAASGSDVTYIIPFQILSWYPRIVVFPGFIDKARAEHIVKLAGKFMYPSGLAYRPGEQ 61
Query: 120 VESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH 179
VES++ TRTS+GTF+S+ D G+L +E +IA AT+LP +GEAFNVL YE Q YDSH
Sbjct: 62 VESSQQTRTSTGTFLSSGMDTEGVLGWVEQRIAAATLLPADNGEAFNVLHYEHMQHYDSH 121
Query: 180 YDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGY-DYKKCI--GLKV 236
D+F+P ++GPQ SQR+A+ LLYLS+V EGGET+F E D D++ C K
Sbjct: 122 MDSFDPKDFGPQPSQRIATVLLYLSEVLEGGETVFKKEGVDGADRPIQDWRNCDDGSFKY 181
Query: 237 KPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
PR GD +LF+ PNG ID SLHG CPV KGEKWVATKWIR +
Sbjct: 182 APRMGDAVLFWGTRPNGEIDPHSLHGGCPVKKGEKWVATKWIRSR 226
>gi|343172438|gb|AEL98923.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein, partial
[Silene latifolia]
gi|343172440|gb|AEL98924.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein, partial
[Silene latifolia]
Length = 120
Score = 217 bits (553), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 100/120 (83%), Positives = 112/120 (93%), Gaps = 1/120 (0%)
Query: 164 AFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLD 223
A+NVLRYE+GQKY+SHYDAF+PAEYGPQ SQR+ASFLLYLSDVEEGGETMFP+EN +D
Sbjct: 1 AYNVLRYEVGQKYNSHYDAFHPAEYGPQKSQRIASFLLYLSDVEEGGETMFPYENDN-ID 59
Query: 224 SGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
S YDY +CIGLKVKPR+GDGLLFYSLF NGTID TS+HGSCPVIKGEKWVATKWIR++EQ
Sbjct: 60 SNYDYVQCIGLKVKPRQGDGLLFYSLFSNGTIDPTSIHGSCPVIKGEKWVATKWIRNEEQ 119
>gi|159490898|ref|XP_001703410.1| prolyl 4-hydroxylase [Chlamydomonas reinhardtii]
gi|158280334|gb|EDP06092.1| prolyl 4-hydroxylase [Chlamydomonas reinhardtii]
Length = 429
Score = 201 bits (512), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 106/224 (47%), Positives = 145/224 (64%), Gaps = 7/224 (3%)
Query: 63 GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES 122
G++G+A ++IPFQ+LS PR FPNF + + IIA A K + PS LA R GE VE+
Sbjct: 203 GDSGEAFYRTIPFQILSLYPRIKVFPNFVDKARREEIIALASKFMYPSGLAYRPGEQVEA 262
Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
+ RTS GTF+ D + L +E KIA T +P+ +GE +NVL Y+ Q YDSH D+
Sbjct: 263 EQQVRTSKGTFLGG--DSSPALTWLESKIAAVTDIPRQNGEFWNVLNYKHTQHYDSHMDS 320
Query: 183 FNPAEYGPQMSQRLASFLLYLSDVE-EGGETMFPFENGIFLDSGY-DYKKCI---GLKVK 237
F+P EYG Q SQR+A+ ++ LSD GGET+F E +D ++ C GL+ K
Sbjct: 321 FDPKEYGQQYSQRIATVIVVLSDEGLVGGETVFKREGKANIDKPITNWTDCDADGGLRYK 380
Query: 238 PRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
PR GD +LF+S FP+G +D+ +LHGSCPV+ G KWVA KWIR++
Sbjct: 381 PRAGDAVLFWSAFPDGRLDQHALHGSCPVVTGNKWVAVKWIRNK 424
>gi|449520146|ref|XP_004167095.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 249
Score = 190 bits (483), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 102/222 (45%), Positives = 140/222 (63%), Gaps = 14/222 (6%)
Query: 63 GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVE 121
G+ GD ++ I SW PRA + NF S E+C +I+ AK ++ S + + G+ VE
Sbjct: 32 GKRGDQWVEFI-----SWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDNETGKNVE 86
Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
+ RTSSG F++ +DK I+ IE +IA T +P HGE +L YE+GQKYD+HYD
Sbjct: 87 DS--VRTSSGMFLNRGQDK--IVSNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYD 142
Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVK 237
F+ ++ QR+A+ L+YLSDVEEGGET+FP G F + + KC GL VK
Sbjct: 143 FFDDEFNLKEIGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSKCGKGGLSVK 202
Query: 238 PRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
P+ GD LLF+S+ P+ T+D TSLHG+CPVI+G KW TKWI
Sbjct: 203 PKMGDALLFWSMKPDTTLDPTSLHGACPVIRGNKWSCTKWIH 244
>gi|357517881|ref|XP_003629229.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355523251|gb|AET03705.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 278
Score = 189 bits (480), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 98/212 (46%), Positives = 138/212 (65%), Gaps = 9/212 (4%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFI 134
Q++SW PRA + NF + ++C+ +I TAK ++ S + + G++ +S+ RTSSGTF+
Sbjct: 68 QIVSWEPRAFLYHNFLTKKECEHLINTAKPSMQKSSVVDNETGKSKDSS--VRTSSGTFL 125
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
D+ I+ IE +IA T +P +GE+FNVLRYE+GQKYD H D F Q
Sbjct: 126 DRGGDE--IVRNIEKRIADFTFIPVENGESFNVLRYEVGQKYDPHLDYFADDYNTVNGGQ 183
Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSLF 250
R+A+ L+YLSDVEEGGET+FP G + + C GL +KP+ GD LLF+S+
Sbjct: 184 RIATMLMYLSDVEEGGETVFPAAKGNISSVPWWNELSDCGKKGLSIKPKMGDALLFWSMK 243
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
P+GT+D +SLHG+CPVIKG+KW TKW+R E
Sbjct: 244 PDGTLDPSSLHGACPVIKGDKWSCTKWMRINE 275
>gi|357483925|ref|XP_003612249.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355513584|gb|AES95207.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 289
Score = 188 bits (478), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 100/212 (47%), Positives = 132/212 (62%), Gaps = 9/212 (4%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFI 134
+V+SW PRA + NF + E+C+ +I AK + S + ET +S RTSSGTF+
Sbjct: 79 EVVSWEPRAFVYHNFLTKEECEYLIDIAKPSMHKSTVV--DSETGKSKDSRVRTSSGTFL 136
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
+ DK I+ IE KIA T +P HGE VL YE+GQKY+ HYD F Q
Sbjct: 137 ARGRDK--IVRNIEKKIADFTFIPVEHGEGLQVLHYEVGQKYEPHYDYFLDEFNTKNGGQ 194
Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSLF 250
R+A+ L+YL+DVEEGGET+FP G F + + + C GL +KP+RGD LLF+S+
Sbjct: 195 RIATVLMYLTDVEEGGETVFPAAKGNFSNVPWYNELSDCGKKGLSIKPKRGDALLFWSMK 254
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
P+ T+D +SLHG CPVIKG KW +TKWIR E
Sbjct: 255 PDATLDASSLHGGCPVIKGNKWSSTKWIRVNE 286
>gi|302791635|ref|XP_002977584.1| hypothetical protein SELMODRAFT_106693 [Selaginella moellendorffii]
gi|300154954|gb|EFJ21588.1| hypothetical protein SELMODRAFT_106693 [Selaginella moellendorffii]
Length = 296
Score = 188 bits (477), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 104/219 (47%), Positives = 139/219 (63%), Gaps = 10/219 (4%)
Query: 67 DASIQSI--PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVE 121
D S QSI P +V LSW+PRA + F SA +C ++ AK +L+ S +A + G++V
Sbjct: 30 DKSDQSIVDPTKVIQLSWKPRAFLYKGFMSAAECDHVVKMAKDKLQKSMVADNESGKSVL 89
Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
S RTSSG F+S +D+ ++ IE +IA T LP+ +GEA VLRYE G+KY+ HYD
Sbjct: 90 SN--IRTSSGMFLSKGQDE--VINRIEERIAAWTFLPKENGEAIQVLRYEFGEKYEPHYD 145
Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPF-ENGIFLDSGYDYKKCIGLKVKPRR 240
F+ R+A+ L+YLSDV +GGET+FP E+ D + G+ VKPR+
Sbjct: 146 YFHDKYNQALGGHRIATVLMYLSDVVKGGETVFPSSEDTTVKDDSWSDCAKKGIAVKPRK 205
Query: 241 GDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
GD LLFYSL P+ T D +SLHG CPVI+GEKW ATKWI
Sbjct: 206 GDALLFYSLHPDATPDESSLHGGCPVIEGEKWSATKWIH 244
>gi|449443243|ref|XP_004139389.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 284
Score = 187 bits (476), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 101/222 (45%), Positives = 138/222 (62%), Gaps = 14/222 (6%)
Query: 63 GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVE 121
G+ GD ++ I SW PRA + NF S E+C +I+ AK ++ S + + G+ VE
Sbjct: 67 GKRGDQWVEFI-----SWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDNETGKNVE 121
Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
+ RTSSG F++ +DK I+ IE +IA T +P HGE +L YE+GQKYD+HYD
Sbjct: 122 DS--VRTSSGMFLNRGQDK--IVSNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYD 177
Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKCI--GLKVK 237
F + QR+A+ L+YLSDVEEGGET+FP G F + + KC GL VK
Sbjct: 178 YFVDEYNIKKGGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSKCGKGGLSVK 237
Query: 238 PRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
P+ GD LLF+S+ P+ T+D TSLHG+CPVI+G KW TKW+
Sbjct: 238 PKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMH 279
>gi|357517897|ref|XP_003629237.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355523259|gb|AET03713.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|388513409|gb|AFK44766.1| unknown [Medicago truncatula]
gi|388516345|gb|AFK46234.1| unknown [Medicago truncatula]
Length = 275
Score = 187 bits (475), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 114/285 (40%), Positives = 163/285 (57%), Gaps = 19/285 (6%)
Query: 4 KAKRSSTKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIR-PKLRTLEVVEKENESGLPH 62
K K S+ L P +L+ C FL F TFL + ++R PKL L + + H
Sbjct: 3 KFKHSNVGLRKP-SLITCWTLFLTLFVTFTFLILIILTLRIPKLNHL--------NSITH 53
Query: 63 GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQ-LALRQGETVE 121
T Q++SW PRA + NF + E+C+ +I AK + S+ + + G+++
Sbjct: 54 SNTLRNDDNKRWVQIISWEPRAFLYHNFLTKEECEHLINIAKPSMHKSEVIDEKTGKSLN 113
Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
S+ RTSSGTF+ D+ I+ IE +IA T +P HGE+FNVL YE+GQKY+ HYD
Sbjct: 114 SS--IRTSSGTFLDREGDE--IVSNIEKRIADFTFIPVEHGESFNVLHYEVGQKYEPHYD 169
Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVK 237
F QR+A+ L+YLSDVEEGGET+FP G F + + C GL +K
Sbjct: 170 YFLDTFSTRHAGQRIATMLMYLSDVEEGGETVFPNAKGNFSSVPWWNELSDCGKGGLSIK 229
Query: 238 PRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
P+ G+ +LF+S+ P+ T+D +SLHG+CPVIKG+KW KW+ E
Sbjct: 230 PKMGNAILFWSMKPDATLDPSSLHGACPVIKGDKWSCAKWMHADE 274
>gi|449529555|ref|XP_004171765.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 284
Score = 187 bits (475), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 101/222 (45%), Positives = 139/222 (62%), Gaps = 14/222 (6%)
Query: 63 GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVE 121
G+ GD ++ I SW PRA + NF S E+C +I+ AK ++ S + + GE+V+
Sbjct: 67 GKRGDQWVEFI-----SWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSKTGESVD 121
Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
S RTSSG F++ +DK I+ IE +IA T +P HGE +L YE+GQKYD+HYD
Sbjct: 122 SR--VRTSSGMFLNRGQDK--IIRNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYD 177
Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKCI--GLKVK 237
F + QR+A+ L+YLSDVEEGGET+FP G F + + +C GL VK
Sbjct: 178 YFVDEYNIKKGGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVK 237
Query: 238 PRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
P+ GD LLF+S+ P+ T+D TSLHG+CPVI+G KW TKW+
Sbjct: 238 PKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMH 279
>gi|356517655|ref|XP_003527502.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Glycine max]
Length = 290
Score = 187 bits (474), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 98/212 (46%), Positives = 136/212 (64%), Gaps = 9/212 (4%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTKGTRTSSGTFI 134
++LSW PRA + NF S E+C+ +I AK ++ K S + + G++ ES RTSSG F+
Sbjct: 80 EILSWEPRAFIYHNFLSKEECEYLIELAKPQMVKSSVVDSKTGKSTESR--VRTSSGMFL 137
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
+DK I++ IE +IA T +P+ +GE +L YE+GQKY+ HYD F Q
Sbjct: 138 KRGKDK--IVQNIEKRIADFTFIPEENGEGLQILHYEVGQKYEPHYDYFLDEFNTKNGGQ 195
Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKCI--GLKVKPRRGDGLLFYSLF 250
R+A+ L+YLSDVEEGGET+FP N F + D +C GL VKP+ GD LLF+S+
Sbjct: 196 RIATVLMYLSDVEEGGETVFPAANANFSSVPWWNDLSQCARKGLSVKPKMGDALLFWSMR 255
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
P+ T+D +SLHG CPVIKG KW +TKW+ +E
Sbjct: 256 PDATLDPSSLHGGCPVIKGNKWSSTKWMHLRE 287
>gi|302786814|ref|XP_002975178.1| hypothetical protein SELMODRAFT_174666 [Selaginella moellendorffii]
gi|300157337|gb|EFJ23963.1| hypothetical protein SELMODRAFT_174666 [Selaginella moellendorffii]
Length = 283
Score = 186 bits (473), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 103/220 (46%), Positives = 138/220 (62%), Gaps = 11/220 (5%)
Query: 67 DASIQSI--PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVE 121
D S QSI P +V LSW+PRA + F SA +C ++ AK +L+ S +A + G++V
Sbjct: 16 DKSDQSIVDPTKVIQLSWKPRAFLYKGFMSAAECDHVVKMAKDKLQKSMVADNESGKSVL 75
Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
S RTSSG F+S +D+ ++ IE +IA T LP+ +GEA VLRYE G+KY+ HYD
Sbjct: 76 SN--IRTSSGMFLSKGQDE--VINRIEERIAAWTFLPKENGEAIQVLRYEFGEKYEPHYD 131
Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPF--ENGIFLDSGYDYKKCIGLKVKPR 239
F+ R+A+ L+YLSD +GGET+FP E+ D + G+ VKPR
Sbjct: 132 YFHDKYNQALGGHRIATVLMYLSDAVKGGETVFPSSEEDTTVKDDSWSDCAKKGIAVKPR 191
Query: 240 RGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+GD LLFYSL P+ T D +SLHG CPVI+GEKW ATKWI
Sbjct: 192 KGDALLFYSLHPDATPDESSLHGGCPVIEGEKWSATKWIH 231
>gi|21537370|gb|AAM61711.1| putative prolyl 4-hydroxylase, alpha subunit [Arabidopsis thaliana]
Length = 287
Score = 186 bits (472), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 101/208 (48%), Positives = 129/208 (62%), Gaps = 9/208 (4%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFI 134
+VLSW PRA + NF S E+C+ +I+ AK + S + ET +S RTSSGTF+
Sbjct: 77 EVLSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVV--DSETGKSKDSRVRTSSGTFL 134
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
DK I++ IE +IA T +P HGE VL YE GQKY+ HYD F Q
Sbjct: 135 RRGRDK--IIKTIEKRIADYTFIPADHGEGLQVLHYEAGQKYEPHYDYFVDEFNTKNGGQ 192
Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSLF 250
R+A+ L+YLSDVEEGGET+FP N F + + +C GL VKPR GD LLF+S+
Sbjct: 193 RMATMLMYLSDVEEGGETVFPAANMNFSSVPWYNELSECGKKGLSVKPRMGDALLFWSMR 252
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWI 278
P+ T+D TSLHG CPVI+G KW +TKWI
Sbjct: 253 PDATLDPTSLHGGCPVIRGNKWSSTKWI 280
>gi|242039227|ref|XP_002467008.1| hypothetical protein SORBIDRAFT_01g018200 [Sorghum bicolor]
gi|241920862|gb|EER94006.1| hypothetical protein SORBIDRAFT_01g018200 [Sorghum bicolor]
Length = 307
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 96/211 (45%), Positives = 132/211 (62%), Gaps = 7/211 (3%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
+VLSW PRA + NF S E+C +I+ AK +K S + + ++ RTSSG F+
Sbjct: 97 EVLSWEPRAFVYHNFLSKEECDHLISLAKPHMKKSTVVDSATGASKDSR-VRTSSGMFLR 155
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
+DK I++ IE +IA T +P HGE VL YE+GQKY+ H+D F+ QR
Sbjct: 156 RGQDK--IIQTIEKRIADFTFIPVEHGEGLQVLHYEVGQKYEPHFDYFHDDYNTKNGGQR 213
Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKCI--GLKVKPRRGDGLLFYSLFP 251
+A+ L+YLSDVE+GGET+FP S + + +C GL VKP+ GD LLF+S+ P
Sbjct: 214 IATLLMYLSDVEDGGETVFPSSTTNSSSSPFYNELSECAKGGLSVKPKMGDALLFWSMKP 273
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
+G++D TSLHG CPVIKG KW +TKW+R E
Sbjct: 274 DGSMDSTSLHGGCPVIKGNKWSSTKWMRVHE 304
>gi|357146834|ref|XP_003574128.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Brachypodium
distachyon]
Length = 306
Score = 185 bits (470), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 99/217 (45%), Positives = 135/217 (62%), Gaps = 19/217 (8%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
+VLSW PRA + NF S E+C+ +I+ AK +K S + V+S G RTS
Sbjct: 96 EVLSWEPRAFLYHNFLSKEECEYLISLAKPHMKKSTV-------VDSATGGSKDSRVRTS 148
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
SGTF+ +DK ++ IE +I+ T +P +GE VL YE+GQKY+ H+D F+
Sbjct: 149 SGTFLRRGQDK--VIRTIEKRISDFTFIPAENGEGLQVLHYEVGQKYEPHFDYFHDDFNT 206
Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFE--NGIFLDSGYDYKKCI--GLKVKPRRGDGLL 245
QR+A+ L+YLSDVEEGGET+FP N + + +C G+ VKP+ GD LL
Sbjct: 207 KNGGQRIATLLMYLSDVEEGGETVFPSAKVNSSSIPFYNELSECAKRGISVKPKMGDALL 266
Query: 246 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
F+S+ P+GT+D TSLHG CPVIKG+KW +TKWIR E
Sbjct: 267 FWSMRPDGTLDPTSLHGGCPVIKGDKWSSTKWIRVHE 303
>gi|18394842|ref|NP_564109.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
gi|9558598|gb|AAF88161.1|AC026234_12 Contains similarity to a prolyl 4-hydroxylase alpha subunit protein
from Gallus gallus gi|212530 [Arabidopsis thaliana]
gi|90962978|gb|ABE02413.1| At1g20270 [Arabidopsis thaliana]
gi|332191835|gb|AEE29956.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
Length = 287
Score = 185 bits (469), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 100/208 (48%), Positives = 129/208 (62%), Gaps = 9/208 (4%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFI 134
+VLSW PRA + NF S E+C+ +I+ AK + S + ET +S RTSSGTF+
Sbjct: 77 EVLSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVV--DSETGKSKDSRVRTSSGTFL 134
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
DK I++ IE +IA T +P HGE VL YE GQKY+ HYD F Q
Sbjct: 135 RRGRDK--IIKTIEKRIADYTFIPADHGEGLQVLHYEAGQKYEPHYDYFVDEFNTKNGGQ 192
Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSLF 250
R+A+ L+YLSDVEEGGET+FP N F + + +C GL VKPR GD LLF+S+
Sbjct: 193 RMATMLMYLSDVEEGGETVFPAANMNFSSVPWYNELSECGKKGLSVKPRMGDALLFWSMR 252
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWI 278
P+ T+D TSLHG CPVI+G KW +TKW+
Sbjct: 253 PDATLDPTSLHGGCPVIRGNKWSSTKWM 280
>gi|449491267|ref|XP_004158845.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 287
Score = 185 bits (469), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 96/212 (45%), Positives = 133/212 (62%), Gaps = 9/212 (4%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFI 134
+V+SW PRA + NF + E+C+ +I+ AK ++ S + ET +S RTSSGTF+
Sbjct: 77 EVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSTVV--DSETGQSKDSRVRTSSGTFL 134
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
DKT + IE +++ + +P HGE VL YE+GQKY+ H+D F Q
Sbjct: 135 PRGRDKT--VRTIEKRLSDFSFIPVEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQ 192
Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSLF 250
R+A+ L+YLSDVEEGGET+FP G F + + C GL VKP+RGD LLF+S+
Sbjct: 193 RIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMK 252
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
P+ ++D +SLHG CPVIKG KW ATKW+R +E
Sbjct: 253 PDASLDPSSLHGGCPVIKGNKWSATKWVRVEE 284
>gi|326495334|dbj|BAJ85763.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 300
Score = 184 bits (468), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 100/217 (46%), Positives = 135/217 (62%), Gaps = 19/217 (8%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
+VLSW PRA + NF S E+C+ +I+ AK +K S + V+S G RTS
Sbjct: 90 EVLSWEPRAFIYHNFLSKEECEYLISLAKPHMKKSTV-------VDSATGGSKDSRVRTS 142
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
SGTF+ +DK I+ IE +I+ T +P +GE VL YE+GQKY+ H+D F+
Sbjct: 143 SGTFLRRGQDK--IVRTIEKRISDFTFIPVENGEGLQVLHYEVGQKYEPHFDYFHDDFNT 200
Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFE--NGIFLDSGYDYKKCI--GLKVKPRRGDGLL 245
QR+A+ L+YLSDVEEGGET+FP N + + +C G+ VKP+ GD LL
Sbjct: 201 KNGGQRIATVLMYLSDVEEGGETVFPSAKVNSSSIPFYNELSECAKRGISVKPKMGDALL 260
Query: 246 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
F+S+ P+GT+D TSLHG CPVIKG+KW +TKWIR E
Sbjct: 261 FWSMRPDGTLDPTSLHGGCPVIKGDKWSSTKWIRVHE 297
>gi|357125236|ref|XP_003564301.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
distachyon]
Length = 293
Score = 184 bits (468), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 98/206 (47%), Positives = 133/206 (64%), Gaps = 8/206 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LSWRPRA + F S +C ++ AK RL+ S +A G++V S RTSSGTF++
Sbjct: 34 LSWRPRAFLYSGFLSHAECDHLVKLAKGRLQKSMVADNDSGKSVMSQ--VRTSSGTFLNK 91
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
ED+ I+ IE ++A T LP+ + E+ VL YE+GQKYD+H+D F+ R+
Sbjct: 92 HEDE--IISGIEKRVAAWTFLPEENAESIQVLHYEVGQKYDAHFDYFHDKNNQKLGGHRV 149
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPNG 253
A+ L+YL+DV++GGET+FP G L + + +C GL VKPR+GD LLF+SL N
Sbjct: 150 ATVLMYLTDVKKGGETVFPNAEGRHLQHKDETWSECARSGLAVKPRKGDALLFFSLHINA 209
Query: 254 TIDRTSLHGSCPVIKGEKWVATKWIR 279
T D +SLHGSCPVI+GEKW ATKWI
Sbjct: 210 TTDPSSLHGSCPVIEGEKWSATKWIH 235
>gi|297850430|ref|XP_002893096.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297338938|gb|EFH69355.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 287
Score = 184 bits (468), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 99/208 (47%), Positives = 129/208 (62%), Gaps = 9/208 (4%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFI 134
+VLSW PRA + NF S E+C+ +I+ AK + S + ET +S RTSSGTF+
Sbjct: 77 EVLSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVV--DSETGKSKDSRVRTSSGTFL 134
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
DK I++ IE +IA T +P HGE +L YE GQKY+ HYD F Q
Sbjct: 135 RRGRDK--IIKTIEKRIADYTFIPADHGEGLQILHYEAGQKYEPHYDYFVDEFNTKNGGQ 192
Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSLF 250
R+A+ L+YLSDVEEGGET+FP N F + + +C GL VKPR GD LLF+S+
Sbjct: 193 RMATMLMYLSDVEEGGETVFPAANMNFSSVPWYNELSECGKKGLSVKPRMGDALLFWSMR 252
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWI 278
P+ T+D TSLHG CPVI+G KW +TKW+
Sbjct: 253 PDATLDPTSLHGGCPVIRGNKWSSTKWM 280
>gi|356540840|ref|XP_003538892.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Glycine max]
Length = 290
Score = 184 bits (467), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 97/212 (45%), Positives = 132/212 (62%), Gaps = 9/212 (4%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFI 134
+V+SW PRA + NF + E+C+ +I AK + S + ET +S RTSSGTF+
Sbjct: 80 EVVSWEPRAFVYHNFLTKEECEYLIDIAKPNMHKSSVV--DSETGKSKDSRVRTSSGTFL 137
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
+ DK I+ IE +IA + +P HGE VL YE+GQKY+ HYD F Q
Sbjct: 138 ARGRDK--IVRDIEKRIAHYSFIPVEHGEGLQVLHYEVGQKYEPHYDYFLDDFNTKNGGQ 195
Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSLF 250
R+A+ L+YL+DVEEGGET+FP G F + + +C GL +KP+RGD LLF+S+
Sbjct: 196 RIATVLMYLTDVEEGGETVFPAAKGNFSSVPWWNELSECGKKGLSIKPKRGDALLFWSMK 255
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
P+ T+D +SLHG CPVIKG KW +TKW+R E
Sbjct: 256 PDATLDPSSLHGGCPVIKGNKWSSTKWMRVSE 287
>gi|449434114|ref|XP_004134841.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 287
Score = 184 bits (467), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 96/212 (45%), Positives = 133/212 (62%), Gaps = 9/212 (4%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFI 134
+V+SW PRA + NF + E+C+ +I+ AK ++ S + ET +S RTSSGTF+
Sbjct: 77 EVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSTVV--DSETGQSKDSRVRTSSGTFL 134
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
DKT + IE +++ + +P HGE VL YE+GQKY+ H+D F Q
Sbjct: 135 PRGRDKT--VRTIEKRLSDFSFIPVEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQ 192
Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSLF 250
R+A+ L+YLSDVEEGGET+FP G F + + C GL VKP+RGD LLF+S+
Sbjct: 193 RIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMK 252
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
P+ ++D +SLHG CPVIKG KW ATKW+R +E
Sbjct: 253 PDASLDPSSLHGGCPVIKGNKWSATKWMRVEE 284
>gi|302773668|ref|XP_002970251.1| hypothetical protein SELMODRAFT_411114 [Selaginella moellendorffii]
gi|300161767|gb|EFJ28381.1| hypothetical protein SELMODRAFT_411114 [Selaginella moellendorffii]
Length = 256
Score = 184 bits (467), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 95/211 (45%), Positives = 134/211 (63%), Gaps = 7/211 (3%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
+ +SW+PRA F NF S+E+C +I A+ +K S + Q + ++ RTSSGTF+
Sbjct: 47 ETISWQPRASVFHNFLSSEECDHLIRLAQPNMKRSAVVDNQTGKSKDSR-VRTSSGTFLR 105
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
+D+ I+ IE +IA+ T +P+ HGE VL YE+GQKYD+H+D F+ QR
Sbjct: 106 RGQDE--IISRIEERIAKFTFIPKEHGEGLQVLHYEVGQKYDAHHDYFHDKVNTKNGGQR 163
Query: 196 LASFLLYLSDVEEGGETMFPFE--NGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLFP 251
+A+ L+YLSDVEEGGET+FP N + + +C G+ VKPR+GD LLF+S+ P
Sbjct: 164 VATVLMYLSDVEEGGETVFPSAKVNSSSVPWWDELSECAKKGVSVKPRKGDALLFWSMSP 223
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
+ +D SLHG CPVIKG KW ATKW+ +E
Sbjct: 224 DAELDPFSLHGGCPVIKGNKWSATKWMHLRE 254
>gi|255579590|ref|XP_002530636.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223529809|gb|EEF31744.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 287
Score = 183 bits (465), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 98/212 (46%), Positives = 129/212 (60%), Gaps = 9/212 (4%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFI 134
+V+SW PRA + NF + E+C+ +I AK ++ S + ET S RTSSGTF+
Sbjct: 77 EVISWEPRAFVYHNFLTKEECEYLINLAKPNMQKSTVV--DSETGRSKDSRVRTSSGTFL 134
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
S DK + IE +IA + +P HGE VL YE+GQKY+ H+D FN Q
Sbjct: 135 SRGRDKK--IRDIEKRIADFSFIPVEHGEGLQVLHYEVGQKYEPHFDYFNDEFNTKNGGQ 192
Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSLF 250
R+A+ L+YLSDVEEGGET+FP G F + + +C GL VKP GD LLF+S+
Sbjct: 193 RVATLLMYLSDVEEGGETVFPAAKGNFSAVPWWNELSECGKKGLSVKPNMGDALLFWSMK 252
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
P+ T+D +SLHG CPVI G KW ATKW+R E
Sbjct: 253 PDATLDPSSLHGGCPVINGNKWSATKWMRVNE 284
>gi|302793288|ref|XP_002978409.1| hypothetical protein SELMODRAFT_418273 [Selaginella moellendorffii]
gi|300153758|gb|EFJ20395.1| hypothetical protein SELMODRAFT_418273 [Selaginella moellendorffii]
Length = 256
Score = 183 bits (464), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 95/211 (45%), Positives = 134/211 (63%), Gaps = 7/211 (3%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
+ +SW+PRA F NF S+E+C +I A+ +K S + Q + ++ RTSSGTF+
Sbjct: 47 ETISWQPRASVFHNFLSSEECDHLIRLAQPNMKRSAVVDNQTGKSKDSR-VRTSSGTFLR 105
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
+D+ I+ IE +IA+ T +P+ HGE VL YE+GQKYD+H+D F+ QR
Sbjct: 106 RGQDE--IISRIEERIAKFTFIPKEHGEGLQVLHYEVGQKYDAHHDYFHDKVNTKNGGQR 163
Query: 196 LASFLLYLSDVEEGGETMFPFE--NGIFLDSGYDYKKC--IGLKVKPRRGDGLLFYSLFP 251
+A+ L+YLSDVEEGGET+FP N + + +C G+ VKPR+GD LLF+S+ P
Sbjct: 164 VATVLMYLSDVEEGGETVFPSAKVNSSSVPWWDELSECGKKGVSVKPRKGDALLFWSMSP 223
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
+ +D SLHG CPVIKG KW ATKW+ +E
Sbjct: 224 DAELDPFSLHGGCPVIKGNKWSATKWMHLRE 254
>gi|115482738|ref|NP_001064962.1| Os10g0497800 [Oryza sativa Japonica Group]
gi|78708853|gb|ABB47828.1| prolyl 4-hydroxylase alpha subunit, putative, expressed [Oryza
sativa Japonica Group]
gi|113639571|dbj|BAF26876.1| Os10g0497800 [Oryza sativa Japonica Group]
gi|215767852|dbj|BAH00081.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218184821|gb|EEC67248.1| hypothetical protein OsI_34188 [Oryza sativa Indica Group]
Length = 321
Score = 182 bits (463), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 95/211 (45%), Positives = 132/211 (62%), Gaps = 7/211 (3%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
+VLSW PRA + NF S E+C+ +I+ AK +K S + + ++ RTSSG F+
Sbjct: 111 EVLSWEPRAFLYHNFLSKEECEYLISLAKPHMKKSTVVDASTGGSKDSR-VRTSSGMFLG 169
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
+DK I+ IE +I+ T +P +GE VL YE+GQKY+ H+D F+ QR
Sbjct: 170 RGQDK--IIRTIEKRISDYTFIPVENGEGLQVLHYEVGQKYEPHFDYFHDEFNTKNGGQR 227
Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKCI--GLKVKPRRGDGLLFYSLFP 251
+A+ L+YLSDVEEGGET+FP S + + +C GL VKP+ GD LLF+S+ P
Sbjct: 228 IATLLMYLSDVEEGGETIFPSSKANSSSSPFYNELSECAKKGLAVKPKMGDALLFWSMRP 287
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
+G++D TSLHG CPVIKG KW +TKW+R E
Sbjct: 288 DGSLDATSLHGGCPVIKGNKWSSTKWMRVHE 318
>gi|225459748|ref|XP_002285898.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Vitis vinifera]
gi|302141716|emb|CBI18919.3| unnamed protein product [Vitis vinifera]
Length = 288
Score = 182 bits (463), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 95/211 (45%), Positives = 131/211 (62%), Gaps = 7/211 (3%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
+++SW PRA + NF S E+C+ +I+ AK +K S + + + ++ RTSSG F+
Sbjct: 78 EIVSWEPRAFIYHNFLSKEECEYMISLAKPYMKKSTVVDSETGRSKDSR-VRTSSGMFLR 136
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
DK I+ IE +IA T +P HGE VL YE+GQKYD+HYD F QR
Sbjct: 137 RGRDK--IIRDIEKRIADFTFIPVEHGEGLQVLHYEVGQKYDAHYDYFLDEFNTKNGGQR 194
Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSLFP 251
+A+ L+YLSDVEEGGET+FP F + + +C GL VKP+ GD LLF+S+ P
Sbjct: 195 IATLLMYLSDVEEGGETVFPATKANFSSVPWWNELSECGKKGLSVKPKMGDALLFWSMRP 254
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
+ T+D +SLHG CPVIKG KW +TKW+ +E
Sbjct: 255 DATLDPSSLHGGCPVIKGNKWSSTKWMHVEE 285
>gi|225468574|ref|XP_002263060.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
gi|296084059|emb|CBI24447.3| unnamed protein product [Vitis vinifera]
Length = 288
Score = 182 bits (462), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 97/217 (44%), Positives = 133/217 (61%), Gaps = 19/217 (8%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
+V+SW PRA + NF S ++C+ +I AK ++ S + V+S+ G RTS
Sbjct: 78 EVISWEPRAFVYHNFLSKDECEYLIKLAKPHMQKSTV-------VDSSTGKSKDSRVRTS 130
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
SGTF++ +DK I+ IE +++ T LP HGE +L YE+GQKY+ HYD F
Sbjct: 131 SGTFLTRGQDK--IIRGIEKRLSDFTFLPVEHGEGLQILHYEVGQKYEPHYDYFLDDYNT 188
Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLL 245
QR+A+ L+YLSDVEEGGET+FP G F + + C GL VKP+ GD LL
Sbjct: 189 KNGGQRMATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKEGLSVKPKMGDALL 248
Query: 246 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
F+S+ P+ ++D +SLHG CPVIKG KW +TKWIR E
Sbjct: 249 FWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNE 285
>gi|147800995|emb|CAN64470.1| hypothetical protein VITISV_014644 [Vitis vinifera]
Length = 288
Score = 182 bits (462), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 97/217 (44%), Positives = 133/217 (61%), Gaps = 19/217 (8%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
+V+SW PRA + NF S ++C+ +I AK ++ S + V+S+ G RTS
Sbjct: 78 EVISWEPRAFVYHNFLSKDECEYLIKLAKPHMQKSTV-------VDSSTGKSKDSRVRTS 130
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
SGTF++ +DK I+ IE +++ T LP HGE +L YE+GQKY+ HYD F
Sbjct: 131 SGTFLTRGQDK--IIRGIEKRLSDFTFLPVEHGEGLQILHYEVGQKYEPHYDYFLDDYNT 188
Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLL 245
QR+A+ L+YLSDVEEGGET+FP G F + + C GL VKP+ GD LL
Sbjct: 189 KNGGQRMATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSXCGKEGLSVKPKMGDALL 248
Query: 246 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
F+S+ P+ ++D +SLHG CPVIKG KW +TKWIR E
Sbjct: 249 FWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNE 285
>gi|215490183|dbj|BAG86625.1| type 2 proly 4-hydroxylase [Nicotiana tabacum]
Length = 318
Score = 182 bits (462), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 96/205 (46%), Positives = 131/205 (63%), Gaps = 8/205 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
+SWRPRA + NF + E+C I AK +L+ S +A + G++VES RTSSG F
Sbjct: 65 ISWRPRAFVYRNFLTDEECDHFITLAKHKLEKSMVADNESGKSVESE--VRTSSGMFFRK 122
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
++D+ ++ +E +IA T LP+ +GE+ +L YE GQKY+ H+D F+ R+
Sbjct: 123 AQDQ--VVANVEARIAAWTFLPEENGESIQILHYEHGQKYEPHFDYFHDKVNQELGGHRV 180
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDS-GYDYKKCI--GLKVKPRRGDGLLFYSLFPNG 253
A+ L+YLSDVE+GGET+FP + G D+ C G VKPR+GD LLF+SL P+
Sbjct: 181 ATVLMYLSDVEKGGETVFPNSEAKKTQAKGDDWSDCAKKGYAVKPRKGDALLFFSLHPDA 240
Query: 254 TIDRTSLHGSCPVIKGEKWVATKWI 278
T D SLHGSCPVI+GEKW ATKWI
Sbjct: 241 TTDPLSLHGSCPVIEGEKWSATKWI 265
>gi|222613083|gb|EEE51215.1| hypothetical protein OsJ_32038 [Oryza sativa Japonica Group]
Length = 222
Score = 182 bits (461), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 95/211 (45%), Positives = 132/211 (62%), Gaps = 7/211 (3%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
+VLSW PRA + NF S E+C+ +I+ AK +K S + + ++ RTSSG F+
Sbjct: 12 EVLSWEPRAFLYHNFLSKEECEYLISLAKPHMKKSTVVDASTGGSKDSR-VRTSSGMFLG 70
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
+DK I+ IE +I+ T +P +GE VL YE+GQKY+ H+D F+ QR
Sbjct: 71 RGQDK--IIRTIEKRISDYTFIPVENGEGLQVLHYEVGQKYEPHFDYFHDEFNTKNGGQR 128
Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKCI--GLKVKPRRGDGLLFYSLFP 251
+A+ L+YLSDVEEGGET+FP S + + +C GL VKP+ GD LLF+S+ P
Sbjct: 129 IATLLMYLSDVEEGGETIFPSSKANSSSSPFYNELSECAKKGLAVKPKMGDALLFWSMRP 188
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
+G++D TSLHG CPVIKG KW +TKW+R E
Sbjct: 189 DGSLDATSLHGGCPVIKGNKWSSTKWMRVHE 219
>gi|357517895|ref|XP_003629236.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355523258|gb|AET03712.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 326
Score = 181 bits (460), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 93/208 (44%), Positives = 132/208 (63%), Gaps = 9/208 (4%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFI 134
Q++SW PRA + NF + E+C+ +I AK + S + + G V+S + RTSSG F+
Sbjct: 116 QIISWEPRAFLYHNFLTKEECEHLINIAKPSMHKSAVIDEETGNGVDSRE--RTSSGAFL 173
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
D+ I++ IE +IA T +P HGE FNVL YE+GQKY+ HYD F Q
Sbjct: 174 KRGSDR--IVKNIERRIADFTFIPVEHGENFNVLHYEVGQKYEPHYDYFMDTFSTTYAGQ 231
Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSLF 250
R+A+ L+YLSDVEEGGET+FP G F + + C GL +KP+ G+ +LF+S+
Sbjct: 232 RIATMLMYLSDVEEGGETVFPNAKGNFSSVPWWNELSDCGKGGLSIKPKMGNAILFWSMK 291
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWI 278
P+ T+D +SLHG+CPVIKG+KW+ KW+
Sbjct: 292 PDATLDPSSLHGACPVIKGDKWLCAKWM 319
>gi|212720775|ref|NP_001131953.1| uncharacterized protein LOC100193348 [Zea mays]
gi|194693016|gb|ACF80592.1| unknown [Zea mays]
gi|347978798|gb|AEP37741.1| prolyl 4-hydroxylase 1 [Zea mays]
gi|414870898|tpg|DAA49455.1| TPA: hypothetical protein ZEAMMB73_536273 [Zea mays]
Length = 307
Score = 181 bits (459), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 98/217 (45%), Positives = 131/217 (60%), Gaps = 19/217 (8%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
+VLSW PRA + NF S E+C +I+ AK +K S + V+S G RTS
Sbjct: 97 EVLSWEPRAFVYHNFLSKEECDHLISLAKPHMKKSTV-------VDSATGGSKDSRVRTS 149
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
SG F+ +DK I+ IE +IA T +P GE VL YE+GQKY+ H+D F+
Sbjct: 150 SGMFLRRGQDK--IIRTIEKRIADYTFIPVEQGEGLQVLHYEVGQKYEPHFDYFHDDYNT 207
Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKCI--GLKVKPRRGDGLL 245
QR+A+ L+YLSDVE+GGET+FP S + + +C GL VKP+ GD LL
Sbjct: 208 KNGGQRIATLLMYLSDVEDGGETVFPSSTTNSSSSPFYNELSECAKGGLSVKPKMGDALL 267
Query: 246 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
F+S+ P+G++D TSLHG CPVIKG KW +TKW+R E
Sbjct: 268 FWSMKPDGSLDPTSLHGGCPVIKGNKWSSTKWMRVHE 304
>gi|363806698|ref|NP_001242522.1| uncharacterized protein LOC100806046 [Glycine max]
gi|255647110|gb|ACU24023.1| unknown [Glycine max]
Length = 289
Score = 181 bits (459), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 97/212 (45%), Positives = 132/212 (62%), Gaps = 9/212 (4%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFI 134
+V+SW PRA + NF + E+C+ +I AK + S + ET +S RTSSGTF+
Sbjct: 79 EVVSWEPRAFVYHNFLTKEECEYLIDIAKPSMHKSTVV--DSETGKSKDSRVRTSSGTFL 136
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
+ DK I+ IE KI+ T +P HGE VL YE+GQKY+ HYD F Q
Sbjct: 137 ARGRDK--IVRNIEKKISDFTFIPVEHGEGLQVLHYEVGQKYEPHYDYFLDDFNTKNGGQ 194
Query: 195 RLASFLLYLSDVEEGGETMFPFENG--IFLDSGYDYKKC--IGLKVKPRRGDGLLFYSLF 250
R+A+ L+YL+DVEEGGET+FP G F+ + +C GL +KP+RGD LLF+S+
Sbjct: 195 RIATVLMYLTDVEEGGETVFPAAKGNFSFVPWWNELFECGKKGLSIKPKRGDALLFWSMK 254
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
P+ ++D +SLHG CPVIKG KW +TKW+R E
Sbjct: 255 PDASLDPSSLHGGCPVIKGNKWSSTKWMRVSE 286
>gi|242032633|ref|XP_002463711.1| hypothetical protein SORBIDRAFT_01g004670 [Sorghum bicolor]
gi|241917565|gb|EER90709.1| hypothetical protein SORBIDRAFT_01g004670 [Sorghum bicolor]
Length = 297
Score = 181 bits (459), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 96/206 (46%), Positives = 129/206 (62%), Gaps = 8/206 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LSWRPRA + F S +C +I AK ++ S +A G+++ S RTSSG F++
Sbjct: 38 LSWRPRAFLYSGFLSDTECDHLINLAKGSMEKSMVADNDSGKSLMSQ--VRTSSGAFLAK 95
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
ED+ I+ IE ++A T LP+ + E+ VLRYEIGQKYD+H+D F+ QR
Sbjct: 96 HEDE--IVSAIEKRVAAWTFLPEENAESMQVLRYEIGQKYDAHFDYFHDKNNVKHGGQRF 153
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFL---DSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 253
A+ L+YL+DV++GGET+FP G L D + GL VKP++GD LLF+ L N
Sbjct: 154 ATVLMYLTDVKKGGETVFPNAEGSHLQYKDETWSECSRSGLAVKPKKGDALLFFGLHLNA 213
Query: 254 TIDRTSLHGSCPVIKGEKWVATKWIR 279
T D +SLHGSCPVI+GEKW ATKWI
Sbjct: 214 TTDTSSLHGSCPVIEGEKWSATKWIH 239
>gi|218193936|gb|EEC76363.1| hypothetical protein OsI_13952 [Oryza sativa Indica Group]
Length = 1062
Score = 181 bits (459), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 95/206 (46%), Positives = 131/206 (63%), Gaps = 8/206 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LSWRPRA + F S ++C ++ AK R++ S +A G+++ S RTSSGTF+S
Sbjct: 40 LSWRPRAFLYSGFLSHDECDHLVNLAKGRMEKSMVADNDSGKSIMSQ--VRTSSGTFLSK 97
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
ED I+ IE ++A T LP+ + E+ +L YE+GQKYD+H+D F+ + R+
Sbjct: 98 HEDD--IVSGIEKRVAAWTFLPEENAESIQILHYELGQKYDAHFDYFHDKNNLKRGGHRV 155
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFL---DSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 253
A+ L+YL+DV++GGET+FP G L D + GL VKP++GD LLF+SL N
Sbjct: 156 ATVLMYLTDVKKGGETVFPNAAGRHLQLKDETWSDCARSGLAVKPKKGDALLFFSLHVNA 215
Query: 254 TIDRTSLHGSCPVIKGEKWVATKWIR 279
T D SLHGSCPVI+GEKW ATKWI
Sbjct: 216 TTDPASLHGSCPVIEGEKWSATKWIH 241
>gi|414870899|tpg|DAA49456.1| TPA: hypothetical protein ZEAMMB73_536273 [Zea mays]
Length = 364
Score = 181 bits (458), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 98/217 (45%), Positives = 131/217 (60%), Gaps = 19/217 (8%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
+VLSW PRA + NF S E+C +I+ AK +K S + V+S G RTS
Sbjct: 154 EVLSWEPRAFVYHNFLSKEECDHLISLAKPHMKKSTV-------VDSATGGSKDSRVRTS 206
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
SG F+ +DK I+ IE +IA T +P GE VL YE+GQKY+ H+D F+
Sbjct: 207 SGMFLRRGQDK--IIRTIEKRIADYTFIPVEQGEGLQVLHYEVGQKYEPHFDYFHDDYNT 264
Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKCI--GLKVKPRRGDGLL 245
QR+A+ L+YLSDVE+GGET+FP S + + +C GL VKP+ GD LL
Sbjct: 265 KNGGQRIATLLMYLSDVEDGGETVFPSSTTNSSSSPFYNELSECAKGGLSVKPKMGDALL 324
Query: 246 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
F+S+ P+G++D TSLHG CPVIKG KW +TKW+R E
Sbjct: 325 FWSMKPDGSLDPTSLHGGCPVIKGNKWSSTKWMRVHE 361
>gi|115456019|ref|NP_001051610.1| Os03g0803500 [Oryza sativa Japonica Group]
gi|29150365|gb|AAO72374.1| putative oxidoreductase [Oryza sativa Japonica Group]
gi|108711618|gb|ABF99413.1| oxidoreductase, 2OG-Fe oxygenase family protein, putative,
expressed [Oryza sativa Japonica Group]
gi|113550081|dbj|BAF13524.1| Os03g0803500 [Oryza sativa Japonica Group]
gi|215765410|dbj|BAG87107.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222625993|gb|EEE60125.1| hypothetical protein OsJ_13003 [Oryza sativa Japonica Group]
Length = 299
Score = 180 bits (457), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 95/205 (46%), Positives = 131/205 (63%), Gaps = 8/205 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LSWRPRA + F S ++C ++ AK R++ S +A G+++ S RTSSGTF+S
Sbjct: 40 LSWRPRAFLYSGFLSHDECDHLVNLAKGRMEKSMVADNDSGKSIMSQ--VRTSSGTFLSK 97
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
ED I+ IE ++A T LP+ + E+ +L YE+GQKYD+H+D F+ + R+
Sbjct: 98 HEDD--IVSGIEKRVAAWTFLPEENAESIQILHYELGQKYDAHFDYFHDKNNLKRGGHRV 155
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFL---DSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 253
A+ L+YL+DV++GGET+FP G L D + GL VKP++GD LLF+SL N
Sbjct: 156 ATVLMYLTDVKKGGETVFPNAAGRHLQLKDETWSDCARSGLAVKPKKGDALLFFSLHVNA 215
Query: 254 TIDRTSLHGSCPVIKGEKWVATKWI 278
T D SLHGSCPVI+GEKW ATKWI
Sbjct: 216 TTDPASLHGSCPVIEGEKWSATKWI 240
>gi|240256489|ref|NP_201407.4| iron ion binding / oxidoreductase/ oxidoreductase protein
[Arabidopsis thaliana]
gi|332010770|gb|AED98153.1| iron ion binding / oxidoreductase/ oxidoreductase protein
[Arabidopsis thaliana]
Length = 289
Score = 180 bits (456), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 95/212 (44%), Positives = 133/212 (62%), Gaps = 9/212 (4%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
+++SW PRA + NF + E+C+ +I AK ++ S + + G++ +S RTSSGTF+
Sbjct: 79 EIISWEPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKTGKSTDSR--VRTSSGTFL 136
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
+ DKT + IE +I+ T +P HGE VL YEIGQKY+ HYD F Q
Sbjct: 137 ARGRDKT--IREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHYDYFMDEYNTRNGGQ 194
Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSLF 250
R+A+ L+YLSDVEEGGET+FP G + + + +C GL VKP+ GD LLF+S+
Sbjct: 195 RIATVLMYLSDVEEGGETVFPAAKGNYSAVPWWNELSECGKGGLSVKPKMGDALLFWSMT 254
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
P+ T+D +SLHG C VIKG KW +TKW+R E
Sbjct: 255 PDATLDPSSLHGGCAVIKGNKWSSTKWLRVHE 286
>gi|357137804|ref|XP_003570489.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Brachypodium
distachyon]
Length = 318
Score = 179 bits (454), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 97/217 (44%), Positives = 129/217 (59%), Gaps = 19/217 (8%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
+V+SW PRA + NF S E+C+ +I AK R++ S + V+ST G RTS
Sbjct: 108 EVISWEPRAFVYHNFLSKEECEYLIGLAKPRMEKSTV-------VDSTTGKSKDSRVRTS 160
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
SG F+ DK ++ IE +IA T +P HGE VL YE+GQKY+ H+D F
Sbjct: 161 SGMFLRRGRDK--VIRAIERRIADYTFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEFNT 218
Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFEN--GIFLDSGYDYKKCI--GLKVKPRRGDGLL 245
QR+A+ L+YLSDVEEGGET+FP N L + +C GL VKP+ GD LL
Sbjct: 219 KNGGQRMATILMYLSDVEEGGETIFPDANVNSSSLPWHNELSECARKGLAVKPKMGDALL 278
Query: 246 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
F+S+ P+ T+D SLHG CPVI+G KW +TKW+ E
Sbjct: 279 FWSMNPDATLDPLSLHGGCPVIRGNKWSSTKWMHVGE 315
>gi|297818456|ref|XP_002877111.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297322949|gb|EFH53370.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 316
Score = 178 bits (451), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 95/206 (46%), Positives = 128/206 (62%), Gaps = 8/206 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LSW PRA + F S E+C I AK +L+ S +A GE+VES RTSSG F+S
Sbjct: 59 LSWTPRAFLYKGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESE--VRTSSGMFLSK 116
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D I+ +E K+A T +P+ +GE+ +L YE GQKY+ H+D F+ R+
Sbjct: 117 RQDD--IVANVEAKLAAWTFIPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRI 174
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPNG 253
A+ L+YLS+VE+GGET+FP G D + +C G VKPR+GD LLF++L PN
Sbjct: 175 ATVLMYLSNVEKGGETVFPMWKGKTTQLKDDSWTECAKQGYAVKPRKGDALLFFNLHPNA 234
Query: 254 TIDRTSLHGSCPVIKGEKWVATKWIR 279
T D SLHGSCPV++GEKW AT+WI
Sbjct: 235 TTDSNSLHGSCPVVEGEKWSATRWIH 260
>gi|255552788|ref|XP_002517437.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223543448|gb|EEF44979.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 311
Score = 178 bits (451), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 95/206 (46%), Positives = 130/206 (63%), Gaps = 8/206 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LSW PRA + F S E+C +I A+ +L+ S +A + G+++ES RTSSG FI+
Sbjct: 52 LSWHPRAFLYKGFLSYEECDHLIDLARDKLEKSMVADNESGKSIESE--VRTSSGMFIAK 109
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
++D+ I+ IE +IA T LP+ +GE+ +L YE GQKY+ H+D F+ R+
Sbjct: 110 AQDE--IVADIEARIAAWTFLPEENGESMQILHYEHGQKYEPHFDYFHDKANQELGGHRV 167
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPNG 253
A+ L+YLS+VE+GGET+FP G D + C G VKP +GD LLF+SL P+
Sbjct: 168 ATVLMYLSNVEKGGETVFPNAEGKLSQPKEDSWSDCAKGGYAVKPEKGDALLFFSLHPDA 227
Query: 254 TIDRTSLHGSCPVIKGEKWVATKWIR 279
T D SLHGSCPVI+GEKW ATKWI
Sbjct: 228 TTDSDSLHGSCPVIEGEKWSATKWIH 253
>gi|357517885|ref|XP_003629231.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355523253|gb|AET03707.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 279
Score = 177 bits (450), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 92/209 (44%), Positives = 129/209 (61%), Gaps = 9/209 (4%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLAL-RQGETVESTKGTRTSSGTFI 134
+++SW PR + NF + E+C+ +I AK ++ S + G++V S+ RTSSGTFI
Sbjct: 71 EIVSWEPRVFLYHNFLAKEECEHLINIAKPDVQKSTVVDDTTGKSVNSS--ARTSSGTFI 128
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
DK IL IE +IA T +P HGE N+L YE+GQKYD H D F +
Sbjct: 129 DRGYDK--ILSDIEKRIADFTFIPVEHGEDVNILHYEVGQKYDFHTDYFEDEVNTKHGGE 186
Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSLF 250
R+A+ L+YLSDVEEGGET+FP G F + + C GL +KP+ G+ +LF+ +
Sbjct: 187 RIATMLMYLSDVEEGGETVFPSAKGNFSSVPWWNELSDCGKKGLSIKPKMGNAILFWGMK 246
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
P+ T+D S+HG+CPVIKG+KW TKW+R
Sbjct: 247 PDATVDPLSVHGACPVIKGDKWSCTKWMR 275
>gi|9294583|dbj|BAB02864.1| prolyl 4-hydroxylase alpha subunit-like protein [Arabidopsis
thaliana]
Length = 332
Score = 177 bits (450), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 95/205 (46%), Positives = 127/205 (61%), Gaps = 8/205 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LSW PR + F S E+C I AK +L+ S +A GE+VES RTSSG F+S
Sbjct: 75 LSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESE--VRTSSGMFLSK 132
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D I+ +E K+A T LP+ +GE+ +L YE GQKY+ H+D F+ R+
Sbjct: 133 RQDD--IVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRI 190
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPNG 253
A+ L+YLS+VE+GGET+FP G D + +C G VKPR+GD LLF++L PN
Sbjct: 191 ATVLMYLSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDALLFFNLHPNA 250
Query: 254 TIDRTSLHGSCPVIKGEKWVATKWI 278
T D SLHGSCPV++GEKW AT+WI
Sbjct: 251 TTDSNSLHGSCPVVEGEKWSATRWI 275
>gi|18086437|gb|AAL57673.1| AT3g28480/MFJ20_16 [Arabidopsis thaliana]
gi|24796986|gb|AAN64505.1| At3g28480/MFJ20_16 [Arabidopsis thaliana]
Length = 316
Score = 177 bits (449), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 95/206 (46%), Positives = 127/206 (61%), Gaps = 8/206 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LSW PR + F S E+C I AK +L+ S +A GE+VES RTSSG F+S
Sbjct: 59 LSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESE--VRTSSGMFLSK 116
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D I+ +E K+A T LP+ +GE+ +L YE GQKY+ H+D F+ R+
Sbjct: 117 RQDD--IVNNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRI 174
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPNG 253
A+ L+YLS+VE+GGET+FP G D + +C G VKPR+GD LLF++L PN
Sbjct: 175 ATVLMYLSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDALLFFNLHPNA 234
Query: 254 TIDRTSLHGSCPVIKGEKWVATKWIR 279
T D SLHGSCPV++GEKW AT+WI
Sbjct: 235 TTDSNSLHGSCPVVEGEKWSATRWIH 260
>gi|18405808|ref|NP_566838.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
gi|21617881|gb|AAM66931.1| prolyl 4-hydroxylase, putative [Arabidopsis thaliana]
gi|332643929|gb|AEE77450.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
Length = 316
Score = 177 bits (449), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 95/206 (46%), Positives = 127/206 (61%), Gaps = 8/206 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LSW PR + F S E+C I AK +L+ S +A GE+VES RTSSG F+S
Sbjct: 59 LSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESE--VRTSSGMFLSK 116
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D I+ +E K+A T LP+ +GE+ +L YE GQKY+ H+D F+ R+
Sbjct: 117 RQDD--IVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRI 174
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPNG 253
A+ L+YLS+VE+GGET+FP G D + +C G VKPR+GD LLF++L PN
Sbjct: 175 ATVLMYLSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDALLFFNLHPNA 234
Query: 254 TIDRTSLHGSCPVIKGEKWVATKWIR 279
T D SLHGSCPV++GEKW AT+WI
Sbjct: 235 TTDSNSLHGSCPVVEGEKWSATRWIH 260
>gi|215490181|dbj|BAG86624.1| type 2 proly 4-hydroxylase [Nicotiana tabacum]
Length = 294
Score = 177 bits (449), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 96/207 (46%), Positives = 126/207 (60%), Gaps = 9/207 (4%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
+SW+PRA + F + E+C +I+ AK LK S +A + ++++ RTSSG FI +
Sbjct: 36 ISWKPRAFVYEGFLTDEECNHLISLAKSELKRSAVADNESGNSKTSE-VRTSSGMFIPKA 94
Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
+D I+ IE KIA T LP+ +GE VLRYE GQKY+ HYD F + RLA
Sbjct: 95 KDP--IVSGIEEKIATWTFLPKENGEEIQVLRYEEGQKYEPHYDYFVDKVNIARGGHRLA 152
Query: 198 SFLLYLSDVEEGGETMFP------FENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
+ L+YL++VE+GGET+FP + D G+ VKPR+GD LLFYSL P
Sbjct: 153 TVLMYLTNVEKGGETVFPKAEESPRRRSMIADDSLSECAKKGIPVKPRKGDALLFYSLHP 212
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWI 278
N T D SLHG CPVI+GEKW ATKWI
Sbjct: 213 NATPDPLSLHGGCPVIQGEKWSATKWI 239
>gi|224141325|ref|XP_002324024.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Populus
trichocarpa]
gi|222867026|gb|EEF04157.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Populus
trichocarpa]
Length = 308
Score = 177 bits (448), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 96/207 (46%), Positives = 131/207 (63%), Gaps = 10/207 (4%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LSW PRA + F S E+C ++ A+ +L+ S +A + G+++ES RTSSG FI
Sbjct: 49 LSWNPRAFLYKGFLSDEECDHLMNLARDKLEKSMVADNESGKSIESE--VRTSSGMFIGK 106
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
S+D+ I++ IE +IA T LPQ +GE+ +L YE GQKY+ H+D F+ R+
Sbjct: 107 SQDE--IVDDIEARIAAWTFLPQENGESIQILHYEHGQKYEPHFDYFHDKANQELGGHRV 164
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFL----DSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
+ L+YLS+V +GGET+FP G + DS D K G VKP++GD LLF+SL P+
Sbjct: 165 VTVLMYLSNVGKGGETVFPNSEGKTIQPKDDSWSDCAK-NGYAVKPQKGDALLFFSLHPD 223
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIR 279
T D SLHGSCPVI+GEKW ATKWI
Sbjct: 224 ATTDTNSLHGSCPVIEGEKWSATKWIH 250
>gi|48716447|dbj|BAD23054.1| putative prolyl 4-hydroxylase [Oryza sativa Japonica Group]
Length = 310
Score = 177 bits (448), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 97/217 (44%), Positives = 127/217 (58%), Gaps = 19/217 (8%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
+V+SW PRA + NF S E+C +I AK + S + V+ST G RTS
Sbjct: 100 EVISWEPRAFVYHNFLSKEECDYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 152
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
SG F+ DK ++ IE +IA T +P HGE VL YE+GQKY+ H+D F
Sbjct: 153 SGMFLQRGRDK--VIRAIEKRIADYTFIPMEHGEGLQVLHYEVGQKYEPHFDYFLDEYNT 210
Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFE--NGIFLDSGYDYKKCI--GLKVKPRRGDGLL 245
QR+A+ L+YLSDVEEGGET+FP N L + +C GL VKP+ GD LL
Sbjct: 211 KNGGQRMATLLMYLSDVEEGGETIFPDANVNSSSLPWYNELSECARKGLAVKPKMGDALL 270
Query: 246 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
F+S+ P+ T+D SLHG CPVIKG KW +TKW+ +E
Sbjct: 271 FWSMKPDATLDPLSLHGGCPVIKGNKWSSTKWMHVRE 307
>gi|297818458|ref|XP_002877112.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297322950|gb|EFH53371.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 289
Score = 177 bits (448), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 95/207 (45%), Positives = 129/207 (62%), Gaps = 9/207 (4%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLAL--RQGETVESTKGTRTSSGTFIS 135
LSW PRA + F S E+C +I AK +L+ S + GE+++S + RTSSG F++
Sbjct: 35 LSWTPRAFLYNGFLSDEECDHLINLAKGKLEKSMVVADDNSGESIDSEE--RTSSGVFLT 92
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
+D I+ +E K+A T LP+ +GEA +L YE GQKYD H+D + E R
Sbjct: 93 KRQDD--IVANVEAKLATWTFLPEENGEALQILHYENGQKYDPHFDYYYDKETLKLGGHR 150
Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPN 252
+A+ L+YLS+V +GGET+FP G D + +C G VKPR+GD LLF++L PN
Sbjct: 151 IATVLMYLSNVTKGGETVFPMWKGKTPQLKDDTWSECAKQGYAVKPRKGDALLFFNLHPN 210
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIR 279
T D TSLHGSCPVI+GEKW AT+WI
Sbjct: 211 ATTDPTSLHGSCPVIEGEKWSATRWIH 237
>gi|363543301|ref|NP_001241866.1| prolyl 4-hydroxylase 6 precursor [Zea mays]
gi|195624808|gb|ACG34234.1| oxidoreductase [Zea mays]
gi|347978818|gb|AEP37751.1| prolyl 4-hydroxylase 6 [Zea mays]
Length = 297
Score = 176 bits (447), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 96/205 (46%), Positives = 130/205 (63%), Gaps = 8/205 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LS RPRA + F S +C I++ AK ++ S +A G++V S RTSSGTF++
Sbjct: 38 LSSRPRAFLYSGFLSDTECDHIVSLAKGSMEKSMVADNDSGKSVASQ--ARTSSGTFLAK 95
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
ED+ I+ IE ++A T LP+ + E+ VLRYE GQKYD+H+D F+ QR+
Sbjct: 96 REDE--IVSAIEKRVAAWTFLPEENAESLQVLRYETGQKYDAHFDYFHDRNNLKLGGQRV 153
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFL---DSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 253
A+ L+YL+DV++GGET+FP G L D + GL VKP++GD LLF++L N
Sbjct: 154 ATVLMYLTDVKKGGETVFPNAEGSHLQYKDETWSECSRSGLAVKPKKGDALLFFNLHVNA 213
Query: 254 TIDRTSLHGSCPVIKGEKWVATKWI 278
T D SLHGSCPVI+GEKW ATKWI
Sbjct: 214 TADTGSLHGSCPVIEGEKWSATKWI 238
>gi|302815629|ref|XP_002989495.1| hypothetical protein SELMODRAFT_129912 [Selaginella moellendorffii]
gi|300142673|gb|EFJ09371.1| hypothetical protein SELMODRAFT_129912 [Selaginella moellendorffii]
Length = 213
Score = 176 bits (447), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 90/211 (42%), Positives = 130/211 (61%), Gaps = 7/211 (3%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
+++SW PRA NF + ++C +I A ++ S + Q ++ RTSSG F++
Sbjct: 3 EIISWTPRASLVHNFLTDDECDHLIRVAMPLMQKSTVVDSQTGGSRDSR-VRTSSGMFLN 61
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
+D+ ++ IE KIA+ T +P+ HGE VL YE GQKYD+H+D F QR
Sbjct: 62 RGQDR--VISEIEDKIAKLTFIPKDHGEGIQVLHYEPGQKYDAHHDFFYDTVNTRNGGQR 119
Query: 196 LASFLLYLSDVEEGGETMFP--FENGIFLDSGYDYKKC--IGLKVKPRRGDGLLFYSLFP 251
+A+ L+YL+DVEEGGET+FP +N L +C G+ V+P+RGD LLF+S+ P
Sbjct: 120 IATLLMYLTDVEEGGETVFPKSAKNSSSLPWHNQLSECGRRGVSVRPKRGDALLFWSMSP 179
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
+ +D +SLHG CPVIKG+KW ATKW+R E
Sbjct: 180 DAQLDHSSLHGGCPVIKGDKWSATKWMRVSE 210
>gi|302762452|ref|XP_002964648.1| hypothetical protein SELMODRAFT_82355 [Selaginella moellendorffii]
gi|300168377|gb|EFJ34981.1| hypothetical protein SELMODRAFT_82355 [Selaginella moellendorffii]
Length = 225
Score = 176 bits (447), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 90/211 (42%), Positives = 130/211 (61%), Gaps = 7/211 (3%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
+++SW PRA NF + ++C +I A ++ S + Q ++ RTSSG F++
Sbjct: 15 EIISWTPRASLVHNFLTDDECDHLIRVAMPLMQKSTVVDSQTGGSRDSR-VRTSSGMFLN 73
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
+D+ ++ IE KIA+ T +P+ HGE VL YE GQKYD+H+D F QR
Sbjct: 74 RGQDR--VISEIEDKIAKLTFIPKDHGEGIQVLHYEPGQKYDAHHDFFYDTVNTRNGGQR 131
Query: 196 LASFLLYLSDVEEGGETMFP--FENGIFLDSGYDYKKC--IGLKVKPRRGDGLLFYSLFP 251
+A+ L+YL+DVEEGGET+FP +N L +C G+ V+P+RGD LLF+S+ P
Sbjct: 132 IATLLMYLTDVEEGGETVFPKSAKNSSSLPWHNQLSECGRRGVSVRPKRGDALLFWSMSP 191
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
+ +D +SLHG CPVIKG+KW ATKW+R E
Sbjct: 192 DAQLDHSSLHGGCPVIKGDKWSATKWMRVSE 222
>gi|359477455|ref|XP_002278454.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1 [Vitis
vinifera]
Length = 296
Score = 176 bits (447), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 97/222 (43%), Positives = 131/222 (59%), Gaps = 7/222 (3%)
Query: 62 HGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE 121
+ + +++ + + +SW+PRA + F S E+C +I+ AK LK S +A
Sbjct: 24 YADAAGSNVSAAKVRQISWKPRAFVYEGFLSEEECDHLISLAKSELKRSAVADNVSGKSR 83
Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
++ RTSSG FI +D I+ IE KIA T LP+ +GE VLRYE GQKYD+HYD
Sbjct: 84 LSE-VRTSSGMFIGKGKDP--IVAGIEDKIAAWTFLPKDNGEDMQVLRYEPGQKYDAHYD 140
Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFE--NGIFLDSGYDYKKCI--GLKVK 237
F + R+A+ L+YLSDV +GGET+FP + L + D +C G+ VK
Sbjct: 141 YFVDKVNIARGGHRIATVLMYLSDVVKGGETVFPMAEVSSSTLPTNDDLSECARKGIAVK 200
Query: 238 PRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
PR+GD LLF+SL P D SLHG CPVI+GEKW ATKWI
Sbjct: 201 PRKGDALLFFSLHPTAIPDPMSLHGGCPVIEGEKWSATKWIH 242
>gi|326489721|dbj|BAK01841.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 315
Score = 176 bits (447), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 95/208 (45%), Positives = 126/208 (60%), Gaps = 9/208 (4%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFI 134
+V+SW PRA + NF S E+C+ +I AK R+ S + ET +S RTSSG F+
Sbjct: 105 EVISWEPRAFVYHNFLSKEECEYLIELAKPRMVKSTVV--DSETGKSKDSRVRTSSGMFL 162
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
DK ++ IE +IA T +P HGE VL YE+GQKY+ H+D F Q
Sbjct: 163 QRGRDK--VIRAIERRIADYTFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEFNTKNGGQ 220
Query: 195 RLASFLLYLSDVEEGGETMFPFEN--GIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLF 250
R+A+ L+YLSD+EEGGET+FP N L + +C GL VKP+ GD LLF+S+
Sbjct: 221 RMATILMYLSDIEEGGETIFPDANVNSSSLPWYNELSECARKGLAVKPKMGDALLFWSMK 280
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWI 278
P+ T+D SLHG CPVIKG KW +TKW+
Sbjct: 281 PDATLDPLSLHGGCPVIKGNKWSSTKWL 308
>gi|226529219|ref|NP_001151238.1| LOC100284871 [Zea mays]
gi|195645242|gb|ACG42089.1| prolyl 4-hydroxylase alpha-2 subunit precursor [Zea mays]
gi|347978812|gb|AEP37748.1| prolyl 4-hydroxylase 5 [Zea mays]
gi|413923983|gb|AFW63915.1| prolyl 4-hydroxylase alpha-2 subunit [Zea mays]
Length = 308
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 96/217 (44%), Positives = 128/217 (58%), Gaps = 19/217 (8%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
+V+SW PRA + NF S E+C+ +I AK + S + V+ST G RTS
Sbjct: 98 EVISWEPRAFVYHNFLSKEECEYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 150
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
SG F+ DK ++ +IE +IA T +P HGE VL YE+GQKY+ H+D F
Sbjct: 151 SGMFLQRGRDK--VIRVIEKRIADYTFIPVDHGEGLQVLHYEVGQKYEPHFDYFLDEFNT 208
Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFENGIF--LDSGYDYKKCI--GLKVKPRRGDGLL 245
QR+A+ L+YLSDVEEGGET+FP N L + +C GL VKP+ GD LL
Sbjct: 209 KNGGQRMATLLMYLSDVEEGGETIFPDANVNVSSLPWYNELSECAKRGLSVKPKMGDALL 268
Query: 246 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
F+S+ P+ T+D SLHG CPVI+G KW +TKW+ E
Sbjct: 269 FWSMKPDATLDPLSLHGGCPVIRGNKWSSTKWMHIHE 305
>gi|224133600|ref|XP_002327635.1| predicted protein [Populus trichocarpa]
gi|222836720|gb|EEE75113.1| predicted protein [Populus trichocarpa]
Length = 291
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 91/211 (43%), Positives = 130/211 (61%), Gaps = 7/211 (3%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
+V+SW+PRA + NF + +C+ +I AK R++ S + + +K RTSSGTF+
Sbjct: 81 EVISWKPRAFVYHNFLTKAECEYLINLAKPRMQKSTVVDSSTGKSKDSK-VRTSSGTFLP 139
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
DK I+ IE +IA + +P HGE +L YE+GQ+Y+ H+D F QR
Sbjct: 140 RGRDK--IVRDIEKRIADFSFIPVEHGEGLQILHYEVGQRYEPHFDYFMDEYNTKNGGQR 197
Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSLFP 251
+A+ L+YLSDVEEGGET+FP G + + +C GL VKP+ GD LLF+S+ P
Sbjct: 198 IATVLMYLSDVEEGGETVFPSAEGNISAVPWWNELSECGKGGLSVKPKMGDALLFWSMNP 257
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
+G+ D +SLHG CPVI+G KW +TKW+R E
Sbjct: 258 DGSPDPSSLHGGCPVIRGNKWSSTKWMRVNE 288
>gi|357467085|ref|XP_003603827.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355492875|gb|AES74078.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 280
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 95/212 (44%), Positives = 131/212 (61%), Gaps = 9/212 (4%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTKGTRTSSGTFI 134
++LSW PRA + NF S E+C+ +I AK L K S + + G++ ES RTSSG F+
Sbjct: 70 EILSWEPRAFVYHNFLSKEECEHLINLAKPFLAKSSVVDSKTGKSTESR--VRTSSGMFL 127
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
+DK I++ IE +IA T +P +GE VL Y +G+KY+ HYD F Q
Sbjct: 128 KRGKDK--IIQNIERRIADFTFIPVENGEGLQVLHYGVGEKYEPHYDYFLDEFNTKNGGQ 185
Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKCI--GLKVKPRRGDGLLFYSLF 250
R+A+ L+YLSDVEEGGET+FP F + D +C GL +KP+ GD LLF+S+
Sbjct: 186 RVATVLMYLSDVEEGGETVFPAAKANFSSVPWWNDLSECARKGLSLKPKMGDALLFWSMR 245
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
P+ T+D +SLHG CPVI G KW +TKW+ +E
Sbjct: 246 PDATLDASSLHGGCPVIVGNKWSSTKWMHLEE 277
>gi|255539064|ref|XP_002510597.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223551298|gb|EEF52784.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 289
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 95/208 (45%), Positives = 129/208 (62%), Gaps = 9/208 (4%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTKGTRTSSGTFI 134
+++SW PRA + NF S E+C+ +IA AK + K + + + G + +S RTSSG F+
Sbjct: 79 EIISWEPRAFVYHNFLSKEECEYLIALAKPHMVKSTVVDSKTGRSKDSR--VRTSSGMFL 136
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
DK I+ IE +IA + +P HGE VL YE+GQKY++HYD F Q
Sbjct: 137 RRGRDK--IIRNIEKRIADFSFIPIEHGEGLQVLHYEVGQKYEAHYDYFLDEFNTKNGGQ 194
Query: 195 RLASFLLYLSDVEEGGETMFPFE--NGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLF 250
R A+ L+YLSDVEEGGET+FP N + S + +C GL VKP+ G+ LLF+S
Sbjct: 195 RTATLLMYLSDVEEGGETVFPAAKANISNVPSWNELSECARQGLSVKPKMGNALLFWSTR 254
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWI 278
P+ T+D SLHGSCPVI+G KW ATKW+
Sbjct: 255 PDATLDPASLHGSCPVIRGNKWSATKWM 282
>gi|242063586|ref|XP_002453082.1| hypothetical protein SORBIDRAFT_04g038020 [Sorghum bicolor]
gi|241932913|gb|EES06058.1| hypothetical protein SORBIDRAFT_04g038020 [Sorghum bicolor]
Length = 307
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 96/217 (44%), Positives = 127/217 (58%), Gaps = 19/217 (8%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
+V+SW PRA + NF S E+C+ +I AK + S + V+ST G RTS
Sbjct: 97 EVISWEPRAFVYHNFLSKEECEYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 149
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
SG F+ DK ++ IE +IA T +P HGE VL YE+GQKY+ H+D F
Sbjct: 150 SGMFLQRGRDK--VIRAIEKRIADYTFIPADHGEGLQVLHYEVGQKYEPHFDYFLDEFNT 207
Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFEN--GIFLDSGYDYKKCI--GLKVKPRRGDGLL 245
QR+A+ L+YLSDVEEGGET+FP N L + +C GL VKP+ GD LL
Sbjct: 208 KNGGQRMATLLMYLSDVEEGGETIFPDANVNASSLPWYNELSECAKRGLSVKPKMGDALL 267
Query: 246 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
F+S+ P+ T+D SLHG CPVI+G KW +TKW+ E
Sbjct: 268 FWSMKPDATLDPLSLHGGCPVIRGNKWSSTKWMHIHE 304
>gi|413932756|gb|AFW67307.1| oxidoreductase [Zea mays]
Length = 297
Score = 176 bits (445), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 95/206 (46%), Positives = 129/206 (62%), Gaps = 8/206 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LS RPRA + F S +C +++ AK ++ S +A G++V S RTSSGTF++
Sbjct: 38 LSSRPRAFLYSGFLSDTECDHLVSLAKGSMEKSMVADNDSGKSVASQ--ARTSSGTFLAK 95
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
ED+ I+ IE ++A T LP+ + E+ VLRYE GQKYD+H+D F+ QR+
Sbjct: 96 REDE--IVSAIEKRVAAWTFLPEENAESLQVLRYETGQKYDAHFDYFHDRNNLKLGGQRV 153
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFL---DSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 253
A+ L+YL+DV +GGET+FP G L D + GL VKP++GD LLF++L N
Sbjct: 154 ATVLMYLTDVNKGGETVFPNAEGSHLQYKDETWSECSRSGLAVKPKKGDALLFFNLHVNA 213
Query: 254 TIDRTSLHGSCPVIKGEKWVATKWIR 279
T D SLHGSCPVI+GEKW ATKWI
Sbjct: 214 TADTGSLHGSCPVIEGEKWSATKWIH 239
>gi|449432777|ref|XP_004134175.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 303
Score = 175 bits (444), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 96/208 (46%), Positives = 127/208 (61%), Gaps = 9/208 (4%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
+SW PRA + F + +C +I+ AK LK S +A + ++ RTSSG FI +
Sbjct: 44 ISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGKSKVSE-VRTSSGAFIHKA 102
Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
+D I+ IE KIA T LP+ +GE VLRYE GQKYD+H+D F + R+A
Sbjct: 103 KDP--IVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMA 160
Query: 198 SFLLYLSDVEEGGETMFP----FENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLFP 251
+ L+YLSDVE+GGET+FP + ++ D C G+ VKPR+GD LLF+SL P
Sbjct: 161 TVLMYLSDVEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRKGDALLFFSLHP 220
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIR 279
N D +SLHG CPVI+GEKW ATKWIR
Sbjct: 221 NAIPDTSSLHGGCPVIEGEKWSATKWIR 248
>gi|356555587|ref|XP_003546112.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 2
[Glycine max]
Length = 297
Score = 175 bits (444), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 100/220 (45%), Positives = 132/220 (60%), Gaps = 9/220 (4%)
Query: 65 TGDASIQSIPFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVE 121
G AS P +V +SW+PRA + F + +C +I+ AK LK S +A GE+
Sbjct: 28 AGSASAIIDPSKVKQVSWKPRAFVYEGFLTELECDHLISIAKSELKRSAVADNLSGESKL 87
Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
S RTSSG FI ++D I+ +E KI+ T+LP+ +GE VLRYE GQKYD HYD
Sbjct: 88 SE--VRTSSGMFIPKNKDP--IVAGVEDKISSWTLLPKENGEDIQVLRYEHGQKYDPHYD 143
Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI--GLKVKPR 239
F + R+A+ L+YL+DV +GGET+FP ++ D +C G+ VKPR
Sbjct: 144 YFADKVNIARGGHRVATVLMYLTDVTKGGETVFPNAELKSSETKEDLSECAQKGIAVKPR 203
Query: 240 RGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
RGD LLF+SL+PN D SLH CPVI+GEKW ATKWI
Sbjct: 204 RGDALLFFSLYPNAIPDTMSLHAGCPVIEGEKWSATKWIH 243
>gi|356550516|ref|XP_003543632.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 318
Score = 175 bits (444), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 102/253 (40%), Positives = 145/253 (57%), Gaps = 10/253 (3%)
Query: 33 TFLSRDVPSIRPKLRTLEVVEKENESGLPHGETGDASIQSIPFQV--LSWRPRALYFPNF 90
FL ++ +L L+ K + G +S++ P +V LSW PRA + F
Sbjct: 12 CFLCFEISVSSIRLPGLDQDAKATHGSVLRLNRGGSSVKFDPTRVTQLSWSPRAFLYKGF 71
Query: 91 ASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISASEDKTGILELIEH 149
S E+C +I AK +L+ S +A + G+++ S RTSSG F++ ++D+ I+ IE
Sbjct: 72 LSDEECDHLITLAKDKLEKSMVADNESGKSIMSE--VRTSSGMFLNKAQDE--IVAGIEA 127
Query: 150 KIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEG 209
+IA T LP +GE+ +L YE GQKY+ H+D F+ R+A+ L+YLSDVE+G
Sbjct: 128 RIAAWTFLPIENGESMQILHYENGQKYEPHFDYFHDKANQVMGGHRIATVLMYLSDVEKG 187
Query: 210 GETMFPFENGIFL---DSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPV 266
GET+FP L D + G VKPR+GD LLF+SL + + D SLHGSCPV
Sbjct: 188 GETIFPNAKAKLLQPKDESWSECAHKGYAVKPRKGDALLFFSLHLDASTDNKSLHGSCPV 247
Query: 267 IKGEKWVATKWIR 279
I+GEKW ATKWI
Sbjct: 248 IEGEKWSATKWIH 260
>gi|326501992|dbj|BAK06488.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 306
Score = 175 bits (443), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 98/208 (47%), Positives = 134/208 (64%), Gaps = 11/208 (5%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKR-LKPSQLALRQ-GETVESTKGTRTSSGTFIS 135
+SWRPRA + F + +C ++A A++ L+ S + RQ G++V S RTSSGTF++
Sbjct: 41 VSWRPRAFLYKGFLTEAECDHLVALAEEGGLQKSMVVDRQTGKSVMSE--VRTSSGTFLA 98
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG--PQMS 193
+D+ ++ IE +IA T+LPQ +GE+ VLRYE GQKY+ H D A G +
Sbjct: 99 KKQDQ--VVATIEARIAAWTLLPQENGESIQVLRYENGQKYEPHVDFIRHAAKGHHSRGG 156
Query: 194 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYK-KCI--GLKVKPRRGDGLLFYSLF 250
R+A+ L+YLSDV+ GGET+FP + L D + +C G VKP +GD +LF+SL
Sbjct: 157 HRVATVLMYLSDVKMGGETVFPNSDAKTLQPKDDTQSECARRGYAVKPVKGDAVLFFSLH 216
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWI 278
PNGT DR SLHG CPVI+GEKW ATKWI
Sbjct: 217 PNGTTDRDSLHGGCPVIEGEKWSATKWI 244
>gi|114796723|gb|ABI79328.1| prolyl 4-hydroxylase [Dianthus caryophyllus]
Length = 297
Score = 175 bits (443), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 93/208 (44%), Positives = 130/208 (62%), Gaps = 10/208 (4%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
+SW+PRA + F + E+C +I+ AK LK S +A + + ++ RTSSG FIS +
Sbjct: 40 ISWKPRAFVYEGFLTDEECDHLISIAKTELKRSAVADNESGKSQVSE-VRTSSGAFISKA 98
Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
+D I++ IE K+A T LP +GE VLRYE GQKY++H+D F+ + R A
Sbjct: 99 KD--AIVQRIEEKLATWTFLPIENGEDIQVLRYEEGQKYENHFDFFSDKVNIARGGHRYA 156
Query: 198 SFLLYLSDVEEGGETMFPF-----ENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLF 250
+ L+YLS+VE+GG+T+FP + + D +C G+ VKPR+GD LLF+SL
Sbjct: 157 TVLMYLSNVEKGGDTVFPNAELSERQKAAIAANDDLSECAKRGISVKPRKGDALLFFSLT 216
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWI 278
P T D+ SLHG CPVI+GEKW ATKWI
Sbjct: 217 PTATPDQLSLHGGCPVIEGEKWSATKWI 244
>gi|302845234|ref|XP_002954156.1| hypothetical protein VOLCADRAFT_82641 [Volvox carteri f.
nagariensis]
gi|300260655|gb|EFJ44873.1| hypothetical protein VOLCADRAFT_82641 [Volvox carteri f.
nagariensis]
Length = 309
Score = 174 bits (442), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 93/205 (45%), Positives = 127/205 (61%), Gaps = 7/205 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTKGTRTSSGTFISA 136
LSW PRA F S E+C+ IIA AK R+ K S + G++V+S RTS+G +++
Sbjct: 57 LSWSPRAFLLKGFLSDEECEHIIAKAKPRMVKSSVVDNASGKSVDSE--IRTSTGAWLAK 114
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMS-Q 194
ED+ I+ IE ++A+ TM+P + E VL Y GQKY+ HYD F +P P+ Q
Sbjct: 115 GEDE--IISRIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNASPEHGGQ 172
Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
R+ + L+YL+ VEEGGET+ P + G+ GL VKP +GD L+FYSL P+G+
Sbjct: 173 RVVTVLMYLTTVEEGGETVLPHADQKVSGEGWSECAKRGLAVKPVKGDALMFYSLKPDGS 232
Query: 255 IDRTSLHGSCPVIKGEKWVATKWIR 279
D SLHGSCP +KG+KW ATKWI
Sbjct: 233 NDPASLHGSCPTLKGDKWSATKWIH 257
>gi|297802350|ref|XP_002869059.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297314895|gb|EFH45318.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 290
Score = 174 bits (441), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 104/276 (37%), Positives = 159/276 (57%), Gaps = 18/276 (6%)
Query: 16 TALLLCSF--FFLAGFYGSTFLSRDVPSIRPKLRT--LEVVEKENESGLPHGETGDASIQ 71
T L+L F L G + + + S RP T ++ +E+ G GD ++
Sbjct: 22 TVLILGLFVILILVGLGILSLPNTNKSSSRPMDLTTIVQTIEERESYGDEEDGNGDRWLE 81
Query: 72 SIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSS 130
V+SW PRA + NF + E+C+ +I+ AK + S++ ++ G++++S RTSS
Sbjct: 82 -----VISWEPRAFVYHNFLTNEECEHLISLAKPSMVKSKVVDVKTGKSIDSR--VRTSS 134
Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP 190
GTF+ D+ I+E IE++I+ T +P +GE VL YE+GQKY+ H+D F
Sbjct: 135 GTFLKRGHDE--IVEEIENRISDFTFIPIENGEGLQVLHYEVGQKYEPHHDYFFDEFNVR 192
Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLF 246
+ QR+A+ L+YLSDV+EGGET+FP G D + + +C GL V P++ D LLF
Sbjct: 193 KGGQRIATVLMYLSDVDEGGETVFPAAKGNISDVPWWDELSQCGKEGLSVLPKKRDALLF 252
Query: 247 YSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
+S+ P+ ++D +SLHG CPVIKG KW +TKW E
Sbjct: 253 WSMKPDASLDPSSLHGGCPVIKGNKWSSTKWFHVHE 288
>gi|259490206|ref|NP_001159002.1| prolyl 4-hydroxylase alpha-2 subunit [Zea mays]
gi|195626402|gb|ACG35031.1| prolyl 4-hydroxylase alpha-2 subunit precursor [Zea mays]
gi|347978830|gb|AEP37757.1| prolyl 4-hydroxylase 8 [Zea mays]
gi|347978832|gb|AEP37758.1| prolyl 4-hydroxylase 8-1 [Zea mays]
gi|413939569|gb|AFW74120.1| prolyl 4-hydroxylase alpha-2 subunit isoform 1 [Zea mays]
gi|413939570|gb|AFW74121.1| prolyl 4-hydroxylase alpha-2 subunit isoform 2 [Zea mays]
gi|413939571|gb|AFW74122.1| prolyl 4-hydroxylase alpha-2 subunit isoform 3 [Zea mays]
gi|413939572|gb|AFW74123.1| prolyl 4-hydroxylase alpha-2 subunit isoform 4 [Zea mays]
Length = 307
Score = 174 bits (441), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 96/217 (44%), Positives = 126/217 (58%), Gaps = 19/217 (8%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
+V+SW PRA + NF S ++C+ +I AK + S + V+ST G RTS
Sbjct: 97 EVISWEPRAFVYHNFLSKDECEYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 149
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
SG F+ DK ++ IE +IA T +P HGE VL YE+GQKY+ H+D F
Sbjct: 150 SGMFLQRGRDK--VIRAIEKRIADYTFIPVDHGEGLQVLHYEVGQKYEPHFDYFLDEFNT 207
Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFEN--GIFLDSGYDYKKCI--GLKVKPRRGDGLL 245
QR+A+ L+YLSDVEEGGET+FP N L + C GL VKP+ GD LL
Sbjct: 208 KNGGQRIATLLMYLSDVEEGGETIFPDANVNASSLPWYNELSDCAKRGLSVKPKMGDALL 267
Query: 246 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
F+S+ P+ T+D SLHG CPVIKG KW +TKW+ E
Sbjct: 268 FWSMKPDATLDPLSLHGGCPVIKGNKWSSTKWMHIHE 304
>gi|50845214|gb|AAT84604.1| prolyl 4-hydroxylase [Dianthus caryophyllus]
Length = 316
Score = 174 bits (441), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 93/206 (45%), Positives = 129/206 (62%), Gaps = 8/206 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LSW+PRA + F + E+C +I AK +L+ S +A + G+++ S RTSSG F+
Sbjct: 58 LSWKPRAFLYEGFLTHEECDHLIDMAKDKLEKSMVADNESGKSIPSE--VRTSSGMFLQK 115
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
++D ++ IE +IA T LP +GEA +L YE GQKY+ H+D F+ R+
Sbjct: 116 AQDD--VVAAIEARIAAWTFLPIENGEAMQILHYERGQKYEPHFDYFHDKVNQQLGGHRI 173
Query: 197 ASFLLYLSDVEEGGETMFP-FENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLFPNG 253
A+ L+YLS+VEEGGET+FP E + L + C G VKP++GD LLF+SL P+
Sbjct: 174 ATVLMYLSNVEEGGETVFPNAEAKLQLANNESLSDCAKGGYSVKPKKGDALLFFSLHPDA 233
Query: 254 TIDRTSLHGSCPVIKGEKWVATKWIR 279
+ D SLHGSCPVI+GEKW ATKWI
Sbjct: 234 STDSLSLHGSCPVIEGEKWSATKWIH 259
>gi|359477453|ref|XP_003631980.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Vitis
vinifera]
gi|297736941|emb|CBI26142.3| unnamed protein product [Vitis vinifera]
Length = 298
Score = 174 bits (440), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 99/224 (44%), Positives = 132/224 (58%), Gaps = 11/224 (4%)
Query: 62 HGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETV 120
+ + +++ + + +SW+PRA + F S E+C +I+ AK LK S +A G++
Sbjct: 24 YADAAGSNVSAAKVRQISWKPRAFVYEGFLSEEECDHLISLAKSELKRSAVADNVSGKSR 83
Query: 121 ESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHY 180
S RTSSG FI +D I+ IE KIA T LP+ +GE VLRYE GQKYD+HY
Sbjct: 84 LSE--VRTSSGMFIGKGKDP--IVAGIEDKIAAWTFLPKDNGEDMQVLRYEPGQKYDAHY 139
Query: 181 DAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIF----LDSGYDYKKCI--GL 234
D F + R+A+ L+YLSDV +GGET+FP L + D +C G+
Sbjct: 140 DYFVDKVNIARGGHRIATVLMYLSDVVKGGETVFPMAEEPSRRKPLPTNDDLSECARKGI 199
Query: 235 KVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
VKPR+GD LLF+SL P D SLHG CPVI+GEKW ATKWI
Sbjct: 200 AVKPRKGDALLFFSLHPTAIPDPMSLHGGCPVIEGEKWSATKWI 243
>gi|42567428|ref|NP_195306.2| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
gi|332661174|gb|AEE86574.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
Length = 290
Score = 174 bits (440), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 89/213 (41%), Positives = 137/213 (64%), Gaps = 9/213 (4%)
Query: 75 FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTF 133
+V+SW PRA + NF + E+C+ +I+ AK + S++ ++ G++++S RTSSGTF
Sbjct: 80 LEVISWEPRAFVYHNFLTNEECEHLISLAKPSMMKSKVVDVKTGKSIDSR--VRTSSGTF 137
Query: 134 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 193
++ D+ I+E IE++I+ T +P +GE VL YE+GQ+Y+ H+D F +
Sbjct: 138 LNRGHDE--IVEEIENRISDFTFIPPENGEGLQVLHYEVGQRYEPHHDYFFDEFNVRKGG 195
Query: 194 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSL 249
QR+A+ L+YLSDV+EGGET+FP G D + + +C GL V P++ D LLF+S+
Sbjct: 196 QRIATVLMYLSDVDEGGETVFPAAKGNVSDVPWWDELSQCGKEGLSVLPKKRDALLFWSM 255
Query: 250 FPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
P+ ++D +SLHG CPVIKG KW +TKW E
Sbjct: 256 KPDASLDPSSLHGGCPVIKGNKWSSTKWFHVHE 288
>gi|90704797|dbj|BAE92293.1| putative prolyl 4-hydroxylase, alpha subunit [Cryptomeria japonica]
Length = 302
Score = 173 bits (439), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 92/209 (44%), Positives = 133/209 (63%), Gaps = 9/209 (4%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
+VLSW PRA + NF + ++C+ +I AK + S + + G +++S RTSSG F+
Sbjct: 92 EVLSWEPRAFLYHNFLAKDECEYLINIAKPHMVKSMVVDSKTGGSMDSN--VRTSSGWFL 149
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
+ +DK I+ IE +IA + +P HGE +VL YE+ QKYD+HYD F+ Q
Sbjct: 150 NRGQDK--IIRRIEKRIADFSHIPVEHGEGLHVLHYEVEQKYDAHYDYFSDTINVKNGGQ 207
Query: 195 RLASFLLYLSDVEEGGETMFPFE--NGIFLDSGYDYKKC--IGLKVKPRRGDGLLFYSLF 250
R A+ L+YLSDVE+GGET+FP N + + +C GL V+P+ GD LLF+S+
Sbjct: 208 RGATMLMYLSDVEKGGETVFPQSKVNSSSVPWWDELSECGRSGLSVRPKMGDALLFWSVK 267
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
P+ ++D +SLHGSCPVI+G KW ATKW+R
Sbjct: 268 PDASLDPSSLHGSCPVIQGNKWSATKWMR 296
>gi|242039723|ref|XP_002467256.1| hypothetical protein SORBIDRAFT_01g022150 [Sorghum bicolor]
gi|241921110|gb|EER94254.1| hypothetical protein SORBIDRAFT_01g022150 [Sorghum bicolor]
Length = 303
Score = 173 bits (439), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 93/206 (45%), Positives = 127/206 (61%), Gaps = 8/206 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LSWRPRA F S +C +I AK +L+ S +A + G++V+S RTSSG F+
Sbjct: 43 LSWRPRAFLHKGFLSDAECDHLIVLAKDKLEKSMVADNESGKSVQSE--VRTSSGMFLEK 100
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D+ ++ IE +IA T LP +GE+ +L Y+ G+KY+ HYD F+ R+
Sbjct: 101 KQDE--VVRGIEERIAAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKNNQALGGHRI 158
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPNG 253
A+ L+YLS+VE+GGET+FP G L D + C G VKP +GD LLF+SL P+
Sbjct: 159 ATVLMYLSNVEKGGETIFPNAEGKLLQPKDDTWSDCARNGYAVKPVKGDALLFFSLHPDA 218
Query: 254 TIDRTSLHGSCPVIKGEKWVATKWIR 279
T D SLHGSCPVI+G+KW ATKWI
Sbjct: 219 TTDSESLHGSCPVIEGQKWSATKWIH 244
>gi|159794881|pdb|2JIJ|A Chain A, Crystal Structure Of The Apo Form Of Chlamydomonas
Reinhardtii Prolyl-4 Hydroxylase Type I
gi|159794882|pdb|2JIJ|B Chain B, Crystal Structure Of The Apo Form Of Chlamydomonas
Reinhardtii Prolyl-4 Hydroxylase Type I
gi|159794883|pdb|2JIJ|C Chain C, Crystal Structure Of The Apo Form Of Chlamydomonas
Reinhardtii Prolyl-4 Hydroxylase Type I
Length = 233
Score = 173 bits (439), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 92/205 (44%), Positives = 125/205 (60%), Gaps = 7/205 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAK-KRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
LSW PRA NF S E+C I+ A+ K +K S + G++V+S RTS+GT+ +
Sbjct: 25 LSWSPRAFLLKNFLSDEECDYIVEKARPKMVKSSVVDNESGKSVDSE--IRTSTGTWFAK 82
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMS-Q 194
ED ++ IE ++A+ TM+P + E VL Y GQKY+ HYD F +P GP+ Q
Sbjct: 83 GEDS--VISKIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNAGPEHGGQ 140
Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
R+ + L+YL+ VEEGGET+ P G+ GL VKP +GD L+FYSL P+G+
Sbjct: 141 RVVTMLMYLTTVEEGGETVLPNAEQKVTGDGWSECAKRGLAVKPIKGDALMFYSLKPDGS 200
Query: 255 IDRTSLHGSCPVIKGEKWVATKWIR 279
D SLHGSCP +KG+KW ATKWI
Sbjct: 201 NDPASLHGSCPTLKGDKWSATKWIH 225
>gi|363543371|ref|NP_001241695.1| prolyl 4-hydroxylase 8-5 [Zea mays]
gi|347978840|gb|AEP37762.1| prolyl 4-hydroxylase 8-5 [Zea mays]
Length = 307
Score = 173 bits (439), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 96/217 (44%), Positives = 125/217 (57%), Gaps = 19/217 (8%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
+V+SW PRA + NF S ++C+ +I AK + S + V+ST G RTS
Sbjct: 97 EVISWEPRAFVYHNFLSKDECEYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 149
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
SG F+ DK ++ IE +IA T +P HGE VL YE+GQKY+ H+D F
Sbjct: 150 SGMFLQRGRDK--VIRAIEKRIADYTFIPVDHGEGLQVLHYEVGQKYEPHFDYFLDEFNT 207
Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFEN--GIFLDSGYDYKKCI--GLKVKPRRGDGLL 245
QR+A+ L+YLSDVEEGGET+FP N L + C GL VKP+ GD LL
Sbjct: 208 KNGGQRIATLLMYLSDVEEGGETIFPDANVNASSLPWYNELSDCAKRGLSVKPKMGDALL 267
Query: 246 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
F+S+ P T+D SLHG CPVIKG KW +TKW+ E
Sbjct: 268 FWSMKPGATLDPLSLHGGCPVIKGNKWSSTKWMHIHE 304
>gi|168046048|ref|XP_001775487.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162673157|gb|EDQ59684.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 263
Score = 173 bits (439), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 93/205 (45%), Positives = 129/205 (62%), Gaps = 7/205 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LSW+PRA + NF S +C +I+ AK +L+ S +A + G++V+S RTSSG F+
Sbjct: 9 LSWKPRAFLYSNFLSDAECDHMISLAKDKLEKSMVADNESGKSVKSE--IRTSSGMFLMK 66
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D I+ IE +IA T LP+ +GEA VLRY+ G+KY+ H+D F+ R+
Sbjct: 67 GQDD--IISRIEDRIAAWTFLPKENGEAIQVLRYQDGEKYEPHFDYFHDKNNQALGGHRI 124
Query: 197 ASFLLYLSDVEEGGETMFPF--ENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
A+ L+YLSDV +GGET+FP + G D + G+ VKPR+GD LLF+SL P+
Sbjct: 125 ATVLMYLSDVVKGGETVFPSSEDRGGPKDDSWSACGKTGVAVKPRKGDALLFFSLHPSAV 184
Query: 255 IDRTSLHGSCPVIKGEKWVATKWIR 279
D +SLH CPVI+GEKW ATKWI
Sbjct: 185 PDESSLHTGCPVIEGEKWSATKWIH 209
>gi|326526235|dbj|BAJ97134.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 308
Score = 173 bits (438), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 94/204 (46%), Positives = 125/204 (61%), Gaps = 5/204 (2%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
+SW PRA +P+F S ++ +++ A+ LK S +A + ++ RTSSGTFIS
Sbjct: 54 ISWHPRAFLYPHFLSDDEANHLVSLARAELKRSAVADETSGKSQLSE-VRTSSGTFISKG 112
Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
+D I+ IE KIA T LP+ +GE VLRY+ G+KY+ HYD F + R+A
Sbjct: 113 KDP--IVAGIEDKIAAWTFLPKENGEDMQVLRYKRGEKYEPHYDFFTDSVNTILGGHRVA 170
Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLFPNGTI 255
+ LLYL+DV EGGET+FP G +C G+ VKPR+GD LLF++L P+
Sbjct: 171 TVLLYLTDVAEGGETVFPLAKGRKGSHHKGLSECAQKGIAVKPRKGDALLFFNLRPDAAT 230
Query: 256 DRTSLHGSCPVIKGEKWVATKWIR 279
D TSLHG C VIKGEKW ATKWIR
Sbjct: 231 DPTSLHGGCEVIKGEKWSATKWIR 254
>gi|159794879|pdb|2JIG|A Chain A, Crystal Structure Of Chlamydomonas Reinhardtii Prolyl-4
Hydroxylase Type I Complexed With Zinc And Pyridine-2,4-
Dicarboxylate
gi|159794880|pdb|2JIG|B Chain B, Crystal Structure Of Chlamydomonas Reinhardtii Prolyl-4
Hydroxylase Type I Complexed With Zinc And Pyridine-2,4-
Dicarboxylate
Length = 224
Score = 173 bits (438), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 92/205 (44%), Positives = 125/205 (60%), Gaps = 7/205 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAK-KRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
LSW PRA NF S E+C I+ A+ K +K S + G++V+S RTS+GT+ +
Sbjct: 16 LSWSPRAFLLKNFLSDEECDYIVEKARPKMVKSSVVDNESGKSVDSE--IRTSTGTWFAK 73
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMS-Q 194
ED ++ IE ++A+ TM+P + E VL Y GQKY+ HYD F +P GP+ Q
Sbjct: 74 GEDS--VISKIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNAGPEHGGQ 131
Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
R+ + L+YL+ VEEGGET+ P G+ GL VKP +GD L+FYSL P+G+
Sbjct: 132 RVVTMLMYLTTVEEGGETVLPNAEQKVTGDGWSECAKRGLAVKPIKGDALMFYSLKPDGS 191
Query: 255 IDRTSLHGSCPVIKGEKWVATKWIR 279
D SLHGSCP +KG+KW ATKWI
Sbjct: 192 NDPASLHGSCPTLKGDKWSATKWIH 216
>gi|224117220|ref|XP_002331751.1| predicted protein [Populus trichocarpa]
gi|222874448|gb|EEF11579.1| predicted protein [Populus trichocarpa]
Length = 266
Score = 173 bits (438), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 99/260 (38%), Positives = 146/260 (56%), Gaps = 12/260 (4%)
Query: 32 STFLSRDVPSIRPKLRTLEVVEKENESGLPHG----ETGDASIQSIPF-QVLSWRPRALY 86
S +R PS ++L V + S + H E+GD ++ + + +SW PRA
Sbjct: 7 SRISTRKSPSSTLIRKSLNVHFPNDLSSIAHNSKIHESGDDEGKAEQWVEAISWEPRAFI 66
Query: 87 FPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILEL 146
+ NF + +C +I AK ++ S + + ++ RTSSGTF+ DK I+
Sbjct: 67 YHNFLTKAECDYLINLAKPHMQKSMVVDSSSGKSKDSR-VRTSSGTFLPRGRDK--IIRD 123
Query: 147 IEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDV 206
IE +IA + +P HGE +L YE+GQKY+ H+D F QR+A+ L+YLSDV
Sbjct: 124 IEKRIADFSFIPSEHGEGLQILHYEVGQKYEPHFDYFMDDYNTENGGQRIATVLMYLSDV 183
Query: 207 EEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSLFPNGTIDRTSLHG 262
EEGGET+FP G + + +C GL VKP+ GD LLF+S+ P+ ++D +SLHG
Sbjct: 184 EEGGETVFPSAKGNISSVPWWNELSECGKGGLSVKPKMGDALLFWSMKPDASLDPSSLHG 243
Query: 263 SCPVIKGEKWVATKWIRDQE 282
CPVI+G KW +TKW+R E
Sbjct: 244 GCPVIRGNKWSSTKWMRVNE 263
>gi|241913390|pdb|3GZE|A Chain A, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
(Ser-Pro)5 Peptide Substrate
gi|241913391|pdb|3GZE|B Chain B, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
(Ser-Pro)5 Peptide Substrate
gi|241913392|pdb|3GZE|C Chain C, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
(Ser-Pro)5 Peptide Substrate
gi|241913393|pdb|3GZE|D Chain D, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
(Ser-Pro)5 Peptide Substrate
Length = 225
Score = 173 bits (438), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 92/205 (44%), Positives = 125/205 (60%), Gaps = 7/205 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAK-KRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
LSW PRA NF S E+C I+ A+ K +K S + G++V+S RTS+GT+ +
Sbjct: 17 LSWSPRAFLLKNFLSDEECDYIVEKARPKMVKSSVVDNESGKSVDSE--IRTSTGTWFAK 74
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMS-Q 194
ED ++ IE ++A+ TM+P + E VL Y GQKY+ HYD F +P GP+ Q
Sbjct: 75 GEDS--VISKIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNAGPEHGGQ 132
Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
R+ + L+YL+ VEEGGET+ P G+ GL VKP +GD L+FYSL P+G+
Sbjct: 133 RVVTMLMYLTTVEEGGETVLPNAEQKVTGDGWSECAKRGLAVKPIKGDALMFYSLKPDGS 192
Query: 255 IDRTSLHGSCPVIKGEKWVATKWIR 279
D SLHGSCP +KG+KW ATKWI
Sbjct: 193 NDPASLHGSCPTLKGDKWSATKWIH 217
>gi|303285562|ref|XP_003062071.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226456482|gb|EEH53783.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 522
Score = 173 bits (438), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 97/220 (44%), Positives = 137/220 (62%), Gaps = 17/220 (7%)
Query: 71 QSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTK-GTRTS 129
+S P + + RP+A F NF + E+C+ +IA AK +L PS + G+ +STK G RTS
Sbjct: 225 RSPPLVLSATRPKAYLFRNFLTEEECRHLIALAKAQLAPSTVVADGGK--KSTKSGIRTS 282
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF----NP 185
+G F++ + +T + ++E ++A A LP+ +GE +LRYE GQKYD HYD F NP
Sbjct: 283 AGMFLT--KGQTPTVRMVEERVAAAVGLPEENGEGMQILRYEHGQKYDPHYDYFHDKINP 340
Query: 186 AEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGI--FLDSGYD--YKKCI--GLKVKPR 239
+ + QR+A+ L+YL D EEGGET+FP F D D + C GL VK +
Sbjct: 341 SPN--RGGQRMATMLIYLKDTEEGGETIFPNAKKPEGFHDGEKDGAFSDCAKRGLPVKSK 398
Query: 240 RGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
RGD +LF+SL + +D SLHG+CPV++GEKW A KWIR
Sbjct: 399 RGDAVLFWSLTSDYKLDEGSLHGACPVLRGEKWTAVKWIR 438
>gi|356572148|ref|XP_003554232.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Glycine max]
Length = 319
Score = 173 bits (438), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 105/272 (38%), Positives = 150/272 (55%), Gaps = 22/272 (8%)
Query: 14 LPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKENESGLPHGETGDASIQSI 73
L ++L LC FL ++ +L L+ K + G +S++
Sbjct: 6 LASSLFLC------------FLCFEISVSSLRLPGLDQDAKATHGSVLRLNRGGSSVKFD 53
Query: 74 PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSS 130
P +V LSW PRA + F S E+C +I AK +L+ S +A G+++ S RTSS
Sbjct: 54 PTRVTQLSWSPRAFLYKGFLSEEECDHLIVLAKDKLEKSMVADNDSGKSIMSD--IRTSS 111
Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP 190
G F++ ++D+ I+ IE +IA T LP +GE+ +L YE GQKY+ H+D F+
Sbjct: 112 GMFLNKAQDE--IVAGIEARIAAWTFLPVENGESMQILHYENGQKYEPHFDYFHDKANQV 169
Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFL---DSGYDYKKCIGLKVKPRRGDGLLFY 247
R+A+ L+YLSDVE+GGET+FP L D + G VKP++GD LLF+
Sbjct: 170 MGGHRIATVLMYLSDVEKGGETIFPNAEAKLLQPKDESWSECAHKGYAVKPQKGDALLFF 229
Query: 248 SLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
SL + + D SLHGSCPVI+GEKW ATKWI
Sbjct: 230 SLHLDASTDTKSLHGSCPVIEGEKWSATKWIH 261
>gi|159478673|ref|XP_001697425.1| predicted protein [Chlamydomonas reinhardtii]
gi|158274304|gb|EDP00087.1| predicted protein [Chlamydomonas reinhardtii]
Length = 297
Score = 173 bits (438), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 92/205 (44%), Positives = 125/205 (60%), Gaps = 7/205 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAK-KRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
LSW PRA NF S E+C I+ A+ K +K S + G++V+S RTS+GT+ +
Sbjct: 45 LSWSPRAFLLKNFLSDEECDYIVEKARPKMVKSSVVDNESGKSVDSE--IRTSTGTWFAK 102
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMS-Q 194
ED ++ IE ++A+ TM+P + E VL Y GQKY+ HYD F +P GP+ Q
Sbjct: 103 GEDS--VISKIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNAGPEHGGQ 160
Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
R+ + L+YL+ VEEGGET+ P G+ GL VKP +GD L+FYSL P+G+
Sbjct: 161 RVVTMLMYLTTVEEGGETVLPNAEQKVTGDGWSECAKRGLAVKPIKGDALMFYSLKPDGS 220
Query: 255 IDRTSLHGSCPVIKGEKWVATKWIR 279
D SLHGSCP +KG+KW ATKWI
Sbjct: 221 NDPASLHGSCPTLKGDKWSATKWIH 245
>gi|224102545|ref|XP_002312720.1| predicted protein [Populus trichocarpa]
gi|222852540|gb|EEE90087.1| predicted protein [Populus trichocarpa]
Length = 300
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 96/220 (43%), Positives = 132/220 (60%), Gaps = 9/220 (4%)
Query: 65 TGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTK 124
T + I + +SW+PRA + F + +C +I+ AK LK S +A + + ++
Sbjct: 29 TSSSIINPAKVKQVSWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVADNESGKSKLSE 88
Query: 125 GTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN 184
RTSSG FI+ ++D I+ IE KIA T LP+ +GE VLRYE GQKYD HYD F+
Sbjct: 89 -VRTSSGMFITKAKDP--IVAGIEDKIATWTFLPRENGEDIQVLRYEHGQKYDPHYDYFS 145
Query: 185 PAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGI----FLDSGYDYKKCI--GLKVKP 238
+ R+A+ L+YL+DVE+GGET+FP + S D +C G+ VKP
Sbjct: 146 DKVNIARGGHRVATVLMYLTDVEKGGETVFPSAEELPRRKASVSHEDLSECARKGIAVKP 205
Query: 239 RRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
RRGD LLF+SL+P D +S+H CPVI+GEKW ATKWI
Sbjct: 206 RRGDALLFFSLYPTAVPDTSSIHAGCPVIEGEKWSATKWI 245
>gi|225452614|ref|XP_002281420.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
gi|296087745|emb|CBI35001.3| unnamed protein product [Vitis vinifera]
Length = 316
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 94/206 (45%), Positives = 128/206 (62%), Gaps = 8/206 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LSWRPRA + F S E+C +I AK +L+ S +A + G+++ S RTSSG F+
Sbjct: 57 LSWRPRAFLYKGFLSEEECDHLITLAKDKLEKSMVADNESGKSIMSE--VRTSSGMFLLK 114
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
++D+ I+ IE +IA T LP +GE+ +L YE G+KY+ H+D F+ R+
Sbjct: 115 AQDE--IVADIEARIAAWTFLPVENGESIQILHYENGEKYEPHFDYFHDKVNQLLGGHRI 172
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPNG 253
A+ L+YL+ VEEGGET+FP G F D + C G V P++GD LLF+SL P+
Sbjct: 173 ATVLMYLATVEEGGETVFPNSEGRFSQPKDDSWSDCAKKGYAVNPKKGDALLFFSLHPDA 232
Query: 254 TIDRTSLHGSCPVIKGEKWVATKWIR 279
T D +SLHGSCPVI GEKW ATKWI
Sbjct: 233 TTDPSSLHGSCPVIAGEKWSATKWIH 258
>gi|195627276|gb|ACG35468.1| prolyl 4-hydroxylase [Zea mays]
Length = 298
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 93/206 (45%), Positives = 127/206 (61%), Gaps = 8/206 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISA 136
LSWRPRA F +C +IA AK +L+ S +A + G++V+S RTSSG F+
Sbjct: 38 LSWRPRAFLHKGFLLDAECDHLIALAKDKLEKSMVADNKSGKSVQSE--VRTSSGMFLEK 95
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D+ ++ IE +I+ T LP +GEA +L Y+ G+KY+ HYD F+ R+
Sbjct: 96 KQDE--VVTRIEERISAWTFLPPENGEAIQILHYQNGEKYEPHYDYFHDKNNQALGGHRI 153
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPNG 253
A+ L+YLS+VE+GGET+FP G L D + C G VKP +GD LLF+SL P+
Sbjct: 154 ATVLMYLSNVEKGGETIFPNAEGKLLQPKDDTWSDCARNGYAVKPVKGDALLFFSLHPDS 213
Query: 254 TIDRTSLHGSCPVIKGEKWVATKWIR 279
T D SLHGSCPVI+G+KW ATKWI
Sbjct: 214 TTDSDSLHGSCPVIEGQKWSATKWIH 239
>gi|449459442|ref|XP_004147455.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
gi|449515722|ref|XP_004164897.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 319
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 99/222 (44%), Positives = 141/222 (63%), Gaps = 11/222 (4%)
Query: 64 ETGDASIQSIPFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE 121
+TG +++ P +V LS +PRA + F SAE+CQ +I +AK +L S +A G++V
Sbjct: 47 KTGGSAMTIDPTRVIQLSSKPRAFLYKGFLSAEECQHLINSAKGKLHQSLVAAGTGQSVT 106
Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
S + RTS+G F+ ++D+ I+ IE +IA T LP +GE +LRYE GQKY+ H+D
Sbjct: 107 SKE--RTSTGMFLHKAQDE--IVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFD 162
Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFL--DSGYDYKKC--IGLKVK 237
F R+A+ L+YLS+VE+GGET+FP + + L + D +C +G V+
Sbjct: 163 FFQDPGNIAIGGHRIATILMYLSNVEKGGETVFP-NSPVKLSEEEKADLSECGKVGYGVR 221
Query: 238 PRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
P+ GD LLF+S+ PN T D TS HGSCPVI+GEKW ATKWI
Sbjct: 222 PKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIH 263
>gi|293337056|ref|NP_001169835.1| uncharacterized protein LOC100383727 precursor [Zea mays]
gi|224031897|gb|ACN35024.1| unknown [Zea mays]
gi|347978800|gb|AEP37742.1| prolyl 4-hydroxylase 2 [Zea mays]
gi|414871435|tpg|DAA49992.1| TPA: hypothetical protein ZEAMMB73_500506 [Zea mays]
Length = 299
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 92/206 (44%), Positives = 129/206 (62%), Gaps = 8/206 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LSWRPRA F S +C +IA AK +L+ S +A + G++V+S RTSSG F+
Sbjct: 39 LSWRPRAFLHKGFLSDAECDHLIALAKDKLEKSMVADNESGKSVQSE--VRTSSGMFLER 96
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D+ ++ IE +I+ T LP +GE+ +L Y+ G+KY+ HYD F+ + R+
Sbjct: 97 KQDE--VVTRIEERISAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKKNQALGGHRI 154
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFL---DSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 253
A+ L+YLS+VE+GGET+FP G L D+ + G VKP +GD LLF+SL P+
Sbjct: 155 ATVLMYLSNVEKGGETIFPNAEGKLLQPKDNTWSDCARNGYAVKPVKGDALLFFSLHPDA 214
Query: 254 TIDRTSLHGSCPVIKGEKWVATKWIR 279
T D SLHGSCPVI+G+KW ATKWI
Sbjct: 215 TTDSDSLHGSCPVIEGQKWSATKWIH 240
>gi|449454448|ref|XP_004144967.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
gi|449474082|ref|XP_004154068.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
gi|449515181|ref|XP_004164628.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 300
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 98/230 (42%), Positives = 135/230 (58%), Gaps = 11/230 (4%)
Query: 57 ESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ 116
ES + + A++ + +SW+PRA + F + +C +++ A+ LK S++A
Sbjct: 21 ESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADND 80
Query: 117 -GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQK 175
G++ ST RTSSG FIS ++D I+ IE KI+ T LP+ +GE VLRYE GQK
Sbjct: 81 SGKSKLST--VRTSSGMFISKNKDP--IVSGIEDKISAWTFLPKENGEDIQVLRYEHGQK 136
Query: 176 YDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGY----DYKKC 231
Y+SHYD F RLA+ L+YLS+V +GGET+FP Y D +C
Sbjct: 137 YESHYDYFVDKVNIAWGGHRLATVLMYLSNVTQGGETVFPLAEKPSHRRAYETDEDLSEC 196
Query: 232 I--GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
G+ VKP++GD LLF+SL PN D SLHG CPV++GEKW ATKWI
Sbjct: 197 AKKGVAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIH 246
>gi|356502610|ref|XP_003520111.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 286
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 91/226 (40%), Positives = 132/226 (58%), Gaps = 9/226 (3%)
Query: 62 HGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETV 120
H E + ++ +V+SW+PRA + NF + E+C+ +I A ++ S +A Q G++V
Sbjct: 61 HIEAEEDDQVALRMEVISWQPRAFLYHNFLTKEECEYLINIATPHMQKSTVADNQSGQSV 120
Query: 121 ESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHY 180
R S+G F+ +D+ I+ IE +IA T +P +GE V+ YE+GQ YD HY
Sbjct: 121 --VHDVRKSTGAFLDRGQDE--IVRNIEKRIADVTFIPIENGEPIYVIHYEVGQYYDPHY 176
Query: 181 DAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKV 236
D F QR+A+ L+YLS+VEEGGETMFP F + + C +GL +
Sbjct: 177 DYFIDDFNIENGGQRIATMLMYLSNVEEGGETMFPRAKANFSSVPWWNELSNCGKMGLSI 236
Query: 237 KPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
KP+ GD LLF+S+ PN T+D +LH +CPVIKG KW TKW+ E
Sbjct: 237 KPKMGDALLFWSMKPNATLDALTLHSACPVIKGNKWSCTKWMHPTE 282
>gi|224141327|ref|XP_002324025.1| predicted protein [Populus trichocarpa]
gi|222867027|gb|EEF04158.1| predicted protein [Populus trichocarpa]
Length = 239
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 97/206 (47%), Positives = 127/206 (61%), Gaps = 8/206 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LSW+PRA + F S E+C +I AK +L S +A + GE++ES + RTSSG FI
Sbjct: 21 LSWQPRAFVYKGFLSDEECDHLINLAKGKLVKSMVANDETGESMESQE--RTSSGMFIFK 78
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+ED+ I+ IE +IA T LP+ +GE +LRYE GQKY++H D F + R
Sbjct: 79 TEDE--IVNGIEARIAAWTFLPEENGEPIQILRYEHGQKYEAHIDYFVDKANQEEGGHRA 136
Query: 197 ASFLLYLSDVEEGGETMFP---FENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 253
A+ L+YLSDV++GGET+FP E D + G VKP +GD LLF+SL P+
Sbjct: 137 ATVLMYLSDVKKGGETVFPTSEAEGSQAKDDSWSDCAKKGYAVKPNKGDALLFFSLHPDA 196
Query: 254 TIDRTSLHGSCPVIKGEKWVATKWIR 279
T D SLH SCPVI+GEKW ATKWI
Sbjct: 197 TPDPGSLHASCPVIEGEKWSATKWIH 222
>gi|363543369|ref|NP_001241694.1| prolyl 4-hydroxylase 8-4 [Zea mays]
gi|347978838|gb|AEP37761.1| prolyl 4-hydroxylase 8-4 [Zea mays]
Length = 307
Score = 172 bits (436), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 95/217 (43%), Positives = 126/217 (58%), Gaps = 19/217 (8%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
+V+SW PRA + NF S ++C+ +I AK + S + V+ST G RTS
Sbjct: 97 EVISWEPRAFVYHNFLSKDECEYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 149
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
SG F+ +K ++ IE +IA T +P HGE VL YE+GQKY+ H+D F
Sbjct: 150 SGMFLQRGRNK--VIRAIEKRIADYTFIPVDHGEGLQVLHYEVGQKYEPHFDYFLDEFNT 207
Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFEN--GIFLDSGYDYKKCI--GLKVKPRRGDGLL 245
QR+A+ L+YLSDVEEGGET+FP N L + C GL VKP+ GD LL
Sbjct: 208 KNGGQRIATLLMYLSDVEEGGETIFPDANVNASSLPWYNELSDCAKRGLSVKPKMGDALL 267
Query: 246 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
F+S+ P+ T+D SLHG CPVIKG KW +TKW+ E
Sbjct: 268 FWSMKPDATLDPLSLHGGCPVIKGNKWSSTKWMHIHE 304
>gi|255637501|gb|ACU19077.1| unknown [Glycine max]
Length = 318
Score = 172 bits (436), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 101/253 (39%), Positives = 144/253 (56%), Gaps = 10/253 (3%)
Query: 33 TFLSRDVPSIRPKLRTLEVVEKENESGLPHGETGDASIQSIPFQV--LSWRPRALYFPNF 90
FL ++ +L L+ K + G +S++ P +V LSW PRA + F
Sbjct: 12 CFLCFEISVSSIRLPGLDQDAKATHGSVLRLNRGGSSVKFDPTRVTQLSWSPRAFLYKGF 71
Query: 91 ASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISASEDKTGILELIEH 149
S E+C +I AK +L+ S +A + G+++ S RTSSG F++ ++D+ I+ IE
Sbjct: 72 LSDEECDHLITLAKDKLEKSMVADNESGKSIMSE--VRTSSGMFLNKAQDE--IVAGIEA 127
Query: 150 KIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEG 209
+IA T LP +GE+ +L YE GQKY+ H+D F+ R+A+ L+YLSDVE+G
Sbjct: 128 RIAAWTFLPIENGESMQILHYENGQKYEPHFDYFHDKANQVMGGHRIATVLMYLSDVEKG 187
Query: 210 GETMFPFENGIFL---DSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPV 266
GET+F L D + G VKPR+GD LLF+SL + + D SLHGSCPV
Sbjct: 188 GETIFSNAKAKLLQPKDESWSECAHKGYAVKPRKGDALLFFSLHLDASTDNKSLHGSCPV 247
Query: 267 IKGEKWVATKWIR 279
I+GEKW ATKWI
Sbjct: 248 IEGEKWSATKWIH 260
>gi|449495423|ref|XP_004159836.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 304
Score = 172 bits (436), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 95/209 (45%), Positives = 125/209 (59%), Gaps = 10/209 (4%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
+SW PRA + F + +C +I+ AK LK S +A + ++ RTSSG FI +
Sbjct: 44 ISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGKSKVSE-VRTSSGAFIHKA 102
Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
+D I+ IE KIA T LP+ +GE VLRYE GQKYD+H+D F + R+A
Sbjct: 103 KDP--IVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMA 160
Query: 198 SFLLYLSDVEEGGETMFPFENG-----IFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLF 250
+ L+YLSDVE+GGET+F ++ D C G+ VKPR+GD LLF+SL
Sbjct: 161 TVLMYLSDVEKGGETVFLLRRSESQRRQASETNEDLSDCAKKGIAVKPRKGDALLFFSLH 220
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
PN D +SLHG CPVI+GEKW ATKWIR
Sbjct: 221 PNAIPDTSSLHGGCPVIEGEKWSATKWIR 249
>gi|218184507|gb|EEC66934.1| hypothetical protein OsI_33548 [Oryza sativa Indica Group]
Length = 308
Score = 171 bits (434), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 92/206 (44%), Positives = 127/206 (61%), Gaps = 8/206 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LSWRPRA F + +C+ +I+ AK +L+ S +A + G++V S RTSSG F+
Sbjct: 48 LSWRPRAFLHKGFLTDAECEHLISLAKDKLEKSMVADNESGKSVMSE--VRTSSGMFLEK 105
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D+ ++ IE +IA T LP +GE+ +L Y+ G+KY+ HYD F+ R+
Sbjct: 106 KQDE--VVARIEERIAAWTFLPPDNGESIQILHYQNGEKYEPHYDYFHDKNNQALGGHRI 163
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPNG 253
A+ L+YLSDV +GGET+FP G L D + C G VKP +GD LLF+SL P+
Sbjct: 164 ATVLMYLSDVGKGGETIFPEAEGKLLQPKDDTWSDCAKNGYAVKPVKGDALLFFSLHPDA 223
Query: 254 TIDRTSLHGSCPVIKGEKWVATKWIR 279
T D SLHGSCPVI+G+KW ATKWI
Sbjct: 224 TTDSDSLHGSCPVIEGQKWSATKWIH 249
>gi|115481998|ref|NP_001064592.1| Os10g0413500 [Oryza sativa Japonica Group]
gi|110289075|gb|ABG66075.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
Group]
gi|113639201|dbj|BAF26506.1| Os10g0413500 [Oryza sativa Japonica Group]
gi|215692577|dbj|BAG87997.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222612821|gb|EEE50953.1| hypothetical protein OsJ_31503 [Oryza sativa Japonica Group]
Length = 308
Score = 171 bits (434), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 92/206 (44%), Positives = 127/206 (61%), Gaps = 8/206 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LSWRPRA F + +C+ +I+ AK +L+ S +A + G++V S RTSSG F+
Sbjct: 48 LSWRPRAFLHKGFLTDAECEHLISLAKDKLEKSMVADNESGKSVMSE--VRTSSGMFLEK 105
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D+ ++ IE +IA T LP +GE+ +L Y+ G+KY+ HYD F+ R+
Sbjct: 106 KQDE--VVARIEERIAAWTFLPPDNGESIQILHYQNGEKYEPHYDYFHDKNNQALGGHRI 163
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPNG 253
A+ L+YLSDV +GGET+FP G L D + C G VKP +GD LLF+SL P+
Sbjct: 164 ATVLMYLSDVGKGGETIFPEAEGKLLQPKDDTWSDCAKNGYAVKPVKGDALLFFSLHPDA 223
Query: 254 TIDRTSLHGSCPVIKGEKWVATKWIR 279
T D SLHGSCPVI+G+KW ATKWI
Sbjct: 224 TTDSDSLHGSCPVIEGQKWSATKWIH 249
>gi|30689216|ref|NP_189490.2| Oxoglutarate/iron-dependent oxygenase [Arabidopsis thaliana]
gi|332643931|gb|AEE77452.1| Oxoglutarate/iron-dependent oxygenase [Arabidopsis thaliana]
Length = 288
Score = 171 bits (434), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 95/207 (45%), Positives = 127/207 (61%), Gaps = 9/207 (4%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA--LRQGETVESTKGTRTSSGTFIS 135
LSW PRA + F S E+C +I AK +L+ S + + GE+ +S RTSSG F++
Sbjct: 35 LSWTPRAFLYKGFLSDEECDHLIKLAKGKLEKSMVVADVDSGESEDSE--VRTSSGMFLT 92
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
+D I+ +E K+A T LP+ +GEA +L YE GQKYD H+D F + R
Sbjct: 93 KRQDD--IVANVEAKLAAWTFLPEENGEALQILHYENGQKYDPHFDYFYDKKALELGGHR 150
Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPN 252
+A+ L+YLS+V +GGET+FP G D + KC G VKPR+GD LLF++L N
Sbjct: 151 IATVLMYLSNVTKGGETVFPNWKGKTPQLKDDSWSKCAKQGYAVKPRKGDALLFFNLHLN 210
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIR 279
GT D SLHGSCPVI+GEKW AT+WI
Sbjct: 211 GTTDPNSLHGSCPVIEGEKWSATRWIH 237
>gi|307106819|gb|EFN55064.1| hypothetical protein CHLNCDRAFT_35843 [Chlorella variabilis]
Length = 287
Score = 171 bits (433), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 92/205 (44%), Positives = 131/205 (63%), Gaps = 9/205 (4%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTKGTRTSSGTFISA 136
+SWRPRA + NF S E+C+ + A+KRL K + + + G++++ST RTSSGTF++
Sbjct: 41 VSWRPRAFVYHNFLSDEECEHLKELARKRLTKSTVVDNKTGKSMDST--VRTSSGTFLAR 98
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS--- 193
ED+ ++ IE +I+ TM+P+ +GEA +L+Y GQKY+ H D F+ +Y +
Sbjct: 99 GEDE--VVRAIEKRISLVTMIPEENGEAIQILKYVDGQKYEPHTDYFH-DKYNSRTENGG 155
Query: 194 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 253
QR+A+ L+YLS EEGGET+FP+ G+ GL VK +G LLFYSL PNG
Sbjct: 156 QRVATILMYLSTPEEGGETVFPYAEKKVEGEGWSECARKGLAVKAVKGSALLFYSLKPNG 215
Query: 254 TIDRTSLHGSCPVIKGEKWVATKWI 278
D+ S HGSCP + GEKW AT+WI
Sbjct: 216 EEDQASTHGSCPTLAGEKWSATRWI 240
>gi|212720650|ref|NP_001132477.1| uncharacterized protein LOC100193935 precursor [Zea mays]
gi|194694488|gb|ACF81328.1| unknown [Zea mays]
gi|347978828|gb|AEP37756.1| prolyl 4-hydroxylase 7 [Zea mays]
gi|413934218|gb|AFW68769.1| prolyl 4-hydroxylase [Zea mays]
Length = 298
Score = 171 bits (433), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 92/206 (44%), Positives = 126/206 (61%), Gaps = 8/206 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISA 136
LSWRPRA F +C +IA AK +L+ S +A + G++V+S RTSSG F+
Sbjct: 38 LSWRPRAFLHKGFLLDAECDHLIALAKDKLEKSMVADNKSGKSVQSE--VRTSSGMFLEK 95
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D+ ++ IE +I+ T LP +GEA +L Y+ G+KY+ HYD F+ R+
Sbjct: 96 KQDE--VVTRIEERISAWTFLPPENGEAIQILHYQNGEKYEPHYDYFHDKNNQALGGHRI 153
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPNG 253
A+ L+YLS+VE+GGET+FP G L D + C G VKP +GD LLF+SL P+
Sbjct: 154 ATVLMYLSNVEKGGETIFPNAEGKLLQPKDDTWSDCARNGYAVKPVKGDALLFFSLHPDS 213
Query: 254 TIDRTSLHGSCPVIKGEKWVATKWIR 279
T D SLHGSCP I+G+KW ATKWI
Sbjct: 214 TTDSDSLHGSCPAIEGQKWSATKWIH 239
>gi|356555585|ref|XP_003546111.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 1
[Glycine max]
Length = 301
Score = 171 bits (433), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 101/224 (45%), Positives = 131/224 (58%), Gaps = 13/224 (5%)
Query: 65 TGDASIQSIPFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVE 121
G AS P +V +SW+PRA + F + +C +I+ AK LK S +A GE+
Sbjct: 28 AGSASAIIDPSKVKQVSWKPRAFVYEGFLTELECDHLISIAKSELKRSAVADNLSGESKL 87
Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
S RTSSG FI ++D I+ +E KI+ T+LP+ +GE VLRYE GQKYD HYD
Sbjct: 88 SE--VRTSSGMFIPKNKDP--IVAGVEDKISSWTLLPKENGEDIQVLRYEHGQKYDPHYD 143
Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGY----DYKKCI--GLK 235
F + R+A+ L+YL+DV +GGET+FP G D +C G+
Sbjct: 144 YFADKVNIARGGHRVATVLMYLTDVTKGGETVFPNAEESPRHRGSETKEDLSECAQKGIA 203
Query: 236 VKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
VKPRRGD LLF+SL+PN D SLH CPVI+GEKW ATKWI
Sbjct: 204 VKPRRGDALLFFSLYPNAIPDTMSLHAGCPVIEGEKWSATKWIH 247
>gi|242088305|ref|XP_002439985.1| hypothetical protein SORBIDRAFT_09g023860 [Sorghum bicolor]
gi|241945270|gb|EES18415.1| hypothetical protein SORBIDRAFT_09g023860 [Sorghum bicolor]
Length = 308
Score = 171 bits (432), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 93/205 (45%), Positives = 126/205 (61%), Gaps = 7/205 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
+SW+PR + +F S ++ +I+ A+ LK S +A G++ S RTSSGTF+
Sbjct: 54 ISWKPRVFLYQHFLSDDEANHLISLARAELKRSAVADNMSGKSTLSD--VRTSSGTFLRK 111
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D I+E IE KIA T LP+ +GE VLRY+ G+KY+ HYD F + R
Sbjct: 112 GQDP--IVEGIEDKIAAWTFLPKENGEDIQVLRYKHGEKYEPHYDYFTDNVNTIRGGHRY 169
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLFPNGT 254
A+ LLYL+DV EGGET+FP + + +C G+ VKPR+GD LLF++L P+GT
Sbjct: 170 ATVLLYLTDVAEGGETVFPLAEEVDDAKDATFSECAQKGIAVKPRKGDALLFFNLKPDGT 229
Query: 255 IDRTSLHGSCPVIKGEKWVATKWIR 279
D SLHG C VI+GEKW ATKWIR
Sbjct: 230 TDPVSLHGGCAVIRGEKWSATKWIR 254
>gi|224085946|ref|XP_002307750.1| predicted protein [Populus trichocarpa]
gi|222857199|gb|EEE94746.1| predicted protein [Populus trichocarpa]
Length = 288
Score = 170 bits (431), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 90/208 (43%), Positives = 128/208 (61%), Gaps = 9/208 (4%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAK-KRLKPSQLALRQGETVESTKGTRTSSGTFI 134
++LSW PRA + NF S E+C+ +I AK +K + + + G + +S RTSSG F+
Sbjct: 78 EILSWEPRAFLYHNFLSKEECEYLINLAKPHMMKSTVVDSKTGRSKDSR--VRTSSGMFL 135
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
D+ ++ IE +IA + +P HGE VL YE+GQKY++H+D F Q
Sbjct: 136 RRGRDR--VIREIEKRIADFSFIPVEHGEGLQVLHYEVGQKYEAHFDYFLDEFNTKNGGQ 193
Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKCI--GLKVKPRRGDGLLFYSLF 250
R A+ L+YLSDVEEGGET+FP N + + +C GL +KP+ G+ LLF+S
Sbjct: 194 RTATLLMYLSDVEEGGETVFPAANMNISAVPWWNELSECAKQGLSLKPKMGNALLFWSTR 253
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWI 278
P+ T+D +SLHGSCPVI+G KW ATKW+
Sbjct: 254 PDATLDPSSLHGSCPVIRGNKWSATKWM 281
>gi|388496942|gb|AFK36537.1| unknown [Lotus japonicus]
Length = 302
Score = 170 bits (430), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 99/240 (41%), Positives = 138/240 (57%), Gaps = 13/240 (5%)
Query: 49 LEVVEKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLK 108
+ ++ + + +G + + A I + +SW+PRA + F + +C +I+ AK LK
Sbjct: 15 VSLICESHGAGSSYAGSASAIIDPSKVKQVSWKPRAFVYKGFLTELECDHLISLAKSELK 74
Query: 109 PSQLALR-QGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNV 167
S +A G++ S RTSSG FIS ++D I+ IE KI+ T LP+ +GE V
Sbjct: 75 RSAVADNLSGDSKLSD--VRTSSGMFISKNKDP--IVAGIEDKISSWTFLPKENGEDIQV 130
Query: 168 LRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP------FENGIF 221
LRYE GQKYD HYD F + R+A+ L+YL++V GGET+FP F
Sbjct: 131 LRYEHGQKYDPHYDFFADKVNIARGGHRVATVLMYLTNVTRGGETVFPNAEVEEFPRHRG 190
Query: 222 LDSGYDYKKCI--GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
++ D +C G+ VKPRRGD LLF+SL+PN D SLH CPVI+GEKW ATKWI
Sbjct: 191 SETIDDLSECAKKGIAVKPRRGDALLFFSLYPNAVPDTMSLHAGCPVIEGEKWSATKWIH 250
>gi|28393447|gb|AAO42145.1| putative prolyl 4-hydroxylase [Arabidopsis thaliana]
Length = 253
Score = 169 bits (429), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 94/206 (45%), Positives = 126/206 (61%), Gaps = 9/206 (4%)
Query: 79 SWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA--LRQGETVESTKGTRTSSGTFISA 136
SW PRA + F S E+C +I AK +L+ S + + GE+ +S RTSSG F++
Sbjct: 1 SWTPRAFLYKGFLSDEECDHLIKLAKGKLEKSMVVADVDSGESEDSE--VRTSSGMFLTK 58
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D I+ +E K+A T LP+ +GEA +L YE GQKYD H+D F + R+
Sbjct: 59 RQDD--IVANVEAKLAAWTFLPEENGEALQILHYENGQKYDPHFDYFYDKKALELGGHRI 116
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPNG 253
A+ L+YLS+V +GGET+FP G D + KC G VKPR+GD LLF++L NG
Sbjct: 117 ATVLMYLSNVTKGGETVFPNWKGKTPQLKDDSWSKCAKQGYAVKPRKGDALLFFNLHLNG 176
Query: 254 TIDRTSLHGSCPVIKGEKWVATKWIR 279
T D SLHGSCPVI+GEKW AT+WI
Sbjct: 177 TTDPNSLHGSCPVIEGEKWSATRWIH 202
>gi|356502598|ref|XP_003520105.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 296
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 91/213 (42%), Positives = 129/213 (60%), Gaps = 9/213 (4%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFI 134
+++SW PR + NF + E+C+ +I AK ++ S + + G ++ES RTSSGTF+
Sbjct: 86 EIISWEPRIFLYHNFLTKEECEHLINIAKPNMRKSTVIESETGMSIESR--VRTSSGTFL 143
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
+ DK I+ IE++IA T +P +GE VL Y++G+KY H+D F
Sbjct: 144 ARGRDK--IVRNIENRIADFTFIPVDNGEELQVLHYQVGEKYVPHHDYFMDDINTANGGD 201
Query: 195 RLASFLLYLSDVEEGGETMFPFENGIF--LDSGYDYKKC--IGLKVKPRRGDGLLFYSLF 250
R+A+ L+YLSDVEEGGET+FP G F + + C GL +KP+ + LLF+S+
Sbjct: 202 RIATMLMYLSDVEEGGETVFPDAKGNFSSMPGWNELSVCGKKGLSIKPKMRNALLFWSIK 261
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
P+ T D SLHGSCPVIKG KW +TKWIR E
Sbjct: 262 PDATYDPLSLHGSCPVIKGNKWSSTKWIRIGEH 294
>gi|384246332|gb|EIE19822.1| hypothetical protein COCSUDRAFT_25518 [Coccomyxa subellipsoidea
C-169]
Length = 347
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 93/210 (44%), Positives = 126/210 (60%), Gaps = 19/210 (9%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
+SW PRA F +C+ +I+ AK + S + G++++ST RTS+GTF
Sbjct: 86 VSWSPRAFLLKGFLKEAECEHLISKAKPSMVKSTVVDNDTGKSIDST--VRTSTGTFFGR 143
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN------PAEYGP 190
ED+ +++ IE +I+ T LP+ +GE +L YE GQKY++H+D F+ P G
Sbjct: 144 EEDE--VIQGIERRISMITHLPEVNGEGLQILHYEDGQKYEAHHDFFHDKFNSRPENGG- 200
Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYS 248
QR+A+ L+YL+ EEGGET+FP +G + +C G VK RRGD LLFYS
Sbjct: 201 ---QRIATVLMYLTTAEEGGETVFPMAANKV--TGPQWSECARGGAAVKSRRGDALLFYS 255
Query: 249 LFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
L PNG D TSLHGSCP KGEKW ATKWI
Sbjct: 256 LLPNGETDPTSLHGSCPTTKGEKWSATKWI 285
>gi|357447553|ref|XP_003594052.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
truncatula]
gi|355483100|gb|AES64303.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
truncatula]
Length = 301
Score = 169 bits (428), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 98/219 (44%), Positives = 128/219 (58%), Gaps = 11/219 (5%)
Query: 68 ASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGT 126
A I + +SW+PRA + F + +C +I+ AK LK S +A GE+ S
Sbjct: 33 AIIDPTKVKQVSWKPRAFVYKGFLTDLECDHLISIAKSELKRSAVADNLSGESKLSE--V 90
Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
RTSSG FIS ++D I+ IE KI+ T LP+ +GE VLRYE GQKYD HYD F
Sbjct: 91 RTSSGMFISKNKD--AIVSGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDYFADK 148
Query: 187 EYGPQMSQRLASFLLYLSDVEEGGETMFPFEN----GIFLDSGYDYKKC--IGLKVKPRR 240
+ R+A+ L+YL++V +GGET+FP ++ D +C G+ VKPRR
Sbjct: 149 VNIARGGHRVATVLMYLTNVTKGGETVFPNAEESPRHKLSETDEDLSECGKKGVAVKPRR 208
Query: 241 GDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
GD LLF+SL PN D SLH CPVI+GEKW ATKWI
Sbjct: 209 GDALLFFSLHPNAIPDTLSLHAGCPVIEGEKWSATKWIH 247
>gi|357496283|ref|XP_003618430.1| Prolyl 4-hydroxylase subunit alpha-2 [Medicago truncatula]
gi|217073992|gb|ACJ85356.1| unknown [Medicago truncatula]
gi|355493445|gb|AES74648.1| Prolyl 4-hydroxylase subunit alpha-2 [Medicago truncatula]
gi|388494436|gb|AFK35284.1| unknown [Medicago truncatula]
Length = 313
Score = 169 bits (428), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 96/231 (41%), Positives = 137/231 (59%), Gaps = 10/231 (4%)
Query: 55 ENESGLPHGETGDASIQSIPFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQL 112
E + + G A ++ P +V LSW PRA + NF + E+C +I +K +L+ S +
Sbjct: 29 EEGNKITRGSVFGAKVKFDPTRVTQLSWSPRAFLYKNFLTDEECDHLIELSKDKLEKSMV 88
Query: 113 ALRQ-GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYE 171
A + G++++S RTSSG F++ +D+ I+ IE +IA T LP +GE+ VL Y
Sbjct: 89 ADNESGKSIQSE--VRTSSGMFLNKQQDE--IVSGIEARIAAWTFLPVENGESMQVLHYM 144
Query: 172 IGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIF---LDSGYDY 228
G+KY+ H+D F+ R+A+ L+YLS+VE+GGET+FP G D +
Sbjct: 145 NGEKYEPHFDFFHDKANQRLGGHRVATVLMYLSNVEKGGETIFPHAEGKLSQPKDESWSE 204
Query: 229 KKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
G VKPR+GD LLF+SL + T D SLHGSCPVI+GEKW ATKWI
Sbjct: 205 CAHKGYAVKPRKGDALLFFSLHLDATTDSKSLHGSCPVIEGEKWSATKWIH 255
>gi|412993142|emb|CCO16675.1| predicted protein [Bathycoccus prasinos]
Length = 564
Score = 169 bits (427), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 94/216 (43%), Positives = 135/216 (62%), Gaps = 14/216 (6%)
Query: 74 PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTF 133
P + + +P+A F NF SAE+C ++ AK L PS + G +V ST RTS+G F
Sbjct: 278 PLVLSAVKPKAYLFRNFLSAEECDHLMKLAKAELAPSTVVGAGGTSVPST--IRTSAGMF 335
Query: 134 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA-EYGPQM 192
+ + DKT LE IE++IA A+ P+ +GE +LRY++GQKYD H+D F+ A P+
Sbjct: 336 LRKAADKT--LENIEYRIAAASGTPEPNGEGMQILRYDVGQKYDPHFDYFHDAVNPSPKR 393
Query: 193 S-QRLASFLLYLSDVEEGGETMFP----FENGIFLDSG--YDYKKCI--GLKVKPRRGDG 243
QR+A+ L+YL + +EGGET+FP E + G +++ +C GL VK +GD
Sbjct: 394 GGQRMATMLIYLENTKEGGETIFPRGTRAETFDLTEEGNPHEWSECTKHGLPVKSVKGDA 453
Query: 244 LLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
LLF+SL + +D SLHG+CPV+KG+KW A KWIR
Sbjct: 454 LLFWSLTDDYKLDMGSLHGACPVVKGQKWTAVKWIR 489
>gi|356546462|ref|XP_003541645.1| PREDICTED: uncharacterized protein LOC100818794 [Glycine max]
Length = 839
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 100/222 (45%), Positives = 130/222 (58%), Gaps = 13/222 (5%)
Query: 66 GDASIQSIPFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVES 122
G AS P +V +SW+PRA + F + +C +I+ AK LK S +A GE+ S
Sbjct: 567 GSASAIIDPSKVKQVSWKPRAFVYEGFLTELECDHLISIAKSELKRSAVADNLSGESKLS 626
Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
RTSSG FI ++D I+ IE KI+ T LP+ +GE VLRYE GQKYD HYD
Sbjct: 627 E--VRTSSGMFIPKNKDL--IVAGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDY 682
Query: 183 FNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI------GLKV 236
F + R+A+ L+YL+DV +GGET+FP G + + + G+ V
Sbjct: 683 FADKVNIARGGHRVATVLMYLTDVTKGGETVFPDAEESPRHKGSETNENLSECAQKGIAV 742
Query: 237 KPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
KPRRGD LLF+SL+PN D SLH CPVI+GEKW ATKWI
Sbjct: 743 KPRRGDALLFFSLYPNAIPDTLSLHAGCPVIEGEKWSATKWI 784
>gi|110289076|gb|ABB47602.2| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
Group]
Length = 309
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 93/207 (44%), Positives = 128/207 (61%), Gaps = 9/207 (4%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LSWRPRA F + +C+ +I+ AK +L+ S +A + G++V S RTSSG F+
Sbjct: 48 LSWRPRAFLHKGFLTDAECEHLISLAKDKLEKSMVADNESGKSVMSE--VRTSSGMFLEK 105
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D+ ++ IE +IA T LP +GE+ +L Y+ G+KY+ HYD F+ R+
Sbjct: 106 KQDE--VVARIEERIAAWTFLPPDNGESIQILHYQNGEKYEPHYDYFHDKNNQALGGHRI 163
Query: 197 ASFLLYLSDVEEGGETMFP-FENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPN 252
A+ L+YLSDV +GGET+FP E G L D + C G VKP +GD LLF+SL P+
Sbjct: 164 ATVLMYLSDVGKGGETIFPEAEVGKLLQPKDDTWSDCAKNGYAVKPVKGDALLFFSLHPD 223
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIR 279
T D SLHGSCPVI+G+KW ATKWI
Sbjct: 224 ATTDSDSLHGSCPVIEGQKWSATKWIH 250
>gi|357140446|ref|XP_003571778.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
distachyon]
Length = 298
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 91/206 (44%), Positives = 126/206 (61%), Gaps = 8/206 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LSWRPRA F S +C +I AK +L+ S +A + G++V+S RTSSG F+
Sbjct: 38 LSWRPRAFLHKGFLSEPECDHMIELAKDKLEKSMVADNESGKSVQSE--VRTSSGMFLEK 95
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D+ ++ IE +IA T LP +GE+ +L Y+ G+KY+ HYD F+ R+
Sbjct: 96 RQDE--VVARIEERIAAWTFLPSENGESIQILHYKNGEKYEPHYDYFHDKNNQALGGHRI 153
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPNG 253
A+ L+YLS+VE+GGET+FP G + +C G VKP +GD LLF+SL P+
Sbjct: 154 ATVLMYLSNVEKGGETIFPNAEGKLTQHKDETASECAKNGYAVKPMKGDALLFFSLHPDA 213
Query: 254 TIDRTSLHGSCPVIKGEKWVATKWIR 279
T D SLHGSCPVI+G+KW ATKWI
Sbjct: 214 TTDPDSLHGSCPVIEGQKWSATKWIH 239
>gi|357447555|ref|XP_003594053.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
truncatula]
gi|355483101|gb|AES64304.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
truncatula]
Length = 303
Score = 168 bits (426), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 98/221 (44%), Positives = 128/221 (57%), Gaps = 13/221 (5%)
Query: 68 ASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGT 126
A I + +SW+PRA + F + +C +I+ AK LK S +A GE+ S
Sbjct: 33 AIIDPTKVKQVSWKPRAFVYKGFLTDLECDHLISIAKSELKRSAVADNLSGESKLSE--V 90
Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
RTSSG FIS ++D I+ IE KI+ T LP+ +GE VLRYE GQKYD HYD F
Sbjct: 91 RTSSGMFISKNKD--AIVSGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDYFADK 148
Query: 187 EYGPQMSQRLASFLLYLSDVEEGGETMFPFEN------GIFLDSGYDYKKC--IGLKVKP 238
+ R+A+ L+YL++V +GGET+FP ++ D +C G+ VKP
Sbjct: 149 VNIARGGHRVATVLMYLTNVTKGGETVFPNAELQESPRHKLSETDEDLSECGKKGVAVKP 208
Query: 239 RRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
RRGD LLF+SL PN D SLH CPVI+GEKW ATKWI
Sbjct: 209 RRGDALLFFSLHPNAIPDTLSLHAGCPVIEGEKWSATKWIH 249
>gi|125552794|gb|EAY98503.1| hypothetical protein OsI_20415 [Oryza sativa Indica Group]
Length = 319
Score = 168 bits (426), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 93/209 (44%), Positives = 129/209 (61%), Gaps = 11/209 (5%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
+SW+PR + +F S ++ +++ A+ LK S +A E + RTSSGTFI S
Sbjct: 61 ISWKPRVFLYQHFLSDDEANHLVSLARAELKRSAVADNLSGKSELSDA-RTSSGTFIRKS 119
Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
+D I+ IE KIA T LP+ +GE VLRY+ G+KY+ HYD F+ + R+A
Sbjct: 120 QDP--IVAGIEEKIAAWTFLPKENGEDIQVLRYKHGEKYERHYDYFSDNVNTLRGGHRIA 177
Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYK-----KCI--GLKVKPRRGDGLLFYSLF 250
+ L+YL+DV EGGET+FP F +SG + + +C G+ VKPR+GD LLF++L
Sbjct: 178 TVLMYLTDVAEGGETVFPLAEE-FTESGTNNEDSTLSECAKKGVAVKPRKGDALLFFNLS 236
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
P+ + D SLH CPVIKGEKW ATKWIR
Sbjct: 237 PDASKDSLSLHAGCPVIKGEKWSATKWIR 265
>gi|115464581|ref|NP_001055890.1| Os05g0489100 [Oryza sativa Japonica Group]
gi|50511363|gb|AAT77286.1| putative prolyl 4-hydroxylase alpha subunit [Oryza sativa Japonica
Group]
gi|113579441|dbj|BAF17804.1| Os05g0489100 [Oryza sativa Japonica Group]
gi|125587281|gb|EAZ27945.1| hypothetical protein OsJ_11906 [Oryza sativa Japonica Group]
gi|215737307|dbj|BAG96236.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 319
Score = 168 bits (425), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 93/209 (44%), Positives = 129/209 (61%), Gaps = 11/209 (5%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
+SW+PR + +F S ++ +++ A+ LK S +A E + RTSSGTFI S
Sbjct: 61 ISWKPRVFLYQHFLSDDEANHLVSLARTELKRSAVADNLSGKSELSDA-RTSSGTFIRKS 119
Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
+D I+ IE KIA T LP+ +GE VLRY+ G+KY+ HYD F+ + R+A
Sbjct: 120 QDP--IVAGIEEKIAAWTFLPKENGEDIQVLRYKHGEKYERHYDYFSDNVNTLRGGHRIA 177
Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYK-----KCI--GLKVKPRRGDGLLFYSLF 250
+ L+YL+DV EGGET+FP F +SG + + +C G+ VKPR+GD LLF++L
Sbjct: 178 TVLMYLTDVAEGGETVFPLAEE-FTESGTNNEDSTLSECAKKGVAVKPRKGDALLFFNLS 236
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
P+ + D SLH CPVIKGEKW ATKWIR
Sbjct: 237 PDASKDSLSLHAGCPVIKGEKWSATKWIR 265
>gi|363807286|ref|NP_001242363.1| uncharacterized protein LOC100796794 precursor [Glycine max]
gi|255641119|gb|ACU20838.1| unknown [Glycine max]
Length = 297
Score = 168 bits (425), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 101/223 (45%), Positives = 131/223 (58%), Gaps = 13/223 (5%)
Query: 66 GDASIQSIPFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVES 122
G AS P +V +SW+PRA + F + +C +I+ AK LK S +A GE+ S
Sbjct: 25 GSASSVINPSKVKQISWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVADNLSGESQLS 84
Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
RTSSG FIS ++D I+ IE KI+ T LP+ +GE V RYE GQKYD HYD
Sbjct: 85 D--VRTSSGMFISKNKDP--IVAGIEDKISSWTFLPKENGEDIQVSRYEHGQKYDPHYDY 140
Query: 183 FNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENG----IFLDSGYDYKKCI--GLKV 236
F + R+A+ L+YL+DV +GGET+FP ++ D +C G+ V
Sbjct: 141 FTDKVNIARGGHRIATVLMYLTDVAKGGETVFPSAEEPPRRRGAETSSDLSECAKKGIAV 200
Query: 237 KPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
KPRRGD LLF+SL N T D +SLH CPVI+GEKW ATKWI
Sbjct: 201 KPRRGDALLFFSLHTNATPDTSSLHAGCPVIEGEKWSATKWIH 243
>gi|116788056|gb|ABK24739.1| unknown [Picea sitchensis]
Length = 303
Score = 167 bits (423), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 91/217 (41%), Positives = 130/217 (59%), Gaps = 16/217 (7%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVES---------TKG 125
+VLSW PRA+ + NF + E+C+ +I AK + K + + G++ +S
Sbjct: 82 EVLSWEPRAILYHNFLNKEECEYLINLAKPHMAKSTVVDSATGKSKDSRFVHRWKSNDSR 141
Query: 126 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 185
RTSSG F++ +DKT + IE +IA T +P HGE VL YE+GQKY+ H+D F
Sbjct: 142 VRTSSGMFLNRGQDKT--IRSIEKRIADFTFIPAEHGEGLQVLHYEVGQKYEPHFDYFLD 199
Query: 186 AEYGPQMSQRLASFLLYLSDVEEGGETMFPFE--NGIFLDSGYDYKKCI--GLKVKPRRG 241
QR+A+ L+YLSDVE+GGET+FP N + + +C G+ V+PR G
Sbjct: 200 EFNTKNGGQRIATVLMYLSDVEKGGETVFPASKVNSSSVPWWDELSECAKAGISVRPRMG 259
Query: 242 DGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
D LLF+S+ P+ +D +SLH CPVI+G+KW ATKWI
Sbjct: 260 DALLFWSMRPDAELDPSSLHAGCPVIQGDKWSATKWI 296
>gi|168002780|ref|XP_001754091.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162694645|gb|EDQ80992.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 214
Score = 167 bits (423), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 91/208 (43%), Positives = 128/208 (61%), Gaps = 9/208 (4%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTKGTRTSSGTFI 134
+VLSW PRA + +F + E+C +I A+ L K + + G++ +S RTSSGTF+
Sbjct: 4 EVLSWEPRAFLYHHFLTEEECNHLIEVARPSLVKSTVVDSDTGKSKDSR--LRTSSGTFL 61
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
+D +++ IE +IA T +P GE VL+Y+ +KY+ HYD F+ A Q
Sbjct: 62 MRGQDP--VIKRIEKRIADFTFIPAEQGEGLQVLQYKESEKYEPHYDYFHDAYNTKNGGQ 119
Query: 195 RLASFLLYLSDVEEGGETMFPFE--NGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLF 250
R+A+ L+YLS+VEEGGET+FP N + +C GL V+PR GD LLF+S+
Sbjct: 120 RIATVLMYLSNVEEGGETVFPAAQVNKTEVPDWDKLSECAQKGLSVRPRMGDALLFWSMK 179
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWI 278
P+ T+D TSLHG CPVIKG KW ATKW+
Sbjct: 180 PDATLDSTSLHGGCPVIKGTKWSATKWL 207
>gi|359806348|ref|NP_001241485.1| uncharacterized protein LOC100783075 precursor [Glycine max]
gi|255645457|gb|ACU23224.1| unknown [Glycine max]
Length = 298
Score = 167 bits (422), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 101/225 (44%), Positives = 132/225 (58%), Gaps = 17/225 (7%)
Query: 66 GDASIQSIPFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVES 122
G AS P +V +SW+PRA + F + +C +I+ AK LK S +A GE+ S
Sbjct: 26 GSASSIVNPSKVKQISWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVADNLSGESQLS 85
Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
RTSSG FIS ++D I+ IE KI+ T LP+ +GE VLRYE GQKYD HYD
Sbjct: 86 D--VRTSSGMFISKNKDP--IISGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDY 141
Query: 183 FNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPF------ENGIFLDSGYDYKKCI--GL 234
F + R+A+ L+YL++V +GGET+FP G ++ D +C G+
Sbjct: 142 FTDKVNIARGGHRIATVLMYLTNVTKGGETVFPSAEEPPRRRGT--ETSSDLSECAKKGI 199
Query: 235 KVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
VKP RGD LLF+SL N T D +SLH CPVI+GEKW ATKWI
Sbjct: 200 AVKPHRGDALLFFSLHTNATPDTSSLHAGCPVIEGEKWSATKWIH 244
>gi|255551575|ref|XP_002516833.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223543921|gb|EEF45447.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 297
Score = 167 bits (422), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 94/208 (45%), Positives = 123/208 (59%), Gaps = 9/208 (4%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
+SW+PRA + F + +C +I+ AK LK S +A + + ++ RTSSG FI+
Sbjct: 39 VSWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVADNESGKSKLSE-VRTSSGMFIAKG 97
Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
+D I+ IE KI+ T LP+ +GE VLRYE GQKYD HYD F + R+A
Sbjct: 98 KDP--IIAGIEEKISTWTFLPKENGEDLQVLRYEHGQKYDPHYDYFADKINIARGGHRMA 155
Query: 198 SFLLYLSDVEEGGETMFPFENGI----FLDSGYDYKKCI--GLKVKPRRGDGLLFYSLFP 251
+ L+YLSDV +GGET+FP +S D +C G+ VKPRRGD LLF+SL P
Sbjct: 156 TVLMYLSDVVKGGETVFPNAEEPPRRKATESHEDLSECAKKGISVKPRRGDALLFFSLHP 215
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIR 279
D SLH CPVI+GEKW ATKWI
Sbjct: 216 TAIPDPNSLHAGCPVIEGEKWSATKWIH 243
>gi|159795555|pdb|2V4A|A Chain A, Crystal Structure Of The Semet-Labeled Prolyl-4
Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
Reinhardtii.
gi|159795556|pdb|2V4A|B Chain B, Crystal Structure Of The Semet-Labeled Prolyl-4
Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
Reinhardtii.
gi|159795557|pdb|2V4A|C Chain C, Crystal Structure Of The Semet-Labeled Prolyl-4
Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
Reinhardtii.
gi|159795558|pdb|2V4A|D Chain D, Crystal Structure Of The Semet-Labeled Prolyl-4
Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
Reinhardtii
Length = 233
Score = 167 bits (422), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 91/205 (44%), Positives = 122/205 (59%), Gaps = 7/205 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAK-KRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
LSW PRA NF S E+C I+ A+ K +K S + G++V+S RTS+GT+ +
Sbjct: 25 LSWSPRAFLLKNFLSDEECDYIVEKARPKXVKSSVVDNESGKSVDSE--IRTSTGTWFAK 82
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMS-Q 194
ED ++ IE ++A+ T +P + E VL Y GQKY+ HYD F +P GP+ Q
Sbjct: 83 GEDS--VISKIEKRVAQVTXIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNAGPEHGGQ 140
Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
R+ + L YL+ VEEGGET+ P G+ GL VKP +GD L FYSL P+G+
Sbjct: 141 RVVTXLXYLTTVEEGGETVLPNAEQKVTGDGWSECAKRGLAVKPIKGDALXFYSLKPDGS 200
Query: 255 IDRTSLHGSCPVIKGEKWVATKWIR 279
D SLHGSCP +KG+KW ATKWI
Sbjct: 201 NDPASLHGSCPTLKGDKWSATKWIH 225
>gi|148537204|dbj|BAF63493.1| prolyl 4-hydroxylase [Potamogeton distinctus]
Length = 246
Score = 167 bits (422), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 91/193 (47%), Positives = 120/193 (62%), Gaps = 8/193 (4%)
Query: 90 FASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISASEDKTGILELIE 148
F S E+C +IA K +L+ S +A + G++V S RTSSG F+ +D+T + IE
Sbjct: 3 FLSHEECDHLIALGKDKLEKSMVADNESGKSVMSE--IRTSSGMFLERRQDET--ITRIE 58
Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEE 208
+IA T LP+ +GE +L YE GQKYD+HYD F+ R+A+ L+YLSDV++
Sbjct: 59 KRIAAWTFLPEENGEPIQILHYEKGQKYDAHYDYFHDKNNQRVGGHRMATVLMYLSDVKK 118
Query: 209 GGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCP 265
GGET+FP G L D + C G VKPR+GD LLF+S PN T D SLH SCP
Sbjct: 119 GGETVFPDAEGKLLQVKDDTWSDCARSGYAVKPRKGDALLFFSCHPNATTDPNSLHASCP 178
Query: 266 VIKGEKWVATKWI 278
VI+GEKW AT+WI
Sbjct: 179 VIEGEKWSATRWI 191
>gi|255641919|gb|ACU21228.1| unknown [Glycine max]
Length = 301
Score = 166 bits (421), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 99/224 (44%), Positives = 130/224 (58%), Gaps = 13/224 (5%)
Query: 65 TGDASIQSIPFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVE 121
G AS P +V +SW+PRA + F + +C +I+ AK LK S +A GE+
Sbjct: 28 AGSASAIIDPSKVKQVSWKPRAFVYEGFLTELECDHLISIAKSELKRSAVADNLSGESKL 87
Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
S RTSSG FI ++D I+ IE KI+ T LP+ +GE VLRYE GQKYD HYD
Sbjct: 88 SE--VRTSSGMFIPKNKDL--IVAGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYD 143
Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI------GLK 235
F + R+A+ L+YL+DV +GGET+FP G + + + G+
Sbjct: 144 YFADKVNIARGGHRVATVLMYLTDVTKGGETVFPDAEESPRHKGSETNENLSECAQKGIA 203
Query: 236 VKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
VKPRRGD LLF+SL+PN D SLH CPVI+GEKW AT+WI
Sbjct: 204 VKPRRGDALLFFSLYPNAIPDTLSLHAGCPVIEGEKWSATEWIH 247
>gi|297832394|ref|XP_002884079.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297329919|gb|EFH60338.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 291
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 85/211 (40%), Positives = 129/211 (61%), Gaps = 7/211 (3%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
+V+SW PRA+ + NF S E+C+ +I AK + S + + + ++ RTSSGTF+
Sbjct: 81 EVISWEPRAVVYHNFLSNEECEHLINLAKPSMVKSTVVDEKTGGSKDSR-VRTSSGTFLR 139
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
D+ ++E+IE +I+ T +P +GE VL Y++GQKY+ HYD F QR
Sbjct: 140 RGHDE--VVEVIEKRISDFTFIPVENGEGLQVLHYQVGQKYEPHYDYFLDEFNTKNGGQR 197
Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSLFP 251
+A+ L+YLSDV++GGET+FP G + + KC GL V P++ D LLF+++ P
Sbjct: 198 IATVLMYLSDVDDGGETVFPAARGNISAVPWWNELSKCGKEGLSVLPKKRDALLFWNMRP 257
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
+ ++D +SLHG CPV+KG KW +TKW E
Sbjct: 258 DASLDPSSLHGGCPVVKGNKWSSTKWFHVHE 288
>gi|145345836|ref|XP_001417405.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577632|gb|ABO95698.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 330
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 92/225 (40%), Positives = 133/225 (59%), Gaps = 22/225 (9%)
Query: 71 QSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSS 130
++ P + + +P+A NF SAE+C ++ AK+ L PS + G++V S RTS+
Sbjct: 38 ETPPLVLSATQPKAYLLRNFLSAEECDHLMKLAKRELAPSTVVGEAGDSVPSD--IRTSA 95
Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF----NPA 186
G F+ +DK I++ IE +IAR + P +GE +LRY++GQKYD H+D F NPA
Sbjct: 96 GMFLRKGQDK--IVKAIEERIARLSGTPVDNGEGMQILRYDVGQKYDPHFDYFHDKVNPA 153
Query: 187 EYGPQMSQRLASFLLYLSDVEEGGETMFPF----------ENGIFLDSGYDYKKCI--GL 234
+ QRLA+ L+YL D ++GGET FP E S ++ C G+
Sbjct: 154 PK--RGGQRLATMLIYLVDTDKGGETTFPNAKLPQSFEADEPENPFASHIEHTDCAKKGI 211
Query: 235 KVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
VK RGD +LF+S+ +G +DR SLHG+CPVI+G+KW A KWIR
Sbjct: 212 PVKSVRGDAILFFSMTQDGVLDRGSLHGACPVIEGQKWTAVKWIR 256
>gi|15227885|ref|NP_179363.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
gi|25411813|pir||F84555 similar to prolyl 4-hydroxylase alpha subunit [imported] -
Arabidopsis thaliana
gi|89274129|gb|ABD65585.1| At2g17720 [Arabidopsis thaliana]
gi|110738861|dbj|BAF01353.1| similar to prolyl 4-hydroxylase alpha subunit [Arabidopsis
thaliana]
gi|330251579|gb|AEC06673.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
Length = 291
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 84/211 (39%), Positives = 130/211 (61%), Gaps = 7/211 (3%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
+V+SW PRA+ + NF + E+C+ +I+ AK + S + + + ++ RTSSGTF+
Sbjct: 81 EVISWEPRAVVYHNFLTNEECEHLISLAKPSMVKSTVVDEKTGGSKDSR-VRTSSGTFLR 139
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
D+ ++E+IE +I+ T +P +GE VL Y++GQKY+ HYD F QR
Sbjct: 140 RGHDE--VVEVIEKRISDFTFIPVENGEGLQVLHYQVGQKYEPHYDYFLDEFNTKNGGQR 197
Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSLFP 251
+A+ L+YLSDV++GGET+FP G + + KC GL V P++ D LLF+++ P
Sbjct: 198 IATVLMYLSDVDDGGETVFPAARGNISAVPWWNELSKCGKEGLSVLPKKRDALLFWNMRP 257
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
+ ++D +SLHG CPV+KG KW +TKW E
Sbjct: 258 DASLDPSSLHGGCPVVKGNKWSSTKWFHVHE 288
>gi|29150368|gb|AAO72377.1| putative oxidoreductase [Oryza sativa Japonica Group]
gi|108711617|gb|ABF99412.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
Group]
gi|125546090|gb|EAY92229.1| hypothetical protein OsI_13949 [Oryza sativa Indica Group]
gi|125588294|gb|EAZ28958.1| hypothetical protein OsJ_13002 [Oryza sativa Japonica Group]
Length = 310
Score = 166 bits (419), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 93/208 (44%), Positives = 128/208 (61%), Gaps = 10/208 (4%)
Query: 77 VLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFIS 135
++SW+PR ++ F S ++C ++ K++LK S +A + G++V S RTSSG F+
Sbjct: 50 IISWKPRIFFYKGFLSDDECDHLVKLGKEKLKRSMVADNESGKSVMSE--VRTSSGMFLD 107
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
+D ++ IE +IA T+LPQ + E +LRYE GQKYD H+D F Q R
Sbjct: 108 KQQDP--VVSGIEERIAAWTLLPQENAENIQILRYENGQKYDPHFDYFQDKVNQLQGGHR 165
Query: 196 LASFLLYLSDVEEGGETMFP----FENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
A+ L YLS VE+GGET+FP +E+ DS D K GL VK +GD +LF++L P
Sbjct: 166 YATVLTYLSTVEKGGETVFPNAEGWESQPKDDSFSDCAK-KGLAVKAVKGDSVLFFNLQP 224
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+GT D SLHGSCPVI+GEKW A KWI
Sbjct: 225 DGTPDPLSLHGSCPVIEGEKWSAPKWIH 252
>gi|226495689|ref|NP_001149322.1| LOC100282945 precursor [Zea mays]
gi|194697650|gb|ACF82909.1| unknown [Zea mays]
gi|194708468|gb|ACF88318.1| unknown [Zea mays]
gi|195626376|gb|ACG35018.1| oxidoreductase [Zea mays]
gi|347978842|gb|AEP37763.1| prolyl 4-hydroxylase 9 [Zea mays]
gi|413945802|gb|AFW78451.1| oxidoreductase [Zea mays]
Length = 308
Score = 166 bits (419), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 93/205 (45%), Positives = 124/205 (60%), Gaps = 7/205 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
+S +PR + +F S ++ +I+ A+ LK S +A G++ S RTSSGTF+
Sbjct: 54 ISCKPRVFLYQHFLSDDEANHLISLARAELKRSAVADNMSGKSTLSE--VRTSSGTFLRK 111
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D I+E IE KIA T LP+ +GE VLRY+ G+KY+ HYD F + R
Sbjct: 112 GQDP--IVEGIEDKIAAWTFLPKENGEDIQVLRYKHGEKYEPHYDYFTDNVNTVRGGHRY 169
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLFPNGT 254
A+ LLYL+DV EGGET+FP +C G+ V+PR+GD LLF++L P+GT
Sbjct: 170 ATVLLYLTDVPEGGETVFPLAEEPDDAKDATLSECAQKGIAVRPRKGDALLFFNLNPDGT 229
Query: 255 IDRTSLHGSCPVIKGEKWVATKWIR 279
D SLHG CPVIKGEKW ATKWIR
Sbjct: 230 TDSVSLHGGCPVIKGEKWSATKWIR 254
>gi|302823087|ref|XP_002993198.1| hypothetical protein SELMODRAFT_431327 [Selaginella moellendorffii]
gi|300138968|gb|EFJ05718.1| hypothetical protein SELMODRAFT_431327 [Selaginella moellendorffii]
Length = 269
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 91/210 (43%), Positives = 127/210 (60%), Gaps = 12/210 (5%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA------LRQGETVESTKGTRTS 129
+VL+W PR + F SAE+C +IA A RL S + R G +ES RTS
Sbjct: 61 EVLNWSPRIILLHKFLSAEECDYLIAIAGPRLAKSTVVDTSTGKARHG--IESK--VRTS 116
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
+G F+S + + +++ IE +IA +M+P +GE VLRYE Q Y H+D F+
Sbjct: 117 TGMFLSNYDRRYPMIQAIERRIAVYSMIPVENGELLQVLRYEPNQYYKPHHDYFSDQFNL 176
Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSL 249
+ QR+A+ L+YLSDVEEGGET+FP + G + +K GL VKPR+GD +LF+S
Sbjct: 177 KRGGQRVATVLMYLSDVEEGGETIFPSVGDGECECGGELRK--GLCVKPRKGDAILFWSA 234
Query: 250 FPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+G +D SLHG C V++GEKW ATKW+R
Sbjct: 235 ALDGNVDSNSLHGGCSVLRGEKWSATKWLR 264
>gi|302764100|ref|XP_002965471.1| hypothetical protein SELMODRAFT_67344 [Selaginella moellendorffii]
gi|300166285|gb|EFJ32891.1| hypothetical protein SELMODRAFT_67344 [Selaginella moellendorffii]
Length = 264
Score = 165 bits (418), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 92/210 (43%), Positives = 126/210 (60%), Gaps = 12/210 (5%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA------LRQGETVESTKGTRTS 129
+VL+W PR F SAE+C +IA A RL S + R G +ES RTS
Sbjct: 60 EVLNWSPRITLLHKFLSAEECDYLIAIAGPRLAKSTVVDTSTGKARHG--IESK--VRTS 115
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
+G F+S + + ++E IE +IA +M+P +GE VLRYE Q Y H+D F+
Sbjct: 116 TGMFLSNYDRRYPMIEAIERRIAVYSMIPVENGELLQVLRYEPNQYYKPHHDYFSDQFNL 175
Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSL 249
+ QR+A+ L+YLSDVEEGGET+FP + G + +K GL VKPR+GD +LF+S
Sbjct: 176 KRGGQRVATVLMYLSDVEEGGETIFPSVGDGECECGGELRK--GLCVKPRKGDAILFWSA 233
Query: 250 FPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+G +D SLHG C V++GEKW ATKW+R
Sbjct: 234 ALDGNVDSNSLHGGCSVLRGEKWSATKWLR 263
>gi|302830268|ref|XP_002946700.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
gi|300267744|gb|EFJ51926.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
Length = 186
Score = 165 bits (418), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 86/184 (46%), Positives = 120/184 (65%), Gaps = 7/184 (3%)
Query: 107 LKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFN 166
+ PS LA R GE E+ + RTS GTF+ D + L +E KIA T+LP+T+GE +N
Sbjct: 1 MYPSGLAYRPGEKAEAEQQVRTSKGTFLGG--DSSPALRWLEDKIAAVTLLPRTNGEFWN 58
Query: 167 VLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVE-EGGETMFPFENGIFLDSG 225
VL Y+ Q YDSH D+F+P EYGPQ SQR+A+ ++ LSD GGET+F E ++
Sbjct: 59 VLNYKHSQHYDSHMDSFDPKEYGPQYSQRIATVIVVLSDDGLMGGETVFKREGKSSINKP 118
Query: 226 Y-DYKKCI---GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
++ C GLK KPR GD +LF+S P+G +D +LHGSCPV+ G KWVA KW+R++
Sbjct: 119 ISNWTDCDADGGLKYKPRAGDAVLFWSARPDGQLDPHALHGSCPVVTGNKWVAVKWLRNK 178
Query: 282 EQHE 285
+++
Sbjct: 179 GEYD 182
>gi|21593091|gb|AAM65040.1| putative prolyl 4-hydroxylase, alpha subunit [Arabidopsis thaliana]
Length = 291
Score = 165 bits (418), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 84/211 (39%), Positives = 129/211 (61%), Gaps = 7/211 (3%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
+V+SW PRA+ + NF + E+C+ +I+ AK + S + + + ++ RTSSGTF+
Sbjct: 81 EVISWEPRAVVYHNFLTNEECEHLISLAKPSMVKSTVVDEKTGGSKDSR-VRTSSGTFLR 139
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
D+ ++E+IE +I+ T +P +GE VL Y++GQKY+ HYD F QR
Sbjct: 140 RGHDE--VVEVIEKRISDFTFIPVENGEGLQVLHYQVGQKYEPHYDYFLDEFNTKNGGQR 197
Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSLFP 251
+A+ L+YLSDV++GGET+FP G + + KC GL V P+ D LLF+++ P
Sbjct: 198 IATVLMYLSDVDDGGETVFPAARGNISAVPWWNELSKCGKEGLSVLPKXRDALLFWNMRP 257
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
+ ++D +SLHG CPV+KG KW +TKW E
Sbjct: 258 DASLDPSSLHGGCPVVKGNKWSSTKWFHVHE 288
>gi|218199253|gb|EEC81680.1| hypothetical protein OsI_25242 [Oryza sativa Indica Group]
Length = 487
Score = 165 bits (417), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 94/230 (40%), Positives = 134/230 (58%), Gaps = 17/230 (7%)
Query: 63 GETGDASIQSI----PF-----QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA 113
GE GD + ++ PF + +SWRPR + F S ++C ++ K++++ S +A
Sbjct: 36 GEAGDDGVGAVAAAPPFNASRVRAVSWRPRVFVYKGFLSDDECDHLVKLGKRKMQRSMVA 95
Query: 114 -LRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEI 172
+ G++V S RTSSG F+ +D ++ IE +IA T LP+ + E +LRYE
Sbjct: 96 DNKSGKSVMSE--VRTSSGMFLDKRQDP--VVSRIEKRIAAWTFLPEENAENIQILRYEH 151
Query: 173 GQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKC 231
GQKY+ H+D F+ R A+ L+YLS VE+GGET+FP G D + +C
Sbjct: 152 GQKYEPHFDYFHDKVNQALGGHRYATVLMYLSTVEKGGETVFPNAEGWENQPKDDTFSEC 211
Query: 232 I--GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
GL VKP +GD +LF+SL +G D SLHGSCPVI+GEKW A KWIR
Sbjct: 212 AQKGLAVKPVKGDAVLFFSLHIDGVPDPLSLHGSCPVIEGEKWSAPKWIR 261
>gi|222636605|gb|EEE66737.1| hypothetical protein OsJ_23428 [Oryza sativa Japonica Group]
Length = 487
Score = 164 bits (415), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 94/230 (40%), Positives = 134/230 (58%), Gaps = 17/230 (7%)
Query: 63 GETGDASIQSI----PF-----QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA 113
GE GD + ++ PF + +SWRPR + F S ++C ++ K++++ S +A
Sbjct: 36 GEAGDDGVGAVAAAPPFNASRVRAVSWRPRVFVYKGFLSDDECDHLVKLGKRKMQRSMVA 95
Query: 114 -LRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEI 172
+ G++V S RTSSG F+ +D ++ IE +IA T LP+ + E +LRYE
Sbjct: 96 DNKSGKSVMSE--VRTSSGMFLDKRQDP--VVSRIEKRIAAWTFLPEENAENIQILRYEH 151
Query: 173 GQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKC 231
GQKY+ H+D F+ R A+ L+YLS VE+GGET+FP G D + +C
Sbjct: 152 GQKYEPHFDYFHDKVNQALGGHRYATVLMYLSTVEKGGETVFPNAEGWENQPKDDTFSEC 211
Query: 232 I--GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
GL VKP +GD +LF+SL +G D SLHGSCPVI+GEKW A KWIR
Sbjct: 212 AQKGLAVKPVKGDTVLFFSLHIDGVPDPLSLHGSCPVIEGEKWSAPKWIR 261
>gi|115471029|ref|NP_001059113.1| Os07g0194500 [Oryza sativa Japonica Group]
gi|113610649|dbj|BAF21027.1| Os07g0194500 [Oryza sativa Japonica Group]
gi|215768445|dbj|BAH00674.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 319
Score = 164 bits (415), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 94/230 (40%), Positives = 134/230 (58%), Gaps = 17/230 (7%)
Query: 63 GETGDASIQSI----PF-----QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA 113
GE GD + ++ PF + +SWRPR + F S ++C ++ K++++ S +A
Sbjct: 36 GEAGDDGVGAVAAAPPFNASRVRAVSWRPRVFVYKGFLSDDECDHLVKLGKRKMQRSMVA 95
Query: 114 -LRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEI 172
+ G++V S RTSSG F+ +D ++ IE +IA T LP+ + E +LRYE
Sbjct: 96 DNKSGKSVMSE--VRTSSGMFLDKRQDP--VVSRIEKRIAAWTFLPEENAENIQILRYEH 151
Query: 173 GQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKC 231
GQKY+ H+D F+ R A+ L+YLS VE+GGET+FP G D + +C
Sbjct: 152 GQKYEPHFDYFHDKVNQALGGHRYATVLMYLSTVEKGGETVFPNAEGWENQPKDDTFSEC 211
Query: 232 I--GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
GL VKP +GD +LF+SL +G D SLHGSCPVI+GEKW A KWIR
Sbjct: 212 AQKGLAVKPVKGDTVLFFSLHIDGVPDPLSLHGSCPVIEGEKWSAPKWIR 261
>gi|34393269|dbj|BAC83179.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein [Oryza
sativa Japonica Group]
gi|50509101|dbj|BAD30161.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein [Oryza
sativa Japonica Group]
Length = 313
Score = 164 bits (414), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 94/230 (40%), Positives = 134/230 (58%), Gaps = 17/230 (7%)
Query: 63 GETGDASIQSI----PF-----QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA 113
GE GD + ++ PF + +SWRPR + F S ++C ++ K++++ S +A
Sbjct: 30 GEAGDDGVGAVAAAPPFNASRVRAVSWRPRVFVYKGFLSDDECDHLVKLGKRKMQRSMVA 89
Query: 114 -LRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEI 172
+ G++V S RTSSG F+ +D ++ IE +IA T LP+ + E +LRYE
Sbjct: 90 DNKSGKSVMSE--VRTSSGMFLDKRQDP--VVSRIEKRIAAWTFLPEENAENIQILRYEH 145
Query: 173 GQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKC 231
GQKY+ H+D F+ R A+ L+YLS VE+GGET+FP G D + +C
Sbjct: 146 GQKYEPHFDYFHDKVNQALGGHRYATVLMYLSTVEKGGETVFPNAEGWENQPKDDTFSEC 205
Query: 232 I--GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
GL VKP +GD +LF+SL +G D SLHGSCPVI+GEKW A KWIR
Sbjct: 206 AQKGLAVKPVKGDTVLFFSLHIDGVPDPLSLHGSCPVIEGEKWSAPKWIR 255
>gi|363543295|ref|NP_001241863.1| prolyl 4-hydroxylase 4 precursor [Zea mays]
gi|347978806|gb|AEP37745.1| prolyl 4-hydroxylase 4 [Zea mays]
gi|414591890|tpg|DAA42461.1| TPA: hypothetical protein ZEAMMB73_637248 [Zea mays]
Length = 274
Score = 163 bits (413), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 90/207 (43%), Positives = 124/207 (59%), Gaps = 8/207 (3%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFI 134
+ +SW PR + F S +C ++ AKK+++ S +A + G++V+S RTSSG F+
Sbjct: 46 KAVSWHPRIFVYKGFLSDAECDHLVTLAKKKIQRSMVADNESGKSVKSE--VRTSSGMFL 103
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
+D ++ IE +IA T LPQ + E VLRYE GQKY+ H+D F+ +
Sbjct: 104 DKRQDP--VVSRIEERIAAWTFLPQENAENMQVLRYEPGQKYEPHFDYFHDRVNQARGGH 161
Query: 195 RLASFLLYLSDVEEGGETMFPFENGIF---LDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
R A+ L+YLS V EGGET+FP G D+ + GL VKP +GD +LF+SL
Sbjct: 162 RYATVLMYLSTVREGGETVFPNAKGWESQPKDATFSECAHKGLAVKPVKGDAVLFFSLHA 221
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWI 278
+GT D SLHGSCPVI+GEKW A KWI
Sbjct: 222 DGTPDPLSLHGSCPVIRGEKWSAPKWI 248
>gi|116309432|emb|CAH66506.1| OSIGBa0111I14.1 [Oryza sativa Indica Group]
Length = 267
Score = 163 bits (413), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 87/208 (41%), Positives = 128/208 (61%), Gaps = 5/208 (2%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
+V+SW PR + F NF S+E+C + + A+ RL+ S + + G+ V+S RTSSG F+
Sbjct: 62 EVISWSPRIIVFHNFLSSEECDYLRSIARPRLQISTVVDVATGKGVKSN--VRTSSGMFV 119
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
S+ E K +++ IE +I+ + +P+ +GE VLRYE Q Y H+D F+ + Q
Sbjct: 120 SSEERKLPVIQSIEKRISVYSQIPEENGELIQVLRYEPSQYYRPHHDYFSDTFNIKRGGQ 179
Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
R+A+ L+YL+D EGGET FP G K GL VKP +GD +LF+S+ +G
Sbjct: 180 RVATMLMYLTDGVEGGETHFPQAGDGECSCGGKMVK--GLCVKPNKGDAVLFWSMGLDGE 237
Query: 255 IDRTSLHGSCPVIKGEKWVATKWIRDQE 282
D S+HG CPV++GEKW ATKW+R +E
Sbjct: 238 TDSNSIHGGCPVLEGEKWSATKWMRQKE 265
>gi|115457822|ref|NP_001052511.1| Os04g0346000 [Oryza sativa Japonica Group]
gi|38346023|emb|CAE03962.2| OSJNBb0085H11.11 [Oryza sativa Japonica Group]
gi|113564082|dbj|BAF14425.1| Os04g0346000 [Oryza sativa Japonica Group]
gi|125547818|gb|EAY93640.1| hypothetical protein OsI_15426 [Oryza sativa Indica Group]
gi|125589953|gb|EAZ30303.1| hypothetical protein OsJ_14349 [Oryza sativa Japonica Group]
gi|215693934|dbj|BAG89133.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 267
Score = 163 bits (412), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 87/208 (41%), Positives = 128/208 (61%), Gaps = 5/208 (2%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
+V+SW PR + F NF S+E+C + + A+ RL+ S + + G+ V+S RTSSG F+
Sbjct: 62 EVISWSPRIIVFHNFLSSEECDYLRSIARPRLQISTVVDVATGKGVKSN--VRTSSGMFV 119
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
S+ E K +++ IE +I+ + +P+ +GE VLRYE Q Y H+D F+ + Q
Sbjct: 120 SSEERKLPVIQSIEKRISVYSQIPEENGELIQVLRYEPSQYYRPHHDYFSDTFNIKRGGQ 179
Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
R+A+ L+YL+D EGGET FP G K GL VKP +GD +LF+S+ +G
Sbjct: 180 RVATMLMYLTDGVEGGETHFPQAGDGECSCGGKMVK--GLCVKPNKGDAVLFWSMGLDGE 237
Query: 255 IDRTSLHGSCPVIKGEKWVATKWIRDQE 282
D S+HG CPV++GEKW ATKW+R +E
Sbjct: 238 TDSNSIHGGCPVLEGEKWSATKWMRQKE 265
>gi|388495016|gb|AFK35574.1| unknown [Lotus japonicus]
Length = 297
Score = 163 bits (412), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 95/208 (45%), Positives = 122/208 (58%), Gaps = 11/208 (5%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTFISA 136
+SW+PRA + F + +C +I+ AK LK S +A G++ S RTSSG FIS
Sbjct: 39 VSWKPRAFVYEGFLTGLECDHLISLAKSELKRSAVADNLPGDSKLSE--VRTSSGMFISK 96
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D I+ IE KI+ T LP+ +GE VLRYE GQKYD HYD F + R+
Sbjct: 97 KKDP--IVAGIEDKISAWTFLPKENGEDMQVLRYEHGQKYDPHYDYFTDKVNIVRGGHRM 154
Query: 197 ASFLLYLSDVEEGGETMFPFENGI----FLDSGYDYKKCI--GLKVKPRRGDGLLFYSLF 250
A+ LLYL++V GGET+FP L++ D +C G+ VKPRRGD LLF+SL
Sbjct: 155 ATVLLYLTNVTRGGETVFPVAEEPPRRRGLETNSDLSECAKKGIAVKPRRGDALLFFSLH 214
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWI 278
D SLH CPVI+GEKW ATKWI
Sbjct: 215 TTAIPDTDSLHAGCPVIEGEKWSATKWI 242
>gi|15239594|ref|NP_197391.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
gi|21593296|gb|AAM65245.1| prolyl 4-hydroxylase alpha subunit-like protein [Arabidopsis
thaliana]
gi|332005243|gb|AED92626.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
Length = 298
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 93/208 (44%), Positives = 127/208 (61%), Gaps = 11/208 (5%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
+S +PRA + F + +C +++ AK LK S +A GE+ S RTSSGTFIS
Sbjct: 40 VSSKPRAFVYEGFLTELECDHMVSLAKASLKRSAVADNDSGESKFSE--VRTSSGTFISK 97
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D I+ IE KI+ T LP+ +GE VLRYE GQKYD+H+D F+ + R+
Sbjct: 98 GKDP--IVSGIEDKISTWTFLPKENGEDIQVLRYEHGQKYDAHFDYFHDKVNIVRGGHRM 155
Query: 197 ASFLLYLSDVEEGGETMFPF----ENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLF 250
A+ L+YLS+V +GGET+FP + ++ D C G+ VKPR+GD LLF++L
Sbjct: 156 ATILMYLSNVTKGGETVFPDAEIPSRRVLSENKEDLSDCAKRGIAVKPRKGDALLFFNLH 215
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWI 278
P+ D SLHG CPVI+GEKW ATKWI
Sbjct: 216 PDAIPDPLSLHGGCPVIEGEKWSATKWI 243
>gi|242047772|ref|XP_002461632.1| hypothetical protein SORBIDRAFT_02g005750 [Sorghum bicolor]
gi|241925009|gb|EER98153.1| hypothetical protein SORBIDRAFT_02g005750 [Sorghum bicolor]
Length = 307
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 93/225 (41%), Positives = 132/225 (58%), Gaps = 13/225 (5%)
Query: 64 ETGDASIQSIPF-----QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-G 117
+ GD + PF + +SW+PR + F S +C ++ AKK+++ S +A Q G
Sbjct: 28 DGGDVVAPAPPFNSSRVKAVSWQPRIFVYKGFLSDAECDHLVTLAKKKIQRSMVADNQSG 87
Query: 118 ETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYD 177
++V S RTSSG F++ +D ++ IE +IA T LPQ + E +LRYE GQKY+
Sbjct: 88 KSVMSE--VRTSSGMFLNKRQDP--VVSRIEERIAAWTFLPQENAENMQILRYEHGQKYE 143
Query: 178 SHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GL 234
H+D F+ + R A+ L+YLS V++GGET+FP G D + +C GL
Sbjct: 144 PHFDYFHDKINQVRGGHRYATVLMYLSTVDKGGETVFPNAKGWESQPKDDTFSECAHQGL 203
Query: 235 KVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
VKP +GD +LF+SL +G D SLHGSCPVI+GEKW A KWI
Sbjct: 204 AVKPVKGDAVLFFSLHVDGVPDPLSLHGSCPVIQGEKWSAPKWIH 248
>gi|307102975|gb|EFN51240.1| hypothetical protein CHLNCDRAFT_28187 [Chlorella variabilis]
Length = 322
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 94/222 (42%), Positives = 134/222 (60%), Gaps = 23/222 (10%)
Query: 75 FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQG----ETVESTKGTRTSS 130
+++SW+PRAL F + +C +I+ A+ RL+PS++ R G ++V + +G +SS
Sbjct: 15 IELVSWKPRALLLHGFLAHSECDHMISLAEARLEPSKVVSRDGSGKLDSVRTRQGL-SSS 73
Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY-- 188
GTF++ +D ++ +E +I AT LP +H E VL+YE+GQKY +HYD E
Sbjct: 74 GTFLTKRQDS--VVAGVEDRIELATHLPFSHSEQLQVLKYELGQKYSAHYDVHGSNEQAQ 131
Query: 189 -----GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYD----YKKC--IGLKVK 237
G Q R A+ L+YLSDVEEGGET FP +G ++D G Y +C G+ VK
Sbjct: 132 LAIRRGEQGGSRYATMLMYLSDVEEGGETSFP--HGRWIDEGAQAQPPYSECGSRGVAVK 189
Query: 238 PRRGDGLLFYSLFPNG-TIDRTSLHGSCPVIKGEKWVATKWI 278
PR+GD +LFYSL +G + D SLH CPV KG K+ AT WI
Sbjct: 190 PRKGDAILFYSLKSDGQSKDFFSLHAGCPVAKGVKYSATAWI 231
>gi|20260280|gb|AAM13038.1| unknown protein [Arabidopsis thaliana]
gi|22136524|gb|AAM91340.1| unknown protein [Arabidopsis thaliana]
Length = 298
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 93/208 (44%), Positives = 127/208 (61%), Gaps = 11/208 (5%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
+S +PRA + F + +C +++ AK LK S +A GE+ S RTSSGTFIS
Sbjct: 40 VSSKPRAFVYEGFLTELECDHMVSLAKASLKRSAVADNDSGESKFSE--VRTSSGTFISK 97
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D I+ IE KI+ T LP+ +GE VLRYE GQKYD+H+D F+ + R+
Sbjct: 98 GKDP--IVSGIEDKISTWTFLPKENGEDIQVLRYEHGQKYDAHFDYFHDKVNIVRGGHRM 155
Query: 197 ASFLLYLSDVEEGGETMFPF----ENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLF 250
A+ L+YLS+V +GGET+FP + ++ D C G+ VKPR+GD LLF++L
Sbjct: 156 ATILMYLSNVTKGGETVFPDAEIPSRRVLSENEEDLSDCAKRGIAVKPRKGDALLFFNLH 215
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWI 278
P+ D SLHG CPVI+GEKW ATKWI
Sbjct: 216 PDAIPDPLSLHGGCPVIEGEKWSATKWI 243
>gi|449522594|ref|XP_004168311.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Cucumis
sativus]
Length = 313
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 89/205 (43%), Positives = 125/205 (60%), Gaps = 8/205 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LSW+PRA + F S +C +I AK +L+ S +A G++V S RTSSG F+
Sbjct: 56 LSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVSSE--VRTSSGMFLRK 113
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
++D+ ++ +E +IA T+LP +GE+ +L YE GQKY+ H+D F+ R+
Sbjct: 114 AQDE--VVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRI 171
Query: 197 ASFLLYLSDVEEGGETMFP---FENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 253
A+ L+YLS+VE+GGET+FP F+ D + G VK ++GD LLF+SL +
Sbjct: 172 ATVLMYLSNVEKGGETIFPNSEFKESQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDA 231
Query: 254 TIDRTSLHGSCPVIKGEKWVATKWI 278
T D SLHGSCPVI GEKW ATKWI
Sbjct: 232 TTDERSLHGSCPVIAGEKWSATKWI 256
>gi|168060785|ref|XP_001782374.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666166|gb|EDQ52828.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 211
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 90/208 (43%), Positives = 126/208 (60%), Gaps = 9/208 (4%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTKGTRTSSGTFI 134
+VLSW PRA + +F + +C +I AK L K + + G++ +S RTSSGTF+
Sbjct: 3 EVLSWEPRAFLYHHFLTQVECNHLIEVAKPSLVKSTVIDSATGKSKDSR--VRTSSGTFL 60
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
+D I++ IE +IA T +P GE VL+Y +KY+ HYD F+ A Q
Sbjct: 61 VRGQDH--IIKRIEKRIADFTFIPVEQGEGLQVLQYRESEKYEPHYDYFHDAFNTKNGGQ 118
Query: 195 RLASFLLYLSDVEEGGETMFPFE--NGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLF 250
R+A+ L+YLSDVE+GGET+FP N + +C GL V+PR GD LLF+S+
Sbjct: 119 RIATVLMYLSDVEKGGETVFPASKVNASEVPDWDQRSECAKRGLSVRPRMGDALLFWSMK 178
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWI 278
P+ +D TSLHG+CPVI+G KW ATKW+
Sbjct: 179 PDAKLDPTSLHGACPVIQGTKWSATKWL 206
>gi|334185677|ref|NP_001189994.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
gi|332643930|gb|AEE77451.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
Length = 324
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 91/210 (43%), Positives = 125/210 (59%), Gaps = 8/210 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVES--TKGTRTSSGTFI 134
LSW PR + F S E+C I AK +L+ S +A GE+VES + S +FI
Sbjct: 59 LSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEDSVSVVRQSSSFI 118
Query: 135 SA--SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
+ S + I+ +E K+A T LP+ +GE+ +L YE GQKY+ H+D F+
Sbjct: 119 ANMDSLEIDDIVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELG 178
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSL 249
R+A+ L+YLS+VE+GGET+FP G D + +C G VKPR+GD LLF++L
Sbjct: 179 GHRIATVLMYLSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDALLFFNL 238
Query: 250 FPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
PN T D SLHGSCPV++GEKW AT+WI
Sbjct: 239 HPNATTDSNSLHGSCPVVEGEKWSATRWIH 268
>gi|168001068|ref|XP_001753237.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162695523|gb|EDQ81866.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 284
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 84/207 (40%), Positives = 130/207 (62%), Gaps = 5/207 (2%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTKGTRTSSGTFI 134
+V+SW+PR + NF SA++C +I A+ RL K + + G+ +ES RTS+G F+
Sbjct: 79 EVISWQPRIILLHNFLSADECDHLINLARPRLVKSTVVDATTGKGIESK--VRTSTGMFL 136
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
+ ++ + ++ IE +IA +M+P +GE VLRYE Q Y +H+D F+ + Q
Sbjct: 137 NGNDRRHHTIQAIETRIAAYSMVPVQNGELLQVLRYESDQYYKAHHDYFSDEFNLKRGGQ 196
Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
R+A+ L+YL++ EGGET+FP G + K IG+ VKP+RGD +LF+S+ +G
Sbjct: 197 RVATMLMYLTEGVEGGETIFPQAGDKECSCGGEMK--IGVCVKPKRGDAVLFWSIKLDGQ 254
Query: 255 IDRTSLHGSCPVIKGEKWVATKWIRDQ 281
+D TSLHG C V+ GEKW +TKW+R +
Sbjct: 255 VDPTSLHGGCKVLSGEKWSSTKWMRQR 281
>gi|242075290|ref|XP_002447581.1| hypothetical protein SORBIDRAFT_06g004550 [Sorghum bicolor]
gi|241938764|gb|EES11909.1| hypothetical protein SORBIDRAFT_06g004550 [Sorghum bicolor]
Length = 263
Score = 162 bits (409), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 87/219 (39%), Positives = 133/219 (60%), Gaps = 5/219 (2%)
Query: 64 ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVES 122
E G ++ + +V+SW PR + F NF S+E+C ++A A+ RL+ S + + G+ V+S
Sbjct: 46 EAGLLRLRYVKPEVISWTPRIIIFHNFLSSEECDYLMAIARPRLQMSTVVDVATGKGVKS 105
Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
RTSSG F+++ E K+ +++ IE +I+ + +P+ +GE VLRYE Q Y H+D
Sbjct: 106 D--VRTSSGMFVNSEERKSPVIQAIEKRISVFSQIPKENGELIQVLRYEASQYYRPHHDY 163
Query: 183 FNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
F+ + QR+A+ L+YL+D EGGET F G + K GL VKP +GD
Sbjct: 164 FSDTFNLKRGGQRVATMLMYLTDGVEGGETHFLQAGDGECSCGGNVVK--GLCVKPNKGD 221
Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
+LF+S+ +G D S+H CPV+KGEKW ATKW+R +
Sbjct: 222 AVLFWSMGLDGNTDPNSIHSGCPVLKGEKWSATKWMRQK 260
>gi|414587756|tpg|DAA38327.1| TPA: hypothetical protein ZEAMMB73_894856 [Zea mays]
Length = 263
Score = 161 bits (408), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 85/207 (41%), Positives = 128/207 (61%), Gaps = 5/207 (2%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
+V+SW PR + F NF S+E+C ++A A+ RL+ S + + G+ V+S RTSSG F+
Sbjct: 58 EVISWTPRIIVFHNFLSSEECDYLMAIARPRLQISTVVDVATGKGVKSD--VRTSSGMFV 115
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
++ E K+ +++ IE +I+ + +P+ +GE VLRYE Q Y H+D F+ + Q
Sbjct: 116 NSEERKSPVVQAIEKRISVFSQIPKENGELIQVLRYEASQYYRPHHDYFSDTFNLKRGGQ 175
Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
R+A+ L+YL+D GGET FP G + K GL VKP +GD +LF+S+ +G
Sbjct: 176 RVATMLMYLTDGVVGGETHFPQAGDGECSCGGNVVK--GLCVKPNKGDAVLFWSMGLDGN 233
Query: 255 IDRTSLHGSCPVIKGEKWVATKWIRDQ 281
D S+H CPV+KGEKW ATKW+R +
Sbjct: 234 TDPNSIHSGCPVLKGEKWSATKWMRQK 260
>gi|384251901|gb|EIE25378.1| hypothetical protein COCSUDRAFT_35772 [Coccomyxa subellipsoidea
C-169]
Length = 222
Score = 161 bits (407), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 82/208 (39%), Positives = 123/208 (59%), Gaps = 7/208 (3%)
Query: 75 FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTF 133
+VLSW PRA + NF + + ++ K ++ S++ ET +S RTSSG F
Sbjct: 1 MEVLSWEPRAYLYHNFLTEAEADYLVQKGKPHMEKSEVV--DNETGKSAPSKVRTSSGMF 58
Query: 134 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 193
++ ED ++E IE +IA+ T +P+ +GE +L Y+ ++Y H+D F+
Sbjct: 59 LNRGEDD--VIERIEARIAKYTAIPKENGEGLQILHYQASEEYRPHFDYFHDNFNTQNGG 116
Query: 194 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKC--IGLKVKPRRGDGLLFYSLFP 251
QR+A+ L+YLSDVE+GGET+FP + + +C G KP++GD L FYSL P
Sbjct: 117 QRIATMLMYLSDVEDGGETVFPESSDKPNVGNTKFSQCAQAGAAAKPKKGDALFFYSLTP 176
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+G +D SLH CPV+KG+KW ATKW+R
Sbjct: 177 DGRMDEKSLHAGCPVMKGDKWSATKWLR 204
>gi|357478545|ref|XP_003609558.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
gi|355510613|gb|AES91755.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
Length = 299
Score = 161 bits (407), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 100/223 (44%), Positives = 128/223 (57%), Gaps = 13/223 (5%)
Query: 65 TGDASIQSIPFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVE 121
G AS P +V +SW PRA + F + +C +I+ AK LK S +A G++
Sbjct: 25 AGSASSIINPSKVKQISWIPRAFVYQGFLTDLECDHLISLAKSELKRSAVADNLSGDSQL 84
Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
S RTSSG FIS ++D I+ IE +I+ T LP+ +GE VLRYE GQKYD HYD
Sbjct: 85 SD--VRTSSGMFISKNKDP--IVSGIEDRISAWTFLPKENGEDIQVLRYEHGQKYDPHYD 140
Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSG----YDYKKCI--GLK 235
F Q RLA+ L+YL++V +GGET+FP G D +C G+
Sbjct: 141 YFADKVNIVQGGHRLATVLMYLTNVTKGGETVFPEAEEPPRRRGSKKSSDLSECAKKGIA 200
Query: 236 VKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
VKPRRGD LLF+SL N D SLH CPV++GEKW ATKWI
Sbjct: 201 VKPRRGDALLFFSLDTNAIPDTNSLHAGCPVLEGEKWSATKWI 243
>gi|388492638|gb|AFK34385.1| unknown [Medicago truncatula]
Length = 299
Score = 160 bits (406), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 100/223 (44%), Positives = 128/223 (57%), Gaps = 13/223 (5%)
Query: 65 TGDASIQSIPFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVE 121
G AS P +V +SW PRA + F + +C +I+ AK LK S +A G++
Sbjct: 25 AGSASSIINPSKVKQISWIPRAFVYQGFLTDLECDHLISLAKSELKRSAVADNLSGDSQL 84
Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
S RTSSG FIS ++D I+ IE +I+ T LP+ +GE VLRYE GQKYD HYD
Sbjct: 85 SD--VRTSSGMFISKNKDP--IVSGIEDRISAWTFLPKENGEDIQVLRYEHGQKYDPHYD 140
Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSG----YDYKKCI--GLK 235
F Q RLA+ L+YL++V +GGET+FP G D +C G+
Sbjct: 141 YFADKVNIVQGGHRLATVLMYLTNVTKGGETVFPEAEEPPRRRGSKKSSDLSECAKKGIA 200
Query: 236 VKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
VKPRRGD LLF+SL N D SLH CPV++GEKW ATKWI
Sbjct: 201 VKPRRGDALLFFSLDTNAIPDTNSLHAGCPVLEGEKWSATKWI 243
>gi|255072321|ref|XP_002499835.1| prolyl 4-hydroxylase [Micromonas sp. RCC299]
gi|226515097|gb|ACO61093.1| prolyl 4-hydroxylase [Micromonas sp. RCC299]
Length = 454
Score = 160 bits (405), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 92/221 (41%), Positives = 130/221 (58%), Gaps = 19/221 (8%)
Query: 72 SIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
S P + + P+A F NF + +C+ ++ AKK+L PS + +G +K RTS+G
Sbjct: 167 SPPLVLSNHEPKAYMFRNFLTPHECEHLMQLAKKQLAPSTVVGDKGSGSMVSK-IRTSAG 225
Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGP 190
F+ +D T + IE +IA A+ LP+ +GE +LRYE GQKYD H+D F + P
Sbjct: 226 MFLGRGQDPT--VRAIEERIAAASGLPEPNGEGLQILRYENGQKYDPHFDYFHDQVNSSP 283
Query: 191 QMS-QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYD---------YKKCI--GLKVKP 238
+ QR+A+ L+YL D EGGET+FP NG+ + +D + C G+ VK
Sbjct: 284 RRGGQRMATMLIYLEDTTEGGETIFP--NGVRPED-WDADEPGNHNSWSDCAKKGIPVKS 340
Query: 239 RRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
RGD +LF+SL + T+D SLHG+CPVI GEKW A KWIR
Sbjct: 341 HRGDAVLFWSLKEDYTLDNGSLHGACPVIAGEKWTAVKWIR 381
>gi|255085784|ref|XP_002505323.1| predicted protein [Micromonas sp. RCC299]
gi|226520592|gb|ACO66581.1| predicted protein [Micromonas sp. RCC299]
Length = 215
Score = 160 bits (405), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 89/219 (40%), Positives = 127/219 (57%), Gaps = 21/219 (9%)
Query: 75 FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES-TKGTRTSSGTF 133
+ +SW PRA + NF + E+C ++ AK + L++ ++ T GT SG F
Sbjct: 2 IEQISWEPRAFVYHNFLTPEECAHLVNLAKA----TDGGLKRATVADARTGGTFPGSGAF 57
Query: 134 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM- 192
+ + D I+ IE +I+ M+P HGE +LRY G+KYD H+D F+ + +
Sbjct: 58 LLRNHDP--IVTRIEERISAFAMIPADHGEGMRILRYGRGEKYDPHHDYFDDGDKNLRFY 115
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLD----------SGYDYKKCI--GLKVKPRR 240
QR+A+ L+YLSDVE GGET+FP ++G +++ S D KC L VKPRR
Sbjct: 116 GQRVATVLMYLSDVESGGETVFP-KHGAWIEPDEMDVRGRSSSKDSSKCAKGALHVKPRR 174
Query: 241 GDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
GD LLF++ NG D TSLH CPV++GEKW ATKW+R
Sbjct: 175 GDALLFHNCHLNGREDPTSLHAGCPVLRGEKWTATKWMR 213
>gi|218192156|gb|EEC74583.1| hypothetical protein OsI_10158 [Oryza sativa Indica Group]
Length = 299
Score = 160 bits (405), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 89/214 (41%), Positives = 129/214 (60%), Gaps = 9/214 (4%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKK-RLKPSQLAL-RQGETVESTKGTRTSSGTFIS 135
+SW PR + F S +C+ +IA AK+ R++ S + + GE+V S TRTSSG F+
Sbjct: 40 VSWSPRVFLYEGFLSDAECEHLIALAKQGRMERSTVVNGKSGESVMSK--TRTSSGMFLI 97
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
+D+ ++ IE +IA TM P +GE+ +LRY G+KY+ H+D + + R
Sbjct: 98 RKQDE--VVARIEERIAAWTMFPAENGESMQMLRYGQGEKYEPHFDYIRGRQASARGGHR 155
Query: 196 LASFLLYLSDVEEGGETMFP-FENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLFPN 252
+A+ L+YLS+V+ GGET+FP E + + C G VKP +G +LF+SL+PN
Sbjct: 156 IATVLMYLSNVKMGGETVFPDAEARLSQPKDETWSDCAEQGFAVKPTKGSAVLFFSLYPN 215
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHED 286
T D SLHGSCPVI+GEKW ATKWI + E+
Sbjct: 216 ATFDPGSLHGSCPVIQGEKWSATKWIHVRSYDEN 249
>gi|297812067|ref|XP_002873917.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297319754|gb|EFH50176.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 298
Score = 160 bits (405), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 92/209 (44%), Positives = 126/209 (60%), Gaps = 11/209 (5%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
+S +PRA + F + +C +++ AK LK S +A GE+ S RTSSGTFI
Sbjct: 40 VSSKPRAFVYEGFLTELECDHMVSLAKASLKRSAVADNDSGESKFSE--VRTSSGTFIPK 97
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D I+ IE KI+ T LP+ +GE VLRYE GQKYD+H+D F+ + R+
Sbjct: 98 GKDP--IVSGIEDKISTWTFLPKENGEDIQVLRYEHGQKYDAHFDYFHDKVNIVRGGHRI 155
Query: 197 ASFLLYLSDVEEGGETMFPFEN----GIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLF 250
A+ L+YLS+V +GGET+FP + ++ D C G+ VKPR+GD LLF++L
Sbjct: 156 ATVLMYLSNVTKGGETVFPDAEVPSCRVLSENKEDLSDCAKRGIAVKPRKGDALLFFNLH 215
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
P+ D SLHG CPVI+GEKW ATKWI
Sbjct: 216 PDAIPDPLSLHGGCPVIEGEKWSATKWIH 244
>gi|294461211|gb|ADE76168.1| unknown [Picea sitchensis]
Length = 280
Score = 160 bits (405), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 90/204 (44%), Positives = 127/204 (62%), Gaps = 12/204 (5%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISASEDK 140
P + NF + +C +I A+ +L+ S +A + G++V S RTSSG F++ ++D+
Sbjct: 28 PGLFLYKNFLTDAECDHLIFLARDKLQKSMVADNESGKSVMSE--IRTSSGMFLNKAQDE 85
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFL 200
I+ +E +IA T LP +GEA VL YE+GQKY+ H+D F+ R+A+ L
Sbjct: 86 --IVASVEDRIAAWTFLPIENGEAMQVLHYELGQKYEPHFDYFHDKINQAMGGHRIATVL 143
Query: 201 LYLSDVEEGGETMFPFENGIFLDS---GYDYKKCI--GLKVKPRRGDGLLFYSLFPNGTI 255
+YLSDV +GGET+FP N DS + +C G VKP +GD LLF+SL P+ T
Sbjct: 144 MYLSDVVKGGETVFP--NAETKDSQPKDDSWSECAKGGYSVKPNKGDALLFFSLRPDATT 201
Query: 256 DRTSLHGSCPVIKGEKWVATKWIR 279
D++SLHGSCPVI+GEKW ATKWI
Sbjct: 202 DQSSLHGSCPVIEGEKWSATKWIH 225
>gi|108706361|gb|ABF94156.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
Group]
gi|222624253|gb|EEE58385.1| hypothetical protein OsJ_09545 [Oryza sativa Japonica Group]
Length = 299
Score = 159 bits (403), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 89/214 (41%), Positives = 129/214 (60%), Gaps = 9/214 (4%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKK-RLKPSQLAL-RQGETVESTKGTRTSSGTFIS 135
+SW PR + F S +C+ +IA AK+ R++ S + + GE+V S TRTSSG F+
Sbjct: 40 VSWSPRVFLYEGFLSDVECEHLIALAKQGRMERSTVVNGKSGESVMSK--TRTSSGMFLI 97
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
+D+ ++ IE +IA TM P +GE+ +LRY G+KY+ H+D + + R
Sbjct: 98 RKQDE--VVARIEERIAAWTMFPAENGESMQMLRYGQGEKYEPHFDYIRGRQASARGGHR 155
Query: 196 LASFLLYLSDVEEGGETMFP-FENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLFPN 252
+A+ L+YLS+V+ GGET+FP E + + C G VKP +G +LF+SL+PN
Sbjct: 156 IATVLMYLSNVKMGGETVFPDAEARLSQPKDETWSDCAEQGFAVKPTKGSAVLFFSLYPN 215
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHED 286
T D SLHGSCPVI+GEKW ATKWI + E+
Sbjct: 216 ATFDPGSLHGSCPVIQGEKWSATKWIHVRSYDEN 249
>gi|357467075|ref|XP_003603822.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355492870|gb|AES74073.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 683
Score = 159 bits (402), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 91/210 (43%), Positives = 127/210 (60%), Gaps = 11/210 (5%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLAL-RQGETVESTKGTRTSSGTFI 134
++LS PRA + NF S E+C+ +I AK + S + GE ES+ +RTSSG F+
Sbjct: 113 EILSSVPRASMYHNFLSKEECEHLINLAKPFMARSLVVDGVTGEVKESS--SRTSSGMFL 170
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
+DK I++ IE +IA T +P +GE +V+ Y +GQK + HYD +
Sbjct: 171 DRGKDK--IVQNIERRIADITSVPIENGEGLHVIHYGVGQKCEPHYDYTSDGVVTKNGGP 228
Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIG--LKVKPRRGDGLLFYSLFPN 252
R+A+ L+YLSDVEEGGET+FP F KC G L VKP+ GD LLF+S+ P+
Sbjct: 229 RVATVLMYLSDVEEGGETVFPDAQPNFTS----VSKCSGDGLSVKPKMGDALLFWSMKPD 284
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
GT+D +SLHG PVI+G KW +TKW+ +E
Sbjct: 285 GTLDTSSLHGGSPVIRGNKWASTKWLHLRE 314
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 79/195 (40%), Positives = 108/195 (55%), Gaps = 24/195 (12%)
Query: 90 FASAEQCQSIIATAKKRLKPSQLAL-RQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
F S E+C+ +I AK + S + G+ ES+ RTSSG F+ +DK I++ IE
Sbjct: 372 FGSKEECEHLINLAKPFMTRSLVVDGLTGKGRESS--ARTSSGRFLERGKDK--IVQNIE 427
Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEE 208
+IA T +P+ A + + + G GP R+A+ L+YLSDVEE
Sbjct: 428 QRIADITSIPRM---ARDFMLFTAG--------GVVTKNGGP----RVATVLMYLSDVEE 472
Query: 209 GGETMFPFEN-GIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVI 267
GGET+FP I S Y K GL VKP+ GD LLF S+ P+GT+D +SLHG PVI
Sbjct: 473 GGETVFPNAKPNINSVSKYPEK---GLSVKPKMGDALLFRSMKPDGTLDTSSLHGGSPVI 529
Query: 268 KGEKWVATKWIRDQE 282
+G KW +TKW+ E
Sbjct: 530 RGNKWASTKWLHLTE 544
>gi|343171882|gb|AEL98645.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein, partial
[Silene latifolia]
gi|343171884|gb|AEL98646.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein, partial
[Silene latifolia]
Length = 162
Score = 159 bits (402), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 92/161 (57%), Positives = 118/161 (73%), Gaps = 9/161 (5%)
Query: 11 KLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKENE---------SGLP 61
KLGLP+ LC FFFL GF+GS+ +S+D + R + R LE E + LP
Sbjct: 2 KLGLPSVFFLCFFFFLLGFFGSSLISQDYANARLRARILESSNGEEDITGGSSFGFDLLP 61
Query: 62 HGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE 121
G+TGD SI SIPFQVLSWRPR LYFP FA+A+ C++II+ A+ +LKPS+LALR+GET++
Sbjct: 62 SGDTGDDSISSIPFQVLSWRPRVLYFPKFATADHCETIISIARSQLKPSRLALRKGETLD 121
Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHG 162
ST+ RTSSG FISA EDKTGIL+ I+ KIARATM+P+ +G
Sbjct: 122 STREIRTSSGMFISADEDKTGILDFIDEKIARATMIPRANG 162
>gi|449461905|ref|XP_004148682.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 295
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 88/209 (42%), Positives = 124/209 (59%), Gaps = 11/209 (5%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LSW+PRA + F S +C +I AK +L+ S +A G++V S RTSSG F+
Sbjct: 35 LSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVSSE--VRTSSGMFLRK 92
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
++D+ ++ +E +IA T+LP +GE+ +L YE GQKY+ H+D F+ R+
Sbjct: 93 AQDE--VVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRI 150
Query: 197 ASFLLYLSDVEEGGETMFPFENGIF------LDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
A+ L+YLS+VE+GGET+FP + D + G VK ++GD LLF+SL
Sbjct: 151 ATVLMYLSNVEKGGETIFPNSEVWYGSESQAKDESWSDCSRKGYAVKAQKGDALLFFSLN 210
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+ T D SLHGSCPVI GEKW ATKWI
Sbjct: 211 LDATTDERSLHGSCPVIAGEKWSATKWIH 239
>gi|388500582|gb|AFK38357.1| unknown [Medicago truncatula]
Length = 299
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 99/223 (44%), Positives = 127/223 (56%), Gaps = 13/223 (5%)
Query: 65 TGDASIQSIPFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVE 121
G AS P +V +SW PRA + F + +C +I+ AK LK S +A G++
Sbjct: 25 AGSASSIINPSKVKQISWIPRAFVYQGFLTDLECDHLISLAKSELKRSAVADNLSGDSQL 84
Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
S RTSSG IS ++D I+ IE +I+ T LP+ +GE VLRYE GQKYD HYD
Sbjct: 85 SD--VRTSSGMLISKNKDP--IVSGIEDRISAWTFLPKENGEDIQVLRYEHGQKYDPHYD 140
Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSG----YDYKKCI--GLK 235
F Q RLA+ L+YL++V +GGET+FP G D +C G+
Sbjct: 141 YFADKVNIVQGGHRLATVLMYLTNVTKGGETVFPEAEEPPRRRGSKKSSDLSECAKKGIA 200
Query: 236 VKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
VKPRRGD LLF+SL N D SLH CPV++GEKW ATKWI
Sbjct: 201 VKPRRGDALLFFSLDTNAIPDTNSLHAGCPVLEGEKWSATKWI 243
>gi|145345764|ref|XP_001417370.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577597|gb|ABO95663.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 328
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 94/218 (43%), Positives = 124/218 (56%), Gaps = 10/218 (4%)
Query: 67 DASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKG 125
D +I+ + +SWRP A + F + E+C + A A L S + G +V S
Sbjct: 48 DRAIEFDRIERVSWRPHAEVYRGFLTREECDHLKALATPSLGRSTVVDASNGGSVPSD-- 105
Query: 126 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 185
RTSSG F+ ED ++ IE +IA T +P++HGE F VLRYE GQ+Y H+D F
Sbjct: 106 IRTSSGMFLLRGEDD--VVASIERRIASWTHVPESHGEGFQVLRYEFGQEYRPHFDYFQD 163
Query: 186 A--EYGPQMSQRLASFLLYLSDVEEGGETMFP-FENGIFLDSGYDYKKCIG--LKVKPRR 240
+ + QR+A+ L+YL+DVEEGGET+FP E G G D C L VKPR+
Sbjct: 164 EFNQKREKGGQRVATVLMYLTDVEEGGETIFPDAEAGANPGGGDDASSCAAGKLAVKPRK 223
Query: 241 GDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
GD L F SL NGT D S H CPV+KG K+ ATKW+
Sbjct: 224 GDALFFRSLHHNGTSDAMSSHAGCPVVKGVKFSATKWM 261
>gi|18397528|ref|NP_566279.1| P4H isoform 2 [Arabidopsis thaliana]
gi|332640849|gb|AEE74370.1| P4H isoform 2 [Arabidopsis thaliana]
Length = 299
Score = 157 bits (398), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 90/209 (43%), Positives = 126/209 (60%), Gaps = 11/209 (5%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTFISA 136
+S +PRA + F + +C +I+ AK+ L+ S +A GE+ RTSSGTFIS
Sbjct: 41 VSSKPRAFVYEGFLTDLECDHLISLAKENLQRSAVADNDNGES--QVSDVRTSSGTFISK 98
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D I+ IE K++ T LP+ +GE VLRYE GQKYD+H+D F+ + R+
Sbjct: 99 GKDP--IVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQKYDAHFDYFHDKVNIARGGHRI 156
Query: 197 ASFLLYLSDVEEGGETMFP----FENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLF 250
A+ LLYLS+V +GGET+FP F ++ D C G+ VKP++G+ LLF++L
Sbjct: 157 ATVLLYLSNVTKGGETVFPDAQEFSRRSLSENKDDLSDCAKKGIAVKPKKGNALLFFNLQ 216
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+ D SLHG CPVI+GEKW ATKWI
Sbjct: 217 QDAIPDPFSLHGGCPVIEGEKWSATKWIH 245
>gi|357128903|ref|XP_003566109.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Brachypodium
distachyon]
Length = 313
Score = 157 bits (397), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 92/210 (43%), Positives = 125/210 (59%), Gaps = 13/210 (6%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTFISA 136
+SW+PR + +F S ++ +++ A+ LK S +A G++ S RTS GTFIS
Sbjct: 55 ISWKPRVFLYQHFLSDDEANHLLSLARAELKRSAVADNTSGKSTLSE--VRTSYGTFISK 112
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D I+ IE KIA T LP+ +GE VLRY+ G+K + +D F + R+
Sbjct: 113 GKDP--IVAGIEDKIAAWTFLPKENGEDMQVLRYKRGEKDEPQFDFFTDTVNTVRGGHRV 170
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYK-----KCI--GLKVKPRRGDGLLFYSL 249
A+ LLYL+DV EGGET+FP F D+G K +C G+ VKPR+GD LLF++L
Sbjct: 171 ATVLLYLTDVAEGGETVFPLAKD-FTDTGLHDKDTTLSECAQKGIAVKPRKGDALLFFNL 229
Query: 250 FPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
P+ D SLHG C VIKGEKW ATKWIR
Sbjct: 230 RPDAATDPLSLHGGCTVIKGEKWTATKWIR 259
>gi|21618073|gb|AAM67123.1| prolyl 4-hydroxylase alpha subunit-like protein [Arabidopsis
thaliana]
Length = 297
Score = 157 bits (397), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 90/209 (43%), Positives = 126/209 (60%), Gaps = 11/209 (5%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTFISA 136
+S +PRA + F + +C +I+ AK+ L+ S +A GE+ RTSSGTFIS
Sbjct: 39 VSSKPRAFVYEGFLTDLECDHLISLAKENLQRSAVADNDNGES--QVSDVRTSSGTFISK 96
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D I+ IE K++ T LP+ +GE VLRYE GQKYD+H+D F+ + R+
Sbjct: 97 GKDP--IVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQKYDAHFDYFHDKVNIARGGHRI 154
Query: 197 ASFLLYLSDVEEGGETMFP----FENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLF 250
A+ LLYLS+V +GGET+FP F ++ D C G+ VKP++G+ LLF++L
Sbjct: 155 ATVLLYLSNVTKGGETVFPDAQEFSRRSLSENKDDLSDCAKKGIAVKPKKGNALLFFNLQ 214
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+ D SLHG CPVI+GEKW ATKWI
Sbjct: 215 QDAIPDPFSLHGGCPVIEGEKWSATKWIH 243
>gi|363807814|ref|NP_001242181.1| uncharacterized protein LOC100782154 [Glycine max]
gi|255644463|gb|ACU22735.1| unknown [Glycine max]
Length = 285
Score = 157 bits (397), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 85/206 (41%), Positives = 122/206 (59%), Gaps = 9/206 (4%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAK-KRLKPSQLALRQGETVESTKGTRTSSGTFI 134
+V+SW PRA + NF + E+C+ +I TA LK + GE +E++ RTS+ +
Sbjct: 84 EVMSWEPRAFLYHNFLTKEECEYLINTATPNMLKSLVIDNESGEGIETS--YRTSTEYVV 141
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
+DK I+ IE +IA T +P HGE +V+RY +GQ Y+ H D F Q
Sbjct: 142 ERGKDK--IVRNIEKRIADVTFIPIEHGEPLHVIRYAVGQYYEPHVDYFEEEFSLVNGGQ 199
Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSLF 250
R+A+ L+YLS+VE GGET+FP N F + + +C GL +KP+ GD LLF+S+
Sbjct: 200 RIATMLMYLSNVEGGGETVFPIANANFSSVPWWNELSECGQTGLSIKPKMGDALLFWSMK 259
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATK 276
P+ T+D +LH +CPVIKG KW TK
Sbjct: 260 PDATLDPLTLHRACPVIKGNKWSCTK 285
>gi|110738390|dbj|BAF01121.1| hypothetical protein [Arabidopsis thaliana]
Length = 299
Score = 157 bits (396), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 91/209 (43%), Positives = 127/209 (60%), Gaps = 11/209 (5%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTFISA 136
+S +PRA + F + +C +I+ AK+ L+ S +A GE+ S RTSSGTFIS
Sbjct: 41 VSSKPRAFVYGGFLTDLECDHLISLAKENLQRSAVADNDNGESQVSD--VRTSSGTFISK 98
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D I+ IE K++ T LP+ +GE VLRYE GQKYD+H+D F+ + R+
Sbjct: 99 GKDP--IVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQKYDAHFDYFHDKVNIARGGHRI 156
Query: 197 ASFLLYLSDVEEGGETMFP----FENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLF 250
A+ LLYLS+V +GGET+FP F ++ D C G+ VKP++G+ LLF++L
Sbjct: 157 ATVLLYLSNVTKGGETVFPDAQEFSRRSLSENKDDLSDCAKKGIAVKPKKGNALLFFNLQ 216
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+ D SLHG CPVI+GEKW ATKWI
Sbjct: 217 QDAIPDPFSLHGGCPVIEGEKWSATKWIH 245
>gi|307110744|gb|EFN58979.1| hypothetical protein CHLNCDRAFT_137600 [Chlorella variabilis]
Length = 327
Score = 156 bits (395), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 92/218 (42%), Positives = 125/218 (57%), Gaps = 20/218 (9%)
Query: 75 FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 134
+V++W+PRAL F S +C II A L+ S + +G ++ RTSSG FI
Sbjct: 42 VEVVAWKPRALLLHGFLSHAECDHIIRVADPSLERSTVVSPEGGSMLDE--IRTSSGMFI 99
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
D ++ +E ++A T LP +H E VLRYE+GQKY +H+D + E QM
Sbjct: 100 LKGHD--AVISGLEERVAALTHLPVSHQEDLQVLRYELGQKYSAHWDINDSPERAQQMRA 157
Query: 195 -------RLASFLLYLSDVEEGGETMFPFENGIFLDSGYD----YKKCI--GLKVKPRRG 241
R A+ L+YLSDVEEGGET FP +G +LD G Y +C G+ VKPR+G
Sbjct: 158 KGVLGGLRTATLLMYLSDVEEGGETAFP--HGRWLDEGVQAAPPYTECASKGVVVKPRKG 215
Query: 242 DGLLFYSLFPNG-TIDRTSLHGSCPVIKGEKWVATKWI 278
D +LF+SL NG D SLH CPV++G K+ ATKW+
Sbjct: 216 DAILFFSLKLNGQKKDVYSLHAGCPVVRGVKYSATKWV 253
>gi|412992163|emb|CCO19876.1| predicted protein [Bathycoccus prasinos]
Length = 350
Score = 156 bits (395), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 91/218 (41%), Positives = 127/218 (58%), Gaps = 15/218 (6%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTFISA 136
+SW+PRA + S E+C+ I+ AK +K S + GE T RTS TF++
Sbjct: 83 ISWQPRAFVLHSILSEEECEEILRIAKPMMKRSTVVDSITGEI--KTDPIRTSKQTFLAR 140
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP-AEYGPQMS-- 193
K ++ +E +++R TMLP +GE +L Y +G+KY +H+D + G Q+S
Sbjct: 141 G--KYPVVTRVEERLSRFTMLPWYNGEDMQILSYGVGEKYSAHHDVGEKNTKSGQQLSAD 198
Query: 194 --QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKK---CI--GLKVKPRRGDGLLF 246
QR+A+ LLYL D EEGGET FP I +S Y +K C G+ KP+RGDGLLF
Sbjct: 199 GGQRVATVLLYLQDTEEGGETAFPDSEWIEPESEYAQQKFSECAKNGVAFKPKRGDGLLF 258
Query: 247 YSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQH 284
+S+ P G ID+ S+H CPV+KG KW ATKWI + H
Sbjct: 259 FSITPEGDIDQKSMHAGCPVVKGTKWTATKWIHARPFH 296
>gi|297829156|ref|XP_002882460.1| hypothetical protein ARALYDRAFT_896741 [Arabidopsis lyrata subsp.
lyrata]
gi|297328300|gb|EFH58719.1| hypothetical protein ARALYDRAFT_896741 [Arabidopsis lyrata subsp.
lyrata]
Length = 299
Score = 156 bits (395), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 89/209 (42%), Positives = 126/209 (60%), Gaps = 11/209 (5%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTFISA 136
+S +PRA + F + +C +I+ AK+ L+ S +A GE+ RTSSGTFIS
Sbjct: 41 VSAKPRAFVYEGFLTDLECDHLISLAKENLQRSAVADNDNGES--QVSDVRTSSGTFISK 98
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D I+ IE K++ T LP+ +GE VLRYE GQKYD+H+D F+ + R+
Sbjct: 99 GKDP--IVSGIEDKLSTWTFLPKENGEDLQVLRYEPGQKYDAHFDYFHDKVNIARGGHRI 156
Query: 197 ASFLLYLSDVEEGGETMFP----FENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLF 250
A+ LLYLS+V +GGET+FP + ++ D C G+ VKP++G+ LLF++L
Sbjct: 157 ATVLLYLSNVTKGGETVFPDAQEYSRRSLSENKDDLSDCAKKGIAVKPKKGNALLFFNLQ 216
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+ D SLHG CPVI+GEKW ATKWI
Sbjct: 217 QDAIPDPFSLHGGCPVIEGEKWSATKWIH 245
>gi|18071415|gb|AAL58274.1|AC068923_16 putative prolyl 4-hydroxylase, alpha subunit [Oryza sativa Japonica
Group]
Length = 343
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 85/205 (41%), Positives = 123/205 (60%), Gaps = 11/205 (5%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
+VLSW PRA + NF S E+C+ +I+ AK +K S + + ++ RTSSG F+
Sbjct: 111 EVLSWEPRAFLYHNFLSKEECEYLISLAKPHMKKSTVVDASTGGSKDSR-VRTSSGMFLG 169
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
+DK I+ IE +I+ T +P +GE VL YE+GQKY+ H+D F+ QR
Sbjct: 170 RGQDK--IIRTIEKRISDYTFIPVENGEGLQVLHYEVGQKYEPHFDYFHDEFNTKNGGQR 227
Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKCI--GLKVKPRRGDGLLFYSLFP 251
+A+ L+YLSDVEEGGET+FP S + + +C GL VKP+ GD LLF+S+ P
Sbjct: 228 IATLLMYLSDVEEGGETIFPSSKANSSSSPFYNELSECAKKGLAVKPKMGDALLFWSMRP 287
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATK 276
+G++D TSLHG P++ W+ T
Sbjct: 288 DGSLDATSLHGEIPIL----WLLTN 308
>gi|145343778|ref|XP_001416487.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144576712|gb|ABO94780.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 255
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 91/202 (45%), Positives = 118/202 (58%), Gaps = 11/202 (5%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQL--ALRQGETVESTKGTRTSSGTFISASED 139
PRA + F + E+C I+A +K L S + A G T T RTS+GTFIS + D
Sbjct: 1 PRAFVYEGFLTDEECDHILALSKGHLHKSGVVDAKTGGST---TSDIRTSTGTFISRAHD 57
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASF 199
T + IE +I + +P HGEA VLRYE GQ+Y +H+D F G + + R+A+
Sbjct: 58 PT--ITAIEERIELWSQIPVDHGEALQVLRYENGQEYKAHFDYF--FHKGGKRNNRIATV 113
Query: 200 LLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLFPNGTIDR 257
LLYLSDVEEGGET+FP + Y +C G VK R+GD LLF+S+ P G +D
Sbjct: 114 LLYLSDVEEGGETVFPNTDVPTDRDRSQYSECGNGGKSVKARKGDALLFWSMKPGGELDP 173
Query: 258 TSLHGSCPVIKGEKWVATKWIR 279
S H CPVIKG KW ATKW+
Sbjct: 174 GSSHAGCPVIKGVKWTATKWMH 195
>gi|219121927|ref|XP_002181308.1| proly 4-hydroxylase [Phaeodactylum tricornutum CCAP 1055/1]
gi|217407294|gb|EEC47231.1| proly 4-hydroxylase [Phaeodactylum tricornutum CCAP 1055/1]
Length = 226
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 86/221 (38%), Positives = 122/221 (55%), Gaps = 14/221 (6%)
Query: 71 QSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSS 130
+++ + LS P L F S ++C I TA+ ++ S++ L + RTS
Sbjct: 3 RAVTLETLSLVPLVLSVEGFLSDDECTYIQETAEPHMEYSEVTLMDKDQGRPASDFRTSQ 62
Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP 190
FI A +D IL I+++ A +P+ H E VLRY++ +KYDSH D F+PA Y
Sbjct: 63 SAFIRAHDD--AILTDIDYRTASLVRIPRRHQEDVQVLRYDVTEKYDSHADYFDPALYTK 120
Query: 191 Q----------MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRR 240
R+A+ YLSDVE+GGET+FP NG S D K GLKVKP +
Sbjct: 121 DKRTLALIRNGHRNRMATVFWYLSDVEKGGETVFPRFNGAQETSMKDCK--TGLKVKPEK 178
Query: 241 GDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
G ++FYS+ P+G +D SLHG+CPV KG KW A KW+ ++
Sbjct: 179 GKVIIFYSMTPDGALDEYSLHGACPVQKGTKWAANKWVWNE 219
>gi|308802438|ref|XP_003078532.1| prolyl 4-hydroxylase alpha-1 subunit precursor (IC) [Ostreococcus
tauri]
gi|116056985|emb|CAL51412.1| prolyl 4-hydroxylase alpha-1 subunit precursor (IC) [Ostreococcus
tauri]
Length = 369
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 94/232 (40%), Positives = 131/232 (56%), Gaps = 27/232 (11%)
Query: 67 DASIQSIPFQVLSWR-PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG 125
+A S P VLS + P+A NF S ++C ++ AK+ L PS + G +V S
Sbjct: 74 EAGFPSTPPLVLSSKKPKAYLMRNFLSPQECDHLMMLAKRELAPSTVVGDGGSSVASE-- 131
Query: 126 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-- 183
RTS+G F+ S+D T + IE +IAR + +P +GE +LRY+ GQKYD H+D F
Sbjct: 132 IRTSAGMFLRKSQDDT--VREIEERIARLSGVPVDNGEGMQILRYDKGQKYDPHFDYFHD 189
Query: 184 --NPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLD------------SGYDYK 229
NPA + QR+A+ L+YL D EEGGET FP NG + + +
Sbjct: 190 KVNPAPK--RGGQRVATVLIYLVDTEEGGETTFP--NGRLPENFEEDEPDNPFAAHIKHT 245
Query: 230 KCI--GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
C G+ VK RGD +LF+S+ +G +D SLHG+CPVI G+KW A KW+R
Sbjct: 246 DCAKNGIPVKSVRGDAILFFSMTKDGELDHGSLHGACPVIAGQKWTAVKWLR 297
>gi|308801080|ref|XP_003075321.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
gi|116061875|emb|CAL52593.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
Length = 541
Score = 155 bits (393), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 87/199 (43%), Positives = 120/199 (60%), Gaps = 7/199 (3%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
PRA + NF S ++C+ ++A +K +L S + Q S RTS+GTFIS D
Sbjct: 265 PRAFLYENFLSEKECEHLLALSKGKLHKSGVVDAQ-TGGSSLSEVRTSTGTFISRKYDD- 322
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
I+ +E +I + +PQ+H EAF +LRYE GQ+Y +H+D F + R+A+ LL
Sbjct: 323 -IIAGVEERIELWSQIPQSHHEAFQILRYEPGQEYKAHFDYF--FHKSGMRNNRIATVLL 379
Query: 202 YLSDVEEGGETMFPFENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLFPNGTIDRTS 259
YLSDVEEGGET+FP + + Y +C G +K R+GD LLF+S+ P G +D S
Sbjct: 380 YLSDVEEGGETVFPNTDVPTSRNRSMYSECGNGGKALKARKGDALLFWSMKPGGELDAGS 439
Query: 260 LHGSCPVIKGEKWVATKWI 278
H CPVIKGEKW ATKW+
Sbjct: 440 SHAGCPVIKGEKWTATKWM 458
>gi|255085592|ref|XP_002505227.1| predicted protein [Micromonas sp. RCC299]
gi|226520496|gb|ACO66485.1| predicted protein [Micromonas sp. RCC299]
Length = 267
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 91/207 (43%), Positives = 123/207 (59%), Gaps = 11/207 (5%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISA 136
LS +P+A + F +C I AK +L+ S + + G++V S RTS G F
Sbjct: 8 LSEKPKAYLYRGFLRQAECDYIKERAKPKLEKSTVVDNKTGQSVPSN--IRTSDGMFFDR 65
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS--- 193
ED I+E IE +IA T +P +GE VLRYE+GQKY+ H DAF+ ++ + S
Sbjct: 66 HEDD--IIEDIERRIAEWTNVPWENGEGIQVLRYEVGQKYEPHLDAFSD-KFNTEESKGG 122
Query: 194 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLFP 251
QR+A+ L+YLSDVEEGGET+FP + +C G+ VK R+GD LLF+SL
Sbjct: 123 QRMATVLMYLSDVEEGGETVFPRSVDKPHKGDPKWSECAQRGVAVKARKGDALLFWSLDI 182
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWI 278
+ +D SLHG CPVIKG KW ATKW+
Sbjct: 183 DSNVDELSLHGGCPVIKGTKWSATKWM 209
>gi|255083627|ref|XP_002508388.1| predicted protein [Micromonas sp. RCC299]
gi|226523665|gb|ACO69646.1| predicted protein [Micromonas sp. RCC299]
Length = 253
Score = 154 bits (390), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 88/219 (40%), Positives = 122/219 (55%), Gaps = 27/219 (12%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSG 131
+SW PRA + NF S E+C I+ A+ R+ R+ ++S G RTS
Sbjct: 1 VSWYPRAFHLHNFMSHEECDRILEIARPRV-------RRSTVIDSVTGQSKVDPIRTSEQ 53
Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN-PAEYGP 190
TF++ I+ +E ++A T LP HGE +L+Y +GQKYD+H+D + G
Sbjct: 54 TFLNRG--TWDIVTKVEERLAVVTQLPAYHGEDMQILKYGLGQKYDAHHDVGELTSASGK 111
Query: 191 QMS----QRLASFLLYLSDVEEGGETMFPFENGIFLD-----SGYDYKKCI--GLKVKPR 239
Q++ R+A+ LLYLSDVEEGGET FP + + G + C + VKPR
Sbjct: 112 QLAAEGGHRVATVLLYLSDVEEGGETAFPDSEWMTPELRKWAEGQKWSDCAEGNVAVKPR 171
Query: 240 RGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
+GDGLLF+S+ ID S+H CPVI+GEKW ATKWI
Sbjct: 172 KGDGLLFWSVNNENAIDPHSMHAGCPVIRGEKWTATKWI 210
>gi|159476104|ref|XP_001696154.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein
[Chlamydomonas reinhardtii]
gi|158275325|gb|EDP01103.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein
[Chlamydomonas reinhardtii]
Length = 343
Score = 154 bits (389), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 93/227 (40%), Positives = 127/227 (55%), Gaps = 20/227 (8%)
Query: 63 GET---GDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET 119
GET G+ Q VLSW PR + + E+C ++ ++ RL+ S ++
Sbjct: 52 GETFNAGEHRAQDSRMVVLSWHPRVFLYKGILTHEECDQLMDNSRSRLERSGVS------ 105
Query: 120 VESTKGT------RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIG 173
++T G RTSSG F E T +++ IE+++A TMLP +GE VLRYE
Sbjct: 106 -DATTGAGAVSDIRTSSGMFYERGE--TELVKRIENRLAMWTMLPVENGEGIQVLRYEKT 162
Query: 174 QKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENG--IFLDSGYDYKKC 231
QKYD H+D F+ R+A+ L+YL+ EEGGET+FP G + L +
Sbjct: 163 QKYDPHHDYFSFDGADDNGGNRMATVLMYLATPEEGGETVFPKVVGWVVQLTTTASAPCR 222
Query: 232 IGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
GL VKP +GD +LF+S+ P+G D SLHGSCPVIKG KW ATKWI
Sbjct: 223 QGLAVKPAKGDAVLFWSIRPDGRFDPGSLHGSCPVIKGVKWSATKWI 269
>gi|308799217|ref|XP_003074389.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
gi|116000560|emb|CAL50240.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
Length = 294
Score = 154 bits (389), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 95/231 (41%), Positives = 132/231 (57%), Gaps = 21/231 (9%)
Query: 61 PHGETGDA-----SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR 115
P GDA +I + + LSW P A + F + +C+ I A LKPS + +
Sbjct: 7 PTTTAGDARGAGGTIDAGAIERLSWAPHAEVYRGFLTEAECEHIERLATAELKPSTV-VD 65
Query: 116 QGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQK 175
+++ RTSSG F+ +ED ++E IE +IA T +P++HGE F VLRYE Q+
Sbjct: 66 ASTGGDASSEIRTSSGMFLGRAEDD--VIEAIEARIAAWTHVPESHGEGFQVLRYEKHQE 123
Query: 176 YDSHYDAFNPA-----EYGPQMSQRLASFLLYLSDVEEGGETMFP-FENGIFLDSGYDYK 229
Y +HYD F+ E G QR+ + L+YLSDVEEGGET+FP FE+G +G +
Sbjct: 124 YRAHYDYFHDKFNVKREKG---GQRMGTVLMYLSDVEEGGETVFPKFEDGT--PAGSEAS 178
Query: 230 KCI--GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
+C L V+PR+GD L F SL +G D S H CPVI+G K+ ATKW+
Sbjct: 179 ECARNKLAVRPRKGDALFFRSLRHDGVPDTFSEHAGCPVIRGVKFSATKWM 229
>gi|326503458|dbj|BAJ86235.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326516134|dbj|BAJ88090.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 266
Score = 154 bits (388), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 86/218 (39%), Positives = 129/218 (59%), Gaps = 7/218 (3%)
Query: 66 GDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTK 124
D + + +V+SW PR + F NF S+E+C + A+ RL+ S + + G+ V+S
Sbjct: 51 ADLRLGYVKPEVISWTPRIIVFHNFLSSEECDYLREIARPRLEISTVVDVATGKGVKSD- 109
Query: 125 GTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN 184
RTSSG F+++ E K +++ IE +I+ + +P +GE VLRYE Q Y H+D F+
Sbjct: 110 -VRTSSGMFVNSEERKLPVIKAIEKRISVFSQIPVENGELIQVLRYEPNQYYRPHHDYFS 168
Query: 185 PAEYGPQMSQRLASFLLYLSDVEEGGETMFPFE-NGIFLDSGYDYKKCIGLKVKPRRGDG 243
+ QR+A+ L+YL+D EGGET FP +G + G + GL VKP +GD
Sbjct: 169 DTFNLKRGGQRVATMLMYLTDGVEGGETHFPQAGDGECICGG---RLVRGLCVKPNKGDA 225
Query: 244 LLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
+LF+S+ +G D SLH C V+KGEKW ATKW+R +
Sbjct: 226 VLFWSMGLDGNTDSNSLHSGCAVVKGEKWSATKWMRQK 263
>gi|388520325|gb|AFK48224.1| unknown [Lotus japonicus]
Length = 188
Score = 154 bits (388), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 85/189 (44%), Positives = 115/189 (60%), Gaps = 9/189 (4%)
Query: 99 IIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATML 157
+I AK + S + Q G++V S RTSSG F+ +DK +++ IE +IA +
Sbjct: 1 MINLAKPHMAKSSVVDSQTGKSVGSR--VRTSSGMFLKRGKDK--VIQTIEKRIADFAFI 56
Query: 158 PQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFE 217
P +GE VL YE+GQKY+ HYD F QR+A+ L+YLSDVEEGGET+FP
Sbjct: 57 PVENGEGLQVLHYEVGQKYEPHYDYFLDEFNTKNGGQRIATVLMYLSDVEEGGETIFPAA 116
Query: 218 NGIFLDSGY--DYKKCI--GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWV 273
F + D C GL VKP+RGD LLF+S+ P+ T+D +SLHG CPVI+G KW
Sbjct: 117 KANFSSVPWYNDLSVCAKKGLSVKPKRGDALLFWSIRPDATLDPSSLHGGCPVIRGNKWS 176
Query: 274 ATKWIRDQE 282
+TKW+ +E
Sbjct: 177 STKWMHLEE 185
>gi|224069056|ref|XP_002302889.1| predicted protein [Populus trichocarpa]
gi|222844615|gb|EEE82162.1| predicted protein [Populus trichocarpa]
Length = 287
Score = 153 bits (387), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 97/286 (33%), Positives = 150/286 (52%), Gaps = 29/286 (10%)
Query: 12 LGLPTALLLCSFFFLAGF-------YGSTFLSRDVPSIRPKLRTLEVVEKENESGLPHG- 63
L TA ++ FF F YG+ F P + + ++ + LP G
Sbjct: 12 LAFVTAGMIVGAFFQLAFILKLEDSYGTKF---------PSFKRVRKLQSDAYLQLPRGI 62
Query: 64 ----ETGDASIQSIPF---QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LR 115
+A++ I + +++SW PR + +F S+E+C + A AK RL+ S + ++
Sbjct: 63 SHWDNDTEAAVLRIGYVKPEIISWSPRIIVLHDFLSSEECDYLRALAKPRLRISTVVDVK 122
Query: 116 QGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQK 175
G+ +ES RTSSG F+S+ E +++ IE +I+ + +P +GE VLRYE Q
Sbjct: 123 TGKGIESK--VRTSSGMFLSSEEKTYQVVQAIEKRISVYSQVPIENGELIQVLRYEKNQY 180
Query: 176 YDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLK 235
Y H+D F+ + QR+A+ L+YLSD EGGET FP G K GL
Sbjct: 181 YKPHHDYFSDTFNLKRGGQRVATMLMYLSDNVEGGETYFPMAGSGKCSCG--GKVVDGLS 238
Query: 236 VKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
VKP +G+ +LF+S+ +G D +S+HG C V+ G KW ATKW+R +
Sbjct: 239 VKPIKGNAVLFWSMGLDGQSDPSSIHGGCEVLSGVKWSATKWMRQR 284
>gi|125542543|gb|EAY88682.1| hypothetical protein OsI_10157 [Oryza sativa Indica Group]
Length = 321
Score = 153 bits (387), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 89/222 (40%), Positives = 126/222 (56%), Gaps = 24/222 (10%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES-TKGTRTSSGTFISA 136
+SWRPRA + F S +C +I+ AK+ K + + GE+ ES T RTSSG F+
Sbjct: 45 VSWRPRAFLYEGFLSDAECDHLISLAKQG-KMEKSTVVDGESGESVTSKVRTSSGMFLDK 103
Query: 137 SEDKTGILELIEHKIARATMLP-----------------QTHGEAFNVLRYEIGQKYDSH 179
+D+ ++ IE +IA TMLP +GE+ +LRY G+KY+ H
Sbjct: 104 KQDE--VVARIEERIAAWTMLPTECIIFYCFANFAILKLSENGESMQILRYGQGEKYEPH 161
Query: 180 YDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP-FENGIFLDSGYDYKKCI--GLKV 236
+D + + + R+A+ L+YLS+V+ GGET+FP E + + C G V
Sbjct: 162 FDYISGRQGSTREGDRVATVLMYLSNVKMGGETIFPDCEARLSQPKDETWSDCAEQGFAV 221
Query: 237 KPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
KP +G +LF+SL PN T+D SLHGSCPVI+GEKW ATKWI
Sbjct: 222 KPAKGSAVLFFSLHPNATLDTDSLHGSCPVIEGEKWSATKWI 263
>gi|357467077|ref|XP_003603823.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355492871|gb|AES74074.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 291
Score = 153 bits (386), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 90/208 (43%), Positives = 126/208 (60%), Gaps = 14/208 (6%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST-KGTRTSSGTFI 134
+VLS PRA + NF S E+C+ +I AK ++ S + G T + RTSSGTF+
Sbjct: 86 EVLSSEPRASMYHNFLSKEECEHLINLAKPFMQRSLVV--DGVTGQGILNSVRTSSGTFL 143
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFN---PAEYGP 190
+DK I++ +E +IA T +P +GE ++ YE+GQK++ HYD FN GP
Sbjct: 144 ERGKDK--IVQNVERRIADITSIPIENGEGLQIIHYEVGQKFEPHYDYNFNWRITNNGGP 201
Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
R+A+ L+YLSDVEEGGET+FP F +S Y GL VKP+ GD LLF+S+
Sbjct: 202 ----RVATVLMYLSDVEEGGETVFPNAKPNF-NSVSKYHPGKGLVVKPKMGDALLFWSVK 256
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWI 278
P+G++D SLHG PVI+G KW + K +
Sbjct: 257 PDGSLDTASLHGGSPVIRGSKWASNKLL 284
>gi|224034451|gb|ACN36301.1| unknown [Zea mays]
gi|413945801|gb|AFW78450.1| hypothetical protein ZEAMMB73_588774 [Zea mays]
Length = 295
Score = 152 bits (385), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 88/204 (43%), Positives = 117/204 (57%), Gaps = 18/204 (8%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
+S +PR + +F S ++ +I+ A+ LK S +A ++ G T S
Sbjct: 54 ISCKPRVFLYQHFLSDDEANHLISLARAELKRSAVA-------DNMSGKST-------LS 99
Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
ED I+E IE KIA T LP+ +GE VLRY+ G+KY+ HYD F + R A
Sbjct: 100 EDP--IVEGIEDKIAAWTFLPKENGEDIQVLRYKHGEKYEPHYDYFTDNVNTVRGGHRYA 157
Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLFPNGTI 255
+ LLYL+DV EGGET+FP +C G+ V+PR+GD LLF++L P+GT
Sbjct: 158 TVLLYLTDVPEGGETVFPLAEEPDDAKDATLSECAQKGIAVRPRKGDALLFFNLNPDGTT 217
Query: 256 DRTSLHGSCPVIKGEKWVATKWIR 279
D SLHG CPVIKGEKW ATKWIR
Sbjct: 218 DSVSLHGGCPVIKGEKWSATKWIR 241
>gi|302834449|ref|XP_002948787.1| hypothetical protein VOLCADRAFT_80309 [Volvox carteri f.
nagariensis]
gi|300265978|gb|EFJ50167.1| hypothetical protein VOLCADRAFT_80309 [Volvox carteri f.
nagariensis]
Length = 329
Score = 152 bits (385), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 90/222 (40%), Positives = 127/222 (57%), Gaps = 10/222 (4%)
Query: 63 GETGDASIQSIP---FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGE 118
GET + +P VLSW+PR + + E+C +I A+ RL+ S ++ GE
Sbjct: 35 GETYRSGRTDVPDSRMVVLSWQPRVFLYKGILTQEECDYLIKIAQGRLERSGVSDATTGE 94
Query: 119 TVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDS 178
RTSSG F + E+ +++ IE ++A TMLP +GE VLRYE QKYD
Sbjct: 95 G--GVSDIRTSSGMFYTRGEND--VVKRIETRLAMWTMLPVENGEGIQVLRYEKTQKYDP 150
Query: 179 HYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKC--IGLKV 236
H+D F+ R+A+ L+YL+ EEGGET+FP + ++ +C GL V
Sbjct: 151 HHDYFSFEGRDANGGNRMATVLMYLATPEEGGETVFPKIPVPAGQTRANFSECGMKGLAV 210
Query: 237 KPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
KP +GD +LF+S+ P+G + SLHGSCPVI+G KW ATKWI
Sbjct: 211 KPVKGDAVLFWSIRPDGRFEPGSLHGSCPVIRGVKWSATKWI 252
>gi|297797785|ref|XP_002866777.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297312612|gb|EFH43036.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 266
Score = 152 bits (385), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 102/267 (38%), Positives = 147/267 (55%), Gaps = 20/267 (7%)
Query: 5 AKRSSTKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRP---KLRTL-EVVEKENESGL 60
+ R S++ L A+L+ S F + LS VPS K L +V K + G+
Sbjct: 10 SARKSSRSTLVFAVLIMSTFVILILLAFGILS--VPSNNAGSSKANDLTSIVRKTLQRGV 67
Query: 61 PHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGET 119
D S +++SW PRA + NF + E+C+ +I AK ++ S + + G++
Sbjct: 68 E-----DDSKNERWVEIISWEPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKTGKS 122
Query: 120 VESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH 179
+S RTSSGTF++ DKT + IE +I+ T +P HGE VL YEIGQKY+ H
Sbjct: 123 TDSR--VRTSSGTFLARGRDKT--IREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPH 178
Query: 180 YDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLK 235
YD F QR+A+ L+YLSDVEEGGET+FP G + + + +C GL
Sbjct: 179 YDYFMDEYNTRNGGQRIATVLMYLSDVEEGGETVFPAAKGNYSAVPWWNELSECGKGGLS 238
Query: 236 VKPRRGDGLLFYSLFPNGTIDRTSLHG 262
VKP+ GD LLF+S+ P+ T+D +SLHG
Sbjct: 239 VKPKMGDALLFWSMTPDATLDPSSLHG 265
>gi|6437556|gb|AAF08583.1|AC011623_16 unknown protein [Arabidopsis thaliana]
Length = 278
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 87/203 (42%), Positives = 122/203 (60%), Gaps = 20/203 (9%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTFISA 136
+S +PRA + F + +C +I+ AK+ L+ S +A GE+ RTSSGTFIS
Sbjct: 41 VSSKPRAFVYEGFLTDLECDHLISLAKENLQRSAVADNDNGES--QVSDVRTSSGTFISK 98
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D I+ IE K++ T LP+ +GE VLRYE GQKYD+H+D F+ + R+
Sbjct: 99 GKDP--IVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQKYDAHFDYFHDKVNIARGGHRI 156
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
A+ LLYLS+V +GGET+FP D + C+ KP++G+ LLF++L + D
Sbjct: 157 ATVLLYLSNVTKGGETVFP-----------DAQVCL----KPKKGNALLFFNLQQDAIPD 201
Query: 257 RTSLHGSCPVIKGEKWVATKWIR 279
SLHG CPVI+GEKW ATKWI
Sbjct: 202 PFSLHGGCPVIEGEKWSATKWIH 224
>gi|224001336|ref|XP_002290340.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220973762|gb|EED92092.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 483
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 86/222 (38%), Positives = 126/222 (56%), Gaps = 16/222 (7%)
Query: 69 SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRT 128
S + + + LS RP + F S E+C I A ++K S ++L+ + + + RT
Sbjct: 255 SQKQVTIETLSLRPLVVSVEGFLSDEECDYIAEIASPQVKYSSVSLKDADKGKDSSEWRT 314
Query: 129 SSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY 188
S F+SA +D+ +L I+H++A T +P+ H E VLRY G+KYDSH+D F+P+ Y
Sbjct: 315 SQSAFLSARDDE--VLTEIDHRVASLTRIPRNHQEYVQVLRYGAGEKYDSHHDYFDPSAY 372
Query: 189 GPQMS----------QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKC-IGLKVK 237
S R A+ YL+DV +GGET+FP G + +K C IGLKVK
Sbjct: 373 RSDKSTLRLIENGKKNRYATVFWYLTDVHDGGETIFPRYGGA--PAPRSHKDCSIGLKVK 430
Query: 238 PRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGE-KWVATKWI 278
P++G ++FYSL +G +D SLHG+CPV + KW A KWI
Sbjct: 431 PQKGKVVIFYSLDASGEMDPFSLHGACPVGENNLKWAANKWI 472
>gi|357162904|ref|XP_003579560.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
distachyon]
Length = 266
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 83/207 (40%), Positives = 123/207 (59%), Gaps = 5/207 (2%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
+V+SW PR + F NF S+E+C + A+ RL+ S + + G+ V+S RTSSG F+
Sbjct: 61 EVISWTPRIIVFHNFLSSEECDFLKEIARPRLEISTVVDVATGKGVKSD--VRTSSGMFV 118
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
++ E K +++ IE +I+ + +P +GE VLRYE Q Y H+D F+ + Q
Sbjct: 119 NSEERKFPVIQAIEKRISVFSQIPVENGELIQVLRYEPSQYYRPHHDYFSDTFNLKRGGQ 178
Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
R+A+ L+YL+D EGGET FP G + GL VKP +GD +LF+S+ +G
Sbjct: 179 RVATMLMYLTDGVEGGETHFPQAGDGECSCGGRIVR--GLCVKPNKGDAVLFWSMGLDGN 236
Query: 255 IDRTSLHGSCPVIKGEKWVATKWIRDQ 281
D S+H C V+KGEKW ATKW+R +
Sbjct: 237 TDSNSIHSGCAVLKGEKWSATKWMRQK 263
>gi|10177121|dbj|BAB10411.1| prolyl 4-hydroxylase, alpha subunit-like protein [Arabidopsis
thaliana]
Length = 267
Score = 151 bits (382), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 83/192 (43%), Positives = 119/192 (61%), Gaps = 9/192 (4%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
+++SW PRA + NF + E+C+ +I AK ++ S + + G++ +S RTSSGTF+
Sbjct: 79 EIISWEPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKTGKSTDSR--VRTSSGTFL 136
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
+ DKT + IE +I+ T +P HGE VL YEIGQKY+ HYD F Q
Sbjct: 137 ARGRDKT--IREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHYDYFMDEYNTRNGGQ 194
Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSLF 250
R+A+ L+YLSDVEEGGET+FP G + + + +C GL VKP+ GD LLF+S+
Sbjct: 195 RIATVLMYLSDVEEGGETVFPAAKGNYSAVPWWNELSECGKGGLSVKPKMGDALLFWSMT 254
Query: 251 PNGTIDRTSLHG 262
P+ T+D +SLHG
Sbjct: 255 PDATLDPSSLHG 266
>gi|297824279|ref|XP_002880022.1| AT-P4H-1 [Arabidopsis lyrata subsp. lyrata]
gi|297325861|gb|EFH56281.1| AT-P4H-1 [Arabidopsis lyrata subsp. lyrata]
Length = 283
Score = 150 bits (380), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 84/213 (39%), Positives = 124/213 (58%), Gaps = 5/213 (2%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRT 128
I ++ +V+SW PR + +F S E+C+ + A A+ RL+ S + ++ G+ V+S RT
Sbjct: 72 IGNVKPEVVSWSPRIIVLHDFLSPEECEYLKAIARPRLQVSTVVDVKTGKGVKSD--VRT 129
Query: 129 SSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY 188
SSG F++ E I++ IE +IA + +P +GE VLRYE Q Y H+D F
Sbjct: 130 SSGMFLTHVERSNPIIQAIEKRIAVFSQVPAENGELIQVLRYEPKQFYKPHHDYFADTFN 189
Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 248
+ QR+A+ L+YL+D EGGET FP G K G+ VKP +GD +LF+S
Sbjct: 190 LKRGGQRVATMLMYLTDDVEGGETYFPLAGDGDCTCGGKIMK--GISVKPTKGDAVLFWS 247
Query: 249 LFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
+ +G D S+HG C V+ GEKW ATKW+R +
Sbjct: 248 MGLDGQSDPRSIHGGCEVLSGEKWSATKWMRQK 280
>gi|307102962|gb|EFN51227.1| hypothetical protein CHLNCDRAFT_28161 [Chlorella variabilis]
Length = 300
Score = 150 bits (380), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 82/208 (39%), Positives = 116/208 (55%), Gaps = 5/208 (2%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
I +VLSW PR + + E+C ++ A RL S + ES RTS G
Sbjct: 14 ILLKVLSWDPRIFLYQRLLTEEECDHMMTKAGPRLTRSGVVDVDNPGGESVSDIRTSYGM 73
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
F ED+ ++ +E +++ +++P HGE VLRYE G++Y H+D F
Sbjct: 74 FFDRGEDE--VVREVERRLSEWSLIPPGHGEGIQVLRYENGEEYKPHFDYFFDNLSVQNG 131
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGI---FLDSGYDYKKCIGLKVKPRRGDGLLFYSL 249
RLA+ L+YL++ E GGET+FP L++GY GL VKPR+GD +LF+SL
Sbjct: 132 GNRLATILMYLAEPEFGGETVFPNVKAPPEQTLEAGYSECATQGLAVKPRKGDAVLFFSL 191
Query: 250 FPNGTIDRTSLHGSCPVIKGEKWVATKW 277
GT+D+ SLHGSCP +KG K+ ATKW
Sbjct: 192 RTEGTLDKGSLHGSCPTLKGFKFAATKW 219
>gi|159487419|ref|XP_001701720.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280939|gb|EDP06695.1| predicted protein [Chlamydomonas reinhardtii]
Length = 274
Score = 150 bits (378), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 89/214 (41%), Positives = 119/214 (55%), Gaps = 16/214 (7%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
Q + PRA YF NF + + ++ A +LK S + GE V RTS G FI
Sbjct: 2 QQVGLHPRAYYFHNFLTKAERGHLVKLAAPKLKRSTVVGNDGEGV--VDNIRTSYGMFIR 59
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA-EYGPQMSQ 194
+D ++ IE +I+ T LP H E VLRY GQ Y +HYD+ + + E GP+
Sbjct: 60 RLQDP--VVARIEKRISLWTHLPVEHQEDIQVLRYAHGQTYGAHYDSGDKSNEPGPKW-- 115
Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDS------GYDYKKCI--GLKVKPRRGDGLLF 246
RLA+FL+YLSDVEEGGET FP N ++ D G + C + KP+ GD +LF
Sbjct: 116 RLATFLMYLSDVEEGGETAFP-HNSVWADPSIPEKVGDKFSDCAKGNVAAKPKAGDAVLF 174
Query: 247 YSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 280
YS +PN T+D ++H CPVIKG KW A W+ D
Sbjct: 175 YSFYPNMTMDPAAMHTGCPVIKGVKWAAPVWMHD 208
>gi|238007346|gb|ACR34708.1| unknown [Zea mays]
Length = 180
Score = 149 bits (377), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 81/174 (46%), Positives = 105/174 (60%), Gaps = 12/174 (6%)
Query: 119 TVESTKG------TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEI 172
V+ST G RTSSG F+ DK ++ +IE +IA T +P HGE VL YE+
Sbjct: 6 VVDSTTGKSKDSRVRTSSGMFLQRGRDK--VIRVIEKRIADYTFIPVDHGEGLQVLHYEV 63
Query: 173 GQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIF--LDSGYDYKK 230
GQKY+ H+D F QR+A+ L+YLSDVEEGGET+FP N L + +
Sbjct: 64 GQKYEPHFDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETIFPDANVNVSSLPWYNELSE 123
Query: 231 CI--GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
C GL VKP+ GD LLF+S+ P+ T+D SLHG CPVI+G KW +TKW+ E
Sbjct: 124 CAKRGLSVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIRGNKWSSTKWMHIHE 177
>gi|15224220|ref|NP_181836.1| P4H isoform 1 [Arabidopsis thaliana]
gi|3763917|gb|AAC64297.1| hypothetical protein [Arabidopsis thaliana]
gi|20197628|gb|AAM15158.1| hypothetical protein [Arabidopsis thaliana]
gi|26450452|dbj|BAC42340.1| unknown protein [Arabidopsis thaliana]
gi|29824245|gb|AAP04083.1| unknown protein [Arabidopsis thaliana]
gi|330255112|gb|AEC10206.1| P4H isoform 1 [Arabidopsis thaliana]
Length = 283
Score = 149 bits (377), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 84/213 (39%), Positives = 124/213 (58%), Gaps = 5/213 (2%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRT 128
I ++ +V+SW PR + +F S E+C+ + A A+ RL+ S + ++ G+ V+S RT
Sbjct: 72 IGNVKPEVVSWSPRIIVLHDFLSPEECEYLKAIARPRLQVSTVVDVKTGKGVKSD--VRT 129
Query: 129 SSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY 188
SSG F++ E I++ IE +IA + +P +GE VLRYE Q Y H+D F
Sbjct: 130 SSGMFLTHVERSYPIIQAIEKRIAVFSQVPAENGELIQVLRYEPQQFYKPHHDYFADTFN 189
Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 248
+ QR+A+ L+YL+D EGGET FP G K G+ VKP +GD +LF+S
Sbjct: 190 LKRGGQRVATMLMYLTDDVEGGETYFPLAGDGDCTCGGKIMK--GISVKPTKGDAVLFWS 247
Query: 249 LFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
+ +G D S+HG C V+ GEKW ATKW+R +
Sbjct: 248 MGLDGQSDPRSIHGGCEVLSGEKWSATKWMRQK 280
>gi|224033439|gb|ACN35795.1| unknown [Zea mays]
Length = 180
Score = 149 bits (376), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 82/174 (47%), Positives = 103/174 (59%), Gaps = 12/174 (6%)
Query: 119 TVESTKG------TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEI 172
V+ST G RTSSG F+ DK ++ IE +IA T +P HGE VL YE+
Sbjct: 6 VVDSTTGKSKDSRVRTSSGMFLQRGRDK--VIRAIEKRIADYTFIPVDHGEGLQVLHYEV 63
Query: 173 GQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFE--NGIFLDSGYDYKK 230
GQKY+ H+D F QR+A+ L+YLSDVEEGGET+FP N L +
Sbjct: 64 GQKYEPHFDYFLDEFNTKNGGQRIATLLMYLSDVEEGGETIFPDANVNASSLPWYNELSD 123
Query: 231 CI--GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
C GL VKP+ GD LLF+S+ P+ T+D SLHG CPVIKG KW +TKW+ E
Sbjct: 124 CAKRGLSVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIKGNKWSSTKWMHIHE 177
>gi|356576923|ref|XP_003556579.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 287
Score = 149 bits (376), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 84/205 (40%), Positives = 118/205 (57%), Gaps = 5/205 (2%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
+VL+W PR + NF S E+C + A A RL S + + G+ ++S RTSSG F+
Sbjct: 82 EVLNWSPRIILLHNFLSMEECDYLRAIALPRLHISNVVDTKTGKGIKSD--VRTSSGMFL 139
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
+ E K +++ IE +I+ + +P +GE VLRYE Q Y H+D F+ + Q
Sbjct: 140 NPQERKYPMVQAIEKRISVYSQIPIENGELMQVLRYEKNQYYKPHHDYFSDTFNLKRGGQ 199
Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
R+A+ L+YLSD EGGET FP G K GL VKP +G+ +LF+S+ +G
Sbjct: 200 RIATMLMYLSDNIEGGETYFPLAGSGECSCGGKLVK--GLSVKPIKGNAVLFWSMGLDGQ 257
Query: 255 IDRTSLHGSCPVIKGEKWVATKWIR 279
D S+HG C VI GEKW ATKW+R
Sbjct: 258 SDPNSVHGGCEVISGEKWSATKWMR 282
>gi|357445147|ref|XP_003592851.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
gi|355481899|gb|AES63102.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
Length = 281
Score = 148 bits (374), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 85/205 (41%), Positives = 116/205 (56%), Gaps = 5/205 (2%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
+VLSW PR + NF S E+C + A RLK S + G+ ++S RTSSG F+
Sbjct: 76 EVLSWSPRIILLHNFLSYEECDYLRGVALPRLKISTVVDANTGKGIKSD--VRTSSGMFL 133
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
S E K ++ IE +I+ + +P +GE VLRYE Q Y H+D F+ + Q
Sbjct: 134 SHEERKYPMIHAIEKRISVYSQIPIENGELMQVLRYEKNQYYRPHHDYFSDTFNLKRGGQ 193
Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
R+A+ L+YL D EGGET FP G K GL VKP +G+ +LF+S+ +G
Sbjct: 194 RIATMLMYLGDNVEGGETHFPSAGSDECSCGGKLTK--GLCVKPVKGNAVLFWSMGLDGQ 251
Query: 255 IDRTSLHGSCPVIKGEKWVATKWIR 279
D S+HG CPV+ GEKW ATKW+R
Sbjct: 252 SDPDSVHGGCPVLAGEKWSATKWMR 276
>gi|449443245|ref|XP_004139390.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 295
Score = 147 bits (372), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 78/202 (38%), Positives = 120/202 (59%), Gaps = 9/202 (4%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
PRA + NF S ++C +I AK R++ S ++ + RTSSG F++ +++
Sbjct: 83 PRAFIYHNFLSEKECSQLINLAKPRMERSLVSAQNTNWEGVVSSRRTSSGRFLAKGQNQ- 141
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
++ IE +IA T +P +GE ++L YE+GQK++ H+D +P + + QR A+ +
Sbjct: 142 -LVRRIEKRIAEFTFIPVENGEGLSILHYEVGQKFEPHHDYTHPDSFSFKSLGQRNATLV 200
Query: 201 LYLSDVEEGGETMFPFENGI------FLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
+YLS V+EGG T+FP + +Y K GL VKP+ GD LLF+S+ P+GT
Sbjct: 201 MYLSGVKEGGATVFPEAKKCASSARRWWKKLPEYGKDNGLSVKPKMGDALLFWSVKPDGT 260
Query: 255 IDRTSLHGSCPVIKGEKWVATK 276
+D TSLH S PV+KG+KWV K
Sbjct: 261 LDPTSLHASSPVVKGDKWVGVK 282
Score = 71.2 bits (173), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 35/73 (47%), Positives = 49/73 (67%), Gaps = 6/73 (8%)
Query: 205 DVEEGGETMFPFENGIFLDSGYDYKKCI-----GLKVKPRRGDGLLFYSLFPNGTIDRTS 259
++EEGGET+FP N + +KK GL +KP+ GD L F+S+ P+GT+D TS
Sbjct: 11 NIEEGGETVFPAANQCVSSVPW-WKKLPTHGKDGLSIKPKMGDALFFWSMKPDGTLDYTS 69
Query: 260 LHGSCPVIKGEKW 272
LHGS PVI+G++W
Sbjct: 70 LHGSYPVIRGDEW 82
>gi|449520144|ref|XP_004167094.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 323
Score = 147 bits (371), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 78/202 (38%), Positives = 120/202 (59%), Gaps = 9/202 (4%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
PRA + NF S ++C +I AK R++ S ++ + RTSSG F++ +++
Sbjct: 74 PRAFIYHNFLSEKECSQLINLAKPRMERSLVSAQNTNWEGVVSSRRTSSGRFLAKGQNQ- 132
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
++ IE +IA T +P +GE ++L YE+GQK++ H+D +P + + QR A+ +
Sbjct: 133 -LVRRIEKRIAEFTFIPVENGEGLSILHYEVGQKFEPHHDYTHPDSFSFKSLGQRNATLV 191
Query: 201 LYLSDVEEGGETMFPFENGI------FLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
+YLS V+EGG T+FP + +Y K GL VKP+ GD LLF+S+ P+GT
Sbjct: 192 MYLSGVKEGGATVFPEAKKCASSARRWWKKLPEYGKDNGLSVKPKMGDALLFWSVKPDGT 251
Query: 255 IDRTSLHGSCPVIKGEKWVATK 276
+D TSLH S PV+KG+KWV K
Sbjct: 252 LDPTSLHASSPVVKGDKWVGVK 273
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 28/63 (44%), Positives = 39/63 (61%), Gaps = 6/63 (9%)
Query: 205 DVEEGGETMFPFENGIFLDSGYDYKKC-----IGLKVKPRRGDGLLFYSLFPNGTIDRTS 259
++EEGGET+FP N + +KK GL +KP+ GD L F+S+ P+GT+D TS
Sbjct: 11 NIEEGGETVFPAANKCVSSVPW-WKKLPTHGKDGLSIKPKMGDALFFWSMKPDGTLDYTS 69
Query: 260 LHG 262
LH
Sbjct: 70 LHA 72
>gi|225433714|ref|XP_002268409.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
gi|296089634|emb|CBI39453.3| unnamed protein product [Vitis vinifera]
Length = 287
Score = 147 bits (371), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 85/222 (38%), Positives = 126/222 (56%), Gaps = 8/222 (3%)
Query: 67 DASIQSIPF---QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVES 122
DA I I + ++L+W PR + +F S+E+C + A A+ L+ S + Q G+ ++S
Sbjct: 70 DADILRIGYVKPEILNWSPRIILLHSFLSSEECDYLRAMAEPLLQISTVVDAQTGKGIQS 129
Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
RTSSG F+S + I+ IE +I+ + +P +GE VLRY+ Q Y H+D
Sbjct: 130 D--VRTSSGMFLSPDDSTYPIVRAIEKRISVYSQVPVENGELIQVLRYKKSQFYKPHHDY 187
Query: 183 FNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
F+ + + QR+A+ L+YLSD EGGET FP F G K GL V P +G+
Sbjct: 188 FSDSFNLKRGGQRVATMLIYLSDNVEGGETYFPMAGSGFCRCG--GKSVRGLSVAPVKGN 245
Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQH 284
+LF+S+ +G D S+HG C V+ GEKW ATKW+R + H
Sbjct: 246 AVLFWSMGLDGQSDPNSIHGGCEVLAGEKWSATKWMRQRSTH 287
>gi|363543299|ref|NP_001241865.1| prolyl 4-hydroxylase 5-1 [Zea mays]
gi|347978814|gb|AEP37749.1| prolyl 4-hydroxylase 5-1 [Zea mays]
Length = 180
Score = 147 bits (370), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 80/176 (45%), Positives = 105/176 (59%), Gaps = 16/176 (9%)
Query: 119 TVESTKG------TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEI 172
V+ST G RTSSG F+ DK ++ +IE +I T +P HGE VL YE+
Sbjct: 6 VVDSTTGKSKDSRVRTSSGMFLQRGRDK--VIRVIEKRITDYTFIPVDHGEGLQVLHYEV 63
Query: 173 GQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDY---- 228
GQKY+ H+D F QR+A+ L++LSDVEEGGET+FP N DS +
Sbjct: 64 GQKYEPHFDYFLDEFNTKNGGQRMATLLMHLSDVEEGGETIFPDAN--VNDSSLPWYNEL 121
Query: 229 KKCI--GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
+C GL VKP+ GD LLF+S+ P+ T+D SLHG CPVI+G KW +TKW+ E
Sbjct: 122 SECAKRGLSVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIRGNKWSSTKWMHIHE 177
>gi|222623961|gb|EEE58093.1| hypothetical protein OsJ_08962 [Oryza sativa Japonica Group]
Length = 387
Score = 147 bits (370), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 85/201 (42%), Positives = 113/201 (56%), Gaps = 19/201 (9%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
+V+SW PRA + NF S E+C +I AK + S + V+ST G RTS
Sbjct: 100 EVISWEPRAFVYHNFLSKEECDYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 152
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
SG F+ DK ++ IE +IA T +P HGE VL YE+GQKY+ H+D F
Sbjct: 153 SGMFLQRGRDK--VIRAIEKRIADYTFIPMEHGEGLQVLHYEVGQKYEPHFDYFLDEYNT 210
Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFE--NGIFLDSGYDYKKCI--GLKVKPRRGDGLL 245
QR+A+ L+YLSDVEEGGET+FP N L + +C GL VKP+ GD LL
Sbjct: 211 KNGGQRMATLLMYLSDVEEGGETIFPDANVNSSSLPWYNELSECARKGLAVKPKMGDALL 270
Query: 246 FYSLFPNGTIDRTSLHGSCPV 266
F+S+ P+ T+D SLH + V
Sbjct: 271 FWSMKPDATLDPLSLHDTLRV 291
>gi|242047774|ref|XP_002461633.1| hypothetical protein SORBIDRAFT_02g005760 [Sorghum bicolor]
gi|241925010|gb|EER98154.1| hypothetical protein SORBIDRAFT_02g005760 [Sorghum bicolor]
Length = 275
Score = 147 bits (370), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 83/211 (39%), Positives = 118/211 (55%), Gaps = 21/211 (9%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-----TRTSS 130
+ LSW+PR + F S ++C ++ AKK G V + TRTSS
Sbjct: 49 KALSWQPRIFVYKGFLSDDECDHLVTLAKK-----------GTMVAHNRSSYYRQTRTSS 97
Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP 190
G F+ +D ++ IE +IA T+LP+ + E + RY+ GQKYD H+D F+ +
Sbjct: 98 GMFLRKRQDP--VVSRIEERIAAWTLLPRENVEKMQIQRYQHGQKYDPHFDYFDDKIHHT 155
Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFY 247
+ R A+ L+YLS V++GGET+FP G D + +C GL VKP +GD +LF+
Sbjct: 156 RGGPRYATVLMYLSTVDKGGETVFPKAKGWESQPKDDTFSECAHKGLAVKPVKGDAVLFF 215
Query: 248 SLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
SL +G D +LHGSCPVI+GEKW A WI
Sbjct: 216 SLHVDGGPDPLTLHGSCPVIQGEKWSAPNWI 246
>gi|218191856|gb|EEC74283.1| hypothetical protein OsI_09531 [Oryza sativa Indica Group]
Length = 376
Score = 146 bits (369), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 84/196 (42%), Positives = 111/196 (56%), Gaps = 19/196 (9%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
+V+SW PRA + NF S E+C +I AK + S + V+ST G RTS
Sbjct: 100 EVISWEPRAFVYHNFLSKEECDYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 152
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
SG F+ DK ++ IE +IA T +P HGE VL YE+GQKY+ H+D F
Sbjct: 153 SGMFLQRGRDK--VIRAIEKRIADYTFIPMEHGEGLQVLHYEVGQKYEPHFDYFLDEYNT 210
Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFE--NGIFLDSGYDYKKCI--GLKVKPRRGDGLL 245
QR+A+ L+YLSDVEEGGET+FP N L + +C GL VKP+ GD LL
Sbjct: 211 KNGGQRMATLLMYLSDVEEGGETIFPDANVNSSSLPWYNELSECARKGLAVKPKMGDALL 270
Query: 246 FYSLFPNGTIDRTSLH 261
F+S+ P+ T+D SLH
Sbjct: 271 FWSMKPDATLDPLSLH 286
>gi|307111754|gb|EFN59988.1| hypothetical protein CHLNCDRAFT_49444 [Chlorella variabilis]
Length = 344
Score = 145 bits (367), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 79/206 (38%), Positives = 115/206 (55%), Gaps = 5/206 (2%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
QVL R + NF + E+C II A+ + S + + RTS GTF++
Sbjct: 64 QVLHEDARIFLYHNFLTDEECDHIIKLAEPTMARSGV-VETDSGKSKIDNVRTSKGTFLN 122
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
D ++ IE +IA+ T++P +GE VL+YE GQ+Y+ HYD F R
Sbjct: 123 RGHDS--VIADIEARIAKWTLMPAGNGEGLQVLKYEHGQEYEGHYDYFFHKAGTANGGNR 180
Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIG--LKVKPRRGDGLLFYSLFPNG 253
+ L+YL+DVEEGGET FP D+G ++ +C L KP++G+ +LF+S+ P G
Sbjct: 181 YLTVLMYLNDVEEGGETCFPNIPSPNGDNGPEFSECARKVLAAKPKKGNAVLFHSIKPTG 240
Query: 254 TIDRTSLHGSCPVIKGEKWVATKWIR 279
++R SLH +CPVIKG KW A KW+
Sbjct: 241 ELERRSLHTACPVIKGVKWSAPKWVH 266
>gi|307110383|gb|EFN58619.1| hypothetical protein CHLNCDRAFT_19485 [Chlorella variabilis]
Length = 328
Score = 145 bits (367), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 87/204 (42%), Positives = 115/204 (56%), Gaps = 26/204 (12%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
+SW+PRA F NF + E+ I+A AK +K S + G +VE RTS GTF+
Sbjct: 35 VSWKPRAFVFHNFMTEEEADHIVALAKPFMKRSTVVGAGGASVEDQ--IRTSYGTFLKRL 92
Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
+D I+ +E ++A T L +H E +LRY IGQKY +HYD+ + S R+
Sbjct: 93 QDP--IVTAVEQRLATWTKLNVSHQEDMQILRYGIGQKYGAHYDSLD------NDSPRVC 144
Query: 198 SFLLYLSDV--EEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 255
+ LLYLSDV + GGET FP G+ + Y P++GD LLFYSL P+GT
Sbjct: 145 TVLLYLSDVPADGGGETAFP---GVRRQALY-----------PKKGDALLFYSLKPDGTS 190
Query: 256 DRTSLHGSCPVIKGEKWVATKWIR 279
D SLH CP+I G KW ATKWI
Sbjct: 191 DAYSLHTGCPIISGVKWTATKWIH 214
>gi|397568865|gb|EJK46391.1| hypothetical protein THAOC_34939 [Thalassiosira oceanica]
Length = 488
Score = 145 bits (365), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 87/230 (37%), Positives = 124/230 (53%), Gaps = 18/230 (7%)
Query: 61 PHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETV 120
P TG++ +++ + LS +P L F + E+C I+ A +K S ++L+ +
Sbjct: 255 PSTITGNS--KNVTIETLSMKPLVLSISGFLADEECDYIMEKAAPTMKYSGVSLKDADKG 312
Query: 121 ESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHY 180
RTS TF++A D IL IE + A T +P TH E VLRY + +KYD+H+
Sbjct: 313 RPASDWRTSQSTFVAAMGDP--ILRDIELRTASLTRVPVTHQEFVQVLRYGVTEKYDAHH 370
Query: 181 DAFNPAEY--GPQMSQ--------RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKK 230
D F+P+ Y P Q R A+ YL+DV GGET FP G D+
Sbjct: 371 DFFDPSSYRSDPGTLQLIENGKKNRYATVFWYLTDVARGGETCFPRHGGA--PPPRDFSM 428
Query: 231 CIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGE--KWVATKWI 278
C GLKVKP++G ++FYSL +G +D SLHG+CPV+ E KW A KW+
Sbjct: 429 CTGLKVKPQKGKVIIFYSLDASGEMDPLSLHGACPVLGKEDIKWAANKWL 478
>gi|159469311|ref|XP_001692811.1| predicted protein [Chlamydomonas reinhardtii]
gi|158278064|gb|EDP03830.1| predicted protein [Chlamydomonas reinhardtii]
Length = 273
Score = 145 bits (365), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 85/205 (41%), Positives = 114/205 (55%), Gaps = 8/205 (3%)
Query: 77 VLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
VL R + F + E+C I A+KRL+ S + + G RTS G F
Sbjct: 38 VLDPDARIYLWKGFLTPEECDYIRMKAEKRLERSGV-VDTGSGGSVVSDIRTSDGMFFER 96
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
ED I+E +E ++A TM P GE+ VLRY QKYDSH+D F + R
Sbjct: 97 GED--AIIEAVEQRLADWTMTPIWGGESLQVLRYRKDQKYDSHWDYFFHKDGSSNGGNRW 154
Query: 197 ASFLLYLSDVEEGGETMF---PFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 253
A+ LLYL++ EEGGET+F P NGI + G+ L VKP +GD LLF+S+ P G
Sbjct: 155 ATVLLYLTETEEGGETVFPKIPAPNGI--NVGFSECAKYNLAVKPHKGDALLFHSMKPTG 212
Query: 254 TIDRTSLHGSCPVIKGEKWVATKWI 278
++ S+HG+CPVI+GEK+ TKWI
Sbjct: 213 ELEERSMHGACPVIRGEKFSMTKWI 237
>gi|9294584|dbj|BAB02865.1| unnamed protein product [Arabidopsis thaliana]
Length = 328
Score = 144 bits (364), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 80/166 (48%), Positives = 105/166 (63%), Gaps = 7/166 (4%)
Query: 117 GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKY 176
GE+ +S RTSSG F++ +D I+ +E K+A T LP+ +GEA +L YE GQKY
Sbjct: 9 GESEDSE--VRTSSGMFLTKRQDD--IVANVEAKLAAWTFLPEENGEALQILHYENGQKY 64
Query: 177 DSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--G 233
D H+D F + R+A+ L+YLS+V +GGET+FP G D + KC G
Sbjct: 65 DPHFDYFYDKKALELGGHRIATVLMYLSNVTKGGETVFPNWKGKTPQLKDDSWSKCAKQG 124
Query: 234 LKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
VKPR+GD LLF++L NGT D SLHGSCPVI+GEKW AT+WI
Sbjct: 125 YAVKPRKGDALLFFNLHLNGTTDPNSLHGSCPVIEGEKWSATRWIH 170
>gi|297727581|ref|NP_001176154.1| Os10g0415128 [Oryza sativa Japonica Group]
gi|255679404|dbj|BAH94882.1| Os10g0415128 [Oryza sativa Japonica Group]
Length = 241
Score = 144 bits (363), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 74/157 (47%), Positives = 98/157 (62%), Gaps = 5/157 (3%)
Query: 126 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 185
RTSSG F+ +D+ ++ IE +IA T LP +GE+ +L Y+ G+KY+ HYD F+
Sbjct: 15 VRTSSGMFLEKKQDE--VVARIEERIAAWTFLPPDNGESIQILHYQNGEKYEPHYDYFHD 72
Query: 186 AEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGD 242
R+A+ L+YLSDV +GGET+FP G L D + C G VKP +GD
Sbjct: 73 KNNQALGGHRIATVLMYLSDVGKGGETIFPEAEGKLLQPKDDTWSDCAKNGYAVKPVKGD 132
Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
LLF+SL P+ T D SLHGSCPVI+G+KW ATKWI
Sbjct: 133 ALLFFSLHPDATTDSDSLHGSCPVIEGQKWSATKWIH 169
>gi|357517893|ref|XP_003629235.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355523257|gb|AET03711.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 196
Score = 143 bits (361), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 79/194 (40%), Positives = 116/194 (59%), Gaps = 21/194 (10%)
Query: 90 FASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEH 149
+ E+C+ +I AK + S + G++V+++ RTSSGTFI+ DK IL IE
Sbjct: 12 ITTKEECEHLINIAKPSMHKSTVDDETGKSVDNS--ARTSSGTFINRGHDK--ILRNIEQ 67
Query: 150 KIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEG 209
+IA T +P +GE+ N+L YE+GQKY+ H D F +++ + E+G
Sbjct: 68 RIADFTFIPVENGESVNILHYEVGQKYEPHPDFFTD-----EINTKNGG--------EQG 114
Query: 210 GETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCP 265
GET+FPF G F + + C GL +KP+ GD LLF+S+ P+GT+D S+HG+CP
Sbjct: 115 GETVFPFAEGNFSSVPWWNELSDCGKKGLSIKPKMGDALLFWSMKPDGTLDPLSMHGACP 174
Query: 266 VIKGEKWVATKWIR 279
VIKG+KW TKW+R
Sbjct: 175 VIKGDKWSCTKWMR 188
>gi|145341735|ref|XP_001415959.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144576182|gb|ABO94251.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 254
Score = 143 bits (361), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 81/216 (37%), Positives = 116/216 (53%), Gaps = 25/216 (11%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSG 131
LSW PRA + + QC++++ + R+ R+ V+S G RTS
Sbjct: 6 LSWYPRAFALRDALTEAQCEAVLRATRARV-------RRSTVVDSVTGESKVDPIRTSKQ 58
Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-----AFNPA 186
TF++ E+ ++ I ++ TMLP TH E VL Y +G+KYD+H D + +
Sbjct: 59 TFLNRDEE---VVREIYDALSAVTMLPWTHNEDMQVLEYRVGEKYDAHEDVGAEDSLSGR 115
Query: 187 EYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIF--LDSGYDYKKCIGLKV--KPRRGD 242
E +R+A+ LLYL + E GGET FP I + G + KC +V KPRRGD
Sbjct: 116 ELSKDGGKRVATVLLYLEEPEAGGETAFPDSEWIDPKMAEGTSWSKCAEHRVAMKPRRGD 175
Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
GL+F+S+ PNG ID +LH CPV+ G KW AT W+
Sbjct: 176 GLIFWSVDPNGKIDHRALHVGCPVVAGVKWTATVWV 211
>gi|302841711|ref|XP_002952400.1| hypothetical protein VOLCADRAFT_81799 [Volvox carteri f.
nagariensis]
gi|300262336|gb|EFJ46543.1| hypothetical protein VOLCADRAFT_81799 [Volvox carteri f.
nagariensis]
Length = 269
Score = 143 bits (360), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 87/200 (43%), Positives = 112/200 (56%), Gaps = 10/200 (5%)
Query: 83 RALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASEDKT 141
R + F + E+C I A+KRL+ S + G +V S RTS G F ED
Sbjct: 44 RIYLWRGFLTPEECDYIRMKAEKRLERSGVVDTASGSSVVSD--IRTSDGMFFERGED-- 99
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
ILE +E ++A TM P GEA VLRY QKYDSH + F E R A+ L
Sbjct: 100 AILEAVEQRLADWTMTPIWAGEALQVLRYRKDQKYDSHVNYFFHKEGSANGGNRWATVLT 159
Query: 202 YLSDVEEGGETMF---PFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
YL+D EEGGET+F P G+ + G+ L VKPR+GD +LF+S+ NG ++
Sbjct: 160 YLTDTEEGGETVFPKIPAPGGV--NVGFSECAKYNLAVKPRKGDAILFHSMKTNGQLEER 217
Query: 259 SLHGSCPVIKGEKWVATKWI 278
SLHG+CPVIKGEK+ TKWI
Sbjct: 218 SLHGACPVIKGEKFSMTKWI 237
>gi|302831512|ref|XP_002947321.1| hypothetical protein VOLCADRAFT_120451 [Volvox carteri f.
nagariensis]
gi|300267185|gb|EFJ51369.1| hypothetical protein VOLCADRAFT_120451 [Volvox carteri f.
nagariensis]
Length = 797
Score = 143 bits (360), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 81/216 (37%), Positives = 115/216 (53%), Gaps = 12/216 (5%)
Query: 75 FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 134
+ +SW PRA + NF ++ +C ++ +R+ S L + RTS G
Sbjct: 493 IETISWSPRAFVYHNFLTSAECDHLVQIGTQRVSRS-LVVDSQTGQSKLDDIRTSYGAAF 551
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQM- 192
ED ++ IE +IA T LP HGE +LRY GQKYD+H+D F +P + +
Sbjct: 552 GRGEDP--VIAEIEERIAEWTHLPPEHGEPMQILRYVDGQKYDAHWDWFDDPVHHRSYLV 609
Query: 193 -SQRLASFLLYLSDVEEGGETMFPFENGI-----FLDSGYDYKKCIGLKVKPRRGDGLLF 246
R A+ LLYLS+VE GGET P + I +++ +GL ++PR+GD LLF
Sbjct: 610 DGNRYATVLLYLSEVEAGGETNLPLADPIDMSVQAIENPSPCAAKMGLSIRPRKGDALLF 669
Query: 247 YSLFPNGTI-DRTSLHGSCPVIKGEKWVATKWIRDQ 281
Y + G DR +LH SCP +KG KW ATKWI +
Sbjct: 670 YDMDIEGQKGDRKALHASCPTLKGMKWTATKWIHSK 705
>gi|302844247|ref|XP_002953664.1| prolyl 4-hydroxylase alpha subunit-like protein [Volvox carteri f.
nagariensis]
gi|300261073|gb|EFJ45288.1| prolyl 4-hydroxylase alpha subunit-like protein [Volvox carteri f.
nagariensis]
Length = 364
Score = 142 bits (359), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 86/207 (41%), Positives = 116/207 (56%), Gaps = 15/207 (7%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
PRA F NF + + ++ A +LK S + +GE V RTS G FI D
Sbjct: 55 PRAYLFHNFLTKAERAHMVRLAAPKLKRSTVVGSKGEGV--VDNIRTSFGMFIRRLSDP- 111
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY-GPQMSQRLASFL 200
I+ IE +I+ T LP H E VLRY GQ Y +HYD+ +++ GP+ RLA+FL
Sbjct: 112 -IIARIEKRISLWTHLPIEHQEDIQVLRYAHGQTYGAHYDSGASSDHVGPKW--RLATFL 168
Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYK-----KCIG--LKVKPRRGDGLLFYSLFPNG 253
+YLSDVEEGGET FP +N ++ D + +C + KP+ GD +LFYS PN
Sbjct: 169 MYLSDVEEGGETAFP-QNSVWYDPTIPERIGPVSECAKGHVAAKPKAGDAVLFYSFLPNN 227
Query: 254 TIDRTSLHGSCPVIKGEKWVATKWIRD 280
T+D ++H CPVIKG KW A W+ D
Sbjct: 228 TMDPAAMHTGCPVIKGIKWAAPVWMHD 254
>gi|159487421|ref|XP_001701721.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280940|gb|EDP06696.1| predicted protein [Chlamydomonas reinhardtii]
Length = 336
Score = 142 bits (359), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 85/214 (39%), Positives = 116/214 (54%), Gaps = 16/214 (7%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
Q + PRA YF NF + + ++ A +LK S + + RTS G FI
Sbjct: 20 QQVGLHPRAYYFHNFLTKAERAHLVRVAAPKLKRSTVVGGK--GEGVVDDIRTSYGMFIR 77
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY-GPQMSQ 194
D ++ IE +I+ T LP H E +LRY GQ Y +HYD+ +++ GP+
Sbjct: 78 RLSDP--VVTRIEKRISLWTHLPVEHQEDIQILRYAHGQTYGAHYDSGASSDHVGPKW-- 133
Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDS------GYDYKKCI--GLKVKPRRGDGLLF 246
RLA+FL+YLSDVEEGGET FP N ++ D G + C + KP+ GD +LF
Sbjct: 134 RLATFLMYLSDVEEGGETAFP-HNSVWADPSIPEQVGDKFSDCAKGHVAAKPKAGDAVLF 192
Query: 247 YSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 280
YS +PN T+D S+H CPVIKG KW A W+ D
Sbjct: 193 YSFYPNNTMDPASMHTGCPVIKGVKWAAPVWMHD 226
>gi|226494249|ref|NP_001141909.1| uncharacterized protein LOC100274058 [Zea mays]
gi|194706408|gb|ACF87288.1| unknown [Zea mays]
gi|413932757|gb|AFW67308.1| hypothetical protein ZEAMMB73_919439 [Zea mays]
gi|413932758|gb|AFW67309.1| hypothetical protein ZEAMMB73_919439 [Zea mays]
Length = 217
Score = 142 bits (358), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 72/150 (48%), Positives = 95/150 (63%), Gaps = 3/150 (2%)
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
+S + K I+ IE ++A T LP+ + E+ VLRYE GQKYD+H+D F+
Sbjct: 10 MLSPPQPKDEIVSAIEKRVAAWTFLPEENAESLQVLRYETGQKYDAHFDYFHDRNNLKLG 69
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFL---DSGYDYKKCIGLKVKPRRGDGLLFYSL 249
QR+A+ L+YL+DV +GGET+FP G L D + GL VKP++GD LLF++L
Sbjct: 70 GQRVATVLMYLTDVNKGGETVFPNAEGSHLQYKDETWSECSRSGLAVKPKKGDALLFFNL 129
Query: 250 FPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
N T D SLHGSCPVI+GEKW ATKWI
Sbjct: 130 HVNATADTGSLHGSCPVIEGEKWSATKWIH 159
>gi|449468746|ref|XP_004152082.1| PREDICTED: putative prolyl 4-hydroxylase-like [Cucumis sativus]
Length = 290
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 82/207 (39%), Positives = 118/207 (57%), Gaps = 5/207 (2%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
+V+SW PR + NF S ++C + A RL+ S + + G+ V+S RTSSG F+
Sbjct: 83 EVVSWSPRIIVLHNFLSTKECDYLKGIALARLEISTVVDTKTGKGVKSD--FRTSSGMFL 140
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
S E +++ IE +I+ + +P +GE VLRYE Q Y H+D F+ + Q
Sbjct: 141 SHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ 200
Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
R+A+ L+YLS+ EGGET FP G K GL VKP +GD +LF+S+ +G
Sbjct: 201 RIATMLMYLSENIEGGETYFPKAGSGECSCG--GKTVPGLSVKPAKGDAVLFWSMGLDGQ 258
Query: 255 IDRTSLHGSCPVIKGEKWVATKWIRDQ 281
D S+HG C V+ GEKW ATKW+R +
Sbjct: 259 SDPKSIHGGCEVLSGEKWSATKWMRQK 285
>gi|307102963|gb|EFN51228.1| hypothetical protein CHLNCDRAFT_141231 [Chlorella variabilis]
Length = 313
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 87/213 (40%), Positives = 117/213 (54%), Gaps = 21/213 (9%)
Query: 75 FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT-----RTS 129
QVL R F NF + E+C I+A AK L+ R G +T G+ RTS
Sbjct: 34 MQVLDAEAR--IFINFLTEEECDHIVALAKPHLE------RSGVVDTATGGSEISDIRTS 85
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHY-DAFNPAEY 188
G F+ D T + IE +IAR T+LP +GE VL Y G+KYD ++ D N
Sbjct: 86 KGMFLERGHDDT--VAAIEERIARWTLLPVGNGEGLQVLNYHPGEKYDDYFFDKVNGESN 143
Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI--GLKVKPRRGDGLLF 246
G R A+ L+YL+ VEEGGET+FP D+G + +C L KP +G +LF
Sbjct: 144 G---GNRYATVLMYLNTVEEGGETVFPNIPAPGGDNGPTFTECARRHLAAKPTKGSAVLF 200
Query: 247 YSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+S+ P+G ++R SLH +CPV+KGEKW A KWI
Sbjct: 201 HSIKPSGDLERRSLHTACPVVKGEKWSAPKWIH 233
>gi|302765413|ref|XP_002966127.1| hypothetical protein SELMODRAFT_86017 [Selaginella moellendorffii]
gi|300165547|gb|EFJ32154.1| hypothetical protein SELMODRAFT_86017 [Selaginella moellendorffii]
Length = 201
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 76/201 (37%), Positives = 116/201 (57%), Gaps = 6/201 (2%)
Query: 85 LYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGIL 144
L F S ++C +I A RL+ S + + + ++ RTS G F+ D I+
Sbjct: 1 LIFFYLYSDDECDHLIGLALPRLRRSSVIDEKTGLGKDSR-NRTSWGAFLRRDHDN--IV 57
Query: 145 ELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLS 204
IE +I+ T +P+ +GE+ V+RY+ GQK++ H D + E R+ + LLYL+
Sbjct: 58 SGIEDRISSITFIPKEYGESLQVVRYKTGQKFEPHQDYYKLTENNNNGGHRIGTLLLYLT 117
Query: 205 DVEEGGETMFPFE-NGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLFPNGTIDRTSLH 261
+VE GGET+FP + D + +C G+ ++PRRGDGLLF+ P+G ID S H
Sbjct: 118 NVENGGETVFPRALANVINDYSTNTSECTKKGIVIRPRRGDGLLFWITRPSGEIDPFSFH 177
Query: 262 GSCPVIKGEKWVATKWIRDQE 282
G CPV+KGEKW+ATK++ + E
Sbjct: 178 GGCPVVKGEKWLATKFLHEHE 198
>gi|255637879|gb|ACU19258.1| unknown [Glycine max]
Length = 287
Score = 141 bits (356), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 87/214 (40%), Positives = 120/214 (56%), Gaps = 14/214 (6%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
+VL+W PR + NF S E+C + A A RL S + + G+ ++S RTSSG F+
Sbjct: 82 EVLNWSPRIILLHNFLSMEECDYLRALALPRLHISTVVDTKTGKGIKSD--VRTSSGMFL 139
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKY----DSHYDAFNPAEYGP 190
++ E K +++ IE +I+ + +P +GE VLRYE Q Y D +D FN G
Sbjct: 140 NSKERKYPMVQAIEKRISVYSQIPIENGELMQVLRYEKNQYYKPRHDYFFDTFNLKRGG- 198
Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
Q +A+ L+YLSD EGGET FP G K GL VKP +G+ +LF+S+
Sbjct: 199 ---QGIATMLMYLSDNIEGGETYFPLAGSGECSCGGKLVK--GLSVKPIKGNAVLFWSMG 253
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQH 284
+G D S+HG C VI GEKW ATKW+R Q H
Sbjct: 254 LDGQSDPNSVHGGCEVISGEKWSATKWLR-QTTH 286
>gi|299115886|emb|CBN75895.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein
[Ectocarpus siliculosus]
Length = 404
Score = 140 bits (354), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 80/217 (36%), Positives = 117/217 (53%), Gaps = 8/217 (3%)
Query: 66 GDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG 125
G ++ SI + LS P NF E+C+ I A +KPS ++L + +
Sbjct: 184 GLETLGSIDMKTLSMEPLVFEARNFLLDEECKHIREKADPHMKPSPVSLMDHDKGKPDTN 243
Query: 126 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 185
RTS+ F+ ++ D +L+ I+ ++ T +P++H E VL+Y+ GQ+Y +H+D +
Sbjct: 244 WRTSTTYFMPSTRDP--LLQGIDRRVEEFTRVPKSHQEQVQVLKYDKGQRYTAHHDFLDE 301
Query: 186 A---EYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI-GLKVKPRRG 241
R+ + YLSDVEEGGET+FP G D+ C GLKVKP G
Sbjct: 302 RTMRNMDGGRKNRMITVFWYLSDVEEGGETIFPRYGG--RTGRVDFSDCTTGLKVKPVEG 359
Query: 242 DGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
+FYSL P+G D SLHG+CPVI G+KW A KW+
Sbjct: 360 KVAMFYSLKPDGQFDDFSLHGACPVITGQKWAANKWV 396
>gi|384250156|gb|EIE23636.1| hypothetical protein COCSUDRAFT_53414 [Coccomyxa subellipsoidea
C-169]
Length = 285
Score = 139 bits (351), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 80/210 (38%), Positives = 115/210 (54%), Gaps = 7/210 (3%)
Query: 72 SIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTKGTRTSS 130
S+ + +SW PRA + S ++C II A+ + K + L + + V + R +
Sbjct: 50 SLMVERISWNPRAFLYRGLLSQDECDYIINAARPNMVKATVLDAKTKKQVPNK--LRNNK 107
Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP 190
+I S D +++ IE +IAR T LP HGE F++++Y GQ Y H D + +
Sbjct: 108 EAYIDGSADD--VIDQIERRIARYTFLPAAHGEPFHIMQYLPGQGYAPHTDWLDDWWHPR 165
Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYS 248
++R+A+ ++YLSDV EGGET+FP Y KC G+ VKP +GD LL Y+
Sbjct: 166 LGNERIATMIIYLSDVVEGGETVFPNSTMQPHVGDAAYSKCAQQGIAVKPVKGDALLLYN 225
Query: 249 LFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
L NG D SLH CPVI+G KW ATK I
Sbjct: 226 LLENGRNDGESLHQGCPVIRGVKWTATKRI 255
>gi|145354086|ref|XP_001421326.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144581563|gb|ABO99619.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 309
Score = 139 bits (350), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 81/218 (37%), Positives = 122/218 (55%), Gaps = 20/218 (9%)
Query: 75 FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 134
+ +S PRA + NF + E+ ++ IA A++ ++ S++ + + + T RTSSG ++
Sbjct: 78 IERISESPRAYVYRNFLTREEAEATIAAARRTMRRSEV-VNEADGTSKTSDERTSSGGWV 136
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
S + + ++ IE ++A TMLP+ GE V+RYE GQ+Y +H D F+ Q
Sbjct: 137 SGEDSE--VMANIERRVAAWTMLPRNRGETTQVMRYEAGQEYAAHDDYFHDEVNVKNGGQ 194
Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIG---------------LKVKPR 239
R A+ L+YLSDVEEGGET+FP G L K + L VKPR
Sbjct: 195 RAATVLMYLSDVEEGGETVFP--RGTPLGGAAPEKSGVTQGNACERALRGDPNVLAVKPR 252
Query: 240 RGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKW 277
RGD LLF+++ NG +D + H CPV++G KW AT+W
Sbjct: 253 RGDALLFFNVHLNGEVDERARHAGCPVVRGTKWTATRW 290
>gi|159485424|ref|XP_001700744.1| hypothetical protein CHLREDRAFT_187378 [Chlamydomonas reinhardtii]
gi|158281243|gb|EDP06998.1| predicted protein [Chlamydomonas reinhardtii]
Length = 253
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 86/228 (37%), Positives = 124/228 (54%), Gaps = 18/228 (7%)
Query: 64 ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
++GD I + +SW PRA + F S +C +I A +L+ S + + + V+
Sbjct: 29 DSGDVGAPWI--ETISWVPRAFIYHGFLSHAECDHLIGLALPKLERSLVVGNKSDEVDPI 86
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTS I +E T ++ IE +IAR T LP++H E VLRY GQKYD+H+D F
Sbjct: 87 ---RTSYSASIGYNE--TDVVADIEGRIARWTHLPRSHQEPMEVLRYINGQKYDAHWDWF 141
Query: 184 N-PAEYGPQMSQRLASFLLYLSDVE--EGGETMFPFENGIFLD----SGYDYKKC---IG 233
+ G R+A+ L+YLSD+E GGET P + + G Y +C +G
Sbjct: 142 DETETGGTGGGNRMATALMYLSDMEPAAGGETALPLAQPLDWEVQGVEGRGYSECASKMG 201
Query: 234 LKVKPRRGDGLLFYSLFPNG-TIDRTSLHGSCPVIKGEKWVATKWIRD 280
+ V+P++GD LLF+ + P G DR +LH SCP G KW ATKWI +
Sbjct: 202 ISVRPKKGDVLLFWDMEPGGREPDRHALHASCPTFSGTKWTATKWIHN 249
>gi|303287328|ref|XP_003062953.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226455589|gb|EEH52892.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 259
Score = 137 bits (346), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 85/223 (38%), Positives = 122/223 (54%), Gaps = 32/223 (14%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSG 131
+SW PRA + N + +C ++ A+ R+ R+ V+ST G RTS
Sbjct: 4 ISWHPRAFHLHNIMTDAECDEVLELARTRV-------RRSTVVDSTTGESKVDPIRTSEQ 56
Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFN-----VLRYEIGQKYDSHYDAFN-P 185
F++ I+ +IE ++ R TMLP +GE VL+Y GQKYD+H+D
Sbjct: 57 CFLNRGH--FPIVSVIEKRLERYTMLPWYNGEDLQARPSRVLKYSNGQKYDAHHDVGELD 114
Query: 186 AEYGPQMS----QRLASFLLYLSDVEE--GGETMFPFENGI--FLDSGYDYKKCI--GLK 235
G Q++ R+A+ LLYLSDV++ GGET FP I D G + +C +
Sbjct: 115 TASGKQLAAEGGHRVATVLLYLSDVDDDGGGETAFPDSEWIDPTADRGSGWSECAEDHVA 174
Query: 236 VKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
VKP++GDGLLF+S+ P G ID+ S+H CPV+ G+ W ATKWI
Sbjct: 175 VKPKKGDGLLFWSITPEGVIDQQSMHAGCPVL-GKSWTATKWI 216
>gi|159486447|ref|XP_001701251.1| hypothetical protein CHLREDRAFT_122372 [Chlamydomonas reinhardtii]
gi|158271833|gb|EDO97644.1| predicted protein [Chlamydomonas reinhardtii]
Length = 251
Score = 137 bits (344), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 83/219 (37%), Positives = 116/219 (52%), Gaps = 18/219 (8%)
Query: 75 FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 134
+ +SW PR + NF S +C+ I TA +K S + G +V T RTS GTFI
Sbjct: 2 IETVSWNPRVFIYHNFLSDAECRHIKRTAAPMMKRSSVVGTNGSSVLDT--IRTSYGTFI 59
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
D ++E + ++A T P + E VLRY GQKY +H D+ S
Sbjct: 60 RRRHDP--VVERVLRRVAAWTKAPPENQEDLQVLRYGPGQKYGAHMDSLI------DDSP 111
Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLD-----SGYDYKKCIGLKV--KPRRGDGLLFY 247
R+A+ LLYL D E GGET FP ++G +LD S + +C V +P++GD L+F+
Sbjct: 112 RMATVLLYLHDTEYGGETAFP-DSGHWLDPSLAQSMGPFSECAQGHVAFRPKKGDALMFW 170
Query: 248 SLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHED 286
S+ P+GT D SLH CPV+ G KW AT W+ + D
Sbjct: 171 SIKPDGTHDPLSLHTGCPVVTGVKWTATSWVHSMPYNYD 209
>gi|17547533|ref|NP_520935.1| hypothetical protein RSc2814 [Ralstonia solanacearum GMI1000]
gi|17429837|emb|CAD16521.1| putative prolyl 4-hydroxylase alpha subunit homologue
oxidoreductase protein [Ralstonia solanacearum GMI1000]
Length = 289
Score = 137 bits (344), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 88/248 (35%), Positives = 126/248 (50%), Gaps = 29/248 (11%)
Query: 43 RPKLRTLEVVEKENESGLPHGETGDA---SIQSIPFQVLSWRPRALYFPNFASAEQCQSI 99
RPK V +G P E+ +A S + IP PR + F +F S E+C +
Sbjct: 56 RPKPEPAATVPSPAPAG-PEPESSNAVRTSDREIPILFAIETPRIVLFQHFLSDEECDQL 114
Query: 100 IATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLP 158
IA + RLK S + + GE E+ RTS G E ++ IE +IA+AT +P
Sbjct: 115 IALGRHRLKRSPVVNPETGE--ENLISARTSQGAMFQVGEHP--LVARIEARIAQATGVP 170
Query: 159 QTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-----QRLASFLLYLSDVEEGGETM 213
HGE F VL Y+ G +Y H+D FNP G QR+A+ ++YL+ V GG T
Sbjct: 171 VEHGEGFQVLHYQPGGEYQPHFDYFNPGRSGEARQLEVGGQRVATLVIYLNSVPAGGATG 230
Query: 214 FPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWV 273
FP +GL+V P +G+ + F P+GT+D +LH PV +GEKW+
Sbjct: 231 FP---------------KLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEKWI 275
Query: 274 ATKWIRDQ 281
ATKW+R++
Sbjct: 276 ATKWLRER 283
>gi|229002593|ref|ZP_04160640.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock3-17]
gi|229003816|ref|ZP_04161625.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock1-4]
gi|228757417|gb|EEM06653.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock1-4]
gi|228758520|gb|EEM07660.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock3-17]
Length = 219
Score = 137 bits (344), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 70/198 (35%), Positives = 109/198 (55%), Gaps = 25/198 (12%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C+++I +K ++K S++ + + T RTSSG F+ SE
Sbjct: 42 PLIVVLANVLSDEECETLIEMSKNKMKRSKIGISR-----KTNDIRTSSGAFLEESE--- 93
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
I IE +IA +P HGE +L+Y +GQ+Y +HYD F + R+++ ++
Sbjct: 94 -ITTRIERRIASIMNVPAPHGEGLQILKYTVGQEYQAHYDFF-VENSAAASNNRMSTLVM 151
Query: 202 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 261
YL+ VEEGGET FP + L V P++G + F + + +I++ +LH
Sbjct: 152 YLNHVEEGGETFFP---------------KLNLSVSPKKGMAVYFEYFYQDESINKLTLH 196
Query: 262 GSCPVIKGEKWVATKWIR 279
G PVIKGEKWVAT+W+R
Sbjct: 197 GGAPVIKGEKWVATQWMR 214
>gi|344172475|emb|CCA85118.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia syzygii R24]
Length = 289
Score = 136 bits (343), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 80/219 (36%), Positives = 116/219 (52%), Gaps = 25/219 (11%)
Query: 69 SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTR 127
S + IP PR + F +F S E+C +IA + RLK S + + GE E+ R
Sbjct: 84 SDREIPILFAIETPRIVLFQHFLSDEECDELIALGRHRLKRSPVVNPETGE--ENLISAR 141
Query: 128 TSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE 187
TS G E ++ IE +IA+AT +P HGE F VL Y+ G +Y H+D FNP
Sbjct: 142 TSQGAMFQVGEHP--LIARIEARIAQATGVPVEHGEGFQVLHYQPGGEYQPHFDYFNPGR 199
Query: 188 YGPQMS-----QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
G QR+A+ ++YL+ V+ GG T FP +GL+V P +G+
Sbjct: 200 SGEARQLEVGGQRVATLVIYLNSVQAGGATGFP---------------KLGLEVAPVKGN 244
Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
+ F P+GT+D +LH PV +GEKW+ATKW+R++
Sbjct: 245 AVFFVYKRPDGTLDDNTLHAGLPVERGEKWIATKWLRER 283
>gi|344169181|emb|CCA81504.1| putative Prolyl 4-hydroxylase alpha subunit [blood disease
bacterium R229]
Length = 289
Score = 136 bits (343), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 80/219 (36%), Positives = 116/219 (52%), Gaps = 25/219 (11%)
Query: 69 SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTR 127
S + IP PR + F +F S E+C +IA + RLK S + + GE E+ R
Sbjct: 84 SDREIPILFAIETPRIVLFQHFLSDEECDELIALGRHRLKRSPVVNPETGE--ENLISAR 141
Query: 128 TSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE 187
TS G E ++ IE +IA+AT +P HGE F VL Y+ G +Y H+D FNP
Sbjct: 142 TSQGAMFQVGEHP--LIARIEARIAQATGVPVEHGEGFQVLHYQPGGEYQPHFDYFNPGR 199
Query: 188 YGPQMS-----QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
G QR+A+ ++YL+ V+ GG T FP +GL+V P +G+
Sbjct: 200 SGEARQLEVGGQRVATLVIYLNSVQAGGATGFP---------------KLGLEVAPVKGN 244
Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
+ F P+GT+D +LH PV +GEKW+ATKW+R++
Sbjct: 245 AVFFVYKRPDGTLDDNTLHAGLPVERGEKWIATKWLRER 283
>gi|300690371|ref|YP_003751366.1| prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum PSI07]
gi|299077431|emb|CBJ50057.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
PSI07]
Length = 289
Score = 136 bits (343), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 80/219 (36%), Positives = 116/219 (52%), Gaps = 25/219 (11%)
Query: 69 SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTR 127
S + IP PR + F +F S E+C +IA + RLK S + + GE E+ R
Sbjct: 84 SDREIPILFAIETPRIVLFQHFLSDEECDELIALGRHRLKRSPVVNPETGE--ENLISAR 141
Query: 128 TSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE 187
TS G E ++ IE +IA+AT +P HGE F VL Y+ G +Y H+D FNP
Sbjct: 142 TSQGAMFQVGEHP--LIARIEARIAQATGVPVEHGEGFQVLHYQPGGEYQPHFDYFNPGR 199
Query: 188 YGPQMS-----QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
G QR+A+ ++YL+ V+ GG T FP +GL+V P +G+
Sbjct: 200 SGEARQLEVGGQRVATLVIYLNSVQAGGATGFP---------------KLGLEVAPVKGN 244
Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
+ F P+GT+D +LH PV +GEKW+ATKW+R++
Sbjct: 245 AVFFVYKRPDGTLDDNTLHAGLPVERGEKWIATKWLRER 283
>gi|228990015|ref|ZP_04149988.1| Prolyl 4-hydroxylase alpha subunit [Bacillus pseudomycoides DSM
12442]
gi|228769681|gb|EEM18271.1| Prolyl 4-hydroxylase alpha subunit [Bacillus pseudomycoides DSM
12442]
Length = 219
Score = 136 bits (343), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 70/198 (35%), Positives = 109/198 (55%), Gaps = 25/198 (12%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C+++I +K ++K S++ + + T RTSSG F+ SE
Sbjct: 42 PLIVVLANVLSDEECETLIEMSKNKMKRSKIGVSR-----KTNDIRTSSGAFLEESE--- 93
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
I IE +IA +P HGE +L+Y +GQ+Y +HYD F + R+++ ++
Sbjct: 94 -ITTRIERRIASIMNVPAPHGEGLQILKYTVGQEYQAHYDFF-VENSAAASNNRMSTLVM 151
Query: 202 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 261
YL+ VEEGGET FP + L V P++G + F + + +I++ +LH
Sbjct: 152 YLNHVEEGGETFFP---------------KLNLSVSPKKGMAVYFEYFYQDESINKLTLH 196
Query: 262 GSCPVIKGEKWVATKWIR 279
G PVIKGEKWVAT+W+R
Sbjct: 197 GGAPVIKGEKWVATQWMR 214
>gi|300702992|ref|YP_003744594.1| prolyl 4-hydroxylase subunit alpha [Ralstonia solanacearum
CFBP2957]
gi|299070655|emb|CBJ41950.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
CFBP2957]
Length = 289
Score = 136 bits (342), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 83/230 (36%), Positives = 119/230 (51%), Gaps = 28/230 (12%)
Query: 61 PHGETGDA---SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ- 116
P E +A S + IP PR + F +F S E+C +IA + RLK S + +
Sbjct: 73 PEAENSNAVRTSDREIPILFAIETPRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPET 132
Query: 117 GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKY 176
GE E+ RTS G E ++ IE +IA+AT +P HGE F VL Y G +Y
Sbjct: 133 GE--ENLISARTSEGAMFQVGEHP--LVARIEARIAQATGVPVEHGEGFQVLHYHPGGEY 188
Query: 177 DSHYDAFNPAEYGPQMS-----QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKC 231
H+D FNP G QR+A+ ++YL+ V+ GG T FP
Sbjct: 189 QPHFDYFNPGRSGEARQLEVGGQRVATLVIYLNSVQAGGATGFP---------------K 233
Query: 232 IGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
+GL+V P +G+ + F P+GT+D +LH PV +GEKW+ATKW+R++
Sbjct: 234 LGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEKWIATKWLRER 283
>gi|386332363|ref|YP_006028532.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum Po82]
gi|334194811|gb|AEG67996.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum Po82]
Length = 292
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 83/230 (36%), Positives = 119/230 (51%), Gaps = 28/230 (12%)
Query: 61 PHGETGDA---SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ- 116
P E +A S + IP PR + F +F S E+C +IA + RLK S + +
Sbjct: 76 PEAENSNAVRTSDREIPILFAIETPRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPET 135
Query: 117 GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKY 176
GE E+ RTS G E ++ IE +IA+AT +P HGE F VL Y G +Y
Sbjct: 136 GE--ENLISARTSEGAMFQVGEHP--LVARIEARIAQATGVPVEHGEGFQVLHYHPGGEY 191
Query: 177 DSHYDAFNPAEYGPQMS-----QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKC 231
H+D FNP G QR+A+ ++YL+ V+ GG T FP
Sbjct: 192 QPHFDYFNPGRSGEARQLEVGGQRVATLVIYLNSVQAGGATGFP---------------K 236
Query: 232 IGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
+GL+V P +G+ + F P+GT+D +LH PV +GEKW+ATKW+R++
Sbjct: 237 LGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEKWIATKWLRER 286
>gi|207744371|ref|YP_002260763.1| prolyl 4-hydroxylase subunit alpha [Ralstonia solanacearum IPO1609]
gi|206595776|emb|CAQ62703.1| prolyl 4-hydroxylase alpha subunit homologue protein [Ralstonia
solanacearum IPO1609]
Length = 280
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 83/230 (36%), Positives = 119/230 (51%), Gaps = 28/230 (12%)
Query: 61 PHGETGDA---SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ- 116
P E +A S + IP PR + F +F S E+C +IA + RLK S + +
Sbjct: 64 PEAENSNAVRTSDREIPILFAIETPRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPET 123
Query: 117 GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKY 176
GE E+ RTS G E ++ IE +IA+AT +P HGE F VL Y G +Y
Sbjct: 124 GE--ENLISARTSEGAMFQVGEHP--LVARIEARIAQATGVPVEHGEGFQVLHYHPGGEY 179
Query: 177 DSHYDAFNPAEYGPQMS-----QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKC 231
H+D FNP G QR+A+ ++YL+ V+ GG T FP
Sbjct: 180 QPHFDYFNPGRSGEARQLEVGGQRVATLVIYLNSVQAGGATGFP---------------K 224
Query: 232 IGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
+GL+V P +G+ + F P+GT+D +LH PV +GEKW+ATKW+R++
Sbjct: 225 LGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEKWIATKWLRER 274
>gi|83746819|ref|ZP_00943867.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum UW551]
gi|83726588|gb|EAP73718.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum UW551]
Length = 289
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 83/230 (36%), Positives = 119/230 (51%), Gaps = 28/230 (12%)
Query: 61 PHGETGDA---SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ- 116
P E +A S + IP PR + F +F S E+C +IA + RLK S + +
Sbjct: 73 PEAENSNAVRTSDREIPILFAIETPRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPET 132
Query: 117 GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKY 176
GE E+ RTS G E ++ IE +IA+AT +P HGE F VL Y G +Y
Sbjct: 133 GE--ENLISARTSEGAMFQVGEHP--LVARIEARIAQATGVPVEHGEGFQVLHYHPGGEY 188
Query: 177 DSHYDAFNPAEYGPQMS-----QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKC 231
H+D FNP G QR+A+ ++YL+ V+ GG T FP
Sbjct: 189 QPHFDYFNPGRSGEARQLEVGGQRVATLVIYLNSVQAGGATGFP---------------K 233
Query: 232 IGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
+GL+V P +G+ + F P+GT+D +LH PV +GEKW+ATKW+R++
Sbjct: 234 LGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEKWIATKWLRER 283
>gi|159464219|ref|XP_001690339.1| hypothetical protein CHLREDRAFT_114525 [Chlamydomonas reinhardtii]
gi|158279839|gb|EDP05598.1| predicted protein [Chlamydomonas reinhardtii]
Length = 244
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 77/206 (37%), Positives = 117/206 (56%), Gaps = 11/206 (5%)
Query: 83 RALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTG 142
R +F + E+ I+ +++RL+ S + G + ES RTS G F+ ED
Sbjct: 1 RIFLIEHFLTDEEADHIVQVSERRLERSGVVATNGGSEESQ--IRTSFGVFLERGEDP-- 56
Query: 143 ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLY 202
+++ +E +I+ T++P +GE VLRY+ QKYD+H+D F + R A+ L+Y
Sbjct: 57 VVKGVEERISALTLMPVGNGEGLQVLRYQKEQKYDAHWDYFFHKDGIANGGNRYATVLMY 116
Query: 203 LSDVEEGGETMFPFENGIFLDSGYD--YKKCI--GLKVKPRRGDGLLFYSLFPNGTIDRT 258
L D EEGGET+FP I G + + +C L KP++G +LF+S+ P G ++R
Sbjct: 117 LVDTEEGGETVFP---NIAAPGGENVGFSECARYHLAAKPKKGTAILFHSIKPTGELERK 173
Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQH 284
SLH +CPVIKG KW A KWI + Q+
Sbjct: 174 SLHTACPVIKGIKWSAAKWIHVKPQN 199
>gi|255071007|ref|XP_002507585.1| predicted protein [Micromonas sp. RCC299]
gi|226522860|gb|ACO68843.1| predicted protein [Micromonas sp. RCC299]
Length = 433
Score = 135 bits (340), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 76/209 (36%), Positives = 114/209 (54%), Gaps = 8/209 (3%)
Query: 75 FQVLSW-RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTF 133
QV+S PRA F S +C ++ A+ + S + + S RTS+G+F
Sbjct: 158 IQVVSLDNPRAFMHIGFLSERECDLLVEYARPNMYKSGV-VDASNGGSSFSNIRTSTGSF 216
Query: 134 ISA--SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ 191
+ ++ IE +IA T +P HGE VLRY+IGQ+Y SH+D F G
Sbjct: 217 VPTVFPLGMNDVVRRIERRIAAWTQIPAAHGEPIQVLRYQIGQEYQSHFDYF--FHEGGM 274
Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSL 249
+ R+A+ L+YLSDV++GGET+FP + + + C G+ V P++GD +LF+++
Sbjct: 275 KNNRIATVLMYLSDVKDGGETVFPSAESLQVKPEPIHHACAKNGITVIPKKGDAILFWNM 334
Query: 250 FPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
G +D S H CPV+ GEKW ATKW+
Sbjct: 335 KVGGDLDGGSTHAGCPVVLGEKWTATKWL 363
>gi|372266874|ref|ZP_09502922.1| peptidyl prolyl 4-hydroxylase-like protein subunit alpha
[Alteromonas sp. S89]
Length = 294
Score = 135 bits (340), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 74/207 (35%), Positives = 113/207 (54%), Gaps = 23/207 (11%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+P + F NF + +C +++ ++ L PS++ Q E K +RTS GT + E
Sbjct: 102 QPNIVLFANFLAEWECDALVEMSRPNLSPSRVVNTQHGAFE-LKPSRTSGGTHFARGE-- 158
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQR 195
T ++ IE +IA +P+ HGE +L Y + +Y HYD F+P + G Q QR
Sbjct: 159 TPLIADIEARIASLLKVPEAHGEPLQILHYPVSGEYRPHYDFFDPEKPGNQEVLAAGGQR 218
Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 255
+ + ++YLSDVE GG T+FP +GL+V+P++G L F + +G +
Sbjct: 219 VGTLIMYLSDVESGGATVFP---------------RVGLEVQPQKGAALFFSYVGEHGKL 263
Query: 256 DRTSLHGSCPVIKGEKWVATKWIRDQE 282
D SLHG PV+ GEKW+ATKW+R E
Sbjct: 264 DLQSLHGGSPVLAGEKWIATKWLRAAE 290
>gi|421890664|ref|ZP_16321519.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
K60-1]
gi|378964031|emb|CCF98267.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
K60-1]
Length = 288
Score = 135 bits (339), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 80/219 (36%), Positives = 115/219 (52%), Gaps = 25/219 (11%)
Query: 69 SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTR 127
S + IP PR + F +F S E+C +IA + RLK S + + GE E+ R
Sbjct: 83 SDREIPILFAIETPRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGE--ENLISAR 140
Query: 128 TSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE 187
TS G E ++ IE +IA+AT +P HGE F VL Y G +Y H+D FNP
Sbjct: 141 TSEGAMFQVGEHP--LVARIEARIAQATGVPVEHGEGFQVLHYHPGGEYQPHFDYFNPGR 198
Query: 188 YGPQMS-----QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
G QR+A+ ++YL+ V+ GG T FP +GL+V P +G+
Sbjct: 199 SGEARQLDVGGQRVATLVIYLNSVQAGGATGFP---------------KLGLEVAPVKGN 243
Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
+ F P+GT+D +LH PV +GEKW+ATKW+R++
Sbjct: 244 AVFFVYKRPDGTLDDNTLHAGLPVERGEKWIATKWLRER 282
>gi|308812133|ref|XP_003083374.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein (ISS)
[Ostreococcus tauri]
gi|116055254|emb|CAL57650.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein (ISS)
[Ostreococcus tauri]
Length = 311
Score = 135 bits (339), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 81/222 (36%), Positives = 114/222 (51%), Gaps = 24/222 (10%)
Query: 75 FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTF 133
+ +S PRA F F + +C +I A ++ S++ GE R+S G +
Sbjct: 68 IEKISDSPRAYVFREFLTDAECDRVIERAYPTMEASEVTDDDSGEA--RPDDARSSIGGW 125
Query: 134 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 193
+S +D+ ++ IE + + MLP GE VLRYE GQKYD+H D F+
Sbjct: 126 VSGDDDE--VIRNIELRASTWAMLPMNRGETMQVLRYEKGQKYDAHDDFFHDEHNVKNGG 183
Query: 194 QRLASFLLYLSDVEEGGETMFPF----------------ENGIFLDSGYDYKKCIGLKVK 237
QR+A+ L+YLSDVEEGGET+FP +N L S D + L VK
Sbjct: 184 QRVATILMYLSDVEEGGETVFPLGTPLGGRDPEKSGVTGDNACELASQNDPRV---LAVK 240
Query: 238 PRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
PRRGD LLF++ +G +D + H CPV +G KW T+W R
Sbjct: 241 PRRGDALLFFNAHLSGEMDEKANHAGCPVNRGTKWTMTRWHR 282
>gi|3805847|emb|CAA21467.1| putative protein [Arabidopsis thaliana]
gi|7270533|emb|CAB81490.1| putative protein [Arabidopsis thaliana]
Length = 307
Score = 134 bits (338), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 78/222 (35%), Positives = 125/222 (56%), Gaps = 36/222 (16%)
Query: 75 FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGT------- 126
+V+SW PRA + NF + E+C+ +I+ AK + S++ ++ G++++S T
Sbjct: 80 LEVISWEPRAFVYHNFLTNEECEHLISLAKPSMMKSKVVDVKTGKSIDSRFCTLTSVVVF 139
Query: 127 ----------------------RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEA 164
RTSSGTF++ D+ I+E IE++I+ T +P +GE
Sbjct: 140 TFQLNLERFENSKFANPSLCRVRTSSGTFLNRGHDE--IVEEIENRISDFTFIPPENGEG 197
Query: 165 FNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDS 224
VL YE+GQ+Y+ H+D F + QR+A+ L+YLSDV+EGGET+FP G D
Sbjct: 198 LQVLHYEVGQRYEPHHDYFFDEFNVRKGGQRIATVLMYLSDVDEGGETVFPAAKGNVSDV 257
Query: 225 GY--DYKKC--IGLKVKPRRGDGLLFYSLFPNGTIDRTSLHG 262
+ + +C GL V P++ D LLF+S+ P+ ++D +SLHG
Sbjct: 258 PWWDELSQCGKEGLSVLPKKRDALLFWSMKPDASLDPSSLHG 299
>gi|430751569|ref|YP_007214477.1| 2OG-Fe(II) oxygenase [Thermobacillus composti KWC4]
gi|430735534|gb|AGA59479.1| 2OG-Fe(II) oxygenase superfamily enzyme [Thermobacillus composti
KWC4]
Length = 215
Score = 134 bits (338), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 73/205 (35%), Positives = 111/205 (54%), Gaps = 24/205 (11%)
Query: 77 VLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
VL P + F S ++C+ +I TA RLK S+L + RTS G F
Sbjct: 25 VLHQEPLIVRFERLLSDDECRQLIETAAPRLKESKLVNK------VVSDIRTSRGMFFE- 77
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
E+++ + IE +IA+ +P H E VL Y GQ+Y +H+D F P + + R+
Sbjct: 78 -EEESPFIHRIERRIAQLMNVPIEHAEGLQVLHYGPGQEYKAHHDFFAPGSPAAR-NNRI 135
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
++ ++YL+DVEEGGET+FP +G+ +KP+RG L F + N ++
Sbjct: 136 STLIVYLNDVEEGGETVFPL---------------LGIAMKPKRGAALYFEYFYRNQALN 180
Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQ 281
+LH S PV++GEKWVAT+W+R Q
Sbjct: 181 DLTLHSSVPVVRGEKWVATQWMRRQ 205
>gi|357467087|ref|XP_003603828.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355492876|gb|AES74079.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 156
Score = 134 bits (338), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 70/154 (45%), Positives = 96/154 (62%), Gaps = 6/154 (3%)
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
F+ +DK I++ IE +IA T +P +GE VL Y +G+KY+ HYD F
Sbjct: 2 FLKRGKDK--IIQNIERRIADFTFIPVENGEGLQVLHYGVGEKYEPHYDYFLDEFNTKNG 59
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKCI--GLKVKPRRGDGLLFYS 248
QR+A+ L+YLSDVEEGGET+FP F + D +C GL +KP+ GD LLF+S
Sbjct: 60 GQRVATVLMYLSDVEEGGETVFPAAKANFSSVPWWNDLSECARKGLSLKPKMGDALLFWS 119
Query: 249 LFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
+ P+ T+D +SLHG CPVI G KW +TKW+ +E
Sbjct: 120 MRPDATLDASSLHGGCPVIVGNKWSSTKWMHLEE 153
>gi|303282201|ref|XP_003060392.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226457863|gb|EEH55161.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 369
Score = 134 bits (337), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 87/207 (42%), Positives = 112/207 (54%), Gaps = 21/207 (10%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASEDK 140
PRA + F + +C IA A +L S + GE V S RTS G F ED
Sbjct: 83 PRAYVYRGFLTDAECDHFIARASPKLAKSNVVDTDTGEGVPSA--IRTSDGMFFDRGEDD 140
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF------NPAEYGPQMSQ 194
+++ +E +I+ T LP +GE VLRY GQKYD+H DAF + A G Q
Sbjct: 141 --VVDAVERRISAWTRLPTENGEGMQVLRYAGGQKYDAHLDAFVDKFNADDAHGG----Q 194
Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLFPN 252
R+A+ L+YL+DV++GGET+FP Y C G+ VKPRRGD LLF+S+ +
Sbjct: 195 RVATVLMYLNDVDDGGETVFPETTAKPHVGDERYSACARRGVAVKPRRGDALLFWSM--D 252
Query: 253 GTIDRTSLHGSCPV-IKGEKWVATKWI 278
T R SLHG CPV G KW TKWI
Sbjct: 253 ETFTR-SLHGGCPVGAGGVKWSMTKWI 278
>gi|413934216|gb|AFW68767.1| hypothetical protein ZEAMMB73_452923 [Zea mays]
Length = 210
Score = 134 bits (336), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 66/140 (47%), Positives = 88/140 (62%), Gaps = 3/140 (2%)
Query: 143 ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLY 202
++ IE +I+ T LP +GEA +L Y+ G+KY+ HYD F+ R+A+ L+Y
Sbjct: 12 VVTRIEERISAWTFLPPENGEAIQILHYQNGEKYEPHYDYFHDKNNQALGGHRIATVLMY 71
Query: 203 LSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPNGTIDRTS 259
LS+VE+GGET+FP G L D + C G VKP +GD LLF+SL P+ T D S
Sbjct: 72 LSNVEKGGETIFPNAEGKLLQPKDDTWSDCARNGYAVKPVKGDALLFFSLHPDSTTDSDS 131
Query: 260 LHGSCPVIKGEKWVATKWIR 279
LHGSCP I+G+KW ATKWI
Sbjct: 132 LHGSCPAIEGQKWSATKWIH 151
>gi|241664232|ref|YP_002982592.1| procollagen-proline dioxygenase [Ralstonia pickettii 12D]
gi|309783051|ref|ZP_07677770.1| procollagen-proline dioxygenase [Ralstonia sp. 5_7_47FAA]
gi|404397139|ref|ZP_10988932.1| hypothetical protein HMPREF0989_00773 [Ralstonia sp. 5_2_56FAA]
gi|240866259|gb|ACS63920.1| Procollagen-proline dioxygenase [Ralstonia pickettii 12D]
gi|308918159|gb|EFP63837.1| procollagen-proline dioxygenase [Ralstonia sp. 5_7_47FAA]
gi|348610674|gb|EGY60360.1| hypothetical protein HMPREF0989_00773 [Ralstonia sp. 5_2_56FAA]
Length = 288
Score = 134 bits (336), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 82/237 (34%), Positives = 118/237 (49%), Gaps = 31/237 (13%)
Query: 50 EVVEKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKP 109
E VE N H GD IP PR + F +F S ++C +IA + RLK
Sbjct: 72 EAVENSNAV---HTADGD-----IPILFAIETPRIVLFQHFLSDQECDELIAIGRNRLKR 123
Query: 110 SQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLR 169
S + + E+ RTS G E ++ IE +IA+A +P HGE F VL
Sbjct: 124 SPV-VNPDTGEENLISARTSQGGMFQVGEHP--LIAKIEARIAQAVGVPVEHGEGFQVLN 180
Query: 170 YEIGQKYDSHYDAFNPAEYGPQMS-----QRLASFLLYLSDVEEGGETMFPFENGIFLDS 224
Y+ G +Y H+D FNP G QR+A+ ++YL+ V+ GG T FP
Sbjct: 181 YQPGGEYQPHFDFFNPGRSGEARQLEVGGQRVATMVIYLNSVQAGGATGFP--------- 231
Query: 225 GYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
+GL+V P +G+ + F P+GT+D +LH PV +GEKW+ATKW+R++
Sbjct: 232 ------KLGLEVAPVKGNAVFFVYKRPDGTLDEDTLHAGLPVERGEKWIATKWLRER 282
>gi|413934217|gb|AFW68768.1| hypothetical protein ZEAMMB73_452923 [Zea mays]
Length = 204
Score = 134 bits (336), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 66/140 (47%), Positives = 88/140 (62%), Gaps = 3/140 (2%)
Query: 143 ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLY 202
++ IE +I+ T LP +GEA +L Y+ G+KY+ HYD F+ R+A+ L+Y
Sbjct: 6 VVTRIEERISAWTFLPPENGEAIQILHYQNGEKYEPHYDYFHDKNNQALGGHRIATVLMY 65
Query: 203 LSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPNGTIDRTS 259
LS+VE+GGET+FP G L D + C G VKP +GD LLF+SL P+ T D S
Sbjct: 66 LSNVEKGGETIFPNAEGKLLQPKDDTWSDCARNGYAVKPVKGDALLFFSLHPDSTTDSDS 125
Query: 260 LHGSCPVIKGEKWVATKWIR 279
LHGSCP I+G+KW ATKWI
Sbjct: 126 LHGSCPAIEGQKWSATKWIH 145
>gi|55741040|gb|AAV64184.1| unknown [Zea mays]
Length = 394
Score = 133 bits (335), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 67/146 (45%), Positives = 95/146 (65%), Gaps = 5/146 (3%)
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
A++D+ ++ IE +I+ T LP +GE+ +L Y+ G+KY+ HYD F+ + R
Sbjct: 191 ATQDE--VVTRIEERISAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKKNQALGGHR 248
Query: 196 LASFLLYLSDVEEGGETMFPFENGIFL---DSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
+A+ L+YLS+VE+GGET+FP G L D+ + G VKP +GD LLF+SL P+
Sbjct: 249 IATVLMYLSNVEKGGETIFPNAEGKLLQPKDNTWSDCARNGYAVKPVKGDALLFFSLHPD 308
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWI 278
T D SLHGSCPVI+G+KW ATKWI
Sbjct: 309 ATTDSDSLHGSCPVIEGQKWSATKWI 334
>gi|55741082|gb|AAV64222.1| unknown [Zea mays]
Length = 369
Score = 133 bits (335), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 67/146 (45%), Positives = 95/146 (65%), Gaps = 5/146 (3%)
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
A++D+ ++ IE +I+ T LP +GE+ +L Y+ G+KY+ HYD F+ + R
Sbjct: 191 ATQDE--VVTRIEERISAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKKNQALGGHR 248
Query: 196 LASFLLYLSDVEEGGETMFPFENGIFL---DSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
+A+ L+YLS+VE+GGET+FP G L D+ + G VKP +GD LLF+SL P+
Sbjct: 249 IATVLMYLSNVEKGGETIFPNAEGKLLQPKDNTWSDCARNGYAVKPVKGDALLFFSLHPD 308
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWI 278
T D SLHGSCPVI+G+KW ATKWI
Sbjct: 309 ATTDSDSLHGSCPVIEGQKWSATKWI 334
>gi|340787855|ref|YP_004753320.1| peptidyl prolyl 4-hydroxylase-like protein subunit alpha
[Collimonas fungivorans Ter331]
gi|340553122|gb|AEK62497.1| Peptidyl prolyl 4-hydroxylase-like protein, alpha subunit
[Collimonas fungivorans Ter331]
Length = 289
Score = 133 bits (335), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 75/212 (35%), Positives = 110/212 (51%), Gaps = 33/212 (15%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT-----RTSSGTFIS 135
+PRA+ F N S ++C +IA +K +L LR G T T RTSSGTF
Sbjct: 99 KPRAILFGNVLSHDECDQLIALSKTKL------LRSGVVDHQTGNTKLHEHRTSSGTFFH 152
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP-----AEYGP 190
T + +I+ ++A +P++HGE +L Y++G +Y HYD F P A++
Sbjct: 153 --RGTTPFIAMIDKRLAALMQVPESHGEGLQILNYQMGGEYRPHYDYFRPDAPGSAKHLA 210
Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
+ QR A+ ++YL+DV+ GGET+FP GL + P +G + F
Sbjct: 211 RGGQRTATLIIYLNDVDGGGETIFPRN---------------GLSIVPAKGSAIYFSYTN 255
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
+D S HG PVI+GEKW+ATKW+R E
Sbjct: 256 AENQLDSLSFHGGSPVIEGEKWIATKWVRQNE 287
>gi|299065638|emb|CBJ36810.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
CMR15]
Length = 289
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 79/219 (36%), Positives = 114/219 (52%), Gaps = 25/219 (11%)
Query: 69 SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTR 127
S + IP PR + F +F S E+C +I + RLK S + + GE E+ R
Sbjct: 84 SDREIPILFAIETPRIVLFQHFLSDEECDQLITLGRHRLKRSPVVNPETGE--ENLISAR 141
Query: 128 TSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE 187
TS G E ++ IE +IA+AT +P HGE F VL Y+ G +Y H+D FNP
Sbjct: 142 TSQGAMFQVGEHP--LIARIEARIAQATGVPVEHGEGFQVLHYQPGGEYQPHFDYFNPGR 199
Query: 188 YGPQMS-----QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
G QR+A+ ++YL+ V GG T FP +GL+V P +G+
Sbjct: 200 SGEARQLEVGGQRVATLVIYLNSVPAGGATGFP---------------KLGLEVAPVKGN 244
Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
+ F P+GT+D +LH PV +GEKW+ATKW+R++
Sbjct: 245 AVFFVYKRPDGTLDDKTLHAGLPVERGEKWIATKWLRER 283
>gi|423389445|ref|ZP_17366671.1| hypothetical protein ICG_01293 [Bacillus cereus BAG1X1-3]
gi|401641536|gb|EJS59253.1| hypothetical protein ICG_01293 [Bacillus cereus BAG1X1-3]
Length = 216
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 74/217 (34%), Positives = 116/217 (53%), Gaps = 32/217 (14%)
Query: 65 TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
T D IQ I F+ P + N S E+C+ +I +K ++K S++ +
Sbjct: 25 TEDREIQIISKFE----EPLIVVLANVLSDEECEELIELSKNKMKRSKVG-----SSRDV 75
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTSSG F+ +E + IE +I+ T +P HGE ++L YE+ Q+Y +HYD F
Sbjct: 76 NDIRTSSGAFLEENE----LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF 131
Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
AE+ + R+++ ++YL+DVEEGGET FP + L V PR+G
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174
Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+ F + + +++ +LHG PV KGEKW+AT+W+R
Sbjct: 175 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWVR 211
>gi|412985583|emb|CCO19029.1| predicted protein [Bathycoccus prasinos]
Length = 458
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 78/213 (36%), Positives = 121/213 (56%), Gaps = 16/213 (7%)
Query: 75 FQVLSW-RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGT 132
Q++S PRA + F + E+C +I +K R+ S+ + ET + K RTS+G+
Sbjct: 177 MQIISLDHPRAFLYKRFMTDEECDFLIDHSKSRM--SKSGVVDAETGGTAKSDIRTSTGS 234
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
F+ + +++ +E ++A +MLP H EA VLRYE+ Q+Y +HYD F G
Sbjct: 235 FVGIGAND--LMKKLEKRVATFSMLPVKHQEATQVLRYEVKQEYRAHYDYF--FHKGGMA 290
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDS-----GYDYKKC--IGLKVKPRRGDGLL 245
+ R+ + L+YL + E GGET+FP + L+ G ++ +C G R+GD L+
Sbjct: 291 NNRIVTILMYLHEPEFGGETVFP-NTEVPLERAEKGWGKNFSECGNRGRAAVVRKGDALI 349
Query: 246 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
F+S+ P G +D S H CPV++GEKW ATKWI
Sbjct: 350 FWSMKPGGELDPGSSHAGCPVVRGEKWTATKWI 382
>gi|307109700|gb|EFN57937.1| hypothetical protein CHLNCDRAFT_142031 [Chlorella variabilis]
Length = 325
Score = 132 bits (333), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 81/213 (38%), Positives = 120/213 (56%), Gaps = 20/213 (9%)
Query: 75 FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 134
F+ +SW PRA NFAS E+ +I A+ +L+ S + +GE+V RTS G FI
Sbjct: 32 FEPVSWYPRAFVAHNFASKEETDHMIKLAQPQLRRSTVVGSRGESV--VDNYRTSYGMFI 89
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
D+ ++ +E ++A T TH E VLRY Q+Y +H+D+ + S
Sbjct: 90 RRHHDE--VVSTLEKRVATWTKYNVTHQEDIQVLRYGTTQEYKAHFDSLD------DDSP 141
Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGY-----DYKKCI--GLKVKPRRGDGLLFY 247
R A+ L+YLSDVE GGET FP N ++D + +C + +KP+RGD ++F+
Sbjct: 142 RTATVLIYLSDVESGGETTFP--NSEWIDPALPKALGPFSECAQGHVAMKPKRGDAIVFH 199
Query: 248 SLFPNG-TIDRTSLHGSCPVIKGEKWVATKWIR 279
SL P+G + D+ +LH +CPVI G K+VA WI
Sbjct: 200 SLNPDGRSHDQHALHTACPVIVGVKYVAIFWIH 232
>gi|421895470|ref|ZP_16325871.1| prolyl 4-hydroxylase alpha subunit homologue protein [Ralstonia
solanacearum MolK2]
gi|206586635|emb|CAQ17221.1| prolyl 4-hydroxylase alpha subunit homologue protein [Ralstonia
solanacearum MolK2]
Length = 283
Score = 132 bits (333), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 82/230 (35%), Positives = 118/230 (51%), Gaps = 28/230 (12%)
Query: 61 PHGETGDA---SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ- 116
P E +A S + IP PR + F +F S E+C +IA + RLK S + +
Sbjct: 67 PGAENSNAVRTSDREIPILFAIETPRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPET 126
Query: 117 GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKY 176
GE E+ RTS G E ++ IE +IA+AT +P HGE F VL Y G +Y
Sbjct: 127 GE--ENLISARTSEGAMFQVGEHP--LVARIEARIAQATGVPVEHGEGFQVLHYHPGGEY 182
Query: 177 DSHYDAFNPAEYGPQMS-----QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKC 231
H+D FNP G QR+A+ ++YL+ V+ GG T FP
Sbjct: 183 QPHFDYFNPGRGGEARQLEVGGQRVATLVIYLNSVQAGGATGFP---------------K 227
Query: 232 IGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
+GL+V P +G+ + F P+G +D +LH PV +GEKW+ATKW+R++
Sbjct: 228 LGLEVAPVKGNAVFFVYKRPDGMLDDNTLHAGLPVERGEKWIATKWLRER 277
>gi|299532490|ref|ZP_07045880.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni S44]
gi|298719437|gb|EFI60404.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni S44]
Length = 299
Score = 132 bits (333), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 85/248 (34%), Positives = 128/248 (51%), Gaps = 39/248 (15%)
Query: 50 EVVEKENESGLPHGE---------TGDASIQSIPFQVLSWR-PRALYFPNFASAEQCQSI 99
EV + +ESG+P + GD + + +S R PR + F N S E+C +I
Sbjct: 74 EVAVQVHESGVPEPDLLTDPCCVHAGDRDVHVL----MSVRHPRVVVFGNLLSDEECDAI 129
Query: 100 IATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQ 159
IA A+ R++ S L + E+ RTS+G F E++ ++ L+E +IAR P
Sbjct: 130 IAAARPRMRRS-LTVDNQSGGEAVNDDRTSNGMFFQRGENE--LISLVEQRIARLLNWPL 186
Query: 160 THGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQRLASFLLYLSDVEEGGETMF 214
+GE VL Y G +Y HYD F P E G P + QR+ + ++YL++ GG T F
Sbjct: 187 ENGEGMQVLHYRPGAEYKPHYDYFAPNEPGTPTILKRGGQRVGTLVMYLNEPARGGATTF 246
Query: 215 PFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVA 274
P +GL+V PRRG+ + F P+ +LHG PV++GEKW+A
Sbjct: 247 P---------------DVGLQVVPRRGNAVFFSYNRPDPATK--TLHGGAPVLEGEKWIA 289
Query: 275 TKWIRDQE 282
TKW+R++E
Sbjct: 290 TKWLRERE 297
>gi|229019457|ref|ZP_04176278.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1273]
gi|229025700|ref|ZP_04182104.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1272]
gi|423417837|ref|ZP_17394926.1| hypothetical protein IE3_01309 [Bacillus cereus BAG3X2-1]
gi|228735575|gb|EEL86166.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1272]
gi|228741812|gb|EEL91991.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1273]
gi|401107008|gb|EJQ14965.1| hypothetical protein IE3_01309 [Bacillus cereus BAG3X2-1]
Length = 216
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 74/217 (34%), Positives = 115/217 (52%), Gaps = 32/217 (14%)
Query: 65 TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
T D IQ I F+ P + N S E+C +I +K ++K S++ +
Sbjct: 25 TEDREIQIISKFE----EPLIVVLANVLSDEECDELIELSKNKMKRSKVG-----SSRDV 75
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTSSG F+ +E + IE +I+ T +P HGE ++L YE+ Q+Y +HYD F
Sbjct: 76 NDIRTSSGAFLEENE----LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF 131
Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
AE+ + R+++ ++YL+DVEEGGET FP + L V PR+G
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174
Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+ F + + +++ +LHG PV KGEKW+AT+W+R
Sbjct: 175 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWVR 211
>gi|160900716|ref|YP_001566298.1| procollagen-proline dioxygenase [Delftia acidovorans SPH-1]
gi|160366300|gb|ABX37913.1| Procollagen-proline dioxygenase [Delftia acidovorans SPH-1]
Length = 294
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 82/226 (36%), Positives = 117/226 (51%), Gaps = 28/226 (12%)
Query: 64 ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
+ GD +Q + V PR + F N S E+C +IIA A+ R+ S L + E
Sbjct: 92 DVGDRQVQVL---VSMRNPRIVVFGNLLSHEECDAIIAAARPRMARS-LTVATQSGGEEI 147
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTS+G F E TGI+ +E +IAR P HGE VL Y G +Y H+D F
Sbjct: 148 NDDRTSNGMFFQRGE--TGIVSQLEERIARLLRWPLDHGEGLQVLHYGPGAEYKPHHDYF 205
Query: 184 NPAEYG-PQM----SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKP 238
P E G P + QR+ + ++YL++ E GG T+FP + L+V P
Sbjct: 206 APGEPGTPTILKRGGQRVGTLVIYLNEPERGGATIFP---------------EVPLQVVP 250
Query: 239 RRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQH 284
RRG+ + F P+ + +LHG PV+ GEKW+ATKW+R++E H
Sbjct: 251 RRGNAVFFSYERPDPST--RTLHGGAPVLAGEKWIATKWLREREFH 294
>gi|333912984|ref|YP_004486716.1| procollagen-proline dioxygenase [Delftia sp. Cs1-4]
gi|333743184|gb|AEF88361.1| Procollagen-proline dioxygenase [Delftia sp. Cs1-4]
Length = 294
Score = 132 bits (331), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 82/226 (36%), Positives = 117/226 (51%), Gaps = 28/226 (12%)
Query: 64 ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
+ GD +Q + V PR + F N S E+C +IIA A+ R+ S L + E
Sbjct: 92 DVGDRQVQVL---VSMRNPRIVVFGNLLSHEECDAIIAAARPRMARS-LTVATQSGGEEI 147
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTS+G F E TGI+ +E +IAR P HGE VL Y G +Y H+D F
Sbjct: 148 NDDRTSNGMFFQRGE--TGIVSQLEERIARLLRWPLDHGEGLQVLHYGPGAEYKPHHDYF 205
Query: 184 NPAEYG-PQM----SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKP 238
P E G P + QR+ + ++YL++ E GG T+FP + L+V P
Sbjct: 206 APGEPGTPTILKRGGQRVGTLVIYLNEPERGGATIFP---------------EVPLQVVP 250
Query: 239 RRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQH 284
RRG+ + F P+ + +LHG PV+ GEKW+ATKW+R++E H
Sbjct: 251 RRGNAVFFSYERPDPST--RTLHGGAPVLAGEKWIATKWLREREFH 294
>gi|229075940|ref|ZP_04208916.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-18]
gi|229117732|ref|ZP_04247101.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-3]
gi|407706764|ref|YP_006830349.1| alpha/beta fold family hydrolase [Bacillus thuringiensis MC28]
gi|423377905|ref|ZP_17355189.1| hypothetical protein IC9_01258 [Bacillus cereus BAG1O-2]
gi|423464099|ref|ZP_17440867.1| hypothetical protein IEK_01286 [Bacillus cereus BAG6O-1]
gi|423547540|ref|ZP_17523898.1| hypothetical protein IGO_03975 [Bacillus cereus HuB5-5]
gi|423622677|ref|ZP_17598455.1| hypothetical protein IK3_01275 [Bacillus cereus VD148]
gi|228665709|gb|EEL21182.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-3]
gi|228707255|gb|EEL59452.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-18]
gi|401179261|gb|EJQ86434.1| hypothetical protein IGO_03975 [Bacillus cereus HuB5-5]
gi|401260797|gb|EJR66965.1| hypothetical protein IK3_01275 [Bacillus cereus VD148]
gi|401636171|gb|EJS53925.1| hypothetical protein IC9_01258 [Bacillus cereus BAG1O-2]
gi|402420366|gb|EJV52637.1| hypothetical protein IEK_01286 [Bacillus cereus BAG6O-1]
gi|407384449|gb|AFU14950.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis MC28]
Length = 216
Score = 132 bits (331), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 74/217 (34%), Positives = 114/217 (52%), Gaps = 32/217 (14%)
Query: 65 TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
T D IQ I F+ P + N S E+C +I +K ++K S + +
Sbjct: 25 TEDREIQIISKFE----EPLIVVLGNVISDEECNELIEMSKNKIKRSTIG-----SARDV 75
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTSSG F+ +E + IE +I+ +P THGE ++L YE+ Q+Y +HYD F
Sbjct: 76 NDIRTSSGAFLEENE----LTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYF 131
Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
AE+ + R+++ ++YL+DVEEGGET FP + L V PR+G
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174
Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+ F + + +++ +LHG PV KGEKW+AT+W+R
Sbjct: 175 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWVR 211
>gi|423598444|ref|ZP_17574444.1| hypothetical protein III_01246 [Bacillus cereus VD078]
gi|423660914|ref|ZP_17636083.1| hypothetical protein IKM_01311 [Bacillus cereus VDM022]
gi|401236714|gb|EJR43171.1| hypothetical protein III_01246 [Bacillus cereus VD078]
gi|401300955|gb|EJS06544.1| hypothetical protein IKM_01311 [Bacillus cereus VDM022]
Length = 216
Score = 131 bits (330), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 74/217 (34%), Positives = 115/217 (52%), Gaps = 32/217 (14%)
Query: 65 TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
T D IQ I F+ P + N S E+C +I +K ++K S++ +
Sbjct: 25 TEDREIQIISKFE----EPLIVVLANVLSDEECDELIELSKSKMKRSKVG-----SSRDV 75
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTSSG F+ +E + IE +I+ T +P HGE ++L YE+ Q+Y +HYD F
Sbjct: 76 NDIRTSSGAFLEENE----LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF 131
Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
AE+ + R+++ ++YL+DVEEGGET FP + L V PR+G
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174
Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+ F + + +++ +LHG PV KGEKW+AT+W+R
Sbjct: 175 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWVR 211
>gi|229086310|ref|ZP_04218488.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-44]
gi|228697005|gb|EEL49812.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-44]
Length = 220
Score = 131 bits (330), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 67/199 (33%), Positives = 113/199 (56%), Gaps = 27/199 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQL-ALRQGETVESTKGTRTSSGTFISASEDK 140
P + N S E+C+S+I +K +K S++ A R+ + + RTSSGTF+ +E
Sbjct: 43 PLIVVLENVLSDEECESLIELSKDSMKRSKIGASREVDNI------RTSSGTFLEENE-- 94
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFL 200
+ +IE +++ +P HGE ++L+Y GQ+Y +HYD F + + R+++ +
Sbjct: 95 --TVAIIEKRVSSIMNIPVEHGEGLHILKYTPGQEYKAHYDYFAEHSRAAE-NNRISTLV 151
Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
+YL+DVEEGGET FP + L + P++G + F + + +++ +L
Sbjct: 152 MYLNDVEEGGETFFP---------------KLNLSIAPKKGSAVYFEYFYNDKSLNELTL 196
Query: 261 HGSCPVIKGEKWVATKWIR 279
HG PVIKGEKWVAT+W++
Sbjct: 197 HGGAPVIKGEKWVATQWMK 215
>gi|218899396|ref|YP_002447807.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
G9842]
gi|218542449|gb|ACK94843.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
G9842]
Length = 216
Score = 131 bits (330), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 67/199 (33%), Positives = 109/199 (54%), Gaps = 27/199 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K ++K S++ + RTSSG F+ +E
Sbjct: 39 PLIVVLANVLSDEECDELIEMSKNKMKRSKVG-----SSRDVNDIRTSSGAFLEDNE--- 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ ++ R+++ +
Sbjct: 91 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAVNNRISTLV 147
Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 148 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 192
Query: 261 HGSCPVIKGEKWVATKWIR 279
HG PV KGEKW+AT+W+R
Sbjct: 193 HGGAPVTKGEKWIATQWVR 211
>gi|308799555|ref|XP_003074558.1| putative oxidoreductase (ISS) [Ostreococcus tauri]
gi|116000729|emb|CAL50409.1| putative oxidoreductase (ISS) [Ostreococcus tauri]
Length = 274
Score = 131 bits (330), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 82/223 (36%), Positives = 122/223 (54%), Gaps = 17/223 (7%)
Query: 66 GDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTK 124
GDA + I + LSW PRA N + ++I+A A+ R+ + + + G++V
Sbjct: 2 GDARV--IAVEPLSWYPRAFALRNALDETEMRAILALARTRVARSTVIDSESGKSV--VN 57
Query: 125 GTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN 184
RTS TF+S ++ ++ + +++ T LP H E VL Y G+KYD+H D
Sbjct: 58 PIRTSKQTFLSRNDP---VVRKVLERMSSVTHLPWYHCEDLQVLEYSAGEKYDAHEDVGE 114
Query: 185 PA-EYGPQMSQ----RLASFLLYLSDVEEGGETMFPFENGIFLDSGYD--YKKCIGLKV- 236
+ G Q+S+ R+A+ LLYL + EEGGET FP I + + KC +V
Sbjct: 115 EGTKSGDQLSKNGGKRVATILLYLEEPEEGGETAFPDSEWIDPERAKTETWSKCAHRRVA 174
Query: 237 -KPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
KP RGDGL+F+S+ P+GTID +LH CP +G KW AT W+
Sbjct: 175 MKPTRGDGLMFWSVRPDGTIDHRALHVGCPPTRGTKWTATIWV 217
>gi|187930127|ref|YP_001900614.1| procollagen-proline dioxygenase [Ralstonia pickettii 12J]
gi|187727017|gb|ACD28182.1| Procollagen-proline dioxygenase [Ralstonia pickettii 12J]
Length = 288
Score = 131 bits (330), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 80/232 (34%), Positives = 117/232 (50%), Gaps = 29/232 (12%)
Query: 55 ENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLAL 114
EN + + H GD IP PR + F +F S +C +IA + RLK S + +
Sbjct: 75 ENSNAV-HTADGD-----IPILFAIETPRIVLFQHFLSDAECDELIAIGRNRLKRSPV-V 127
Query: 115 RQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQ 174
E+ RTS G E ++ IE +IA+A +P HGE F VL Y+ G
Sbjct: 128 NPDTGEENLISARTSQGGMFQVGEHP--LIAKIEVRIAQAVGVPVEHGEGFQVLNYQPGG 185
Query: 175 KYDSHYDAFNPAEYGPQMS-----QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYK 229
+Y H+D FNP G QR+A+ ++YL+ V+ GG T FP
Sbjct: 186 EYQPHFDFFNPGRSGEARQLEVGGQRVATMVIYLNSVQAGGATGFP-------------- 231
Query: 230 KCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
+GL+V P +G+ + F P+GT+D +LH PV +GEKW+ATKW+R++
Sbjct: 232 -KLGLEVAPVKGNAVFFVYKRPDGTLDEDTLHAGLPVERGEKWIATKWLRER 282
>gi|228954520|ref|ZP_04116545.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
kurstaki str. T03a001]
gi|449091198|ref|YP_007423639.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
kurstaki str. HD73]
gi|228805177|gb|EEM51771.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
kurstaki str. T03a001]
gi|449024955|gb|AGE80118.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
kurstaki str. HD73]
Length = 216
Score = 131 bits (329), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 67/199 (33%), Positives = 108/199 (54%), Gaps = 27/199 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K ++K S++ + RTSSG F+ +E
Sbjct: 39 PLIVVLANVLSDEECDELIEMSKNKMKRSKVG-----SARDVNDIRTSSGAFLEDNE--- 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 148 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 192
Query: 261 HGSCPVIKGEKWVATKWIR 279
HG PV KGEKW+AT+W+R
Sbjct: 193 HGGAPVTKGEKWIATQWVR 211
>gi|228910069|ref|ZP_04073889.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL 200]
gi|228849586|gb|EEM94420.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL 200]
Length = 248
Score = 131 bits (329), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 67/199 (33%), Positives = 109/199 (54%), Gaps = 27/199 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K ++K S++ + RTSSG F+ +E
Sbjct: 71 PLIVVLANVLSDEECDELIEMSKNKMKRSKVG-----SSRDVNDIRTSSGAFLEDNE--- 122
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ ++ R+++ +
Sbjct: 123 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAVNNRISTLV 179
Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 180 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 224
Query: 261 HGSCPVIKGEKWVATKWIR 279
HG PV KGEKW+AT+W+R
Sbjct: 225 HGGAPVTKGEKWIATQWVR 243
>gi|428175714|gb|EKX44602.1| hypothetical protein GUITHDRAFT_71994 [Guillardia theta CCMP2712]
Length = 244
Score = 131 bits (329), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 82/232 (35%), Positives = 117/232 (50%), Gaps = 25/232 (10%)
Query: 69 SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRT 128
S +++ + LS PR NF SAE+C+ II TA L PS + L+QG+ + +
Sbjct: 14 SSRTVEVKRLSSTPRLFVVENFLSAEECEEIIKTATPLLAPSTV-LKQGDQSNGEEKVKD 72
Query: 129 SSGTFISA--SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
T +A + K I+ I ++ +P ++ E VL+Y Q Y HYD F+P
Sbjct: 73 EVRTSETAWLMDKKVPIVAKIRQRVEELIRIPMSYAEDMQVLKYTFKQHYHVHYDFFDPK 132
Query: 187 EYGPQMS---QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKC-----------I 232
Y + S RL + YL+ VE+GGET+FPF N S ++ K
Sbjct: 133 MYPGRWSSGHNRLVTVFFYLTSVEKGGETIFPFGN----TSAEEHHKIQSWGPCENAVES 188
Query: 233 GLKVKPRRGDGLLFYSLFPNG----TIDRTSLHGSCPVIKGEKWVATKWIRD 280
+KVKP RG ++FY + P+G +D TSLHG C I GEKW A WIR+
Sbjct: 189 SIKVKPVRGSAVIFYLMKPHGHTHGELDHTSLHGGCDPIVGEKWAANYWIRN 240
>gi|3169183|gb|AAC17826.1| hypothetical protein [Arabidopsis thaliana]
Length = 1036
Score = 131 bits (329), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 68/150 (45%), Positives = 92/150 (61%), Gaps = 11/150 (7%)
Query: 47 RTLEVVEKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKR 106
RTL ++ + GL H + Q LSW PR Y PNFA+ +QC+++I AK +
Sbjct: 776 RTLITLDSSDSQGLVHVLMWND-------QGLSWNPRVFYLPNFATKQQCEAVIDMAKPK 828
Query: 107 LKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFN 166
LKPS LALR+ E+ ED++G+L IE KIA AT P+ + E+FN
Sbjct: 829 LKPSTLALRK----ETKHFQMQYRSLHQHTDEDESGVLAAIEEKIALATRFPKDYYESFN 884
Query: 167 VLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+LRY++GQKYDSHYDAF+ AEYGP +SQR+
Sbjct: 885 ILRYQLGQKYDSHYDAFHSAEYGPLISQRV 914
>gi|423437685|ref|ZP_17414666.1| hypothetical protein IE9_03866 [Bacillus cereus BAG4X12-1]
gi|423503075|ref|ZP_17479667.1| hypothetical protein IG1_00641 [Bacillus cereus HD73]
gi|401120840|gb|EJQ28636.1| hypothetical protein IE9_03866 [Bacillus cereus BAG4X12-1]
gi|402459296|gb|EJV91033.1| hypothetical protein IG1_00641 [Bacillus cereus HD73]
Length = 248
Score = 131 bits (329), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 67/199 (33%), Positives = 108/199 (54%), Gaps = 27/199 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K ++K S++ + RTSSG F+ +E
Sbjct: 71 PLIVVLANVLSDEECDELIEMSKNKMKRSKVG-----SARDVNDIRTSSGAFLEDNE--- 122
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 123 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 179
Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 180 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 224
Query: 261 HGSCPVIKGEKWVATKWIR 279
HG PV KGEKW+AT+W+R
Sbjct: 225 HGGAPVTKGEKWIATQWVR 243
>gi|423541303|ref|ZP_17517694.1| hypothetical protein IGK_03395 [Bacillus cereus HuB4-10]
gi|401172491|gb|EJQ79712.1| hypothetical protein IGK_03395 [Bacillus cereus HuB4-10]
Length = 216
Score = 130 bits (328), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 74/217 (34%), Positives = 114/217 (52%), Gaps = 32/217 (14%)
Query: 65 TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
T D IQ I F+ P + N S E+C +I +K ++K S + +
Sbjct: 25 TEDREIQIISKFE----EPLIVVLGNVISDEECDELIEMSKNKIKRSTIG-----SSRDV 75
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTSSG F+ +E + IE +I+ +P THGE ++L YE+ Q+Y +HYD F
Sbjct: 76 NDIRTSSGAFLEENE----LTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYF 131
Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
AE+ + R+++ ++YL+DVEEGGET FP + L V PR+G
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174
Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+ F + + +++ +LHG PV KGEKW+AT+W+R
Sbjct: 175 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWVR 211
>gi|423521903|ref|ZP_17498376.1| hypothetical protein IGC_01286 [Bacillus cereus HuA4-10]
gi|401176565|gb|EJQ83760.1| hypothetical protein IGC_01286 [Bacillus cereus HuA4-10]
Length = 216
Score = 130 bits (328), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 68/199 (34%), Positives = 107/199 (53%), Gaps = 27/199 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +K S++ + RTSSG F+ +E
Sbjct: 39 PLIVVLANVLSDEECDKLIELSKNNMKRSKVG-----SSRDVNDIRTSSGAFLEENE--- 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ T +P HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 148 MYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 192
Query: 261 HGSCPVIKGEKWVATKWIR 279
HG PV KGEKW+AT+W+R
Sbjct: 193 HGGAPVTKGEKWIATQWVR 211
>gi|423489423|ref|ZP_17466105.1| hypothetical protein IEU_04046 [Bacillus cereus BtB2-4]
gi|402431659|gb|EJV63723.1| hypothetical protein IEU_04046 [Bacillus cereus BtB2-4]
Length = 216
Score = 130 bits (328), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 73/217 (33%), Positives = 116/217 (53%), Gaps = 32/217 (14%)
Query: 65 TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
T D IQ I F+ P + N S E+C +I +K +++ S++ +
Sbjct: 25 TEDREIQIISKFE----EPLIVVLANVLSDEECDELIELSKSKMERSKVG-----SSRDV 75
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTSSG F+ +E + IE +I+ T +P +HGE ++L YE+ Q+Y +HYD F
Sbjct: 76 NDIRTSSGAFLEENE----LTSKIEKRISSITNVPVSHGEGLHILNYEVDQEYKAHYDYF 131
Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
AE+ + R+++ ++YL+DVEEGGET FP + L V PR+G
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174
Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+ F + + +++ +LHG PV KGEKW+AT+W+R
Sbjct: 175 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWVR 211
>gi|228902749|ref|ZP_04066896.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL
4222]
gi|228967277|ref|ZP_04128313.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
sotto str. T04001]
gi|402564350|ref|YP_006607074.1| prolyl 4-hydroxylase subunit alpha domain-containing protein
[Bacillus thuringiensis HD-771]
gi|434377355|ref|YP_006611999.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
thuringiensis HD-789]
gi|228792646|gb|EEM40212.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
sotto str. T04001]
gi|228856936|gb|EEN01449.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL
4222]
gi|401793002|gb|AFQ19041.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
thuringiensis HD-771]
gi|401875912|gb|AFQ28079.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
thuringiensis HD-789]
Length = 216
Score = 130 bits (328), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 67/199 (33%), Positives = 108/199 (54%), Gaps = 27/199 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K ++K S++ + RTSSG F+ +E
Sbjct: 39 PLIVVLANVLSDEECDKLIEMSKNKMKRSKVG-----SSRDVNDIRTSSGAFLEDNE--- 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 148 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 192
Query: 261 HGSCPVIKGEKWVATKWIR 279
HG PV KGEKW+AT+W+R
Sbjct: 193 HGGAPVTKGEKWIATQWVR 211
>gi|302842389|ref|XP_002952738.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
gi|300262082|gb|EFJ46291.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
Length = 281
Score = 130 bits (328), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 74/194 (38%), Positives = 112/194 (57%), Gaps = 11/194 (5%)
Query: 93 AEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIA 152
AE+ I+ +++RL+ + + G+ T RTS G F+ ED+ I++ +E +IA
Sbjct: 7 AEEADHIVKVSERRLE--RSGVVGGDGGSETSNIRTSYGVFLDRGEDE--IVKRVEERIA 62
Query: 153 RATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGET 212
T++P +GE VLRY+ QKYD+H+D F + R A+ L+YL D EEGGET
Sbjct: 63 AWTLMPVGNGEGLQVLRYQKEQKYDAHWDYFFHKDGITNGGNRYATVLMYLVDTEEGGET 122
Query: 213 MFPFENGIFLDSGYD--YKKCI--GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIK 268
+FP + G + + +C L KP++G +LF+S+ P G ++R SLH +CPVI+
Sbjct: 123 VFP---NVAAPGGENVGFSECARYHLAAKPKKGTAILFHSIKPTGELERKSLHTACPVIR 179
Query: 269 GEKWVATKWIRDQE 282
G KW A KWI E
Sbjct: 180 GIKWSAAKWIHHAE 193
>gi|228922987|ref|ZP_04086280.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
huazhongensis BGSC 4BD1]
gi|228836620|gb|EEM81968.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
huazhongensis BGSC 4BD1]
Length = 216
Score = 130 bits (328), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 70/200 (35%), Positives = 109/200 (54%), Gaps = 29/200 (14%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K ++K S++ + RTSSG F+ SE
Sbjct: 39 PLIVVLANVLSDEECDELIEMSKNKMKRSKVG-----SSRDVNDIRTSSGAFLEDSE--- 90
Query: 142 GILEL-IEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASF 199
L L IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++
Sbjct: 91 --LTLKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTL 146
Query: 200 LLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTS 259
++YL+DVEEGGET FP + L V PR+G + F + + +++ +
Sbjct: 147 VMYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELT 191
Query: 260 LHGSCPVIKGEKWVATKWIR 279
LHG PV KGEKW+AT+W+R
Sbjct: 192 LHGGAPVTKGEKWIATQWVR 211
>gi|423582447|ref|ZP_17558558.1| hypothetical protein IIA_03962 [Bacillus cereus VD014]
gi|401213326|gb|EJR20067.1| hypothetical protein IIA_03962 [Bacillus cereus VD014]
Length = 248
Score = 130 bits (327), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 76/218 (34%), Positives = 116/218 (53%), Gaps = 34/218 (15%)
Query: 65 TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
T D IQ I F+ P + N S E+C +I +K ++K S++ +
Sbjct: 57 TEDREIQIISKFE----EPLIVVLANVLSDEECDELIEMSKNKMKRSKVG-----SSRDV 107
Query: 124 KGTRTSSGTFISASEDKTGILEL-IEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
RTSSG F+ SE L L IE +I+ +P +HGE ++L YE+ Q+Y +HYD
Sbjct: 108 NDIRTSSGAFLEDSE-----LTLKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDY 162
Query: 183 FNPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRG 241
F AE+ + R+++ ++YL+DVEEGGET FP + L V PR+G
Sbjct: 163 F--AEHSRSAANNRISTLVMYLNDVEEGGETYFP---------------KLNLSVHPRKG 205
Query: 242 DGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+ F + + +++ +LHG PV KGEKW+AT+W+R
Sbjct: 206 MAVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWVR 243
>gi|423558182|ref|ZP_17534484.1| hypothetical protein II3_03386 [Bacillus cereus MC67]
gi|401191450|gb|EJQ98472.1| hypothetical protein II3_03386 [Bacillus cereus MC67]
Length = 216
Score = 130 bits (327), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 74/217 (34%), Positives = 114/217 (52%), Gaps = 32/217 (14%)
Query: 65 TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
T D IQ I F+ P + N S E+C +I +K ++K S++ +
Sbjct: 25 TEDREIQIISKFE----EPLIVVLANVLSDEECDGLIELSKNKIKRSKIG-----SSRDV 75
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTSSG F+ +E + IE +I+ +P HGE ++L YE+ Q+Y +HYD F
Sbjct: 76 NDIRTSSGAFLEENE----LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQEYKAHYDYF 131
Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
AE+ + R+++ ++YL+DVEEGGET FP + L V PR+G
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174
Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+ F + + +++ +LHG PV KGEKWVAT+W+R
Sbjct: 175 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWVATQWVR 211
>gi|49480949|ref|YP_038297.1| prolyl 4-hydroxylase subunit alpha [Bacillus thuringiensis serovar
konkukian str. 97-27]
gi|49332505|gb|AAT63151.1| prolyl 4-hydroxylase, alpha subunit [Bacillus thuringiensis serovar
konkukian str. 97-27]
Length = 232
Score = 130 bits (327), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 74/217 (34%), Positives = 115/217 (52%), Gaps = 32/217 (14%)
Query: 65 TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
T D IQ I F+ P + N S E+C +I +K +L S++ +
Sbjct: 41 TEDREIQIISKFE----EPLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDV 91
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTSSG F+ +E + E IE +I+ +P +HGE ++L YE+ Q+Y +HYD F
Sbjct: 92 NDIRTSSGAFLDDNE----LTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF 147
Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
AE+ + R+++ ++YL+DVEEGGET FP + L V PR+G
Sbjct: 148 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 190
Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+ F + + +++ +LHG PV KGEKW+AT+W+R
Sbjct: 191 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWVR 227
>gi|302838815|ref|XP_002950965.1| hypothetical protein VOLCADRAFT_60971 [Volvox carteri f.
nagariensis]
gi|300263660|gb|EFJ47859.1| hypothetical protein VOLCADRAFT_60971 [Volvox carteri f.
nagariensis]
Length = 298
Score = 130 bits (327), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 81/223 (36%), Positives = 115/223 (51%), Gaps = 30/223 (13%)
Query: 75 FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 134
+ +SW PR + NF + +C+ I TA +K S + + G +V T RTS GTFI
Sbjct: 2 IEAVSWNPRVFIYHNFLTDGECRHIKRTAAPMMKRSSVVGQNGSSV--TDNIRTSYGTFI 59
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFN------------VLRYEIGQKYDSHYDA 182
D ++E I ++A T P + E VLRY IGQKY +H D+
Sbjct: 60 RRRHDP--VIERILRRVAAWTKAPPENQEDLQAGRGEGGREKERVLRYGIGQKYGAHMDS 117
Query: 183 FNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGY-----DYKKCIGLKV- 236
S R+A+ LLYL D EEGGET FP ++ +L + +C V
Sbjct: 118 LI------DDSPRMATVLLYLHDTEEGGETAFP-DSSSWLTPDLATRMGPFSECAQGHVA 170
Query: 237 -KPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
+P++GD L+F+S+ P+GT D S+H CPV+KG KW AT W+
Sbjct: 171 FRPKKGDALMFWSIKPDGTHDPLSMHTGCPVVKGVKWTATSWV 213
>gi|163941996|ref|YP_001646880.1| 2OG-Fe(II) oxygenase [Bacillus weihenstephanensis KBAB4]
gi|229013455|ref|ZP_04170592.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides DSM 2048]
gi|423495146|ref|ZP_17471790.1| hypothetical protein IEW_04044 [Bacillus cereus CER057]
gi|423498060|ref|ZP_17474677.1| hypothetical protein IEY_01287 [Bacillus cereus CER074]
gi|163864193|gb|ABY45252.1| 2OG-Fe(II) oxygenase [Bacillus weihenstephanensis KBAB4]
gi|228747867|gb|EEL97733.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides DSM 2048]
gi|401151239|gb|EJQ58691.1| hypothetical protein IEW_04044 [Bacillus cereus CER057]
gi|401161347|gb|EJQ68714.1| hypothetical protein IEY_01287 [Bacillus cereus CER074]
Length = 216
Score = 130 bits (327), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 73/217 (33%), Positives = 115/217 (52%), Gaps = 32/217 (14%)
Query: 65 TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
T D IQ I F+ P + N S E+C +I +K +++ S++ +
Sbjct: 25 TEDREIQIISKFE----EPLIVVLANVLSDEECDELIELSKSKMERSKVG-----SSRDV 75
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTSSG F+ +E + IE +I+ T +P HGE ++L YE+ Q+Y +HYD F
Sbjct: 76 NDIRTSSGAFLEENE----LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF 131
Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
AE+ + R+++ ++YL+DVEEGGET FP + L V PR+G
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174
Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+ F + + +++ +LHG PV KGEKW+AT+W+R
Sbjct: 175 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWVR 211
>gi|264677094|ref|YP_003277000.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni CNB-2]
gi|262207606|gb|ACY31704.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni CNB-2]
Length = 306
Score = 130 bits (327), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 84/248 (33%), Positives = 126/248 (50%), Gaps = 39/248 (15%)
Query: 50 EVVEKENESGLPHGE---------TGDASIQSIPFQVLSWR-PRALYFPNFASAEQCQSI 99
EV +ESG+P + GD + + +S R PR + F N S E+C +I
Sbjct: 81 EVAVLVHESGVPEPDLLTDPCCVHAGDRDVHVL----MSVRHPRVVVFGNLLSDEECDAI 136
Query: 100 IATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQ 159
IA A+ R++ S L + E+ RTS+G F E+ ++ L+E +IAR P
Sbjct: 137 IAAARPRMRRS-LTVDNQSGGEAVNDDRTSNGMFFQRGEND--LISLVEQRIARLLNWPL 193
Query: 160 THGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQRLASFLLYLSDVEEGGETMF 214
+GE VL Y G +Y HYD F P E G P + QR+ + ++YL++ GG T F
Sbjct: 194 ENGEGMQVLHYRPGAEYKPHYDYFAPNEPGTPTILKRGGQRVGTLVMYLNEPARGGATTF 253
Query: 215 PFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVA 274
P +GL++ PRRG+ + F P+ +LHG PV++GEKW+A
Sbjct: 254 P---------------DVGLQIVPRRGNAVFFSYNRPDPAT--KTLHGGAPVLEGEKWIA 296
Query: 275 TKWIRDQE 282
TKW+R++E
Sbjct: 297 TKWLRERE 304
>gi|423634936|ref|ZP_17610589.1| hypothetical protein IK7_01345 [Bacillus cereus VD156]
gi|401278922|gb|EJR84852.1| hypothetical protein IK7_01345 [Bacillus cereus VD156]
Length = 248
Score = 130 bits (327), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 70/200 (35%), Positives = 109/200 (54%), Gaps = 29/200 (14%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K ++K S++ + RTSSG F+ SE
Sbjct: 71 PLIVVLANVLSDEECDELIEMSKNKMKRSKVG-----SSRDVNDIRTSSGAFLEDSE--- 122
Query: 142 GILEL-IEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASF 199
L L IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++
Sbjct: 123 --LTLKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTL 178
Query: 200 LLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTS 259
++YL+DVEEGGET FP + L V PR+G + F + + +++ +
Sbjct: 179 VMYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELT 223
Query: 260 LHGSCPVIKGEKWVATKWIR 279
LHG PV KGEKW+AT+W+R
Sbjct: 224 LHGGAPVTKGEKWIATQWVR 243
>gi|228960501|ref|ZP_04122151.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pakistani str. T13001]
gi|229047930|ref|ZP_04193506.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH676]
gi|423630961|ref|ZP_17606708.1| hypothetical protein IK5_03811 [Bacillus cereus VD154]
gi|423650103|ref|ZP_17625673.1| hypothetical protein IKA_03890 [Bacillus cereus VD169]
gi|228723387|gb|EEL74756.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH676]
gi|228799198|gb|EEM46165.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pakistani str. T13001]
gi|401264328|gb|EJR70440.1| hypothetical protein IK5_03811 [Bacillus cereus VD154]
gi|401282521|gb|EJR88420.1| hypothetical protein IKA_03890 [Bacillus cereus VD169]
Length = 248
Score = 130 bits (327), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 67/199 (33%), Positives = 108/199 (54%), Gaps = 27/199 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K ++K S++ + RTSSG F+ +E
Sbjct: 71 PLIVVLANVLSDEECDELIEMSKNKMKRSKVG-----SSRDVNDIRTSSGAFLEDNE--- 122
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 123 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 179
Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 180 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 224
Query: 261 HGSCPVIKGEKWVATKWIR 279
HG PV KGEKW+AT+W+R
Sbjct: 225 HGGAPVTKGEKWIATQWVR 243
>gi|423527903|ref|ZP_17504348.1| hypothetical protein IGE_01455 [Bacillus cereus HuB1-1]
gi|402451566|gb|EJV83385.1| hypothetical protein IGE_01455 [Bacillus cereus HuB1-1]
Length = 248
Score = 130 bits (327), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 67/199 (33%), Positives = 108/199 (54%), Gaps = 27/199 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K ++K S++ + RTSSG F+ +E
Sbjct: 71 PLIVVLANVLSDEECDKLIEMSKNKMKRSKVG-----SSRDVNDIRTSSGAFLEDNE--- 122
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 123 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 179
Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 180 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 224
Query: 261 HGSCPVIKGEKWVATKWIR 279
HG PV KGEKW+AT+W+R
Sbjct: 225 HGGAPVTKGEKWIATQWVR 243
>gi|75760922|ref|ZP_00740932.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
israelensis ATCC 35646]
gi|423385740|ref|ZP_17362996.1| hypothetical protein ICE_03486 [Bacillus cereus BAG1X1-2]
gi|423561293|ref|ZP_17537569.1| hypothetical protein II5_00697 [Bacillus cereus MSX-A1]
gi|74491592|gb|EAO54798.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
israelensis ATCC 35646]
gi|401201550|gb|EJR08415.1| hypothetical protein II5_00697 [Bacillus cereus MSX-A1]
gi|401635796|gb|EJS53551.1| hypothetical protein ICE_03486 [Bacillus cereus BAG1X1-2]
Length = 248
Score = 130 bits (327), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 67/199 (33%), Positives = 108/199 (54%), Gaps = 27/199 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K ++K S++ + RTSSG F+ +E
Sbjct: 71 PLIVVLANVLSDEECDKLIEMSKNKMKRSKVG-----SSRDVNDIRTSSGAFLEDNE--- 122
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 123 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 179
Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 180 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 224
Query: 261 HGSCPVIKGEKWVATKWIR 279
HG PV KGEKW+AT+W+R
Sbjct: 225 HGGAPVTKGEKWIATQWVR 243
>gi|423358724|ref|ZP_17336227.1| hypothetical protein IC1_00704 [Bacillus cereus VD022]
gi|401084596|gb|EJP92842.1| hypothetical protein IC1_00704 [Bacillus cereus VD022]
Length = 248
Score = 130 bits (326), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 67/199 (33%), Positives = 108/199 (54%), Gaps = 27/199 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K ++K S++ + RTSSG F+ +E
Sbjct: 71 PLIVVLANVLSDEECDKLIEMSKNKMKRSKVG-----SSRDVNDIRTSSGAFLEDNE--- 122
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 123 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 179
Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 180 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 224
Query: 261 HGSCPVIKGEKWVATKWIR 279
HG PV KGEKW+AT+W+R
Sbjct: 225 HGGAPVTKGEKWIATQWVR 243
>gi|159487763|ref|XP_001701892.1| predicted protein [Chlamydomonas reinhardtii]
gi|158281111|gb|EDP06867.1| predicted protein [Chlamydomonas reinhardtii]
Length = 259
Score = 130 bits (326), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 77/209 (36%), Positives = 117/209 (55%), Gaps = 15/209 (7%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
++W+PR + NF + + + +I A ++K S + G++VE RTS GTF+
Sbjct: 4 VAWKPRVFIYHNFITEVEAKHLIELAAPQMKRSTVVGAGGKSVEDN--YRTSYGTFLKRY 61
Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
+D+ I+E IE+++A T +P H E +LRY +GQ+Y H D E G R+A
Sbjct: 62 QDE--IVERIENRVAAWTQIPVAHQEDTQILRYGLGQQYKVHADTLRDEEAG----VRVA 115
Query: 198 SFLLYLSDVEEGGETMFP---FENGIFLDS-GYDYKKCIGLKV--KPRRGDGLLFYSLFP 251
+ L+YL++ + GGET FP + N + G ++ C V P+RGD LLF+S+ P
Sbjct: 116 TVLIYLNEPDGGGETAFPSSEWVNPQLAKTLGANFSDCAKNHVAFAPKRGDALLFWSINP 175
Query: 252 NG-TIDRTSLHGSCPVIKGEKWVATKWIR 279
+G T D + H CPV+ G KW ATKWI
Sbjct: 176 DGNTEDTHASHTGCPVLSGVKWTATKWIH 204
>gi|423518940|ref|ZP_17495421.1| hypothetical protein IG7_04010 [Bacillus cereus HuA2-4]
gi|401159995|gb|EJQ67374.1| hypothetical protein IG7_04010 [Bacillus cereus HuA2-4]
Length = 216
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 74/217 (34%), Positives = 113/217 (52%), Gaps = 32/217 (14%)
Query: 65 TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
T D IQ I F+ P + N S E+C +I +K +K S++ +
Sbjct: 25 TEDREIQIISKFE----EPLIVVLANVLSDEECAELIELSKNNMKRSKVG-----SSRDV 75
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTSSG F+ +E + IE +I+ T +P HGE ++L YE+ Q+Y +HYD F
Sbjct: 76 NDIRTSSGAFLEENE----LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF 131
Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
AE+ + R+++ ++YL+DVEEGGET FP + L V PR+G
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------QLNLSVHPRKGM 174
Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+ F + + ++ +LHG PV KGEKW+AT+W+R
Sbjct: 175 AVYFEYFYQDQLLNELTLHGGAPVTKGEKWIATQWVR 211
>gi|229061929|ref|ZP_04199257.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH603]
gi|228717372|gb|EEL69042.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH603]
Length = 216
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 74/217 (34%), Positives = 113/217 (52%), Gaps = 32/217 (14%)
Query: 65 TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
T D IQ I F+ P + N S E+C +I +K +K S++ +
Sbjct: 25 TEDREIQIISKFE----EPLIVVLANVLSDEECAELIELSKSNMKRSKVG-----SSRDV 75
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTSSG F+ +E + IE +I+ T +P HGE ++L YE+ Q+Y +HYD F
Sbjct: 76 NDIRTSSGAFLEENE----LTSKIEKRISSITNVPVVHGEGLHILNYEVDQEYKAHYDYF 131
Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
AE+ + R+++ ++YL+DVEEGGET FP + L V PR+G
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174
Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+ F + + ++ +LHG PV KGEKW+AT+W+R
Sbjct: 175 AVYFEYFYQDQLLNELTLHGGAPVTKGEKWIATQWVR 211
>gi|228941395|ref|ZP_04103947.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
berliner ATCC 10792]
gi|228974327|ref|ZP_04134896.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
thuringiensis str. T01001]
gi|228980919|ref|ZP_04141223.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
gi|384188306|ref|YP_005574202.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
chinensis CT-43]
gi|410676625|ref|YP_006928996.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
gi|452200698|ref|YP_007480779.1| Peptidyl prolyl 4-hydroxylase-like protein, alpha subunit [Bacillus
thuringiensis serovar thuringiensis str. IS5056]
gi|228778855|gb|EEM27118.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
gi|228785377|gb|EEM33387.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
thuringiensis str. T01001]
gi|228818321|gb|EEM64394.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
berliner ATCC 10792]
gi|326942015|gb|AEA17911.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
chinensis CT-43]
gi|409175754|gb|AFV20059.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
gi|452106091|gb|AGG03031.1| Peptidyl prolyl 4-hydroxylase-like protein, alpha subunit [Bacillus
thuringiensis serovar thuringiensis str. IS5056]
Length = 216
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 67/199 (33%), Positives = 108/199 (54%), Gaps = 27/199 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K ++K S++ + RTSSG F+ +E
Sbjct: 39 PLIVVLANVLSDEECGELIEMSKNKMKRSKVG-----SSRDVNDIRTSSGAFLEDNE--- 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 148 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 192
Query: 261 HGSCPVIKGEKWVATKWIR 279
HG PV KGEKW+AT+W+R
Sbjct: 193 HGGAPVTKGEKWIATQWVR 211
>gi|229104864|ref|ZP_04235524.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-28]
gi|228678581|gb|EEL32798.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-28]
Length = 216
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 74/217 (34%), Positives = 114/217 (52%), Gaps = 32/217 (14%)
Query: 65 TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
T D IQ I F+ P + N S E+C +I +K ++K S + +
Sbjct: 25 TEDREIQIISKFE----EPLIVVLGNVISDEECGELIEMSKNKIKRSTIG-----SSRDV 75
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTSSG F+ +E + IE +I+ +P THGE ++L YE+ Q+Y +HYD F
Sbjct: 76 NDIRTSSGAFLEENE----LTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYF 131
Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
AE+ + R+++ ++YL+DVEEGGET FP + L V PR+G
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174
Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+ F + + +++ +LHG PV KGEKW+AT+W+R
Sbjct: 175 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWVR 211
>gi|423368291|ref|ZP_17345723.1| hypothetical protein IC3_03392 [Bacillus cereus VD142]
gi|401081042|gb|EJP89322.1| hypothetical protein IC3_03392 [Bacillus cereus VD142]
Length = 216
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 74/217 (34%), Positives = 113/217 (52%), Gaps = 32/217 (14%)
Query: 65 TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
T D IQ I F+ P + N S E+C +I +K +K S++ +
Sbjct: 25 TEDREIQIISKFE----EPLIVVLANVLSDEECAELIELSKNNMKRSKVG-----SSRDV 75
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTSSG F+ +E + IE +I+ T +P HGE ++L YE+ Q+Y +HYD F
Sbjct: 76 NDIRTSSGAFLEENE----LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF 131
Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
AE+ + R+++ ++YL+DVEEGGET FP + L V PR+G
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174
Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+ F + + ++ +LHG PV KGEKW+AT+W+R
Sbjct: 175 AVYFEYFYQDQLLNELTLHGGAPVTKGEKWIATQWVR 211
>gi|218231188|ref|YP_002369041.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
B4264]
gi|218159145|gb|ACK59137.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
B4264]
Length = 216
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 67/199 (33%), Positives = 108/199 (54%), Gaps = 27/199 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +++ S++ + RTSSG F+ +E
Sbjct: 39 PLIVVLANVLSDEECGELIEMSKNKMERSKIG-----SSRDVNDIRTSSGAFLEDNE--- 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
+YL+DVEEGGET FP + L V PR+G + F + + +I+ +L
Sbjct: 148 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSINELTL 192
Query: 261 HGSCPVIKGEKWVATKWIR 279
HG PV KGEKW+AT+W+R
Sbjct: 193 HGGAPVTKGEKWIATQWVR 211
>gi|229135058|ref|ZP_04263863.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST196]
gi|228648443|gb|EEL04473.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST196]
Length = 216
Score = 129 bits (324), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 74/217 (34%), Positives = 113/217 (52%), Gaps = 32/217 (14%)
Query: 65 TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
T D IQ I F+ P + N S E+C +I +K +K S++ +
Sbjct: 25 TEDREIQIISKFE----EPLIVVLANVLSDEECAELIELSKSNMKRSKVG-----SSRDV 75
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTSSG F+ +E + IE +I+ T +P HGE ++L YE+ Q+Y +HYD F
Sbjct: 76 NDIRTSSGAFLEENE----LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF 131
Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
AE+ + R+++ ++YL+DVEEGGET FP + L V PR+G
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174
Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+ F + + ++ +LHG PV KGEKW+AT+W+R
Sbjct: 175 AVYFEYFYQDQLLNELTLHGGAPVTKGEKWIATQWVR 211
>gi|423512354|ref|ZP_17488885.1| hypothetical protein IG3_03851 [Bacillus cereus HuA2-1]
gi|402449325|gb|EJV81162.1| hypothetical protein IG3_03851 [Bacillus cereus HuA2-1]
Length = 216
Score = 129 bits (324), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 74/217 (34%), Positives = 113/217 (52%), Gaps = 32/217 (14%)
Query: 65 TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
T D IQ I F+ P + N S E+C +I +K +K S++ +
Sbjct: 25 TEDREIQIISKFE----EPLIVVLANVLSDEECAELIELSKSNMKRSKVG-----SSRDV 75
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTSSG F+ +E + IE +I+ T +P HGE ++L YE+ Q+Y +HYD F
Sbjct: 76 NDIRTSSGAFLEENE----LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF 131
Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
AE+ + R+++ ++YL+DVEEGGET FP + L V PR+G
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174
Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+ F + + ++ +LHG PV KGEKW+AT+W+R
Sbjct: 175 AVYFEYFYQDQLLNELTLHGGAPVTKGEKWIATQWVR 211
>gi|423669823|ref|ZP_17644852.1| hypothetical protein IKO_03520 [Bacillus cereus VDM034]
gi|423673973|ref|ZP_17648912.1| hypothetical protein IKS_01516 [Bacillus cereus VDM062]
gi|401298950|gb|EJS04550.1| hypothetical protein IKO_03520 [Bacillus cereus VDM034]
gi|401309524|gb|EJS14857.1| hypothetical protein IKS_01516 [Bacillus cereus VDM062]
Length = 216
Score = 129 bits (324), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 73/217 (33%), Positives = 114/217 (52%), Gaps = 32/217 (14%)
Query: 65 TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
T D IQ I F+ P + N S E+C +I +K +++ S++ +
Sbjct: 25 TEDREIQIISKFE----EPLIVVLANVLSDEECDELIELSKSKMERSKVG-----SSRDV 75
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTSSG F+ +E + IE +I+ T +P HGE ++L YE+ Q+Y +HYD F
Sbjct: 76 NDIRTSSGAFLEENE----LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF 131
Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
AE+ + R+++ ++YL+DVEEGGET FP + L V PR+G
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174
Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+ F + + ++ +LHG PV KGEKW+AT+W+R
Sbjct: 175 AVYFEYFYQDQLLNELTLHGGAPVTKGEKWIATQWVR 211
>gi|47567794|ref|ZP_00238502.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus G9241]
gi|47555471|gb|EAL13814.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus G9241]
Length = 216
Score = 129 bits (324), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 73/217 (33%), Positives = 114/217 (52%), Gaps = 32/217 (14%)
Query: 65 TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
T D IQ I F+ P + N S E+C +I +K +L S++ +
Sbjct: 25 TEDREIQIISKFE----EPLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDV 75
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTS G F+ +E + E IE +I+ +P +HGE ++L YE+ Q+Y +HYD F
Sbjct: 76 NDIRTSKGAFLDDNE----LTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF 131
Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
AE+ + R+++ ++YL+DVEEGGET FP + L V PR+G
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174
Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+ F + + +++ +LHG PV KGEKW+AT+W+R
Sbjct: 175 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWVR 211
>gi|403238305|ref|ZP_10916891.1| procollagen-proline dioxygenase [Bacillus sp. 10403023]
Length = 296
Score = 129 bits (324), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 75/203 (36%), Positives = 110/203 (54%), Gaps = 21/203 (10%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQ-LALRQGETVESTKGTRTSSGTFISASED 139
+P L+ F S E+C +I +++RLKPS + + GE E RTS G E+
Sbjct: 108 KPFILHLDYFLSEEECDQLIEMSRERLKPSTVIDPKTGE--EKAATGRTSKGMSFYLQEN 165
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-QRLAS 198
+ ++ +E +IA P +GE VL Y IG++Y SH+D F ++ P+ QR+ +
Sbjct: 166 E--FIKKVEKRIAELIEFPVENGEGLQVLNYGIGEEYKSHFDYFPQSKVVPEKGGQRVGT 223
Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
FL+YL+DV GGET+FP G+ + P++G + F G +DR
Sbjct: 224 FLIYLNDVPAGGETVFP---------------KAGVSIVPKKGSAVYFQYGNSKGEVDRM 268
Query: 259 SLHGSCPVIKGEKWVATKWIRDQ 281
SLH S PV +GEKWVATKWIR +
Sbjct: 269 SLHSSIPVSEGEKWVATKWIRQE 291
>gi|423483822|ref|ZP_17460512.1| hypothetical protein IEQ_03600 [Bacillus cereus BAG6X1-2]
gi|401141373|gb|EJQ48928.1| hypothetical protein IEQ_03600 [Bacillus cereus BAG6X1-2]
Length = 216
Score = 129 bits (324), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 73/217 (33%), Positives = 113/217 (52%), Gaps = 32/217 (14%)
Query: 65 TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
T D IQ I F+ P + N S E+C +I +K ++K S + +
Sbjct: 25 TEDREIQIISKFE----EPLIVVLGNVISDEECDELIEMSKNKIKRSTIG-----SSRDV 75
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTSSG F+ +E + IE +I+ +P HGE ++L YE+ Q+Y +HYD F
Sbjct: 76 NDIRTSSGAFLEENE----LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYF 131
Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
AE+ + R+++ ++YL+DVEEGGET FP + L V PR+G
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174
Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+ F + + +++ +LHG PV KGEKW+AT+W+R
Sbjct: 175 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWVR 211
>gi|423400914|ref|ZP_17378087.1| hypothetical protein ICW_01312 [Bacillus cereus BAG2X1-2]
gi|401653904|gb|EJS71447.1| hypothetical protein ICW_01312 [Bacillus cereus BAG2X1-2]
Length = 216
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 67/199 (33%), Positives = 108/199 (54%), Gaps = 27/199 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K ++K S++ + RTSSG F+ +E
Sbjct: 39 PLIVVLGNVLSDEECDELIELSKSKMKRSKVG-----SSRDVNDIRTSSGAFLDDNE--- 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 148 MYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 192
Query: 261 HGSCPVIKGEKWVATKWIR 279
HG PV KGEKW+AT+W+R
Sbjct: 193 HGGAPVTKGEKWIATQWVR 211
>gi|423478381|ref|ZP_17455096.1| hypothetical protein IEO_03839 [Bacillus cereus BAG6X1-1]
gi|402428543|gb|EJV60640.1| hypothetical protein IEO_03839 [Bacillus cereus BAG6X1-1]
Length = 216
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 67/199 (33%), Positives = 108/199 (54%), Gaps = 27/199 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K ++K S++ + RTSSG F+ +E
Sbjct: 39 PLIVVLGNVLSDEECDELIELSKSKMKRSKVG-----SSRDVNDIRTSSGAFLDDNE--- 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 148 MYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 192
Query: 261 HGSCPVIKGEKWVATKWIR 279
HG PV KGEKW+AT+W+R
Sbjct: 193 HGGAPVTKGEKWIATQWVR 211
>gi|423657194|ref|ZP_17632493.1| hypothetical protein IKG_04182 [Bacillus cereus VD200]
gi|401289937|gb|EJR95641.1| hypothetical protein IKG_04182 [Bacillus cereus VD200]
Length = 248
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 66/199 (33%), Positives = 108/199 (54%), Gaps = 27/199 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +++ S++ + RTSSG F+ +E
Sbjct: 71 PLIVVLANVLSDEECDELIEMSKNKMERSKIG-----SSRDVNDIRTSSGAFLEDNE--- 122
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 123 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 179
Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 180 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 224
Query: 261 HGSCPVIKGEKWVATKWIR 279
HG PV KGEKW+AT+W+R
Sbjct: 225 HGGAPVTKGEKWIATQWVR 243
>gi|302835042|ref|XP_002949083.1| hypothetical protein VOLCADRAFT_89416 [Volvox carteri f.
nagariensis]
gi|300265828|gb|EFJ50018.1| hypothetical protein VOLCADRAFT_89416 [Volvox carteri f.
nagariensis]
Length = 263
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 78/220 (35%), Positives = 116/220 (52%), Gaps = 31/220 (14%)
Query: 75 FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 134
+ +SW PRA + F + +C +I A +L+ S + + ++ + + ++S +
Sbjct: 59 VETVSWMPRAFVYHQFLTPAECDHLIELATPKLERSMVVGTDSDLIDDIRTSFSASIMY- 117
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-S 193
+T I+ IE +IAR T VLRY GQKYD+H+D F+ E S
Sbjct: 118 ----GETSIVSSIEERIARWT-----------VLRYVNGQKYDAHWDWFDDNEVAKAGGS 162
Query: 194 QRLASFLLYLSDVE--EGGETMFPFENGIFLD------SGYDYKKC---IGLKVKPRRGD 242
R+A+ L+YLSDV+ GGET P LD G Y +C +G+ ++PR+GD
Sbjct: 163 NRMATVLMYLSDVDPAAGGETALPLAEP--LDPHKQSVDGQGYSQCAARMGISIRPRKGD 220
Query: 243 GLLFYSLFPNGTI-DRTSLHGSCPVIKGEKWVATKWIRDQ 281
LLF+ + P G I DR +LH SCP G KW ATKWI ++
Sbjct: 221 VLLFWDMDPAGLIPDRHALHASCPTFSGTKWTATKWIHNK 260
>gi|229152436|ref|ZP_04280628.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1550]
gi|228631044|gb|EEK87681.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1550]
Length = 248
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 67/199 (33%), Positives = 108/199 (54%), Gaps = 27/199 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +++ S++ + RTSSG F+ +E
Sbjct: 71 PLIVVLANVLSDEECGELIEMSKNKMERSKIG-----SSRDVNDIRTSSGAFLEDNE--- 122
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 123 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 179
Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
+YL+DVEEGGET FP + L V PR+G + F + + +I+ +L
Sbjct: 180 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSINELTL 224
Query: 261 HGSCPVIKGEKWVATKWIR 279
HG PV KGEKW+AT+W+R
Sbjct: 225 HGGAPVTKGEKWIATQWVR 243
>gi|229098707|ref|ZP_04229647.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-29]
gi|423441025|ref|ZP_17417931.1| hypothetical protein IEA_01355 [Bacillus cereus BAG4X2-1]
gi|423533441|ref|ZP_17509859.1| hypothetical protein IGI_01273 [Bacillus cereus HuB2-9]
gi|228684786|gb|EEL38724.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-29]
gi|402417686|gb|EJV49986.1| hypothetical protein IEA_01355 [Bacillus cereus BAG4X2-1]
gi|402463660|gb|EJV95360.1| hypothetical protein IGI_01273 [Bacillus cereus HuB2-9]
Length = 216
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 73/217 (33%), Positives = 113/217 (52%), Gaps = 32/217 (14%)
Query: 65 TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
T D IQ I F+ P + N S E+C +I +K ++K S + +
Sbjct: 25 TEDREIQIISKFE----EPLIVVLGNVISDEECNELIEMSKNKIKRSTIG-----SARDV 75
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTSSG F+ +E + IE +I+ +P THGE ++L YE+ Q+Y +HYD F
Sbjct: 76 NDIRTSSGAFLEENE----LTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYF 131
Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
AE+ + R+++ ++YL+DVEEGGET FP + L V PR+G
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174
Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+ F + + +++ +LHG V KGEKW+AT+W+R
Sbjct: 175 AVYFEYFYQDQSLNELTLHGGASVTKGEKWIATQWVR 211
>gi|423615424|ref|ZP_17591258.1| hypothetical protein IIO_00750 [Bacillus cereus VD115]
gi|401259961|gb|EJR66134.1| hypothetical protein IIO_00750 [Bacillus cereus VD115]
Length = 216
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 73/217 (33%), Positives = 113/217 (52%), Gaps = 32/217 (14%)
Query: 65 TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
T D IQ I F+ P + N S E+C +I +K ++K S + +
Sbjct: 25 TEDREIQIISKFE----EPLIVVLGNVISDEECDELIEMSKNKIKRSTIG-----SSRDV 75
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTSSG F+ +E + IE +I+ +P HGE ++L YE+ Q+Y +HYD F
Sbjct: 76 NDIRTSSGAFLEENE----LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYF 131
Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
AE+ + R+++ ++YL+DVEEGGET FP + L V PR+G
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174
Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+ F + + +++ +LHG PV KGEKW+AT+W+R
Sbjct: 175 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWVR 211
>gi|330799463|ref|XP_003287764.1| hypothetical protein DICPUDRAFT_151895 [Dictyostelium purpureum]
gi|325082219|gb|EGC35708.1| hypothetical protein DICPUDRAFT_151895 [Dictyostelium purpureum]
Length = 220
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 75/212 (35%), Positives = 108/212 (50%), Gaps = 31/212 (14%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT-RTSSGTFISA 136
LS +PR P F + E+C +I T+K +L+P E + G R+ G F+
Sbjct: 28 LSQKPRVYRIPEFLTEEECNHLIDTSKNKLRPCN---------EISSGVHRSGWGLFMKE 78
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS--- 193
E++ + + I +K+ + + E ++RY G++ +HYD FNP M
Sbjct: 79 GEEEHPVTKNIFNKMKNFVNISDS-CEVMQIIRYNPGEETSAHYDYFNPLTTNGSMKIGL 137
Query: 194 --QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
QR+ + L+YL DVEEGGET FP +G+KVKP RGD +LFY+ P
Sbjct: 138 YGQRICTILMYLCDVEEGGETSFPE---------------VGIKVKPIRGDAVLFYNCKP 182
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
NG +D SLH PV KG KWVA K I + +
Sbjct: 183 NGDVDPLSLHQGDPVTKGTKWVAIKLINQKSK 214
>gi|412988743|emb|CCO15334.1| predicted protein [Bathycoccus prasinos]
Length = 352
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 79/220 (35%), Positives = 108/220 (49%), Gaps = 24/220 (10%)
Query: 75 FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 134
+ LSW PRA + NF S E+ + ++ + R+ S + Q V RTS GTFI
Sbjct: 68 IEALSWDPRAFLYHNFLSKEEAKHLVDLGEPRVTRSTVVGGQTGRVSDI---RTSFGTFI 124
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
D+ +LE IE + A + +P H E +LRY GQKY H D G +
Sbjct: 125 PKKYDE--VLEKIEDRCAVFSGIPVAHQEQMQLLRYRDGQKYSDHTDGLISENGG----K 178
Query: 195 RLASFLLYLSDVEEGGETMFPFENGI-------------FLDSGYDYKKCIGLKVKPRRG 241
R+A+ L++L + EGGET F N + F D GY K G VKP+ G
Sbjct: 179 RIATILMFLHEPTEGGETSFVLGNPLGKVKERIERTKDQFSDCGYRSGK--GFAVKPKVG 236
Query: 242 DGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
D +LF+S G D S+H SCP + G KW AT WI ++
Sbjct: 237 DAILFFSFSEAGITDNNSMHASCPTLGGTKWTATMWIHER 276
>gi|15233345|ref|NP_195307.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
gi|3805848|emb|CAA21468.1| putative protein [Arabidopsis thaliana]
gi|7270534|emb|CAB81491.1| putative protein [Arabidopsis thaliana]
gi|332661175|gb|AEE86575.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
Length = 272
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 75/197 (38%), Positives = 115/197 (58%), Gaps = 31/197 (15%)
Query: 75 FQVLSWRPRALYFPNF--------ASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKG 125
+V++ PRA + NF + E+C +I+ AK + S++ R T +
Sbjct: 88 LEVITKEPRAFVYHNFLALFFKICKTNEECDHLISLAKPSMARSKV--RNALTGLGEESS 145
Query: 126 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 185
+RTSSGTFI + DK I++ IE +I+ T +PQ +GE V+ YE+GQK++ H+D F
Sbjct: 146 SRTSSGTFIRSGHDK--IVKEIEKRISEFTFIPQENGETLQVINYEVGQKFEPHFDGF-- 201
Query: 186 AEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLL 245
QR+A+ L+YLSDV++GGET+FP GI K G+ V+P++GD LL
Sbjct: 202 --------QRIATVLMYLSDVDKGGETVFPEAKGI--------KSKKGVSVRPKKGDALL 245
Query: 246 FYSLFPNGTIDRTSLHG 262
F+S+ P+G+ D +S HG
Sbjct: 246 FWSMRPDGSRDPSSKHG 262
>gi|423452458|ref|ZP_17429311.1| hypothetical protein IEE_01202 [Bacillus cereus BAG5X1-1]
gi|401140096|gb|EJQ47653.1| hypothetical protein IEE_01202 [Bacillus cereus BAG5X1-1]
Length = 216
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 73/217 (33%), Positives = 114/217 (52%), Gaps = 32/217 (14%)
Query: 65 TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
T D IQ I F+ P + N S E+C +I +K +++ S++ +
Sbjct: 25 TEDREIQIISKFE----EPLIVVLANVLSDEECDGLIELSKNKIERSKIG-----SSRDV 75
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTSSG F+ +E + IE +I+ +P HGE ++L YE+ Q+Y +HYD F
Sbjct: 76 NDIRTSSGAFLEENE----LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQEYKAHYDYF 131
Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
AE+ + R+++ ++YL+DVEEGGET FP + L V PR+G
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174
Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+ F + + +++ +LHG PV KGEKWVAT+W+R
Sbjct: 175 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWVATQWVR 211
>gi|229192445|ref|ZP_04319408.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 10876]
gi|228591022|gb|EEK48878.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 10876]
Length = 216
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 66/199 (33%), Positives = 107/199 (53%), Gaps = 27/199 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +++ S++ + RTSSG F+ +E
Sbjct: 39 PLIVVLANVISDEECDELIEMSKNKMERSKIG-----SSRDVNDIRTSSGAFLEDNE--- 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 148 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 192
Query: 261 HGSCPVIKGEKWVATKWIR 279
HG PV KGEKW+AT+W+R
Sbjct: 193 HGGAPVTKGEKWIATQWVR 211
>gi|228987427|ref|ZP_04147547.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
tochigiensis BGSC 4Y1]
gi|228772399|gb|EEM20845.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
tochigiensis BGSC 4Y1]
Length = 232
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 73/217 (33%), Positives = 114/217 (52%), Gaps = 32/217 (14%)
Query: 65 TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
T D IQ I F+ P + N S E+C +I +K +L S++ +
Sbjct: 41 TEDREIQIISKFE----EPLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDV 91
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTS G F+ +E + E IE +I+ +P +HGE ++L YE+ Q+Y +HYD F
Sbjct: 92 NDIRTSKGAFLDDNE----LTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF 147
Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
AE+ + R+++ ++YL+DVEEGGET FP + L V PR+G
Sbjct: 148 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 190
Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+ F + + +++ +LHG PV KGEKW+AT+W+R
Sbjct: 191 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWVR 227
>gi|229093299|ref|ZP_04224414.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-42]
gi|228690082|gb|EEL43879.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-42]
Length = 232
Score = 128 bits (322), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 73/217 (33%), Positives = 114/217 (52%), Gaps = 32/217 (14%)
Query: 65 TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
T D IQ I F+ P + N S E+C +I +K +L S++ +
Sbjct: 41 TEDREIQIISKFE----EPLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDV 91
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTSSG F+ +E + E IE +I+ +P +HGE ++L YE+ Q+Y +HYD F
Sbjct: 92 NDIRTSSGAFLDDNE----LTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF 147
Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
AE+ + R+++ ++YL+DVEEGGET FP + L V PR+G
Sbjct: 148 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 190
Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+ F + + +++ +LHG PV KGEKW+ T+W+R
Sbjct: 191 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWITTQWVR 227
>gi|196041590|ref|ZP_03108882.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
NVH0597-99]
gi|218905373|ref|YP_002453207.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
AH820]
gi|225866219|ref|YP_002751597.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
03BB102]
gi|423550018|ref|ZP_17526345.1| hypothetical protein IGW_00649 [Bacillus cereus ISP3191]
gi|196027578|gb|EDX66193.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
NVH0597-99]
gi|218537435|gb|ACK89833.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
AH820]
gi|225786013|gb|ACO26230.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
03BB102]
gi|401189634|gb|EJQ96684.1| hypothetical protein IGW_00649 [Bacillus cereus ISP3191]
Length = 216
Score = 128 bits (322), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 73/217 (33%), Positives = 114/217 (52%), Gaps = 32/217 (14%)
Query: 65 TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
T D IQ I F+ P + N S E+C +I +K +L S++ +
Sbjct: 25 TEDREIQIISKFE----EPLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDV 75
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTSSG F+ +E + IE +I+ +P +HGE ++L YE+ Q+Y +HYD F
Sbjct: 76 NDIRTSSGAFLDDNE----LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF 131
Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
AE+ + R+++ ++YL+DVEEGGET FP + L V PR+G
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174
Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+ F + + +++ +LHG PV KGEKW+AT+W+R
Sbjct: 175 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWVR 211
>gi|229157835|ref|ZP_04285910.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 4342]
gi|228625792|gb|EEK82544.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 4342]
Length = 232
Score = 128 bits (322), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 73/217 (33%), Positives = 114/217 (52%), Gaps = 32/217 (14%)
Query: 65 TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
T D IQ I F+ P + N S E+C +I +K +L S++ +
Sbjct: 41 TEDREIQIISKFE----EPLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDV 91
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTS G F+ +E + E IE +I+ +P +HGE ++L YE+ Q+Y +HYD F
Sbjct: 92 NDIRTSKGAFLDDNE----LTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF 147
Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
AE+ + R+++ ++YL+DVEEGGET FP + L V PR+G
Sbjct: 148 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 190
Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+ F + + +++ +LHG PV KGEKW+AT+W+R
Sbjct: 191 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWVR 227
>gi|406665340|ref|ZP_11073114.1| hypothetical protein B857_00901 [Bacillus isronensis B3W22]
gi|405387266|gb|EKB46691.1| hypothetical protein B857_00901 [Bacillus isronensis B3W22]
Length = 211
Score = 128 bits (322), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 75/228 (32%), Positives = 121/228 (53%), Gaps = 24/228 (10%)
Query: 58 SGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQG 117
S LP ++I +VL P + F N S E+CQ++I A RL+ S+LA ++
Sbjct: 6 SILPIQSVYSIDNRTITAEVLHEEPLIVKFLNVLSDEECQNLIDCASSRLERSKLAKKEI 65
Query: 118 ETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYD 177
++ RTSSG F E++ ++ IE +I+ LP H E VL YE GQ++
Sbjct: 66 SSI------RTSSGMFFE--ENENPLISEIEKRISSLMHLPIEHAEGLQVLHYEPGQEFK 117
Query: 178 SHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVK 237
+H+D F P + + R+++ ++YL+DVEEGG T FP +G+
Sbjct: 118 AHFDFFGP-NHPSSSNNRISTLVVYLNDVEEGGVTTFP---------------NLGIVNV 161
Query: 238 PRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 285
P++G + F + + ++ +LH PVI+GEKWVAT+W+R ++ E
Sbjct: 162 PKKGTAVYFEYFYNDQKLNELTLHSGEPVIQGEKWVATQWMRKKQIRE 209
>gi|354334983|gb|AER23925.1| procollagen-proline dioxygenase [Variovorax sp. HH01]
Length = 280
Score = 128 bits (322), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 85/225 (37%), Positives = 120/225 (53%), Gaps = 30/225 (13%)
Query: 64 ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPS-QLALRQGETVES 122
+ GD +Q + Q + PR + F N SAE+C+ +IA A+ RL S + R G V +
Sbjct: 78 DAGDRRVQVL--QTMR-HPRVVVFGNLLSAEECEGLIAAARVRLARSLTVETRTGGEVLN 134
Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
RTS G F E++ I+ +E +IA P GE +LRY G +Y HYD
Sbjct: 135 VD--RTSDGMFFERGENE--IVARVEQRIAALLRWPLEFGEGLQILRYAPGAQYRPHYDY 190
Query: 183 FNPAEYG-PQM----SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVK 237
F+P+E G P + QR+A+ ++YL + E GG T FP +GL+V
Sbjct: 191 FDPSEPGTPTILKRGGQRVATLVMYLQEPEGGGATTFP---------------DVGLEVA 235
Query: 238 PRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
P RG G+ F P+ + RT LHG PV+ GEKWVATKW+R++E
Sbjct: 236 PARGCGVFFSYDRPD-PVTRT-LHGGAPVLAGEKWVATKWLRERE 278
>gi|196046329|ref|ZP_03113555.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
03BB108]
gi|376268135|ref|YP_005120847.1| Peptidyl prolyl 4- hydroxylase like protein [Bacillus cereus
F837/76]
gi|196022799|gb|EDX61480.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
03BB108]
gi|364513935|gb|AEW57334.1| Peptidyl prolyl 4- hydroxylase like protein [Bacillus cereus
F837/76]
Length = 216
Score = 128 bits (322), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 73/217 (33%), Positives = 114/217 (52%), Gaps = 32/217 (14%)
Query: 65 TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
T D IQ I F+ P + N S E+C +I +K +L S++ +
Sbjct: 25 TEDREIQIISKFE----EPLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDV 75
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTSSG F+ +E + IE +I+ +P +HGE ++L YE+ Q+Y +HYD F
Sbjct: 76 NDIRTSSGAFLDDNE----LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF 131
Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
AE+ + R+++ ++YL+DVEEGGET FP + L V PR+G
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174
Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+ F + + +++ +LHG PV KGEKW+AT+W+R
Sbjct: 175 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWVR 211
>gi|301055727|ref|YP_003793938.1| prolyl 4-hydroxylase subunit alpha [Bacillus cereus biovar
anthracis str. CI]
gi|300377896|gb|ADK06800.1| prolyl 4-hydroxylase, alpha subunit [Bacillus cereus biovar
anthracis str. CI]
Length = 216
Score = 128 bits (322), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 73/217 (33%), Positives = 114/217 (52%), Gaps = 32/217 (14%)
Query: 65 TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
T D IQ I F+ P + N S E+C +I +K +L S++ +
Sbjct: 25 TEDRKIQIISKFE----EPLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDV 75
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTSSG F+ +E + IE +I+ +P +HGE ++L YE+ Q+Y +HYD F
Sbjct: 76 NDIRTSSGAFLDDNE----LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF 131
Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
AE+ + R+++ ++YL+DVEEGGET FP + L V PR+G
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174
Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+ F + + +++ +LHG PV KGEKW+AT+W+R
Sbjct: 175 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWVR 211
>gi|206971296|ref|ZP_03232247.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
AH1134]
gi|229081494|ref|ZP_04213993.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-2]
gi|423411965|ref|ZP_17389085.1| hypothetical protein IE1_01269 [Bacillus cereus BAG3O-2]
gi|423432249|ref|ZP_17409253.1| hypothetical protein IE7_04065 [Bacillus cereus BAG4O-1]
gi|206734068|gb|EDZ51239.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
AH1134]
gi|228701801|gb|EEL54288.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-2]
gi|401104033|gb|EJQ12010.1| hypothetical protein IE1_01269 [Bacillus cereus BAG3O-2]
gi|401117005|gb|EJQ24843.1| hypothetical protein IE7_04065 [Bacillus cereus BAG4O-1]
Length = 216
Score = 128 bits (322), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 66/199 (33%), Positives = 107/199 (53%), Gaps = 27/199 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +++ S++ + RTSSG F+ +E
Sbjct: 39 PLIVVLANVLSDEECDELIEMSKNKMERSKIG-----SSRDVNDIRTSSGAFLEDNE--- 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 148 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 192
Query: 261 HGSCPVIKGEKWVATKWIR 279
HG PV KGEKW+AT+W+R
Sbjct: 193 HGGAPVTKGEKWIATQWVR 211
>gi|229168980|ref|ZP_04296697.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH621]
gi|423591765|ref|ZP_17567796.1| hypothetical protein IIG_00633 [Bacillus cereus VD048]
gi|228614572|gb|EEK71680.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH621]
gi|401231898|gb|EJR38400.1| hypothetical protein IIG_00633 [Bacillus cereus VD048]
Length = 216
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 74/217 (34%), Positives = 113/217 (52%), Gaps = 32/217 (14%)
Query: 65 TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
T D IQ I F+ P + N S E+C +I +K +K S++ +
Sbjct: 25 TEDREIQIISKFE----EPLIVVLANVLSDEECAELIELSKSNMKRSKVG-----SSRDV 75
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTSSG F+ +E + IE +I+ T +P HGE ++L YE+ Q+Y +HYD F
Sbjct: 76 NDIRTSSGAFLEENE----LTWKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF 131
Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
AE+ + R+++ ++YL+DVEEGGET FP + L V PR+G
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174
Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+ F + + ++ +LHG PV KGEKW+AT+W+R
Sbjct: 175 AVYFEYFYQDQLLNELTLHGGAPVTKGEKWIATQWVR 211
>gi|229186477|ref|ZP_04313640.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BGSC 6E1]
gi|228596991|gb|EEK54648.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BGSC 6E1]
Length = 216
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 73/217 (33%), Positives = 114/217 (52%), Gaps = 32/217 (14%)
Query: 65 TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
T D IQ I F+ P + N S E+C +I +K +L S++ +
Sbjct: 25 TEDREIQIISKFE----EPLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDV 75
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTSSG F+ +E + IE +I+ +P +HGE ++L YE+ Q+Y +HYD F
Sbjct: 76 NDIRTSSGAFLDDNE----LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF 131
Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
AE+ + R+++ ++YL+DVEEGGET FP + L V PR+G
Sbjct: 132 --AEHSRSAANNRISTLVIYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174
Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+ F + + +++ +LHG PV KGEKW+AT+W+R
Sbjct: 175 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWVR 211
>gi|229146822|ref|ZP_04275187.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST24]
gi|228636650|gb|EEK93115.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST24]
Length = 216
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 66/199 (33%), Positives = 107/199 (53%), Gaps = 27/199 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +++ S++ + RTSSG F+ +E
Sbjct: 39 PLIVVLANVLSDEECDELIEMSKNKMERSKIG-----SSRDVNDIRTSSGAFLEDNE--- 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
+YL+DVEEGGET FP + L V PR+G + F + +++ +L
Sbjct: 148 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQGQSLNELTL 192
Query: 261 HGSCPVIKGEKWVATKWIR 279
HG PV KGEKW+AT+W+R
Sbjct: 193 HGGAPVTKGEKWIATQWVR 211
>gi|302845026|ref|XP_002954052.1| hypothetical protein VOLCADRAFT_64430 [Volvox carteri f.
nagariensis]
gi|300260551|gb|EFJ44769.1| hypothetical protein VOLCADRAFT_64430 [Volvox carteri f.
nagariensis]
Length = 311
Score = 128 bits (321), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 79/212 (37%), Positives = 111/212 (52%), Gaps = 17/212 (8%)
Query: 77 VLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
V+SW+PRA NF + +C I A+ ++ S + G +V RTS GTFI+
Sbjct: 3 VISWQPRAFVIRNFLTEHECTHIADLAQVHMRRSTVVADNGSSV--LDDYRTSYGTFINR 60
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+ T ++ +E ++A T P + E VLRY +GQ Y H D+ S R+
Sbjct: 61 YQ--TPVIAAVEDRVALLTRTPVVYQEDMQVLRYGLGQYYHRHTDSLE------NDSPRM 112
Query: 197 ASFLLYLSDVEEGGETMFP----FENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLF 250
A+ LLYLS+ E GGET FP + + + C+ + KPRRGD LLF+S+
Sbjct: 113 ATVLLYLSEPELGGETAFPQAASWAHPAMAQLFGPFSDCVKGNVAFKPRRGDALLFWSVK 172
Query: 251 PNG-TIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
P+G T D S H CPVI+G KW AT W+ Q
Sbjct: 173 PDGRTEDPYSEHEGCPVIRGVKWTATVWVHTQ 204
>gi|423426372|ref|ZP_17403403.1| hypothetical protein IE5_04061 [Bacillus cereus BAG3X2-2]
gi|401111119|gb|EJQ19018.1| hypothetical protein IE5_04061 [Bacillus cereus BAG3X2-2]
Length = 248
Score = 128 bits (321), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 66/199 (33%), Positives = 107/199 (53%), Gaps = 27/199 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +++ S++ + RTSSG F+ +E
Sbjct: 71 PLIVVLANVLSDEECDELIEISKNKMERSKIG-----SSRDVNDIRTSSGAFLEDNE--- 122
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 123 -LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 179
Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 180 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 224
Query: 261 HGSCPVIKGEKWVATKWIR 279
HG PV KGEKW+AT+W+R
Sbjct: 225 HGGAPVTKGEKWIATQWVR 243
>gi|319792090|ref|YP_004153730.1| procollagen-proline dioxygenase [Variovorax paradoxus EPS]
gi|315594553|gb|ADU35619.1| Procollagen-proline dioxygenase [Variovorax paradoxus EPS]
Length = 280
Score = 128 bits (321), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 83/225 (36%), Positives = 120/225 (53%), Gaps = 30/225 (13%)
Query: 64 ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPS-QLALRQGETVES 122
+ GD +Q + Q + PR + F N S E+C+ +IA A+ RL S + R G V +
Sbjct: 78 DAGDRQVQVL--QTMR-HPRVIVFGNLLSTEECEGLIAAARVRLARSLTVETRTGGEVLN 134
Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
RTS G F E++ I+ +E ++A P +GE +LRY G +Y HYD
Sbjct: 135 VD--RTSDGMFFERGENE--IVARLEQRLAMLLRWPLEYGEGLQILRYAPGAQYRPHYDY 190
Query: 183 FNPAEYG-PQM----SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVK 237
F+P E G P + QR+A+ ++YL + E+GG T FP +GL+V
Sbjct: 191 FDPNEPGTPTILKRGGQRVATLVMYLQEPEQGGATTFP---------------DVGLEVA 235
Query: 238 PRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
P RG G+ F P+ + RT LHG PV+ GEKWVATKW+R++E
Sbjct: 236 PVRGTGVFFSYDRPD-PVTRT-LHGGAPVLAGEKWVATKWLRERE 278
>gi|365158975|ref|ZP_09355162.1| hypothetical protein HMPREF1014_00625 [Bacillus sp. 7_6_55CFAA_CT2]
gi|363625964|gb|EHL76973.1| hypothetical protein HMPREF1014_00625 [Bacillus sp. 7_6_55CFAA_CT2]
Length = 248
Score = 128 bits (321), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 66/199 (33%), Positives = 107/199 (53%), Gaps = 27/199 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +++ S++ + RTSSG F+ +E
Sbjct: 71 PLIVVLANVLSDEECDELIEMSKNKMERSKIG-----SSRDVNDIRTSSGAFLEDNE--- 122
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 123 -LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 179
Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 180 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 224
Query: 261 HGSCPVIKGEKWVATKWIR 279
HG PV KGEKW+AT+W+R
Sbjct: 225 HGGAPVTKGEKWIATQWVR 243
>gi|242085722|ref|XP_002443286.1| hypothetical protein SORBIDRAFT_08g016950 [Sorghum bicolor]
gi|241943979|gb|EES17124.1| hypothetical protein SORBIDRAFT_08g016950 [Sorghum bicolor]
Length = 147
Score = 128 bits (321), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 66/137 (48%), Positives = 87/137 (63%), Gaps = 4/137 (2%)
Query: 143 ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLY 202
I+ IE +IA T +P +GE VL Y +GQK++ H+D + R A+FL+Y
Sbjct: 10 IVRTIEQRIADYTSVPIENGEPLQVLHYAVGQKFEPHFDYTDGTSVTKIGGPRKATFLMY 69
Query: 203 LSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHG 262
LSDVEEGGET+FP N S K G+ VKP+ GD LLF+S+ P+G++D SLHG
Sbjct: 70 LSDVEEGGETVFP--NATAKGSAPSAKS--GISVKPKMGDALLFWSMKPDGSLDPKSLHG 125
Query: 263 SCPVIKGEKWVATKWIR 279
+ PVIKG+KW ATKWI
Sbjct: 126 ASPVIKGDKWSATKWIH 142
>gi|229111709|ref|ZP_04241257.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-15]
gi|296504733|ref|YP_003666433.1| prolyl 4-hydroxylase subunit alpha [Bacillus thuringiensis BMB171]
gi|423585282|ref|ZP_17561369.1| hypothetical protein IIE_00694 [Bacillus cereus VD045]
gi|423640681|ref|ZP_17616299.1| hypothetical protein IK9_00626 [Bacillus cereus VD166]
gi|228671703|gb|EEL26999.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-15]
gi|296325785|gb|ADH08713.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis BMB171]
gi|401233925|gb|EJR40411.1| hypothetical protein IIE_00694 [Bacillus cereus VD045]
gi|401279742|gb|EJR85664.1| hypothetical protein IK9_00626 [Bacillus cereus VD166]
Length = 248
Score = 128 bits (321), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 66/199 (33%), Positives = 107/199 (53%), Gaps = 27/199 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +++ S++ + RTSSG F+ +E
Sbjct: 71 PLIVVLANVLSDEECDELIEMSKNKMERSKIG-----SSRDVNDIRTSSGAFLEDNE--- 122
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 123 -FTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 179
Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 180 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 224
Query: 261 HGSCPVIKGEKWVATKWIR 279
HG PV KGEKW+AT+W+R
Sbjct: 225 HGGAPVTKGEKWIATQWVR 243
>gi|229180513|ref|ZP_04307855.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 172560W]
gi|228602937|gb|EEK60416.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 172560W]
Length = 232
Score = 128 bits (321), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 66/199 (33%), Positives = 107/199 (53%), Gaps = 27/199 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +++ S++ + RTSSG F+ +E
Sbjct: 55 PLIVVLANVLSDEECDELIEMSKNKMERSKIG-----SSRDVNDIRTSSGAFLEDNE--- 106
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 107 -LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 163
Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 164 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 208
Query: 261 HGSCPVIKGEKWVATKWIR 279
HG PV KGEKW+AT+W+R
Sbjct: 209 HGGAPVTKGEKWIATQWVR 227
>gi|418530659|ref|ZP_13096582.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni ATCC 11996]
gi|371452378|gb|EHN65407.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni ATCC 11996]
Length = 299
Score = 128 bits (321), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 84/248 (33%), Positives = 125/248 (50%), Gaps = 39/248 (15%)
Query: 50 EVVEKENESGLPHGE---------TGDASIQSIPFQVLSWR-PRALYFPNFASAEQCQSI 99
EV + +ESG+P + GD + + +S R PR + F N S E+C +I
Sbjct: 74 EVAVQIHESGVPEPDLLTDPCCVHAGDRDVHVL----MSVRHPRVVVFGNLLSNEECDAI 129
Query: 100 IATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQ 159
IA A+ R++ S L + E+ RTS+G F E+ ++ +E +IAR P
Sbjct: 130 IAAARPRMQRS-LTVDNQSGGEAVNDDRTSNGMFFQRGEND--LISRVEQRIARLLNWPL 186
Query: 160 THGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQRLASFLLYLSDVEEGGETMF 214
+GE VL Y G +Y HYD F P E G P + QR+ + ++YL++ GG T F
Sbjct: 187 ENGEGMQVLHYRPGAEYKPHYDYFAPNEPGTPTILKRGGQRVGTLVMYLNEPARGGATTF 246
Query: 215 PFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVA 274
P +GL+V PRRG+ + F P +LHG PV++GEKW+A
Sbjct: 247 P---------------DVGLQVVPRRGNAVFFSYNRPEPATK--TLHGGAPVLEGEKWIA 289
Query: 275 TKWIRDQE 282
TKW+R++E
Sbjct: 290 TKWLRERE 297
>gi|423448819|ref|ZP_17425698.1| hypothetical protein IEC_03427 [Bacillus cereus BAG5O-1]
gi|401129413|gb|EJQ37096.1| hypothetical protein IEC_03427 [Bacillus cereus BAG5O-1]
Length = 216
Score = 128 bits (321), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 73/217 (33%), Positives = 113/217 (52%), Gaps = 32/217 (14%)
Query: 65 TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
T D IQ I F+ P + N S E+C +I +K ++K S + +
Sbjct: 25 TEDREIQIISKFE----EPLIVVLGNVISDEECDELIEMSKNKIKRSTIG-----SSRDV 75
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTSSG F+ +E + IE +I+ +P THGE ++L YE+ Q+Y +HYD F
Sbjct: 76 NDIRTSSGAFLEENE----LTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYF 131
Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
AE+ + R+++ ++YL+DVEEGGET FP + L V PR+G
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174
Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+ F + + +++ +LHG V KGEKW+AT+W+R
Sbjct: 175 AVYFEYFYQDQSLNELTLHGGASVTKGEKWIATQWVR 211
>gi|30264308|ref|NP_846685.1| prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. Ames]
gi|47529753|ref|YP_021102.1| prolyl 4-hydroxylase subunit alpha [Bacillus anthracis str. 'Ames
Ancestor']
gi|65321616|ref|ZP_00394575.1| hypothetical protein Bant_01005109 [Bacillus anthracis str. A2012]
gi|165873278|ref|ZP_02217887.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0488]
gi|167634610|ref|ZP_02392930.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0442]
gi|167638693|ref|ZP_02396969.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0193]
gi|170687507|ref|ZP_02878724.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0465]
gi|170709341|ref|ZP_02899757.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0389]
gi|177655890|ref|ZP_02937082.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0174]
gi|190566156|ref|ZP_03019075.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Tsiankovskii-I]
gi|196034803|ref|ZP_03102210.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
W]
gi|227817011|ref|YP_002817020.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
anthracis str. CDC 684]
gi|228929280|ref|ZP_04092307.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pondicheriensis BGSC 4BA1]
gi|228935557|ref|ZP_04098373.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
andalousiensis BGSC 4AW1]
gi|229123754|ref|ZP_04252949.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 95/8201]
gi|229604260|ref|YP_002868528.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0248]
gi|254683996|ref|ZP_05147856.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. CNEVA-9066]
gi|254721830|ref|ZP_05183619.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A1055]
gi|254736344|ref|ZP_05194050.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Western North America USA6153]
gi|254741382|ref|ZP_05199069.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Kruger B]
gi|254753983|ref|ZP_05206018.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Vollum]
gi|254757854|ref|ZP_05209881.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Australia 94]
gi|386738126|ref|YP_006211307.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. H9401]
gi|421506493|ref|ZP_15953416.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. UR-1]
gi|421638315|ref|ZP_16078911.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. BF1]
gi|30258953|gb|AAP28171.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Ames]
gi|47504901|gb|AAT33577.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. 'Ames Ancestor']
gi|164710995|gb|EDR16563.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0488]
gi|167513541|gb|EDR88911.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0193]
gi|167530062|gb|EDR92797.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0442]
gi|170125767|gb|EDS94678.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0389]
gi|170668702|gb|EDT19448.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0465]
gi|172079923|gb|EDT65028.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0174]
gi|190563075|gb|EDV17041.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Tsiankovskii-I]
gi|195992342|gb|EDX56303.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
W]
gi|227005734|gb|ACP15477.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. CDC 684]
gi|228659889|gb|EEL15534.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 95/8201]
gi|228824095|gb|EEM69911.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
andalousiensis BGSC 4AW1]
gi|228830570|gb|EEM76180.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pondicheriensis BGSC 4BA1]
gi|229268668|gb|ACQ50305.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0248]
gi|384387978|gb|AFH85639.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. H9401]
gi|401823486|gb|EJT22633.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. UR-1]
gi|403394741|gb|EJY91981.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. BF1]
Length = 216
Score = 128 bits (321), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 73/218 (33%), Positives = 115/218 (52%), Gaps = 32/218 (14%)
Query: 64 ETGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES 122
+T D IQ I F+ P + N S E+C +I +K +L S++ +
Sbjct: 24 KTEDREIQIISKFE----EPLIVVLGNVLSDEECDELIELSKSKLARSKVG-----SSRD 74
Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
RTSSG F+ +E + IE +I+ +P +HGE ++L YE+ Q+Y +HYD
Sbjct: 75 VNDIRTSSGAFLDDNE----LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDY 130
Query: 183 FNPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRG 241
F AE+ + R+++ ++YL+DVEEGGET FP + L V PR+G
Sbjct: 131 F--AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKG 173
Query: 242 DGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+ F + + +++ +LHG PV KGEKW+AT+W+R
Sbjct: 174 MAVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWVR 211
>gi|423470454|ref|ZP_17447198.1| hypothetical protein IEM_01760 [Bacillus cereus BAG6O-2]
gi|402436583|gb|EJV68613.1| hypothetical protein IEM_01760 [Bacillus cereus BAG6O-2]
Length = 216
Score = 128 bits (321), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 73/217 (33%), Positives = 114/217 (52%), Gaps = 32/217 (14%)
Query: 65 TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
T D IQ I F+ P + N S E+C +I +K +++ S++ +
Sbjct: 25 TEDREIQIISKFE----EPLIVVLANVLSDEECDGLIELSKNKIERSKIG-----SSRDV 75
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTSSG F+ +E + IE +I+ +P HGE ++L YE+ Q+Y +HYD F
Sbjct: 76 NDIRTSSGAFLEENE----LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQEYKAHYDYF 131
Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
AE+ + R+++ ++YL+DVEEGGET FP + L V PR+G
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174
Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+ F + + +++ +LHG PV KGEKWVAT+W+R
Sbjct: 175 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWVATQWMR 211
>gi|120609859|ref|YP_969537.1| 2OG-Fe(II) oxygenase [Acidovorax citrulli AAC00-1]
gi|120588323|gb|ABM31763.1| 2OG-Fe(II) oxygenase [Acidovorax citrulli AAC00-1]
Length = 309
Score = 128 bits (321), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 76/207 (36%), Positives = 112/207 (54%), Gaps = 27/207 (13%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPS-QLALRQGETVESTKGTRTSSGTFISASED 139
+PR + F N S E+C +II A+ R+ S +A R G E RTS+G F E+
Sbjct: 121 QPRVVLFGNLLSPEECDAIIDAARPRMARSLTVATRTGG--EEVNDDRTSNGMFFQREEN 178
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQ 194
++ +E +IAR P +GE VL Y G +Y HYD F+PAE G P + Q
Sbjct: 179 P--VVARLEARIARLVNWPLENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPTILRRGGQ 236
Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
R+A+ ++YL+D E+GG T FP + L+V PRRG+ + F P+ +
Sbjct: 237 RVATIVIYLNDPEKGGGTTFP---------------DVHLEVAPRRGNAVFFSYERPHPS 281
Query: 255 IDRTSLHGSCPVIKGEKWVATKWIRDQ 281
+LHG PV+ G+KW+ATKW+R++
Sbjct: 282 T--RTLHGGAPVVAGDKWIATKWLRER 306
>gi|217961727|ref|YP_002340297.1| prolyl 4-hydroxylase subunit alpha domain-containing protein
[Bacillus cereus AH187]
gi|222097680|ref|YP_002531737.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
Q1]
gi|229198365|ref|ZP_04325071.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1293]
gi|375286242|ref|YP_005106681.1| prolyl 4-hydroxylase subunit alpha domain-containing protein
[Bacillus cereus NC7401]
gi|423354732|ref|ZP_17332357.1| hypothetical protein IAU_02806 [Bacillus cereus IS075]
gi|423566803|ref|ZP_17543050.1| hypothetical protein II7_00026 [Bacillus cereus MSX-A12]
gi|423574080|ref|ZP_17550199.1| hypothetical protein II9_01301 [Bacillus cereus MSX-D12]
gi|217067199|gb|ACJ81449.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
AH187]
gi|221241738|gb|ACM14448.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
Q1]
gi|228585065|gb|EEK43177.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1293]
gi|358354769|dbj|BAL19941.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
NC7401]
gi|401086280|gb|EJP94507.1| hypothetical protein IAU_02806 [Bacillus cereus IS075]
gi|401212649|gb|EJR19392.1| hypothetical protein II9_01301 [Bacillus cereus MSX-D12]
gi|401215318|gb|EJR22035.1| hypothetical protein II7_00026 [Bacillus cereus MSX-A12]
Length = 216
Score = 127 bits (320), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 73/217 (33%), Positives = 114/217 (52%), Gaps = 32/217 (14%)
Query: 65 TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
T D IQ I F+ P + N S E+C +I +K +L S++ +
Sbjct: 25 TEDREIQIISKFE----EPLIVVLGNVLSDEECDKLIELSKNKLARSKVG-----SSRDV 75
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTSSG F+ +E + IE +I+ +P +HGE ++L YE+ Q+Y +HYD F
Sbjct: 76 NDIRTSSGAFLDDNE----LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF 131
Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
AE+ + R+++ ++YL+DVEEGGET FP + L V PR+G
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174
Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+ F + + +++ +LHG PV KGEKW+AT+W+R
Sbjct: 175 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWVR 211
>gi|52141260|ref|YP_085568.1| prolyl 4-hydroxylase, alpha subunit [Bacillus cereus E33L]
gi|51974729|gb|AAU16279.1| prolyl 4-hydroxylase, alpha subunit [Bacillus cereus E33L]
Length = 232
Score = 127 bits (320), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 73/218 (33%), Positives = 115/218 (52%), Gaps = 32/218 (14%)
Query: 64 ETGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES 122
+T D IQ I F+ P + N S E+C +I +K +L S++ +
Sbjct: 40 KTEDREIQIISKFE----EPLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRD 90
Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
RTSSG F+ +E + IE +I+ +P +HGE ++L YE+ Q+Y +HYD
Sbjct: 91 VNDIRTSSGAFLDDNE----LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDY 146
Query: 183 FNPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRG 241
F AE+ + R+++ ++YL+DVEEGGET FP + L V PR+G
Sbjct: 147 F--AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKG 189
Query: 242 DGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+ F + + +++ +LHG PV KGEKW+AT+W+R
Sbjct: 190 MAVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWVR 227
>gi|108706360|gb|ABF94155.1| oxidoreductase, 2OG-Fe oxygenase family protein, putative [Oryza
sativa Japonica Group]
gi|125585047|gb|EAZ25711.1| hypothetical protein OsJ_09544 [Oryza sativa Japonica Group]
Length = 277
Score = 127 bits (320), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 79/216 (36%), Positives = 119/216 (55%), Gaps = 25/216 (11%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES-TKGTRTSSGTFISA 136
+SWRPRA + F S +C +I+ AK+ K + + GE+ ES T RTSSG F+
Sbjct: 45 VSWRPRAFLYEGFLSDAECDHLISLAKQG-KMEKSTVVDGESGESVTSKVRTSSGMFLDK 103
Query: 137 SEDKTGILELIEHKIARATMLP-----------------QTHGEAFNVLRYEIGQKYDSH 179
+D+ ++ IE +IA TMLP +GE+ +LRY G+KY+ H
Sbjct: 104 KQDE--VVARIEERIAAWTMLPTECIIFYCFANFAILKLSENGESMQILRYGQGEKYEPH 161
Query: 180 YDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI--GLKVK 237
+D + + + R+A+ L+YLS+V+ G +++ P + + + C G VK
Sbjct: 162 FDYISGRQGSTREGDRVATVLMYLSNVKMG-DSLLP-QARLSQPKDETWSDCAEQGFAVK 219
Query: 238 PRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWV 273
P +G +LF+SL PN T+D SLHGSCPVI+GEK V
Sbjct: 220 PAKGSAVLFFSLHPNATLDTDSLHGSCPVIEGEKVV 255
>gi|398808448|ref|ZP_10567311.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
gi|398087480|gb|EJL78066.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
Length = 280
Score = 127 bits (320), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 84/225 (37%), Positives = 120/225 (53%), Gaps = 30/225 (13%)
Query: 64 ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPS-QLALRQGETVES 122
+ GD +Q + Q + PR + F N SAE+C+ +IA A+ RL S + R G V +
Sbjct: 78 DAGDRRVQVL--QTMR-HPRVVVFGNLLSAEECEGLIAAARVRLARSLTVETRTGGEVLN 134
Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
RTS G F E++ I+ +E ++A P +GE +LRY G +Y HYD
Sbjct: 135 VD--RTSDGMFFERGENE--IVARLEQRLATLLRWPLEYGEGLQILRYAPGAQYRPHYDY 190
Query: 183 FNPAEYG-PQM----SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVK 237
F+P E G P + QR+A+ ++YL + E GG T FP +GL+V
Sbjct: 191 FDPGEPGTPTILKRGGQRVATLVMYLQEPEGGGATTFP---------------DVGLEVA 235
Query: 238 PRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
P RG G+ F P+ + RT LHG PV+ GEKWVATKW+R++E
Sbjct: 236 PVRGCGVFFSYDRPD-PVTRT-LHGGAPVLAGEKWVATKWLRERE 278
>gi|319652187|ref|ZP_08006306.1| hypothetical protein HMPREF1013_02919 [Bacillus sp. 2_A_57_CT2]
gi|317396176|gb|EFV76895.1| hypothetical protein HMPREF1013_02919 [Bacillus sp. 2_A_57_CT2]
Length = 283
Score = 127 bits (320), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 79/234 (33%), Positives = 119/234 (50%), Gaps = 24/234 (10%)
Query: 48 TLEVVEKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRL 107
T EV E + + H D + S + +P L+ S+E+C +I+ ++ RL
Sbjct: 67 TYEVPEIGRKGNILHTSDKDVKVLS-----RNEKPFVLHLDQVLSSEECDELISLSRSRL 121
Query: 108 KPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNV 167
+PS L + +G E RTS E++ ++E IE +IA T P +GE +
Sbjct: 122 QPS-LVVDRGSGEERAGSGRTSKSMAFRLKENE--LVERIETRIAELTGYPAENGEGLQI 178
Query: 168 LRYEIGQKYDSHYDAFNP-AEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGY 226
L Y +G++Y H+D F P + QR+ +FL+YL+DVE+GGET+F
Sbjct: 179 LNYGLGEEYKPHFDFFPPHMADASKGGQRVGTFLIYLNDVEDGGETVF------------ 226
Query: 227 DYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 280
GL P++G + F+ G +DR S+H S PV KGEKW ATKWIR+
Sbjct: 227 ---SKAGLSFVPKKGAAIYFHYGNAQGQLDRLSVHSSVPVRKGEKWAATKWIRE 277
>gi|384182063|ref|YP_005567825.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
finitimus YBT-020]
gi|324328147|gb|ADY23407.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
finitimus YBT-020]
Length = 216
Score = 127 bits (320), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 67/199 (33%), Positives = 107/199 (53%), Gaps = 27/199 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +L S++ + RTSSG F+ +E
Sbjct: 39 PLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDVNDIRTSSGAFLDDNE--- 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 148 MYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDRSLNELTL 192
Query: 261 HGSCPVIKGEKWVATKWIR 279
HG PV KGEKW+AT+W+R
Sbjct: 193 HGGAPVTKGEKWIATQWVR 211
>gi|297802348|ref|XP_002869058.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297314894|gb|EFH45317.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 245
Score = 127 bits (320), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 74/197 (37%), Positives = 116/197 (58%), Gaps = 31/197 (15%)
Query: 75 FQVLSWRPRALYFPNF--------ASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKG 125
+V++ PRA + NF + E+C+ +I+ AK + S++ R T +
Sbjct: 55 LEVIAKEPRAFVYHNFLALFFKFCKTNEECEHLISLAKPSMARSKV--RNAITGLGEESS 112
Query: 126 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 185
+RTSSGTF+ DK I++ IE +I+ T +P+ +GEA V+ YE+GQK++ H+D F
Sbjct: 113 SRTSSGTFLRKGHDK--IVKEIEKRISEFTFIPEENGEALQVIHYEVGQKFEPHFDGF-- 168
Query: 186 AEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLL 245
QR+A+ L+YLSDV++GGET+FP GI K G+ V+P++GD LL
Sbjct: 169 --------QRIATVLMYLSDVDKGGETVFPEAKGI--------KSKKGVSVRPKKGDALL 212
Query: 246 FYSLFPNGTIDRTSLHG 262
F+S+ P+G+ D +S HG
Sbjct: 213 FWSMRPDGSQDPSSKHG 229
>gi|118479416|ref|YP_896567.1| prolyl 4-hydroxylase subunit alpha [Bacillus thuringiensis str. Al
Hakam]
gi|118418641|gb|ABK87060.1| prolyl 4-hydroxylase, alpha subunit [Bacillus thuringiensis str. Al
Hakam]
Length = 232
Score = 127 bits (320), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 73/217 (33%), Positives = 114/217 (52%), Gaps = 32/217 (14%)
Query: 65 TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
T D IQ I F+ P + N S E+C +I +K +L S++ +
Sbjct: 41 TEDREIQIISKFE----EPLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDV 91
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTSSG F+ +E + IE +I+ +P +HGE ++L YE+ Q+Y +HYD F
Sbjct: 92 NDIRTSSGAFLDDNE----LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF 147
Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
AE+ + R+++ ++YL+DVEEGGET FP + L V PR+G
Sbjct: 148 --AEHSRSAANNRISTLVIYLNDVEEGGETFFP---------------KLNLSVHPRKGM 190
Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+ F + + +++ +LHG PV KGEKW+AT+W+R
Sbjct: 191 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWVR 227
>gi|206978009|ref|ZP_03238895.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
H3081.97]
gi|423373947|ref|ZP_17351286.1| hypothetical protein IC5_03002 [Bacillus cereus AND1407]
gi|206743809|gb|EDZ55230.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
H3081.97]
gi|401094762|gb|EJQ02832.1| hypothetical protein IC5_03002 [Bacillus cereus AND1407]
Length = 216
Score = 127 bits (320), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 73/217 (33%), Positives = 113/217 (52%), Gaps = 32/217 (14%)
Query: 65 TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
T D IQ I F+ P + N S E+C +I +K +L S++ +
Sbjct: 25 TEDREIQIISKFE----EPLIVVLGNVLSDEECDKLIELSKNKLARSKVG-----SSRDV 75
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTSSG F+ E + IE +I+ +P +HGE ++L YE+ Q+Y +HYD F
Sbjct: 76 NDIRTSSGAFLDDDE----LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF 131
Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
AE+ + R+++ ++YL+DVEEGGET FP + L V PR+G
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174
Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+ F + + +++ +LHG PV KGEKW+AT+W+R
Sbjct: 175 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWVR 211
>gi|66820122|ref|XP_643703.1| hypothetical protein DDB_G0275385 [Dictyostelium discoideum AX4]
gi|60471803|gb|EAL69758.1| hypothetical protein DDB_G0275385 [Dictyostelium discoideum AX4]
Length = 221
Score = 127 bits (320), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 76/212 (35%), Positives = 109/212 (51%), Gaps = 31/212 (14%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT-RTSSGTFISA 136
LS PR P F + E+C+ +I T+K +L+P E + G R+ G F+
Sbjct: 28 LSQAPRIYRIPGFLTDEECEFLIDTSKNKLRPCN---------EISSGVHRSGWGLFMKE 78
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS--- 193
E+ I + I +K+ + ++ E V+RY G++ SH+D FNP M
Sbjct: 79 GEEDHQITKNIFNKMKSFVNISES-CEVMQVIRYNQGEETSSHFDYFNPLTTNGSMKIGL 137
Query: 194 --QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
QR+ + L+YL DVEEGGET FP +G+KVKP +GD +LFY+ P
Sbjct: 138 YGQRVCTILMYLCDVEEGGETTFPE---------------VGIKVKPIKGDAVLFYNCKP 182
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
NG +D SLH PV+KG KWVA K I + +
Sbjct: 183 NGDVDPLSLHQGDPVLKGNKWVAIKLINQKSK 214
>gi|49187135|ref|YP_030387.1| prolyl 4-hydroxylase subunit alpha [Bacillus anthracis str. Sterne]
gi|228947951|ref|ZP_04110238.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
monterrey BGSC 4AJ1]
gi|49181062|gb|AAT56438.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Sterne]
gi|228811938|gb|EEM58272.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
monterrey BGSC 4AJ1]
Length = 232
Score = 127 bits (319), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 73/218 (33%), Positives = 115/218 (52%), Gaps = 32/218 (14%)
Query: 64 ETGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES 122
+T D IQ I F+ P + N S E+C +I +K +L S++ +
Sbjct: 40 KTEDREIQIISKFE----EPLIVVLGNVLSDEECDELIELSKSKLARSKVG-----SSRD 90
Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
RTSSG F+ +E + IE +I+ +P +HGE ++L YE+ Q+Y +HYD
Sbjct: 91 VNDIRTSSGAFLDDNE----LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDY 146
Query: 183 FNPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRG 241
F AE+ + R+++ ++YL+DVEEGGET FP + L V PR+G
Sbjct: 147 F--AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKG 189
Query: 242 DGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+ F + + +++ +LHG PV KGEKW+AT+W+R
Sbjct: 190 MAVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWVR 227
>gi|423457579|ref|ZP_17434376.1| hypothetical protein IEI_00719 [Bacillus cereus BAG5X2-1]
gi|401147963|gb|EJQ55456.1| hypothetical protein IEI_00719 [Bacillus cereus BAG5X2-1]
Length = 216
Score = 127 bits (319), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 67/199 (33%), Positives = 107/199 (53%), Gaps = 27/199 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +L S++ + RTSSG F+ +E
Sbjct: 39 PLIVVLGNVLSDEECDELIELSKSKLARSKVG-----SSRDVNDIRTSSGAFLEDNE--- 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTVKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 148 MYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 192
Query: 261 HGSCPVIKGEKWVATKWIR 279
HG PV KGEKW+AT+W+R
Sbjct: 193 HGGAPVTKGEKWIATQWVR 211
>gi|229071739|ref|ZP_04204954.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus F65185]
gi|228711334|gb|EEL63294.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus F65185]
Length = 232
Score = 127 bits (319), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 66/199 (33%), Positives = 107/199 (53%), Gaps = 27/199 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +++ S++ + RTSSG F+ +E
Sbjct: 55 PLIVVLANVLSDEECDELIEMSKNKMERSKIG-----SSRDVNDIRTSSGAFLEDNE--- 106
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 107 -LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 163
Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 164 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 208
Query: 261 HGSCPVIKGEKWVATKWIR 279
HG PV KGEKW+AT+W+R
Sbjct: 209 HGGAPVTKGEKWIATQWMR 227
>gi|281307110|pdb|3ITQ|A Chain A, Crystal Structure Of A Prolyl 4-Hydroxylase From Bacillus
Anthracis
gi|281307111|pdb|3ITQ|B Chain B, Crystal Structure Of A Prolyl 4-Hydroxylase From Bacillus
Anthracis
Length = 216
Score = 127 bits (319), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 73/218 (33%), Positives = 114/218 (52%), Gaps = 32/218 (14%)
Query: 64 ETGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES 122
+T D IQ I F+ P + N S E+C +I +K +L S++ +
Sbjct: 24 KTEDREIQIISKFE----EPLIVVLGNVLSDEECDELIELSKSKLARSKVG-----SSRD 74
Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
RTSSG F+ +E + IE +I+ +P +HGE ++L YE+ Q+Y +HYD
Sbjct: 75 VNDIRTSSGAFLDDNE----LTAKIEKRISSIXNVPASHGEGLHILNYEVDQQYKAHYDY 130
Query: 183 FNPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRG 241
F AE+ + R+++ + YL+DVEEGGET FP + L V PR+G
Sbjct: 131 F--AEHSRSAANNRISTLVXYLNDVEEGGETFFP---------------KLNLSVHPRKG 173
Query: 242 DGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+ F + + +++ +LHG PV KGEKW+AT+W+R
Sbjct: 174 XAVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWVR 211
>gi|42783360|ref|NP_980607.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 10987]
gi|42739288|gb|AAS43215.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
ATCC 10987]
Length = 216
Score = 127 bits (319), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 73/217 (33%), Positives = 114/217 (52%), Gaps = 32/217 (14%)
Query: 65 TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
T D IQ I F+ P + N S E+C +I +K +L S++ +
Sbjct: 25 TEDREIQIISKFE----EPLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDV 75
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTSSG F+ +E + IE +I+ +P +HGE ++L YE+ Q+Y +HYD F
Sbjct: 76 NDIRTSSGAFLDDNE----LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF 131
Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
AE+ + R+++ ++YL+DVEEGGET FP + L V PR+G
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174
Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+ F + + +++ +LHG PV KGEKW+AT+W+R
Sbjct: 175 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWMR 211
>gi|229140971|ref|ZP_04269515.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST26]
gi|228642547|gb|EEK98834.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST26]
Length = 232
Score = 127 bits (319), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 73/217 (33%), Positives = 114/217 (52%), Gaps = 32/217 (14%)
Query: 65 TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
T D IQ I F+ P + N S E+C +I +K +L S++ +
Sbjct: 41 TEDREIQIISKFE----EPLIVVLGNVLSDEECDKLIELSKNKLARSKVG-----SSRDV 91
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTSSG F+ +E + IE +I+ +P +HGE ++L YE+ Q+Y +HYD F
Sbjct: 92 NDIRTSSGAFLDDNE----LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF 147
Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
AE+ + R+++ ++YL+DVEEGGET FP + L V PR+G
Sbjct: 148 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 190
Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+ F + + +++ +LHG PV KGEKW+AT+W+R
Sbjct: 191 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWVR 227
>gi|30022316|ref|NP_833947.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 14579]
gi|229129515|ref|ZP_04258486.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-Cer4]
gi|29897873|gb|AAP11148.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 14579]
gi|228654120|gb|EEL09987.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-Cer4]
Length = 232
Score = 127 bits (319), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 65/199 (32%), Positives = 107/199 (53%), Gaps = 27/199 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +++ S++ + RTSSG F+ +
Sbjct: 55 PLIVVLANVLSDEECDELIEMSKNKMERSKIG-----SSRDVNDIRTSSGAFL----EDN 105
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 106 KLTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 163
Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 164 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 208
Query: 261 HGSCPVIKGEKWVATKWIR 279
HG PV KGEKW+AT+W+R
Sbjct: 209 HGGAPVTKGEKWIATQWVR 227
>gi|194290782|ref|YP_002006689.1| prolyl 4-hydroxylase subunit alpha [Cupriavidus taiwanensis LMG
19424]
gi|193224617|emb|CAQ70628.1| putative Prolyl 4-hydroxylase alpha subunit [Cupriavidus
taiwanensis LMG 19424]
Length = 296
Score = 127 bits (318), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 74/229 (32%), Positives = 115/229 (50%), Gaps = 23/229 (10%)
Query: 58 SGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQG 117
+G P D + +P P+ F S ++C +++A ++ RL S + +
Sbjct: 80 AGAPTVYAADGGDRRVPILFRLASPQVQLFQQLLSDDECDALVALSRGRLARSPV-VNPD 138
Query: 118 ETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYD 177
E+ RTS G +E ++ IE +IA T +P HGE +L Y+ G +Y
Sbjct: 139 TGDENLIDARTSMGAMFQVAEH--ALIARIEARIAAVTGVPADHGEGLQILNYKPGGEYQ 196
Query: 178 SHYDAFNPAEYGP--QMS---QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI 232
H+D FNP G Q+S QR+A+ ++YL+ E GG T FP +
Sbjct: 197 PHFDYFNPQRPGEARQLSVGGQRIATLVIYLNTPEAGGATAFP---------------RV 241
Query: 233 GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
GL+V P +G+ + F L P+GT+D +LH PV GEKW+ATKW+R++
Sbjct: 242 GLEVAPVKGNAVYFSYLLPDGTLDDRTLHAGLPVAAGEKWIATKWLRER 290
>gi|393200372|ref|YP_006462214.1| prolyl 4-hydroxylase [Solibacillus silvestris StLB046]
gi|327439703|dbj|BAK16068.1| prolyl 4-hydroxylase [Solibacillus silvestris StLB046]
Length = 211
Score = 127 bits (318), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 75/228 (32%), Positives = 119/228 (52%), Gaps = 24/228 (10%)
Query: 58 SGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQG 117
S LP ++I +VL P + F N S E+CQ++I A RL+ S+LA ++
Sbjct: 6 SILPIQSVYSIDNRTITAEVLHEEPLIVKFLNVLSDEECQNLIDCASSRLERSKLAKKEI 65
Query: 118 ETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYD 177
++ RTSSG F E++ ++ IE +I+ LP H E VL YE GQ++
Sbjct: 66 SSI------RTSSGMFFE--ENENPLISEIEKRISSLMHLPIEHAEGLQVLHYEPGQEFK 117
Query: 178 SHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVK 237
H+D F P + + R+ + ++YL+DVEEGG T FP +G+
Sbjct: 118 PHFDFFGP-NHPSSSNNRICTLVVYLNDVEEGGVTTFP---------------NLGIVNV 161
Query: 238 PRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 285
P++G + F + + ++ +LH PVI+GEKWVAT+W+R ++ E
Sbjct: 162 PKKGTAVYFEYFYNDQKLNELTLHSGEPVIQGEKWVATQWMRKKQIRE 209
>gi|319763870|ref|YP_004127807.1| procollagen-proline dioxygenase [Alicycliphilus denitrificans BC]
gi|330823866|ref|YP_004387169.1| procollagen-proline dioxygenase [Alicycliphilus denitrificans K601]
gi|317118431|gb|ADV00920.1| Procollagen-proline dioxygenase [Alicycliphilus denitrificans BC]
gi|329309238|gb|AEB83653.1| Procollagen-proline dioxygenase [Alicycliphilus denitrificans K601]
Length = 284
Score = 127 bits (318), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 75/221 (33%), Positives = 114/221 (51%), Gaps = 25/221 (11%)
Query: 67 DASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT 126
DA + + + PR + F N S E+CQ++I A+ R+ S L ++ E
Sbjct: 82 DAGDRRVEVLMAMANPRVVLFGNLLSPEECQAVIEAARTRMARS-LTVQAASGGEEVNKD 140
Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
RTS G F E++ + +E +IAR P +GE VL Y G +Y HYD F+PA
Sbjct: 141 RTSDGMFFQRGENEA--VARLEERIARLVRWPVENGEGLQVLHYRPGAEYKPHYDYFDPA 198
Query: 187 EYG-PQM----SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRG 241
E G P++ QR+A+ ++YL+D GG T FP + L++ PR+G
Sbjct: 199 EPGTPRLLRRGGQRVATLVIYLNDPVRGGGTTFP---------------DVPLEIGPRQG 243
Query: 242 DGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
+ + F + +LHG PVI+GEKW+ATKW+R++E
Sbjct: 244 NAVFFS--YGRAHPSSRTLHGGAPVIEGEKWIATKWLRERE 282
>gi|423612451|ref|ZP_17588312.1| hypothetical protein IIM_03166 [Bacillus cereus VD107]
gi|401246040|gb|EJR52392.1| hypothetical protein IIM_03166 [Bacillus cereus VD107]
Length = 254
Score = 127 bits (318), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 66/199 (33%), Positives = 107/199 (53%), Gaps = 27/199 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +++ S++ + + RTSSG F+ +E
Sbjct: 77 PLIVVLANVLSDEECDELIELSKNKMERSKIG-----SSRNVNDIRTSSGAFLEENE--- 128
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
IE +I+ T +P HGE ++L Y + Q+Y +HYD F AE+ + R+++ +
Sbjct: 129 -FTSKIEKRISSITNVPVAHGEGLHILNYAVDQEYKAHYDYF--AEHSRSAANNRISTLV 185
Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 186 MYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 230
Query: 261 HGSCPVIKGEKWVATKWIR 279
HG PV KGEKW+AT+W+R
Sbjct: 231 HGGAPVTKGEKWIATQWMR 249
>gi|229163182|ref|ZP_04291137.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus R309803]
gi|228620245|gb|EEK77116.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus R309803]
Length = 229
Score = 127 bits (318), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 66/199 (33%), Positives = 106/199 (53%), Gaps = 27/199 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +L S++ + RTS G F+ +E
Sbjct: 52 PLIVVLGNVLSDEECDELIELSKSKLARSKVG-----SSRDVNDIRTSKGAFLDDNE--- 103
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 104 -LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 160
Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 161 MYLNDVEEGGETFFP---------------KLNLSVNPRKGMAVYFEYFYQDQSLNELTL 205
Query: 261 HGSCPVIKGEKWVATKWIR 279
HG PV KGEKW+AT+W+R
Sbjct: 206 HGGAPVTKGEKWIATQWVR 224
>gi|339327280|ref|YP_004686973.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator N-1]
gi|338167437|gb|AEI78492.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator N-1]
Length = 297
Score = 126 bits (317), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 73/229 (31%), Positives = 115/229 (50%), Gaps = 23/229 (10%)
Query: 58 SGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQG 117
+G P D + +P P+ F + ++C +++A ++ RL S + +
Sbjct: 81 AGTPAVYAADGGDRQVPILFRLASPQVQLFQQLLTDDECDALVALSRGRLARSPV-VNPD 139
Query: 118 ETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYD 177
E+ RTS G +E ++ IE +IA T +P HGE +L Y+ G +Y
Sbjct: 140 TGDENLIDARTSMGAMFQVAEH--ALIARIEARIAAVTGVPAEHGEGLQILNYKPGGEYQ 197
Query: 178 SHYDAFNPAEYGP--QMS---QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI 232
H+D FNP G Q+S QR+A+ ++YL+ E GG T FP +
Sbjct: 198 PHFDYFNPQRPGEARQLSVGGQRIATLVIYLNTPEAGGATAFP---------------RV 242
Query: 233 GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
GL+V P +G+ + F L P+GT+D +LH PV GEKW+ATKW+R++
Sbjct: 243 GLEVAPVKGNAVYFSYLLPDGTLDERTLHAGLPVASGEKWIATKWLRER 291
>gi|281206564|gb|EFA80750.1| putative prolyl 4-hydroxylase alpha subunit [Polysphondylium
pallidum PN500]
Length = 251
Score = 126 bits (317), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 75/221 (33%), Positives = 115/221 (52%), Gaps = 32/221 (14%)
Query: 68 ASIQSIPFQV-LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT 126
+S +IP + +S +PR P F + E+C+ +I T+K +LKP E + G
Sbjct: 49 SSTDNIPKLIEVSQKPRIYRIPKFLTDEECEHLIETSKNKLKPCN---------EISSGV 99
Query: 127 -RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 185
R+ G F+ E+ + + I +++ L ++ E V+RY G++ +H+D FNP
Sbjct: 100 HRSGWGLFMKEGEEDHPVTQNIFNRMKTFVNLTES-SEVMQVIRYNPGEETSAHFDYFNP 158
Query: 186 AEYGPQMS-----QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRR 240
M QR+ + L+YL+DVEEGGET FP + +KVKP +
Sbjct: 159 LTTNGAMKIGLYGQRICTILMYLADVEEGGETSFPE---------------VNVKVKPIK 203
Query: 241 GDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
GD +LFY+ PNG +D SLH PVIKG KW+A K + +
Sbjct: 204 GDAVLFYNCKPNGEVDPLSLHQGDPVIKGTKWIAIKLVNQK 244
>gi|423604110|ref|ZP_17580003.1| hypothetical protein IIK_00691 [Bacillus cereus VD102]
gi|401245796|gb|EJR52149.1| hypothetical protein IIK_00691 [Bacillus cereus VD102]
Length = 216
Score = 126 bits (317), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 73/217 (33%), Positives = 113/217 (52%), Gaps = 32/217 (14%)
Query: 65 TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
T D IQ I F+ P + N S E+C +I +K +L S++ +
Sbjct: 25 TEDREIQIISKFE----EPLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDV 75
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTSSG F+ +E + IE +I+ +P +HGE ++L YE+ Q+Y +HYD F
Sbjct: 76 NDIRTSSGAFLDDNE----LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF 131
Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
AE+ + R+++ ++YL+DVEEGGET FP + L V PR+G
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174
Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+ F + +++ +LHG PV KGEKW+AT+W+R
Sbjct: 175 AVYFEYFHQDQSLNELTLHGGAPVTKGEKWIATQWVR 211
>gi|423395462|ref|ZP_17372663.1| hypothetical protein ICU_01156 [Bacillus cereus BAG2X1-1]
gi|401654873|gb|EJS72412.1| hypothetical protein ICU_01156 [Bacillus cereus BAG2X1-1]
Length = 216
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 66/199 (33%), Positives = 106/199 (53%), Gaps = 27/199 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +L S++ + RTS G F+ +E
Sbjct: 39 PLIVVLGNVLSDEECDKLIELSKNKLARSKVG-----SSRDVNDIRTSKGAFLDDNE--- 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 148 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 192
Query: 261 HGSCPVIKGEKWVATKWIR 279
HG PV KGEKW+AT+W+R
Sbjct: 193 HGGAPVTKGEKWIATQWVR 211
>gi|402555628|ref|YP_006596899.1| prolyl 4-hydroxylase subunit alpha [Bacillus cereus FRI-35]
gi|401796838|gb|AFQ10697.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus FRI-35]
Length = 216
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 73/217 (33%), Positives = 114/217 (52%), Gaps = 32/217 (14%)
Query: 65 TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
T D IQ I F+ P + N S E+C +I +K +L S++ +
Sbjct: 25 TEDREIQIISKFE----EPLIVVLGNVLSDEECGELIELSKNKLARSKVG-----SSRDV 75
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTSSG F+ +E + IE +I+ +P +HGE ++L YE+ Q+Y +HYD F
Sbjct: 76 NDIRTSSGAFLDDNE----LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF 131
Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
AE+ + R+++ ++YL+DVEEGGET FP + L V PR+G
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174
Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+ F + + +++ +LHG PV KGEKW+AT+W+R
Sbjct: 175 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWMR 211
>gi|423406337|ref|ZP_17383486.1| hypothetical protein ICY_01022 [Bacillus cereus BAG2X1-3]
gi|401660331|gb|EJS77813.1| hypothetical protein ICY_01022 [Bacillus cereus BAG2X1-3]
Length = 216
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 66/199 (33%), Positives = 106/199 (53%), Gaps = 27/199 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +L S++ + RTS G F+ +E
Sbjct: 39 PLIVVLGNVLSDEECDKLIELSKNKLARSKVG-----SSRDVNDIRTSKGAFLDDNE--- 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 148 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 192
Query: 261 HGSCPVIKGEKWVATKWIR 279
HG PV KGEKW+AT+W+R
Sbjct: 193 HGGAPVTKGEKWIATQWVR 211
>gi|228916870|ref|ZP_04080433.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pulsiensis BGSC 4CC1]
gi|228842793|gb|EEM87878.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pulsiensis BGSC 4CC1]
Length = 232
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 72/217 (33%), Positives = 113/217 (52%), Gaps = 32/217 (14%)
Query: 65 TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
T D IQ I F+ P + N S E+C +I +K +L S++ +
Sbjct: 41 TEDREIQIISKFE----EPLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDV 91
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTS G F+ +E + IE +I+ +P +HGE ++L YE+ Q+Y +HYD F
Sbjct: 92 NDIRTSKGAFLDDNE----LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF 147
Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
AE+ + R+++ ++YL+DVEEGGET FP + L V PR+G
Sbjct: 148 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 190
Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+ F + + +++ +LHG PV KGEKW+AT+W+R
Sbjct: 191 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWVR 227
>gi|398804098|ref|ZP_10563100.1| 2OG-Fe(II) oxygenase superfamily enzyme [Polaromonas sp. CF318]
gi|398094921|gb|EJL85274.1| 2OG-Fe(II) oxygenase superfamily enzyme [Polaromonas sp. CF318]
Length = 277
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 79/208 (37%), Positives = 112/208 (53%), Gaps = 29/208 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPS-QLALRQGETVESTKGTRTSSGTFISASEDK 140
P F N SA +C+++IA A+ RL S + +R G E RTS G F + E++
Sbjct: 90 PELWVFDNLLSAAECEALIAAAESRLARSLTVDIRTGG--EELNHDRTSHGMFYTRGENE 147
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQR 195
++ IE +IAR P +GE VLRY G +Y HYD F+P E G QR
Sbjct: 148 --VIRRIEARIARLLNWPVQNGEGLQVLRYRRGAEYKPHYDYFDPGEPGTAAILRRGGQR 205
Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLF-YSLFPNGT 254
+AS ++YL + EGG T+FP IGLKV+P++G + F Y+L +
Sbjct: 206 VASLIMYLREPGEGGATVFP---------------DIGLKVRPQQGSAVFFSYALAHPAS 250
Query: 255 IDRTSLHGSCPVIKGEKWVATKWIRDQE 282
+ +LHG PV GEKW+ATKW+R++E
Sbjct: 251 L---TLHGGEPVKSGEKWIATKWLRERE 275
>gi|253575459|ref|ZP_04852796.1| prolyl 4-hydroxylase [Paenibacillus sp. oral taxon 786 str. D14]
gi|251845106|gb|EES73117.1| prolyl 4-hydroxylase [Paenibacillus sp. oral taxon 786 str. D14]
Length = 215
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 72/210 (34%), Positives = 106/210 (50%), Gaps = 26/210 (12%)
Query: 77 VLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
VL P + F + ++C+ +I A RL+ S+L + RTS G F
Sbjct: 25 VLHKEPLIMRFERLLTDDECRQLIEAAAPRLRESKLVNK------VVSEIRTSRGMFFE- 77
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ-R 195
E++ + IE +I+ +P H E VL Y GQ+Y +HYD F P P S R
Sbjct: 78 -EEENPFIHRIEKRISALMNVPIEHAEGLQVLHYGPGQEYQAHYDFFGPN--SPSASNNR 134
Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 255
+++ ++YL+DVE GGET+FP + L+VKP RG L F + +
Sbjct: 135 ISTLIIYLNDVEAGGETVFPL---------------LDLEVKPERGSALYFEYFYRQQEL 179
Query: 256 DRTSLHGSCPVIKGEKWVATKWIRDQEQHE 285
+ +LH S PV++GEKWVAT+W+R Q E
Sbjct: 180 NNLTLHSSVPVVRGEKWVATQWMRRQRVRE 209
>gi|229174912|ref|ZP_04302432.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus MM3]
gi|228608580|gb|EEK65882.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus MM3]
Length = 216
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 66/199 (33%), Positives = 106/199 (53%), Gaps = 27/199 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +L S++ + RTS G F+ +E
Sbjct: 39 PLIVVLGNVLSDEECDELIELSKSKLARSKVG-----SSRDVNDIRTSKGAFLDDNE--- 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTVKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 148 MYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 192
Query: 261 HGSCPVIKGEKWVATKWIR 279
HG PV KGEKW+AT+W+R
Sbjct: 193 HGGAPVTKGEKWIATQWVR 211
>gi|328876967|gb|EGG25330.1| putative prolyl 4-hydroxylase alpha subunit [Dictyostelium
fasciculatum]
Length = 244
Score = 125 bits (315), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 74/210 (35%), Positives = 108/210 (51%), Gaps = 31/210 (14%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT-RTSSGTFISA 136
+S PR P+F S +C+ +I +K +L+P E + G R+ G F+
Sbjct: 29 MSQCPRVYRVPDFLSPAECEHLIDISKNKLRPCN---------EISSGVHRSGWGLFMKE 79
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS--- 193
E+ +++ I ++ L + + E V+RY G++ +HYD FNP M
Sbjct: 80 GEEDHDVVKKIFQRMKMLVNLTE-NCEVMQVIRYHPGEETSAHYDYFNPLTTNGAMKIGL 138
Query: 194 --QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
QR+ + L+YLS+VEEGGET FP +G+KVKP +GD +LFY+ P
Sbjct: 139 YGQRVCTILMYLSEVEEGGETSFPE---------------VGVKVKPVKGDAVLFYNCKP 183
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
NG +D SLH PVIKG KWVA K I +
Sbjct: 184 NGEVDPLSLHQGDPVIKGTKWVAIKLINQK 213
>gi|229031885|ref|ZP_04187873.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1271]
gi|228729503|gb|EEL80492.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1271]
Length = 216
Score = 125 bits (315), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 67/199 (33%), Positives = 106/199 (53%), Gaps = 27/199 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +L S++ + RTS G F+ +E T
Sbjct: 39 PLIVVLGNVLSDEECGELIELSKSKLARSKVG-----SSRDVNDIRTSKGAFLDDNELTT 93
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
I E +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 94 KI----EKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 148 MYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 192
Query: 261 HGSCPVIKGEKWVATKWIR 279
HG PV KGEKW+AT+W+R
Sbjct: 193 HGGAPVTKGEKWIATQWVR 211
>gi|388567209|ref|ZP_10153646.1| procollagen-proline dioxygenase [Hydrogenophaga sp. PBC]
gi|388265592|gb|EIK91145.1| procollagen-proline dioxygenase [Hydrogenophaga sp. PBC]
Length = 296
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 77/206 (37%), Positives = 109/206 (52%), Gaps = 25/206 (12%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
PR + N SAE+C +II +AK +L S L ++ E RTSSG F + +T
Sbjct: 109 PRVVVLGNLLSAEECDAIIESAKPKLARS-LTVQTATGGEELNADRTSSGMFFT--RGQT 165
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQRL 196
+ +E +IAR P +GE VL Y G +Y HYD F+P E G P + QR+
Sbjct: 166 PEVTAVERRIARLVGWPVENGEGLQVLHYRPGAEYKPHYDYFDPKEAGTPTILKRGGQRV 225
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
A+ ++YL++ GG T FP +GL+V P +G + F P+ T
Sbjct: 226 ATLVMYLNEPARGGGTTFP---------------DVGLEVAPVKGSAVFFSYDRPHPTT- 269
Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQE 282
SLHG PV++GEKWVATKW+R++E
Sbjct: 270 -RSLHGGAPVLEGEKWVATKWLRERE 294
>gi|254254263|ref|ZP_04947580.1| hypothetical protein BDAG_03558 [Burkholderia dolosa AUO158]
gi|124898908|gb|EAY70751.1| hypothetical protein BDAG_03558 [Burkholderia dolosa AUO158]
Length = 285
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 77/230 (33%), Positives = 111/230 (48%), Gaps = 35/230 (15%)
Query: 63 GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES 122
G DA+ + IP + RP+ + F N ++C +I + +L+ Q TV +
Sbjct: 77 GNVIDAADRRIPVLIRCERPQIVVFGNVLDQDECDEMIQRSMHKLE-------QSTTVNA 129
Query: 123 TKGT------RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKY 176
GT RTS GT+ ED ++ IE ++A P +GE VLRY G +Y
Sbjct: 130 ETGTQEVIRHRTSHGTWFQNGED--ALIRRIETRLAALMNCPVENGEGLQVLRYTPGGEY 187
Query: 177 DSHYDAFNPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKC 231
SHYD F P G QR+A+ ++YL+DV GGET+FP
Sbjct: 188 RSHYDYFQPTAAGSLTHVRTGGQRVATLIVYLNDVPSGGETVFPEA-------------- 233
Query: 232 IGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
G+ V PRRGD + F + +D +LH PV GEKW+ TKW+R++
Sbjct: 234 -GISVVPRRGDAVYFRYMNRLRQLDPATLHAGAPVRDGEKWIMTKWVRER 282
>gi|326316001|ref|YP_004233673.1| procollagen-proline dioxygenase [Acidovorax avenae subsp. avenae
ATCC 19860]
gi|323372837|gb|ADX45106.1| Procollagen-proline dioxygenase [Acidovorax avenae subsp. avenae
ATCC 19860]
Length = 298
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 75/207 (36%), Positives = 111/207 (53%), Gaps = 27/207 (13%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPS-QLALRQGETVESTKGTRTSSGTFISASED 139
+PR + F N S E+C +II A+ R+ S +A R G E RTS+G F E+
Sbjct: 110 QPRVVLFGNLLSPEECDAIIDAARPRMARSLTVATRTGG--EEVNDDRTSNGMFFQREEN 167
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQ 194
++ +E +IAR P +GE VL Y G +Y HYD F+P E G P + Q
Sbjct: 168 P--MVAKLEARIARLVNWPLENGEGLQVLHYRPGAEYKPHYDYFDPTEPGTPTILRRGGQ 225
Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
R+A+ ++YL+D E+GG T FP + L+V PRRG+ + F P+ +
Sbjct: 226 RVATIVIYLNDPEKGGGTTFP---------------DVHLEVAPRRGNAVFFSYERPHPS 270
Query: 255 IDRTSLHGSCPVIKGEKWVATKWIRDQ 281
+LHG PV+ G+KW+ATKW+R++
Sbjct: 271 T--RTLHGGAPVVAGDKWIATKWLRER 295
>gi|308804269|ref|XP_003079447.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
gi|116057902|emb|CAL54105.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
Length = 363
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 76/219 (34%), Positives = 115/219 (52%), Gaps = 27/219 (12%)
Query: 75 FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 134
+ LSW PRA + NF + ++C+ +IA +K+L+ S + +G+ + RTS GTFI
Sbjct: 91 WTTLSWSPRAFLYQNFLTEDECEHLIALGEKKLERSTVVGSKGKEGD-VHSARTSFGTFI 149
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
+ T L +E ++A + +P H E +LRYE GQ+Y + +
Sbjct: 150 T--RRLTPTLSAVEDRVAEYSGIPWRHQEQLQLLRYEKGQEYGNG-------------EK 194
Query: 195 RLASFLLYLSDVEEGGETMFPFENGI------FLDSGYDYKKC-----IGLKVKPRRGDG 243
R+A+ L++L + E GGET FP + FL S C G V PR+GD
Sbjct: 195 RIATVLMFLREPEFGGETHFPDATPLPATRSEFLGSRAKLSDCGWNEGRGFSVIPRKGDA 254
Query: 244 LLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
+LF+S NGT D + H SCP ++G K+ ATKWI ++E
Sbjct: 255 ILFFSHHINGTSDDAASHASCPTLRGIKYTATKWIHEKE 293
>gi|430808003|ref|ZP_19435118.1| prolyl 4-hydroxylase [Cupriavidus sp. HMR-1]
gi|429499635|gb|EKZ98045.1| prolyl 4-hydroxylase [Cupriavidus sp. HMR-1]
Length = 293
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 72/205 (35%), Positives = 102/205 (49%), Gaps = 23/205 (11%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
PR L N +C +++A A+ RL+ S + + E+ RTS G E
Sbjct: 101 PRILLLQNLLDDAECDAVVALARDRLQRSPV-VNPDTGDENLIDARTSMGAMFQVGEH-- 157
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRL 196
+L+ IE +IA T P HGE F VL Y+ G +Y H+D FNP G QR+
Sbjct: 158 ALLQRIEARIAAVTGWPVEHGEGFQVLNYKPGGEYQPHFDFFNPKRPGEARQLRVGGQRV 217
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
A+ ++YL+ GG T FP IGL+V P +G+ +LF P+G +D
Sbjct: 218 ATMVIYLNSPASGGATAFP---------------RIGLEVAPVKGNAVLFSYGLPDGALD 262
Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQ 281
+LH PV GEKW+ATKW+R+
Sbjct: 263 ERTLHAGLPVEAGEKWIATKWLREH 287
>gi|94312029|ref|YP_585239.1| prolyl 4-hydroxylase [Cupriavidus metallidurans CH34]
gi|93355881|gb|ABF09970.1| prolyl 4-hydroxylase [Cupriavidus metallidurans CH34]
Length = 293
Score = 124 bits (312), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 72/205 (35%), Positives = 102/205 (49%), Gaps = 23/205 (11%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
PR L N +C +++A A+ RL+ S + + E+ RTS G E
Sbjct: 101 PRILLLQNLLDDAECDAVVALARDRLQRSPV-VNPDTGDENLIDARTSMGAMFQVGEH-- 157
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRL 196
+L+ IE +IA T P HGE F VL Y+ G +Y H+D FNP G QR+
Sbjct: 158 ALLQRIEARIAAVTGWPVEHGEGFQVLNYKPGGEYQPHFDFFNPKRPGEARQLRVGGQRV 217
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
A+ ++YL+ GG T FP IGL+V P +G+ +LF P+G +D
Sbjct: 218 ATMVIYLNSPASGGATAFP---------------RIGLEVAPVKGNAVLFSYGLPDGALD 262
Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQ 281
+LH PV GEKW+ATKW+R+
Sbjct: 263 ERTLHAGLPVEAGEKWIATKWLREH 287
>gi|221068712|ref|ZP_03544817.1| Procollagen-proline dioxygenase [Comamonas testosteroni KF-1]
gi|220713735|gb|EED69103.1| Procollagen-proline dioxygenase [Comamonas testosteroni KF-1]
Length = 299
Score = 124 bits (312), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 74/206 (35%), Positives = 108/206 (52%), Gaps = 25/206 (12%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
PR + F N S E+C +IIA A R++ S L + E+ RTS+G F E+
Sbjct: 112 PRVVVFGNLLSDEECDAIIAAAGPRMQRS-LTVDNQSGGEAVNDDRTSNGMFFQRGEND- 169
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQRL 196
++ +E +IAR P +GE VL Y G +Y HYD F P E G P + QR+
Sbjct: 170 -LICRVEQRIARLLNWPLENGEGMQVLHYRPGAEYKPHYDYFAPNEPGTPTILKRGGQRV 228
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
+ ++YL++ GG T FP +GL+V PRRG+ + F P+
Sbjct: 229 GTLVMYLNEPARGGATTFPD---------------VGLQVVPRRGNAVFFSYNRPDPAT- 272
Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQE 282
+LHG PV++GEKW+ATKW+R++E
Sbjct: 273 -KTLHGGAPVLEGEKWIATKWLRERE 297
>gi|116784858|gb|ABK23496.1| unknown [Picea sitchensis]
Length = 208
Score = 124 bits (312), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 69/148 (46%), Positives = 86/148 (58%), Gaps = 10/148 (6%)
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASF 199
K I+ IE KIA T LP+ +GE VLRYE G+KYD H+D F + R+A+
Sbjct: 7 KDAIISRIEDKIAAWTFLPKENGEDMQVLRYEPGEKYDPHFDFFQDKVNIVRGGHRVATV 66
Query: 200 LLYLSDVEEGGETMFP---------FENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
L+YL+DV +GGET+FP + I D+ D K G VKP+RGD LLF+SL
Sbjct: 67 LMYLTDVSKGGETVFPSAEEDTHRRISSIIKDDTLSDCAK-RGTAVKPKRGDALLFFSLT 125
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWI 278
D SLH CPVI+GEKW TKWI
Sbjct: 126 TQAKPDTRSLHAGCPVIEGEKWSVTKWI 153
>gi|239814309|ref|YP_002943219.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
gi|239800886|gb|ACS17953.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
Length = 279
Score = 124 bits (312), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 90/276 (32%), Positives = 128/276 (46%), Gaps = 53/276 (19%)
Query: 39 VPSIRPKLRTL-------EVVEKENESGLPHGETGDASIQSIP----------------- 74
VP++R +R ++ + E+G PH E +P
Sbjct: 23 VPALRASMRAAGWQDAATDLALAQLEAGFPHVEVARPRRTEMPGPDLEGAPLYIDAGDRR 82
Query: 75 FQVLSW--RPRALYFPNFASAEQCQSIIATAKKRLKPS-QLALRQGETVESTKGTRTSSG 131
QVL PR + F N S E+C+ +IA A+ RL S + R G V + RTS G
Sbjct: 83 VQVLQTMRHPRVVVFGNLVSPEECEGLIAAARVRLARSLTVETRTGGEVLNVD--RTSEG 140
Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-P 190
F E+ I+ +E +IA P GE +LRY G +Y HYD F+P E G P
Sbjct: 141 MFFERGEND--IVARLEQRIAALLRWPVEFGEGLQILRYAPGAQYRPHYDYFDPGEPGTP 198
Query: 191 QM----SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLF 246
+ QR+A+ ++YL + +GG T FP +GL+V P RG G+ F
Sbjct: 199 TILKRGGQRVATLVMYLQEPGQGGATTFP---------------DVGLEVAPVRGTGVFF 243
Query: 247 YSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
P+ +LHG PV+ GEKWVATKW+R++E
Sbjct: 244 SYEEPDPAT--RTLHGGAPVLAGEKWVATKWLRERE 277
>gi|421749438|ref|ZP_16186877.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator HPC(L)]
gi|409771699|gb|EKN53918.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator HPC(L)]
Length = 319
Score = 124 bits (310), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 74/228 (32%), Positives = 113/228 (49%), Gaps = 28/228 (12%)
Query: 58 SGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQG 117
+ LP G+ D +P PR F ++C+++IA ++ RL S + +
Sbjct: 108 NALPAGDGRD-----VPVLFAIESPRIALFQRLLMPDECEALIALSRGRLARSPV-VNPD 161
Query: 118 ETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYD 177
E+ RTS G E ++E +E +IA T +P HGE +L Y+ G +Y
Sbjct: 162 TGDENLIDARTSMGAMFQVGEHP--LIERLEARIAAVTGVPVEHGEGLQILNYKPGAEYQ 219
Query: 178 SHYDAFNPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI 232
HYD FNP G QR+A+ ++YL+DV GG T FP +
Sbjct: 220 PHYDFFNPQRPGEARQLRVGGQRMATLVIYLNDVPAGGATAFP---------------KL 264
Query: 233 GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 280
GL+V P +G+ + F L +G++D +LH PV +GEKW+ATKW+R+
Sbjct: 265 GLRVNPVQGNAVFFAYLGEDGSLDERTLHAGLPVEQGEKWIATKWLRE 312
>gi|113869198|ref|YP_727687.1| prolyl 4-hydroxylase alpha subunit [Ralstonia eutropha H16]
gi|113527974|emb|CAJ94319.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia eutropha H16]
Length = 297
Score = 124 bits (310), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 73/233 (31%), Positives = 114/233 (48%), Gaps = 23/233 (9%)
Query: 59 GLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGE 118
G P D + +P P+ F + ++C +++A ++ RL S + +
Sbjct: 82 GTPAVYAADGGDRQVPILFRLASPQVQLFQQLLTDDECDALVALSRGRLARSPV-VNPDT 140
Query: 119 TVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDS 178
E+ RTS G +E ++ IE +IA T +P HGE +L Y+ G +Y
Sbjct: 141 GDENLIDARTSMGAMFQVAEHP--LITRIEARIAAVTGVPAEHGEGLQILNYKPGGEYQP 198
Query: 179 HYDAFNPAEYGP--QMS---QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIG 233
H+D FNP G Q+S QR+A+ ++YL+ E GG T FP +G
Sbjct: 199 HFDYFNPQRPGEARQLSVGGQRIATLVIYLNTPEAGGATAFP---------------RVG 243
Query: 234 LKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHED 286
L+V P +G+ + F L P+G +D +LH PV GEKW+ATKW+R++ D
Sbjct: 244 LEVAPVKGNAVYFSYLLPDGALDERTLHAGLPVAFGEKWIATKWLRERPYRSD 296
>gi|333981907|ref|YP_004511117.1| procollagen-proline dioxygenase [Methylomonas methanica MC09]
gi|333805948|gb|AEF98617.1| Procollagen-proline dioxygenase [Methylomonas methanica MC09]
Length = 286
Score = 124 bits (310), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 72/241 (29%), Positives = 118/241 (48%), Gaps = 32/241 (13%)
Query: 53 EKENESGLPH---------GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATA 103
E N++ LP+ G D +S+ + RP + F S E+C+ +I +
Sbjct: 58 EPSNKAKLPYQYETSLVAAGNNIDLFDRSVKVSLRVSRPDIVVVDEFMSGEECEQLIEQS 117
Query: 104 KKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGE 163
+++L PS + Q + R+S GT+ E + ++ ++ +I+ P+ HGE
Sbjct: 118 RRKLTPSAIVDPQTGKFQVI-ADRSSEGTYFQRGE--SPLISRLDRRISELMNWPEDHGE 174
Query: 164 AFNVLRYEIGQKYDSHYDAFNPAEYG-----PQMSQRLASFLLYLSDVEEGGETMFPFEN 218
+L Y +G +Y H+D F E G Q QR+A+ ++YL++V EGGET+FP
Sbjct: 175 GIQILHYGVGAQYKPHFDYFLENESGGALQMTQSGQRVATLVMYLNEVTEGGETVFPD-- 232
Query: 219 GIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
+G+ + P+RG F G +D +LHG PV+ GEKW+ATKW+
Sbjct: 233 -------------VGISITPKRGSAAYFAYCNSLGQVDPATLHGGAPVLTGEKWIATKWM 279
Query: 279 R 279
R
Sbjct: 280 R 280
>gi|365090417|ref|ZP_09328465.1| 2OG-Fe(II) oxygenase [Acidovorax sp. NO-1]
gi|363416516|gb|EHL23626.1| 2OG-Fe(II) oxygenase [Acidovorax sp. NO-1]
Length = 302
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 75/207 (36%), Positives = 110/207 (53%), Gaps = 25/207 (12%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+PR + F N S E+C ++IA A+ RL S L + E RTS G F +
Sbjct: 114 QPRIVVFGNLLSPEECDALIADAQPRLARS-LTVATKTGGEEINDDRTSDGMFFQRGQ-- 170
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQR 195
+ +++ IE +IAR P +GE VL Y G +Y HYD F+PAE G P + QR
Sbjct: 171 SPLIQRIEERIARLLNWPIENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPSIVNRGGQR 230
Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 255
+ + ++YL+ E+GG T FP + L+V P+RG+ + F P+ +
Sbjct: 231 VGTLVMYLNTPEKGGGTTFP---------------DVHLEVAPQRGNAVFFSYERPHPST 275
Query: 256 DRTSLHGSCPVIKGEKWVATKWIRDQE 282
+LHG PVI GEKW+ATKW+R++E
Sbjct: 276 --RTLHGGAPVIAGEKWIATKWLRERE 300
>gi|159489450|ref|XP_001702710.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280732|gb|EDP06489.1| predicted protein [Chlamydomonas reinhardtii]
Length = 252
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 79/207 (38%), Positives = 109/207 (52%), Gaps = 15/207 (7%)
Query: 77 VLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
V+SW PRA NF + ++ I A+ ++ S + G +V RTS GTFI+
Sbjct: 3 VISWEPRAFVIRNFLTDQEATHIADVAQVHMRRSTVVADNGSSV--LDDYRTSYGTFINR 60
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
T ++ +E ++A T +P + E VLRY GQ Y H D+ S RL
Sbjct: 61 Y--ATPVVARVEDRVAVLTRVPVHYQEDMQVLRYGNGQYYHRHTDSLE------NDSPRL 112
Query: 197 ASFLLYLSDVEEGGETMFP--FENGIFLDSGYDYKKCIGLKV--KPRRGDGLLFYSLFPN 252
A+ LLYLSD E GGET FP + + + +C+ V KPR+GD LLF+S+ P+
Sbjct: 113 ATVLLYLSDPELGGETAFPLAWAHPDMPKVFGPFSECVKNNVAFKPRKGDALLFWSVKPD 172
Query: 253 G-TIDRTSLHGSCPVIKGEKWVATKWI 278
G T D S H CPVI+G KW AT W+
Sbjct: 173 GKTEDPLSEHEGCPVIRGVKWTATVWV 199
>gi|301093292|ref|XP_002997494.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262110636|gb|EEY68688.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 324
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 77/219 (35%), Positives = 108/219 (49%), Gaps = 17/219 (7%)
Query: 72 SIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
+ + LS P F ++ I+A + + LKPS + L G + RTS+
Sbjct: 105 DVVLETLSLTPLVFSVDEFLKDDEIDIIMALSLEHLKPSTVTLMDGHEDRAATDWRTSTT 164
Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY--- 188
F+S+S K L+ I+ ++A T +P H E VLRYE QKYD H D F P E+
Sbjct: 165 YFLSSS--KHSKLDEIDQRVADLTKVPVDHQEDVQVLRYEETQKYDHHTDYF-PVEHHKN 221
Query: 189 GPQM--------SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKC-IGLKVKPR 239
P + R+ + Y+SDV +GG T+FP G K C GLKV P+
Sbjct: 222 SPHVLESIDYGYKNRMITVFWYMSDVAKGGHTIFPRAGGA--PRPQSMKDCSTGLKVSPK 279
Query: 240 RGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
+ ++FYS+ PNG D SLHG CPV G K+ KW+
Sbjct: 280 KRKVIVFYSMLPNGQGDPMSLHGGCPVEDGIKYSGNKWV 318
>gi|403234403|ref|ZP_10912989.1| Procollagen-proline dioxygenase [Bacillus sp. 10403023]
Length = 217
Score = 122 bits (307), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 65/198 (32%), Positives = 108/198 (54%), Gaps = 24/198 (12%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K R+ S++A + RTSS TFI E++
Sbjct: 39 PLIVVLGNVLSDEECDELIRLSKDRINRSKIA------NANVDNMRTSSSTFIE--ENEN 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
I+ IE +I++ +P +GE +L Y++GQ+Y SH+D F+ + + + R+++ ++
Sbjct: 91 IIVSRIEKRISQIMNIPTEYGEGLQILNYQVGQEYKSHFDFFS-SPHNAINNPRISTLVM 149
Query: 202 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 261
YLSDVE+GGET FP + V P++G + F + + T++ +LH
Sbjct: 150 YLSDVEQGGETYFP---------------KLHFSVSPQKGMAVYFEYFYNDQTLNELTLH 194
Query: 262 GSCPVIKGEKWVATKWIR 279
G PVI G+KW AT+W+R
Sbjct: 195 GGAPVIVGDKWAATQWMR 212
>gi|395003644|ref|ZP_10387769.1| 2OG-Fe(II) oxygenase superfamily enzyme [Acidovorax sp. CF316]
gi|394318439|gb|EJE54870.1| 2OG-Fe(II) oxygenase superfamily enzyme [Acidovorax sp. CF316]
Length = 299
Score = 122 bits (307), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 79/222 (35%), Positives = 117/222 (52%), Gaps = 27/222 (12%)
Query: 67 DASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPS-QLALRQGETVESTKG 125
DA +++ + +PR + F N SAE+C ++IA A R+ S +A + G E
Sbjct: 97 DAGDRAVNILLAIAKPRIVVFGNLLSAEECDALIAAAAPRMARSLTVATKTGG--EEVND 154
Query: 126 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 185
RTS G F E+ +++ IE +IAR P +GE VL Y G +Y HYD F+P
Sbjct: 155 DRTSDGMFFQRGENP--VVQRIEERIARLLDWPIENGEGLQVLHYRPGAEYKPHYDYFDP 212
Query: 186 AEYG-PQM----SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRR 240
E G P + QR+ + ++YL+ E+GG T FP + ++V P+R
Sbjct: 213 GEPGTPTILKRGGQRVGTLVMYLNTPEKGGGTTFP---------------DVHVEVAPQR 257
Query: 241 GDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
G+ + F+S RT LHG PVI GEKW+ATKW+R++E
Sbjct: 258 GNAV-FFSYERAHPATRT-LHGGAPVIAGEKWIATKWLRERE 297
>gi|222111817|ref|YP_002554081.1| procollagen-proline dioxygenase [Acidovorax ebreus TPSY]
gi|221731261|gb|ACM34081.1| Procollagen-proline dioxygenase [Acidovorax ebreus TPSY]
Length = 289
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 72/206 (34%), Positives = 108/206 (52%), Gaps = 25/206 (12%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
PR + F N S E+CQ+II A+ R+ S L ++ E RTS G F E T
Sbjct: 102 PRVVLFGNLLSPEECQAIIDAAQPRMARS-LTVQTTTGGEEVNADRTSDGMFFQRGE--T 158
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRL 196
+++ +E +IAR P +GE VL Y G +Y HYD F+P + G QR+
Sbjct: 159 PVVQRLEERIARLVRWPIQNGEGLQVLHYRPGAEYKPHYDYFDPDQPGTSTIVRRGGQRV 218
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
A+ ++YL++ +GG T FP + L+V PR+G+ + F P+ +
Sbjct: 219 ATLVIYLNNPRKGGGTTFP---------------DVPLEVAPRQGNAVFFSYERPHPST- 262
Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQE 282
+LHG VI+GEKW+ATKW+R++E
Sbjct: 263 -RTLHGGASVIEGEKWIATKWLRERE 287
>gi|91789558|ref|YP_550510.1| procollagen-proline,2-oxoglutarate-4-dioxygenase [Polaromonas sp.
JS666]
gi|91698783|gb|ABE45612.1| Procollagen-proline,2-oxoglutarate-4-dioxygenase [Polaromonas sp.
JS666]
Length = 277
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 74/206 (35%), Positives = 108/206 (52%), Gaps = 25/206 (12%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + F N S +C++++ A+ RL S L + E RTS G F + E+
Sbjct: 90 PDLVVFGNLLSDSECEALMEVAQPRLARS-LTVNIKTGGEERNRDRTSQGMFFARGENP- 147
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQRL 196
+++ +E +IAR P GE VLRY G +Y HYD F+PAE G P + QR+
Sbjct: 148 -LVQRVEARIARLVGWPVDRGEGLQVLRYRQGAQYKPHYDYFDPAEPGTPAILQRGGQRV 206
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
A+ ++YL++ E+GG T+FP IGL+V PRRG + F +P
Sbjct: 207 ATLIMYLNEPEQGGATVFP---------------DIGLQVTPRRGTAVFFS--YPAANPA 249
Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQE 282
+ HG PV GEKW+ATKW+R++E
Sbjct: 250 SLTRHGGEPVKAGEKWIATKWLRERE 275
>gi|145347188|ref|XP_001418057.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144578285|gb|ABO96350.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 317
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 73/217 (33%), Positives = 112/217 (51%), Gaps = 19/217 (8%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
+ LSW PR NF S E+C+ +I +K+L+ S + ST RTS GTF++
Sbjct: 37 ETLSWSPRVFLLKNFLSDEECEHLIELGEKKLERSTVVNSDESGAVST--ARTSFGTFVT 94
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
+T L+ +E ++A+ + +P H E +LRY GQ+Y +H+D G +R
Sbjct: 95 RRLTET--LQRVEDRVAKYSGIPWEHQEQLQLLRYRDGQEYVAHHDGIISENGG----KR 148
Query: 196 LASFLLYLSDVEEGGETMFP------FENGIFLDSGYDYKKC-----IGLKVKPRRGDGL 244
+A+ L++L + GGET FP FL + +C G V P++G+ +
Sbjct: 149 IATVLMFLREPTSGGETSFPQGTPLPETKAAFLANKDKLSECGWNDGNGFSVIPKKGEAV 208
Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
LF+S NGT D + H SCP + G K+ ATKWI +
Sbjct: 209 LFFSFHINGTNDPFANHASCPTLGGTKYTATKWIHEN 245
>gi|340357957|ref|ZP_08680560.1| prolyl 4-hydroxylase [Sporosarcina newyorkensis 2681]
gi|339616017|gb|EGQ20677.1| prolyl 4-hydroxylase [Sporosarcina newyorkensis 2681]
Length = 211
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 67/200 (33%), Positives = 108/200 (54%), Gaps = 24/200 (12%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I A ++K S++ G T E + RTSS FI +D+
Sbjct: 33 PLIVVLGNVLSDEECDELIQLAGDKVKRSKI----GTTREENE-LRTSSSMFIE--DDEN 85
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
I+ ++ +I+ +P HGE +LRY GQ+Y +H+D F+ + R+++ ++
Sbjct: 86 LIVTRVKKRISAIMKIPMEHGEGLQILRYTPGQQYKAHHDFFSSD--SKITNNRISTLVM 143
Query: 202 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 261
YL+DVE+GGET FP + V PR+G + F + + T++ +LH
Sbjct: 144 YLNDVEQGGETFFPH---------------LKFSVSPRKGMAVYFEYFYSDQTLNDFTLH 188
Query: 262 GSCPVIKGEKWVATKWIRDQ 281
G PV++GEKWVAT+W+R Q
Sbjct: 189 GGAPVVEGEKWVATQWMRKQ 208
>gi|302844249|ref|XP_002953665.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
gi|300261074|gb|EFJ45289.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
Length = 245
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 77/193 (39%), Positives = 106/193 (54%), Gaps = 15/193 (7%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
PRA F NF + + ++ A +LK S + GE V RTS G FI D
Sbjct: 60 HPRAYLFHNFLTKAERAHMVRLAAPKLKRSTVVGNDGEGV--VDEIRTSYGMFIRRLADP 117
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA-EYGPQMSQRLASF 199
++ IE +I+ T LP H E VLRY GQ Y +HYD+ + + E GP+ RLA+F
Sbjct: 118 --VITRIEKRISLWTHLPIEHQEDIQVLRYAHGQTYGAHYDSGDKSNEPGPKW--RLATF 173
Query: 200 LLYLSDVEEGGETMFPFENGIFLDSGYDYK-----KCI--GLKVKPRRGDGLLFYSLFPN 252
L+YLSDVEEGGET FP +N ++ D + +C + KP+ GD +LFYS +PN
Sbjct: 174 LMYLSDVEEGGETAFP-QNSVWYDPTIPERIGPVSECAKGHVAAKPKAGDAVLFYSFYPN 232
Query: 253 GTIDRTSLHGSCP 265
T+D ++H CP
Sbjct: 233 LTMDPAAMHTGCP 245
>gi|149180354|ref|ZP_01858859.1| prolyl 4-hydroxylase, alpha subunit [Bacillus sp. SG-1]
gi|148852546|gb|EDL66691.1| prolyl 4-hydroxylase, alpha subunit [Bacillus sp. SG-1]
Length = 212
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 64/198 (32%), Positives = 106/198 (53%), Gaps = 27/198 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C ++I +K +LK S++ + E RTSS TF+ E +
Sbjct: 37 PLIVVLGNVLSDEECDALIGLSKDKLKRSKIGNTRNEN-----DMRTSSSTFMEEGESE- 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
++ +E +I++ +P +GE +L Y+IGQ+Y +H+D F A + R+++ ++
Sbjct: 91 -VVTRVEKRISQIMNIPYENGEGLQILNYKIGQEYKAHFDFFKNAS-----NPRISTLVM 144
Query: 202 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 261
YL+DVEEGGET FP + V P++G + F + N ++ +LH
Sbjct: 145 YLNDVEEGGETYFP---------------KLNFSVSPQKGMAVYFEYFYDNQELNDLTLH 189
Query: 262 GSCPVIKGEKWVATKWIR 279
G PVI G+KW AT+W+R
Sbjct: 190 GGAPVIIGDKWAATQWMR 207
>gi|302844281|ref|XP_002953681.1| hypothetical protein VOLCADRAFT_63898 [Volvox carteri f.
nagariensis]
gi|300261090|gb|EFJ45305.1| hypothetical protein VOLCADRAFT_63898 [Volvox carteri f.
nagariensis]
Length = 304
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 73/208 (35%), Positives = 115/208 (55%), Gaps = 15/208 (7%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
++W+PR + NF + + + +I A ++K S + G++VE + T ++G +
Sbjct: 4 VAWKPRVFIYHNFITDMEAKHMIELAAPQMKRSTVVGAGGQSVEDSYRTLYTAG--VRRY 61
Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
+D ++E IE+++A T + H E +LRY IGQ+Y H D E G R+A
Sbjct: 62 QDD--VVERIENRVAAWTQISVLHQEDMQILRYGIGQQYKVHADTLRDDEAG----VRVA 115
Query: 198 SFLLYLSDVEEGGETMFP---FENGIFLDS-GYDYKKCIGLKV--KPRRGDGLLFYSLFP 251
+ L+YL++ E GGET FP + N ++ G ++ C V P+RGD LLF+S+ P
Sbjct: 116 TVLIYLNEPEAGGETAFPDSQWVNPKLAETIGANFSACAKNHVAFAPKRGDALLFWSIGP 175
Query: 252 NGTI-DRTSLHGSCPVIKGEKWVATKWI 278
+GT D + H CPV+ G KW ATKWI
Sbjct: 176 DGTTEDYHASHTGCPVLSGVKWTATKWI 203
>gi|337280547|ref|YP_004620019.1| hypothetical protein Rta_28970 [Ramlibacter tataouinensis TTB310]
gi|334731624|gb|AEG94000.1| conserved hypothetical protein [Ramlibacter tataouinensis TTB310]
Length = 286
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 74/207 (35%), Positives = 109/207 (52%), Gaps = 25/207 (12%)
Query: 80 WRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASED 139
+ PR + F + S ++C+ +I AK RL S L + E RTSSG F E+
Sbjct: 97 YNPRVVVFGSLLSDQECEQLIGLAKPRLARS-LTVATKTGGEEVNEDRTSSGMFFQRGEN 155
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQ 194
+ ++ IE +IAR P +GE VL Y G +Y HYD F+PAE G P + Q
Sbjct: 156 E--LVARIEARIARLVNWPVENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPTILKRGGQ 213
Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
R+ + ++YL + E+GG T FP + L+V P+RG G+ F P+ +
Sbjct: 214 RVGTLVMYLGEPEKGGGTTFP---------------DVHLEVAPKRGHGVFFSYERPHPS 258
Query: 255 IDRTSLHGSCPVIKGEKWVATKWIRDQ 281
+LHG PV+ GEKW+ATKW+R++
Sbjct: 259 T--RTLHGGAPVLAGEKWIATKWLRER 283
>gi|121595595|ref|YP_987491.1| 2OG-Fe(II) oxygenase [Acidovorax sp. JS42]
gi|120607675|gb|ABM43415.1| 2OG-Fe(II) oxygenase [Acidovorax sp. JS42]
Length = 289
Score = 121 bits (303), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 72/206 (34%), Positives = 108/206 (52%), Gaps = 25/206 (12%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
PR + F N S E+CQ+II A+ R+ S L ++ E RTS G F E T
Sbjct: 102 PRVVLFGNLLSPEECQAIIDAAQPRMARS-LTVQTTTGGEEVNADRTSDGMFFQRGE--T 158
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRL 196
+++ +E +IAR P +GE VL Y G +Y HYD F+P + G QR+
Sbjct: 159 PVVQRLEERIARLVRWPIQNGEGLQVLHYRPGAEYKPHYDYFDPDQPGTSTIVRRGGQRV 218
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
A+ ++YL++ +GG T FP + L+V PR+G+ + F P+ +
Sbjct: 219 ATLVIYLNNPLKGGGTTFP---------------DVPLEVAPRQGNAVFFSYERPHPST- 262
Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQE 282
+LHG VI+GEKW+ATKW+R++E
Sbjct: 263 -RTLHGGASVIEGEKWIATKWLRERE 287
>gi|407938132|ref|YP_006853773.1| 2OG-Fe(II) oxygenase [Acidovorax sp. KKS102]
gi|407895926|gb|AFU45135.1| 2OG-Fe(II) oxygenase [Acidovorax sp. KKS102]
Length = 303
Score = 121 bits (303), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 72/207 (34%), Positives = 110/207 (53%), Gaps = 25/207 (12%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+PR + F N S E+C ++IA A+ R+ S L + E RTS G F +
Sbjct: 115 QPRIVVFGNLLSPEECDALIAAAEPRMARS-LTVATKTGGEEINADRTSDGMFFQRGQ-- 171
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQR 195
+ +++ IE +IAR P +GE VL Y G +Y HYD F+PAE G P + QR
Sbjct: 172 SPLIQRIEERIARLLQWPIENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPSIIKRGGQR 231
Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 255
+ + ++YL+ ++GG T FP + L+V P+RG+ + F P+ +
Sbjct: 232 VGTLVMYLNTPDKGGGTTFP---------------DVHLEVAPQRGNAVFFSYERPHPST 276
Query: 256 DRTSLHGSCPVIKGEKWVATKWIRDQE 282
+LHG PVI G+KW+ATKW+R++E
Sbjct: 277 --RTLHGGAPVIAGDKWIATKWLRERE 301
>gi|390570433|ref|ZP_10250698.1| procollagen-proline dioxygenase [Burkholderia terrae BS001]
gi|389937613|gb|EIM99476.1| procollagen-proline dioxygenase [Burkholderia terrae BS001]
Length = 285
Score = 121 bits (303), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 73/224 (32%), Positives = 112/224 (50%), Gaps = 23/224 (10%)
Query: 63 GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES 122
G T A + + ++ RP+ + F + S E+C +I A+ RLK S + + E
Sbjct: 77 GNTVHAHDRDVTVRIRFERPQVIAFDDVLSGEECAELIERARHRLKRSTTVNPENGS-ED 135
Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
RTS G + ED +E ++H+I+ P HGE +L Y G +Y H+D
Sbjct: 136 VIQLRTSEGFWFQRCED--AFIERLDHRISALMNWPLEHGEGLQILHYRQGGEYRPHFDY 193
Query: 183 FNPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVK 237
F P + G + QR+A+ ++YLSDVE GGET+FP D+G L V
Sbjct: 194 FPPGQNGSVLHTARGGQRVATLIVYLSDVEGGGETVFP-------DAG--------LAVM 238
Query: 238 PRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
R+G + F + +D +LHG PV G+KW+ TKW+R++
Sbjct: 239 ARQGGAIYFRYMNGRRQLDPLTLHGGAPVTSGDKWIMTKWMRER 282
>gi|420246706|ref|ZP_14750139.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. BT03]
gi|398073616|gb|EJL64785.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. BT03]
Length = 282
Score = 121 bits (303), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 73/224 (32%), Positives = 112/224 (50%), Gaps = 23/224 (10%)
Query: 63 GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES 122
G T A + + ++ RP+ + F + S E+C +I A+ RLK S + + E
Sbjct: 74 GNTVHAHDRDVTVRIRFERPQVIAFDDVLSGEECAELIERARHRLKRSTTVNPENGS-ED 132
Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
RTS G + ED +E ++H+I+ P HGE +L Y G +Y H+D
Sbjct: 133 VIQLRTSEGFWFQRCED--AFIERLDHRISALMNWPLEHGEGLQILHYRQGGEYRPHFDY 190
Query: 183 FNPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVK 237
F P + G + QR+A+ ++YLSDVE GGET+FP D+G L V
Sbjct: 191 FPPGQNGSVLHTARGGQRVATLIVYLSDVEGGGETVFP-------DAG--------LAVM 235
Query: 238 PRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
R+G + F + +D +LHG PV G+KW+ TKW+R++
Sbjct: 236 ARQGGAIYFRYMNGRRQLDPLTLHGGAPVTSGDKWIMTKWMRER 279
>gi|449520827|ref|XP_004167434.1| PREDICTED: putative prolyl 4-hydroxylase-like, partial [Cucumis
sativus]
Length = 164
Score = 120 bits (302), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 66/155 (42%), Positives = 91/155 (58%), Gaps = 2/155 (1%)
Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
RTSSG F+S E +++ IE +I+ + +P +GE VLRYE Q Y H+D F+
Sbjct: 7 RTSSGMFLSHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQFYKPHHDYFSDT 66
Query: 187 EYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLF 246
+ QR+A+ L+YLS+ EGGET FP G K GL VKP +GD +LF
Sbjct: 67 FNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCG--GKTVPGLSVKPAKGDAVLF 124
Query: 247 YSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
+S+ +G D S+HG C V+ GEKW ATKW+R +
Sbjct: 125 WSMGLDGQSDPKSIHGGCEVLSGEKWSATKWMRQK 159
>gi|255577610|ref|XP_002529682.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223530830|gb|EEF32693.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 165
Score = 120 bits (302), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 68/156 (43%), Positives = 92/156 (58%), Gaps = 4/156 (2%)
Query: 126 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 185
RTSSG F+S+ E K+ + IE +I+ + +P +GE VLRYE Q Y H+D F+
Sbjct: 11 VRTSSGMFLSSEERKSPMA--IEKRISVYSQVPIENGELVQVLRYEKSQFYRPHHDYFSD 68
Query: 186 AEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLL 245
+ QR+A+ L+YLSD EGGET FP G K GL VKP +GD +L
Sbjct: 69 TFNLKRGGQRVATMLMYLSDNVEGGETYFPMAGSGECSCGGKIVK--GLSVKPIKGDAVL 126
Query: 246 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
F+S+ +G D S+HG C V+ GEKW ATKW+R +
Sbjct: 127 FWSMGLDGQSDPKSIHGGCEVLAGEKWSATKWMRQR 162
>gi|383642155|ref|ZP_09954561.1| hypothetical protein SeloA3_06917 [Sphingomonas elodea ATCC 31461]
Length = 327
Score = 120 bits (302), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 73/198 (36%), Positives = 101/198 (51%), Gaps = 22/198 (11%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
PR +FP F S E+C + TA+ L+PS L RTS G I + +
Sbjct: 140 PRVEHFPGFLSREECAHVATTAQDLLEPS-FVLDPNSGRPIPHPIRTSDGGAIGPTNENL 198
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
++ I +IA AT GE+ VLRY GQ+Y H D AE +QR+A+F++
Sbjct: 199 -VVRAINLRIAAATGTAVEQGESLTVLRYARGQEYRRHLDTIAGAE-----NQRIATFIV 252
Query: 202 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 261
YL+D EGGET FP N ++V+PR GD + F ++ P+GT D +H
Sbjct: 253 YLNDGFEGGETHFPLLN---------------IQVRPRIGDAIRFDTIRPDGTPDPRLVH 297
Query: 262 GSCPVIKGEKWVATKWIR 279
PV G KW+AT+WIR
Sbjct: 298 AGQPVRNGVKWIATRWIR 315
>gi|205374182|ref|ZP_03226981.1| prolyl 4-hydroxylase alpha subunit [Bacillus coahuilensis m4-4]
Length = 210
Score = 120 bits (301), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P N S E+C +I+ +K R+ S++A Q + RTS+ F+ ED +
Sbjct: 33 PFVAVLGNVLSDEECDELISLSKDRMNRSKIAGNQENDI------RTSTSVFLP--EDAS 84
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
+++ +E +I++ +P HGE +L Y+IGQ+Y +H+D F+P + + R+++ +L
Sbjct: 85 EVVQRVEKRISQIMNIPVEHGEGLQLLNYQIGQEYKAHFDFFSPKKLIE--NPRISTLVL 142
Query: 202 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 261
YL+DVEEGG+T FP + L V P +G + F + + ++ +LH
Sbjct: 143 YLNDVEEGGDTYFP---------------NLKLSVSPHKGMAVYFEYFYDDPMLNELTLH 187
Query: 262 GSCPVIKGEKWVATKWIR 279
G PV G+KW AT W+R
Sbjct: 188 GGAPVTIGDKWAATMWMR 205
>gi|385206010|ref|ZP_10032880.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
gi|385185901|gb|EIF35175.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
Length = 296
Score = 120 bits (301), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 81/207 (39%), Positives = 106/207 (51%), Gaps = 25/207 (12%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASED 139
RP A+ +F SA +C+ +I+ A+ RL S + G V G R+S G F E
Sbjct: 101 RPAAILLDDFLSANECEQLISLARPRLSRSTVVDPVTGRNV--VAGHRSSDGMFFRLGE- 157
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD---AFNPA--EYGPQMSQ 194
T ++ +E +IA T LP +GE +L YE+G + H D A NPA E + Q
Sbjct: 158 -TPLIARLEARIAELTGLPVENGEGLQLLHYEVGAESTPHVDYLIAGNPANQESIARSGQ 216
Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
R+ + L+YL+DVE GGETMFP G V PRRG L F G
Sbjct: 217 RVGTLLMYLNDVEGGGETMFP---------------QTGWSVVPRRGQALYFEYGNRFGL 261
Query: 255 IDRTSLHGSCPVIKGEKWVATKWIRDQ 281
D +SLH S P+ GEKWVATKWIR +
Sbjct: 262 ADPSSLHTSTPLRVGEKWVATKWIRTR 288
>gi|91779740|ref|YP_554948.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
xenovorans LB400]
gi|91692400|gb|ABE35598.1| Procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
xenovorans LB400]
Length = 296
Score = 120 bits (301), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 82/207 (39%), Positives = 105/207 (50%), Gaps = 25/207 (12%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASED 139
RP A+ +F SA +C+ +IA A+ RL S + G V G R+S G F E
Sbjct: 101 RPAAVLLDDFLSANECEQLIALARPRLSRSTVVDPVTGRNV--VAGHRSSDGMFFRLGE- 157
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD---AFNPA--EYGPQMSQ 194
T ++ +E +IA T LP +GE +L YE G + H D A NPA E + Q
Sbjct: 158 -TPLIARLEARIAELTGLPVENGEGLQLLHYEAGAESTPHVDYLIAGNPANRESIARSGQ 216
Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
R+ + L+YL+DVE GGETMFP G V PRRG L F G
Sbjct: 217 RVGTLLMYLNDVEGGGETMFP---------------QTGWSVVPRRGQALYFEYGNRFGL 261
Query: 255 IDRTSLHGSCPVIKGEKWVATKWIRDQ 281
D +SLH S P+ GEKWVATKWIR +
Sbjct: 262 ADPSSLHTSTPLRAGEKWVATKWIRTR 288
>gi|148653656|ref|YP_001280749.1| procollagen-proline dioxygenase [Psychrobacter sp. PRwf-1]
gi|148572740|gb|ABQ94799.1| Procollagen-proline dioxygenase [Psychrobacter sp. PRwf-1]
Length = 268
Score = 120 bits (300), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 71/242 (29%), Positives = 122/242 (50%), Gaps = 29/242 (11%)
Query: 50 EVVEKENESGLPHGETGDASIQ----SIPFQVLSWRPRALYFPNFASAEQCQSIIATAKK 105
E++ K + +PH + ++ + + ++P +F S E+C ++I+ A +
Sbjct: 43 ELISKSSNQKIPHINMTNNYVELSDKRVSLSFVCYKPFVTVINDFLSPEECDALISDADQ 102
Query: 106 RLKPSQLA-LRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEA 164
+LK S++ G VE + T TS+G + I++ IE +IA P HGE
Sbjct: 103 KLKASRVVDPEDGSFVEHSARTSTSTGY----HRGEIDIIKTIEARIADLINWPVDHGEG 158
Query: 165 FNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFPFENG 219
VLRYE G +Y H+D F+PA+ ++ QR+ +FL+YLS+V+ GG T FP
Sbjct: 159 LQVLRYEDGGEYRPHFDFFDPAKKSSRLVTKQGGQRVGTFLMYLSEVDSGGSTRFP---- 214
Query: 220 IFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+ +++P +G L F + I+ +LH PV +G K++ATKW+R
Sbjct: 215 -----------NLNFEIRPNKGSALYFANTNLKAEIEPLTLHAGMPVTEGVKYLATKWLR 263
Query: 280 DQ 281
++
Sbjct: 264 EK 265
>gi|294499597|ref|YP_003563297.1| 2OG-Fe(II) oxygenase family protein [Bacillus megaterium QM B1551]
gi|294349534|gb|ADE69863.1| 2OG-Fe(II) oxygenase family protein [Bacillus megaterium QM B1551]
Length = 219
Score = 120 bits (300), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 67/202 (33%), Positives = 110/202 (54%), Gaps = 25/202 (12%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQL-ALRQGETVESTKGTRTSSGTFISASEDK 140
P L N S E+C +I +K +++ S++ A R+ ++ RTSSG F SE++
Sbjct: 39 PLVLVLGNVLSNEECDELIQLSKDKMQRSKIGAAREVNSI------RTSSGMFFEESENE 92
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFL 200
++ IE ++++ + E VL+Y Q+Y +H+D F A + + R+++ +
Sbjct: 93 --LVHQIERRLSKIMGPSIEYAEGLQVLKYLPDQEYKAHHDYFTSASKASK-NNRISTLV 149
Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
+YL+DVEEGGET FP +GL V P +G + F + + ++ +L
Sbjct: 150 MYLNDVEEGGETYFP---------------KLGLSVSPTKGMAVYFEYFYSDAELNDRTL 194
Query: 261 HGSCPVIKGEKWVATKWIRDQE 282
HG PVIKGEKWVAT+W+R Q+
Sbjct: 195 HGGAPVIKGEKWVATQWMRKQK 216
>gi|388520887|gb|AFK48505.1| unknown [Lotus japonicus]
Length = 187
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 61/122 (50%), Positives = 79/122 (64%), Gaps = 3/122 (2%)
Query: 161 HGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP-FENG 219
+GE+ +L YE G+KY+ HYD F+ R+A+ L+YLSDV +GGET+FP E+
Sbjct: 8 NGESIQILHYENGRKYEPHYDYFHDRANQFMGGHRIATVLMYLSDVGKGGETIFPNAESK 67
Query: 220 IFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKW 277
+ + +C G VKPR+GD LLF+SL N T D SLHGSCPVI+GEKW ATKW
Sbjct: 68 LSQPKDESWSECAHKGYAVKPRKGDALLFFSLHLNATTDSNSLHGSCPVIEGEKWSATKW 127
Query: 278 IR 279
I
Sbjct: 128 IH 129
>gi|351731158|ref|ZP_08948849.1| 2OG-Fe(II) oxygenase [Acidovorax radicis N35]
Length = 303
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 72/207 (34%), Positives = 109/207 (52%), Gaps = 25/207 (12%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+PR + F N S E+C ++IA A R+ S L + E RTS G F +
Sbjct: 115 QPRVVVFGNLLSPEECDALIADAAPRMARS-LTVATKTGGEEINDDRTSDGMFFQRGQ-- 171
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQR 195
+ +++ IE +IAR P +GE VL Y G +Y HYD F+PAE G P + QR
Sbjct: 172 SPLIQRIEERIARLLNWPIENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPTIVKRGGQR 231
Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 255
+ + ++YL+ E+GG T FP + ++V P+RG+ + F P+ +
Sbjct: 232 VGTLVMYLNTPEKGGGTTFP---------------DVHVEVAPQRGNAVFFSYERPHPST 276
Query: 256 DRTSLHGSCPVIKGEKWVATKWIRDQE 282
+LHG PV+ GEKW+ATKW+R++E
Sbjct: 277 --RTLHGGAPVLAGEKWIATKWLRERE 301
>gi|317127314|ref|YP_004093596.1| Procollagen-proline dioxygenase [Bacillus cellulosilyticus DSM
2522]
gi|315472262|gb|ADU28865.1| Procollagen-proline dioxygenase [Bacillus cellulosilyticus DSM
2522]
Length = 229
Score = 119 bits (298), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 60/198 (30%), Positives = 105/198 (53%), Gaps = 24/198 (12%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I+ +K R++ S+++ + S RTSS F +E+
Sbjct: 44 PLIVLLGNVLSEEECDQLISLSKDRIERSKISNK------SVHDLRTSSSMFFDDAEN-- 95
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
++ +E ++++ +P HGE +L Y IGQ+Y +HYD F+ + R+++ ++
Sbjct: 96 DVVSTVEKRVSQIMKIPVDHGEGIQILNYAIGQEYKAHYDYFSSGNSKVN-NPRISTLVM 154
Query: 202 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 261
YL+DVE GGET FP + V P++G + F + + T++ +LH
Sbjct: 155 YLNDVEAGGETYFP---------------KLNFYVAPKKGMAVYFEYFYNDTTLNELTLH 199
Query: 262 GSCPVIKGEKWVATKWIR 279
G PV+ G+KW AT+W+R
Sbjct: 200 GGAPVVIGDKWAATQWMR 217
>gi|319652240|ref|ZP_08006358.1| prolyl 4-hydroxylase [Bacillus sp. 2_A_57_CT2]
gi|317396063|gb|EFV76783.1| prolyl 4-hydroxylase [Bacillus sp. 2_A_57_CT2]
Length = 216
Score = 119 bits (298), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 108/198 (54%), Gaps = 23/198 (11%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K R++ S++A ++E + RTSS TF E++
Sbjct: 38 PLIVILGNVLSDEECDQLIQQSKDRMQRSKVA----NSLEVDE-LRTSSSTFFHEGENE- 91
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
I+ IE +I++ +P HGE +L Y+IGQ+Y +H+D F+ + R+++ ++
Sbjct: 92 -IVARIEKRISQIMNIPVEHGEGLQILNYKIGQEYKAHFDFFSSTSRAAS-NPRISTLVM 149
Query: 202 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 261
YL+DVE+GGET FP + V P++G + F + + ++ +LH
Sbjct: 150 YLNDVEQGGETYFP---------------KLNFSVSPQKGMAVYFEYFYNDQNLNDLTLH 194
Query: 262 GSCPVIKGEKWVATKWIR 279
G PV+ G+KW AT+W+R
Sbjct: 195 GGAPVVMGDKWAATQWMR 212
>gi|295704991|ref|YP_003598066.1| 2OG-Fe(II) oxygenase [Bacillus megaterium DSM 319]
gi|294802650|gb|ADF39716.1| 2OG-Fe(II) oxygenase family protein [Bacillus megaterium DSM 319]
Length = 219
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 66/202 (32%), Positives = 110/202 (54%), Gaps = 25/202 (12%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQL-ALRQGETVESTKGTRTSSGTFISASEDK 140
P L N S E+C +I +K +++ S++ A R+ ++ RTSSG F SE++
Sbjct: 39 PLVLVLGNVLSNEECDELIRLSKDKMQRSKIGAAREVNSI------RTSSGMFFDESENE 92
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFL 200
++ IE ++++ + E +L+Y Q+Y +H+D F A + + R+++ +
Sbjct: 93 --LVHQIERRLSKIMGPSIEYAEGLQILKYLPDQEYKAHHDYFTSASKASK-NNRISTLV 149
Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
+YL+DVEEGGET FP +GL V P +G + F + + ++ +L
Sbjct: 150 MYLNDVEEGGETYFP---------------KLGLSVSPTKGMAVYFEYFYSDAELNDRTL 194
Query: 261 HGSCPVIKGEKWVATKWIRDQE 282
HG PVIKGEKWVAT+W+R Q+
Sbjct: 195 HGGAPVIKGEKWVATQWMRKQK 216
>gi|413963357|ref|ZP_11402584.1| ProCollegen-proline dioxygenase [Burkholderia sp. SJ98]
gi|413929189|gb|EKS68477.1| ProCollegen-proline dioxygenase [Burkholderia sp. SJ98]
Length = 286
Score = 118 bits (296), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 76/225 (33%), Positives = 111/225 (49%), Gaps = 23/225 (10%)
Query: 60 LPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET 119
LP AS +S P S +P + +C +I ++ ++ S + +
Sbjct: 72 LPQARVLTASDRSAPKVFSSEQPVIALVADVLDDTECDRLIEIGREHVQRSSV-VDPDSG 130
Query: 120 VESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH 179
E T R S G F++AS D ++E I+ +IA P +GE ++LRY +G +Y H
Sbjct: 131 KEITIEERRSEGAFVNASTD--ALVETIDRRIAELFRQPVENGEDLHILRYGMGGEYRPH 188
Query: 180 YDAFNPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGL 234
YD F + G + QR+A+ +LYL++VE+GG+T FP IGL
Sbjct: 189 YDYFPEEQAGSKHHMQRGGQRIATVILYLNEVEQGGDTTFP---------------DIGL 233
Query: 235 KVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+ PRRG L F + G D +LH PV KGEKW+ATKWIR
Sbjct: 234 AIHPRRGSALYFEYVNELGQSDPKTLHAGTPVEKGEKWIATKWIR 278
>gi|384046522|ref|YP_005494539.1| prolyl 4-hydroxylase alpha subunit [Bacillus megaterium WSH-002]
gi|345444213|gb|AEN89230.1| Prolyl 4-hydroxylase alpha subunit [Bacillus megaterium WSH-002]
Length = 219
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 63/201 (31%), Positives = 106/201 (52%), Gaps = 23/201 (11%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P L N S E+C +I +K +++ S++ + RTSSG F SE++
Sbjct: 39 PLVLVLGNVLSNEECDELIQLSKDKMQRSKIGAER-----EVNSIRTSSGMFFEESENE- 92
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
++ IE ++++ + E +L+Y Q+Y +H+D F A + + R+++ ++
Sbjct: 93 -LVHQIERRLSKIMGPSIEYAEGLQILKYLPDQEYKAHHDYFTSASKASK-NNRISTLVM 150
Query: 202 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 261
YL+DVEEGGET FP +GL + P +G + F + + ++ +LH
Sbjct: 151 YLNDVEEGGETYFP---------------KLGLSISPTKGMAVYFEYFYSDAELNDRTLH 195
Query: 262 GSCPVIKGEKWVATKWIRDQE 282
G PVIKGEKWVAT+W+R Q+
Sbjct: 196 GGAPVIKGEKWVATQWMRKQK 216
>gi|125546091|gb|EAY92230.1| hypothetical protein OsI_13950 [Oryza sativa Indica Group]
Length = 178
Score = 117 bits (293), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 59/139 (42%), Positives = 90/139 (64%), Gaps = 5/139 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LSWRPRA + F S ++C ++ AK R++ S +A G+++ S RTSSGTF+S
Sbjct: 40 LSWRPRAFLYSGFLSHDECDHLVNLAKGRMEKSMVADNDSGKSIMSQ--VRTSSGTFLSK 97
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
ED I+ IE ++A T LP+ + E+ +L YE+GQKYD+H+D F+ + R+
Sbjct: 98 HEDD--IVSGIEKRVAAWTFLPEENAESIQILHYELGQKYDAHFDYFHDKNNLKRGGHRV 155
Query: 197 ASFLLYLSDVEEGGETMFP 215
A+ L+YL+DV++GGET+FP
Sbjct: 156 ATVLMYLTDVKKGGETVFP 174
>gi|89096248|ref|ZP_01169141.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus sp.
NRRL B-14911]
gi|89089102|gb|EAR68210.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus sp.
NRRL B-14911]
Length = 217
Score = 117 bits (292), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 62/198 (31%), Positives = 104/198 (52%), Gaps = 23/198 (11%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C+ +I ++ +LK S++ + RTSS F E++
Sbjct: 39 PLIVILGNVLSDEECEGLIRMSEDKLKRSKIG-----NTRTVDDIRTSSSMFFEEGENE- 92
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
++ IE ++++ +P HGE +L Y IGQ+Y +H+D F+ + + R+++ ++
Sbjct: 93 -LVARIERRLSQIMNIPVEHGEGLQMLNYHIGQEYKAHFDFFSSSSR-AASNPRISTLVM 150
Query: 202 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 261
YL+DVEEGGET FP + V P++G + F + N ++ +LH
Sbjct: 151 YLNDVEEGGETYFP---------------KLNFSVNPQKGSAVYFEYFYDNQDLNDLTLH 195
Query: 262 GSCPVIKGEKWVATKWIR 279
G PVIKG KW AT+W+R
Sbjct: 196 GGAPVIKGSKWAATQWMR 213
>gi|402813396|ref|ZP_10862991.1| hypothetical protein PAV_1c08470 [Paenibacillus alvei DSM 29]
gi|402509339|gb|EJW19859.1| hypothetical protein PAV_1c08470 [Paenibacillus alvei DSM 29]
Length = 215
Score = 116 bits (291), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 70/200 (35%), Positives = 101/200 (50%), Gaps = 25/200 (12%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K+RL+ S++ GE S RTSSG F +E
Sbjct: 36 PLIVILGNVLSNEECDELIEHSKERLQRSKI----GEE-RSVNQIRTSSGVFCEENE--- 87
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
+ IE +I++ +P HG+ VL Y GQ+Y H+D F + R+++ ++
Sbjct: 88 -TVAKIEKRISQIMNIPIEHGDGLQVLLYAPGQEYKPHFDFFADTSRASA-NNRISTLVM 145
Query: 202 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 261
YL+DVEEGGET FP N L V P +G + F + N ++ +LH
Sbjct: 146 YLNDVEEGGETTFPMLN---------------LSVFPSKGMAVYFEYFYSNHELNERTLH 190
Query: 262 GSCPVIKGEKWVATKWIRDQ 281
PV KGEKWVAT W+R Q
Sbjct: 191 AGAPVRKGEKWVATMWMRRQ 210
>gi|241767624|ref|ZP_04765273.1| Procollagen-proline dioxygenase [Acidovorax delafieldii 2AN]
gi|241361463|gb|EER57922.1| Procollagen-proline dioxygenase [Acidovorax delafieldii 2AN]
Length = 318
Score = 116 bits (291), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 75/224 (33%), Positives = 112/224 (50%), Gaps = 28/224 (12%)
Query: 64 ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
+ GD S++ + + PR + F N S E+C+++IA A R+ S L + E
Sbjct: 116 DCGDRSVEVL---LTMAHPRVVVFGNLLSPEECEALIAAAAPRMARS-LTVATQTGGEEV 171
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTS G F E +++ IE +IA P +GE VL Y G +Y HYD F
Sbjct: 172 NDDRTSHGMFFQRGESP--LVQRIEERIASLLNWPIENGEGLQVLHYRPGAEYKPHYDYF 229
Query: 184 NPAEYG-PQM----SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKP 238
+PAE G P + QR+ + ++YL+ E+GG T FP ++V P
Sbjct: 230 DPAEPGTPTVIQRGGQRVGTLVMYLNTPEQGGGTTFPDAQ---------------IEVAP 274
Query: 239 RRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
+RG+ F P T +LHG PV+ G+KW+ATKW+R++E
Sbjct: 275 QRGNAAFFSYERP--TPSTRTLHGGAPVLAGDKWIATKWLRERE 316
>gi|226314793|ref|YP_002774689.1| hypothetical protein BBR47_52080 [Brevibacillus brevis NBRC 100599]
gi|226097743|dbj|BAH46185.1| conserved hypothetical protein [Brevibacillus brevis NBRC 100599]
Length = 215
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 68/200 (34%), Positives = 101/200 (50%), Gaps = 25/200 (12%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S +C +I +++RL+ S++ GE S RTSSG F +E T
Sbjct: 36 PLVVVLGNVLSDSECDELIEHSRERLQRSKI----GED-RSVNSIRTSSGVFCEQTETIT 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
I E +I++ +P HG+ VLRY GQ+Y HYD F + R+++ ++
Sbjct: 91 RI----EKRISQIMNIPIEHGDGLQVLRYTPGQEYKPHYDFFAETSRAS-TNNRISTLVM 145
Query: 202 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 261
YL+DVE+GGET+FP + L V P +G + F + N ++ +LH
Sbjct: 146 YLNDVEQGGETVFPL---------------LHLSVFPTKGMAVYFEYFYRNQEVNEFTLH 190
Query: 262 GSCPVIKGEKWVATKWIRDQ 281
VI GEKWVAT W+R Q
Sbjct: 191 AGAQVIHGEKWVATMWMRRQ 210
>gi|334188665|ref|NP_001190630.1| iron ion binding / oxidoreductase/ oxidoreductase protein
[Arabidopsis thaliana]
gi|332010771|gb|AED98154.1| iron ion binding / oxidoreductase/ oxidoreductase protein
[Arabidopsis thaliana]
Length = 243
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 64/147 (43%), Positives = 90/147 (61%), Gaps = 7/147 (4%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
+++SW PRA + NF E+C+ +I AK ++ S + + G++ +S RTSSGTF+
Sbjct: 79 EIISWEPRASVYHNFL--EECKYLIELAKPHMEKSTVVDEKTGKSTDSR--VRTSSGTFL 134
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
+ DKT + IE +I+ T +P HGE VL YEIGQKY+ HYD F Q
Sbjct: 135 ARGRDKT--IREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHYDYFMDEYNTRNGGQ 192
Query: 195 RLASFLLYLSDVEEGGETMFPFENGIF 221
R+A+ L+YLSDVEEGGET+FP G +
Sbjct: 193 RIATVLMYLSDVEEGGETVFPAAKGNY 219
>gi|167519971|ref|XP_001744325.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163777411|gb|EDQ91028.1| predicted protein [Monosiga brevicollis MX1]
Length = 492
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 68/207 (32%), Positives = 101/207 (48%), Gaps = 32/207 (15%)
Query: 83 RALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT------RTSSGTFISA 136
R F NFASA++C + +K+L V T G R S+ ++
Sbjct: 305 RLQIFRNFASAQECAHLREEGRKKL---------SRAVAWTDGAFRPVEFRISTAAWLQP 355
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
D ++ + +IA AT L EA V Y IG Y++HYD E R+
Sbjct: 356 DHDD--VVTNLHTRIADATQLDLEFAEALQVSNYGIGGFYETHYDHHASRERELPEGDRI 413
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
A+F++YL+ VE+GG T FP +G V+P GD + +Y+L P+G D
Sbjct: 414 ATFMIYLNQVEQGGYTAFPR---------------LGAAVEPGHGDAVFWYNLLPDGESD 458
Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+LHG+CPV++G KWVA KWI +++
Sbjct: 459 NNTLHGACPVLQGSKWVANKWIHEKKN 485
>gi|386712780|ref|YP_006179102.1| prolyl 4-hydroxylase alpha subunit [Halobacillus halophilus DSM
2266]
gi|384072335|emb|CCG43825.1| prolyl 4-hydroxylase alpha subunit [Halobacillus halophilus DSM
2266]
Length = 211
Score = 115 bits (289), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 62/194 (31%), Positives = 99/194 (51%), Gaps = 25/194 (12%)
Query: 89 NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
N S E+C+ +I +K ++ S++ + RTSS TF+ + + IE
Sbjct: 41 NVVSEEECEELIFLSKNKMNRSKIGSQH-----EVSDIRTSSSTFLPEDD----LTNRIE 91
Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEE 208
++A+ +P HGE ++L Y+ GQ+Y +HYD F + R+++ +LYL+DVEE
Sbjct: 92 KRVAQIMNVPVEHGEGLHILNYKQGQEYKAHYDYFRSKAKAAN-NPRISTLVLYLNDVEE 150
Query: 209 GGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIK 268
GGET FP N L + P +G + F + + I+ +LHG PV
Sbjct: 151 GGETYFPHMN---------------LSISPHKGMAVYFEYFYSDPLINERTLHGGSPVTS 195
Query: 269 GEKWVATKWIRDQE 282
GEKW AT W+R ++
Sbjct: 196 GEKWAATMWVRRKQ 209
>gi|330821584|ref|YP_004350446.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
gladioli BSR3]
gi|327373579|gb|AEA64934.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
gladioli BSR3]
Length = 302
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 75/207 (36%), Positives = 100/207 (48%), Gaps = 25/207 (12%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASED 139
RP A+ F SA +C+ +I A+ RL S + G + G R+S G F E
Sbjct: 101 RPAAVLLDGFLSAGECRQLIELARPRLNRSTVVDPVTGRNI--VAGHRSSDGMFFRLGE- 157
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP-----AEYGPQMSQ 194
T ++ IE +IA T P +GE +L YE G + H D P AE + Q
Sbjct: 158 -TPLISRIEQRIAALTGFPVENGEGLQMLHYEAGAESTPHVDYLVPGNPANAESIARSGQ 216
Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
R+ + L+YL+DVE GGET+FP +G V PRRG F +G
Sbjct: 217 RVGTLLMYLNDVESGGETLFP---------------QVGCSVVPRRGQAFYFEYGNGSGR 261
Query: 255 IDRTSLHGSCPVIKGEKWVATKWIRDQ 281
D SLH S P+ G+KWVATKWIR +
Sbjct: 262 SDPASLHASSPIGSGDKWVATKWIRTR 288
>gi|255083957|ref|XP_002508553.1| predicted protein [Micromonas sp. RCC299]
gi|226523830|gb|ACO69811.1| predicted protein [Micromonas sp. RCC299]
Length = 262
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 74/215 (34%), Positives = 106/215 (49%), Gaps = 18/215 (8%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
LS P+A + F SAE+C +I LK S + + +T RTS GTF+
Sbjct: 4 LSDEPKAFLYHGFLSAEECDHLIKIGTPHLKRSTVVGGKDDT-GVLDDVRTSFGTFLPKK 62
Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
D +L IE ++ + + + E +L+Y GQ+Y H D P +R+A
Sbjct: 63 YDD--VLYGIERRVEDFSQISYENQEQLQLLKYHDGQEYKDHQDGLT----SPNGGRRIA 116
Query: 198 SFLLYLSDVEEGGETMFPFENGI---------FLDSGYD--YKKCIGLKVKPRRGDGLLF 246
+ L++L + E+GGET FP + D D ++ GL VKPRRGD +LF
Sbjct: 117 TVLMFLHEPEKGGETSFPQGKPLPAVAQRLRGMRDELSDCAWRDGRGLAVKPRRGDAVLF 176
Query: 247 YSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
+S NG D S H SCP + G KW ATKWI ++
Sbjct: 177 FSFKKNGGSDIASTHASCPTVGGVKWTATKWIHEK 211
>gi|433460968|ref|ZP_20418587.1| prolyl 4-hydroxylase alpha subunit [Halobacillus sp. BAB-2008]
gi|432190746|gb|ELK47751.1| prolyl 4-hydroxylase alpha subunit [Halobacillus sp. BAB-2008]
Length = 211
Score = 115 bits (288), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 64/201 (31%), Positives = 102/201 (50%), Gaps = 25/201 (12%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P+ N S E+C+++I +K ++ S++ + RTSS F+ E
Sbjct: 34 PKIAILGNVVSEEECEALIRLSKDKVNRSKIG-----SDHDVSDIRTSSSAFLPDDE--- 85
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
+ IE ++A+ +P HGE ++L Y+ GQ+Y +H+D F + + R+++ +L
Sbjct: 86 -LTGRIEKRLAQIMNVPVEHGEGIHILHYKPGQEYKAHHDYFRSTSRAAK-NPRISTLVL 143
Query: 202 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 261
YL+DVEEGGET FP N L V P +G + F + + I+ +LH
Sbjct: 144 YLNDVEEGGETYFPEMN---------------LTVSPHKGMAVYFEYFYNDPAINERTLH 188
Query: 262 GSCPVIKGEKWVATKWIRDQE 282
G PV GEKW AT W+R Q+
Sbjct: 189 GGSPVTAGEKWAATMWVRRQQ 209
>gi|326518408|dbj|BAJ88233.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 276
Score = 115 bits (287), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 69/187 (36%), Positives = 108/187 (57%), Gaps = 7/187 (3%)
Query: 65 TGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVEST 123
D + + +V+SW PR + F NF S+E+C + A+ RL+ S + + G+ V+S
Sbjct: 50 AADLRLGYVKPEVISWTPRIIVFHNFLSSEECDYLREIARPRLEISTVVDVATGKGVKSD 109
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTSSG F+++ E K +++ IE +I+ + +P +GE VLRYE Q Y H+D F
Sbjct: 110 --VRTSSGMFVNSEERKLPVIKAIEKRISVFSQIPVENGELIQVLRYEPNQYYRPHHDYF 167
Query: 184 NPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFE-NGIFLDSGYDYKKCIGLKVKPRRGD 242
+ + QR+A+ L+YL+D EGGET FP +G + G + GL VKP +GD
Sbjct: 168 SDTFNLKRGGQRVATMLMYLTDGVEGGETHFPQAGDGECICGG---RLVRGLCVKPNKGD 224
Query: 243 GLLFYSL 249
+LF+S+
Sbjct: 225 AVLFWSM 231
>gi|186474111|ref|YP_001861453.1| procollagen-proline dioxygenase [Burkholderia phymatum STM815]
gi|184196443|gb|ACC74407.1| Procollagen-proline dioxygenase [Burkholderia phymatum STM815]
Length = 305
Score = 114 bits (286), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 70/219 (31%), Positives = 107/219 (48%), Gaps = 23/219 (10%)
Query: 68 ASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTR 127
A + + +V RP+ + F + S ++C +I A+ RLK S + E R
Sbjct: 102 AHDRDVAVRVRFERPQVIVFDDVLSRDECDELIERARHRLKRS-TTVNPESGREDVIQLR 160
Query: 128 TSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE 187
TS G + ED +E ++ +I+ P HGE +L Y G +Y H+D F P++
Sbjct: 161 TSEGFWFQRCED--AFIERLDRRISALMNWPLEHGEGLQILHYTKGGEYRPHFDYFPPSQ 218
Query: 188 YGPQM-----SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
G + QR+A+ ++YLSDV GGET+FP GL V R+G
Sbjct: 219 SGSVLHTSRGGQRVATLIVYLSDVAGGGETVFP---------------NAGLAVMARQGG 263
Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
+ F L + +D +LHG PV GEKW+ TKW+R++
Sbjct: 264 AIYFRYLNGHRQLDPLTLHGGAPVTNGEKWIMTKWMRER 302
>gi|398818543|ref|ZP_10577128.1| 2OG-Fe(II) oxygenase superfamily enzyme [Brevibacillus sp. BC25]
gi|398027481|gb|EJL21031.1| 2OG-Fe(II) oxygenase superfamily enzyme [Brevibacillus sp. BC25]
Length = 220
Score = 114 bits (286), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 68/200 (34%), Positives = 101/200 (50%), Gaps = 25/200 (12%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S +C +I +++RL+ S++ GE S RTSSG F +E T
Sbjct: 41 PLVVVLGNVLSDSECDELIEHSRERLQRSKI----GED-GSVNSIRTSSGVFCEQTETIT 95
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
I E +I++ +P HG+ VLRY GQ+Y HYD F + R+++ ++
Sbjct: 96 RI----EKRISQIMNIPIEHGDGLQVLRYTPGQEYKPHYDFFAETSRAS-TNNRISTLVM 150
Query: 202 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 261
YL+DVE+GGET+FP + L V P +G + F + N ++ +LH
Sbjct: 151 YLNDVEQGGETVFPL---------------LHLSVFPTKGMAVYFEYFYSNQELNDFTLH 195
Query: 262 GSCPVIKGEKWVATKWIRDQ 281
VI GEKWVAT W+R Q
Sbjct: 196 AGTQVIHGEKWVATMWMRRQ 215
>gi|195061068|ref|XP_001995918.1| GH14106 [Drosophila grimshawi]
gi|193891710|gb|EDV90576.1| GH14106 [Drosophila grimshawi]
Length = 511
Score = 114 bits (286), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 68/212 (32%), Positives = 107/212 (50%), Gaps = 21/212 (9%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
+ +++ P + F + S ++ + A+ LK + + + G+ V ++ RTS G
Sbjct: 308 LKMEIVLLNPFIVVFHDALSPQEIDYLQNLARPLLKRTTVHV-NGKYV--SRRVRTSKGA 364
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA-EYGPQ 191
++ D + IE ++ T L EA+N++ Y +G Y +HYD FN + +
Sbjct: 365 WLE--RDLNNLTRRIERRVVDMTELSMQGSEAYNIMNYGLGGHYAAHYDFFNTTKQQTSE 422
Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
R+A+ L YLSDVE+GG T+FP + L V P RG L +Y+L
Sbjct: 423 TGDRIATVLFYLSDVEQGGATVFP---------------NLKLAVSPERGMALFWYNLLD 467
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
NGT D +LHG CPV+ G KWV T WI ++ Q
Sbjct: 468 NGTGDTRTLHGGCPVLVGSKWVMTLWIHERAQ 499
>gi|348683507|gb|EGZ23322.1| hypothetical protein PHYSODRAFT_310730 [Phytophthora sojae]
Length = 417
Score = 114 bits (285), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 71/225 (31%), Positives = 110/225 (48%), Gaps = 17/225 (7%)
Query: 72 SIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
+ + LS P F ++ I+ + + LKPS + L G + RTS+
Sbjct: 198 DVVLETLSMTPLVFSVEEFLKDDEIDIIMNLSLEHLKPSGVTLMDGHENRAATDWRTSTT 257
Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY--- 188
F+ + D ++ I+ +++ T +P H E VLRYE QKYD H D F P E+
Sbjct: 258 YFLPS--DAHPKIDEIDQRVSDLTKVPIDHQEDVQVLRYEKTQKYDHHTDYF-PVEHHKN 314
Query: 189 GPQM--------SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI-GLKVKPR 239
P + R+ + Y+SDV +GG T+FP G + K C GL V P+
Sbjct: 315 APHILESIDYGYKNRMITVFWYMSDVAKGGHTIFPRAGGAPRPTSM--KDCTTGLNVPPK 372
Query: 240 RGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQH 284
+ ++FYS+ PNG D SLHG CPV +G K+ KW+ ++ ++
Sbjct: 373 KRKVIVFYSMLPNGEGDPMSLHGGCPVEEGVKYSGNKWVWNKARY 417
>gi|383757171|ref|YP_005436156.1| putative prolyl 4-hydroxylase alpha subunit [Rubrivivax gelatinosus
IL144]
gi|381377840|dbj|BAL94657.1| putative prolyl 4-hydroxylase alpha subunit homologue
oxidoreductase protein [Rubrivivax gelatinosus IL144]
Length = 279
Score = 114 bits (285), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 72/210 (34%), Positives = 103/210 (49%), Gaps = 37/210 (17%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFIS 135
PR + F S E+C ++A A+ RL S ETV+++ G RTS G F
Sbjct: 92 PRVVVFGGLLSDEECDELVALARPRLARS-------ETVDNSTGGSEVNAARTSDGMFFE 144
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----- 190
E ++E IE +IA P GE VLRY G +Y H+D F+PA G
Sbjct: 145 RGEKP--LIERIERRIAELVRWPVERGEGLQVLRYRPGAQYKPHHDFFDPAHPGTANILR 202
Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
+ QR+ + ++YL+ GG T FP +GL+V+P +G+ + F
Sbjct: 203 RGGQRVGTVVMYLNTPAGGGATTFP---------------EVGLEVQPVKGNAVFFSYER 247
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRD 280
P + +LHG PV+ GEKWVATKW+R+
Sbjct: 248 PLAST--RTLHGGAPVLDGEKWVATKWMRE 275
>gi|428183249|gb|EKX52107.1| hypothetical protein GUITHDRAFT_150687 [Guillardia theta CCMP2712]
Length = 315
Score = 114 bits (285), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 86/281 (30%), Positives = 140/281 (49%), Gaps = 35/281 (12%)
Query: 15 PTALLLCSFFFLAGFYGS--TFLSRDVPSIRPKLRTLEVVEKENESGLPH--GETGDASI 70
P A +LC F L G TF++ V S R ++ LEV ++ SG+ + G + +
Sbjct: 15 PPAAVLCGIFVLGIGSGVILTFIAAQVVSRRSEI--LEVKPQKLRSGMLNAPGTSLSPAD 72
Query: 71 QSIPFQVLSWR----PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQG--ETVESTK 124
+ F + + PR N + E+C+S+ + L + G E VES+
Sbjct: 73 KKNEFWIETISDLPGPRIYVLHNILTKEECESLKSLGVMAGMEKALIIPYGGKELVESS- 131
Query: 125 GTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN 184
TRT++ ++ + ++ +E+ +A+ T +GE +L Y+ Q++ H+D F+
Sbjct: 132 -TRTNTAAWLEYHQGP--VVTKLENLLAKVTNTEPENGENLQILHYQTSQQFKEHHDYFD 188
Query: 185 PAEYGPQM----SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRR 240
PA P+ RLA+ ++YL + EEGGET D+ K I KVKP
Sbjct: 189 PATDPPENFEPGGNRLATAIIYLQNAEEGGET--------------DFMK-IDTKVKPEA 233
Query: 241 GDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
G +LFY L P+G++D+ ++H P GEKWVATKWI ++
Sbjct: 234 GSAVLFYDLKPDGSVDKLTIHSGNPPKGGEKWVATKWIHER 274
>gi|90022913|ref|YP_528740.1| hypothetical protein Sde_3273 [Saccharophagus degradans 2-40]
gi|89952513|gb|ABD82528.1| 2OG-Fe(II) oxygenase [Saccharophagus degradans 2-40]
Length = 478
Score = 114 bits (284), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 75/262 (28%), Positives = 123/262 (46%), Gaps = 41/262 (15%)
Query: 39 VPSIRPKLRTLEVVEKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPN--------- 89
V + RP + +++ + P + ++Q+ PR ++ PN
Sbjct: 43 VMNFRPTVPVDQLINPFKRAASPQQTVNNQNLQTAANHGAPTDPRKIFIPNALKLNSDKL 102
Query: 90 -------FASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTG 142
F + E+C+ IIA + +L+PS+L+ + ES K RTS + +D
Sbjct: 103 EMYALGEFLTTEECERIIANIRSKLRPSELS-----SQESDKTYRTSRTCDLGTIDDP-- 155
Query: 143 ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN---PAEYGPQMSQRLASF 199
+ ++ +I + + ++ E YE+GQ++ +H D F E+G M QR +
Sbjct: 156 FIHYVDSRICKLVGIDPSYSEVIQGQLYEVGQEFKAHTDYFEIKEMPEHGAVMGQRTYTV 215
Query: 200 LLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTS 259
++YL+DVEEGGET FP +G +KPR G L++ SL NG + S
Sbjct: 216 MIYLNDVEEGGETDFPAADG---------------AIKPRAGLALIWNSLQSNGAPNPHS 260
Query: 260 LHGSCPVIKGEKWVATKWIRDQ 281
+H + PV+KG K V TKW R Q
Sbjct: 261 MHQAYPVLKGHKAVITKWFRSQ 282
>gi|357417854|ref|YP_004930874.1| procollagen-proline dioxygenase [Pseudoxanthomonas spadix BD-a59]
gi|355335432|gb|AER56833.1| Procollagen-proline dioxygenase [Pseudoxanthomonas spadix BD-a59]
Length = 283
Score = 114 bits (284), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 71/225 (31%), Positives = 114/225 (50%), Gaps = 28/225 (12%)
Query: 64 ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
+ GD +Q + + PR + F N +AE+C ++IA A++++K S + +
Sbjct: 81 QAGDRQVQVLASLL---HPRVIVFGNLLAAEECDALIALARRQIKRSPV-FDPDTGQDQQ 136
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTS G F + + +E +IA P +GE VLRY G +Y+ HYD F
Sbjct: 137 HQARTSEGMFFGRGANP--LCARVEARIAALLNWPLENGEGLQVLRYGPGAQYEPHYDYF 194
Query: 184 NPAEYGPQMS-----QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKP 238
+PA G +++ QR+AS ++YL+ +GG T FP + L+V P
Sbjct: 195 DPARPGAEVALRRGGQRVASLVIYLNTPTQGGATTFPDAH---------------LEVAP 239
Query: 239 RRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+G+ + F P+ +LHG PV++GEKWVATKW+R++
Sbjct: 240 IKGNAVYFSYDRPHPMTG--TLHGGAPVVEGEKWVATKWLRERRH 282
>gi|329913962|ref|ZP_08276011.1| hypothetical protein IMCC9480_1311 [Oxalobacteraceae bacterium
IMCC9480]
gi|327545257|gb|EGF30515.1| hypothetical protein IMCC9480_1311 [Oxalobacteraceae bacterium
IMCC9480]
Length = 280
Score = 114 bits (284), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 68/205 (33%), Positives = 101/205 (49%), Gaps = 23/205 (11%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
PR + N S ++C +I A ++ R S + + +RTS I E T
Sbjct: 92 PRIVVLGNVLSDDECDAIAAMSRTRFARST-TIDNASGINRFDDSRTSESAHIQRGE--T 148
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP-----QMSQRL 196
++ I+ ++A + P HGE + +Y+ G +Y H+D F+PA G + QRL
Sbjct: 149 ELIARIDARLAALSGWPVDHGEPLQLQKYQAGNEYRPHFDWFDPALAGTAKHLEKSGQRL 208
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
A+ +LYL+DVEEGG T FP IGL V P++G L F + P G D
Sbjct: 209 ATIILYLTDVEEGGGTSFP---------------GIGLDVHPQKGGALFFRNTTPYGVPD 253
Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQ 281
R + H PV KG K +A KW+R++
Sbjct: 254 RKTQHAGLPVEKGTKIIANKWLREK 278
>gi|251794605|ref|YP_003009336.1| procollagen-proline dioxygenase [Paenibacillus sp. JDR-2]
gi|247542231|gb|ACS99249.1| Procollagen-proline dioxygenase [Paenibacillus sp. JDR-2]
Length = 209
Score = 114 bits (284), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 62/200 (31%), Positives = 100/200 (50%), Gaps = 25/200 (12%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P L N S +C +I A R++ +++ + RTSS F SE++
Sbjct: 32 PLILILDNVLSWAECDLLIDLASARMQRAKIG-----SSHDVSEVRTSSSMFFEESENEC 86
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
+ +E ++A +P +H E VLRY+ G++Y H+D F G M+ R+++ ++
Sbjct: 87 --IGQVEARVAELMNIPVSHAEPLQVLRYQPGEQYHPHFDYFTQ---GSSMNNRISTLVM 141
Query: 202 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 261
YL+DVEEGGET FP + V P++G + F + + ++ +LH
Sbjct: 142 YLNDVEEGGETYFP---------------SLHFSVTPKKGSAVYFEYFYNDTRLNELTLH 186
Query: 262 GSCPVIKGEKWVATKWIRDQ 281
PV GEKWVAT+W+R Q
Sbjct: 187 AGHPVEAGEKWVATQWMRRQ 206
>gi|295700439|ref|YP_003608332.1| procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
gi|295439652|gb|ADG18821.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
Length = 296
Score = 114 bits (284), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 77/205 (37%), Positives = 104/205 (50%), Gaps = 25/205 (12%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASED 139
RP A++ NF SA++C+ +IA A+ RL S + G V +T R+S G F E
Sbjct: 101 RPAAVHLANFLSADECEQLIALAQPRLDRSAVVDPVTGRDVIATH--RSSHGMFFRLGE- 157
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF---NPA--EYGPQMSQ 194
T ++ IE +IA T P +GE +L YE G + H D N A E + Q
Sbjct: 158 -TPLIARIEARIAELTATPVENGEGLQMLHYEEGAESTPHVDYLMTGNEANRESIARSGQ 216
Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
R+ + L+YL DVE GGET+FP +G + P+RG L F G
Sbjct: 217 RMGTLLMYLKDVEGGGETVFP---------------QVGWSIVPQRGHALYFEYGNRYGM 261
Query: 255 IDRTSLHGSCPVIKGEKWVATKWIR 279
D +SLH S P+ G+KWVATKWIR
Sbjct: 262 CDPSSLHASTPLRTGDKWVATKWIR 286
>gi|224006596|ref|XP_002292258.1| hypothetical protein THAPSDRAFT_263536 [Thalassiosira pseudonana
CCMP1335]
gi|220971900|gb|EED90233.1| hypothetical protein THAPSDRAFT_263536 [Thalassiosira pseudonana
CCMP1335]
Length = 206
Score = 113 bits (283), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 75/213 (35%), Positives = 109/213 (51%), Gaps = 24/213 (11%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPS----QLALRQGETVESTKGTRTSSGTFISAS 137
PR Y NF SA++ ++A + + PS A QG + + TRTS F +
Sbjct: 2 PRVFYVHNFLSADEADELVAFS---MAPSTGGTHKAWNQGGS-NAKLTTRTSMNAFDITT 57
Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-----NPAEYGPQM 192
+ I + ++ R + + +LRYE+GQ Y +H+D F N + P
Sbjct: 58 KLSFRI-KRRAFRLLRMGAYKENLADGIQILRYELGQAYIAHHDYFPVRQSNDHLWDPSK 116
Query: 193 --SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIG-----LKVKPRRGDGLL 245
S R A+ LYLSDVE GG+T+ E +D+G K + L V PRRGD +L
Sbjct: 117 GGSNRFATIFLYLSDVEVGGQTL---EKDAGVDAGSWEDKLVDQCYSKLAVPPRRGDAIL 173
Query: 246 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
FYS +P+G +D SLHG+CP++KG KW A W+
Sbjct: 174 FYSQYPDGHLDPNSLHGACPILKGTKWGANLWV 206
>gi|377810637|ref|YP_005043077.1| proCollegen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia sp.
YI23]
gi|357939998|gb|AET93554.1| proCollegen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia sp.
YI23]
Length = 297
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 75/205 (36%), Positives = 100/205 (48%), Gaps = 25/205 (12%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASED 139
RP A+ F + +C +IA A+ RL S + G V + G R+S GTF +E
Sbjct: 101 RPAAVLLDEFLTGSECDQLIALARPRLSRSTVVDPVTGRDVAA--GHRSSDGTFFRLAE- 157
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA-----EYGPQMSQ 194
T ++ +E +IA T L +GE +LRY+ G + H D E + Q
Sbjct: 158 -TPLVARLEMRIAALTGLAAENGEGLQLLRYQPGAESTPHVDYLVAGNETNRESIARSGQ 216
Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
R+ + L+YL+DVE GGET+FP +G V PRRG L F G
Sbjct: 217 RVGTLLMYLNDVEGGGETVFP---------------QVGCSVVPRRGQALYFEYCNRAGV 261
Query: 255 IDRTSLHGSCPVIKGEKWVATKWIR 279
D SLH S P+ GEKWVATKWIR
Sbjct: 262 CDPASLHASTPLRSGEKWVATKWIR 286
>gi|389793983|ref|ZP_10197143.1| 2OG-Fe(II) oxygenase [Rhodanobacter fulvus Jip2]
gi|388433014|gb|EIL89992.1| 2OG-Fe(II) oxygenase [Rhodanobacter fulvus Jip2]
Length = 282
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 74/220 (33%), Positives = 108/220 (49%), Gaps = 30/220 (13%)
Query: 67 DASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT 126
D IP V + P S +C +I A+ RL+ + G+ +
Sbjct: 80 DGRTIGIPLSVDA--PALRVLDGLLSERECADLIELARPRLQRALTVDSDGK--QQIDQR 135
Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
RTS G F A E T ++ IE ++A+ +P +HGE +L Y GQ+Y+ HYD F+PA
Sbjct: 136 RTSEGMFFRAGE--TPLVAAIEQRLAQLLGVPASHGEGLQILHYGPGQEYEPHYDWFDPA 193
Query: 187 EYG-----PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRG 241
G + QR+AS ++YL+ E GG T FP IGL V RRG
Sbjct: 194 LPGYDKLTARAGQRIASVVMYLNTPERGGGTAFP---------------EIGLTVTARRG 238
Query: 242 DGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
+ F + G D++SLH PV++GEKW+AT W+R++
Sbjct: 239 AAVYF--AYEGG--DQSSLHAGLPVLQGEKWIATHWLRER 274
>gi|325922187|ref|ZP_08183974.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas gardneri ATCC
19865]
gi|325547306|gb|EGD18373.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas gardneri ATCC
19865]
Length = 285
Score = 112 bits (279), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 75/235 (31%), Positives = 113/235 (48%), Gaps = 30/235 (12%)
Query: 52 VEKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQ 111
V ++N++ L + GD ++ + +L PR + +F S +C ++IA A+ RL S+
Sbjct: 71 VLQDNDASL--LDLGDRQVRVLVSLLL---PRVVVLGDFLSDAECDALIALAQPRLARSR 125
Query: 112 LALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYE 171
+ + RTS + +D + + IE +IAR P HGE VLRY
Sbjct: 126 -TVDNDNGAQIVHAARTSDSMCLQLGQD--ALCQRIEARIARLLDWPVDHGEGLQVLRYA 182
Query: 172 IGQKYDSHYDAFNPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGY 226
G +Y HYD F+P G + QRLAS ++YL+ E GG T FP
Sbjct: 183 TGAEYQPHYDYFDPTAAGTPVLLQAGGQRLASLVMYLNTPERGGATRFPD---------- 232
Query: 227 DYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
+ L V +G+ + F P+ SLH PV+ GEKWVATKW+R++
Sbjct: 233 -----VHLDVAAVKGNAVFFSYDRPHPMT--RSLHAGAPVLAGEKWVATKWLRER 280
>gi|295699617|ref|YP_003607510.1| procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
gi|295438830|gb|ADG17999.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
Length = 286
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 66/206 (32%), Positives = 103/206 (50%), Gaps = 23/206 (11%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
RP+ + F + SA +C +I ++ RLK S + E RTS G + ED+
Sbjct: 96 RPQLVVFADVLSAAECAELIERSRHRLKRST-TVNPLTGREDVIRNRTSEGVWYRRGEDQ 154
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP-----QMSQR 195
++ +E +IA T P +GE VL Y +Y H+D F P + G Q QR
Sbjct: 155 --LIARVERRIASLTNWPLENGEGLQVLHYGTSGEYSPHFDFFAPDQPGSAVHTTQGGQR 212
Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 255
+A+ ++YL+DV +GGET+FP GL V + G + F + +
Sbjct: 213 VATLIIYLNDVADGGETVFP---------------TAGLSVAAQAGGAVYFRYMNAERQL 257
Query: 256 DRTSLHGSCPVIKGEKWVATKWIRDQ 281
D ++LHG PV+ G+KW+ TKW+R++
Sbjct: 258 DPSTLHGGAPVLAGDKWIMTKWMRER 283
>gi|209522122|ref|ZP_03270769.1| Procollagen-proline dioxygenase [Burkholderia sp. H160]
gi|209497434|gb|EDZ97642.1| Procollagen-proline dioxygenase [Burkholderia sp. H160]
Length = 296
Score = 111 bits (278), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 78/207 (37%), Positives = 104/207 (50%), Gaps = 25/207 (12%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASED 139
RP A++ +F SA++C+ +IA A+ RL S + G V G R+S G F E
Sbjct: 101 RPAAVHLADFLSADECEQLIALAQPRLDRSTVVDPVTGRNV--VAGHRSSHGMFFRLGE- 157
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF---NPA--EYGPQMSQ 194
T ++ IE +IA T P +GE +L YE G + H D N A E + Q
Sbjct: 158 -TPLIVRIEARIAALTGTPVENGEGLQMLHYEEGAESTPHVDYLITGNEANRESIARSGQ 216
Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
R+ + L+YL DVE GGET+FP IG V P+RG L F G
Sbjct: 217 RMGTLLMYLKDVEGGGETVFP---------------QIGWSVAPQRGHALYFEYGNRFGL 261
Query: 255 IDRTSLHGSCPVIKGEKWVATKWIRDQ 281
D +SLH S P+ G+KWVATKWIR +
Sbjct: 262 CDPSSLHASTPLRVGDKWVATKWIRTR 288
>gi|388519941|gb|AFK48032.1| unknown [Lotus japonicus]
Length = 151
Score = 111 bits (277), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 63/153 (41%), Positives = 88/153 (57%), Gaps = 3/153 (1%)
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
F++ E K ++ IE +I+ + +P +GE VLRYE Q Y H+D F +
Sbjct: 2 FLTPEERKYPMVHAIEKRISVYSQVPIENGELMQVLRYEKNQYYKPHHDYFADTFNLKRG 61
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
QR+A+ L+YLSD EGGET FP G K GL VKP +G+ +LF+S+ +
Sbjct: 62 GQRIATMLMYLSDNVEGGETYFPNIGSGQCSCG--GKTVEGLSVKPTKGNAVLFWSMGLD 119
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 285
G D S+HG C V+ GEKW ATKW+R Q+ H+
Sbjct: 120 GQSDPLSVHGGCEVLAGEKWSATKWMR-QKAHQ 151
>gi|363543309|ref|NP_001241870.1| prolyl 4-hydroxylase 6-3 precursor [Zea mays]
gi|347978824|gb|AEP37754.1| prolyl 4-hydroxylase 6-3 [Zea mays]
Length = 208
Score = 111 bits (277), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 61/146 (41%), Positives = 90/146 (61%), Gaps = 5/146 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LS RPRA + F S +C +++ AK ++ S +A G++V S RTSSGTF++
Sbjct: 38 LSSRPRAFLYSGFLSDTECDHLVSLAKGSMEKSMVADNDSGKSVASQ--ARTSSGTFLAK 95
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
ED+ I+ IE ++A T LP+ + E+ VLRYE GQKYD+H+D F+ QR+
Sbjct: 96 REDE--IVSAIEKRVAAWTFLPEENAESLQVLRYETGQKYDAHFDYFHDRNNLKLGGQRV 153
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFL 222
A+ L+YL+DV++GGE +FP G L
Sbjct: 154 ATVLMYLTDVKKGGEAVFPDAEGSHL 179
>gi|363543293|ref|NP_001241862.1| prolyl 4-hydroxylase 2-1 precursor [Zea mays]
gi|347978802|gb|AEP37743.1| prolyl 4-hydroxylase 2-1 [Zea mays]
Length = 204
Score = 110 bits (276), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 60/146 (41%), Positives = 90/146 (61%), Gaps = 5/146 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LSWRPRA F S +C +IA AK +L+ S +A + G++V+S RTSSG F+
Sbjct: 39 LSWRPRAFLHKGFLSDAECDHLIALAKDKLEKSMVADNESGKSVQSE--VRTSSGMFLER 96
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D+ ++ IE +I+ T LP +GE+ +L Y+ G+KY+ HYD F+ + R+
Sbjct: 97 KQDE--VVTRIEERISAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKKNQALGGHRI 154
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFL 222
A+ L+YLS+VE+GGET+FP G L
Sbjct: 155 ATVLMYLSNVEKGGETIFPNAEGKLL 180
>gi|73542634|ref|YP_297154.1| procollagen-proline,2-oxoglutarate-4-dioxygenase [Ralstonia
eutropha JMP134]
gi|72120047|gb|AAZ62310.1| Procollagen-proline,2-oxoglutarate-4-dioxygenase [Ralstonia
eutropha JMP134]
Length = 282
Score = 110 bits (276), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 65/205 (31%), Positives = 99/205 (48%), Gaps = 23/205 (11%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + + S +C +++ A+ RL S + + E+ RTS G E
Sbjct: 90 PSIRLYQHLLSDAECDALVELARGRLARSPV-INPDTGDENLIDARTSMGAMFQVGEHT- 147
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-----QRL 196
+++ IE +IA +P HGE +L Y+ G +Y H+D FNP G QR
Sbjct: 148 -LIQRIEDRIAAVLGVPVDHGEGLQILNYKPGGEYQPHFDFFNPKRPGEARQLRVGGQRT 206
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
A+ ++YL+ + GG T FP IGL+V P +G+ + F L P+G +D
Sbjct: 207 ATLVIYLNTPQAGGATAFP---------------RIGLEVAPVKGNAVYFSYLQPDGKLD 251
Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQ 281
+LH PV GEKW+ATKW+R+
Sbjct: 252 ERTLHAGLPVQSGEKWIATKWLREH 276
>gi|332526359|ref|ZP_08402485.1| procollagen-proline dioxygenase [Rubrivivax benzoatilyticus JA2]
gi|332110495|gb|EGJ10818.1| procollagen-proline dioxygenase [Rubrivivax benzoatilyticus JA2]
Length = 224
Score = 110 bits (275), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 75/227 (33%), Positives = 109/227 (48%), Gaps = 40/227 (17%)
Query: 65 TGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTK 124
GD + + L PR + F S ++C ++A A+ RL LR ETV+++
Sbjct: 23 AGDREVHVLATMAL---PRVVVFGGLLSEQECDELVALAQPRL------LRS-ETVDNST 72
Query: 125 G------TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDS 178
G RTS G F E T ++E IE +IA P GE VL Y G +Y
Sbjct: 73 GGSEVNAARTSDGMFFERGE--TPLIERIERRIAELVHWPVERGEGLQVLHYRPGAQYKP 130
Query: 179 HYDAFNPAEYGP-----QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIG 233
H+D F+PA G + QR+ + ++YL+ GG T FP +G
Sbjct: 131 HHDFFDPAHPGTANILRRGGQRVGTVVIYLNTPAGGGATTFP---------------EVG 175
Query: 234 LKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 280
L+V+P +G+ + F P + +LHG PV+ GEKWVATKW+R+
Sbjct: 176 LEVQPIKGNAVFFSYERPLAST--RTLHGGAPVLDGEKWVATKWLRE 220
>gi|377811809|ref|YP_005044249.1| ProCollegen-proline dioxygenase [Burkholderia sp. YI23]
gi|357941170|gb|AET94726.1| ProCollegen-proline dioxygenase [Burkholderia sp. YI23]
Length = 283
Score = 110 bits (275), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 66/203 (32%), Positives = 99/203 (48%), Gaps = 23/203 (11%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + S +C +I ++R++ S + + E R S G F++ S D
Sbjct: 91 PVVALLADVLSPRECDRLIEIGRERVRRSSV-VDPDSGGEVLIDARKSEGAFVNGSTDP- 148
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRL 196
++ I+ +IA P +GE ++LRY G +Y H+D F + G + QR+
Sbjct: 149 -LVATIDRRIAELVQQPVENGEDLHILRYGAGGEYRPHFDYFPEEQAGSKHHMQRGGQRI 207
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
A+ +LYL+ VEEGG+T FP IGL + PRRG L F + G D
Sbjct: 208 ATLILYLNQVEEGGDTTFPD---------------IGLTIHPRRGAALYFEYVNALGQTD 252
Query: 257 RTSLHGSCPVIKGEKWVATKWIR 279
+LH PV +GEKW+ATKW+R
Sbjct: 253 PRTLHAGMPVERGEKWIATKWMR 275
>gi|418523362|ref|ZP_13089380.1| hypothetical protein WS7_20388 [Xanthomonas axonopodis pv.
malvacearum str. GSPB2388]
gi|410699993|gb|EKQ58573.1| hypothetical protein WS7_20388 [Xanthomonas axonopodis pv.
malvacearum str. GSPB2388]
Length = 286
Score = 110 bits (275), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 72/211 (34%), Positives = 101/211 (47%), Gaps = 37/211 (17%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFIS 135
PR + F S +C ++IA A+ RL S+ TV++ G RTS G +
Sbjct: 96 PRVVVLGGFLSDGECDALIALARPRLARSR-------TVDNANGEHLVHAARTSDGMCLR 148
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM--- 192
+D + + IE +IAR P HGE VLRY G +Y HYD F+P G +
Sbjct: 149 VGQD--ALCQRIEARIARLFDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAVGTPILLQ 206
Query: 193 --SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
QR+AS ++YL+ E GG T FP + L V +G+ + F
Sbjct: 207 AGGQRVASLVMYLNTPERGGATRFPDAH---------------LDVAAVKGNAVFFSYDR 251
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
P+ SLH PV+ GEKWVATKW+R++
Sbjct: 252 PHPMT--RSLHAGAPVLAGEKWVATKWLRER 280
>gi|196011912|ref|XP_002115819.1| hypothetical protein TRIADDRAFT_59908 [Trichoplax adhaerens]
gi|190581595|gb|EDV21671.1| hypothetical protein TRIADDRAFT_59908 [Trichoplax adhaerens]
Length = 300
Score = 110 bits (274), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 66/217 (30%), Positives = 111/217 (51%), Gaps = 24/217 (11%)
Query: 73 IPFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTS 129
+P+ + +S P + + N S + +S+ A A K+L+P+ + + +G TR +
Sbjct: 93 MPYAIEEMSRDPLIILYHNLTSNAEMESLKALAAKQLQPAGVYHTTSADNRNLEGYTRIA 152
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
FI ++++ + I ++ T L E V+ Y I +Y HYD F PA+ G
Sbjct: 153 KMAFIL--DEESAVASAITQRLQDVTGLNMNFSEPLQVINYGIAGQYTPHYDTF-PAKSG 209
Query: 190 PQMS---QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLF 246
+ RLA+ +LYLSDVE GG T+F I ++V PR+G+ +++
Sbjct: 210 DRSHPSHDRLATAILYLSDVERGGATVF---------------TNINVRVLPRKGNVIIW 254
Query: 247 YSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
Y+ P+G + +LH CPV+ G KW+A KWI+ + Q
Sbjct: 255 YNYLPDGNLHPGTLHAGCPVLVGSKWIANKWIQSKGQ 291
>gi|445499353|ref|ZP_21466208.1| prolyl 4-hydroxylase alpha subunit [Janthinobacterium sp. HH01]
gi|444789348|gb|ELX10896.1| prolyl 4-hydroxylase alpha subunit [Janthinobacterium sp. HH01]
Length = 272
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 69/204 (33%), Positives = 102/204 (50%), Gaps = 23/204 (11%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+P+ + N S E+C +IIA R S + + +G RTS FI E +
Sbjct: 82 QPQIILLGNVLSDEECDAIIAHCGTRYTRSTVTGEADGSSMVHEG-RTSEMAFIQRGEAE 140
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-----PQMSQR 195
+ E IE ++A P E F + +Y+ Q+Y HYD +P G + QR
Sbjct: 141 --VAERIERRLAALAHWPAECSEPFQLQKYDATQEYRPHYDWLDPDSSGHRSHLARGGQR 198
Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 255
LA+F+LYLSDVE+GG T+FP +GL+V P++G L F + N
Sbjct: 199 LATFILYLSDVEQGGGTVFP---------------GLGLEVYPKKGSALWFLNTDINHQP 243
Query: 256 DRTSLHGSCPVIKGEKWVATKWIR 279
D+ +LHG PV++G K +A KW+R
Sbjct: 244 DKRTLHGGAPVVRGTKIIANKWLR 267
>gi|389770666|ref|ZP_10192118.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
gi|388429637|gb|EIL86932.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
Length = 286
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 68/227 (29%), Positives = 111/227 (48%), Gaps = 23/227 (10%)
Query: 60 LPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET 119
LP G A+ + + + +P S E+C +I A +L+ S + +
Sbjct: 73 LPPGNRIIAADREVQVALRVEQPVLAVLDGVLSHEECDELIRRAAAKLQRSTI-VDPTTG 131
Query: 120 VESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH 179
T R+S GTF + D + ++ +I+ LP HGE +L Y G +Y H
Sbjct: 132 KHETIADRSSEGTFFEINADD--FIARLDRRISALMNLPVDHGEGLQILHYGPGGEYKPH 189
Query: 180 YDAFNPAEYGP--QMS---QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGL 234
+D F P + G QM+ QR+++ ++YL++VE+GG T+FP +GL
Sbjct: 190 FDFFPPGDPGSAVQMATGGQRVSTLVMYLNEVEDGGATIFP---------------ELGL 234
Query: 235 KVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
V P++G + F G +D +LHG PV++GEKW+ TKW+R +
Sbjct: 235 SVLPKKGSAVYFEYTNSRGQLDPRTLHGGAPVLRGEKWIVTKWMRQR 281
>gi|171059332|ref|YP_001791681.1| procollagen-proline dioxygenase [Leptothrix cholodnii SP-6]
gi|170776777|gb|ACB34916.1| Procollagen-proline dioxygenase [Leptothrix cholodnii SP-6]
Length = 287
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 73/209 (34%), Positives = 103/209 (49%), Gaps = 37/209 (17%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFIS 135
PR + F F S ++C +++A A+ RL S ETV++ G RTS G F
Sbjct: 100 PRVVVFGGFLSHDECDALVALAQPRLARS-------ETVDNDTGGSEVNEARTSQGMFFM 152
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM-- 192
E + ++ IE +IA P +GE VL Y G +Y HYD F+PA+ G P +
Sbjct: 153 RGEGE--LISRIEARIAALLDWPLENGEGVQVLHYRPGAEYKPHYDYFDPAQPGTPTILK 210
Query: 193 --SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
QR+ + ++YL+ E GG T FP N L+V P +G+ + F +
Sbjct: 211 RGGQRVGTLVMYLNTPERGGGTTFPDVN---------------LEVAPIKGNAVFFS--Y 253
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
SLHG PV+ GEKWVATKW+R
Sbjct: 254 ERAHPSTRSLHGGAPVLAGEKWVATKWLR 282
>gi|323528042|ref|YP_004230194.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1001]
gi|323385044|gb|ADX57134.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1001]
Length = 300
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 64/206 (31%), Positives = 101/206 (49%), Gaps = 23/206 (11%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
RP+ + F N S E+C +I ++ RLK S + + E RTS G + ED
Sbjct: 110 RPQVIVFANVLSPEECDEVIERSRHRLKRSTI-VDPATGQEGVIRNRTSEGIWYQRGED- 167
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQR 195
+E ++ +IA P +GE +L Y +Y H+D F P + G + QR
Sbjct: 168 -AFIERLDQRIASLMNWPVENGEGLQILHYGPTGEYRPHFDYFPPDQPGSAVHTARGGQR 226
Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 255
+A+ ++YL+DV +GGET+FP GL V ++G + F + +
Sbjct: 227 VATLVVYLNDVADGGETIFP---------------AAGLSVAAKQGGAVYFRYMNGQRQL 271
Query: 256 DRTSLHGSCPVIKGEKWVATKWIRDQ 281
D +LHG PV G+KW+ TKW+R++
Sbjct: 272 DPLTLHGGAPVHAGDKWIMTKWMRER 297
>gi|319943342|ref|ZP_08017624.1| 2OG-Fe(II) oxygenase [Lautropia mirabilis ATCC 51599]
gi|319743157|gb|EFV95562.1| 2OG-Fe(II) oxygenase [Lautropia mirabilis ATCC 51599]
Length = 311
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 71/235 (30%), Positives = 114/235 (48%), Gaps = 43/235 (18%)
Query: 71 QSIPFQVLSWRPR------------ALYFPNFA------SAEQCQSIIATAKKRLKPSQL 112
Q+ P +++S PR + PN A S E+C +I ++ ++K SQ+
Sbjct: 92 QASPIRLISQLPRFTVADREVELAAVMSNPNIAVIRGLLSDEECDEVIRLSRGKMKTSQV 151
Query: 113 ALRQ-GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYE 171
R+ G + ES+ R S G+ E++ ++ IE +++ LP GE +L Y
Sbjct: 152 VDRESGGSYESS--VRKSEGSHFERGENE--LVRRIEARLSALVDLPVNRGEPLQILHYG 207
Query: 172 IGQKYDSHYDAFNPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGY 226
G +Y +H D F P + G + QR+ + ++YL+DV EGGET FP
Sbjct: 208 PGGEYKAHQDFFEPKDPGSAVLTRVGGQRIGTVVMYLNDVPEGGETAFP----------- 256
Query: 227 DYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
IG KP +G + F +G +D LH PVI+G+KW+ TKW+R++
Sbjct: 257 ----DIGFSAKPIKGSAVYFEYQNADGQLDYRCLHAGMPVIRGDKWIMTKWLRER 307
>gi|389809938|ref|ZP_10205598.1| procollagen-proline dioxygenase [Rhodanobacter thiooxydans LCS2]
gi|388441354|gb|EIL97635.1| procollagen-proline dioxygenase [Rhodanobacter thiooxydans LCS2]
Length = 284
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 67/198 (33%), Positives = 102/198 (51%), Gaps = 28/198 (14%)
Query: 89 NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
N SA +C +IA A+ RL+ + +G + RTS G F + D+ ++ IE
Sbjct: 102 NILSARECDELIALARPRLQRALTVDSEGR--QQVDRRRTSEGMFFTL--DEVPLVGRIE 157
Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRLASFLLYL 203
++A +P +HGE +L Y GQ Y+ H+D F+P + G + QR+AS ++YL
Sbjct: 158 RRVAALLDVPASHGEGLQILHYLPGQAYEPHFDWFDPDQPGYETITAVGGQRIASVVMYL 217
Query: 204 SDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGS 263
+ GG T FP +GL V RRG + F + G D +SLH
Sbjct: 218 NTPARGGGTAFP---------------ALGLTVTARRGAAVYF--AYEGG--DCSSLHAG 258
Query: 264 CPVIKGEKWVATKWIRDQ 281
PV++GEKW+ATKW+R++
Sbjct: 259 LPVLEGEKWIATKWLRER 276
>gi|407708877|ref|YP_006792741.1| prolyl 4-hydroxylase [Burkholderia phenoliruptrix BR3459a]
gi|407237560|gb|AFT87758.1| prolyl 4-hydroxylase [Burkholderia phenoliruptrix BR3459a]
Length = 300
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 64/206 (31%), Positives = 101/206 (49%), Gaps = 23/206 (11%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
RP+ + F N S E+C +I ++ RLK S + + E RTS G + ED
Sbjct: 110 RPQVIVFANVLSPEECDEVIERSRHRLKRSTI-VDPATGQEGVIRNRTSEGIWYQRGED- 167
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQR 195
+E ++ +IA P +GE +L Y +Y H+D F P + G + QR
Sbjct: 168 -AFIERLDRRIASLMNWPVENGEGLQILHYGPTGEYRPHFDYFPPDQPGSAVHTARGGQR 226
Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 255
+A+ ++YL+DV +GGET+FP GL V ++G + F + +
Sbjct: 227 VATLVVYLNDVADGGETIFP---------------AAGLSVAAKQGGAVYFRYMNGQRQL 271
Query: 256 DRTSLHGSCPVIKGEKWVATKWIRDQ 281
D +LHG PV G+KW+ TKW+R++
Sbjct: 272 DPLTLHGGAPVRAGDKWIMTKWMRER 297
>gi|389775678|ref|ZP_10193553.1| procollagen-proline dioxygenase [Rhodanobacter spathiphylli B39]
gi|388437120|gb|EIL93940.1| procollagen-proline dioxygenase [Rhodanobacter spathiphylli B39]
Length = 284
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 68/198 (34%), Positives = 103/198 (52%), Gaps = 28/198 (14%)
Query: 89 NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
N +AE+C+ +IA A+ RLK + G + RTS G F + +E ++ IE
Sbjct: 102 NLLAAEECEELIALAQPRLKRALTVASDGSNQVDQR--RTSEGMFFTLNE--LPLVGRIE 157
Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRLASFLLYL 203
++A +P +HGE +L Y GQ+Y+ H+D F+P + G QR+AS ++YL
Sbjct: 158 QRLATLLGMPVSHGEGLQILHYLPGQEYEPHFDWFDPQQPGYDTITAVGGQRVASVVMYL 217
Query: 204 SDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGS 263
+ +GG T FP +GL V RRG + F + G D+ SLH
Sbjct: 218 NTPAQGGGTAFP---------------ELGLTVTARRGAAVYF--AYEGG--DQQSLHAG 258
Query: 264 CPVIKGEKWVATKWIRDQ 281
PV +GEKW+ATKW+R++
Sbjct: 259 LPVQRGEKWIATKWLRER 276
>gi|428182311|gb|EKX51172.1| hypothetical protein GUITHDRAFT_92735 [Guillardia theta CCMP2712]
Length = 190
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 69/190 (36%), Positives = 96/190 (50%), Gaps = 20/190 (10%)
Query: 110 SQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLR 169
S +A E RTSS ++S + D ++ I ++A LP E VL
Sbjct: 4 STIAEAGNEAKNGVGSARTSSTAWLSKTADP--LVAKIRTRVAELVKLPMELAEDMQVLH 61
Query: 170 YEIGQKYDSHYDAFNPAEYGPQMS----QRLASFLLYLSDVEEGGETMFPFENGIFLDSG 225
Y Q Y +H+D F+P Y ++ R + YLSDVEEGGET+FPF NG
Sbjct: 62 YSKNQHYWAHHDFFDPNIYRGFVTSPGQNRFITVFFYLSDVEEGGETVFPFANGDDRRV- 120
Query: 226 YDYKKCI-GLKVKPRRGDGLLFYSLF---------PNG---TIDRTSLHGSCPVIKGEKW 272
D+ C GLKVKP+ G+ ++FYS+ P+ +D SLHG C VIKG+KW
Sbjct: 121 TDFADCSRGLKVKPKAGNAIIFYSMLAKRQQEICPPDDLGCNLDVRSLHGGCDVIKGDKW 180
Query: 273 VATKWIRDQE 282
A WI +++
Sbjct: 181 AANYWIANKK 190
>gi|375106426|ref|ZP_09752687.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderiales bacterium
JOSHI_001]
gi|374667157|gb|EHR71942.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderiales bacterium
JOSHI_001]
Length = 295
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 73/212 (34%), Positives = 107/212 (50%), Gaps = 26/212 (12%)
Query: 77 VLSWR-PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
+LS R PR + F S E+C +++ A+ RL S+ + G RTS G F
Sbjct: 102 LLSMRNPRVMVFGGLLSDEECDAMVDLARPRLARSE-TVHNGSGGSEVNAARTSDGMFFD 160
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM-- 192
E + IE +IA P +GE VLRY G +Y +H+D F+PA+ G P +
Sbjct: 161 RGEFP--LCRTIEQRIAALVNWPVENGEGLQVLRYRPGSEYKAHHDYFDPAQPGTPTILK 218
Query: 193 --SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
QR+ + ++YL+ GG T FP +GL+V P +G+ +F+S
Sbjct: 219 RGGQRVGTVVMYLNHPIRGGGTAFP---------------DVGLEVAPFKGNA-VFFSYD 262
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
+ RT LH PV++GEKWVATKW+R+ E
Sbjct: 263 RAHPMTRT-LHAGTPVLEGEKWVATKWVREGE 293
>gi|170064953|ref|XP_001867740.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
gi|167882143|gb|EDS45526.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
Length = 509
Score = 109 bits (272), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 70/212 (33%), Positives = 109/212 (51%), Gaps = 25/212 (11%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFI 134
+VL+ P + + AS + +I AK R+ S+ +R GE RTS ++
Sbjct: 303 EVLNLDPFITVYHDVASDREISKLIELAKSRI--SRATIRDDGEP--QVSNARTSQNAWL 358
Query: 135 SASEDKTGILELIEHKIARATM-LPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEY-GPQ 191
A +D+ ++ ++ ++ T L Q E V Y +G Y +H+D A Y G +
Sbjct: 359 DAGDDR--VVTTLDRRVGDMTGGLRQQSYEMLQVNNYGVGGHYVAHHDWAMEAVPYAGLR 416
Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
+ R+A+ + YLSDVE GG T+FP +GL V PR+G +L+Y+L+
Sbjct: 417 VGNRIATVMFYLSDVEIGGATVFP---------------QLGLAVFPRKGSAILWYNLYR 461
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
NG DR +LH +CPV+ G KWVA +WI + Q
Sbjct: 462 NGKGDRRTLHAACPVLSGSKWVANQWIHEYHQ 493
>gi|78046308|ref|YP_362483.1| 2OG-Fe(II) oxygenase [Xanthomonas campestris pv. vesicatoria str.
85-10]
gi|78034738|emb|CAJ22383.1| putative 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas
campestris pv. vesicatoria str. 85-10]
Length = 296
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 71/211 (33%), Positives = 101/211 (47%), Gaps = 37/211 (17%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFIS 135
PR + F S E+C ++IA A+ RL S+ TV++ G RTS +
Sbjct: 106 PRVVVLGGFLSDEECDALIALARPRLARSR-------TVDNANGEHVVHAARTSDSMCLR 158
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM--- 192
+D + + IE +IAR P HGE VLRY G +Y HYD F+P G +
Sbjct: 159 LGQD--ALCQRIEARIARLLDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAAGTPVLVQ 216
Query: 193 --SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
QR+AS ++YL+ E GG T FP + L V +G+ + F
Sbjct: 217 AGGQRVASLVMYLNTPERGGATRFPDAH---------------LDVAAVKGNAVFFSYDR 261
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
P+ SLH PV+ G+KWVATKW+R++
Sbjct: 262 PHPMT--RSLHAGAPVLAGDKWVATKWLRER 290
>gi|344199983|ref|YP_004784309.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrivorans SS3]
gi|343775427|gb|AEM47983.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrivorans SS3]
Length = 212
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 67/200 (33%), Positives = 97/200 (48%), Gaps = 23/200 (11%)
Query: 85 LYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGIL 144
++F S E+C +IA KPS++ + T G R+ T S S DK I+
Sbjct: 15 VHFSGLLSPEECTELIAAGGSHAKPSEVIYGVSDVSHETSGRRS---TVASPSADKYPII 71
Query: 145 ELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ---RLASFLL 201
+ + +I+ + + + E VL Y G +YD HYD+F E PQ+ R+ + LL
Sbjct: 72 KAVRRRISLFIGVAEENQEPLQVLHYTRGGRYDIHYDSF--LEGSPQLENGGNRMLTVLL 129
Query: 202 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 261
YL+DVE+GG T FP I + P G G+LF + R SLH
Sbjct: 130 YLNDVEQGGWTQFPH---------------IMANIVPNVGTGILFRNTDAQNLQLRESLH 174
Query: 262 GSCPVIKGEKWVATKWIRDQ 281
PVI GEKW+A+ WIR++
Sbjct: 175 AGLPVIDGEKWIASIWIREK 194
>gi|307725787|ref|YP_003909000.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1003]
gi|307586312|gb|ADN59709.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1003]
Length = 313
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 63/206 (30%), Positives = 103/206 (50%), Gaps = 23/206 (11%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
RP+ + F N S ++C +I ++ RLK S + + E RTS G + ED
Sbjct: 123 RPQVIVFGNVLSPDECAEMIERSRHRLKRSTI-VDPATGREDVIRNRTSEGIWYQRGED- 180
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQR 195
++E ++ +IA P +GE +L Y +Y H+D F P + G + QR
Sbjct: 181 -ALIERLDQRIASLMNWPLENGEGLQILHYGPSGEYRPHFDYFPPDQPGSAVHTARGGQR 239
Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 255
+A+ ++YL+DV +GGET+FP GL V ++G + F + +
Sbjct: 240 VATLVVYLNDVPDGGETIFPEA---------------GLSVAAQQGGAVYFRYMNGRRQL 284
Query: 256 DRTSLHGSCPVIKGEKWVATKWIRDQ 281
D +LHG PV+ G+KW+ TKW+R++
Sbjct: 285 DPLTLHGGAPVLSGDKWIMTKWVRER 310
>gi|321474898|gb|EFX85862.1| hypothetical protein DAPPUDRAFT_309117 [Daphnia pulex]
Length = 541
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 66/216 (30%), Positives = 102/216 (47%), Gaps = 20/216 (9%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
IQ I ++ S +PR + + N + E+ ++ A+ RL+ S + E TK R +
Sbjct: 330 IQPIKMELASLKPRLVIYHNVVTDEEIETAKKLAQSRLRRSTVQNSLTGASEPTK-YRIA 388
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
F+ SE + + +I T L T E V Y IG Y+ HYD E
Sbjct: 389 KAAFLQNSEHDHIVK--MTRRIGDVTGLDMTTAEELQVCNYGIGGHYEPHYDHARKGEVQ 446
Query: 190 PQMS--QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFY 247
R+A+++ Y+SDVE GG T+FP I L + P++G ++
Sbjct: 447 KDFGWGNRIATWMFYMSDVEAGGATVFP---------------QINLALWPQKGSAAFWF 491
Query: 248 SLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+L PNG D + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 492 NLHPNGEGDDLTQHAACPVLTGSKWVSNKWIHERNQ 527
>gi|170690448|ref|ZP_02881615.1| Procollagen-proline dioxygenase [Burkholderia graminis C4D1M]
gi|170144883|gb|EDT13044.1| Procollagen-proline dioxygenase [Burkholderia graminis C4D1M]
Length = 307
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 64/206 (31%), Positives = 101/206 (49%), Gaps = 23/206 (11%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
RP+ + F N S E+C +I ++ RLK S + + E RTS G + ED
Sbjct: 117 RPQVIVFANVLSPEECDEVIERSRHRLKRSTI-VDPATGQEDVIRNRTSEGIWYQRGEDA 175
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQR 195
+E ++ +IA P +GE +L Y +Y H+D F P + G + QR
Sbjct: 176 --FIERLDQRIASLMNWPVENGEGLQILHYGPTGEYRPHFDYFPPDQPGSMVHTARGGQR 233
Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 255
+A+ ++YL+DV +GGET+FP GL V ++G + F + +
Sbjct: 234 VATLVIYLNDVPDGGETIFPEA---------------GLSVAAKQGGAVYFRYMNGQRQL 278
Query: 256 DRTSLHGSCPVIKGEKWVATKWIRDQ 281
D +LHG PV G+KW+ TKW+R++
Sbjct: 279 DPLTLHGGAPVRAGDKWIMTKWMRER 304
>gi|352086439|ref|ZP_08953941.1| Procollagen-proline dioxygenase [Rhodanobacter sp. 2APBS1]
gi|389799401|ref|ZP_10202396.1| procollagen-proline dioxygenase [Rhodanobacter sp. 116-2]
gi|351679404|gb|EHA62545.1| Procollagen-proline dioxygenase [Rhodanobacter sp. 2APBS1]
gi|388442818|gb|EIL98985.1| procollagen-proline dioxygenase [Rhodanobacter sp. 116-2]
Length = 284
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 67/198 (33%), Positives = 104/198 (52%), Gaps = 28/198 (14%)
Query: 89 NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
N S ++C+ +IA A+ RL+ + +G + RTS G F + +E ++ IE
Sbjct: 102 NILSTQECEELIALARPRLQRALTVDSEGR--QQVDRRRTSEGMFFTLNE--VPLVGRIE 157
Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE--YGPQMS---QRLASFLLYL 203
++A +P +HGE +L Y GQ+Y+ H+D F+P + YG + QR+AS ++YL
Sbjct: 158 QRLAALLRVPASHGEGLQILHYLPGQEYEPHFDWFDPEQPGYGAITAVGGQRIASVVMYL 217
Query: 204 SDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGS 263
+ GG T FP +GL V RRG + F + G D +SLH
Sbjct: 218 NTPARGGGTAFP---------------ELGLTVTARRGSAVYFA--YEGG--DPSSLHAG 258
Query: 264 CPVIKGEKWVATKWIRDQ 281
PV+ GEKW+ATKW+R++
Sbjct: 259 LPVLDGEKWIATKWLRER 276
>gi|418515355|ref|ZP_13081536.1| hypothetical protein MOU_00890 [Xanthomonas axonopodis pv.
malvacearum str. GSPB1386]
gi|410708074|gb|EKQ66523.1| hypothetical protein MOU_00890 [Xanthomonas axonopodis pv.
malvacearum str. GSPB1386]
Length = 216
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 71/211 (33%), Positives = 100/211 (47%), Gaps = 37/211 (17%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFIS 135
PR + F S +C ++IA A+ RL S+ TV++ G RTS +
Sbjct: 26 PRVVVLGGFLSDGECDALIALARPRLARSR-------TVDNANGEHLVHAARTSDSMCLR 78
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM--- 192
+D + + IE +IAR P HGE VLRY G +Y HYD F+P G +
Sbjct: 79 VGQD--ALCQRIEARIARLFDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAVGTPILLQ 136
Query: 193 --SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
QR+AS ++YL+ E GG T FP + L V +G+ + F
Sbjct: 137 AGGQRVASLVMYLNTPERGGATRFPDAH---------------LDVAAVKGNAVFFSYDR 181
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
P+ SLH PV+ GEKWVATKW+R++
Sbjct: 182 PHPMT--RSLHAGAPVLAGEKWVATKWLRER 210
>gi|302850293|ref|XP_002956674.1| hypothetical protein VOLCADRAFT_67269 [Volvox carteri f.
nagariensis]
gi|300258035|gb|EFJ42276.1| hypothetical protein VOLCADRAFT_67269 [Volvox carteri f.
nagariensis]
Length = 325
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 68/208 (32%), Positives = 106/208 (50%), Gaps = 19/208 (9%)
Query: 75 FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 134
Q +SW+PRA+ + NF S ++ + II A +++K S + + E V RTS GTF+
Sbjct: 41 IQTISWKPRAVVYHNFLSDQEARHIIDLAHEQMKRSTVVGNKNEGV--VDDIRTSYGTFL 98
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
++D ++ IE ++A + +P +H E VLRY KY H D +
Sbjct: 99 RRAQDP--VIMAIEERLALWSHMPPSHQEDMQVLRYGRTNKYGPHIDGL----------E 146
Query: 195 RLASFLLYLSDVEEGGETMFPFE--NGIFLDSGYDYKKCIG-LKVKPRRGDGLLFYSLFP 251
R+A+ L+YL E G + P ++ + G + KP+RGD L+F+ + P
Sbjct: 147 RVATVLMYLVG-ESPGPDLAPVSACECMYAEQSNPSACAKGHVAYKPKRGDALMFFDVKP 205
Query: 252 N-GTIDRTSLHGSCPVIKGEKWVATKWI 278
+ T D S+H CPV+ G KW A KWI
Sbjct: 206 DYTTTDGHSMHTGCPVVAGVKWNAVKWI 233
>gi|398806116|ref|ZP_10565064.1| 2OG-Fe(II) oxygenase superfamily enzyme [Polaromonas sp. CF318]
gi|398089832|gb|EJL80333.1| 2OG-Fe(II) oxygenase superfamily enzyme [Polaromonas sp. CF318]
Length = 294
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 75/268 (27%), Positives = 119/268 (44%), Gaps = 29/268 (10%)
Query: 14 LPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKENESGLPHGETGDASIQSI 73
+P A+ + + G T LS D P I +L ++ K++ LP + +
Sbjct: 36 VPMAVRIVDAVLRSEGEGGT-LSYDPPHITRRLPAIDTSAKQSVLHLPGADA------VV 88
Query: 74 PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTF 133
F+ L+ PR + NF S+E+C + A+ P+ + + V + S
Sbjct: 89 TFEQLA--PRIVVLDNFLSSEECDGLCEEARPAFAPATVVDPHQDAVHAAHFRSNDSAQL 146
Query: 134 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 193
+A + ++ +E +I R T P E + RY GQ Y HYD F Q
Sbjct: 147 PAAGSE---LVRRVEARIERLTGWPSAFCETLQLQRYAQGQDYRPHYDFFGQDMVEAQGG 203
Query: 194 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 253
QRLA+ +LYL E GG T F +G+++ PR+G L F +P+
Sbjct: 204 QRLATLILYLRAPEAGGATYF---------------ANLGMRIAPRKGSALFF--TYPDP 246
Query: 254 TIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
+ +LHG V+ GEKW+AT+W RD+
Sbjct: 247 GNNSGTLHGGEAVLAGEKWIATQWFRDR 274
>gi|239816557|ref|YP_002945467.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
gi|239803134|gb|ACS20201.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
Length = 296
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 67/201 (33%), Positives = 102/201 (50%), Gaps = 35/201 (17%)
Query: 92 SAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFISASEDKTGILE 145
SAE+C+++IA A+ RL PS +V+ G R+S G F E+ +
Sbjct: 109 SAEECEALIALARPRLAPST-------SVDPLTGRNRLGAQRSSLGMFFRLREN--AFVA 159
Query: 146 LIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-----QRLASFL 200
++ +++ LP +GE VL Y G + H+D P+ Q S QR+++ +
Sbjct: 160 RLDERLSELMNLPVENGEGLQVLHYPAGAQSLPHFDFLVPSNAANQASLQRSGQRVSTLV 219
Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
YL++VEEGGET+FP ++G+ V P+RG + F G +D SL
Sbjct: 220 AYLNEVEEGGETVFP-------ETGW--------SVSPQRGGAVYFEYCNSLGQVDHASL 264
Query: 261 HGSCPVIKGEKWVATKWIRDQ 281
H PV+ GEKWVATKW+R +
Sbjct: 265 HAGAPVLSGEKWVATKWMRQR 285
>gi|325267002|ref|ZP_08133672.1| 2OG-Fe(II) oxygenase [Kingella denitrificans ATCC 33394]
gi|324981502|gb|EGC17144.1| 2OG-Fe(II) oxygenase [Kingella denitrificans ATCC 33394]
Length = 279
Score = 108 bits (269), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 63/206 (30%), Positives = 107/206 (51%), Gaps = 25/206 (12%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASEDK 140
P + NF +AE+C +IA A+ +++ + + GE V+ RTS + +E
Sbjct: 91 PEVVVLDNFITAEECAQLIALAEGKVEDATVVDPATGEFVKHQD--RTSMNAAFARAEHP 148
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-----QR 195
++ +E +IA A P +GE VLRY G +Y +H+D F+ G + + QR
Sbjct: 149 --LIARLEARIAAAIHWPAENGEGMQVLRYRSGGEYKAHFDYFDTQSEGGRKNMQTGGQR 206
Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 255
+ +FL+YL DV+ GG T FP + +++P++G L F + PNG
Sbjct: 207 VGTFLVYLCDVDAGGATRFP---------------ALNFEIRPKKGMALFFANTLPNGEG 251
Query: 256 DRTSLHGSCPVIKGEKWVATKWIRDQ 281
+ +LH PV+ G K++A+KW+R++
Sbjct: 252 NPLTLHAGVPVVSGVKYLASKWLREK 277
>gi|389728965|ref|ZP_10189244.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
gi|388441204|gb|EIL97500.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
Length = 285
Score = 108 bits (269), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 75/230 (32%), Positives = 110/230 (47%), Gaps = 31/230 (13%)
Query: 61 PHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETV 120
PH G+ S++ + + + P F S ++C ++I AK RL+ ++ G
Sbjct: 77 PHAVIGERSVRVM---LAAETPPLRVFDGLLSDDECAALIELAKPRLQRARTVAEDG--A 131
Query: 121 ESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHY 180
+ RTS G F E ++E IE +IA +P HGE VL Y GQ+Y+ H
Sbjct: 132 QQIDEHRTSDGMFFGLGEQP--LIERIEARIAALLGIPVDHGEGLQVLHYLPGQQYEPHQ 189
Query: 181 DAFNPAEYG-----PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLK 235
D F+P + G QR+AS ++YL+ + GG T FP IGL
Sbjct: 190 DWFDPTQPGYAAITATGGQRIASLVIYLNTPDAGGGTAFPE---------------IGLT 234
Query: 236 VKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 285
V RG + F + +G D SLH PV +GEKW+ATKW+R++ E
Sbjct: 235 VTALRGSAVCF--TYESG--DVFSLHAGLPVTRGEKWIATKWLRERPYRE 280
>gi|319795182|ref|YP_004156822.1| procollagen-proline dioxygenase [Variovorax paradoxus EPS]
gi|315597645|gb|ADU38711.1| Procollagen-proline dioxygenase [Variovorax paradoxus EPS]
Length = 296
Score = 108 bits (269), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 66/199 (33%), Positives = 99/199 (49%), Gaps = 25/199 (12%)
Query: 89 NFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASEDKTGILELI 147
N A +C+++I AK RL PS L G V S K R S G F E+ ++ +
Sbjct: 107 NVVDAHECKALIEMAKPRLAPSTLVDPMSGRDVVSDK--RASWGMFFRLCEND--LVARL 162
Query: 148 EHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-----QRLASFLLY 202
+ +++ LP +GE ++L Y G + H+D P + S QR+++ + Y
Sbjct: 163 DRRLSALMNLPLENGEGLHLLYYPTGAGSEPHHDYLAPTNAANRESIARSGQRVSTLVTY 222
Query: 203 LSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHG 262
L+D EGG+T+FP +GL V P RG+ F NG +D SLH
Sbjct: 223 LNDAPEGGQTVFPQ---------------LGLAVSPIRGNACYFEYCDGNGRVDARSLHA 267
Query: 263 SCPVIKGEKWVATKWIRDQ 281
S PV +G+KWV TKW+R++
Sbjct: 268 SAPVTRGDKWVMTKWMRER 286
>gi|414587755|tpg|DAA38326.1| TPA: hypothetical protein ZEAMMB73_894856 [Zea mays]
Length = 244
Score = 107 bits (268), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 57/143 (39%), Positives = 90/143 (62%), Gaps = 3/143 (2%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
+V+SW PR + F NF S+E+C ++A A+ RL+ S + + G+ V+S RTSSG F+
Sbjct: 58 EVISWTPRIIVFHNFLSSEECDYLMAIARPRLQISTVVDVATGKGVKS--DVRTSSGMFV 115
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
++ E K+ +++ IE +I+ + +P+ +GE VLRYE Q Y H+D F+ + Q
Sbjct: 116 NSEERKSPVVQAIEKRISVFSQIPKENGELIQVLRYEASQYYRPHHDYFSDTFNLKRGGQ 175
Query: 195 RLASFLLYLSDVEEGGETMFPFE 217
R+A+ L+YL+D GGET FP E
Sbjct: 176 RVATMLMYLTDGVVGGETHFPQE 198
>gi|410637601|ref|ZP_11348175.1| prolyl 4-hydroxylase [Glaciecola lipolytica E3]
gi|410142794|dbj|GAC15380.1| prolyl 4-hydroxylase [Glaciecola lipolytica E3]
Length = 280
Score = 107 bits (267), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 65/205 (31%), Positives = 108/205 (52%), Gaps = 25/205 (12%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
R + + NF +A++C++++A K +L+PS++ R+G+ KG RTSS + ++D
Sbjct: 84 RVQMIKIDNFLTAQECEALVALTKSKLRPSEIPEREGDQY---KGFRTSSTCDLPFTKDP 140
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE-----YGPQMSQR 195
+ I+ KI A L E Y IGQ++ +H D F P Y QR
Sbjct: 141 --LAHEIDQKIVDALGLGVGEKEVIQAQHYAIGQEFKAHCDYFVPGSKDFKTYSKDGGQR 198
Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 255
+F++YL+++ EGGET F +G+K KP++G L++ +L +G+I
Sbjct: 199 TWTFMIYLNELCEGGETEFV---------------KLGIKFKPKQGTALVWNNLHEDGSI 243
Query: 256 DRTSLHGSCPVIKGEKWVATKWIRD 280
+ +LH + P+ GEK V TKW R+
Sbjct: 244 NEDTLHHAHPIESGEKVVITKWFRE 268
>gi|384429387|ref|YP_005638747.1| procollagen-proline, 2-oxoglutarate-4-dioxygenase [Xanthomonas
campestris pv. raphani 756C]
gi|341938490|gb|AEL08629.1| procollagen-proline, 2-oxoglutarate-4-dioxygenase [Xanthomonas
campestris pv. raphani 756C]
Length = 286
Score = 107 bits (267), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 73/226 (32%), Positives = 108/226 (47%), Gaps = 28/226 (12%)
Query: 64 ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
+ GD +Q + +L PR + S ++C ++IA A+ +L S+ + + E
Sbjct: 81 DLGDRQVQVLVSLML---PRVVVLGGLLSDDECDALIALARPQLARSR-TVDNRDGSEIV 136
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTS + +D + + IE +IAR P HGE VLRY G +Y HYD F
Sbjct: 137 HAARTSHSMALQPGQD--ALCQRIEARIARLLEWPVEHGEGLQVLRYATGAQYAPHYDYF 194
Query: 184 NPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKP 238
P G + QR+AS ++YL+ E GG T FP + L V
Sbjct: 195 EPDAPGTPVLLQHGGQRVASLVMYLNTPERGGATRFP---------------DVHLDVAA 239
Query: 239 RRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQH 284
+G+ + F P+ + RT LH PV+ GEKWVATKW+R++ H
Sbjct: 240 VKGNAVFFSYDRPH-PMTRT-LHAGAPVLAGEKWVATKWLRERPLH 283
>gi|21106803|gb|AAM35580.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri
str. 306]
Length = 306
Score = 107 bits (267), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 71/211 (33%), Positives = 100/211 (47%), Gaps = 37/211 (17%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFIS 135
PR + F S +C ++IA A+ RL S+ TV++ G RTS +
Sbjct: 116 PRVVVLGGFLSDGECDALIALARPRLARSR-------TVDNANGEHMVHAARTSDSMCLR 168
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM--- 192
+D + + IE +IAR P HGE VLRY G +Y HYD F+P G +
Sbjct: 169 VGQD--ALCQRIEARIARLFDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAAGTPILLQ 226
Query: 193 --SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
QR+AS ++YL+ E GG T FP + L V +G+ + F
Sbjct: 227 AGGQRVASLVMYLNTPERGGATRFPDAH---------------LDVAAVKGNAVFFSYDR 271
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
P+ SLH PV+ GEKWVATKW+R++
Sbjct: 272 PHPMT--RSLHAGAPVLAGEKWVATKWLRER 300
>gi|77748547|ref|NP_641044.2| hypothetical protein XAC0691 [Xanthomonas axonopodis pv. citri str.
306]
gi|381169877|ref|ZP_09879039.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas citri pv.
mangiferaeindicae LMG 941]
gi|380689647|emb|CCG35526.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas citri pv.
mangiferaeindicae LMG 941]
Length = 286
Score = 107 bits (267), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 71/211 (33%), Positives = 100/211 (47%), Gaps = 37/211 (17%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFIS 135
PR + F S +C ++IA A+ RL S+ TV++ G RTS +
Sbjct: 96 PRVVVLGGFLSDGECDALIALARPRLARSR-------TVDNANGEHMVHAARTSDSMCLR 148
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM--- 192
+D + + IE +IAR P HGE VLRY G +Y HYD F+P G +
Sbjct: 149 VGQD--ALCQRIEARIARLFDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAAGTPILLQ 206
Query: 193 --SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
QR+AS ++YL+ E GG T FP + L V +G+ + F
Sbjct: 207 AGGQRVASLVMYLNTPERGGATRFPDAH---------------LDVAAVKGNAVFFSYDR 251
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
P+ SLH PV+ GEKWVATKW+R++
Sbjct: 252 PHPMT--RSLHAGAPVLAGEKWVATKWLRER 280
>gi|428170517|gb|EKX39441.1| hypothetical protein GUITHDRAFT_114401 [Guillardia theta CCMP2712]
Length = 322
Score = 107 bits (267), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 71/228 (31%), Positives = 111/228 (48%), Gaps = 31/228 (13%)
Query: 66 GDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQG--ETVEST 123
G + +S + +S PR N + E+C +++ A ++ + L G + VEST
Sbjct: 66 GGMAKKSTWIETVSVDPRIFIVHNLLTEEECDHLVSLALQKGLSASLITPYGTNKLVEST 125
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
TRT+ ++ +D +++ +E KIA+ T GE VL Y Q++ H+D F
Sbjct: 126 --TRTNKQAWLDFQQDD--VVKRVEDKIAKLTKTTPEQGENLQVLHYAKSQQFTEHHDYF 181
Query: 184 NPAEYGPQM----SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPR 239
+PA P+ RL + ++YL EEGGET F N LK+
Sbjct: 182 DPATDPPENYEKGGNRLITVIVYLQAAEEGGETHFGAAN---------------LKLTAA 226
Query: 240 RGDGLLFYSL------FPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
+GD ++FY+L +D+ +LH P IKGEKWVATKWI ++
Sbjct: 227 KGDAVMFYNLKHGCDGIDPTCVDKQTLHAGLPPIKGEKWVATKWIHER 274
>gi|325925807|ref|ZP_08187179.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas perforans
91-118]
gi|325543793|gb|EGD15204.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas perforans
91-118]
Length = 286
Score = 107 bits (266), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 70/211 (33%), Positives = 100/211 (47%), Gaps = 37/211 (17%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFIS 135
PR + F S E+C ++IA A+ L S+ TV++ G RTS +
Sbjct: 96 PRVVVLGGFLSDEECDALIALARPHLARSR-------TVDNANGEHVVHAARTSDSMCLR 148
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM--- 192
+D + + IE +IAR P HGE VLRY G +Y HYD F+P G +
Sbjct: 149 LGQD--ALCQRIEARIARLLDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAAGTPVLVQ 206
Query: 193 --SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
QR+AS ++YL+ E GG T FP + L V +G+ + F
Sbjct: 207 AGGQRVASLVMYLNTPERGGATRFPDAH---------------LDVAAVKGNAVFFSYDR 251
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
P+ SLH PV+ G+KWVATKW+R++
Sbjct: 252 PHPMT--RSLHAGAPVLAGDKWVATKWLRER 280
>gi|414591891|tpg|DAA42462.1| TPA: hypothetical protein ZEAMMB73_637248 [Zea mays]
Length = 207
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 58/145 (40%), Positives = 85/145 (58%), Gaps = 5/145 (3%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFI 134
+ +SW PR + F S +C ++ AKK+++ S +A + G++V+S RTSSG F+
Sbjct: 46 KAVSWHPRIFVYKGFLSDAECDHLVTLAKKKIQRSMVADNESGKSVKSE--VRTSSGMFL 103
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
+D ++ IE +IA T LPQ + E VLRYE GQKY+ H+D F+ +
Sbjct: 104 DKRQDP--VVSRIEERIAAWTFLPQENAENMQVLRYEPGQKYEPHFDYFHDRVNQARGGH 161
Query: 195 RLASFLLYLSDVEEGGETMFPFENG 219
R A+ L+YLS V EGGET+FP G
Sbjct: 162 RYATVLMYLSTVREGGETVFPNAKG 186
>gi|218665910|ref|YP_002425647.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrooxidans ATCC 23270]
gi|218518123|gb|ACK78709.1| oxidoreductase, 2OG-Fe(II) oxygenase family [Acidithiobacillus
ferrooxidans ATCC 23270]
Length = 248
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 70/193 (36%), Positives = 95/193 (49%), Gaps = 23/193 (11%)
Query: 92 SAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKI 151
+ E CQ++IA + L+P+ + Q E G R S + D IL+ + I
Sbjct: 73 TPENCQNLIAIGQSLLRPATVTDEQ-TGQEVAHGERVSEMAW--PKRDDYPILQSLAEGI 129
Query: 152 ARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ---RLASFLLYLSDVEE 208
A+ T +P E +L Y G +Y HYDAF A P + Q R A+ +LYL+ VEE
Sbjct: 130 AQLTGIPIDCQEPLQILHYRPGGEYKPHYDAF--AADAPTLRQGGNRQATLILYLNAVEE 187
Query: 209 GGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIK 268
GGET FP +GL+V P G G+ F +L G SLH PV K
Sbjct: 188 GGETAFPE---------------LGLQVSPIPGGGVFFRNLNEEGQRHPLSLHAGLPVRK 232
Query: 269 GEKWVATKWIRDQ 281
GEKW+AT+WIR +
Sbjct: 233 GEKWIATQWIRQE 245
>gi|346723630|ref|YP_004850299.1| hypothetical protein XACM_0696 [Xanthomonas axonopodis pv.
citrumelo F1]
gi|346648377|gb|AEO41001.1| hypothetical protein XACM_0696 [Xanthomonas axonopodis pv.
citrumelo F1]
Length = 286
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 70/211 (33%), Positives = 100/211 (47%), Gaps = 37/211 (17%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFIS 135
PR + F S E+C ++IA A+ L S+ TV++ G RTS +
Sbjct: 96 PRVVVLGGFLSDEECDALIALAQPHLARSR-------TVDNANGEHVVHAARTSDSMCLR 148
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM--- 192
+D + + IE +IAR P HGE VLRY G +Y HYD F+P G +
Sbjct: 149 LGQD--ALCQRIEARIARLLDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAAGTPVLVQ 206
Query: 193 --SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
QR+AS ++YL+ E GG T FP + L V +G+ + F
Sbjct: 207 AGGQRVASLVMYLNTPERGGATRFPDAH---------------LDVAAVKGNAVFFSYDR 251
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
P+ SLH PV+ G+KWVATKW+R++
Sbjct: 252 PHPMT--RSLHAGAPVLAGDKWVATKWLRER 280
>gi|303279839|ref|XP_003059212.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226459048|gb|EEH56344.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 409
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 82/282 (29%), Positives = 120/282 (42%), Gaps = 44/282 (15%)
Query: 38 DVPSIRPKLRTLEVVEKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQ 97
+VP T E ++ S G T D + + LS PRA F F + E+C
Sbjct: 46 EVPRADATTTTREAPRADDASASSLGPTRDIGVGDARVEKLSDSPRAYLFREFLTKEECA 105
Query: 98 SIIATAKKRLKPSQLALRQ---GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARA 154
+I + LK S + GE RTS+G F+ D ++ +E ++
Sbjct: 106 HLIEISTPHLKRSTVVGDDALLGEADGRRSDYRTSTGAFLPKLYDD--VVTRVERRVEAF 163
Query: 155 TMLP---QTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGE 211
+ LP Q +A ++LRYE+GQ+Y H D F G +R+A+ L++L++ EEGGE
Sbjct: 164 SRLPFENQEQLQARSLLRYELGQEYRDHVDGFATENGG----KRVATVLMFLAEPEEGGE 219
Query: 212 TMFP---------------FENGIFLDSGYDYKKCI----------GLKVKPRRGDGLLF 246
T FP G D + G VKPR GD +LF
Sbjct: 220 TAFPNGEPSEAVAARVAAQRARGELSDCAWRGGGGGTAGGGRGNLRGFAVKPRLGDAVLF 279
Query: 247 YSLFPNGT-------IDRTSLHGSCPVIKGEKWVATKWIRDQ 281
+S + + S H SCP +G KW ATKWI ++
Sbjct: 280 FSYDADDDGGYDGAEVSHASTHASCPTTRGVKWTATKWIHER 321
>gi|195113239|ref|XP_002001175.1| GI10638 [Drosophila mojavensis]
gi|193917769|gb|EDW16636.1| GI10638 [Drosophila mojavensis]
Length = 511
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 64/208 (30%), Positives = 104/208 (50%), Gaps = 23/208 (11%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
I +VL P + F + S+ + + A+ L+ S + ++ V+ R S+GT
Sbjct: 310 IKMEVLVLDPLVVIFHDVLSSREIDGLQEIARPHLERS-MVVKYRANVQGKH--RISAGT 366
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
++ + + IE +IA L E F V+ Y IG +Y +H+D F
Sbjct: 367 WVERKYN--NLTWRIERRIADMVDLNLEGSEPFYVINYGIGGQYKAHWDFFGADTVE--- 421
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
RLA+ L Y++DVE+GG T+FP +G V+ +RG+ L +Y++ N
Sbjct: 422 DNRLATVLFYMNDVEQGGATVFP---------------RLGQTVRAKRGNALFWYNMQHN 466
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRD 280
GT+D +LHG CP++ G KW+ T+WI D
Sbjct: 467 GTVDDRTLHGGCPILVGSKWIFTQWISD 494
>gi|194905419|ref|XP_001981192.1| GG11932 [Drosophila erecta]
gi|190655830|gb|EDV53062.1| GG11932 [Drosophila erecta]
Length = 535
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 66/215 (30%), Positives = 103/215 (47%), Gaps = 22/215 (10%)
Query: 74 PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
PF++ L P + ++ +S+ TA+ R+K S + G + RTS G
Sbjct: 316 PFKLEELHLDPPVVQLHQVIGSKDAESLQRTARPRIKRSTVYSLAGNGDSTAAAFRTSQG 375
Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF---NPAEY 188
+ S + +L+ H + + L + E V Y IG Y+ H+D+F + +
Sbjct: 376 ASFNYS--RNAATKLLSHHVGDFSGLNMEYAEDLQVANYGIGGHYEPHWDSFPDNHVYQE 433
Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 248
G R+A+ + YLSDVE GG T FPF + L V P RG L +Y+
Sbjct: 434 GDLHGNRIATAIYYLSDVEAGGGTAFPF---------------LPLLVTPERGSLLFWYN 478
Query: 249 LFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
L P+G D + H +CPV++G KW+A WIR++ Q
Sbjct: 479 LHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQ 513
>gi|157111033|ref|XP_001651361.1| prolyl 4-hydroxylase alpha subunit 1, putative [Aedes aegypti]
gi|108878552|gb|EAT42777.1| AAEL005714-PA, partial [Aedes aegypti]
Length = 522
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 63/212 (29%), Positives = 110/212 (51%), Gaps = 22/212 (10%)
Query: 74 PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
PF++ + +P+ + F + S + + + AK L+ + +A +Q E +K + S
Sbjct: 319 PFKLEEMHLKPKIVIFHDVLSDTEIELLKRLAKPILERATIANQQTGKAERSKDRVSKSS 378
Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ 191
F ++ + I ++A T L E V+ Y +G +YD H+D F+ + +
Sbjct: 379 WF---PDEYHSTIRTITKRVADMTGLSMDTAEELQVVNYGLGGQYDPHFDFFHWGKL--K 433
Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
R+A+ L Y+SDV GG T+FP +G+ ++ R+G +Y+L
Sbjct: 434 EVNRIATVLFYMSDVSIGGATVFP---------------KLGVTLEARKGTAAFWYNLHS 478
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+G +D ++LHG+CPV+ GEKWVA KWIR++ Q
Sbjct: 479 SGELDYSTLHGACPVLIGEKWVANKWIRERGQ 510
>gi|242051901|ref|XP_002455096.1| hypothetical protein SORBIDRAFT_03g004265 [Sorghum bicolor]
gi|241927071|gb|EES00216.1| hypothetical protein SORBIDRAFT_03g004265 [Sorghum bicolor]
Length = 303
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 65/204 (31%), Positives = 106/204 (51%), Gaps = 21/204 (10%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
LSW PR + F S +C +I+ A + +Q V S G I
Sbjct: 62 LSWHPRVFLYEGFLSDMECDHLISMAHGK--------KQSSLVVGGSAGNNSQGASI--- 110
Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
ED I+ IE +I+ + LP+ GE+ +L+YE+ + ++Y++ + + + RL
Sbjct: 111 EDT--IVSTIEDRISVWSFLPKDFGESMQILKYEVNKSDYNNYESQSSSGH-----DRLV 163
Query: 198 SFLLYLSDVEEGGETMFPFE--NGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 255
+ L+YLSDV+ GGET FP G ++ +C G V+P RG+ +L ++L P+G I
Sbjct: 164 TVLMYLSDVKRGGETAFPRSELKGTKVELAAP-SECAGYAVQPVRGNAILLFNLKPDGVI 222
Query: 256 DRTSLHGSCPVIKGEKWVATKWIR 279
D+ S + C V++GE+W+A K I
Sbjct: 223 DKDSQYEMCSVLEGEEWLAIKHIH 246
>gi|389795384|ref|ZP_10198508.1| procollagen-proline dioxygenase [Rhodanobacter fulvus Jip2]
gi|388430823|gb|EIL87950.1| procollagen-proline dioxygenase [Rhodanobacter fulvus Jip2]
Length = 293
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 64/197 (32%), Positives = 98/197 (49%), Gaps = 35/197 (17%)
Query: 94 EQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFISASEDKTGILELI 147
E+C +I + +L+ S TV+ G R+S GTF + D + +
Sbjct: 109 EECDELIRRSADKLQRST-------TVDPVNGGYEVIAARSSEGTFFPVNADD--FIARL 159
Query: 148 EHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-----QRLASFLLY 202
+ +IA P +GE VL Y G +Y H+D F+P + G + QR+++ L+Y
Sbjct: 160 DRRIAELMNCPVENGEGLQVLHYGEGGEYQPHFDYFSPGDPGSEAQMVVGGQRVSTLLIY 219
Query: 203 LSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHG 262
L+DV +GG T+FP +GL+V PR+G + F +G +D +LHG
Sbjct: 220 LNDVAQGGATVFP---------------TLGLRVLPRKGMAVYFEYSNRDGQVDPLTLHG 264
Query: 263 SCPVIKGEKWVATKWIR 279
PV KGEKW+ TKW+R
Sbjct: 265 GEPVEKGEKWIITKWMR 281
>gi|357135727|ref|XP_003569460.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like isoform 2
[Brachypodium distachyon]
Length = 314
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 62/202 (30%), Positives = 104/202 (51%), Gaps = 13/202 (6%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
L+W PR + F S +C ++ A+ ++ S L + T+ + + F +
Sbjct: 63 LAWHPRVFLYEGFLSGMECDHLVYVARLNIESSLLVNAGARNI--TQNSTDARFKF-QLA 119
Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
+ K ++ IE +I+ + +P+ HGE+ +L+Y Q D + D + G RL
Sbjct: 120 DSKDIVVSKIEDRISLWSFIPKEHGESMQILKYGSNQS-DHNKDGTQSSSGG----NRLV 174
Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYD---YKKCIGLKVKPRRGDGLLFYSLFPNGT 254
+ L+YLSDV++GGET+FP D+ +C G VKP +GD +L ++L P+G
Sbjct: 175 TILMYLSDVKQGGETVFPRSE--LKDTQAKEGALSECAGYAVKPVKGDAILLFNLRPDGV 232
Query: 255 IDRTSLHGSCPVIKGEKWVATK 276
D S + C V++GEKW+A K
Sbjct: 233 TDSDSHYEDCSVLEGEKWLAIK 254
>gi|398810140|ref|ZP_10568970.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
gi|398083831|gb|EJL74535.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
Length = 296
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 66/199 (33%), Positives = 98/199 (49%), Gaps = 35/199 (17%)
Query: 94 EQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFISASEDKTGILELI 147
++C+ +IA A+ RL PS TV+ G R+S G F E+ + +
Sbjct: 111 QECEELIALARPRLAPST-------TVDPLSGRDLVGEQRSSLGMFFRLREN--AFIARL 161
Query: 148 EHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-----QRLASFLLY 202
+ +++ LP +GE VL Y G + H+D P+ + S QR+++ + Y
Sbjct: 162 DQRVSELMNLPVENGEGLQVLCYPAGAQSMPHFDFLVPSNAANKASLARSGQRVSTLVSY 221
Query: 203 LSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHG 262
L++VEEGGET+FP +C G V PRRG + F G +D SLH
Sbjct: 222 LNEVEEGGETIFP--------------EC-GWSVPPRRGSAVYFEYCNSLGQVDHASLHA 266
Query: 263 SCPVIKGEKWVATKWIRDQ 281
PV+ GEKWVATKW+R +
Sbjct: 267 GGPVLHGEKWVATKWMRQR 285
>gi|363543297|ref|NP_001241864.1| prolyl 4-hydroxylase 4-2 precursor [Zea mays]
gi|194704960|gb|ACF86564.1| unknown [Zea mays]
gi|347978810|gb|AEP37747.1| prolyl 4-hydroxylase 4-2 [Zea mays]
Length = 207
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 58/145 (40%), Positives = 84/145 (57%), Gaps = 5/145 (3%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFI 134
+ +SW PR + F S +C ++ AKK+ + S +A + G++V+S RTSSG F+
Sbjct: 46 KAVSWHPRIFVYKGFLSDAECDHLVTLAKKKTQRSMVADNESGKSVKSE--VRTSSGMFL 103
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
+D ++ IE +IA T LPQ + E VLRYE GQKY+ H+D F+ +
Sbjct: 104 DKRQDP--VVSRIEERIAAWTFLPQENAENMQVLRYEPGQKYEPHFDYFHDRVNQARGGH 161
Query: 195 RLASFLLYLSDVEEGGETMFPFENG 219
R A+ L+YLS V EGGET+FP G
Sbjct: 162 RYATVLMYLSTVREGGETVFPNAKG 186
>gi|24417248|gb|AAN60234.1| unknown [Arabidopsis thaliana]
Length = 190
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 59/136 (43%), Positives = 82/136 (60%), Gaps = 5/136 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LSW PR + F S E+C I AK +L+ S +A GE+VES RTSSG F+S
Sbjct: 59 LSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESE--VRTSSGMFLSK 116
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D I+ +E K+A T LP+ +GE+ +L YE GQKY+ H+D F+ R+
Sbjct: 117 RQDD--IVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRI 174
Query: 197 ASFLLYLSDVEEGGET 212
A+ L+YLS+VE+GGET
Sbjct: 175 ATVLMYLSNVEKGGET 190
>gi|385205097|ref|ZP_10031967.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
gi|385184988|gb|EIF34262.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
Length = 292
Score = 105 bits (261), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 62/206 (30%), Positives = 101/206 (49%), Gaps = 23/206 (11%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
RP+ + F + S ++C +I ++ RLK S + E RTS G + ED
Sbjct: 102 RPQMIVFADVLSPDECAEMIERSRHRLKRS-TTVNPATGKEDVIRNRTSEGIWYQRGEDP 160
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-----PQMSQR 195
+E ++ +I+ P +GE +LRY +Y H+D F P + G Q QR
Sbjct: 161 --FIERMDRRISSLMNWPVENGEGLQLLRYGTTGEYRPHFDYFPPDQPGSTVHTAQGGQR 218
Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 255
+A+ ++YL+DV +GGET+FP G+ V +G + F + +
Sbjct: 219 VATLVIYLNDVPDGGETIFPEA---------------GMSVAASQGGAVYFRYMNGRRQL 263
Query: 256 DRTSLHGSCPVIKGEKWVATKWIRDQ 281
D +LHG PV+ G+KW+ TKW+R++
Sbjct: 264 DPLTLHGGAPVLSGDKWIMTKWMRER 289
>gi|299115443|emb|CBN75608.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 548
Score = 105 bits (261), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 78/243 (32%), Positives = 112/243 (46%), Gaps = 44/243 (18%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-----TR 127
+ + LS PR NF E+ SII A L +Q A R + TKG TR
Sbjct: 205 VVLETLSHSPRVFSLYNFMDMEEADSIIEDA---LGMTQEAYRLKRSSTGTKGKAISKTR 261
Query: 128 TSSGTFISASEDKTGILELIEHKIARATMLPQTH---GEAFNVLRYEIGQKYDSHYDAFN 184
TS F++ T + ++ +I + + + H + VLRY Q Y +H+D
Sbjct: 262 TSDNAFVT----HTNTAQALKRRIFQLLGIEEYHETWADGLQVLRYNESQAYVAHFDYLE 317
Query: 185 PAEYGPQMSQ-----RLASFLLYLSDVEEGGETMFPFENGI-----------------FL 222
AE S+ R A+ +LY +DV EGGET+F GI L
Sbjct: 318 SAEGHDFKSEGLGTNRFATVVLYFNDVREGGETVFTHAPGIDHHLVPDTKVPVREVLENL 377
Query: 223 D---SGYDYKKCIGLK----VKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVAT 275
D SG++ K + + V P+RG +LFY+ P+G D +S HG+CPVI G+KW A
Sbjct: 378 DLPRSGWEEKLLLQCRRHMVVAPKRGQAVLFYNQHPDGRKDLSSEHGACPVIDGQKWAAN 437
Query: 276 KWI 278
W+
Sbjct: 438 LWV 440
>gi|187920106|ref|YP_001889137.1| procollagen-proline dioxygenase [Burkholderia phytofirmans PsJN]
gi|187718544|gb|ACD19767.1| Procollagen-proline dioxygenase [Burkholderia phytofirmans PsJN]
Length = 295
Score = 105 bits (261), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 62/206 (30%), Positives = 101/206 (49%), Gaps = 23/206 (11%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
RP+ + F + S ++C +I ++ RLK S + E RTS G + ED
Sbjct: 105 RPQVIVFGDVLSPDECAEMIERSRHRLKRS-TTVNPETGKEDVIRNRTSEGIWYQRGED- 162
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-----PQMSQR 195
+E ++ +I+ P +GE +L Y +Y H+D F P + G Q QR
Sbjct: 163 -AFIERMDRRISSLMNWPVENGEGLQILHYGTTGEYRPHFDYFPPDQPGSAVHTAQGGQR 221
Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 255
+A+ ++YL+DV +GGET+FP G+ V R+G + F + +
Sbjct: 222 VATLVIYLNDVPDGGETIFPEA---------------GISVAARQGGAVYFRYMNGQRQL 266
Query: 256 DRTSLHGSCPVIKGEKWVATKWIRDQ 281
D +LHG PV+ G+KW+ TKW+R++
Sbjct: 267 DPLTLHGGAPVLGGDKWIMTKWMRER 292
>gi|416009427|ref|ZP_11561250.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein
[Acidithiobacillus sp. GGI-221]
gi|339836568|gb|EGQ64151.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein
[Acidithiobacillus sp. GGI-221]
Length = 196
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 69/198 (34%), Positives = 95/198 (47%), Gaps = 23/198 (11%)
Query: 87 FPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILEL 146
+ + E CQ++IA + L+P+ + Q E G R S + D IL+
Sbjct: 16 WAGLLTPENCQNLIAIGQSLLRPATVTDEQ-TGQEVAHGERVSEMAW--PKRDDHPILQS 72
Query: 147 IEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ---RLASFLLYL 203
+ IA+ T +P E +L Y G +Y HYDAF A P + Q R + +LYL
Sbjct: 73 LAEGIAQLTGIPIDCQEPLQILHYRPGGEYKPHYDAF--AADAPTLRQGGNRQGTLILYL 130
Query: 204 SDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGS 263
+ VEEGGET FP +GL+V P G G+ F +L G SLH
Sbjct: 131 NAVEEGGETAFPE---------------LGLQVSPIPGGGVFFRNLNEEGQRHPLSLHAG 175
Query: 264 CPVIKGEKWVATKWIRDQ 281
PV KGEKW+AT+WIR +
Sbjct: 176 LPVRKGEKWIATQWIRQE 193
>gi|294666178|ref|ZP_06731433.1| 2OG-Fe II oxygenase superfamily protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
gi|292604043|gb|EFF47439.1| 2OG-Fe II oxygenase superfamily protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
Length = 296
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 70/211 (33%), Positives = 99/211 (46%), Gaps = 37/211 (17%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFIS 135
P + F S +C ++IA A+ RL S+ TV++ G RTS +
Sbjct: 106 PCVVVLGGFLSGGECDALIALARPRLARSR-------TVDNANGEHVVHAARTSDSMCLR 158
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM--- 192
+D + + IE +IAR P HGE VLRY G +Y HYD F+P G +
Sbjct: 159 VGQD--ALCQRIEARIARLLDWPVDHGEGLQVLRYGTGAEYRPHYDYFDPDAAGTPVLLQ 216
Query: 193 --SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
QR+AS ++YL+ E GG T FP + L V +G+ + F
Sbjct: 217 AGGQRVASLVMYLNTPERGGATRFPDAH---------------LDVAAVKGNAVFFSYDR 261
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
P+ SLH PV+ GEKWVATKW+R++
Sbjct: 262 PHPMT--RSLHAGAPVLAGEKWVATKWLRER 290
>gi|77761111|ref|YP_241833.2| hypothetical protein XC_0735 [Xanthomonas campestris pv. campestris
str. 8004]
Length = 288
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 71/226 (31%), Positives = 108/226 (47%), Gaps = 28/226 (12%)
Query: 64 ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
+ GD +Q + +L PR + + ++C ++IA A+ +L S+ + + E
Sbjct: 83 DLGDRQVQVLVSLML---PRVVVLGGLLADDECDALIALARPQLARSR-TVDNRDGSEIV 138
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTS + +D + + IE +IA+ P HGE VLRY G +Y HYD F
Sbjct: 139 HAARTSHSMALQPGQD--ALCQRIEARIAQLLEWPVEHGEGLQVLRYATGAQYAPHYDYF 196
Query: 184 NPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKP 238
P G + QR+AS ++YL+ E GG T FP + L V
Sbjct: 197 EPDAPGTPVLLQHGGQRVASLVMYLNTPERGGATRFP---------------DVHLDVAA 241
Query: 239 RRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQH 284
+G+ + F P+ + RT LH PV+ GEKWVATKW+R++ H
Sbjct: 242 VKGNAVFFSYDRPH-PMTRT-LHAGAPVLAGEKWVATKWLRERPLH 285
>gi|294627644|ref|ZP_06706226.1| 2OG-Fe II oxygenase superfamily protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 11122]
gi|292597996|gb|EFF42151.1| 2OG-Fe II oxygenase superfamily protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 11122]
Length = 296
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 70/211 (33%), Positives = 99/211 (46%), Gaps = 37/211 (17%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFIS 135
P + F S +C ++IA A+ RL S+ TV++ G RTS +
Sbjct: 106 PCVVVLGGFLSGGECDALIALARPRLARSR-------TVDNANGEHVVHAARTSDSMCLR 158
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM--- 192
+D + + IE +IAR P HGE VLRY G +Y HYD F+P G +
Sbjct: 159 VGQD--ALCQRIEARIARLLDWPVDHGEGLQVLRYGTGAEYRPHYDYFDPDAAGTPVLLQ 216
Query: 193 --SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
QR+AS ++YL+ E GG T FP + L V +G+ + F
Sbjct: 217 AGGQRVASLVMYLNTPERGGATRFPDAH---------------LDVAAVKGNAVFFSYDR 261
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
P+ SLH PV+ GEKWVATKW+R++
Sbjct: 262 PHPMT--RSLHAGAPVLAGEKWVATKWLRER 290
>gi|66572403|gb|AAY47813.1| conserved hypothetical protein [Xanthomonas campestris pv.
campestris str. 8004]
Length = 308
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 71/226 (31%), Positives = 108/226 (47%), Gaps = 28/226 (12%)
Query: 64 ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
+ GD +Q + +L PR + + ++C ++IA A+ +L S+ + + E
Sbjct: 103 DLGDRQVQVLVSLML---PRVVVLGGLLADDECDALIALARPQLARSR-TVDNRDGSEIV 158
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTS + +D + + IE +IA+ P HGE VLRY G +Y HYD F
Sbjct: 159 HAARTSHSMALQPGQD--ALCQRIEARIAQLLEWPVEHGEGLQVLRYATGAQYAPHYDYF 216
Query: 184 NPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKP 238
P G + QR+AS ++YL+ E GG T FP + L V
Sbjct: 217 EPDAPGTPVLLQHGGQRVASLVMYLNTPERGGATRFPD---------------VHLDVAA 261
Query: 239 RRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQH 284
+G+ + F P+ + RT LH PV+ GEKWVATKW+R++ H
Sbjct: 262 VKGNAVFFSYDRPH-PMTRT-LHAGAPVLAGEKWVATKWLRERPLH 305
>gi|195505199|ref|XP_002099401.1| GE23383 [Drosophila yakuba]
gi|194185502|gb|EDW99113.1| GE23383 [Drosophila yakuba]
Length = 535
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 66/215 (30%), Positives = 104/215 (48%), Gaps = 22/215 (10%)
Query: 74 PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
PF++ L P + A+ +S+ TA+ R+K S + G + RTS G
Sbjct: 316 PFKLEELHLDPLLVQLHQVIGAKDSESLQRTARPRIKRSTVYSLAGNGGSTAAAFRTSQG 375
Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF---NPAEY 188
+ S ++ +L+ H + + L + E V Y IG Y+ H+D+F + +
Sbjct: 376 ASFNYS--RSAATKLLSHHVGDFSGLNMEYAEDLQVANYGIGGHYEPHWDSFPENHVYQE 433
Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 248
G R+A+ + YLSDVE GG T FPF + L V P +G L +Y+
Sbjct: 434 GDLHGNRIATGIYYLSDVEAGGGTAFPF---------------LPLLVTPEKGSLLFWYN 478
Query: 249 LFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
L P+G D + H +CPV++G KW+A WIR++ Q
Sbjct: 479 LHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQ 513
>gi|326436053|gb|EGD81623.1| p4ha2 protein [Salpingoeca sp. ATCC 50818]
Length = 548
Score = 104 bits (259), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 68/207 (32%), Positives = 107/207 (51%), Gaps = 34/207 (16%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQG--ETVESTKGTRTSSGTFISASE 138
R R F FAS E+C+ + K+RL+ + +A G + VE R S+ ++
Sbjct: 337 RQRLQVFRQFASPEECRHLQHAGKRRLERA-VAWTDGRFQPVE----FRISTAAWLQPDH 391
Query: 139 DKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD----AFNPAEYGPQMSQ 194
D I++ I +I AT + + EA + Y +G Y+ H+D NP +
Sbjct: 392 D--AIVKRIHGRIEDATQVDIEYAEALQISNYGMGGFYEPHFDHSSRGTNPD------GE 443
Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
RLA+F++YL+ V++GG T FP +G V+P GD + +Y+L P+G
Sbjct: 444 RLATFMIYLNPVKQGGFTAFPR---------------LGAAVQPGYGDAVFWYNLQPSGV 488
Query: 255 IDRTSLHGSCPVIKGEKWVATKWIRDQ 281
D +LHG+CPV++G KWVA KWI ++
Sbjct: 489 GDPLTLHGACPVLRGSKWVANKWIHER 515
>gi|91778899|ref|YP_554107.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
xenovorans LB400]
gi|91691559|gb|ABE34757.1| Procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
xenovorans LB400]
Length = 292
Score = 104 bits (259), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 61/206 (29%), Positives = 100/206 (48%), Gaps = 23/206 (11%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
RP+ + F + S ++C +I ++ RLK S + E RTS G + ED
Sbjct: 102 RPQVIVFADVLSPDECAEMIERSRHRLKRS-TTVNPATGKEDVIRNRTSEGIWYQRGEDP 160
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-----PQMSQR 195
+E ++ +I+ P +GE +L Y +Y H+D F P + G Q QR
Sbjct: 161 --FIERMDRRISSLMNWPVENGEGLQILHYGTTGEYRPHFDYFPPDQPGSAVHTAQGGQR 218
Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 255
+A+ ++YL+DV +GGET+FP G+ V +G + F + +
Sbjct: 219 VATLVIYLNDVPDGGETIFPEA---------------GMSVAASQGGAVYFRYMNDRRQL 263
Query: 256 DRTSLHGSCPVIKGEKWVATKWIRDQ 281
D +LHG PV+ G+KW+ TKW+R++
Sbjct: 264 DPLTLHGGAPVLAGDKWIMTKWMRER 289
>gi|196011908|ref|XP_002115817.1| hypothetical protein TRIADDRAFT_30052 [Trichoplax adhaerens]
gi|190581593|gb|EDV21669.1| hypothetical protein TRIADDRAFT_30052, partial [Trichoplax
adhaerens]
Length = 495
Score = 103 bits (258), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 65/208 (31%), Positives = 102/208 (49%), Gaps = 21/208 (10%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSI--IATAKKRLKPSQLALRQGETVESTKGTRTSS 130
IP + +S P + + + + Q ++I I+ +K P+ L G E+T+ +
Sbjct: 290 IPVEEISLDPFIVIYYDIINDHQIETIKKISPSKSNKSPNHAMLCSGIKSEATQVSIFCC 349
Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP 190
T++ + D ++E I T L + E V Y IG Y HYD+ A P
Sbjct: 350 STWLEDAYDP--VVEKISRLTQELTHLDVNYAEDLQVANYGIGGHYVPHYDSTIIAPEDP 407
Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
QRLA+ + YLS+VE GG T+FP +G+ V+P++G L + +L
Sbjct: 408 L--QRLATMMFYLSNVEIGGATIFPR---------------LGVAVRPQKGSALFWINLK 450
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWI 278
NG +R +LH +CPV+ G KW+A KWI
Sbjct: 451 RNGLTNRQTLHAACPVVIGSKWIANKWI 478
>gi|325915062|ref|ZP_08177391.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas vesicatoria
ATCC 35937]
gi|325538760|gb|EGD10427.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas vesicatoria
ATCC 35937]
Length = 286
Score = 103 bits (258), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 65/205 (31%), Positives = 96/205 (46%), Gaps = 25/205 (12%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
PR + F S +C ++IA A+ RL S+ + RTS + +D
Sbjct: 96 PRVMVLGGFLSDAECDAMIALAQPRLARSR-TVDNANGAHVVHAARTSDSMCLQLGQD-- 152
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRL 196
+ + IE +IAR P +GE VLRY G +Y HYD F+P G + QR+
Sbjct: 153 ALCQRIEARIARLLDWPVENGEGLQVLRYGTGAEYQPHYDYFDPDAAGTPVLLQAGGQRV 212
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
AS ++YL+ + GG T FP + L + +G+ + F P+
Sbjct: 213 ASLVMYLNTPDRGGATRFPD---------------VHLDIAAIKGNAVFFSYDRPHPMT- 256
Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQ 281
SLH PV+ GEKWVATKW+R++
Sbjct: 257 -RSLHAGAPVLAGEKWVATKWLRER 280
>gi|115434812|ref|NP_001042164.1| Os01g0174500 [Oryza sativa Japonica Group]
gi|55296794|dbj|BAD68120.1| prolyl 4-hydroxylase -like [Oryza sativa Japonica Group]
gi|113531695|dbj|BAF04078.1| Os01g0174500 [Oryza sativa Japonica Group]
gi|222617830|gb|EEE53962.1| hypothetical protein OsJ_00571 [Oryza sativa Japonica Group]
Length = 303
Score = 103 bits (257), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 65/202 (32%), Positives = 105/202 (51%), Gaps = 20/202 (9%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
LSW PR + F S +C +++ + ++ S LA T G R SS I
Sbjct: 63 LSWHPRIFLYEGFLSDMECDHLVSMGRGNME-SSLAF--------TDGDRNSSYNNI--- 110
Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
ED ++ IE +I+ + LP+ +GE+ VL+Y + + + + RLA
Sbjct: 111 EDI--VVSKIEDRISLWSFLPKENGESIQVLKYGVNRS-----GSIKEEPKSSSGAHRLA 163
Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYDY-KKCIGLKVKPRRGDGLLFYSLFPNGTID 256
+ L+YLSDV++GGET+FP + +C G V+P +G+ +L ++L P+G D
Sbjct: 164 TILMYLSDVKQGGETVFPRSEMKDAQAKEGAPSQCSGYAVRPAKGNAILLFNLRPDGETD 223
Query: 257 RTSLHGSCPVIKGEKWVATKWI 278
+ S + CPV++GEKW+A K I
Sbjct: 224 KDSQYEECPVLEGEKWLAIKHI 245
>gi|218187602|gb|EEC70029.1| hypothetical protein OsI_00603 [Oryza sativa Indica Group]
Length = 549
Score = 103 bits (257), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 67/202 (33%), Positives = 104/202 (51%), Gaps = 20/202 (9%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
LSW PR + F S +C +++T + + S LA T G R SS I
Sbjct: 309 LSWHPRIFLYEGFLSDMECDHLVSTGRGNMD-SSLAF--------TDGDRNSSYNNI--- 356
Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
ED ++ IE +I+ + LP+ +GE VL+Y + ++ + LA
Sbjct: 357 EDI--VVSKIEDRISLWSFLPKENGENIQVLKYGVNRR-----GSIKEEPKSSTGGHWLA 409
Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYDY-KKCIGLKVKPRRGDGLLFYSLFPNGTID 256
+ L+YLSDV++GGET+FP + +C G V+P +G+ LL ++L P+G ID
Sbjct: 410 TILIYLSDVKQGGETVFPRSEMKDAQAKEGAPSQCSGYAVRPAKGNALLLFNLRPDGEID 469
Query: 257 RTSLHGSCPVIKGEKWVATKWI 278
+ S + CPV++GEKW+A K I
Sbjct: 470 KDSQYEECPVLEGEKWLAIKHI 491
>gi|357135725|ref|XP_003569459.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like isoform 1
[Brachypodium distachyon]
Length = 303
Score = 103 bits (257), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 62/202 (30%), Positives = 100/202 (49%), Gaps = 24/202 (11%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
L+W PR + F S +C ++ A+ ++ S L G R I+ +
Sbjct: 63 LAWHPRVFLYEGFLSGMECDHLVYVARLNIESSLLV---------NAGARN-----ITQN 108
Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
++ IE +I+ + +P+ HGE+ +L+Y Q D + D + G RL
Sbjct: 109 STDDIVVSKIEDRISLWSFIPKEHGESMQILKYGSNQS-DHNKDGTQSSSGG----NRLV 163
Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYD---YKKCIGLKVKPRRGDGLLFYSLFPNGT 254
+ L+YLSDV++GGET+FP D+ +C G VKP +GD +L ++L P+G
Sbjct: 164 TILMYLSDVKQGGETVFPRSE--LKDTQAKEGALSECAGYAVKPVKGDAILLFNLRPDGV 221
Query: 255 IDRTSLHGSCPVIKGEKWVATK 276
D S + C V++GEKW+A K
Sbjct: 222 TDSDSHYEDCSVLEGEKWLAIK 243
>gi|323445926|gb|EGB02303.1| hypothetical protein AURANDRAFT_39521 [Aureococcus anophagefferens]
Length = 239
Score = 103 bits (257), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 66/208 (31%), Positives = 105/208 (50%), Gaps = 23/208 (11%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
LS P + +FA + C+ +I A+ L +++ R+G + R +S +++A
Sbjct: 31 LSADPLVYFIDDFADEDSCEHLIRQARPSLGGAEVQTRRGSAART--AIRRASSCWLAAR 88
Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYE--IGQKYDSHYDAFNPAEYGPQMS-Q 194
D+ LE +E I P+ E F+V+RY G++Y +H DAF + Q
Sbjct: 89 GDEA--LEHLEDAICAELGAPEERTEFFHVVRYRPSTGERYAAHADAFEAGNAELERGGQ 146
Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
RL + LLYLSDV GG T+FP +GL V PRRG L+F ++ + T
Sbjct: 147 RLTTALLYLSDVGAGGATVFP---------------ALGLSVAPRRGRLLVFANVADDTT 191
Query: 255 IDRTSLHGSCPVI-KGEKWVATKWIRDQ 281
+D ++H P+ EKW+A KW+R++
Sbjct: 192 VDARTVHAGEPIAGDTEKWIANKWVRER 219
>gi|347972274|ref|XP_001237637.3| AGAP004611-PA [Anopheles gambiae str. PEST]
gi|333469330|gb|EAU76664.3| AGAP004611-PA [Anopheles gambiae str. PEST]
Length = 514
Score = 103 bits (257), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 68/228 (29%), Positives = 107/228 (46%), Gaps = 27/228 (11%)
Query: 62 HGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE 121
H T I + Q ++ P + + + S ++ +II+ +K + S + + V
Sbjct: 296 HNRTPFLRISPLKLQEVNHDPMIVMYHDVISNKEIDAIISISKPLMHRSMVGDDHEKAVS 355
Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
T RTSS ++ + ++ + + T L T E V Y IG Y HYD
Sbjct: 356 KT---RTSSNAWLD--DVMHPVVRTLSQRTEDMTNLAMTAAERLQVGNYGIGGHYLPHYD 410
Query: 182 AFNPAEYGPQM------SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLK 235
+ AE G ++ R+A+ + YLSDV GG T+FP +GL
Sbjct: 411 -YAVAEEGKEVYPSIGKGNRIATVMYYLSDVAIGGATVFP---------------QLGLG 454
Query: 236 VKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
V P++G + +Y+L NGT+D +LHG+CPV G KWV KWI ++ Q
Sbjct: 455 VFPQKGSAIFWYNLHANGTVDHRTLHGACPVFVGSKWVGNKWIHERGQ 502
>gi|195575097|ref|XP_002105516.1| GD17035 [Drosophila simulans]
gi|194201443|gb|EDX15019.1| GD17035 [Drosophila simulans]
Length = 535
Score = 103 bits (257), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 65/215 (30%), Positives = 102/215 (47%), Gaps = 22/215 (10%)
Query: 74 PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
PF++ L P + + +S+ TA+ R+K S + G + RTS G
Sbjct: 316 PFKLEELHLDPLVVQLHQVIGSNDSESLQKTARPRIKRSTVYSLGGNGGSTAAAFRTSQG 375
Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF---NPAEY 188
+ S + +L+ H + + L + E V Y IG Y+ H+D+F + +
Sbjct: 376 ASFNYS--RNAATKLLSHHVGDFSGLNMDYAEDLQVANYGIGGHYEPHWDSFPENHIYQE 433
Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 248
G R+A+ + YLSDVE GG T FPF + L V P +G L +Y+
Sbjct: 434 GDLHGNRIATGIYYLSDVEAGGGTAFPF---------------LPLLVTPEKGSLLFWYN 478
Query: 249 LFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
L P+G D + H +CPV++G KW+A WIR++ Q
Sbjct: 479 LHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQ 513
>gi|195452776|ref|XP_002073495.1| GK13117 [Drosophila willistoni]
gi|194169580|gb|EDW84481.1| GK13117 [Drosophila willistoni]
Length = 487
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 65/215 (30%), Positives = 110/215 (51%), Gaps = 23/215 (10%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
+ +++ P + + + S + + AK +LK +++ T + +K TRT+
Sbjct: 279 LKMELIGLDPYMVLYHDVISPNEIAELQEMAKPQLKRARVYNSTKNTDQLSK-TRTAKLA 337
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
+ + ++ + E + +I T E V+ Y +G Y H+D FN + GP +
Sbjct: 338 WFLDTFNQ--LTERLNQRIMDMTNFVLNGSEMLQVMNYGLGGYYVKHFDYFNTTK-GPHI 394
Query: 193 SQ----RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 248
+Q R+A+ L YL+DVE+GG T+FP + KK V P+RG +++Y+
Sbjct: 395 TQINGDRIATVLFYLNDVEQGGATVFP-----------EIKKA----VFPKRGSAIMWYN 439
Query: 249 LFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
L +G +R +LH CPVI G KWV KWIR++EQ
Sbjct: 440 LKDDGEGNRDTLHAGCPVIVGSKWVCNKWIREREQ 474
>gi|255607134|ref|XP_002538686.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223510975|gb|EEF23697.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 318
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 67/205 (32%), Positives = 104/205 (50%), Gaps = 26/205 (12%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
PR F + S +C ++IA ++ RL+ S++ +G E TRTS G + + E+
Sbjct: 126 PRIALFDDVLSDAECDALIAASRSRLQRSKVVANRGSG-EFVDDTRTSYGAYFNKGENS- 183
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG---PQMS--QRL 196
++ I+ +IA T P TH E +L Y +G +Y H+D F P + G P S QR+
Sbjct: 184 -LVATIQRRIAELTRWPLTHAEPLQILNYGLGGEYLPHFDYFEPQQPGLPSPLESGGQRI 242
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLF-YSLFPNGTI 255
A+ ++YL+DVE GG T+FP N L+ +PR+G + F Y L +I
Sbjct: 243 ATVVMYLNDVEAGGGTIFPHLN---------------LETRPRKGGAIYFSYQLAVARSI 287
Query: 256 DRTSLHGSCPVIKGEKWVATKWIRD 280
+ + I KW+AT+W RD
Sbjct: 288 RSRCM--AARRIARRKWIATQWFRD 310
>gi|124267278|ref|YP_001021282.1| hypothetical protein Mpe_A2091 [Methylibium petroleiphilum PM1]
gi|124260053|gb|ABM95047.1| conserved hypothetical protein [Methylibium petroleiphilum PM1]
Length = 289
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 68/203 (33%), Positives = 93/203 (45%), Gaps = 25/203 (12%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
PR + F S +C I+A A RL S + RTS G F + E
Sbjct: 102 PRVIVFSGLLSDAECDEIVALAGARLARSH-TVDTATGASEVNAARTSDGMFFTRGEHP- 159
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQRL 196
+ E +IA P +GE VL Y G +Y HYD F+P + G P + QR+
Sbjct: 160 -VCARFEARIAALLNWPVENGEGLQVLHYRPGAEYKPHYDYFDPDQPGTPAVLRRGGQRV 218
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
A+ + YL+ GG T FP IGL+V P +G + F P+ +
Sbjct: 219 ATLVTYLNTPTRGGGTTFP---------------DIGLEVTPLKGHAVFFSYDRPHPST- 262
Query: 257 RTSLHGSCPVIKGEKWVATKWIR 279
SLHG PV++G+KWVATKW+R
Sbjct: 263 -RSLHGGAPVLEGDKWVATKWLR 284
>gi|327267604|ref|XP_003218589.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Anolis
carolinensis]
Length = 542
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 66/211 (31%), Positives = 103/211 (48%), Gaps = 30/211 (14%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
RPR + F S E+ +++ AK RL + + Q + +T R S ++S E+
Sbjct: 342 RPRIVRFVEIISDEEIETVKELAKPRLSRATVHDPQTGKL-TTAHYRVSKSAWLSGYENP 400
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
I+ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 401 --IVARINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG--- 455
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
R+A++L Y+SDV GG T+FP +G V PR+G + +Y+LFP+
Sbjct: 456 -NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPRKGTAVFWYNLFPS 499
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
G D ++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 500 GEGDYSTRHAACPVLVGNKWVSNKWIHERGQ 530
>gi|239915958|ref|NP_001070123.2| prolyl 4-hydroxylase alpha II-like precursor [Danio rerio]
Length = 490
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 63/199 (31%), Positives = 98/199 (49%), Gaps = 31/199 (15%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P+ + + + S + +++ A+ L SQ G V S RTS F+ ++
Sbjct: 308 PKIIRYHDVISDTEIETLKDIARPELTRSQ----TGWGVIS--DIRTSQSVFL----EEV 357
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
G + I +IA T L E +V Y IG +Y H+D G ++++R A+FL+
Sbjct: 358 GTVARISQRIADITGLSVESAEKLHVQNYGIGGRYTPHFDT------GDEVNERTATFLI 411
Query: 202 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 261
Y+SDVE GG T+F +G+ VKP +G + +Y+L NG +D + H
Sbjct: 412 YMSDVEVGGATVF---------------TNVGVAVKPEKGSAVFWYNLHKNGELDLKTKH 456
Query: 262 GSCPVIKGEKWVATKWIRD 280
CPV+ G KWVA KWI +
Sbjct: 457 AGCPVLVGNKWVANKWIHE 475
>gi|224009604|ref|XP_002293760.1| prolyl 4-hydroxylase alpha subunit [Thalassiosira pseudonana
CCMP1335]
gi|220970432|gb|EED88769.1| prolyl 4-hydroxylase alpha subunit [Thalassiosira pseudonana
CCMP1335]
Length = 206
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 71/216 (32%), Positives = 108/216 (50%), Gaps = 21/216 (9%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSII-ATAKKRLKPSQLALRQGETVE---STKGTRT 128
+ +VLS PRA NF S + I+ T +L S A T + ST+ TRT
Sbjct: 1 MTLKVLSCAPRAFEIENFLSQTEVDHIMYLTTGMKLHRSTTAGSDQITADERDSTRNTRT 60
Query: 129 SSGTFISASEDKTGILELIEHKIARATMLPQTH-GEAFNVLRYEIGQKYDSHYDAFNP-- 185
S T++ +K+ I++ I + A ++ + EA ++ Y++GQ+Y +H+D +P
Sbjct: 61 SLNTWVY--REKSAIIDTIYRRAADLQLMNEALIAEALQLVHYDVGQEYTAHHDWGHPDI 118
Query: 186 -AEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
EY P R + LLYL++ EGG T FP + + GL V+P+ G +
Sbjct: 119 DNEYQPA---RYCTLLLYLNEGMEGGATQFP--------RWVNAETRNGLDVEPKIGKAV 167
Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 280
LFYS P+G +D S H + PV GEKW+ W D
Sbjct: 168 LFYSQLPDGNMDDWSHHAAMPVRVGEKWLMNLWTWD 203
>gi|92096574|gb|AAI15350.1| LOC557059 protein [Danio rerio]
Length = 508
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 63/199 (31%), Positives = 98/199 (49%), Gaps = 31/199 (15%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P+ + + + S + +++ A+ L SQ G V S RTS F+ ++
Sbjct: 326 PKIIRYHDVISDTEIETLKDIARPELTRSQ----TGWGVIS--DIRTSQSVFL----EEV 375
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
G + I +IA T L E +V Y IG +Y H+D G ++++R A+FL+
Sbjct: 376 GTVARISQRIADITGLSVESAEKLHVQNYGIGGRYTPHFDT------GDEVNERTATFLI 429
Query: 202 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 261
Y+SDVE GG T+F +G+ VKP +G + +Y+L NG +D + H
Sbjct: 430 YMSDVEVGGATVF---------------TNVGVAVKPEKGSAVFWYNLHKNGELDLKTKH 474
Query: 262 GSCPVIKGEKWVATKWIRD 280
CPV+ G KWVA KWI +
Sbjct: 475 AGCPVLVGNKWVANKWIHE 493
>gi|21114687|gb|AAM42699.1| conserved hypothetical protein [Xanthomonas campestris pv.
campestris str. ATCC 33913]
Length = 308
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 70/226 (30%), Positives = 107/226 (47%), Gaps = 28/226 (12%)
Query: 64 ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
+ GD +Q + +L PR + + ++C ++IA A+ +L S+ + + E
Sbjct: 103 DLGDRQVQVLVSLML---PRVVVLGGLLADDECDALIALARPQLARSR-TVDNRDGSEIV 158
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTS + +D + + IE +IA+ P HGE VLRY G +Y HYD F
Sbjct: 159 HAARTSHSMALQPGQD--ALCQRIEARIAQLLEWPVEHGEGLQVLRYATGAQYAPHYDYF 216
Query: 184 NPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKP 238
P G + QR+AS ++YL+ E GG T P + L V
Sbjct: 217 EPDAPGTPVLLQHGGQRVASLVMYLNTPERGGATRVPD---------------VHLDVAA 261
Query: 239 RRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQH 284
+G+ + F P+ + RT LH PV+ GEKWVATKW+R++ H
Sbjct: 262 VKGNAVFFSYDRPH-PMTRT-LHAGAPVLAGEKWVATKWLRERPLH 305
>gi|77747935|ref|NP_638775.2| hypothetical protein XCC3429 [Xanthomonas campestris pv. campestris
str. ATCC 33913]
Length = 288
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 70/226 (30%), Positives = 107/226 (47%), Gaps = 28/226 (12%)
Query: 64 ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
+ GD +Q + +L PR + + ++C ++IA A+ +L S+ + + E
Sbjct: 83 DLGDRQVQVLVSLML---PRVVVLGGLLADDECDALIALARPQLARSR-TVDNRDGSEIV 138
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTS + +D + + IE +IA+ P HGE VLRY G +Y HYD F
Sbjct: 139 HAARTSHSMALQPGQD--ALCQRIEARIAQLLEWPVEHGEGLQVLRYATGAQYAPHYDYF 196
Query: 184 NPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKP 238
P G + QR+AS ++YL+ E GG T P + L V
Sbjct: 197 EPDAPGTPVLLQHGGQRVASLVMYLNTPERGGATRVPD---------------VHLDVAA 241
Query: 239 RRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQH 284
+G+ + F P+ + RT LH PV+ GEKWVATKW+R++ H
Sbjct: 242 VKGNAVFFSYDRPH-PMTRT-LHAGAPVLAGEKWVATKWLRERPLH 285
>gi|321474952|gb|EFX85916.1| hypothetical protein DAPPUDRAFT_45616 [Daphnia pulex]
Length = 537
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 64/222 (28%), Positives = 104/222 (46%), Gaps = 30/222 (13%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
IQ I + +P + + + S ++ +++ AK R K + +R +T E
Sbjct: 324 IQPIKMEEALLKPMIVVYHDVMSDDEIETVKKMAKPRFKRA--TIRNSKTGELEPANYRI 381
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-------- 181
S + SE+ IL+ + ++ T L + E V+ Y IG Y+ H+D
Sbjct: 382 SKSAWLKSEEHDHILK-VTRRVGDITGLDMSTAEDLQVVNYGIGGHYEPHFDYARTETTE 440
Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRG 241
AF +G R+A++L Y+SDVE GG T+FP G V PR+G
Sbjct: 441 AFKELGWG----NRIATWLFYMSDVEAGGATVFP---------------PTGAAVWPRKG 481
Query: 242 DGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+Y+L+PNG + + H +CPV+ G KWV+ +WI + Q
Sbjct: 482 SAAFWYNLYPNGKGNELTRHAACPVLSGSKWVSNRWIHEHRQ 523
>gi|301115862|ref|XP_002905660.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262110449|gb|EEY68501.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 215
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 66/218 (30%), Positives = 101/218 (46%), Gaps = 19/218 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P F ++ I+ + L PS + L+ G RTS+ ++ +S
Sbjct: 3 PLVFSVEEFLRDDEIDVILELSMPHLAPSGVTLQDGHENRPATDWRTSTTYWLDSSSHP- 61
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA------------EYG 189
+++ I+ + A +P +H E+ VLRYE Q YD H D F+ EYG
Sbjct: 62 -VVQTIDKRTADLVKVPISHQESVQVLRYEPTQHYDQHLDYFSAERHRNSPDVLKRIEYG 120
Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI-GLKVKPRRGDGLLFYS 248
R+ + Y+SDV +GG T F G+ S K C G+ V P++ ++FYS
Sbjct: 121 --YKNRMITVFWYMSDVAKGGHTNFARSGGLPRPSSN--KDCSQGISVAPKKRKVVVFYS 176
Query: 249 LFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHED 286
+ PNG D SLH CPV +G K KWI ++ + +D
Sbjct: 177 MLPNGEGDPMSLHAGCPVEEGIKLSGNKWIWNKPRSDD 214
>gi|303273602|ref|XP_003056161.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226462245|gb|EEH59537.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 750
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 72/229 (31%), Positives = 112/229 (48%), Gaps = 48/229 (20%)
Query: 87 FPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILEL 146
F +F SA +C ++A A L+ S++ G+ E RTSS TF++ + + ++
Sbjct: 534 FDHFLSAVECDDLVAIAAPDLRRSRVT--DGKLSEG----RTSSSTFLTGCKQEEPLVRA 587
Query: 147 IEHKIARA----TMLP---------QTHG--------------------EAFNVLRYEIG 173
IE ++ RA T++ + HG E V+RY G
Sbjct: 588 IEQRLLRAVQSATLIAAQPNVYDSNERHGQPYRGSTSRFSQRPNLLQGAEPMQVVRYTEG 647
Query: 174 QKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIG 233
Q Y +HYD +R A+F++YL+DV GG T FP + + G G
Sbjct: 648 QMYTAHYDNKQGC------LRRTATFMMYLTDVHSGGATHFPRAVPVSMRDGC--GDAAG 699
Query: 234 LKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
+++ P+RG L+F+S+ G D SLH + PVI+GEKW+ATKW+R+ E
Sbjct: 700 IRIWPKRGRALVFWSV-SGGIEDVRSLHEAEPVIEGEKWIATKWLREDE 747
>gi|344175386|emb|CCA88057.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia syzygii R24]
Length = 331
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 65/217 (29%), Positives = 102/217 (47%), Gaps = 23/217 (10%)
Query: 71 QSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQ-LALRQGETVESTKGTRTS 129
+ + Q +S PRA + S ++C ++I A+ RL S + G+ V + S
Sbjct: 121 RQVSVQFVSHHPRAALISDLLSTQECDALIEQARSRLTTSYVIEYESGQEVVNEATRSCS 180
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-----N 184
+F E+ + + + I + AR P H E RY G+++ H D F N
Sbjct: 181 CASF--PPEEMSMLQKRIVERAARLVGQPGAHCEGVTFARYLPGEQFRPHVDYFRGAVLN 238
Query: 185 PAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
+ R+A+ LLYL++VE GG T FP G +V+P++G L
Sbjct: 239 NDKIMGSSGHRIATVLLYLNEVEAGGATFFPNP---------------GFEVRPQKGGAL 283
Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
F +G++D TSLH C V +GEKW+AT W R++
Sbjct: 284 YFAYQQADGSMDPTSLHEGCAVTQGEKWIATLWFRER 320
>gi|195505209|ref|XP_002099405.1| GE10885 [Drosophila yakuba]
gi|194185506|gb|EDW99117.1| GE10885 [Drosophila yakuba]
Length = 473
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 66/211 (31%), Positives = 101/211 (47%), Gaps = 21/211 (9%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
+ ++LS P + F + S + SI AK L + + G E RT+ GT
Sbjct: 274 LKMELLSLDPYMVLFHDVVSDKDITSIRNLAKGGLVRAVTVTKDGSYEEDP--ARTTKGT 331
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
++ + + +++ + T L + F VL Y IG Y +H+D E G
Sbjct: 332 WLV---ENSKLIQRLSQLAQDMTNLDIRDADPFQVLNYGIGGYYGTHFDFLADTEMG-NF 387
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
S R+A+ + YLSDV +GG T+FP +GL V P++G LL+Y+L
Sbjct: 388 SNRIATAVFYLSDVPQGGATIFP---------------KLGLSVFPKKGSALLWYNLDHK 432
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
G D + H +CP I G +WV TKWI ++EQ
Sbjct: 433 GDGDNRTAHSACPTIVGSRWVMTKWINEREQ 463
>gi|255633460|gb|ACU17088.1| unknown [Glycine max]
Length = 207
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 55/132 (41%), Positives = 77/132 (58%), Gaps = 5/132 (3%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFI 134
+V+SW PRA + NF + E+C+ +I AK + S + ET +S RTSSGTF+
Sbjct: 80 EVVSWEPRAFVYHNFLTKEECEYLIDIAKPNMHKSSVV--DSETGKSKDSRVRTSSGTFL 137
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
+ DK I+ IE +IA + +P HGE VL YE+GQKY+ HYD F Q
Sbjct: 138 ARGRDK--IVRDIEKRIAHYSFIPVEHGEGLQVLHYEVGQKYEPHYDYFLDDFNTKNGGQ 195
Query: 195 RLASFLLYLSDV 206
R+A+ L+YL+DV
Sbjct: 196 RIATVLMYLTDV 207
>gi|195452778|ref|XP_002073496.1| GK13116 [Drosophila willistoni]
gi|194169581|gb|EDW84482.1| GK13116 [Drosophila willistoni]
Length = 521
Score = 101 bits (251), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 64/192 (33%), Positives = 95/192 (49%), Gaps = 27/192 (14%)
Query: 100 IATAKKRLKPSQLALRQGETVESTKGT----RTSSGTFISASEDKTGILELIEHKIARAT 155
IA ++ KP L++ STK T +T + + + E + +I T
Sbjct: 339 IAELQEMAKPE---LKRATVYNSTKNTNQFVKTRTAKVAWFLDTFNQLTERLNQRIMDMT 395
Query: 156 MLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ----RLASFLLYLSDVEEGGE 211
E V+ Y +G Y H+D FN P +SQ R+A+ L YL+DVE+GG
Sbjct: 396 NFVLNGSEMLQVMNYGLGGYYVKHFDYFNTTT-NPHISQINGDRIATVLFYLNDVEQGGA 454
Query: 212 TMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEK 271
T+FP + KK V P+RG +++Y+L +G +R +LH +CPVI G K
Sbjct: 455 TVFP-----------EIKKA----VFPKRGSAIMWYNLKDDGEGNRDTLHAACPVIVGSK 499
Query: 272 WVATKWIRDQEQ 283
WV KWIR++EQ
Sbjct: 500 WVCNKWIREREQ 511
>gi|319786559|ref|YP_004146034.1| Procollagen-proline dioxygenase [Pseudoxanthomonas suwonensis 11-1]
gi|317465071|gb|ADV26803.1| Procollagen-proline dioxygenase [Pseudoxanthomonas suwonensis 11-1]
Length = 289
Score = 101 bits (251), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 71/228 (31%), Positives = 109/228 (47%), Gaps = 40/228 (17%)
Query: 64 ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
+ GD +Q + +L PR + S E+C +++ ++ RL R+ TV++
Sbjct: 87 DAGDRQVQLLASLML---PRVVVLGGLLSDEECDALVELSRPRL-------RRSTTVDAQ 136
Query: 124 KG------TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYD 177
G RTS GTF + IE +IAR P +GE VL Y G ++
Sbjct: 137 TGGSQVHADRTSRGTFFERGAHP--VCATIEARIARLLEWPVENGEGLQVLHYPPGAEFR 194
Query: 178 SHYDAFNPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI 232
HYD F+P E G ++ QR+A+ ++YL+ GG T FP +
Sbjct: 195 PHYDYFDPDEPGAEVLLRQGGQRVATVVMYLNTPARGGATTFPDAH-------------- 240
Query: 233 GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 280
L+V +G+ + F P+ + RT LHG PV +GEKW+ATKW+R+
Sbjct: 241 -LEVAAVKGNAVFFSYDRPH-PMTRT-LHGGAPVTEGEKWIATKWLRE 285
>gi|195110931|ref|XP_002000033.1| GI24862 [Drosophila mojavensis]
gi|193916627|gb|EDW15494.1| GI24862 [Drosophila mojavensis]
Length = 549
Score = 101 bits (251), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 61/215 (28%), Positives = 99/215 (46%), Gaps = 28/215 (13%)
Query: 74 PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
P +V LS P + F + + +++ AK ++ + + G RTS
Sbjct: 324 PLKVEELSHDPLLVLFHDVIYQSEIDTLMRLAKNKIHRATVT---GHNSSVVSNARTSQF 380
Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY--- 188
TF+ + K +L I+ ++A T L + E + Y IG Y H D F P +
Sbjct: 381 TFLPKTRHK--VLRTIDQRVADMTDLHLEYAEDHQLANYGIGGHYAQHMDWFYPITFETK 438
Query: 189 ---GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLL 245
P+M R+ + L YLSDVE+GG T FP + ++P++
Sbjct: 439 QVSNPEMGNRIGTVLFYLSDVEQGGATAFP---------------ALKQLLRPKKHAAAF 483
Query: 246 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 280
+Y+L +G D ++HG+CP+I G KWV +WIR+
Sbjct: 484 WYNLHASGVGDARTMHGACPIIVGSKWVLNRWIRE 518
>gi|4336512|gb|AAD17844.1| prolyl 4-hydroxylase alpha subunit [Drosophila melanogaster]
Length = 535
Score = 101 bits (251), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 65/215 (30%), Positives = 101/215 (46%), Gaps = 22/215 (10%)
Query: 74 PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
PF++ L P + ++ S+ TA+ R+K S + G + RTS G
Sbjct: 316 PFKLEELHLDPLVVQLHQVIGSKDSDSLQKTARPRIKRSTVYSLGGNGGSTAAAFRTSQG 375
Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF---NPAEY 188
+ S + +L+ + + L + E V Y IG Y+ H+D+F + +
Sbjct: 376 ASFNYS--RNAATKLLSRHVGDFSGLNMDYAEDLQVANYGIGGHYEPHWDSFPENHIYQE 433
Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 248
G R+A+ + YLSDVE GG T FPF + L V P RG L +Y+
Sbjct: 434 GDLHGNRMATGIYYLSDVEAGGGTAFPF---------------LPLLVTPERGSLLFWYN 478
Query: 249 LFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
L P+G D + H +CPV++G KW+A WIR++ Q
Sbjct: 479 LHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQ 513
>gi|195391758|ref|XP_002054527.1| GJ22759 [Drosophila virilis]
gi|194152613|gb|EDW68047.1| GJ22759 [Drosophila virilis]
Length = 539
Score = 100 bits (250), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 72/244 (29%), Positives = 109/244 (44%), Gaps = 21/244 (8%)
Query: 43 RPKLRTLEVVEKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIAT 102
R +LR +E L G ++ + L P + + SA +
Sbjct: 292 REELRPAPAALRELRCRLFAGNGRKSTYAPYKLEELHLDPYIIQVHDVISARDTAELQHL 351
Query: 103 AKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHG 162
A+ L+ SQ+ R G + RTS GT ++ I++ + H +A + L
Sbjct: 352 ARPELQRSQVYSRTGHE-HISANFRTSQGTTFEYTDHP--IMQKMSHHVAEISGLDMRSA 408
Query: 163 EAFNVLRYEIGQKYDSHYDAF-NPAEYGPQM--SQRLASFLLYLSDVEEGGETMFPFENG 219
E + Y IG Y+ H D+F + +Y M + RLA+ + YLS+VE GG T FPF
Sbjct: 409 EPLQIANYGIGGHYEPHMDSFPDSYDYSLNMYKTNRLATGIYYLSNVEAGGGTAFPF--- 465
Query: 220 IFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+ L V P RG L +Y+L P+G D + H +CPV++G KW+A WIR
Sbjct: 466 ------------LPLLVTPERGSLLFWYNLHPSGDADYRTKHAACPVLQGSKWIANVWIR 513
Query: 280 DQEQ 283
Q
Sbjct: 514 LSNQ 517
>gi|115313004|gb|AAI24075.1| Zgc:152670 [Danio rerio]
Length = 235
Score = 100 bits (250), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 64/199 (32%), Positives = 97/199 (48%), Gaps = 31/199 (15%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P+ + + + S + +++ A+ L SQ G V S RTS F+ D+
Sbjct: 53 PKIIRYHDVISDTEIETLKDIARPELTRSQ----TGWGVISE--IRTSQSVFL----DEV 102
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
G + I +IA T L E +V Y IG +Y H+DA G +++R A+FL+
Sbjct: 103 GTVARISQRIADITGLSVESAEKLHVQNYGIGGRYTPHFDA------GGDVNERTATFLI 156
Query: 202 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 261
Y+SDVE GG T+F +G+ VKP +G + + +L NG +D + H
Sbjct: 157 YMSDVEVGGATVF---------------TNVGVAVKPEKGSAVFWNNLHKNGELDLKTKH 201
Query: 262 GSCPVIKGEKWVATKWIRD 280
CPV+ G KWVA KWI +
Sbjct: 202 AGCPVLVGNKWVANKWIHE 220
>gi|195061074|ref|XP_001995919.1| GH14105 [Drosophila grimshawi]
gi|193891711|gb|EDV90577.1| GH14105 [Drosophila grimshawi]
Length = 513
Score = 100 bits (250), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 65/226 (28%), Positives = 104/226 (46%), Gaps = 24/226 (10%)
Query: 62 HGETGDASIQSIPF--QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET 119
+ T A ++ P ++L P + + + S + + + A RLK +++ + Q
Sbjct: 299 YNSTNSAFLRLAPLKMELLQLDPYMVLYHDAISPREIEDLQFLAMPRLKRAKV-VDQVTH 357
Query: 120 VESTKGTRTSSGTFISASEDKTGILEL-IEHKIARATMLPQTHGEAFNVLRYEIGQKYDS 178
RTS T++ D T + + +I + E V+ Y +G Y S
Sbjct: 358 RNMMVKERTSKVTWLG---DATNAFTMRLNKRIEDMSGFTMYGSEMLQVMNYGLGGHYAS 414
Query: 179 HYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP-FENGIFLDSGYDYKKCIGLKVK 237
HYD N R+A+ + YLSDVE+GG T+FP + +F
Sbjct: 415 HYDFLNATSKTRLNGDRIATVMFYLSDVEQGGATVFPKIQKAVF---------------- 458
Query: 238 PRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
P+RG +++Y+L NG D ++H +CPVI G KWV KWIR+ EQ
Sbjct: 459 PQRGTAIIWYNLKENGDFDTNTIHAACPVIVGSKWVCNKWIRENEQ 504
>gi|387016440|gb|AFJ50339.1| Prolyl 4-hydroxylase subunit alpha-1-like [Crotalus adamanteus]
Length = 543
Score = 100 bits (250), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 63/211 (29%), Positives = 104/211 (49%), Gaps = 30/211 (14%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
RPR + F + S E+ + + +K RL+ + ++ +E T R S ++S E+
Sbjct: 343 RPRIVRFLDIISNEEIEKVKELSKPRLRRATISNPITGVLE-TAHYRISKSAWLSGYENP 401
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 402 --VVARINQRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG--- 456
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LFP+
Sbjct: 457 -NRIATWLFYMSDVAAGGATVFP---------------EVGASVWPKKGTAVFWYNLFPS 500
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
G D ++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 501 GEGDYSTRHAACPVLVGNKWVSNKWIHERGQ 531
>gi|449280261|gb|EMC87600.1| Prolyl 4-hydroxylase subunit alpha-1 [Columba livia]
Length = 536
Score = 100 bits (250), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 65/212 (30%), Positives = 106/212 (50%), Gaps = 32/212 (15%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S E+ +++ AK RL S+ + ET + +T R S ++S E
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRL--SRATVHDPETGKLTTAHYRVSKSAWLSGYE- 392
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
+ ++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 -SPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG-- 449
Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
R+A++L Y+SDV GG T+FP +G V PR+G + +Y+LFP
Sbjct: 450 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPRKGTAVFWYNLFP 492
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 493 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 524
>gi|359490628|ref|XP_002271805.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Vitis
vinifera]
Length = 312
Score = 100 bits (249), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 77/244 (31%), Positives = 129/244 (52%), Gaps = 24/244 (9%)
Query: 42 IRPKLRTLEVVEKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIA 101
IR +LR +VV +E L H + S Q LSW+PRA + F S E+C +I+
Sbjct: 25 IRKELRINKVVNQETTVQLGHSIEYNRVDPSRVIQ-LSWQPRAFLYRGFLSDEECDHLIS 83
Query: 102 TAKKRLKPSQLALRQGET--VESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQ 159
A K +LA G++ V + ++S G E + IE +I+ T LP+
Sbjct: 84 LALG--KKEELATNGGDSGNVVLKRLLKSSEGPLYIDDE----VAARIEKRISAWTFLPK 137
Query: 160 THGEAFNVLRYEI---GQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP- 215
+ E V++Y+ QKY+ ++ + +++G + +A+ LL+LS+V GGE FP
Sbjct: 138 ENSEPLEVVQYQFENAKQKYN-YFSNKSTSKFGEPL---MATVLLHLSNVTRGGELFFPE 193
Query: 216 --FENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWV 273
++GI D + GL+ P +G+ +LF+++ PN + D++S + CPV++GE W
Sbjct: 194 SESKSGILSDCT---ESSSGLR--PVKGNAILFFNVHPNASPDKSSSYARCPVLEGEMWC 248
Query: 274 ATKW 277
ATK+
Sbjct: 249 ATKF 252
>gi|195390833|ref|XP_002054072.1| GJ22994 [Drosophila virilis]
gi|194152158|gb|EDW67592.1| GJ22994 [Drosophila virilis]
Length = 496
Score = 100 bits (249), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 57/158 (36%), Positives = 80/158 (50%), Gaps = 18/158 (11%)
Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NP 185
RTS GT+I D + + IE +I L + E F V+ Y +G Y +H D +
Sbjct: 345 RTSKGTWIE--RDHNNLTKRIERRITDMVELDLRYSEPFQVMNYGLGGHYAAHEDFLGDT 402
Query: 186 AEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLL 245
+ R+A+ L YL+DVE+GG T+F N V P+RG L
Sbjct: 403 WADKKEEDDRIATVLFYLTDVEQGGATVFTILNQ---------------AVSPKRGTALF 447
Query: 246 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+Y+L NGT D +LHG CPV+ G KW+ T WIR++ Q
Sbjct: 448 WYNLHRNGTGDTRTLHGGCPVLVGSKWIMTLWIRERMQ 485
>gi|195341542|ref|XP_002037365.1| GM12152 [Drosophila sechellia]
gi|194131481|gb|EDW53524.1| GM12152 [Drosophila sechellia]
Length = 535
Score = 100 bits (249), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 64/215 (29%), Positives = 101/215 (46%), Gaps = 22/215 (10%)
Query: 74 PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
PF++ L P + + +S+ +A+ +K S + G + RTS G
Sbjct: 316 PFKLEELHLDPLVVQLHQVIGSNDSESLQKSARPMIKRSTVYSLGGNGGSTAAAFRTSQG 375
Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF---NPAEY 188
+ S K +L+ H + + L + E V Y IG Y+ H+D+F + +
Sbjct: 376 ASFNYS--KNAATKLLSHHVGDFSDLNMDYAEDLQVANYGIGGHYEPHWDSFPENHIYQE 433
Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 248
G R+A+ + YLSDVE GG T FPF + L V P +G L +Y+
Sbjct: 434 GDLHGNRIATGIYYLSDVEAGGGTAFPF---------------LPLLVTPEKGSLLFWYN 478
Query: 249 LFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
L P+G D + H +CPV++G KW+A WIR++ Q
Sbjct: 479 LHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQ 513
>gi|195575145|ref|XP_002105540.1| GD16902 [Drosophila simulans]
gi|194201467|gb|EDX15043.1| GD16902 [Drosophila simulans]
Length = 525
Score = 100 bits (248), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 50/123 (40%), Positives = 68/123 (55%), Gaps = 17/123 (13%)
Query: 163 EAFNVLRYEIGQKYDSHYDAFNP--AEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGI 220
E ++ Y +G YD HYD FN + R+A+ L YL+DVE+GG T+FP
Sbjct: 407 EMLQLMNYGLGGHYDQHYDFFNKTNSNMTAMSGDRIATVLFYLTDVEQGGATVFP----- 461
Query: 221 FLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 280
I V P+RG +++Y+L NG ID +LH +CPVI G KWV KWIR+
Sbjct: 462 ----------NIRKAVFPQRGSVVMWYNLRDNGQIDTQTLHAACPVIVGSKWVCNKWIRE 511
Query: 281 QEQ 283
+EQ
Sbjct: 512 REQ 514
>gi|195341590|ref|XP_002037389.1| GM12139 [Drosophila sechellia]
gi|194131505|gb|EDW53548.1| GM12139 [Drosophila sechellia]
Length = 525
Score = 100 bits (248), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 50/123 (40%), Positives = 68/123 (55%), Gaps = 17/123 (13%)
Query: 163 EAFNVLRYEIGQKYDSHYDAFNP--AEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGI 220
E ++ Y +G YD HYD FN + R+A+ L YL+DVE+GG T+FP
Sbjct: 407 EMLQLMNYGLGGHYDQHYDFFNNTNSNMTAMSGDRIATVLFYLTDVEQGGATVFP----- 461
Query: 221 FLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 280
I V P+RG +++Y+L NG ID +LH +CPVI G KWV KWIR+
Sbjct: 462 ----------NIRKAVFPQRGSVVMWYNLRDNGQIDTQTLHAACPVIVGSKWVCNKWIRE 511
Query: 281 QEQ 283
+EQ
Sbjct: 512 REQ 514
>gi|195159311|ref|XP_002020525.1| GL13465 [Drosophila persimilis]
gi|194117294|gb|EDW39337.1| GL13465 [Drosophila persimilis]
Length = 578
Score = 100 bits (248), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 61/217 (28%), Positives = 105/217 (48%), Gaps = 28/217 (12%)
Query: 74 PF--QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
PF ++LS P + + + + + ++ +K +K + + + RTS+
Sbjct: 375 PFKTELLSLAPYMVLYHDVITPLESLTLKNLSKPHMKRRAMTFNKQKLRPLIDSGRTSNS 434
Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ 191
++++ E+ ++E +E ++ T + E + ++ Y IG Y H D F PQ
Sbjct: 435 VWLTSHEN--AVMERLERRVGVMTNFEMENSEVYQLINYGIGGHYKPHTDHFE----TPQ 488
Query: 192 MSQR-----LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLF 246
+ R +A+ L YLSDV +GG T+FP N + V+PR+GD LL+
Sbjct: 489 LEHRGGGDRIATVLFYLSDVPQGGATLFPRLN---------------ISVQPRQGDALLW 533
Query: 247 YSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
Y+L G + ++H SCP+IKG KW KWI + Q
Sbjct: 534 YNLNDRGQGEIGTVHTSCPIIKGSKWALVKWIDELSQ 570
>gi|195505255|ref|XP_002099425.1| GE23368 [Drosophila yakuba]
gi|194185526|gb|EDW99137.1| GE23368 [Drosophila yakuba]
Length = 528
Score = 100 bits (248), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 51/126 (40%), Positives = 70/126 (55%), Gaps = 18/126 (14%)
Query: 161 HG-EAFNVLRYEIGQKYDSHYDAFNP--AEYGPQMSQRLASFLLYLSDVEEGGETMFPFE 217
HG E ++ Y +G YD HYD FN + R+A+ L YL+DVE+GG T+FP
Sbjct: 407 HGSEMLQLMNYGLGGHYDQHYDYFNTINSNLTAMSGDRIATVLFYLTDVEQGGATVFP-- 464
Query: 218 NGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKW 277
I V P+RG +++Y+L +G ID +LH +CPVI G KWV KW
Sbjct: 465 -------------NIRKAVFPQRGSVIMWYNLKDDGQIDTQTLHAACPVIVGSKWVCNKW 511
Query: 278 IRDQEQ 283
IR++EQ
Sbjct: 512 IREREQ 517
>gi|24651418|ref|NP_524594.2| prolyl-4-hydroxylase-alpha MP [Drosophila melanogaster]
gi|7301951|gb|AAF57057.1| prolyl-4-hydroxylase-alpha MP [Drosophila melanogaster]
gi|359807686|gb|AEV66559.1| FI17802p1 [Drosophila melanogaster]
Length = 535
Score = 100 bits (248), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 64/215 (29%), Positives = 101/215 (46%), Gaps = 22/215 (10%)
Query: 74 PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
PF++ L P + ++ S+ TA+ R+K S + G + RTS G
Sbjct: 316 PFKLEELHLDPLVVQLHQVIGSKDSDSLQKTARPRIKRSTVYSLGGNGGSTAAAFRTSQG 375
Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF---NPAEY 188
+ S + +L+ + + L + E V Y IG Y+ H+D+F + +
Sbjct: 376 ASFNYS--RNAATKLLSRHVGDFSGLNMDYAEDLQVANYGIGGHYEPHWDSFPENHIYQE 433
Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 248
G R+A+ + YL+DVE GG T FPF + L V P RG L +Y+
Sbjct: 434 GDLHGNRMATGIYYLADVEAGGGTAFPF---------------LPLLVTPERGSLLFWYN 478
Query: 249 LFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
L P+G D + H +CPV++G KW+A WIR++ Q
Sbjct: 479 LHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQ 513
>gi|66772633|gb|AAY55628.1| IP02961p [Drosophila melanogaster]
Length = 409
Score = 100 bits (248), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 64/215 (29%), Positives = 101/215 (46%), Gaps = 22/215 (10%)
Query: 74 PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
PF++ L P + ++ S+ TA+ R+K S + G + RTS G
Sbjct: 190 PFKLEELHLDPLVVQLHQVIGSKDSDSLQKTARPRIKRSTVYSLGGNGGSTAAAFRTSQG 249
Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF---NPAEY 188
+ S + +L+ + + L + E V Y IG Y+ H+D+F + +
Sbjct: 250 ASFNYS--RNAATKLLSRHVGDFSGLNMDYAEDLQVANYGIGGHYEPHWDSFPENHIYQE 307
Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 248
G R+A+ + YL+DVE GG T FPF + L V P RG L +Y+
Sbjct: 308 GDLHGNRMATGIYYLADVEAGGGTAFPF---------------LPLLVTPERGSLLFWYN 352
Query: 249 LFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
L P+G D + H +CPV++G KW+A WIR++ Q
Sbjct: 353 LHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQ 387
>gi|24651477|ref|NP_733395.1| prolyl-4-hydroxylase-alpha PV [Drosophila melanogaster]
gi|20269812|gb|AAM18061.1|AF495539_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]PV [Drosophila
melanogaster]
gi|23172718|gb|AAN14252.1| prolyl-4-hydroxylase-alpha PV [Drosophila melanogaster]
Length = 525
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 50/123 (40%), Positives = 68/123 (55%), Gaps = 17/123 (13%)
Query: 163 EAFNVLRYEIGQKYDSHYDAFNP--AEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGI 220
E ++ Y +G YD HYD FN + R+A+ L YL+DVE+GG T+FP
Sbjct: 407 EMLQLMNYGLGGHYDQHYDFFNKTNSNMTAMSGDRIATVLFYLTDVEQGGATVFP----- 461
Query: 221 FLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 280
I V P+RG +++Y+L NG ID +LH +CPVI G KWV KWIR+
Sbjct: 462 ----------NIRKAVFPQRGSVVMWYNLKDNGQIDTQTLHAACPVIVGSKWVCNKWIRE 511
Query: 281 QEQ 283
+EQ
Sbjct: 512 REQ 514
>gi|255545252|ref|XP_002513687.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223547595|gb|EEF49090.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 309
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 69/244 (28%), Positives = 122/244 (50%), Gaps = 29/244 (11%)
Query: 41 SIRPKLRTLEVVEKENESGLPHGETGDAS-IQSIPFQVLSWRPRALYFPNFASAEQCQSI 99
SIR +LR EV ++E+ + G + + I + LSWRPR + F + E+C +
Sbjct: 26 SIRKELRDKEV---KHETIIQLGSSVQTNRISLLQVVQLSWRPRVFLYKGFLTDEECDRL 82
Query: 100 IATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGI----LELIEHKIARAT 155
I+ A + + KG + + +++SE ++ I L IE +I+ T
Sbjct: 83 ISLA-----------HGAKEISKGKGDGSRNNIQLASSESRSHIYDDLLARIEERISAWT 131
Query: 156 MLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
+P+ + + V+ Y I + + H+D F+ +S +A+ +LYLS+V GGE +FP
Sbjct: 132 FIPKENSKPLQVMHYGIEEARE-HFDYFDNKTLISNVSL-MATLVLYLSNVTRGGEILFP 189
Query: 216 ---FENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKW 272
++ ++ D D ++P +G+ +L ++ N + D S HG CPV++GE W
Sbjct: 190 KSELKDKVWSDCTKDSSI-----LRPVKGNAVLIFNAHLNASADSRSTHGRCPVLEGEMW 244
Query: 273 VATK 276
ATK
Sbjct: 245 CATK 248
>gi|374370415|ref|ZP_09628419.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus basilensis OR16]
gi|373098067|gb|EHP39184.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus basilensis OR16]
Length = 454
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 71/235 (30%), Positives = 105/235 (44%), Gaps = 29/235 (12%)
Query: 55 ENESGLPHG--ETGDASI----QSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLK 108
E+ SGL H + G + + IP PR F + +C +++A A+ RL
Sbjct: 99 EHASGLGHAGAQAGRNAAHFAGREIPILFTLAAPRVTLFQQLLTDAECDALVALARGRLA 158
Query: 109 PSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVL 168
S + + E+ RTS G E ++E IE IA T + GE +L
Sbjct: 159 RSPV-INPDTGDENLIEARTSLGAMFQVGEHP--LIERIEDCIAAVTGIAAERGEGLQIL 215
Query: 169 RYEIGQKYDSHYDAFNPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFPFENGIFLD 223
Y+ G +Y HYD FNP G QR+ + ++YL+ GG T FP
Sbjct: 216 NYKPGGEYQPHYDFFNPQRPGEARQLKVGGQRVGTLVIYLNSPLAGGATAFPK------- 268
Query: 224 SGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
+GL+V P +G+ + F +G +D +LH PV GEKW+ATKW+
Sbjct: 269 --------LGLEVAPVKGNAVYFSYRKSDGALDERTLHAGLPVEAGEKWIATKWL 315
>gi|194765138|ref|XP_001964684.1| GF23317 [Drosophila ananassae]
gi|190614956|gb|EDV30480.1| GF23317 [Drosophila ananassae]
Length = 520
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 63/203 (31%), Positives = 100/203 (49%), Gaps = 24/203 (11%)
Query: 84 ALYFPNFASAE--QCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
+Y +SAE + + + + KR + +L + E V+ TRTS + S +
Sbjct: 330 VIYHDVLSSAEIDEMKEMATPSLKRATVYKASLGKNEVVK----TRTSKVAWFPDSYNSL 385
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-QRLASFL 200
+ + +I T + E ++ Y +G YD HYD FN E ++ R+A+ L
Sbjct: 386 TLR--LNARIHDMTGFDLSGSEMLQLMNYGLGGHYDKHYDFFNATEKSSSLTGDRIATVL 443
Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
Y+SDVE+GG T+FP I V P+RG +++Y+L +G D +L
Sbjct: 444 FYMSDVEQGGATVFP---------------NIYKTVYPQRGTAVMWYNLKDDGQPDEQTL 488
Query: 261 HGSCPVIKGEKWVATKWIRDQEQ 283
H +CPV+ G KWV KWIR++ Q
Sbjct: 489 HAACPVLVGSKWVCNKWIRERAQ 511
>gi|194905290|ref|XP_001981166.1| GG11918 [Drosophila erecta]
gi|190655804|gb|EDV53036.1| GG11918 [Drosophila erecta]
Length = 525
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 57/160 (35%), Positives = 80/160 (50%), Gaps = 19/160 (11%)
Query: 126 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 185
TRTS + S + + + +IA T E ++ Y +G YD HYD FN
Sbjct: 372 TRTSKVAWFPDSYNPLTVR--LNARIADMTGFNLYGSEMLQLMNYGLGGHYDQHYDFFNT 429
Query: 186 --AEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDG 243
+ R+A+ L YL+DVE+GG T+FP I V P+RG
Sbjct: 430 INSNLTAMSGDRIATVLFYLTDVEQGGATVFP---------------NIRKAVFPQRGSV 474
Query: 244 LLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+++Y+L NG D +LH +CPVI G KWV KWIR++EQ
Sbjct: 475 IMWYNLQDNGQTDNKTLHAACPVIVGSKWVCNKWIREREQ 514
>gi|390176896|ref|XP_002136934.2| GA26861 [Drosophila pseudoobscura pseudoobscura]
gi|388858831|gb|EDY67492.2| GA26861 [Drosophila pseudoobscura pseudoobscura]
Length = 513
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 60/213 (28%), Positives = 104/213 (48%), Gaps = 22/213 (10%)
Query: 74 PF--QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
PF ++LS P + + + + + ++ +K +K + + + RTS+
Sbjct: 312 PFKTEILSLSPYMVLYHDVITPLESLTLKNLSKPHMKRRAMTFNKQKLRPLIDSGRTSNS 371
Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGP 190
++++ E+ ++E +E ++ T + E + ++ Y IG Y H D F P G
Sbjct: 372 VWLTSHEN--AVMERLERRVGVMTNFEMENSEVYQLINYGIGGHYKPHTDHFETPQHRGG 429
Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
R+A+ L YLSDV +GG T+FP N + V+PR+GD LL+Y+L
Sbjct: 430 --GDRIATVLFYLSDVPQGGATLFPRLN---------------ISVQPRQGDALLWYNLN 472
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
G + ++H SCP+I+G KW KWI + Q
Sbjct: 473 DRGQGEIGTVHTSCPIIQGSKWALVKWIDELSQ 505
>gi|229084249|ref|ZP_04216532.1| 2OG-Fe(II) oxygenase [Bacillus cereus Rock3-44]
gi|228699049|gb|EEL51751.1| 2OG-Fe(II) oxygenase [Bacillus cereus Rock3-44]
Length = 235
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 64/217 (29%), Positives = 101/217 (46%), Gaps = 26/217 (11%)
Query: 74 PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTF 133
P +L P + + +C +I A+ L+PS++ G + + T RTS
Sbjct: 39 PSNLLHDNPFIGCYEKVVTQTECHQLIDLARHGLQPSKVI---GNSEQKTSAVRTSDT-- 93
Query: 134 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE-----Y 188
I T + I +IA LP + E + RY++G K+++H+D FNP+ Y
Sbjct: 94 IGFQHHLTELTLQICKRIASIVELPLNYAEHLQIARYQVGGKFNAHFDTFNPSTELGKMY 153
Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 248
+ QR+ + LLYL++V GGET FP N ++V P G L+F +
Sbjct: 154 LSENGQRIITALLYLNNVSAGGETSFPLLN---------------IQVAPSEGTLLVFEN 198
Query: 249 LFPNGTIDRT-SLHGSCPVIKGEKWVATKWIRDQEQH 284
N S+H C V +GEKW+AT W ++ Q+
Sbjct: 199 CKKNSNERHALSIHEGCAVHEGEKWIATLWFHEKSQY 235
>gi|195113237|ref|XP_002001174.1| GI10637 [Drosophila mojavensis]
gi|193917768|gb|EDW16635.1| GI10637 [Drosophila mojavensis]
Length = 529
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 64/215 (29%), Positives = 104/215 (48%), Gaps = 23/215 (10%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
+ +++S P + + + S + + + A LK + + +Q K TRTS T
Sbjct: 321 LKMELISLDPYMVIYHDVISPSEISELQSLAVPGLKRATVFNQQSMRNHVVK-TRTSKVT 379
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN---PAEYG 189
++ + ++ I + +I T E V+ Y +G YD HYD FN A+
Sbjct: 380 WLLDTLNQLTIR--LNRRITDMTGFDMYGSEMLQVMNYGLGGHYDKHYDYFNSSVAADLT 437
Query: 190 PQMSQRLASFLLYLSDVEEGGETMFP-FENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 248
R+A+ L YL+DVE+GG T+FP E +F P+ G +++Y+
Sbjct: 438 RLNGDRIATVLFYLTDVEQGGATVFPNIEKAVF----------------PKSGTAVVWYN 481
Query: 249 LFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
L +G D +LH +CPVI G KWV KWIR+++Q
Sbjct: 482 LRHDGNGDPQTLHAACPVIVGSKWVCNKWIRERQQ 516
>gi|21711777|gb|AAM75079.1| RE70601p [Drosophila melanogaster]
Length = 316
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 50/123 (40%), Positives = 68/123 (55%), Gaps = 17/123 (13%)
Query: 163 EAFNVLRYEIGQKYDSHYDAFNP--AEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGI 220
E ++ Y +G YD HYD FN + R+A+ L YL+DVE+GG T+FP
Sbjct: 198 EMLQLMNYGLGGHYDQHYDFFNKTNSNMTAMSGDRIATVLFYLTDVEQGGATVFP----- 252
Query: 221 FLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 280
I V P+RG +++Y+L NG ID +LH +CPVI G KWV KWIR+
Sbjct: 253 ----------NIRKAVFPQRGSVVMWYNLKDNGQIDTQTLHAACPVIVGSKWVCNKWIRE 302
Query: 281 QEQ 283
+EQ
Sbjct: 303 REQ 305
>gi|195055767|ref|XP_001994784.1| GH14132 [Drosophila grimshawi]
gi|193892547|gb|EDV91413.1| GH14132 [Drosophila grimshawi]
Length = 537
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 58/176 (32%), Positives = 86/176 (48%), Gaps = 24/176 (13%)
Query: 115 RQGETVESTKGT---RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYE 171
R G + ST RTS FI+A+ K +L I+ ++A T L + E + Y
Sbjct: 362 RAGVVINSTSTVSKKRTSQHIFIAATRHK--VLRTIDQRVADMTNLNMQYAEDHQLADYG 419
Query: 172 IGQKYDSHYDAFNPAEYG----PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYD 227
IG Y H+D F ++ +M R+A+ L YLSDV +GG T FP +
Sbjct: 420 IGGHYSQHFDWFGNSDLANSKCDEMGNRIATVLFYLSDVAQGGGTAFPILKQL------- 472
Query: 228 YKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+KP++ +Y+L +G D +LHG CP+I G KWV +WIR+ +Q
Sbjct: 473 --------LKPKKYAAAFWYNLHASGKGDWRNLHGGCPIIVGSKWVLNRWIREYDQ 520
>gi|198449502|ref|XP_001357605.2| GA15937 [Drosophila pseudoobscura pseudoobscura]
gi|198130635|gb|EAL26739.2| GA15937 [Drosophila pseudoobscura pseudoobscura]
Length = 510
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 62/211 (29%), Positives = 100/211 (47%), Gaps = 22/211 (10%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
+ ++L P + + + S + I+ A++R+ + + T + TRT+ G
Sbjct: 310 LKMELLGEHPYVVVYHDVLSDSEIAEILEMAERRMARTSTVAQPNRT---SSPTRTAMGA 366
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
++ S + + I ++ + L E V+ Y IG Y H D F ++ M
Sbjct: 367 WLKRSSN--ALTRRIARRVRDMSGLQLEGSERMQVINYGIGGHYVPHKDWFT--QHPEVM 422
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
RLA+ L YL+DVE+GG TMF KV PRRG L +Y+L +
Sbjct: 423 GNRLATVLFYLTDVEQGGATMFNKAEH---------------KVLPRRGTALFWYNLHTD 467
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
G D ++ H +CP+I G KWV T+WIR++ Q
Sbjct: 468 GEGDWSTTHAACPIIVGSKWVLTQWIRERNQ 498
>gi|312032356|ref|NP_001185665.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Gallus
gallus]
Length = 536
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 64/212 (30%), Positives = 106/212 (50%), Gaps = 32/212 (15%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S E+ +++ AK RL S+ + ET + +T R S ++S E
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRL--SRATVHDPETGKLTTAHYRVSKSAWLSGYE- 392
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
+ ++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 -SPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 449
Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LFP
Sbjct: 450 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFP 492
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 493 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 524
>gi|326435474|gb|EGD81044.1| hypothetical protein PTSG_10986 [Salpingoeca sp. ATCC 50818]
Length = 264
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 80/263 (30%), Positives = 117/263 (44%), Gaps = 37/263 (14%)
Query: 26 LAGFYGSTFLSRDVPSIRPKLRTLEVVEKENESGLPHGETGDASI----QSIPFQVLSWR 81
LA F ++ RD P +++ GE G A Q +LS
Sbjct: 16 LAAFLAASTTLRDTP-------FSGTASTQHQQPFKPGEWGYADADWLRQHYNITMLSED 68
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTF----ISAS 137
P + F NF S E+ +I+ AK + S + + RTSS + + +
Sbjct: 69 PPVIQFNNFISQERIDAILHFAKPKFARSTSGIER-----EVSNYRTSSTAWMLPDVLGN 123
Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
+ L+ +E +IAR LP + E F VL+Y+ Q Y H D P R+A
Sbjct: 124 DPMQAHLKDMEEEIARIVRLPVENQEHFQVLQYQKNQYYKVHSDYIEEQRQQP-CGIRVA 182
Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT-ID 256
+F LYL+DVEEGG T FP + L V+P +G+ +L+YS +PN T +D
Sbjct: 183 TFFLYLNDVEEGGGTRFP---------------NLNLTVQPAKGNAVLWYSAYPNTTRMD 227
Query: 257 RTSLHGSCPVIKGEKWVATKWIR 279
+ H + PV KG K+ A KWI
Sbjct: 228 SRTDHEAMPVAKGMKYGANKWIH 250
>gi|312032354|ref|NP_001185664.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Gallus
gallus]
Length = 536
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 64/212 (30%), Positives = 106/212 (50%), Gaps = 32/212 (15%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S E+ +++ AK RL S+ + ET + +T R S ++S E
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRL--SRATVHDPETGKLTTAHYRVSKSAWLSGYE- 392
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
+ ++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 -SPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG-- 449
Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LFP
Sbjct: 450 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFP 492
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 493 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 524
>gi|326923463|ref|XP_003207955.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 2
[Meleagris gallopavo]
Length = 536
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 64/212 (30%), Positives = 106/212 (50%), Gaps = 32/212 (15%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S E+ +++ AK RL S+ + ET + +T R S ++S E
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRL--SRATVHDPETGKLTTAHYRVSKSAWLSGYE- 392
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
+ ++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 -SPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG-- 449
Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LFP
Sbjct: 450 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFP 492
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 493 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 524
>gi|449469338|ref|XP_004152378.1| PREDICTED: uncharacterized protein LOC101218968 [Cucumis sativus]
Length = 311
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 65/207 (31%), Positives = 108/207 (52%), Gaps = 19/207 (9%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKK-RLKPSQLALRQGETVESTKGTRTSSGTFISA 136
+SWRPR + F S E+C +I+ A PS+ + G TV + SSG ++
Sbjct: 59 VSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGITVSTE--LLNSSGVILNT 116
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
++D I+ IE+++A T+LP+ H F +++Y G++ Y N + P +
Sbjct: 117 TDD---IVARIENRLAIWTLLPKDHSMPFQIMQYR-GEEAKHKYFYGNRSAMLPSSEPLM 172
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLK-----VKPRRGDGLLFYSLFP 251
A+ +LYLSD GGE +FP +S K G + ++P +G+ +LF+S+
Sbjct: 173 ATVVLYLSDSASGGEILFP-------ESKVKSKFWSGRRKKNNFLRPVKGNAILFFSVHL 225
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWI 278
N + D++S H P+ GE WVATK++
Sbjct: 226 NASPDKSSYHIRSPIRDGELWVATKFL 252
>gi|442757047|gb|JAA70682.1| Putative prolyl 4-hydroxylase alpha subunit [Ixodes ricinus]
Length = 532
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 58/221 (26%), Positives = 107/221 (48%), Gaps = 25/221 (11%)
Query: 63 GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES 122
GETG +Q I + + +P + + +IA AK RL+ S+ +
Sbjct: 307 GETGFFKLQPIKLEEFNLKPYVVVLRDLLQDRDLNDMIAFAKPRLEQSKTLCAADK---D 363
Query: 123 TKGTRTSSGTFISASEDKTG--ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHY 180
+RTSS T+++ + + + ++ + T+ + E + + Y IG Y H+
Sbjct: 364 GPPSRTSSNTWLNDEDAPVAARVNQYLQSLLGLGTLFSRDEAEKYQLANYGIGGHYVPHH 423
Query: 181 DAF----NPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKV 236
D F P++ G + R+A+ ++Y+SDVEEGG T+FP +G++V
Sbjct: 424 DYFEEFQTPSK-GNRFGNRVATLMIYMSDVEEGGATVFP---------------SLGVRV 467
Query: 237 KPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKW 277
P++GD + ++++ + + + H CPV+ G KW+A KW
Sbjct: 468 SPKKGDAVFWWNIMSSWEGEMLTWHAGCPVLYGSKWIANKW 508
>gi|129365|sp|P16924.1|P4HA1_CHICK RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1
Length = 516
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 64/212 (30%), Positives = 106/212 (50%), Gaps = 32/212 (15%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S E+ +++ AK RL S+ + ET + +T R S ++S E
Sbjct: 316 KPRIVRFLDIISDEEIETVKELAKPRL--SRATVHDPETGKLTTAHYRVSKSAWLSGYE- 372
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
+ ++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 373 -SPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG-- 429
Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LFP
Sbjct: 430 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFP 472
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 473 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 504
>gi|224052167|ref|XP_002191912.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Taeniopygia
guttata]
Length = 536
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 65/212 (30%), Positives = 105/212 (49%), Gaps = 32/212 (15%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S E+ +++ AK RL S+ + ET + +T R S ++S E
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRL--SRATVHDPETGKLTTAHYRVSKSAWLSGYE- 392
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
+ ++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 -SPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG-- 449
Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
R+A++L Y+SDV GG T+FP +G V PR+G + +Y+LFP
Sbjct: 450 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPRKGTAVFWYNLFP 492
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+G D ++ H +CPV+ G KWV KW+ ++ Q
Sbjct: 493 SGEGDYSTRHAACPVLVGNKWVFNKWLHERGQ 524
>gi|212530|gb|AAA49002.1| prolyl 4-hydroxylase, alpha subunit (EC 1.14.11.2), partial [Gallus
gallus]
Length = 489
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 64/212 (30%), Positives = 106/212 (50%), Gaps = 32/212 (15%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S E+ +++ AK RL S+ + ET + +T R S ++S E
Sbjct: 289 KPRIVRFLDIISDEEIETVKELAKPRL--SRATVHDPETGKLTTAHYRVSKSAWLSGYE- 345
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
+ ++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 346 -SPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG-- 402
Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LFP
Sbjct: 403 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFP 445
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 446 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 477
>gi|195159144|ref|XP_002020442.1| GL13995 [Drosophila persimilis]
gi|194117211|gb|EDW39254.1| GL13995 [Drosophila persimilis]
Length = 535
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 62/211 (29%), Positives = 100/211 (47%), Gaps = 22/211 (10%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
+ ++L P + + + S + I+ A++R+ + + T + TRT+ G
Sbjct: 335 LKMELLGEHPYVVVYHDVLSDSEIAEILEMAERRMARTSTVAQPNRT---SSPTRTALGA 391
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
++ S + + I ++ + L E V+ Y IG Y H D F ++ M
Sbjct: 392 WLKRSSN--ALTRRIARRVRDMSGLQLEGSERMQVINYGIGGHYVPHKDWFT--QHPEVM 447
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
RLA+ L YL+DVE+GG TMF KV PRRG L +Y+L +
Sbjct: 448 GNRLATVLFYLTDVEQGGATMFNKAEH---------------KVLPRRGTALFWYNLHTD 492
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
G D ++ H +CP+I G KWV T+WIR++ Q
Sbjct: 493 GEGDWSTTHAACPIIVGSKWVLTQWIRERNQ 523
>gi|321461762|gb|EFX72791.1| hypothetical protein DAPPUDRAFT_308081 [Daphnia pulex]
Length = 561
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 66/225 (29%), Positives = 106/225 (47%), Gaps = 30/225 (13%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+Q I + S P + + + Q + + + +L S +G+ V S RTS
Sbjct: 337 LQPIKVEEQSLDPMIVVLHDLITERQTEILRQLGEPKLATSLHRGGEGKFVRSM--IRTS 394
Query: 130 SGTFISASEDKTGILELIEHKIARATML---PQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
++ E+ + L I H++ AT L P+T E F + Y IG Y +H D
Sbjct: 395 KNAWLQEHENAS--LPAIRHRMELATGLIYGPETASEYFQIANYGIGGLYKTHTDNVIHP 452
Query: 187 EYGPQ--------MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKP 238
+ P+ + R+A+ ++YLSDVE GG T+FP G+ P
Sbjct: 453 DVRPEDQDPWNLYVGDRIATLMVYLSDVEAGGATVFPRA---------------GVTCWP 497
Query: 239 RRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
R+G +++L+ +G D T+ HG+CPV+ G KWV+ KWIR +Q
Sbjct: 498 RKGSAAFWWNLYKSGEPDLTTRHGACPVLHGSKWVSNKWIRQYDQ 542
>gi|312032358|ref|NP_001185666.1| prolyl 4-hydroxylase subunit alpha-1 isoform 3 precursor [Gallus
gallus]
Length = 536
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 62/211 (29%), Positives = 105/211 (49%), Gaps = 30/211 (14%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+PR + F + S E+ +++ AK RL+ + ++ +E T R S ++S E
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRLRRATISNPITGALE-TAHYRISKSAWLSGYE-- 392
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
+ ++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 SPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG--- 449
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LFP+
Sbjct: 450 -NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFPS 493
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 494 GEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 524
>gi|312032360|ref|NP_001185667.1| prolyl 4-hydroxylase subunit alpha-1 isoform 4 precursor [Gallus
gallus]
Length = 536
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 62/211 (29%), Positives = 105/211 (49%), Gaps = 30/211 (14%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+PR + F + S E+ +++ AK RL+ + ++ +E T R S ++S E
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRLRRATISNPITGALE-TAHYRISKSAWLSGYE-- 392
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
+ ++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 SPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG--- 449
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LFP+
Sbjct: 450 -NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFPS 493
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 494 GEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 524
>gi|326923461|ref|XP_003207954.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 1
[Meleagris gallopavo]
Length = 536
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 62/211 (29%), Positives = 105/211 (49%), Gaps = 30/211 (14%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+PR + F + S E+ +++ AK RL+ + ++ +E T R S ++S E
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRLRRATISNPITGALE-TAHYRISKSAWLSGYE-- 392
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
+ ++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 SPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG--- 449
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LFP+
Sbjct: 450 -NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFPS 493
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 494 GEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 524
>gi|291190274|ref|NP_001167096.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide precursor [Salmo
salar]
gi|223648100|gb|ACN10808.1| Prolyl 4-hydroxylase subunit alpha-1 precursor [Salmo salar]
Length = 545
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 62/211 (29%), Positives = 104/211 (49%), Gaps = 30/211 (14%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
RPR + + + S + + + AK RL+ + ++ +E T R S +++A ED
Sbjct: 345 RPRIIRYHDVLSNSEIEKVKELAKPRLRRATISNPITGVLE-TAHYRISKSAWLTAYEDP 403
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
+++ I +I T L E V Y +G +Y+ H+D AF G
Sbjct: 404 --VVDKINQRIEDITGLNVKTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG--- 458
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
R+A++L+Y+SDV GG T+F +G V P++G + +Y+LFP+
Sbjct: 459 -NRIATWLIYMSDVPSGGATVFT---------------DVGAAVWPKKGSAVFWYNLFPS 502
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
G D ++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 503 GEGDYSTRHAACPVLVGNKWVSNKWIHERGQ 533
>gi|116008432|ref|NP_651804.2| CG15539, isoform A [Drosophila melanogaster]
gi|66772391|gb|AAY55507.1| IP10910p [Drosophila melanogaster]
gi|66772535|gb|AAY55579.1| IP10810p [Drosophila melanogaster]
gi|113194858|gb|AAF57060.2| CG15539, isoform A [Drosophila melanogaster]
Length = 386
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 63/211 (29%), Positives = 98/211 (46%), Gaps = 21/211 (9%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
+ ++LS P + F + S + SI K +L + + G E RT+ GT
Sbjct: 187 LKMELLSLDPYMVLFHDVVSDKDIVSIRNLTKGKLARTVTVSKDGNYTEDPD--RTTKGT 244
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
++ + +++ + T + F VL Y IG Y H+D AE
Sbjct: 245 WLV---ENNALIQRLSQLTQDMTNFDIHDADPFQVLNYGIGGFYGIHFDFLEDAELD-NF 300
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
S R+A+ + YLSDV +GG T+FP +GL V P++G LL+Y+L
Sbjct: 301 SDRIATAVFYLSDVPQGGATIFP---------------KLGLSVFPKKGSALLWYNLDHK 345
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
G D + H +CP + G +WV TKWI ++EQ
Sbjct: 346 GDGDNRTAHSACPTVVGSRWVMTKWINEREQ 376
>gi|194905392|ref|XP_001981188.1| GG11756 [Drosophila erecta]
gi|190655826|gb|EDV53058.1| GG11756 [Drosophila erecta]
Length = 509
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 65/211 (30%), Positives = 98/211 (46%), Gaps = 21/211 (9%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
+ ++LS P + F + S + SI AK L + + G E RT+ GT
Sbjct: 310 LKMELLSLDPYVVLFHDVVSDQDILSIRNLAKGGLARAVTVTQDGNDKEDP--ARTTKGT 367
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
++ + + +++ + T + F VL Y IG Y +H+D E G
Sbjct: 368 WLV---ENSKLIQRLSQLSQDMTNFDVRDADPFQVLNYGIGGFYGTHFDFLEDTEMG-HF 423
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
S R+A+ + YLSDV +GG T FP +GL V P +G LL+Y+L
Sbjct: 424 SDRIATAVFYLSDVPQGGATTFP---------------DLGLSVFPEKGAALLWYNLDHK 468
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
G D + H +CP I G +WV TKWI ++EQ
Sbjct: 469 GVGDNRTAHSACPTIVGSRWVMTKWINEREQ 499
>gi|302143843|emb|CBI22704.3| unnamed protein product [Vitis vinifera]
Length = 317
Score = 98.6 bits (244), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 77/249 (30%), Positives = 129/249 (51%), Gaps = 29/249 (11%)
Query: 42 IRPKLRTLEVVEKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIA 101
IR +LR +VV +E L H + S Q LSW+PRA + F S E+C +I+
Sbjct: 25 IRKELRINKVVNQETTVQLGHSIEYNRVDPSRVIQ-LSWQPRAFLYRGFLSDEECDHLIS 83
Query: 102 TAKKRLKPSQLALRQGET--VESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQ 159
A K +LA G++ V + ++S G E + IE +I+ T LP+
Sbjct: 84 LALG--KKEELATNGGDSGNVVLKRLLKSSEGPLYIDDE----VAARIEKRISAWTFLPK 137
Query: 160 THGEAFNVLRYEI---GQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP- 215
+ E V++Y+ QKY+ ++ + +++G + +A+ LL+LS+V GGE FP
Sbjct: 138 ENSEPLEVVQYQFENAKQKYN-YFSNKSTSKFGEPL---MATVLLHLSNVTRGGELFFPE 193
Query: 216 -------FENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIK 268
++GI D + GL+ P +G+ +LF+++ PN + D++S + CPV++
Sbjct: 194 SELKNSQSKSGILSDCT---ESSSGLR--PVKGNAILFFNVHPNASPDKSSSYARCPVLE 248
Query: 269 GEKWVATKW 277
GE W ATK+
Sbjct: 249 GEMWCATKF 257
>gi|323454062|gb|EGB09933.1| hypothetical protein AURANDRAFT_14928, partial [Aureococcus
anophagefferens]
Length = 182
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 64/198 (32%), Positives = 100/198 (50%), Gaps = 29/198 (14%)
Query: 89 NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
NF + E+C ++I +AK + P+ + G +RTSS ++ A ED L +
Sbjct: 8 NFLTEEECDALIDSAKDHMTPAPVV---GPGNGEVSVSRTSSTCYL-ARED----LPSVC 59
Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA-----EYGPQMSQRLASFLLYL 203
K+ T P H E V RY G+ Y HYDAF+ + + QR+A+ L+YL
Sbjct: 60 TKVCALTGKPLEHLELPQVGRYRGGEFYKPHYDAFDTSSADGRRFAQNGGQRVATVLVYL 119
Query: 204 SDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGS 263
+DVE GGET F +G+++KPR+G+ L+F+ +G +D+ LH +
Sbjct: 120 NDVERGGETSF---------------SKLGVRIKPRKGNALIFFPATLDGVLDQNYLHAA 164
Query: 264 CPVIKGEKWVATKWIRDQ 281
P + KWV+ WIR +
Sbjct: 165 EPAVD-PKWVSQIWIRQR 181
>gi|413945803|gb|AFW78452.1| hypothetical protein ZEAMMB73_588774 [Zea mays]
Length = 239
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 59/139 (42%), Positives = 83/139 (59%), Gaps = 5/139 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
+S +PR + +F S ++ +I+ A+ LK S +A G++ S RTSSGTF+
Sbjct: 54 ISCKPRVFLYQHFLSDDEANHLISLARAELKRSAVADNMSGKSTLSE--VRTSSGTFLRK 111
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D I+E IE KIA T LP+ +GE VLRY+ G+KY+ HYD F + R
Sbjct: 112 GQDP--IVEGIEDKIAAWTFLPKENGEDIQVLRYKHGEKYEPHYDYFTDNVNTVRGGHRY 169
Query: 197 ASFLLYLSDVEEGGETMFP 215
A+ LLYL+DV EGGET+FP
Sbjct: 170 ATVLLYLTDVPEGGETVFP 188
>gi|414870897|tpg|DAA49454.1| TPA: hypothetical protein ZEAMMB73_536273 [Zea mays]
Length = 222
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 54/134 (40%), Positives = 74/134 (55%), Gaps = 15/134 (11%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
+VLSW PRA + NF S E+C +I+ AK +K S + V+S G RTS
Sbjct: 97 EVLSWEPRAFVYHNFLSKEECDHLISLAKPHMKKSTV-------VDSATGGSKDSRVRTS 149
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
SG F+ +DK I+ IE +IA T +P GE VL YE+GQKY+ H+D F+
Sbjct: 150 SGMFLRRGQDK--IIRTIEKRIADYTFIPVEQGEGLQVLHYEVGQKYEPHFDYFHDDYNT 207
Query: 190 PQMSQRLASFLLYL 203
QR+A+ L+YL
Sbjct: 208 KNGGQRIATLLMYL 221
>gi|397643670|gb|EJK76008.1| hypothetical protein THAOC_02250 [Thalassiosira oceanica]
Length = 480
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 70/248 (28%), Positives = 106/248 (42%), Gaps = 55/248 (22%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES-TKG-------TRTSSGTF 133
PR Y NF SA + + + P ++A G T ++ +G TRTS F
Sbjct: 205 PRVFYVHNFLSAAEADEFVKFSTAPENPYKMAPSTGGTHKAWNQGGDGAVLTTRTSENAF 264
Query: 134 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF---------- 183
++ + + ++ R + + +LRY++GQ Y +H+D F
Sbjct: 265 DITTKQSFDVKKRA-FRLLRMNGYQENMADGIQILRYKVGQAYVAHHDYFPTHQSKDFNW 323
Query: 184 NPAEYGPQMSQRLASFLLYLSDVEEGGETMFP---------------------------- 215
+P G S R A+ LYLSDV GG+T+FP
Sbjct: 324 DPLSGG---SNRFATIFLYLSDVSYGGQTVFPNCEKLSAEKSPELVERLGESPSASELKE 380
Query: 216 -FENGIFLDSGYD---YKKCI-GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGE 270
N ++ ++ KC V PRRGD +LFYS P+G +D SLHG+CP++ G
Sbjct: 381 FVSNAGLMEGSWEDNLIHKCYEKFAVPPRRGDAILFYSQRPDGLLDTNSLHGACPILNGT 440
Query: 271 KWVATKWI 278
KW A W+
Sbjct: 441 KWGANLWV 448
>gi|363814557|ref|NP_001242754.1| uncharacterized protein LOC100794585 [Glycine max]
gi|255628535|gb|ACU14612.1| unknown [Glycine max]
Length = 238
Score = 98.2 bits (243), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 55/146 (37%), Positives = 83/146 (56%), Gaps = 3/146 (2%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
+VL+W PR + NF S E+C + A A RL S + + G+ ++S RTSSG F+
Sbjct: 82 EVLNWSPRIILLHNFLSMEECDYLRALALPRLHISTVVDTKTGKGIKSD--VRTSSGMFL 139
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
++ E K +++ IE +I+ + +P +GE VLRYE Q Y H+D F+ + Q
Sbjct: 140 NSKERKYPMVQAIEKRISVYSQIPIENGELMQVLRYEKNQYYKPHHDYFSDTFNLKRGGQ 199
Query: 195 RLASFLLYLSDVEEGGETMFPFENGI 220
R+A+ L+YLSD E GET FP +
Sbjct: 200 RIATMLMYLSDNIERGETYFPLAGSV 225
>gi|116008128|ref|NP_001036776.1| CG15539, isoform B [Drosophila melanogaster]
gi|113194857|gb|ABI31220.1| CG15539, isoform B [Drosophila melanogaster]
Length = 509
Score = 98.2 bits (243), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 63/211 (29%), Positives = 98/211 (46%), Gaps = 21/211 (9%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
+ ++LS P + F + S + SI K +L + + G E RT+ GT
Sbjct: 310 LKMELLSLDPYMVLFHDVVSDKDIVSIRNLTKGKLARTVTVSKDGNYTEDPD--RTTKGT 367
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
++ + +++ + T + F VL Y IG Y H+D AE
Sbjct: 368 WLV---ENNALIQRLSQLTQDMTNFDIHDADPFQVLNYGIGGFYGIHFDFLEDAELD-NF 423
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
S R+A+ + YLSDV +GG T+FP +GL V P++G LL+Y+L
Sbjct: 424 SDRIATAVFYLSDVPQGGATIFP---------------KLGLSVFPKKGSALLWYNLDHK 468
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
G D + H +CP + G +WV TKWI ++EQ
Sbjct: 469 GDGDNRTAHSACPTVVGSRWVMTKWINEREQ 499
>gi|357483927|ref|XP_003612250.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355513585|gb|AES95208.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 204
Score = 98.2 bits (243), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 55/129 (42%), Positives = 74/129 (57%), Gaps = 5/129 (3%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFI 134
+V+SW PRA + NF + E+C+ +I AK + S + ET +S RTSSGTF+
Sbjct: 79 EVVSWEPRAFVYHNFLTKEECEYLIDIAKPSMHKSTVV--DSETGKSKDSRVRTSSGTFL 136
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
+ DK I+ IE KIA T +P HGE VL YE+GQKY+ HYD F Q
Sbjct: 137 ARGRDK--IVRNIEKKIADFTFIPVEHGEGLQVLHYEVGQKYEPHYDYFLDEFNTKNGGQ 194
Query: 195 RLASFLLYL 203
R+A+ L+YL
Sbjct: 195 RIATVLMYL 203
>gi|403183473|gb|EJY58123.1| AAEL017524-PA, partial [Aedes aegypti]
Length = 212
Score = 98.2 bits (243), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 68/218 (31%), Positives = 110/218 (50%), Gaps = 29/218 (13%)
Query: 74 PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETV-ESTKGTRTSS 130
PF++ S P + + N S ++ + II +K LK S + GE+ + RTS
Sbjct: 3 PFKLEEASLDPLIVIYHNAISDKEIEQIIQVSKPMLKRSMV----GESFSKEVSNERTSQ 58
Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF--NPAEY 188
+++ + + +++++ + T L + E+ V Y IG Y H+D N E
Sbjct: 59 NAWLADYDFE--LVKVLSLRTEDMTGLDRKSYESLQVNNYGIGGFYLPHFDWVRTNGTEE 116
Query: 189 GPQ---MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLL 245
+ + R+A+ + YLSDVE+GG T+FP IG+ V P++G +
Sbjct: 117 PYKDMGLGNRIATLMYYLSDVEQGGATVFP---------------QIGVGVFPKKGSAIF 161
Query: 246 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+Y+L P+GT D +LHG+CPV+ G KWVA KWI Q
Sbjct: 162 WYNLLPDGTGDERTLHGACPVLLGSKWVANKWIHQYHQ 199
>gi|381200649|ref|ZP_09907785.1| Prolyl 4-hydroxylase alpha subunit [Sphingobium yanoikuyae XLDN2-5]
Length = 305
Score = 98.2 bits (243), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 64/202 (31%), Positives = 103/202 (50%), Gaps = 29/202 (14%)
Query: 87 FPNFASAEQCQSIIATAKKRLKPSQ-LALRQGETVESTKGTRTSSG-TFISASEDKTGIL 144
F F + ++C +I+ + L+P+ + R G + RTS G F A ED ++
Sbjct: 126 FRQFLTGDECHHVISEGQALLEPAMVIDPRSGRPMPHP--VRTSDGGIFGPAREDL--VI 181
Query: 145 ELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-SQRLASFLLYL 203
+ I +IA A+ + GE +LRY +GQ+Y H+D P + +QR + L+YL
Sbjct: 182 QAINRRIAAASGTMLSGGEPLTLLRYAVGQQYRQHHDCL------PHVRNQRAWTMLIYL 235
Query: 204 SDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGS 263
++ GGET+FP +GL VK R+GD LLF + G ++H
Sbjct: 236 NEGYAGGETIFPR---------------LGLSVKGRKGDALLFRNTDAQGQAAEAAVHLG 280
Query: 264 CPVIKGEKWVATKWIRDQEQHE 285
PV+ G+KW+ T+WIR ++H+
Sbjct: 281 APVMAGQKWLCTRWIR-HDRHD 301
>gi|323452216|gb|EGB08091.1| hypothetical protein AURANDRAFT_26622 [Aureococcus anophagefferens]
Length = 190
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 67/211 (31%), Positives = 97/211 (45%), Gaps = 31/211 (14%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASED-- 139
PR S +C II K ++ S + G+ T TRTS ++ S
Sbjct: 1 PRVFLVREMLSEFECDHIIELGTKVVRKSMV----GQGGGFTSKTRTSENGWLRRSASPI 56
Query: 140 ------KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 193
+ G + I+H + R+ + E V+RY+ Q+Y H+D + PQ
Sbjct: 57 LENIYKRFGDVLGIDHDLLRSG----KNAEELQVVRYDRSQEYAPHHDFGDDGT--PQ-- 108
Query: 194 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 253
QR + LLY+ EEGG T FP N +G++V P RGD +LFYS+ P+G
Sbjct: 109 QRFLTLLLYIQLPEEGGATSFPKAN-----------DGMGVQVVPARGDAVLFYSMLPDG 157
Query: 254 TIDRTSLHGSCPVIKGEKWVATKWIRDQEQH 284
D +LH PV KG+KWV W+ D +H
Sbjct: 158 NADDLALHAGMPVRKGQKWVCNLWVWDPHRH 188
>gi|292619367|ref|XP_001922562.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Danio rerio]
Length = 541
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 62/211 (29%), Positives = 103/211 (48%), Gaps = 30/211 (14%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
RPR + + + ++ + I +K RL+ + ++ +E T R S +++A E
Sbjct: 341 RPRIIRYHEIITEQEIEKIKELSKPRLRRATISNPITGVLE-TAHYRISKSAWLAAYEHP 399
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
+++ I +I T L E V Y +G +Y+ H+D AF G
Sbjct: 400 --VVDRINQRIEDITGLNVKTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG--- 454
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
R+A++L Y+SDV GG T+FP +G VKP +G + +Y+LFP+
Sbjct: 455 -NRIATWLFYMSDVAAGGATVFP---------------EVGAAVKPLKGTAVFWYNLFPS 498
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
G D ++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 499 GEGDYSTRHAACPVLVGNKWVSNKWIHERGQ 529
>gi|224008853|ref|XP_002293385.1| hypothetical protein THAPSDRAFT_264010 [Thalassiosira pseudonana
CCMP1335]
gi|220970785|gb|EED89121.1| hypothetical protein THAPSDRAFT_264010 [Thalassiosira pseudonana
CCMP1335]
Length = 248
Score = 97.8 bits (242), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 67/226 (29%), Positives = 98/226 (43%), Gaps = 34/226 (15%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
I + +S PR +F S + I+ + + + + TRTS T
Sbjct: 35 ITLRTVSCSPRIFELEHFISDVEADHILMLTNRTHELHRSSTGDSSHHSDHDSTRTSMNT 94
Query: 133 FISASEDKTGILELIEHKIARATML---------PQTH---------GEAFNVLRYEIGQ 174
+I E T I++ I ++A + P H E ++ Y+ G+
Sbjct: 95 WIYREE--TAIIDTIYRRVADVLRIDEALLRRRQPDEHPRLGTRSSIAEPLQMVHYDPGE 152
Query: 175 KYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGL 234
+Y +H+D P R + LLYL+DVEEGGET FP + GL
Sbjct: 153 EYTAHHDFGYTHMSAPHQPSRSINMLLYLNDVEEGGETSFP--------------RWGGL 198
Query: 235 KVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 280
VKP +G +LFY L +G D S H + PVIKGEKW++ WI D
Sbjct: 199 DVKPVKGKAVLFYMLTADGNSDDLSQHAALPVIKGEKWMSNLWIWD 244
>gi|195391766|ref|XP_002054531.1| GJ24504 [Drosophila virilis]
gi|194152617|gb|EDW68051.1| GJ24504 [Drosophila virilis]
Length = 545
Score = 97.8 bits (242), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 54/170 (31%), Positives = 80/170 (47%), Gaps = 23/170 (13%)
Query: 117 GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKY 176
G RTS TFI + K +L I+ ++A T L E + Y IG Y
Sbjct: 362 GNNASVVSNARTSQFTFIPKTRHK--VLRTIDQRVADMTDLNMVFAEDHQLANYGIGGHY 419
Query: 177 DSHYDAFNPAEY------GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKK 230
H D F+P + +M R+A+ L YL+DVE+GG T FP +
Sbjct: 420 AQHMDWFSPNAFETKQVANSEMGNRIATVLFYLTDVEQGGGTAFPVLKQL---------- 469
Query: 231 CIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 280
+KP++ +Y+L +G D ++HG+CP+I G KWV +WIR+
Sbjct: 470 -----LKPKKYAAAFWYNLHASGAGDVRTMHGACPIIVGSKWVLNRWIRE 514
>gi|198449500|ref|XP_001357604.2| GA15939 [Drosophila pseudoobscura pseudoobscura]
gi|198130634|gb|EAL26738.2| GA15939 [Drosophila pseudoobscura pseudoobscura]
Length = 528
Score = 97.8 bits (242), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 67/228 (29%), Positives = 108/228 (47%), Gaps = 25/228 (10%)
Query: 62 HGETGDASIQSIPF--QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET 119
+ T A ++ P +++ P + + + SA + + A LK + + G
Sbjct: 308 YNSTNSAFLRLAPLKMELVGLDPYMVLYHDVISAPEISQLQDMATPGLKRATVYKASGRR 367
Query: 120 VESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH 179
E K TRTS + + ++ + E + +IA T E + Y +G YD H
Sbjct: 368 SEVVK-TRTSKVAWFPDTFNE--LTERLNRRIADMTNFDLLGSEMLQAMNYGLGGHYDKH 424
Query: 180 YDAFNPAEYGPQMSQ----RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLK 235
YD FN A ++Q R+A+ L YL+DVE+GG T+FP I
Sbjct: 425 YDFFN-ASTATNLTQMNGDRIATVLFYLTDVEQGGATVFP---------------NIRKA 468
Query: 236 VKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
V P+RG +++Y+L +G + +LH +CPV+ G KWV KWIR++ Q
Sbjct: 469 VFPQRGSAIIWYNLKDDGDPNPQTLHAACPVLVGSKWVCNKWIRERAQ 516
>gi|432949777|ref|XP_004084253.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Oryzias
latipes]
Length = 532
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 63/206 (30%), Positives = 104/206 (50%), Gaps = 24/206 (11%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + N S ++ + I AK RL ++ +R +T V +T R S ++ +D
Sbjct: 335 PHIVRYLNILSDQEIEKIKELAKPRL--ARATVRDPKTGVLTTAPYRVSKSAWLEGEDDP 392
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS---QRLA 197
+++ + +I T L E V Y +G +Y+ H+D F+ + + RLA
Sbjct: 393 --VIDRVNQRIQDITGLTVETAELLQVANYGVGGQYEPHFD-FSRRPFDSNLKVDGNRLA 449
Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 257
+FL Y+SDVE GG T+FP D+ G + PR+G + +Y+LF +G D
Sbjct: 450 TFLNYMSDVEAGGATVFP-----------DF----GASIWPRKGTAVFWYNLFRSGEGDY 494
Query: 258 TSLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 495 RTRHAACPVLVGSKWVSNKWIHERGQ 520
>gi|410900628|ref|XP_003963798.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Takifugu
rubripes]
Length = 548
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 61/211 (28%), Positives = 103/211 (48%), Gaps = 30/211 (14%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
RP + + + S ++ +++ AK RL+ + ++ +E T R S +++ E
Sbjct: 348 RPYIVRYIDIISDKEIETVKKLAKPRLRRATISNPITGVLE-TASYRISKSAWLTGYEHP 406
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
++E+I +I T L E V Y +G +Y+ H+D AF G
Sbjct: 407 --VIEIINQRIEDLTGLEMDTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG--- 461
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF N
Sbjct: 462 -NRIATWLFYMSDVAAGGATVFP---------------DVGAAVWPQKGTAVFWYNLFAN 505
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
G D ++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 506 GEGDYSTRHAACPVLVGNKWVSNKWIHERGQ 536
>gi|291230950|ref|XP_002735430.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Saccoglossus
kowalevskii]
Length = 533
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 60/218 (27%), Positives = 102/218 (46%), Gaps = 22/218 (10%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+Q +V+ +P+ + F + + + + A A RL+ + + +E + R S
Sbjct: 321 LQPAKEEVVFDKPKLIIFHDAILTNEIRKVKALASPRLRRATIQNSVTGNLEFAE-YRIS 379
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
++S ED ++ + H+I + T L E V Y +G Y+ H+D E
Sbjct: 380 KSAWLS--EDDGDVVHRLNHRIEQYTGLTMDTAEELQVANYGLGGHYEPHFDFARKEEIN 437
Query: 190 P----QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLL 245
R+A+FL Y+SDVE GG T+FP +G ++ P +G
Sbjct: 438 AFKSLNTGNRIATFLFYMSDVEAGGATVFP---------------QVGARLIPEKGSAAF 482
Query: 246 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+Y+L NG D ++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 483 WYNLLKNGEGDYSTRHAACPVLVGSKWVSNKWIHERGQ 520
>gi|195159142|ref|XP_002020441.1| GL13994 [Drosophila persimilis]
gi|194117210|gb|EDW39253.1| GL13994 [Drosophila persimilis]
Length = 493
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 67/228 (29%), Positives = 108/228 (47%), Gaps = 25/228 (10%)
Query: 62 HGETGDASIQSIPF--QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET 119
+ T A ++ P +++ P + + + SA + + A LK + + G
Sbjct: 273 YNSTNSAFLRLAPLKMELVGLDPYMVLYHDVISALEISQLQDMATPGLKRATVYKASGRR 332
Query: 120 VESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH 179
E K TRTS + + ++ + E + +IA T E + Y +G YD H
Sbjct: 333 SEVVK-TRTSKVAWFPDTFNE--LTERLNRRIADMTNFDLLGSEMLQAMNYGLGGHYDKH 389
Query: 180 YDAFNPAEYGPQMSQ----RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLK 235
YD FN A ++Q R+A+ L YL+DVE+GG T+FP I
Sbjct: 390 YDFFN-ASTAANLTQMNGDRIATVLFYLTDVEQGGATVFP---------------NIRKA 433
Query: 236 VKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
V P+RG +++Y+L +G + +LH +CPV+ G KWV KWIR++ Q
Sbjct: 434 VFPQRGSAIIWYNLKDDGDPNPQTLHAACPVLVGSKWVCNKWIRERAQ 481
>gi|47550697|ref|NP_999856.1| prolyl 4-hydroxylase, alpha polypeptide I b precursor [Danio rerio]
gi|28277826|gb|AAH45890.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I [Danio rerio]
Length = 536
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 63/211 (29%), Positives = 101/211 (47%), Gaps = 30/211 (14%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
RPR + + S + +++ AK RL+ + ++ +E T R S ++S E
Sbjct: 336 RPRIVRYHEIISDSEIETVKEMAKPRLRRATISNPITGVLE-TAPYRISKSAWLSGYEHS 394
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
T +E I +I T L E V Y +G +Y+ H+D AF G
Sbjct: 395 T--IERINQRIEDVTGLEMDTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG--- 449
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
R+A++L Y+SDV GG T+F +G V P++G + +Y+LFP+
Sbjct: 450 -NRIATWLFYMSDVSAGGATVFT---------------DVGAAVWPKKGTAVFWYNLFPS 493
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
G D ++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 494 GEGDYSTRHAACPVLVGNKWVSNKWIHERGQ 524
>gi|301613004|ref|XP_002936004.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Xenopus
(Silurana) tropicalis]
Length = 526
Score = 97.4 bits (241), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 59/209 (28%), Positives = 103/209 (49%), Gaps = 26/209 (12%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+PR + + + S E+ + AK RL+ + ++ +E+ + R + ++S ED
Sbjct: 326 KPRIVRYHDIISDEEISKVKELAKPRLRRATISNPITGVLETAQ-YRITKSAWLSGYEDP 384
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ------MSQ 194
++ + +I T L + E V Y IG +Y+ H+D +Y P
Sbjct: 385 --VVARLNRRIEGVTGLDMSTAEELQVANYGIGGQYEPHFDFLR--KYEPDAFKKLGTGN 440
Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
R+A++L Y+SDVE GG T+FP +G V P++G + +Y+L +G
Sbjct: 441 RVATWLFYMSDVEAGGATVFPE---------------VGAAVYPKKGTAVFWYNLLESGE 485
Query: 255 IDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
D ++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 486 GDYSTRHAACPVLVGNKWVSNKWIHERGQ 514
>gi|170064951|ref|XP_001867739.1| prolyl 4-hydroxylase subunit alpha-2 [Culex quinquefasciatus]
gi|167882142|gb|EDS45525.1| prolyl 4-hydroxylase subunit alpha-2 [Culex quinquefasciatus]
Length = 516
Score = 97.4 bits (241), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 62/210 (29%), Positives = 102/210 (48%), Gaps = 21/210 (10%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
+V++ P + + AS + +I + ++ S + + V + RTS ++++
Sbjct: 314 EVVNLDPFVAVYHDAASDAEINKVIELGRPQINRSMVGDAAKKEVSKS---RTSQNSWLT 370
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-- 193
D + L A L +T E+ V Y IG Y HYD P+++
Sbjct: 371 -DYDHPVVAALSRRTKDMALGLDETAYESLQVNNYGIGGHYLPHYDWSREENPYPELNTG 429
Query: 194 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 253
R+A+ + YLSDVEEGG T+FP +G+ V P++G + +Y+L +G
Sbjct: 430 NRIATLMFYLSDVEEGGATVFPH---------------LGVGVFPKKGTAIFWYNLRASG 474
Query: 254 TIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
D +LHG+CPV+ G KWVA KWI ++ Q
Sbjct: 475 KGDEKTLHGACPVLIGSKWVANKWIHERHQ 504
>gi|194765180|ref|XP_001964705.1| GF23331 [Drosophila ananassae]
gi|190614977|gb|EDV30501.1| GF23331 [Drosophila ananassae]
Length = 535
Score = 97.4 bits (241), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 63/216 (29%), Positives = 99/216 (45%), Gaps = 22/216 (10%)
Query: 74 PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
PF++ LS P S++ + I A+ ++K S + G RTS G
Sbjct: 316 PFKLEELSHEPLVFQVHQVVSSKSAEFIKKMARPKIKRSTVYSIGGGGGSQAAAFRTSQG 375
Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF---NPAEY 188
+ S + +++ + + L E V Y IG Y+ H+D+F + +
Sbjct: 376 ASFNYS--RNAATKILSRHVGDLSSLDMNFAEELQVANYGIGGHYEPHWDSFPENHIYDE 433
Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 248
G R+A+ + YLSDVE GG T FPF + L V P +G L +Y+
Sbjct: 434 GDDRGNRIATGIYYLSDVEAGGGTAFPF---------------LPLLVTPEKGSLLFWYN 478
Query: 249 LFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQH 284
L +G D + H +CPV++G KW+A WIR++ QH
Sbjct: 479 LHESGDQDYRTKHAACPVLQGSKWIANVWIRERNQH 514
>gi|428178571|gb|EKX47446.1| hypothetical protein GUITHDRAFT_152114 [Guillardia theta CCMP2712]
Length = 262
Score = 97.1 bits (240), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 75/251 (29%), Positives = 120/251 (47%), Gaps = 48/251 (19%)
Query: 44 PKLRTLEVVEKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATA 103
P LR LE ++ S LP+ E +AS PR N + ++C+ ++ A
Sbjct: 42 PSLRGLESLK----SNLPYLEQINAS------------PRVFRIRNLLTKQECEHLMLLA 85
Query: 104 KKRLKPSQLALRQG--ETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTH 161
++ + + G + VEST TRT+ G ++ +D ++ +E + + T
Sbjct: 86 FRKGLSKTMIMPYGTHKLVEST--TRTNDGAWLDFLQDD--VVRRLEETLGKLTKTTPQQ 141
Query: 162 GEAFNVLRYEIG-QKYDSHYDAFNPAEYGP----QMSQRLASFLLYLSDVEEGGETMFPF 216
GE VL Y G Q + HYD F+PA P Q R + ++YL EGGET FP
Sbjct: 142 GENLQVLHYSNGAQFFQEHYDYFDPARDPPESFEQGGNRYITVIVYLEAALEGGETHFP- 200
Query: 217 ENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSL--FPNGT----IDRTSLHGSCPVIKGE 270
+GLK+ + GD L+FY+L +GT +++ ++H + P ++GE
Sbjct: 201 --------------ELGLKLTAQPGDALMFYNLKEHCSGTDPDCVEKKTIHAALPPVRGE 246
Query: 271 KWVATKWIRDQ 281
KWVA KWI ++
Sbjct: 247 KWVAVKWIHEK 257
>gi|170064956|ref|XP_001867741.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
gi|167882144|gb|EDS45527.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
Length = 520
Score = 97.1 bits (240), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 65/215 (30%), Positives = 101/215 (46%), Gaps = 25/215 (11%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
+ +V++ P + + S + +I A+ +K S + + E + R S
Sbjct: 315 LKLEVVNLEPLIVVYHEAVSDREIAKLIELARPLIKRSAVGDTRSEQISKI---RISQNA 371
Query: 133 FISASEDKTGILELIEHKIAR--ATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP 190
+ D I+E + + AR A L + E V Y +G Y HYD A P
Sbjct: 372 WFENEHDP--IVETLNQR-ARDMAGGLNEPSYELLQVNNYGLGGFYSIHYDWSTSANPFP 428
Query: 191 Q--MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 248
M R+A+ + YLSDV+EGG T+FP N L V+PR+G + +Y+
Sbjct: 429 NKGMGNRIATLMFYLSDVQEGGSTVFPRLN---------------LAVRPRKGTAIFWYN 473
Query: 249 LFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
L NG ++ +LH +CPV+ G KWVA KWI ++ Q
Sbjct: 474 LHRNGKGNKKTLHAACPVLIGSKWVANKWIHERHQ 508
>gi|393718270|ref|ZP_10338197.1| putative oxygenase [Sphingomonas echinoides ATCC 14820]
Length = 226
Score = 97.1 bits (240), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 62/202 (30%), Positives = 98/202 (48%), Gaps = 30/202 (14%)
Query: 85 LYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK-TGI 143
Y P+F A C ++A + S + ES + RTS S D+ +
Sbjct: 43 FYHPDFLDAATCDRLVALIDANRRRSTVLAE-----ESVQDFRTSD----SCDMDRWSPD 93
Query: 144 LELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE-YGPQM----SQRLAS 198
+ + IA + HGE RY +GQ + +H+D FN A+ Y P+M QR +
Sbjct: 94 VRPTDEAIADLLGIDPVHGETMQGQRYAVGQHFRAHFDYFNEAQAYWPKMVETGGQRTWT 153
Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
++YL+DVEEGG T FP IG++V P++G L + ++ P+G +
Sbjct: 154 AMIYLNDVEEGGATWFP---------------TIGIRVAPKKGLLLTWNNMKPDGDRNTA 198
Query: 259 SLHGSCPVIKGEKWVATKWIRD 280
+LH PV++G K++ TKW R+
Sbjct: 199 TLHEGMPVVQGTKYIVTKWFRE 220
>gi|189241578|ref|XP_969458.2| PREDICTED: similar to prolyl 4-hydroxylase alpha subunit 1,
putative [Tribolium castaneum]
Length = 515
Score = 97.1 bits (240), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 63/212 (29%), Positives = 105/212 (49%), Gaps = 22/212 (10%)
Query: 74 PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
PF+V P L F N S + +++ A+ RL + + +E R S
Sbjct: 317 PFKVEQAHLDPDILIFHNVLSDCEIETMKQLAQSRLVTAVFENPHSKQLELFP-FRISKV 375
Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ 191
++ E + L ++ ++A T L + E F V+ Y IG Y+ H+D + + P
Sbjct: 376 AWLEDQEHQH--LAVVAQRVAHMTGLTLSTAEEFQVVNYGIGGHYEPHFDFQSTVD--PA 431
Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
+ R+ + L YLSDVE+GG T+FP I + V P++G +++++L P
Sbjct: 432 IGSRIETVLFYLSDVEQGGATVFP---------------EIQVSVWPQKGSAVVWFNLHP 476
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+G D+ + H CPV+ G KW+ATKWI ++ Q
Sbjct: 477 SGDGDQRTKHAGCPVLIGSKWIATKWIHERGQ 508
>gi|413923982|gb|AFW63914.1| hypothetical protein ZEAMMB73_179176 [Zea mays]
Length = 222
Score = 97.1 bits (240), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 53/134 (39%), Positives = 74/134 (55%), Gaps = 15/134 (11%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
+V+SW PRA + NF S E+C+ +I AK + S + V+ST G RTS
Sbjct: 98 EVISWEPRAFVYHNFLSKEECEYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 150
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
SG F+ DK ++ +IE +IA T +P HGE VL YE+GQKY+ H+D F
Sbjct: 151 SGMFLQRGRDK--VIRVIEKRIADYTFIPVDHGEGLQVLHYEVGQKYEPHFDYFLDEFNT 208
Query: 190 PQMSQRLASFLLYL 203
QR+A+ L+YL
Sbjct: 209 KNGGQRMATLLMYL 222
>gi|194765178|ref|XP_001964704.1| GF23330 [Drosophila ananassae]
gi|190614976|gb|EDV30500.1| GF23330 [Drosophila ananassae]
Length = 537
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 61/204 (29%), Positives = 99/204 (48%), Gaps = 21/204 (10%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASEDK 140
P + + + SA++ + + A R++ S + L G+ +S R S +++
Sbjct: 331 PYVVTYHDMLSAQKIRDLRQMAVPRMRRSTVNPLPGGQNKKS--AFRVSKNAWLAYESHP 388
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMSQRLASF 199
T +E + + AT L T+ E V Y +G Y+ H+D F +P Y + R+A+
Sbjct: 389 T--MEGMLRDLKDATGLDTTYCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRIATA 446
Query: 200 LLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTS 259
+ YLSDVE+GG T FPF + VKP+ G+ L +Y+L + +D +
Sbjct: 447 IFYLSDVEQGGATAFPF---------------LDFAVKPQLGNVLFWYNLHRSLDMDYRT 491
Query: 260 LHGSCPVIKGEKWVATKWIRDQEQ 283
H CPV+KG KW+ WI D Q
Sbjct: 492 KHAGCPVLKGSKWIGNVWIHDMTQ 515
>gi|198477152|ref|XP_002136738.1| GA29216 [Drosophila pseudoobscura pseudoobscura]
gi|198145043|gb|EDY71755.1| GA29216 [Drosophila pseudoobscura pseudoobscura]
Length = 517
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 60/216 (27%), Positives = 105/216 (48%), Gaps = 24/216 (11%)
Query: 74 PF--QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES-TKGTRTSS 130
PF ++LS P + + + + + ++ +K +K + + V RTS+
Sbjct: 312 PFKTEILSLSPYMVLYHDVITPLESLTLKNLSKPLMKRRAMVMVNNLKVRPFIDSGRTSN 371
Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP 190
++++ E+ ++E +E ++ T + E + ++ Y IG Y H D F + P
Sbjct: 372 SVWLASHEN--AVMERLERRVGVMTNFEMENSEVYQLINYGIGGHYKPHTDHFETPQ-AP 428
Query: 191 QMS---QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFY 247
+ R+A+ L YLSDV +GG T+FP N + V+PR+GD LL+Y
Sbjct: 429 EHRGGGDRIATVLFYLSDVPQGGATLFPRLN---------------ISVQPRQGDALLWY 473
Query: 248 SLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+L G + ++H SCP+I+G KW KWI + Q
Sbjct: 474 NLNDRGQGEIGTVHTSCPIIQGSKWALVKWIDELSQ 509
>gi|270001038|gb|EEZ97485.1| hypothetical protein TcasGA2_TC011322 [Tribolium castaneum]
Length = 509
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 60/202 (29%), Positives = 101/202 (50%), Gaps = 20/202 (9%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P L F N S + +++ A+ RL + + +E R S ++ E +
Sbjct: 321 PDILIFHNVLSDCEIETMKQLAQSRLVTAVFENPHSKQLELFP-FRISKVAWLEDQEHQH 379
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
L ++ ++A T L + E F V+ Y IG Y+ H+D + + P + R+ + L
Sbjct: 380 --LAVVAQRVAHMTGLTLSTAEEFQVVNYGIGGHYEPHFDFQSTVD--PAIGSRIETVLF 435
Query: 202 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 261
YLSDVE+GG T+FP I + V P++G +++++L P+G D+ + H
Sbjct: 436 YLSDVEQGGATVFPE---------------IQVSVWPQKGSAVVWFNLHPSGDGDQRTKH 480
Query: 262 GSCPVIKGEKWVATKWIRDQEQ 283
CPV+ G KW+ATKWI ++ Q
Sbjct: 481 AGCPVLIGSKWIATKWIHERGQ 502
>gi|449488641|ref|XP_004158125.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized LOC101218968
[Cucumis sativus]
Length = 311
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 64/207 (30%), Positives = 107/207 (51%), Gaps = 19/207 (9%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKK-RLKPSQLALRQGETVESTKGTRTSSGTFISA 136
+SWRPR + F S E+C +I+ A PS+ + G TV + SSG ++
Sbjct: 59 VSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGITVSTE--LLNSSGVILNT 116
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
++D I+ IE+++A T+LP+ H F +++Y G++ Y N + P +
Sbjct: 117 TDD---IVARIENRLAIWTLLPKDHSMPFQIMQYR-GEEAKHKYFYGNRSAMLPSSEPLM 172
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLK-----VKPRRGDGLLFYSLFP 251
A+ +LYLSD GGE +FP +S K G + ++P +G+ +L +S+
Sbjct: 173 ATVVLYLSDSASGGEILFP-------ESKVKSKFWSGRRKKNNFLRPVKGNAILXFSVHL 225
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWI 278
N + D++S H P+ GE WVATK++
Sbjct: 226 NASPDKSSYHIRSPIRDGELWVATKFL 252
>gi|195390831|ref|XP_002054071.1| GJ22995 [Drosophila virilis]
gi|194152157|gb|EDW67591.1| GJ22995 [Drosophila virilis]
Length = 485
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 65/211 (30%), Positives = 97/211 (45%), Gaps = 32/211 (15%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
+ +VL +P + F + S + + A LK + + S KGTRTS G
Sbjct: 295 LKMEVLVVKPFIVAFHDVLSPHEIGELQQLAMPLLKRTTVYDSNAGLHGSVKGTRTSKGI 354
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
++S S + + + I +I+ T + V+ Y + Y H D FN AE
Sbjct: 355 WLSRSHN--NLTKRIGRRISDMTGFHLEGSTSLQVMNYGLSGHYALHTDYFNTAE----- 407
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
LSDVE+GG+T+FP F KP RG LL+Y+L N
Sbjct: 408 ----------LSDVEQGGDTVFPRIEQAF---------------KPERGKALLWYNLHRN 442
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
GT D+ + HG+CPV+ G KW+ T+WI ++ Q
Sbjct: 443 GTGDKRTEHGACPVLVGSKWIMTQWINERPQ 473
>gi|195575143|ref|XP_002105539.1| GD16913 [Drosophila simulans]
gi|194201466|gb|EDX15042.1| GD16913 [Drosophila simulans]
Length = 534
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 64/211 (30%), Positives = 97/211 (45%), Gaps = 28/211 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + + SA + +I A + +K +++ QG V RT+ G + ++
Sbjct: 326 PYVVLYHEVLSAREISMLIGKAAQNMKNTRVHKEQG--VPKKNRGRTAKGFWFKKESNE- 382
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE---------YGPQM 192
+ + I +I T E F V+ Y IG Y H D F+ A Y +
Sbjct: 383 -LTKGITRRIMDMTGFDLADSEGFQVINYGIGGHYLLHMDYFDFASSNHTDTRSGYSMDL 441
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
R+A+ L YL+DVE+GG T +F D GY V P+ G + +Y+L N
Sbjct: 442 GDRIATVLFYLTDVEQGGAT-------VFADVGYS--------VYPQAGTAIFWYNLDTN 486
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
G D + H +CPVI G KWV T+WIR++ Q
Sbjct: 487 GKGDPRTRHAACPVIVGSKWVMTEWIREKRQ 517
>gi|348518914|ref|XP_003446976.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Oreochromis
niloticus]
Length = 536
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 65/209 (31%), Positives = 103/209 (49%), Gaps = 28/209 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK RL ++ +R +T V +T R S ++ ED
Sbjct: 337 PHIVRYLDLLSDEEIEKIKELAKPRL--ARATVRDPKTGVLTTANYRVSKSAWLEGEEDP 394
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL---- 196
+++ + +I T L E V Y +G +Y+ H+D E P +RL
Sbjct: 395 --VIDRVNQRIEAITGLTVETAELLQVANYGVGGQYEPHFDFSRKDE--PDAFKRLGTGN 450
Query: 197 --ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
A+FL Y+SDVE GG T+FP D+ G + PR+G + +Y+LF +G
Sbjct: 451 RVATFLNYMSDVEAGGATVFP-----------DF----GAAIWPRKGTSVFWYNLFRSGE 495
Query: 255 IDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
D + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 496 GDYRTRHAACPVLVGSKWVSNKWIHERGQ 524
>gi|195505251|ref|XP_002099423.1| GE23370 [Drosophila yakuba]
gi|194185524|gb|EDW99135.1| GE23370 [Drosophila yakuba]
Length = 534
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 65/211 (30%), Positives = 98/211 (46%), Gaps = 29/211 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + + SA + +I+ A + +K +++ ET T RT+ G ++ ++
Sbjct: 327 PYVVLYHEVLSAREISMLISKAAQNMKNTRV---HRETKPKTNRGRTAKGHWLKKESNE- 382
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE---YGPQMSQ---- 194
+ I +I T E F V+ Y IG Y H D F+ A GP+ Q
Sbjct: 383 -LTRRITRRIVDMTGFDLADSEDFQVINYGIGGHYFLHMDYFDYASSNYTGPRSRQSKVL 441
Query: 195 --RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
R+A+ L YLSDVE+GG T+F +G V P+ G + +Y+L +
Sbjct: 442 GDRIATVLFYLSDVEQGGATVF---------------GNVGYSVYPQAGTAIFWYNLDTD 486
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
G D + H SCPVI G KWV T+WIR+ Q
Sbjct: 487 GNGDPLTRHASCPVIVGSKWVMTEWIRESRQ 517
>gi|427410797|ref|ZP_18900999.1| hypothetical protein HMPREF9718_03473 [Sphingobium yanoikuyae ATCC
51230]
gi|425710785|gb|EKU73805.1| hypothetical protein HMPREF9718_03473 [Sphingobium yanoikuyae ATCC
51230]
Length = 322
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 63/202 (31%), Positives = 103/202 (50%), Gaps = 29/202 (14%)
Query: 87 FPNFASAEQCQSIIATAKKRLKPSQ-LALRQGETVESTKGTRTSSG-TFISASEDKTGIL 144
F F + ++C +I+ + L+P+ + R G + RTS G F A ED ++
Sbjct: 143 FRQFLTGDECHHVISEGQALLEPAMVIDPRSGRPMPHP--IRTSDGGIFGPAREDL--VI 198
Query: 145 ELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-SQRLASFLLYL 203
+ I +IA A+ + GE +LRY +GQ+Y H+D P + +QR + L+YL
Sbjct: 199 QAINRRIAAASGTMLSGGEPLTLLRYAVGQQYRQHHDCL------PHVRNQRAWTMLIYL 252
Query: 204 SDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGS 263
++ GGET+FP +GL VK R+G+ LLF + G ++H
Sbjct: 253 NEGYAGGETIFPR---------------LGLSVKGRKGNALLFRNTDAQGQAAEAAVHLG 297
Query: 264 CPVIKGEKWVATKWIRDQEQHE 285
PV+ G+KW+ T+WIR ++H+
Sbjct: 298 APVMAGQKWLCTRWIR-HDRHD 318
>gi|195341588|ref|XP_002037388.1| GM12140 [Drosophila sechellia]
gi|194131504|gb|EDW53547.1| GM12140 [Drosophila sechellia]
Length = 534
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 64/211 (30%), Positives = 97/211 (45%), Gaps = 28/211 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + + SA + +I A + +K +++ QG V RT+ G + ++
Sbjct: 326 PYVVLYHEVLSAREISMLIGKATQNMKNTRVHKEQG--VPKKNRGRTAKGFWFKKESNE- 382
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE---------YGPQM 192
+ + I +I T E F V+ Y IG Y H D F+ A Y +
Sbjct: 383 -LTKGITRRIMDMTGFDLADSEGFQVINYGIGGHYLLHMDYFDFASSNHTDTRSSYSMDL 441
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
R+A+ L YL+DVE+GG T +F D GY V P+ G + +Y+L N
Sbjct: 442 GDRIATVLFYLTDVEQGGAT-------VFADVGY--------SVYPQAGTAIFWYNLDTN 486
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
G D + H +CPVI G KWV T+WIR++ Q
Sbjct: 487 GKGDPRTKHAACPVIVGSKWVMTEWIREKRQ 517
>gi|410914996|ref|XP_003970973.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Takifugu
rubripes]
Length = 538
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 65/209 (31%), Positives = 103/209 (49%), Gaps = 28/209 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + +F S E+ + I AK +L ++ +R ++ V +T R S ++ ED
Sbjct: 339 PNIVRYLDFLSNEEIEKIKELAKPKL--ARATVRDPKSGVLTTASYRVSKSAWLEGEEDP 396
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL---- 196
I+ + +I T L E V Y +G +Y+ H+D E P +RL
Sbjct: 397 --IIARVNQRIEDLTGLTVKTAELLQVANYGVGGQYEPHFDFSRKDE--PDAFKRLGTGN 452
Query: 197 --ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
A+FL Y+SDVE GG T+FP D+ G + PR+G + +Y+LF +G
Sbjct: 453 RVATFLNYMSDVEAGGATVFP-----------DF----GAAIWPRKGTAVFWYNLFKSGE 497
Query: 255 IDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
D + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 498 GDYRTRHAACPVLVGNKWVSNKWIHERGQ 526
>gi|432904500|ref|XP_004077362.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oryzias
latipes]
Length = 555
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 63/211 (29%), Positives = 101/211 (47%), Gaps = 30/211 (14%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
RP + + + S + I AK RL+ + ++ +E T R S +++A ED
Sbjct: 350 RPYIVRYIDIISEAEMDKIKQLAKPRLRRATISNPVTGVLE-TAPYRISKSAWLTAYEDP 408
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
++E I +I T L E V Y +G +Y+ H+D AF G
Sbjct: 409 --VVEKINQRIEDLTGLEMDTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG--- 463
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF +
Sbjct: 464 -NRIATWLFYMSDVSAGGATVFP---------------DVGASVGPQKGTAVFWYNLFAS 507
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
G D ++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 508 GEGDYSTRHAACPVLVGNKWVSNKWIHERGQ 538
>gi|354483223|ref|XP_003503794.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 1
[Cricetulus griseus]
Length = 534
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 63/210 (30%), Positives = 103/210 (49%), Gaps = 28/210 (13%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S + + + AK RL S+ + ET + +T R S ++S ED
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYED 391
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ----- 194
++ I +I T L + E V Y +G +Y+ H+D E P Q
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDE--PDAFQELGTG 447
Query: 195 -RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 253
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF +G
Sbjct: 448 NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASG 492
Query: 254 TIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 493 EGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|194765168|ref|XP_001964699.1| GF22909 [Drosophila ananassae]
gi|190614971|gb|EDV30495.1| GF22909 [Drosophila ananassae]
Length = 525
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 63/217 (29%), Positives = 99/217 (45%), Gaps = 26/217 (11%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
I + + LS P + + + + +I +LK + + E+V S RTS
Sbjct: 290 IAPLKAEELSRDPLLILYHDVIYQSEIDTIRKLTTNKLKRATIT-STNESVVS--NVRTS 346
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY- 188
TF+ +EDK +L I+ ++A T + E Y IG Y H D F +
Sbjct: 347 QFTFLPVTEDK--VLATIDRRVADMTNFNMRYAEDHQFANYGIGGHYGQHMDWFYQPSFD 404
Query: 189 -----GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDG 243
P+M R+A+ L YLSDV +GG T FP + + +KP++
Sbjct: 405 AGLVSSPEMGNRIATVLFYLSDVTQGGGTAFPH---------------LRVLLKPKKYAA 449
Query: 244 LLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 280
+Y+L +G D + HG+CP+I G KWV +WIR+
Sbjct: 450 AFWYNLHASGVGDPRTQHGACPIISGSKWVQNRWIRE 486
>gi|323455897|gb|EGB11765.1| hypothetical protein AURANDRAFT_52419 [Aureococcus anophagefferens]
Length = 478
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 85/293 (29%), Positives = 122/293 (41%), Gaps = 68/293 (23%)
Query: 39 VPS----IRPKLRTLEVVEKENESGLPHGETG-DASIQSIPFQVLSWRPRALYFPNFASA 93
VPS + P +RT + L ETG DA +S+ LS RP+ F
Sbjct: 114 VPSGVHFVWPTVRTGHATVTD----LADAETGVDAGHRSV-VTTLSMRPQVFRISQFMMG 168
Query: 94 EQCQSIIATAKKRLKPSQLAL--RQGETVESTKGTRTSSGTFISASEDKTGILELIEHKI 151
+ + +I K R+KPS++ L R G+ TRTS+ + +AS + I
Sbjct: 169 HETEKLIERNKPRIKPSEVGLVGRSGDK------TRTSTNAWDTASP-------VARDVI 215
Query: 152 ARATMLPQTHG-----EAFNVLRYEIGQKYDSHYDAFNPAEYG----------------P 190
RA L + + VL YE Q Y H D F G
Sbjct: 216 GRAFRLLKIDAHRKLEDGLQVLHYERPQWYKPHVDYFTSRNAGGGGASEDAFSNAIPTAN 275
Query: 191 QMSQRLASFLLYLSDVEEGGETMFP-------FENGIFLDSGYDYK-------------- 229
+ R A+ LYL++ GGET+FP ++ G +G +
Sbjct: 276 NGTNRFATVFLYLNNAGSGGETVFPLSTTHEIYQGGRLTQAGTNRTPGFIRDADAAWVCD 335
Query: 230 -KCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
K L+V PR GD +LFYS + ++D SLHGSCP+ GEKW A W+ ++
Sbjct: 336 TKSEALRVTPRTGDSVLFYSQRGDASLDGYSLHGSCPMGDGEKWAANLWVWNR 388
>gi|452752943|ref|ZP_21952682.1| eukaryotic Peptidyl prolyl 4-hydroxylase, alpha subunit [alpha
proteobacterium JLT2015]
gi|451959765|gb|EMD82182.1| eukaryotic Peptidyl prolyl 4-hydroxylase, alpha subunit [alpha
proteobacterium JLT2015]
Length = 314
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 78/251 (31%), Positives = 117/251 (46%), Gaps = 39/251 (15%)
Query: 35 LSRDVPSIRPKLRTLEVVEKE-NESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASA 93
L+ D P + ++R L E+E N P ET SI+ + R L F+SA
Sbjct: 97 LAVDDPVAQQQMRLLNAAEEEENWRTEPVSET--PSIRMV---------RHL----FSSA 141
Query: 94 EQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIAR 153
E C + + RL+PS + L RTS G +S E+ ++ ++ +IA
Sbjct: 142 E-CAYLQQMSAPRLRPSTI-LDPQTGARRPDPVRTSVGAALSPVEEDL-VVGMLNRRIAA 198
Query: 154 ATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETM 213
AT + GE ++LRY Q+Y H+DA E +QR + ++YL+ EGGET
Sbjct: 199 ATGTDRMQGEPLHILRYSGAQEYRPHHDAVAGLE-----NQRSHTLIVYLTADYEGGETA 253
Query: 214 FPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWV 273
FP +G +++ R+GD LLF +L +G D H P G KW+
Sbjct: 254 FPE---------------LGFRLRGRQGDALLFANLREDGRPDLRMRHAGLPATSGAKWI 298
Query: 274 ATKWIRDQEQH 284
AT+WIR + H
Sbjct: 299 ATRWIRTRPYH 309
>gi|51036657|ref|NP_742059.2| prolyl 4-hydroxylase subunit alpha-1 precursor [Rattus norvegicus]
gi|90111077|sp|P54001.2|P4HA1_RAT RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; Flags: Precursor
gi|50927553|gb|AAH78703.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I [Rattus norvegicus]
gi|149038787|gb|EDL93076.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide, isoform CRA_a
[Rattus norvegicus]
Length = 534
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 63/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S + + + AK RL S+ + ET + +T R S ++S ED
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYED 391
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTG-- 447
Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 490
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|340367965|ref|XP_003382523.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Amphimedon
queenslandica]
Length = 525
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 58/217 (26%), Positives = 103/217 (47%), Gaps = 20/217 (9%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+ I +V +P+ F + + + + + A +L + + GE + +T R S
Sbjct: 314 LSPIKTEVAFVKPKIYIFYDIVTDREIERLKELANPKLNRATVHGENGELLHAT--YRIS 371
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE-- 187
++S S+D G ++ I+ +I T L + E V+ Y IG +Y+ HYD E
Sbjct: 372 KSGWLSGSDDPLGYVDRIDQRIEDVTGLTMSTAEQLQVVNYGIGGQYEPHYDFARTGEDT 431
Query: 188 -YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLF 246
R+++ L+Y+SDVE+GG T+FP +G ++ P + +
Sbjct: 432 FTSLGSGNRISTLLIYMSDVEKGGATVFP---------------GVGARLVPIKRAAAYW 476
Query: 247 YSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
++L +G D ++ H CPV+ G KWV KWI ++ Q
Sbjct: 477 WNLKRSGDGDYSTRHAGCPVLVGSKWVCNKWIHERGQ 513
>gi|26336999|dbj|BAC32183.1| unnamed protein product [Mus musculus]
gi|148700261|gb|EDL32208.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide, isoform CRA_b [Mus
musculus]
Length = 534
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 63/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S + + + AK RL S+ + ET + +T R S ++S ED
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYED 391
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTG-- 447
Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 490
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|474940|emb|CAA55546.1| gamma-butyrobetaine,2-oxoglutarate dioxygenase [Rattus norvegicus]
Length = 534
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 63/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S + + + AK RL S+ + ET + +T R S ++S ED
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYED 391
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTG-- 447
Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 490
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|410910256|ref|XP_003968606.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Takifugu
rubripes]
Length = 540
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 66/213 (30%), Positives = 105/213 (49%), Gaps = 25/213 (11%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
+VLS RP + + +F S + + I A+ L+ S +A G+ ++T R S ++
Sbjct: 336 EVLSLRPYVVLYHDFISDSESEEIKQHAQLGLRRSVVA--TGDK-QATAEYRISKSAWLK 392
Query: 136 ASEDKTGILELIEHKIARATML--PQTHGEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 190
S T + ++ KI+ T L HGE V+ Y IG Y+ H+D A +P+ +
Sbjct: 393 GSAHST--VSRLDQKISMLTGLNVQHPHGEYLQVVNYGIGGHYEPHFDHATSPSSPVFKL 450
Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
+ R+A+F++YLS VE GG T F + N V + + +++L
Sbjct: 451 KTGNRVATFMIYLSSVEAGGSTAFIYAN---------------FSVPVMKNAAIFWWNLH 495
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
NG D +LH CPV+ G+KWVA KWI + Q
Sbjct: 496 RNGEGDADTLHAGCPVLIGDKWVANKWIHEYGQ 528
>gi|395501518|ref|XP_003755140.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Sarcophilus
harrisii]
Length = 385
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 64/212 (30%), Positives = 102/212 (48%), Gaps = 32/212 (15%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F S + + + AK RL S+ + ET + +T R S ++S ED
Sbjct: 185 KPRIVRFHEIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYED 242
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 243 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG-- 298
Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 299 --NRIATWLFYMSDVSAGGATVFPE---------------VGASVWPKKGTAVFWYNLFA 341
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+G D ++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 342 SGEGDYSTRHAACPVLVGNKWVSNKWIHERGQ 373
>gi|195390835|ref|XP_002054073.1| GJ22993 [Drosophila virilis]
gi|194152159|gb|EDW67593.1| GJ22993 [Drosophila virilis]
Length = 525
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 55/163 (33%), Positives = 85/163 (52%), Gaps = 24/163 (14%)
Query: 126 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 185
TRTS T+++ S + + + +I+ T E V+ Y +G YD H+D FN
Sbjct: 372 TRTSKVTWLTDSLNPLTVR--LNRRISDMTGFDLYGSEMLQVMNYGLGGHYDLHFDYFN- 428
Query: 186 AEYGPQMSQ----RLASFLLYLSDVEEGGETMFP-FENGIFLDSGYDYKKCIGLKVKPRR 240
A +++ R+A+ L YL+DVE+GG T+FP + IF P++
Sbjct: 429 ATIAKDLTKLNGDRIATVLFYLTDVEQGGATVFPNIKQAIF----------------PKK 472
Query: 241 GDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
G +++Y+L N D +LH +CPVI G KWV KWIR+ +Q
Sbjct: 473 GTAVMWYNLRHNNDGDPQTLHAACPVIVGSKWVCNKWIREHQQ 515
>gi|345305838|ref|XP_001508476.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Ornithorhynchus
anatinus]
Length = 493
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 63/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + + S + +++ AK RL S+ + ET + +T R S ++S ED
Sbjct: 293 KPRIVRYHEIISDAEIETVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYED 350
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 351 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG-- 406
Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 407 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 449
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+G D ++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 450 SGEGDYSTRHAACPVLVGNKWVSNKWIHERGQ 481
>gi|836898|gb|AAC52197.1| prolyl 4-hydroxylase alpha(I)-subunit, partial [Mus musculus]
gi|1096887|prf||2112362A Pro 4-hydroxylase:SUBUNIT=alpha:ISOTYPE=I
Length = 526
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 63/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S + + + AK RL S+ + ET + +T R S ++S ED
Sbjct: 326 KPRIIRFHDIISDAEIEIVKYLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYED 383
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 384 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTG-- 439
Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 440 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 482
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 483 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 514
>gi|334314085|ref|XP_001363658.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1
[Monodelphis domestica]
Length = 537
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 64/212 (30%), Positives = 102/212 (48%), Gaps = 32/212 (15%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F S + + + AK RL S+ + ET + +T R S ++S ED
Sbjct: 337 KPRIVRFHEIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYED 394
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 395 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG-- 450
Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 451 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 493
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+G D ++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 494 SGEGDYSTRHAACPVLVGNKWVSNKWIHERGQ 525
>gi|74148153|dbj|BAE36242.1| unnamed protein product [Mus musculus]
Length = 454
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 63/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S + + + AK RL S+ + ET + +T R S ++S ED
Sbjct: 254 KPRIIRFHDIISDAENEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYED 311
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 312 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTG-- 367
Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 368 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 410
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 411 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 442
>gi|74225936|dbj|BAE28745.1| unnamed protein product [Mus musculus]
Length = 561
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 60/211 (28%), Positives = 102/211 (48%), Gaps = 30/211 (14%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+PR + F + S + + + AK RL+ + ++ +E+ R S ++S ED
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPVTGALETVH-YRISKSAWLSGYEDP 392
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTG--- 447
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF +
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFAS 491
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 492 GEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|354483225|ref|XP_003503795.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 2
[Cricetulus griseus]
Length = 534
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 60/209 (28%), Positives = 102/209 (48%), Gaps = 26/209 (12%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+PR + F + S + + + AK RL+ + ++ +E+ R S ++S ED
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGNLETVH-YRISKSAWLSGYEDP 392
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ------ 194
++ I +I T L + E V Y +G +Y+ H+D E P Q
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDE--PDAFQELGTGN 448
Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF +G
Sbjct: 449 RIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGE 493
Query: 255 IDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 494 GDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|415977972|ref|ZP_11559036.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein
[Acidithiobacillus sp. GGI-221]
gi|339834153|gb|EGQ61937.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein
[Acidithiobacillus sp. GGI-221]
Length = 215
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 67/201 (33%), Positives = 93/201 (46%), Gaps = 24/201 (11%)
Query: 85 LYFPNFASAEQCQSIIATAK-KRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGI 143
++F S ++C +IA KPS + + T G R S T ++ S D I
Sbjct: 17 VHFKGLLSLDECAELIAIGSVSDAKPSVVVDGASDAAYETPG-RCS--TVVAPSVDAYPI 73
Query: 144 LELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM---SQRLASFL 200
+ I +I + + Q + E +L Y G KYD HYDAF ++ PQ+ RL + L
Sbjct: 74 ILEIRRRIELFSGISQENQEPLQILHYTRGGKYDIHYDAF--SDGSPQLRNGGNRLLTVL 131
Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
LYL+DVE GG T FP I + P G G+LF + R SL
Sbjct: 132 LYLNDVEYGGWTQFPH---------------IMANIVPNAGSGILFRNTDAQNRQLRESL 176
Query: 261 HGSCPVIKGEKWVATKWIRDQ 281
H PV GEKW+A+ WIR+
Sbjct: 177 HAGLPVTHGEKWIASIWIREN 197
>gi|198284815|ref|YP_002221136.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrooxidans ATCC 53993]
gi|218668131|ref|YP_002427500.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrooxidans ATCC 23270]
gi|198249336|gb|ACH84929.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrooxidans ATCC 53993]
gi|218520344|gb|ACK80930.1| oxidoreductase, 2OG-Fe(II) oxygenase family [Acidithiobacillus
ferrooxidans ATCC 23270]
Length = 213
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 67/201 (33%), Positives = 93/201 (46%), Gaps = 24/201 (11%)
Query: 85 LYFPNFASAEQCQSIIATAK-KRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGI 143
++F S ++C +IA KPS + + T G R S T ++ S D I
Sbjct: 15 VHFKGLLSLDECAELIAIGSVSDAKPSVVVDGASDAAYETPG-RCS--TVVAPSVDAYPI 71
Query: 144 LELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM---SQRLASFL 200
+ I +I + + Q + E +L Y G KYD HYDAF ++ PQ+ RL + L
Sbjct: 72 ILEIRRRIELFSGISQENQEPLQILHYTRGGKYDIHYDAF--SDGSPQLRNGGNRLLTVL 129
Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
LYL+DVE GG T FP I + P G G+LF + R SL
Sbjct: 130 LYLNDVEYGGWTQFPH---------------IMANIVPNAGSGILFRNTDAQNRQLRESL 174
Query: 261 HGSCPVIKGEKWVATKWIRDQ 281
H PV GEKW+A+ WIR+
Sbjct: 175 HAGLPVTHGEKWIASIWIREN 195
>gi|334314087|ref|XP_003339988.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2
[Monodelphis domestica]
Length = 537
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 62/211 (29%), Positives = 101/211 (47%), Gaps = 30/211 (14%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+PR + F S + + + AK RL+ + ++ +E T R S ++S ED
Sbjct: 337 KPRIVRFHEIISDAEIEIVKDLAKPRLRRATISNPITGVLE-TAHYRISKSAWLSGYEDP 395
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 396 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG--- 450
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF +
Sbjct: 451 -NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFAS 494
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
G D ++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 495 GEGDYSTRHAACPVLVGNKWVSNKWIHERGQ 525
>gi|149038788|gb|EDL93077.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide, isoform CRA_b
[Rattus norvegicus]
Length = 534
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 60/211 (28%), Positives = 102/211 (48%), Gaps = 30/211 (14%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+PR + F + S + + + AK RL+ + ++ +E+ R S ++S ED
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPVTGALETVH-YRISKSAWLSGYEDP 392
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTG--- 447
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF +
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFAS 491
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 492 GEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|33859596|ref|NP_035160.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Mus musculus]
gi|20455506|sp|Q60715.2|P4HA1_MOUSE RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; Flags: Precursor
gi|16307134|gb|AAH09654.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide [Mus musculus]
gi|74144306|dbj|BAE36020.1| unnamed protein product [Mus musculus]
gi|74146660|dbj|BAE41331.1| unnamed protein product [Mus musculus]
gi|148700260|gb|EDL32207.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide, isoform CRA_a [Mus
musculus]
Length = 534
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 60/211 (28%), Positives = 102/211 (48%), Gaps = 30/211 (14%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+PR + F + S + + + AK RL+ + ++ +E+ R S ++S ED
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPVTGALETVH-YRISKSAWLSGYEDP 392
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTG--- 447
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF +
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFAS 491
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 492 GEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|224122338|ref|XP_002318810.1| predicted protein [Populus trichocarpa]
gi|222859483|gb|EEE97030.1| predicted protein [Populus trichocarpa]
Length = 310
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 60/205 (29%), Positives = 106/205 (51%), Gaps = 19/205 (9%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRT---SSGTFI 134
+SW+PR + F + E+C +I+ A+ + S+ G+ +S + R +S T +
Sbjct: 61 VSWQPRVFVYKGFLTDEECDHLISLAQGTKETSE-----GKDDDSGRIERNRLFASSTSL 115
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
+D IL IE +++ T+LP+ + + V+ Y I + +++D F
Sbjct: 116 LNMDD--NILSRIEERVSAWTLLPKENSKPLQVMHYGI-EDAKNYFDYFGNKSAIISSEP 172
Query: 195 RLASFLLYLSDVEEGGETMFP---FENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
+A+ + YLS+V +GGE FP +N I+ D I ++P +G+ +LF+++ P
Sbjct: 173 LMATLVFYLSNVTQGGEIFFPKSEVKNKIWSDCTK-----ISDSLRPIKGNAILFFTVHP 227
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATK 276
N + D S H CPV++GE W ATK
Sbjct: 228 NTSPDMGSSHSRCPVLEGEMWYATK 252
>gi|74224984|dbj|BAE38205.1| unnamed protein product [Mus musculus]
Length = 534
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 60/211 (28%), Positives = 102/211 (48%), Gaps = 30/211 (14%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+PR + F + S + + + AK RL+ + ++ +E+ R S ++S ED
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPVTGALETVH-YRISKSAWLSGYEDP 392
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTG--- 447
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF +
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFAS 491
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 492 GEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|194905376|ref|XP_001981185.1| GG11927 [Drosophila erecta]
gi|190655823|gb|EDV53055.1| GG11927 [Drosophila erecta]
Length = 539
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 63/213 (29%), Positives = 101/213 (47%), Gaps = 21/213 (9%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
I ++LS P + + S ++ I +++K + PS+ + V S RTS
Sbjct: 322 IKTEILSIDPFVVLLHDMVSPKEAALIRSSSKSTIFPSETVNAANDFVVSK--FRTSKSV 379
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF--NPAEYGP 190
++ ++ + + ++A AT L H E F V+ Y IG ++SH+D + +
Sbjct: 380 WLDRDANEATVK--LTQRLADATGLDVKHSEHFQVINYGIGGVFESHFDTTLEDTNRFVG 437
Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
R+A+ L YL+DV +GG T FP N + V PR G L +Y+L
Sbjct: 438 GFIDRIATTLFYLNDVPQGGATHFPGLN---------------ITVFPRLGAALFWYNLD 482
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
G + ++H CPVI G KWV +KWI D+ Q
Sbjct: 483 TQGMLQVRTMHTGCPVIVGSKWVVSKWIDDKGQ 515
>gi|344254200|gb|EGW10304.1| Prolyl 4-hydroxylase subunit alpha-1 [Cricetulus griseus]
Length = 507
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 61/209 (29%), Positives = 102/209 (48%), Gaps = 26/209 (12%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+PR + F + S + + + AK RL+ + ++ +E T R S ++S ED
Sbjct: 307 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGNLE-TVHYRISKSAWLSGYEDP 365
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ------ 194
++ I +I T L + E V Y +G +Y+ H+D E P Q
Sbjct: 366 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDE--PDAFQELGTGN 421
Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF +G
Sbjct: 422 RIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGE 466
Query: 255 IDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 467 GDYSTRHAACPVLVGNKWVSNKWLHERGQ 495
>gi|159462456|ref|XP_001689458.1| predicted protein [Chlamydomonas reinhardtii]
gi|158283446|gb|EDP09196.1| predicted protein [Chlamydomonas reinhardtii]
Length = 221
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 61/197 (30%), Positives = 91/197 (46%), Gaps = 34/197 (17%)
Query: 85 LYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGIL 144
+ + NF S +C+ II A ++K S + + V RTS GTF+ D ++
Sbjct: 1 MVYHNFLSDRECRHIIDLAHAQMKRSTVVGSKNAGV--VDDIRTSYGTFLRRVPDP--VI 56
Query: 145 ELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLS 204
IEH++A + LP +H E VLRY KY H D +R+A+ L+YL
Sbjct: 57 AAIEHRLALWSHLPASHQEDMQVLRYGPTNKYGPHIDGL----------ERVATVLIYLG 106
Query: 205 DVEEGGETMFPFENGIFLDSGYDYKKCIGLKV--KPRRGDGLLFYSLFPN-GTIDRTSLH 261
E + +C +V KP+RGD L+F+ P+ D S+H
Sbjct: 107 QAERA-----------------NLSQCARGRVAYKPKRGDALMFFDTMPDYKQTDVHSMH 149
Query: 262 GSCPVIKGEKWVATKWI 278
CPV++G KW A KW+
Sbjct: 150 TGCPVVEGVKWNAVKWL 166
>gi|291190128|ref|NP_001167431.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Salmo salar]
gi|223649060|gb|ACN11288.1| Prolyl 4-hydroxylase subunit alpha-2 precursor [Salmo salar]
Length = 538
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 64/211 (30%), Positives = 102/211 (48%), Gaps = 32/211 (15%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + N S + + I AK RL ++ +R +T V +T R S ++ ED
Sbjct: 339 PHIVRYLNALSDSEIEKIKELAKPRL--ARATVRDPKTGVLTTANYRVSKSAWLEGEEDP 396
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
++E + +I T L E + Y +G +Y+ H+D AF G
Sbjct: 397 --VIERVNQRIEDITGLTTQTAELLQIANYGVGGQYEPHFDFSRKDEPDAFKTLGTG--- 451
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
R+A+FL Y+SDVE GG T+FP D+ G + P++G + +Y+LF +
Sbjct: 452 -NRVATFLNYMSDVEAGGATVFP-----------DF----GAAIYPKKGTAVFWYNLFRS 495
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
G D + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 496 GEGDYRTRHAACPVLVGCKWVSNKWIHERGQ 526
>gi|344274272|ref|XP_003408941.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1
[Loxodonta africana]
Length = 534
Score = 94.4 bits (233), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 334 KPRIVRFHDIISDAEIEVVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447
Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP---------------DVGASVWPKKGTAVFWYNLFA 490
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|194905372|ref|XP_001981184.1| GG11758 [Drosophila erecta]
gi|190655822|gb|EDV53054.1| GG11758 [Drosophila erecta]
Length = 550
Score = 94.0 bits (232), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 56/173 (32%), Positives = 80/173 (46%), Gaps = 23/173 (13%)
Query: 117 GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKY 176
G RTS TFI AS K +L I+ ++A T L + E Y IG Y
Sbjct: 362 GHNESLVSNVRTSQFTFIPASAHK--VLSTIDQRVADMTNLNMKYAEDHQFANYGIGGHY 419
Query: 177 DSHYDAFNPAEY------GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKK 230
H D F + P+M R+A+ L YLSDV +GG T FP +
Sbjct: 420 GQHMDWFYQTTFDAGLVSSPEMGNRIATVLFYLSDVSQGGGTAFPQLRTL---------- 469
Query: 231 CIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+KP++ +++L +G D + HG+CP+I G KWV +WIR+ +Q
Sbjct: 470 -----LKPKKYAAAFWHNLHASGVGDVRTQHGACPIIAGSKWVQNRWIREFDQ 517
>gi|159481038|ref|XP_001698589.1| predicted protein [Chlamydomonas reinhardtii]
gi|158282329|gb|EDP08082.1| predicted protein [Chlamydomonas reinhardtii]
Length = 258
Score = 94.0 bits (232), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 55/151 (36%), Positives = 76/151 (50%), Gaps = 6/151 (3%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
I + +SW PRA + NF S +C + KR+ S L + RTS G
Sbjct: 6 IRIETISWSPRAFIYHNFLSEAECDHLTDIGNKRVSRS-LVVDSKTGQSKLDDIRTSYGA 64
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQ 191
ED ++ +E +IA T LP +GE +LRY GQKYD+H+D F +P +
Sbjct: 65 AFGRGEDP--VIAAVEERIAEWTHLPPEYGEPMQILRYVDGQKYDAHWDWFDDPVHHAAY 122
Query: 192 MSQ--RLASFLLYLSDVEEGGETMFPFENGI 220
+ + R A+ LLYLS VE GGET P + I
Sbjct: 123 LHEGNRYATVLLYLSGVEGGGETNLPLADPI 153
>gi|321474953|gb|EFX85917.1| hypothetical protein DAPPUDRAFT_309108 [Daphnia pulex]
Length = 549
Score = 94.0 bits (232), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 59/219 (26%), Positives = 100/219 (45%), Gaps = 23/219 (10%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
IQ + + +P + + + E+ +++ A R K + + +E+ K R S
Sbjct: 335 IQPLKMEEAFLKPLLVIYHDVIFDEEIETVKKLAHPRFKRTTVMNSATGKLETAK-YRIS 393
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY- 188
F+ E +L++ ++ T L + E V Y IG Y+ H+D E
Sbjct: 394 KAAFLKNKEHHH-VLKM-SRRVGAITGLDMSTAEDLQVCNYGIGGHYEPHFDYARKNETI 451
Query: 189 ----GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
R+A++L Y+SDVE GG T+FP + + + P++G
Sbjct: 452 GFNKDSGWRNRIATWLFYMSDVEAGGATVFP---------------ALNVALWPQKGSAA 496
Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+Y+LFPNG + + H +CPV+ G KWVA KWI ++ Q
Sbjct: 497 FWYNLFPNGEGNELTRHAACPVLTGSKWVANKWIHEKNQ 535
>gi|321474876|gb|EFX85840.1| hypothetical protein DAPPUDRAFT_309107 [Daphnia pulex]
Length = 528
Score = 94.0 bits (232), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 52/161 (32%), Positives = 78/161 (48%), Gaps = 21/161 (13%)
Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
R + F+ SE ++ + ++ T L E V Y IG Y H+D
Sbjct: 373 RIAKAAFLKDSEH--NLIVKMSRRVGDITGLDMAASEDLQVCNYGIGGHYVPHFDYARQG 430
Query: 187 E-YGPQ---MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
E +GP+ R+A++L Y+SDVE GG T+FP +G + P++G
Sbjct: 431 EIHGPRDLDWGNRIATWLFYMSDVEAGGATVFP---------------AVGAALWPQKGS 475
Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+Y+L PNG D +LH CPV+ G KWV+ KWI ++ Q
Sbjct: 476 AAFWYNLRPNGNGDEDTLHAGCPVLTGSKWVSNKWIHERSQ 516
>gi|73952886|ref|XP_850682.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3 [Canis
lupus familiaris]
Length = 534
Score = 94.0 bits (232), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447
Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 490
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|90085216|dbj|BAE91349.1| unnamed protein product [Macaca fascicularis]
Length = 244
Score = 94.0 bits (232), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 44 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 101
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 102 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 157
Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 158 --NRIATWLFYMSDVSAGGATVFPE---------------VGASVWPKKGTAVFWYNLFA 200
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 201 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 232
>gi|397490069|ref|XP_003816032.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Pan paniscus]
Length = 488
Score = 94.0 bits (232), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 288 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 345
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 346 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 401
Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 402 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 444
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 445 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 476
>gi|190786|gb|AAA36534.1| prolyl 4-hydroxylase alpha subunit (EC 1.14.11.2) [Homo sapiens]
Length = 534
Score = 94.0 bits (232), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447
Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 490
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|219113023|ref|XP_002186095.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|209582945|gb|ACI65565.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 508
Score = 94.0 bits (232), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 68/220 (30%), Positives = 105/220 (47%), Gaps = 26/220 (11%)
Query: 72 SIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKR-LKPSQLALRQGETVESTKGTRTSS 130
++ VLS PR +F S + + ++ A KR LK S + + TRTS+
Sbjct: 278 NMTMTVLSCVPRVFEVKDFLSDMEVEHLLNIASKRKLKRSTMHAGGSSEATTNDDTRTST 337
Query: 131 GTFISASED---------KTGILELIEH--KIARATMLPQ---TH---GEAFNVLRYEIG 173
+I +D +L++ E + R + +P+ +H E ++ Y++G
Sbjct: 338 NDWIPRHQDLITDTIYRRAADLLQMDEALLRWRRKSEIPEFTESHISISERLQLVNYQVG 397
Query: 174 QKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIG 233
Q+Y H+D P Q S R A+ L YL+D +GGET FP + G
Sbjct: 398 QQYTPHHDFTMPGLVNMQPS-RFATLLFYLNDDMDGGETAFPRWLHADEEGG-------S 449
Query: 234 LKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWV 273
LKVKP +G +LFY+L P+G D S H + PV +GEKW+
Sbjct: 450 LKVKPEKGKAILFYNLLPDGNYDERSEHAALPVRRGEKWL 489
>gi|332244067|ref|XP_003271193.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
alpha-1 [Nomascus leucogenys]
Length = 502
Score = 93.6 bits (231), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 302 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 359
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 360 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 415
Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 416 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 458
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 459 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 490
>gi|290243077|ref|YP_003494747.1| Procollagen-proline dioxygenase [Thioalkalivibrio sp. K90mix]
gi|288945582|gb|ADC73280.1| Procollagen-proline dioxygenase [Thioalkalivibrio sp. K90mix]
Length = 575
Score = 93.6 bits (231), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 66/217 (30%), Positives = 104/217 (47%), Gaps = 26/217 (11%)
Query: 75 FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 134
+ LS P +Y F +C+++I A+ R+K + ++L V ++G RT S ++
Sbjct: 50 METLSQDPLVVYLDEFLEPGECEALIHLAQGRMKRALVSLDGSSGV--SQG-RTGSNCWL 106
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN-----PAEYG 189
E+ + I ++A+ P + E V+ Y Q+Y HYDA++
Sbjct: 107 RYQEEP--LARRIGERVAKRVGFPLEYAEPLQVIHYGHEQEYRPHYDAYDLDTPRGLRCT 164
Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSL 249
Q QR+ + LLYL++VEEGG T FP G++V PR+G +F ++
Sbjct: 165 RQGGQRMVTALLYLNEVEEGGATAFP---------------NAGVEVAPRKGRIAIFNNV 209
Query: 250 FPN-GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 285
+ G SLHG PV GEKW A+ W R + HE
Sbjct: 210 GADPGRPHPRSLHGGMPVKSGEKWAASIWFRARPAHE 246
>gi|63252886|ref|NP_000908.2| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Homo
sapiens]
gi|114631173|ref|XP_508168.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 13 [Pan
troglodytes]
gi|602676|gb|AAA59069.1| alpha-subunit of prolyl 4-hydroxylase [Homo sapiens]
gi|62897481|dbj|BAD96680.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I variant [Homo
sapiens]
gi|119574852|gb|EAW54467.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I, isoform CRA_a [Homo
sapiens]
gi|119574853|gb|EAW54468.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I, isoform CRA_b [Homo
sapiens]
gi|410349609|gb|JAA41408.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
gi|410349613|gb|JAA41410.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
Length = 534
Score = 93.6 bits (231), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447
Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 490
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|395820524|ref|XP_003783614.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1 [Otolemur
garnettii]
Length = 534
Score = 93.6 bits (231), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447
Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 490
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|301770069|ref|XP_002920453.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Ailuropoda
melanoleuca]
Length = 534
Score = 93.6 bits (231), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447
Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 490
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|348576112|ref|XP_003473831.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cavia
porcellus]
Length = 534
Score = 93.6 bits (231), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447
Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 490
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|296220402|ref|XP_002756291.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Callithrix
jacchus]
Length = 534
Score = 93.6 bits (231), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447
Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 490
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|291404182|ref|XP_002718471.1| PREDICTED: prolyl 4-hydroxylase, alpha I subunit isoform 1
[Oryctolagus cuniculus]
Length = 534
Score = 93.6 bits (231), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447
Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 490
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|380813208|gb|AFE78478.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Macaca
mulatta]
gi|384947330|gb|AFI37270.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Macaca
mulatta]
Length = 534
Score = 93.6 bits (231), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447
Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 490
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|432106758|gb|ELK32410.1| Prolyl 4-hydroxylase subunit alpha-1 [Myotis davidii]
Length = 534
Score = 93.6 bits (231), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447
Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 490
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|383418721|gb|AFH32574.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Macaca
mulatta]
Length = 534
Score = 93.6 bits (231), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447
Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 490
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|307211752|gb|EFN87747.1| Prolyl 4-hydroxylase subunit alpha-1 [Harpegnathos saltator]
Length = 415
Score = 93.6 bits (231), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 61/209 (29%), Positives = 96/209 (45%), Gaps = 24/209 (11%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
PR +++ N E+ ++I A+ R K + + + +E R S ++ E K
Sbjct: 208 PRIVFYHNVIYDEEIETIKRMAQPRFKRATVQNYKTGALE-IANYRISKSAWLQEHEHKH 266
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS----QRLA 197
+ + ++ T + E V+ Y IG Y+ H+D E S R+A
Sbjct: 267 --VAAVSKRVEHMTSMSVETAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIA 324
Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 257
+ L Y+SDVE+GG T+F I + + PR+G +Y+L PNG D
Sbjct: 325 TVLYYMSDVEQGGGTVFT---------------AINISLWPRKGSAAFWYNLKPNGEGDF 369
Query: 258 TSLHGSCPVIKGEKWVATKWI--RDQEQH 284
+ H +CPV+ G KWVA KW+ R QE H
Sbjct: 370 KTRHAACPVLTGSKWVANKWLHERGQEFH 398
>gi|410927705|ref|XP_003977281.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Takifugu
rubripes]
Length = 531
Score = 93.6 bits (231), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 61/211 (28%), Positives = 102/211 (48%), Gaps = 30/211 (14%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
RP + + + S + +++ AK RL+ + + Q + +T R S ++ A E
Sbjct: 331 RPHIVRYHDILSNREMETVKELAKPRLRRATVHDPQTGQL-TTAPYRVSKSAWLGAFEHP 389
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
+++ I +I T L + E V Y +G +Y+ HYD AF G
Sbjct: 390 --VVDRINQRIEDITGLDVSTAEDLQVANYGVGGQYEPHYDFGRKDEPDAFKELGTG--- 444
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
R+A++LLY+S+V+ GG T+F IG V P++G + +Y+L P+
Sbjct: 445 -NRIATWLLYMSEVQAGGATVFT---------------DIGASVSPKKGSAVFWYNLHPS 488
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
G D + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 489 GDGDYRTRHAACPVLLGNKWVSNKWIHERGQ 519
>gi|410251924|gb|JAA13929.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
Length = 566
Score = 93.6 bits (231), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447
Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 490
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|66771513|gb|AAY55068.1| IP12095p [Drosophila melanogaster]
Length = 538
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 65/213 (30%), Positives = 104/213 (48%), Gaps = 21/213 (9%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
I ++LS P + + S ++ I +++K ++ PS+ + E K + S
Sbjct: 321 IKTEILSVDPFVILLHDMVSHKEGALIRSSSKNQILPSE-TVNAANEFEIAKFRTSKSVW 379
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE--YGP 190
F S + + T L+L + ++ AT L H E F V+ Y IG ++SH+D E +
Sbjct: 380 FDSDANEAT--LKLTQ-RLGEATGLDMKHSEPFQVINYGIGGVFESHFDTSLADEDRFVN 436
Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
RLA+ L YL+DV +GG T FP + + V P+ G L++Y+L
Sbjct: 437 GYIDRLATTLFYLNDVPQGGATHFP---------------GLNITVFPKFGTVLMWYNLH 481
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
G + ++H CPVI G KWV +KWI D+ Q
Sbjct: 482 TEGMLHVRTMHTGCPVIVGSKWVVSKWIDDKGQ 514
>gi|402880501|ref|XP_003903839.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like, partial
[Papio anubis]
Length = 379
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 179 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 236
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 237 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 292
Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 293 --NRIATWLFYMSDVSAGGATVFPE---------------VGASVWPKKGTAVFWYNLFA 335
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 336 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 367
>gi|349604936|gb|AEQ00344.1| Prolyl 4-hydroxylase subunit alpha-1-like protein, partial [Equus
caballus]
Length = 302
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 102 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 159
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 160 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 215
Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 216 --NRIATWLFYMSDVSAGGATVFPE---------------VGASVWPKKGTAVFWYNLFA 258
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 259 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 290
>gi|66770643|gb|AAY54633.1| IP12395p [Drosophila melanogaster]
Length = 538
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 65/213 (30%), Positives = 104/213 (48%), Gaps = 21/213 (9%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
I ++LS P + + S ++ I +++K ++ PS+ + E K + S
Sbjct: 321 IKTEILSVDPFVILLHDMVSHKEGALIRSSSKNQILPSE-TVNAANEFEIAKFRTSKSVW 379
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE--YGP 190
F S + + T L+L + ++ AT L H E F V+ Y IG ++SH+D E +
Sbjct: 380 FDSDANEAT--LKLTQ-RLGEATGLDMKHSEPFQVINYGIGGVFESHFDTSLADEDRFVN 436
Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
RLA+ L YL+DV +GG T FP + + V P+ G L++Y+L
Sbjct: 437 GYIDRLATTLFYLNDVPQGGATHFP---------------GLNITVFPKFGTVLMWYNLH 481
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
G + ++H CPVI G KWV +KWI D+ Q
Sbjct: 482 TEGMLHVRTMHTGCPVIVGSKWVVSKWIDDKGQ 514
>gi|261245137|gb|ACX54875.1| FI12021p [Drosophila melanogaster]
Length = 538
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 65/213 (30%), Positives = 104/213 (48%), Gaps = 21/213 (9%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
I ++LS P + + S ++ I +++K ++ PS+ + E K + S
Sbjct: 321 IKTEILSVDPFVILLHDMVSHKEGALIRSSSKNQILPSE-TVNAANEFEIAKFRTSKSVW 379
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE--YGP 190
F S + + T L+L + ++ AT L H E F V+ Y IG ++SH+D E +
Sbjct: 380 FDSDANEAT--LKLTQ-RLGEATGLDMKHSEPFQVINYGIGGVFESHFDTSLADEDRFVN 436
Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
RLA+ L YL+DV +GG T FP + + V P+ G L++Y+L
Sbjct: 437 GYIDRLATTLFYLNDVPQGGATHFP---------------GLNITVFPKFGTVLMWYNLH 481
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
G + ++H CPVI G KWV +KWI D+ Q
Sbjct: 482 TEGMLHVRTMHTGCPVIVGSKWVVSKWIDDKGQ 514
>gi|198429625|ref|XP_002128613.1| PREDICTED: similar to procollagen-proline, 2-oxoglutarate
4-dioxygenase (proline 4-hydroxylase), alpha 1
polypeptide [Ciona intestinalis]
Length = 195
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 43/144 (29%), Positives = 79/144 (54%), Gaps = 18/144 (12%)
Query: 143 ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP---QMSQRLASF 199
+++ + +I+ T L E + Y +G +Y+ H+D +++G ++ R+A+F
Sbjct: 50 VIKRVCQRISDVTGLSMETAEELQIANYGVGGQYEPHFDYSRKSDFGKFDDEVGNRIATF 109
Query: 200 LLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTS 259
L Y+S+VE+GG T+F G+ V+P +G + +Y+L P+G D +
Sbjct: 110 LTYMSNVEQGGSTVFLHP---------------GIAVRPIKGSAVFWYNLLPSGAGDERT 154
Query: 260 LHGSCPVIKGEKWVATKWIRDQEQ 283
H +CPV+ G KWV+ KWI +++Q
Sbjct: 155 RHAACPVLTGVKWVSNKWIHERDQ 178
>gi|260825355|ref|XP_002607632.1| hypothetical protein BRAFLDRAFT_84679 [Branchiostoma floridae]
gi|229292980|gb|EEN63642.1| hypothetical protein BRAFLDRAFT_84679 [Branchiostoma floridae]
Length = 519
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 60/208 (28%), Positives = 95/208 (45%), Gaps = 25/208 (12%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKP--SQLALRQGETVESTKGTRTSSGTFISASE 138
+P+ N S + + I A+ RL+P +Q G + S R S ++ E
Sbjct: 320 KPKLWVLHNILSDPEMEVIKKLAQPRLRPAATQNPTTGGAVLSSY---RISKNAWLYYWE 376
Query: 139 DKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE---YGPQMSQR 195
+ ++ ++ ++ AT L E V+ Y IG Y+ H+D E P R
Sbjct: 377 HR--LINRVKQRVEDATGLTMETAEPLQVINYGIGGHYEPHFDCATKDEEFALDPNEGDR 434
Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 255
+A+ L Y+SDVE GG T+FP +G +V P +G G +Y+L +G
Sbjct: 435 IATMLFYMSDVEAGGATVFP---------------QVGARVVPEKGAGAFWYNLLKSGEG 479
Query: 256 DRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
D + H CPV+ G KWV+ WI ++ Q
Sbjct: 480 DMLTEHAGCPVLVGSKWVSNMWIHERGQ 507
>gi|116008130|ref|NP_001036777.1| CG31524, isoform B [Drosophila melanogaster]
gi|113194860|gb|ABI31221.1| CG31524, isoform B [Drosophila melanogaster]
Length = 535
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 65/213 (30%), Positives = 104/213 (48%), Gaps = 21/213 (9%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
I ++LS P + + S ++ I +++K ++ PS+ + E K + S
Sbjct: 318 IKTEILSVDPFVILLHDMVSHKEGALIRSSSKNQILPSE-TVNAANEFEIAKFRTSKSVW 376
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE--YGP 190
F S + + T L+L + ++ AT L H E F V+ Y IG ++SH+D E +
Sbjct: 377 FDSDANEAT--LKLTQ-RLGEATGLDMKHSEPFQVINYGIGGVFESHFDTSLADEDRFVN 433
Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
RLA+ L YL+DV +GG T FP + + V P+ G L++Y+L
Sbjct: 434 GYIDRLATTLFYLNDVPQGGATHFP---------------GLNITVFPKFGTVLMWYNLH 478
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
G + ++H CPVI G KWV +KWI D+ Q
Sbjct: 479 TEGMLHVRTMHTGCPVIVGSKWVVSKWIDDKGQ 511
>gi|443730626|gb|ELU16050.1| hypothetical protein CAPTEDRAFT_114796, partial [Capitella teleta]
Length = 150
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 54/156 (34%), Positives = 83/156 (53%), Gaps = 29/156 (18%)
Query: 135 SASEDKTGILELIEHKIARATML-PQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY----- 188
SAS DK + +++ AT L + + E F V Y IG Y+ H+D F+ +Y
Sbjct: 8 SASADK------LSRRVSSATKLDAEKYAELFQVSTYGIGGHYEPHFD-FSKVKYFTNPV 60
Query: 189 -GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFY 247
QM R+A+F++YL+DVE GG T+FP N L ++P + + ++
Sbjct: 61 LNEQMGDRIATFMIYLNDVEAGGRTVFPRLN---------------LVIEPIKNSAVFWH 105
Query: 248 SLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+L +G D ++HG+CPV+ G KWVA KWI + Q
Sbjct: 106 NLLDDGQQDDRTIHGACPVVLGRKWVANKWIHEYGQ 141
>gi|228993272|ref|ZP_04153188.1| hypothetical protein bpmyx0001_40040 [Bacillus pseudomycoides DSM
12442]
gi|228766340|gb|EEM14983.1| hypothetical protein bpmyx0001_40040 [Bacillus pseudomycoides DSM
12442]
Length = 195
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 69/218 (31%), Positives = 98/218 (44%), Gaps = 39/218 (17%)
Query: 77 VLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
VL P + + +CQ +I +KK ++P+Q GE R S T++
Sbjct: 7 VLHDEPFVAQYEQIITPAECQELIELSKKHIQPAQAYGHTGE--------RKSDFTWLPH 58
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF--------NPAEY 188
G++ + IA A LP H E RYE+G K+D+H D + N E
Sbjct: 59 YSH--GLVSQVSELIATAMPLPLNHAEPLQAARYEVGGKFDAHIDCYGTWHEDGRNRVEQ 116
Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 248
G QRL + +LYL+ V GGET FP + L V P G L+F +
Sbjct: 117 G---GQRLYTAILYLNTVNAGGETFFP---------------SLNLTVTPSEGKLLVFEN 158
Query: 249 LFPNGTID--RTSLHGSCPVIKGEKWVATKWIRDQEQH 284
GT + SLH C V +GEKW+AT W R++ Q+
Sbjct: 159 C-KRGTNEPHPLSLHEGCAVHEGEKWIATLWFREKPQY 195
>gi|116008537|ref|NP_733379.2| CG31524, isoform A [Drosophila melanogaster]
gi|113194861|gb|AAN14239.2| CG31524, isoform A [Drosophila melanogaster]
Length = 536
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 65/213 (30%), Positives = 104/213 (48%), Gaps = 21/213 (9%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
I ++LS P + + S ++ I +++K ++ PS+ + E K + S
Sbjct: 319 IKTEILSVDPFVILLHDMVSHKEGALIRSSSKNQILPSE-TVNAANEFEIAKFRTSKSVW 377
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE--YGP 190
F S + + T L+L + ++ AT L H E F V+ Y IG ++SH+D E +
Sbjct: 378 FDSDANEAT--LKLTQ-RLGEATGLDMKHSEPFQVINYGIGGVFESHFDTSLADEDRFVN 434
Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
RLA+ L YL+DV +GG T FP + + V P+ G L++Y+L
Sbjct: 435 GYIDRLATTLFYLNDVPQGGATHFP---------------GLNITVFPKFGTVLMWYNLH 479
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
G + ++H CPVI G KWV +KWI D+ Q
Sbjct: 480 TEGMLHVRTMHTGCPVIVGSKWVVSKWIDDKGQ 512
>gi|321474875|gb|EFX85839.1| hypothetical protein DAPPUDRAFT_309105 [Daphnia pulex]
Length = 545
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 59/216 (27%), Positives = 99/216 (45%), Gaps = 20/216 (9%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
IQ I + +PR + + + S E+ ++I A+ R + + + ++ E ++ R +
Sbjct: 334 IQPIKMEEALLKPRIVVYHDIISDEEIETIKRLAQPRFERATVQKKESGEREFSR-YRIA 392
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
++ E + I ++ T L E V Y IG Y+ HYD E
Sbjct: 393 KSAWLKHEEHD--YVSDINFRVGDITGLDMATSEDLQVCNYGIGGHYEPHYDYARKGEVQ 450
Query: 190 PQMSQ--RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFY 247
R+A++L Y+SDVE GG T+FP N L + P++G ++
Sbjct: 451 QDFGWGGRIATWLFYMSDVEAGGATVFPKLN---------------LSLWPQKGSAAFWF 495
Query: 248 SLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+L+PNG + + H CPV+ G KWVA WI ++ Q
Sbjct: 496 NLYPNGEGNEMTQHAGCPVLTGSKWVANYWIHERGQ 531
>gi|428172003|gb|EKX40915.1| hypothetical protein GUITHDRAFT_112917 [Guillardia theta CCMP2712]
Length = 421
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 73/252 (28%), Positives = 104/252 (41%), Gaps = 44/252 (17%)
Query: 60 LPHGETGDASIQSIP-------FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQL 112
P E GD +P +V S PR L +F + E+C +I++AK + S +
Sbjct: 172 FPTCEVGDTFDIPLPSANRNSIAKVRSISPRVLEVEDFLTPEECHELISSAKPLMSRSTV 231
Query: 113 ALRQGETVESTKGTRTSSGTFISASE--------DKTGILELI-----EHKIARATMLPQ 159
+ V + +RTSS ++ D+ L I EH +
Sbjct: 232 SAEGDSAVSLQESSRTSSTAWLPPHSHTLANKLYDRVSSLVGIDFRKHEHVVVEDLQAID 291
Query: 160 THG----EAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS----QRLASFLLYLSDVEEGGE 211
G A+ VLRYE+ Q Y H+D F+P + + R + YL+DVE G
Sbjct: 292 KRGGSSVTAWQVLRYEVNQHYHIHHDYFDPVLHRGFLQGDGRNRFITAFFYLTDVERGDP 351
Query: 212 TMFPFENGIFLDSGYDYKKCI-GLKVKPRRGDGLLFYSLFPNGT----IDRTSLHGSCPV 266
DY C GL+V P+RG ++FYSL +G +D S HG C V
Sbjct: 352 RPIT-----------DYSDCNRGLRVPPKRGKAIIFYSLLADGQRSGGLDVASWHGGCDV 400
Query: 267 IKGEKWVATKWI 278
G KW A WI
Sbjct: 401 HNGTKWAANYWI 412
>gi|113682363|ref|NP_001038463.1| prolyl 4-hydroxylase, alpha polypeptide I a precursor [Danio rerio]
Length = 522
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 58/194 (29%), Positives = 95/194 (48%), Gaps = 30/194 (15%)
Query: 99 IIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASEDKTGILELIEHKIARATML 157
++ TA R+ + + +T + +T R S +++A E +++ I +I T L
Sbjct: 338 VLETAHYRISKRRATVHDPQTGKLTTAQYRVSKSAWLAAYEHP--VVDRINQRIEDITGL 395
Query: 158 PQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQMSQRLASFLLYLSDVEEG 209
E V Y +G +Y+ H+D AF G R+A++L Y+SDV G
Sbjct: 396 NVKTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG----NRIATWLFYMSDVAAG 451
Query: 210 GETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKG 269
G T+FP +G VKP +G + +Y+LFP+G D ++ H +CPV+ G
Sbjct: 452 GATVFPE---------------VGAAVKPLKGTAVFWYNLFPSGEGDYSTRHAACPVLVG 496
Query: 270 EKWVATKWIRDQEQ 283
KWV+ KWI ++ Q
Sbjct: 497 NKWVSNKWIHERGQ 510
>gi|151556370|gb|AAI47868.1| P4HA1 protein [Bos taurus]
Length = 534
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEVVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447
Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 448 --NRIATWLFYMSDVLAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 490
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|442747091|gb|JAA65705.1| Putative prolyl 4-hydroxylase alpha subunit [Ixodes ricinus]
Length = 533
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 57/221 (25%), Positives = 102/221 (46%), Gaps = 24/221 (10%)
Query: 63 GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES 122
GETG +Q I + + +P + + +IA AK RL+ S+ +
Sbjct: 307 GETGFFKLQPIKLEEYNLKPYVVVLRDLLQDRDLNDMIAFAKPRLEQSKTLCAADK---D 363
Query: 123 TKGTRTSSGTFISASEDKTG--ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHY 180
RTSS T++ + + + ++ + T+ + E + + Y IG Y H+
Sbjct: 364 GPPPRTSSNTWLDDDDAPVAARVNQYLQSLLGLGTLYGKDEAEKYQLANYGIGGHYVPHH 423
Query: 181 D----AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKV 236
D + ++ R+A+ ++Y+SDVEEGG T+FP +G++V
Sbjct: 424 DYLEESLTSSKKHRLFGDRVATLMIYMSDVEEGGATVFP---------------SLGVRV 468
Query: 237 KPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKW 277
PR+GD + ++++ + D + H CPV+ G KW+A KW
Sbjct: 469 SPRKGDAVFWWNIKSSWEGDVLTWHAGCPVLYGSKWIANKW 509
>gi|348523976|ref|XP_003449499.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oreochromis
niloticus]
Length = 594
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 61/210 (29%), Positives = 100/210 (47%), Gaps = 30/210 (14%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + + N S + + + AK RL+ + ++ +E T R S ++ A E
Sbjct: 395 PHIVRYHNIVSEKDMEKVKELAKPRLRRATISNPVTGVLE-TAHYRISKSAWLGAYEHP- 452
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQMS 193
+++ I I T L E V Y +G +Y+ H+D AF G
Sbjct: 453 -VVDKINQLIEDVTGLNVKTAEDLQVANYGLGGQYEPHFDFGRKDEPDAFEELGTG---- 507
Query: 194 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 253
R+A++LLY++DV+ GG T+F IG VKP++G + +Y+L+P+G
Sbjct: 508 NRIATWLLYMTDVQAGGATVFT---------------DIGAAVKPKKGTAVFWYNLYPSG 552
Query: 254 TIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
D + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 553 EGDYRTRHAACPVLLGNKWVSNKWIHERGQ 582
>gi|410295850|gb|JAA26525.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
gi|410295854|gb|JAA26527.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
Length = 534
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 61/212 (28%), Positives = 103/212 (48%), Gaps = 32/212 (15%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S + + + AK RL+ + + ET + +T R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLR--RATVHDPETGKLTTAQYRVSKSAWLSGYEN 391
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447
Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 490
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|390989473|ref|ZP_10259770.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas axonopodis
pv. punicae str. LMG 859]
gi|372555742|emb|CCF66745.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas axonopodis
pv. punicae str. LMG 859]
Length = 152
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 57/164 (34%), Positives = 78/164 (47%), Gaps = 24/164 (14%)
Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
RTS + +D + + IE +IAR P HGE VLRY G +Y HYD
Sbjct: 2 VHAARTSDSMCLRVGQD--ALCQRIEARIARLFDWPVDHGEGLQVLRYATGAEYRPHYDY 59
Query: 183 FNPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVK 237
F+P G + QR+AS ++YL+ E GG T FP + L V
Sbjct: 60 FDPDAAGTPILLQAGGQRVASLVMYLNTPERGGATRFPDAH---------------LDVA 104
Query: 238 PRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
+G+ + F P+ SLH PV+ GEKWVATKW+R++
Sbjct: 105 AVKGNAVFFSYDRPHPMT--RSLHAGAPVLTGEKWVATKWLRER 146
>gi|344274274|ref|XP_003408942.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2
[Loxodonta africana]
Length = 534
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 59/211 (27%), Positives = 102/211 (48%), Gaps = 30/211 (14%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+PR + F + S + + + AK RL+ + ++ +E+ R S ++S E+
Sbjct: 334 KPRIVRFHDIISDAEIEVVKDLAKPRLRRATISNPITGDLETVH-YRISKSAWLSGYENP 392
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG--- 447
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF +
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP---------------DVGASVWPKKGTAVFWYNLFAS 491
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 492 GEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|85857698|gb|ABC86384.1| IP10964p [Drosophila melanogaster]
Length = 534
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 61/211 (28%), Positives = 97/211 (45%), Gaps = 28/211 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + + SA + +I A + +K +++ + V RT+ G ++ ++
Sbjct: 326 PYVVLYHEVLSAREISMLIGKAAQNMKNTKI--HKERAVPKKNRGRTAKGFWLKKESNE- 382
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA---------EYGPQM 192
+ + I +I T E F V+ Y IG Y H D F+ A Y +
Sbjct: 383 -LTKRITRRIMDMTGFDLADSEGFQVINYGIGGHYFLHMDYFDFASSNHTDTRSRYSIDL 441
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
R+A+ L YL+DVE+GG T +F D GY V P+ G + +Y+L +
Sbjct: 442 GDRIATVLFYLTDVEQGGAT-------VFGDVGY--------YVSPQAGTAIFWYNLDTD 486
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
G D + H +CPVI G KWV T+WIR++ Q
Sbjct: 487 GNGDPRTRHAACPVIVGSKWVMTEWIREKRQ 517
>gi|219124513|ref|XP_002182546.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217405892|gb|EEC45833.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 193
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 60/209 (28%), Positives = 103/209 (49%), Gaps = 23/209 (11%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
+ LS PRA NF + + I+ +K+ +++ T TRTSS T+++
Sbjct: 2 KALSCAPRAFQVENFLTDVEADHIVGLVQKKND-----MQRSSTNGHISETRTSSTTWLA 56
Query: 136 ASEDKTGILELIEHKIARA-----TMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP 190
D +++ I ++A ML + E ++ Y +GQ+Y +H+D F + P
Sbjct: 57 RHSDP--VIDSIFRRVADTLKMDEAMLHRRINEDLQIVHYGVGQQYTAHHD-FGYPKGDP 113
Query: 191 QMSQRLASFLLYLSDVEEGGETMFP-FENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSL 249
R +F +YL+DV GG+T FP + N + L V P++G ++FY +
Sbjct: 114 GSPSRSINFCMYLNDVPAGGQTSFPRWRNA---------ETNGALNVVPKKGTAMIFYMV 164
Query: 250 FPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
P+G +D + H + PVI+GEK+ + WI
Sbjct: 165 NPDGNLDDLTHHAALPVIEGEKFFSNLWI 193
>gi|421871431|ref|ZP_16303052.1| 2OG-Fe(II) oxygenase superfamily protein [Brevibacillus
laterosporus GI-9]
gi|372459315|emb|CCF12601.1| 2OG-Fe(II) oxygenase superfamily protein [Brevibacillus
laterosporus GI-9]
Length = 201
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 64/217 (29%), Positives = 107/217 (49%), Gaps = 25/217 (11%)
Query: 74 PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTF 133
P Q+L+ +P +P+ S+E CQS+I A+ +L P+ + + G V R S +
Sbjct: 4 PTQLLNQQPFIGCYPSLISSEACQSLINLARGQLTPATVVGQSGLEVSHV---RISELAW 60
Query: 134 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE----YG 189
+ ++ +++ I +IA P + E V Y G K+++H D ++ E +
Sbjct: 61 FCHNYNE--VVQSICKQIAEIVEQPIHYAEKLQVAHYGAGGKFEAHLDCYDSQEANKTFL 118
Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSL 249
QRL + +LYL+DV GGET FP + ++V P G L+F +
Sbjct: 119 EHSGQRLYTAILYLNDVVSGGETYFPN---------------LKIEVSPTTGTLLVFENC 163
Query: 250 FPNGTI-DRTSLHGSCPVIKGEKWVATKWIRDQEQHE 285
P+ +I D SLHGS + GEKW+ T W ++ Q++
Sbjct: 164 QPDTSIPDLRSLHGSKILQSGEKWIGTLWFCERPQYQ 200
>gi|194905294|ref|XP_001981167.1| GG11919 [Drosophila erecta]
gi|190655805|gb|EDV53037.1| GG11919 [Drosophila erecta]
Length = 533
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 58/212 (27%), Positives = 96/212 (45%), Gaps = 29/212 (13%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+P + + SA + ++ A + +K +++ Q E +T RT+ G ++ ++
Sbjct: 325 KPYVVLYHEVLSAREISMLMGKAAQNMKNTRV---QSEKAVNTNRERTAKGYWLKKESNE 381
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA---------EYGPQ 191
+ I +I T E F V+ Y IG Y H+D F A +
Sbjct: 382 --MTRRITRRIVDMTGFDLADSEDFQVINYGIGGHYSLHFDYFGFASSNYTGERSHHSIV 439
Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
+ R+A+ L YL+DVE+GG T+F +G V P+ G + +Y+L
Sbjct: 440 LGDRIATVLFYLTDVEQGGATVF---------------GNVGYSVYPQAGTAIFWYNLDT 484
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+G D + H SCPV+ G KWV T+WI + Q
Sbjct: 485 DGNGDPLTRHASCPVVVGSKWVMTEWIHEARQ 516
>gi|195575113|ref|XP_002105524.1| GD16980 [Drosophila simulans]
gi|194201451|gb|EDX15027.1| GD16980 [Drosophila simulans]
Length = 518
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 65/213 (30%), Positives = 103/213 (48%), Gaps = 21/213 (9%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
I ++LS P + + S + I +++K ++ PS+ + E K + S
Sbjct: 301 IKTEILSVDPFVILLHDMVSPTEGALIRSSSKNQILPSE-TVNAANEFEVAKFRTSKSVW 359
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE--YGP 190
F S + + T L+L + ++ AT L H E F V+ Y IG ++SH+D E +
Sbjct: 360 FDSDANEAT--LKLTQ-RLGEATGLDMKHSEPFQVINYGIGGVFESHFDTSLADEDRFVN 416
Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
RLA+ L YL+DV +GG T FP + + V P+ G L++Y+L
Sbjct: 417 GYIDRLATTLFYLNDVPQGGATHFP---------------GLNITVFPKFGTVLMWYNLH 461
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
G + ++H CPVI G KWV +KWI D+ Q
Sbjct: 462 TEGLLHVRTMHTGCPVIVGSKWVVSKWIDDKGQ 494
>gi|426255746|ref|XP_004021509.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Ovis
aries]
Length = 534
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447
Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 448 --NRIATWLFYMSDVLAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 490
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|221460681|ref|NP_733394.3| CG31013 [Drosophila melanogaster]
gi|220903261|gb|AAF57073.4| CG31013 [Drosophila melanogaster]
Length = 534
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 61/211 (28%), Positives = 97/211 (45%), Gaps = 28/211 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + + SA + +I A + +K +++ + V RT+ G ++ ++
Sbjct: 326 PYVVLYHEVLSAREISMLIGKAAQNMKNTKI--HKERAVPKKNRGRTAKGFWLKKESNE- 382
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA---------EYGPQM 192
+ + I +I T E F V+ Y IG Y H D F+ A Y +
Sbjct: 383 -LTKRITRRIMDMTGFDLADSEGFQVINYGIGGHYFLHMDYFDFASSNHTDTRSRYSIDL 441
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
R+A+ L YL+DVE+GG T +F D GY V P+ G + +Y+L +
Sbjct: 442 GDRIATVLFYLTDVEQGGAT-------VFGDVGY--------YVSPQAGTAIFWYNLDTD 486
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
G D + H +CPVI G KWV T+WIR++ Q
Sbjct: 487 GNGDPRTRHAACPVIVGSKWVMTEWIREKRQ 517
>gi|339009924|ref|ZP_08642495.1| 2OG-Fe(II) oxygenase [Brevibacillus laterosporus LMG 15441]
gi|338773194|gb|EGP32726.1| 2OG-Fe(II) oxygenase [Brevibacillus laterosporus LMG 15441]
Length = 201
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 64/217 (29%), Positives = 107/217 (49%), Gaps = 25/217 (11%)
Query: 74 PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTF 133
P Q+L+ +P +P+ S+E CQS+I A+ +L P+ + + G V R S +
Sbjct: 4 PTQLLNQQPFIGCYPSLISSEACQSLINLARGQLTPATVVGQSGLEVSHV---RISELAW 60
Query: 134 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE----YG 189
+ ++ +++ I +IA P + E V Y G K+++H D ++ E +
Sbjct: 61 FCHNYNE--VVQSICKQIAEIVEQPIHYAEKLQVAHYGAGGKFEAHLDCYDSQEANKPFL 118
Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSL 249
QRL + +LYL+DV GGET FP + ++V P G L+F +
Sbjct: 119 EHSGQRLYTAILYLNDVVSGGETYFPN---------------LKIEVSPTTGTLLVFENC 163
Query: 250 FPNGTI-DRTSLHGSCPVIKGEKWVATKWIRDQEQHE 285
P+ +I D SLHGS + GEKW+ T W ++ Q++
Sbjct: 164 QPDTSIPDLRSLHGSKILQSGEKWIGTLWFCERPQYQ 200
>gi|380813206|gb|AFE78477.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Macaca
mulatta]
gi|384947328|gb|AFI37269.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Macaca
mulatta]
Length = 534
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 59/211 (27%), Positives = 102/211 (48%), Gaps = 30/211 (14%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+PR + F + S + + + AK RL+ + ++ +E+ R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVH-YRISKSAWLSGYENP 392
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG--- 447
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF +
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFAS 491
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 492 GEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|297515507|gb|ADI44133.1| RT08151p [Drosophila melanogaster]
Length = 546
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 53/165 (32%), Positives = 78/165 (47%), Gaps = 23/165 (13%)
Query: 125 GTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN 184
RTS TFI + K +L I+ ++A T L + E Y IG Y H D F
Sbjct: 373 NVRTSQFTFIPVTAHK--VLSTIDQRVADMTNLNMKYAEDHQFANYGIGGHYGQHMDWFY 430
Query: 185 PAEY------GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKP 238
+ P+M R+A+ L YLSDV +GG T FP + +KP
Sbjct: 431 QTTFDAGLVSSPEMGNRIAAVLFYLSDVAQGGGTAFP---------------QLRTLLKP 475
Query: 239 RRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
++ +++L +G D + HG+CP+I G KWV +WIR+ +Q
Sbjct: 476 KKYAAAFWHNLHASGVGDVRTQHGACPIIAGSKWVQNRWIRENDQ 520
>gi|348501574|ref|XP_003438344.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oreochromis
niloticus]
Length = 615
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 60/211 (28%), Positives = 101/211 (47%), Gaps = 30/211 (14%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
RP + + + S + + + AK RL+ + ++ +E T R S +++ +D
Sbjct: 415 RPYIVRYLDIISDAEIERVKQLAKPRLRRATISNPITGVLE-TASYRISKSAWLTEYDDP 473
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
++E I +I T L E V Y +G +Y+ H+D AF G
Sbjct: 474 --MIEKINDRIEGVTGLEMDTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG--- 528
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF +
Sbjct: 529 -NRIATWLFYMSDVSAGGATVFP---------------DVGAAVWPQKGTAVFWYNLFAS 572
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
G D ++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 573 GEGDYSTRHAACPVLVGNKWVSNKWIHERGQ 603
>gi|116008434|ref|NP_651806.2| CG9698 [Drosophila melanogaster]
gi|113194862|gb|AAF57062.2| CG9698 [Drosophila melanogaster]
Length = 547
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 53/165 (32%), Positives = 78/165 (47%), Gaps = 23/165 (13%)
Query: 125 GTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN 184
RTS TFI + K +L I+ ++A T L + E Y IG Y H D F
Sbjct: 373 NVRTSQFTFIPVTAHK--VLSTIDQRVADMTNLNMKYAEDHQFANYGIGGHYGQHMDWFY 430
Query: 185 PAEY------GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKP 238
+ P+M R+A+ L YLSDV +GG T FP + +KP
Sbjct: 431 QTTFDAGLVSSPEMGNRIATVLFYLSDVAQGGGTAFP---------------QLRTLLKP 475
Query: 239 RRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
++ +++L +G D + HG+CP+I G KWV +WIR+ +Q
Sbjct: 476 KKYAAAFWHNLHASGVGDVRTQHGACPIIAGSKWVQNRWIRENDQ 520
>gi|355562502|gb|EHH19096.1| hypothetical protein EGK_19739 [Macaca mulatta]
gi|355782842|gb|EHH64763.1| hypothetical protein EGM_18071 [Macaca fascicularis]
gi|383418719|gb|AFH32573.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Macaca
mulatta]
Length = 534
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 59/211 (27%), Positives = 102/211 (48%), Gaps = 30/211 (14%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+PR + F + S + + + AK RL+ + ++ +E+ R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVH-YRISKSAWLSGYENP 392
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG--- 447
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF +
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFAS 491
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 492 GEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|190788|gb|AAA36535.1| prolyl 4-hydroxylase alpha subunit (EC 1.14.11.2) [Homo sapiens]
Length = 534
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 59/211 (27%), Positives = 102/211 (48%), Gaps = 30/211 (14%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+PR + F + S + + + AK RL+ + ++ +E+ R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVH-YRISKSAWLSGYENP 392
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG--- 447
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF +
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFAS 491
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 492 GEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|410251926|gb|JAA13930.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
Length = 566
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 59/211 (27%), Positives = 102/211 (48%), Gaps = 30/211 (14%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+PR + F + S + + + AK RL+ + ++ +E+ R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVH-YRISKSAWLSGYENP 392
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG--- 447
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF +
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFAS 491
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 492 GEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|241999340|ref|XP_002434313.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
gi|215496072|gb|EEC05713.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
Length = 267
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 71/231 (30%), Positives = 108/231 (46%), Gaps = 35/231 (15%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+Q +VLS PR + FP+F + +C+ + ++++L +++ L G RT+
Sbjct: 46 LQPFKIEVLSEDPRIVVFPDFLNPRECEIFRSISQEKLSRAKVYL-GGPPEGGFSLRRTN 104
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH--YDAFNPAE 187
++S +D +L + +IA AT L T E + V Y +G Y H Y F A+
Sbjct: 105 KVAWMS--DDLHPLLGKVSRRIALATGLTLTSAEMYQVANYGLGGHYIPHPDYAGFGEAQ 162
Query: 188 YGPQMS--QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLL 245
S RLA+ L+YL+DV GG T F + L VKP G L
Sbjct: 163 GDIYKSSGNRLATMLIYLADVAGGGATAF---------------INMRLAVKPTLGTALF 207
Query: 246 FYSLFP-NGTI------------DRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+Y+L P +G I D + H CPV+ G KW+ TKWI ++EQ
Sbjct: 208 WYNLKPYDGPIVNESFWNQRRFGDPRTFHMGCPVLTGSKWIVTKWIHEREQ 258
>gi|198449635|ref|XP_001357660.2| GA21971 [Drosophila pseudoobscura pseudoobscura]
gi|198130694|gb|EAL26794.2| GA21971 [Drosophila pseudoobscura pseudoobscura]
Length = 549
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 58/209 (27%), Positives = 94/209 (44%), Gaps = 26/209 (12%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
LS P + + + + I R+ + + L TV + RTS TFI+ +
Sbjct: 324 LSHDPLLVLYHDVIYQSEIDVIRQLTTNRMARAMVTLTNQSTVSNV---RTSQITFIAKT 380
Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY------GPQ 191
E + +L+ I+ ++A T L + E Y IG Y H D F + +
Sbjct: 381 EHE--VLQTIDRRVADMTNLNMDYAEDHQFANYGIGGHYGQHMDWFTETTFDNGLVSSTE 438
Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
M R+A+ L YLSDV +GG T FP+ + ++P++ +++L
Sbjct: 439 MGNRIATVLFYLSDVAQGGGTAFPY---------------LKQHLRPKKYAAAFWHNLHA 483
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRD 280
G D + HG+CP+I G KWV +WIR+
Sbjct: 484 AGRGDARTQHGACPIIAGSKWVLNRWIRE 512
>gi|63252888|ref|NP_001017962.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Homo
sapiens]
gi|197099666|ref|NP_001125733.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Pongo abelii]
gi|217272849|ref|NP_001136067.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Homo
sapiens]
gi|114631177|ref|XP_001140234.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3 [Pan
troglodytes]
gi|114631181|ref|XP_001140652.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 7 [Pan
troglodytes]
gi|2507090|sp|P13674.2|P4HA1_HUMAN RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; Flags: Precursor
gi|75061858|sp|Q5RAG8.1|P4HA1_PONAB RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; Flags: Precursor
gi|602675|gb|AAA59068.1| alpha-subunit of prolyl 4-hydroxylase [Homo sapiens]
gi|23271226|gb|AAH34998.1| Prolyl 4-hydroxylase, alpha polypeptide I [Homo sapiens]
gi|55729010|emb|CAH91242.1| hypothetical protein [Pongo abelii]
gi|56403853|emb|CAI29712.1| hypothetical protein [Pongo abelii]
gi|119574854|gb|EAW54469.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I, isoform CRA_c [Homo
sapiens]
gi|119574855|gb|EAW54470.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I, isoform CRA_d [Homo
sapiens]
gi|123981532|gb|ABM82595.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I [synthetic
construct]
gi|123996359|gb|ABM85781.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I [synthetic
construct]
gi|261861532|dbj|BAI47288.1| prolyl 4-hydroxylase, alpha polypeptide I [synthetic construct]
gi|410295852|gb|JAA26526.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
gi|410349611|gb|JAA41409.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
Length = 534
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 59/211 (27%), Positives = 102/211 (48%), Gaps = 30/211 (14%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+PR + F + S + + + AK RL+ + ++ +E+ R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVH-YRISKSAWLSGYENP 392
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG--- 447
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF +
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFAS 491
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 492 GEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|395820526|ref|XP_003783615.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Otolemur
garnettii]
Length = 534
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 59/211 (27%), Positives = 102/211 (48%), Gaps = 30/211 (14%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+PR + F + S + + + AK RL+ + ++ +E+ R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVH-YRISKSAWLSGYENP 392
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG--- 447
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF +
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFAS 491
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 492 GEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|291404184|ref|XP_002718472.1| PREDICTED: prolyl 4-hydroxylase, alpha I subunit isoform 2
[Oryctolagus cuniculus]
Length = 534
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 59/211 (27%), Positives = 102/211 (48%), Gaps = 30/211 (14%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+PR + F + S + + + AK RL+ + ++ +E+ R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVH-YRISKSAWLSGYENP 392
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG--- 447
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF +
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFAS 491
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 492 GEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|298712929|emb|CBJ26831.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 294
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 68/202 (33%), Positives = 96/202 (47%), Gaps = 31/202 (15%)
Query: 89 NFASAEQCQSIIATAKKRLKPSQL-ALRQGETVESTKGTRTSSGTFISASEDKTGILELI 147
+F S +C ++IA A + S + GE ES RTSS F+ A ED L +
Sbjct: 108 DFFSGPECDALIALAGNYMIVSPVVGAGAGEVSES----RTSSSCFL-ARED----LPTV 158
Query: 148 EHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-----NPAEYGPQMSQRLASFLLY 202
HK+ T P H E V RY QKY +H+DAF + + QR+ + L+Y
Sbjct: 159 CHKVMALTGKPIEHLELPQVGRYYTSQKYANHWDAFDLNTEDGRRFAQNGGQRVCTVLVY 218
Query: 203 LSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHG 262
L+DV GG T FP +G+KV+PR+G ++F+ +G +D LH
Sbjct: 219 LNDVPSGGCTAFPQ---------------LGMKVQPRKGMAVVFFPATLDGVLDSRLLHA 263
Query: 263 SCPVIKGEKWVATKWIRDQEQH 284
+ P I KWV+ WIR H
Sbjct: 264 AEPAID-TKWVSQIWIRQGAYH 284
>gi|195505241|ref|XP_002099419.1| GE10893 [Drosophila yakuba]
gi|194185520|gb|EDW99131.1| GE10893 [Drosophila yakuba]
Length = 508
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 62/223 (27%), Positives = 112/223 (50%), Gaps = 36/223 (16%)
Query: 74 PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
PF++ +S P + + + + Q +IA A+ RL+P+++ + + E+ R++ G
Sbjct: 295 PFKMEEISLEPYIVVYHDILPDKDMQQLIALAEPRLRPTEVF--EEDKSEARTSDRSALG 352
Query: 132 TFISASE-DKTG--ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE- 187
TF+ + + +G +L+ + ++ T + H FN+++Y G +Y +++D FN
Sbjct: 353 TFLPFKDMNPSGGPLLDRLTQRMRDITGIQIRHENTFNIIKYGFGSQYATNFDFFNGTNS 412
Query: 188 ----YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDG 243
YG R+A+ L YL+D GG T+FP I +KV RG
Sbjct: 413 EMEGYG----DRMATVLFYLNDAPNGGATVFP---------------RIDVKVTAERGKV 453
Query: 244 LLFYSLFPNG---TIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
L +++L NG ++ +LH +CPV +G KWV WI + +Q
Sbjct: 454 LFWHNL--NGETHDVEPNTLHAACPVFQGSKWVMAAWIHEYDQ 494
>gi|195145314|ref|XP_002013641.1| GL24244 [Drosophila persimilis]
gi|194102584|gb|EDW24627.1| GL24244 [Drosophila persimilis]
Length = 496
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 64/199 (32%), Positives = 94/199 (47%), Gaps = 32/199 (16%)
Query: 84 ALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS--EDKT 141
ALY ++AEQ ++ L SQL ++G + + TF SAS + T
Sbjct: 312 ALYHEVVSAAEQRHLML------LSESQLQRQRGHQYDKIR-------TFASASVAANAT 358
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ--RLASF 199
+E + ++ T L E +L Y IG +Y H D P + + RLA+
Sbjct: 359 PTVEQLHRRLEDITGLDLAESEPLRILNYGIGGQYYIHVDCEQPQTHVEPYPKEYRLATV 418
Query: 200 LLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTS 259
LLYLSDV GG T FP +GL ++P RG L++++ G D +
Sbjct: 419 LLYLSDVRLGGFTSFP---------------ALGLGIRPNRGSALVWHNANNAGNCDYRA 463
Query: 260 LHGSCPVIKGEKWVATKWI 278
LH +CPV+ G +WVA+KWI
Sbjct: 464 LHAACPVLLGTRWVASKWI 482
>gi|307190793|gb|EFN74662.1| Prolyl 4-hydroxylase subunit alpha-2 [Camponotus floridanus]
Length = 476
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 61/209 (29%), Positives = 95/209 (45%), Gaps = 24/209 (11%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
PR + + N E+ ++I A+ R K + + + +E R S ++ E K
Sbjct: 269 PRIVVYHNVIYDEEIETIKRMAQPRFKRATVQNYKTGALE-IANYRISKSAWLQEHEHKH 327
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS----QRLA 197
+ + ++ T + E V+ Y IG Y+ H+D E S R+A
Sbjct: 328 --VAAVSKRVEHMTSMSIETAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIA 385
Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 257
+ L Y+SDVE+GG T+F I + + PR+G +Y+L PNG D
Sbjct: 386 TVLYYMSDVEQGGGTVFT---------------AINISLWPRKGSAAFWYNLKPNGEGDF 430
Query: 258 TSLHGSCPVIKGEKWVATKWI--RDQEQH 284
+ H +CPV+ G KWVA KW+ R QE H
Sbjct: 431 KTRHAACPVLTGSKWVANKWLHERGQEFH 459
>gi|195452746|ref|XP_002073482.1| GK14141 [Drosophila willistoni]
gi|194169567|gb|EDW84468.1| GK14141 [Drosophila willistoni]
Length = 541
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 56/173 (32%), Positives = 83/173 (47%), Gaps = 28/173 (16%)
Query: 119 TVESTKGT-----RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIG 173
TV KG+ RTS TFI + K +L+ I+ ++A + L + E Y IG
Sbjct: 359 TVIGAKGSEVSKVRTSQFTFIPKTRHK--VLQTIDQRVADMSNLNMDYAELHQFANYGIG 416
Query: 174 QKYDSHYDAF------NPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYD 227
Y H D F N P+M R+A+ L YLSDV +GG T FP +
Sbjct: 417 GHYAQHNDWFGQDAFDNELVSSPEMGNRIATVLFYLSDVAQGGGTAFPHLKQL------- 469
Query: 228 YKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 280
++P++ +++L +G D +LHG+CP+I G KWV +WIR+
Sbjct: 470 --------LQPKKYAAAFWHNLHASGVGDLRTLHGACPIIAGSKWVQNRWIRE 514
>gi|195505218|ref|XP_002099409.1| GE10887 [Drosophila yakuba]
gi|194185510|gb|EDW99121.1| GE10887 [Drosophila yakuba]
Length = 521
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 60/208 (28%), Positives = 90/208 (43%), Gaps = 26/208 (12%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + + + + I + RLK + + G RTS TFI S K
Sbjct: 302 PLLVLYHDVIYQSEIDVIRKLTENRLKRATVT---GHNESVVSNVRTSQFTFIPVSAHK- 357
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY------GPQMSQR 195
+L I+ ++A T L + E Y IG Y H D F P+M R
Sbjct: 358 -VLSTIDQRVADMTNLNMKYAEDHQFANYGIGGHYGQHMDWFYQTTIDAGLISSPEMGNR 416
Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 255
+A+ L YLSDV +GG T FP + +KP++ +++L +G
Sbjct: 417 IATVLFYLSDVSQGGGTAFP---------------QLRTLLKPKKYAAAFWHNLHASGVG 461
Query: 256 DRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
D + HG+CP+I G KWV +WIR+ +Q
Sbjct: 462 DVRTQHGACPIIAGSKWVQNRWIREVDQ 489
>gi|195159319|ref|XP_002020529.1| GL14044 [Drosophila persimilis]
gi|194117298|gb|EDW39341.1| GL14044 [Drosophila persimilis]
Length = 536
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 54/160 (33%), Positives = 79/160 (49%), Gaps = 20/160 (12%)
Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
RTS G + S+ T + + +A + L + E + Y IG Y+ H+D+F
Sbjct: 372 RTSQGASFNYSQYATT--QRLSQHVADLSGLDMDYAENLQIANYGIGGHYEPHWDSFPEH 429
Query: 187 EYGPQ---MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDG 243
P+ RLA+ + YLSDV GG T FPF + L V P RG
Sbjct: 430 HEYPEDDLYGNRLATAIYYLSDVVAGGGTAFPF---------------LPLLVTPERGSL 474
Query: 244 LLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
L +Y+L P+G D + H +CPV++G KW+A WIR++ Q
Sbjct: 475 LFWYNLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQ 514
>gi|195452730|ref|XP_002073475.1| GK13125 [Drosophila willistoni]
gi|194169560|gb|EDW84461.1| GK13125 [Drosophila willistoni]
Length = 539
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 62/219 (28%), Positives = 99/219 (45%), Gaps = 21/219 (9%)
Query: 69 SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT-R 127
S + + + L P + N S + + A+ ++ SQ+ + E+ R
Sbjct: 316 SYEVLELEELHQDPFVVQVHNIVSQKDMNLLQKIARPNIQRSQVYAQDHNANETVAAAYR 375
Query: 128 TSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPA 186
TS G E ++ +EL+ +A + L E + Y IG Y+ H+D F +
Sbjct: 376 TSKGATFEYFEHRS--MELLSRHVADLSGLDMNSAELLQIANYGIGGHYEPHWDCFPDHH 433
Query: 187 EYGP--QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
Y P + R+A+ + YLS+VE GG T FPF + L V P RG +
Sbjct: 434 VYLPDDRDGNRIATGIYYLSEVEAGGGTAFPF---------------LPLLVTPERGSLV 478
Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+Y+L +G D + H +CPV++G KW+A WIR Q
Sbjct: 479 FWYNLHRSGDQDYRTKHAACPVLQGSKWIANVWIRQSNQ 517
>gi|198449648|ref|XP_001357666.2| GA21989 [Drosophila pseudoobscura pseudoobscura]
gi|198130700|gb|EAL26801.2| GA21989 [Drosophila pseudoobscura pseudoobscura]
Length = 536
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 54/160 (33%), Positives = 79/160 (49%), Gaps = 20/160 (12%)
Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
RTS G + S+ T + + +A + L + E + Y IG Y+ H+D+F
Sbjct: 372 RTSQGASFNYSQYATT--QRLSQHVADLSGLDMDYAENLQIANYGIGGHYEPHWDSFPEH 429
Query: 187 EYGPQ---MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDG 243
P+ RLA+ + YLSDV GG T FPF + L V P RG
Sbjct: 430 HEYPEDDLYGNRLATAIYYLSDVVAGGGTAFPF---------------LPLLVTPERGSL 474
Query: 244 LLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
L +Y+L P+G D + H +CPV++G KW+A WIR++ Q
Sbjct: 475 LFWYNLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQ 514
>gi|115495019|ref|NP_001069238.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Bos taurus]
gi|122144801|sp|Q1RMU3.1|P4HA1_BOVIN RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; Flags: Precursor
gi|92097479|gb|AAI14709.1| Prolyl 4-hydroxylase, alpha polypeptide I [Bos taurus]
gi|296472132|tpg|DAA14247.1| TPA: prolyl 4-hydroxylase subunit alpha-1 precursor [Bos taurus]
gi|440892721|gb|ELR45796.1| Prolyl 4-hydroxylase subunit alpha-1 [Bos grunniens mutus]
Length = 534
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 59/211 (27%), Positives = 102/211 (48%), Gaps = 30/211 (14%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+PR + F + S + + + AK RL+ + ++ +E+ R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEVVKDLAKPRLRRATISNPITGDLETVH-YRISKSAWLSGYENP 392
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG--- 447
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF +
Sbjct: 448 -NRIATWLFYMSDVLAGGATVFP---------------EVGASVWPKKGTAVFWYNLFAS 491
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 492 GEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|195055775|ref|XP_001994788.1| GH17428 [Drosophila grimshawi]
gi|193892551|gb|EDV91417.1| GH17428 [Drosophila grimshawi]
Length = 540
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 70/246 (28%), Positives = 109/246 (44%), Gaps = 24/246 (9%)
Query: 43 RPKLRTLEVVEKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIAT 102
R +L+ +++ L G +S Q + L P + + SAE+ +
Sbjct: 292 REELKPEPATQRKLRCRLHRGNGLRSSYQPYRLEELHLDPYVIQVHDIISAEETIVLQQL 351
Query: 103 AKKRLKPSQL-ALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTH 161
A+ L+ S + +L E + + R S GTF E I++ + + + L
Sbjct: 352 ARPELQRSMVYSLSNSEHI--STNFRISQGTFFEYHEHP--IMQRMSQHLENISGLDMRS 407
Query: 162 GEAFNVLRYEIGQKYDSHYDAFNPAE-YGPQM---SQRLASFLLYLSDVEEGGETMFPFE 217
E V Y IG Y+ H D+F+ YG + R+A+ + YLS+VE GG T FPF
Sbjct: 408 AEQLQVANYGIGGHYEPHMDSFSENHNYGINTYMSTNRVATGIYYLSNVEAGGGTAFPF- 466
Query: 218 NGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKW 277
+ L V+P RG L +Y+L +G +D + H CPV+ G KW+A W
Sbjct: 467 --------------LPLLVEPERGSLLFWYNLHRSGDLDYRTKHAGCPVLMGSKWIANVW 512
Query: 278 IRDQEQ 283
IR Q
Sbjct: 513 IRLSNQ 518
>gi|239792190|dbj|BAH72464.1| ACYPI007079 [Acyrthosiphon pisum]
Length = 249
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 55/206 (26%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
PR + + + + + I A+ RLK + + + +E R S ++ ED
Sbjct: 44 PRIILYRDVLYDNEIEVIKRMAQPRLKRATVQNYKTGELEFAD-YRISKSAWLKEHEDV- 101
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS----QRLA 197
++ + ++ T L E V+ Y +G YD HYD E S R+A
Sbjct: 102 -VVANVAKRVEVMTGLTTETAEELQVVNYGVGGHYDPHYDFARTEEINAFKSLGTGNRIA 160
Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 257
+ L Y+SDV +GG T+FP+ +G+ ++P +G ++++L+P+G D
Sbjct: 161 TVLFYMSDVAQGGATVFPW---------------LGVALQPVKGTAAVWFNLYPSGNGDL 205
Query: 258 TSLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV++G KWV KW+ + Q
Sbjct: 206 RTRHAACPVLQGSKWVCNKWLHEAGQ 231
>gi|228999322|ref|ZP_04158902.1| hypothetical protein bmyco0003_38780 [Bacillus mycoides Rock3-17]
gi|229006877|ref|ZP_04164509.1| hypothetical protein bmyco0002_37790 [Bacillus mycoides Rock1-4]
gi|228754370|gb|EEM03783.1| hypothetical protein bmyco0002_37790 [Bacillus mycoides Rock1-4]
gi|228760519|gb|EEM09485.1| hypothetical protein bmyco0003_38780 [Bacillus mycoides Rock3-17]
Length = 195
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 68/218 (31%), Positives = 97/218 (44%), Gaps = 39/218 (17%)
Query: 77 VLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
VL P + + +CQ +I +KK ++P+Q GE R S T++
Sbjct: 7 VLHDEPFVAQYEQIITPAECQELIELSKKHIQPAQAYGHTGE--------RKSDFTWLPH 58
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF--------NPAEY 188
G++ + IA A LP H E RYE+G K+D+H D + N E
Sbjct: 59 YSH--GLVSQVSELIATAMPLPLNHAEPLQAARYEVGGKFDAHIDCYGTWHEDGRNRVEQ 116
Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 248
G QRL + +LYL+ V GGET FP + L V P G L+F +
Sbjct: 117 G---GQRLYTAILYLNTVNAGGETFFP---------------SLNLTVTPSEGKLLVFEN 158
Query: 249 LFPNGTID--RTSLHGSCPVIKGEKWVATKWIRDQEQH 284
GT + SLH C V +GEKW+ T W R++ Q+
Sbjct: 159 C-KRGTNEPHPLSLHEGCAVHEGEKWIVTLWFREKPQY 195
>gi|195069801|ref|XP_001997031.1| GH12975 [Drosophila grimshawi]
gi|193891500|gb|EDV90366.1| GH12975 [Drosophila grimshawi]
Length = 242
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 66/221 (29%), Positives = 100/221 (45%), Gaps = 24/221 (10%)
Query: 68 ASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQL-ALRQGETVESTKGT 126
+S Q + L P + + SAE+ + A+ L+ S + +L E + +
Sbjct: 19 SSYQPYRLEELHLDPYVIQVHDIISAEETIVLQQLARPELQRSMVYSLSNSEHISTN--F 76
Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
R S GTF E I++ + + + L E V Y IG Y+ H D+F+
Sbjct: 77 RISQGTFFEYHEHP--IMQRMSQHLENISGLDMRSAEQLQVANYGIGGHYEPHMDSFSEN 134
Query: 187 E-YGPQM---SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
YG + R+A+ + YLS+VE GG T FPF + L V+P RG
Sbjct: 135 HNYGINTYMSTNRVATGIYYLSNVEAGGGTAFPF---------------LPLLVEPERGS 179
Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
L +Y+L +G +D + H CPV+ G KW+A WIR Q
Sbjct: 180 LLFWYNLHRSGDLDYRTKHAGCPVLMGSKWIANVWIRLSNQ 220
>gi|325920649|ref|ZP_08182559.1| 2OG-Fe(II) oxygenase superfamily enzyme,Sel1 repeat protein
[Xanthomonas gardneri ATCC 19865]
gi|325548839|gb|EGD19783.1| 2OG-Fe(II) oxygenase superfamily enzyme,Sel1 repeat protein
[Xanthomonas gardneri ATCC 19865]
Length = 422
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 64/198 (32%), Positives = 91/198 (45%), Gaps = 30/198 (15%)
Query: 92 SAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILE-----L 146
SA++C+ ++ A+ L+ SQ+ + T RTS G + ILE
Sbjct: 242 SADECRLLMLLARPHLRASQVVDPNDASTHRTP-IRTSRGATLDP------ILEDFAARA 294
Query: 147 IEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG---PQMSQRLASFLLYL 203
+ ++A LP TH EA +VL Y G+ Y +H D P P RL + +YL
Sbjct: 295 AQARVAACAQLPLTHAEALSVLCYAPGEHYRAHRDYLPPGTIAADRPGAGNRLRTACVYL 354
Query: 204 SDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGS 263
+DV+ GGET FP G++V+PR G + F +L +G D SLH
Sbjct: 355 NDVDAGGETEFPVA---------------GIRVQPRAGSVVCFDNLQADGCPDPDSLHAG 399
Query: 264 CPVIKGEKWVATKWIRDQ 281
PV G KW+ T W R Q
Sbjct: 400 LPVTTGSKWLGTLWFRQQ 417
>gi|148226320|ref|NP_001087703.1| prolyl 4-hydroxylase, alpha polypeptide 2 precursor [Xenopus
laevis]
gi|51703693|gb|AAH81114.1| MGC83530 protein [Xenopus laevis]
Length = 533
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 60/207 (28%), Positives = 103/207 (49%), Gaps = 24/207 (11%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
PR + + + S E+ + I AK RL ++ +R +T V + R S ++ +D
Sbjct: 336 PRIVRYLDVLSDEEIEKIKELAKPRL--ARATVRDPKTGVLTVANYRVSKSAWLEEYDDP 393
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS---QRLA 197
++ + ++ T L + E V Y +G +Y+ H+D F+ + + RLA
Sbjct: 394 --VIGRVNSRMQAITGLTKDTAELLQVANYGMGGQYEPHFD-FSRRPFDSNLKTEGNRLA 450
Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 257
++L Y+SDVE GG T+FP D+ G + PR+G + +Y+LF +G D
Sbjct: 451 TYLNYMSDVEAGGATVFP-----------DF----GAAIWPRKGTAVFWYNLFRSGEGDY 495
Query: 258 TSLHGSCPVIKGEKWVATKWIRDQEQH 284
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 496 RTRHAACPVLVGSKWVSNKWFHERGQE 522
>gi|410632646|ref|ZP_11343301.1| prolyl 4-hydroxylase [Glaciecola arctica BSs20135]
gi|410147883|dbj|GAC20168.1| prolyl 4-hydroxylase [Glaciecola arctica BSs20135]
Length = 480
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 55/194 (28%), Positives = 101/194 (52%), Gaps = 25/194 (12%)
Query: 89 NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
+F ++CQ++I ++ +PS + + + RTSS + +D ++ I+
Sbjct: 103 DFLLPQECQALIELIEQAKQPSTIT-----SENPDQQFRTSSTCHLGNMQDP--VIRKID 155
Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP---AEYGPQMSQRLASFLLYLSD 205
+I + + ++ E Y++GQ++ H D F P A YG QR +F++YL++
Sbjct: 156 LQICQYLGIDPSYSEVIQGQHYQLGQQFKPHTDYFEPYELAHYGGIQGQRTYTFMIYLNE 215
Query: 206 VEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCP 265
VE+GG+T+FP + IG K K +G +++ ++ P+G+++ +LH P
Sbjct: 216 VEQGGDTVFP-------------ELAIGFKAK--KGMAVIWNNINPDGSVNYQTLHQGMP 260
Query: 266 VIKGEKWVATKWIR 279
V KGEK + TKW R
Sbjct: 261 VQKGEKLIITKWFR 274
>gi|426255744|ref|XP_004021508.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1 [Ovis
aries]
Length = 534
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 59/211 (27%), Positives = 102/211 (48%), Gaps = 30/211 (14%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+PR + F + S + + + AK RL+ + ++ +E+ R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVH-YRISKSAWLSGYENP 392
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG--- 447
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF +
Sbjct: 448 -NRIATWLFYMSDVLAGGATVFP---------------EVGASVWPKKGTAVFWYNLFAS 491
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 492 GEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|395817618|ref|XP_003782262.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Otolemur
garnettii]
Length = 538
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 102/205 (49%), Gaps = 22/205 (10%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 341 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 396
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
++ + H++ T L E V Y +G +Y+ H+D + P + G + R+A+
Sbjct: 397 DPVVARVNHRMQHITGLSVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRVAT 456
Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 457 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 501
Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 502 TRHAACPVLVGCKWVSNKWFHERGQ 526
>gi|328696638|ref|XP_003240086.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
[Acyrthosiphon pisum]
Length = 534
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 55/206 (26%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
PR + + + + + I A+ RLK + + + +E R S ++ ED
Sbjct: 329 PRIILYRDVLYDNEIEVIKRMAQPRLKRATVQNYKTGELEFA-DYRISKSAWLKEHEDV- 386
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS----QRLA 197
++ + ++ T L E V+ Y +G YD HYD E S R+A
Sbjct: 387 -VVANVAKRVEVMTGLTTETAEELQVVNYGVGGHYDPHYDFARTEEINAFKSLGTGNRIA 445
Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 257
+ L Y+SDV +GG T+FP+ +G+ ++P +G ++++L+P+G D
Sbjct: 446 TVLFYMSDVAQGGATVFPW---------------LGVALQPVKGTAAVWFNLYPSGNGDL 490
Query: 258 TSLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV++G KWV KW+ + Q
Sbjct: 491 RTRHAACPVLQGSKWVCNKWLHEAGQ 516
>gi|313229039|emb|CBY18191.1| unnamed protein product [Oikopleura dioica]
Length = 522
Score = 90.9 bits (224), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 48/154 (31%), Positives = 83/154 (53%), Gaps = 18/154 (11%)
Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
R S ++ + T +E +I+R T L + E + Y IG +Y+ HYD ++
Sbjct: 367 RVSKSAWLKDEDSDT--VEKYNRRISRLTGLDLEYAEQLQMSNYGIGGQYEPHYD-YSRR 423
Query: 187 EYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLF 246
E+ ++R+A++L YL+ VE+GG T+F +GL ++ +G + +
Sbjct: 424 EWDIYNNRRIATWLSYLTTVEQGGGTVFT---------------ELGLHIRSIKGSAVFW 468
Query: 247 YSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 280
Y+L PNG+ D + H +CPV++G KWV+ KWI +
Sbjct: 469 YNLLPNGSGDERTRHAACPVLRGNKWVSNKWIHE 502
>gi|91091610|ref|XP_969386.1| PREDICTED: similar to prolyl 4-hydroxylase alpha subunit 1,
putative [Tribolium castaneum]
gi|270001037|gb|EEZ97484.1| hypothetical protein TcasGA2_TC011321 [Tribolium castaneum]
Length = 536
Score = 90.9 bits (224), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 60/216 (27%), Positives = 100/216 (46%), Gaps = 24/216 (11%)
Query: 74 PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
PF+V RP F + + + +I A+ R K + + +E + R S
Sbjct: 322 PFKVEEAHHRPDIFIFRDVLADSEIATIKRMAQPRFKRATVQNTDTGELEIAQ-YRISKS 380
Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ 191
++ E K + + +++ T L + E V+ Y IG Y+ H+D E
Sbjct: 381 AWLKEEEHKH--IADVSQRVSDMTGLTMSTAEELQVVNYGIGGHYEPHFDFARRDERNAF 438
Query: 192 MS----QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFY 247
S R+A+ L Y+SDVE+GG T+FP I + + P++G +Y
Sbjct: 439 KSLGTGNRIATVLFYMSDVEQGGATVFP---------------SIQVSLWPQKGSAAFWY 483
Query: 248 SLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+L P+G D+ + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 484 NLHPSGDGDKMTRHAACPVLTGSKWVSNKWIHERGQ 519
>gi|193688213|ref|XP_001943683.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 1
[Acyrthosiphon pisum]
Length = 552
Score = 90.9 bits (224), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 55/206 (26%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
PR + + + + + I A+ RLK + + + +E R S ++ ED
Sbjct: 347 PRIILYRDVLYDNEIEVIKRMAQPRLKRATVQNYKTGELEFA-DYRISKSAWLKEHEDV- 404
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS----QRLA 197
++ + ++ T L E V+ Y +G YD HYD E S R+A
Sbjct: 405 -VVANVAKRVEVMTGLTTETAEELQVVNYGVGGHYDPHYDFARTEEINAFKSLGTGNRIA 463
Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 257
+ L Y+SDV +GG T+FP+ +G+ ++P +G ++++L+P+G D
Sbjct: 464 TVLFYMSDVAQGGATVFPW---------------LGVALQPVKGTAAVWFNLYPSGNGDL 508
Query: 258 TSLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV++G KWV KW+ + Q
Sbjct: 509 RTRHAACPVLQGSKWVCNKWLHEAGQ 534
>gi|383864775|ref|XP_003707853.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Megachile
rotundata]
Length = 550
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 58/206 (28%), Positives = 94/206 (45%), Gaps = 22/206 (10%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
PR + + N E+ ++I A+ R K + + + +E R S ++ E K
Sbjct: 343 PRIVIYHNVIYDEEIETIKRMAQPRFKRATVQNYKTGALE-IANYRISKSAWLQEHEHKH 401
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS----QRLA 197
+ + ++ T L E V+ Y IG Y+ H+D E S R+A
Sbjct: 402 --VAAVSKRVEHMTSLNVETAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIA 459
Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 257
+ L Y+SDVE+GG T+F I + + PR+G +++L PNG D
Sbjct: 460 TVLYYMSDVEQGGGTVFT---------------AINISLWPRKGSAAFWFNLKPNGEGDL 504
Query: 258 TSLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV+ G KWVA KW+ ++ Q
Sbjct: 505 RTRHAACPVLTGSKWVANKWLHERGQ 530
>gi|156405954|ref|XP_001640996.1| predicted protein [Nematostella vectensis]
gi|156228133|gb|EDO48933.1| predicted protein [Nematostella vectensis]
Length = 182
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 62/179 (34%), Positives = 89/179 (49%), Gaps = 27/179 (15%)
Query: 123 TKGTRT--SSGTFISASED-KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH 179
TK T + SS ++ ED K IL I + + E + +Y++GQKY H
Sbjct: 1 TKETESGFSSSLYLKNKEDSKITILRDIAQLAGKLSNTQWRFAEPVALTKYKVGQKYSLH 60
Query: 180 YDAFNPAEYGPQMSQ----RLASFLLYLSDVEEGGETMFPFENGI-------------FL 222
YD+ G M+Q R A+FL+YL+DV+ GGET+FP I L
Sbjct: 61 YDS------GFLMNQRRVKRTATFLVYLNDVKSGGETIFPLATNISSIQLKKENVDKPSL 114
Query: 223 DSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
DS + + +KV P LLF++ +D SLHGSCPV+ GEKW+A W+ ++
Sbjct: 115 DSICGKENNM-VKVSPEAQSCLLFWNHVDGDDVDAFSLHGSCPVVSGEKWIAQIWLHNE 172
>gi|260825357|ref|XP_002607633.1| hypothetical protein BRAFLDRAFT_59428 [Branchiostoma floridae]
gi|229292981|gb|EEN63643.1| hypothetical protein BRAFLDRAFT_59428 [Branchiostoma floridae]
Length = 520
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 63/214 (29%), Positives = 93/214 (43%), Gaps = 37/214 (17%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+P+ N + + + I A+ RL+ ++ VES T G S K
Sbjct: 321 KPKLWVLHNILTDPEMEVIKKLAQPRLRRAR--------VESPT---TGEGELASYRISK 369
Query: 141 TGILELIEHKIAR--------ATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE---YG 189
+ L EH++ R T L E V+ Y IG Y+ H+D E
Sbjct: 370 SAWLYDWEHRVIRRVNQRVEDVTGLTMETAELLQVVNYGIGGHYEPHFDCATKDEEFALD 429
Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSL 249
P R+A+ L Y+SDVE GG T+FP +G +V P +G G +Y+L
Sbjct: 430 PNEGDRIATMLFYMSDVEAGGATVFP---------------QVGARVVPEKGAGAFWYNL 474
Query: 250 FPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+G D + H CPV+ G KWV+ KWI ++ Q
Sbjct: 475 LKSGEGDMLTEHAGCPVLVGSKWVSNKWIHERGQ 508
>gi|148233143|ref|NP_001090904.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Sus scrofa]
gi|83778522|gb|ABC47142.1| procollagen-proline 2-oxoglutarate-4-dioxygenase [Sus scrofa]
Length = 534
Score = 90.5 bits (223), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 58/211 (27%), Positives = 101/211 (47%), Gaps = 30/211 (14%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+PR + F + S + + AK RL+ + ++ +E+ R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIDIVKDLAKPRLRRATISNPITGDLETVH-YRISKSAWLSGYENP 392
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
++ + +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 --VVSRLNMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG--- 447
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF +
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFAS 491
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 492 GEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|260812289|ref|XP_002600853.1| hypothetical protein BRAFLDRAFT_214927 [Branchiostoma floridae]
gi|229286143|gb|EEN56865.1| hypothetical protein BRAFLDRAFT_214927 [Branchiostoma floridae]
Length = 281
Score = 90.5 bits (223), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 54/158 (34%), Positives = 77/158 (48%), Gaps = 21/158 (13%)
Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYDAFN 184
R S ++ +D+ I+ + +I T L T E VL Y +G +Y+ H+D
Sbjct: 126 RISQQAWLHDKDDE--IVARVSKRIGLLTGLNTTPTSTELLQVLNYGLGGQYEPHHDYMT 183
Query: 185 PAE--YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
E +G + R+A+FL+YLSDV GG T+FP N + V +
Sbjct: 184 AEEKMWGTILGNRMATFLMYLSDVTAGGATVFPVAN---------------VTVPVVKNA 228
Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 280
GLLF L +G D SLH CPV+ G KW+A KWI +
Sbjct: 229 GLLFMDLLRSGRGDVNSLHAGCPVVIGSKWIANKWIHE 266
>gi|357605723|gb|EHJ64752.1| prolyl 4-hydroxylase alpha subunit [Danaus plexippus]
Length = 235
Score = 90.5 bits (223), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 47/143 (32%), Positives = 73/143 (51%), Gaps = 17/143 (11%)
Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM--SQR 195
++++ ++ + ++A T L T E V+ Y IG YD H+D E + R
Sbjct: 75 DEESAVVARVSRRVADITGLSMTTAEELQVVNYGIGGHYDPHFDFARKEENAFEKFNGNR 134
Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 255
+A+ L Y+SDV +GG T+F +GL V PRRG + + +L P+G
Sbjct: 135 IATVLFYMSDVAQGGATVF---------------TELGLSVFPRRGSAVFWLNLHPSGEG 179
Query: 256 DRTSLHGSCPVIKGEKWVATKWI 278
D + H +CPV++G KWV KWI
Sbjct: 180 DLATRHAACPVLRGSKWVCNKWI 202
>gi|424863736|ref|ZP_18287648.1| prolyl 4-hydroxylase subunit alpha-2 [SAR86 cluster bacterium
SAR86A]
gi|400757057|gb|EJP71269.1| prolyl 4-hydroxylase subunit alpha-2 [SAR86 cluster bacterium
SAR86A]
Length = 205
Score = 90.5 bits (223), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 55/205 (26%), Positives = 99/205 (48%), Gaps = 26/205 (12%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P NF S ++C++ + K +++ +++ + E+ +RT+ ++ S +
Sbjct: 17 PIVYVVNNFLSDDECEAFVEMGKGKMERAKV-ISDDES--EFHASRTNDFCWLEHS--AS 71
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-----QRL 196
++ + + + +P + E F ++ Y G +Y H+DAF+ Q + QR+
Sbjct: 72 DVIHEVSKRFSVLVKMPINNAEQFQLVYYGPGNEYKPHFDAFDKTTKEGQNNWFPGGQRM 131
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT-I 255
+ L YL+DVEEGG T FP I + VKP +GD ++F++ T I
Sbjct: 132 VTALAYLNDVEEGGATDFP---------------KINVSVKPNKGDVVVFHNCIEGTTEI 176
Query: 256 DRTSLHGSCPVIKGEKWVATKWIRD 280
+ +LHG PV+ GEKW W R+
Sbjct: 177 NPQALHGGSPVVAGEKWAVNLWFRE 201
>gi|346724248|ref|YP_004850917.1| hypothetical protein XACM_1335 [Xanthomonas axonopodis pv.
citrumelo F1]
gi|346648995|gb|AEO41619.1| hypothetical protein XACM_1335 [Xanthomonas axonopodis pv.
citrumelo F1]
Length = 418
Score = 90.5 bits (223), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 61/204 (29%), Positives = 90/204 (44%), Gaps = 22/204 (10%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG-TFISASEDK 140
PR + SA++C+ ++ A+ L+ S++ + + RTS G T ED
Sbjct: 228 PRIEEYAAVLSADECRLLMLLARPHLRASKV-IDPNDASTGRAPVRTSHGATLDPIIEDF 286
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG---PQMSQRLA 197
+ ++A LP H E +VL Y G++Y +H D P P R
Sbjct: 287 AA--RAAQSRLAACAQLPLAHAEPLSVLCYAPGEQYRAHRDYLPPGTIAADRPTAGNRQR 344
Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 257
+ +YL+DV GGET FP G++V+PR G + F +L +G D
Sbjct: 345 TVCVYLNDVGAGGETEFPVA---------------GVRVRPRPGTLVCFDNLHADGRPDA 389
Query: 258 TSLHGSCPVIKGEKWVATKWIRDQ 281
SLH PV G KW+ T W R Q
Sbjct: 390 DSLHAGLPVTAGSKWLGTLWFRQQ 413
>gi|78046960|ref|YP_363135.1| hypothetical protein XCV1404 [Xanthomonas campestris pv.
vesicatoria str. 85-10]
gi|78035390|emb|CAJ23035.1| conserved hypothetical protein [Xanthomonas campestris pv.
vesicatoria str. 85-10]
Length = 418
Score = 90.1 bits (222), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 61/204 (29%), Positives = 90/204 (44%), Gaps = 22/204 (10%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG-TFISASEDK 140
PR + SA++C+ ++ A+ L+ S++ + + RTS G T ED
Sbjct: 228 PRIEEYAAVLSADECRLLMLLARPHLRASKV-IDPNDASTGRAPVRTSHGATLDPIIEDF 286
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG---PQMSQRLA 197
+ ++A LP H E +VL Y G++Y +H D P P R
Sbjct: 287 AA--RAAQSRLAACAQLPLAHAEPLSVLCYAPGEQYRAHRDYLPPGTIAADRPTAGNRQR 344
Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 257
+ +YL+DV GGET FP G++V+PR G + F +L +G D
Sbjct: 345 TVCVYLNDVGAGGETEFPVA---------------GVRVRPRPGTLVCFDNLHADGRPDA 389
Query: 258 TSLHGSCPVIKGEKWVATKWIRDQ 281
SLH PV G KW+ T W R Q
Sbjct: 390 DSLHAGLPVTAGSKWLGTLWFRQQ 413
>gi|215697788|dbj|BAG91981.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 225
Score = 90.1 bits (222), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 59/187 (31%), Positives = 98/187 (52%), Gaps = 24/187 (12%)
Query: 95 QCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARA 154
+C +++ + ++ S LA T G R SS I ED ++ IE +I+
Sbjct: 2 ECDHLVSMGRGNME-SSLAF--------TDGDRNSSYNNI---EDI--VVSKIEDRISLW 47
Query: 155 TMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMF 214
+ LP+ +GE+ VL+Y + + + + RLA+ L+YLSDV++GGET+F
Sbjct: 48 SFLPKENGESIQVLKYGVNRS-----GSIKEEPKSSSGAHRLATILMYLSDVKQGGETVF 102
Query: 215 PFENGIFLDSGYDY---KKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEK 271
P D+ +C G V+P +G+ +L ++L P+G D+ S + CPV++GEK
Sbjct: 103 PRSE--MKDAQAKEGAPSQCSGYAVRPAKGNAILLFNLRPDGETDKDSQYEECPVLEGEK 160
Query: 272 WVATKWI 278
W+A K I
Sbjct: 161 WLAIKHI 167
>gi|195159317|ref|XP_002020528.1| GL14042 [Drosophila persimilis]
gi|194117297|gb|EDW39340.1| GL14042 [Drosophila persimilis]
Length = 534
Score = 90.1 bits (222), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 47/134 (35%), Positives = 69/134 (51%), Gaps = 16/134 (11%)
Query: 151 IARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMSQRLASFLLYLSDVEEG 209
++ AT L T E V Y +G Y+ H+D F +P Y + R+A+ + YLSDVE+G
Sbjct: 394 LSDATGLDMTFCEQLQVANYGVGGHYEPHWDFFRDPDHYPAEEGNRMATAIFYLSDVEQG 453
Query: 210 GETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKG 269
G T FPF N VKP+ G+ L +Y++ + +D + H CPV+KG
Sbjct: 454 GATAFPFLN---------------FAVKPQLGNVLFWYNVHRSLDVDYRTKHAGCPVLKG 498
Query: 270 EKWVATKWIRDQEQ 283
KW+ WI + Q
Sbjct: 499 SKWIGNVWIHEATQ 512
>gi|125772813|ref|XP_001357665.1| GA21991 [Drosophila pseudoobscura pseudoobscura]
gi|54637397|gb|EAL26799.1| GA21991 [Drosophila pseudoobscura pseudoobscura]
Length = 534
Score = 90.1 bits (222), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 47/134 (35%), Positives = 69/134 (51%), Gaps = 16/134 (11%)
Query: 151 IARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMSQRLASFLLYLSDVEEG 209
++ AT L T E V Y +G Y+ H+D F +P Y + R+A+ + YLSDVE+G
Sbjct: 394 LSDATGLDMTFCEQLQVANYGVGGHYEPHWDFFRDPDHYPAEEGNRMATAIFYLSDVEQG 453
Query: 210 GETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKG 269
G T FPF N VKP+ G+ L +Y++ + +D + H CPV+KG
Sbjct: 454 GATAFPFLN---------------FAVKPQLGNVLFWYNVHRSLDVDYRTKHAGCPVLKG 498
Query: 270 EKWVATKWIRDQEQ 283
KW+ WI + Q
Sbjct: 499 SKWIGNVWIHEATQ 512
>gi|195055773|ref|XP_001994787.1| GH17427 [Drosophila grimshawi]
gi|193892550|gb|EDV91416.1| GH17427 [Drosophila grimshawi]
Length = 538
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 47/134 (35%), Positives = 72/134 (53%), Gaps = 16/134 (11%)
Query: 151 IARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP-QMSQRLASFLLYLSDVEEG 209
++ AT L T+ E V Y +G Y+ H+D F +++ P + R+A+ + YLSDVE+G
Sbjct: 398 LSDATGLDMTYCEQLQVANYGVGGHYEPHWDFFVDSQHYPAEEGNRIATAIFYLSDVEQG 457
Query: 210 GETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKG 269
G T FPF N V+P+ G+ L +Y+L + +D + H CPV+KG
Sbjct: 458 GATAFPFLN---------------FAVRPQLGNILFWYNLHRSLDMDYRTKHAGCPVLKG 502
Query: 270 EKWVATKWIRDQEQ 283
KW+A WI + Q
Sbjct: 503 SKWIANIWIHEATQ 516
>gi|310831339|ref|YP_003969982.1| putative prolyl 4-hydroxylase alpha subunit [Cafeteria
roenbergensis virus BV-PW1]
gi|309386523|gb|ADO67383.1| putative prolyl 4-hydroxylase alpha subunit [Cafeteria
roenbergensis virus BV-PW1]
Length = 210
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 67/212 (31%), Positives = 94/212 (44%), Gaps = 29/212 (13%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
+LS P Y N + ++C II +LKP AL G + RT + ++S
Sbjct: 4 HILSQDPLIYYVDNVLNKQECYHIIKITSNKLKP---ALVSGNSRGFLSTGRTGTNCWLS 60
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-- 193
D+ I I KI P + E F VL Y QKY+ HYDAF P + +
Sbjct: 61 HKNDE--ITFNIALKITNLVNKPLENAENFQVLHYSTNQKYEYHYDAF-PIDNSEKAKRC 117
Query: 194 -----QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 248
QRL + L+YL++V +GGET F K + +K+ P+ G L+F +
Sbjct: 118 LKKGGQRLLTALIYLNNVTKGGETEF---------------KNLNIKITPKIGRILVFEN 162
Query: 249 LFPNGTIDR-TSLHGSCPVIKGEKWVATKWIR 279
N SLH VI+GEK+V W R
Sbjct: 163 TLQNSLNKHPDSLHSGKQVIEGEKYVINLWFR 194
>gi|332026992|gb|EGI67088.1| Prolyl 4-hydroxylase subunit alpha-1 [Acromyrmex echinatior]
Length = 415
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 60/209 (28%), Positives = 95/209 (45%), Gaps = 24/209 (11%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
PR + + N E+ ++I A+ R K + + + +E R S ++ E K
Sbjct: 208 PRIVVYHNVIYDEEIETIKRMAQPRFKRATVQNYKTGALE-IANYRISKSAWLQEHEHKH 266
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS----QRLA 197
+ + ++ T + E V+ Y IG Y+ H+D E S R+A
Sbjct: 267 --VAAVSKRVEHMTSMSVETAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIA 324
Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 257
+ L Y+SDVE+GG T+F I + + PR+G +++L PNG D
Sbjct: 325 TVLYYMSDVEQGGGTVFT---------------AINISLWPRKGSAAFWHNLKPNGEGDF 369
Query: 258 TSLHGSCPVIKGEKWVATKWI--RDQEQH 284
+ H +CPV+ G KWVA KW+ R QE H
Sbjct: 370 KTRHAACPVLTGSKWVANKWLHERGQEFH 398
>gi|194905381|ref|XP_001981186.1| GG11928 [Drosophila erecta]
gi|190655824|gb|EDV53056.1| GG11928 [Drosophila erecta]
Length = 543
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 64/214 (29%), Positives = 99/214 (46%), Gaps = 21/214 (9%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT-RTSSG 131
I ++LS P L + ++ I A++K+ L S++ + E RTS
Sbjct: 324 IKTEILSLDPFVLLLHDMVRQKESTLIRASSKEHLLQSEITNTDASSSEDNVAIFRTSKS 383
Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF--NPAEYG 189
+ S+ + T + I ++A AT L E F V+ Y +G + +H D + +
Sbjct: 384 VWYSSDFNDTT--KKITERLADATGLDMHFTEYFQVINYGLGGFFATHLDMLLSDKTRFN 441
Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSL 249
S R+A+ + YL+ V +GG T FP N L V P+ G L +Y+L
Sbjct: 442 GT-SDRIATTVFYLNGVRQGGATHFPLLN---------------LTVFPQPGSALFWYNL 485
Query: 250 FPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
G R+++H CPVI G KWV TKW+ DQ Q
Sbjct: 486 DTKGNDQRSTMHTGCPVIVGSKWVMTKWVGDQGQ 519
>gi|390178148|ref|XP_001358756.3| GA13990 [Drosophila pseudoobscura pseudoobscura]
gi|388859341|gb|EAL27899.3| GA13990 [Drosophila pseudoobscura pseudoobscura]
Length = 498
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 58/220 (26%), Positives = 103/220 (46%), Gaps = 25/220 (11%)
Query: 65 TGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTK 124
T A I + + LS P + F + + ++ + LK S + Q V ++K
Sbjct: 286 TAFARIAPLKMEELSHDPYMVLFHDVVYESEIDFLLNATQ--LKASLVGQYQYSPVRTSK 343
Query: 125 GTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN 184
F+ ++ T +++ + ++ T L +A ++ Y +G YD HYD+ N
Sbjct: 344 EQH-----FVEYND--TAVVKTLHRRLNDMTGLDMIESDALTLINYGMGGHYDVHYDSHN 396
Query: 185 PAEYGPQ-MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDG 243
+E + R+A+ L Y+ +V+ GG T FP+ I + V P++G
Sbjct: 397 YSEANRLILGDRIATVLFYVGEVDSGGATTFPY---------------INVSVTPKKGSA 441
Query: 244 LLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+L+Y+L G ++ ++H CPVI G K+V TKWI + Q
Sbjct: 442 VLWYNLDNAGQMNPKAIHAGCPVIVGSKYVLTKWINEIPQ 481
>gi|321474877|gb|EFX85841.1| hypothetical protein DAPPUDRAFT_208740 [Daphnia pulex]
Length = 545
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 57/222 (25%), Positives = 103/222 (46%), Gaps = 30/222 (13%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
IQ + + +P + + N + ++ +++ A+ R K + + +E R S
Sbjct: 332 IQPVKMEEAFHKPLIVIYHNVINDDEIETVKKMAQPRFKRATVQNSVTGNLEPA-NYRIS 390
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-------- 181
++ SE+ + + + ++ T L E V+ Y IG Y+ H+D
Sbjct: 391 KSAWLK-SEEHDHVFK-VTRRVGDVTGLDMATAEDLQVVNYGIGGHYEPHFDYARKEEVN 448
Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRG 241
AF +G R+A++L Y+S+VE GG T+FP + L + P++G
Sbjct: 449 AFKDLGWG----NRVATWLFYMSEVEAGGATVFP---------------KLNLALWPQKG 489
Query: 242 DGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+Y+L PNG + + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 490 SAAFWYNLHPNGEGNELTRHAACPVLTGSKWVSNKWIHERNQ 531
>gi|156370129|ref|XP_001628324.1| predicted protein [Nematostella vectensis]
gi|156215298|gb|EDO36261.1| predicted protein [Nematostella vectensis]
Length = 541
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 65/227 (28%), Positives = 100/227 (44%), Gaps = 41/227 (18%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPS---------------QLALRQGETVESTKG 125
+P L F NF + + + I A RLK + +++ R+ G
Sbjct: 309 KPEVLIFRNFITDSEIKRIKELATPRLKRATVKDPVTGELIFANYRISKRRATIQHPVTG 368
Query: 126 T------RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH 179
R S ++ ED+ +++ I +++ + L T E V+ Y IG Y+ H
Sbjct: 369 KLEFANYRISKSGWLRDEEDE--LVKRISYRVQAYSGLNMTTSEDLQVVNYGIGGHYEPH 426
Query: 180 YDAFNPAE---YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKV 236
YD E R+A+FL YLSDVE GG T+F +G V
Sbjct: 427 YDFARDGEDKFTSLGTGNRIATFLSYLSDVEAGGGTVFT---------------RVGATV 471
Query: 237 KPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
P++GD +Y+L +G D ++ H +CPV+ G KWVA KWI + Q
Sbjct: 472 WPQKGDAAFWYNLKRSGDGDSSTRHAACPVLVGSKWVANKWIHEVGQ 518
>gi|432891690|ref|XP_004075614.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Oryzias
latipes]
Length = 517
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 63/213 (29%), Positives = 106/213 (49%), Gaps = 25/213 (11%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
+VLS +P + + NF + + + I A+ L+ S +A GE ++T R S ++
Sbjct: 313 EVLSLQPYVVIYHNFITDREAEEIKGFAQPALRRSVVA--SGEN-QATVEYRISKSAWLK 369
Query: 136 ASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 190
SE + I+ ++ +I+ T L + E V+ Y IG Y+ H+D A +P+ +
Sbjct: 370 GSE--SCIVGKLDQRISMLTGLNVRPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPVFKL 427
Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
+ R+A+F++YLS VE GG T F + N V + + +++L
Sbjct: 428 KTGNRVATFMIYLSSVEAGGSTAFIYAN---------------FSVPVLKKAAIFWWNLH 472
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
NG D +LH CPV+ G+KWVA KW+ + Q
Sbjct: 473 RNGRGDAETLHAGCPVLIGDKWVANKWVHEYGQ 505
>gi|395814850|ref|XP_003780953.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Otolemur
garnettii]
Length = 544
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 67/219 (30%), Positives = 104/219 (47%), Gaps = 25/219 (11%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+Q I +V+ P + +F S + Q I A+ L+ S +A GE + R S
Sbjct: 334 LQPIRKEVIHLEPFVALYHDFVSDSEAQKIRELAEPWLQRSVVA--SGEK-QLQVDYRIS 390
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYD-AFNPA 186
++ + D +L ++H+IA T L + E V+ Y IG Y+ H+D A +P+
Sbjct: 391 KSAWLKDTVDP--MLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPS 448
Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
Y + R+A+F++YLS VE GG T F + N V + L
Sbjct: 449 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIYAN---------------FSVPVVKNAAL 493
Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+++L NG D +LH CPV+ G+KWVA KWI + Q
Sbjct: 494 FWWNLHRNGEGDSDTLHAGCPVLVGDKWVANKWIHEYGQ 532
>gi|332211329|ref|XP_003254773.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Nomascus
leucogenys]
Length = 544
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 68/219 (31%), Positives = 104/219 (47%), Gaps = 25/219 (11%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+Q I +V+ P + +F S + Q I A+ L+ S +A GE + R S
Sbjct: 334 LQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVA--SGEKQLQVE-YRIS 390
Query: 130 SGTFISASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPA 186
++ + D +L + H+IA T L + E V+ Y IG Y+ H+D A +P+
Sbjct: 391 KSAWLKDTVDP--MLVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPS 448
Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
Y + R+A+F++YLS VE GG T F + N L V R L
Sbjct: 449 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIYAN---------------LSVPVVRNAAL 493
Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+++L +G D +LH CPV+ G+KWVA KWI + Q
Sbjct: 494 FWWNLHRSGEGDSDTLHAGCPVLVGDKWVANKWIHEYGQ 532
>gi|301104296|ref|XP_002901233.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262101167|gb|EEY59219.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 535
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 71/258 (27%), Positives = 106/258 (41%), Gaps = 48/258 (18%)
Query: 71 QSIPFQVLSWRPRALYFPNFASAEQCQSIIATA------KKRLKPSQLALRQGETVESTK 124
+ + + +S PR NF S E+ +I +L+ S + + +
Sbjct: 177 EPVIIESISESPRTFRLHNFFSGEEADKLIKRTLEIDDPSNKLQQSTVGANDNKNKKKKS 236
Query: 125 GTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--- 181
RTS F + SE I + + + + +LRY+ Q Y +H D
Sbjct: 237 KHRTSENAFDTVSEAAVDIRKRV-FDVLSLGEFQADMADGLQLLRYQQKQAYIAHEDYFP 295
Query: 182 -------AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPF---------ENGIFLDSG 225
F+P + G S R A+ LYLSDV GG+T+FP E +S
Sbjct: 296 VGAAKDFNFDPHKGG---SNRFATVFLYLSDVPRGGQTVFPLAEMPEGLPTEYQHPPNSA 352
Query: 226 YDY------------------KKC-IGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPV 266
DY +KC L P +G +LFYS PNG +D SLHG CPV
Sbjct: 353 QDYEAIGAELFEPGSWEMDMVRKCSTKLASYPSKGGAVLFYSQKPNGELDPKSLHGGCPV 412
Query: 267 IKGEKWVATKWIRDQEQH 284
++G KW A W+ ++ +H
Sbjct: 413 LEGTKWGANLWVWNRRRH 430
>gi|198449524|ref|XP_002136918.1| GA26871 [Drosophila pseudoobscura pseudoobscura]
gi|198130646|gb|EDY67476.1| GA26871 [Drosophila pseudoobscura pseudoobscura]
Length = 530
Score = 89.4 bits (220), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 64/212 (30%), Positives = 101/212 (47%), Gaps = 25/212 (11%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFISA 136
LS P + + N S + IA ++ +P ++ GE S K RT+ G +I
Sbjct: 322 LSLDPYIVVYHNVLSDAE----IAKVERVAEPLLKSIGVGEMDNSKKSKVRTALGAWIPD 377
Query: 137 SEDKTG---ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ-M 192
+++ I +I T L HG+ +++Y G YD+H+D N + Q +
Sbjct: 378 KNMHISGWPVIQRIVRRIHDMTGLIIKHGQVVQLIKYGYGGHYDTHFDYLNDSLPITQAL 437
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF-P 251
R+A+ L YL+DV+ GG T+FP + LKV RG L++Y++
Sbjct: 438 GDRMATVLFYLNDVKHGGSTVFP---------------VLKLKVPSERGKVLVWYNMHGE 482
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+D +LHGSCPVI G K V + WI + +Q
Sbjct: 483 THDLDSRTLHGSCPVIDGAKTVLSCWIHEWDQ 514
>gi|297689698|ref|XP_002822285.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Pongo abelii]
Length = 544
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 68/219 (31%), Positives = 104/219 (47%), Gaps = 25/219 (11%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+Q I +V+ P + +F S + Q I A+ L+ S +A GE + R S
Sbjct: 334 LQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVA--SGEKQLQVE-YRIS 390
Query: 130 SGTFISASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPA 186
++ + D +L + H+IA T L + E V+ Y IG Y+ H+D A +P+
Sbjct: 391 KSAWLKDTVDP--MLVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPS 448
Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
Y + R+A+F++YLS VE GG T F + N L V R L
Sbjct: 449 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIYAN---------------LSVPVVRNAAL 493
Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+++L +G D +LH CPV+ G+KWVA KWI + Q
Sbjct: 494 FWWNLHRSGEGDSDTLHAGCPVLVGDKWVANKWIHEYGQ 532
>gi|195145084|ref|XP_002013526.1| GL24185 [Drosophila persimilis]
gi|194102469|gb|EDW24512.1| GL24185 [Drosophila persimilis]
Length = 229
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 57/220 (25%), Positives = 103/220 (46%), Gaps = 25/220 (11%)
Query: 65 TGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTK 124
T A I + + LS P + F + + ++ + LK S + Q V ++K
Sbjct: 17 TAFARIAPLKMEELSHDPYMVLFHDVVYESEIDFLLNATQ--LKASLVGQYQYSPVRTSK 74
Query: 125 GTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN 184
F+ ++ T +++ + ++ T L + ++ Y +G YD HYD+ N
Sbjct: 75 EQH-----FVEYND--TAVVKTLHRRLNDMTGLDMIESDTLTLINYGMGGHYDVHYDSHN 127
Query: 185 PAEYGPQ-MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDG 243
+E + R+A+ L Y+ +V+ GG T FP+ I + V P++G
Sbjct: 128 YSEANRLILGDRIATVLFYVGEVDSGGATTFPY---------------INVSVTPKKGSA 172
Query: 244 LLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+L+Y+L +G ++ ++H CPVI G K+V TKWI + Q
Sbjct: 173 VLWYNLDNSGQMNPKAIHAGCPVIVGSKYVLTKWINEIPQ 212
>gi|351696981|gb|EHA99899.1| Prolyl 4-hydroxylase subunit alpha-3 [Heterocephalus glaber]
Length = 572
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 66/219 (30%), Positives = 104/219 (47%), Gaps = 25/219 (11%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+Q + +V+ P + +F S + Q I A+ L+ S +A GE + R S
Sbjct: 362 LQPVRKEVIHLEPYVALYHDFVSDPEAQKIRKLAEPWLQRSVVA--SGEKQLQVE-YRIS 418
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTH--GEAFNVLRYEIGQKYDSHYD-AFNPA 186
++ + D +L ++H+IA T L H E V+ Y IG Y+ H+D A +P+
Sbjct: 419 KSAWLKDTADP--VLVTLDHRIAALTGLDVQHPYAEYLQVVNYGIGGHYEPHFDHATSPS 476
Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
Y + R+A+F++YLS VE GG T F + N V + L
Sbjct: 477 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIYAN---------------FSVPVVKNAAL 521
Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+++L +G D +LH CPV+ G+KWVA KWI + Q
Sbjct: 522 FWWNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIHEYGQ 560
>gi|195391760|ref|XP_002054528.1| GJ22757 [Drosophila virilis]
gi|194152614|gb|EDW68048.1| GJ22757 [Drosophila virilis]
Length = 534
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 47/134 (35%), Positives = 69/134 (51%), Gaps = 16/134 (11%)
Query: 151 IARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-QRLASFLLYLSDVEEG 209
++ AT L T E V Y +G Y+ H+D F + + P R+A+ + YLSDVE+G
Sbjct: 394 VSDATGLDMTFCEQLQVANYGVGGHYEPHWDFFRDSRHYPAAEGNRIATAIFYLSDVEQG 453
Query: 210 GETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKG 269
G T FPF N V+P+ G+ L +Y+L + +D + H CPV+KG
Sbjct: 454 GATAFPFLN---------------FAVRPQLGNILFWYNLHRSSDMDFRTKHAGCPVLKG 498
Query: 270 EKWVATKWIRDQEQ 283
KW+A WI + Q
Sbjct: 499 SKWIANIWIHEATQ 512
>gi|302849869|ref|XP_002956463.1| hypothetical protein VOLCADRAFT_107241 [Volvox carteri f.
nagariensis]
gi|300258161|gb|EFJ42400.1| hypothetical protein VOLCADRAFT_107241 [Volvox carteri f.
nagariensis]
Length = 965
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 66/206 (32%), Positives = 96/206 (46%), Gaps = 38/206 (18%)
Query: 75 FQVLSWRPRALYFPNFASAEQCQSIIATAKKR--LKPSQLALRQGETVESTKGTRTSSGT 132
+VL+ P + F SA +C I+ +A +K S + + G V+ T+ RTSS
Sbjct: 744 LRVLNIDPPVITVEGFLSAPECDGIVRSAADSGLMKQSGVGV-SGYQVKDTENVRTSSTL 802
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
+A +T A LPQ V RY+ GQ + +H DAF A G +
Sbjct: 803 AATAEPGQT------------AFELPQ-------VARYQPGQHFLTHEDAFPAAVVGSKG 843
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
QR A+ L+YL+D E+GG T F + + V+PR+G LLF+ F N
Sbjct: 844 YQRRATLLVYLNDCEQGGATKF---------------DILDIAVQPRKGTALLFFPAFAN 888
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWI 278
G DR +LH + + EKWV W+
Sbjct: 889 GMPDRRTLHTAQDAVS-EKWVTQLWL 913
>gi|195444366|ref|XP_002069834.1| GK11733 [Drosophila willistoni]
gi|194165919|gb|EDW80820.1| GK11733 [Drosophila willistoni]
Length = 517
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 69/225 (30%), Positives = 109/225 (48%), Gaps = 31/225 (13%)
Query: 66 GDASIQSIPFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
D ++ PF+V L+ P + + + + + + + +++ S L ++
Sbjct: 306 ADPFLRIAPFKVELLNRSPYVAAYYDVLNDSEIEELKLMSSPQIRRS-LLYNHTLDIDQA 364
Query: 124 KGTRTSSGTFISASEDKTGI--LELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSH 179
RTS+ F+ ++TGI LE I + A T L T E V+ Y +G +Y H
Sbjct: 365 DVDRTSNSVFM----EETGITLLETISQRAADMTDLYVTAISSEDLQVINYGLGGQYTPH 420
Query: 180 YDAFNP-AEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKP 238
D F+ AE G RLA+ L YL+DV++GG T+FPF + L P
Sbjct: 421 CDYFDENAENG----DRLATVLFYLTDVQQGGATVFPF---------------LRLSYFP 461
Query: 239 RRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
++G L+F +L + D+ S H +CPV+ G KWVATKWI +Q
Sbjct: 462 KKGSALIFRNLDNAMSGDKDSTHSACPVLFGNKWVATKWIYHFDQ 506
>gi|147791524|emb|CAN70717.1| hypothetical protein VITISV_029140 [Vitis vinifera]
Length = 173
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 53/141 (37%), Positives = 72/141 (51%), Gaps = 22/141 (15%)
Query: 144 LELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYL 203
L+ IE +I+ + +P +GE FN G QR+A+ L+YL
Sbjct: 55 LQAIEKRISVYSQVPVENGELIQ----------------FNLKRGG----QRVATMLIYL 94
Query: 204 SDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGS 263
SD EGGET FP F G K GL V P +G+ +LF+S+ +G D S+HG
Sbjct: 95 SDNVEGGETYFPMAGSGFCRCGG--KSVRGLSVAPVKGNAVLFWSMGLDGQSDPNSIHGG 152
Query: 264 CPVIKGEKWVATKWIRDQEQH 284
C V+ GEKW ATKW+R + H
Sbjct: 153 CEVLAGEKWSATKWMRQRSTH 173
>gi|442747045|gb|JAA65682.1| Putative prolyl 4-hydroxylase alpha subunit [Ixodes ricinus]
Length = 538
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 61/227 (26%), Positives = 108/227 (47%), Gaps = 30/227 (13%)
Query: 63 GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES 122
G+ G S+Q I + ++ +P + + + + ++A A+ RL+ S E V S
Sbjct: 306 GQDGFFSLQPIKLEEINLKPYIIVMHDVVQDKDIKDLMAYAEPRLERST-TYTGSEMVPS 364
Query: 123 TKGTRTSSGTFISASEDKTGIL--ELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHY 180
RTSS +++ E + + + T EA+ + Y G ++ H+
Sbjct: 365 P--VRTSSTAWLNEDEAPIAVRMNSYLRALLGMGTSDTNEEAEAYQLANYGTGGQFLPHH 422
Query: 181 D-------AFNP-AEYGPQ--MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKK 230
D ++N A+Y Q RLA+ ++Y++DVEEGG T+FP
Sbjct: 423 DFLQDSLHSYNSSADYYLQYGTGDRLATLMIYMTDVEEGGATVFP--------------- 467
Query: 231 CIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKW 277
+G+++ P++GD +++L +G DR + H CPV+ G KW+A KW
Sbjct: 468 SLGIRLTPKKGDAAFWWNLKASGEGDRLTTHAGCPVLYGSKWIANKW 514
>gi|47218149|emb|CAG10069.1| unnamed protein product [Tetraodon nigroviridis]
Length = 595
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 62/230 (26%), Positives = 105/230 (45%), Gaps = 46/230 (20%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPS---------------QLALRQGETVESTKG 125
RP + + + S ++ + + AK RL+ + +++ R+ + G
Sbjct: 373 RPYIVRYLDIISDKEIELVKQLAKPRLRRATISNPITGVLETASYRISKRRATVHDPQTG 432
Query: 126 TRTSSGTFISASEDKTG----ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
T++ +S S TG ++E I +I T L E V Y +G +Y+ H+D
Sbjct: 433 KLTTAQYRVSKSAWLTGYEHPVIETINQRIEDLTGLEVDTAEELQVANYGVGGQYEPHFD 492
Query: 182 --------AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIG 233
AF G R+A++L Y+SDV GG T+FP +G
Sbjct: 493 FGRKDEPDAFKELGTG----NRIATWLFYMSDVAAGGATVFP---------------DVG 533
Query: 234 LKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
V P++G + +Y+LF +G D ++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 534 AAVWPQKGSAVFWYNLFTSGEGDYSTRHAACPVLVGNKWVSNKWIHERGQ 583
>gi|195110925|ref|XP_002000030.1| GI22756 [Drosophila mojavensis]
gi|193916624|gb|EDW15491.1| GI22756 [Drosophila mojavensis]
Length = 533
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 47/134 (35%), Positives = 68/134 (50%), Gaps = 16/134 (11%)
Query: 151 IARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-QRLASFLLYLSDVEEG 209
+ AT L T+ E V Y +G Y+ H+D F + + P R+A+ + YLSDVE+G
Sbjct: 393 VGDATGLDMTYCEQLQVANYGVGGHYEPHWDFFRDSRHYPAAEGNRIATAIFYLSDVEQG 452
Query: 210 GETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKG 269
G T FPF N V+P+ G+ L +Y+L + D + H CPV+KG
Sbjct: 453 GATAFPFLN---------------FAVRPQLGNILFWYNLHRSSDEDYRTKHAGCPVLKG 497
Query: 270 EKWVATKWIRDQEQ 283
KW+A WI + Q
Sbjct: 498 SKWIANIWIHEATQ 511
>gi|427783867|gb|JAA57385.1| Putative prolyl 4-hydroxylase subunit alpha-1 [Rhipicephalus
pulchellus]
Length = 548
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 57/229 (24%), Positives = 101/229 (44%), Gaps = 29/229 (12%)
Query: 62 HGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE 121
+G G +Q + + + +P + F + ++A A RL S GE
Sbjct: 316 YGRNGFLRLQPVKIEEANLKPYIITFHDIIGDRDINDLLAYATPRLFRST---HYGEHGT 372
Query: 122 STKGTRTSSGTFISASED--KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH 179
T RTSS ++ + T + +E + + + E + + Y +G +Y +H
Sbjct: 373 ETSLIRTSSTAWLGDQDAPVATRLNRFVESLLGLGSQYLKGEAEYYQLANYGVGGQYIAH 432
Query: 180 YDAFNPAEYGPQ---------MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKK 230
+D P R+A+ + YLSDVEEGG T+FP
Sbjct: 433 HDFLADIYADPNRKLDDFERSAGDRIATLMFYLSDVEEGGATVFPH-------------- 478
Query: 231 CIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+G+++ P++G+ +++L +G ++ + HG CPV+ G KW+A KW R
Sbjct: 479 -LGVRLTPKKGNAAFWWNLNSDGEGEQLTKHGGCPVLYGSKWIANKWFR 526
>gi|451927223|gb|AGF85101.1| 4-hydroxylase [Moumouvirus goulette]
Length = 239
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 60/199 (30%), Positives = 95/199 (47%), Gaps = 30/199 (15%)
Query: 89 NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
NF + E+C I+ + +L S++ + K R S ++S + +++ +
Sbjct: 61 NFINKEKCGEIMNNTQSKLFDSEVISGK------NKAIRNSQQCWVSKYDP---MVKSMF 111
Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-----NPAEYGPQMSQRLASFLLYL 203
KI++ +P + E V+RY GQ Y+ H+DA E+ + QR + L+YL
Sbjct: 112 QKISQQFNIPIQNAEDLQVVRYLPGQYYNEHHDACCDNNDKCNEFISRGGQRCLTVLIYL 171
Query: 204 SDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT-IDRTSLHG 262
++ EGG T F K +GLKVKP GD ++FY L N + SLH
Sbjct: 172 NNEFEGGHTFF---------------KNLGLKVKPETGDAIVFYPLAKNTSKCHPLSLHA 216
Query: 263 SCPVIKGEKWVATKWIRDQ 281
PV GEKW+A W R++
Sbjct: 217 GMPVTNGEKWIANLWFRER 235
>gi|268536692|ref|XP_002633481.1| C. briggsae CBR-PHY-2 protein [Caenorhabditis briggsae]
gi|94442973|emb|CAJ98659.1| prolyl 4-hydroxylase [Caenorhabditis briggsae]
Length = 539
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 64/223 (28%), Positives = 98/223 (43%), Gaps = 32/223 (14%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
I I ++L + P A+ F N S + + I A +LK + TV+++K
Sbjct: 313 IAPIKVEILRFDPLAVLFKNVISDSEIEVIKELASPKLKRA--------TVQNSKTGELE 364
Query: 130 SGTF-ISASE----DKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN 184
T+ IS S D +++ + +I T L Q E V Y +G YD H+D
Sbjct: 365 HATYRISKSAWLKGDLDPVIDRVNRRIEDFTGLNQATSEELQVANYGLGGHYDPHFDFAR 424
Query: 185 PAEYGP----QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRR 240
E R+A+ L Y+S E GG T+F +G V P +
Sbjct: 425 KEEKNAFKTLNTGNRIATVLFYMSQPERGGATVF---------------NHLGTAVFPSK 469
Query: 241 GDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
D L +Y+L +G D + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 470 NDALFWYNLRRDGEGDLRTRHAACPVLLGVKWVSNKWIHERGQ 512
>gi|328790718|ref|XP_392392.4| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Apis mellifera]
Length = 415
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 56/206 (27%), Positives = 94/206 (45%), Gaps = 22/206 (10%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
PR + + N ++ ++I A+ R K + + + +E R S ++ E K
Sbjct: 208 PRIVVYHNVIYDDEIETIKRMAQPRFKRATVQNYKTGALE-IANYRISKSAWLQEHEHKH 266
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS----QRLA 197
+ + ++ T + E V+ Y IG Y+ H+D E S R+A
Sbjct: 267 --VAAVSRRVEHMTSMTVDTAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIA 324
Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 257
+ L Y+SDVE+GG T+F I + + P++G +Y+L PNG D
Sbjct: 325 TVLYYMSDVEQGGGTVFT---------------AINIALWPKKGSAAFWYNLKPNGEGDF 369
Query: 258 TSLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV+ G KWVA KW+ ++ Q
Sbjct: 370 KTRHAACPVLTGSKWVANKWLHERGQ 395
>gi|195452734|ref|XP_002073476.1| GK13124 [Drosophila willistoni]
gi|194169561|gb|EDW84462.1| GK13124 [Drosophila willistoni]
Length = 536
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 46/134 (34%), Positives = 69/134 (51%), Gaps = 16/134 (11%)
Query: 151 IARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMSQRLASFLLYLSDVEEG 209
++ T L T+ E V Y +G Y+ H+D F NP Y + R+A+ + YLS+VE+G
Sbjct: 396 LSDTTGLDMTYCEQLQVANYGVGGHYEPHWDFFRNPDHYPAEEGNRIATAIYYLSEVEQG 455
Query: 210 GETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKG 269
G T FPF N V+P+ G+ L +Y+L + +D + H CPV+KG
Sbjct: 456 GATAFPFLN---------------FAVRPQLGNVLFWYNLHRSSDMDYRTKHAGCPVLKG 500
Query: 270 EKWVATKWIRDQEQ 283
KW+ WI + Q
Sbjct: 501 SKWIGNVWIHEVTQ 514
>gi|126327904|ref|XP_001367838.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Monodelphis
domestica]
Length = 559
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 66/219 (30%), Positives = 104/219 (47%), Gaps = 25/219 (11%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+Q + +VL P + + +F S + Q I A L+ S +A GE + + R S
Sbjct: 349 LQPVRKEVLHLEPYIVLYHDFVSDSEAQKIRGFAAPWLQRSVVA--SGEKQQQVE-YRIS 405
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYD-AFNPA 186
++ + D +L ++H+IA T L + E V+ Y IG Y+ H+D A +P+
Sbjct: 406 KSAWLKDTVDP--MLVSLDHRIAALTGLNVQPPYAEHLQVVNYGIGGHYEPHFDHATSPS 463
Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
Y R+A+F++YLS VE GG T F + N V + L
Sbjct: 464 SPLYRMNSGNRVATFMIYLSSVEAGGSTAFIYAN---------------FSVPVVKNAAL 508
Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+++L +G D +LH CPV+ G+KWVA KWI + Q
Sbjct: 509 FWWNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIHEYGQ 547
>gi|393909803|gb|EFO21561.2| prolyl 4-hydroxylase 2 [Loa loa]
Length = 542
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 65/216 (30%), Positives = 97/216 (44%), Gaps = 24/216 (11%)
Query: 74 PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
PF+V L + P A++F + + E+ I A RL+ + + +E T RTS
Sbjct: 323 PFKVEILRFSPLAVFFRDVITDEEVTIIQMLATPRLRRATVQNSITGELE-TASYRTSKS 381
Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ 191
++ E + I+ I +I T L Q E V Y IG YD H+D E
Sbjct: 382 AWLKDEEHE--IVHRINRRIDLMTNLEQETSEELQVGNYGIGGHYDPHFDFARREEVNAF 439
Query: 192 MS----QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFY 247
S RLA+ L Y++ E GG T+F + V P + D L +Y
Sbjct: 440 QSLNTGNRLATLLFYMTQPESGGATVF---------------TEVKTTVMPSKNDALFWY 484
Query: 248 SLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+L +G D + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 485 NLLRSGEGDLRTRHAACPVLIGSKWVSNKWIHERGQ 520
>gi|219123691|ref|XP_002182153.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217406114|gb|EEC46054.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 188
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 63/208 (30%), Positives = 96/208 (46%), Gaps = 29/208 (13%)
Query: 77 VLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
VL+ P NF + +C+ +I A+ P+ + G+ +RTSS ++S
Sbjct: 1 VLNTSPPMFAVDNFLTPLECEFLIHMAQDSFGPAPVV---GKGAGEVSPSRTSSTCYLS- 56
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA-----EYGPQ 191
ED ++ K++ T P H E V RY Q+Y HYDAF+ +
Sbjct: 57 REDLPDLMR----KVSSLTGKPIEHCELPQVGRYFPSQQYLQHYDAFDLGTEDGLRFAAN 112
Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
QR + LLYL+DV GG T FP + L V+PR+G L+F+
Sbjct: 113 GGQRTITVLLYLNDVARGGATRFP---------------ALNLDVQPRQGMALVFFPATI 157
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIR 279
+G +DR +LH + P + K+V+ WIR
Sbjct: 158 DGMLDRMALHAAMPAVD-TKYVSQVWIR 184
>gi|340722330|ref|XP_003399560.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Bombus
terrestris]
Length = 557
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 56/206 (27%), Positives = 94/206 (45%), Gaps = 22/206 (10%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
PR + + N E+ ++I A+ R K + + + +E R S ++ E +
Sbjct: 350 PRIVVYHNVIYDEEIETIKRMAQPRFKRATVQNYKTGALE-IANYRISKSAWLQEHEHEH 408
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS----QRLA 197
+ + ++ T + E V+ Y IG Y+ H+D E S R+A
Sbjct: 409 --VAAVSRRVEHMTSMTVDTAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIA 466
Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 257
+ L Y+SDVE+GG T+F I + + P++G +Y+L PNG D
Sbjct: 467 TVLYYMSDVEQGGGTVFT---------------AINISLWPKKGSAAFWYNLKPNGEGDF 511
Query: 258 TSLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV+ G KWVA KW+ ++ Q
Sbjct: 512 KTRHAACPVLTGSKWVANKWLHERGQ 537
>gi|386368303|gb|AFJ06910.1| procollagen-proline dioxygenase [Mytilus galloprovincialis]
Length = 535
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 47/146 (32%), Positives = 77/146 (52%), Gaps = 20/146 (13%)
Query: 143 ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-----AFNPAEYGPQMSQRLA 197
+++ ++++I T L +A V Y IG YD HYD + +E + R+A
Sbjct: 393 VVDRVQNRIKAVTGLDLDSADALQVANYGIGGHYDPHYDFSTRDDDDTSETEKRDGNRIA 452
Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 257
+FLLY++DV+ GG T+FP I ++V P++G + +Y+L +G
Sbjct: 453 TFLLYMTDVDAGGATVFPI---------------IDVRVLPKKGTAVFWYNLRRSGKGIM 497
Query: 258 TSLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV+ G KWV+ KWIR + Q
Sbjct: 498 ETRHAACPVLVGTKWVSNKWIRTRGQ 523
>gi|57997558|emb|CAI46066.1| hypothetical protein [Homo sapiens]
Length = 533
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451
Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 452 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 496
Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 497 TRHAACPVLVGCKWVSNKWFHERGQ 521
>gi|403255937|ref|XP_003920661.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Saimiri
boliviensis boliviensis]
gi|403255939|ref|XP_003920662.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Saimiri
boliviensis boliviensis]
gi|403255943|ref|XP_003920664.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 4 [Saimiri
boliviensis boliviensis]
Length = 533
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451
Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 452 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 496
Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 497 TRHAACPVLVGCKWVSNKWFHERGQ 521
>gi|312080225|ref|XP_003142509.1| prolyl 4-hydroxylase 2 [Loa loa]
Length = 541
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 65/216 (30%), Positives = 97/216 (44%), Gaps = 24/216 (11%)
Query: 74 PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
PF+V L + P A++F + + E+ I A RL+ + + +E T RTS
Sbjct: 322 PFKVEILRFSPLAVFFRDVITDEEVTIIQMLATPRLRRATVQNSITGELE-TASYRTSKS 380
Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ 191
++ E + I+ I +I T L Q E V Y IG YD H+D E
Sbjct: 381 AWLKDEEHE--IVHRINRRIDLMTNLEQETSEELQVGNYGIGGHYDPHFDFARREEVNAF 438
Query: 192 MS----QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFY 247
S RLA+ L Y++ E GG T+F + V P + D L +Y
Sbjct: 439 QSLNTGNRLATLLFYMTQPESGGATVF---------------TEVKTTVMPSKNDALFWY 483
Query: 248 SLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+L +G D + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 484 NLLRSGEGDLRTRHAACPVLIGSKWVSNKWIHERGQ 519
>gi|291387300|ref|XP_002710241.1| PREDICTED: prolyl 4-hydroxylase, alpha II subunit isoform 1
precursor (predicted)-like isoform 1 [Oryctolagus
cuniculus]
Length = 533
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 59/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
++ I ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 392 DPVVARINRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451
Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 452 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 496
Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 497 TRHAACPVLVGCKWVSNKWFHERGQ 521
>gi|386780652|ref|NP_001247763.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Macaca mulatta]
gi|383422579|gb|AFH34503.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Macaca
mulatta]
gi|384939466|gb|AFI33338.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Macaca
mulatta]
Length = 533
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451
Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 452 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 496
Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 497 TRHAACPVLVGCKWVSNKWFHERGQ 521
>gi|119582749|gb|EAW62345.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_d
[Homo sapiens]
Length = 488
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 291 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 346
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 347 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 406
Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 407 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 451
Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 452 TRHAACPVLVGCKWVSNKWFHERGQ 476
>gi|195505216|ref|XP_002099408.1| GE23378 [Drosophila yakuba]
gi|194185509|gb|EDW99120.1| GE23378 [Drosophila yakuba]
Length = 546
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 65/218 (29%), Positives = 102/218 (46%), Gaps = 24/218 (11%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGE---TVESTKGT-R 127
I ++LS P + + S E+ + +K + PS+ A L E E G+ R
Sbjct: 322 IKTEILSIDPFIVLLHDMVSVEEGALLRTFSKNMISPSETAELSDSEEKSIFEFEVGSFR 381
Query: 128 TSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF--NP 185
TS ++ ++ + + ++ AT L +H E F V+ Y IG ++SH+D +
Sbjct: 382 TSKSVWLDNDANEATLK--LTQRLGDATGLDISHSEPFQVINYGIGGIFESHFDTSLQDE 439
Query: 186 AEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLL 245
+ RLA+ L YL+DV +GG T FP N + V P+ G L
Sbjct: 440 NRFLDGYMDRLATTLFYLNDVPQGGATHFPGLN---------------ITVFPKFGTALF 484
Query: 246 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+Y+L G + ++H CPVI G KWV +KWI D+ Q
Sbjct: 485 WYNLDTKGLLRLRTMHTGCPVIVGSKWVVSKWIDDKGQ 522
>gi|443721482|gb|ELU10773.1| hypothetical protein CAPTEDRAFT_174752 [Capitella teleta]
Length = 525
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 70/269 (26%), Positives = 123/269 (45%), Gaps = 45/269 (16%)
Query: 43 RPKLRTLEVVE---KENESGLPHGETGDASIQS------IPF-----QVLSWRPRALYFP 88
RPKL++ + E + + LP ++ +++ +PF ++L+ +P + F
Sbjct: 263 RPKLKSTKAYEALCRGEQLKLPDVDSEQQALKCRYKPGILPFVRYKEEMLNRKPHIVLFH 322
Query: 89 NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
+ S + +++ A +L+ + +A + + S R S +++ I +L
Sbjct: 323 DVMSDAEAKTMKMEAMHKLERAHVADNENKHGHSASAKRISQVSWLWDDHANKTIHQL-S 381
Query: 149 HKIARATMLPQT-------HGEAFNVLRYEIGQKYDSHYDAF-------NPAEYGPQMSQ 194
++A T L QT E F +L Y IG +Y+ H D F + E+
Sbjct: 382 RRVADITGL-QTGVVSGLHSAEPFQILNYGIGGQYEPHVDYFAGNHSHSSLPEHVRASGN 440
Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
RLA+F+ YL+DV GG T+FP K +G + P + +Y++ NG
Sbjct: 441 RLATFMFYLNDVHAGGATVFP-------------KLKVG--IPPTKNGAAFWYNIGLNGD 485
Query: 255 IDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+D + H CPV+ G+KWVA KWI + Q
Sbjct: 486 VDPLTEHAGCPVLLGQKWVANKWIHEHGQ 514
>gi|380025232|ref|XP_003696381.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Apis florea]
Length = 537
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 56/206 (27%), Positives = 94/206 (45%), Gaps = 22/206 (10%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
PR + + N ++ ++I A+ R K + + + +E R S ++ E K
Sbjct: 330 PRIVVYHNVIYDDEIETIKRMAQPRFKRATVQNYKTGALE-IANYRISKSAWLQEHEHKH 388
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS----QRLA 197
+ + ++ T + E V+ Y IG Y+ H+D E S R+A
Sbjct: 389 --VAAVSRRVEHMTSMTVDTAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIA 446
Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 257
+ L Y+SDVE+GG T+F I + + P++G +Y+L PNG D
Sbjct: 447 TVLYYMSDVEQGGGTVFT---------------AINIALWPKKGSAAFWYNLKPNGEGDF 491
Query: 258 TSLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV+ G KWVA KW+ ++ Q
Sbjct: 492 KTRHAACPVLTGSKWVANKWLHERGQ 517
>gi|440899661|gb|ELR50930.1| Prolyl 4-hydroxylase subunit alpha-3, partial [Bos grunniens mutus]
Length = 478
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 65/219 (29%), Positives = 106/219 (48%), Gaps = 25/219 (11%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+Q + +V+ P + + +F S + Q+I A+ L+ S +A GE + R S
Sbjct: 268 LQPVRKEVIHLEPYVVLYHDFVSDAEAQTIRGLAEPWLQRSVVA--SGEKQLPVE-YRIS 324
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYD-AFNPA 186
++ + D +L ++H+IA T L + E V+ Y IG Y+ H+D A +P+
Sbjct: 325 KSAWLKDTVDP--VLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPS 382
Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
Y R+A+F++YLS VE GG T F + N V + L
Sbjct: 383 SPLYRMNSGNRVATFMIYLSSVEAGGATAFIYGN---------------FSVPVVKNAAL 427
Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+++L +G D +LH +CPV+ G+KWVA KWI + Q
Sbjct: 428 FWWNLHRSGEGDGDTLHAACPVLVGDKWVANKWIHEYGQ 466
>gi|63252891|ref|NP_001017973.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Homo
sapiens]
gi|63252893|ref|NP_001017974.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Homo
sapiens]
gi|217272861|ref|NP_001136070.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Homo
sapiens]
gi|18073925|emb|CAC85688.1| Prolyl 4-hydroxylase alpha IIa subunit [Homo sapiens]
gi|23274221|gb|AAH35813.1| Prolyl 4-hydroxylase, alpha polypeptide II [Homo sapiens]
gi|37183058|gb|AAQ89329.1| P4HA2 [Homo sapiens]
gi|119582745|gb|EAW62341.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_a
[Homo sapiens]
gi|119582750|gb|EAW62346.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_a
[Homo sapiens]
gi|123983232|gb|ABM83357.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II [synthetic
construct]
gi|157928048|gb|ABW03320.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II [synthetic
construct]
Length = 533
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451
Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 452 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 496
Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 497 TRHAACPVLVGCKWVSNKWFHERGQ 521
>gi|350416719|ref|XP_003491070.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Bombus
impatiens]
Length = 557
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 56/206 (27%), Positives = 94/206 (45%), Gaps = 22/206 (10%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
PR + + N E+ ++I A+ R K + + + +E R S ++ E +
Sbjct: 350 PRIVVYHNVIYDEEIETIKRMAQPRFKRATVQNYKTGALE-IANYRISKSAWLQEHEHEH 408
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS----QRLA 197
+ + ++ T + E V+ Y IG Y+ H+D E S R+A
Sbjct: 409 --VAAVSRRVEHMTSMTVDTAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIA 466
Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 257
+ L Y+SDVE+GG T+F I + + P++G +Y+L PNG D
Sbjct: 467 TVLYYMSDVEQGGGTVFT---------------AINISLWPKKGSAAFWYNLKPNGEGDF 511
Query: 258 TSLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV+ G KWVA KW+ ++ Q
Sbjct: 512 KTRHAACPVLTGSKWVANKWLHERGQ 537
>gi|114601548|ref|XP_001162501.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 9 [Pan
troglodytes]
gi|114601562|ref|XP_001162805.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 16 [Pan
troglodytes]
gi|114601564|ref|XP_517917.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 19 [Pan
troglodytes]
gi|397518354|ref|XP_003829356.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Pan
paniscus]
gi|397518356|ref|XP_003829357.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Pan
paniscus]
gi|397518360|ref|XP_003829359.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 4 [Pan
paniscus]
gi|410215942|gb|JAA05190.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
gi|410255606|gb|JAA15770.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
gi|410331277|gb|JAA34585.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
gi|410331281|gb|JAA34587.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
Length = 533
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451
Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 452 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 496
Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 497 TRHAACPVLVGCKWVSNKWFHERGQ 521
>gi|332221656|ref|XP_003259979.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Nomascus
leucogenys]
gi|332221658|ref|XP_003259980.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Nomascus
leucogenys]
Length = 535
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 338 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 393
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 394 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 453
Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 454 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 498
Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 499 TRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|224068121|ref|XP_002191580.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Taeniopygia
guttata]
Length = 539
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 59/204 (28%), Positives = 101/204 (49%), Gaps = 24/204 (11%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK RL ++ +R +T V + R S +++ ED
Sbjct: 342 PHIVRYYDVMSDEEIEKIKQLAKPRL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 397
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS---QRLA 197
++ + ++ T L E V Y +G +Y+ H+D F+ + + RLA
Sbjct: 398 DPVVAKVNQRMQHITGLTVKTAELLQVANYGMGGQYEPHFD-FSRRPFDSTLKSEGNRLA 456
Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 257
+FL Y+SDVE GG T+FP D+ G + P++G + +Y+LF +G D
Sbjct: 457 TFLNYMSDVEAGGATVFP-----------DF----GAAIWPKKGTAVFWYNLFRSGEGDY 501
Query: 258 TSLHGSCPVIKGEKWVATKWIRDQ 281
+ H +CPV+ G KWV+ KW ++
Sbjct: 502 RTRHAACPVLVGCKWVSNKWFHER 525
>gi|426349879|ref|XP_004042513.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Gorilla gorilla
gorilla]
Length = 565
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 368 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 423
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 424 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 483
Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 484 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 528
Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 529 TRHAACPVLVGCKWVSNKWFHERGQ 553
>gi|48675383|ref|NP_001001598.1| prolyl 4-hydroxylase subunit alpha-3 precursor [Bos taurus]
gi|75053350|sp|Q75UG4.1|P4HA3_BOVIN RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
alpha-3; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-3; Flags: Precursor
gi|47115494|dbj|BAD18888.1| Collagen prolyl 4-hydroxylase alpha III subunit [Bos taurus]
gi|296479828|tpg|DAA21943.1| TPA: prolyl 4-hydroxylase subunit alpha-3 precursor [Bos taurus]
Length = 544
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 65/219 (29%), Positives = 106/219 (48%), Gaps = 25/219 (11%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+Q + +V+ P + + +F S + Q+I A+ L+ S +A GE + R S
Sbjct: 334 LQPVRKEVIHLEPYVVLYHDFVSDAEAQTIRGLAEPWLQRSVVA--SGEKQLPVE-YRIS 390
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYD-AFNPA 186
++ + D +L ++H+IA T L + E V+ Y IG Y+ H+D A +P+
Sbjct: 391 KSAWLKDTVDP--VLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPS 448
Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
Y R+A+F++YLS VE GG T F + N V + L
Sbjct: 449 SPLYRMNSGNRVATFMIYLSSVEAGGATAFIYGN---------------FSVPVVKNAAL 493
Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+++L +G D +LH +CPV+ G+KWVA KWI + Q
Sbjct: 494 FWWNLHRSGEGDGDTLHAACPVLVGDKWVANKWIHEYGQ 532
>gi|119582748|gb|EAW62344.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_c
[Homo sapiens]
Length = 565
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 368 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 423
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 424 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 483
Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 484 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 528
Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 529 TRHAACPVLVGCKWVSNKWFHERGQ 553
>gi|297675927|ref|XP_002815905.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Pongo
abelii]
gi|395736137|ref|XP_003776704.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Pongo abelii]
Length = 533
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451
Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 452 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 496
Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 497 TRHAACPVLVGCKWVSNKWFHERGQ 521
>gi|431838427|gb|ELK00359.1| Prolyl 4-hydroxylase subunit alpha-3 [Pteropus alecto]
Length = 483
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 65/219 (29%), Positives = 106/219 (48%), Gaps = 25/219 (11%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+Q + +V+ P + + +F S + Q I A+ L+ S +A GE + R S
Sbjct: 273 LQPVRKEVIHLEPYVVLYHDFVSDLEAQKIRGLAEPWLQRSVVA--SGEKQLPVE-YRIS 329
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYD-AFNPA 186
++ + D +L ++H+IA T L + E V+ Y IG Y+ H+D A +P+
Sbjct: 330 KSAWLKDTADP--MLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPS 387
Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
Y + R+A+F++YLS VE GG T F + N V + L
Sbjct: 388 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIYAN---------------FSVPVVKNAAL 432
Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+++L +G D +LH +CPV+ G+KWVA KWI + Q
Sbjct: 433 FWWNLHRSGEGDSDTLHAACPVLVGDKWVANKWIHEYGQ 471
>gi|403263105|ref|XP_003923900.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
alpha-3, partial [Saimiri boliviensis boliviensis]
Length = 534
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 68/219 (31%), Positives = 103/219 (47%), Gaps = 25/219 (11%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+Q I +VL P + +F S + Q I A+ L+ S +A GE + R S
Sbjct: 324 LQPIQKEVLHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVA--SGEKQLQVE-YRIS 380
Query: 130 SGTFISASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPA 186
++ + D +L + H+IA T L + E V+ Y IG Y+ H+D A +P+
Sbjct: 381 KSAWLKDTVDP--MLVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPS 438
Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
Y + R+A+F++YLS VE GG T F + N L V + L
Sbjct: 439 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIYAN---------------LSVPVVKNAAL 483
Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+++L +G D +LH CPV+ G KWVA KWI + Q
Sbjct: 484 FWWNLHRSGEGDSDTLHAGCPVLVGNKWVANKWIHEYGQ 522
>gi|359400227|ref|ZP_09193216.1| 2OG-Fe(II) oxygenase [Novosphingobium pentaromativorans US6-1]
gi|357598467|gb|EHJ60196.1| 2OG-Fe(II) oxygenase [Novosphingobium pentaromativorans US6-1]
Length = 193
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 56/198 (28%), Positives = 95/198 (47%), Gaps = 28/198 (14%)
Query: 89 NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
+F QC ++IA + +PS +A G+ V RTSS +S G + +
Sbjct: 16 DFLDTAQCDALIALIEAEHRPSTVANYNGDDV-----FRTSSTCDLSPD---VGAVAALA 67
Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP-----AEYGPQMSQRLASFLLYL 203
K+ + + H E RYE+GQ++ +H D F P +Y QR +F++YL
Sbjct: 68 RKLCDISGIDPAHAEPLQGQRYEVGQEFKAHTDYFEPNNSDFEKYCSVSGQRTWTFMIYL 127
Query: 204 SDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGS 263
+DV+ GG T F K I ++P RG + + + P+G+++ +LH +
Sbjct: 128 NDVDAGGATRF---------------KVINKLIQPERGKLVAWNNRRPDGSLNPATLHHA 172
Query: 264 CPVIKGEKWVATKWIRDQ 281
V +G K+V T+W R++
Sbjct: 173 MKVRQGRKYVVTQWFRER 190
>gi|332221662|ref|XP_003259982.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 4 [Nomascus
leucogenys]
Length = 556
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 359 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 414
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 415 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 474
Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 475 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 519
Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 520 TRHAACPVLVGCKWVSNKWFHERGQ 544
>gi|395736139|ref|XP_003776705.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Pongo abelii]
Length = 575
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 378 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 433
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 434 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 493
Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 494 FLNYMSDVEAGGATVFPD---------------LGAAIWPKKGTAVFWYNLLRSGEGDYR 538
Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 539 TRHAACPVLVGCKWVSNKWFHERGQ 563
>gi|33589818|ref|NP_878907.1| prolyl 4-hydroxylase subunit alpha-3 precursor [Homo sapiens]
gi|114639354|ref|XP_001174896.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Pan troglodytes]
gi|397487266|ref|XP_003814725.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Pan paniscus]
gi|74738714|sp|Q7Z4N8.1|P4HA3_HUMAN RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
alpha-3; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-3; Flags: Precursor
gi|33188232|gb|AAP97874.1| prolyl 4-hydroxylase alpha III subunit [Homo sapiens]
gi|36962719|gb|AAQ87603.1| collagen prolyl 4-hydroxylase alpha III subunit [Homo sapiens]
gi|37182165|gb|AAQ88885.1| GPGA711 [Homo sapiens]
gi|109658570|gb|AAI17334.1| Prolyl 4-hydroxylase, alpha polypeptide III [Homo sapiens]
gi|119595341|gb|EAW74935.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide III, isoform CRA_b
[Homo sapiens]
gi|410219716|gb|JAA07077.1| prolyl 4-hydroxylase, alpha polypeptide III [Pan troglodytes]
gi|410248278|gb|JAA12106.1| prolyl 4-hydroxylase, alpha polypeptide III [Pan troglodytes]
gi|410336087|gb|JAA36990.1| prolyl 4-hydroxylase, alpha polypeptide III [Pan troglodytes]
Length = 544
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 68/219 (31%), Positives = 103/219 (47%), Gaps = 25/219 (11%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+Q I +V+ P + +F S + Q I A+ L+ S +A GE + R S
Sbjct: 334 LQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVA--SGEKQLQVE-YRIS 390
Query: 130 SGTFISASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPA 186
++ + D L + H+IA T L + E V+ Y IG Y+ H+D A +P+
Sbjct: 391 KSAWLKDTVDPK--LVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPS 448
Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
Y + R+A+F++YLS VE GG T F + N L V R L
Sbjct: 449 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIYAN---------------LSVPVVRNAAL 493
Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+++L +G D +LH CPV+ G+KWVA KWI + Q
Sbjct: 494 FWWNLHRSGEGDSDTLHAGCPVLVGDKWVANKWIHEYGQ 532
>gi|348688210|gb|EGZ28024.1| hypothetical protein PHYSODRAFT_321730 [Phytophthora sojae]
Length = 487
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 64/222 (28%), Positives = 103/222 (46%), Gaps = 15/222 (6%)
Query: 75 FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 134
+ +S P F ++ ++ + L PS + L+ G RTS+ ++
Sbjct: 268 METISMTPLVFSVEEFLRDDEIDVVLELSMPHLAPSGVTLQDGHENRPATDWRTSTTYWL 327
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF------NPAEY 188
+S +++ I+ + A +P +H E+ VLRYE Q YD H D F N A+
Sbjct: 328 ESSSHP--VVQDIDKRTADLVKVPISHQESVQVLRYEHTQHYDQHLDYFSVKRHRNSADV 385
Query: 189 GPQMSQ----RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI-GLKVKPRRGDG 243
++ R+ + Y+SDV +GG T F G L K C GL V P++
Sbjct: 386 LKKIEHGYKNRMITVFWYMSDVAKGGHTNFARAGG--LPPPPTNKGCTQGLSVVPKKRKV 443
Query: 244 LLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 285
++FYS+ PNG D SLH CPV +G K KW+ ++ + +
Sbjct: 444 VVFYSMLPNGEGDPMSLHAGCPVEEGIKMSGNKWVWNKPRSD 485
>gi|195505214|ref|XP_002099407.1| GE23379 [Drosophila yakuba]
gi|194185508|gb|EDW99119.1| GE23379 [Drosophila yakuba]
Length = 547
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 64/213 (30%), Positives = 98/213 (46%), Gaps = 19/213 (8%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT-RTSSG 131
I ++LS P L F + S ++ I +++K+ + PS E T RTS
Sbjct: 328 IKTEILSIDPFVLLFHDMISQKESTLIRSSSKEHMLPSATTDVDASGSEDHVATFRTSKS 387
Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGP 190
+ S++ + T + I ++ AT L E F V+ Y +G +++H D +
Sbjct: 388 VWYSSTSNDTT--KRITERLGDATGLDMNFTEYFQVINYGLGGFFETHLDMLLSDRSRFN 445
Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
RLA+ L YL++V +GG T FP N L V P+ G L +Y+L
Sbjct: 446 GTRDRLATTLFYLNEVRQGGGTHFPRLN---------------LTVFPQPGSALFWYNLD 490
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
G ++LH CPVI G KWV +KW+ D Q
Sbjct: 491 TRGNDHTSTLHTGCPVIVGSKWVMSKWVEDAGQ 523
>gi|116283554|gb|AAH17062.1| P4HA2 protein [Homo sapiens]
Length = 504
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 307 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 362
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 363 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 422
Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 423 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 467
Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 468 TRHAACPVLVGCKWVSNKWFHERGQ 492
>gi|432926124|ref|XP_004080841.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oryzias
latipes]
Length = 523
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 61/210 (29%), Positives = 103/210 (49%), Gaps = 30/210 (14%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + + + AS ++ +++ AK RL+ + + Q + +T R S ++ + E
Sbjct: 324 PYIVRYHDVASEKEMETVKELAKPRLRRATVHDPQTGKL-TTAQYRVSKSAWLGSHEHP- 381
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQMS 193
I++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 382 -IVDRINQRIEDITGLDVSTAEDLQVANYGVGGQYEPHFDFGRKDEADAFEELGTG---- 436
Query: 194 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 253
R+A++LLY+SDV+ GG N +F D IG V P++G + +Y+L +G
Sbjct: 437 NRIATWLLYMSDVQAGG-------NTVFTD--------IGAVVWPKKGTAVFWYNLHRSG 481
Query: 254 TIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
D + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 482 EGDYRTRHAACPVLVGNKWVSNKWIHERGQ 511
>gi|190402274|gb|ACE77683.1| prolyl 4-hydroxylase subunit alpha-2 precursor (predicted) [Sorex
araneus]
Length = 533
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 103/205 (50%), Gaps = 22/205 (10%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V +T R S +++ ++D
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTTASYRVSKSSWLEETDDP 393
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 394 --VVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451
Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 452 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 496
Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 497 TRHAACPVLVGCKWVSNKWFHERGQ 521
>gi|426369750|ref|XP_004051847.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3, partial [Gorilla
gorilla gorilla]
Length = 517
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 68/219 (31%), Positives = 103/219 (47%), Gaps = 25/219 (11%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+Q I +V+ P + +F S + Q I A+ L+ S +A GE + R S
Sbjct: 307 LQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVA--SGEKQLQVE-YRIS 363
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYD-AFNPA 186
++ + D L + H+IA T L + E V+ Y IG Y+ H+D A +P+
Sbjct: 364 KSAWLKDTVDPK--LVALNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPS 421
Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
Y + R+A+F++YLS VE GG T F + N L V R L
Sbjct: 422 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIYAN---------------LSVPVVRNAAL 466
Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+++L +G D +LH CPV+ G+KWVA KWI + Q
Sbjct: 467 FWWNLHRSGEGDSDTLHAGCPVLVGDKWVANKWIHEYGQ 505
>gi|405964867|gb|EKC30309.1| Prolyl 4-hydroxylase subunit alpha-1 [Crassostrea gigas]
Length = 591
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 64/231 (27%), Positives = 107/231 (46%), Gaps = 38/231 (16%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKG----TRTSS 130
+V+++ PR F + S + + + A K S + L G T G R S
Sbjct: 361 EVVNYEPRIAIFHDVISPTSIEHLKSVASKGFTRSTVFLENTGPDGHVTYGKLDNVRVSQ 420
Query: 131 GTFISASEDKTGILELIEHKIARATMLP------QTHGEAFNVLRYEIGQKYDSHYDAF- 183
+++ D+ L +E++I T L ++H E F VL Y +G Y HYD
Sbjct: 421 TSWLGT--DEYPELSRLENRIKLTTGLSAEYKSVRSHSEKFQVLNYGVGGMYTVHYDYTG 478
Query: 184 -------NPAEYGPQMS--QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGL 234
NP + + +R+A+++ YL+DV+ GG T+FP +
Sbjct: 479 YMLGIPSNPLDSDDIRTSGERMATWMFYLNDVKAGGATVFP---------------EVKT 523
Query: 235 KVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 285
++ +G +Y++ P+G D +LHG CPV+ G KWV+ KWIR++ Q +
Sbjct: 524 RIPVAKGGAAFWYNVRPSGATDPRTLHGGCPVLVGSKWVSNKWIREEGQMD 574
>gi|289526401|gb|ADD01323.1| FI13021p [Drosophila melanogaster]
gi|373432715|gb|AEY70761.1| FI17809p1 [Drosophila melanogaster]
Length = 193
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 58/194 (29%), Positives = 91/194 (46%), Gaps = 28/194 (14%)
Query: 99 IIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLP 158
+I A + +K +++ + V RT+ G ++ ++ + + I +I T
Sbjct: 2 LIGKAAQNMKNTKI--HKERAVPKKNRGRTAKGFWLKKESNE--LTKRITRRIMDMTGFD 57
Query: 159 QTHGEAFNVLRYEIGQKYDSHYDAFNPA---------EYGPQMSQRLASFLLYLSDVEEG 209
E F V+ Y IG Y H D F+ A Y + R+A+ L YL+DVE+G
Sbjct: 58 LADSEGFQVINYGIGGHYFLHMDYFDFASSNHTDTRSRYSIDLGDRIATVLFYLTDVEQG 117
Query: 210 GETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKG 269
G T +F D GY V P+ G + +Y+L +G D + H +CPVI G
Sbjct: 118 GAT-------VFGDVGY--------YVSPQAGTAIFWYNLDTDGNGDPRTRHAACPVIVG 162
Query: 270 EKWVATKWIRDQEQ 283
KWV T+WIR++ Q
Sbjct: 163 SKWVMTEWIREKRQ 176
>gi|59809017|gb|AAH89446.1| P4HA3 protein [Homo sapiens]
Length = 528
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 68/219 (31%), Positives = 103/219 (47%), Gaps = 25/219 (11%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+Q I +V+ P + +F S + Q I A+ L+ S +A GE + R S
Sbjct: 318 LQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVA--SGEKQLQVE-YRIS 374
Query: 130 SGTFISASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPA 186
++ + D L + H+IA T L + E V+ Y IG Y+ H+D A +P+
Sbjct: 375 KSAWLKDTVDPK--LVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPS 432
Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
Y + R+A+F++YLS VE GG T F + N L V R L
Sbjct: 433 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIYAN---------------LSVPVVRNAAL 477
Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+++L +G D +LH CPV+ G+KWVA KWI + Q
Sbjct: 478 FWWNLHRSGEGDSDTLHAGCPVLVGDKWVANKWIHEYGQ 516
>gi|148701600|gb|EDL33547.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide, isoform CRA_e [Mus
musculus]
Length = 593
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 396 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 451
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 452 DPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 511
Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 512 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 556
Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 557 TRHAACPVLVGCKWVSNKWFHERGQ 581
>gi|387016442|gb|AFJ50340.1| Prolyl 4-hydroxylase subunit alpha-2-like [Crotalus adamanteus]
Length = 533
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 58/204 (28%), Positives = 102/204 (50%), Gaps = 24/204 (11%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + S E+ + I AK +L ++ +R +T V + R S +++ +D
Sbjct: 336 PHIVRYYEVLSDEEIEKIKELAKPKL--ARATVRDPKTGVLTVANYRVSKSSWLEEEDDL 393
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS---QRLA 197
++ + H++ + T L E V Y +G +Y+ H+D F+ + + RLA
Sbjct: 394 --VVARVNHRMEQITGLTTKTAELLQVANYGMGGQYEPHFD-FSRRPFDITLKTEGNRLA 450
Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 257
+FL Y+SDVE GG T+FP D+ G + P++G + +Y+LF +G D
Sbjct: 451 TFLNYMSDVEAGGATVFP-----------DF----GAAIWPKKGTAVFWYNLFRSGEGDY 495
Query: 258 TSLHGSCPVIKGEKWVATKWIRDQ 281
+ H +CPV+ G KWV+ KW ++
Sbjct: 496 RTRHAACPVLVGCKWVSNKWFHER 519
>gi|354474415|ref|XP_003499426.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2
[Cricetulus griseus]
Length = 533
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451
Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 452 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 496
Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 497 TRHAACPVLVGCKWVSNKWFHERGQ 521
>gi|209862961|ref|NP_001129548.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Mus
musculus]
gi|17390970|gb|AAH18411.1| P4ha2 protein [Mus musculus]
gi|18073922|emb|CAC85690.1| Prolyl 4-hydroxylase alpha IIa subunit [Mus musculus]
gi|74211515|dbj|BAE26490.1| unnamed protein product [Mus musculus]
Length = 535
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 338 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 393
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 394 DPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 453
Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 454 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 498
Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 499 TRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|114799222|ref|YP_760562.1| 2OG-Fe(II) oxygenase [Hyphomonas neptunium ATCC 15444]
gi|114739396|gb|ABI77521.1| oxidoreductase, 2OG-Fe(II) oxygenase family [Hyphomonas neptunium
ATCC 15444]
Length = 298
Score = 88.2 bits (217), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 61/207 (29%), Positives = 104/207 (50%), Gaps = 29/207 (14%)
Query: 82 PRA-LY-FPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASED 139
P+A LY +PNF + E C ++IA +RL+ S + RTS + I +
Sbjct: 100 PKAQLYVWPNFLAPETCDALIALTDERLRASTTT-----DAFADPKIRTSRSSDI-GTMG 153
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-----Q 194
+++L E IA A + ++ +A RY++ Q+Y +HYD F P Q+ Q
Sbjct: 154 HNLVMQLDE-LIAEALGIHWSYSDATQTQRYDVNQEYKAHYDYFTPGTRDYQVHCQFTGQ 212
Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
R +F++YL+DVEEGG T F + + + P +G +++ +L P+G+
Sbjct: 213 RTWTFMIYLNDVEEGGGTRF---------------RRLEKTIMPEKGKAVIWNNLNPDGS 257
Query: 255 IDRTSLHGSCPVIKGEKWVATKWIRDQ 281
++ ++H V G K+V TKW R++
Sbjct: 258 VNPYTIHHGMKVRSGAKYVITKWFRER 284
>gi|395509387|ref|XP_003758979.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1
[Sarcophilus harrisii]
Length = 534
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ +D
Sbjct: 337 PHIVRYYDVLSDEEIERIKELAKPKL--ARATVRDPKTGVLTVANYRVSKSSWLEEGDDP 394
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 395 --VIAQLNRRMHYITGLSVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 452
Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
FL Y+SDVE GG T+FP G + P++G + +Y+LF +G D
Sbjct: 453 FLNYMSDVEAGGATVFP---------------DFGATIWPKKGTSVFWYNLFRSGEGDYR 497
Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 498 TRHAACPVLVGSKWVSNKWFHERGQ 522
>gi|390352104|ref|XP_003727818.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like
[Strongylocentrotus purpuratus]
Length = 121
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 45/121 (37%), Positives = 64/121 (52%), Gaps = 16/121 (13%)
Query: 163 EAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFL 222
E + Y +G Y H+D F + R+AS L YLSDV +GG+T +F+
Sbjct: 5 EFLQIANYGLGGHYLPHFD-FTRDVATHKNGNRIASMLFYLSDVAKGGDT-------VFI 56
Query: 223 DSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
D+G K+KP +G + +Y+LF NG +D + H SCPVI G KWVA W+ +
Sbjct: 57 DAG--------AKIKPEKGSAIFWYNLFKNGKVDERTKHASCPVISGSKWVANMWMHEHG 108
Query: 283 Q 283
Q
Sbjct: 109 Q 109
>gi|395817620|ref|XP_003782263.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Otolemur
garnettii]
Length = 540
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 57/207 (27%), Positives = 98/207 (47%), Gaps = 24/207 (11%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 341 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 396
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----QMSQRL 196
++ + H++ T L E V Y +G +Y+ H+D E R+
Sbjct: 397 DPVVARVNHRMQHITGLSVKTAELLQVANYGVGGQYEPHFDFSRNHERDAFKRLGTGNRV 456
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
A+FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 457 ATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGD 501
Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 502 YRTRHAACPVLVGCKWVSNKWFHERGQ 528
>gi|195110923|ref|XP_002000029.1| GI22757 [Drosophila mojavensis]
gi|193916623|gb|EDW15490.1| GI22757 [Drosophila mojavensis]
Length = 535
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 58/179 (32%), Positives = 85/179 (47%), Gaps = 25/179 (13%)
Query: 113 ALRQGETVESTKGT-----RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNV 167
L++ E T G+ RTS GT D+ I+E + + + L E +
Sbjct: 354 VLQRSEVYSPTNGSTAATFRTSQGTVFEY--DEHPIIEKLSQHMTLISGLDMGFAEPLQI 411
Query: 168 LRYEIGQKYDSHYDAFNPA-EYGPQM--SQRLASFLLYLSDVEEGGETMFPFENGIFLDS 224
Y IG Y+ H D+F + +Y Q + R+A+ + YLS+VE GG T FPF
Sbjct: 412 ANYGIGGHYEPHMDSFPESFDYSLQRFKTNRIATGIFYLSNVEAGGATAFPF-------- 463
Query: 225 GYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+ L VKP +G L +Y+L +G D + H CPV++G KW+A WIR Q
Sbjct: 464 -------LPLLVKPEQGSLLFWYNLHRSGDADYRTKHAGCPVLQGSKWIANVWIRLSHQ 515
>gi|410948132|ref|XP_003980795.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Felis
catus]
gi|410948136|ref|XP_003980797.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Felis
catus]
Length = 533
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451
Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 452 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 496
Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 497 TRHAACPVLVGCKWVSNKWFHERGQ 521
>gi|345324764|ref|XP_001505668.2| PREDICTED: LOW QUALITY PROTEIN: transmembrane prolyl 4-hydroxylase
[Ornithorhynchus anatinus]
Length = 495
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 58/195 (29%), Positives = 88/195 (45%), Gaps = 34/195 (17%)
Query: 118 ETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQT---HGEAFNVLRYEIGQ 174
+ V+ + R S T++ E ++ I+ ++ R T LPQ H E V+RY+ G
Sbjct: 259 QKVKMSDLVRNSQHTWLYQGEGAHQVMRSIQQRVLRLTRLPQEIVEHSEPLQVVRYDQGG 318
Query: 175 KYDSHYDA-------------FNPAEYGP-QMSQRLASFLLYLSDVEEGGETMFP----- 215
Y +H D+ F E P + S R + L YL++V GGET FP
Sbjct: 319 HYHAHMDSGPVFPETACSHTKFITNETAPFETSCRYVTVLFYLNNVTGGGETTFPVADNR 378
Query: 216 -------FENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT-----IDRTSLHGS 263
+N I L + L+VKP++G + +Y+ +G +D SLHG
Sbjct: 379 TYDEMSLIQNDIDLRDTRKHCDKGNLRVKPKQGTAVFWYNYLSDGQGWVGDLDEYSLHGG 438
Query: 264 CPVIKGEKWVATKWI 278
C V +G KW+A WI
Sbjct: 439 CLVTQGTKWIANNWI 453
>gi|441432545|ref|YP_007354587.1| Prolyl 4-hydroxylase [Acanthamoeba polyphaga moumouvirus]
gi|371944705|gb|AEX62527.1| putative prolyl4-hydroxylase [Moumouvirus Monve]
gi|440383625|gb|AGC02151.1| Prolyl 4-hydroxylase [Acanthamoeba polyphaga moumouvirus]
Length = 239
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 59/199 (29%), Positives = 95/199 (47%), Gaps = 30/199 (15%)
Query: 89 NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
NF + E+C+ I+ + +L S++ + K R S ++S + +++ +
Sbjct: 61 NFINKEKCKEIMNNTQNKLFDSEVISGK------NKAIRNSQQCWVSKYDP---MVKSMF 111
Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-----NPAEYGPQMSQRLASFLLYL 203
KI++ +P + E V+RY GQ Y+ H+DA E+ + QR + L+YL
Sbjct: 112 QKISQQFNIPLENAEDLQVVRYLPGQYYNEHHDACCDNNDKCNEFISRGGQRCLTVLVYL 171
Query: 204 SDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT-IDRTSLHG 262
++ EGG T F K + LKVKP GD ++FY L N + SLH
Sbjct: 172 NNEFEGGHTFF---------------KNLNLKVKPETGDAIVFYPLAKNTSKCHPLSLHA 216
Query: 263 SCPVIKGEKWVATKWIRDQ 281
PV GEKW+A W R++
Sbjct: 217 GMPVTSGEKWIANLWFRER 235
>gi|344264849|ref|XP_003404502.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2
[Loxodonta africana]
Length = 534
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 337 PHIVRYYDVMSDEEIERIKQIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 392
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 393 DPVVAQVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 452
Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 453 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 497
Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 498 TRHAACPVLVGCKWVSNKWFHERGQ 522
>gi|52139015|gb|AAH82538.1| P4ha3 protein [Mus musculus]
Length = 404
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 66/213 (30%), Positives = 103/213 (48%), Gaps = 25/213 (11%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
+V+ RP + +F S E+ Q I A+ L+ S +A GE + R S ++
Sbjct: 200 EVVHLRPLIALYHDFVSDEEAQKIRELAEPWLQRSVVA--SGEKQLQVE-YRISKSAWLK 256
Query: 136 ASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 190
+ D +L ++H+IA T L + E V+ Y IG Y+ H+D A +P+ Y
Sbjct: 257 DTVDP--MLVTLDHRIAALTGLDIQPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRM 314
Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
+ R+A+F++YLS VE GG T F + N V + L +++L
Sbjct: 315 KSGNRVATFMIYLSSVEAGGATAFIYGN---------------FSVPVVKNAALFWWNLH 359
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+G D +LH CPV+ G+KWVA KWI + Q
Sbjct: 360 RSGEGDGDTLHAGCPVLVGDKWVANKWIHEYGQ 392
>gi|426245942|ref|XP_004016760.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3, partial [Ovis
aries]
Length = 514
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 65/219 (29%), Positives = 105/219 (47%), Gaps = 25/219 (11%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+Q + +V+ P + + +F S + Q I A+ L+ S +A GE + R S
Sbjct: 304 LQPVRKEVIHLEPYVVLYHDFVSDAEAQKIRGLAEPWLQRSVVA--SGEKQLPVE-YRIS 360
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYD-AFNPA 186
++ + D +L ++H+IA T L + E V+ Y IG Y+ H+D A +P+
Sbjct: 361 KSAWLKDTVDP--VLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPS 418
Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
Y R+A+F++YLS VE GG T F + N V + L
Sbjct: 419 SPLYRMNSGNRVATFMIYLSSVEAGGATAFIYGN---------------FSVPVVKNAAL 463
Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+++L +G D +LH +CPV+ G+KWVA KWI + Q
Sbjct: 464 FWWNLHRSGEGDGDTLHAACPVLVGDKWVANKWIHEYGQ 502
>gi|241598362|ref|XP_002404733.1| prolyl 4-hydroxylase alpha subunit 1, putative [Ixodes scapularis]
gi|215500464|gb|EEC09958.1| prolyl 4-hydroxylase alpha subunit 1, putative [Ixodes scapularis]
Length = 340
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 56/221 (25%), Positives = 103/221 (46%), Gaps = 22/221 (9%)
Query: 63 GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES 122
G+ G S+Q I + ++ +P + + + + ++A A+ RL+ S E + S
Sbjct: 12 GQDGFFSLQPIKLEEINLKPYVIVMHDVVQDKDIEDLMAFAEPRLERST-TYTGNEMMPS 70
Query: 123 TKGTRTSSGTFISASEDKTGIL--ELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHY 180
+ RTSS +++ E + + + T EA+ + Y G + H+
Sbjct: 71 PE--RTSSTAWLNEDEAPIAVRMNSYLRALLGMGTSDTDEEAEAYQLANYGTGGHFLPHH 128
Query: 181 DAFNPAEYGPQ--MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKP 238
D + RLA+ ++Y++DVEEGG T+FP +G+++ P
Sbjct: 129 DFLQDSLQADNSVTGDRLATLMIYMTDVEEGGTTVFP---------------NLGIRLTP 173
Query: 239 RRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
++GD +++L +G +R + H CPV+ G KW+A KW R
Sbjct: 174 KKGDAAFWWNLKASGDGERLTTHAGCPVLYGSKWIANKWFR 214
>gi|156370133|ref|XP_001628326.1| predicted protein [Nematostella vectensis]
gi|156215300|gb|EDO36263.1| predicted protein [Nematostella vectensis]
Length = 526
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 61/216 (28%), Positives = 102/216 (47%), Gaps = 22/216 (10%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
++ + +++S P+ F N S + + ++ A+ RL+ +++ + +E R S
Sbjct: 314 LKPVAMEIVSVNPQITLFHNVLSEMEIEQMLELARPRLRRARVNNLETGEIEDVD-YRIS 372
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
++S S+ I+ I ++ T L GE V Y +G Y+ H+D E
Sbjct: 373 QIAWLSDSDGD--IVRRINRRVGFITGLNTNTGECLQVNNYGVGGHYEPHFDHSLDMENS 430
Query: 190 PQMS----QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLL 245
P S R+A+F+ YLS+VE GG T+F G+K P +G +
Sbjct: 431 PIASLGQGNRIATFMFYLSEVEAGGSTVFI---------------KTGVKTNPFKGGAVF 475
Query: 246 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
+Y+L +G D SLH CPV+ G KWVA KW+ +
Sbjct: 476 WYNLKKSGEGDWDSLHAGCPVLIGNKWVANKWLHEH 511
>gi|194213450|ref|XP_001495951.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Equus
caballus]
Length = 548
Score = 87.8 bits (216), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 65/219 (29%), Positives = 104/219 (47%), Gaps = 25/219 (11%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+Q + +V+ P + + +F S + Q I A+ L+ S +A GE + R S
Sbjct: 338 LQPVRKEVIHLEPYVVLYHDFVSDSEAQKIRGLAEPWLQRSVVA--SGEKQLPVE-YRIS 394
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYD-AFNPA 186
++ + D +L ++H+IA T L + E V+ Y IG Y+ H+D A +P
Sbjct: 395 KSAWLKDTVDP--MLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPT 452
Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
Y + R+A+F++YLS VE GG T F + N V + L
Sbjct: 453 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIYAN---------------FSVPVVKNAAL 497
Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+++L +G D +LH CPV+ G+KWVA KWI + Q
Sbjct: 498 FWWNLHRSGEGDSDTLHAGCPVLVGDKWVANKWIHEYGQ 536
>gi|406595590|ref|YP_006746720.1| hypothetical protein MASE_03040 [Alteromonas macleodii ATCC 27126]
gi|407682553|ref|YP_006797727.1| hypothetical protein AMEC673_03255 [Alteromonas macleodii str.
'English Channel 673']
gi|406372911|gb|AFS36166.1| hypothetical protein MASE_03040 [Alteromonas macleodii ATCC 27126]
gi|407244164|gb|AFT73350.1| hypothetical protein AMEC673_03255 [Alteromonas macleodii str.
'English Channel 673']
Length = 263
Score = 87.8 bits (216), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 59/203 (29%), Positives = 96/203 (47%), Gaps = 31/203 (15%)
Query: 87 FPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILEL 146
+ +F S+++C I+A K +L PS+LA S RTSS ++ +K +++
Sbjct: 85 YDDFLSSQECDDIVALTKDKLAPSKLA-----GAASADDIRTSSTCELAFLGNK--LVKD 137
Query: 147 IEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-------QRLASF 199
++++I L GE Y +G+ Y HYD F P PQ QR +
Sbjct: 138 VDNRIVSTLSLGVGEGEVIQAQHYNVGEYYKPHYDFFPPGS--PQYKAHCLSRGQRTWTC 195
Query: 200 LLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTS 259
++YL+D +GG T F + + VKP++G L + +L P+G + S
Sbjct: 196 MIYLNDECDGGHTRF---------------TKLDIAVKPKKGMALFWNNLLPSGDPNLNS 240
Query: 260 LHGSCPVIKGEKWVATKWIRDQE 282
+H + PV +G K V TKW R +
Sbjct: 241 IHFAEPVTRGHKTVITKWFRTKN 263
>gi|323456313|gb|EGB12180.1| hypothetical protein AURANDRAFT_61447 [Aureococcus anophagefferens]
Length = 317
Score = 87.8 bits (216), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 68/219 (31%), Positives = 98/219 (44%), Gaps = 33/219 (15%)
Query: 74 PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETV-ESTKGTRTSSGT 132
P +VLS P A +FA+ +C IIA A RL AL G+ E +R++
Sbjct: 101 PCEVLSTAPLAFCVRDFATGAECDRIIAEATPRLS---AALVAGDGAGEQAGSSRSAQVA 157
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF--------- 183
++ S D + ++A +P +H E+ V++Y G +Y H+DAF
Sbjct: 158 WVPRSPDDP----WLARRVAELIDVPLSHAESLQVVKYGAGGEYKPHFDAFPLDAARGRR 213
Query: 184 NPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDG 243
QR + +LYL+DVE+GG T F E V+PRRG
Sbjct: 214 AAVRGRTYAGQRRVTAILYLNDVEKGGGTAFHSETP------------AEFVVRPRRGSL 261
Query: 244 LLFYSLFPNGTIDR--TSLHGSCPVIK-GEKWVATKWIR 279
+FY+ + + T DR SLH PV G KW+A W R
Sbjct: 262 FVFYNCYEDST-DRHPMSLHAGLPVAPGGTKWIANIWWR 299
>gi|402894624|ref|XP_003910453.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
alpha-3 [Papio anubis]
Length = 535
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 67/219 (30%), Positives = 104/219 (47%), Gaps = 25/219 (11%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+Q I +V+ P + +F S + Q I A+ L+ S +A GE + R S
Sbjct: 325 LQPIRKEVIHLEPYIALYHDFVSDSEAQKIREFAEPWLQRSVVA--SGEKQLQVE-YRIS 381
Query: 130 SGTFISASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPA 186
++ + D +L + H+IA T L + E V+ Y IG Y+ H+D A +P+
Sbjct: 382 KSAWLKDTVDP--MLVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPS 439
Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
Y + R+A+F++YLS VE GG T F + N L V + L
Sbjct: 440 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIYAN---------------LSVPVVKNAAL 484
Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+++L +G D +LH CPV+ G+KWVA KWI + Q
Sbjct: 485 FWWNLHRSGEGDSDTLHAGCPVLVGDKWVANKWIHEYGQ 523
>gi|326928728|ref|XP_003210527.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Meleagris
gallopavo]
Length = 535
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 58/204 (28%), Positives = 102/204 (50%), Gaps = 24/204 (11%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 338 PHIVRYYDVMSDEEIEKIKQLAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 393
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS---QRLA 197
++ + ++ + T L E V Y +G +Y+ H+D F+ + + RLA
Sbjct: 394 DPVVAKVNQRMQQITGLTVKTAELLQVANYGMGGQYEPHFD-FSRRPFDSTLKSEGNRLA 452
Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 257
+FL Y+SDVE GG T+FP D+ G + P++G + +Y+LF +G D
Sbjct: 453 TFLNYMSDVEAGGATVFP-----------DF----GAAIWPKKGTAVFWYNLFRSGEGDY 497
Query: 258 TSLHGSCPVIKGEKWVATKWIRDQ 281
+ H +CPV+ G KWV+ KW ++
Sbjct: 498 RTRHAACPVLVGCKWVSNKWFHER 521
>gi|57525020|ref|NP_001006155.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Gallus gallus]
gi|82082587|sp|Q5ZLK5.1|P4HA2_CHICK RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
alpha-2; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-2; Flags: Precursor
gi|53129464|emb|CAG31388.1| hypothetical protein RCJMB04_5l17 [Gallus gallus]
Length = 534
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 58/204 (28%), Positives = 102/204 (50%), Gaps = 24/204 (11%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 337 PHIVRYYDVMSDEEIEKIKQLAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 392
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS---QRLA 197
++ + ++ + T L E V Y +G +Y+ H+D F+ + + RLA
Sbjct: 393 DPVVAKVNQRMQQITGLTVKTAELLQVANYGMGGQYEPHFD-FSRRPFDSTLKSEGNRLA 451
Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 257
+FL Y+SDVE GG T+FP D+ G + P++G + +Y+LF +G D
Sbjct: 452 TFLNYMSDVEAGGATVFP-----------DF----GAAIWPKKGTAVFWYNLFRSGEGDY 496
Query: 258 TSLHGSCPVIKGEKWVATKWIRDQ 281
+ H +CPV+ G KWV+ KW ++
Sbjct: 497 RTRHAACPVLVGCKWVSNKWFHER 520
>gi|227908832|ref|NP_796135.3| prolyl 4-hydroxylase subunit alpha-3 precursor [Mus musculus]
Length = 542
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 66/213 (30%), Positives = 103/213 (48%), Gaps = 25/213 (11%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
+V+ RP + +F S E+ Q I A+ L+ S +A GE + R S ++
Sbjct: 338 EVVHLRPLIALYHDFVSDEEAQKIRELAEPWLQRSVVA--SGEKQLQVE-YRISKSAWLK 394
Query: 136 ASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 190
+ D +L ++H+IA T L + E V+ Y IG Y+ H+D A +P+ Y
Sbjct: 395 DTVDP--MLVTLDHRIAALTGLDIQPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRM 452
Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
+ R+A+F++YLS VE GG T F + N V + L +++L
Sbjct: 453 KSGNRVATFMIYLSSVEAGGATAFIYGN---------------FSVPVVKNAALFWWNLH 497
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+G D +LH CPV+ G+KWVA KWI + Q
Sbjct: 498 RSGEGDGDTLHAGCPVLVGDKWVANKWIHEYGQ 530
>gi|81870817|sp|Q6W3F0.1|P4HA3_MOUSE RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
alpha-3; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-3; Flags: Precursor
gi|36962749|gb|AAQ87604.1| collagen prolyl 4-hydroxylase alpha III subunit [Mus musculus]
Length = 542
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 66/213 (30%), Positives = 103/213 (48%), Gaps = 25/213 (11%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
+V+ RP + +F S E+ Q I A+ L+ S +A GE + R S ++
Sbjct: 338 EVVHLRPLIALYHDFVSDEEAQKIRELAEPWLQRSVVA--SGEKQLQVE-YRISKSAWLK 394
Query: 136 ASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 190
+ D +L ++H+IA T L + E V+ Y IG Y+ H+D A +P+ Y
Sbjct: 395 DTVDP--MLVTLDHRIAALTGLDIQPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRM 452
Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
+ R+A+F++YLS VE GG T F + N V + L +++L
Sbjct: 453 KSGNRVATFMIYLSSVEAGGATAFIYGN---------------FSVPVVKNAALFWWNLH 497
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+G D +LH CPV+ G+KWVA KWI + Q
Sbjct: 498 RSGEGDGDTLHAGCPVLVGDKWVANKWIHEYGQ 530
>gi|170591592|ref|XP_001900554.1| prolyl 4-hydroxylase [Brugia malayi]
gi|16415740|emb|CAC82616.1| prolyl 4-hydroxylase [Brugia malayi]
gi|21425621|emb|CAD19314.1| prolyl 4-hydroxylase [Brugia malayi]
gi|158592166|gb|EDP30768.1| prolyl 4-hydroxylase, putative [Brugia malayi]
Length = 541
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 64/216 (29%), Positives = 96/216 (44%), Gaps = 24/216 (11%)
Query: 74 PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
PF+V L + P A+ F + + E+ I A RL+ + + +E T RTS
Sbjct: 322 PFKVEILRFNPLAVLFRDVITDEEVTMIQMLATPRLRRATVQNSITGELE-TASYRTSKS 380
Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ 191
++ E + ++ I +I T L Q E V Y IG YD H+D E
Sbjct: 381 AWLKDEEHE--VVHRINKRIDLMTNLEQETSEELQVGNYGIGGHYDPHFDFARREEVNAF 438
Query: 192 MS----QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFY 247
S RLA+ L Y++ E GG T+F + V P + D L +Y
Sbjct: 439 QSLNTGNRLATLLFYMTQPESGGATVF---------------TEVKTTVMPSKNDALFWY 483
Query: 248 SLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+L +G D + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 484 NLLRSGEGDLRTRHAACPVLTGTKWVSNKWIHERGQ 519
>gi|156333122|ref|XP_001619372.1| hypothetical protein NEMVEDRAFT_v1g151555 [Nematostella vectensis]
gi|156202442|gb|EDO27272.1| predicted protein [Nematostella vectensis]
Length = 144
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 48/141 (34%), Positives = 71/141 (50%), Gaps = 15/141 (10%)
Query: 143 ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLY 202
+++ I +++ + L T E V+ Y IG Y+ HYD R+A+FL Y
Sbjct: 13 LVKRISYRVQAYSGLNMTTSEDLQVVNYGIGGHYEPHYDFARDKFTSLGTGNRIATFLSY 72
Query: 203 LSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHG 262
LSDVE GG T+F +G V P++GD +Y+L +G D ++ H
Sbjct: 73 LSDVEAGGGTVFTR---------------VGATVWPQKGDAAFWYNLKRSGDGDSSTRHA 117
Query: 263 SCPVIKGEKWVATKWIRDQEQ 283
+CPV+ G KWVA KWI + Q
Sbjct: 118 ACPVLVGSKWVANKWIHEVGQ 138
>gi|221126103|ref|XP_002165259.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Hydra
magnipapillata]
Length = 533
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 60/218 (27%), Positives = 99/218 (45%), Gaps = 21/218 (9%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRT 128
++ + +VL P + + ++ + II AK L+ + + + G+ + + R
Sbjct: 321 LKPLKMEVLHHDPYIELYYELITDDEAKHIIKFAKPLLRRAFVHDMVTGDLIYA--DYRV 378
Query: 129 SSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD---AFNP 185
S T+I+ ED I I ++ T L + E V Y I +Y+ H+D P
Sbjct: 379 SKNTWIA--EDMDVIAAKIIRRVGDVTGLNMRYAEHLQVANYGIAGQYEPHFDHSTGTRP 436
Query: 186 AEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLL 245
+ R+A+ LLYLSDV+ GG T+F G+ P +G G+
Sbjct: 437 KHFDRWGGNRIATMLLYLSDVDWGGRTVFT-------------NTAPGVGTDPIKGAGVF 483
Query: 246 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+Y+L NG + + H CPV+ G+KWVA WI + Q
Sbjct: 484 WYNLLRNGKSNPKTQHAGCPVVLGQKWVANLWIHEHGQ 521
>gi|296217074|ref|XP_002754870.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Callithrix
jacchus]
Length = 544
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 67/219 (30%), Positives = 103/219 (47%), Gaps = 25/219 (11%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+Q I ++L P + +F S + Q I A+ L+ S +A GE + R S
Sbjct: 334 LQPIQKEILHLEPYIALYHDFVSDSEAQKIREFAEPWLQRSVVA--SGEKQLQVE-YRIS 390
Query: 130 SGTFISASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPA 186
++ + D +L + H+IA T L + E V+ Y IG Y+ H+D A +P+
Sbjct: 391 KSAWLKDTVDP--MLVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPS 448
Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
Y + R+A+F++YLS VE GG T F + N L V + L
Sbjct: 449 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIYAN---------------LSVPVVKNAAL 493
Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+++L +G D +LH CPV+ G KWVA KWI + Q
Sbjct: 494 FWWNLHRSGEGDSDTLHAGCPVLVGNKWVANKWIHEYGQ 532
>gi|156398644|ref|XP_001638298.1| predicted protein [Nematostella vectensis]
gi|156225417|gb|EDO46235.1| predicted protein [Nematostella vectensis]
Length = 495
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 49/162 (30%), Positives = 80/162 (49%), Gaps = 20/162 (12%)
Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD- 181
T R S ++S E +++ +E +IA T L E F V Y + +YD H+D
Sbjct: 335 TAHYRISKNCWLSGREHGE-VIDRVERRIAAMTRLNLETAEGFQVQNYGLAGQYDPHFDF 393
Query: 182 ---AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKP 238
N + R+A+ L+++S VE GG T+FP+ +G ++ P
Sbjct: 394 SRDLANSSLGSLGTGNRIATVLVWMSQVESGGATVFPY---------------VGARILP 438
Query: 239 RRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 280
++GD + +++L +G D + H CPV+ G KWVA KWI +
Sbjct: 439 QKGDAVFWHNLLRSGDGDFRTRHAGCPVLSGIKWVANKWIHE 480
>gi|402593814|gb|EJW87741.1| hypothetical protein WUBG_01349 [Wuchereria bancrofti]
Length = 541
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 64/216 (29%), Positives = 96/216 (44%), Gaps = 24/216 (11%)
Query: 74 PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
PF+V L + P A+ F + + E+ I A RL+ + + +E T RTS
Sbjct: 322 PFKVEILRFNPLAVLFRDVITDEEITMIQMLATPRLRRATVQNSITGELE-TASYRTSKS 380
Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ 191
++ E + ++ I +I T L Q E V Y IG YD H+D E
Sbjct: 381 AWLKDEEHE--VVHRINKRIDLMTNLEQETSEELQVGNYGIGGHYDPHFDFARREEVNAF 438
Query: 192 MS----QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFY 247
S RLA+ L Y++ E GG T+F + V P + D L +Y
Sbjct: 439 QSLNTGNRLATLLFYMTQPESGGATVF---------------TEVKTTVMPSKNDALFWY 483
Query: 248 SLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+L +G D + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 484 NLLRSGEGDLRTRHAACPVLTGTKWVSNKWIHERGQ 519
>gi|198449643|ref|XP_001357664.2| GA15938 [Drosophila pseudoobscura pseudoobscura]
gi|198130698|gb|EAL26798.2| GA15938 [Drosophila pseudoobscura pseudoobscura]
Length = 549
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 66/215 (30%), Positives = 103/215 (47%), Gaps = 27/215 (12%)
Query: 74 PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST-KGTRTSS 130
PF+V LS P YF + S ++ + II K ++ S++ G+T ST RTS
Sbjct: 336 PFKVEQLSGDPYVAYFHDVLSDKESEQIIEHGKGQVTRSEI----GQTGNSTVSDIRTSQ 391
Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE--Y 188
T++ + L I+ ++ T L E ++ Y IG +Y+ H+D + AE +
Sbjct: 392 NTWLWY--ENNPWLADIKQRLEDITGLSTDTAEPLQLVNYGIGGQYEPHFDFMDDAEKNF 449
Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 248
G + RL + L YL+DV GG T FPF + L V P +G L++Y+
Sbjct: 450 GWK-GNRLLTALFYLNDVPLGGATAFPF---------------LHLAVPPVKGSLLVWYN 493
Query: 249 LFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
L + D + H CPV+KG KW+ +W + Q
Sbjct: 494 LHRSLHKDFRTKHAGCPVLKGSKWICNQWFHEAAQ 528
>gi|159474434|ref|XP_001695330.1| predicted protein [Chlamydomonas reinhardtii]
gi|158275813|gb|EDP01588.1| predicted protein [Chlamydomonas reinhardtii]
Length = 1887
Score = 87.4 bits (215), Expect = 6e-15, Method: Composition-based stats.
Identities = 59/211 (27%), Positives = 94/211 (44%), Gaps = 25/211 (11%)
Query: 79 SWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASE 138
S PR L F C ++ A A RL +R + + +R S TF +
Sbjct: 1687 SLSPRVLVVDGFLPPGLCDALCAVAAPRL------IRSRVSTGAETPSRVSQSTFFTGDS 1740
Query: 139 DKTGILELIEHKIARATMLPQT---------HGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
+ + +E ++ P+ EA V+ Y++G Y HYD + G
Sbjct: 1741 ARLPEVVAVEARLQALMERPEVTAGGRPTLVKSEALQVVSYDVGGFYSEHYDN----KTG 1796
Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSL 249
+S R A+ ++YL D + GG T FP + + GL+V P +G L+F+S
Sbjct: 1797 GVIS-RAATIIIYLQDTQAGGSTHFPNQQLRLMRVARP-----GLRVYPAKGRALIFWSR 1850
Query: 250 FPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 280
P+G+ D SLH + PV G KW+ T+W ++
Sbjct: 1851 LPDGSEDLASLHSAEPVRAGSKWICTRWFKE 1881
>gi|395521232|ref|XP_003764722.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Sarcophilus
harrisii]
Length = 521
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 66/219 (30%), Positives = 103/219 (47%), Gaps = 25/219 (11%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+Q + +VL P + + +F S + Q I A L+ S +A GE + + R S
Sbjct: 311 LQPVRKEVLHLEPYIVLYHDFVSDSEAQKIRGFAAPWLQRSVVA--SGEKQQQVE-YRIS 367
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYD-AFNPA 186
++ + D IL ++ +IA T L + E V+ Y IG Y+ H+D A +P+
Sbjct: 368 KSAWLKDTVDP--ILVSLDRRIAALTGLNVQPPYAEHLQVVNYGIGGHYEPHFDHATSPS 425
Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
Y R+A+F++YLS VE GG T F + N V + L
Sbjct: 426 SPLYRMNSGNRVATFMIYLSSVEAGGSTAFIYAN---------------FSVPVVKNAAL 470
Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+++L +G D +LH CPV+ G+KWVA KWI + Q
Sbjct: 471 FWWNLHRSGQGDGDTLHAGCPVLVGDKWVANKWIHEYGQ 509
>gi|73970649|ref|XP_850109.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Canis
lupus familiaris]
Length = 533
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 392 DPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451
Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 452 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 496
Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 497 TRHAACPVLVGCKWVSNKWFHERGQ 521
>gi|440912197|gb|ELR61789.1| Prolyl 4-hydroxylase subunit alpha-2, partial [Bos grunniens mutus]
Length = 535
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 338 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 393
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 394 DPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 453
Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 454 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 498
Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 499 TRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|195505202|ref|XP_002099402.1| GE23382 [Drosophila yakuba]
gi|194185503|gb|EDW99114.1| GE23382 [Drosophila yakuba]
Length = 537
Score = 87.4 bits (215), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 46/134 (34%), Positives = 69/134 (51%), Gaps = 16/134 (11%)
Query: 151 IARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMSQRLASFLLYLSDVEEG 209
+ AT L T+ E V Y +G Y+ H+D F +P Y + R+A+ + YLS+VE+G
Sbjct: 397 LKEATGLDTTYCEQLQVANYGVGGHYEPHWDFFRDPNHYPEEEGNRIATAIFYLSEVEQG 456
Query: 210 GETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKG 269
G T FPF + + VKP+ G+ L +Y+L + D + H CPV+KG
Sbjct: 457 GATAFPF---------------LDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLKG 501
Query: 270 EKWVATKWIRDQEQ 283
KW+ WI + Q
Sbjct: 502 SKWIGNVWIHEVTQ 515
>gi|74353841|gb|AAI03334.1| Prolyl 4-hydroxylase, alpha polypeptide II [Bos taurus]
Length = 487
Score = 87.4 bits (215), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 290 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 345
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 346 DPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 405
Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 406 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 450
Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 451 TRHAACPVLVGCKWVSNKWFHERGQ 475
>gi|226874885|ref|NP_001029465.2| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Bos
taurus]
gi|296485623|tpg|DAA27738.1| TPA: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Bos taurus]
Length = 533
Score = 87.4 bits (215), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 392 DPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451
Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 452 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 496
Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 497 TRHAACPVLVGCKWVSNKWFHERGQ 521
>gi|56118630|ref|NP_001007975.1| prolyl 4-hydroxylase, alpha polypeptide 2 precursor [Xenopus
(Silurana) tropicalis]
gi|51513259|gb|AAH80485.1| p4ha2 protein [Xenopus (Silurana) tropicalis]
Length = 527
Score = 87.4 bits (215), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 62/209 (29%), Positives = 101/209 (48%), Gaps = 26/209 (12%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
PR + + N S E+ I AK +L ++ +R +T V S R S ++ ++D
Sbjct: 338 PRIVRYLNALSDEEIAKIKELAKPKL--ARATVRDPKTGVLSVANYRVSKSAWLEENDDP 395
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS---QRLA 197
++ + ++ T L E V Y +G +Y+ H+D F+ + + RLA
Sbjct: 396 --VIARVNLRMQAITGLTVDTAELLQVANYGMGGQYEPHFD-FSRRPFDSNLKTDGNRLA 452
Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 257
+FL Y+SDVE GG T+FP D+ G + P++G + +Y+LF +G D
Sbjct: 453 TFLNYMSDVEAGGATVFP-----------DF----GAAIWPKKGTAVFWYNLFRSGEGDY 497
Query: 258 TSLHGSCPVIKGEKWVATKWIRDQEQHED 286
+ H +CPV+ G KW KW Q+ H D
Sbjct: 498 RTRHAACPVLVGSKW--GKWTHTQDHHFD 524
>gi|355709025|gb|AES03456.1| prolyl 4-hydroxylase, alpha polypeptide II [Mustela putorius furo]
Length = 532
Score = 87.4 bits (215), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 392 DPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451
Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 452 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 496
Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 497 TRHAACPVLVGCKWVSNKWFHERGQ 521
>gi|334311009|ref|XP_001371555.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Monodelphis
domestica]
Length = 534
Score = 87.4 bits (215), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 60/205 (29%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASEDK 140
P + + + S E+ + I +K +L S+ +R +T R S +++ ED
Sbjct: 337 PHIVRYYDVLSDEEIEKIKEISKPKL--SRATVRDPKTGHLIVVSYRISKSSWLK--EDD 392
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
I+ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 393 DPIIAQVNRRMQYITGLSVKTAELLQVSNYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 452
Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
FL Y+SDVE GG T+FP D+ G + P++G + +Y+LF +G D
Sbjct: 453 FLNYMSDVEAGGATVFP-----------DF----GAAIWPKKGTSVFWYNLFRSGECDYR 497
Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 498 TRHAACPVLVGSKWVSNKWFHERGQ 522
>gi|47227817|emb|CAG08980.1| unnamed protein product [Tetraodon nigroviridis]
Length = 285
Score = 87.4 bits (215), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 61/213 (28%), Positives = 100/213 (46%), Gaps = 25/213 (11%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
+ LS +P + + +F S + + I A+ L+ S +A R + T R S ++
Sbjct: 81 ETLSLQPYVVLYHDFISDTEAEEIKHHAQLGLRRSVVATRDKQV---TAEYRISKSAWLK 137
Query: 136 ASEDKTGILELIEHKIARATML--PQTHGEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 190
S + ++ +I+ T L HGE V+ Y IG Y+ H+D A +P+ +
Sbjct: 138 GSAQSA--VSRLDQRISMLTGLNVQHPHGEYLQVVNYGIGGHYEPHFDHATSPSSPVFKL 195
Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
+ R+A+ ++YLS VE GG T F + N V + + +++L
Sbjct: 196 KTGNRVATVMIYLSSVEAGGSTAFIYAN---------------FSVPVMKNAAIFWWNLH 240
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
NG D +LH CPV+ G+KWVA KWI + Q
Sbjct: 241 RNGRGDPDTLHAGCPVLIGDKWVANKWIHEYGQ 273
>gi|426229221|ref|XP_004008689.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
[Ovis aries]
Length = 487
Score = 87.4 bits (215), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 290 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 345
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 346 DPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 405
Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 406 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 450
Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 451 TRHAACPVLVGCKWVSNKWFHERGQ 475
>gi|198466401|ref|XP_002135182.1| GA23910 [Drosophila pseudoobscura pseudoobscura]
gi|198150583|gb|EDY73809.1| GA23910 [Drosophila pseudoobscura pseudoobscura]
Length = 530
Score = 87.0 bits (214), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 44/118 (37%), Positives = 65/118 (55%), Gaps = 18/118 (15%)
Query: 163 EAFNVLRYEIGQKYDSHYDAFNPAEY--GPQMSQRLASFLLYLSDVEEGGETMFPFENGI 220
E NV Y +G + HYD + P Y G M L + L Y+SD+++GG T+FP
Sbjct: 409 EELNVANYGLGTIFGPHYD-YTPENYDIGWFMGGPLGTILFYVSDLQQGGATIFP----- 462
Query: 221 FLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
I + V PR+G LL+++L+ +G D +LH SCPVI+G++W TKW+
Sbjct: 463 ----------SINITVSPRKGSALLWFNLYDDGEPDPRTLHSSCPVIEGDRWTLTKWV 510
>gi|407686446|ref|YP_006801619.1| hypothetical protein AMBAS45_03290 [Alteromonas macleodii str.
'Balearic Sea AD45']
gi|407289826|gb|AFT94138.1| hypothetical protein AMBAS45_03290 [Alteromonas macleodii str.
'Balearic Sea AD45']
Length = 263
Score = 87.0 bits (214), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 59/203 (29%), Positives = 95/203 (46%), Gaps = 31/203 (15%)
Query: 87 FPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILEL 146
+ +F S+++C I+A K +L PS+LA S RTSS ++ +K +++
Sbjct: 85 YDDFLSSQECDDIVALTKDKLAPSKLA-----GAASADDIRTSSTCELAFLGNK--LVKD 137
Query: 147 IEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-------QRLASF 199
++ +I L GE Y +G+ Y HYD F P PQ QR +
Sbjct: 138 VDSRIVSTLSLGVGEGEVIQAQHYNVGEYYKPHYDFFPPGS--PQYKAHCLSRGQRTWTC 195
Query: 200 LLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTS 259
++YL+D +GG T F + + VKP++G L + +L P+G + S
Sbjct: 196 MIYLNDECDGGHTRF---------------TKLDIAVKPKKGMALFWNNLLPSGDPNLNS 240
Query: 260 LHGSCPVIKGEKWVATKWIRDQE 282
+H + PV +G K V TKW R +
Sbjct: 241 IHFAEPVTRGHKTVITKWFRTKN 263
>gi|348505573|ref|XP_003440335.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Oreochromis
niloticus]
Length = 517
Score = 87.0 bits (214), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 62/213 (29%), Positives = 104/213 (48%), Gaps = 25/213 (11%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
+++S +P + + +F + + + I + A L+ S +A GE ++T R S ++
Sbjct: 313 ELVSLQPYVVLYHDFVTDTEAEDIKSLAHPGLRRSVVA--AGEK-QATADYRISKSAWLK 369
Query: 136 ASEDKTGILELIEHKIARATMLPQTH--GEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 190
S I+ ++ +I+ T L H GE V+ Y IG Y+ H+D A +P+ +
Sbjct: 370 GS--AQSIVGKLDQRISLLTGLNVKHPYGEYLQVVNYGIGGHYEPHFDHATSPSSPVFKL 427
Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
+ R+A+F++YLS VE GG T F + N V + +++L
Sbjct: 428 KTGNRVATFMIYLSPVEAGGSTAFIYAN---------------FSVPVVEKAAIFWWNLH 472
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
NG D +LH CPV+ G+KWVA KWI + Q
Sbjct: 473 RNGEGDDDTLHAGCPVLIGDKWVANKWIHEYGQ 505
>gi|391342914|ref|XP_003745760.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Metaseiulus
occidentalis]
Length = 525
Score = 87.0 bits (214), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 58/222 (26%), Positives = 102/222 (45%), Gaps = 30/222 (13%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+Q +V+ RP F + S ++ Q++I + RLK + + + +E R S
Sbjct: 314 LQPSKLEVIHERPYLALFHDIMSDDEIQTVIELSAPRLKRATVQNAKSGELE-VANYRIS 372
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-------- 181
++ + + ++E + + T L E V+ Y IG Y++H+D
Sbjct: 373 KSAWLKNHDHE--VVERLSFRFEYLTGLTHLTAEELQVVNYGIGGHYEAHFDFARRDEKD 430
Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRG 241
AF G R+A+++ Y+SDV+ GG T+FP +GL V P +G
Sbjct: 431 AFKQLGTG----NRIATWINYMSDVKAGGATVFPR---------------LGLTVWPEKG 471
Query: 242 DGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+++L +G D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 472 SAAFWWNLHRSGEGDILTRHAACPVLAGSKWVSNKWFHERGQ 513
>gi|17541712|ref|NP_502317.1| Protein PHY-2 [Caenorhabditis elegans]
gi|32171589|sp|Q20065.1|P4HA2_CAEEL RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
alpha-2; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-2; Flags: Precursor
gi|3876769|emb|CAA93469.1| Protein PHY-2 [Caenorhabditis elegans]
Length = 539
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 62/220 (28%), Positives = 96/220 (43%), Gaps = 32/220 (14%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
I ++L + P A+ F N + + I A +LK + TV+++K T
Sbjct: 316 IKVEILRFDPLAVLFKNVIHDSEIEVIKELASPKLKRA--------TVQNSKTGELEHAT 367
Query: 133 F-ISASE----DKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE 187
+ IS S D +++ + +I T L Q E V Y +G YD H+D E
Sbjct: 368 YRISKSAWLKGDLDPVIDRVNRRIEDFTNLNQATSEELQVANYGLGGHYDPHFDFARKEE 427
Query: 188 YGP----QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDG 243
R+A+ L Y+S E GG T+F +G V P + D
Sbjct: 428 KNAFKTLNTGNRIATVLFYMSQPERGGATVF---------------NHLGTAVFPSKNDA 472
Query: 244 LLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
L +Y+L +G D + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 473 LFWYNLRRDGEGDLRTRHAACPVLLGVKWVSNKWIHEKGQ 512
>gi|348557544|ref|XP_003464579.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
[Cavia porcellus]
Length = 533
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 57/205 (27%), Positives = 102/205 (49%), Gaps = 22/205 (10%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ +D
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLEEEDDP 393
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
++ + ++ + T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 394 --VVARVNRRMQQITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451
Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 452 FLNYMSDVEAGGATVFP---------------DLGAALWPKKGTAVFWYNLLRSGEGDYR 496
Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 497 TRHAACPVLVGCKWVSNKWFHERGQ 521
>gi|116496629|gb|AAI26171.1| Prolyl 4-hydroxylase, alpha polypeptide III [Homo sapiens]
Length = 544
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 67/219 (30%), Positives = 103/219 (47%), Gaps = 25/219 (11%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+Q I +V+ P + +F S + Q I A+ L+ S +A GE + R S
Sbjct: 334 LQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVA--SGEKQLQVE-YRIS 390
Query: 130 SGTFISASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPA 186
++ + + L + H+IA T L + E V+ Y IG Y+ H+D A +P+
Sbjct: 391 KSAWLKDTVNPK--LVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPS 448
Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
Y + R+A+F++YLS VE GG T F + N L V R L
Sbjct: 449 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIYAN---------------LSVPVVRNAAL 493
Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+++L +G D +LH CPV+ G+KWVA KWI + Q
Sbjct: 494 FWWNLHRSGEGDSDTLHAGCPVLVGDKWVANKWIHEYGQ 532
>gi|301613006|ref|XP_002936013.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Xenopus
(Silurana) tropicalis]
Length = 504
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 57/211 (27%), Positives = 97/211 (45%), Gaps = 56/211 (26%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+PR + + + S E+ + AK RL+ + + S
Sbjct: 330 KPRIVRYHDIISDEEISKVKELAKPRLRRATI------------------------SNPI 365
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
TG+LE +++I++ + + V Y +G +Y+ H+D AF G
Sbjct: 366 TGVLETAQYRISKRWAIME-----LEVANYGMGGQYEPHFDFARKDEPDAFKELGTG--- 417
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
R+A++L Y+SDVE GG T+FP +G V P++G + +Y+LF +
Sbjct: 418 -NRVATWLFYMSDVEAGGATVFP---------------EVGAAVYPKKGTAVFWYNLFES 461
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
G D ++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 462 GEGDYSTRHAACPVLVGNKWVSNKWIHERGQ 492
>gi|414587754|tpg|DAA38325.1| TPA: hypothetical protein ZEAMMB73_894856 [Zea mays]
Length = 169
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 43/110 (39%), Positives = 71/110 (64%), Gaps = 3/110 (2%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
+V+SW PR + F NF S+E+C ++A A+ RL+ S + + G+ V+S RTSSG F+
Sbjct: 58 EVISWTPRIIVFHNFLSSEECDYLMAIARPRLQISTVVDVATGKGVKS--DVRTSSGMFV 115
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN 184
++ E K+ +++ IE +I+ + +P+ +GE VLRYE Q Y H+D F+
Sbjct: 116 NSEERKSPVVQAIEKRISVFSQIPKENGELIQVLRYEASQYYRPHHDYFS 165
>gi|345481336|ref|XP_001600680.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Nasonia
vitripennis]
Length = 556
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 59/210 (28%), Positives = 95/210 (45%), Gaps = 26/210 (12%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASEDK 140
PR + + + ++ ++I A+ R K + + + GE R S ++ E K
Sbjct: 349 PRIVIYHDVIYDDEIETIKRMAQPRFKRATVQNYKTGEL--EIANYRISKSAWLQEHEHK 406
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS----QRL 196
+ + ++ T + E V+ Y IG Y+ H+D E S R+
Sbjct: 407 H--VRAVSQRVEHMTSMSIETAEELQVVNYGIGGHYEPHFDFARREEKNAFKSLGTGNRI 464
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
A+ L Y+SDVE+GG T+F I + + P++G +Y+L PNG D
Sbjct: 465 ATVLYYMSDVEQGGGTVFT---------------KINISLWPKKGSAAFWYNLKPNGEGD 509
Query: 257 RTSLHGSCPVIKGEKWVATKWI--RDQEQH 284
+ H +CPV+ G KWVA KW+ R QE H
Sbjct: 510 YKTRHAACPVLTGSKWVANKWLHERGQEFH 539
>gi|195452744|ref|XP_002073481.1| GK14140 [Drosophila willistoni]
gi|194169566|gb|EDW84467.1| GK14140 [Drosophila willistoni]
Length = 454
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 56/215 (26%), Positives = 98/215 (45%), Gaps = 19/215 (8%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
I + ++LS P + + + + + + KRL+ ++ AL Q + RTS
Sbjct: 249 IAPVKMEILSLNPYIVLCHDVILPSEQEFLKTQSSKRLEGAR-ALDQVKNEVVFNFIRTS 307
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE-Y 188
T++ + D + + H I + L G+ + ++ Y +G +++H D E
Sbjct: 308 KATWLKKNSD--NVTRRLSHWIEDVSNLDSNIGDLYQIINYGVGGLFEAHSDTMRKDEDR 365
Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 248
+ R+A+F+ YL DV +GG T+F + L V P+ G L +++
Sbjct: 366 WKVLYDRIATFIFYLQDVPQGGATLF---------------NNLNLTVFPKAGAALFWFN 410
Query: 249 LFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
L G D ++H CPVI G KW+ TKW+ D Q
Sbjct: 411 LDNAGDTDLFTVHTGCPVIVGSKWIMTKWVYDLGQ 445
>gi|289662828|ref|ZP_06484409.1| hypothetical protein XcampvN_06993, partial [Xanthomonas campestris
pv. vasculorum NCPPB 702]
Length = 301
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 59/194 (30%), Positives = 89/194 (45%), Gaps = 22/194 (11%)
Query: 92 SAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG-TFISASEDKTGILELIEHK 150
SA++C+ ++ A+ L+ SQ+ + + RTS G T ED + + +
Sbjct: 121 SADECRLLMLLARPHLRDSQV-IDPNDASTQRAPVRTSRGATLDPIIEDFAA--RVAQAR 177
Query: 151 IARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP---AEYGPQMSQRLASFLLYLSDVE 207
+A L TH E +VL Y G++Y +H D P A P R + +YL+ V+
Sbjct: 178 LAACAQLTLTHAEPLSVLCYAPGEQYRAHRDYLPPGTIAADHPNAGNRQRTVCVYLNVVD 237
Query: 208 EGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVI 267
GGET FP G++V+PR G + F +L +G + SLH PV
Sbjct: 238 AGGETEFPLA---------------GVRVQPRPGALVCFDNLHADGRPNADSLHAGLPVT 282
Query: 268 KGEKWVATKWIRDQ 281
G KW+ T W R Q
Sbjct: 283 AGSKWLGTLWFRQQ 296
>gi|195159313|ref|XP_002020526.1| GL14040 [Drosophila persimilis]
gi|194117295|gb|EDW39338.1| GL14040 [Drosophila persimilis]
Length = 549
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 66/215 (30%), Positives = 103/215 (47%), Gaps = 27/215 (12%)
Query: 74 PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSS 130
PF+V LS P YF + S ++ + II K ++ S++ G+T ST RTS
Sbjct: 336 PFKVEQLSGDPYVAYFHDVLSDKESEQIIEHGKGQVTRSEI----GQTGNSTVSEIRTSQ 391
Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE--Y 188
T++ + L I+ ++ T L E ++ Y IG +Y+ H+D + AE +
Sbjct: 392 NTWLWY--ENNPWLADIKQRLEDITGLSTDTAEPLQLVNYGIGGQYEPHFDFMDDAEKNF 449
Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 248
G + RL + L YL+DV GG T FPF + L V P +G L++Y+
Sbjct: 450 GWK-GNRLLTALFYLNDVPLGGATAFPF---------------LHLAVPPVKGSLLVWYN 493
Query: 249 LFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
L + D + H CPV+KG KW+ +W + Q
Sbjct: 494 LHRSLHKDFRTKHAGCPVLKGSKWICNEWFHEAAQ 528
>gi|195113245|ref|XP_002001178.1| GI22115 [Drosophila mojavensis]
gi|193917772|gb|EDW16639.1| GI22115 [Drosophila mojavensis]
Length = 498
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 104/218 (47%), Gaps = 31/218 (14%)
Query: 74 PF--QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
PF ++LS P + F + + + + TA+ L S + E+V S RT+ G
Sbjct: 281 PFKMELLSEDPYIVVFHDVIYDSEIKHLRNTAEPLLHRSYVKKSNNESVVSK--VRTAKG 338
Query: 132 TFISA---SEDKTGILELIEHKIARATMLPQTHGEAFN---VLRYEIGQKYDSHYDAFNP 185
F+ A S + +++ ++ ++ + L E +N L Y+ G Y H D FN
Sbjct: 339 AFMHADRLSPESAQVVQRLKQRMGDLSDL-NIKREGYNEMQYLNYDFGDHYLLHMDYFNI 397
Query: 186 AEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLL 245
+ M+ R+A+FL+YL+DV GG T+FP + V P +G +L
Sbjct: 398 S-----MNDRIATFLIYLNDVTRGGGTIFP---------------QVKQAVHPEKGKLIL 437
Query: 246 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+Y++ N + SLHG+CPV+ G K WIR+ +Q
Sbjct: 438 WYNMNSNLDYELASLHGACPVLIGRKIAIVYWIREHDQ 475
>gi|195575111|ref|XP_002105523.1| GD16991 [Drosophila simulans]
gi|194201450|gb|EDX15026.1| GD16991 [Drosophila simulans]
Length = 542
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 64/213 (30%), Positives = 96/213 (45%), Gaps = 19/213 (8%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT-RTSSG 131
I ++LS P L + S ++ I ++K+ + PS E+ T RTS
Sbjct: 323 IKTEILSVDPFVLLLHDMISQKESTLIRNSSKEHMLPSATTDPDSSDTETQVDTYRTSKS 382
Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGP 190
+ S+ + T + I ++ AT L E + V+ Y +G +++H D +
Sbjct: 383 VWYSSDFNDTT--KKITERLGDATGLDTNFTEFYQVINYGLGGFFETHLDMLLSEKNRFN 440
Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
R+A+ L YL++V +GG T FP I L V P+ G L +Y+L
Sbjct: 441 GTRDRIATTLFYLNEVRQGGGTYFPR---------------INLTVFPQPGSALFWYNLD 485
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
NG SLH CPVI G KWV +KWI D Q
Sbjct: 486 TNGNDHMGSLHTGCPVIVGSKWVMSKWINDMGQ 518
>gi|308476969|ref|XP_003100699.1| hypothetical protein CRE_15564 [Caenorhabditis remanei]
gi|308264511|gb|EFP08464.1| hypothetical protein CRE_15564 [Caenorhabditis remanei]
Length = 573
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 67/241 (27%), Positives = 100/241 (41%), Gaps = 50/241 (20%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
I I ++L + P A+ F N S + + I A +LK + TV+++K
Sbjct: 329 IAPIKVEILRFDPLAVLFKNVISDSEIKVIKELASPKLKRA--------TVQNSKTGELE 380
Query: 130 SGTF-ISASE----DKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN 184
T+ IS S D ++E + +I T L Q E V Y +G YD H+D
Sbjct: 381 HATYRISKSAWLKGDLHPVIERVNRRIEDFTGLYQGTSEELQVANYGLGGHYDPHFDFAR 440
Query: 185 PAEYGP----------------------QMSQRLASFLLYLSDVEEGGETMFPFENGIFL 222
A YG R+A+ L Y+S E GG T+F
Sbjct: 441 IANYGLGGHYEPHYDMSLKEEKNAFKTLNTGNRIATVLFYMSQPERGGATVF-------- 492
Query: 223 DSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
+G V P + D L +Y+L +G D + H +CPV+ G KWV+ KWI ++
Sbjct: 493 -------NHLGTAVFPSKNDALFWYNLRRDGEGDLRTRHAACPVLLGVKWVSNKWIHERG 545
Query: 283 Q 283
Q
Sbjct: 546 Q 546
>gi|321463241|gb|EFX74258.1| hypothetical protein DAPPUDRAFT_22132 [Daphnia pulex]
Length = 523
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 67/226 (29%), Positives = 102/226 (45%), Gaps = 30/226 (13%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+ I + S+ P F + S E+ ++I AK L S + + G E + RTS
Sbjct: 307 LMPIKIEQHSFEPAIYTFHDVLSDEEIETIKELAKPLLARSMVQGKLGVGHEVS-NVRTS 365
Query: 130 SGTFISASEDKTGILELIEHKIARATMLP----QTHGEAFNVLRYEIGQKYDSHYDAF-- 183
++ E +L + +I T L + E V Y IG Y H+D
Sbjct: 366 KTAWLP--EGLHPLLNRLSRRIGLITGLKTDPIRDEAELLQVANYGIGGHYSPHHDYLMK 423
Query: 184 NPAEYG------PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVK 237
+ A++ Q R+A+F+ YL+DVE GG T FP G+ VK
Sbjct: 424 DKADFEYMHHRELQAGDRIATFMFYLNDVERGGSTAFP---------------RAGVAVK 468
Query: 238 PRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
P +G +++L +G D +LHG+CPV+ G KWV+ KWIR+ Q
Sbjct: 469 PVKGGAAFWFNLKRSGKPDPLTLHGACPVLLGHKWVSNKWIRETAQ 514
>gi|347964867|ref|XP_309164.4| AGAP000971-PA [Anopheles gambiae str. PEST]
gi|333466515|gb|EAA04901.5| AGAP000971-PA [Anopheles gambiae str. PEST]
Length = 553
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 55/207 (26%), Positives = 95/207 (45%), Gaps = 22/207 (10%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
RP + + + S + + I A+ R + + + + +E R S ++ +ED+
Sbjct: 348 RPYIVIYHDVMSDREIERIKHYARPRFRRATVQNYKTGELEFA-NYRISKSAWLKDAEDE 406
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS----QRL 196
++ I ++ T L E V+ Y IG Y+ H+D E S R+
Sbjct: 407 --MIRTISQRVEDMTGLTMETAEELQVVNYGIGGHYEPHFDFARREERNAFKSLGTGNRI 464
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
A+ L Y+SDV +GG T+FP + L + PR+G +++L +G D
Sbjct: 465 ATVLFYMSDVTQGGATVFP---------------SLNLALWPRKGTAAFWFNLHASGRGD 509
Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 510 YATRHAACPVLTGTKWVSNKWIHERGQ 536
>gi|195159303|ref|XP_002020521.1| GL13468 [Drosophila persimilis]
gi|194117290|gb|EDW39333.1| GL13468 [Drosophila persimilis]
Length = 415
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 59/225 (26%), Positives = 102/225 (45%), Gaps = 35/225 (15%)
Query: 62 HGETGDASIQSIPF--QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET 119
+ T ++ PF ++LS P + + + + + ++ +K +K + +
Sbjct: 215 YNTTAAPFLRLAPFKTELLSLSPYMVLYHDVITPLESLTLKNLSKPLMKRRAMVMVNNLK 274
Query: 120 VES-TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDS 178
V RTS+ ++++ E+ ++E +E ++ T + E + ++ Y IG Y
Sbjct: 275 VRPFIDSGRTSNSVWLTSHEN--AVMERLERRVGVMTNFEMENSEVYQLINYGIGGHYKP 332
Query: 179 HYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKP 238
H D F PQ LSDV +GG T+FP N + V+P
Sbjct: 333 HTDHFE----TPQ-----------LSDVPQGGATLFPRLN---------------ISVQP 362
Query: 239 RRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
R+GD LL+Y+L G + ++H SCP+IKG KW KWI + Q
Sbjct: 363 RQGDALLWYNLNDRGQGEIGTVHTSCPIIKGSKWALVKWIDELSQ 407
>gi|410860761|ref|YP_006975995.1| prolyl 4-hydroxylase subunit alpha [Alteromonas macleodii AltDE1]
gi|410818023|gb|AFV84640.1| Prolyl 4-hydroxylase alpha subunit [Alteromonas macleodii AltDE1]
Length = 376
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 61/219 (27%), Positives = 98/219 (44%), Gaps = 41/219 (18%)
Query: 71 QSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG----- 125
+S+P +V + + S +C+ +IA LKPS + V+ G
Sbjct: 174 KSLPIEV---------YESILSEYECRYLIAKFSALLKPSMV-------VDPVTGRGKID 217
Query: 126 -TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN 184
RTS I + I ++ I++ T + +GEA N+LRY GQ+Y HYD N
Sbjct: 218 SVRTSYVAVIEPTHCD-WITRKLDKIISQITHTLRQNGEALNLLRYSPGQQYKPHYDGLN 276
Query: 185 PAEYGPQMS---QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRG 241
QR+ + L+YL+ + EGGET+FP + +++ P+ G
Sbjct: 277 EINDALMFKDGKQRIKTALVYLNTINEGGETLFPK---------------LDIRIAPKSG 321
Query: 242 DGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 280
++F + NG + S H P + KW+ TKWIR+
Sbjct: 322 TMVVFSNSDENGKLLLNSYHAGAPTVSENKWLVTKWIRE 360
>gi|407698902|ref|YP_006823689.1| hypothetical protein AMBLS11_03220 [Alteromonas macleodii str.
'Black Sea 11']
gi|407248049|gb|AFT77234.1| hypothetical protein AMBLS11_03220 [Alteromonas macleodii str.
'Black Sea 11']
Length = 263
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 58/203 (28%), Positives = 95/203 (46%), Gaps = 31/203 (15%)
Query: 87 FPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILEL 146
+ +F S+++C I+A K +L PS+LA S RTSS ++ +K +++
Sbjct: 85 YDDFLSSQECDDIVALTKDKLAPSKLA-----GAASADDIRTSSTCELAFLGNK--LVKD 137
Query: 147 IEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-------QRLASF 199
++ +I L GE Y +G+ Y HYD F P PQ QR +
Sbjct: 138 VDSRIVSTLSLGVGEGEVIQAQHYNVGEYYKPHYDFFPPG--SPQYKTHCLSRGQRTWTC 195
Query: 200 LLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTS 259
++YL+D +GG T F + + V+P++G L + +L P+G + S
Sbjct: 196 MIYLNDECDGGHTRF---------------TKLDIAVRPKKGMALFWNNLLPSGDPNLNS 240
Query: 260 LHGSCPVIKGEKWVATKWIRDQE 282
+H + PV +G K V TKW R +
Sbjct: 241 IHFAEPVTRGHKTVITKWFRTKN 263
>gi|443712762|gb|ELU05926.1| hypothetical protein CAPTEDRAFT_153364 [Capitella teleta]
Length = 491
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 47/128 (36%), Positives = 66/128 (51%), Gaps = 22/128 (17%)
Query: 163 EAFNVLRYEIGQKYDSHYDAFNPAEY----GPQMSQ---RLASFLLYLSDVEEGGETMFP 215
EA V+ Y IG +Y+ H D + E P + R+++FL YLS V GG T+FP
Sbjct: 364 EAMQVVNYGIGGQYEPHLDFYEDPEMLKNVNPSLQDTGDRISTFLFYLSRVHLGGATVFP 423
Query: 216 FENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVAT 275
N ++V P + +Y+ PNG D+ +LH CPV+ GEKWVA
Sbjct: 424 KLN---------------VRVPPVKNGAAFWYNARPNGEHDKRTLHAGCPVVLGEKWVAN 468
Query: 276 KWIRDQEQ 283
KWIR++ Q
Sbjct: 469 KWIRERGQ 476
>gi|443707037|gb|ELU02831.1| hypothetical protein CAPTEDRAFT_181697 [Capitella teleta]
Length = 538
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 54/211 (25%), Positives = 95/211 (45%), Gaps = 20/211 (9%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
+V+ P + N + ++ I +K +L S + G + + RTS +I
Sbjct: 333 EVMFLDPFIAIYHNLMTDKEADMIKRISKPKLHRSGVFTYSGGNQKPVQDYRTSKSAWIE 392
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE---YGPQM 192
E ++ + + + T L E F V+ Y IG Y+ H+D P E + P++
Sbjct: 393 DEEHP--MIRRVSERTSALTDLSLDTVELFQVVNYGIGGHYEPHFDFARPNEIATFDPEV 450
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
R+ + + Y++ E GG T+FP +G+K+ P +G ++++L N
Sbjct: 451 GNRIITVIFYVAAPEAGGATVFP---------------DLGVKLWPEKGSCAVWWNLMRN 495
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
G D + H CP I G KW+A KW ++ Q
Sbjct: 496 GEGDYRTKHAGCPTITGSKWIANKWYHERGQ 526
>gi|397644356|gb|EJK76358.1| hypothetical protein THAOC_01879, partial [Thalassiosira oceanica]
Length = 539
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 67/223 (30%), Positives = 104/223 (46%), Gaps = 31/223 (13%)
Query: 62 HGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAK----KRLKPSQLALRQG 117
HG T + + + P+ V+ F NF + E+ ++ + +R A G
Sbjct: 324 HGATWEGAEEGSPWVVV--------FDNFLTDEEVADLVKGGELEGYERSTDQGAANAYG 375
Query: 118 ETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYD 177
E + TRTSS + ++ + KI T +PQ + E+F +L+Y+ GQ Y
Sbjct: 376 EQEKVVSRTRTSSNAWCMHKCERLPGVRSASKKIEAVTGIPQVNYESFQLLKYDGGQFYR 435
Query: 178 SHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVK 237
SH+D+ + + P R+ +F LYLSDVEEGGET F +G+ VK
Sbjct: 436 SHHDS-SSVDDSP-AGHRILTFFLYLSDVEEGGETYF---------------SKLGIAVK 478
Query: 238 PRRGDGLLFYSLFPNGTI--DRTSLHGSCPVIKGEKWVATKWI 278
P++G L++ S+ D+ H + VIKGEK A WI
Sbjct: 479 PKKGRALVWPSVLDEDPTYWDKRMYHEAKDVIKGEKKAANHWI 521
>gi|195341544|ref|XP_002037366.1| GM12151 [Drosophila sechellia]
gi|194131482|gb|EDW53525.1| GM12151 [Drosophila sechellia]
Length = 537
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 46/131 (35%), Positives = 67/131 (51%), Gaps = 16/131 (12%)
Query: 154 ATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMSQRLASFLLYLSDVEEGGET 212
AT L T E V Y +G Y+ H+D F +P Y + R+A+ + YLS+VE+GG T
Sbjct: 400 ATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRIATAIFYLSEVEQGGAT 459
Query: 213 MFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKW 272
FPF + + VKP+ G+ L +Y+L + D + H CPV+KG KW
Sbjct: 460 AFPF---------------LDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGSKW 504
Query: 273 VATKWIRDQEQ 283
+ WI + Q
Sbjct: 505 IGNVWIHEVTQ 515
>gi|148701598|gb|EDL33545.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide, isoform CRA_c [Mus
musculus]
gi|149052607|gb|EDM04424.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide (predicted),
isoform CRA_d [Rattus norvegicus]
Length = 189
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 57/195 (29%), Positives = 97/195 (49%), Gaps = 22/195 (11%)
Query: 92 SAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDKTGILELIEHK 150
S E+ + I AK +L ++ +R +T V + R S +++ ED ++ + +
Sbjct: 2 SDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDDDPVVARVNRR 57
Query: 151 IARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLASFLLYLSDVEE 208
+ T L E V Y +G +Y+ H+D + P + G + RLA+FL Y+SDVE
Sbjct: 58 MQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEA 117
Query: 209 GGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIK 268
GG T+FP +G + P++G + +Y+L +G D + H +CPV+
Sbjct: 118 GGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLV 162
Query: 269 GEKWVATKWIRDQEQ 283
G KWV+ KW ++ Q
Sbjct: 163 GCKWVSNKWFHERGQ 177
>gi|195575099|ref|XP_002105517.1| GD17024 [Drosophila simulans]
gi|194201444|gb|EDX15020.1| GD17024 [Drosophila simulans]
Length = 537
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 46/131 (35%), Positives = 67/131 (51%), Gaps = 16/131 (12%)
Query: 154 ATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMSQRLASFLLYLSDVEEGGET 212
AT L T E V Y +G Y+ H+D F +P Y + R+A+ + YLS+VE+GG T
Sbjct: 400 ATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRIATAIFYLSEVEQGGAT 459
Query: 213 MFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKW 272
FPF + + VKP+ G+ L +Y+L + D + H CPV+KG KW
Sbjct: 460 AFPF---------------LDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGSKW 504
Query: 273 VATKWIRDQEQ 283
+ WI + Q
Sbjct: 505 IGNVWIHEVTQ 515
>gi|24651420|ref|NP_733374.1| prolyl-4-hydroxylase-alpha NE1 [Drosophila melanogaster]
gi|7301952|gb|AAF57058.1| prolyl-4-hydroxylase-alpha NE1 [Drosophila melanogaster]
gi|363987308|gb|AEW43896.1| FI16820p1 [Drosophila melanogaster]
Length = 537
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 46/131 (35%), Positives = 67/131 (51%), Gaps = 16/131 (12%)
Query: 154 ATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMSQRLASFLLYLSDVEEGGET 212
AT L T E V Y +G Y+ H+D F +P Y + R+A+ + YLS+VE+GG T
Sbjct: 400 ATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRIATAIFYLSEVEQGGAT 459
Query: 213 MFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKW 272
FPF + + VKP+ G+ L +Y+L + D + H CPV+KG KW
Sbjct: 460 AFPF---------------LDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGSKW 504
Query: 273 VATKWIRDQEQ 283
+ WI + Q
Sbjct: 505 IGNVWIHEVTQ 515
>gi|198477150|ref|XP_002136737.1| GA29215 [Drosophila pseudoobscura pseudoobscura]
gi|198145042|gb|EDY71754.1| GA29215 [Drosophila pseudoobscura pseudoobscura]
Length = 508
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 57/211 (27%), Positives = 96/211 (45%), Gaps = 21/211 (9%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
+ ++LS P + + + A++ S++ +R + E S RT+
Sbjct: 310 LKMELLSLDPYVVLYHDVL-ADREMSLLKLMAQRDLVRAVTYNATEKKHSEDPNRTTKAG 368
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
++ S + + ++ ++ L E F VL Y IG Y H D F + P++
Sbjct: 369 WLDPSHNLIRRMGILTEDMSN---LDLERSEDFQVLNYGIGGHYAVHPDFFEGS--NPEL 423
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
R+A+ L YLSDV GG T+FP + L V P++G L++Y+L
Sbjct: 424 PDRVATLLFYLSDVPLGGATVFPL---------------LDLSVFPKKGAVLMWYNLDHK 468
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
G ++H +CPV+ G +WV TKW+ Q Q
Sbjct: 469 GQGMEKTIHSACPVVVGSRWVMTKWVNQQPQ 499
>gi|195159164|ref|XP_002020452.1| GL13506 [Drosophila persimilis]
gi|194117221|gb|EDW39264.1| GL13506 [Drosophila persimilis]
Length = 536
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 63/212 (29%), Positives = 100/212 (47%), Gaps = 25/212 (11%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFISA 136
LS P + + N S + IA ++ +P ++ GE S K RT+ G +I
Sbjct: 328 LSLDPYIVVYHNVLSDAE----IAKVERVAEPLLKSIGVGEMDNSKKSKVRTALGAWIPD 383
Query: 137 SEDKTG---ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ-M 192
+++ I +I T L G+ +++Y G YD+H+D N + Q +
Sbjct: 384 ENMHISGWPVIQRIVRRIHDMTGLIIKRGQVVQLIKYGYGGHYDTHFDYLNDSLPITQAL 443
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF-P 251
R+A+ L YL+DV+ GG T+FP + LKV RG L++Y++
Sbjct: 444 GDRMATVLFYLNDVKHGGSTVFP---------------VLQLKVPSERGKVLVWYNMHGE 488
Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+D +LHGSCPVI G K V + WI + +Q
Sbjct: 489 THDLDSRTLHGSCPVIDGAKTVLSCWIHEWDQ 520
>gi|332187533|ref|ZP_08389270.1| 2OG-Fe(II) oxygenase superfamily protein [Sphingomonas sp. S17]
gi|332012462|gb|EGI54530.1| 2OG-Fe(II) oxygenase superfamily protein [Sphingomonas sp. S17]
Length = 228
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 59/200 (29%), Positives = 98/200 (49%), Gaps = 28/200 (14%)
Query: 86 YFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILE 145
Y +F + QC ++IA +PS L + G RTS ++ + ++
Sbjct: 47 YQADFLTPAQCDALIAMIDANRRPSTLL-----SDRPDYGFRTSESCDMNRWSPE---VQ 98
Query: 146 LIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE-YGPQMS----QRLASFL 200
I+ IA+ +P GE RY GQ++ +H+D F+ +E Y ++ QR + +
Sbjct: 99 PIDESIAQLLGIPPEQGETMQGQRYAPGQQFRAHHDYFHESESYWEKVKVHGGQRTWTAM 158
Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
+YL+DV EGG T FP G++V PRRG L + ++ +G+ + +L
Sbjct: 159 IYLNDVPEGGATWFP---------------QAGIRVAPRRGLLLAWNNMLLDGSPNDATL 203
Query: 261 HGSCPVIKGEKWVATKWIRD 280
H PV++G K+V TKW R+
Sbjct: 204 HEGMPVVEGVKYVITKWFRE 223
>gi|195390825|ref|XP_002054068.1| GJ24233 [Drosophila virilis]
gi|194152154|gb|EDW67588.1| GJ24233 [Drosophila virilis]
Length = 533
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 65/218 (29%), Positives = 102/218 (46%), Gaps = 26/218 (11%)
Query: 74 PF--QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
PF ++LS P F + A + +I + LK + + RT++G
Sbjct: 313 PFKLELLSKDPYIAVFHDVIYASEIAELIRIGEPMLKRTAVQNITQNVDTYISKDRTATG 372
Query: 132 TFI---SASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYDAFNPA 186
++I + ++ + ++ I+ +I T L T + +L Y G Y SHYD FN
Sbjct: 373 SWILNGNLTKLERNMIWRIQRRIEDMTGLLITGFSEQDLQLLNYVFGGHYQSHYDFFNCP 432
Query: 187 EYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLF 246
+ R+A+ L+YL+DV GG T+FP + L V+P RG L +
Sbjct: 433 SFP---HDRIATTLIYLNDVVRGGATVFP---------------KLDLVVQPERGKVLHW 474
Query: 247 YSLFPNG-TIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
Y++ P+ DR SLHG CPV+ GEK T WI + +Q
Sbjct: 475 YNMLPDTFDYDRRSLHGGCPVLIGEKLALTNWIYEWDQ 512
>gi|327265288|ref|XP_003217440.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Anolis
carolinensis]
Length = 554
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 59/207 (28%), Positives = 100/207 (48%), Gaps = 28/207 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + N S E+ + I AK +L ++ +R +T V + R S +++ +D
Sbjct: 355 PHIVRYYNVLSDEEIEKIKELAKPKL--ARATVRDPKTGVLTVANYRVSKSSWLEEEDDL 412
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL---- 196
++ + ++ T L E V Y +G +Y+ H+D E P +RL
Sbjct: 413 --VVAKVNQRMEHITGLTVKTAELLQVANYGMGGQYEPHFDFSRKEE--PDAFKRLGTGN 468
Query: 197 --ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
A+FL Y+SDVE GG T+FP D+ G + P++G + +Y+LF +G
Sbjct: 469 RVATFLNYMSDVEAGGATVFP-----------DF----GAAIWPKKGTAVFWYNLFRSGE 513
Query: 255 IDRTSLHGSCPVIKGEKWVATKWIRDQ 281
D + H +CPV+ G KWV+ KW ++
Sbjct: 514 GDYRTRHAACPVLVGCKWVSNKWFHER 540
>gi|219113765|ref|XP_002186466.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|209583316|gb|ACI65936.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 537
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 78/274 (28%), Positives = 116/274 (42%), Gaps = 58/274 (21%)
Query: 60 LPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKK------RLKPSQLA 113
LPH G+ Q + + LS PR NF E+ Q ++A K RLK S
Sbjct: 264 LPHVMGGNPE-QPVYLRTLSLSPRVFDIVNFFRLEESQDLVARVLKEERELWRLKRST-- 320
Query: 114 LRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIG 173
G + RTS F ++SE L++ + +T+ + +LRY +
Sbjct: 321 --TGPDSSNVNKRRTSESGFDTSSESALK-LKMRGFQALGFDDYIETYADGLQMLRYNLT 377
Query: 174 QKYDSHYD------------AFNPAEYGPQMSQRLASFLLYLSDVEE--GGETMF----P 215
Y++H+D FN A G + R A+ LLY+++ E GGET+F P
Sbjct: 378 TAYNAHHDFYGDISAVTGHHNFNSAGVG---ADRFATILLYMTEHGERGGGETVFTEAWP 434
Query: 216 FE--------------------NGIFLDSGYDYKKCIG-----LKVKPRRGDGLLFYSLF 250
E + L +G + + L V+P G +LFYS
Sbjct: 435 TELAPGKRVELPDAIESLRASGDASMLKTGSWEETLVAKCRSRLAVQPHPGRAVLFYSQL 494
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQH 284
PNG D TSLHG CPV+ G+K+ A W+ + +H
Sbjct: 495 PNGKPDYTSLHGGCPVLAGQKYAANLWVWNSHRH 528
>gi|443709455|gb|ELU04127.1| hypothetical protein CAPTEDRAFT_149240 [Capitella teleta]
Length = 532
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 52/161 (32%), Positives = 80/161 (49%), Gaps = 22/161 (13%)
Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD---AF 183
R S ++ ED ++ I + + T L T E V+ Y IG +Y+ H+D
Sbjct: 380 RISKSGWLRDEEDP--LIARISERCSALTNLSLTTVEELQVVNYGIGGQYEPHFDFSRRS 437
Query: 184 NPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDG 243
P + R+ + + Y++DVE GG T +FLD+G +KV P +G
Sbjct: 438 EPTAFEKWRGNRILTVIYYMTDVEAGGAT-------VFLDAG--------VKVYPEKGSA 482
Query: 244 LLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI--RDQE 282
++++L P+G D + H +CPV+ G KWVA KW RDQE
Sbjct: 483 AVWHNLLPSGEGDMRTRHAACPVLTGSKWVANKWFHERDQE 523
>gi|407699315|ref|YP_006824102.1| prolyl 4-hydroxylase subunit alpha [Alteromonas macleodii str.
'Black Sea 11']
gi|407248462|gb|AFT77647.1| Prolyl 4-hydroxylase alpha subunit [Alteromonas macleodii str.
'Black Sea 11']
Length = 354
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 63/212 (29%), Positives = 96/212 (45%), Gaps = 24/212 (11%)
Query: 74 PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA--LRQGETVESTKGTRTSSG 131
P +VL + + S +C +I L+PS + L V++ RTS
Sbjct: 146 PTEVLDQTLPVELYVDVLSEYECAYLITKFSSLLQPSMVVDPLTGNGKVDNV---RTSYV 202
Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ 191
I+ S I ++ I++ T P+ +GEA N+LRY GQ+Y HYDA N G
Sbjct: 203 AIIAPSYCD-WITRKLDKVISQVTHTPRCNGEALNLLRYTPGQQYKPHYDALNEDHDGSM 261
Query: 192 M---SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 248
QR+ + L+YL+ V +GGET FP + + V P G+ ++F +
Sbjct: 262 YKDGKQRIKTALVYLNTVRQGGETRFPK---------------LDISVSPTLGNMVVFSN 306
Query: 249 LFPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 280
+G + S H P KW+ TKWIR+
Sbjct: 307 SDESGKLLLNSYHLGAPTFSENKWLVTKWIRE 338
>gi|17861644|gb|AAL39299.1| GH17175p [Drosophila melanogaster]
Length = 187
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 50/158 (31%), Positives = 77/158 (48%), Gaps = 18/158 (11%)
Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NP 185
R S +++ T + L + + AT L T E V Y +G Y+ H+D F +P
Sbjct: 25 RVSKNAWLAYESHPTMVGMLRD--LKDATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDP 82
Query: 186 AEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLL 245
Y + R+A+ + YLS+VE+GG T FPF + + VKP+ G+ L
Sbjct: 83 NHYPAEEGNRIATAIFYLSEVEQGGATAFPF---------------LDIAVKPQLGNVLF 127
Query: 246 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+Y+L + D + H CPV+KG KW+ WI + Q
Sbjct: 128 WYNLHRSLDKDYRTKHAGCPVLKGSKWIGNVWIHEVTQ 165
>gi|334140935|ref|YP_004534141.1| 2OG-Fe(II) oxygenase [Novosphingobium sp. PP1Y]
gi|333938965|emb|CCA92323.1| 2OG-Fe(II) oxygenase [Novosphingobium sp. PP1Y]
Length = 209
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 56/198 (28%), Positives = 93/198 (46%), Gaps = 28/198 (14%)
Query: 89 NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
+F QC ++IA + +PS +A G+ V RTSS +S L
Sbjct: 32 DFLDTAQCDALIALIEAEHRPSTVANYNGDDV-----FRTSSTCDLSPDVPAVAALA--- 83
Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP-----AEYGPQMSQRLASFLLYL 203
K+ + + H E RYE+GQ++ +H D F P +Y QR +F++YL
Sbjct: 84 RKLCDISGIDPAHAEPLQGQRYEVGQEFKAHTDYFEPNNSDFEKYCSVSGQRTWTFMIYL 143
Query: 204 SDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGS 263
+DV+ GG T F K I ++P RG + + + P+G+++ +LH +
Sbjct: 144 NDVDAGGATRF---------------KVINKLIQPERGKLVAWNNRRPDGSLNPATLHHA 188
Query: 264 CPVIKGEKWVATKWIRDQ 281
V +G K+V T+W R++
Sbjct: 189 MKVRQGRKYVVTQWFRER 206
>gi|227553849|gb|ACP40552.1| IP22178p [Drosophila melanogaster]
Length = 467
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 46/131 (35%), Positives = 67/131 (51%), Gaps = 16/131 (12%)
Query: 154 ATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMSQRLASFLLYLSDVEEGGET 212
AT L T E V Y +G Y+ H+D F +P Y + R+A+ + YLS+VE+GG T
Sbjct: 330 ATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRIATAIFYLSEVEQGGAT 389
Query: 213 MFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKW 272
FPF + + VKP+ G+ L +Y+L + D + H CPV+KG KW
Sbjct: 390 AFPF---------------LDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGSKW 434
Query: 273 VATKWIRDQEQ 283
+ WI + Q
Sbjct: 435 IGNVWIHEVTQ 445
>gi|54792285|emb|CAG28668.1| prolyl 4-hydroxylase alpha-2 subunit [Gallus gallus]
Length = 538
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 59/207 (28%), Positives = 101/207 (48%), Gaps = 28/207 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 340 PHIVRYYDVMSDEEIEKIKQLAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 395
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL---- 196
++ + ++ + T L E V Y +G +Y+ H+D E P +RL
Sbjct: 396 DPVVAKVNQRMQQITGLTVKTAELLQVANYGMGGQYEPHFDFSRKDE--PDAFKRLGTGN 453
Query: 197 --ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
A+FL Y+SDVE GG T+FP D+ G + P++G + +Y+LF +G
Sbjct: 454 RVATFLNYMSDVEAGGATVFP-----------DF----GAAIWPKKGTAVFWYNLFRSGE 498
Query: 255 IDRTSLHGSCPVIKGEKWVATKWIRDQ 281
D + H +CPV+ G KWV+ KW ++
Sbjct: 499 GDYRTRHAACPVLVGCKWVSNKWFHER 525
>gi|354504916|ref|XP_003514519.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Cricetulus
griseus]
Length = 509
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 65/213 (30%), Positives = 102/213 (47%), Gaps = 25/213 (11%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
+V+ RP + +F S + Q I A+ L+ S +A GE + R S ++
Sbjct: 305 EVIHLRPFVALYHDFVSDAEAQKIRELAEPWLQRSVVA--SGEKQLPVE-YRISKSAWLK 361
Query: 136 ASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 190
+ D +L ++H+IA T L + E V+ Y IG Y+ H+D A +P+ Y
Sbjct: 362 DTVDP--MLGTLDHRIAALTGLDIQPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRM 419
Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
+ R+A+F++YLS VE GG T F + N V + L +++L
Sbjct: 420 KSGNRVATFMIYLSAVEAGGATAFIYAN---------------FSVPVVKNAALFWWNLH 464
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+G D +LH CPV+ G+KWVA KWI + Q
Sbjct: 465 RSGEGDGDTLHAGCPVLVGDKWVANKWIHEYGQ 497
>gi|195390805|ref|XP_002054058.1| GJ23004 [Drosophila virilis]
gi|194152144|gb|EDW67578.1| GJ23004 [Drosophila virilis]
Length = 446
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 50/157 (31%), Positives = 81/157 (51%), Gaps = 19/157 (12%)
Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
R+ FI +K +++ IE ++ + L + +++ Y IG Y H+D+F+
Sbjct: 296 RSGKNVFIEL--EKGELVKTIEMRVTDMSGLSMEGSDDLSLINYGIGGHYIPHHDSFSEE 353
Query: 187 EYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLF 246
E + R+A+ L YLSDVE GG T FP N L + P +G +L+
Sbjct: 354 E--NKTEDRIATALFYLSDVELGGATTFPLLN---------------LTISPEKGTAVLW 396
Query: 247 YSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
++L +GT ++H +CPVI G K+V TKWI + +Q
Sbjct: 397 HNLKDSGTPHPKTVHAACPVIVGSKYVMTKWIYNMDQ 433
>gi|397644755|gb|EJK76534.1| hypothetical protein THAOC_01697 [Thalassiosira oceanica]
Length = 475
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 65/206 (31%), Positives = 99/206 (48%), Gaps = 26/206 (12%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRL--KPSQLALRQGE-TVESTKGT-RTSSGTFISAS 137
P + F NF + ++C +I +K + + Q + + +S + T RTS + S
Sbjct: 273 PWVITFENFLTEDECTHMIEQGRKAEYERSEDVGEVQADGSYDSVRSTGRTSENAWCSFR 332
Query: 138 ED--KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
+ I+EL+ +IA+ T + H E F +L+YE GQ Y H+D + + + R
Sbjct: 333 DGCRNDTIVELVHDRIAKVTGIGANHSEDFQILKYEPGQFYRQHHD-YIEHQRDRRCGPR 391
Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF---PN 252
+ +F LYLSDVEEGG T FP +G+ VKP+ G LL+ S+ P
Sbjct: 392 VLTFFLYLSDVEEGGATNFPK---------------LGIAVKPKVGRALLWPSVLNSEPR 436
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWI 278
RT H + VI G K+ A WI
Sbjct: 437 NKDGRTD-HEAQDVIAGVKYGANAWI 461
>gi|449267219|gb|EMC78185.1| Prolyl 4-hydroxylase subunit alpha-2 [Columba livia]
Length = 538
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 59/207 (28%), Positives = 101/207 (48%), Gaps = 28/207 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 339 PHIVRYYDVMSDEEIEKIKQLAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 394
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL---- 196
++ + ++ + T L E V Y +G +Y+ H+D E P +RL
Sbjct: 395 DPVVAKVNQRMQQITGLTVKTAELLQVANYGMGGQYEPHFDFSRKDE--PDAFKRLGTGN 452
Query: 197 --ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
A+FL Y+SDVE GG T+FP D+ G + P++G + +Y+LF +G
Sbjct: 453 RVATFLNYMSDVEAGGATVFP-----------DF----GAAIWPKKGTAVFWYNLFRSGE 497
Query: 255 IDRTSLHGSCPVIKGEKWVATKWIRDQ 281
D + H +CPV+ G KWV+ KW ++
Sbjct: 498 GDYRTRHAACPVLVGCKWVSNKWFHER 524
>gi|198452400|ref|XP_002137470.1| GA26529 [Drosophila pseudoobscura pseudoobscura]
gi|198131917|gb|EDY68028.1| GA26529 [Drosophila pseudoobscura pseudoobscura]
Length = 348
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 61/211 (28%), Positives = 99/211 (46%), Gaps = 26/211 (12%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPS--QLALRQGETVESTKGTRTSS 130
+ ++ S P + + + + Q +I + ++R+ S Q +RQ E E RTS
Sbjct: 141 LKLEIFSHDPYVVIYHDVLYDAEMQGLIDSTRRRMSRSMVQYEIRQIEISEQ----RTSK 196
Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH---YDAFNPAE 187
+ D +L+ I ++ T E ++L Y+ G +D H +D + E
Sbjct: 197 EAPFTEKNDPQ-LLKRIYDRLKDMTGCDMLRSEHLSILLYDQGGHHDPHVDYHDLYWEYE 255
Query: 188 YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFY 247
Y P R AS + YL+DVE+GGET+FP + L + P +G L+++
Sbjct: 256 YHP-FGDRQASVVFYLNDVEDGGETVFP---------------KLQLVIPPTKGSALMWH 299
Query: 248 SLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
+L P G D + H SCPV+ G K VA +WI
Sbjct: 300 NLRPWGEGDPRTQHASCPVLSGYKQVAIQWI 330
>gi|198417610|ref|XP_002125349.1| PREDICTED: similar to Prolyl 4-hydroxylase subunit alpha-1
precursor (4-PH alpha-1)
(Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1) [Ciona intestinalis]
Length = 527
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 48/138 (34%), Positives = 67/138 (48%), Gaps = 19/138 (13%)
Query: 147 IEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE----YGPQMSQRLASFLLY 202
I +I+ T L E V Y +G +Y H+D E Q +R+A+FL+Y
Sbjct: 381 ITERISDITGLTLNTSEEIQVANYGVGGEYPPHFDIPTTDEERDDLKSQDGERIATFLIY 440
Query: 203 LSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHG 262
LSDVE GG T F G+ KP +G + +Y++FP+G D + HG
Sbjct: 441 LSDVEVGGRTAFV---------------NAGVSAKPIKGSAVFWYNVFPSGEPDLRTYHG 485
Query: 263 SCPVIKGEKWVATKWIRD 280
+CPV G KW KWIR+
Sbjct: 486 ACPVAFGNKWAGNKWIRE 503
>gi|194905410|ref|XP_001981191.1| GG11931 [Drosophila erecta]
gi|190655829|gb|EDV53061.1| GG11931 [Drosophila erecta]
Length = 537
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 63/223 (28%), Positives = 99/223 (44%), Gaps = 31/223 (13%)
Query: 72 SIPFQVL--------SWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
S P+++L S P F + S + + A R++ S + R G + +
Sbjct: 313 SHPYRILAPLKLEEHSLDPYVASFHDMLSPRKISQLREMAVPRMQRSTVNPRPGGQHKKS 372
Query: 124 KGTRTSSGTFIS--ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
R S +++ A G+L + AT L T E V Y +G Y+ H+D
Sbjct: 373 -AFRVSKNAWLAYEAHPTMAGMLR----DLKDATGLDTTFCEQLQVANYGVGGHYEPHWD 427
Query: 182 AF-NPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRR 240
F +P+ Y R+A+ + YLS+VE+GG T FPF + VKP+
Sbjct: 428 FFRDPSHYPAAEGNRIATAIFYLSEVEQGGATAFPF---------------LDFAVKPQL 472
Query: 241 GDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
G+ L +Y+L + D + H CPV+KG KW+ WI + Q
Sbjct: 473 GNVLFWYNLHRSLDKDYRTKHAGCPVLKGSKWIGNVWIHEVTQ 515
>gi|20269818|gb|AAM18064.1| prolyl 4-hydroxylase alpha-related protein PH4[alpha]NE1
[Drosophila melanogaster]
Length = 286
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 50/158 (31%), Positives = 77/158 (48%), Gaps = 18/158 (11%)
Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NP 185
R S +++ T + L + + AT L T E V Y +G Y+ H+D F +P
Sbjct: 124 RVSKNAWLAYESHPTMVGMLRD--LKDATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDP 181
Query: 186 AEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLL 245
Y + R+A+ + YLS+VE+GG T FPF + + VKP+ G+ L
Sbjct: 182 NHYPAEEGNRIATAIFYLSEVEQGGATAFPF---------------LDIAVKPQLGNVLF 226
Query: 246 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+Y+L + D + H CPV+KG KW+ WI + Q
Sbjct: 227 WYNLHRSLDKDYRTKHAGCPVLKGSKWIGNVWIHEVTQ 264
>gi|355709028|gb|AES03457.1| prolyl 4-hydroxylase, alpha polypeptide III [Mustela putorius furo]
Length = 477
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 65/219 (29%), Positives = 103/219 (47%), Gaps = 25/219 (11%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+Q I +V+ P + + +F S + Q I A+ L+ S +A GE + R S
Sbjct: 268 LQPIRKEVIHLEPYVVLYHDFVSDMEAQKIRGLAEPWLQRSVVA--SGEKQLPVE-YRIS 324
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYD-AFNPA 186
++ + D +L ++H+I T L + E V+ Y IG Y+ H+D A +P
Sbjct: 325 KSAWLKDTVDP--LLVNLDHRIGALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPT 382
Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
Y + R+A+F++YLS VE GG T F + N V + L
Sbjct: 383 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIYAN---------------FSVPVVKNAAL 427
Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+++L +G D +LH CPV+ G+KWVA KWI + Q
Sbjct: 428 FWWNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIHEYGQ 466
>gi|332140647|ref|YP_004426385.1| Prolyl 4-hydroxylase alpha subunit [Alteromonas macleodii str.
'Deep ecotype']
gi|327550669|gb|AEA97387.1| Prolyl 4-hydroxylase alpha subunit [Alteromonas macleodii str.
'Deep ecotype']
Length = 376
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 60/219 (27%), Positives = 97/219 (44%), Gaps = 41/219 (18%)
Query: 71 QSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG----- 125
+S+P +V + + S +C+ +I LKPS + V+ G
Sbjct: 174 KSLPIEV---------YESILSEYECRYLITKFNALLKPSMV-------VDPVTGRGKID 217
Query: 126 -TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN 184
RTS I + I ++ I++ T + +GEA N+LRY GQ+Y HYD N
Sbjct: 218 SVRTSYVAVIEPAHCD-WITRKLDKTISQITHTLRQNGEALNLLRYSPGQQYKPHYDGLN 276
Query: 185 PAEYGPQMS---QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRG 241
QR+ + L+YL+ + EGGET+FP + +++ P+ G
Sbjct: 277 EINDALMFKDGKQRIKTALVYLNTISEGGETLFPK---------------LDIRIAPKSG 321
Query: 242 DGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 280
++F + NG + S H P + KW+ TKWIR+
Sbjct: 322 TMVVFSNSDENGKLLLNSYHAGAPTVSENKWLVTKWIRE 360
>gi|397615311|gb|EJK63351.1| hypothetical protein THAOC_15991 [Thalassiosira oceanica]
Length = 463
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 60/185 (32%), Positives = 92/185 (49%), Gaps = 32/185 (17%)
Query: 118 ETVESTKGTRTSSGTFISASEDKTGILELIEHK------IARATMLPQTHGE-------- 163
E + TRTS T++ +D I++ I + I A + P++ GE
Sbjct: 289 EARHDIRETRTSLNTWVYREKDL--IIDAIYRRAADLLRIDEALLRPRSAGEVPEMKNTR 346
Query: 164 ----AFNVLRYEIGQKYDSHYD-AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP-FE 217
A ++ YE+GQ+Y +H+D + P + Q + R A+ LLYL++ GGET FP +
Sbjct: 347 GLAEALQLVHYEVGQEYTAHHDFGYAPFDRKDQPA-RFATLLLYLNEGMVGGETQFPRWA 405
Query: 218 NGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKW 277
N + GL V+P+ G +LFYS P+G +D S H + PV GEKW+ W
Sbjct: 406 NA---------ETRAGLDVEPKIGKAVLFYSQLPDGNMDDLSQHAARPVKIGEKWLMNLW 456
Query: 278 IRDQE 282
+ D E
Sbjct: 457 VWDPE 461
>gi|395509389|ref|XP_003758980.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2
[Sarcophilus harrisii]
Length = 536
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 97/207 (46%), Gaps = 24/207 (11%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ +D
Sbjct: 337 PHIVRYYDVLSDEEIERIKELAKPKL--ARATVRDPKTGVLTVANYRVSKSSWLEEGDDP 394
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----QMSQRL 196
++ + ++ T L E V Y +G +Y+ H+D E R+
Sbjct: 395 --VIAQLNRRMHYITGLSVKTAELLQVANYGMGGQYEPHFDFSRKGEQDAFKHLGTGNRV 452
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
A+FL Y+SDVE GG T+FP G + P++G + +Y+LF +G D
Sbjct: 453 ATFLNYMSDVEAGGATVFP---------------DFGATIWPKKGTSVFWYNLFRSGEGD 497
Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 498 YRTRHAACPVLVGSKWVSNKWFHERGQ 524
>gi|195392288|ref|XP_002054791.1| GJ24631 [Drosophila virilis]
gi|194152877|gb|EDW68311.1| GJ24631 [Drosophila virilis]
Length = 499
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 54/210 (25%), Positives = 98/210 (46%), Gaps = 27/210 (12%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
+ + LS P + + + A + + I+ AK L+ + + + + R +
Sbjct: 296 LKLEQLSLDPYMVLYHDVVQANEREHIMQLAKPHLRRALVGAARAHS------QRFAMNA 349
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF----NPAEY 188
S ++ + G + + ++ + T+ VL Y IG +Y HYD + + A+
Sbjct: 350 GFSYNDSRQG--QRLRQRLEDMSGFDLTNSGQLAVLNYGIGGQYYMHYDCWFSQDDAAQV 407
Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 248
R+A+ LLYL+DV+ GG T FP +GL V+P G L++++
Sbjct: 408 ASIKDNRIATILLYLTDVQLGGLTSFP---------------ALGLAVQPSPGSALIWHN 452
Query: 249 LFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
+ DR +LH +CP++ G +WVAT+WI
Sbjct: 453 MNNAAECDRRTLHAACPLLLGTRWVATQWI 482
>gi|412986224|emb|CCO17424.1| predicted protein [Bathycoccus prasinos]
Length = 557
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 72/241 (29%), Positives = 112/241 (46%), Gaps = 46/241 (19%)
Query: 77 VLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
+S P F NF +C+ + A K LK S++ T RTSS F+
Sbjct: 318 CVSLSPLLFVFENFLHESECEFLRTLADKDLKRSRV------TDGKLSNGRTSSSCFLIG 371
Query: 137 SEDKTGILELIEHKI---ARATMLPQTH---------GEAFNVLRYEIGQKYDSHYDAFN 184
++ K +++ IE ++ R+T + T E ++RY +KY SH+D N
Sbjct: 372 AKGKEDVVKTIERRMLDAIRSTPVLTTRRFDTLKLKGSEPMQIVRYGKNEKYTSHFD--N 429
Query: 185 PAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYD----------YKKCI-- 232
A +R+A+F+ YLSD EGG T FP +FL+ +D KK +
Sbjct: 430 KA----GSFRRVATFMCYLSDQCEGGCTNFPKAEPLFLEPSFDEHGAFKPFGRKKKTVAS 485
Query: 233 ---GLKVKPRRGDGLLFYSL----FPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 285
G+K+ P+ G +LF+S+ F + SLH V KGEK++ TKW+ E+ E
Sbjct: 486 EQHGVKIHPKLGRAILFFSISEEPFRENPL---SLHEGQTVRKGEKFICTKWLTRTEESE 542
Query: 286 D 286
+
Sbjct: 543 N 543
>gi|195064500|ref|XP_001996577.1| GH12091 [Drosophila grimshawi]
gi|193895397|gb|EDV94263.1| GH12091 [Drosophila grimshawi]
Length = 521
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 58/185 (31%), Positives = 85/185 (45%), Gaps = 24/185 (12%)
Query: 105 KRLKPS-QLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGE 163
KRL P Q G TK T ++ ++ + T LE + +I T
Sbjct: 344 KRLSPQMQNGYIHGYKANQTKVTDIAAR--VNWLVENTPFLERMNQRITDMTGFDLKEFP 401
Query: 164 AFNVLRYEIGQKYDSHYDAF-----NPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFEN 218
+ V + IG +++HYD + G + RLAS + Y SDV GG T+FP
Sbjct: 402 SVQVANFGIGNNFEAHYDYIFGKRVRKEDVG-DLGDRLASIIFYSSDVPLGGATVFP--- 457
Query: 219 GIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
I + V+P++G+ LL+Y+LF +GT D SLH CPV+ G +W TKW+
Sbjct: 458 ------------DIQVAVQPQKGNSLLWYNLFDDGTPDPRSLHSVCPVVVGSRWTLTKWL 505
Query: 279 RDQEQ 283
Q
Sbjct: 506 HTSPQ 510
>gi|291387304|ref|XP_002710243.1| PREDICTED: prolyl 4-hydroxylase, alpha II subunit isoform 1
precursor (predicted)-like isoform 3 [Oryctolagus
cuniculus]
Length = 535
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 57/207 (27%), Positives = 97/207 (46%), Gaps = 24/207 (11%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----QMSQRL 196
++ I ++ T L E V Y +G +Y+ H+D E R+
Sbjct: 392 DPVVARINRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRNNERDAFKRLGTGNRV 451
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
A+FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 452 ATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGD 496
Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 497 YRTRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|74216495|dbj|BAE25162.1| unnamed protein product [Mus musculus]
Length = 187
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 56/193 (29%), Positives = 96/193 (49%), Gaps = 22/193 (11%)
Query: 94 EQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDKTGILELIEHKIA 152
E+ + I AK +L ++ +R +T V + R S +++ ED ++ + ++
Sbjct: 2 EEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDDDPVVARVNRRMQ 57
Query: 153 RATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLASFLLYLSDVEEGG 210
T L E V Y +G +Y+ H+D + P + G + RLA+FL Y+SDVE GG
Sbjct: 58 HITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGG 117
Query: 211 ETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGE 270
T+FP +G + P++G + +Y+L +G D + H +CPV+ G
Sbjct: 118 ATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGC 162
Query: 271 KWVATKWIRDQEQ 283
KWV+ KW ++ Q
Sbjct: 163 KWVSNKWFHERGQ 175
>gi|224007761|ref|XP_002292840.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220971702|gb|EED90036.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 490
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 66/214 (30%), Positives = 101/214 (47%), Gaps = 34/214 (15%)
Query: 78 LSWRPRALYFPNFASAEQCQSII-----ATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
+S P + F NF + E+C +I A ++ ++ + +KG RTS
Sbjct: 284 MSQPPWIITFDNFLTDEECNQMIQLGYKAKYERSKDVGEMQIDGSYDSVVSKG-RTSENA 342
Query: 133 FISASED--KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE--- 187
+ S + T +LI +I+ T +P H E F +L+YE GQ Y SH+D E
Sbjct: 343 WCSFRDKCRNTTTAQLIHDRISTVTGIPANHSEDFQILKYEKGQFYRSHHDYIEHQEKRR 402
Query: 188 YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFY 247
GP R+ +F LYLSDVEEGG+T FP + + VKP++G +L+
Sbjct: 403 CGP----RVLTFFLYLSDVEEGGDTNFPK---------------LSIAVKPKKGSAVLWP 443
Query: 248 SLF---PNGTIDRTSLHGSCPVIKGEKWVATKWI 278
S+ P+ RT H + V+ G K+ A W+
Sbjct: 444 SVLDSNPSMKDPRTD-HEAQEVVNGTKFGANAWL 476
>gi|399057802|ref|ZP_10744231.1| 2OG-Fe(II) oxygenase superfamily enzyme [Novosphingobium sp. AP12]
gi|398041550|gb|EJL34606.1| 2OG-Fe(II) oxygenase superfamily enzyme [Novosphingobium sp. AP12]
Length = 210
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 61/220 (27%), Positives = 101/220 (45%), Gaps = 32/220 (14%)
Query: 68 ASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTR 127
A +Q P S R NF +AEQC ++A + +PS +A G+ R
Sbjct: 16 AGVQRFP----SSRLDLFILKNFVAAEQCAELMALIEDSHRPSTIADYNGD-----DAFR 66
Query: 128 TSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA- 186
TSS +S ++ + ++R + + H E RYE+GQ++ +H D F P
Sbjct: 67 TSSTCDLSTD---VPVVANLAAALSRLSGIDLAHAEPLQGQRYEVGQEFKAHTDYFEPGN 123
Query: 187 ----EYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
+Y QR +F++YL++VE GG T F + I ++P G
Sbjct: 124 ADYDKYCAVPGQRTWTFMIYLNEVEAGGATRF---------------RVIDKMIQPEIGK 168
Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
+ + + P+GT + +LH + V KG K+V T+W R++
Sbjct: 169 LIAWNNRRPDGTPNAATLHHAMKVRKGYKYVITQWYRERH 208
>gi|403255941|ref|XP_003920663.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Saimiri
boliviensis boliviensis]
gi|403255945|ref|XP_003920665.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 5 [Saimiri
boliviensis boliviensis]
Length = 535
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 58/211 (27%), Positives = 99/211 (46%), Gaps = 32/211 (15%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
++ + ++ T L E V Y +G +Y+ H+D AF G
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDAFKHLGTG--- 448
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
R+A+FL Y+SDVE GG T+FP +G + P++G + +Y+L +
Sbjct: 449 -NRVATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRS 492
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
G D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 493 GEGDYRTRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|301759032|ref|XP_002915381.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Ailuropoda
melanoleuca]
Length = 539
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 64/219 (29%), Positives = 103/219 (47%), Gaps = 25/219 (11%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+Q + +V+ P + + +F S + Q I A+ L+ S +A GE + R S
Sbjct: 329 LQPVRKEVIHLEPYVVLYHDFVSDGEAQKIRGLAEPWLQRSVVA--SGEKQLPVE-YRIS 385
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYD-AFNPA 186
++ + D +L ++H+I T L + E V+ Y IG Y+ H+D A +P
Sbjct: 386 KSAWLKDTVDP--LLVTLDHRIGALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPT 443
Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
Y + R+A+F++YLS VE GG T F + N V + L
Sbjct: 444 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIYAN---------------FSVPVVKNAAL 488
Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+++L +G D +LH CPV+ G+KWVA KWI + Q
Sbjct: 489 FWWNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIHEYGQ 527
>gi|324507368|gb|ADY43128.1| Prolyl 4-hydroxylase subunit alpha-2 [Ascaris suum]
Length = 534
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 62/219 (28%), Positives = 96/219 (43%), Gaps = 30/219 (13%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
I ++L + P + F S + + I A +LK + + + +E R S
Sbjct: 316 IKVEILRFSPLVVLFKQVISDYEIEVIEKLAIPKLKRATVQNARTGDLEYA-NYRISKSA 374
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFN 184
++ ++ ++ I +I T L Q E Y IG YD H+D AF
Sbjct: 375 WLKGTDHPA--IDRINKRIDLMTNLNQETAEELQAQNYGIGGHYDPHFDFARKEDINAFK 432
Query: 185 PAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
G R+A+ L+Y+SDVE GG T+F +G V P + D L
Sbjct: 433 TLNTG----NRIATILIYMSDVESGGATVF---------------NHLGNAVFPSKYDAL 473
Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+Y+L +G D + H +CPV+ G KWV+ KWI D+ Q
Sbjct: 474 FWYNLRRDGEGDLRTRHAACPVLTGIKWVSNKWIHDRGQ 512
>gi|335294484|ref|XP_003357239.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Sus scrofa]
Length = 545
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 62/219 (28%), Positives = 105/219 (47%), Gaps = 24/219 (10%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+Q I +V+ P + + +F + + Q I A+ + +++ + GE + R S
Sbjct: 334 LQPIRKEVIHLEPYVVLYHDFVTDAEAQKIRGLAEPWVT-AEILVASGEKQLPVE-YRIS 391
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYD-AFNPA 186
++ + D +L ++H+IA T L + E V+ Y IG Y+ H+D A +P+
Sbjct: 392 KSAWLKDTVDP--MLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPS 449
Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
Y + R+A+F++YLS VE GG T F + N V + L
Sbjct: 450 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIYGN---------------FSVPVVKNAAL 494
Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+++L +G D +LH CPV+ G+KWVA KWI + Q
Sbjct: 495 FWWNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIHEYGQ 533
>gi|326928035|ref|XP_003210190.1| PREDICTED: WD repeat-containing protein 6-like [Meleagris
gallopavo]
Length = 900
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 56/195 (28%), Positives = 89/195 (45%), Gaps = 34/195 (17%)
Query: 118 ETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLP---QTHGEAFNVLRYEIGQ 174
+ V+++ R S T++ E ++ I ++ R T LP H E V+RY+ G
Sbjct: 134 QKVKTSDAVRNSQHTWLYQGEGAHQVMRAIRQRVMRLTRLPPEIVEHSEPLQVVRYDQGG 193
Query: 175 KYDSHYDA-------------FNPAEYGP-QMSQRLASFLLYLSDVEEGGETMFP----- 215
Y +H D+ E P + S R + L YL++V GGET+FP
Sbjct: 194 HYHAHMDSGPVFPETACSHTKLVANESAPFETSCRYVTVLFYLNNVTGGGETVFPIADNR 253
Query: 216 -FENGIFLDSGYDY----KKCI--GLKVKPRRGDGLLFYSLFPNGT-----IDRTSLHGS 263
+E + + D K C L+VKP++G + +Y+ +G +D +LHG
Sbjct: 254 TYEEMSLIQNDVDLRDTRKNCDKGNLRVKPQQGTAVFWYNYLSDGEGWVGELDDFALHGG 313
Query: 264 CPVIKGEKWVATKWI 278
C V +G KW+A WI
Sbjct: 314 CLVTQGTKWIANNWI 328
>gi|400602974|gb|EJP70572.1| 2OG-Fe(II) oxygenase family Oxidoreductase [Beauveria bassiana
ARSEF 2860]
Length = 269
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 62/219 (28%), Positives = 102/219 (46%), Gaps = 16/219 (7%)
Query: 71 QSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSS 130
Q+ ++LS P A+Y NF + + + ++A + KPS++A G V +T R+S
Sbjct: 43 QNYTVEILSIDPLAIYLNNFLNDAEIRYLLALGENIYKPSEVASHSGIIVNTT--VRSSE 100
Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIG-QKYDSHYDAFNPAEYG 189
F+ ED LI + + H E+ +++Y G +Y H D A+
Sbjct: 101 SAFLL--EDDAVCNCLISRMKSLLGNVQHEHVESLQMVKYAAGGDRYRLHTDWSVAAKNN 158
Query: 190 P-------QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYD----YKKCIGLKVKP 238
+ S+RL + +YL D GGET FP G+ D+ + K+ GL V+P
Sbjct: 159 TDEASGKLRQSRRLGTIFVYLEDSCAGGETYFPLLTGVSDDADGEKFAVAKQGGGLLVRP 218
Query: 239 RRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKW 277
+RG+G+ + ++ NGT D +H P+ G K W
Sbjct: 219 KRGNGVFWNNIHSNGTGDDRVVHAGLPIKSGVKIGLNMW 257
>gi|312599252|gb|ADQ91275.1| hypothetical protein BpV2_108c [Bathycoccus sp. RCC1105 virus BpV2]
Length = 197
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 65/217 (29%), Positives = 104/217 (47%), Gaps = 35/217 (16%)
Query: 70 IQSIPFQVLSW-------RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES 122
I + F ++ W +PR L N S ++C+ I A K+L+ S ++ ++ +
Sbjct: 5 ILFVSFLLIIWFFIPIYEKPRVL--KNVLSEDECKHIQNIASKKLQTSTVS----KSRDI 58
Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
+ R S ++ ASED +++ + K T P + E VL+Y+ G Y H D
Sbjct: 59 DESIRKSETAWLKASED--PVVDKLIRKCVSMTDRPLRNCEDLQVLKYKPGGFYKPHQDT 116
Query: 183 FNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
F P + ++R+ +F++ L+D EGGET FP + KK L+ +GD
Sbjct: 117 F-PDD----KNKRMYTFIIALNDEYEGGETEFP-----------NIKKSYRLE----KGD 156
Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
L F +L I + +LHG PV GEKWV W+R
Sbjct: 157 ALFFNTLNNYECITKKALHGGTPVKSGEKWVCNLWVR 193
>gi|354474413|ref|XP_003499425.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1
[Cricetulus griseus]
Length = 535
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 97/207 (46%), Gaps = 24/207 (11%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----QMSQRL 196
++ + ++ T L E V Y +G +Y+ H+D E R+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRSDEQDAFKRLGTGNRV 451
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
A+FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 452 ATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGD 496
Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 497 YRTRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|313768105|ref|YP_004061536.1| hypothetical protein BpV1_106c [Bathycoccus sp. RCC1105 virus BpV1]
gi|312599712|gb|ADQ91733.1| hypothetical protein BpV1_106c [Bathycoccus sp. RCC1105 virus BpV1]
Length = 197
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 62/217 (28%), Positives = 100/217 (46%), Gaps = 35/217 (16%)
Query: 70 IQSIPFQVLSW-------RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES 122
I + F ++ W +PR L N S ++C+ I A K+L+ S +++ + +
Sbjct: 5 ILFVSFLLIIWFFIPIYEKPRVL--KNVLSEDECKHIQDIASKKLQTSTVSMSR----DI 58
Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
+ R S ++ ASED +++ + K T P + E VL+Y+ G Y H D
Sbjct: 59 DEKIRKSETAWLKASED--PVVDKLIRKCVSMTDRPLHNCEDLQVLKYKPGGFYKPHQDC 116
Query: 183 FNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
F + ++R+ +F++ L+D EGGET FP I + + +GD
Sbjct: 117 FKNDK-----NKRMYTFIIALNDEYEGGETEFPN---------------IKRRYRLEKGD 156
Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
L F +L + +LHG PV GEKWV WIR
Sbjct: 157 ALFFNTLNNYECTTKQALHGGAPVKSGEKWVCNLWIR 193
>gi|226874876|ref|NP_035161.2| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Mus
musculus]
gi|148701601|gb|EDL33548.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide, isoform CRA_f [Mus
musculus]
Length = 537
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 97/207 (46%), Gaps = 24/207 (11%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 338 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 393
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----QMSQRL 196
++ + ++ T L E V Y +G +Y+ H+D E R+
Sbjct: 394 DPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRSDEQDAFKRLGTGNRV 453
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
A+FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 454 ATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGD 498
Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 499 YRTRHAACPVLVGCKWVSNKWFHERGQ 525
>gi|332221664|ref|XP_003259983.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 5 [Nomascus
leucogenys]
Length = 558
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 97/207 (46%), Gaps = 24/207 (11%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 359 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 414
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----QMSQRL 196
++ + ++ T L E V Y +G +Y+ H+D E R+
Sbjct: 415 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRV 474
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
A+FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 475 ATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGD 519
Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 520 YRTRHAACPVLVGCKWVSNKWFHERGQ 546
>gi|335283456|ref|XP_003354320.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Sus scrofa]
Length = 535
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 97/207 (46%), Gaps = 24/207 (11%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----QMSQRL 196
++ + ++ T L E V Y +G +Y+ H+D E R+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRKDEQDAFKRLGTGNRV 451
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
A+FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 452 ATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGD 496
Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 497 YRTRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|119582752|gb|EAW62348.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_f
[Homo sapiens]
Length = 567
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 97/207 (46%), Gaps = 24/207 (11%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 368 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 423
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----QMSQRL 196
++ + ++ T L E V Y +G +Y+ H+D E R+
Sbjct: 424 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRV 483
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
A+FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 484 ATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGD 528
Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 529 YRTRHAACPVLVGCKWVSNKWFHERGQ 555
>gi|114601566|ref|XP_001162222.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Pan
troglodytes]
gi|114601568|ref|XP_001162843.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 17 [Pan
troglodytes]
gi|397518358|ref|XP_003829358.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Pan
paniscus]
gi|397518362|ref|XP_003829360.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 5 [Pan
paniscus]
gi|410215944|gb|JAA05191.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
gi|410255608|gb|JAA15771.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
gi|410331279|gb|JAA34586.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
Length = 535
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 97/207 (46%), Gaps = 24/207 (11%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----QMSQRL 196
++ + ++ T L E V Y +G +Y+ H+D E R+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRV 451
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
A+FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 452 ATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGD 496
Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 497 YRTRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|4758868|ref|NP_004190.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Homo
sapiens]
gi|217272863|ref|NP_001136071.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Homo
sapiens]
gi|20455169|sp|O15460.1|P4HA2_HUMAN RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
alpha-2; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-2; Flags: Precursor
gi|2439985|gb|AAB71339.1| prolyl 4-hydroxylase alpha (II) subunit [Homo sapiens]
gi|18073926|emb|CAC85689.1| Prolyl 4-hydroxylase alpha IIb subunit [Homo sapiens]
gi|119582746|gb|EAW62342.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_b
[Homo sapiens]
gi|119582747|gb|EAW62343.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_b
[Homo sapiens]
Length = 535
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 97/207 (46%), Gaps = 24/207 (11%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----QMSQRL 196
++ + ++ T L E V Y +G +Y+ H+D E R+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRV 451
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
A+FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 452 ATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGD 496
Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 497 YRTRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|344296798|ref|XP_003420090.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Loxodonta
africana]
Length = 544
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 65/217 (29%), Positives = 104/217 (47%), Gaps = 27/217 (12%)
Query: 74 PF--QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
PF +V+ P + + +F + + Q I A+ L+ S +A GE + R S
Sbjct: 336 PFRKEVIHLEPYVVLYHDFVNDMEAQKIKGLAEPWLQRSVVA--SGEK-QLQVDYRISKS 392
Query: 132 TFISASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYD-AFNPAE- 187
++ S D +L ++H+IA T L + E V+ Y IG Y+ H+D A +P+
Sbjct: 393 AWLKDSVDP--MLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPSSP 450
Query: 188 -YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLF 246
Y + R+A+F++YLS VE GG T F + N + + L +
Sbjct: 451 LYRMKSGNRVATFMIYLSAVEAGGATAFIYAN---------------FSMPVVKNAALFW 495
Query: 247 YSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
++L +G D +LH CPV+ G+KWVA KWI + Q
Sbjct: 496 WNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIHEYGQ 532
>gi|332221660|ref|XP_003259981.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Nomascus
leucogenys]
Length = 537
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 97/207 (46%), Gaps = 24/207 (11%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 338 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 393
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----QMSQRL 196
++ + ++ T L E V Y +G +Y+ H+D E R+
Sbjct: 394 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRV 453
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
A+FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 454 ATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGD 498
Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 499 YRTRHAACPVLVGCKWVSNKWFHERGQ 525
>gi|297675929|ref|XP_002815906.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Pongo
abelii]
Length = 535
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 97/207 (46%), Gaps = 24/207 (11%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----QMSQRL 196
++ + ++ T L E V Y +G +Y+ H+D E R+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRV 451
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
A+FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 452 ATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGD 496
Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 497 YRTRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|395736141|ref|XP_003776706.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Pongo abelii]
Length = 577
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 97/207 (46%), Gaps = 24/207 (11%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 378 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 433
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----QMSQRL 196
++ + ++ T L E V Y +G +Y+ H+D E R+
Sbjct: 434 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRV 493
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
A+FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 494 ATFLNYMSDVEAGGATVFPD---------------LGAAIWPKKGTAVFWYNLLRSGEGD 538
Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 539 YRTRHAACPVLVGCKWVSNKWFHERGQ 565
>gi|149052606|gb|EDM04423.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide (predicted),
isoform CRA_c [Rattus norvegicus]
Length = 506
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 97/207 (46%), Gaps = 24/207 (11%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 307 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 362
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----QMSQRL 196
++ + ++ T L E V Y +G +Y+ H+D E R+
Sbjct: 363 DPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRSDERDAFKRLGTGNRV 422
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
A+FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 423 ATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGD 467
Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 468 YRTRHAACPVLVGCKWVSNKWFHERGQ 494
>gi|326923465|ref|XP_003207956.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 3
[Meleagris gallopavo]
Length = 518
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 61/204 (29%), Positives = 100/204 (49%), Gaps = 34/204 (16%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S E+ +++ AK RL S+ + ET + +T R S ++S E
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRL--SRATVHDPETGKLTTAHYRVSKSAWLSGYE- 392
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASF 199
+ ++ I +I T L + E QK + DAF G R+A++
Sbjct: 393 -SPVVSRINTRIQDLTGLDVSTAEEL--------QKDEP--DAFKELGTG----NRIATW 437
Query: 200 LLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTS 259
L Y+SDV GG T+FP +G V P++G + +Y+LFP+G D ++
Sbjct: 438 LFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFPSGEGDYST 482
Query: 260 LHGSCPVIKGEKWVATKWIRDQEQ 283
H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 483 RHAACPVLVGNKWVSNKWLHERGQ 506
>gi|157818741|ref|NP_001101745.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Rattus norvegicus]
gi|149052604|gb|EDM04421.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide (predicted),
isoform CRA_a [Rattus norvegicus]
Length = 535
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 97/207 (46%), Gaps = 24/207 (11%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----QMSQRL 196
++ + ++ T L E V Y +G +Y+ H+D E R+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRSDERDAFKRLGTGNRV 451
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
A+FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 452 ATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGD 496
Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 497 YRTRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|195572619|ref|XP_002104293.1| GD18524 [Drosophila simulans]
gi|194200220|gb|EDX13796.1| GD18524 [Drosophila simulans]
Length = 472
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 50/162 (30%), Positives = 78/162 (48%), Gaps = 29/162 (17%)
Query: 126 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 185
RTS ++I SE + ++ T + F+++ Y +G Y HYD
Sbjct: 318 VRTSKDSYIVDSES-------LNERVTDMTGFSMEMSDPFSLINYGLGGHYMLHYDFH-- 368
Query: 186 AEYG----PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRG 241
EY P+ R+A+ L YL +V+ GG T+FP I + V P++G
Sbjct: 369 -EYTNTTRPKQGDRIATVLFYLGEVDSGGATIFP---------------KINIAVTPKKG 412
Query: 242 DGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+ +Y+L +G ++ SLH +CPVI G K+V TKWI + Q
Sbjct: 413 SAVFWYNLHNSGAMNLKSLHSACPVISGSKYVLTKWINELPQ 454
>gi|410948134|ref|XP_003980796.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Felis
catus]
Length = 535
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 97/207 (46%), Gaps = 24/207 (11%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----QMSQRL 196
++ + ++ T L E V Y +G +Y+ H+D E R+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRKNEQDAFKRLGTGNRV 451
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
A+FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 452 ATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGD 496
Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 497 YRTRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|196011900|ref|XP_002115813.1| hypothetical protein TRIADDRAFT_59899 [Trichoplax adhaerens]
gi|190581589|gb|EDV21665.1| hypothetical protein TRIADDRAFT_59899 [Trichoplax adhaerens]
Length = 581
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 57/217 (26%), Positives = 96/217 (44%), Gaps = 23/217 (10%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
+ +VLS +P + + N + + + A LK + + + + R S
Sbjct: 343 LNVEVLSLQPYIVIYHNLLTNSEVVLLKTLASPLLKRAVVVGKPDKEYGEETTYRISKTA 402
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP------A 186
++ ED + + I I L E + Y IG Y+ H D +
Sbjct: 403 WLD-KEDHPAV-KRITTLIGDIIGLTSETAEPLQIANYGIGGHYEPHLDFIESEDKEALS 460
Query: 187 EYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLF 246
EY ++ R+A+ L+YLS+VE GG T+FP G++V+PR+G +
Sbjct: 461 EYTSRIGNRIATVLIYLSNVEAGGATVFP---------------KAGVRVEPRQGSAAFW 505
Query: 247 YSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
Y++ NG ++ S+H +CPV+ G KW A W R+ Q
Sbjct: 506 YNMHRNGEGNKLSVHAACPVLIGSKWAANLWFREVGQ 542
>gi|395516293|ref|XP_003762325.1| PREDICTED: transmembrane prolyl 4-hydroxylase [Sarcophilus
harrisii]
Length = 517
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 60/208 (28%), Positives = 90/208 (43%), Gaps = 40/208 (19%)
Query: 111 QLALRQ------GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQT---H 161
QL LR + V+ + R S T++ E ++ I ++ R T LP+ H
Sbjct: 267 QLNLRDFHKYMGSQKVKVSDLVRNSQHTWLYQGEGAHQVMRSIRQRVLRLTRLPRQIVEH 326
Query: 162 GEAFNVLRYEIGQKYDSHYDA-------------FNPAEYGP-QMSQRLASFLLYLSDVE 207
E V+RY+ G Y +H D+ E P + S R + L YL++V
Sbjct: 327 SEPMQVVRYDQGGHYHAHMDSGPVFPETACSHTKLIANETSPFETSCRYVTVLFYLNNVT 386
Query: 208 EGGETMFP------------FENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT- 254
GGET FP +N + L + L+VKPR+G + +Y+ +G
Sbjct: 387 GGGETTFPVADNRTYDELSLIQNDVDLRDTRKHCDKGNLRVKPRQGTAVFWYNYLSDGQG 446
Query: 255 ----IDRTSLHGSCPVIKGEKWVATKWI 278
+D SLHG C V +G KW+A WI
Sbjct: 447 WVGELDEYSLHGGCLVTQGTKWIANNWI 474
>gi|324511726|gb|ADY44875.1| Prolyl 4-hydroxylase subunit alpha-1 [Ascaris suum]
Length = 550
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 64/220 (29%), Positives = 100/220 (45%), Gaps = 32/220 (14%)
Query: 74 PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
PF+V L + P A+ F + S E+ + I A RLK + + + +E T R S
Sbjct: 319 PFKVEILRFNPLAVLFVDIISDEEAKMIQQIATPRLKRATVQNSKTGELE-TAAYRISKS 377
Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AF 183
++ + + +++ I +I T L Q E + Y +G YD H+D AF
Sbjct: 378 AWLKGGDHE--LIDRINRRIELMTNLIQETSEELQIANYGVGGHYDPHFDFARKEEPKAF 435
Query: 184 NPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDG 243
G RLA+ L YL++ E GG T+F + V P +
Sbjct: 436 ESLGTG----NRLATVLFYLTEPEIGGGTVF---------------TELRTAVMPSKNGA 476
Query: 244 LLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
L +Y+L+ +G D + H +CPV+ G KWVA KWI ++ Q
Sbjct: 477 LFWYNLYRSGEGDLRTRHAACPVLVGIKWVANKWIHERGQ 516
>gi|148701597|gb|EDL33544.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide, isoform CRA_b [Mus
musculus]
Length = 506
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 97/207 (46%), Gaps = 24/207 (11%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 307 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 362
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----QMSQRL 196
++ + ++ T L E V Y +G +Y+ H+D E R+
Sbjct: 363 DPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRSDEQDAFKRLGTGNRV 422
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
A+FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 423 ATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGD 467
Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 468 YRTRHAACPVLVGCKWVSNKWFHERGQ 494
>gi|344264847|ref|XP_003404501.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1
[Loxodonta africana]
Length = 536
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 97/207 (46%), Gaps = 24/207 (11%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 337 PHIVRYYDVMSDEEIERIKQIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 392
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----QMSQRL 196
++ + ++ T L E V Y +G +Y+ H+D E R+
Sbjct: 393 DPVVAQVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRSHEQDAFKRLGTGNRV 452
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
A+FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 453 ATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGD 497
Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 498 YRTRHAACPVLVGCKWVSNKWFHERGQ 524
>gi|37912909|gb|AAR05245.1| conserved hypothetical protein [uncultured marine proteobacterium
ANT32C12]
Length = 186
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 57/193 (29%), Positives = 95/193 (49%), Gaps = 29/193 (15%)
Query: 100 IATAKKRLKPSQLALRQGETVESTK----GTRTSSGTFISASEDKTGILELIEHKIARAT 155
+ +A+ L+ Q + + + ++ +RT+S +I D + I+ + + +
Sbjct: 9 LMSARPLLRLDQARVERATVITDSEHQFHDSRTNSYAWIQ--HDASEIIHEVSKRFSILV 66
Query: 156 MLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA-EYGPQM----SQRLASFLLYLSDVEEGG 210
+P + E F ++ Y G +Y H+DAF+ + E G QR+ + L YL+DVE+GG
Sbjct: 67 KMPINNAEQFQLVHYGPGTEYKPHFDAFDKSTEEGRNNWFPGGQRMVTALAYLNDVEDGG 126
Query: 211 ETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT--IDRTSLHGSCPVIK 268
T FP I + VKP +GD ++F++ +GT I+ SLHG PVI
Sbjct: 127 ATDFP---------------DIHVSVKPNKGDVVVFHNC-KDGTSDINPNSLHGGSPVIS 170
Query: 269 GEKWVATKWIRDQ 281
GEKW W R +
Sbjct: 171 GEKWAVNLWFRQE 183
>gi|449668268|ref|XP_002154169.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Hydra
magnipapillata]
Length = 531
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 61/200 (30%), Positives = 91/200 (45%), Gaps = 31/200 (15%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLA--LRQGETVESTKGTRTSSGTFIS-ASE 138
P L F + E + I A RL+PS++ + Q T R S F A E
Sbjct: 342 PDVLVFHEMITEEVAEKIRDVANPRLRPSEVIDPIIQKHV---TASYRVSKNVFFDDAFE 398
Query: 139 DKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA------EYGPQM 192
++ I + + AT L E V Y +G +Y+ H D +P E+G
Sbjct: 399 EELEISRKLRPLVEDATDLNDDFSEQLQVNNYGLGGQYEFHVDFGDPGSPLDKHEHG--- 455
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
R+A+ L+YLSDVE GG+T+F +GL +KP+ GD +++L+ N
Sbjct: 456 -NRIATLLIYLSDVERGGDTVFT---------------RLGLSLKPKLGDAAFWHNLYKN 499
Query: 253 GTIDRTSLHGSCPVIKGEKW 272
G+ + H SCPV+ G KW
Sbjct: 500 GSGIYATEHASCPVVSGSKW 519
>gi|355691582|gb|EHH26767.1| hypothetical protein EGK_16829 [Macaca mulatta]
gi|355750162|gb|EHH54500.1| hypothetical protein EGM_15360 [Macaca fascicularis]
gi|384939464|gb|AFI33337.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Macaca
mulatta]
Length = 535
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 97/207 (46%), Gaps = 24/207 (11%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----QMSQRL 196
++ + ++ T L E V Y +G +Y+ H+D E R+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERHTFKHLGTGNRV 451
Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
A+FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 452 ATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGD 496
Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 497 YRTRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|73988166|ref|XP_851718.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Canis lupus
familiaris]
Length = 544
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 63/219 (28%), Positives = 103/219 (47%), Gaps = 25/219 (11%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+Q + +V+ P + + +F + + Q I A+ L+ S +A GE + R S
Sbjct: 334 LQPVRKEVIHLEPYVVLYHDFVNDVEAQKIRGLAEPWLQRSVVA--SGEKQLPVE-YRIS 390
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYD-AFNPA 186
++ + D +L ++H+I T L + E V+ Y IG Y+ H+D A +P
Sbjct: 391 KSAWLKDTVDP--LLVTLDHRIGALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPT 448
Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
Y + R+A+F++YLS VE GG T F + N V + L
Sbjct: 449 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIYAN---------------FSVPVVKNAAL 493
Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+++L +G D +LH CPV+ G+KWVA KWI + Q
Sbjct: 494 FWWNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIHEYGQ 532
>gi|2498741|sp|Q60716.1|P4HA2_MOUSE RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
alpha-2; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-2; Flags: Precursor
gi|836900|gb|AAC52198.1| prolyl 4-hydroxylase alpha(II)-subunit [Mus musculus]
gi|18073923|emb|CAC85691.1| Prolyl 4-hydroxylase alpha IIb subunit [Mus musculus]
gi|1096888|prf||2112362B Pro 4-hydroxylase:SUBUNIT=alpha:ISOTYPE=II
Length = 537
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 58/211 (27%), Positives = 99/211 (46%), Gaps = 32/211 (15%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 338 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 393
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
++ + ++ T L E V Y +G +Y+ H+D AF G
Sbjct: 394 DPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRSDDEDAFKRLGTG--- 450
Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
R+A+FL Y+SDVE GG T+FP +G + P++G + +Y+L +
Sbjct: 451 -NRVATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRS 494
Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
G D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 495 GEGDYRTRHAACPVLVGCKWVSNKWFHERGQ 525
>gi|194765174|ref|XP_001964702.1| GF23328 [Drosophila ananassae]
gi|190614974|gb|EDV30498.1| GF23328 [Drosophila ananassae]
Length = 542
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 60/213 (28%), Positives = 98/213 (46%), Gaps = 23/213 (10%)
Query: 74 PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
PF+V L+ P YF ++ + + II ++ S++ Q T T RTS+
Sbjct: 329 PFKVEQLNLDPYVAYFHEAINSSEMEQIIEKGLGSMERSRVGQSQNAT---TSEIRTSAN 385
Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN-PAEYGP 190
T++ +E+ L I+ ++ T L E ++ Y IG +Y+ H+D P +
Sbjct: 386 TWLWYNENPW--LSKIKQRLEDITGLSTESAEPLQLVNYGIGGQYEPHFDFVEEPQKVFG 443
Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
R+ + L Y++DV GG T FPF + L V P +G L++Y+L
Sbjct: 444 WKGNRMLTALFYINDVALGGATAFPF---------------LQLAVPPVKGSLLVWYNLH 488
Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+ D + H CPVIKG KW+ +W + Q
Sbjct: 489 RSLHKDFRTKHAGCPVIKGSKWICNEWFHEGTQ 521
>gi|328718395|ref|XP_003246475.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Acyrthosiphon
pisum]
Length = 518
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 59/188 (31%), Positives = 87/188 (46%), Gaps = 25/188 (13%)
Query: 107 LKPSQLALR--QGETVESTKGT------RTSSGTFISASE-DKTGILELIEHKIARATML 157
LK LAL + TV+S G +T SG S+ D L+ ++ +I T
Sbjct: 334 LKIKTLALENMKDATVKSVDGKGDSLIEKTRSGQVYWISKVDAVEYLDALDTRIESFTGF 393
Query: 158 PQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFE 217
E + ++ Y +G Y H+D+F A Q RL + L YL+DV+ G T FP
Sbjct: 394 STKTAEQYQIVNYGLGGHYLPHHDSFAKAINCLQFGNRLVTVLFYLTDVQNDGYTSFPLL 453
Query: 218 NGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSL-FPNGTIDRTSLHGSCPVIKGEKWVATK 276
N I +G L++ +L NG SLHGSCP++KG KW+ T+
Sbjct: 454 NII---------------APAEKGAALVWNNLHMSNGQKFYESLHGSCPLLKGNKWIMTR 498
Query: 277 WIRDQEQH 284
W+ ++ QH
Sbjct: 499 WLYEEGQH 506
>gi|410972729|ref|XP_003992809.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Felis catus]
Length = 533
Score = 84.3 bits (207), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 64/219 (29%), Positives = 103/219 (47%), Gaps = 25/219 (11%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+Q I +V+ P + + +F + + Q I A+ L+ S +A GE + R S
Sbjct: 323 LQPIRKEVIHLEPYVVLYHDFVNDLEAQKIRGLAEPWLQRSVVA--SGEKQLPVE-YRIS 379
Query: 130 SGTFISASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPA 186
++ + D +L ++H+I T L + E V+ Y IG Y+ H+D A +P
Sbjct: 380 KSAWLKDTVDP--LLVTLDHRIGALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPT 437
Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
Y + R+A+F++YLS VE GG T F + N V + L
Sbjct: 438 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIYAN---------------FSVPVVKNAAL 482
Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
+++L +G D +LH CPV+ G+KWVA KWI + Q
Sbjct: 483 FWWNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIHEYGQ 521
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.318 0.135 0.401
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,646,853,308
Number of Sequences: 23463169
Number of extensions: 194179654
Number of successful extensions: 384605
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1423
Number of HSP's successfully gapped in prelim test: 591
Number of HSP's that attempted gapping in prelim test: 379607
Number of HSP's gapped (non-prelim): 2250
length of query: 286
length of database: 8,064,228,071
effective HSP length: 141
effective length of query: 145
effective length of database: 9,050,888,538
effective search space: 1312378838010
effective search space used: 1312378838010
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 76 (33.9 bits)