BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 028194
(212 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|297798522|ref|XP_002867145.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297312981|gb|EFH43404.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 288
Score = 375 bits (964), Expect = e-102, Method: Compositional matrix adjust.
Identities = 171/211 (81%), Positives = 192/211 (90%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
QVLSWRPRA+YFPNFA+AEQCQ+II AK LKPS LALR+GET E+TKGTRTSSGTFI
Sbjct: 78 FQVLSWRPRAIYFPNFATAEQCQAIIERAKVNLKPSALALRKGETAENTKGTRTSSGTFI 137
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
SASED TG L+ +E KIARATM+P++HGE+FN+LRYE+GQKYDSHYD FNP EYGPQ SQ
Sbjct: 138 SASEDSTGALDFVERKIARATMIPRSHGESFNILRYELGQKYDSHYDVFNPTEYGPQSSQ 197
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+ASFLLYLSDVEEGGETMFPFENG + +GYDYK+CIGLKVKPR+GDGLLFYS+FPNGT
Sbjct: 198 RIASFLLYLSDVEEGGETMFPFENGSNMGTGYDYKQCIGLKVKPRKGDGLLFYSVFPNGT 257
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 211
ID+TSLHGSCPV KGEKWVATKWIRDQ+Q E
Sbjct: 258 IDQTSLHGSCPVTKGEKWVATKWIRDQDQEE 288
>gi|225428938|ref|XP_002262952.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
gi|296083079|emb|CBI22483.3| unnamed protein product [Vitis vinifera]
Length = 284
Score = 374 bits (961), Expect = e-101, Method: Compositional matrix adjust.
Identities = 173/211 (81%), Positives = 195/211 (92%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
QVLSW+PRALYFP FA+AEQCQSII AK L+PS LALRQGET ESTKGTRTSSGTFI
Sbjct: 74 FQVLSWKPRALYFPRFATAEQCQSIIEMAKSHLRPSTLALRQGETDESTKGTRTSSGTFI 133
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
SASEDKTGIL+ +E KIA+ATM+P++HGEAFN+LRYEIGQ+Y+SHYDAFNPAEYGPQ SQ
Sbjct: 134 SASEDKTGILDFVERKIAKATMIPRSHGEAFNILRYEIGQRYNSHYDAFNPAEYGPQTSQ 193
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+ASFLLYLSDVEEGGETMFPFE+ + + +GYDYKKCIGLKVKP+RGDGLLFYS+FPNGT
Sbjct: 194 RVASFLLYLSDVEEGGETMFPFEHDLNIGTGYDYKKCIGLKVKPQRGDGLLFYSVFPNGT 253
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 211
IDRTSLHGSCPVI GEKWVATKWIRD++Q +
Sbjct: 254 IDRTSLHGSCPVIAGEKWVATKWIRDEQQDD 284
>gi|147823227|emb|CAN70872.1| hypothetical protein VITISV_009065 [Vitis vinifera]
Length = 276
Score = 374 bits (960), Expect = e-101, Method: Compositional matrix adjust.
Identities = 173/211 (81%), Positives = 195/211 (92%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
QVLSW+PRALYFP FA+AEQCQSII AK L+PS LALRQGET ESTKGTRTSSGTFI
Sbjct: 66 FQVLSWKPRALYFPRFATAEQCQSIIEMAKSHLRPSTLALRQGETDESTKGTRTSSGTFI 125
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
SASEDKTGIL+ +E KIA+ATM+P++HGEAFN+LRYEIGQ+Y+SHYDAFNPAEYGPQ SQ
Sbjct: 126 SASEDKTGILDFVERKIAKATMIPRSHGEAFNILRYEIGQRYNSHYDAFNPAEYGPQTSQ 185
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+ASFLLYLSDVEEGGETMFPFE+ + + +GYDYKKCIGLKVKP+RGDGLLFYS+FPNGT
Sbjct: 186 RVASFLLYLSDVEEGGETMFPFEHDLNIGTGYDYKKCIGLKVKPQRGDGLLFYSVFPNGT 245
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 211
IDRTSLHGSCPVI GEKWVATKWIRD++Q +
Sbjct: 246 IDRTSLHGSCPVIAGEKWVATKWIRDEQQDD 276
>gi|255573113|ref|XP_002527486.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223533126|gb|EEF34884.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 286
Score = 373 bits (957), Expect = e-101, Method: Compositional matrix adjust.
Identities = 171/210 (81%), Positives = 194/210 (92%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
QVLSW+PRA+YFP+FA+ EQC++II AK RLKPS LALR+GET ESTKGTRTSSGTF+S
Sbjct: 77 QVLSWKPRAVYFPDFATPEQCKNIIEMAKLRLKPSGLALRKGETAESTKGTRTSSGTFLS 136
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 121
ASED TG L+ IEHKIARATM+P++HGEAFN+LRYEIGQKYDSHYD+FNPAEYGPQMSQR
Sbjct: 137 ASEDGTGTLDFIEHKIARATMIPRSHGEAFNILRYEIGQKYDSHYDSFNPAEYGPQMSQR 196
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 181
+ASFLLYLSDVE+GGETMFPFENG+ + S YDYKKC GLKVKPR+GDG+LFYSL PNGTI
Sbjct: 197 VASFLLYLSDVEKGGETMFPFENGVKISSVYDYKKCAGLKVKPRQGDGILFYSLLPNGTI 256
Query: 182 DRTSLHGSCPVIKGEKWVATKWIRDQEQHE 211
D+TSLHGSCPVI+GEKWVATKWIRDQ Q +
Sbjct: 257 DQTSLHGSCPVIEGEKWVATKWIRDQVQMD 286
>gi|18418321|ref|NP_567941.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
gi|17381226|gb|AAL36425.1| unknown protein [Arabidopsis thaliana]
gi|20465827|gb|AAM20018.1| unknown protein [Arabidopsis thaliana]
gi|21592377|gb|AAM64328.1| putative dioxygenase [Arabidopsis thaliana]
gi|332660892|gb|AEE86292.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
Length = 288
Score = 372 bits (956), Expect = e-101, Method: Compositional matrix adjust.
Identities = 170/211 (80%), Positives = 191/211 (90%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
QVLSWRPRA+YFPNFA+AEQCQ+II AK LKPS LALR+GET E+TKGTRTSSGTFI
Sbjct: 78 FQVLSWRPRAIYFPNFATAEQCQAIIERAKVNLKPSALALRKGETAENTKGTRTSSGTFI 137
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
SASE+ TG L+ +E KIARATM+P++HGE+FN+LRYE+GQKYDSHYD FNP EYGPQ SQ
Sbjct: 138 SASEESTGALDFVERKIARATMIPRSHGESFNILRYELGQKYDSHYDVFNPTEYGPQSSQ 197
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+ASFLLYLSDVEEGGETMFPFENG + GYDYK+CIGLKVKPR+GDGLLFYS+FPNGT
Sbjct: 198 RIASFLLYLSDVEEGGETMFPFENGSNMGIGYDYKQCIGLKVKPRKGDGLLFYSVFPNGT 257
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 211
ID+TSLHGSCPV KGEKWVATKWIRDQ+Q E
Sbjct: 258 IDQTSLHGSCPVTKGEKWVATKWIRDQDQEE 288
>gi|385137888|gb|AFI41205.1| oxygenase protein, partial [Arabidopsis thaliana]
Length = 288
Score = 372 bits (955), Expect = e-101, Method: Compositional matrix adjust.
Identities = 170/211 (80%), Positives = 191/211 (90%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
QVLSWRPRA+YFPNFA+AEQCQ+II AK LKPS LALR+GET E+TKGTRTSSGTFI
Sbjct: 78 FQVLSWRPRAIYFPNFATAEQCQAIIERAKVNLKPSALALRKGETAENTKGTRTSSGTFI 137
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
SASE+ TG L+ +E KIARATM+P++HGE+FN+LRYE+GQKYDSHYD FNP EYGPQ SQ
Sbjct: 138 SASEESTGALDFVERKIARATMIPRSHGESFNILRYELGQKYDSHYDVFNPTEYGPQSSQ 197
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+ASFLLYLSDVEEGGETMFPFENG + GYDYK+CIGLKVKPR+GDGLLFYS+FPNGT
Sbjct: 198 RIASFLLYLSDVEEGGETMFPFENGSNMGIGYDYKQCIGLKVKPRKGDGLLFYSVFPNGT 257
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 211
ID+TSLHGSCPV KGEKWVATKWIRDQ+Q E
Sbjct: 258 IDQTSLHGSCPVTKGEKWVATKWIRDQDQEE 288
>gi|255584898|ref|XP_002533164.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223527036|gb|EEF29223.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 290
Score = 367 bits (943), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 170/211 (80%), Positives = 190/211 (90%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
QVLSW+PRALYFPNFA+AEQCQS+I AK L PS LALR+GET E+TKG RTSSG F+
Sbjct: 80 FQVLSWKPRALYFPNFATAEQCQSVINMAKPNLTPSTLALRKGETEENTKGIRTSSGMFL 139
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
SASEDKTG+L+ IE KIARATMLP+ +GEAFN+LRYEIGQKY+SHYDAFNPAEYGPQ SQ
Sbjct: 140 SASEDKTGVLDAIEEKIARATMLPRANGEAFNILRYEIGQKYNSHYDAFNPAEYGPQKSQ 199
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+ASFLLYLSDVEEGGETMFPFEN + +D YD++KCIGL+V+PRRGDGLLFYSLFPN T
Sbjct: 200 RVASFLLYLSDVEEGGETMFPFENDLDVDESYDFEKCIGLQVRPRRGDGLLFYSLFPNNT 259
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 211
ID TSLHGSCPVIKGEKWVATKWIRDQEQ +
Sbjct: 260 IDPTSLHGSCPVIKGEKWVATKWIRDQEQDD 290
>gi|40809925|dbj|BAD07294.1| prolyl 4-hydroxylase [Nicotiana tabacum]
Length = 286
Score = 365 bits (936), Expect = 9e-99, Method: Compositional matrix adjust.
Identities = 171/211 (81%), Positives = 190/211 (90%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
QVLSW PRALYFPNFAS EQCQSII AK ++PS LALR GET E+TKG RTSSGTFI
Sbjct: 76 FQVLSWFPRALYFPNFASIEQCQSIIKMAKANMEPSSLALRTGETEETTKGIRTSSGTFI 135
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
SASEDKTGIL+LIE KIA+ATM+P+THGEAFNVLRYEIGQ+Y SHYDAF+PA+YGPQ SQ
Sbjct: 136 SASEDKTGILDLIEEKIAKATMIPKTHGEAFNVLRYEIGQRYQSHYDAFDPAQYGPQKSQ 195
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R ASFLLYLSDVEEGGET+FP+ENG +D+ YD+ KCIGLKVKPRRGDGLLFYSLFPNGT
Sbjct: 196 RAASFLLYLSDVEEGGETVFPYENGQNMDASYDFSKCIGLKVKPRRGDGLLFYSLFPNGT 255
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 211
ID TSLHGSCPVI+GEKWVATKWIR+Q+Q +
Sbjct: 256 IDLTSLHGSCPVIRGEKWVATKWIRNQDQDD 286
>gi|363807682|ref|NP_001242420.1| uncharacterized protein LOC100775302 [Glycine max]
gi|255641811|gb|ACU21174.1| unknown [Glycine max]
Length = 293
Score = 360 bits (924), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 167/211 (79%), Positives = 187/211 (88%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
QVLSWRPRALYFPNFA+AEQC++II AK LKPS LALRQGET E+TKG RTSSG F+
Sbjct: 83 FQVLSWRPRALYFPNFATAEQCENIIDVAKDGLKPSTLALRQGETEENTKGIRTSSGVFV 142
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
SAS DKTG L +IE KIARATM+P++HGEAFN+LRYE+ Q+Y+SHYDAFNPAEYGPQ SQ
Sbjct: 143 SASGDKTGTLAVIEEKIARATMIPRSHGEAFNILRYEVDQRYNSHYDAFNPAEYGPQKSQ 202
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+ASFLLYL+DVEEGGETMFPFENG+ +D Y Y+ CIGLKVKPR+GDGLLFYSL NGT
Sbjct: 203 RMASFLLYLTDVEEGGETMFPFENGLNMDGNYGYEDCIGLKVKPRQGDGLLFYSLLTNGT 262
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 211
ID TSLHGSCPVIKGEKWVATKWIRDQEQ +
Sbjct: 263 IDPTSLHGSCPVIKGEKWVATKWIRDQEQDD 293
>gi|449448264|ref|XP_004141886.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 294
Score = 359 bits (921), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 164/211 (77%), Positives = 189/211 (89%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
QVLSWRPRALYFP FA+AEQCQSI+ AK +L+PS LALR+GET ESTKG RTSSG F
Sbjct: 81 FQVLSWRPRALYFPKFATAEQCQSIVNLAKPKLRPSTLALRKGETAESTKGVRTSSGVFF 140
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
SASED++G L +IE KIARATM+P+THGEA+N+LRYEIGQKY+SHYDAF P+EYGPQ SQ
Sbjct: 141 SASEDESGTLGVIEEKIARATMIPRTHGEAYNILRYEIGQKYNSHYDAFKPSEYGPQKSQ 200
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+ASFLLYL+DVEEGGETMFPFENG+ +D Y+++ CIGLKVKPR+GDGLLFYS+FPNGT
Sbjct: 201 RVASFLLYLTDVEEGGETMFPFENGLNMDGTYNFQTCIGLKVKPRQGDGLLFYSVFPNGT 260
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 211
ID TSLHGSCPVIKG+KWVATKWIRDQ Q +
Sbjct: 261 IDPTSLHGSCPVIKGQKWVATKWIRDQMQED 291
>gi|356563543|ref|XP_003550021.1| PREDICTED: putative prolyl 4-hydroxylase-like [Glycine max]
Length = 293
Score = 358 bits (920), Expect = 6e-97, Method: Compositional matrix adjust.
Identities = 165/208 (79%), Positives = 186/208 (89%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
QVLSWRPRA+YFPNFA+AEQC+SII AK LKPS LALRQGET ++TKG RTSSG F+
Sbjct: 83 FQVLSWRPRAVYFPNFATAEQCESIIDVAKDGLKPSTLALRQGETEDNTKGIRTSSGVFV 142
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
SASEDKT L++IE KIARATM+P++HGEAFN+LRYE+ Q+Y+SHYDAFNPAEYGPQ SQ
Sbjct: 143 SASEDKTRTLDVIEEKIARATMIPRSHGEAFNILRYEVNQRYNSHYDAFNPAEYGPQKSQ 202
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+ASFLLYL+DVEEGGETMFPFENG+ +D Y Y+ CIGLKVKPR+GDGLLFYSL NGT
Sbjct: 203 RMASFLLYLTDVEEGGETMFPFENGLNMDGNYGYEDCIGLKVKPRQGDGLLFYSLLTNGT 262
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQE 208
ID TSLHGSCPVIKGEKWVATKWIRDQE
Sbjct: 263 IDPTSLHGSCPVIKGEKWVATKWIRDQE 290
>gi|255647903|gb|ACU24410.1| unknown [Glycine max]
Length = 293
Score = 357 bits (917), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 165/208 (79%), Positives = 186/208 (89%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
QVLSWRPRA+YFPNFA+AEQC+SII AK LKPS LALRQGET ++TKG RTSSG F+
Sbjct: 83 FQVLSWRPRAVYFPNFATAEQCESIIDVAKDGLKPSTLALRQGETEDNTKGIRTSSGVFV 142
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
SASEDKT L++IE KIARATM+P++HGEAFN+LRYE+ Q+Y+SHYDAFNPAEYGPQ SQ
Sbjct: 143 SASEDKTRTLDVIEEKIARATMIPRSHGEAFNILRYEVNQRYNSHYDAFNPAEYGPQKSQ 202
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+ASFLLYL+DVEEGGETMFPFENG+ +D Y Y+ CIGLKVKPR+GDGLLFYSL NGT
Sbjct: 203 RMASFLLYLTDVEEGGETMFPFENGLNMDGNYGYEGCIGLKVKPRQGDGLLFYSLLTNGT 262
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQE 208
ID TSLHGSCPVIKGEKWVATKWIRDQE
Sbjct: 263 IDPTSLHGSCPVIKGEKWVATKWIRDQE 290
>gi|449511009|ref|XP_004163837.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
alpha-1-like [Cucumis sativus]
Length = 294
Score = 357 bits (916), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 163/211 (77%), Positives = 188/211 (89%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
QVLSWRPRALYFP FA+AEQCQSI+ AK +L+PS LALR+GET ESTKG RTSSG F
Sbjct: 81 FQVLSWRPRALYFPKFATAEQCQSIVNLAKPKLRPSTLALRKGETAESTKGVRTSSGVFF 140
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
SASED++G L +IE K ARATM+P+THGEA+N+LRYEIGQKY+SHYDAF P+EYGPQ SQ
Sbjct: 141 SASEDESGTLGVIEEKXARATMIPRTHGEAYNILRYEIGQKYNSHYDAFKPSEYGPQKSQ 200
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+ASFLLYL+DVEEGGETMFPFENG+ +D Y+++ CIGLKVKPR+GDGLLFYS+FPNGT
Sbjct: 201 RVASFLLYLTDVEEGGETMFPFENGLNMDGTYNFQTCIGLKVKPRQGDGLLFYSVFPNGT 260
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 211
ID TSLHGSCPVIKG+KWVATKWIRDQ Q +
Sbjct: 261 IDPTSLHGSCPVIKGQKWVATKWIRDQMQED 291
>gi|357476355|ref|XP_003608463.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
gi|355509518|gb|AES90660.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
Length = 297
Score = 357 bits (916), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 161/212 (75%), Positives = 189/212 (89%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
QVLSW+PRALYFPNFA+AEQC++I++ AK LKPS LALR+GET E+TKG RTSSG F+
Sbjct: 85 FQVLSWKPRALYFPNFATAEQCENIVSVAKAGLKPSSLALRKGETTENTKGIRTSSGVFL 144
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
SAS DKT LE IE KIARATM+P++HGEAFN+LRYE+GQ+Y+SHYDAFNP EYGPQ SQ
Sbjct: 145 SASRDKTKTLEAIEEKIARATMIPRSHGEAFNILRYEVGQRYNSHYDAFNPDEYGPQKSQ 204
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+ASFLLYL+DVEEGGETMFPFENG+ +D Y Y+ C+GL+VKPR+GDGLLFYSL PNGT
Sbjct: 205 RVASFLLYLTDVEEGGETMFPFENGLNMDGTYGYEDCVGLRVKPRQGDGLLFYSLLPNGT 264
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQEQHED 212
ID+TSLHGSCPVIKGEKWVATKWIR+ +Q +D
Sbjct: 265 IDQTSLHGSCPVIKGEKWVATKWIRNLDQEDD 296
>gi|225438938|ref|XP_002279411.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
gi|296087348|emb|CBI33722.3| unnamed protein product [Vitis vinifera]
Length = 285
Score = 356 bits (914), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 164/211 (77%), Positives = 186/211 (88%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
QVLSWRPRALYFPNFA++EQCQSII AK L PS +ALR GE +T+G RTSSG FI
Sbjct: 75 FQVLSWRPRALYFPNFATSEQCQSIINMAKSNLTPSTVALRVGEIRGNTEGIRTSSGVFI 134
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
SASEDKTG L+LIE KIAR M+P+THGEAFNVLRYEIGQ+Y+SHYDAF+PAEYGPQ S
Sbjct: 135 SASEDKTGTLDLIEQKIARVIMIPRTHGEAFNVLRYEIGQRYNSHYDAFDPAEYGPQKSH 194
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+A+FL+YLSDVEEGGETMFPFENG+ +D YD+++CIGLKVKP +GDGLLFYS+FPNGT
Sbjct: 195 RIATFLVYLSDVEEGGETMFPFENGLNMDKDYDFQRCIGLKVKPHQGDGLLFYSMFPNGT 254
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 211
ID TSLHGSCPVIKGEKWVATKWIRDQEQ +
Sbjct: 255 IDPTSLHGSCPVIKGEKWVATKWIRDQEQDD 285
>gi|356536125|ref|XP_003536590.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 286
Score = 353 bits (907), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 161/209 (77%), Positives = 186/209 (88%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
Q+LSWRPRA++FPNF S E CQ II AK +L+PS+LALR+GET ESTK TRTSSGTFIS
Sbjct: 77 QILSWRPRAVFFPNFTSVEVCQQIIEMAKPKLEPSKLALRKGETAESTKDTRTSSGTFIS 136
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 121
ASEDK+GIL+L+E KIA+ TM+P+THGE FN+L+YE+GQKYDSHYDAFNP EYG SQR
Sbjct: 137 ASEDKSGILDLVERKIAKVTMIPRTHGEIFNILKYEVGQKYDSHYDAFNPDEYGSVESQR 196
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 181
+ASFLLYLS+VE GGETMFP+E G+ +D GYDY+KCIGLKVKPR+GDGLLFYSL PNG I
Sbjct: 197 IASFLLYLSNVEAGGETMFPYEGGLNIDRGYDYQKCIGLKVKPRQGDGLLFYSLLPNGKI 256
Query: 182 DRTSLHGSCPVIKGEKWVATKWIRDQEQH 210
D+TSLHGSCPVIKGEKWVATKWI D+EQH
Sbjct: 257 DKTSLHGSCPVIKGEKWVATKWIDDREQH 285
>gi|224103711|ref|XP_002313164.1| predicted protein [Populus trichocarpa]
gi|222849572|gb|EEE87119.1| predicted protein [Populus trichocarpa]
Length = 294
Score = 352 bits (902), Expect = 7e-95, Method: Compositional matrix adjust.
Identities = 163/207 (78%), Positives = 183/207 (88%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
QVLSW+PRALYFP FA+ EQC+SII + +LKPS LALR+GET ESTK TRTSSG+F+
Sbjct: 82 FQVLSWKPRALYFPKFATPEQCESIIKMVESKLKPSTLALRKGETAESTKDTRTSSGSFV 141
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
S SED+TG L+ IE KIA+ATM+PQ+HGEAFN+LRYEIGQKYDSHYDAFNP EYG Q SQ
Sbjct: 142 SGSEDETGTLDFIEKKIAKATMIPQSHGEAFNILRYEIGQKYDSHYDAFNPDEYGQQSSQ 201
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R ASFLLYLS+VEEGGETMFPFENG + G+DYK+C+GLKVKPR+GDGLLFYSLFPNGT
Sbjct: 202 RTASFLLYLSNVEEGGETMFPFENGSAVIPGFDYKQCVGLKVKPRQGDGLLFYSLFPNGT 261
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQ 207
ID TSLHGSCPVIKG KWVATKWIRDQ
Sbjct: 262 IDPTSLHGSCPVIKGVKWVATKWIRDQ 288
>gi|388505024|gb|AFK40578.1| unknown [Medicago truncatula]
Length = 297
Score = 350 bits (897), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 160/212 (75%), Positives = 187/212 (88%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
QVLSW+PRALYFPNFA+AEQC++I++ AK LKPS LALR+GET E+TKG RTSSG F+
Sbjct: 85 FQVLSWKPRALYFPNFATAEQCENIVSVAKAGLKPSSLALRKGETTENTKGIRTSSGVFL 144
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
SAS DKT LE IE KIARATM+P++HGEAFN+LRYE+GQ+Y SHYDAFNP EYGPQ SQ
Sbjct: 145 SASRDKTKTLEAIEEKIARATMIPRSHGEAFNILRYEVGQRYYSHYDAFNPDEYGPQKSQ 204
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+ASFLLYL+DVEEGGETMFPFENG+ +D Y Y+ +GL+VKPR+GDGLLFYSL PNGT
Sbjct: 205 RVASFLLYLTDVEEGGETMFPFENGLNMDGTYGYEDRVGLRVKPRQGDGLLFYSLLPNGT 264
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQEQHED 212
ID+TSLHGSCPVIKGEKWVATKWIR+ +Q +D
Sbjct: 265 IDQTSLHGSCPVIKGEKWVATKWIRNLDQEDD 296
>gi|356574299|ref|XP_003555286.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 290
Score = 349 bits (895), Expect = 5e-94, Method: Compositional matrix adjust.
Identities = 161/211 (76%), Positives = 184/211 (87%), Gaps = 1/211 (0%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
Q+LSWRPRA+YFPNF S E CQ II AK +L+PS+LALR+GET ESTK TRTSSGTFI
Sbjct: 75 FQILSWRPRAVYFPNFTSVEVCQQIIEMAKPKLEPSKLALRKGETAESTKDTRTSSGTFI 134
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
SASEDK+GIL+ +E KIA+ TM+P+THGE FN+L+YE+ QKYDSHYDAFNP EYG SQ
Sbjct: 135 SASEDKSGILDFVERKIAKVTMIPRTHGEKFNILKYEVAQKYDSHYDAFNPDEYGTVESQ 194
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSG-YDYKKCIGLKVKPRRGDGLLFYSLFPNG 179
R+ASFLLYLS+VE GGETMFP+E G+ +D G YDYKKCIGLKVKPR+GDGLLFYSL PNG
Sbjct: 195 RIASFLLYLSNVEAGGETMFPYEGGLNIDKGYYDYKKCIGLKVKPRQGDGLLFYSLLPNG 254
Query: 180 TIDRTSLHGSCPVIKGEKWVATKWIRDQEQH 210
ID+TSLHGSCPVIKGEKWVATKWI D+EQH
Sbjct: 255 KIDKTSLHGSCPVIKGEKWVATKWIDDREQH 285
>gi|223945827|gb|ACN26997.1| unknown [Zea mays]
gi|414872966|tpg|DAA51523.1| TPA: prolyl 4-hydroxylase [Zea mays]
Length = 294
Score = 347 bits (890), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 161/206 (78%), Positives = 183/206 (88%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
Q+LSW+PRALYFP FA++EQC++I+ TAK+RLKPS LALR+GET ESTKG RTSSGTF+S
Sbjct: 87 QILSWQPRALYFPQFATSEQCENIVKTAKERLKPSTLALRKGETAESTKGIRTSSGTFLS 146
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 121
A+ED T L IE KIARATMLP+ HGE FNVLRY IGQ+Y SHYDAF+PA+YGPQ +QR
Sbjct: 147 ANEDPTETLAEIEKKIARATMLPRNHGEPFNVLRYNIGQRYASHYDAFDPAQYGPQKNQR 206
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 181
+ASFLLYL+DVEEGGETMFP+EN +D GYDY+KCIGLKVKPR+GDGLLFYSL NGTI
Sbjct: 207 VASFLLYLTDVEEGGETMFPYENSENMDIGYDYEKCIGLKVKPRKGDGLLFYSLMVNGTI 266
Query: 182 DRTSLHGSCPVIKGEKWVATKWIRDQ 207
DRTSLHGSCPVIKGEKWVATKWIRD
Sbjct: 267 DRTSLHGSCPVIKGEKWVATKWIRDN 292
>gi|226499492|ref|NP_001150030.1| LOC100283657 [Zea mays]
gi|195636206|gb|ACG37571.1| prolyl 4-hydroxylase [Zea mays]
gi|347978804|gb|AEP37744.1| prolyl 4-hydroxylase 3 [Zea mays]
Length = 294
Score = 347 bits (890), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 161/206 (78%), Positives = 183/206 (88%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
Q+LSW+PRALYFP FA++EQC++I+ TAK+RLKPS LALR+GET ESTKG RTSSGTF+S
Sbjct: 87 QILSWQPRALYFPQFATSEQCENIVKTAKERLKPSTLALRKGETAESTKGIRTSSGTFLS 146
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 121
A+ED T L IE KIARATMLP+ HGE FNVLRY IGQ+Y SHYDAF+PA+YGPQ +QR
Sbjct: 147 ANEDPTETLAEIEKKIARATMLPRNHGEPFNVLRYNIGQRYASHYDAFDPAQYGPQKNQR 206
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 181
+ASFLLYL+DVEEGGETMFP+EN +D GYDY+KCIGLKVKPR+GDGLLFYSL NGTI
Sbjct: 207 VASFLLYLTDVEEGGETMFPYENSENMDIGYDYEKCIGLKVKPRKGDGLLFYSLMVNGTI 266
Query: 182 DRTSLHGSCPVIKGEKWVATKWIRDQ 207
DRTSLHGSCPVIKGEKWVATKWIRD
Sbjct: 267 DRTSLHGSCPVIKGEKWVATKWIRDN 292
>gi|357453665|ref|XP_003597113.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
gi|357482683|ref|XP_003611628.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
gi|355486161|gb|AES67364.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
gi|355512963|gb|AES94586.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
Length = 294
Score = 347 bits (889), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 160/211 (75%), Positives = 183/211 (86%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
QVLSW PRALYFPNFASAEQC II AK L PS+L LR+GET E TKG RTSSG FI
Sbjct: 83 FQVLSWNPRALYFPNFASAEQCDRIIEMAKAELSPSRLMLREGETEEGTKGIRTSSGMFI 142
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
SASEDKTG+LE+I+ KIARA +P+THG A+N+LRY++GQKY+SHYDAFNPAEYGPQ SQ
Sbjct: 143 SASEDKTGLLEVIDEKIARAAKIPKTHGGAYNILRYKVGQKYNSHYDAFNPAEYGPQESQ 202
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+ASFLLYL+DV EGGETMFPFENG +DS Y+++ CIGLK+KP +GDGLLFYSLFPNGT
Sbjct: 203 RVASFLLYLTDVPEGGETMFPFENGSNMDSSYNFEDCIGLKIKPLKGDGLLFYSLFPNGT 262
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 211
ID TSLHGSCPVIKGEKWVATKWIR+Q ++
Sbjct: 263 IDPTSLHGSCPVIKGEKWVATKWIREQLHYD 293
>gi|242038031|ref|XP_002466410.1| hypothetical protein SORBIDRAFT_01g007280 [Sorghum bicolor]
gi|241920264|gb|EER93408.1| hypothetical protein SORBIDRAFT_01g007280 [Sorghum bicolor]
Length = 294
Score = 347 bits (889), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 160/206 (77%), Positives = 183/206 (88%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
Q+LSW+PRALYFP FA++EQC++I+ TAK+RLKPS LALR+GET ESTKG RTSSGTF+S
Sbjct: 87 QILSWQPRALYFPQFATSEQCENIVKTAKERLKPSTLALRKGETAESTKGIRTSSGTFLS 146
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 121
A+ED T L IE KIARATM+P+ HGE FNVLRY IGQ+Y SHYDAF+P +YGPQ SQR
Sbjct: 147 ANEDPTRTLAEIEKKIARATMIPRNHGEPFNVLRYNIGQRYASHYDAFDPVQYGPQKSQR 206
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 181
+ASFLLYL++VEEGGETMFP+ENG +D GYDY+KCIGLKVKPR+GDGLLFYSL NGTI
Sbjct: 207 VASFLLYLTNVEEGGETMFPYENGENMDIGYDYEKCIGLKVKPRKGDGLLFYSLMVNGTI 266
Query: 182 DRTSLHGSCPVIKGEKWVATKWIRDQ 207
DRTSLHGSCPVIKGEKWVATKWIRD
Sbjct: 267 DRTSLHGSCPVIKGEKWVATKWIRDN 292
>gi|388523073|gb|AFK49598.1| unknown [Lotus japonicus]
Length = 318
Score = 347 bits (889), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 164/207 (79%), Positives = 180/207 (86%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
QVLSW P ALYFPNFA+AEQC+SII TAK+ LKPS L LR GET EST G RTSSG FI
Sbjct: 92 FQVLSWNPHALYFPNFATAEQCESIIETAKEGLKPSTLVLRVGETDESTTGIRTSSGVFI 151
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
SA EDKTG+L++IE KIARAT +P+THGEAFNVLRY++GQKY SHYDA +P YGPQ SQ
Sbjct: 152 SAFEDKTGVLDVIEEKIARATKIPRTHGEAFNVLRYKVGQKYSSHYDALHPDIYGPQKSQ 211
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+ASFLLYLSDV EGGETMFPFENG+ +D Y Y+KCIGLKVKPR+GDGLLFYSLFPNGT
Sbjct: 212 RMASFLLYLSDVPEGGETMFPFENGLNMDGSYYYEKCIGLKVKPRKGDGLLFYSLFPNGT 271
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQ 207
ID SLHGSCPVIKGEKWVATKWIRDQ
Sbjct: 272 IDPMSLHGSCPVIKGEKWVATKWIRDQ 298
>gi|326492085|dbj|BAJ98267.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 347
Score = 343 bits (880), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 156/206 (75%), Positives = 185/206 (89%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
Q+LSW+PRALYFP FA+AEQC++++ TAK RL+PS LALR+GE+ E+TKG RTSSGTF+S
Sbjct: 140 QILSWQPRALYFPQFATAEQCENVVKTAKARLRPSTLALRKGESEETTKGIRTSSGTFLS 199
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 121
A ED TG L IE KIA+ATM+P++HGE FNVLRYEIGQKY SHYDAF+PA+YGPQ SQR
Sbjct: 200 AEEDPTGALAEIETKIAKATMMPRSHGEPFNVLRYEIGQKYASHYDAFDPAQYGPQKSQR 259
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 181
+ASFLLYL+DVEEGGETMFP+ENG ++ GYDY++CIGLKVKPR+GDGLLFYSL NGTI
Sbjct: 260 VASFLLYLTDVEEGGETMFPYENGDNMNIGYDYEQCIGLKVKPRKGDGLLFYSLMVNGTI 319
Query: 182 DRTSLHGSCPVIKGEKWVATKWIRDQ 207
D TSLHGSCPV++GEKWVATKWIRD+
Sbjct: 320 DPTSLHGSCPVVRGEKWVATKWIRDK 345
>gi|115455509|ref|NP_001051355.1| Os03g0761900 [Oryza sativa Japonica Group]
gi|14488368|gb|AAK63935.1|AC084282_16 putative dioxygenase [Oryza sativa Japonica Group]
gi|17027263|gb|AAL34117.1|AC090713_4 putative hydroxylase subunit [Oryza sativa Japonica Group]
gi|108711218|gb|ABF99013.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
Group]
gi|113549826|dbj|BAF13269.1| Os03g0761900 [Oryza sativa Japonica Group]
gi|125545807|gb|EAY91946.1| hypothetical protein OsI_13633 [Oryza sativa Indica Group]
Length = 310
Score = 343 bits (879), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 157/208 (75%), Positives = 185/208 (88%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
Q+LSW+PRALYFP FA+++QC++I+ TAK+RL PS LALR+GET ESTKG RTSSGTF+S
Sbjct: 101 QILSWQPRALYFPQFATSQQCENIVKTAKQRLMPSTLALRKGETEESTKGIRTSSGTFLS 160
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 121
+ ED TG L +E KIA+ATM+P+ HGE FN+LRYEIGQ+Y SHYDAF+PA+YGPQ SQR
Sbjct: 161 SDEDPTGTLAEVEKKIAKATMIPRHHGEPFNILRYEIGQRYASHYDAFDPAQYGPQKSQR 220
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 181
+ASFLLYL+DVEEGGETMFP+ENG +D GYDY+KCIGLKVKPR+GDGLLFYSL NGTI
Sbjct: 221 VASFLLYLTDVEEGGETMFPYENGENMDIGYDYEKCIGLKVKPRKGDGLLFYSLMVNGTI 280
Query: 182 DRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
D TSLHGSCPVIKGEKWVATKWIRD+ +
Sbjct: 281 DPTSLHGSCPVIKGEKWVATKWIRDKSK 308
>gi|125588006|gb|EAZ28670.1| hypothetical protein OsJ_12681 [Oryza sativa Japonica Group]
Length = 280
Score = 342 bits (878), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 157/208 (75%), Positives = 185/208 (88%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
Q+LSW+PRALYFP FA+++QC++I+ TAK+RL PS LALR+GET ESTKG RTSSGTF+S
Sbjct: 71 QILSWQPRALYFPQFATSQQCENIVKTAKQRLMPSTLALRKGETEESTKGIRTSSGTFLS 130
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 121
+ ED TG L +E KIA+ATM+P+ HGE FN+LRYEIGQ+Y SHYDAF+PA+YGPQ SQR
Sbjct: 131 SDEDPTGTLAEVEKKIAKATMIPRHHGEPFNILRYEIGQRYASHYDAFDPAQYGPQKSQR 190
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 181
+ASFLLYL+DVEEGGETMFP+ENG +D GYDY+KCIGLKVKPR+GDGLLFYSL NGTI
Sbjct: 191 VASFLLYLTDVEEGGETMFPYENGENMDIGYDYEKCIGLKVKPRKGDGLLFYSLMVNGTI 250
Query: 182 DRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
D TSLHGSCPVIKGEKWVATKWIRD+ +
Sbjct: 251 DPTSLHGSCPVIKGEKWVATKWIRDKSK 278
>gi|225428943|ref|XP_002263094.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
gi|296083076|emb|CBI22480.3| unnamed protein product [Vitis vinifera]
Length = 282
Score = 342 bits (877), Expect = 5e-92, Method: Compositional matrix adjust.
Identities = 160/211 (75%), Positives = 187/211 (88%), Gaps = 1/211 (0%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
QVLSW+PRA YFP+FA+AEQCQSII AK L PS L LR+GET ESTKG RTSSGTFI
Sbjct: 73 FQVLSWKPRARYFPHFATAEQCQSIIEMAKSGLSPSTLVLRKGETEESTKGIRTSSGTFI 132
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
SASEDKTGIL+ IE KIA+ATM+P+ HGE FN+LRYEIGQ+Y+SHYDA +PAEYG Q SQ
Sbjct: 133 SASEDKTGILDFIERKIAKATMIPRNHGEVFNILRYEIGQRYNSHYDAISPAEYGLQTSQ 192
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+ASFLLYLSDVEEGGETMFPFE+ + +++ ++ +KCIGLKVKPRRGDGLLFYS+FPNGT
Sbjct: 193 RIASFLLYLSDVEEGGETMFPFEHDLNINT-FNSRKCIGLKVKPRRGDGLLFYSVFPNGT 251
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 211
ID TS+HGSCPVI+GEKWVATKWIRD++Q +
Sbjct: 252 IDWTSMHGSCPVIEGEKWVATKWIRDEQQED 282
>gi|357114580|ref|XP_003559078.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
distachyon]
Length = 295
Score = 341 bits (875), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 156/206 (75%), Positives = 184/206 (89%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
Q+LSW+PRALYFP FA++EQC++++ TAK RL+PS LALR+GET E+TKG RTSSGTF+S
Sbjct: 88 QILSWQPRALYFPQFATSEQCENVVKTAKARLRPSTLALRKGETEETTKGIRTSSGTFLS 147
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 121
A ED T L +E KIA+ATM+P++HGE FNVLRYEIGQKY SHYDAF+PA+YGPQ SQR
Sbjct: 148 ADEDPTRTLAEVEKKIAKATMIPRSHGEPFNVLRYEIGQKYASHYDAFDPAQYGPQKSQR 207
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 181
+ASFLLYL+DVEEGGETMFP+ENG +D GYDY++CIGLKVKPR+GDGLLFYSL NGTI
Sbjct: 208 VASFLLYLTDVEEGGETMFPYENGENMDIGYDYEQCIGLKVKPRKGDGLLFYSLMVNGTI 267
Query: 182 DRTSLHGSCPVIKGEKWVATKWIRDQ 207
D TSLHGSCPVIKGEKWVATKWIR++
Sbjct: 268 DLTSLHGSCPVIKGEKWVATKWIRNK 293
>gi|302764866|ref|XP_002965854.1| hypothetical protein SELMODRAFT_84512 [Selaginella moellendorffii]
gi|300166668|gb|EFJ33274.1| hypothetical protein SELMODRAFT_84512 [Selaginella moellendorffii]
Length = 231
Score = 334 bits (856), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 151/208 (72%), Positives = 177/208 (85%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
QVLSW PRAL FP FAS QC++II+ AK +L PS LALR+GET T+ RTS G F+
Sbjct: 21 FQVLSWTPRALLFPKFASPAQCEAIISLAKTKLTPSSLALRKGETATETQDVRTSHGCFL 80
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
S+ +DKTG L +E K+A+ATM+P++HGEAFNVLRYEIGQKY+SHYD FNPAEYGPQ SQ
Sbjct: 81 SSRQDKTGTLAWVEEKMAKATMIPKSHGEAFNVLRYEIGQKYNSHYDVFNPAEYGPQKSQ 140
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+ASFLLYLSDVEEGGETMFPFEN ++ YDYK+CIGLKVKP++GD LLFYS+FPNGT
Sbjct: 141 RMASFLLYLSDVEEGGETMFPFENYEHMNENYDYKECIGLKVKPKQGDALLFYSMFPNGT 200
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQE 208
D+T+LHGSCPVIKGEKWVATKWIRD+E
Sbjct: 201 FDKTALHGSCPVIKGEKWVATKWIRDKE 228
>gi|302802700|ref|XP_002983104.1| hypothetical protein SELMODRAFT_234144 [Selaginella moellendorffii]
gi|300149257|gb|EFJ15913.1| hypothetical protein SELMODRAFT_234144 [Selaginella moellendorffii]
Length = 292
Score = 333 bits (854), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 151/208 (72%), Positives = 177/208 (85%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
QVLSW PRAL FP FAS QC++II+ AK +L PS LALR+GET T+ RTS G F+
Sbjct: 82 FQVLSWTPRALLFPKFASPAQCEAIISLAKTKLTPSSLALRKGETATETQDVRTSHGCFL 141
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
S+ +DKTG L +E K+A+ATM+P++HGEAFNVLRYEIGQKY+SHYD FNPAEYGPQ SQ
Sbjct: 142 SSRQDKTGTLAWVEEKMAKATMIPKSHGEAFNVLRYEIGQKYNSHYDVFNPAEYGPQKSQ 201
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+ASFLLYLSDVEEGGETMFPFEN ++ YDYK+CIGLKVKP++GD LLFYS+FPNGT
Sbjct: 202 RMASFLLYLSDVEEGGETMFPFENYEHMNENYDYKECIGLKVKPKQGDALLFYSMFPNGT 261
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQE 208
D+T+LHGSCPVIKGEKWVATKWIRD+E
Sbjct: 262 FDKTALHGSCPVIKGEKWVATKWIRDKE 289
>gi|356496957|ref|XP_003517331.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 299
Score = 330 bits (847), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 153/207 (73%), Positives = 177/207 (85%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
QVLSW PRALYFPNF SAEQC++II A+ LKPS L LR+GET ESTKG RTS G F+
Sbjct: 89 FQVLSWYPRALYFPNFVSAEQCETIIEMARGGLKPSTLVLRKGETEESTKGIRTSYGVFM 148
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
SASED+TGIL+ IE KIA+AT +P+THGEAFN+LRYE+GQKY HYDAF+ AE+GP SQ
Sbjct: 149 SASEDETGILDSIEEKIAKATKIPRTHGEAFNILRYEVGQKYSPHYDAFDEAEFGPLQSQ 208
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R ASFLLYL+DV EGGET+FP+ENG D YD++ CIGL+V+PR+GDGLLFYSL PNGT
Sbjct: 209 RAASFLLYLTDVPEGGETLFPYENGFNRDGSYDFEDCIGLRVRPRKGDGLLFYSLLPNGT 268
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQ 207
ID+TS+HGSCPVIKGEKWVATKWIRDQ
Sbjct: 269 IDQTSVHGSCPVIKGEKWVATKWIRDQ 295
>gi|356541677|ref|XP_003539300.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 297
Score = 330 bits (845), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 156/211 (73%), Positives = 180/211 (85%), Gaps = 2/211 (0%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
QVLSW PRALYFPNFASAEQC+SII A+ LK S LALR+GET ESTKG RTSSG F+
Sbjct: 89 FQVLSWYPRALYFPNFASAEQCESIIEMARGGLKSSTLALRKGETEESTKGIRTSSGVFM 148
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
SASED+TGIL+ IE KIA+AT +P+THGEAFN+LRYE+GQKY+SHYDAF+ AEYGP SQ
Sbjct: 149 SASEDETGILDAIEEKIAKATKIPRTHGEAFNILRYEVGQKYNSHYDAFDEAEYGPLQSQ 208
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+ASFLLYL+DV EGGETMFP+ENG D + + CIGL+V+PR+GD LLFYSL PNGT
Sbjct: 209 RVASFLLYLTDVPEGGETMFPYENGFNRDG--NVEDCIGLRVRPRKGDALLFYSLLPNGT 266
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 211
ID+TS HGSCPVIKGEKWVATKWIR+Q Q +
Sbjct: 267 IDQTSAHGSCPVIKGEKWVATKWIRNQVQDD 297
>gi|168043388|ref|XP_001774167.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674574|gb|EDQ61081.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 284
Score = 324 bits (831), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 150/209 (71%), Positives = 177/209 (84%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
QVLSW+PRAL +PNFAS EQC++II A+ RL PS LALR+GE+ +TK RTSSGTF+
Sbjct: 75 FQVLSWKPRALLYPNFASKEQCEAIIKLARTRLAPSGLALRKGESEATTKEIRTSSGTFL 134
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
ASEDKT L +E K+ARATM+P+ +GEAFNVLRY GQKYD HYD F+PAEYGPQ SQ
Sbjct: 135 RASEDKTQSLAEVEEKMARATMIPRQNGEAFNVLRYNPGQKYDCHYDVFDPAEYGPQPSQ 194
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+ASFLLYLSDVEEGGETMFPFEN +++GY+YK CIGLKVKPR+GD LLFYS+ PNGT
Sbjct: 195 RMASFLLYLSDVEEGGETMFPFENFQNMNTGYNYKDCIGLKVKPRQGDALLFYSMHPNGT 254
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
D+T+LHGSCPVIKGEKWVATKWIR+ ++
Sbjct: 255 FDKTALHGSCPVIKGEKWVATKWIRNTDK 283
>gi|224071291|ref|XP_002303388.1| predicted protein [Populus trichocarpa]
gi|222840820|gb|EEE78367.1| predicted protein [Populus trichocarpa]
Length = 297
Score = 324 bits (830), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 152/209 (72%), Positives = 174/209 (83%), Gaps = 1/209 (0%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
QVLSWRPRALY+P F +AEQCQ II AK L+PS LALR+GET E+TKG RTSSG F+
Sbjct: 89 FQVLSWRPRALYYPGFITAEQCQHIINMAKPSLQPSTLALRKGETAETTKGIRTSSGMFV 148
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
+SED+ G+L++IE KIARATM+P THGEAFNVLRYEIGQKYD+HYDAFNPAEYGPQ SQ
Sbjct: 149 FSSEDQAGVLQVIEEKIARATMIPSTHGEAFNVLRYEIGQKYDAHYDAFNPAEYGPQTSQ 208
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+A+FLLYLS+ EEGGET FP EN + GYD +KC GL+VKP +GD +LFYS+FPN T
Sbjct: 209 RVATFLLYLSNFEEGGETTFPIENDENFE-GYDAQKCNGLRVKPHQGDAILFYSIFPNNT 267
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
ID SLH SC VIKGEKWVATKWIRDQ Q
Sbjct: 268 IDPASLHASCHVIKGEKWVATKWIRDQVQ 296
>gi|168006299|ref|XP_001755847.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693166|gb|EDQ79520.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 299
Score = 320 bits (820), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 145/209 (69%), Positives = 177/209 (84%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
QVLSW+PRAL +P FAS EQC++I+ A+ RL PS LALR+GE+ +STK RTSSGTF+
Sbjct: 90 FQVLSWKPRALLYPRFASKEQCEAIMKLARTRLAPSALALRKGESEDSTKDIRTSSGTFL 149
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
A ED T LE +E K+A+ATM+P+ +GEAFNVL+Y +GQKYD HYD F+PAEYGPQ SQ
Sbjct: 150 RADEDTTRSLEQVEEKMAKATMIPRENGEAFNVLKYNVGQKYDCHYDVFDPAEYGPQPSQ 209
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+ASFLLYLSDVEEGGETMFPFEN ++ G+DYKKCIG+KVKPR+GD LLFYS+ PNGT
Sbjct: 210 RMASFLLYLSDVEEGGETMFPFENFQNMNIGFDYKKCIGMKVKPRQGDALLFYSMHPNGT 269
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
D+++LHGSCPVIKGEKWVATKWIR+ ++
Sbjct: 270 FDKSALHGSCPVIKGEKWVATKWIRNTDK 298
>gi|3297815|emb|CAA19873.1| putative protein [Arabidopsis thaliana]
gi|7270340|emb|CAB80108.1| putative protein [Arabidopsis thaliana]
Length = 257
Score = 317 bits (813), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 145/184 (78%), Positives = 166/184 (90%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
QVLSWRPRA+YFPNFA+AEQCQ+II AK LKPS LALR+GET E+TKGTRTSSGTFIS
Sbjct: 28 QVLSWRPRAIYFPNFATAEQCQAIIERAKVNLKPSALALRKGETAENTKGTRTSSGTFIS 87
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 121
ASE+ TG L+ +E KIARATM+P++HGE+FN+LRYE+GQKYDSHYD FNP EYGPQ SQR
Sbjct: 88 ASEESTGALDFVERKIARATMIPRSHGESFNILRYELGQKYDSHYDVFNPTEYGPQSSQR 147
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 181
+ASFLLYLSDVEEGGETMFPFENG + GYDYK+CIGLKVKPR+GDGLLFYS+FPNGTI
Sbjct: 148 IASFLLYLSDVEEGGETMFPFENGSNMGIGYDYKQCIGLKVKPRKGDGLLFYSVFPNGTI 207
Query: 182 DRTS 185
D+ +
Sbjct: 208 DQVN 211
>gi|224056224|ref|XP_002298763.1| predicted protein [Populus trichocarpa]
gi|222846021|gb|EEE83568.1| predicted protein [Populus trichocarpa]
Length = 175
Score = 300 bits (768), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 142/179 (79%), Positives = 156/179 (87%), Gaps = 9/179 (5%)
Query: 29 AKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHG 88
AK +LKPS LALR+GET EST FI SEDKTG L+ IE KIA+ATM+PQ+HG
Sbjct: 2 AKSKLKPSTLALRKGETTEST---------FIGGSEDKTGTLDFIERKIAKATMIPQSHG 52
Query: 89 EAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFL 148
EAFN+LRYEIGQKYDSHYDAFNP EYGPQ SQR+ASFLLYLS VEEGGETMFPFENG +
Sbjct: 53 EAFNILRYEIGQKYDSHYDAFNPDEYGPQPSQRVASFLLYLSSVEEGGETMFPFENGSAV 112
Query: 149 DSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 207
SG++YK+C+GLKVKPR+GDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ
Sbjct: 113 SSGFEYKQCVGLKVKPRQGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 171
>gi|30681957|ref|NP_850038.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
gi|330252315|gb|AEC07409.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
Length = 274
Score = 293 bits (749), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 133/204 (65%), Positives = 165/204 (80%), Gaps = 3/204 (1%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 63
LSW PR Y PNFA+ +QC+++I AK +LKPS LALR+GET E+T+ R+
Sbjct: 71 LSWNPRVFYLPNFATKQQCEAVIDMAKPKLKPSTLALRKGETAETTQNYRSLHQ---HTD 127
Query: 64 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 123
ED++G+L IE KIA AT P+ + E+FN+LRY++GQKYDSHYDAF+ AEYGP +SQR+
Sbjct: 128 EDESGVLAAIEEKIALATRFPKDYYESFNILRYQLGQKYDSHYDAFHSAEYGPLISQRVV 187
Query: 124 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 183
+FLL+LS VEEGGETMFPFENG ++ YDY+KC+GLKVKPR+GD + FY+LFPNGTID+
Sbjct: 188 TFLLFLSSVEEGGETMFPFENGRNMNGRYDYEKCVGLKVKPRQGDAIFFYNLFPNGTIDQ 247
Query: 184 TSLHGSCPVIKGEKWVATKWIRDQ 207
TSLHGSCPVIKGEKWVATKWIRDQ
Sbjct: 248 TSLHGSCPVIKGEKWVATKWIRDQ 271
>gi|297825201|ref|XP_002880483.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297326322|gb|EFH56742.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 272
Score = 292 bits (747), Expect = 6e-77, Method: Compositional matrix adjust.
Identities = 133/204 (65%), Positives = 164/204 (80%), Gaps = 3/204 (1%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 63
LSW PR Y PNFA+ +QC+++I AK +LKPS LALR+GET E+T+ RT
Sbjct: 69 LSWNPRVFYLPNFATKQQCEAVIDMAKPKLKPSLLALRKGETAETTQNVRTR---LKKTD 125
Query: 64 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 123
ED++GIL IE KIA AT +P + E+FN+LRY++GQKYDSHYDAF+PAEYGPQ+SQR+
Sbjct: 126 EDESGILAAIEEKIALATRIPIDYYESFNILRYQLGQKYDSHYDAFHPAEYGPQISQRVV 185
Query: 124 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 183
+F+L+LS VEEGGETMFPFENG ++ YDY+ CIGL+VKPR+GD + FY+L PN TID+
Sbjct: 186 TFILFLSSVEEGGETMFPFENGRNMNGRYDYETCIGLRVKPRQGDAIFFYNLLPNRTIDQ 245
Query: 184 TSLHGSCPVIKGEKWVATKWIRDQ 207
TSLHGSCPVIKGEKWVATKWIRDQ
Sbjct: 246 TSLHGSCPVIKGEKWVATKWIRDQ 269
>gi|412994121|emb|CCO14632.1| predicted protein [Bathycoccus prasinos]
Length = 341
Score = 268 bits (685), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 125/209 (59%), Positives = 162/209 (77%), Gaps = 3/209 (1%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
Q+LS PR++ + NFAS C +I+ A+ RL S LAL++GET+E+TK RTSSGTF++
Sbjct: 129 QLLSTAPRSVMYRNFASDADCDAIVEAARSRLHKSGLALKRGETLETTKNIRTSSGTFLT 188
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 121
+ +++G L+ +E K+ARAT +P THGEA+N+LRYEIGQKYDSHYD F+P++YGPQ SQR
Sbjct: 189 SKMEQSGALKRVEEKMARATHIPATHGEAYNILRYEIGQKYDSHYDMFDPSQYGPQRSQR 248
Query: 122 LASFLLYLSDVEEGGETMFPFE--NGIFLDSGYDYKKC-IGLKVKPRRGDGLLFYSLFPN 178
+ASFLLYL+ +EGGET+FP E NG++ G DY C GLKVKPR+GD LLF+S+ PN
Sbjct: 249 VASFLLYLTTPDEGGETVFPLEGQNGLYRLRGIDYTSCEAGLKVKPRKGDALLFWSVHPN 308
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWIRDQ 207
T DR+SLHG CPVI G K+VATKWI D
Sbjct: 309 NTFDRSSLHGGCPVISGTKFVATKWIHDN 337
>gi|147834798|emb|CAN75013.1| hypothetical protein VITISV_039948 [Vitis vinifera]
Length = 282
Score = 263 bits (672), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 126/192 (65%), Positives = 144/192 (75%), Gaps = 33/192 (17%)
Query: 53 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGE----------------------- 89
R SG FISASEDKTG L+LIE KIAR M+P+THGE
Sbjct: 91 RLCSGVFISASEDKTGTLDLIEQKIARVIMIPRTHGEIKPKENCLNWLGQVPPFEFVVMK 150
Query: 90 ----------AFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETM 139
AFN+LRYEIGQ+Y+SHYDAF+PAEYGPQ S R+A+FL+YLSDVEEGGETM
Sbjct: 151 RFLTDVVYHVAFNILRYEIGQRYNSHYDAFDPAEYGPQKSHRIATFLVYLSDVEEGGETM 210
Query: 140 FPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWV 199
FPFENG+ +D YD+++CIGLKVKP +GDGLLFYS+FPNGTID TSLHGSCPVIKGEKWV
Sbjct: 211 FPFENGLNMDKDYDFQRCIGLKVKPHQGDGLLFYSMFPNGTIDPTSLHGSCPVIKGEKWV 270
Query: 200 ATKWIRDQEQHE 211
ATKWIRDQEQ +
Sbjct: 271 ATKWIRDQEQDD 282
>gi|384250599|gb|EIE24078.1| hypothetical protein COCSUDRAFT_47131 [Coccomyxa subellipsoidea
C-169]
Length = 327
Score = 249 bits (636), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 116/211 (54%), Positives = 146/211 (69%), Gaps = 5/211 (2%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
Q++SW PR + +P F E+C+ + AK RL PS LALR E + T+ RTS GTF+S
Sbjct: 110 QLISWYPRIILYPGFIDPERCKHFVKVAKARLAPSGLALRTTEGPQETENVRTSQGTFMS 169
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 121
+D G++ +E K A+ T LP +HGE FNVLRY+ GQ YDSHYD F P YGPQ SQR
Sbjct: 170 RKDDPAGVIAWVEEKAAQVTGLPVSHGEPFNVLRYQDGQHYDSHYDIFEPESYGPQPSQR 229
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLD----SGYDYKKC-IGLKVKPRRGDGLLFYSLF 176
+A+ L YL+DVEEGGET+FP E D +G++YK C G K KPR GD L+FYS+
Sbjct: 230 MATILFYLTDVEEGGETIFPLEGRYGPDLLKMTGFNYKSCTTGFKYKPRMGDALMFYSMH 289
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 207
PNGT D+ +LHG CPV+ GEKWVATKWIRD+
Sbjct: 290 PNGTFDKHALHGGCPVMAGEKWVATKWIRDK 320
>gi|307108817|gb|EFN57056.1| hypothetical protein CHLNCDRAFT_143796 [Chlorella variabilis]
Length = 334
Score = 226 bits (575), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 113/210 (53%), Positives = 141/210 (67%), Gaps = 15/210 (7%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
MQ+LS PRA P F S +QC +IA A++RL PS LA + G+T E+T+
Sbjct: 131 MQLLSLYPRAYLMPRFLSQKQCDHVIAMAERRLAPSGLAFKAGDTAENTR---------- 180
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
ED G+L IE K+A TM+P HGE FNVLRYE Q YDSHYD+F+ EYGPQ SQ
Sbjct: 181 --DEDPDGVLAWIEDKLAAVTMIPAGHGEPFNVLRYEPSQHYDSHYDSFSEEEYGPQFSQ 238
Query: 121 RLASFLLYLSDVEEGGETMFPFE--NGIFLDSGYDYKKC-IGLKVKPRRGDGLLFYSLFP 177
R+A+ LLYL+DVEEGGET+F E G+ DYK C G+KVKPR+GD LLF+S+
Sbjct: 239 RIATVLLYLADVEEGGETVFLLEGKGGLARLERIDYKACDTGIKVKPRQGDALLFFSVSV 298
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 207
NGT+D+ SLHG CPV+ G KW TKWIR++
Sbjct: 299 NGTLDKHSLHGGCPVVAGTKWAMTKWIRNR 328
>gi|302845120|ref|XP_002954099.1| hypothetical protein VOLCADRAFT_64439 [Volvox carteri f.
nagariensis]
gi|300260598|gb|EFJ44816.1| hypothetical protein VOLCADRAFT_64439 [Volvox carteri f.
nagariensis]
Length = 231
Score = 219 bits (557), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 110/211 (52%), Positives = 144/211 (68%), Gaps = 5/211 (2%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
Q+LSW PR + FP F + + +I A K + PS LA R GETV+ ++ TRTS+GTF+
Sbjct: 17 FQILSWYPRVVVFPGFIDKARAEYVIKLASKFMYPSGLAYRPGETVDPSQQTRTSTGTFL 76
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
+A+ D G+L +E +IA AT+LP +GEAFNVL YE Q YDSHYD F+P E+GPQ SQ
Sbjct: 77 AAAMDPEGVLGWVEQRIAAATLLPAENGEAFNVLHYEKEQHYDSHYDTFDPKEFGPQPSQ 136
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKCI--GLKVKPRRGDGLLFYSLF 176
R+A+ LLYLS+V EGGET+F E G+ ++ D++ C K PR GD +LF+
Sbjct: 137 RIATVLLYLSEVLEGGETVFKRE-GVDGENRVIGDWRNCDDGSFKYMPRMGDAVLFWGTK 195
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 207
PNG ID +LHG CPV +GEKWVATKWIR +
Sbjct: 196 PNGDIDPHALHGGCPVKRGEKWVATKWIRSR 226
>gi|159489502|ref|XP_001702736.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280758|gb|EDP06515.1| predicted protein [Chlamydomonas reinhardtii]
Length = 231
Score = 218 bits (556), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 111/210 (52%), Positives = 140/210 (66%), Gaps = 3/210 (1%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
Q+LSW PR + FP F + + I+ A K + PS LA R GE VES++ TRTS+GTF+
Sbjct: 17 FQILSWYPRIVVFPGFIDKARAEHIVKLAGKFMYPSGLAYRPGEQVESSQQTRTSTGTFL 76
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
S+ D G+L +E +IA AT+LP +GEAFNVL YE Q YDSH D+F+P ++GPQ SQ
Sbjct: 77 SSGMDTEGVLGWVEQRIAAATLLPADNGEAFNVLHYEHMQHYDSHMDSFDPKDFGPQPSQ 136
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGY-DYKKCI--GLKVKPRRGDGLLFYSLFP 177
R+A+ LLYLS+V EGGET+F E D D++ C K PR GD +LF+ P
Sbjct: 137 RIATVLLYLSEVLEGGETVFKKEGVDGADRPIQDWRNCDDGSFKYAPRMGDAVLFWGTRP 196
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 207
NG ID SLHG CPV KGEKWVATKWIR +
Sbjct: 197 NGEIDPHSLHGGCPVKKGEKWVATKWIRSR 226
>gi|343172438|gb|AEL98923.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein, partial
[Silene latifolia]
gi|343172440|gb|AEL98924.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein, partial
[Silene latifolia]
Length = 120
Score = 216 bits (551), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 100/120 (83%), Positives = 112/120 (93%), Gaps = 1/120 (0%)
Query: 90 AFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLD 149
A+NVLRYE+GQKY+SHYDAF+PAEYGPQ SQR+ASFLLYLSDVEEGGETMFP+EN +D
Sbjct: 1 AYNVLRYEVGQKYNSHYDAFHPAEYGPQKSQRIASFLLYLSDVEEGGETMFPYENDN-ID 59
Query: 150 SGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
S YDY +CIGLKVKPR+GDGLLFYSLF NGTID TS+HGSCPVIKGEKWVATKWIR++EQ
Sbjct: 60 SNYDYVQCIGLKVKPRQGDGLLFYSLFSNGTIDPTSIHGSCPVIKGEKWVATKWIRNEEQ 119
>gi|449520146|ref|XP_004167095.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 249
Score = 189 bits (481), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 98/210 (46%), Positives = 136/210 (64%), Gaps = 9/210 (4%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTF 59
++ +SW PRA + NF S E+C +I+ AK ++ S + + G+ VE + RTSSG F
Sbjct: 39 VEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDNETGKNVEDS--VRTSSGMF 96
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
++ +DK I+ IE +IA T +P HGE +L YE+GQKYD+HYD F+ ++
Sbjct: 97 LNRGQDK--IVSNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDFFDDEFNLKEIG 154
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSL 175
QR+A+ L+YLSDVEEGGET+FP G F + + KC GL VKP+ GD LLF+S+
Sbjct: 155 QRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSKCGKGGLSVKPKMGDALLFWSM 214
Query: 176 FPNGTIDRTSLHGSCPVIKGEKWVATKWIR 205
P+ T+D TSLHG+CPVI+G KW TKWI
Sbjct: 215 KPDTTLDPTSLHGACPVIRGNKWSCTKWIH 244
>gi|357517881|ref|XP_003629229.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355523251|gb|AET03705.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 278
Score = 189 bits (480), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 98/213 (46%), Positives = 139/213 (65%), Gaps = 9/213 (4%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTF 59
+Q++SW PRA + NF + ++C+ +I TAK ++ S + + G++ +S+ RTSSGTF
Sbjct: 67 VQIVSWEPRAFLYHNFLTKKECEHLINTAKPSMQKSSVVDNETGKSKDSS--VRTSSGTF 124
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
+ D+ I+ IE +IA T +P +GE+FNVLRYE+GQKYD H D F
Sbjct: 125 LDRGGDE--IVRNIEKRIADFTFIPVENGESFNVLRYEVGQKYDPHLDYFADDYNTVNGG 182
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSL 175
QR+A+ L+YLSDVEEGGET+FP G + + C GL +KP+ GD LLF+S+
Sbjct: 183 QRIATMLMYLSDVEEGGETVFPAAKGNISSVPWWNELSDCGKKGLSIKPKMGDALLFWSM 242
Query: 176 FPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 208
P+GT+D +SLHG+CPVIKG+KW TKW+R E
Sbjct: 243 KPDGTLDPSSLHGACPVIKGDKWSCTKWMRINE 275
>gi|357483925|ref|XP_003612249.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355513584|gb|AES95207.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 289
Score = 188 bits (478), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 100/213 (46%), Positives = 133/213 (62%), Gaps = 9/213 (4%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTF 59
++V+SW PRA + NF + E+C+ +I AK + S + ET +S RTSSGTF
Sbjct: 78 VEVVSWEPRAFVYHNFLTKEECEYLIDIAKPSMHKSTVV--DSETGKSKDSRVRTSSGTF 135
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
++ DK I+ IE KIA T +P HGE VL YE+GQKY+ HYD F
Sbjct: 136 LARGRDK--IVRNIEKKIADFTFIPVEHGEGLQVLHYEVGQKYEPHYDYFLDEFNTKNGG 193
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSL 175
QR+A+ L+YL+DVEEGGET+FP G F + + + C GL +KP+RGD LLF+S+
Sbjct: 194 QRIATVLMYLTDVEEGGETVFPAAKGNFSNVPWYNELSDCGKKGLSIKPKRGDALLFWSM 253
Query: 176 FPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 208
P+ T+D +SLHG CPVIKG KW +TKWIR E
Sbjct: 254 KPDATLDASSLHGGCPVIKGNKWSSTKWIRVNE 286
>gi|449529555|ref|XP_004171765.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 284
Score = 186 bits (472), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 97/209 (46%), Positives = 135/209 (64%), Gaps = 9/209 (4%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTF 59
++ +SW PRA + NF S E+C +I+ AK ++ S + + GE+V+S RTSSG F
Sbjct: 74 VEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSKTGESVDSR--VRTSSGMF 131
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
++ +DK I+ IE +IA T +P HGE +L YE+GQKYD+HYD F +
Sbjct: 132 LNRGQDK--IIRNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGG 189
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKCI--GLKVKPRRGDGLLFYSL 175
QR+A+ L+YLSDVEEGGET+FP G F + + +C GL VKP+ GD LLF+S+
Sbjct: 190 QRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKMGDALLFWSM 249
Query: 176 FPNGTIDRTSLHGSCPVIKGEKWVATKWI 204
P+ T+D TSLHG+CPVI+G KW TKW+
Sbjct: 250 KPDATLDPTSLHGACPVIRGNKWSCTKWM 278
>gi|356517655|ref|XP_003527502.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Glycine max]
Length = 290
Score = 186 bits (472), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 98/212 (46%), Positives = 136/212 (64%), Gaps = 9/212 (4%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTKGTRTSSGTFI 60
++LSW PRA + NF S E+C+ +I AK ++ K S + + G++ ES RTSSG F+
Sbjct: 80 EILSWEPRAFIYHNFLSKEECEYLIELAKPQMVKSSVVDSKTGKSTESR--VRTSSGMFL 137
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
+DK I++ IE +IA T +P+ +GE +L YE+GQKY+ HYD F Q
Sbjct: 138 KRGKDK--IVQNIEKRIADFTFIPEENGEGLQILHYEVGQKYEPHYDYFLDEFNTKNGGQ 195
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKCI--GLKVKPRRGDGLLFYSLF 176
R+A+ L+YLSDVEEGGET+FP N F + D +C GL VKP+ GD LLF+S+
Sbjct: 196 RIATVLMYLSDVEEGGETVFPAANANFSSVPWWNDLSQCARKGLSVKPKMGDALLFWSMR 255
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 208
P+ T+D +SLHG CPVIKG KW +TKW+ +E
Sbjct: 256 PDATLDPSSLHGGCPVIKGNKWSSTKWMHLRE 287
>gi|449443243|ref|XP_004139389.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 284
Score = 186 bits (472), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 97/210 (46%), Positives = 134/210 (63%), Gaps = 9/210 (4%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTF 59
++ +SW PRA + NF S E+C +I+ AK ++ S + + G+ VE + RTSSG F
Sbjct: 74 VEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDNETGKNVEDS--VRTSSGMF 131
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
++ +DK I+ IE +IA T +P HGE +L YE+GQKYD+HYD F +
Sbjct: 132 LNRGQDK--IVSNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGG 189
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKCI--GLKVKPRRGDGLLFYSL 175
QR+A+ L+YLSDVEEGGET+FP G F + + KC GL VKP+ GD LLF+S+
Sbjct: 190 QRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSKCGKGGLSVKPKMGDALLFWSM 249
Query: 176 FPNGTIDRTSLHGSCPVIKGEKWVATKWIR 205
P+ T+D TSLHG+CPVI+G KW TKW+
Sbjct: 250 KPDATLDPTSLHGACPVIRGNKWSCTKWMH 279
>gi|159490898|ref|XP_001703410.1| prolyl 4-hydroxylase [Chlamydomonas reinhardtii]
gi|158280334|gb|EDP06092.1| prolyl 4-hydroxylase [Chlamydomonas reinhardtii]
Length = 429
Score = 186 bits (472), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 100/211 (47%), Positives = 134/211 (63%), Gaps = 7/211 (3%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
Q+LS PR FPNF + + IIA A K + PS LA R GE VE+ + RTS GTF+
Sbjct: 216 QILSLYPRIKVFPNFVDKARREEIIALASKFMYPSGLAYRPGEQVEAEQQVRTSKGTFLG 275
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 121
D + L +E KIA T +P+ +GE +NVL Y+ Q YDSH D+F+P EYG Q SQR
Sbjct: 276 G--DSSPALTWLESKIAAVTDIPRQNGEFWNVLNYKHTQHYDSHMDSFDPKEYGQQYSQR 333
Query: 122 LASFLLYLSDVE-EGGETMFPFENGIFLDSGY-DYKKCI---GLKVKPRRGDGLLFYSLF 176
+A+ ++ LSD GGET+F E +D ++ C GL+ KPR GD +LF+S F
Sbjct: 334 IATVIVVLSDEGLVGGETVFKREGKANIDKPITNWTDCDADGGLRYKPRAGDAVLFWSAF 393
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 207
P+G +D+ +LHGSCPV+ G KWVA KWIR++
Sbjct: 394 PDGRLDQHALHGSCPVVTGNKWVAVKWIRNK 424
>gi|21537370|gb|AAM61711.1| putative prolyl 4-hydroxylase, alpha subunit [Arabidopsis thaliana]
Length = 287
Score = 185 bits (470), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 101/208 (48%), Positives = 129/208 (62%), Gaps = 9/208 (4%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFI 60
+VLSW PRA + NF S E+C+ +I+ AK + S + ET +S RTSSGTF+
Sbjct: 77 EVLSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVV--DSETGKSKDSRVRTSSGTFL 134
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
DK I++ IE +IA T +P HGE VL YE GQKY+ HYD F Q
Sbjct: 135 RRGRDK--IIKTIEKRIADYTFIPADHGEGLQVLHYEAGQKYEPHYDYFVDEFNTKNGGQ 192
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSLF 176
R+A+ L+YLSDVEEGGET+FP N F + + +C GL VKPR GD LLF+S+
Sbjct: 193 RMATMLMYLSDVEEGGETVFPAANMNFSSVPWYNELSECGKKGLSVKPRMGDALLFWSMR 252
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWI 204
P+ T+D TSLHG CPVI+G KW +TKWI
Sbjct: 253 PDATLDPTSLHGGCPVIRGNKWSSTKWI 280
>gi|302791635|ref|XP_002977584.1| hypothetical protein SELMODRAFT_106693 [Selaginella moellendorffii]
gi|300154954|gb|EFJ21588.1| hypothetical protein SELMODRAFT_106693 [Selaginella moellendorffii]
Length = 296
Score = 185 bits (470), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 97/203 (47%), Positives = 131/203 (64%), Gaps = 6/203 (2%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 62
LSW+PRA + F SA +C ++ AK +L+ S +A + G++V S RTSSG F+S
Sbjct: 45 LSWKPRAFLYKGFMSAAECDHVVKMAKDKLQKSMVADNESGKSVLSN--IRTSSGMFLSK 102
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 122
+D+ ++ IE +IA T LP+ +GEA VLRYE G+KY+ HYD F+ R+
Sbjct: 103 GQDE--VINRIEERIAAWTFLPKENGEAIQVLRYEFGEKYEPHYDYFHDKYNQALGGHRI 160
Query: 123 ASFLLYLSDVEEGGETMFPF-ENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 181
A+ L+YLSDV +GGET+FP E+ D + G+ VKPR+GD LLFYSL P+ T
Sbjct: 161 ATVLMYLSDVVKGGETVFPSSEDTTVKDDSWSDCAKKGIAVKPRKGDALLFYSLHPDATP 220
Query: 182 DRTSLHGSCPVIKGEKWVATKWI 204
D +SLHG CPVI+GEKW ATKWI
Sbjct: 221 DESSLHGGCPVIEGEKWSATKWI 243
>gi|242039227|ref|XP_002467008.1| hypothetical protein SORBIDRAFT_01g018200 [Sorghum bicolor]
gi|241920862|gb|EER94006.1| hypothetical protein SORBIDRAFT_01g018200 [Sorghum bicolor]
Length = 307
Score = 185 bits (470), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 96/211 (45%), Positives = 132/211 (62%), Gaps = 7/211 (3%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
+VLSW PRA + NF S E+C +I+ AK +K S + + ++ RTSSG F+
Sbjct: 97 EVLSWEPRAFVYHNFLSKEECDHLISLAKPHMKKSTVVDSATGASKDSR-VRTSSGMFLR 155
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 121
+DK I++ IE +IA T +P HGE VL YE+GQKY+ H+D F+ QR
Sbjct: 156 RGQDK--IIQTIEKRIADFTFIPVEHGEGLQVLHYEVGQKYEPHFDYFHDDYNTKNGGQR 213
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKCI--GLKVKPRRGDGLLFYSLFP 177
+A+ L+YLSDVE+GGET+FP S + + +C GL VKP+ GD LLF+S+ P
Sbjct: 214 IATLLMYLSDVEDGGETVFPSSTTNSSSSPFYNELSECAKGGLSVKPKMGDALLFWSMKP 273
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 208
+G++D TSLHG CPVIKG KW +TKW+R E
Sbjct: 274 DGSMDSTSLHGGCPVIKGNKWSSTKWMRVHE 304
>gi|449491267|ref|XP_004158845.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 287
Score = 185 bits (470), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 96/213 (45%), Positives = 134/213 (62%), Gaps = 9/213 (4%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTF 59
++V+SW PRA + NF + E+C+ +I+ AK ++ S + ET +S RTSSGTF
Sbjct: 76 VEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSTVV--DSETGQSKDSRVRTSSGTF 133
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
+ DKT + IE +++ + +P HGE VL YE+GQKY+ H+D F
Sbjct: 134 LPRGRDKT--VRTIEKRLSDFSFIPVEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGG 191
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSL 175
QR+A+ L+YLSDVEEGGET+FP G F + + C GL VKP+RGD LLF+S+
Sbjct: 192 QRIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSM 251
Query: 176 FPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 208
P+ ++D +SLHG CPVIKG KW ATKW+R +E
Sbjct: 252 KPDASLDPSSLHGGCPVIKGNKWSATKWVRVEE 284
>gi|357146834|ref|XP_003574128.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Brachypodium
distachyon]
Length = 306
Score = 185 bits (470), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 99/217 (45%), Positives = 135/217 (62%), Gaps = 19/217 (8%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 55
+VLSW PRA + NF S E+C+ +I+ AK +K S + V+S G RTS
Sbjct: 96 EVLSWEPRAFLYHNFLSKEECEYLISLAKPHMKKSTV-------VDSATGGSKDSRVRTS 148
Query: 56 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 115
SGTF+ +DK ++ IE +I+ T +P +GE VL YE+GQKY+ H+D F+
Sbjct: 149 SGTFLRRGQDK--VIRTIEKRISDFTFIPAENGEGLQVLHYEVGQKYEPHFDYFHDDFNT 206
Query: 116 PQMSQRLASFLLYLSDVEEGGETMFPFE--NGIFLDSGYDYKKCI--GLKVKPRRGDGLL 171
QR+A+ L+YLSDVEEGGET+FP N + + +C G+ VKP+ GD LL
Sbjct: 207 KNGGQRIATLLMYLSDVEEGGETVFPSAKVNSSSIPFYNELSECAKRGISVKPKMGDALL 266
Query: 172 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 208
F+S+ P+GT+D TSLHG CPVIKG+KW +TKWIR E
Sbjct: 267 FWSMRPDGTLDPTSLHGGCPVIKGDKWSSTKWIRVHE 303
>gi|18394842|ref|NP_564109.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
gi|9558598|gb|AAF88161.1|AC026234_12 Contains similarity to a prolyl 4-hydroxylase alpha subunit protein
from Gallus gallus gi|212530 [Arabidopsis thaliana]
gi|90962978|gb|ABE02413.1| At1g20270 [Arabidopsis thaliana]
gi|332191835|gb|AEE29956.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
Length = 287
Score = 184 bits (468), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 100/208 (48%), Positives = 129/208 (62%), Gaps = 9/208 (4%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFI 60
+VLSW PRA + NF S E+C+ +I+ AK + S + ET +S RTSSGTF+
Sbjct: 77 EVLSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVV--DSETGKSKDSRVRTSSGTFL 134
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
DK I++ IE +IA T +P HGE VL YE GQKY+ HYD F Q
Sbjct: 135 RRGRDK--IIKTIEKRIADYTFIPADHGEGLQVLHYEAGQKYEPHYDYFVDEFNTKNGGQ 192
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSLF 176
R+A+ L+YLSDVEEGGET+FP N F + + +C GL VKPR GD LLF+S+
Sbjct: 193 RMATMLMYLSDVEEGGETVFPAANMNFSSVPWYNELSECGKKGLSVKPRMGDALLFWSMR 252
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWI 204
P+ T+D TSLHG CPVI+G KW +TKW+
Sbjct: 253 PDATLDPTSLHGGCPVIRGNKWSSTKWM 280
>gi|449434114|ref|XP_004134841.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 287
Score = 184 bits (468), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 96/213 (45%), Positives = 134/213 (62%), Gaps = 9/213 (4%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTF 59
++V+SW PRA + NF + E+C+ +I+ AK ++ S + ET +S RTSSGTF
Sbjct: 76 VEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSTVV--DSETGQSKDSRVRTSSGTF 133
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
+ DKT + IE +++ + +P HGE VL YE+GQKY+ H+D F
Sbjct: 134 LPRGRDKT--VRTIEKRLSDFSFIPVEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGG 191
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSL 175
QR+A+ L+YLSDVEEGGET+FP G F + + C GL VKP+RGD LLF+S+
Sbjct: 192 QRIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSM 251
Query: 176 FPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 208
P+ ++D +SLHG CPVIKG KW ATKW+R +E
Sbjct: 252 KPDASLDPSSLHGGCPVIKGNKWSATKWMRVEE 284
>gi|326495334|dbj|BAJ85763.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 300
Score = 184 bits (467), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 100/217 (46%), Positives = 135/217 (62%), Gaps = 19/217 (8%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 55
+VLSW PRA + NF S E+C+ +I+ AK +K S + V+S G RTS
Sbjct: 90 EVLSWEPRAFIYHNFLSKEECEYLISLAKPHMKKSTV-------VDSATGGSKDSRVRTS 142
Query: 56 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 115
SGTF+ +DK I+ IE +I+ T +P +GE VL YE+GQKY+ H+D F+
Sbjct: 143 SGTFLRRGQDK--IVRTIEKRISDFTFIPVENGEGLQVLHYEVGQKYEPHFDYFHDDFNT 200
Query: 116 PQMSQRLASFLLYLSDVEEGGETMFPFE--NGIFLDSGYDYKKCI--GLKVKPRRGDGLL 171
QR+A+ L+YLSDVEEGGET+FP N + + +C G+ VKP+ GD LL
Sbjct: 201 KNGGQRIATVLMYLSDVEEGGETVFPSAKVNSSSIPFYNELSECAKRGISVKPKMGDALL 260
Query: 172 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 208
F+S+ P+GT+D TSLHG CPVIKG+KW +TKWIR E
Sbjct: 261 FWSMRPDGTLDPTSLHGGCPVIKGDKWSSTKWIRVHE 297
>gi|356540840|ref|XP_003538892.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Glycine max]
Length = 290
Score = 184 bits (467), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 97/213 (45%), Positives = 133/213 (62%), Gaps = 9/213 (4%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTF 59
++V+SW PRA + NF + E+C+ +I AK + S + ET +S RTSSGTF
Sbjct: 79 VEVVSWEPRAFVYHNFLTKEECEYLIDIAKPNMHKSSVV--DSETGKSKDSRVRTSSGTF 136
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
++ DK I+ IE +IA + +P HGE VL YE+GQKY+ HYD F
Sbjct: 137 LARGRDK--IVRDIEKRIAHYSFIPVEHGEGLQVLHYEVGQKYEPHYDYFLDDFNTKNGG 194
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSL 175
QR+A+ L+YL+DVEEGGET+FP G F + + +C GL +KP+RGD LLF+S+
Sbjct: 195 QRIATVLMYLTDVEEGGETVFPAAKGNFSSVPWWNELSECGKKGLSIKPKRGDALLFWSM 254
Query: 176 FPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 208
P+ T+D +SLHG CPVIKG KW +TKW+R E
Sbjct: 255 KPDATLDPSSLHGGCPVIKGNKWSSTKWMRVSE 287
>gi|357517897|ref|XP_003629237.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355523259|gb|AET03713.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|388513409|gb|AFK44766.1| unknown [Medicago truncatula]
gi|388516345|gb|AFK46234.1| unknown [Medicago truncatula]
Length = 275
Score = 184 bits (467), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 94/213 (44%), Positives = 136/213 (63%), Gaps = 9/213 (4%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTF 59
+Q++SW PRA + NF + E+C+ +I AK + S++ + G+++ S+ RTSSGTF
Sbjct: 66 VQIISWEPRAFLYHNFLTKEECEHLINIAKPSMHKSEVIDEKTGKSLNSS--IRTSSGTF 123
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
+ D+ I+ IE +IA T +P HGE+FNVL YE+GQKY+ HYD F
Sbjct: 124 LDREGDE--IVSNIEKRIADFTFIPVEHGESFNVLHYEVGQKYEPHYDYFLDTFSTRHAG 181
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSL 175
QR+A+ L+YLSDVEEGGET+FP G F + + C GL +KP+ G+ +LF+S+
Sbjct: 182 QRIATMLMYLSDVEEGGETVFPNAKGNFSSVPWWNELSDCGKGGLSIKPKMGNAILFWSM 241
Query: 176 FPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 208
P+ T+D +SLHG+CPVIKG+KW KW+ E
Sbjct: 242 KPDATLDPSSLHGACPVIKGDKWSCAKWMHADE 274
>gi|357125236|ref|XP_003564301.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
distachyon]
Length = 293
Score = 184 bits (466), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 98/205 (47%), Positives = 133/205 (64%), Gaps = 8/205 (3%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 62
LSWRPRA + F S +C ++ AK RL+ S +A G++V S RTSSGTF++
Sbjct: 34 LSWRPRAFLYSGFLSHAECDHLVKLAKGRLQKSMVADNDSGKSVMSQ--VRTSSGTFLNK 91
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 122
ED+ I+ IE ++A T LP+ + E+ VL YE+GQKYD+H+D F+ R+
Sbjct: 92 HEDE--IISGIEKRVAAWTFLPEENAESIQVLHYEVGQKYDAHFDYFHDKNNQKLGGHRV 149
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPNG 179
A+ L+YL+DV++GGET+FP G L + + +C GL VKPR+GD LLF+SL N
Sbjct: 150 ATVLMYLTDVKKGGETVFPNAEGRHLQHKDETWSECARSGLAVKPRKGDALLFFSLHINA 209
Query: 180 TIDRTSLHGSCPVIKGEKWVATKWI 204
T D +SLHGSCPVI+GEKW ATKWI
Sbjct: 210 TTDPSSLHGSCPVIEGEKWSATKWI 234
>gi|297850430|ref|XP_002893096.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297338938|gb|EFH69355.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 287
Score = 184 bits (466), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 99/208 (47%), Positives = 129/208 (62%), Gaps = 9/208 (4%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFI 60
+VLSW PRA + NF S E+C+ +I+ AK + S + ET +S RTSSGTF+
Sbjct: 77 EVLSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVV--DSETGKSKDSRVRTSSGTFL 134
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
DK I++ IE +IA T +P HGE +L YE GQKY+ HYD F Q
Sbjct: 135 RRGRDK--IIKTIEKRIADYTFIPADHGEGLQILHYEAGQKYEPHYDYFVDEFNTKNGGQ 192
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSLF 176
R+A+ L+YLSDVEEGGET+FP N F + + +C GL VKPR GD LLF+S+
Sbjct: 193 RMATMLMYLSDVEEGGETVFPAANMNFSSVPWYNELSECGKKGLSVKPRMGDALLFWSMR 252
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWI 204
P+ T+D TSLHG CPVI+G KW +TKW+
Sbjct: 253 PDATLDPTSLHGGCPVIRGNKWSSTKWM 280
>gi|302773668|ref|XP_002970251.1| hypothetical protein SELMODRAFT_411114 [Selaginella moellendorffii]
gi|300161767|gb|EFJ28381.1| hypothetical protein SELMODRAFT_411114 [Selaginella moellendorffii]
Length = 256
Score = 183 bits (465), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 95/211 (45%), Positives = 134/211 (63%), Gaps = 7/211 (3%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
+ +SW+PRA F NF S+E+C +I A+ +K S + Q + ++ RTSSGTF+
Sbjct: 47 ETISWQPRASVFHNFLSSEECDHLIRLAQPNMKRSAVVDNQTGKSKDSR-VRTSSGTFLR 105
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 121
+D+ I+ IE +IA+ T +P+ HGE VL YE+GQKYD+H+D F+ QR
Sbjct: 106 RGQDE--IISRIEERIAKFTFIPKEHGEGLQVLHYEVGQKYDAHHDYFHDKVNTKNGGQR 163
Query: 122 LASFLLYLSDVEEGGETMFPFE--NGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLFP 177
+A+ L+YLSDVEEGGET+FP N + + +C G+ VKPR+GD LLF+S+ P
Sbjct: 164 VATVLMYLSDVEEGGETVFPSAKVNSSSVPWWDELSECAKKGVSVKPRKGDALLFWSMSP 223
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 208
+ +D SLHG CPVIKG KW ATKW+ +E
Sbjct: 224 DAELDPFSLHGGCPVIKGNKWSATKWMHLRE 254
>gi|302786814|ref|XP_002975178.1| hypothetical protein SELMODRAFT_174666 [Selaginella moellendorffii]
gi|300157337|gb|EFJ23963.1| hypothetical protein SELMODRAFT_174666 [Selaginella moellendorffii]
Length = 283
Score = 183 bits (465), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 96/205 (46%), Positives = 130/205 (63%), Gaps = 7/205 (3%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 62
LSW+PRA + F SA +C ++ AK +L+ S +A + G++V S RTSSG F+S
Sbjct: 31 LSWKPRAFLYKGFMSAAECDHVVKMAKDKLQKSMVADNESGKSVLSN--IRTSSGMFLSK 88
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 122
+D+ ++ IE +IA T LP+ +GEA VLRYE G+KY+ HYD F+ R+
Sbjct: 89 GQDE--VINRIEERIAAWTFLPKENGEAIQVLRYEFGEKYEPHYDYFHDKYNQALGGHRI 146
Query: 123 ASFLLYLSDVEEGGETMFPF--ENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
A+ L+YLSD +GGET+FP E+ D + G+ VKPR+GD LLFYSL P+ T
Sbjct: 147 ATVLMYLSDAVKGGETVFPSSEEDTTVKDDSWSDCAKKGIAVKPRKGDALLFYSLHPDAT 206
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIR 205
D +SLHG CPVI+GEKW ATKWI
Sbjct: 207 PDESSLHGGCPVIEGEKWSATKWIH 231
>gi|255579590|ref|XP_002530636.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223529809|gb|EEF31744.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 287
Score = 183 bits (464), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 98/212 (46%), Positives = 129/212 (60%), Gaps = 9/212 (4%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFI 60
+V+SW PRA + NF + E+C+ +I AK ++ S + ET S RTSSGTF+
Sbjct: 77 EVISWEPRAFVYHNFLTKEECEYLINLAKPNMQKSTVV--DSETGRSKDSRVRTSSGTFL 134
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
S DK + IE +IA + +P HGE VL YE+GQKY+ H+D FN Q
Sbjct: 135 SRGRDKK--IRDIEKRIADFSFIPVEHGEGLQVLHYEVGQKYEPHFDYFNDEFNTKNGGQ 192
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSLF 176
R+A+ L+YLSDVEEGGET+FP G F + + +C GL VKP GD LLF+S+
Sbjct: 193 RVATLLMYLSDVEEGGETVFPAAKGNFSAVPWWNELSECGKKGLSVKPNMGDALLFWSMK 252
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 208
P+ T+D +SLHG CPVI G KW ATKW+R E
Sbjct: 253 PDATLDPSSLHGGCPVINGNKWSATKWMRVNE 284
>gi|302793288|ref|XP_002978409.1| hypothetical protein SELMODRAFT_418273 [Selaginella moellendorffii]
gi|300153758|gb|EFJ20395.1| hypothetical protein SELMODRAFT_418273 [Selaginella moellendorffii]
Length = 256
Score = 182 bits (462), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 95/211 (45%), Positives = 134/211 (63%), Gaps = 7/211 (3%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
+ +SW+PRA F NF S+E+C +I A+ +K S + Q + ++ RTSSGTF+
Sbjct: 47 ETISWQPRASVFHNFLSSEECDHLIRLAQPNMKRSAVVDNQTGKSKDSR-VRTSSGTFLR 105
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 121
+D+ I+ IE +IA+ T +P+ HGE VL YE+GQKYD+H+D F+ QR
Sbjct: 106 RGQDE--IISRIEERIAKFTFIPKEHGEGLQVLHYEVGQKYDAHHDYFHDKVNTKNGGQR 163
Query: 122 LASFLLYLSDVEEGGETMFPFE--NGIFLDSGYDYKKC--IGLKVKPRRGDGLLFYSLFP 177
+A+ L+YLSDVEEGGET+FP N + + +C G+ VKPR+GD LLF+S+ P
Sbjct: 164 VATVLMYLSDVEEGGETVFPSAKVNSSSVPWWDELSECGKKGVSVKPRKGDALLFWSMSP 223
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 208
+ +D SLHG CPVIKG KW ATKW+ +E
Sbjct: 224 DAELDPFSLHGGCPVIKGNKWSATKWMHLRE 254
>gi|225459748|ref|XP_002285898.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Vitis vinifera]
gi|302141716|emb|CBI18919.3| unnamed protein product [Vitis vinifera]
Length = 288
Score = 182 bits (462), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 95/211 (45%), Positives = 131/211 (62%), Gaps = 7/211 (3%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
+++SW PRA + NF S E+C+ +I+ AK +K S + + + ++ RTSSG F+
Sbjct: 78 EIVSWEPRAFIYHNFLSKEECEYMISLAKPYMKKSTVVDSETGRSKDSR-VRTSSGMFLR 136
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 121
DK I+ IE +IA T +P HGE VL YE+GQKYD+HYD F QR
Sbjct: 137 RGRDK--IIRDIEKRIADFTFIPVEHGEGLQVLHYEVGQKYDAHYDYFLDEFNTKNGGQR 194
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSLFP 177
+A+ L+YLSDVEEGGET+FP F + + +C GL VKP+ GD LLF+S+ P
Sbjct: 195 IATLLMYLSDVEEGGETVFPATKANFSSVPWWNELSECGKKGLSVKPKMGDALLFWSMRP 254
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 208
+ T+D +SLHG CPVIKG KW +TKW+ +E
Sbjct: 255 DATLDPSSLHGGCPVIKGNKWSSTKWMHVEE 285
>gi|115482738|ref|NP_001064962.1| Os10g0497800 [Oryza sativa Japonica Group]
gi|78708853|gb|ABB47828.1| prolyl 4-hydroxylase alpha subunit, putative, expressed [Oryza
sativa Japonica Group]
gi|113639571|dbj|BAF26876.1| Os10g0497800 [Oryza sativa Japonica Group]
gi|215767852|dbj|BAH00081.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218184821|gb|EEC67248.1| hypothetical protein OsI_34188 [Oryza sativa Indica Group]
Length = 321
Score = 182 bits (461), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 95/211 (45%), Positives = 132/211 (62%), Gaps = 7/211 (3%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
+VLSW PRA + NF S E+C+ +I+ AK +K S + + ++ RTSSG F+
Sbjct: 111 EVLSWEPRAFLYHNFLSKEECEYLISLAKPHMKKSTVVDASTGGSKDSR-VRTSSGMFLG 169
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 121
+DK I+ IE +I+ T +P +GE VL YE+GQKY+ H+D F+ QR
Sbjct: 170 RGQDK--IIRTIEKRISDYTFIPVENGEGLQVLHYEVGQKYEPHFDYFHDEFNTKNGGQR 227
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKCI--GLKVKPRRGDGLLFYSLFP 177
+A+ L+YLSDVEEGGET+FP S + + +C GL VKP+ GD LLF+S+ P
Sbjct: 228 IATLLMYLSDVEEGGETIFPSSKANSSSSPFYNELSECAKKGLAVKPKMGDALLFWSMRP 287
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 208
+G++D TSLHG CPVIKG KW +TKW+R E
Sbjct: 288 DGSLDATSLHGGCPVIKGNKWSSTKWMRVHE 318
>gi|225468574|ref|XP_002263060.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
gi|296084059|emb|CBI24447.3| unnamed protein product [Vitis vinifera]
Length = 288
Score = 182 bits (461), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 94/211 (44%), Positives = 131/211 (62%), Gaps = 7/211 (3%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
+V+SW PRA + NF S ++C+ +I AK ++ S + + ++ RTSSGTF++
Sbjct: 78 EVISWEPRAFVYHNFLSKDECEYLIKLAKPHMQKSTVVDSSTGKSKDSR-VRTSSGTFLT 136
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 121
+DK I+ IE +++ T LP HGE +L YE+GQKY+ HYD F QR
Sbjct: 137 RGQDK--IIRGIEKRLSDFTFLPVEHGEGLQILHYEVGQKYEPHYDYFLDDYNTKNGGQR 194
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSLFP 177
+A+ L+YLSDVEEGGET+FP G F + + C GL VKP+ GD LLF+S+ P
Sbjct: 195 MATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKEGLSVKPKMGDALLFWSMKP 254
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 208
+ ++D +SLHG CPVIKG KW +TKWIR E
Sbjct: 255 DASLDPSSLHGGCPVIKGNKWSSTKWIRVNE 285
>gi|147800995|emb|CAN64470.1| hypothetical protein VITISV_014644 [Vitis vinifera]
Length = 288
Score = 182 bits (461), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 97/217 (44%), Positives = 133/217 (61%), Gaps = 19/217 (8%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 55
+V+SW PRA + NF S ++C+ +I AK ++ S + V+S+ G RTS
Sbjct: 78 EVISWEPRAFVYHNFLSKDECEYLIKLAKPHMQKSTV-------VDSSTGKSKDSRVRTS 130
Query: 56 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 115
SGTF++ +DK I+ IE +++ T LP HGE +L YE+GQKY+ HYD F
Sbjct: 131 SGTFLTRGQDK--IIRGIEKRLSDFTFLPVEHGEGLQILHYEVGQKYEPHYDYFLDDYNT 188
Query: 116 PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLL 171
QR+A+ L+YLSDVEEGGET+FP G F + + C GL VKP+ GD LL
Sbjct: 189 KNGGQRMATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSXCGKEGLSVKPKMGDALL 248
Query: 172 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 208
F+S+ P+ ++D +SLHG CPVIKG KW +TKWIR E
Sbjct: 249 FWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNE 285
>gi|215490183|dbj|BAG86625.1| type 2 proly 4-hydroxylase [Nicotiana tabacum]
Length = 318
Score = 181 bits (460), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 96/205 (46%), Positives = 131/205 (63%), Gaps = 8/205 (3%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 62
+SWRPRA + NF + E+C I AK +L+ S +A + G++VES RTSSG F
Sbjct: 65 ISWRPRAFVYRNFLTDEECDHFITLAKHKLEKSMVADNESGKSVESE--VRTSSGMFFRK 122
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 122
++D+ ++ +E +IA T LP+ +GE+ +L YE GQKY+ H+D F+ R+
Sbjct: 123 AQDQ--VVANVEARIAAWTFLPEENGESIQILHYEHGQKYEPHFDYFHDKVNQELGGHRV 180
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDS-GYDYKKCI--GLKVKPRRGDGLLFYSLFPNG 179
A+ L+YLSDVE+GGET+FP + G D+ C G VKPR+GD LLF+SL P+
Sbjct: 181 ATVLMYLSDVEKGGETVFPNSEAKKTQAKGDDWSDCAKKGYAVKPRKGDALLFFSLHPDA 240
Query: 180 TIDRTSLHGSCPVIKGEKWVATKWI 204
T D SLHGSCPVI+GEKW ATKWI
Sbjct: 241 TTDPLSLHGSCPVIEGEKWSATKWI 265
>gi|222613083|gb|EEE51215.1| hypothetical protein OsJ_32038 [Oryza sativa Japonica Group]
Length = 222
Score = 181 bits (460), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 95/211 (45%), Positives = 132/211 (62%), Gaps = 7/211 (3%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
+VLSW PRA + NF S E+C+ +I+ AK +K S + + ++ RTSSG F+
Sbjct: 12 EVLSWEPRAFLYHNFLSKEECEYLISLAKPHMKKSTVVDASTGGSKDSR-VRTSSGMFLG 70
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 121
+DK I+ IE +I+ T +P +GE VL YE+GQKY+ H+D F+ QR
Sbjct: 71 RGQDK--IIRTIEKRISDYTFIPVENGEGLQVLHYEVGQKYEPHFDYFHDEFNTKNGGQR 128
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKCI--GLKVKPRRGDGLLFYSLFP 177
+A+ L+YLSDVEEGGET+FP S + + +C GL VKP+ GD LLF+S+ P
Sbjct: 129 IATLLMYLSDVEEGGETIFPSSKANSSSSPFYNELSECAKKGLAVKPKMGDALLFWSMRP 188
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 208
+G++D TSLHG CPVIKG KW +TKW+R E
Sbjct: 189 DGSLDATSLHGGCPVIKGNKWSSTKWMRVHE 219
>gi|357517895|ref|XP_003629236.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355523258|gb|AET03712.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 326
Score = 181 bits (460), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 93/209 (44%), Positives = 133/209 (63%), Gaps = 9/209 (4%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTF 59
+Q++SW PRA + NF + E+C+ +I AK + S + + G V+S + RTSSG F
Sbjct: 115 VQIISWEPRAFLYHNFLTKEECEHLINIAKPSMHKSAVIDEETGNGVDSRE--RTSSGAF 172
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
+ D+ I++ IE +IA T +P HGE FNVL YE+GQKY+ HYD F
Sbjct: 173 LKRGSDR--IVKNIERRIADFTFIPVEHGENFNVLHYEVGQKYEPHYDYFMDTFSTTYAG 230
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSL 175
QR+A+ L+YLSDVEEGGET+FP G F + + C GL +KP+ G+ +LF+S+
Sbjct: 231 QRIATMLMYLSDVEEGGETVFPNAKGNFSSVPWWNELSDCGKGGLSIKPKMGNAILFWSM 290
Query: 176 FPNGTIDRTSLHGSCPVIKGEKWVATKWI 204
P+ T+D +SLHG+CPVIKG+KW+ KW+
Sbjct: 291 KPDATLDPSSLHGACPVIKGDKWLCAKWM 319
>gi|212720775|ref|NP_001131953.1| uncharacterized protein LOC100193348 [Zea mays]
gi|194693016|gb|ACF80592.1| unknown [Zea mays]
gi|347978798|gb|AEP37741.1| prolyl 4-hydroxylase 1 [Zea mays]
gi|414870898|tpg|DAA49455.1| TPA: hypothetical protein ZEAMMB73_536273 [Zea mays]
Length = 307
Score = 181 bits (458), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 95/211 (45%), Positives = 130/211 (61%), Gaps = 7/211 (3%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
+VLSW PRA + NF S E+C +I+ AK +K S + + ++ RTSSG F+
Sbjct: 97 EVLSWEPRAFVYHNFLSKEECDHLISLAKPHMKKSTVVDSATGGSKDSR-VRTSSGMFLR 155
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 121
+DK I+ IE +IA T +P GE VL YE+GQKY+ H+D F+ QR
Sbjct: 156 RGQDK--IIRTIEKRIADYTFIPVEQGEGLQVLHYEVGQKYEPHFDYFHDDYNTKNGGQR 213
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKCI--GLKVKPRRGDGLLFYSLFP 177
+A+ L+YLSDVE+GGET+FP S + + +C GL VKP+ GD LLF+S+ P
Sbjct: 214 IATLLMYLSDVEDGGETVFPSSTTNSSSSPFYNELSECAKGGLSVKPKMGDALLFWSMKP 273
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 208
+G++D TSLHG CPVIKG KW +TKW+R E
Sbjct: 274 DGSLDPTSLHGGCPVIKGNKWSSTKWMRVHE 304
>gi|363806698|ref|NP_001242522.1| uncharacterized protein LOC100806046 [Glycine max]
gi|255647110|gb|ACU24023.1| unknown [Glycine max]
Length = 289
Score = 181 bits (458), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 97/213 (45%), Positives = 133/213 (62%), Gaps = 9/213 (4%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTF 59
++V+SW PRA + NF + E+C+ +I AK + S + ET +S RTSSGTF
Sbjct: 78 VEVVSWEPRAFVYHNFLTKEECEYLIDIAKPSMHKSTVV--DSETGKSKDSRVRTSSGTF 135
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
++ DK I+ IE KI+ T +P HGE VL YE+GQKY+ HYD F
Sbjct: 136 LARGRDK--IVRNIEKKISDFTFIPVEHGEGLQVLHYEVGQKYEPHYDYFLDDFNTKNGG 193
Query: 120 QRLASFLLYLSDVEEGGETMFPFENG--IFLDSGYDYKKC--IGLKVKPRRGDGLLFYSL 175
QR+A+ L+YL+DVEEGGET+FP G F+ + +C GL +KP+RGD LLF+S+
Sbjct: 194 QRIATVLMYLTDVEEGGETVFPAAKGNFSFVPWWNELFECGKKGLSIKPKRGDALLFWSM 253
Query: 176 FPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 208
P+ ++D +SLHG CPVIKG KW +TKW+R E
Sbjct: 254 KPDASLDPSSLHGGCPVIKGNKWSSTKWMRVSE 286
>gi|242032633|ref|XP_002463711.1| hypothetical protein SORBIDRAFT_01g004670 [Sorghum bicolor]
gi|241917565|gb|EER90709.1| hypothetical protein SORBIDRAFT_01g004670 [Sorghum bicolor]
Length = 297
Score = 180 bits (457), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 96/206 (46%), Positives = 129/206 (62%), Gaps = 8/206 (3%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 62
LSWRPRA + F S +C +I AK ++ S +A G+++ S RTSSG F++
Sbjct: 38 LSWRPRAFLYSGFLSDTECDHLINLAKGSMEKSMVADNDSGKSLMSQ--VRTSSGAFLAK 95
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 122
ED+ I+ IE ++A T LP+ + E+ VLRYEIGQKYD+H+D F+ QR
Sbjct: 96 HEDE--IVSAIEKRVAAWTFLPEENAESMQVLRYEIGQKYDAHFDYFHDKNNVKHGGQRF 153
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFL---DSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 179
A+ L+YL+DV++GGET+FP G L D + GL VKP++GD LLF+ L N
Sbjct: 154 ATVLMYLTDVKKGGETVFPNAEGSHLQYKDETWSECSRSGLAVKPKKGDALLFFGLHLNA 213
Query: 180 TIDRTSLHGSCPVIKGEKWVATKWIR 205
T D +SLHGSCPVI+GEKW ATKWI
Sbjct: 214 TTDTSSLHGSCPVIEGEKWSATKWIH 239
>gi|218193936|gb|EEC76363.1| hypothetical protein OsI_13952 [Oryza sativa Indica Group]
Length = 1062
Score = 180 bits (457), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 95/205 (46%), Positives = 131/205 (63%), Gaps = 8/205 (3%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 62
LSWRPRA + F S ++C ++ AK R++ S +A G+++ S RTSSGTF+S
Sbjct: 40 LSWRPRAFLYSGFLSHDECDHLVNLAKGRMEKSMVADNDSGKSIMSQ--VRTSSGTFLSK 97
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 122
ED I+ IE ++A T LP+ + E+ +L YE+GQKYD+H+D F+ + R+
Sbjct: 98 HEDD--IVSGIEKRVAAWTFLPEENAESIQILHYELGQKYDAHFDYFHDKNNLKRGGHRV 155
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFL---DSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 179
A+ L+YL+DV++GGET+FP G L D + GL VKP++GD LLF+SL N
Sbjct: 156 ATVLMYLTDVKKGGETVFPNAAGRHLQLKDETWSDCARSGLAVKPKKGDALLFFSLHVNA 215
Query: 180 TIDRTSLHGSCPVIKGEKWVATKWI 204
T D SLHGSCPVI+GEKW ATKWI
Sbjct: 216 TTDPASLHGSCPVIEGEKWSATKWI 240
>gi|414870899|tpg|DAA49456.1| TPA: hypothetical protein ZEAMMB73_536273 [Zea mays]
Length = 364
Score = 180 bits (456), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 98/217 (45%), Positives = 131/217 (60%), Gaps = 19/217 (8%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 55
+VLSW PRA + NF S E+C +I+ AK +K S + V+S G RTS
Sbjct: 154 EVLSWEPRAFVYHNFLSKEECDHLISLAKPHMKKSTV-------VDSATGGSKDSRVRTS 206
Query: 56 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 115
SG F+ +DK I+ IE +IA T +P GE VL YE+GQKY+ H+D F+
Sbjct: 207 SGMFLRRGQDK--IIRTIEKRIADYTFIPVEQGEGLQVLHYEVGQKYEPHFDYFHDDYNT 264
Query: 116 PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKCI--GLKVKPRRGDGLL 171
QR+A+ L+YLSDVE+GGET+FP S + + +C GL VKP+ GD LL
Sbjct: 265 KNGGQRIATLLMYLSDVEDGGETVFPSSTTNSSSSPFYNELSECAKGGLSVKPKMGDALL 324
Query: 172 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 208
F+S+ P+G++D TSLHG CPVIKG KW +TKW+R E
Sbjct: 325 FWSMKPDGSLDPTSLHGGCPVIKGNKWSSTKWMRVHE 361
>gi|240256489|ref|NP_201407.4| iron ion binding / oxidoreductase/ oxidoreductase protein
[Arabidopsis thaliana]
gi|332010770|gb|AED98153.1| iron ion binding / oxidoreductase/ oxidoreductase protein
[Arabidopsis thaliana]
Length = 289
Score = 180 bits (456), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 95/213 (44%), Positives = 134/213 (62%), Gaps = 9/213 (4%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTF 59
++++SW PRA + NF + E+C+ +I AK ++ S + + G++ +S RTSSGTF
Sbjct: 78 VEIISWEPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKTGKSTDSR--VRTSSGTF 135
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
++ DKT + IE +I+ T +P HGE VL YEIGQKY+ HYD F
Sbjct: 136 LARGRDKT--IREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHYDYFMDEYNTRNGG 193
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSL 175
QR+A+ L+YLSDVEEGGET+FP G + + + +C GL VKP+ GD LLF+S+
Sbjct: 194 QRIATVLMYLSDVEEGGETVFPAAKGNYSAVPWWNELSECGKGGLSVKPKMGDALLFWSM 253
Query: 176 FPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 208
P+ T+D +SLHG C VIKG KW +TKW+R E
Sbjct: 254 TPDATLDPSSLHGGCAVIKGNKWSSTKWLRVHE 286
>gi|115456019|ref|NP_001051610.1| Os03g0803500 [Oryza sativa Japonica Group]
gi|29150365|gb|AAO72374.1| putative oxidoreductase [Oryza sativa Japonica Group]
gi|108711618|gb|ABF99413.1| oxidoreductase, 2OG-Fe oxygenase family protein, putative,
expressed [Oryza sativa Japonica Group]
gi|113550081|dbj|BAF13524.1| Os03g0803500 [Oryza sativa Japonica Group]
gi|215765410|dbj|BAG87107.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222625993|gb|EEE60125.1| hypothetical protein OsJ_13003 [Oryza sativa Japonica Group]
Length = 299
Score = 179 bits (455), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 95/205 (46%), Positives = 131/205 (63%), Gaps = 8/205 (3%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 62
LSWRPRA + F S ++C ++ AK R++ S +A G+++ S RTSSGTF+S
Sbjct: 40 LSWRPRAFLYSGFLSHDECDHLVNLAKGRMEKSMVADNDSGKSIMSQ--VRTSSGTFLSK 97
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 122
ED I+ IE ++A T LP+ + E+ +L YE+GQKYD+H+D F+ + R+
Sbjct: 98 HEDD--IVSGIEKRVAAWTFLPEENAESIQILHYELGQKYDAHFDYFHDKNNLKRGGHRV 155
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFL---DSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 179
A+ L+YL+DV++GGET+FP G L D + GL VKP++GD LLF+SL N
Sbjct: 156 ATVLMYLTDVKKGGETVFPNAAGRHLQLKDETWSDCARSGLAVKPKKGDALLFFSLHVNA 215
Query: 180 TIDRTSLHGSCPVIKGEKWVATKWI 204
T D SLHGSCPVI+GEKW ATKWI
Sbjct: 216 TTDPASLHGSCPVIEGEKWSATKWI 240
>gi|357137804|ref|XP_003570489.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Brachypodium
distachyon]
Length = 318
Score = 178 bits (452), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 97/217 (44%), Positives = 129/217 (59%), Gaps = 19/217 (8%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 55
+V+SW PRA + NF S E+C+ +I AK R++ S + V+ST G RTS
Sbjct: 108 EVISWEPRAFVYHNFLSKEECEYLIGLAKPRMEKSTV-------VDSTTGKSKDSRVRTS 160
Query: 56 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 115
SG F+ DK ++ IE +IA T +P HGE VL YE+GQKY+ H+D F
Sbjct: 161 SGMFLRRGRDK--VIRAIERRIADYTFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEFNT 218
Query: 116 PQMSQRLASFLLYLSDVEEGGETMFPFEN--GIFLDSGYDYKKCI--GLKVKPRRGDGLL 171
QR+A+ L+YLSDVEEGGET+FP N L + +C GL VKP+ GD LL
Sbjct: 219 KNGGQRMATILMYLSDVEEGGETIFPDANVNSSSLPWHNELSECARKGLAVKPKMGDALL 278
Query: 172 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 208
F+S+ P+ T+D SLHG CPVI+G KW +TKW+ E
Sbjct: 279 FWSMNPDATLDPLSLHGGCPVIRGNKWSSTKWMHVGE 315
>gi|357517885|ref|XP_003629231.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355523253|gb|AET03707.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 279
Score = 177 bits (450), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 92/210 (43%), Positives = 130/210 (61%), Gaps = 9/210 (4%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLAL-RQGETVESTKGTRTSSGTF 59
++++SW PR + NF + E+C+ +I AK ++ S + G++V S+ RTSSGTF
Sbjct: 70 VEIVSWEPRVFLYHNFLAKEECEHLINIAKPDVQKSTVVDDTTGKSVNSS--ARTSSGTF 127
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
I DK IL IE +IA T +P HGE N+L YE+GQKYD H D F
Sbjct: 128 IDRGYDK--ILSDIEKRIADFTFIPVEHGEDVNILHYEVGQKYDFHTDYFEDEVNTKHGG 185
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSL 175
+R+A+ L+YLSDVEEGGET+FP G F + + C GL +KP+ G+ +LF+ +
Sbjct: 186 ERIATMLMYLSDVEEGGETVFPSAKGNFSSVPWWNELSDCGKKGLSIKPKMGNAILFWGM 245
Query: 176 FPNGTIDRTSLHGSCPVIKGEKWVATKWIR 205
P+ T+D S+HG+CPVIKG+KW TKW+R
Sbjct: 246 KPDATVDPLSVHGACPVIKGDKWSCTKWMR 275
>gi|297818456|ref|XP_002877111.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297322949|gb|EFH53370.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 316
Score = 177 bits (448), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 95/205 (46%), Positives = 128/205 (62%), Gaps = 8/205 (3%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 62
LSW PRA + F S E+C I AK +L+ S +A GE+VES RTSSG F+S
Sbjct: 59 LSWTPRAFLYKGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESE--VRTSSGMFLSK 116
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 122
+D I+ +E K+A T +P+ +GE+ +L YE GQKY+ H+D F+ R+
Sbjct: 117 RQDD--IVANVEAKLAAWTFIPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRI 174
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPNG 179
A+ L+YLS+VE+GGET+FP G D + +C G VKPR+GD LLF++L PN
Sbjct: 175 ATVLMYLSNVEKGGETVFPMWKGKTTQLKDDSWTECAKQGYAVKPRKGDALLFFNLHPNA 234
Query: 180 TIDRTSLHGSCPVIKGEKWVATKWI 204
T D SLHGSCPV++GEKW AT+WI
Sbjct: 235 TTDSNSLHGSCPVVEGEKWSATRWI 259
>gi|255552788|ref|XP_002517437.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223543448|gb|EEF44979.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 311
Score = 177 bits (448), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 95/206 (46%), Positives = 130/206 (63%), Gaps = 8/206 (3%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 62
LSW PRA + F S E+C +I A+ +L+ S +A + G+++ES RTSSG FI+
Sbjct: 52 LSWHPRAFLYKGFLSYEECDHLIDLARDKLEKSMVADNESGKSIESE--VRTSSGMFIAK 109
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 122
++D+ I+ IE +IA T LP+ +GE+ +L YE GQKY+ H+D F+ R+
Sbjct: 110 AQDE--IVADIEARIAAWTFLPEENGESMQILHYEHGQKYEPHFDYFHDKANQELGGHRV 167
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPNG 179
A+ L+YLS+VE+GGET+FP G D + C G VKP +GD LLF+SL P+
Sbjct: 168 ATVLMYLSNVEKGGETVFPNAEGKLSQPKEDSWSDCAKGGYAVKPEKGDALLFFSLHPDA 227
Query: 180 TIDRTSLHGSCPVIKGEKWVATKWIR 205
T D SLHGSCPVI+GEKW ATKWI
Sbjct: 228 TTDSDSLHGSCPVIEGEKWSATKWIH 253
>gi|215490181|dbj|BAG86624.1| type 2 proly 4-hydroxylase [Nicotiana tabacum]
Length = 294
Score = 177 bits (448), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 96/207 (46%), Positives = 126/207 (60%), Gaps = 9/207 (4%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 63
+SW+PRA + F + E+C +I+ AK LK S +A + ++++ RTSSG FI +
Sbjct: 36 ISWKPRAFVYEGFLTDEECNHLISLAKSELKRSAVADNESGNSKTSE-VRTSSGMFIPKA 94
Query: 64 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 123
+D I+ IE KIA T LP+ +GE VLRYE GQKY+ HYD F + RLA
Sbjct: 95 KDP--IVSGIEEKIATWTFLPKENGEEIQVLRYEEGQKYEPHYDYFVDKVNIARGGHRLA 152
Query: 124 SFLLYLSDVEEGGETMFP------FENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
+ L+YL++VE+GGET+FP + D G+ VKPR+GD LLFYSL P
Sbjct: 153 TVLMYLTNVEKGGETVFPKAEESPRRRSMIADDSLSECAKKGIPVKPRKGDALLFYSLHP 212
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWI 204
N T D SLHG CPVI+GEKW ATKWI
Sbjct: 213 NATPDPLSLHGGCPVIQGEKWSATKWI 239
>gi|9294583|dbj|BAB02864.1| prolyl 4-hydroxylase alpha subunit-like protein [Arabidopsis
thaliana]
Length = 332
Score = 176 bits (447), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 95/205 (46%), Positives = 127/205 (61%), Gaps = 8/205 (3%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 62
LSW PR + F S E+C I AK +L+ S +A GE+VES RTSSG F+S
Sbjct: 75 LSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESE--VRTSSGMFLSK 132
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 122
+D I+ +E K+A T LP+ +GE+ +L YE GQKY+ H+D F+ R+
Sbjct: 133 RQDD--IVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRI 190
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPNG 179
A+ L+YLS+VE+GGET+FP G D + +C G VKPR+GD LLF++L PN
Sbjct: 191 ATVLMYLSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDALLFFNLHPNA 250
Query: 180 TIDRTSLHGSCPVIKGEKWVATKWI 204
T D SLHGSCPV++GEKW AT+WI
Sbjct: 251 TTDSNSLHGSCPVVEGEKWSATRWI 275
>gi|18086437|gb|AAL57673.1| AT3g28480/MFJ20_16 [Arabidopsis thaliana]
gi|24796986|gb|AAN64505.1| At3g28480/MFJ20_16 [Arabidopsis thaliana]
Length = 316
Score = 176 bits (447), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 95/205 (46%), Positives = 127/205 (61%), Gaps = 8/205 (3%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 62
LSW PR + F S E+C I AK +L+ S +A GE+VES RTSSG F+S
Sbjct: 59 LSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESE--VRTSSGMFLSK 116
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 122
+D I+ +E K+A T LP+ +GE+ +L YE GQKY+ H+D F+ R+
Sbjct: 117 RQDD--IVNNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRI 174
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPNG 179
A+ L+YLS+VE+GGET+FP G D + +C G VKPR+GD LLF++L PN
Sbjct: 175 ATVLMYLSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDALLFFNLHPNA 234
Query: 180 TIDRTSLHGSCPVIKGEKWVATKWI 204
T D SLHGSCPV++GEKW AT+WI
Sbjct: 235 TTDSNSLHGSCPVVEGEKWSATRWI 259
>gi|18405808|ref|NP_566838.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
gi|21617881|gb|AAM66931.1| prolyl 4-hydroxylase, putative [Arabidopsis thaliana]
gi|332643929|gb|AEE77450.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
Length = 316
Score = 176 bits (446), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 95/205 (46%), Positives = 127/205 (61%), Gaps = 8/205 (3%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 62
LSW PR + F S E+C I AK +L+ S +A GE+VES RTSSG F+S
Sbjct: 59 LSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESE--VRTSSGMFLSK 116
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 122
+D I+ +E K+A T LP+ +GE+ +L YE GQKY+ H+D F+ R+
Sbjct: 117 RQDD--IVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRI 174
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPNG 179
A+ L+YLS+VE+GGET+FP G D + +C G VKPR+GD LLF++L PN
Sbjct: 175 ATVLMYLSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDALLFFNLHPNA 234
Query: 180 TIDRTSLHGSCPVIKGEKWVATKWI 204
T D SLHGSCPV++GEKW AT+WI
Sbjct: 235 TTDSNSLHGSCPVVEGEKWSATRWI 259
>gi|297818458|ref|XP_002877112.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297322950|gb|EFH53371.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 289
Score = 176 bits (446), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 95/207 (45%), Positives = 129/207 (62%), Gaps = 9/207 (4%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLAL--RQGETVESTKGTRTSSGTFIS 61
LSW PRA + F S E+C +I AK +L+ S + GE+++S + RTSSG F++
Sbjct: 35 LSWTPRAFLYNGFLSDEECDHLINLAKGKLEKSMVVADDNSGESIDSEE--RTSSGVFLT 92
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 121
+D I+ +E K+A T LP+ +GEA +L YE GQKYD H+D + E R
Sbjct: 93 KRQDD--IVANVEAKLATWTFLPEENGEALQILHYENGQKYDPHFDYYYDKETLKLGGHR 150
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPN 178
+A+ L+YLS+V +GGET+FP G D + +C G VKPR+GD LLF++L PN
Sbjct: 151 IATVLMYLSNVTKGGETVFPMWKGKTPQLKDDTWSECAKQGYAVKPRKGDALLFFNLHPN 210
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWIR 205
T D TSLHGSCPVI+GEKW AT+WI
Sbjct: 211 ATTDPTSLHGSCPVIEGEKWSATRWIH 237
>gi|363543301|ref|NP_001241866.1| prolyl 4-hydroxylase 6 precursor [Zea mays]
gi|195624808|gb|ACG34234.1| oxidoreductase [Zea mays]
gi|347978818|gb|AEP37751.1| prolyl 4-hydroxylase 6 [Zea mays]
Length = 297
Score = 176 bits (446), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 96/205 (46%), Positives = 130/205 (63%), Gaps = 8/205 (3%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 62
LS RPRA + F S +C I++ AK ++ S +A G++V S RTSSGTF++
Sbjct: 38 LSSRPRAFLYSGFLSDTECDHIVSLAKGSMEKSMVADNDSGKSVASQ--ARTSSGTFLAK 95
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 122
ED+ I+ IE ++A T LP+ + E+ VLRYE GQKYD+H+D F+ QR+
Sbjct: 96 REDE--IVSAIEKRVAAWTFLPEENAESLQVLRYETGQKYDAHFDYFHDRNNLKLGGQRV 153
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFL---DSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 179
A+ L+YL+DV++GGET+FP G L D + GL VKP++GD LLF++L N
Sbjct: 154 ATVLMYLTDVKKGGETVFPNAEGSHLQYKDETWSECSRSGLAVKPKKGDALLFFNLHVNA 213
Query: 180 TIDRTSLHGSCPVIKGEKWVATKWI 204
T D SLHGSCPVI+GEKW ATKWI
Sbjct: 214 TADTGSLHGSCPVIEGEKWSATKWI 238
>gi|302815629|ref|XP_002989495.1| hypothetical protein SELMODRAFT_129912 [Selaginella moellendorffii]
gi|300142673|gb|EFJ09371.1| hypothetical protein SELMODRAFT_129912 [Selaginella moellendorffii]
Length = 213
Score = 176 bits (446), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 90/211 (42%), Positives = 130/211 (61%), Gaps = 7/211 (3%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
+++SW PRA NF + ++C +I A ++ S + Q ++ RTSSG F++
Sbjct: 3 EIISWTPRASLVHNFLTDDECDHLIRVAMPLMQKSTVVDSQTGGSRDSR-VRTSSGMFLN 61
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 121
+D+ ++ IE KIA+ T +P+ HGE VL YE GQKYD+H+D F QR
Sbjct: 62 RGQDR--VISEIEDKIAKLTFIPKDHGEGIQVLHYEPGQKYDAHHDFFYDTVNTRNGGQR 119
Query: 122 LASFLLYLSDVEEGGETMFP--FENGIFLDSGYDYKKC--IGLKVKPRRGDGLLFYSLFP 177
+A+ L+YL+DVEEGGET+FP +N L +C G+ V+P+RGD LLF+S+ P
Sbjct: 120 IATLLMYLTDVEEGGETVFPKSAKNSSSLPWHNQLSECGRRGVSVRPKRGDALLFWSMSP 179
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 208
+ +D +SLHG CPVIKG+KW ATKW+R E
Sbjct: 180 DAQLDHSSLHGGCPVIKGDKWSATKWMRVSE 210
>gi|224133600|ref|XP_002327635.1| predicted protein [Populus trichocarpa]
gi|222836720|gb|EEE75113.1| predicted protein [Populus trichocarpa]
Length = 291
Score = 176 bits (446), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 91/211 (43%), Positives = 130/211 (61%), Gaps = 7/211 (3%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
+V+SW+PRA + NF + +C+ +I AK R++ S + + +K RTSSGTF+
Sbjct: 81 EVISWKPRAFVYHNFLTKAECEYLINLAKPRMQKSTVVDSSTGKSKDSK-VRTSSGTFLP 139
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 121
DK I+ IE +IA + +P HGE +L YE+GQ+Y+ H+D F QR
Sbjct: 140 RGRDK--IVRDIEKRIADFSFIPVEHGEGLQILHYEVGQRYEPHFDYFMDEYNTKNGGQR 197
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSLFP 177
+A+ L+YLSDVEEGGET+FP G + + +C GL VKP+ GD LLF+S+ P
Sbjct: 198 IATVLMYLSDVEEGGETVFPSAEGNISAVPWWNELSECGKGGLSVKPKMGDALLFWSMNP 257
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 208
+G+ D +SLHG CPVI+G KW +TKW+R E
Sbjct: 258 DGSPDPSSLHGGCPVIRGNKWSSTKWMRVNE 288
>gi|357467085|ref|XP_003603827.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355492875|gb|AES74078.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 280
Score = 176 bits (446), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 95/212 (44%), Positives = 131/212 (61%), Gaps = 9/212 (4%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTKGTRTSSGTFI 60
++LSW PRA + NF S E+C+ +I AK L K S + + G++ ES RTSSG F+
Sbjct: 70 EILSWEPRAFVYHNFLSKEECEHLINLAKPFLAKSSVVDSKTGKSTESR--VRTSSGMFL 127
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
+DK I++ IE +IA T +P +GE VL Y +G+KY+ HYD F Q
Sbjct: 128 KRGKDK--IIQNIERRIADFTFIPVENGEGLQVLHYGVGEKYEPHYDYFLDEFNTKNGGQ 185
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKCI--GLKVKPRRGDGLLFYSLF 176
R+A+ L+YLSDVEEGGET+FP F + D +C GL +KP+ GD LLF+S+
Sbjct: 186 RVATVLMYLSDVEEGGETVFPAAKANFSSVPWWNDLSECARKGLSLKPKMGDALLFWSMR 245
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 208
P+ T+D +SLHG CPVI G KW +TKW+ +E
Sbjct: 246 PDATLDASSLHGGCPVIVGNKWSSTKWMHLEE 277
>gi|302762452|ref|XP_002964648.1| hypothetical protein SELMODRAFT_82355 [Selaginella moellendorffii]
gi|300168377|gb|EFJ34981.1| hypothetical protein SELMODRAFT_82355 [Selaginella moellendorffii]
Length = 225
Score = 176 bits (445), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 90/211 (42%), Positives = 130/211 (61%), Gaps = 7/211 (3%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
+++SW PRA NF + ++C +I A ++ S + Q ++ RTSSG F++
Sbjct: 15 EIISWTPRASLVHNFLTDDECDHLIRVAMPLMQKSTVVDSQTGGSRDSR-VRTSSGMFLN 73
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 121
+D+ ++ IE KIA+ T +P+ HGE VL YE GQKYD+H+D F QR
Sbjct: 74 RGQDR--VISEIEDKIAKLTFIPKDHGEGIQVLHYEPGQKYDAHHDFFYDTVNTRNGGQR 131
Query: 122 LASFLLYLSDVEEGGETMFP--FENGIFLDSGYDYKKC--IGLKVKPRRGDGLLFYSLFP 177
+A+ L+YL+DVEEGGET+FP +N L +C G+ V+P+RGD LLF+S+ P
Sbjct: 132 IATLLMYLTDVEEGGETVFPKSAKNSSSLPWHNQLSECGRRGVSVRPKRGDALLFWSMSP 191
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 208
+ +D +SLHG CPVIKG+KW ATKW+R E
Sbjct: 192 DAQLDHSSLHGGCPVIKGDKWSATKWMRVSE 222
>gi|224141325|ref|XP_002324024.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Populus
trichocarpa]
gi|222867026|gb|EEF04157.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Populus
trichocarpa]
Length = 308
Score = 176 bits (445), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 96/207 (46%), Positives = 131/207 (63%), Gaps = 10/207 (4%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 62
LSW PRA + F S E+C ++ A+ +L+ S +A + G+++ES RTSSG FI
Sbjct: 49 LSWNPRAFLYKGFLSDEECDHLMNLARDKLEKSMVADNESGKSIESE--VRTSSGMFIGK 106
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 122
S+D+ I++ IE +IA T LPQ +GE+ +L YE GQKY+ H+D F+ R+
Sbjct: 107 SQDE--IVDDIEARIAAWTFLPQENGESIQILHYEHGQKYEPHFDYFHDKANQELGGHRV 164
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFL----DSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 178
+ L+YLS+V +GGET+FP G + DS D K G VKP++GD LLF+SL P+
Sbjct: 165 VTVLMYLSNVGKGGETVFPNSEGKTIQPKDDSWSDCAK-NGYAVKPQKGDALLFFSLHPD 223
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWIR 205
T D SLHGSCPVI+GEKW ATKWI
Sbjct: 224 ATTDTNSLHGSCPVIEGEKWSATKWIH 250
>gi|48716447|dbj|BAD23054.1| putative prolyl 4-hydroxylase [Oryza sativa Japonica Group]
Length = 310
Score = 176 bits (445), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 97/217 (44%), Positives = 127/217 (58%), Gaps = 19/217 (8%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 55
+V+SW PRA + NF S E+C +I AK + S + V+ST G RTS
Sbjct: 100 EVISWEPRAFVYHNFLSKEECDYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 152
Query: 56 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 115
SG F+ DK ++ IE +IA T +P HGE VL YE+GQKY+ H+D F
Sbjct: 153 SGMFLQRGRDK--VIRAIEKRIADYTFIPMEHGEGLQVLHYEVGQKYEPHFDYFLDEYNT 210
Query: 116 PQMSQRLASFLLYLSDVEEGGETMFPFE--NGIFLDSGYDYKKCI--GLKVKPRRGDGLL 171
QR+A+ L+YLSDVEEGGET+FP N L + +C GL VKP+ GD LL
Sbjct: 211 KNGGQRMATLLMYLSDVEEGGETIFPDANVNSSSLPWYNELSECARKGLAVKPKMGDALL 270
Query: 172 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 208
F+S+ P+ T+D SLHG CPVIKG KW +TKW+ +E
Sbjct: 271 FWSMKPDATLDPLSLHGGCPVIKGNKWSSTKWMHVRE 307
>gi|326489721|dbj|BAK01841.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 315
Score = 176 bits (445), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 95/208 (45%), Positives = 126/208 (60%), Gaps = 9/208 (4%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFI 60
+V+SW PRA + NF S E+C+ +I AK R+ S + ET +S RTSSG F+
Sbjct: 105 EVISWEPRAFVYHNFLSKEECEYLIELAKPRMVKSTVV--DSETGKSKDSRVRTSSGMFL 162
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
DK ++ IE +IA T +P HGE VL YE+GQKY+ H+D F Q
Sbjct: 163 QRGRDK--VIRAIERRIADYTFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEFNTKNGGQ 220
Query: 121 RLASFLLYLSDVEEGGETMFPFEN--GIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLF 176
R+A+ L+YLSD+EEGGET+FP N L + +C GL VKP+ GD LLF+S+
Sbjct: 221 RMATILMYLSDIEEGGETIFPDANVNSSSLPWYNELSECARKGLAVKPKMGDALLFWSMK 280
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWI 204
P+ T+D SLHG CPVIKG KW +TKW+
Sbjct: 281 PDATLDPLSLHGGCPVIKGNKWSSTKWL 308
>gi|255539064|ref|XP_002510597.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223551298|gb|EEF52784.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 289
Score = 175 bits (444), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 95/208 (45%), Positives = 129/208 (62%), Gaps = 9/208 (4%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTKGTRTSSGTFI 60
+++SW PRA + NF S E+C+ +IA AK + K + + + G + +S RTSSG F+
Sbjct: 79 EIISWEPRAFVYHNFLSKEECEYLIALAKPHMVKSTVVDSKTGRSKDSR--VRTSSGMFL 136
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
DK I+ IE +IA + +P HGE VL YE+GQKY++HYD F Q
Sbjct: 137 RRGRDK--IIRNIEKRIADFSFIPIEHGEGLQVLHYEVGQKYEAHYDYFLDEFNTKNGGQ 194
Query: 121 RLASFLLYLSDVEEGGETMFPFE--NGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLF 176
R A+ L+YLSDVEEGGET+FP N + S + +C GL VKP+ G+ LLF+S
Sbjct: 195 RTATLLMYLSDVEEGGETVFPAAKANISNVPSWNELSECARQGLSVKPKMGNALLFWSTR 254
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWI 204
P+ T+D SLHGSCPVI+G KW ATKW+
Sbjct: 255 PDATLDPASLHGSCPVIRGNKWSATKWM 282
>gi|242063586|ref|XP_002453082.1| hypothetical protein SORBIDRAFT_04g038020 [Sorghum bicolor]
gi|241932913|gb|EES06058.1| hypothetical protein SORBIDRAFT_04g038020 [Sorghum bicolor]
Length = 307
Score = 175 bits (444), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 96/217 (44%), Positives = 127/217 (58%), Gaps = 19/217 (8%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 55
+V+SW PRA + NF S E+C+ +I AK + S + V+ST G RTS
Sbjct: 97 EVISWEPRAFVYHNFLSKEECEYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 149
Query: 56 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 115
SG F+ DK ++ IE +IA T +P HGE VL YE+GQKY+ H+D F
Sbjct: 150 SGMFLQRGRDK--VIRAIEKRIADYTFIPADHGEGLQVLHYEVGQKYEPHFDYFLDEFNT 207
Query: 116 PQMSQRLASFLLYLSDVEEGGETMFPFEN--GIFLDSGYDYKKCI--GLKVKPRRGDGLL 171
QR+A+ L+YLSDVEEGGET+FP N L + +C GL VKP+ GD LL
Sbjct: 208 KNGGQRMATLLMYLSDVEEGGETIFPDANVNASSLPWYNELSECAKRGLSVKPKMGDALL 267
Query: 172 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 208
F+S+ P+ T+D SLHG CPVI+G KW +TKW+ E
Sbjct: 268 FWSMKPDATLDPLSLHGGCPVIRGNKWSSTKWMHIHE 304
>gi|226529219|ref|NP_001151238.1| LOC100284871 [Zea mays]
gi|195645242|gb|ACG42089.1| prolyl 4-hydroxylase alpha-2 subunit precursor [Zea mays]
gi|347978812|gb|AEP37748.1| prolyl 4-hydroxylase 5 [Zea mays]
gi|413923983|gb|AFW63915.1| prolyl 4-hydroxylase alpha-2 subunit [Zea mays]
Length = 308
Score = 175 bits (444), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 96/217 (44%), Positives = 128/217 (58%), Gaps = 19/217 (8%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 55
+V+SW PRA + NF S E+C+ +I AK + S + V+ST G RTS
Sbjct: 98 EVISWEPRAFVYHNFLSKEECEYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 150
Query: 56 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 115
SG F+ DK ++ +IE +IA T +P HGE VL YE+GQKY+ H+D F
Sbjct: 151 SGMFLQRGRDK--VIRVIEKRIADYTFIPVDHGEGLQVLHYEVGQKYEPHFDYFLDEFNT 208
Query: 116 PQMSQRLASFLLYLSDVEEGGETMFPFENGIF--LDSGYDYKKCI--GLKVKPRRGDGLL 171
QR+A+ L+YLSDVEEGGET+FP N L + +C GL VKP+ GD LL
Sbjct: 209 KNGGQRMATLLMYLSDVEEGGETIFPDANVNVSSLPWYNELSECAKRGLSVKPKMGDALL 268
Query: 172 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 208
F+S+ P+ T+D SLHG CPVI+G KW +TKW+ E
Sbjct: 269 FWSMKPDATLDPLSLHGGCPVIRGNKWSSTKWMHIHE 305
>gi|449432777|ref|XP_004134175.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 303
Score = 175 bits (444), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 96/211 (45%), Positives = 129/211 (61%), Gaps = 9/211 (4%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
++ +SW PRA + F + +C +I+ AK LK S +A + ++ RTSSG FI
Sbjct: 41 VKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGKSKVSE-VRTSSGAFI 99
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
++D I+ IE KIA T LP+ +GE VLRYE GQKYD+H+D F +
Sbjct: 100 HKAKDP--IVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGH 157
Query: 121 RLASFLLYLSDVEEGGETMFP----FENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYS 174
R+A+ L+YLSDVE+GGET+FP + ++ D C G+ VKPR+GD LLF+S
Sbjct: 158 RMATVLMYLSDVEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRKGDALLFFS 217
Query: 175 LFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 205
L PN D +SLHG CPVI+GEKW ATKWIR
Sbjct: 218 LHPNAIPDTSSLHGGCPVIEGEKWSATKWIR 248
>gi|413932756|gb|AFW67307.1| oxidoreductase [Zea mays]
Length = 297
Score = 175 bits (443), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 95/205 (46%), Positives = 129/205 (62%), Gaps = 8/205 (3%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 62
LS RPRA + F S +C +++ AK ++ S +A G++V S RTSSGTF++
Sbjct: 38 LSSRPRAFLYSGFLSDTECDHLVSLAKGSMEKSMVADNDSGKSVASQ--ARTSSGTFLAK 95
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 122
ED+ I+ IE ++A T LP+ + E+ VLRYE GQKYD+H+D F+ QR+
Sbjct: 96 REDE--IVSAIEKRVAAWTFLPEENAESLQVLRYETGQKYDAHFDYFHDRNNLKLGGQRV 153
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFL---DSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 179
A+ L+YL+DV +GGET+FP G L D + GL VKP++GD LLF++L N
Sbjct: 154 ATVLMYLTDVNKGGETVFPNAEGSHLQYKDETWSECSRSGLAVKPKKGDALLFFNLHVNA 213
Query: 180 TIDRTSLHGSCPVIKGEKWVATKWI 204
T D SLHGSCPVI+GEKW ATKWI
Sbjct: 214 TADTGSLHGSCPVIEGEKWSATKWI 238
>gi|359477455|ref|XP_002278454.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1 [Vitis
vinifera]
Length = 296
Score = 174 bits (442), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 97/205 (47%), Positives = 124/205 (60%), Gaps = 7/205 (3%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 63
+SW+PRA + F S E+C +I+ AK LK S +A ++ RTSSG FI
Sbjct: 40 ISWKPRAFVYEGFLSEEECDHLISLAKSELKRSAVADNVSGKSRLSE-VRTSSGMFIGKG 98
Query: 64 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 123
+D I+ IE KIA T LP+ +GE VLRYE GQKYD+HYD F + R+A
Sbjct: 99 KDP--IVAGIEDKIAAWTFLPKDNGEDMQVLRYEPGQKYDAHYDYFVDKVNIARGGHRIA 156
Query: 124 SFLLYLSDVEEGGETMFPFE--NGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLFPNG 179
+ L+YLSDV +GGET+FP + L + D +C G+ VKPR+GD LLF+SL P
Sbjct: 157 TVLMYLSDVVKGGETVFPMAEVSSSTLPTNDDLSECARKGIAVKPRKGDALLFFSLHPTA 216
Query: 180 TIDRTSLHGSCPVIKGEKWVATKWI 204
D SLHG CPVI+GEKW ATKWI
Sbjct: 217 IPDPMSLHGGCPVIEGEKWSATKWI 241
>gi|326501992|dbj|BAK06488.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 306
Score = 174 bits (442), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 98/208 (47%), Positives = 134/208 (64%), Gaps = 11/208 (5%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKR-LKPSQLALRQ-GETVESTKGTRTSSGTFIS 61
+SWRPRA + F + +C ++A A++ L+ S + RQ G++V S RTSSGTF++
Sbjct: 41 VSWRPRAFLYKGFLTEAECDHLVALAEEGGLQKSMVVDRQTGKSVMSE--VRTSSGTFLA 98
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG--PQMS 119
+D+ ++ IE +IA T+LPQ +GE+ VLRYE GQKY+ H D A G +
Sbjct: 99 KKQDQ--VVATIEARIAAWTLLPQENGESIQVLRYENGQKYEPHVDFIRHAAKGHHSRGG 156
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYK-KCI--GLKVKPRRGDGLLFYSLF 176
R+A+ L+YLSDV+ GGET+FP + L D + +C G VKP +GD +LF+SL
Sbjct: 157 HRVATVLMYLSDVKMGGETVFPNSDAKTLQPKDDTQSECARRGYAVKPVKGDAVLFFSLH 216
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWI 204
PNGT DR SLHG CPVI+GEKW ATKWI
Sbjct: 217 PNGTTDRDSLHGGCPVIEGEKWSATKWI 244
>gi|114796723|gb|ABI79328.1| prolyl 4-hydroxylase [Dianthus caryophyllus]
Length = 297
Score = 174 bits (442), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 93/208 (44%), Positives = 130/208 (62%), Gaps = 10/208 (4%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 63
+SW+PRA + F + E+C +I+ AK LK S +A + + ++ RTSSG FIS +
Sbjct: 40 ISWKPRAFVYEGFLTDEECDHLISIAKTELKRSAVADNESGKSQVSE-VRTSSGAFISKA 98
Query: 64 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 123
+D I++ IE K+A T LP +GE VLRYE GQKY++H+D F+ + R A
Sbjct: 99 KD--AIVQRIEEKLATWTFLPIENGEDIQVLRYEEGQKYENHFDFFSDKVNIARGGHRYA 156
Query: 124 SFLLYLSDVEEGGETMFPF-----ENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLF 176
+ L+YLS+VE+GG+T+FP + + D +C G+ VKPR+GD LLF+SL
Sbjct: 157 TVLMYLSNVEKGGDTVFPNAELSERQKAAIAANDDLSECAKRGISVKPRKGDALLFFSLT 216
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWI 204
P T D+ SLHG CPVI+GEKW ATKWI
Sbjct: 217 PTATPDQLSLHGGCPVIEGEKWSATKWI 244
>gi|297802350|ref|XP_002869059.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297314895|gb|EFH45318.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 290
Score = 174 bits (441), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 90/213 (42%), Positives = 137/213 (64%), Gaps = 9/213 (4%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTF 59
++V+SW PRA + NF + E+C+ +I+ AK + S++ ++ G++++S RTSSGTF
Sbjct: 80 LEVISWEPRAFVYHNFLTNEECEHLISLAKPSMVKSKVVDVKTGKSIDSR--VRTSSGTF 137
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
+ D+ I+E IE++I+ T +P +GE VL YE+GQKY+ H+D F +
Sbjct: 138 LKRGHDE--IVEEIENRISDFTFIPIENGEGLQVLHYEVGQKYEPHHDYFFDEFNVRKGG 195
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSL 175
QR+A+ L+YLSDV+EGGET+FP G D + + +C GL V P++ D LLF+S+
Sbjct: 196 QRIATVLMYLSDVDEGGETVFPAAKGNISDVPWWDELSQCGKEGLSVLPKKRDALLFWSM 255
Query: 176 FPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 208
P+ ++D +SLHG CPVIKG KW +TKW E
Sbjct: 256 KPDASLDPSSLHGGCPVIKGNKWSSTKWFHVHE 288
>gi|42567428|ref|NP_195306.2| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
gi|332661174|gb|AEE86574.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
Length = 290
Score = 174 bits (441), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 89/213 (41%), Positives = 138/213 (64%), Gaps = 9/213 (4%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTF 59
++V+SW PRA + NF + E+C+ +I+ AK + S++ ++ G++++S RTSSGTF
Sbjct: 80 LEVISWEPRAFVYHNFLTNEECEHLISLAKPSMMKSKVVDVKTGKSIDSR--VRTSSGTF 137
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
++ D+ I+E IE++I+ T +P +GE VL YE+GQ+Y+ H+D F +
Sbjct: 138 LNRGHDE--IVEEIENRISDFTFIPPENGEGLQVLHYEVGQRYEPHHDYFFDEFNVRKGG 195
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSL 175
QR+A+ L+YLSDV+EGGET+FP G D + + +C GL V P++ D LLF+S+
Sbjct: 196 QRIATVLMYLSDVDEGGETVFPAAKGNVSDVPWWDELSQCGKEGLSVLPKKRDALLFWSM 255
Query: 176 FPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 208
P+ ++D +SLHG CPVIKG KW +TKW E
Sbjct: 256 KPDASLDPSSLHGGCPVIKGNKWSSTKWFHVHE 288
>gi|356555587|ref|XP_003546112.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 2
[Glycine max]
Length = 297
Score = 174 bits (441), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 95/205 (46%), Positives = 126/205 (61%), Gaps = 7/205 (3%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTFISA 62
+SW+PRA + F + +C +I+ AK LK S +A GE+ S RTSSG FI
Sbjct: 43 VSWKPRAFVYEGFLTELECDHLISIAKSELKRSAVADNLSGESKLSE--VRTSSGMFIPK 100
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 122
++D I+ +E KI+ T+LP+ +GE VLRYE GQKYD HYD F + R+
Sbjct: 101 NKDP--IVAGVEDKISSWTLLPKENGEDIQVLRYEHGQKYDPHYDYFADKVNIARGGHRV 158
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLFPNGT 180
A+ L+YL+DV +GGET+FP ++ D +C G+ VKPRRGD LLF+SL+PN
Sbjct: 159 ATVLMYLTDVTKGGETVFPNAELKSSETKEDLSECAQKGIAVKPRRGDALLFFSLYPNAI 218
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIR 205
D SLH CPVI+GEKW ATKWI
Sbjct: 219 PDTMSLHAGCPVIEGEKWSATKWIH 243
>gi|259490206|ref|NP_001159002.1| prolyl 4-hydroxylase alpha-2 subunit [Zea mays]
gi|195626402|gb|ACG35031.1| prolyl 4-hydroxylase alpha-2 subunit precursor [Zea mays]
gi|347978830|gb|AEP37757.1| prolyl 4-hydroxylase 8 [Zea mays]
gi|347978832|gb|AEP37758.1| prolyl 4-hydroxylase 8-1 [Zea mays]
gi|413939569|gb|AFW74120.1| prolyl 4-hydroxylase alpha-2 subunit isoform 1 [Zea mays]
gi|413939570|gb|AFW74121.1| prolyl 4-hydroxylase alpha-2 subunit isoform 2 [Zea mays]
gi|413939571|gb|AFW74122.1| prolyl 4-hydroxylase alpha-2 subunit isoform 3 [Zea mays]
gi|413939572|gb|AFW74123.1| prolyl 4-hydroxylase alpha-2 subunit isoform 4 [Zea mays]
Length = 307
Score = 174 bits (440), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 96/217 (44%), Positives = 126/217 (58%), Gaps = 19/217 (8%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 55
+V+SW PRA + NF S ++C+ +I AK + S + V+ST G RTS
Sbjct: 97 EVISWEPRAFVYHNFLSKDECEYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 149
Query: 56 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 115
SG F+ DK ++ IE +IA T +P HGE VL YE+GQKY+ H+D F
Sbjct: 150 SGMFLQRGRDK--VIRAIEKRIADYTFIPVDHGEGLQVLHYEVGQKYEPHFDYFLDEFNT 207
Query: 116 PQMSQRLASFLLYLSDVEEGGETMFPFEN--GIFLDSGYDYKKCI--GLKVKPRRGDGLL 171
QR+A+ L+YLSDVEEGGET+FP N L + C GL VKP+ GD LL
Sbjct: 208 KNGGQRIATLLMYLSDVEEGGETIFPDANVNASSLPWYNELSDCAKRGLSVKPKMGDALL 267
Query: 172 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 208
F+S+ P+ T+D SLHG CPVIKG KW +TKW+ E
Sbjct: 268 FWSMKPDATLDPLSLHGGCPVIKGNKWSSTKWMHIHE 304
>gi|302845234|ref|XP_002954156.1| hypothetical protein VOLCADRAFT_82641 [Volvox carteri f.
nagariensis]
gi|300260655|gb|EFJ44873.1| hypothetical protein VOLCADRAFT_82641 [Volvox carteri f.
nagariensis]
Length = 309
Score = 173 bits (439), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 93/205 (45%), Positives = 127/205 (61%), Gaps = 7/205 (3%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTKGTRTSSGTFISA 62
LSW PRA F S E+C+ IIA AK R+ K S + G++V+S RTS+G +++
Sbjct: 57 LSWSPRAFLLKGFLSDEECEHIIAKAKPRMVKSSVVDNASGKSVDSE--IRTSTGAWLAK 114
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMS-Q 120
ED+ I+ IE ++A+ TM+P + E VL Y GQKY+ HYD F +P P+ Q
Sbjct: 115 GEDE--IISRIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNASPEHGGQ 172
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+ + L+YL+ VEEGGET+ P + G+ GL VKP +GD L+FYSL P+G+
Sbjct: 173 RVVTVLMYLTTVEEGGETVLPHADQKVSGEGWSECAKRGLAVKPVKGDALMFYSLKPDGS 232
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIR 205
D SLHGSCP +KG+KW ATKWI
Sbjct: 233 NDPASLHGSCPTLKGDKWSATKWIH 257
>gi|326526235|dbj|BAJ97134.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 308
Score = 173 bits (438), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 94/204 (46%), Positives = 125/204 (61%), Gaps = 5/204 (2%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 63
+SW PRA +P+F S ++ +++ A+ LK S +A + ++ RTSSGTFIS
Sbjct: 54 ISWHPRAFLYPHFLSDDEANHLVSLARAELKRSAVADETSGKSQLSE-VRTSSGTFISKG 112
Query: 64 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 123
+D I+ IE KIA T LP+ +GE VLRY+ G+KY+ HYD F + R+A
Sbjct: 113 KDP--IVAGIEDKIAAWTFLPKENGEDMQVLRYKRGEKYEPHYDFFTDSVNTILGGHRVA 170
Query: 124 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLFPNGTI 181
+ LLYL+DV EGGET+FP G +C G+ VKPR+GD LLF++L P+
Sbjct: 171 TVLLYLTDVAEGGETVFPLAKGRKGSHHKGLSECAQKGIAVKPRKGDALLFFNLRPDAAT 230
Query: 182 DRTSLHGSCPVIKGEKWVATKWIR 205
D TSLHG C VIKGEKW ATKWIR
Sbjct: 231 DPTSLHGGCEVIKGEKWSATKWIR 254
>gi|50845214|gb|AAT84604.1| prolyl 4-hydroxylase [Dianthus caryophyllus]
Length = 316
Score = 173 bits (438), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 93/206 (45%), Positives = 129/206 (62%), Gaps = 8/206 (3%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 62
LSW+PRA + F + E+C +I AK +L+ S +A + G+++ S RTSSG F+
Sbjct: 58 LSWKPRAFLYEGFLTHEECDHLIDMAKDKLEKSMVADNESGKSIPSE--VRTSSGMFLQK 115
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 122
++D ++ IE +IA T LP +GEA +L YE GQKY+ H+D F+ R+
Sbjct: 116 AQDD--VVAAIEARIAAWTFLPIENGEAMQILHYERGQKYEPHFDYFHDKVNQQLGGHRI 173
Query: 123 ASFLLYLSDVEEGGETMFP-FENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLFPNG 179
A+ L+YLS+VEEGGET+FP E + L + C G VKP++GD LLF+SL P+
Sbjct: 174 ATVLMYLSNVEEGGETVFPNAEAKLQLANNESLSDCAKGGYSVKPKKGDALLFFSLHPDA 233
Query: 180 TIDRTSLHGSCPVIKGEKWVATKWIR 205
+ D SLHGSCPVI+GEKW ATKWI
Sbjct: 234 STDSLSLHGSCPVIEGEKWSATKWIH 259
>gi|168046048|ref|XP_001775487.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162673157|gb|EDQ59684.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 263
Score = 173 bits (438), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 93/207 (44%), Positives = 131/207 (63%), Gaps = 7/207 (3%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTF 59
++ LSW+PRA + NF S +C +I+ AK +L+ S +A + G++V+S RTSSG F
Sbjct: 6 VKQLSWKPRAFLYSNFLSDAECDHMISLAKDKLEKSMVADNESGKSVKSE--IRTSSGMF 63
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
+ +D I+ IE +IA T LP+ +GEA VLRY+ G+KY+ H+D F+
Sbjct: 64 LMKGQDD--IISRIEDRIAAWTFLPKENGEAIQVLRYQDGEKYEPHFDYFHDKNNQALGG 121
Query: 120 QRLASFLLYLSDVEEGGETMFPF--ENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
R+A+ L+YLSDV +GGET+FP + G D + G+ VKPR+GD LLF+SL P
Sbjct: 122 HRIATVLMYLSDVVKGGETVFPSSEDRGGPKDDSWSACGKTGVAVKPRKGDALLFFSLHP 181
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWI 204
+ D +SLH CPVI+GEKW ATKWI
Sbjct: 182 SAVPDESSLHTGCPVIEGEKWSATKWI 208
>gi|363543371|ref|NP_001241695.1| prolyl 4-hydroxylase 8-5 [Zea mays]
gi|347978840|gb|AEP37762.1| prolyl 4-hydroxylase 8-5 [Zea mays]
Length = 307
Score = 173 bits (438), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 96/217 (44%), Positives = 125/217 (57%), Gaps = 19/217 (8%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 55
+V+SW PRA + NF S ++C+ +I AK + S + V+ST G RTS
Sbjct: 97 EVISWEPRAFVYHNFLSKDECEYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 149
Query: 56 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 115
SG F+ DK ++ IE +IA T +P HGE VL YE+GQKY+ H+D F
Sbjct: 150 SGMFLQRGRDK--VIRAIEKRIADYTFIPVDHGEGLQVLHYEVGQKYEPHFDYFLDEFNT 207
Query: 116 PQMSQRLASFLLYLSDVEEGGETMFPFEN--GIFLDSGYDYKKCI--GLKVKPRRGDGLL 171
QR+A+ L+YLSDVEEGGET+FP N L + C GL VKP+ GD LL
Sbjct: 208 KNGGQRIATLLMYLSDVEEGGETIFPDANVNASSLPWYNELSDCAKRGLSVKPKMGDALL 267
Query: 172 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 208
F+S+ P T+D SLHG CPVIKG KW +TKW+ E
Sbjct: 268 FWSMKPGATLDPLSLHGGCPVIKGNKWSSTKWMHIHE 304
>gi|356502610|ref|XP_003520111.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 286
Score = 173 bits (438), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 90/213 (42%), Positives = 128/213 (60%), Gaps = 9/213 (4%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTF 59
M+V+SW+PRA + NF + E+C+ +I A ++ S +A Q G++V R S+G F
Sbjct: 74 MEVISWQPRAFLYHNFLTKEECEYLINIATPHMQKSTVADNQSGQSV--VHDVRKSTGAF 131
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
+ +D+ I+ IE +IA T +P +GE V+ YE+GQ YD HYD F
Sbjct: 132 LDRGQDE--IVRNIEKRIADVTFIPIENGEPIYVIHYEVGQYYDPHYDYFIDDFNIENGG 189
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSL 175
QR+A+ L+YLS+VEEGGETMFP F + + C +GL +KP+ GD LLF+S+
Sbjct: 190 QRIATMLMYLSNVEEGGETMFPRAKANFSSVPWWNELSNCGKMGLSIKPKMGDALLFWSM 249
Query: 176 FPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 208
PN T+D +LH +CPVIKG KW TKW+ E
Sbjct: 250 KPNATLDALTLHSACPVIKGNKWSCTKWMHPTE 282
>gi|90704797|dbj|BAE92293.1| putative prolyl 4-hydroxylase, alpha subunit [Cryptomeria japonica]
Length = 302
Score = 173 bits (438), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 92/210 (43%), Positives = 134/210 (63%), Gaps = 9/210 (4%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTF 59
++VLSW PRA + NF + ++C+ +I AK + S + + G +++S RTSSG F
Sbjct: 91 VEVLSWEPRAFLYHNFLAKDECEYLINIAKPHMVKSMVVDSKTGGSMDSN--VRTSSGWF 148
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
++ +DK I+ IE +IA + +P HGE +VL YE+ QKYD+HYD F+
Sbjct: 149 LNRGQDK--IIRRIEKRIADFSHIPVEHGEGLHVLHYEVEQKYDAHYDYFSDTINVKNGG 206
Query: 120 QRLASFLLYLSDVEEGGETMFPFE--NGIFLDSGYDYKKC--IGLKVKPRRGDGLLFYSL 175
QR A+ L+YLSDVE+GGET+FP N + + +C GL V+P+ GD LLF+S+
Sbjct: 207 QRGATMLMYLSDVEKGGETVFPQSKVNSSSVPWWDELSECGRSGLSVRPKMGDALLFWSV 266
Query: 176 FPNGTIDRTSLHGSCPVIKGEKWVATKWIR 205
P+ ++D +SLHGSCPVI+G KW ATKW+R
Sbjct: 267 KPDASLDPSSLHGSCPVIQGNKWSATKWMR 296
>gi|242039723|ref|XP_002467256.1| hypothetical protein SORBIDRAFT_01g022150 [Sorghum bicolor]
gi|241921110|gb|EER94254.1| hypothetical protein SORBIDRAFT_01g022150 [Sorghum bicolor]
Length = 303
Score = 172 bits (437), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 93/205 (45%), Positives = 127/205 (61%), Gaps = 8/205 (3%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 62
LSWRPRA F S +C +I AK +L+ S +A + G++V+S RTSSG F+
Sbjct: 43 LSWRPRAFLHKGFLSDAECDHLIVLAKDKLEKSMVADNESGKSVQSE--VRTSSGMFLEK 100
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 122
+D+ ++ IE +IA T LP +GE+ +L Y+ G+KY+ HYD F+ R+
Sbjct: 101 KQDE--VVRGIEERIAAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKNNQALGGHRI 158
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPNG 179
A+ L+YLS+VE+GGET+FP G L D + C G VKP +GD LLF+SL P+
Sbjct: 159 ATVLMYLSNVEKGGETIFPNAEGKLLQPKDDTWSDCARNGYAVKPVKGDALLFFSLHPDA 218
Query: 180 TIDRTSLHGSCPVIKGEKWVATKWI 204
T D SLHGSCPVI+G+KW ATKWI
Sbjct: 219 TTDSESLHGSCPVIEGQKWSATKWI 243
>gi|159794881|pdb|2JIJ|A Chain A, Crystal Structure Of The Apo Form Of Chlamydomonas
Reinhardtii Prolyl-4 Hydroxylase Type I
gi|159794882|pdb|2JIJ|B Chain B, Crystal Structure Of The Apo Form Of Chlamydomonas
Reinhardtii Prolyl-4 Hydroxylase Type I
gi|159794883|pdb|2JIJ|C Chain C, Crystal Structure Of The Apo Form Of Chlamydomonas
Reinhardtii Prolyl-4 Hydroxylase Type I
Length = 233
Score = 172 bits (437), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 92/205 (44%), Positives = 125/205 (60%), Gaps = 7/205 (3%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAK-KRLKPSQLALRQGETVESTKGTRTSSGTFISA 62
LSW PRA NF S E+C I+ A+ K +K S + G++V+S RTS+GT+ +
Sbjct: 25 LSWSPRAFLLKNFLSDEECDYIVEKARPKMVKSSVVDNESGKSVDSE--IRTSTGTWFAK 82
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMS-Q 120
ED ++ IE ++A+ TM+P + E VL Y GQKY+ HYD F +P GP+ Q
Sbjct: 83 GEDS--VISKIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNAGPEHGGQ 140
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+ + L+YL+ VEEGGET+ P G+ GL VKP +GD L+FYSL P+G+
Sbjct: 141 RVVTMLMYLTTVEEGGETVLPNAEQKVTGDGWSECAKRGLAVKPIKGDALMFYSLKPDGS 200
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIR 205
D SLHGSCP +KG+KW ATKWI
Sbjct: 201 NDPASLHGSCPTLKGDKWSATKWIH 225
>gi|159794879|pdb|2JIG|A Chain A, Crystal Structure Of Chlamydomonas Reinhardtii Prolyl-4
Hydroxylase Type I Complexed With Zinc And Pyridine-2,4-
Dicarboxylate
gi|159794880|pdb|2JIG|B Chain B, Crystal Structure Of Chlamydomonas Reinhardtii Prolyl-4
Hydroxylase Type I Complexed With Zinc And Pyridine-2,4-
Dicarboxylate
Length = 224
Score = 172 bits (437), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 92/205 (44%), Positives = 125/205 (60%), Gaps = 7/205 (3%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAK-KRLKPSQLALRQGETVESTKGTRTSSGTFISA 62
LSW PRA NF S E+C I+ A+ K +K S + G++V+S RTS+GT+ +
Sbjct: 16 LSWSPRAFLLKNFLSDEECDYIVEKARPKMVKSSVVDNESGKSVDSE--IRTSTGTWFAK 73
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMS-Q 120
ED ++ IE ++A+ TM+P + E VL Y GQKY+ HYD F +P GP+ Q
Sbjct: 74 GEDS--VISKIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNAGPEHGGQ 131
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+ + L+YL+ VEEGGET+ P G+ GL VKP +GD L+FYSL P+G+
Sbjct: 132 RVVTMLMYLTTVEEGGETVLPNAEQKVTGDGWSECAKRGLAVKPIKGDALMFYSLKPDGS 191
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIR 205
D SLHGSCP +KG+KW ATKWI
Sbjct: 192 NDPASLHGSCPTLKGDKWSATKWIH 216
>gi|159478673|ref|XP_001697425.1| predicted protein [Chlamydomonas reinhardtii]
gi|158274304|gb|EDP00087.1| predicted protein [Chlamydomonas reinhardtii]
Length = 297
Score = 172 bits (437), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 92/205 (44%), Positives = 125/205 (60%), Gaps = 7/205 (3%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAK-KRLKPSQLALRQGETVESTKGTRTSSGTFISA 62
LSW PRA NF S E+C I+ A+ K +K S + G++V+S RTS+GT+ +
Sbjct: 45 LSWSPRAFLLKNFLSDEECDYIVEKARPKMVKSSVVDNESGKSVDSE--IRTSTGTWFAK 102
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMS-Q 120
ED ++ IE ++A+ TM+P + E VL Y GQKY+ HYD F +P GP+ Q
Sbjct: 103 GEDS--VISKIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNAGPEHGGQ 160
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+ + L+YL+ VEEGGET+ P G+ GL VKP +GD L+FYSL P+G+
Sbjct: 161 RVVTMLMYLTTVEEGGETVLPNAEQKVTGDGWSECAKRGLAVKPIKGDALMFYSLKPDGS 220
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIR 205
D SLHGSCP +KG+KW ATKWI
Sbjct: 221 NDPASLHGSCPTLKGDKWSATKWIH 245
>gi|241913390|pdb|3GZE|A Chain A, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
(Ser-Pro)5 Peptide Substrate
gi|241913391|pdb|3GZE|B Chain B, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
(Ser-Pro)5 Peptide Substrate
gi|241913392|pdb|3GZE|C Chain C, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
(Ser-Pro)5 Peptide Substrate
gi|241913393|pdb|3GZE|D Chain D, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
(Ser-Pro)5 Peptide Substrate
Length = 225
Score = 172 bits (437), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 92/205 (44%), Positives = 125/205 (60%), Gaps = 7/205 (3%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAK-KRLKPSQLALRQGETVESTKGTRTSSGTFISA 62
LSW PRA NF S E+C I+ A+ K +K S + G++V+S RTS+GT+ +
Sbjct: 17 LSWSPRAFLLKNFLSDEECDYIVEKARPKMVKSSVVDNESGKSVDSE--IRTSTGTWFAK 74
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMS-Q 120
ED ++ IE ++A+ TM+P + E VL Y GQKY+ HYD F +P GP+ Q
Sbjct: 75 GEDS--VISKIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNAGPEHGGQ 132
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+ + L+YL+ VEEGGET+ P G+ GL VKP +GD L+FYSL P+G+
Sbjct: 133 RVVTMLMYLTTVEEGGETVLPNAEQKVTGDGWSECAKRGLAVKPIKGDALMFYSLKPDGS 192
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIR 205
D SLHGSCP +KG+KW ATKWI
Sbjct: 193 NDPASLHGSCPTLKGDKWSATKWIH 217
>gi|356550516|ref|XP_003543632.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 318
Score = 172 bits (436), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 93/206 (45%), Positives = 127/206 (61%), Gaps = 8/206 (3%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 62
LSW PRA + F S E+C +I AK +L+ S +A + G+++ S RTSSG F++
Sbjct: 59 LSWSPRAFLYKGFLSDEECDHLITLAKDKLEKSMVADNESGKSIMSE--VRTSSGMFLNK 116
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 122
++D+ I+ IE +IA T LP +GE+ +L YE GQKY+ H+D F+ R+
Sbjct: 117 AQDE--IVAGIEARIAAWTFLPIENGESMQILHYENGQKYEPHFDYFHDKANQVMGGHRI 174
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFL---DSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 179
A+ L+YLSDVE+GGET+FP L D + G VKPR+GD LLF+SL +
Sbjct: 175 ATVLMYLSDVEKGGETIFPNAKAKLLQPKDESWSECAHKGYAVKPRKGDALLFFSLHLDA 234
Query: 180 TIDRTSLHGSCPVIKGEKWVATKWIR 205
+ D SLHGSCPVI+GEKW ATKWI
Sbjct: 235 STDNKSLHGSCPVIEGEKWSATKWIH 260
>gi|359477453|ref|XP_003631980.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Vitis
vinifera]
gi|297736941|emb|CBI26142.3| unnamed protein product [Vitis vinifera]
Length = 298
Score = 172 bits (436), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 97/207 (46%), Positives = 123/207 (59%), Gaps = 9/207 (4%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 63
+SW+PRA + F S E+C +I+ AK LK S +A ++ RTSSG FI
Sbjct: 40 ISWKPRAFVYEGFLSEEECDHLISLAKSELKRSAVADNVSGKSRLSE-VRTSSGMFIGKG 98
Query: 64 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 123
+D I+ IE KIA T LP+ +GE VLRYE GQKYD+HYD F + R+A
Sbjct: 99 KDP--IVAGIEDKIAAWTFLPKDNGEDMQVLRYEPGQKYDAHYDYFVDKVNIARGGHRIA 156
Query: 124 SFLLYLSDVEEGGETMFPFENGIF----LDSGYDYKKCI--GLKVKPRRGDGLLFYSLFP 177
+ L+YLSDV +GGET+FP L + D +C G+ VKPR+GD LLF+SL P
Sbjct: 157 TVLMYLSDVVKGGETVFPMAEEPSRRKPLPTNDDLSECARKGIAVKPRKGDALLFFSLHP 216
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWI 204
D SLHG CPVI+GEKW ATKWI
Sbjct: 217 TAIPDPMSLHGGCPVIEGEKWSATKWI 243
>gi|224117220|ref|XP_002331751.1| predicted protein [Populus trichocarpa]
gi|222874448|gb|EEF11579.1| predicted protein [Populus trichocarpa]
Length = 266
Score = 172 bits (436), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 88/212 (41%), Positives = 127/212 (59%), Gaps = 7/212 (3%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
++ +SW PRA + NF + +C +I AK ++ S + + ++ RTSSGTF+
Sbjct: 55 VEAISWEPRAFIYHNFLTKAECDYLINLAKPHMQKSMVVDSSSGKSKDSR-VRTSSGTFL 113
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
DK I+ IE +IA + +P HGE +L YE+GQKY+ H+D F Q
Sbjct: 114 PRGRDK--IIRDIEKRIADFSFIPSEHGEGLQILHYEVGQKYEPHFDYFMDDYNTENGGQ 171
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSLF 176
R+A+ L+YLSDVEEGGET+FP G + + +C GL VKP+ GD LLF+S+
Sbjct: 172 RIATVLMYLSDVEEGGETVFPSAKGNISSVPWWNELSECGKGGLSVKPKMGDALLFWSMK 231
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 208
P+ ++D +SLHG CPVI+G KW +TKW+R E
Sbjct: 232 PDASLDPSSLHGGCPVIRGNKWSSTKWMRVNE 263
>gi|449495423|ref|XP_004159836.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 304
Score = 172 bits (436), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 95/212 (44%), Positives = 127/212 (59%), Gaps = 10/212 (4%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
++ +SW PRA + F + +C +I+ AK LK S +A + ++ RTSSG FI
Sbjct: 41 VKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGKSKVSE-VRTSSGAFI 99
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
++D I+ IE KIA T LP+ +GE VLRYE GQKYD+H+D F +
Sbjct: 100 HKAKDP--IVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGH 157
Query: 121 RLASFLLYLSDVEEGGETMFPFENG-----IFLDSGYDYKKCI--GLKVKPRRGDGLLFY 173
R+A+ L+YLSDVE+GGET+F ++ D C G+ VKPR+GD LLF+
Sbjct: 158 RMATVLMYLSDVEKGGETVFLLRRSESQRRQASETNEDLSDCAKKGIAVKPRKGDALLFF 217
Query: 174 SLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 205
SL PN D +SLHG CPVI+GEKW ATKWIR
Sbjct: 218 SLHPNAIPDTSSLHGGCPVIEGEKWSATKWIR 249
>gi|195627276|gb|ACG35468.1| prolyl 4-hydroxylase [Zea mays]
Length = 298
Score = 172 bits (436), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 93/206 (45%), Positives = 127/206 (61%), Gaps = 8/206 (3%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISA 62
LSWRPRA F +C +IA AK +L+ S +A + G++V+S RTSSG F+
Sbjct: 38 LSWRPRAFLHKGFLLDAECDHLIALAKDKLEKSMVADNKSGKSVQSE--VRTSSGMFLEK 95
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 122
+D+ ++ IE +I+ T LP +GEA +L Y+ G+KY+ HYD F+ R+
Sbjct: 96 KQDE--VVTRIEERISAWTFLPPENGEAIQILHYQNGEKYEPHYDYFHDKNNQALGGHRI 153
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPNG 179
A+ L+YLS+VE+GGET+FP G L D + C G VKP +GD LLF+SL P+
Sbjct: 154 ATVLMYLSNVEKGGETIFPNAEGKLLQPKDDTWSDCARNGYAVKPVKGDALLFFSLHPDS 213
Query: 180 TIDRTSLHGSCPVIKGEKWVATKWIR 205
T D SLHGSCPVI+G+KW ATKWI
Sbjct: 214 TTDSDSLHGSCPVIEGQKWSATKWIH 239
>gi|363543369|ref|NP_001241694.1| prolyl 4-hydroxylase 8-4 [Zea mays]
gi|347978838|gb|AEP37761.1| prolyl 4-hydroxylase 8-4 [Zea mays]
Length = 307
Score = 172 bits (436), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 95/217 (43%), Positives = 126/217 (58%), Gaps = 19/217 (8%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 55
+V+SW PRA + NF S ++C+ +I AK + S + V+ST G RTS
Sbjct: 97 EVISWEPRAFVYHNFLSKDECEYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 149
Query: 56 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 115
SG F+ +K ++ IE +IA T +P HGE VL YE+GQKY+ H+D F
Sbjct: 150 SGMFLQRGRNK--VIRAIEKRIADYTFIPVDHGEGLQVLHYEVGQKYEPHFDYFLDEFNT 207
Query: 116 PQMSQRLASFLLYLSDVEEGGETMFPFEN--GIFLDSGYDYKKCI--GLKVKPRRGDGLL 171
QR+A+ L+YLSDVEEGGET+FP N L + C GL VKP+ GD LL
Sbjct: 208 KNGGQRIATLLMYLSDVEEGGETIFPDANVNASSLPWYNELSDCAKRGLSVKPKMGDALL 267
Query: 172 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 208
F+S+ P+ T+D SLHG CPVIKG KW +TKW+ E
Sbjct: 268 FWSMKPDATLDPLSLHGGCPVIKGNKWSSTKWMHIHE 304
>gi|225452614|ref|XP_002281420.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
gi|296087745|emb|CBI35001.3| unnamed protein product [Vitis vinifera]
Length = 316
Score = 172 bits (435), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 94/206 (45%), Positives = 128/206 (62%), Gaps = 8/206 (3%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 62
LSWRPRA + F S E+C +I AK +L+ S +A + G+++ S RTSSG F+
Sbjct: 57 LSWRPRAFLYKGFLSEEECDHLITLAKDKLEKSMVADNESGKSIMSE--VRTSSGMFLLK 114
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 122
++D+ I+ IE +IA T LP +GE+ +L YE G+KY+ H+D F+ R+
Sbjct: 115 AQDE--IVADIEARIAAWTFLPVENGESIQILHYENGEKYEPHFDYFHDKVNQLLGGHRI 172
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPNG 179
A+ L+YL+ VEEGGET+FP G F D + C G V P++GD LLF+SL P+
Sbjct: 173 ATVLMYLATVEEGGETVFPNSEGRFSQPKDDSWSDCAKKGYAVNPKKGDALLFFSLHPDA 232
Query: 180 TIDRTSLHGSCPVIKGEKWVATKWIR 205
T D +SLHGSCPVI GEKW ATKWI
Sbjct: 233 TTDPSSLHGSCPVIAGEKWSATKWIH 258
>gi|224141327|ref|XP_002324025.1| predicted protein [Populus trichocarpa]
gi|222867027|gb|EEF04158.1| predicted protein [Populus trichocarpa]
Length = 239
Score = 172 bits (435), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 97/205 (47%), Positives = 127/205 (61%), Gaps = 8/205 (3%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 62
LSW+PRA + F S E+C +I AK +L S +A + GE++ES + RTSSG FI
Sbjct: 21 LSWQPRAFVYKGFLSDEECDHLINLAKGKLVKSMVANDETGESMESQE--RTSSGMFIFK 78
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 122
+ED+ I+ IE +IA T LP+ +GE +LRYE GQKY++H D F + R
Sbjct: 79 TEDE--IVNGIEARIAAWTFLPEENGEPIQILRYEHGQKYEAHIDYFVDKANQEEGGHRA 136
Query: 123 ASFLLYLSDVEEGGETMFP---FENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 179
A+ L+YLSDV++GGET+FP E D + G VKP +GD LLF+SL P+
Sbjct: 137 ATVLMYLSDVKKGGETVFPTSEAEGSQAKDDSWSDCAKKGYAVKPNKGDALLFFSLHPDA 196
Query: 180 TIDRTSLHGSCPVIKGEKWVATKWI 204
T D SLH SCPVI+GEKW ATKWI
Sbjct: 197 TPDPGSLHASCPVIEGEKWSATKWI 221
>gi|224102545|ref|XP_002312720.1| predicted protein [Populus trichocarpa]
gi|222852540|gb|EEE90087.1| predicted protein [Populus trichocarpa]
Length = 300
Score = 172 bits (435), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 94/210 (44%), Positives = 130/210 (61%), Gaps = 9/210 (4%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
++ +SW+PRA + F + +C +I+ AK LK S +A + + ++ RTSSG FI
Sbjct: 39 VKQVSWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVADNESGKSKLSE-VRTSSGMFI 97
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
+ ++D I+ IE KIA T LP+ +GE VLRYE GQKYD HYD F+ +
Sbjct: 98 TKAKDP--IVAGIEDKIATWTFLPRENGEDIQVLRYEHGQKYDPHYDYFSDKVNIARGGH 155
Query: 121 RLASFLLYLSDVEEGGETMFPFENGI----FLDSGYDYKKCI--GLKVKPRRGDGLLFYS 174
R+A+ L+YL+DVE+GGET+FP + S D +C G+ VKPRRGD LLF+S
Sbjct: 156 RVATVLMYLTDVEKGGETVFPSAEELPRRKASVSHEDLSECARKGIAVKPRRGDALLFFS 215
Query: 175 LFPNGTIDRTSLHGSCPVIKGEKWVATKWI 204
L+P D +S+H CPVI+GEKW ATKWI
Sbjct: 216 LYPTAVPDTSSIHAGCPVIEGEKWSATKWI 245
>gi|293337056|ref|NP_001169835.1| uncharacterized protein LOC100383727 precursor [Zea mays]
gi|224031897|gb|ACN35024.1| unknown [Zea mays]
gi|347978800|gb|AEP37742.1| prolyl 4-hydroxylase 2 [Zea mays]
gi|414871435|tpg|DAA49992.1| TPA: hypothetical protein ZEAMMB73_500506 [Zea mays]
Length = 299
Score = 172 bits (435), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 92/205 (44%), Positives = 129/205 (62%), Gaps = 8/205 (3%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 62
LSWRPRA F S +C +IA AK +L+ S +A + G++V+S RTSSG F+
Sbjct: 39 LSWRPRAFLHKGFLSDAECDHLIALAKDKLEKSMVADNESGKSVQSE--VRTSSGMFLER 96
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 122
+D+ ++ IE +I+ T LP +GE+ +L Y+ G+KY+ HYD F+ + R+
Sbjct: 97 KQDE--VVTRIEERISAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKKNQALGGHRI 154
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFL---DSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 179
A+ L+YLS+VE+GGET+FP G L D+ + G VKP +GD LLF+SL P+
Sbjct: 155 ATVLMYLSNVEKGGETIFPNAEGKLLQPKDNTWSDCARNGYAVKPVKGDALLFFSLHPDA 214
Query: 180 TIDRTSLHGSCPVIKGEKWVATKWI 204
T D SLHGSCPVI+G+KW ATKWI
Sbjct: 215 TTDSDSLHGSCPVIEGQKWSATKWI 239
>gi|449459442|ref|XP_004147455.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
gi|449515722|ref|XP_004164897.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 319
Score = 171 bits (434), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 95/206 (46%), Positives = 132/206 (64%), Gaps = 9/206 (4%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 63
LS +PRA + F SAE+CQ +I +AK +L S +A G++V S + RTS+G F+ +
Sbjct: 63 LSSKPRAFLYKGFLSAEECQHLINSAKGKLHQSLVAAGTGQSVTSKE--RTSTGMFLHKA 120
Query: 64 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 123
+D+ I+ IE +IA T LP +GE +LRYE GQKY+ H+D F R+A
Sbjct: 121 QDE--IVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIA 178
Query: 124 SFLLYLSDVEEGGETMFPFENGIFL--DSGYDYKKC--IGLKVKPRRGDGLLFYSLFPNG 179
+ L+YLS+VE+GGET+FP + + L + D +C +G V+P+ GD LLF+S+ PN
Sbjct: 179 TILMYLSNVEKGGETVFP-NSPVKLSEEEKADLSECGKVGYGVRPKLGDALLFFSMNPNV 237
Query: 180 TIDRTSLHGSCPVIKGEKWVATKWIR 205
T D TS HGSCPVI+GEKW ATKWI
Sbjct: 238 TPDTTSYHGSCPVIEGEKWSATKWIH 263
>gi|307106819|gb|EFN55064.1| hypothetical protein CHLNCDRAFT_35843 [Chlorella variabilis]
Length = 287
Score = 171 bits (433), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 92/208 (44%), Positives = 132/208 (63%), Gaps = 9/208 (4%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTF 59
++ +SWRPRA + NF S E+C+ + A+KRL S + + G++++ST RTSSGTF
Sbjct: 38 VEQVSWRPRAFVYHNFLSDEECEHLKELARKRLTKSTVVDNKTGKSMDST--VRTSSGTF 95
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM- 118
++ ED+ ++ IE +I+ TM+P+ +GEA +L+Y GQKY+ H D F+ +Y +
Sbjct: 96 LARGEDE--VVRAIEKRISLVTMIPEENGEAIQILKYVDGQKYEPHTDYFH-DKYNSRTE 152
Query: 119 --SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
QR+A+ L+YLS EEGGET+FP+ G+ GL VK +G LLFYSL
Sbjct: 153 NGGQRVATILMYLSTPEEGGETVFPYAEKKVEGEGWSECARKGLAVKAVKGSALLFYSLK 212
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWI 204
PNG D+ S HGSCP + GEKW AT+WI
Sbjct: 213 PNGEEDQASTHGSCPTLAGEKWSATRWI 240
>gi|218184507|gb|EEC66934.1| hypothetical protein OsI_33548 [Oryza sativa Indica Group]
Length = 308
Score = 171 bits (433), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 92/206 (44%), Positives = 127/206 (61%), Gaps = 8/206 (3%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 62
LSWRPRA F + +C+ +I+ AK +L+ S +A + G++V S RTSSG F+
Sbjct: 48 LSWRPRAFLHKGFLTDAECEHLISLAKDKLEKSMVADNESGKSVMSE--VRTSSGMFLEK 105
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 122
+D+ ++ IE +IA T LP +GE+ +L Y+ G+KY+ HYD F+ R+
Sbjct: 106 KQDE--VVARIEERIAAWTFLPPDNGESIQILHYQNGEKYEPHYDYFHDKNNQALGGHRI 163
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPNG 179
A+ L+YLSDV +GGET+FP G L D + C G VKP +GD LLF+SL P+
Sbjct: 164 ATVLMYLSDVGKGGETIFPEAEGKLLQPKDDTWSDCAKNGYAVKPVKGDALLFFSLHPDA 223
Query: 180 TIDRTSLHGSCPVIKGEKWVATKWIR 205
T D SLHGSCPVI+G+KW ATKWI
Sbjct: 224 TTDSDSLHGSCPVIEGQKWSATKWIH 249
>gi|115481998|ref|NP_001064592.1| Os10g0413500 [Oryza sativa Japonica Group]
gi|110289075|gb|ABG66075.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
Group]
gi|113639201|dbj|BAF26506.1| Os10g0413500 [Oryza sativa Japonica Group]
gi|215692577|dbj|BAG87997.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222612821|gb|EEE50953.1| hypothetical protein OsJ_31503 [Oryza sativa Japonica Group]
Length = 308
Score = 171 bits (433), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 92/205 (44%), Positives = 127/205 (61%), Gaps = 8/205 (3%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 62
LSWRPRA F + +C+ +I+ AK +L+ S +A + G++V S RTSSG F+
Sbjct: 48 LSWRPRAFLHKGFLTDAECEHLISLAKDKLEKSMVADNESGKSVMSE--VRTSSGMFLEK 105
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 122
+D+ ++ IE +IA T LP +GE+ +L Y+ G+KY+ HYD F+ R+
Sbjct: 106 KQDE--VVARIEERIAAWTFLPPDNGESIQILHYQNGEKYEPHYDYFHDKNNQALGGHRI 163
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPNG 179
A+ L+YLSDV +GGET+FP G L D + C G VKP +GD LLF+SL P+
Sbjct: 164 ATVLMYLSDVGKGGETIFPEAEGKLLQPKDDTWSDCAKNGYAVKPVKGDALLFFSLHPDA 223
Query: 180 TIDRTSLHGSCPVIKGEKWVATKWI 204
T D SLHGSCPVI+G+KW ATKWI
Sbjct: 224 TTDSDSLHGSCPVIEGQKWSATKWI 248
>gi|30689216|ref|NP_189490.2| Oxoglutarate/iron-dependent oxygenase [Arabidopsis thaliana]
gi|332643931|gb|AEE77452.1| Oxoglutarate/iron-dependent oxygenase [Arabidopsis thaliana]
Length = 288
Score = 171 bits (433), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 95/207 (45%), Positives = 127/207 (61%), Gaps = 9/207 (4%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA--LRQGETVESTKGTRTSSGTFIS 61
LSW PRA + F S E+C +I AK +L+ S + + GE+ +S RTSSG F++
Sbjct: 35 LSWTPRAFLYKGFLSDEECDHLIKLAKGKLEKSMVVADVDSGESEDSE--VRTSSGMFLT 92
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 121
+D I+ +E K+A T LP+ +GEA +L YE GQKYD H+D F + R
Sbjct: 93 KRQDD--IVANVEAKLAAWTFLPEENGEALQILHYENGQKYDPHFDYFYDKKALELGGHR 150
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPN 178
+A+ L+YLS+V +GGET+FP G D + KC G VKPR+GD LLF++L N
Sbjct: 151 IATVLMYLSNVTKGGETVFPNWKGKTPQLKDDSWSKCAKQGYAVKPRKGDALLFFNLHLN 210
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWIR 205
GT D SLHGSCPVI+GEKW AT+WI
Sbjct: 211 GTTDPNSLHGSCPVIEGEKWSATRWIH 237
>gi|242088305|ref|XP_002439985.1| hypothetical protein SORBIDRAFT_09g023860 [Sorghum bicolor]
gi|241945270|gb|EES18415.1| hypothetical protein SORBIDRAFT_09g023860 [Sorghum bicolor]
Length = 308
Score = 170 bits (431), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 91/204 (44%), Positives = 123/204 (60%), Gaps = 5/204 (2%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 63
+SW+PR + +F S ++ +I+ A+ LK S +A + RTSSGTF+
Sbjct: 54 ISWKPRVFLYQHFLSDDEANHLISLARAELKRSAVADNMSGK-STLSDVRTSSGTFLRKG 112
Query: 64 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 123
+D I+E IE KIA T LP+ +GE VLRY+ G+KY+ HYD F + R A
Sbjct: 113 QDP--IVEGIEDKIAAWTFLPKENGEDIQVLRYKHGEKYEPHYDYFTDNVNTIRGGHRYA 170
Query: 124 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLFPNGTI 181
+ LLYL+DV EGGET+FP + + +C G+ VKPR+GD LLF++L P+GT
Sbjct: 171 TVLLYLTDVAEGGETVFPLAEEVDDAKDATFSECAQKGIAVKPRKGDALLFFNLKPDGTT 230
Query: 182 DRTSLHGSCPVIKGEKWVATKWIR 205
D SLHG C VI+GEKW ATKWIR
Sbjct: 231 DPVSLHGGCAVIRGEKWSATKWIR 254
>gi|212720650|ref|NP_001132477.1| uncharacterized protein LOC100193935 precursor [Zea mays]
gi|194694488|gb|ACF81328.1| unknown [Zea mays]
gi|347978828|gb|AEP37756.1| prolyl 4-hydroxylase 7 [Zea mays]
gi|413934218|gb|AFW68769.1| prolyl 4-hydroxylase [Zea mays]
Length = 298
Score = 170 bits (431), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 92/206 (44%), Positives = 126/206 (61%), Gaps = 8/206 (3%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISA 62
LSWRPRA F +C +IA AK +L+ S +A + G++V+S RTSSG F+
Sbjct: 38 LSWRPRAFLHKGFLLDAECDHLIALAKDKLEKSMVADNKSGKSVQSE--VRTSSGMFLEK 95
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 122
+D+ ++ IE +I+ T LP +GEA +L Y+ G+KY+ HYD F+ R+
Sbjct: 96 KQDE--VVTRIEERISAWTFLPPENGEAIQILHYQNGEKYEPHYDYFHDKNNQALGGHRI 153
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPNG 179
A+ L+YLS+VE+GGET+FP G L D + C G VKP +GD LLF+SL P+
Sbjct: 154 ATVLMYLSNVEKGGETIFPNAEGKLLQPKDDTWSDCARNGYAVKPVKGDALLFFSLHPDS 213
Query: 180 TIDRTSLHGSCPVIKGEKWVATKWIR 205
T D SLHGSCP I+G+KW ATKWI
Sbjct: 214 TTDSDSLHGSCPAIEGQKWSATKWIH 239
>gi|224085946|ref|XP_002307750.1| predicted protein [Populus trichocarpa]
gi|222857199|gb|EEE94746.1| predicted protein [Populus trichocarpa]
Length = 288
Score = 170 bits (430), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 90/208 (43%), Positives = 128/208 (61%), Gaps = 9/208 (4%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTKGTRTSSGTFI 60
++LSW PRA + NF S E+C+ +I AK + K + + + G + +S RTSSG F+
Sbjct: 78 EILSWEPRAFLYHNFLSKEECEYLINLAKPHMMKSTVVDSKTGRSKDSR--VRTSSGMFL 135
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
D+ ++ IE +IA + +P HGE VL YE+GQKY++H+D F Q
Sbjct: 136 RRGRDR--VIREIEKRIADFSFIPVEHGEGLQVLHYEVGQKYEAHFDYFLDEFNTKNGGQ 193
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKCI--GLKVKPRRGDGLLFYSLF 176
R A+ L+YLSDVEEGGET+FP N + + +C GL +KP+ G+ LLF+S
Sbjct: 194 RTATLLMYLSDVEEGGETVFPAANMNISAVPWWNELSECAKQGLSLKPKMGNALLFWSTR 253
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWI 204
P+ T+D +SLHGSCPVI+G KW ATKW+
Sbjct: 254 PDATLDPSSLHGSCPVIRGNKWSATKWM 281
>gi|303285562|ref|XP_003062071.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226456482|gb|EEH53783.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 522
Score = 170 bits (430), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 95/210 (45%), Positives = 132/210 (62%), Gaps = 17/210 (8%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTK-GTRTSSGTFISASED 65
RP+A F NF + E+C+ +IA AK +L PS + G+ +STK G RTS+G F++ +
Sbjct: 235 RPKAYLFRNFLTEEECRHLIALAKAQLAPSTVVADGGK--KSTKSGIRTSAGMFLT--KG 290
Query: 66 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF----NPAEYGPQMSQR 121
+T + ++E ++A A LP+ +GE +LRYE GQKYD HYD F NP+ + QR
Sbjct: 291 QTPTVRMVEERVAAAVGLPEENGEGMQILRYEHGQKYDPHYDYFHDKINPSPN--RGGQR 348
Query: 122 LASFLLYLSDVEEGGETMFPFENGI--FLDSGYD--YKKCI--GLKVKPRRGDGLLFYSL 175
+A+ L+YL D EEGGET+FP F D D + C GL VK +RGD +LF+SL
Sbjct: 349 MATMLIYLKDTEEGGETIFPNAKKPEGFHDGEKDGAFSDCAKRGLPVKSKRGDAVLFWSL 408
Query: 176 FPNGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+ +D SLHG+CPV++GEKW A KWIR
Sbjct: 409 TSDYKLDEGSLHGACPVLRGEKWTAVKWIR 438
>gi|356555585|ref|XP_003546111.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 1
[Glycine max]
Length = 301
Score = 169 bits (429), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 96/208 (46%), Positives = 125/208 (60%), Gaps = 11/208 (5%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTFISA 62
+SW+PRA + F + +C +I+ AK LK S +A GE+ S RTSSG FI
Sbjct: 43 VSWKPRAFVYEGFLTELECDHLISIAKSELKRSAVADNLSGESKLSE--VRTSSGMFIPK 100
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 122
++D I+ +E KI+ T+LP+ +GE VLRYE GQKYD HYD F + R+
Sbjct: 101 NKDP--IVAGVEDKISSWTLLPKENGEDIQVLRYEHGQKYDPHYDYFADKVNIARGGHRV 158
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGY----DYKKCI--GLKVKPRRGDGLLFYSLF 176
A+ L+YL+DV +GGET+FP G D +C G+ VKPRRGD LLF+SL+
Sbjct: 159 ATVLMYLTDVTKGGETVFPNAEESPRHRGSETKEDLSECAQKGIAVKPRRGDALLFFSLY 218
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWI 204
PN D SLH CPVI+GEKW ATKWI
Sbjct: 219 PNAIPDTMSLHAGCPVIEGEKWSATKWI 246
>gi|356572148|ref|XP_003554232.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Glycine max]
Length = 319
Score = 169 bits (429), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 92/206 (44%), Positives = 126/206 (61%), Gaps = 8/206 (3%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 62
LSW PRA + F S E+C +I AK +L+ S +A G+++ S RTSSG F++
Sbjct: 60 LSWSPRAFLYKGFLSEEECDHLIVLAKDKLEKSMVADNDSGKSIMSD--IRTSSGMFLNK 117
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 122
++D+ I+ IE +IA T LP +GE+ +L YE GQKY+ H+D F+ R+
Sbjct: 118 AQDE--IVAGIEARIAAWTFLPVENGESMQILHYENGQKYEPHFDYFHDKANQVMGGHRI 175
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFL---DSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 179
A+ L+YLSDVE+GGET+FP L D + G VKP++GD LLF+SL +
Sbjct: 176 ATVLMYLSDVEKGGETIFPNAEAKLLQPKDESWSECAHKGYAVKPQKGDALLFFSLHLDA 235
Query: 180 TIDRTSLHGSCPVIKGEKWVATKWIR 205
+ D SLHGSCPVI+GEKW ATKWI
Sbjct: 236 STDTKSLHGSCPVIEGEKWSATKWIH 261
>gi|255637501|gb|ACU19077.1| unknown [Glycine max]
Length = 318
Score = 169 bits (428), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 92/206 (44%), Positives = 126/206 (61%), Gaps = 8/206 (3%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 62
LSW PRA + F S E+C +I AK +L+ S +A + G+++ S RTSSG F++
Sbjct: 59 LSWSPRAFLYKGFLSDEECDHLITLAKDKLEKSMVADNESGKSIMSE--VRTSSGMFLNK 116
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 122
++D+ I+ IE +IA T LP +GE+ +L YE GQKY+ H+D F+ R+
Sbjct: 117 AQDE--IVAGIEARIAAWTFLPIENGESMQILHYENGQKYEPHFDYFHDKANQVMGGHRI 174
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFL---DSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 179
A+ L+YLSDVE+GGET+F L D + G VKPR+GD LLF+SL +
Sbjct: 175 ATVLMYLSDVEKGGETIFSNAKAKLLQPKDESWSECAHKGYAVKPRKGDALLFFSLHLDA 234
Query: 180 TIDRTSLHGSCPVIKGEKWVATKWIR 205
+ D SLHGSCPVI+GEKW ATKWI
Sbjct: 235 STDNKSLHGSCPVIEGEKWSATKWIH 260
>gi|356502598|ref|XP_003520105.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 296
Score = 169 bits (428), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 91/214 (42%), Positives = 130/214 (60%), Gaps = 9/214 (4%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTF 59
++++SW PR + NF + E+C+ +I AK ++ S + + G ++ES RTSSGTF
Sbjct: 85 VEIISWEPRIFLYHNFLTKEECEHLINIAKPNMRKSTVIESETGMSIESR--VRTSSGTF 142
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
++ DK I+ IE++IA T +P +GE VL Y++G+KY H+D F
Sbjct: 143 LARGRDK--IVRNIENRIADFTFIPVDNGEELQVLHYQVGEKYVPHHDYFMDDINTANGG 200
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGIF--LDSGYDYKKC--IGLKVKPRRGDGLLFYSL 175
R+A+ L+YLSDVEEGGET+FP G F + + C GL +KP+ + LLF+S+
Sbjct: 201 DRIATMLMYLSDVEEGGETVFPDAKGNFSSMPGWNELSVCGKKGLSIKPKMRNALLFWSI 260
Query: 176 FPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
P+ T D SLHGSCPVIKG KW +TKWIR E
Sbjct: 261 KPDATYDPLSLHGSCPVIKGNKWSSTKWIRIGEH 294
>gi|28393447|gb|AAO42145.1| putative prolyl 4-hydroxylase [Arabidopsis thaliana]
Length = 253
Score = 169 bits (428), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 94/206 (45%), Positives = 126/206 (61%), Gaps = 9/206 (4%)
Query: 5 SWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA--LRQGETVESTKGTRTSSGTFISA 62
SW PRA + F S E+C +I AK +L+ S + + GE+ +S RTSSG F++
Sbjct: 1 SWTPRAFLYKGFLSDEECDHLIKLAKGKLEKSMVVADVDSGESEDSE--VRTSSGMFLTK 58
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 122
+D I+ +E K+A T LP+ +GEA +L YE GQKYD H+D F + R+
Sbjct: 59 RQDD--IVANVEAKLAAWTFLPEENGEALQILHYENGQKYDPHFDYFYDKKALELGGHRI 116
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPNG 179
A+ L+YLS+V +GGET+FP G D + KC G VKPR+GD LLF++L NG
Sbjct: 117 ATVLMYLSNVTKGGETVFPNWKGKTPQLKDDSWSKCAKQGYAVKPRKGDALLFFNLHLNG 176
Query: 180 TIDRTSLHGSCPVIKGEKWVATKWIR 205
T D SLHGSCPVI+GEKW AT+WI
Sbjct: 177 TTDPNSLHGSCPVIEGEKWSATRWIH 202
>gi|384246332|gb|EIE19822.1| hypothetical protein COCSUDRAFT_25518 [Coccomyxa subellipsoidea
C-169]
Length = 347
Score = 169 bits (428), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 93/210 (44%), Positives = 126/210 (60%), Gaps = 19/210 (9%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 62
+SW PRA F +C+ +I+ AK + S + G++++ST RTS+GTF
Sbjct: 86 VSWSPRAFLLKGFLKEAECEHLISKAKPSMVKSTVVDNDTGKSIDST--VRTSTGTFFGR 143
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN------PAEYGP 116
ED+ +++ IE +I+ T LP+ +GE +L YE GQKY++H+D F+ P G
Sbjct: 144 EEDE--VIQGIERRISMITHLPEVNGEGLQILHYEDGQKYEAHHDFFHDKFNSRPENGG- 200
Query: 117 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYS 174
QR+A+ L+YL+ EEGGET+FP +G + +C G VK RRGD LLFYS
Sbjct: 201 ---QRIATVLMYLTTAEEGGETVFPMAANKV--TGPQWSECARGGAAVKSRRGDALLFYS 255
Query: 175 LFPNGTIDRTSLHGSCPVIKGEKWVATKWI 204
L PNG D TSLHGSCP KGEKW ATKWI
Sbjct: 256 LLPNGETDPTSLHGSCPTTKGEKWSATKWI 285
>gi|449454448|ref|XP_004144967.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
gi|449474082|ref|XP_004154068.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
gi|449515181|ref|XP_004164628.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 300
Score = 169 bits (427), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 95/211 (45%), Positives = 129/211 (61%), Gaps = 11/211 (5%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTF 59
++ +SW+PRA + F + +C +++ A+ LK S++A G++ ST RTSSG F
Sbjct: 39 VKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLST--VRTSSGMF 96
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
IS ++D I+ IE KI+ T LP+ +GE VLRYE GQKY+SHYD F
Sbjct: 97 ISKNKDP--IVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGG 154
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGY----DYKKCI--GLKVKPRRGDGLLFY 173
RLA+ L+YLS+V +GGET+FP Y D +C G+ VKP++GD LLF+
Sbjct: 155 HRLATVLMYLSNVTQGGETVFPLAEKPSHRRAYETDEDLSECAKKGVAVKPKKGDALLFF 214
Query: 174 SLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 204
SL PN D SLHG CPV++GEKW ATKWI
Sbjct: 215 SLEPNAIPDTNSLHGGCPVLEGEKWSATKWI 245
>gi|110289076|gb|ABB47602.2| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
Group]
Length = 309
Score = 168 bits (426), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 93/206 (45%), Positives = 128/206 (62%), Gaps = 9/206 (4%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 62
LSWRPRA F + +C+ +I+ AK +L+ S +A + G++V S RTSSG F+
Sbjct: 48 LSWRPRAFLHKGFLTDAECEHLISLAKDKLEKSMVADNESGKSVMSE--VRTSSGMFLEK 105
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 122
+D+ ++ IE +IA T LP +GE+ +L Y+ G+KY+ HYD F+ R+
Sbjct: 106 KQDE--VVARIEERIAAWTFLPPDNGESIQILHYQNGEKYEPHYDYFHDKNNQALGGHRI 163
Query: 123 ASFLLYLSDVEEGGETMFP-FENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPN 178
A+ L+YLSDV +GGET+FP E G L D + C G VKP +GD LLF+SL P+
Sbjct: 164 ATVLMYLSDVGKGGETIFPEAEVGKLLQPKDDTWSDCAKNGYAVKPVKGDALLFFSLHPD 223
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWI 204
T D SLHGSCPVI+G+KW ATKWI
Sbjct: 224 ATTDSDSLHGSCPVIEGQKWSATKWI 249
>gi|388496942|gb|AFK36537.1| unknown [Lotus japonicus]
Length = 302
Score = 168 bits (425), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 96/213 (45%), Positives = 128/213 (60%), Gaps = 13/213 (6%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTF 59
++ +SW+PRA + F + +C +I+ AK LK S +A G++ S RTSSG F
Sbjct: 41 VKQVSWKPRAFVYKGFLTELECDHLISLAKSELKRSAVADNLSGDSKLSD--VRTSSGMF 98
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
IS ++D I+ IE KI+ T LP+ +GE VLRYE GQKYD HYD F +
Sbjct: 99 ISKNKDP--IVAGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDFFADKVNIARGG 156
Query: 120 QRLASFLLYLSDVEEGGETMFP------FENGIFLDSGYDYKKCI--GLKVKPRRGDGLL 171
R+A+ L+YL++V GGET+FP F ++ D +C G+ VKPRRGD LL
Sbjct: 157 HRVATVLMYLTNVTRGGETVFPNAEVEEFPRHRGSETIDDLSECAKKGIAVKPRRGDALL 216
Query: 172 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 204
F+SL+PN D SLH CPVI+GEKW ATKWI
Sbjct: 217 FFSLYPNAVPDTMSLHAGCPVIEGEKWSATKWI 249
>gi|357140446|ref|XP_003571778.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
distachyon]
Length = 298
Score = 168 bits (425), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 91/206 (44%), Positives = 126/206 (61%), Gaps = 8/206 (3%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 62
LSWRPRA F S +C +I AK +L+ S +A + G++V+S RTSSG F+
Sbjct: 38 LSWRPRAFLHKGFLSEPECDHMIELAKDKLEKSMVADNESGKSVQSE--VRTSSGMFLEK 95
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 122
+D+ ++ IE +IA T LP +GE+ +L Y+ G+KY+ HYD F+ R+
Sbjct: 96 RQDE--VVARIEERIAAWTFLPSENGESIQILHYKNGEKYEPHYDYFHDKNNQALGGHRI 153
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPNG 179
A+ L+YLS+VE+GGET+FP G + +C G VKP +GD LLF+SL P+
Sbjct: 154 ATVLMYLSNVEKGGETIFPNAEGKLTQHKDETASECAKNGYAVKPMKGDALLFFSLHPDA 213
Query: 180 TIDRTSLHGSCPVIKGEKWVATKWIR 205
T D SLHGSCPVI+G+KW ATKWI
Sbjct: 214 TTDPDSLHGSCPVIEGQKWSATKWIH 239
>gi|115464581|ref|NP_001055890.1| Os05g0489100 [Oryza sativa Japonica Group]
gi|50511363|gb|AAT77286.1| putative prolyl 4-hydroxylase alpha subunit [Oryza sativa Japonica
Group]
gi|113579441|dbj|BAF17804.1| Os05g0489100 [Oryza sativa Japonica Group]
gi|125587281|gb|EAZ27945.1| hypothetical protein OsJ_11906 [Oryza sativa Japonica Group]
gi|215737307|dbj|BAG96236.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 319
Score = 168 bits (425), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 93/209 (44%), Positives = 129/209 (61%), Gaps = 11/209 (5%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 63
+SW+PR + +F S ++ +++ A+ LK S +A E + RTSSGTFI S
Sbjct: 61 ISWKPRVFLYQHFLSDDEANHLVSLARTELKRSAVADNLSGKSELSDA-RTSSGTFIRKS 119
Query: 64 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 123
+D I+ IE KIA T LP+ +GE VLRY+ G+KY+ HYD F+ + R+A
Sbjct: 120 QDP--IVAGIEEKIAAWTFLPKENGEDIQVLRYKHGEKYERHYDYFSDNVNTLRGGHRIA 177
Query: 124 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYK-----KCI--GLKVKPRRGDGLLFYSLF 176
+ L+YL+DV EGGET+FP F +SG + + +C G+ VKPR+GD LLF++L
Sbjct: 178 TVLMYLTDVAEGGETVFPLAEE-FTESGTNNEDSTLSECAKKGVAVKPRKGDALLFFNLS 236
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIR 205
P+ + D SLH CPVIKGEKW ATKWIR
Sbjct: 237 PDASKDSLSLHAGCPVIKGEKWSATKWIR 265
>gi|125552794|gb|EAY98503.1| hypothetical protein OsI_20415 [Oryza sativa Indica Group]
Length = 319
Score = 168 bits (425), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 93/209 (44%), Positives = 129/209 (61%), Gaps = 11/209 (5%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 63
+SW+PR + +F S ++ +++ A+ LK S +A E + RTSSGTFI S
Sbjct: 61 ISWKPRVFLYQHFLSDDEANHLVSLARAELKRSAVADNLSGKSELSDA-RTSSGTFIRKS 119
Query: 64 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 123
+D I+ IE KIA T LP+ +GE VLRY+ G+KY+ HYD F+ + R+A
Sbjct: 120 QDP--IVAGIEEKIAAWTFLPKENGEDIQVLRYKHGEKYERHYDYFSDNVNTLRGGHRIA 177
Query: 124 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYK-----KCI--GLKVKPRRGDGLLFYSLF 176
+ L+YL+DV EGGET+FP F +SG + + +C G+ VKPR+GD LLF++L
Sbjct: 178 TVLMYLTDVAEGGETVFPLAEE-FTESGTNNEDSTLSECAKKGVAVKPRKGDALLFFNLS 236
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIR 205
P+ + D SLH CPVIKGEKW ATKWIR
Sbjct: 237 PDASKDSLSLHAGCPVIKGEKWSATKWIR 265
>gi|357447553|ref|XP_003594052.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
truncatula]
gi|355483100|gb|AES64303.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
truncatula]
Length = 301
Score = 168 bits (425), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 96/212 (45%), Positives = 127/212 (59%), Gaps = 11/212 (5%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTF 59
++ +SW+PRA + F + +C +I+ AK LK S +A GE+ S RTSSG F
Sbjct: 40 VKQVSWKPRAFVYKGFLTDLECDHLISIAKSELKRSAVADNLSGESKLSE--VRTSSGMF 97
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
IS ++D I+ IE KI+ T LP+ +GE VLRYE GQKYD HYD F +
Sbjct: 98 ISKNKD--AIVSGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDYFADKVNIARGG 155
Query: 120 QRLASFLLYLSDVEEGGETMFPFEN----GIFLDSGYDYKKC--IGLKVKPRRGDGLLFY 173
R+A+ L+YL++V +GGET+FP ++ D +C G+ VKPRRGD LLF+
Sbjct: 156 HRVATVLMYLTNVTKGGETVFPNAEESPRHKLSETDEDLSECGKKGVAVKPRRGDALLFF 215
Query: 174 SLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 205
SL PN D SLH CPVI+GEKW ATKWI
Sbjct: 216 SLHPNAIPDTLSLHAGCPVIEGEKWSATKWIH 247
>gi|356546462|ref|XP_003541645.1| PREDICTED: uncharacterized protein LOC100818794 [Glycine max]
Length = 839
Score = 167 bits (424), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 95/208 (45%), Positives = 124/208 (59%), Gaps = 11/208 (5%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTFISA 62
+SW+PRA + F + +C +I+ AK LK S +A GE+ S RTSSG FI
Sbjct: 581 VSWKPRAFVYEGFLTELECDHLISIAKSELKRSAVADNLSGESKLSE--VRTSSGMFIPK 638
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 122
++D I+ IE KI+ T LP+ +GE VLRYE GQKYD HYD F + R+
Sbjct: 639 NKDL--IVAGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDYFADKVNIARGGHRV 696
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI------GLKVKPRRGDGLLFYSLF 176
A+ L+YL+DV +GGET+FP G + + + G+ VKPRRGD LLF+SL+
Sbjct: 697 ATVLMYLTDVTKGGETVFPDAEESPRHKGSETNENLSECAQKGIAVKPRRGDALLFFSLY 756
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWI 204
PN D SLH CPVI+GEKW ATKWI
Sbjct: 757 PNAIPDTLSLHAGCPVIEGEKWSATKWI 784
>gi|168002780|ref|XP_001754091.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162694645|gb|EDQ80992.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 214
Score = 167 bits (424), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 91/209 (43%), Positives = 129/209 (61%), Gaps = 9/209 (4%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTKGTRTSSGTF 59
++VLSW PRA + +F + E+C +I A+ L K + + G++ +S RTSSGTF
Sbjct: 3 VEVLSWEPRAFLYHHFLTEEECNHLIEVARPSLVKSTVVDSDTGKSKDSR--LRTSSGTF 60
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
+ +D +++ IE +IA T +P GE VL+Y+ +KY+ HYD F+ A
Sbjct: 61 LMRGQDP--VIKRIEKRIADFTFIPAEQGEGLQVLQYKESEKYEPHYDYFHDAYNTKNGG 118
Query: 120 QRLASFLLYLSDVEEGGETMFPFE--NGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSL 175
QR+A+ L+YLS+VEEGGET+FP N + +C GL V+PR GD LLF+S+
Sbjct: 119 QRIATVLMYLSNVEEGGETVFPAAQVNKTEVPDWDKLSECAQKGLSVRPRMGDALLFWSM 178
Query: 176 FPNGTIDRTSLHGSCPVIKGEKWVATKWI 204
P+ T+D TSLHG CPVIKG KW ATKW+
Sbjct: 179 KPDATLDSTSLHGGCPVIKGTKWSATKWL 207
>gi|357496283|ref|XP_003618430.1| Prolyl 4-hydroxylase subunit alpha-2 [Medicago truncatula]
gi|217073992|gb|ACJ85356.1| unknown [Medicago truncatula]
gi|355493445|gb|AES74648.1| Prolyl 4-hydroxylase subunit alpha-2 [Medicago truncatula]
gi|388494436|gb|AFK35284.1| unknown [Medicago truncatula]
Length = 313
Score = 167 bits (424), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 91/205 (44%), Positives = 127/205 (61%), Gaps = 8/205 (3%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 62
LSW PRA + NF + E+C +I +K +L+ S +A + G++++S RTSSG F++
Sbjct: 54 LSWSPRAFLYKNFLTDEECDHLIELSKDKLEKSMVADNESGKSIQSE--VRTSSGMFLNK 111
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 122
+D+ I+ IE +IA T LP +GE+ VL Y G+KY+ H+D F+ R+
Sbjct: 112 QQDE--IVSGIEARIAAWTFLPVENGESMQVLHYMNGEKYEPHFDFFHDKANQRLGGHRV 169
Query: 123 ASFLLYLSDVEEGGETMFPFENGIF---LDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 179
A+ L+YLS+VE+GGET+FP G D + G VKPR+GD LLF+SL +
Sbjct: 170 ATVLMYLSNVEKGGETIFPHAEGKLSQPKDESWSECAHKGYAVKPRKGDALLFFSLHLDA 229
Query: 180 TIDRTSLHGSCPVIKGEKWVATKWI 204
T D SLHGSCPVI+GEKW ATKWI
Sbjct: 230 TTDSKSLHGSCPVIEGEKWSATKWI 254
>gi|412993142|emb|CCO16675.1| predicted protein [Bathycoccus prasinos]
Length = 564
Score = 167 bits (423), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 93/209 (44%), Positives = 132/209 (63%), Gaps = 14/209 (6%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
+P+A F NF SAE+C ++ AK L PS + G +V ST RTS+G F+ + DK
Sbjct: 285 KPKAYLFRNFLSAEECDHLMKLAKAELAPSTVVGAGGTSVPST--IRTSAGMFLRKAADK 342
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA-EYGPQMS-QRLAS 124
T LE IE++IA A+ P+ +GE +LRY++GQKYD H+D F+ A P+ QR+A+
Sbjct: 343 T--LENIEYRIAAASGTPEPNGEGMQILRYDVGQKYDPHFDYFHDAVNPSPKRGGQRMAT 400
Query: 125 FLLYLSDVEEGGETMFP----FENGIFLDSG--YDYKKCI--GLKVKPRRGDGLLFYSLF 176
L+YL + +EGGET+FP E + G +++ +C GL VK +GD LLF+SL
Sbjct: 401 MLIYLENTKEGGETIFPRGTRAETFDLTEEGNPHEWSECTKHGLPVKSVKGDALLFWSLT 460
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+ +D SLHG+CPV+KG+KW A KWIR
Sbjct: 461 DDYKLDMGSLHGACPVVKGQKWTAVKWIR 489
>gi|357447555|ref|XP_003594053.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
truncatula]
gi|355483101|gb|AES64304.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
truncatula]
Length = 303
Score = 167 bits (423), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 96/214 (44%), Positives = 127/214 (59%), Gaps = 13/214 (6%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTF 59
++ +SW+PRA + F + +C +I+ AK LK S +A GE+ S RTSSG F
Sbjct: 40 VKQVSWKPRAFVYKGFLTDLECDHLISIAKSELKRSAVADNLSGESKLSE--VRTSSGMF 97
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
IS ++D I+ IE KI+ T LP+ +GE VLRYE GQKYD HYD F +
Sbjct: 98 ISKNKD--AIVSGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDYFADKVNIARGG 155
Query: 120 QRLASFLLYLSDVEEGGETMFPFEN------GIFLDSGYDYKKC--IGLKVKPRRGDGLL 171
R+A+ L+YL++V +GGET+FP ++ D +C G+ VKPRRGD LL
Sbjct: 156 HRVATVLMYLTNVTKGGETVFPNAELQESPRHKLSETDEDLSECGKKGVAVKPRRGDALL 215
Query: 172 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 205
F+SL PN D SLH CPVI+GEKW ATKWI
Sbjct: 216 FFSLHPNAIPDTLSLHAGCPVIEGEKWSATKWIH 249
>gi|363807286|ref|NP_001242363.1| uncharacterized protein LOC100796794 precursor [Glycine max]
gi|255641119|gb|ACU20838.1| unknown [Glycine max]
Length = 297
Score = 167 bits (423), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 96/211 (45%), Positives = 127/211 (60%), Gaps = 11/211 (5%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTF 59
++ +SW+PRA + F + +C +I+ AK LK S +A GE+ S RTSSG F
Sbjct: 36 VKQISWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVADNLSGESQLSD--VRTSSGMF 93
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
IS ++D I+ IE KI+ T LP+ +GE V RYE GQKYD HYD F +
Sbjct: 94 ISKNKDP--IVAGIEDKISSWTFLPKENGEDIQVSRYEHGQKYDPHYDYFTDKVNIARGG 151
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGI----FLDSGYDYKKCI--GLKVKPRRGDGLLFY 173
R+A+ L+YL+DV +GGET+FP ++ D +C G+ VKPRRGD LLF+
Sbjct: 152 HRIATVLMYLTDVAKGGETVFPSAEEPPRRRGAETSSDLSECAKKGIAVKPRRGDALLFF 211
Query: 174 SLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 204
SL N T D +SLH CPVI+GEKW ATKWI
Sbjct: 212 SLHTNATPDTSSLHAGCPVIEGEKWSATKWI 242
>gi|116788056|gb|ABK24739.1| unknown [Picea sitchensis]
Length = 303
Score = 167 bits (422), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 91/217 (41%), Positives = 129/217 (59%), Gaps = 16/217 (7%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVES---------TKG 51
+VLSW PRA+ + NF + E+C+ +I AK + S + G++ +S
Sbjct: 82 EVLSWEPRAILYHNFLNKEECEYLINLAKPHMAKSTVVDSATGKSKDSRFVHRWKSNDSR 141
Query: 52 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 111
RTSSG F++ +DKT + IE +IA T +P HGE VL YE+GQKY+ H+D F
Sbjct: 142 VRTSSGMFLNRGQDKT--IRSIEKRIADFTFIPAEHGEGLQVLHYEVGQKYEPHFDYFLD 199
Query: 112 AEYGPQMSQRLASFLLYLSDVEEGGETMFPFE--NGIFLDSGYDYKKCI--GLKVKPRRG 167
QR+A+ L+YLSDVE+GGET+FP N + + +C G+ V+PR G
Sbjct: 200 EFNTKNGGQRIATVLMYLSDVEKGGETVFPASKVNSSSVPWWDELSECAKAGISVRPRMG 259
Query: 168 DGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 204
D LLF+S+ P+ +D +SLH CPVI+G+KW ATKWI
Sbjct: 260 DALLFWSMRPDAELDPSSLHAGCPVIQGDKWSATKWI 296
>gi|297832394|ref|XP_002884079.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297329919|gb|EFH60338.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 291
Score = 167 bits (422), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 85/212 (40%), Positives = 130/212 (61%), Gaps = 7/212 (3%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
++V+SW PRA+ + NF S E+C+ +I AK + S + + + ++ RTSSGTF+
Sbjct: 80 VEVISWEPRAVVYHNFLSNEECEHLINLAKPSMVKSTVVDEKTGGSKDSR-VRTSSGTFL 138
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
D+ ++E+IE +I+ T +P +GE VL Y++GQKY+ HYD F Q
Sbjct: 139 RRGHDE--VVEVIEKRISDFTFIPVENGEGLQVLHYQVGQKYEPHYDYFLDEFNTKNGGQ 196
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSLF 176
R+A+ L+YLSDV++GGET+FP G + + KC GL V P++ D LLF+++
Sbjct: 197 RIATVLMYLSDVDDGGETVFPAARGNISAVPWWNELSKCGKEGLSVLPKKRDALLFWNMR 256
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 208
P+ ++D +SLHG CPV+KG KW +TKW E
Sbjct: 257 PDASLDPSSLHGGCPVVKGNKWSSTKWFHVHE 288
>gi|15227885|ref|NP_179363.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
gi|25411813|pir||F84555 similar to prolyl 4-hydroxylase alpha subunit [imported] -
Arabidopsis thaliana
gi|89274129|gb|ABD65585.1| At2g17720 [Arabidopsis thaliana]
gi|110738861|dbj|BAF01353.1| similar to prolyl 4-hydroxylase alpha subunit [Arabidopsis
thaliana]
gi|330251579|gb|AEC06673.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
Length = 291
Score = 167 bits (422), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 84/212 (39%), Positives = 131/212 (61%), Gaps = 7/212 (3%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
++V+SW PRA+ + NF + E+C+ +I+ AK + S + + + ++ RTSSGTF+
Sbjct: 80 VEVISWEPRAVVYHNFLTNEECEHLISLAKPSMVKSTVVDEKTGGSKDSR-VRTSSGTFL 138
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
D+ ++E+IE +I+ T +P +GE VL Y++GQKY+ HYD F Q
Sbjct: 139 RRGHDE--VVEVIEKRISDFTFIPVENGEGLQVLHYQVGQKYEPHYDYFLDEFNTKNGGQ 196
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSLF 176
R+A+ L+YLSDV++GGET+FP G + + KC GL V P++ D LLF+++
Sbjct: 197 RIATVLMYLSDVDDGGETVFPAARGNISAVPWWNELSKCGKEGLSVLPKKRDALLFWNMR 256
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 208
P+ ++D +SLHG CPV+KG KW +TKW E
Sbjct: 257 PDASLDPSSLHGGCPVVKGNKWSSTKWFHVHE 288
>gi|255551575|ref|XP_002516833.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223543921|gb|EEF45447.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 297
Score = 166 bits (421), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 94/210 (44%), Positives = 125/210 (59%), Gaps = 9/210 (4%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
++ +SW+PRA + F + +C +I+ AK LK S +A + + ++ RTSSG FI
Sbjct: 36 VKQVSWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVADNESGKSKLSE-VRTSSGMFI 94
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
+ +D I+ IE KI+ T LP+ +GE VLRYE GQKYD HYD F +
Sbjct: 95 AKGKDP--IIAGIEEKISTWTFLPKENGEDLQVLRYEHGQKYDPHYDYFADKINIARGGH 152
Query: 121 RLASFLLYLSDVEEGGETMFPFENGI----FLDSGYDYKKCI--GLKVKPRRGDGLLFYS 174
R+A+ L+YLSDV +GGET+FP +S D +C G+ VKPRRGD LLF+S
Sbjct: 153 RMATVLMYLSDVVKGGETVFPNAEEPPRRKATESHEDLSECAKKGISVKPRRGDALLFFS 212
Query: 175 LFPNGTIDRTSLHGSCPVIKGEKWVATKWI 204
L P D SLH CPVI+GEKW ATKWI
Sbjct: 213 LHPTAIPDPNSLHAGCPVIEGEKWSATKWI 242
>gi|159795555|pdb|2V4A|A Chain A, Crystal Structure Of The Semet-Labeled Prolyl-4
Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
Reinhardtii.
gi|159795556|pdb|2V4A|B Chain B, Crystal Structure Of The Semet-Labeled Prolyl-4
Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
Reinhardtii.
gi|159795557|pdb|2V4A|C Chain C, Crystal Structure Of The Semet-Labeled Prolyl-4
Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
Reinhardtii.
gi|159795558|pdb|2V4A|D Chain D, Crystal Structure Of The Semet-Labeled Prolyl-4
Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
Reinhardtii
Length = 233
Score = 166 bits (421), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 91/205 (44%), Positives = 122/205 (59%), Gaps = 7/205 (3%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAK-KRLKPSQLALRQGETVESTKGTRTSSGTFISA 62
LSW PRA NF S E+C I+ A+ K +K S + G++V+S RTS+GT+ +
Sbjct: 25 LSWSPRAFLLKNFLSDEECDYIVEKARPKXVKSSVVDNESGKSVDSE--IRTSTGTWFAK 82
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMS-Q 120
ED ++ IE ++A+ T +P + E VL Y GQKY+ HYD F +P GP+ Q
Sbjct: 83 GEDS--VISKIEKRVAQVTXIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNAGPEHGGQ 140
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+ + L YL+ VEEGGET+ P G+ GL VKP +GD L FYSL P+G+
Sbjct: 141 RVVTXLXYLTTVEEGGETVLPNAEQKVTGDGWSECAKRGLAVKPIKGDALXFYSLKPDGS 200
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIR 205
D SLHGSCP +KG+KW ATKWI
Sbjct: 201 NDPASLHGSCPTLKGDKWSATKWIH 225
>gi|148537204|dbj|BAF63493.1| prolyl 4-hydroxylase [Potamogeton distinctus]
Length = 246
Score = 166 bits (420), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 91/193 (47%), Positives = 120/193 (62%), Gaps = 8/193 (4%)
Query: 16 FASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISASEDKTGILELIE 74
F S E+C +IA K +L+ S +A + G++V S RTSSG F+ +D+T + IE
Sbjct: 3 FLSHEECDHLIALGKDKLEKSMVADNESGKSVMSE--IRTSSGMFLERRQDET--ITRIE 58
Query: 75 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEE 134
+IA T LP+ +GE +L YE GQKYD+HYD F+ R+A+ L+YLSDV++
Sbjct: 59 KRIAAWTFLPEENGEPIQILHYEKGQKYDAHYDYFHDKNNQRVGGHRMATVLMYLSDVKK 118
Query: 135 GGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCP 191
GGET+FP G L D + C G VKPR+GD LLF+S PN T D SLH SCP
Sbjct: 119 GGETVFPDAEGKLLQVKDDTWSDCARSGYAVKPRKGDALLFFSCHPNATTDPNSLHASCP 178
Query: 192 VIKGEKWVATKWI 204
VI+GEKW AT+WI
Sbjct: 179 VIEGEKWSATRWI 191
>gi|359806348|ref|NP_001241485.1| uncharacterized protein LOC100783075 precursor [Glycine max]
gi|255645457|gb|ACU23224.1| unknown [Glycine max]
Length = 298
Score = 166 bits (420), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 95/212 (44%), Positives = 127/212 (59%), Gaps = 11/212 (5%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTF 59
++ +SW+PRA + F + +C +I+ AK LK S +A GE+ S RTSSG F
Sbjct: 37 VKQISWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVADNLSGESQLSD--VRTSSGMF 94
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
IS ++D I+ IE KI+ T LP+ +GE VLRYE GQKYD HYD F +
Sbjct: 95 ISKNKDP--IISGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDYFTDKVNIARGG 152
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGI----FLDSGYDYKKCI--GLKVKPRRGDGLLFY 173
R+A+ L+YL++V +GGET+FP ++ D +C G+ VKP RGD LLF+
Sbjct: 153 HRIATVLMYLTNVTKGGETVFPSAEEPPRRRGTETSSDLSECAKKGIAVKPHRGDALLFF 212
Query: 174 SLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 205
SL N T D +SLH CPVI+GEKW ATKWI
Sbjct: 213 SLHTNATPDTSSLHAGCPVIEGEKWSATKWIH 244
>gi|226495689|ref|NP_001149322.1| LOC100282945 precursor [Zea mays]
gi|194697650|gb|ACF82909.1| unknown [Zea mays]
gi|194708468|gb|ACF88318.1| unknown [Zea mays]
gi|195626376|gb|ACG35018.1| oxidoreductase [Zea mays]
gi|347978842|gb|AEP37763.1| prolyl 4-hydroxylase 9 [Zea mays]
gi|413945802|gb|AFW78451.1| oxidoreductase [Zea mays]
Length = 308
Score = 166 bits (419), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 93/205 (45%), Positives = 124/205 (60%), Gaps = 7/205 (3%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTFISA 62
+S +PR + +F S ++ +I+ A+ LK S +A G++ S RTSSGTF+
Sbjct: 54 ISCKPRVFLYQHFLSDDEANHLISLARAELKRSAVADNMSGKSTLSE--VRTSSGTFLRK 111
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 122
+D I+E IE KIA T LP+ +GE VLRY+ G+KY+ HYD F + R
Sbjct: 112 GQDP--IVEGIEDKIAAWTFLPKENGEDIQVLRYKHGEKYEPHYDYFTDNVNTVRGGHRY 169
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLFPNGT 180
A+ LLYL+DV EGGET+FP +C G+ V+PR+GD LLF++L P+GT
Sbjct: 170 ATVLLYLTDVPEGGETVFPLAEEPDDAKDATLSECAQKGIAVRPRKGDALLFFNLNPDGT 229
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIR 205
D SLHG CPVIKGEKW ATKWIR
Sbjct: 230 TDSVSLHGGCPVIKGEKWSATKWIR 254
>gi|255641919|gb|ACU21228.1| unknown [Glycine max]
Length = 301
Score = 166 bits (419), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 94/212 (44%), Positives = 126/212 (59%), Gaps = 11/212 (5%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTF 59
++ +SW+PRA + F + +C +I+ AK LK S +A GE+ S RTSSG F
Sbjct: 40 VKQVSWKPRAFVYEGFLTELECDHLISIAKSELKRSAVADNLSGESKLSE--VRTSSGMF 97
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
I ++D I+ IE KI+ T LP+ +GE VLRYE GQKYD HYD F +
Sbjct: 98 IPKNKDL--IVAGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDYFADKVNIARGG 155
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI------GLKVKPRRGDGLLFY 173
R+A+ L+YL+DV +GGET+FP G + + + G+ VKPRRGD LLF+
Sbjct: 156 HRVATVLMYLTDVTKGGETVFPDAEESPRHKGSETNENLSECAQKGIAVKPRRGDALLFF 215
Query: 174 SLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 205
SL+PN D SLH CPVI+GEKW AT+WI
Sbjct: 216 SLYPNAIPDTLSLHAGCPVIEGEKWSATEWIH 247
>gi|21593091|gb|AAM65040.1| putative prolyl 4-hydroxylase, alpha subunit [Arabidopsis thaliana]
Length = 291
Score = 166 bits (419), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 84/212 (39%), Positives = 130/212 (61%), Gaps = 7/212 (3%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
++V+SW PRA+ + NF + E+C+ +I+ AK + S + + + ++ RTSSGTF+
Sbjct: 80 VEVISWEPRAVVYHNFLTNEECEHLISLAKPSMVKSTVVDEKTGGSKDSR-VRTSSGTFL 138
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
D+ ++E+IE +I+ T +P +GE VL Y++GQKY+ HYD F Q
Sbjct: 139 RRGHDE--VVEVIEKRISDFTFIPVENGEGLQVLHYQVGQKYEPHYDYFLDEFNTKNGGQ 196
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSLF 176
R+A+ L+YLSDV++GGET+FP G + + KC GL V P+ D LLF+++
Sbjct: 197 RIATVLMYLSDVDDGGETVFPAARGNISAVPWWNELSKCGKEGLSVLPKXRDALLFWNMR 256
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 208
P+ ++D +SLHG CPV+KG KW +TKW E
Sbjct: 257 PDASLDPSSLHGGCPVVKGNKWSSTKWFHVHE 288
>gi|29150368|gb|AAO72377.1| putative oxidoreductase [Oryza sativa Japonica Group]
gi|108711617|gb|ABF99412.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
Group]
gi|125546090|gb|EAY92229.1| hypothetical protein OsI_13949 [Oryza sativa Indica Group]
gi|125588294|gb|EAZ28958.1| hypothetical protein OsJ_13002 [Oryza sativa Japonica Group]
Length = 310
Score = 165 bits (418), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 93/209 (44%), Positives = 129/209 (61%), Gaps = 10/209 (4%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTF 59
+ ++SW+PR ++ F S ++C ++ K++LK S +A + G++V S RTSSG F
Sbjct: 48 VTIISWKPRIFFYKGFLSDDECDHLVKLGKEKLKRSMVADNESGKSVMSE--VRTSSGMF 105
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
+ +D ++ IE +IA T+LPQ + E +LRYE GQKYD H+D F Q
Sbjct: 106 LDKQQDP--VVSGIEERIAAWTLLPQENAENIQILRYENGQKYDPHFDYFQDKVNQLQGG 163
Query: 120 QRLASFLLYLSDVEEGGETMFP----FENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSL 175
R A+ L YLS VE+GGET+FP +E+ DS D K GL VK +GD +LF++L
Sbjct: 164 HRYATVLTYLSTVEKGGETVFPNAEGWESQPKDDSFSDCAK-KGLAVKAVKGDSVLFFNL 222
Query: 176 FPNGTIDRTSLHGSCPVIKGEKWVATKWI 204
P+GT D SLHGSCPVI+GEKW A KWI
Sbjct: 223 QPDGTPDPLSLHGSCPVIEGEKWSAPKWI 251
>gi|302823087|ref|XP_002993198.1| hypothetical protein SELMODRAFT_431327 [Selaginella moellendorffii]
gi|300138968|gb|EFJ05718.1| hypothetical protein SELMODRAFT_431327 [Selaginella moellendorffii]
Length = 269
Score = 165 bits (417), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 91/210 (43%), Positives = 127/210 (60%), Gaps = 12/210 (5%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA------LRQGETVESTKGTRTS 55
+VL+W PR + F SAE+C +IA A RL S + R G +ES RTS
Sbjct: 61 EVLNWSPRIILLHKFLSAEECDYLIAIAGPRLAKSTVVDTSTGKARHG--IESK--VRTS 116
Query: 56 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 115
+G F+S + + +++ IE +IA +M+P +GE VLRYE Q Y H+D F+
Sbjct: 117 TGMFLSNYDRRYPMIQAIERRIAVYSMIPVENGELLQVLRYEPNQYYKPHHDYFSDQFNL 176
Query: 116 PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSL 175
+ QR+A+ L+YLSDVEEGGET+FP + G + +K GL VKPR+GD +LF+S
Sbjct: 177 KRGGQRVATVLMYLSDVEEGGETIFPSVGDGECECGGELRK--GLCVKPRKGDAILFWSA 234
Query: 176 FPNGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+G +D SLHG C V++GEKW ATKW+R
Sbjct: 235 ALDGNVDSNSLHGGCSVLRGEKWSATKWLR 264
>gi|302764100|ref|XP_002965471.1| hypothetical protein SELMODRAFT_67344 [Selaginella moellendorffii]
gi|300166285|gb|EFJ32891.1| hypothetical protein SELMODRAFT_67344 [Selaginella moellendorffii]
Length = 264
Score = 165 bits (417), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 92/210 (43%), Positives = 126/210 (60%), Gaps = 12/210 (5%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA------LRQGETVESTKGTRTS 55
+VL+W PR F SAE+C +IA A RL S + R G +ES RTS
Sbjct: 60 EVLNWSPRITLLHKFLSAEECDYLIAIAGPRLAKSTVVDTSTGKARHG--IESK--VRTS 115
Query: 56 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 115
+G F+S + + ++E IE +IA +M+P +GE VLRYE Q Y H+D F+
Sbjct: 116 TGMFLSNYDRRYPMIEAIERRIAVYSMIPVENGELLQVLRYEPNQYYKPHHDYFSDQFNL 175
Query: 116 PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSL 175
+ QR+A+ L+YLSDVEEGGET+FP + G + +K GL VKPR+GD +LF+S
Sbjct: 176 KRGGQRVATVLMYLSDVEEGGETIFPSVGDGECECGGELRK--GLCVKPRKGDAILFWSA 233
Query: 176 FPNGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+G +D SLHG C V++GEKW ATKW+R
Sbjct: 234 ALDGNVDSNSLHGGCSVLRGEKWSATKWLR 263
>gi|302830268|ref|XP_002946700.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
gi|300267744|gb|EFJ51926.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
Length = 186
Score = 164 bits (416), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 86/184 (46%), Positives = 120/184 (65%), Gaps = 7/184 (3%)
Query: 33 LKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFN 92
+ PS LA R GE E+ + RTS GTF+ D + L +E KIA T+LP+T+GE +N
Sbjct: 1 MYPSGLAYRPGEKAEAEQQVRTSKGTFLGG--DSSPALRWLEDKIAAVTLLPRTNGEFWN 58
Query: 93 VLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVE-EGGETMFPFENGIFLDSG 151
VL Y+ Q YDSH D+F+P EYGPQ SQR+A+ ++ LSD GGET+F E ++
Sbjct: 59 VLNYKHSQHYDSHMDSFDPKEYGPQYSQRIATVIVVLSDDGLMGGETVFKREGKSSINKP 118
Query: 152 Y-DYKKCI---GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 207
++ C GLK KPR GD +LF+S P+G +D +LHGSCPV+ G KWVA KW+R++
Sbjct: 119 ISNWTDCDADGGLKYKPRAGDAVLFWSARPDGQLDPHALHGSCPVVTGNKWVAVKWLRNK 178
Query: 208 EQHE 211
+++
Sbjct: 179 GEYD 182
>gi|145345836|ref|XP_001417405.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577632|gb|ABO95698.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 330
Score = 164 bits (416), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 91/215 (42%), Positives = 128/215 (59%), Gaps = 22/215 (10%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
+P+A NF SAE+C ++ AK+ L PS + G++V S RTS+G F+ +DK
Sbjct: 48 QPKAYLLRNFLSAEECDHLMKLAKRELAPSTVVGEAGDSVPSD--IRTSAGMFLRKGQDK 105
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF----NPAEYGPQMSQRL 122
I++ IE +IAR + P +GE +LRY++GQKYD H+D F NPA + QRL
Sbjct: 106 --IVKAIEERIARLSGTPVDNGEGMQILRYDVGQKYDPHFDYFHDKVNPAPK--RGGQRL 161
Query: 123 ASFLLYLSDVEEGGETMFPF----------ENGIFLDSGYDYKKCI--GLKVKPRRGDGL 170
A+ L+YL D ++GGET FP E S ++ C G+ VK RGD +
Sbjct: 162 ATMLIYLVDTDKGGETTFPNAKLPQSFEADEPENPFASHIEHTDCAKKGIPVKSVRGDAI 221
Query: 171 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 205
LF+S+ +G +DR SLHG+CPVI+G+KW A KWIR
Sbjct: 222 LFFSMTQDGVLDRGSLHGACPVIEGQKWTAVKWIR 256
>gi|363543295|ref|NP_001241863.1| prolyl 4-hydroxylase 4 precursor [Zea mays]
gi|347978806|gb|AEP37745.1| prolyl 4-hydroxylase 4 [Zea mays]
gi|414591890|tpg|DAA42461.1| TPA: hypothetical protein ZEAMMB73_637248 [Zea mays]
Length = 274
Score = 163 bits (413), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 90/208 (43%), Positives = 125/208 (60%), Gaps = 8/208 (3%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTF 59
++ +SW PR + F S +C ++ AKK+++ S +A + G++V+S RTSSG F
Sbjct: 45 VKAVSWHPRIFVYKGFLSDAECDHLVTLAKKKIQRSMVADNESGKSVKSE--VRTSSGMF 102
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
+ +D ++ IE +IA T LPQ + E VLRYE GQKY+ H+D F+ +
Sbjct: 103 LDKRQDP--VVSRIEERIAAWTFLPQENAENMQVLRYEPGQKYEPHFDYFHDRVNQARGG 160
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGI---FLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
R A+ L+YLS V EGGET+FP G D+ + GL VKP +GD +LF+SL
Sbjct: 161 HRYATVLMYLSTVREGGETVFPNAKGWESQPKDATFSECAHKGLAVKPVKGDAVLFFSLH 220
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWI 204
+GT D SLHGSCPVI+GEKW A KWI
Sbjct: 221 ADGTPDPLSLHGSCPVIRGEKWSAPKWI 248
>gi|388495016|gb|AFK35574.1| unknown [Lotus japonicus]
Length = 297
Score = 163 bits (413), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 95/211 (45%), Positives = 124/211 (58%), Gaps = 11/211 (5%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTF 59
++ +SW+PRA + F + +C +I+ AK LK S +A G++ S RTSSG F
Sbjct: 36 VKQVSWKPRAFVYEGFLTGLECDHLISLAKSELKRSAVADNLPGDSKLSE--VRTSSGMF 93
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
IS +D I+ IE KI+ T LP+ +GE VLRYE GQKYD HYD F +
Sbjct: 94 ISKKKDP--IVAGIEDKISAWTFLPKENGEDMQVLRYEHGQKYDPHYDYFTDKVNIVRGG 151
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGI----FLDSGYDYKKCI--GLKVKPRRGDGLLFY 173
R+A+ LLYL++V GGET+FP L++ D +C G+ VKPRRGD LLF+
Sbjct: 152 HRMATVLLYLTNVTRGGETVFPVAEEPPRRRGLETNSDLSECAKKGIAVKPRRGDALLFF 211
Query: 174 SLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 204
SL D SLH CPVI+GEKW ATKWI
Sbjct: 212 SLHTTAIPDTDSLHAGCPVIEGEKWSATKWI 242
>gi|384251901|gb|EIE25378.1| hypothetical protein COCSUDRAFT_35772 [Coccomyxa subellipsoidea
C-169]
Length = 222
Score = 163 bits (412), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 83/208 (39%), Positives = 124/208 (59%), Gaps = 7/208 (3%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTF 59
M+VLSW PRA + NF + + ++ K ++ S++ ET +S RTSSG F
Sbjct: 1 MEVLSWEPRAYLYHNFLTEAEADYLVQKGKPHMEKSEVV--DNETGKSAPSKVRTSSGMF 58
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
++ ED ++E IE +IA+ T +P+ +GE +L Y+ ++Y H+D F+
Sbjct: 59 LNRGEDD--VIERIEARIAKYTAIPKENGEGLQILHYQASEEYRPHFDYFHDNFNTQNGG 116
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKC--IGLKVKPRRGDGLLFYSLFP 177
QR+A+ L+YLSDVE+GGET+FP + + +C G KP++GD L FYSL P
Sbjct: 117 QRIATMLMYLSDVEDGGETVFPESSDKPNVGNTKFSQCAQAGAAAKPKKGDALFFYSLTP 176
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+G +D SLH CPV+KG+KW ATKW+R
Sbjct: 177 DGRMDEKSLHAGCPVMKGDKWSATKWLR 204
>gi|168060785|ref|XP_001782374.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666166|gb|EDQ52828.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 211
Score = 163 bits (412), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 90/209 (43%), Positives = 127/209 (60%), Gaps = 9/209 (4%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTKGTRTSSGTF 59
++VLSW PRA + +F + +C +I AK L K + + G++ +S RTSSGTF
Sbjct: 2 VEVLSWEPRAFLYHHFLTQVECNHLIEVAKPSLVKSTVIDSATGKSKDSR--VRTSSGTF 59
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
+ +D I++ IE +IA T +P GE VL+Y +KY+ HYD F+ A
Sbjct: 60 LVRGQDH--IIKRIEKRIADFTFIPVEQGEGLQVLQYRESEKYEPHYDYFHDAFNTKNGG 117
Query: 120 QRLASFLLYLSDVEEGGETMFPFE--NGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSL 175
QR+A+ L+YLSDVE+GGET+FP N + +C GL V+PR GD LLF+S+
Sbjct: 118 QRIATVLMYLSDVEKGGETVFPASKVNASEVPDWDQRSECAKRGLSVRPRMGDALLFWSM 177
Query: 176 FPNGTIDRTSLHGSCPVIKGEKWVATKWI 204
P+ +D TSLHG+CPVI+G KW ATKW+
Sbjct: 178 KPDAKLDPTSLHGACPVIQGTKWSATKWL 206
>gi|15239594|ref|NP_197391.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
gi|21593296|gb|AAM65245.1| prolyl 4-hydroxylase alpha subunit-like protein [Arabidopsis
thaliana]
gi|332005243|gb|AED92626.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
Length = 298
Score = 163 bits (412), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 93/211 (44%), Positives = 129/211 (61%), Gaps = 11/211 (5%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTF 59
++ +S +PRA + F + +C +++ AK LK S +A GE+ S RTSSGTF
Sbjct: 37 VKQVSSKPRAFVYEGFLTELECDHMVSLAKASLKRSAVADNDSGESKFSE--VRTSSGTF 94
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
IS +D I+ IE KI+ T LP+ +GE VLRYE GQKYD+H+D F+ +
Sbjct: 95 ISKGKDP--IVSGIEDKISTWTFLPKENGEDIQVLRYEHGQKYDAHFDYFHDKVNIVRGG 152
Query: 120 QRLASFLLYLSDVEEGGETMFPF----ENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFY 173
R+A+ L+YLS+V +GGET+FP + ++ D C G+ VKPR+GD LLF+
Sbjct: 153 HRMATILMYLSNVTKGGETVFPDAEIPSRRVLSENKEDLSDCAKRGIAVKPRKGDALLFF 212
Query: 174 SLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 204
+L P+ D SLHG CPVI+GEKW ATKWI
Sbjct: 213 NLHPDAIPDPLSLHGGCPVIEGEKWSATKWI 243
>gi|307102975|gb|EFN51240.1| hypothetical protein CHLNCDRAFT_28187 [Chlorella variabilis]
Length = 322
Score = 163 bits (412), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 94/222 (42%), Positives = 135/222 (60%), Gaps = 23/222 (10%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQG----ETVESTKGTRTSS 56
++++SW+PRAL F + +C +I+ A+ RL+PS++ R G ++V + +G +SS
Sbjct: 15 IELVSWKPRALLLHGFLAHSECDHMISLAEARLEPSKVVSRDGSGKLDSVRTRQGL-SSS 73
Query: 57 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY-- 114
GTF++ +D ++ +E +I AT LP +H E VL+YE+GQKY +HYD E
Sbjct: 74 GTFLTKRQDS--VVAGVEDRIELATHLPFSHSEQLQVLKYELGQKYSAHYDVHGSNEQAQ 131
Query: 115 -----GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYD----YKKC--IGLKVK 163
G Q R A+ L+YLSDVEEGGET FP +G ++D G Y +C G+ VK
Sbjct: 132 LAIRRGEQGGSRYATMLMYLSDVEEGGETSFP--HGRWIDEGAQAQPPYSECGSRGVAVK 189
Query: 164 PRRGDGLLFYSLFPNG-TIDRTSLHGSCPVIKGEKWVATKWI 204
PR+GD +LFYSL +G + D SLH CPV KG K+ AT WI
Sbjct: 190 PRKGDAILFYSLKSDGQSKDFFSLHAGCPVAKGVKYSATAWI 231
>gi|116309432|emb|CAH66506.1| OSIGBa0111I14.1 [Oryza sativa Indica Group]
Length = 267
Score = 162 bits (411), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 87/208 (41%), Positives = 128/208 (61%), Gaps = 5/208 (2%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 60
+V+SW PR + F NF S+E+C + + A+ RL+ S + + G+ V+S RTSSG F+
Sbjct: 62 EVISWSPRIIVFHNFLSSEECDYLRSIARPRLQISTVVDVATGKGVKSN--VRTSSGMFV 119
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
S+ E K +++ IE +I+ + +P+ +GE VLRYE Q Y H+D F+ + Q
Sbjct: 120 SSEERKLPVIQSIEKRISVYSQIPEENGELIQVLRYEPSQYYRPHHDYFSDTFNIKRGGQ 179
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+A+ L+YL+D EGGET FP G K GL VKP +GD +LF+S+ +G
Sbjct: 180 RVATMLMYLTDGVEGGETHFPQAGDGECSCGGKMVK--GLCVKPNKGDAVLFWSMGLDGE 237
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQE 208
D S+HG CPV++GEKW ATKW+R +E
Sbjct: 238 TDSNSIHGGCPVLEGEKWSATKWMRQKE 265
>gi|115457822|ref|NP_001052511.1| Os04g0346000 [Oryza sativa Japonica Group]
gi|38346023|emb|CAE03962.2| OSJNBb0085H11.11 [Oryza sativa Japonica Group]
gi|113564082|dbj|BAF14425.1| Os04g0346000 [Oryza sativa Japonica Group]
gi|125547818|gb|EAY93640.1| hypothetical protein OsI_15426 [Oryza sativa Indica Group]
gi|125589953|gb|EAZ30303.1| hypothetical protein OsJ_14349 [Oryza sativa Japonica Group]
gi|215693934|dbj|BAG89133.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 267
Score = 162 bits (411), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 87/208 (41%), Positives = 128/208 (61%), Gaps = 5/208 (2%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 60
+V+SW PR + F NF S+E+C + + A+ RL+ S + + G+ V+S RTSSG F+
Sbjct: 62 EVISWSPRIIVFHNFLSSEECDYLRSIARPRLQISTVVDVATGKGVKSN--VRTSSGMFV 119
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
S+ E K +++ IE +I+ + +P+ +GE VLRYE Q Y H+D F+ + Q
Sbjct: 120 SSEERKLPVIQSIEKRISVYSQIPEENGELIQVLRYEPSQYYRPHHDYFSDTFNIKRGGQ 179
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+A+ L+YL+D EGGET FP G K GL VKP +GD +LF+S+ +G
Sbjct: 180 RVATMLMYLTDGVEGGETHFPQAGDGECSCGGKMVK--GLCVKPNKGDAVLFWSMGLDGE 237
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQE 208
D S+HG CPV++GEKW ATKW+R +E
Sbjct: 238 TDSNSIHGGCPVLEGEKWSATKWMRQKE 265
>gi|20260280|gb|AAM13038.1| unknown protein [Arabidopsis thaliana]
gi|22136524|gb|AAM91340.1| unknown protein [Arabidopsis thaliana]
Length = 298
Score = 162 bits (411), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 93/211 (44%), Positives = 129/211 (61%), Gaps = 11/211 (5%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTF 59
++ +S +PRA + F + +C +++ AK LK S +A GE+ S RTSSGTF
Sbjct: 37 VKQVSSKPRAFVYEGFLTELECDHMVSLAKASLKRSAVADNDSGESKFSE--VRTSSGTF 94
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
IS +D I+ IE KI+ T LP+ +GE VLRYE GQKYD+H+D F+ +
Sbjct: 95 ISKGKDP--IVSGIEDKISTWTFLPKENGEDIQVLRYEHGQKYDAHFDYFHDKVNIVRGG 152
Query: 120 QRLASFLLYLSDVEEGGETMFPF----ENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFY 173
R+A+ L+YLS+V +GGET+FP + ++ D C G+ VKPR+GD LLF+
Sbjct: 153 HRMATILMYLSNVTKGGETVFPDAEIPSRRVLSENEEDLSDCAKRGIAVKPRKGDALLFF 212
Query: 174 SLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 204
+L P+ D SLHG CPVI+GEKW ATKWI
Sbjct: 213 NLHPDAIPDPLSLHGGCPVIEGEKWSATKWI 243
>gi|449522594|ref|XP_004168311.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Cucumis
sativus]
Length = 313
Score = 162 bits (409), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 89/205 (43%), Positives = 125/205 (60%), Gaps = 8/205 (3%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 62
LSW+PRA + F S +C +I AK +L+ S +A G++V S RTSSG F+
Sbjct: 56 LSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVSSE--VRTSSGMFLRK 113
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 122
++D+ ++ +E +IA T+LP +GE+ +L YE GQKY+ H+D F+ R+
Sbjct: 114 AQDE--VVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRI 171
Query: 123 ASFLLYLSDVEEGGETMFP---FENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 179
A+ L+YLS+VE+GGET+FP F+ D + G VK ++GD LLF+SL +
Sbjct: 172 ATVLMYLSNVEKGGETIFPNSEFKESQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDA 231
Query: 180 TIDRTSLHGSCPVIKGEKWVATKWI 204
T D SLHGSCPVI GEKW ATKWI
Sbjct: 232 TTDERSLHGSCPVIAGEKWSATKWI 256
>gi|168001068|ref|XP_001753237.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162695523|gb|EDQ81866.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 284
Score = 161 bits (408), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 84/207 (40%), Positives = 130/207 (62%), Gaps = 5/207 (2%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTKGTRTSSGTFI 60
+V+SW+PR + NF SA++C +I A+ RL K + + G+ +ES RTS+G F+
Sbjct: 79 EVISWQPRIILLHNFLSADECDHLINLARPRLVKSTVVDATTGKGIESK--VRTSTGMFL 136
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
+ ++ + ++ IE +IA +M+P +GE VLRYE Q Y +H+D F+ + Q
Sbjct: 137 NGNDRRHHTIQAIETRIAAYSMVPVQNGELLQVLRYESDQYYKAHHDYFSDEFNLKRGGQ 196
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+A+ L+YL++ EGGET+FP G + K IG+ VKP+RGD +LF+S+ +G
Sbjct: 197 RVATMLMYLTEGVEGGETIFPQAGDKECSCGGEMK--IGVCVKPKRGDAVLFWSIKLDGQ 254
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQ 207
+D TSLHG C V+ GEKW +TKW+R +
Sbjct: 255 VDPTSLHGGCKVLSGEKWSSTKWMRQR 281
>gi|334185677|ref|NP_001189994.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
gi|332643930|gb|AEE77451.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
Length = 324
Score = 161 bits (408), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 91/210 (43%), Positives = 125/210 (59%), Gaps = 8/210 (3%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVES--TKGTRTSSGTFI 60
LSW PR + F S E+C I AK +L+ S +A GE+VES + S +FI
Sbjct: 59 LSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEDSVSVVRQSSSFI 118
Query: 61 SA--SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 118
+ S + I+ +E K+A T LP+ +GE+ +L YE GQKY+ H+D F+
Sbjct: 119 ANMDSLEIDDIVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELG 178
Query: 119 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSL 175
R+A+ L+YLS+VE+GGET+FP G D + +C G VKPR+GD LLF++L
Sbjct: 179 GHRIATVLMYLSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDALLFFNL 238
Query: 176 FPNGTIDRTSLHGSCPVIKGEKWVATKWIR 205
PN T D SLHGSCPV++GEKW AT+WI
Sbjct: 239 HPNATTDSNSLHGSCPVVEGEKWSATRWIH 268
>gi|414587756|tpg|DAA38327.1| TPA: hypothetical protein ZEAMMB73_894856 [Zea mays]
Length = 263
Score = 161 bits (407), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 85/207 (41%), Positives = 128/207 (61%), Gaps = 5/207 (2%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 60
+V+SW PR + F NF S+E+C ++A A+ RL+ S + + G+ V+S RTSSG F+
Sbjct: 58 EVISWTPRIIVFHNFLSSEECDYLMAIARPRLQISTVVDVATGKGVKSD--VRTSSGMFV 115
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
++ E K+ +++ IE +I+ + +P+ +GE VLRYE Q Y H+D F+ + Q
Sbjct: 116 NSEERKSPVVQAIEKRISVFSQIPKENGELIQVLRYEASQYYRPHHDYFSDTFNLKRGGQ 175
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+A+ L+YL+D GGET FP G + K GL VKP +GD +LF+S+ +G
Sbjct: 176 RVATMLMYLTDGVVGGETHFPQAGDGECSCGGNVVK--GLCVKPNKGDAVLFWSMGLDGN 233
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQ 207
D S+H CPV+KGEKW ATKW+R +
Sbjct: 234 TDPNSIHSGCPVLKGEKWSATKWMRQK 260
>gi|242047772|ref|XP_002461632.1| hypothetical protein SORBIDRAFT_02g005750 [Sorghum bicolor]
gi|241925009|gb|EER98153.1| hypothetical protein SORBIDRAFT_02g005750 [Sorghum bicolor]
Length = 307
Score = 161 bits (407), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 89/208 (42%), Positives = 127/208 (61%), Gaps = 8/208 (3%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTF 59
++ +SW+PR + F S +C ++ AKK+++ S +A Q G++V S RTSSG F
Sbjct: 44 VKAVSWQPRIFVYKGFLSDAECDHLVTLAKKKIQRSMVADNQSGKSVMSE--VRTSSGMF 101
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
++ +D ++ IE +IA T LPQ + E +LRYE GQKY+ H+D F+ +
Sbjct: 102 LNKRQDP--VVSRIEERIAAWTFLPQENAENMQILRYEHGQKYEPHFDYFHDKINQVRGG 159
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLF 176
R A+ L+YLS V++GGET+FP G D + +C GL VKP +GD +LF+SL
Sbjct: 160 HRYATVLMYLSTVDKGGETVFPNAKGWESQPKDDTFSECAHQGLAVKPVKGDAVLFFSLH 219
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWI 204
+G D SLHGSCPVI+GEKW A KWI
Sbjct: 220 VDGVPDPLSLHGSCPVIQGEKWSAPKWI 247
>gi|218199253|gb|EEC81680.1| hypothetical protein OsI_25242 [Oryza sativa Indica Group]
Length = 487
Score = 161 bits (407), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 88/209 (42%), Positives = 126/209 (60%), Gaps = 8/209 (3%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTF 59
++ +SWRPR + F S ++C ++ K++++ S +A + G++V S RTSSG F
Sbjct: 57 VRAVSWRPRVFVYKGFLSDDECDHLVKLGKRKMQRSMVADNKSGKSVMSE--VRTSSGMF 114
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
+ +D ++ IE +IA T LP+ + E +LRYE GQKY+ H+D F+
Sbjct: 115 LDKRQDP--VVSRIEKRIAAWTFLPEENAENIQILRYEHGQKYEPHFDYFHDKVNQALGG 172
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLF 176
R A+ L+YLS VE+GGET+FP G D + +C GL VKP +GD +LF+SL
Sbjct: 173 HRYATVLMYLSTVEKGGETVFPNAEGWENQPKDDTFSECAQKGLAVKPVKGDAVLFFSLH 232
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+G D SLHGSCPVI+GEKW A KWIR
Sbjct: 233 IDGVPDPLSLHGSCPVIEGEKWSAPKWIR 261
>gi|388492638|gb|AFK34385.1| unknown [Medicago truncatula]
Length = 299
Score = 160 bits (406), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 95/211 (45%), Positives = 124/211 (58%), Gaps = 11/211 (5%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTF 59
++ +SW PRA + F + +C +I+ AK LK S +A G++ S RTSSG F
Sbjct: 37 VKQISWIPRAFVYQGFLTDLECDHLISLAKSELKRSAVADNLSGDSQLSD--VRTSSGMF 94
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
IS ++D I+ IE +I+ T LP+ +GE VLRYE GQKYD HYD F Q
Sbjct: 95 ISKNKDP--IVSGIEDRISAWTFLPKENGEDIQVLRYEHGQKYDPHYDYFADKVNIVQGG 152
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGIFLDSG----YDYKKCI--GLKVKPRRGDGLLFY 173
RLA+ L+YL++V +GGET+FP G D +C G+ VKPRRGD LLF+
Sbjct: 153 HRLATVLMYLTNVTKGGETVFPEAEEPPRRRGSKKSSDLSECAKKGIAVKPRRGDALLFF 212
Query: 174 SLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 204
SL N D SLH CPV++GEKW ATKWI
Sbjct: 213 SLDTNAIPDTNSLHAGCPVLEGEKWSATKWI 243
>gi|255085784|ref|XP_002505323.1| predicted protein [Micromonas sp. RCC299]
gi|226520592|gb|ACO66581.1| predicted protein [Micromonas sp. RCC299]
Length = 215
Score = 160 bits (406), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 89/219 (40%), Positives = 128/219 (58%), Gaps = 21/219 (9%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES-TKGTRTSSGTF 59
++ +SW PRA + NF + E+C ++ AK + L++ ++ T GT SG F
Sbjct: 2 IEQISWEPRAFVYHNFLTPEECAHLVNLAKA----TDGGLKRATVADARTGGTFPGSGAF 57
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM- 118
+ + D I+ IE +I+ M+P HGE +LRY G+KYD H+D F+ + +
Sbjct: 58 LLRNHDP--IVTRIEERISAFAMIPADHGEGMRILRYGRGEKYDPHHDYFDDGDKNLRFY 115
Query: 119 SQRLASFLLYLSDVEEGGETMFPFENGIFLD----------SGYDYKKCI--GLKVKPRR 166
QR+A+ L+YLSDVE GGET+FP ++G +++ S D KC L VKPRR
Sbjct: 116 GQRVATVLMYLSDVESGGETVFP-KHGAWIEPDEMDVRGRSSSKDSSKCAKGALHVKPRR 174
Query: 167 GDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 205
GD LLF++ NG D TSLH CPV++GEKW ATKW+R
Sbjct: 175 GDALLFHNCHLNGREDPTSLHAGCPVLRGEKWTATKWMR 213
>gi|357478545|ref|XP_003609558.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
gi|355510613|gb|AES91755.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
Length = 299
Score = 160 bits (406), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 95/211 (45%), Positives = 124/211 (58%), Gaps = 11/211 (5%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTF 59
++ +SW PRA + F + +C +I+ AK LK S +A G++ S RTSSG F
Sbjct: 37 VKQISWIPRAFVYQGFLTDLECDHLISLAKSELKRSAVADNLSGDSQLSD--VRTSSGMF 94
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
IS ++D I+ IE +I+ T LP+ +GE VLRYE GQKYD HYD F Q
Sbjct: 95 ISKNKDP--IVSGIEDRISAWTFLPKENGEDIQVLRYEHGQKYDPHYDYFADKVNIVQGG 152
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGIFLDSG----YDYKKCI--GLKVKPRRGDGLLFY 173
RLA+ L+YL++V +GGET+FP G D +C G+ VKPRRGD LLF+
Sbjct: 153 HRLATVLMYLTNVTKGGETVFPEAEEPPRRRGSKKSSDLSECAKKGIAVKPRRGDALLFF 212
Query: 174 SLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 204
SL N D SLH CPV++GEKW ATKWI
Sbjct: 213 SLDTNAIPDTNSLHAGCPVLEGEKWSATKWI 243
>gi|297812067|ref|XP_002873917.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297319754|gb|EFH50176.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 298
Score = 160 bits (405), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 92/212 (43%), Positives = 128/212 (60%), Gaps = 11/212 (5%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTF 59
++ +S +PRA + F + +C +++ AK LK S +A GE+ S RTSSGTF
Sbjct: 37 VKQVSSKPRAFVYEGFLTELECDHMVSLAKASLKRSAVADNDSGESKFSE--VRTSSGTF 94
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
I +D I+ IE KI+ T LP+ +GE VLRYE GQKYD+H+D F+ +
Sbjct: 95 IPKGKDP--IVSGIEDKISTWTFLPKENGEDIQVLRYEHGQKYDAHFDYFHDKVNIVRGG 152
Query: 120 QRLASFLLYLSDVEEGGETMFPFEN----GIFLDSGYDYKKCI--GLKVKPRRGDGLLFY 173
R+A+ L+YLS+V +GGET+FP + ++ D C G+ VKPR+GD LLF+
Sbjct: 153 HRIATVLMYLSNVTKGGETVFPDAEVPSCRVLSENKEDLSDCAKRGIAVKPRKGDALLFF 212
Query: 174 SLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+L P+ D SLHG CPVI+GEKW ATKWI
Sbjct: 213 NLHPDAIPDPLSLHGGCPVIEGEKWSATKWIH 244
>gi|222636605|gb|EEE66737.1| hypothetical protein OsJ_23428 [Oryza sativa Japonica Group]
Length = 487
Score = 160 bits (405), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 88/209 (42%), Positives = 126/209 (60%), Gaps = 8/209 (3%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTF 59
++ +SWRPR + F S ++C ++ K++++ S +A + G++V S RTSSG F
Sbjct: 57 VRAVSWRPRVFVYKGFLSDDECDHLVKLGKRKMQRSMVADNKSGKSVMSE--VRTSSGMF 114
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
+ +D ++ IE +IA T LP+ + E +LRYE GQKY+ H+D F+
Sbjct: 115 LDKRQDP--VVSRIEKRIAAWTFLPEENAENIQILRYEHGQKYEPHFDYFHDKVNQALGG 172
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLF 176
R A+ L+YLS VE+GGET+FP G D + +C GL VKP +GD +LF+SL
Sbjct: 173 HRYATVLMYLSTVEKGGETVFPNAEGWENQPKDDTFSECAQKGLAVKPVKGDTVLFFSLH 232
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+G D SLHGSCPVI+GEKW A KWIR
Sbjct: 233 IDGVPDPLSLHGSCPVIEGEKWSAPKWIR 261
>gi|115471029|ref|NP_001059113.1| Os07g0194500 [Oryza sativa Japonica Group]
gi|113610649|dbj|BAF21027.1| Os07g0194500 [Oryza sativa Japonica Group]
gi|215768445|dbj|BAH00674.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 319
Score = 160 bits (404), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 88/209 (42%), Positives = 126/209 (60%), Gaps = 8/209 (3%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTF 59
++ +SWRPR + F S ++C ++ K++++ S +A + G++V S RTSSG F
Sbjct: 57 VRAVSWRPRVFVYKGFLSDDECDHLVKLGKRKMQRSMVADNKSGKSVMSE--VRTSSGMF 114
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
+ +D ++ IE +IA T LP+ + E +LRYE GQKY+ H+D F+
Sbjct: 115 LDKRQDP--VVSRIEKRIAAWTFLPEENAENIQILRYEHGQKYEPHFDYFHDKVNQALGG 172
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLF 176
R A+ L+YLS VE+GGET+FP G D + +C GL VKP +GD +LF+SL
Sbjct: 173 HRYATVLMYLSTVEKGGETVFPNAEGWENQPKDDTFSECAQKGLAVKPVKGDTVLFFSLH 232
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+G D SLHGSCPVI+GEKW A KWIR
Sbjct: 233 IDGVPDPLSLHGSCPVIEGEKWSAPKWIR 261
>gi|218192156|gb|EEC74583.1| hypothetical protein OsI_10158 [Oryza sativa Indica Group]
Length = 299
Score = 160 bits (404), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 89/214 (41%), Positives = 129/214 (60%), Gaps = 9/214 (4%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKK-RLKPSQLAL-RQGETVESTKGTRTSSGTFIS 61
+SW PR + F S +C+ +IA AK+ R++ S + + GE+V S TRTSSG F+
Sbjct: 40 VSWSPRVFLYEGFLSDAECEHLIALAKQGRMERSTVVNGKSGESVMSK--TRTSSGMFLI 97
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 121
+D+ ++ IE +IA TM P +GE+ +LRY G+KY+ H+D + + R
Sbjct: 98 RKQDE--VVARIEERIAAWTMFPAENGESMQMLRYGQGEKYEPHFDYIRGRQASARGGHR 155
Query: 122 LASFLLYLSDVEEGGETMFP-FENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLFPN 178
+A+ L+YLS+V+ GGET+FP E + + C G VKP +G +LF+SL+PN
Sbjct: 156 IATVLMYLSNVKMGGETVFPDAEARLSQPKDETWSDCAEQGFAVKPTKGSAVLFFSLYPN 215
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHED 212
T D SLHGSCPVI+GEKW ATKWI + E+
Sbjct: 216 ATFDPGSLHGSCPVIQGEKWSATKWIHVRSYDEN 249
>gi|34393269|dbj|BAC83179.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein [Oryza
sativa Japonica Group]
gi|50509101|dbj|BAD30161.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein [Oryza
sativa Japonica Group]
Length = 313
Score = 160 bits (404), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 88/209 (42%), Positives = 126/209 (60%), Gaps = 8/209 (3%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTF 59
++ +SWRPR + F S ++C ++ K++++ S +A + G++V S RTSSG F
Sbjct: 51 VRAVSWRPRVFVYKGFLSDDECDHLVKLGKRKMQRSMVADNKSGKSVMSE--VRTSSGMF 108
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
+ +D ++ IE +IA T LP+ + E +LRYE GQKY+ H+D F+
Sbjct: 109 LDKRQDP--VVSRIEKRIAAWTFLPEENAENIQILRYEHGQKYEPHFDYFHDKVNQALGG 166
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLF 176
R A+ L+YLS VE+GGET+FP G D + +C GL VKP +GD +LF+SL
Sbjct: 167 HRYATVLMYLSTVEKGGETVFPNAEGWENQPKDDTFSECAQKGLAVKPVKGDTVLFFSLH 226
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+G D SLHGSCPVI+GEKW A KWIR
Sbjct: 227 IDGVPDPLSLHGSCPVIEGEKWSAPKWIR 255
>gi|242075290|ref|XP_002447581.1| hypothetical protein SORBIDRAFT_06g004550 [Sorghum bicolor]
gi|241938764|gb|EES11909.1| hypothetical protein SORBIDRAFT_06g004550 [Sorghum bicolor]
Length = 263
Score = 160 bits (404), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 85/207 (41%), Positives = 128/207 (61%), Gaps = 5/207 (2%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 60
+V+SW PR + F NF S+E+C ++A A+ RL+ S + + G+ V+S RTSSG F+
Sbjct: 58 EVISWTPRIIIFHNFLSSEECDYLMAIARPRLQMSTVVDVATGKGVKSD--VRTSSGMFV 115
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
++ E K+ +++ IE +I+ + +P+ +GE VLRYE Q Y H+D F+ + Q
Sbjct: 116 NSEERKSPVIQAIEKRISVFSQIPKENGELIQVLRYEASQYYRPHHDYFSDTFNLKRGGQ 175
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+A+ L+YL+D EGGET F G + K GL VKP +GD +LF+S+ +G
Sbjct: 176 RVATMLMYLTDGVEGGETHFLQAGDGECSCGGNVVK--GLCVKPNKGDAVLFWSMGLDGN 233
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQ 207
D S+H CPV+KGEKW ATKW+R +
Sbjct: 234 TDPNSIHSGCPVLKGEKWSATKWMRQK 260
>gi|294461211|gb|ADE76168.1| unknown [Picea sitchensis]
Length = 280
Score = 159 bits (403), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 90/204 (44%), Positives = 127/204 (62%), Gaps = 12/204 (5%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISASEDK 66
P + NF + +C +I A+ +L+ S +A + G++V S RTSSG F++ ++D+
Sbjct: 28 PGLFLYKNFLTDAECDHLIFLARDKLQKSMVADNESGKSVMSE--IRTSSGMFLNKAQDE 85
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFL 126
I+ +E +IA T LP +GEA VL YE+GQKY+ H+D F+ R+A+ L
Sbjct: 86 --IVASVEDRIAAWTFLPIENGEAMQVLHYELGQKYEPHFDYFHDKINQAMGGHRIATVL 143
Query: 127 LYLSDVEEGGETMFPFENGIFLDS---GYDYKKCI--GLKVKPRRGDGLLFYSLFPNGTI 181
+YLSDV +GGET+FP N DS + +C G VKP +GD LLF+SL P+ T
Sbjct: 144 MYLSDVVKGGETVFP--NAETKDSQPKDDSWSECAKGGYSVKPNKGDALLFFSLRPDATT 201
Query: 182 DRTSLHGSCPVIKGEKWVATKWIR 205
D++SLHGSCPVI+GEKW ATKWI
Sbjct: 202 DQSSLHGSCPVIEGEKWSATKWIH 225
>gi|108706361|gb|ABF94156.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
Group]
gi|222624253|gb|EEE58385.1| hypothetical protein OsJ_09545 [Oryza sativa Japonica Group]
Length = 299
Score = 159 bits (403), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 89/214 (41%), Positives = 129/214 (60%), Gaps = 9/214 (4%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKK-RLKPSQLAL-RQGETVESTKGTRTSSGTFIS 61
+SW PR + F S +C+ +IA AK+ R++ S + + GE+V S TRTSSG F+
Sbjct: 40 VSWSPRVFLYEGFLSDVECEHLIALAKQGRMERSTVVNGKSGESVMSK--TRTSSGMFLI 97
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 121
+D+ ++ IE +IA TM P +GE+ +LRY G+KY+ H+D + + R
Sbjct: 98 RKQDE--VVARIEERIAAWTMFPAENGESMQMLRYGQGEKYEPHFDYIRGRQASARGGHR 155
Query: 122 LASFLLYLSDVEEGGETMFP-FENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLFPN 178
+A+ L+YLS+V+ GGET+FP E + + C G VKP +G +LF+SL+PN
Sbjct: 156 IATVLMYLSNVKMGGETVFPDAEARLSQPKDETWSDCAEQGFAVKPTKGSAVLFFSLYPN 215
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHED 212
T D SLHGSCPVI+GEKW ATKWI + E+
Sbjct: 216 ATFDPGSLHGSCPVIQGEKWSATKWIHVRSYDEN 249
>gi|357467075|ref|XP_003603822.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355492870|gb|AES74073.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 683
Score = 159 bits (401), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 91/210 (43%), Positives = 127/210 (60%), Gaps = 11/210 (5%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLAL-RQGETVESTKGTRTSSGTFI 60
++LS PRA + NF S E+C+ +I AK + S + GE ES+ +RTSSG F+
Sbjct: 113 EILSSVPRASMYHNFLSKEECEHLINLAKPFMARSLVVDGVTGEVKESS--SRTSSGMFL 170
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
+DK I++ IE +IA T +P +GE +V+ Y +GQK + HYD +
Sbjct: 171 DRGKDK--IVQNIERRIADITSVPIENGEGLHVIHYGVGQKCEPHYDYTSDGVVTKNGGP 228
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIG--LKVKPRRGDGLLFYSLFPN 178
R+A+ L+YLSDVEEGGET+FP F KC G L VKP+ GD LLF+S+ P+
Sbjct: 229 RVATVLMYLSDVEEGGETVFPDAQPNFTS----VSKCSGDGLSVKPKMGDALLFWSMKPD 284
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWIRDQE 208
GT+D +SLHG PVI+G KW +TKW+ +E
Sbjct: 285 GTLDTSSLHGGSPVIRGNKWASTKWLHLRE 314
Score = 113 bits (282), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 79/195 (40%), Positives = 109/195 (55%), Gaps = 24/195 (12%)
Query: 16 FASAEQCQSIIATAKKRLKPSQLAL-RQGETVESTKGTRTSSGTFISASEDKTGILELIE 74
F S E+C+ +I AK + S + G+ ES+ RTSSG F+ +DK I++ IE
Sbjct: 372 FGSKEECEHLINLAKPFMTRSLVVDGLTGKGRESS--ARTSSGRFLERGKDK--IVQNIE 427
Query: 75 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEE 134
+IA T +P+ A + + + G GP R+A+ L+YLSDVEE
Sbjct: 428 QRIADITSIPRM---ARDFMLFTAG--------GVVTKNGGP----RVATVLMYLSDVEE 472
Query: 135 GGETMFP-FENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVI 193
GGET+FP + I S Y K GL VKP+ GD LLF S+ P+GT+D +SLHG PVI
Sbjct: 473 GGETVFPNAKPNINSVSKYPEK---GLSVKPKMGDALLFRSMKPDGTLDTSSLHGGSPVI 529
Query: 194 KGEKWVATKWIRDQE 208
+G KW +TKW+ E
Sbjct: 530 RGNKWASTKWLHLTE 544
>gi|388500582|gb|AFK38357.1| unknown [Medicago truncatula]
Length = 299
Score = 158 bits (400), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 94/211 (44%), Positives = 123/211 (58%), Gaps = 11/211 (5%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTF 59
++ +SW PRA + F + +C +I+ AK LK S +A G++ S RTSSG
Sbjct: 37 VKQISWIPRAFVYQGFLTDLECDHLISLAKSELKRSAVADNLSGDSQLSD--VRTSSGML 94
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
IS ++D I+ IE +I+ T LP+ +GE VLRYE GQKYD HYD F Q
Sbjct: 95 ISKNKDP--IVSGIEDRISAWTFLPKENGEDIQVLRYEHGQKYDPHYDYFADKVNIVQGG 152
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGIFLDSG----YDYKKCI--GLKVKPRRGDGLLFY 173
RLA+ L+YL++V +GGET+FP G D +C G+ VKPRRGD LLF+
Sbjct: 153 HRLATVLMYLTNVTKGGETVFPEAEEPPRRRGSKKSSDLSECAKKGIAVKPRRGDALLFF 212
Query: 174 SLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 204
SL N D SLH CPV++GEKW ATKWI
Sbjct: 213 SLDTNAIPDTNSLHAGCPVLEGEKWSATKWI 243
>gi|255072321|ref|XP_002499835.1| prolyl 4-hydroxylase [Micromonas sp. RCC299]
gi|226515097|gb|ACO61093.1| prolyl 4-hydroxylase [Micromonas sp. RCC299]
Length = 454
Score = 158 bits (399), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 89/210 (42%), Positives = 125/210 (59%), Gaps = 17/210 (8%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P+A F NF + +C+ ++ AKK+L PS + +G +K RTS+G F+ +D T
Sbjct: 177 PKAYMFRNFLTPHECEHLMQLAKKQLAPSTVVGDKGSGSMVSK-IRTSAGMFLGRGQDPT 235
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMS-QRLASF 125
+ IE +IA A+ LP+ +GE +LRYE GQKYD H+D F + P+ QR+A+
Sbjct: 236 --VRAIEERIAAASGLPEPNGEGLQILRYENGQKYDPHFDYFHDQVNSSPRRGGQRMATM 293
Query: 126 LLYLSDVEEGGETMFPFENGIFLD--------SGYDYKKCI--GLKVKPRRGDGLLFYSL 175
L+YL D EGGET+FP NG+ + + + C G+ VK RGD +LF+SL
Sbjct: 294 LIYLEDTTEGGETIFP--NGVRPEDWDADEPGNHNSWSDCAKKGIPVKSHRGDAVLFWSL 351
Query: 176 FPNGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+ T+D SLHG+CPVI GEKW A KWIR
Sbjct: 352 KEDYTLDNGSLHGACPVIAGEKWTAVKWIR 381
>gi|449461905|ref|XP_004148682.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 295
Score = 158 bits (399), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 88/208 (42%), Positives = 124/208 (59%), Gaps = 11/208 (5%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 62
LSW+PRA + F S +C +I AK +L+ S +A G++V S RTSSG F+
Sbjct: 35 LSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVSSE--VRTSSGMFLRK 92
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 122
++D+ ++ +E +IA T+LP +GE+ +L YE GQKY+ H+D F+ R+
Sbjct: 93 AQDE--VVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRI 150
Query: 123 ASFLLYLSDVEEGGETMFPFENGIF------LDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
A+ L+YLS+VE+GGET+FP + D + G VK ++GD LLF+SL
Sbjct: 151 ATVLMYLSNVEKGGETIFPNSEVWYGSESQAKDESWSDCSRKGYAVKAQKGDALLFFSLN 210
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWI 204
+ T D SLHGSCPVI GEKW ATKWI
Sbjct: 211 LDATTDERSLHGSCPVIAGEKWSATKWI 238
>gi|18397528|ref|NP_566279.1| P4H isoform 2 [Arabidopsis thaliana]
gi|332640849|gb|AEE74370.1| P4H isoform 2 [Arabidopsis thaliana]
Length = 299
Score = 157 bits (398), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 90/211 (42%), Positives = 128/211 (60%), Gaps = 11/211 (5%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTF 59
++ +S +PRA + F + +C +I+ AK+ L+ S +A GE+ RTSSGTF
Sbjct: 38 VKQVSSKPRAFVYEGFLTDLECDHLISLAKENLQRSAVADNDNGES--QVSDVRTSSGTF 95
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
IS +D I+ IE K++ T LP+ +GE VLRYE GQKYD+H+D F+ +
Sbjct: 96 ISKGKDP--IVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQKYDAHFDYFHDKVNIARGG 153
Query: 120 QRLASFLLYLSDVEEGGETMFP----FENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFY 173
R+A+ LLYLS+V +GGET+FP F ++ D C G+ VKP++G+ LLF+
Sbjct: 154 HRIATVLLYLSNVTKGGETVFPDAQEFSRRSLSENKDDLSDCAKKGIAVKPKKGNALLFF 213
Query: 174 SLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 204
+L + D SLHG CPVI+GEKW ATKWI
Sbjct: 214 NLQQDAIPDPFSLHGGCPVIEGEKWSATKWI 244
>gi|357128903|ref|XP_003566109.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Brachypodium
distachyon]
Length = 313
Score = 157 bits (397), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 92/210 (43%), Positives = 125/210 (59%), Gaps = 13/210 (6%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTFISA 62
+SW+PR + +F S ++ +++ A+ LK S +A G++ S RTS GTFIS
Sbjct: 55 ISWKPRVFLYQHFLSDDEANHLLSLARAELKRSAVADNTSGKSTLSE--VRTSYGTFISK 112
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 122
+D I+ IE KIA T LP+ +GE VLRY+ G+K + +D F + R+
Sbjct: 113 GKDP--IVAGIEDKIAAWTFLPKENGEDMQVLRYKRGEKDEPQFDFFTDTVNTVRGGHRV 170
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYK-----KCI--GLKVKPRRGDGLLFYSL 175
A+ LLYL+DV EGGET+FP F D+G K +C G+ VKPR+GD LLF++L
Sbjct: 171 ATVLLYLTDVAEGGETVFPLAKD-FTDTGLHDKDTTLSECAQKGIAVKPRKGDALLFFNL 229
Query: 176 FPNGTIDRTSLHGSCPVIKGEKWVATKWIR 205
P+ D SLHG C VIKGEKW ATKWIR
Sbjct: 230 RPDAATDPLSLHGGCTVIKGEKWTATKWIR 259
>gi|363807814|ref|NP_001242181.1| uncharacterized protein LOC100782154 [Glycine max]
gi|255644463|gb|ACU22735.1| unknown [Glycine max]
Length = 285
Score = 157 bits (397), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 85/207 (41%), Positives = 123/207 (59%), Gaps = 9/207 (4%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAK-KRLKPSQLALRQGETVESTKGTRTSSGTF 59
++V+SW PRA + NF + E+C+ +I TA LK + GE +E++ RTS+
Sbjct: 83 VEVMSWEPRAFLYHNFLTKEECEYLINTATPNMLKSLVIDNESGEGIETS--YRTSTEYV 140
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
+ +DK I+ IE +IA T +P HGE +V+RY +GQ Y+ H D F
Sbjct: 141 VERGKDK--IVRNIEKRIADVTFIPIEHGEPLHVIRYAVGQYYEPHVDYFEEEFSLVNGG 198
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSL 175
QR+A+ L+YLS+VE GGET+FP N F + + +C GL +KP+ GD LLF+S+
Sbjct: 199 QRIATMLMYLSNVEGGGETVFPIANANFSSVPWWNELSECGQTGLSIKPKMGDALLFWSM 258
Query: 176 FPNGTIDRTSLHGSCPVIKGEKWVATK 202
P+ T+D +LH +CPVIKG KW TK
Sbjct: 259 KPDATLDPLTLHRACPVIKGNKWSCTK 285
>gi|21618073|gb|AAM67123.1| prolyl 4-hydroxylase alpha subunit-like protein [Arabidopsis
thaliana]
Length = 297
Score = 157 bits (397), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 90/211 (42%), Positives = 128/211 (60%), Gaps = 11/211 (5%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTF 59
++ +S +PRA + F + +C +I+ AK+ L+ S +A GE+ RTSSGTF
Sbjct: 36 VKQVSSKPRAFVYEGFLTDLECDHLISLAKENLQRSAVADNDNGES--QVSDVRTSSGTF 93
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
IS +D I+ IE K++ T LP+ +GE VLRYE GQKYD+H+D F+ +
Sbjct: 94 ISKGKDP--IVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQKYDAHFDYFHDKVNIARGG 151
Query: 120 QRLASFLLYLSDVEEGGETMFP----FENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFY 173
R+A+ LLYLS+V +GGET+FP F ++ D C G+ VKP++G+ LLF+
Sbjct: 152 HRIATVLLYLSNVTKGGETVFPDAQEFSRRSLSENKDDLSDCAKKGIAVKPKKGNALLFF 211
Query: 174 SLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 204
+L + D SLHG CPVI+GEKW ATKWI
Sbjct: 212 NLQQDAIPDPFSLHGGCPVIEGEKWSATKWI 242
>gi|110738390|dbj|BAF01121.1| hypothetical protein [Arabidopsis thaliana]
Length = 299
Score = 157 bits (396), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 91/211 (43%), Positives = 129/211 (61%), Gaps = 11/211 (5%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTF 59
++ +S +PRA + F + +C +I+ AK+ L+ S +A GE+ S RTSSGTF
Sbjct: 38 VKQVSSKPRAFVYGGFLTDLECDHLISLAKENLQRSAVADNDNGESQVSD--VRTSSGTF 95
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
IS +D I+ IE K++ T LP+ +GE VLRYE GQKYD+H+D F+ +
Sbjct: 96 ISKGKDP--IVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQKYDAHFDYFHDKVNIARGG 153
Query: 120 QRLASFLLYLSDVEEGGETMFP----FENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFY 173
R+A+ LLYLS+V +GGET+FP F ++ D C G+ VKP++G+ LLF+
Sbjct: 154 HRIATVLLYLSNVTKGGETVFPDAQEFSRRSLSENKDDLSDCAKKGIAVKPKKGNALLFF 213
Query: 174 SLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 204
+L + D SLHG CPVI+GEKW ATKWI
Sbjct: 214 NLQQDAIPDPFSLHGGCPVIEGEKWSATKWI 244
>gi|145345764|ref|XP_001417370.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577597|gb|ABO95663.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 328
Score = 157 bits (396), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 92/210 (43%), Positives = 121/210 (57%), Gaps = 10/210 (4%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTF 59
++ +SWRP A + F + E+C + A A L S + G +V S RTSSG F
Sbjct: 56 IERVSWRPHAEVYRGFLTREECDHLKALATPSLGRSTVVDASNGGSVPSD--IRTSSGMF 113
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA--EYGPQ 117
+ ED ++ IE +IA T +P++HGE F VLRYE GQ+Y H+D F + +
Sbjct: 114 LLRGEDD--VVASIERRIASWTHVPESHGEGFQVLRYEFGQEYRPHFDYFQDEFNQKREK 171
Query: 118 MSQRLASFLLYLSDVEEGGETMFP-FENGIFLDSGYDYKKCIG--LKVKPRRGDGLLFYS 174
QR+A+ L+YL+DVEEGGET+FP E G G D C L VKPR+GD L F S
Sbjct: 172 GGQRVATVLMYLTDVEEGGETIFPDAEAGANPGGGDDASSCAAGKLAVKPRKGDALFFRS 231
Query: 175 LFPNGTIDRTSLHGSCPVIKGEKWVATKWI 204
L NGT D S H CPV+KG K+ ATKW+
Sbjct: 232 LHHNGTSDAMSSHAGCPVVKGVKFSATKWM 261
>gi|307110744|gb|EFN58979.1| hypothetical protein CHLNCDRAFT_137600 [Chlorella variabilis]
Length = 327
Score = 156 bits (395), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 92/218 (42%), Positives = 126/218 (57%), Gaps = 20/218 (9%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
++V++W+PRAL F S +C II A L+ S + +G ++ RTSSG FI
Sbjct: 42 VEVVAWKPRALLLHGFLSHAECDHIIRVADPSLERSTVVSPEGGSMLDE--IRTSSGMFI 99
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
D ++ +E ++A T LP +H E VLRYE+GQKY +H+D + E QM
Sbjct: 100 LKGHD--AVISGLEERVAALTHLPVSHQEDLQVLRYELGQKYSAHWDINDSPERAQQMRA 157
Query: 121 -------RLASFLLYLSDVEEGGETMFPFENGIFLDSGYD----YKKCI--GLKVKPRRG 167
R A+ L+YLSDVEEGGET FP +G +LD G Y +C G+ VKPR+G
Sbjct: 158 KGVLGGLRTATLLMYLSDVEEGGETAFP--HGRWLDEGVQAAPPYTECASKGVVVKPRKG 215
Query: 168 DGLLFYSLFPNG-TIDRTSLHGSCPVIKGEKWVATKWI 204
D +LF+SL NG D SLH CPV++G K+ ATKW+
Sbjct: 216 DAILFFSLKLNGQKKDVYSLHAGCPVVRGVKYSATKWV 253
>gi|297829156|ref|XP_002882460.1| hypothetical protein ARALYDRAFT_896741 [Arabidopsis lyrata subsp.
lyrata]
gi|297328300|gb|EFH58719.1| hypothetical protein ARALYDRAFT_896741 [Arabidopsis lyrata subsp.
lyrata]
Length = 299
Score = 156 bits (394), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 89/211 (42%), Positives = 128/211 (60%), Gaps = 11/211 (5%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTF 59
++ +S +PRA + F + +C +I+ AK+ L+ S +A GE+ RTSSGTF
Sbjct: 38 VKQVSAKPRAFVYEGFLTDLECDHLISLAKENLQRSAVADNDNGES--QVSDVRTSSGTF 95
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
IS +D I+ IE K++ T LP+ +GE VLRYE GQKYD+H+D F+ +
Sbjct: 96 ISKGKDP--IVSGIEDKLSTWTFLPKENGEDLQVLRYEPGQKYDAHFDYFHDKVNIARGG 153
Query: 120 QRLASFLLYLSDVEEGGETMFP----FENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFY 173
R+A+ LLYLS+V +GGET+FP + ++ D C G+ VKP++G+ LLF+
Sbjct: 154 HRIATVLLYLSNVTKGGETVFPDAQEYSRRSLSENKDDLSDCAKKGIAVKPKKGNALLFF 213
Query: 174 SLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 204
+L + D SLHG CPVI+GEKW ATKWI
Sbjct: 214 NLQQDAIPDPFSLHGGCPVIEGEKWSATKWI 244
>gi|412992163|emb|CCO19876.1| predicted protein [Bathycoccus prasinos]
Length = 350
Score = 156 bits (394), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 90/218 (41%), Positives = 127/218 (58%), Gaps = 15/218 (6%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTFISA 62
+SW+PRA + S E+C+ I+ AK +K S + GE T RTS TF++
Sbjct: 83 ISWQPRAFVLHSILSEEECEEILRIAKPMMKRSTVVDSITGEI--KTDPIRTSKQTFLAR 140
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP-AEYGPQMS-- 119
K ++ +E +++R TMLP +GE +L Y +G+KY +H+D + G Q+S
Sbjct: 141 G--KYPVVTRVEERLSRFTMLPWYNGEDMQILSYGVGEKYSAHHDVGEKNTKSGQQLSAD 198
Query: 120 --QRLASFLLYLSDVEEGGETMFPFENGIFLDSGY---DYKKCI--GLKVKPRRGDGLLF 172
QR+A+ LLYL D EEGGET FP I +S Y + +C G+ KP+RGDGLLF
Sbjct: 199 GGQRVATVLLYLQDTEEGGETAFPDSEWIEPESEYAQQKFSECAKNGVAFKPKRGDGLLF 258
Query: 173 YSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQH 210
+S+ P G ID+ S+H CPV+KG KW ATKWI + H
Sbjct: 259 FSITPEGDIDQKSMHAGCPVVKGTKWTATKWIHARPFH 296
>gi|18071415|gb|AAL58274.1|AC068923_16 putative prolyl 4-hydroxylase, alpha subunit [Oryza sativa Japonica
Group]
Length = 343
Score = 155 bits (393), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 85/205 (41%), Positives = 123/205 (60%), Gaps = 11/205 (5%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
+VLSW PRA + NF S E+C+ +I+ AK +K S + + ++ RTSSG F+
Sbjct: 111 EVLSWEPRAFLYHNFLSKEECEYLISLAKPHMKKSTVVDASTGGSKDSR-VRTSSGMFLG 169
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 121
+DK I+ IE +I+ T +P +GE VL YE+GQKY+ H+D F+ QR
Sbjct: 170 RGQDK--IIRTIEKRISDYTFIPVENGEGLQVLHYEVGQKYEPHFDYFHDEFNTKNGGQR 227
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKCI--GLKVKPRRGDGLLFYSLFP 177
+A+ L+YLSDVEEGGET+FP S + + +C GL VKP+ GD LLF+S+ P
Sbjct: 228 IATLLMYLSDVEEGGETIFPSSKANSSSSPFYNELSECAKKGLAVKPKMGDALLFWSMRP 287
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATK 202
+G++D TSLHG P++ W+ T
Sbjct: 288 DGSLDATSLHGEIPIL----WLLTN 308
>gi|145343778|ref|XP_001416487.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144576712|gb|ABO94780.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 255
Score = 155 bits (393), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 91/202 (45%), Positives = 118/202 (58%), Gaps = 11/202 (5%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQL--ALRQGETVESTKGTRTSSGTFISASED 65
PRA + F + E+C I+A +K L S + A G T T RTS+GTFIS + D
Sbjct: 1 PRAFVYEGFLTDEECDHILALSKGHLHKSGVVDAKTGGST---TSDIRTSTGTFISRAHD 57
Query: 66 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASF 125
T + IE +I + +P HGEA VLRYE GQ+Y +H+D F G + + R+A+
Sbjct: 58 PT--ITAIEERIELWSQIPVDHGEALQVLRYENGQEYKAHFDYF--FHKGGKRNNRIATV 113
Query: 126 LLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLFPNGTIDR 183
LLYLSDVEEGGET+FP + Y +C G VK R+GD LLF+S+ P G +D
Sbjct: 114 LLYLSDVEEGGETVFPNTDVPTDRDRSQYSECGNGGKSVKARKGDALLFWSMKPGGELDP 173
Query: 184 TSLHGSCPVIKGEKWVATKWIR 205
S H CPVIKG KW ATKW+
Sbjct: 174 GSSHAGCPVIKGVKWTATKWMH 195
>gi|308801080|ref|XP_003075321.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
gi|116061875|emb|CAL52593.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
Length = 541
Score = 155 bits (392), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 87/199 (43%), Positives = 120/199 (60%), Gaps = 7/199 (3%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
PRA + NF S ++C+ ++A +K +L S + Q S RTS+GTFIS D
Sbjct: 265 PRAFLYENFLSEKECEHLLALSKGKLHKSGVVDAQ-TGGSSLSEVRTSTGTFISRKYDD- 322
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 127
I+ +E +I + +PQ+H EAF +LRYE GQ+Y +H+D F + R+A+ LL
Sbjct: 323 -IIAGVEERIELWSQIPQSHHEAFQILRYEPGQEYKAHFDYF--FHKSGMRNNRIATVLL 379
Query: 128 YLSDVEEGGETMFPFENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLFPNGTIDRTS 185
YLSDVEEGGET+FP + + Y +C G +K R+GD LLF+S+ P G +D S
Sbjct: 380 YLSDVEEGGETVFPNTDVPTSRNRSMYSECGNGGKALKARKGDALLFWSMKPGGELDAGS 439
Query: 186 LHGSCPVIKGEKWVATKWI 204
H CPVIKGEKW ATKW+
Sbjct: 440 SHAGCPVIKGEKWTATKWM 458
>gi|255085592|ref|XP_002505227.1| predicted protein [Micromonas sp. RCC299]
gi|226520496|gb|ACO66485.1| predicted protein [Micromonas sp. RCC299]
Length = 267
Score = 155 bits (391), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 91/207 (43%), Positives = 123/207 (59%), Gaps = 11/207 (5%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISA 62
LS +P+A + F +C I AK +L+ S + + G++V S RTS G F
Sbjct: 8 LSEKPKAYLYRGFLRQAECDYIKERAKPKLEKSTVVDNKTGQSVPSN--IRTSDGMFFDR 65
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS--- 119
ED I+E IE +IA T +P +GE VLRYE+GQKY+ H DAF+ ++ + S
Sbjct: 66 HEDD--IIEDIERRIAEWTNVPWENGEGIQVLRYEVGQKYEPHLDAFSD-KFNTEESKGG 122
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLFP 177
QR+A+ L+YLSDVEEGGET+FP + +C G+ VK R+GD LLF+SL
Sbjct: 123 QRMATVLMYLSDVEEGGETVFPRSVDKPHKGDPKWSECAQRGVAVKARKGDALLFWSLDI 182
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWI 204
+ +D SLHG CPVIKG KW ATKW+
Sbjct: 183 DSNVDELSLHGGCPVIKGTKWSATKWM 209
>gi|159476104|ref|XP_001696154.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein
[Chlamydomonas reinhardtii]
gi|158275325|gb|EDP01103.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein
[Chlamydomonas reinhardtii]
Length = 343
Score = 154 bits (390), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 87/206 (42%), Positives = 119/206 (57%), Gaps = 5/206 (2%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
M VLSW PR + + E+C ++ ++ RL+ S ++ + RTSSG F
Sbjct: 67 MVVLSWHPRVFLYKGILTHEECDQLMDNSRSRLERSGVS-DATTGAGAVSDIRTSSGMFY 125
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
E T +++ IE+++A TMLP +GE VLRYE QKYD H+D F+
Sbjct: 126 ERGE--TELVKRIENRLAMWTMLPVENGEGIQVLRYEKTQKYDPHHDYFSFDGADDNGGN 183
Query: 121 RLASFLLYLSDVEEGGETMFPFENG--IFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 178
R+A+ L+YL+ EEGGET+FP G + L + GL VKP +GD +LF+S+ P+
Sbjct: 184 RMATVLMYLATPEEGGETVFPKVVGWVVQLTTTASAPCRQGLAVKPAKGDAVLFWSIRPD 243
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWI 204
G D SLHGSCPVIKG KW ATKWI
Sbjct: 244 GRFDPGSLHGSCPVIKGVKWSATKWI 269
>gi|219121927|ref|XP_002181308.1| proly 4-hydroxylase [Phaeodactylum tricornutum CCAP 1055/1]
gi|217407294|gb|EEC47231.1| proly 4-hydroxylase [Phaeodactylum tricornutum CCAP 1055/1]
Length = 226
Score = 154 bits (389), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 86/214 (40%), Positives = 118/214 (55%), Gaps = 14/214 (6%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
++ LS P L F S ++C I TA+ ++ S++ L + RTS FI
Sbjct: 7 LETLSLVPLVLSVEGFLSDDECTYIQETAEPHMEYSEVTLMDKDQGRPASDFRTSQSAFI 66
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ--- 117
A +D IL I+++ A +P+ H E VLRY++ +KYDSH D F+PA Y
Sbjct: 67 RAHDD--AILTDIDYRTASLVRIPRRHQEDVQVLRYDVTEKYDSHADYFDPALYTKDKRT 124
Query: 118 -------MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 170
R+A+ YLSDVE+GGET+FP NG S D K GLKVKP +G +
Sbjct: 125 LALIRNGHRNRMATVFWYLSDVEKGGETVFPRFNGAQETSMKDCK--TGLKVKPEKGKVI 182
Query: 171 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 204
+FYS+ P+G +D SLHG+CPV KG KW A KW+
Sbjct: 183 IFYSMTPDGALDEYSLHGACPVQKGTKWAANKWV 216
>gi|302834449|ref|XP_002948787.1| hypothetical protein VOLCADRAFT_80309 [Volvox carteri f.
nagariensis]
gi|300265978|gb|EFJ50167.1| hypothetical protein VOLCADRAFT_80309 [Volvox carteri f.
nagariensis]
Length = 329
Score = 154 bits (388), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 87/207 (42%), Positives = 122/207 (58%), Gaps = 7/207 (3%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTF 59
M VLSW+PR + + E+C +I A+ RL+ S ++ GE RTSSG F
Sbjct: 50 MVVLSWQPRVFLYKGILTQEECDYLIKIAQGRLERSGVSDATTGEG--GVSDIRTSSGMF 107
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
+ E+ +++ IE ++A TMLP +GE VLRYE QKYD H+D F+
Sbjct: 108 YTRGEND--VVKRIETRLAMWTMLPVENGEGIQVLRYEKTQKYDPHHDYFSFEGRDANGG 165
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKC--IGLKVKPRRGDGLLFYSLFP 177
R+A+ L+YL+ EEGGET+FP + ++ +C GL VKP +GD +LF+S+ P
Sbjct: 166 NRMATVLMYLATPEEGGETVFPKIPVPAGQTRANFSECGMKGLAVKPVKGDAVLFWSIRP 225
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWI 204
+G + SLHGSCPVI+G KW ATKWI
Sbjct: 226 DGRFEPGSLHGSCPVIRGVKWSATKWI 252
>gi|308802438|ref|XP_003078532.1| prolyl 4-hydroxylase alpha-1 subunit precursor (IC) [Ostreococcus
tauri]
gi|116056985|emb|CAL51412.1| prolyl 4-hydroxylase alpha-1 subunit precursor (IC) [Ostreococcus
tauri]
Length = 369
Score = 154 bits (388), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 89/219 (40%), Positives = 125/219 (57%), Gaps = 26/219 (11%)
Query: 5 SWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASE 64
S +P+A NF S ++C ++ AK+ L PS + G +V S RTS+G F+ S+
Sbjct: 87 SKKPKAYLMRNFLSPQECDHLMMLAKRELAPSTVVGDGGSSVASE--IRTSAGMFLRKSQ 144
Query: 65 DKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF----NPAEYGPQMSQ 120
D T + IE +IAR + +P +GE +LRY+ GQKYD H+D F NPA + Q
Sbjct: 145 DDT--VREIEERIARLSGVPVDNGEGMQILRYDKGQKYDPHFDYFHDKVNPAPK--RGGQ 200
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLD------------SGYDYKKCI--GLKVKPRR 166
R+A+ L+YL D EEGGET FP NG + + + C G+ VK R
Sbjct: 201 RVATVLIYLVDTEEGGETTFP--NGRLPENFEEDEPDNPFAAHIKHTDCAKNGIPVKSVR 258
Query: 167 GDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 205
GD +LF+S+ +G +D SLHG+CPVI G+KW A KW+R
Sbjct: 259 GDAILFFSMTKDGELDHGSLHGACPVIAGQKWTAVKWLR 297
>gi|255083627|ref|XP_002508388.1| predicted protein [Micromonas sp. RCC299]
gi|226523665|gb|ACO69646.1| predicted protein [Micromonas sp. RCC299]
Length = 253
Score = 154 bits (388), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 88/219 (40%), Positives = 122/219 (55%), Gaps = 27/219 (12%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSG 57
+SW PRA + NF S E+C I+ A+ R+ R+ ++S G RTS
Sbjct: 1 VSWYPRAFHLHNFMSHEECDRILEIARPRV-------RRSTVIDSVTGQSKVDPIRTSEQ 53
Query: 58 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN-PAEYGP 116
TF++ I+ +E ++A T LP HGE +L+Y +GQKYD+H+D + G
Sbjct: 54 TFLN--RGTWDIVTKVEERLAVVTQLPAYHGEDMQILKYGLGQKYDAHHDVGELTSASGK 111
Query: 117 QMS----QRLASFLLYLSDVEEGGETMFPFENGIFLD-----SGYDYKKCI--GLKVKPR 165
Q++ R+A+ LLYLSDVEEGGET FP + + G + C + VKPR
Sbjct: 112 QLAAEGGHRVATVLLYLSDVEEGGETAFPDSEWMTPELRKWAEGQKWSDCAEGNVAVKPR 171
Query: 166 RGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 204
+GDGLLF+S+ ID S+H CPVI+GEKW ATKWI
Sbjct: 172 KGDGLLFWSVNNENAIDPHSMHAGCPVIRGEKWTATKWI 210
>gi|388520325|gb|AFK48224.1| unknown [Lotus japonicus]
Length = 188
Score = 154 bits (388), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 85/189 (44%), Positives = 115/189 (60%), Gaps = 9/189 (4%)
Query: 25 IIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATML 83
+I AK + S + Q G++V S RTSSG F+ +DK +++ IE +IA +
Sbjct: 1 MINLAKPHMAKSSVVDSQTGKSVGSR--VRTSSGMFLKRGKDK--VIQTIEKRIADFAFI 56
Query: 84 PQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFE 143
P +GE VL YE+GQKY+ HYD F QR+A+ L+YLSDVEEGGET+FP
Sbjct: 57 PVENGEGLQVLHYEVGQKYEPHYDYFLDEFNTKNGGQRIATVLMYLSDVEEGGETIFPAA 116
Query: 144 NGIFLDSGY--DYKKCI--GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWV 199
F + D C GL VKP+RGD LLF+S+ P+ T+D +SLHG CPVI+G KW
Sbjct: 117 KANFSSVPWYNDLSVCAKKGLSVKPKRGDALLFWSIRPDATLDPSSLHGGCPVIRGNKWS 176
Query: 200 ATKWIRDQE 208
+TKW+ +E
Sbjct: 177 STKWMHLEE 185
>gi|357467077|ref|XP_003603823.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355492871|gb|AES74074.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 291
Score = 154 bits (388), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 90/208 (43%), Positives = 126/208 (60%), Gaps = 14/208 (6%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST-KGTRTSSGTFI 60
+VLS PRA + NF S E+C+ +I AK ++ S + G T + RTSSGTF+
Sbjct: 86 EVLSSEPRASMYHNFLSKEECEHLINLAKPFMQRSLVV--DGVTGQGILNSVRTSSGTFL 143
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFN---PAEYGP 116
+DK I++ +E +IA T +P +GE ++ YE+GQK++ HYD FN GP
Sbjct: 144 ERGKDK--IVQNVERRIADITSIPIENGEGLQIIHYEVGQKFEPHYDYNFNWRITNNGGP 201
Query: 117 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
R+A+ L+YLSDVEEGGET+FP F +S Y GL VKP+ GD LLF+S+
Sbjct: 202 ----RVATVLMYLSDVEEGGETVFPNAKPNF-NSVSKYHPGKGLVVKPKMGDALLFWSVK 256
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWI 204
P+G++D SLHG PVI+G KW + K +
Sbjct: 257 PDGSLDTASLHGGSPVIRGSKWASNKLL 284
>gi|326503458|dbj|BAJ86235.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326516134|dbj|BAJ88090.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 266
Score = 153 bits (386), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 85/208 (40%), Positives = 126/208 (60%), Gaps = 7/208 (3%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 60
+V+SW PR + F NF S+E+C + A+ RL+ S + + G+ V+S RTSSG F+
Sbjct: 61 EVISWTPRIIVFHNFLSSEECDYLREIARPRLEISTVVDVATGKGVKSD--VRTSSGMFV 118
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
++ E K +++ IE +I+ + +P +GE VLRYE Q Y H+D F+ + Q
Sbjct: 119 NSEERKLPVIKAIEKRISVFSQIPVENGELIQVLRYEPNQYYRPHHDYFSDTFNLKRGGQ 178
Query: 121 RLASFLLYLSDVEEGGETMFP-FENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 179
R+A+ L+YL+D EGGET FP +G + G + GL VKP +GD +LF+S+ +G
Sbjct: 179 RVATMLMYLTDGVEGGETHFPQAGDGECICGG---RLVRGLCVKPNKGDAVLFWSMGLDG 235
Query: 180 TIDRTSLHGSCPVIKGEKWVATKWIRDQ 207
D SLH C V+KGEKW ATKW+R +
Sbjct: 236 NTDSNSLHSGCAVVKGEKWSATKWMRQK 263
>gi|125542543|gb|EAY88682.1| hypothetical protein OsI_10157 [Oryza sativa Indica Group]
Length = 321
Score = 152 bits (385), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 89/222 (40%), Positives = 126/222 (56%), Gaps = 24/222 (10%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES-TKGTRTSSGTFISA 62
+SWRPRA + F S +C +I+ AK+ K + + GE+ ES T RTSSG F+
Sbjct: 45 VSWRPRAFLYEGFLSDAECDHLISLAKQG-KMEKSTVVDGESGESVTSKVRTSSGMFLDK 103
Query: 63 SEDKTGILELIEHKIARATMLP-----------------QTHGEAFNVLRYEIGQKYDSH 105
+D+ ++ IE +IA TMLP +GE+ +LRY G+KY+ H
Sbjct: 104 KQDE--VVARIEERIAAWTMLPTECIIFYCFANFAILKLSENGESMQILRYGQGEKYEPH 161
Query: 106 YDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP-FENGIFLDSGYDYKKCI--GLKV 162
+D + + + R+A+ L+YLS+V+ GGET+FP E + + C G V
Sbjct: 162 FDYISGRQGSTREGDRVATVLMYLSNVKMGGETIFPDCEARLSQPKDETWSDCAEQGFAV 221
Query: 163 KPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 204
KP +G +LF+SL PN T+D SLHGSCPVI+GEKW ATKWI
Sbjct: 222 KPAKGSAVLFFSLHPNATLDTDSLHGSCPVIEGEKWSATKWI 263
>gi|224034451|gb|ACN36301.1| unknown [Zea mays]
gi|413945801|gb|AFW78450.1| hypothetical protein ZEAMMB73_588774 [Zea mays]
Length = 295
Score = 152 bits (385), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 88/204 (43%), Positives = 117/204 (57%), Gaps = 18/204 (8%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 63
+S +PR + +F S ++ +I+ A+ LK S +A ++ G T S
Sbjct: 54 ISCKPRVFLYQHFLSDDEANHLISLARAELKRSAVA-------DNMSGKST-------LS 99
Query: 64 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 123
ED I+E IE KIA T LP+ +GE VLRY+ G+KY+ HYD F + R A
Sbjct: 100 EDP--IVEGIEDKIAAWTFLPKENGEDIQVLRYKHGEKYEPHYDYFTDNVNTVRGGHRYA 157
Query: 124 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLFPNGTI 181
+ LLYL+DV EGGET+FP +C G+ V+PR+GD LLF++L P+GT
Sbjct: 158 TVLLYLTDVPEGGETVFPLAEEPDDAKDATLSECAQKGIAVRPRKGDALLFFNLNPDGTT 217
Query: 182 DRTSLHGSCPVIKGEKWVATKWIR 205
D SLHG CPVIKGEKW ATKWIR
Sbjct: 218 DSVSLHGGCPVIKGEKWSATKWIR 241
>gi|308799217|ref|XP_003074389.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
gi|116000560|emb|CAL50240.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
Length = 294
Score = 152 bits (384), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 90/212 (42%), Positives = 126/212 (59%), Gaps = 16/212 (7%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
++ LSW P A + F + +C+ I A LKPS + + +++ RTSSG F+
Sbjct: 26 IERLSWAPHAEVYRGFLTEAECEHIERLATAELKPSTV-VDASTGGDASSEIRTSSGMFL 84
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA-----EYG 115
+ED ++E IE +IA T +P++HGE F VLRYE Q+Y +HYD F+ E G
Sbjct: 85 GRAEDD--VIEAIEARIAAWTHVPESHGEGFQVLRYEKHQEYRAHYDYFHDKFNVKREKG 142
Query: 116 PQMSQRLASFLLYLSDVEEGGETMFP-FENGIFLDSGYDYKKCI--GLKVKPRRGDGLLF 172
QR+ + L+YLSDVEEGGET+FP FE+G +G + +C L V+PR+GD L F
Sbjct: 143 ---GQRMGTVLMYLSDVEEGGETVFPKFEDGT--PAGSEASECARNKLAVRPRKGDALFF 197
Query: 173 YSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 204
SL +G D S H CPVI+G K+ ATKW+
Sbjct: 198 RSLRHDGVPDTFSEHAGCPVIRGVKFSATKWM 229
>gi|297797785|ref|XP_002866777.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297312612|gb|EFH43036.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 266
Score = 152 bits (383), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 83/193 (43%), Positives = 120/193 (62%), Gaps = 9/193 (4%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTF 59
++++SW PRA + NF + E+C+ +I AK ++ S + + G++ +S RTSSGTF
Sbjct: 77 VEIISWEPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKTGKSTDSR--VRTSSGTF 134
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
++ DKT + IE +I+ T +P HGE VL YEIGQKY+ HYD F
Sbjct: 135 LARGRDKT--IREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHYDYFMDEYNTRNGG 192
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSL 175
QR+A+ L+YLSDVEEGGET+FP G + + + +C GL VKP+ GD LLF+S+
Sbjct: 193 QRIATVLMYLSDVEEGGETVFPAAKGNYSAVPWWNELSECGKGGLSVKPKMGDALLFWSM 252
Query: 176 FPNGTIDRTSLHG 188
P+ T+D +SLHG
Sbjct: 253 TPDATLDPSSLHG 265
>gi|10177121|dbj|BAB10411.1| prolyl 4-hydroxylase, alpha subunit-like protein [Arabidopsis
thaliana]
Length = 267
Score = 152 bits (383), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 83/193 (43%), Positives = 120/193 (62%), Gaps = 9/193 (4%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTF 59
++++SW PRA + NF + E+C+ +I AK ++ S + + G++ +S RTSSGTF
Sbjct: 78 VEIISWEPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKTGKSTDSR--VRTSSGTF 135
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
++ DKT + IE +I+ T +P HGE VL YEIGQKY+ HYD F
Sbjct: 136 LARGRDKT--IREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHYDYFMDEYNTRNGG 193
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSL 175
QR+A+ L+YLSDVEEGGET+FP G + + + +C GL VKP+ GD LLF+S+
Sbjct: 194 QRIATVLMYLSDVEEGGETVFPAAKGNYSAVPWWNELSECGKGGLSVKPKMGDALLFWSM 253
Query: 176 FPNGTIDRTSLHG 188
P+ T+D +SLHG
Sbjct: 254 TPDATLDPSSLHG 266
>gi|6437556|gb|AAF08583.1|AC011623_16 unknown protein [Arabidopsis thaliana]
Length = 278
Score = 152 bits (383), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 87/203 (42%), Positives = 122/203 (60%), Gaps = 20/203 (9%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTFISA 62
+S +PRA + F + +C +I+ AK+ L+ S +A GE+ RTSSGTFIS
Sbjct: 41 VSSKPRAFVYEGFLTDLECDHLISLAKENLQRSAVADNDNGES--QVSDVRTSSGTFISK 98
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 122
+D I+ IE K++ T LP+ +GE VLRYE GQKYD+H+D F+ + R+
Sbjct: 99 GKDP--IVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQKYDAHFDYFHDKVNIARGGHRI 156
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
A+ LLYLS+V +GGET+FP D + C+ KP++G+ LLF++L + D
Sbjct: 157 ATVLLYLSNVTKGGETVFP-----------DAQVCL----KPKKGNALLFFNLQQDAIPD 201
Query: 183 RTSLHGSCPVIKGEKWVATKWIR 205
SLHG CPVI+GEKW ATKWI
Sbjct: 202 PFSLHGGCPVIEGEKWSATKWIH 224
>gi|357162904|ref|XP_003579560.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
distachyon]
Length = 266
Score = 151 bits (382), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 83/207 (40%), Positives = 123/207 (59%), Gaps = 5/207 (2%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 60
+V+SW PR + F NF S+E+C + A+ RL+ S + + G+ V+S RTSSG F+
Sbjct: 61 EVISWTPRIIVFHNFLSSEECDFLKEIARPRLEISTVVDVATGKGVKSD--VRTSSGMFV 118
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
++ E K +++ IE +I+ + +P +GE VLRYE Q Y H+D F+ + Q
Sbjct: 119 NSEERKFPVIQAIEKRISVFSQIPVENGELIQVLRYEPSQYYRPHHDYFSDTFNLKRGGQ 178
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+A+ L+YL+D EGGET FP G + GL VKP +GD +LF+S+ +G
Sbjct: 179 RVATMLMYLTDGVEGGETHFPQAGDGECSCGGRIVR--GLCVKPNKGDAVLFWSMGLDGN 236
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQ 207
D S+H C V+KGEKW ATKW+R +
Sbjct: 237 TDSNSIHSGCAVLKGEKWSATKWMRQK 263
>gi|307102962|gb|EFN51227.1| hypothetical protein CHLNCDRAFT_28161 [Chlorella variabilis]
Length = 300
Score = 150 bits (380), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 81/206 (39%), Positives = 116/206 (56%), Gaps = 5/206 (2%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
++VLSW PR + + E+C ++ A RL S + ES RTS G F
Sbjct: 16 LKVLSWDPRIFLYQRLLTEEECDHMMTKAGPRLTRSGVVDVDNPGGESVSDIRTSYGMFF 75
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
ED+ ++ +E +++ +++P HGE VLRYE G++Y H+D F
Sbjct: 76 DRGEDE--VVREVERRLSEWSLIPPGHGEGIQVLRYENGEEYKPHFDYFFDNLSVQNGGN 133
Query: 121 RLASFLLYLSDVEEGGETMFPFENGI---FLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
RLA+ L+YL++ E GGET+FP L++GY GL VKPR+GD +LF+SL
Sbjct: 134 RLATILMYLAEPEFGGETVFPNVKAPPEQTLEAGYSECATQGLAVKPRKGDAVLFFSLRT 193
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKW 203
GT+D+ SLHGSCP +KG K+ ATKW
Sbjct: 194 EGTLDKGSLHGSCPTLKGFKFAATKW 219
>gi|224069056|ref|XP_002302889.1| predicted protein [Populus trichocarpa]
gi|222844615|gb|EEE82162.1| predicted protein [Populus trichocarpa]
Length = 287
Score = 150 bits (379), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 82/207 (39%), Positives = 123/207 (59%), Gaps = 5/207 (2%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 60
+++SW PR + +F S+E+C + A AK RL+ S + ++ G+ +ES RTSSG F+
Sbjct: 82 EIISWSPRIIVLHDFLSSEECDYLRALAKPRLRISTVVDVKTGKGIESK--VRTSSGMFL 139
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
S+ E +++ IE +I+ + +P +GE VLRYE Q Y H+D F+ + Q
Sbjct: 140 SSEEKTYQVVQAIEKRISVYSQVPIENGELIQVLRYEKNQYYKPHHDYFSDTFNLKRGGQ 199
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+A+ L+YLSD EGGET FP G K GL VKP +G+ +LF+S+ +G
Sbjct: 200 RVATMLMYLSDNVEGGETYFPMAGSGKCSCG--GKVVDGLSVKPIKGNAVLFWSMGLDGQ 257
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQ 207
D +S+HG C V+ G KW ATKW+R +
Sbjct: 258 SDPSSIHGGCEVLSGVKWSATKWMRQR 284
>gi|224001336|ref|XP_002290340.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220973762|gb|EED92092.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 483
Score = 150 bits (378), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 85/216 (39%), Positives = 124/216 (57%), Gaps = 16/216 (7%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
++ LS RP + F S E+C I A ++K S ++L+ + + + RTS F+
Sbjct: 261 IETLSLRPLVVSVEGFLSDEECDYIAEIASPQVKYSSVSLKDADKGKDSSEWRTSQSAFL 320
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS- 119
SA +D+ +L I+H++A T +P+ H E VLRY G+KYDSH+D F+P+ Y S
Sbjct: 321 SARDDE--VLTEIDHRVASLTRIPRNHQEYVQVLRYGAGEKYDSHHDYFDPSAYRSDKST 378
Query: 120 ---------QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKC-IGLKVKPRRGDG 169
R A+ YL+DV +GGET+FP G + +K C IGLKVKP++G
Sbjct: 379 LRLIENGKKNRYATVFWYLTDVHDGGETIFPRYGGA--PAPRSHKDCSIGLKVKPQKGKV 436
Query: 170 LLFYSLFPNGTIDRTSLHGSCPVIKGE-KWVATKWI 204
++FYSL +G +D SLHG+CPV + KW A KWI
Sbjct: 437 VIFYSLDASGEMDPFSLHGACPVGENNLKWAANKWI 472
>gi|159487419|ref|XP_001701720.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280939|gb|EDP06695.1| predicted protein [Chlamydomonas reinhardtii]
Length = 274
Score = 150 bits (378), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 89/215 (41%), Positives = 120/215 (55%), Gaps = 16/215 (7%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
+Q + PRA YF NF + + ++ A +LK S + GE V RTS G FI
Sbjct: 1 VQQVGLHPRAYYFHNFLTKAERGHLVKLAAPKLKRSTVVGNDGEGV--VDNIRTSYGMFI 58
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA-EYGPQMS 119
+D ++ IE +I+ T LP H E VLRY GQ Y +HYD+ + + E GP+
Sbjct: 59 RRLQDP--VVARIEKRISLWTHLPVEHQEDIQVLRYAHGQTYGAHYDSGDKSNEPGPKW- 115
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGIFLDS------GYDYKKCI--GLKVKPRRGDGLL 171
RLA+FL+YLSDVEEGGET FP N ++ D G + C + KP+ GD +L
Sbjct: 116 -RLATFLMYLSDVEEGGETAFP-HNSVWADPSIPEKVGDKFSDCAKGNVAAKPKAGDAVL 173
Query: 172 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 206
FYS +PN T+D ++H CPVIKG KW A W+ D
Sbjct: 174 FYSFYPNMTMDPAAMHTGCPVIKGVKWAAPVWMHD 208
>gi|297824279|ref|XP_002880022.1| AT-P4H-1 [Arabidopsis lyrata subsp. lyrata]
gi|297325861|gb|EFH56281.1| AT-P4H-1 [Arabidopsis lyrata subsp. lyrata]
Length = 283
Score = 149 bits (377), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 83/207 (40%), Positives = 121/207 (58%), Gaps = 5/207 (2%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 60
+V+SW PR + +F S E+C+ + A A+ RL+ S + ++ G+ V+S RTSSG F+
Sbjct: 78 EVVSWSPRIIVLHDFLSPEECEYLKAIARPRLQVSTVVDVKTGKGVKSD--VRTSSGMFL 135
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
+ E I++ IE +IA + +P +GE VLRYE Q Y H+D F + Q
Sbjct: 136 THVERSNPIIQAIEKRIAVFSQVPAENGELIQVLRYEPKQFYKPHHDYFADTFNLKRGGQ 195
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+A+ L+YL+D EGGET FP G K G+ VKP +GD +LF+S+ +G
Sbjct: 196 RVATMLMYLTDDVEGGETYFPLAGDGDCTCGGKIMK--GISVKPTKGDAVLFWSMGLDGQ 253
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQ 207
D S+HG C V+ GEKW ATKW+R +
Sbjct: 254 SDPRSIHGGCEVLSGEKWSATKWMRQK 280
>gi|238007346|gb|ACR34708.1| unknown [Zea mays]
Length = 180
Score = 149 bits (376), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 81/174 (46%), Positives = 105/174 (60%), Gaps = 12/174 (6%)
Query: 45 TVESTKG------TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEI 98
V+ST G RTSSG F+ DK ++ +IE +IA T +P HGE VL YE+
Sbjct: 6 VVDSTTGKSKDSRVRTSSGMFLQRGRDK--VIRVIEKRIADYTFIPVDHGEGLQVLHYEV 63
Query: 99 GQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIF--LDSGYDYKK 156
GQKY+ H+D F QR+A+ L+YLSDVEEGGET+FP N L + +
Sbjct: 64 GQKYEPHFDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETIFPDANVNVSSLPWYNELSE 123
Query: 157 CI--GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 208
C GL VKP+ GD LLF+S+ P+ T+D SLHG CPVI+G KW +TKW+ E
Sbjct: 124 CAKRGLSVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIRGNKWSSTKWMHIHE 177
>gi|224033439|gb|ACN35795.1| unknown [Zea mays]
Length = 180
Score = 149 bits (376), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 82/179 (45%), Positives = 105/179 (58%), Gaps = 12/179 (6%)
Query: 40 LRQGETVESTKG------TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNV 93
+ + V+ST G RTSSG F+ DK ++ IE +IA T +P HGE V
Sbjct: 1 MVKSTVVDSTTGKSKDSRVRTSSGMFLQRGRDK--VIRAIEKRIADYTFIPVDHGEGLQV 58
Query: 94 LRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFE--NGIFLDSG 151
L YE+GQKY+ H+D F QR+A+ L+YLSDVEEGGET+FP N L
Sbjct: 59 LHYEVGQKYEPHFDYFLDEFNTKNGGQRIATLLMYLSDVEEGGETIFPDANVNASSLPWY 118
Query: 152 YDYKKCI--GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 208
+ C GL VKP+ GD LLF+S+ P+ T+D SLHG CPVIKG KW +TKW+ E
Sbjct: 119 NELSDCAKRGLSVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIKGNKWSSTKWMHIHE 177
>gi|15224220|ref|NP_181836.1| P4H isoform 1 [Arabidopsis thaliana]
gi|3763917|gb|AAC64297.1| hypothetical protein [Arabidopsis thaliana]
gi|20197628|gb|AAM15158.1| hypothetical protein [Arabidopsis thaliana]
gi|26450452|dbj|BAC42340.1| unknown protein [Arabidopsis thaliana]
gi|29824245|gb|AAP04083.1| unknown protein [Arabidopsis thaliana]
gi|330255112|gb|AEC10206.1| P4H isoform 1 [Arabidopsis thaliana]
Length = 283
Score = 148 bits (374), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 83/207 (40%), Positives = 121/207 (58%), Gaps = 5/207 (2%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 60
+V+SW PR + +F S E+C+ + A A+ RL+ S + ++ G+ V+S RTSSG F+
Sbjct: 78 EVVSWSPRIIVLHDFLSPEECEYLKAIARPRLQVSTVVDVKTGKGVKSD--VRTSSGMFL 135
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
+ E I++ IE +IA + +P +GE VLRYE Q Y H+D F + Q
Sbjct: 136 THVERSYPIIQAIEKRIAVFSQVPAENGELIQVLRYEPQQFYKPHHDYFADTFNLKRGGQ 195
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+A+ L+YL+D EGGET FP G K G+ VKP +GD +LF+S+ +G
Sbjct: 196 RVATMLMYLTDDVEGGETYFPLAGDGDCTCGGKIMK--GISVKPTKGDAVLFWSMGLDGQ 253
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQ 207
D S+HG C V+ GEKW ATKW+R +
Sbjct: 254 SDPRSIHGGCEVLSGEKWSATKWMRQK 280
>gi|356576923|ref|XP_003556579.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 287
Score = 148 bits (374), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 84/205 (40%), Positives = 118/205 (57%), Gaps = 5/205 (2%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 60
+VL+W PR + NF S E+C + A A RL S + + G+ ++S RTSSG F+
Sbjct: 82 EVLNWSPRIILLHNFLSMEECDYLRAIALPRLHISNVVDTKTGKGIKSD--VRTSSGMFL 139
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
+ E K +++ IE +I+ + +P +GE VLRYE Q Y H+D F+ + Q
Sbjct: 140 NPQERKYPMVQAIEKRISVYSQIPIENGELMQVLRYEKNQYYKPHHDYFSDTFNLKRGGQ 199
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+A+ L+YLSD EGGET FP G K GL VKP +G+ +LF+S+ +G
Sbjct: 200 RIATMLMYLSDNIEGGETYFPLAGSGECSCGGKLVK--GLSVKPIKGNAVLFWSMGLDGQ 257
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIR 205
D S+HG C VI GEKW ATKW+R
Sbjct: 258 SDPNSVHGGCEVISGEKWSATKWMR 282
>gi|357445147|ref|XP_003592851.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
gi|355481899|gb|AES63102.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
Length = 281
Score = 148 bits (373), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 85/205 (41%), Positives = 116/205 (56%), Gaps = 5/205 (2%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 60
+VLSW PR + NF S E+C + A RLK S + G+ ++S RTSSG F+
Sbjct: 76 EVLSWSPRIILLHNFLSYEECDYLRGVALPRLKISTVVDANTGKGIKSD--VRTSSGMFL 133
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
S E K ++ IE +I+ + +P +GE VLRYE Q Y H+D F+ + Q
Sbjct: 134 SHEERKYPMIHAIEKRISVYSQIPIENGELMQVLRYEKNQYYRPHHDYFSDTFNLKRGGQ 193
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+A+ L+YL D EGGET FP G K GL VKP +G+ +LF+S+ +G
Sbjct: 194 RIATMLMYLGDNVEGGETHFPSAGSDECSCGGKLTK--GLCVKPVKGNAVLFWSMGLDGQ 251
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIR 205
D S+HG CPV+ GEKW ATKW+R
Sbjct: 252 SDPDSVHGGCPVLAGEKWSATKWMR 276
>gi|449443245|ref|XP_004139390.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 295
Score = 147 bits (372), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 78/202 (38%), Positives = 120/202 (59%), Gaps = 9/202 (4%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
PRA + NF S ++C +I AK R++ S ++ + RTSSG F++ +++
Sbjct: 83 PRAFIYHNFLSEKECSQLINLAKPRMERSLVSAQNTNWEGVVSSRRTSSGRFLAKGQNQ- 141
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 126
++ IE +IA T +P +GE ++L YE+GQK++ H+D +P + + QR A+ +
Sbjct: 142 -LVRRIEKRIAEFTFIPVENGEGLSILHYEVGQKFEPHHDYTHPDSFSFKSLGQRNATLV 200
Query: 127 LYLSDVEEGGETMFPFENGI------FLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
+YLS V+EGG T+FP + +Y K GL VKP+ GD LLF+S+ P+GT
Sbjct: 201 MYLSGVKEGGATVFPEAKKCASSARRWWKKLPEYGKDNGLSVKPKMGDALLFWSVKPDGT 260
Query: 181 IDRTSLHGSCPVIKGEKWVATK 202
+D TSLH S PV+KG+KWV K
Sbjct: 261 LDPTSLHASSPVVKGDKWVGVK 282
Score = 71.2 bits (173), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 35/73 (47%), Positives = 49/73 (67%), Gaps = 6/73 (8%)
Query: 131 DVEEGGETMFPFENGIFLDSGYDYKKCI-----GLKVKPRRGDGLLFYSLFPNGTIDRTS 185
++EEGGET+FP N + +KK GL +KP+ GD L F+S+ P+GT+D TS
Sbjct: 11 NIEEGGETVFPAANQCVSSVPW-WKKLPTHGKDGLSIKPKMGDALFFWSMKPDGTLDYTS 69
Query: 186 LHGSCPVIKGEKW 198
LHGS PVI+G++W
Sbjct: 70 LHGSYPVIRGDEW 82
>gi|449520144|ref|XP_004167094.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 323
Score = 147 bits (371), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 78/202 (38%), Positives = 120/202 (59%), Gaps = 9/202 (4%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
PRA + NF S ++C +I AK R++ S ++ + RTSSG F++ +++
Sbjct: 74 PRAFIYHNFLSEKECSQLINLAKPRMERSLVSAQNTNWEGVVSSRRTSSGRFLAKGQNQ- 132
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 126
++ IE +IA T +P +GE ++L YE+GQK++ H+D +P + + QR A+ +
Sbjct: 133 -LVRRIEKRIAEFTFIPVENGEGLSILHYEVGQKFEPHHDYTHPDSFSFKSLGQRNATLV 191
Query: 127 LYLSDVEEGGETMFPFENGI------FLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
+YLS V+EGG T+FP + +Y K GL VKP+ GD LLF+S+ P+GT
Sbjct: 192 MYLSGVKEGGATVFPEAKKCASSARRWWKKLPEYGKDNGLSVKPKMGDALLFWSVKPDGT 251
Query: 181 IDRTSLHGSCPVIKGEKWVATK 202
+D TSLH S PV+KG+KWV K
Sbjct: 252 LDPTSLHASSPVVKGDKWVGVK 273
Score = 53.9 bits (128), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 28/63 (44%), Positives = 39/63 (61%), Gaps = 6/63 (9%)
Query: 131 DVEEGGETMFPFENGIFLDSGYDYKKCI-----GLKVKPRRGDGLLFYSLFPNGTIDRTS 185
++EEGGET+FP N + +KK GL +KP+ GD L F+S+ P+GT+D TS
Sbjct: 11 NIEEGGETVFPAANKCVSSVPW-WKKLPTHGKDGLSIKPKMGDALFFWSMKPDGTLDYTS 69
Query: 186 LHG 188
LH
Sbjct: 70 LHA 72
>gi|363543299|ref|NP_001241865.1| prolyl 4-hydroxylase 5-1 [Zea mays]
gi|347978814|gb|AEP37749.1| prolyl 4-hydroxylase 5-1 [Zea mays]
Length = 180
Score = 146 bits (369), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 80/176 (45%), Positives = 105/176 (59%), Gaps = 16/176 (9%)
Query: 45 TVESTKG------TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEI 98
V+ST G RTSSG F+ DK ++ +IE +I T +P HGE VL YE+
Sbjct: 6 VVDSTTGKSKDSRVRTSSGMFLQRGRDK--VIRVIEKRITDYTFIPVDHGEGLQVLHYEV 63
Query: 99 GQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDY---- 154
GQKY+ H+D F QR+A+ L++LSDVEEGGET+FP N DS +
Sbjct: 64 GQKYEPHFDYFLDEFNTKNGGQRMATLLMHLSDVEEGGETIFPDAN--VNDSSLPWYNEL 121
Query: 155 KKCI--GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 208
+C GL VKP+ GD LLF+S+ P+ T+D SLHG CPVI+G KW +TKW+ E
Sbjct: 122 SECAKRGLSVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIRGNKWSSTKWMHIHE 177
>gi|242047774|ref|XP_002461633.1| hypothetical protein SORBIDRAFT_02g005760 [Sorghum bicolor]
gi|241925010|gb|EER98154.1| hypothetical protein SORBIDRAFT_02g005760 [Sorghum bicolor]
Length = 275
Score = 146 bits (369), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 83/212 (39%), Positives = 119/212 (56%), Gaps = 21/212 (9%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-----TRTS 55
++ LSW+PR + F S ++C ++ AKK G V + TRTS
Sbjct: 48 VKALSWQPRIFVYKGFLSDDECDHLVTLAKK-----------GTMVAHNRSSYYRQTRTS 96
Query: 56 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 115
SG F+ +D ++ IE +IA T+LP+ + E + RY+ GQKYD H+D F+ +
Sbjct: 97 SGMFLRKRQDP--VVSRIEERIAAWTLLPRENVEKMQIQRYQHGQKYDPHFDYFDDKIHH 154
Query: 116 PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLF 172
+ R A+ L+YLS V++GGET+FP G D + +C GL VKP +GD +LF
Sbjct: 155 TRGGPRYATVLMYLSTVDKGGETVFPKAKGWESQPKDDTFSECAHKGLAVKPVKGDAVLF 214
Query: 173 YSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 204
+SL +G D +LHGSCPVI+GEKW A WI
Sbjct: 215 FSLHVDGGPDPLTLHGSCPVIQGEKWSAPNWI 246
>gi|222623961|gb|EEE58093.1| hypothetical protein OsJ_08962 [Oryza sativa Japonica Group]
Length = 387
Score = 146 bits (368), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 85/201 (42%), Positives = 113/201 (56%), Gaps = 19/201 (9%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 55
+V+SW PRA + NF S E+C +I AK + S + V+ST G RTS
Sbjct: 100 EVISWEPRAFVYHNFLSKEECDYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 152
Query: 56 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 115
SG F+ DK ++ IE +IA T +P HGE VL YE+GQKY+ H+D F
Sbjct: 153 SGMFLQRGRDK--VIRAIEKRIADYTFIPMEHGEGLQVLHYEVGQKYEPHFDYFLDEYNT 210
Query: 116 PQMSQRLASFLLYLSDVEEGGETMFPFE--NGIFLDSGYDYKKCI--GLKVKPRRGDGLL 171
QR+A+ L+YLSDVEEGGET+FP N L + +C GL VKP+ GD LL
Sbjct: 211 KNGGQRMATLLMYLSDVEEGGETIFPDANVNSSSLPWYNELSECARKGLAVKPKMGDALL 270
Query: 172 FYSLFPNGTIDRTSLHGSCPV 192
F+S+ P+ T+D SLH + V
Sbjct: 271 FWSMKPDATLDPLSLHDTLRV 291
>gi|307111754|gb|EFN59988.1| hypothetical protein CHLNCDRAFT_49444 [Chlorella variabilis]
Length = 344
Score = 146 bits (368), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 79/207 (38%), Positives = 116/207 (56%), Gaps = 5/207 (2%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
+QVL R + NF + E+C II A+ + S + + RTS GTF+
Sbjct: 63 VQVLHEDARIFLYHNFLTDEECDHIIKLAEPTMARSGV-VETDSGKSKIDNVRTSKGTFL 121
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
+ D ++ IE +IA+ T++P +GE VL+YE GQ+Y+ HYD F
Sbjct: 122 NRGHDS--VIADIEARIAKWTLMPAGNGEGLQVLKYEHGQEYEGHYDYFFHKAGTANGGN 179
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIG--LKVKPRRGDGLLFYSLFPN 178
R + L+YL+DVEEGGET FP D+G ++ +C L KP++G+ +LF+S+ P
Sbjct: 180 RYLTVLMYLNDVEEGGETCFPNIPSPNGDNGPEFSECARKVLAAKPKKGNAVLFHSIKPT 239
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWIR 205
G ++R SLH +CPVIKG KW A KW+
Sbjct: 240 GELERRSLHTACPVIKGVKWSAPKWVH 266
>gi|218191856|gb|EEC74283.1| hypothetical protein OsI_09531 [Oryza sativa Indica Group]
Length = 376
Score = 145 bits (367), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 84/196 (42%), Positives = 111/196 (56%), Gaps = 19/196 (9%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 55
+V+SW PRA + NF S E+C +I AK + S + V+ST G RTS
Sbjct: 100 EVISWEPRAFVYHNFLSKEECDYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 152
Query: 56 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 115
SG F+ DK ++ IE +IA T +P HGE VL YE+GQKY+ H+D F
Sbjct: 153 SGMFLQRGRDK--VIRAIEKRIADYTFIPMEHGEGLQVLHYEVGQKYEPHFDYFLDEYNT 210
Query: 116 PQMSQRLASFLLYLSDVEEGGETMFPFE--NGIFLDSGYDYKKCI--GLKVKPRRGDGLL 171
QR+A+ L+YLSDVEEGGET+FP N L + +C GL VKP+ GD LL
Sbjct: 211 KNGGQRMATLLMYLSDVEEGGETIFPDANVNSSSLPWYNELSECARKGLAVKPKMGDALL 270
Query: 172 FYSLFPNGTIDRTSLH 187
F+S+ P+ T+D SLH
Sbjct: 271 FWSMKPDATLDPLSLH 286
>gi|225433714|ref|XP_002268409.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
gi|296089634|emb|CBI39453.3| unnamed protein product [Vitis vinifera]
Length = 287
Score = 145 bits (367), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 81/210 (38%), Positives = 121/210 (57%), Gaps = 5/210 (2%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFI 60
++L+W PR + +F S+E+C + A A+ L+ S + Q G+ ++S RTSSG F+
Sbjct: 82 EILNWSPRIILLHSFLSSEECDYLRAMAEPLLQISTVVDAQTGKGIQSD--VRTSSGMFL 139
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
S + I+ IE +I+ + +P +GE VLRY+ Q Y H+D F+ + + Q
Sbjct: 140 SPDDSTYPIVRAIEKRISVYSQVPVENGELIQVLRYKKSQFYKPHHDYFSDSFNLKRGGQ 199
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+A+ L+YLSD EGGET FP F G K GL V P +G+ +LF+S+ +G
Sbjct: 200 RVATMLIYLSDNVEGGETYFPMAGSGFCRCG--GKSVRGLSVAPVKGNAVLFWSMGLDGQ 257
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQEQH 210
D S+HG C V+ GEKW ATKW+R + H
Sbjct: 258 SDPNSIHGGCEVLAGEKWSATKWMRQRSTH 287
>gi|307110383|gb|EFN58619.1| hypothetical protein CHLNCDRAFT_19485 [Chlorella variabilis]
Length = 328
Score = 145 bits (366), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 87/206 (42%), Positives = 117/206 (56%), Gaps = 26/206 (12%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
++ +SW+PRA F NF + E+ I+A AK +K S + G +VE RTS GTF+
Sbjct: 32 VEPVSWKPRAFVFHNFMTEEEADHIVALAKPFMKRSTVVGAGGASVEDQ--IRTSYGTFL 89
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
+D I+ +E ++A T L +H E +LRY IGQKY +HYD+ + S
Sbjct: 90 KRLQDP--IVTAVEQRLATWTKLNVSHQEDMQILRYGIGQKYGAHYDSLD------NDSP 141
Query: 121 RLASFLLYLSDV--EEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 178
R+ + LLYLSDV + GGET FP G+ + Y P++GD LLFYSL P+
Sbjct: 142 RVCTVLLYLSDVPADGGGETAFP---GVRRQALY-----------PKKGDALLFYSLKPD 187
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWI 204
GT D SLH CP+I G KW ATKWI
Sbjct: 188 GTSDAYSLHTGCPIISGVKWTATKWI 213
>gi|159469311|ref|XP_001692811.1| predicted protein [Chlamydomonas reinhardtii]
gi|158278064|gb|EDP03830.1| predicted protein [Chlamydomonas reinhardtii]
Length = 273
Score = 144 bits (364), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 85/205 (41%), Positives = 114/205 (55%), Gaps = 8/205 (3%)
Query: 3 VLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISA 62
VL R + F + E+C I A+KRL+ S + + G RTS G F
Sbjct: 38 VLDPDARIYLWKGFLTPEECDYIRMKAEKRLERSGV-VDTGSGGSVVSDIRTSDGMFFER 96
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 122
ED I+E +E ++A TM P GE+ VLRY QKYDSH+D F + R
Sbjct: 97 GED--AIIEAVEQRLADWTMTPIWGGESLQVLRYRKDQKYDSHWDYFFHKDGSSNGGNRW 154
Query: 123 ASFLLYLSDVEEGGETMF---PFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 179
A+ LLYL++ EEGGET+F P NGI + G+ L VKP +GD LLF+S+ P G
Sbjct: 155 ATVLLYLTETEEGGETVFPKIPAPNGI--NVGFSECAKYNLAVKPHKGDALLFHSMKPTG 212
Query: 180 TIDRTSLHGSCPVIKGEKWVATKWI 204
++ S+HG+CPVI+GEK+ TKWI
Sbjct: 213 ELEERSMHGACPVIRGEKFSMTKWI 237
>gi|297727581|ref|NP_001176154.1| Os10g0415128 [Oryza sativa Japonica Group]
gi|255679404|dbj|BAH94882.1| Os10g0415128 [Oryza sativa Japonica Group]
Length = 241
Score = 144 bits (362), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 74/157 (47%), Positives = 98/157 (62%), Gaps = 5/157 (3%)
Query: 52 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 111
RTSSG F+ +D+ ++ IE +IA T LP +GE+ +L Y+ G+KY+ HYD F+
Sbjct: 15 VRTSSGMFLEKKQDE--VVARIEERIAAWTFLPPDNGESIQILHYQNGEKYEPHYDYFHD 72
Query: 112 AEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGD 168
R+A+ L+YLSDV +GGET+FP G L D + C G VKP +GD
Sbjct: 73 KNNQALGGHRIATVLMYLSDVGKGGETIFPEAEGKLLQPKDDTWSDCAKNGYAVKPVKGD 132
Query: 169 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 205
LLF+SL P+ T D SLHGSCPVI+G+KW ATKWI
Sbjct: 133 ALLFFSLHPDATTDSDSLHGSCPVIEGQKWSATKWIH 169
>gi|9294584|dbj|BAB02865.1| unnamed protein product [Arabidopsis thaliana]
Length = 328
Score = 144 bits (362), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 80/165 (48%), Positives = 105/165 (63%), Gaps = 7/165 (4%)
Query: 43 GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKY 102
GE+ +S RTSSG F++ +D I+ +E K+A T LP+ +GEA +L YE GQKY
Sbjct: 9 GESEDSE--VRTSSGMFLTKRQDD--IVANVEAKLAAWTFLPEENGEALQILHYENGQKY 64
Query: 103 DSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--G 159
D H+D F + R+A+ L+YLS+V +GGET+FP G D + KC G
Sbjct: 65 DPHFDYFYDKKALELGGHRIATVLMYLSNVTKGGETVFPNWKGKTPQLKDDSWSKCAKQG 124
Query: 160 LKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 204
VKPR+GD LLF++L NGT D SLHGSCPVI+GEKW AT+WI
Sbjct: 125 YAVKPRKGDALLFFNLHLNGTTDPNSLHGSCPVIEGEKWSATRWI 169
>gi|307102963|gb|EFN51228.1| hypothetical protein CHLNCDRAFT_141231 [Chlorella variabilis]
Length = 313
Score = 144 bits (362), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 88/212 (41%), Positives = 118/212 (55%), Gaps = 21/212 (9%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT-----RTS 55
MQVL R F NF + E+C I+A AK L+ R G +T G+ RTS
Sbjct: 34 MQVLDAEAR--IFINFLTEEECDHIVALAKPHLE------RSGVVDTATGGSEISDIRTS 85
Query: 56 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHY-DAFNPAEY 114
G F+ D T + IE +IAR T+LP +GE VL Y G+KYD ++ D N
Sbjct: 86 KGMFLERGHDDT--VAAIEERIARWTLLPVGNGEGLQVLNYHPGEKYDDYFFDKVNGESN 143
Query: 115 GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI--GLKVKPRRGDGLLF 172
G R A+ L+YL+ VEEGGET+FP D+G + +C L KP +G +LF
Sbjct: 144 G---GNRYATVLMYLNTVEEGGETVFPNIPAPGGDNGPTFTECARRHLAAKPTKGSAVLF 200
Query: 173 YSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 204
+S+ P+G ++R SLH +CPV+KGEKW A KWI
Sbjct: 201 HSIKPSGDLERRSLHTACPVVKGEKWSAPKWI 232
>gi|357517893|ref|XP_003629235.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355523257|gb|AET03711.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 196
Score = 143 bits (361), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 79/194 (40%), Positives = 116/194 (59%), Gaps = 21/194 (10%)
Query: 16 FASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEH 75
+ E+C+ +I AK + S + G++V+++ RTSSGTFI+ DK IL IE
Sbjct: 12 ITTKEECEHLINIAKPSMHKSTVDDETGKSVDNS--ARTSSGTFINRGHDK--ILRNIEQ 67
Query: 76 KIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEG 135
+IA T +P +GE+ N+L YE+GQKY+ H D F +++ + E+G
Sbjct: 68 RIADFTFIPVENGESVNILHYEVGQKYEPHPDFFTD-----EINTKNGG--------EQG 114
Query: 136 GETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCP 191
GET+FPF G F + + C GL +KP+ GD LLF+S+ P+GT+D S+HG+CP
Sbjct: 115 GETVFPFAEGNFSSVPWWNELSDCGKKGLSIKPKMGDALLFWSMKPDGTLDPLSMHGACP 174
Query: 192 VIKGEKWVATKWIR 205
VIKG+KW TKW+R
Sbjct: 175 VIKGDKWSCTKWMR 188
>gi|397568865|gb|EJK46391.1| hypothetical protein THAOC_34939 [Thalassiosira oceanica]
Length = 488
Score = 143 bits (361), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 84/216 (38%), Positives = 117/216 (54%), Gaps = 16/216 (7%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
++ LS +P L F + E+C I+ A +K S ++L+ + RTS TF+
Sbjct: 267 IETLSMKPLVLSISGFLADEECDYIMEKAAPTMKYSGVSLKDADKGRPASDWRTSQSTFV 326
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY--GPQM 118
+A D IL IE + A T +P TH E VLRY + +KYD+H+D F+P+ Y P
Sbjct: 327 AAMGDP--ILRDIELRTASLTRVPVTHQEFVQVLRYGVTEKYDAHHDFFDPSSYRSDPGT 384
Query: 119 SQ--------RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 170
Q R A+ YL+DV GGET FP G D+ C GLKVKP++G +
Sbjct: 385 LQLIENGKKNRYATVFWYLTDVARGGETCFPRHGGA--PPPRDFSMCTGLKVKPQKGKVI 442
Query: 171 LFYSLFPNGTIDRTSLHGSCPVIKGE--KWVATKWI 204
+FYSL +G +D SLHG+CPV+ E KW A KW+
Sbjct: 443 IFYSLDASGEMDPLSLHGACPVLGKEDIKWAANKWL 478
>gi|145341735|ref|XP_001415959.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144576182|gb|ABO94251.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 254
Score = 143 bits (361), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 81/219 (36%), Positives = 118/219 (53%), Gaps = 25/219 (11%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRT 54
++ LSW PRA + + QC++++ + R+ R+ V+S G RT
Sbjct: 3 VEPLSWYPRAFALRDALTEAQCEAVLRATRARV-------RRSTVVDSVTGESKVDPIRT 55
Query: 55 SSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-----AF 109
S TF++ E+ ++ I ++ TMLP TH E VL Y +G+KYD+H D +
Sbjct: 56 SKQTFLNRDEE---VVREIYDALSAVTMLPWTHNEDMQVLEYRVGEKYDAHEDVGAEDSL 112
Query: 110 NPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGI--FLDSGYDYKKCIGLKV--KPR 165
+ E +R+A+ LLYL + E GGET FP I + G + KC +V KPR
Sbjct: 113 SGRELSKDGGKRVATVLLYLEEPEAGGETAFPDSEWIDPKMAEGTSWSKCAEHRVAMKPR 172
Query: 166 RGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 204
RGDGL+F+S+ PNG ID +LH CPV+ G KW AT W+
Sbjct: 173 RGDGLIFWSVDPNGKIDHRALHVGCPVVAGVKWTATVWV 211
>gi|159487421|ref|XP_001701721.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280940|gb|EDP06696.1| predicted protein [Chlamydomonas reinhardtii]
Length = 336
Score = 143 bits (360), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 85/215 (39%), Positives = 117/215 (54%), Gaps = 16/215 (7%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
+Q + PRA YF NF + + ++ A +LK S + + RTS G FI
Sbjct: 19 VQQVGLHPRAYYFHNFLTKAERAHLVRVAAPKLKRSTVVGGK--GEGVVDDIRTSYGMFI 76
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY-GPQMS 119
D ++ IE +I+ T LP H E +LRY GQ Y +HYD+ +++ GP+
Sbjct: 77 RRLSDP--VVTRIEKRISLWTHLPVEHQEDIQILRYAHGQTYGAHYDSGASSDHVGPKW- 133
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGIFLDS------GYDYKKCIG--LKVKPRRGDGLL 171
RLA+FL+YLSDVEEGGET FP N ++ D G + C + KP+ GD +L
Sbjct: 134 -RLATFLMYLSDVEEGGETAFP-HNSVWADPSIPEQVGDKFSDCAKGHVAAKPKAGDAVL 191
Query: 172 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 206
FYS +PN T+D S+H CPVIKG KW A W+ D
Sbjct: 192 FYSFYPNNTMDPASMHTGCPVIKGVKWAAPVWMHD 226
>gi|302841711|ref|XP_002952400.1| hypothetical protein VOLCADRAFT_81799 [Volvox carteri f.
nagariensis]
gi|300262336|gb|EFJ46543.1| hypothetical protein VOLCADRAFT_81799 [Volvox carteri f.
nagariensis]
Length = 269
Score = 143 bits (360), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 87/200 (43%), Positives = 112/200 (56%), Gaps = 10/200 (5%)
Query: 9 RALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASEDKT 67
R + F + E+C I A+KRL+ S + G +V S RTS G F ED
Sbjct: 44 RIYLWRGFLTPEECDYIRMKAEKRLERSGVVDTASGSSVVSD--IRTSDGMFFERGED-- 99
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 127
ILE +E ++A TM P GEA VLRY QKYDSH + F E R A+ L
Sbjct: 100 AILEAVEQRLADWTMTPIWAGEALQVLRYRKDQKYDSHVNYFFHKEGSANGGNRWATVLT 159
Query: 128 YLSDVEEGGETMF---PFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 184
YL+D EEGGET+F P G+ + G+ L VKPR+GD +LF+S+ NG ++
Sbjct: 160 YLTDTEEGGETVFPKIPAPGGV--NVGFSECAKYNLAVKPRKGDAILFHSMKTNGQLEER 217
Query: 185 SLHGSCPVIKGEKWVATKWI 204
SLHG+CPVIKGEK+ TKWI
Sbjct: 218 SLHGACPVIKGEKFSMTKWI 237
>gi|302844247|ref|XP_002953664.1| prolyl 4-hydroxylase alpha subunit-like protein [Volvox carteri f.
nagariensis]
gi|300261073|gb|EFJ45288.1| prolyl 4-hydroxylase alpha subunit-like protein [Volvox carteri f.
nagariensis]
Length = 364
Score = 142 bits (358), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 86/207 (41%), Positives = 116/207 (56%), Gaps = 15/207 (7%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
PRA F NF + + ++ A +LK S + +GE V RTS G FI D
Sbjct: 55 PRAYLFHNFLTKAERAHMVRLAAPKLKRSTVVGSKGEGV--VDNIRTSFGMFIRRLSDP- 111
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY-GPQMSQRLASFL 126
I+ IE +I+ T LP H E VLRY GQ Y +HYD+ +++ GP+ RLA+FL
Sbjct: 112 -IIARIEKRISLWTHLPIEHQEDIQVLRYAHGQTYGAHYDSGASSDHVGPKW--RLATFL 168
Query: 127 LYLSDVEEGGETMFPFENGIFLDSGYDYK-----KCIG--LKVKPRRGDGLLFYSLFPNG 179
+YLSDVEEGGET FP +N ++ D + +C + KP+ GD +LFYS PN
Sbjct: 169 MYLSDVEEGGETAFP-QNSVWYDPTIPERIGPVSECAKGHVAAKPKAGDAVLFYSFLPNN 227
Query: 180 TIDRTSLHGSCPVIKGEKWVATKWIRD 206
T+D ++H CPVIKG KW A W+ D
Sbjct: 228 TMDPAAMHTGCPVIKGIKWAAPVWMHD 254
>gi|302831512|ref|XP_002947321.1| hypothetical protein VOLCADRAFT_120451 [Volvox carteri f.
nagariensis]
gi|300267185|gb|EFJ51369.1| hypothetical protein VOLCADRAFT_120451 [Volvox carteri f.
nagariensis]
Length = 797
Score = 142 bits (358), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 81/216 (37%), Positives = 116/216 (53%), Gaps = 12/216 (5%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
++ +SW PRA + NF ++ +C ++ +R+ S L + RTS G
Sbjct: 493 IETISWSPRAFVYHNFLTSAECDHLVQIGTQRVSRS-LVVDSQTGQSKLDDIRTSYGAAF 551
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQM- 118
ED ++ IE +IA T LP HGE +LRY GQKYD+H+D F +P + +
Sbjct: 552 GRGEDP--VIAEIEERIAEWTHLPPEHGEPMQILRYVDGQKYDAHWDWFDDPVHHRSYLV 609
Query: 119 -SQRLASFLLYLSDVEEGGETMFPFENGI-----FLDSGYDYKKCIGLKVKPRRGDGLLF 172
R A+ LLYLS+VE GGET P + I +++ +GL ++PR+GD LLF
Sbjct: 610 DGNRYATVLLYLSEVEAGGETNLPLADPIDMSVQAIENPSPCAAKMGLSIRPRKGDALLF 669
Query: 173 YSLFPNGTI-DRTSLHGSCPVIKGEKWVATKWIRDQ 207
Y + G DR +LH SCP +KG KW ATKWI +
Sbjct: 670 YDMDIEGQKGDRKALHASCPTLKGMKWTATKWIHSK 705
>gi|226494249|ref|NP_001141909.1| uncharacterized protein LOC100274058 [Zea mays]
gi|194706408|gb|ACF87288.1| unknown [Zea mays]
gi|413932757|gb|AFW67308.1| hypothetical protein ZEAMMB73_919439 [Zea mays]
gi|413932758|gb|AFW67309.1| hypothetical protein ZEAMMB73_919439 [Zea mays]
Length = 217
Score = 142 bits (357), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 72/149 (48%), Positives = 95/149 (63%), Gaps = 3/149 (2%)
Query: 59 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 118
+S + K I+ IE ++A T LP+ + E+ VLRYE GQKYD+H+D F+
Sbjct: 10 MLSPPQPKDEIVSAIEKRVAAWTFLPEENAESLQVLRYETGQKYDAHFDYFHDRNNLKLG 69
Query: 119 SQRLASFLLYLSDVEEGGETMFPFENGIFL---DSGYDYKKCIGLKVKPRRGDGLLFYSL 175
QR+A+ L+YL+DV +GGET+FP G L D + GL VKP++GD LLF++L
Sbjct: 70 GQRVATVLMYLTDVNKGGETVFPNAEGSHLQYKDETWSECSRSGLAVKPKKGDALLFFNL 129
Query: 176 FPNGTIDRTSLHGSCPVIKGEKWVATKWI 204
N T D SLHGSCPVI+GEKW ATKWI
Sbjct: 130 HVNATADTGSLHGSCPVIEGEKWSATKWI 158
>gi|449468746|ref|XP_004152082.1| PREDICTED: putative prolyl 4-hydroxylase-like [Cucumis sativus]
Length = 290
Score = 142 bits (357), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 82/207 (39%), Positives = 118/207 (57%), Gaps = 5/207 (2%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 60
+V+SW PR + NF S ++C + A RL+ S + + G+ V+S RTSSG F+
Sbjct: 83 EVVSWSPRIIVLHNFLSTKECDYLKGIALARLEISTVVDTKTGKGVKSD--FRTSSGMFL 140
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
S E +++ IE +I+ + +P +GE VLRYE Q Y H+D F+ + Q
Sbjct: 141 SHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ 200
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+A+ L+YLS+ EGGET FP G K GL VKP +GD +LF+S+ +G
Sbjct: 201 RIATMLMYLSENIEGGETYFPKAGSGECSCG--GKTVPGLSVKPAKGDAVLFWSMGLDGQ 258
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQ 207
D S+HG C V+ GEKW ATKW+R +
Sbjct: 259 SDPKSIHGGCEVLSGEKWSATKWMRQK 285
>gi|255637879|gb|ACU19258.1| unknown [Glycine max]
Length = 287
Score = 141 bits (356), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 85/209 (40%), Positives = 118/209 (56%), Gaps = 13/209 (6%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 60
+VL+W PR + NF S E+C + A A RL S + + G+ ++S RTSSG F+
Sbjct: 82 EVLNWSPRIILLHNFLSMEECDYLRALALPRLHISTVVDTKTGKGIKSD--VRTSSGMFL 139
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKY----DSHYDAFNPAEYGP 116
++ E K +++ IE +I+ + +P +GE VLRYE Q Y D +D FN G
Sbjct: 140 NSKERKYPMVQAIEKRISVYSQIPIENGELMQVLRYEKNQYYKPRHDYFFDTFNLKRGG- 198
Query: 117 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
Q +A+ L+YLSD EGGET FP G K GL VKP +G+ +LF+S+
Sbjct: 199 ---QGIATMLMYLSDNIEGGETYFPLAGSGECSCGGKLVK--GLSVKPIKGNAVLFWSMG 253
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+G D S+HG C VI GEKW ATKW+R
Sbjct: 254 LDGQSDPNSVHGGCEVISGEKWSATKWLR 282
>gi|302765413|ref|XP_002966127.1| hypothetical protein SELMODRAFT_86017 [Selaginella moellendorffii]
gi|300165547|gb|EFJ32154.1| hypothetical protein SELMODRAFT_86017 [Selaginella moellendorffii]
Length = 201
Score = 141 bits (356), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 76/201 (37%), Positives = 116/201 (57%), Gaps = 6/201 (2%)
Query: 11 LYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGIL 70
L F S ++C +I A RL+ S + + + ++ RTS G F+ D I+
Sbjct: 1 LIFFYLYSDDECDHLIGLALPRLRRSSVIDEKTGLGKDSR-NRTSWGAFLRRDHDN--IV 57
Query: 71 ELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLS 130
IE +I+ T +P+ +GE+ V+RY+ GQK++ H D + E R+ + LLYL+
Sbjct: 58 SGIEDRISSITFIPKEYGESLQVVRYKTGQKFEPHQDYYKLTENNNNGGHRIGTLLLYLT 117
Query: 131 DVEEGGETMFPFE-NGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLFPNGTIDRTSLH 187
+VE GGET+FP + D + +C G+ ++PRRGDGLLF+ P+G ID S H
Sbjct: 118 NVENGGETVFPRALANVINDYSTNTSECTKKGIVIRPRRGDGLLFWITRPSGEIDPFSFH 177
Query: 188 GSCPVIKGEKWVATKWIRDQE 208
G CPV+KGEKW+ATK++ + E
Sbjct: 178 GGCPVVKGEKWLATKFLHEHE 198
>gi|145354086|ref|XP_001421326.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144581563|gb|ABO99619.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 309
Score = 139 bits (350), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 81/218 (37%), Positives = 123/218 (56%), Gaps = 20/218 (9%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
++ +S PRA + NF + E+ ++ IA A++ ++ S++ + + + T RTSSG ++
Sbjct: 78 IERISESPRAYVYRNFLTREEAEATIAAARRTMRRSEV-VNEADGTSKTSDERTSSGGWV 136
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
S + + ++ IE ++A TMLP+ GE V+RYE GQ+Y +H D F+ Q
Sbjct: 137 SGEDSE--VMANIERRVAAWTMLPRNRGETTQVMRYEAGQEYAAHDDYFHDEVNVKNGGQ 194
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIG---------------LKVKPR 165
R A+ L+YLSDVEEGGET+FP G L K + L VKPR
Sbjct: 195 RAATVLMYLSDVEEGGETVFP--RGTPLGGAAPEKSGVTQGNACERALRGDPNVLAVKPR 252
Query: 166 RGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKW 203
RGD LLF+++ NG +D + H CPV++G KW AT+W
Sbjct: 253 RGDALLFFNVHLNGEVDERARHAGCPVVRGTKWTATRW 290
>gi|299115886|emb|CBN75895.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein
[Ectocarpus siliculosus]
Length = 404
Score = 139 bits (349), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 78/208 (37%), Positives = 113/208 (54%), Gaps = 8/208 (3%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
M+ LS P NF E+C+ I A +KPS ++L + + RTS+ F+
Sbjct: 193 MKTLSMEPLVFEARNFLLDEECKHIREKADPHMKPSPVSLMDHDKGKPDTNWRTSTTYFM 252
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA---EYGPQ 117
++ D +L+ I+ ++ T +P++H E VL+Y+ GQ+Y +H+D +
Sbjct: 253 PSTRDP--LLQGIDRRVEEFTRVPKSHQEQVQVLKYDKGQRYTAHHDFLDERTMRNMDGG 310
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI-GLKVKPRRGDGLLFYSLF 176
R+ + YLSDVEEGGET+FP G D+ C GLKVKP G +FYSL
Sbjct: 311 RKNRMITVFWYLSDVEEGGETIFPRYGG--RTGRVDFSDCTTGLKVKPVEGKVAMFYSLK 368
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWI 204
P+G D SLHG+CPVI G+KW A KW+
Sbjct: 369 PDGQFDDFSLHGACPVITGQKWAANKWV 396
>gi|384250156|gb|EIE23636.1| hypothetical protein COCSUDRAFT_53414 [Coccomyxa subellipsoidea
C-169]
Length = 285
Score = 138 bits (348), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 79/204 (38%), Positives = 112/204 (54%), Gaps = 7/204 (3%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTKGTRTSSGTFISA 62
+SW PRA + S ++C II A+ + K + L + + V + R + +I
Sbjct: 56 ISWNPRAFLYRGLLSQDECDYIINAARPNMVKATVLDAKTKKQVPNK--LRNNKEAYIDG 113
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 122
S D +++ IE +IAR T LP HGE F++++Y GQ Y H D + + ++R+
Sbjct: 114 SADD--VIDQIERRIARYTFLPAAHGEPFHIMQYLPGQGYAPHTDWLDDWWHPRLGNERI 171
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLFPNGT 180
A+ ++YLSDV EGGET+FP Y KC G+ VKP +GD LL Y+L NG
Sbjct: 172 ATMIIYLSDVVEGGETVFPNSTMQPHVGDAAYSKCAQQGIAVKPVKGDALLLYNLLENGR 231
Query: 181 IDRTSLHGSCPVIKGEKWVATKWI 204
D SLH CPVI+G KW ATK I
Sbjct: 232 NDGESLHQGCPVIRGVKWTATKRI 255
>gi|159485424|ref|XP_001700744.1| hypothetical protein CHLREDRAFT_187378 [Chlamydomonas reinhardtii]
gi|158281243|gb|EDP06998.1| predicted protein [Chlamydomonas reinhardtii]
Length = 253
Score = 138 bits (348), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 83/217 (38%), Positives = 120/217 (55%), Gaps = 16/217 (7%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
++ +SW PRA + F S +C +I A +L+ S + + + V+ RTS I
Sbjct: 38 IETISWVPRAFIYHGFLSHAECDHLIGLALPKLERSLVVGNKSDEVDPI---RTSYSASI 94
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN-PAEYGPQMS 119
+E T ++ IE +IAR T LP++H E VLRY GQKYD+H+D F+ G
Sbjct: 95 GYNE--TDVVADIEGRIARWTHLPRSHQEPMEVLRYINGQKYDAHWDWFDETETGGTGGG 152
Query: 120 QRLASFLLYLSDVE--EGGETMFPFENGIFLD----SGYDYKKC---IGLKVKPRRGDGL 170
R+A+ L+YLSD+E GGET P + + G Y +C +G+ V+P++GD L
Sbjct: 153 NRMATALMYLSDMEPAAGGETALPLAQPLDWEVQGVEGRGYSECASKMGISVRPKKGDVL 212
Query: 171 LFYSLFPNG-TIDRTSLHGSCPVIKGEKWVATKWIRD 206
LF+ + P G DR +LH SCP G KW ATKWI +
Sbjct: 213 LFWDMEPGGREPDRHALHASCPTFSGTKWTATKWIHN 249
>gi|159486447|ref|XP_001701251.1| hypothetical protein CHLREDRAFT_122372 [Chlamydomonas reinhardtii]
gi|158271833|gb|EDO97644.1| predicted protein [Chlamydomonas reinhardtii]
Length = 251
Score = 137 bits (345), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 83/219 (37%), Positives = 117/219 (53%), Gaps = 18/219 (8%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
++ +SW PR + NF S +C+ I TA +K S + G +V T RTS GTFI
Sbjct: 2 IETVSWNPRVFIYHNFLSDAECRHIKRTAAPMMKRSSVVGTNGSSVLDT--IRTSYGTFI 59
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
D ++E + ++A T P + E VLRY GQKY +H D+ S
Sbjct: 60 RRRHDP--VVERVLRRVAAWTKAPPENQEDLQVLRYGPGQKYGAHMDSLI------DDSP 111
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLD-----SGYDYKKCIGLKV--KPRRGDGLLFY 173
R+A+ LLYL D E GGET FP ++G +LD S + +C V +P++GD L+F+
Sbjct: 112 RMATVLLYLHDTEYGGETAFP-DSGHWLDPSLAQSMGPFSECAQGHVAFRPKKGDALMFW 170
Query: 174 SLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHED 212
S+ P+GT D SLH CPV+ G KW AT W+ + D
Sbjct: 171 SIKPDGTHDPLSLHTGCPVVTGVKWTATSWVHSMPYNYD 209
>gi|303287328|ref|XP_003062953.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226455589|gb|EEH52892.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 259
Score = 137 bits (344), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 85/226 (37%), Positives = 124/226 (54%), Gaps = 32/226 (14%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRT 54
++ +SW PRA + N + +C ++ A+ R+ R+ V+ST G RT
Sbjct: 1 VEPISWHPRAFHLHNIMTDAECDEVLELARTRV-------RRSTVVDSTTGESKVDPIRT 53
Query: 55 SSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFN-----VLRYEIGQKYDSHYDAF 109
S F++ I+ +IE ++ R TMLP +GE VL+Y GQKYD+H+D
Sbjct: 54 SEQCFLNRGH--FPIVSVIEKRLERYTMLPWYNGEDLQARPSRVLKYSNGQKYDAHHDVG 111
Query: 110 N-PAEYGPQMS----QRLASFLLYLSDVEE--GGETMFPFENGI--FLDSGYDYKKCI-- 158
G Q++ R+A+ LLYLSDV++ GGET FP I D G + +C
Sbjct: 112 ELDTASGKQLAAEGGHRVATVLLYLSDVDDDGGGETAFPDSEWIDPTADRGSGWSECAED 171
Query: 159 GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 204
+ VKP++GDGLLF+S+ P G ID+ S+H CPV+ G+ W ATKWI
Sbjct: 172 HVAVKPKKGDGLLFWSITPEGVIDQQSMHAGCPVL-GKSWTATKWI 216
>gi|229002593|ref|ZP_04160640.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock3-17]
gi|229003816|ref|ZP_04161625.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock1-4]
gi|228757417|gb|EEM06653.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock1-4]
gi|228758520|gb|EEM07660.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock3-17]
Length = 219
Score = 136 bits (343), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 72/207 (34%), Positives = 114/207 (55%), Gaps = 27/207 (13%)
Query: 1 MQVLSW--RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 58
+Q++S P + N S E+C+++I +K ++K S++ + + T RTSSG
Sbjct: 33 IQIISRLEEPLIVVLANVLSDEECETLIEMSKNKMKRSKIGISR-----KTNDIRTSSGA 87
Query: 59 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 118
F+ SE I IE +IA +P HGE +L+Y +GQ+Y +HYD F
Sbjct: 88 FLEESE----ITTRIERRIASIMNVPAPHGEGLQILKYTVGQEYQAHYDFF-VENSAAAS 142
Query: 119 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 178
+ R+++ ++YL+ VEEGGET FP + L V P++G + F + +
Sbjct: 143 NNRMSTLVMYLNHVEEGGETFFP---------------KLNLSVSPKKGMAVYFEYFYQD 187
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWIR 205
+I++ +LHG PVIKGEKWVAT+W+R
Sbjct: 188 ESINKLTLHGGAPVIKGEKWVATQWMR 214
>gi|255071007|ref|XP_002507585.1| predicted protein [Micromonas sp. RCC299]
gi|226522860|gb|ACO68843.1| predicted protein [Micromonas sp. RCC299]
Length = 433
Score = 136 bits (342), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 76/209 (36%), Positives = 115/209 (55%), Gaps = 8/209 (3%)
Query: 1 MQVLSW-RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTF 59
+QV+S PRA F S +C ++ A+ + S + + S RTS+G+F
Sbjct: 158 IQVVSLDNPRAFMHIGFLSERECDLLVEYARPNMYKSGV-VDASNGGSSFSNIRTSTGSF 216
Query: 60 ISA--SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ 117
+ ++ IE +IA T +P HGE VLRY+IGQ+Y SH+D F G
Sbjct: 217 VPTVFPLGMNDVVRRIERRIAAWTQIPAAHGEPIQVLRYQIGQEYQSHFDYF--FHEGGM 274
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSL 175
+ R+A+ L+YLSDV++GGET+FP + + + C G+ V P++GD +LF+++
Sbjct: 275 KNNRIATVLMYLSDVKDGGETVFPSAESLQVKPEPIHHACAKNGITVIPKKGDAILFWNM 334
Query: 176 FPNGTIDRTSLHGSCPVIKGEKWVATKWI 204
G +D S H CPV+ GEKW ATKW+
Sbjct: 335 KVGGDLDGGSTHAGCPVVLGEKWTATKWL 363
>gi|228990015|ref|ZP_04149988.1| Prolyl 4-hydroxylase alpha subunit [Bacillus pseudomycoides DSM
12442]
gi|228769681|gb|EEM18271.1| Prolyl 4-hydroxylase alpha subunit [Bacillus pseudomycoides DSM
12442]
Length = 219
Score = 136 bits (342), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 70/198 (35%), Positives = 109/198 (55%), Gaps = 25/198 (12%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P + N S E+C+++I +K ++K S++ + + T RTSSG F+ SE
Sbjct: 42 PLIVVLANVLSDEECETLIEMSKNKMKRSKIGVSR-----KTNDIRTSSGAFLEESE--- 93
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 127
I IE +IA +P HGE +L+Y +GQ+Y +HYD F + R+++ ++
Sbjct: 94 -ITTRIERRIASIMNVPAPHGEGLQILKYTVGQEYQAHYDFF-VENSAAASNNRMSTLVM 151
Query: 128 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 187
YL+ VEEGGET FP + L V P++G + F + + +I++ +LH
Sbjct: 152 YLNHVEEGGETFFP---------------KLNLSVSPKKGMAVYFEYFYQDESINKLTLH 196
Query: 188 GSCPVIKGEKWVATKWIR 205
G PVIKGEKWVAT+W+R
Sbjct: 197 GGAPVIKGEKWVATQWMR 214
>gi|344169181|emb|CCA81504.1| putative Prolyl 4-hydroxylase alpha subunit [blood disease
bacterium R229]
Length = 289
Score = 135 bits (341), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 77/206 (37%), Positives = 112/206 (54%), Gaps = 25/206 (12%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISASEDK 66
PR + F +F S E+C +IA + RLK S + + GE E+ RTS G E
Sbjct: 97 PRIVLFQHFLSDEECDELIALGRHRLKRSPVVNPETGE--ENLISARTSQGAMFQVGEHP 154
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQR 121
++ IE +IA+AT +P HGE F VL Y+ G +Y H+D FNP G QR
Sbjct: 155 --LIARIEARIAQATGVPVEHGEGFQVLHYQPGGEYQPHFDYFNPGRSGEARQLEVGGQR 212
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 181
+A+ ++YL+ V+ GG T FP +GL+V P +G+ + F P+GT+
Sbjct: 213 VATLVIYLNSVQAGGATGFP---------------KLGLEVAPVKGNAVFFVYKRPDGTL 257
Query: 182 DRTSLHGSCPVIKGEKWVATKWIRDQ 207
D +LH PV +GEKW+ATKW+R++
Sbjct: 258 DDNTLHAGLPVERGEKWIATKWLRER 283
>gi|300690371|ref|YP_003751366.1| prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum PSI07]
gi|299077431|emb|CBJ50057.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
PSI07]
Length = 289
Score = 135 bits (341), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 77/206 (37%), Positives = 112/206 (54%), Gaps = 25/206 (12%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISASEDK 66
PR + F +F S E+C +IA + RLK S + + GE E+ RTS G E
Sbjct: 97 PRIVLFQHFLSDEECDELIALGRHRLKRSPVVNPETGE--ENLISARTSQGAMFQVGEHP 154
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQR 121
++ IE +IA+AT +P HGE F VL Y+ G +Y H+D FNP G QR
Sbjct: 155 --LIARIEARIAQATGVPVEHGEGFQVLHYQPGGEYQPHFDYFNPGRSGEARQLEVGGQR 212
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 181
+A+ ++YL+ V+ GG T FP +GL+V P +G+ + F P+GT+
Sbjct: 213 VATLVIYLNSVQAGGATGFP---------------KLGLEVAPVKGNAVFFVYKRPDGTL 257
Query: 182 DRTSLHGSCPVIKGEKWVATKWIRDQ 207
D +LH PV +GEKW+ATKW+R++
Sbjct: 258 DDNTLHAGLPVERGEKWIATKWLRER 283
>gi|344172475|emb|CCA85118.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia syzygii R24]
Length = 289
Score = 135 bits (341), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 77/206 (37%), Positives = 112/206 (54%), Gaps = 25/206 (12%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISASEDK 66
PR + F +F S E+C +IA + RLK S + + GE E+ RTS G E
Sbjct: 97 PRIVLFQHFLSDEECDELIALGRHRLKRSPVVNPETGE--ENLISARTSQGAMFQVGEHP 154
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQR 121
++ IE +IA+AT +P HGE F VL Y+ G +Y H+D FNP G QR
Sbjct: 155 --LIARIEARIAQATGVPVEHGEGFQVLHYQPGGEYQPHFDYFNPGRSGEARQLEVGGQR 212
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 181
+A+ ++YL+ V+ GG T FP +GL+V P +G+ + F P+GT+
Sbjct: 213 VATLVIYLNSVQAGGATGFP---------------KLGLEVAPVKGNAVFFVYKRPDGTL 257
Query: 182 DRTSLHGSCPVIKGEKWVATKWIRDQ 207
D +LH PV +GEKW+ATKW+R++
Sbjct: 258 DDNTLHAGLPVERGEKWIATKWLRER 283
>gi|159464219|ref|XP_001690339.1| hypothetical protein CHLREDRAFT_114525 [Chlamydomonas reinhardtii]
gi|158279839|gb|EDP05598.1| predicted protein [Chlamydomonas reinhardtii]
Length = 244
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 77/206 (37%), Positives = 117/206 (56%), Gaps = 11/206 (5%)
Query: 9 RALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTG 68
R +F + E+ I+ +++RL+ S + G + ES RTS G F+ ED
Sbjct: 1 RIFLIEHFLTDEEADHIVQVSERRLERSGVVATNGGSEESQ--IRTSFGVFLERGEDP-- 56
Query: 69 ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLY 128
+++ +E +I+ T++P +GE VLRY+ QKYD+H+D F + R A+ L+Y
Sbjct: 57 VVKGVEERISALTLMPVGNGEGLQVLRYQKEQKYDAHWDYFFHKDGIANGGNRYATVLMY 116
Query: 129 LSDVEEGGETMFPFENGIFLDSGYD--YKKCI--GLKVKPRRGDGLLFYSLFPNGTIDRT 184
L D EEGGET+FP I G + + +C L KP++G +LF+S+ P G ++R
Sbjct: 117 LVDTEEGGETVFP---NIAAPGGENVGFSECARYHLAAKPKKGTAILFHSIKPTGELERK 173
Query: 185 SLHGSCPVIKGEKWVATKWIRDQEQH 210
SLH +CPVIKG KW A KWI + Q+
Sbjct: 174 SLHTACPVIKGIKWSAAKWIHVKPQN 199
>gi|3805847|emb|CAA21467.1| putative protein [Arabidopsis thaliana]
gi|7270533|emb|CAB81490.1| putative protein [Arabidopsis thaliana]
Length = 307
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 78/222 (35%), Positives = 126/222 (56%), Gaps = 36/222 (16%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGT------- 52
++V+SW PRA + NF + E+C+ +I+ AK + S++ ++ G++++S T
Sbjct: 80 LEVISWEPRAFVYHNFLTNEECEHLISLAKPSMMKSKVVDVKTGKSIDSRFCTLTSVVVF 139
Query: 53 ----------------------RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEA 90
RTSSGTF++ D+ I+E IE++I+ T +P +GE
Sbjct: 140 TFQLNLERFENSKFANPSLCRVRTSSGTFLNRGHDE--IVEEIENRISDFTFIPPENGEG 197
Query: 91 FNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDS 150
VL YE+GQ+Y+ H+D F + QR+A+ L+YLSDV+EGGET+FP G D
Sbjct: 198 LQVLHYEVGQRYEPHHDYFFDEFNVRKGGQRIATVLMYLSDVDEGGETVFPAAKGNVSDV 257
Query: 151 GY--DYKKC--IGLKVKPRRGDGLLFYSLFPNGTIDRTSLHG 188
+ + +C GL V P++ D LLF+S+ P+ ++D +SLHG
Sbjct: 258 PWWDELSQCGKEGLSVLPKKRDALLFWSMKPDASLDPSSLHG 299
>gi|372266874|ref|ZP_09502922.1| peptidyl prolyl 4-hydroxylase-like protein subunit alpha
[Alteromonas sp. S89]
Length = 294
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 74/207 (35%), Positives = 113/207 (54%), Gaps = 23/207 (11%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
+P + F NF + +C +++ ++ L PS++ Q E K +RTS GT + E
Sbjct: 102 QPNIVLFANFLAEWECDALVEMSRPNLSPSRVVNTQHGAFE-LKPSRTSGGTHFARGE-- 158
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQR 121
T ++ IE +IA +P+ HGE +L Y + +Y HYD F+P + G Q QR
Sbjct: 159 TPLIADIEARIASLLKVPEAHGEPLQILHYPVSGEYRPHYDFFDPEKPGNQEVLAAGGQR 218
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 181
+ + ++YLSDVE GG T+FP +GL+V+P++G L F + +G +
Sbjct: 219 VGTLIMYLSDVESGGATVFP---------------RVGLEVQPQKGAALFFSYVGEHGKL 263
Query: 182 DRTSLHGSCPVIKGEKWVATKWIRDQE 208
D SLHG PV+ GEKW+ATKW+R E
Sbjct: 264 DLQSLHGGSPVLAGEKWIATKWLRAAE 290
>gi|412985583|emb|CCO19029.1| predicted protein [Bathycoccus prasinos]
Length = 458
Score = 135 bits (339), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 79/213 (37%), Positives = 122/213 (57%), Gaps = 16/213 (7%)
Query: 1 MQVLSW-RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGT 58
MQ++S PRA + F + E+C +I +K R+ S+ + ET + K RTS+G+
Sbjct: 177 MQIISLDHPRAFLYKRFMTDEECDFLIDHSKSRM--SKSGVVDAETGGTAKSDIRTSTGS 234
Query: 59 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 118
F+ + +++ +E ++A +MLP H EA VLRYE+ Q+Y +HYD F G
Sbjct: 235 FVGIGAND--LMKKLEKRVATFSMLPVKHQEATQVLRYEVKQEYRAHYDYF--FHKGGMA 290
Query: 119 SQRLASFLLYLSDVEEGGETMFPFENGIFLDS-----GYDYKKC--IGLKVKPRRGDGLL 171
+ R+ + L+YL + E GGET+FP + L+ G ++ +C G R+GD L+
Sbjct: 291 NNRIVTILMYLHEPEFGGETVFP-NTEVPLERAEKGWGKNFSECGNRGRAAVVRKGDALI 349
Query: 172 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 204
F+S+ P G +D S H CPV++GEKW ATKWI
Sbjct: 350 FWSMKPGGELDPGSSHAGCPVVRGEKWTATKWI 382
>gi|357467087|ref|XP_003603828.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355492876|gb|AES74079.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 156
Score = 134 bits (338), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 70/154 (45%), Positives = 96/154 (62%), Gaps = 6/154 (3%)
Query: 59 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 118
F+ +DK I++ IE +IA T +P +GE VL Y +G+KY+ HYD F
Sbjct: 2 FLKRGKDK--IIQNIERRIADFTFIPVENGEGLQVLHYGVGEKYEPHYDYFLDEFNTKNG 59
Query: 119 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKCI--GLKVKPRRGDGLLFYS 174
QR+A+ L+YLSDVEEGGET+FP F + D +C GL +KP+ GD LLF+S
Sbjct: 60 GQRVATVLMYLSDVEEGGETVFPAAKANFSSVPWWNDLSECARKGLSLKPKMGDALLFWS 119
Query: 175 LFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 208
+ P+ T+D +SLHG CPVI G KW +TKW+ +E
Sbjct: 120 MRPDATLDASSLHGGCPVIVGNKWSSTKWMHLEE 153
>gi|430751569|ref|YP_007214477.1| 2OG-Fe(II) oxygenase [Thermobacillus composti KWC4]
gi|430735534|gb|AGA59479.1| 2OG-Fe(II) oxygenase superfamily enzyme [Thermobacillus composti
KWC4]
Length = 215
Score = 134 bits (338), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 73/205 (35%), Positives = 111/205 (54%), Gaps = 24/205 (11%)
Query: 3 VLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISA 62
VL P + F S ++C+ +I TA RLK S+L + RTS G F
Sbjct: 25 VLHQEPLIVRFERLLSDDECRQLIETAAPRLKESKLVNK------VVSDIRTSRGMFFE- 77
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 122
E+++ + IE +IA+ +P H E VL Y GQ+Y +H+D F P + + R+
Sbjct: 78 -EEESPFIHRIERRIAQLMNVPIEHAEGLQVLHYGPGQEYKAHHDFFAPGSPAAR-NNRI 135
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
++ ++YL+DVEEGGET+FP +G+ +KP+RG L F + N ++
Sbjct: 136 STLIVYLNDVEEGGETVFPL---------------LGIAMKPKRGAALYFEYFYRNQALN 180
Query: 183 RTSLHGSCPVIKGEKWVATKWIRDQ 207
+LH S PV++GEKWVAT+W+R Q
Sbjct: 181 DLTLHSSVPVVRGEKWVATQWMRRQ 205
>gi|308812133|ref|XP_003083374.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein (ISS)
[Ostreococcus tauri]
gi|116055254|emb|CAL57650.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein (ISS)
[Ostreococcus tauri]
Length = 311
Score = 134 bits (338), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 81/222 (36%), Positives = 115/222 (51%), Gaps = 24/222 (10%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTF 59
++ +S PRA F F + +C +I A ++ S++ GE R+S G +
Sbjct: 68 IEKISDSPRAYVFREFLTDAECDRVIERAYPTMEASEVTDDDSGEA--RPDDARSSIGGW 125
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
+S +D+ ++ IE + + MLP GE VLRYE GQKYD+H D F+
Sbjct: 126 VSGDDDE--VIRNIELRASTWAMLPMNRGETMQVLRYEKGQKYDAHDDFFHDEHNVKNGG 183
Query: 120 QRLASFLLYLSDVEEGGETMFPF----------------ENGIFLDSGYDYKKCIGLKVK 163
QR+A+ L+YLSDVEEGGET+FP +N L S D + L VK
Sbjct: 184 QRVATILMYLSDVEEGGETVFPLGTPLGGRDPEKSGVTGDNACELASQNDPRV---LAVK 240
Query: 164 PRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 205
PRRGD LLF++ +G +D + H CPV +G KW T+W R
Sbjct: 241 PRRGDALLFFNAHLSGEMDEKANHAGCPVNRGTKWTMTRWHR 282
>gi|421890664|ref|ZP_16321519.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
K60-1]
gi|378964031|emb|CCF98267.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
K60-1]
Length = 288
Score = 134 bits (338), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 77/206 (37%), Positives = 111/206 (53%), Gaps = 25/206 (12%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISASEDK 66
PR + F +F S E+C +IA + RLK S + + GE E+ RTS G E
Sbjct: 96 PRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGE--ENLISARTSEGAMFQVGEHP 153
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQR 121
++ IE +IA+AT +P HGE F VL Y G +Y H+D FNP G QR
Sbjct: 154 --LVARIEARIAQATGVPVEHGEGFQVLHYHPGGEYQPHFDYFNPGRSGEARQLDVGGQR 211
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 181
+A+ ++YL+ V+ GG T FP +GL+V P +G+ + F P+GT+
Sbjct: 212 VATLVIYLNSVQAGGATGFP---------------KLGLEVAPVKGNAVFFVYKRPDGTL 256
Query: 182 DRTSLHGSCPVIKGEKWVATKWIRDQ 207
D +LH PV +GEKW+ATKW+R++
Sbjct: 257 DDNTLHAGLPVERGEKWIATKWLRER 282
>gi|300702992|ref|YP_003744594.1| prolyl 4-hydroxylase subunit alpha [Ralstonia solanacearum
CFBP2957]
gi|299070655|emb|CBJ41950.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
CFBP2957]
Length = 289
Score = 134 bits (337), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 77/206 (37%), Positives = 111/206 (53%), Gaps = 25/206 (12%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISASEDK 66
PR + F +F S E+C +IA + RLK S + + GE E+ RTS G E
Sbjct: 97 PRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGE--ENLISARTSEGAMFQVGEHP 154
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQR 121
++ IE +IA+AT +P HGE F VL Y G +Y H+D FNP G QR
Sbjct: 155 --LVARIEARIAQATGVPVEHGEGFQVLHYHPGGEYQPHFDYFNPGRSGEARQLEVGGQR 212
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 181
+A+ ++YL+ V+ GG T FP +GL+V P +G+ + F P+GT+
Sbjct: 213 VATLVIYLNSVQAGGATGFP---------------KLGLEVAPVKGNAVFFVYKRPDGTL 257
Query: 182 DRTSLHGSCPVIKGEKWVATKWIRDQ 207
D +LH PV +GEKW+ATKW+R++
Sbjct: 258 DDNTLHAGLPVERGEKWIATKWLRER 283
>gi|207744371|ref|YP_002260763.1| prolyl 4-hydroxylase subunit alpha [Ralstonia solanacearum IPO1609]
gi|206595776|emb|CAQ62703.1| prolyl 4-hydroxylase alpha subunit homologue protein [Ralstonia
solanacearum IPO1609]
Length = 280
Score = 134 bits (336), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 77/206 (37%), Positives = 111/206 (53%), Gaps = 25/206 (12%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISASEDK 66
PR + F +F S E+C +IA + RLK S + + GE E+ RTS G E
Sbjct: 88 PRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGE--ENLISARTSEGAMFQVGEHP 145
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQR 121
++ IE +IA+AT +P HGE F VL Y G +Y H+D FNP G QR
Sbjct: 146 --LVARIEARIAQATGVPVEHGEGFQVLHYHPGGEYQPHFDYFNPGRSGEARQLEVGGQR 203
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 181
+A+ ++YL+ V+ GG T FP +GL+V P +G+ + F P+GT+
Sbjct: 204 VATLVIYLNSVQAGGATGFP---------------KLGLEVAPVKGNAVFFVYKRPDGTL 248
Query: 182 DRTSLHGSCPVIKGEKWVATKWIRDQ 207
D +LH PV +GEKW+ATKW+R++
Sbjct: 249 DDNTLHAGLPVERGEKWIATKWLRER 274
>gi|83746819|ref|ZP_00943867.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum UW551]
gi|83726588|gb|EAP73718.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum UW551]
Length = 289
Score = 134 bits (336), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 77/206 (37%), Positives = 111/206 (53%), Gaps = 25/206 (12%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISASEDK 66
PR + F +F S E+C +IA + RLK S + + GE E+ RTS G E
Sbjct: 97 PRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGE--ENLISARTSEGAMFQVGEHP 154
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQR 121
++ IE +IA+AT +P HGE F VL Y G +Y H+D FNP G QR
Sbjct: 155 --LVARIEARIAQATGVPVEHGEGFQVLHYHPGGEYQPHFDYFNPGRSGEARQLEVGGQR 212
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 181
+A+ ++YL+ V+ GG T FP +GL+V P +G+ + F P+GT+
Sbjct: 213 VATLVIYLNSVQAGGATGFP---------------KLGLEVAPVKGNAVFFVYKRPDGTL 257
Query: 182 DRTSLHGSCPVIKGEKWVATKWIRDQ 207
D +LH PV +GEKW+ATKW+R++
Sbjct: 258 DDNTLHAGLPVERGEKWIATKWLRER 283
>gi|17547533|ref|NP_520935.1| hypothetical protein RSc2814 [Ralstonia solanacearum GMI1000]
gi|17429837|emb|CAD16521.1| putative prolyl 4-hydroxylase alpha subunit homologue
oxidoreductase protein [Ralstonia solanacearum GMI1000]
Length = 289
Score = 134 bits (336), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 77/206 (37%), Positives = 111/206 (53%), Gaps = 25/206 (12%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISASEDK 66
PR + F +F S E+C +IA + RLK S + + GE E+ RTS G E
Sbjct: 97 PRIVLFQHFLSDEECDQLIALGRHRLKRSPVVNPETGE--ENLISARTSQGAMFQVGEHP 154
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQR 121
++ IE +IA+AT +P HGE F VL Y+ G +Y H+D FNP G QR
Sbjct: 155 --LVARIEARIAQATGVPVEHGEGFQVLHYQPGGEYQPHFDYFNPGRSGEARQLEVGGQR 212
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 181
+A+ ++YL+ V GG T FP +GL+V P +G+ + F P+GT+
Sbjct: 213 VATLVIYLNSVPAGGATGFP---------------KLGLEVAPVKGNAVFFVYKRPDGTL 257
Query: 182 DRTSLHGSCPVIKGEKWVATKWIRDQ 207
D +LH PV +GEKW+ATKW+R++
Sbjct: 258 DDNTLHAGLPVERGEKWIATKWLRER 283
>gi|303282201|ref|XP_003060392.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226457863|gb|EEH55161.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 369
Score = 134 bits (336), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 87/207 (42%), Positives = 112/207 (54%), Gaps = 21/207 (10%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASEDK 66
PRA + F + +C IA A +L S + GE V S RTS G F ED
Sbjct: 83 PRAYVYRGFLTDAECDHFIARASPKLAKSNVVDTDTGEGVPSA--IRTSDGMFFDRGEDD 140
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF------NPAEYGPQMSQ 120
+++ +E +I+ T LP +GE VLRY GQKYD+H DAF + A G Q
Sbjct: 141 --VVDAVERRISAWTRLPTENGEGMQVLRYAGGQKYDAHLDAFVDKFNADDAHGG----Q 194
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLFPN 178
R+A+ L+YL+DV++GGET+FP Y C G+ VKPRRGD LLF+S+ +
Sbjct: 195 RVATVLMYLNDVDDGGETVFPETTAKPHVGDERYSACARRGVAVKPRRGDALLFWSM--D 252
Query: 179 GTIDRTSLHGSCPV-IKGEKWVATKWI 204
T R SLHG CPV G KW TKWI
Sbjct: 253 ETFTR-SLHGGCPVGAGGVKWSMTKWI 278
>gi|386332363|ref|YP_006028532.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum Po82]
gi|334194811|gb|AEG67996.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum Po82]
Length = 292
Score = 134 bits (336), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 77/206 (37%), Positives = 111/206 (53%), Gaps = 25/206 (12%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISASEDK 66
PR + F +F S E+C +IA + RLK S + + GE E+ RTS G E
Sbjct: 100 PRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGE--ENLISARTSEGAMFQVGEHP 157
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQR 121
++ IE +IA+AT +P HGE F VL Y G +Y H+D FNP G QR
Sbjct: 158 --LVARIEARIAQATGVPVEHGEGFQVLHYHPGGEYQPHFDYFNPGRSGEARQLEVGGQR 215
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 181
+A+ ++YL+ V+ GG T FP +GL+V P +G+ + F P+GT+
Sbjct: 216 VATLVIYLNSVQAGGATGFP---------------KLGLEVAPVKGNAVFFVYKRPDGTL 260
Query: 182 DRTSLHGSCPVIKGEKWVATKWIRDQ 207
D +LH PV +GEKW+ATKW+R++
Sbjct: 261 DDNTLHAGLPVERGEKWIATKWLRER 286
>gi|340787855|ref|YP_004753320.1| peptidyl prolyl 4-hydroxylase-like protein subunit alpha
[Collimonas fungivorans Ter331]
gi|340553122|gb|AEK62497.1| Peptidyl prolyl 4-hydroxylase-like protein, alpha subunit
[Collimonas fungivorans Ter331]
Length = 289
Score = 133 bits (335), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 75/212 (35%), Positives = 110/212 (51%), Gaps = 33/212 (15%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT-----RTSSGTFIS 61
+PRA+ F N S ++C +IA +K +L LR G T T RTSSGTF
Sbjct: 99 KPRAILFGNVLSHDECDQLIALSKTKL------LRSGVVDHQTGNTKLHEHRTSSGTFFH 152
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP-----AEYGP 116
T + +I+ ++A +P++HGE +L Y++G +Y HYD F P A++
Sbjct: 153 --RGTTPFIAMIDKRLAALMQVPESHGEGLQILNYQMGGEYRPHYDYFRPDAPGSAKHLA 210
Query: 117 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
+ QR A+ ++YL+DV+ GGET+FP GL + P +G + F
Sbjct: 211 RGGQRTATLIIYLNDVDGGGETIFPRN---------------GLSIVPAKGSAIYFSYTN 255
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 208
+D S HG PVI+GEKW+ATKW+R E
Sbjct: 256 AENQLDSLSFHGGSPVIEGEKWIATKWVRQNE 287
>gi|413934217|gb|AFW68768.1| hypothetical protein ZEAMMB73_452923 [Zea mays]
Length = 204
Score = 133 bits (334), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 66/140 (47%), Positives = 88/140 (62%), Gaps = 3/140 (2%)
Query: 69 ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLY 128
++ IE +I+ T LP +GEA +L Y+ G+KY+ HYD F+ R+A+ L+Y
Sbjct: 6 VVTRIEERISAWTFLPPENGEAIQILHYQNGEKYEPHYDYFHDKNNQALGGHRIATVLMY 65
Query: 129 LSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPNGTIDRTS 185
LS+VE+GGET+FP G L D + C G VKP +GD LLF+SL P+ T D S
Sbjct: 66 LSNVEKGGETIFPNAEGKLLQPKDDTWSDCARNGYAVKPVKGDALLFFSLHPDSTTDSDS 125
Query: 186 LHGSCPVIKGEKWVATKWIR 205
LHGSCP I+G+KW ATKWI
Sbjct: 126 LHGSCPAIEGQKWSATKWIH 145
>gi|413934216|gb|AFW68767.1| hypothetical protein ZEAMMB73_452923 [Zea mays]
Length = 210
Score = 133 bits (334), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 66/140 (47%), Positives = 88/140 (62%), Gaps = 3/140 (2%)
Query: 69 ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLY 128
++ IE +I+ T LP +GEA +L Y+ G+KY+ HYD F+ R+A+ L+Y
Sbjct: 12 VVTRIEERISAWTFLPPENGEAIQILHYQNGEKYEPHYDYFHDKNNQALGGHRIATVLMY 71
Query: 129 LSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPNGTIDRTS 185
LS+VE+GGET+FP G L D + C G VKP +GD LLF+SL P+ T D S
Sbjct: 72 LSNVEKGGETIFPNAEGKLLQPKDDTWSDCARNGYAVKPVKGDALLFFSLHPDSTTDSDS 131
Query: 186 LHGSCPVIKGEKWVATKWIR 205
LHGSCP I+G+KW ATKWI
Sbjct: 132 LHGSCPAIEGQKWSATKWIH 151
>gi|55741040|gb|AAV64184.1| unknown [Zea mays]
Length = 394
Score = 132 bits (333), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 67/146 (45%), Positives = 95/146 (65%), Gaps = 5/146 (3%)
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 121
A++D+ ++ IE +I+ T LP +GE+ +L Y+ G+KY+ HYD F+ + R
Sbjct: 191 ATQDE--VVTRIEERISAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKKNQALGGHR 248
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFL---DSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 178
+A+ L+YLS+VE+GGET+FP G L D+ + G VKP +GD LLF+SL P+
Sbjct: 249 IATVLMYLSNVEKGGETIFPNAEGKLLQPKDNTWSDCARNGYAVKPVKGDALLFFSLHPD 308
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWI 204
T D SLHGSCPVI+G+KW ATKWI
Sbjct: 309 ATTDSDSLHGSCPVIEGQKWSATKWI 334
>gi|55741082|gb|AAV64222.1| unknown [Zea mays]
Length = 369
Score = 132 bits (333), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 67/146 (45%), Positives = 95/146 (65%), Gaps = 5/146 (3%)
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 121
A++D+ ++ IE +I+ T LP +GE+ +L Y+ G+KY+ HYD F+ + R
Sbjct: 191 ATQDE--VVTRIEERISAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKKNQALGGHR 248
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFL---DSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 178
+A+ L+YLS+VE+GGET+FP G L D+ + G VKP +GD LLF+SL P+
Sbjct: 249 IATVLMYLSNVEKGGETIFPNAEGKLLQPKDNTWSDCARNGYAVKPVKGDALLFFSLHPD 308
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWI 204
T D SLHGSCPVI+G+KW ATKWI
Sbjct: 309 ATTDSDSLHGSCPVIEGQKWSATKWI 334
>gi|423389445|ref|ZP_17366671.1| hypothetical protein ICG_01293 [Bacillus cereus BAG1X1-3]
gi|401641536|gb|EJS59253.1| hypothetical protein ICG_01293 [Bacillus cereus BAG1X1-3]
Length = 216
Score = 132 bits (332), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 70/208 (33%), Positives = 114/208 (54%), Gaps = 29/208 (13%)
Query: 1 MQVLS--WRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 58
+Q++S P + N S E+C+ +I +K ++K S++ + RTSSG
Sbjct: 30 IQIISKFEEPLIVVLANVLSDEECEELIELSKNKMKRSKVG-----SSRDVNDIRTSSGA 84
Query: 59 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQ 117
F+ +E + IE +I+ T +P HGE ++L YE+ Q+Y +HYD F AE+
Sbjct: 85 FLEENE----LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF--AEHSRSA 138
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
+ R+++ ++YL+DVEEGGET FP + L V PR+G + F +
Sbjct: 139 ANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQ 183
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+ +++ +LHG PV KGEKW+AT+W+R
Sbjct: 184 DQSLNELTLHGGAPVTKGEKWIATQWVR 211
>gi|299065638|emb|CBJ36810.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
CMR15]
Length = 289
Score = 132 bits (332), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 76/206 (36%), Positives = 110/206 (53%), Gaps = 25/206 (12%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISASEDK 66
PR + F +F S E+C +I + RLK S + + GE E+ RTS G E
Sbjct: 97 PRIVLFQHFLSDEECDQLITLGRHRLKRSPVVNPETGE--ENLISARTSQGAMFQVGEHP 154
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQR 121
++ IE +IA+AT +P HGE F VL Y+ G +Y H+D FNP G QR
Sbjct: 155 --LIARIEARIAQATGVPVEHGEGFQVLHYQPGGEYQPHFDYFNPGRSGEARQLEVGGQR 212
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 181
+A+ ++YL+ V GG T FP +GL+V P +G+ + F P+GT+
Sbjct: 213 VATLVIYLNSVPAGGATGFP---------------KLGLEVAPVKGNAVFFVYKRPDGTL 257
Query: 182 DRTSLHGSCPVIKGEKWVATKWIRDQ 207
D +LH PV +GEKW+ATKW+R++
Sbjct: 258 DDKTLHAGLPVERGEKWIATKWLRER 283
>gi|241664232|ref|YP_002982592.1| procollagen-proline dioxygenase [Ralstonia pickettii 12D]
gi|309783051|ref|ZP_07677770.1| procollagen-proline dioxygenase [Ralstonia sp. 5_7_47FAA]
gi|404397139|ref|ZP_10988932.1| hypothetical protein HMPREF0989_00773 [Ralstonia sp. 5_2_56FAA]
gi|240866259|gb|ACS63920.1| Procollagen-proline dioxygenase [Ralstonia pickettii 12D]
gi|308918159|gb|EFP63837.1| procollagen-proline dioxygenase [Ralstonia sp. 5_7_47FAA]
gi|348610674|gb|EGY60360.1| hypothetical protein HMPREF0989_00773 [Ralstonia sp. 5_2_56FAA]
Length = 288
Score = 132 bits (332), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 73/205 (35%), Positives = 109/205 (53%), Gaps = 23/205 (11%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
PR + F +F S ++C +IA + RLK S + + E+ RTS G E
Sbjct: 96 PRIVLFQHFLSDQECDELIAIGRNRLKRSPV-VNPDTGEENLISARTSQGGMFQVGEHP- 153
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRL 122
++ IE +IA+A +P HGE F VL Y+ G +Y H+D FNP G QR+
Sbjct: 154 -LIAKIEARIAQAVGVPVEHGEGFQVLNYQPGGEYQPHFDFFNPGRSGEARQLEVGGQRV 212
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
A+ ++YL+ V+ GG T FP +GL+V P +G+ + F P+GT+D
Sbjct: 213 ATMVIYLNSVQAGGATGFP---------------KLGLEVAPVKGNAVFFVYKRPDGTLD 257
Query: 183 RTSLHGSCPVIKGEKWVATKWIRDQ 207
+LH PV +GEKW+ATKW+R++
Sbjct: 258 EDTLHAGLPVERGEKWIATKWLRER 282
>gi|229019457|ref|ZP_04176278.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1273]
gi|229025700|ref|ZP_04182104.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1272]
gi|423417837|ref|ZP_17394926.1| hypothetical protein IE3_01309 [Bacillus cereus BAG3X2-1]
gi|228735575|gb|EEL86166.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1272]
gi|228741812|gb|EEL91991.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1273]
gi|401107008|gb|EJQ14965.1| hypothetical protein IE3_01309 [Bacillus cereus BAG3X2-1]
Length = 216
Score = 132 bits (331), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 70/208 (33%), Positives = 113/208 (54%), Gaps = 29/208 (13%)
Query: 1 MQVLS--WRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 58
+Q++S P + N S E+C +I +K ++K S++ + RTSSG
Sbjct: 30 IQIISKFEEPLIVVLANVLSDEECDELIELSKNKMKRSKVG-----SSRDVNDIRTSSGA 84
Query: 59 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQ 117
F+ +E + IE +I+ T +P HGE ++L YE+ Q+Y +HYD F AE+
Sbjct: 85 FLEENE----LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF--AEHSRSA 138
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
+ R+++ ++YL+DVEEGGET FP + L V PR+G + F +
Sbjct: 139 ANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQ 183
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+ +++ +LHG PV KGEKW+AT+W+R
Sbjct: 184 DQSLNELTLHGGAPVTKGEKWIATQWVR 211
>gi|229086310|ref|ZP_04218488.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-44]
gi|228697005|gb|EEL49812.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-44]
Length = 220
Score = 131 bits (330), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 69/208 (33%), Positives = 118/208 (56%), Gaps = 29/208 (13%)
Query: 1 MQVLSW--RPRALYFPNFASAEQCQSIIATAKKRLKPSQL-ALRQGETVESTKGTRTSSG 57
+Q++S P + N S E+C+S+I +K +K S++ A R+ + + RTSSG
Sbjct: 34 IQIISRVEEPLIVVLENVLSDEECESLIELSKDSMKRSKIGASREVDNI------RTSSG 87
Query: 58 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ 117
TF+ +E + +IE +++ +P HGE ++L+Y GQ+Y +HYD F +
Sbjct: 88 TFLEENE----TVAIIEKRVSSIMNIPVEHGEGLHILKYTPGQEYKAHYDYFAEHSRAAE 143
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
+ R+++ ++YL+DVEEGGET FP + L + P++G + F +
Sbjct: 144 -NNRISTLVMYLNDVEEGGETFFP---------------KLNLSIAPKKGSAVYFEYFYN 187
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+ +++ +LHG PVIKGEKWVAT+W++
Sbjct: 188 DKSLNELTLHGGAPVIKGEKWVATQWMK 215
>gi|421895470|ref|ZP_16325871.1| prolyl 4-hydroxylase alpha subunit homologue protein [Ralstonia
solanacearum MolK2]
gi|206586635|emb|CAQ17221.1| prolyl 4-hydroxylase alpha subunit homologue protein [Ralstonia
solanacearum MolK2]
Length = 283
Score = 131 bits (330), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 76/206 (36%), Positives = 110/206 (53%), Gaps = 25/206 (12%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISASEDK 66
PR + F +F S E+C +IA + RLK S + + GE E+ RTS G E
Sbjct: 91 PRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGE--ENLISARTSEGAMFQVGEHP 148
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQR 121
++ IE +IA+AT +P HGE F VL Y G +Y H+D FNP G QR
Sbjct: 149 --LVARIEARIAQATGVPVEHGEGFQVLHYHPGGEYQPHFDYFNPGRGGEARQLEVGGQR 206
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 181
+A+ ++YL+ V+ GG T FP +GL+V P +G+ + F P+G +
Sbjct: 207 VATLVIYLNSVQAGGATGFP---------------KLGLEVAPVKGNAVFFVYKRPDGML 251
Query: 182 DRTSLHGSCPVIKGEKWVATKWIRDQ 207
D +LH PV +GEKW+ATKW+R++
Sbjct: 252 DDNTLHAGLPVERGEKWIATKWLRER 277
>gi|229075940|ref|ZP_04208916.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-18]
gi|229117732|ref|ZP_04247101.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-3]
gi|407706764|ref|YP_006830349.1| alpha/beta fold family hydrolase [Bacillus thuringiensis MC28]
gi|423377905|ref|ZP_17355189.1| hypothetical protein IC9_01258 [Bacillus cereus BAG1O-2]
gi|423464099|ref|ZP_17440867.1| hypothetical protein IEK_01286 [Bacillus cereus BAG6O-1]
gi|423547540|ref|ZP_17523898.1| hypothetical protein IGO_03975 [Bacillus cereus HuB5-5]
gi|423622677|ref|ZP_17598455.1| hypothetical protein IK3_01275 [Bacillus cereus VD148]
gi|228665709|gb|EEL21182.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-3]
gi|228707255|gb|EEL59452.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-18]
gi|401179261|gb|EJQ86434.1| hypothetical protein IGO_03975 [Bacillus cereus HuB5-5]
gi|401260797|gb|EJR66965.1| hypothetical protein IK3_01275 [Bacillus cereus VD148]
gi|401636171|gb|EJS53925.1| hypothetical protein IC9_01258 [Bacillus cereus BAG1O-2]
gi|402420366|gb|EJV52637.1| hypothetical protein IEK_01286 [Bacillus cereus BAG6O-1]
gi|407384449|gb|AFU14950.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis MC28]
Length = 216
Score = 131 bits (330), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 70/208 (33%), Positives = 112/208 (53%), Gaps = 29/208 (13%)
Query: 1 MQVLS--WRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 58
+Q++S P + N S E+C +I +K ++K S + + RTSSG
Sbjct: 30 IQIISKFEEPLIVVLGNVISDEECNELIEMSKNKIKRSTIG-----SARDVNDIRTSSGA 84
Query: 59 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQ 117
F+ +E + IE +I+ +P THGE ++L YE+ Q+Y +HYD F AE+
Sbjct: 85 FLEENE----LTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYF--AEHSRSA 138
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
+ R+++ ++YL+DVEEGGET FP + L V PR+G + F +
Sbjct: 139 ANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQ 183
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+ +++ +LHG PV KGEKW+AT+W+R
Sbjct: 184 DQSLNELTLHGGAPVTKGEKWIATQWVR 211
>gi|423598444|ref|ZP_17574444.1| hypothetical protein III_01246 [Bacillus cereus VD078]
gi|423660914|ref|ZP_17636083.1| hypothetical protein IKM_01311 [Bacillus cereus VDM022]
gi|401236714|gb|EJR43171.1| hypothetical protein III_01246 [Bacillus cereus VD078]
gi|401300955|gb|EJS06544.1| hypothetical protein IKM_01311 [Bacillus cereus VDM022]
Length = 216
Score = 131 bits (329), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 70/208 (33%), Positives = 113/208 (54%), Gaps = 29/208 (13%)
Query: 1 MQVLS--WRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 58
+Q++S P + N S E+C +I +K ++K S++ + RTSSG
Sbjct: 30 IQIISKFEEPLIVVLANVLSDEECDELIELSKSKMKRSKVG-----SSRDVNDIRTSSGA 84
Query: 59 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQ 117
F+ +E + IE +I+ T +P HGE ++L YE+ Q+Y +HYD F AE+
Sbjct: 85 FLEENE----LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF--AEHSRSA 138
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
+ R+++ ++YL+DVEEGGET FP + L V PR+G + F +
Sbjct: 139 ANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQ 183
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+ +++ +LHG PV KGEKW+AT+W+R
Sbjct: 184 DQSLNELTLHGGAPVTKGEKWIATQWVR 211
>gi|218899396|ref|YP_002447807.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
G9842]
gi|218542449|gb|ACK94843.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
G9842]
Length = 216
Score = 131 bits (329), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 67/199 (33%), Positives = 109/199 (54%), Gaps = 27/199 (13%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P + N S E+C +I +K ++K S++ + RTSSG F+ +E
Sbjct: 39 PLIVVLANVLSDEECDELIEMSKNKMKRSKVG-----SSRDVNDIRTSSGAFLEDNE--- 90
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 126
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ ++ R+++ +
Sbjct: 91 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAVNNRISTLV 147
Query: 127 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 186
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 148 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 192
Query: 187 HGSCPVIKGEKWVATKWIR 205
HG PV KGEKW+AT+W+R
Sbjct: 193 HGGAPVTKGEKWIATQWVR 211
>gi|228910069|ref|ZP_04073889.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL 200]
gi|228849586|gb|EEM94420.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL 200]
Length = 248
Score = 131 bits (329), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 67/199 (33%), Positives = 109/199 (54%), Gaps = 27/199 (13%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P + N S E+C +I +K ++K S++ + RTSSG F+ +E
Sbjct: 71 PLIVVLANVLSDEECDELIEMSKNKMKRSKVG-----SSRDVNDIRTSSGAFLEDNE--- 122
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 126
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ ++ R+++ +
Sbjct: 123 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAVNNRISTLV 179
Query: 127 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 186
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 180 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 224
Query: 187 HGSCPVIKGEKWVATKWIR 205
HG PV KGEKW+AT+W+R
Sbjct: 225 HGGAPVTKGEKWIATQWVR 243
>gi|228954520|ref|ZP_04116545.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
kurstaki str. T03a001]
gi|449091198|ref|YP_007423639.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
kurstaki str. HD73]
gi|228805177|gb|EEM51771.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
kurstaki str. T03a001]
gi|449024955|gb|AGE80118.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
kurstaki str. HD73]
Length = 216
Score = 131 bits (329), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 67/199 (33%), Positives = 108/199 (54%), Gaps = 27/199 (13%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P + N S E+C +I +K ++K S++ + RTSSG F+ +E
Sbjct: 39 PLIVVLANVLSDEECDELIEMSKNKMKRSKVG-----SARDVNDIRTSSGAFLEDNE--- 90
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 126
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 127 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 186
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 148 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 192
Query: 187 HGSCPVIKGEKWVATKWIR 205
HG PV KGEKW+AT+W+R
Sbjct: 193 HGGAPVTKGEKWIATQWVR 211
>gi|187930127|ref|YP_001900614.1| procollagen-proline dioxygenase [Ralstonia pickettii 12J]
gi|187727017|gb|ACD28182.1| Procollagen-proline dioxygenase [Ralstonia pickettii 12J]
Length = 288
Score = 130 bits (328), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 73/205 (35%), Positives = 108/205 (52%), Gaps = 23/205 (11%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
PR + F +F S +C +IA + RLK S + + E+ RTS G E
Sbjct: 96 PRIVLFQHFLSDAECDELIAIGRNRLKRSPV-VNPDTGEENLISARTSQGGMFQVGEHP- 153
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRL 122
++ IE +IA+A +P HGE F VL Y+ G +Y H+D FNP G QR+
Sbjct: 154 -LIAKIEVRIAQAVGVPVEHGEGFQVLNYQPGGEYQPHFDFFNPGRSGEARQLEVGGQRV 212
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
A+ ++YL+ V+ GG T FP +GL+V P +G+ + F P+GT+D
Sbjct: 213 ATMVIYLNSVQAGGATGFP---------------KLGLEVAPVKGNAVFFVYKRPDGTLD 257
Query: 183 RTSLHGSCPVIKGEKWVATKWIRDQ 207
+LH PV +GEKW+ATKW+R++
Sbjct: 258 EDTLHAGLPVERGEKWIATKWLRER 282
>gi|423521903|ref|ZP_17498376.1| hypothetical protein IGC_01286 [Bacillus cereus HuA4-10]
gi|401176565|gb|EJQ83760.1| hypothetical protein IGC_01286 [Bacillus cereus HuA4-10]
Length = 216
Score = 130 bits (328), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 68/199 (34%), Positives = 107/199 (53%), Gaps = 27/199 (13%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P + N S E+C +I +K +K S++ + RTSSG F+ +E
Sbjct: 39 PLIVVLANVLSDEECDKLIELSKNNMKRSKVG-----SSRDVNDIRTSSGAFLEENE--- 90
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 126
+ IE +I+ T +P HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 127 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 186
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 148 MYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 192
Query: 187 HGSCPVIKGEKWVATKWIR 205
HG PV KGEKW+AT+W+R
Sbjct: 193 HGGAPVTKGEKWIATQWVR 211
>gi|302838815|ref|XP_002950965.1| hypothetical protein VOLCADRAFT_60971 [Volvox carteri f.
nagariensis]
gi|300263660|gb|EFJ47859.1| hypothetical protein VOLCADRAFT_60971 [Volvox carteri f.
nagariensis]
Length = 298
Score = 130 bits (328), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 81/223 (36%), Positives = 116/223 (52%), Gaps = 30/223 (13%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
++ +SW PR + NF + +C+ I TA +K S + + G +V T RTS GTFI
Sbjct: 2 IEAVSWNPRVFIYHNFLTDGECRHIKRTAAPMMKRSSVVGQNGSSV--TDNIRTSYGTFI 59
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFN------------VLRYEIGQKYDSHYDA 108
D ++E I ++A T P + E VLRY IGQKY +H D+
Sbjct: 60 RRRHDP--VIERILRRVAAWTKAPPENQEDLQAGRGEGGREKERVLRYGIGQKYGAHMDS 117
Query: 109 FNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGY-----DYKKCIGLKV- 162
S R+A+ LLYL D EEGGET FP ++ +L + +C V
Sbjct: 118 LI------DDSPRMATVLLYLHDTEEGGETAFP-DSSSWLTPDLATRMGPFSECAQGHVA 170
Query: 163 -KPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 204
+P++GD L+F+S+ P+GT D S+H CPV+KG KW AT W+
Sbjct: 171 FRPKKGDALMFWSIKPDGTHDPLSMHTGCPVVKGVKWTATSWV 213
>gi|423437685|ref|ZP_17414666.1| hypothetical protein IE9_03866 [Bacillus cereus BAG4X12-1]
gi|423503075|ref|ZP_17479667.1| hypothetical protein IG1_00641 [Bacillus cereus HD73]
gi|401120840|gb|EJQ28636.1| hypothetical protein IE9_03866 [Bacillus cereus BAG4X12-1]
gi|402459296|gb|EJV91033.1| hypothetical protein IG1_00641 [Bacillus cereus HD73]
Length = 248
Score = 130 bits (328), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 67/199 (33%), Positives = 108/199 (54%), Gaps = 27/199 (13%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P + N S E+C +I +K ++K S++ + RTSSG F+ +E
Sbjct: 71 PLIVVLANVLSDEECDELIEMSKNKMKRSKVG-----SARDVNDIRTSSGAFLEDNE--- 122
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 126
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 123 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 179
Query: 127 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 186
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 180 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 224
Query: 187 HGSCPVIKGEKWVATKWIR 205
HG PV KGEKW+AT+W+R
Sbjct: 225 HGGAPVTKGEKWIATQWVR 243
>gi|160900716|ref|YP_001566298.1| procollagen-proline dioxygenase [Delftia acidovorans SPH-1]
gi|160366300|gb|ABX37913.1| Procollagen-proline dioxygenase [Delftia acidovorans SPH-1]
Length = 294
Score = 130 bits (327), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 78/208 (37%), Positives = 110/208 (52%), Gaps = 25/208 (12%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
PR + F N S E+C +IIA A+ R+ S L + E RTS+G F E T
Sbjct: 107 PRIVVFGNLLSHEECDAIIAAARPRMARS-LTVATQSGGEEINDDRTSNGMFFQRGE--T 163
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQRL 122
GI+ +E +IAR P HGE VL Y G +Y H+D F P E G P + QR+
Sbjct: 164 GIVSQLEERIARLLRWPLDHGEGLQVLHYGPGAEYKPHHDYFAPGEPGTPTILKRGGQRV 223
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
+ ++YL++ E GG T+FP + L+V PRRG+ + F P+ +
Sbjct: 224 GTLVIYLNEPERGGATIFP---------------EVPLQVVPRRGNAVFFSYERPDPST- 267
Query: 183 RTSLHGSCPVIKGEKWVATKWIRDQEQH 210
+LHG PV+ GEKW+ATKW+R++E H
Sbjct: 268 -RTLHGGAPVLAGEKWIATKWLREREFH 294
>gi|423541303|ref|ZP_17517694.1| hypothetical protein IGK_03395 [Bacillus cereus HuB4-10]
gi|401172491|gb|EJQ79712.1| hypothetical protein IGK_03395 [Bacillus cereus HuB4-10]
Length = 216
Score = 130 bits (327), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 70/208 (33%), Positives = 112/208 (53%), Gaps = 29/208 (13%)
Query: 1 MQVLS--WRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 58
+Q++S P + N S E+C +I +K ++K S + + RTSSG
Sbjct: 30 IQIISKFEEPLIVVLGNVISDEECDELIEMSKNKIKRSTIG-----SSRDVNDIRTSSGA 84
Query: 59 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQ 117
F+ +E + IE +I+ +P THGE ++L YE+ Q+Y +HYD F AE+
Sbjct: 85 FLEENE----LTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYF--AEHSRSA 138
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
+ R+++ ++YL+DVEEGGET FP + L V PR+G + F +
Sbjct: 139 ANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQ 183
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+ +++ +LHG PV KGEKW+AT+W+R
Sbjct: 184 DQSLNELTLHGGAPVTKGEKWIATQWVR 211
>gi|333912984|ref|YP_004486716.1| procollagen-proline dioxygenase [Delftia sp. Cs1-4]
gi|333743184|gb|AEF88361.1| Procollagen-proline dioxygenase [Delftia sp. Cs1-4]
Length = 294
Score = 130 bits (327), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 78/208 (37%), Positives = 110/208 (52%), Gaps = 25/208 (12%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
PR + F N S E+C +IIA A+ R+ S L + E RTS+G F E T
Sbjct: 107 PRIVVFGNLLSHEECDAIIAAARPRMARS-LTVATQSGGEEINDDRTSNGMFFQRGE--T 163
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQRL 122
GI+ +E +IAR P HGE VL Y G +Y H+D F P E G P + QR+
Sbjct: 164 GIVSQLEERIARLLRWPLDHGEGLQVLHYGPGAEYKPHHDYFAPGEPGTPTILKRGGQRV 223
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
+ ++YL++ E GG T+FP + L+V PRRG+ + F P+ +
Sbjct: 224 GTLVIYLNEPERGGATIFP---------------EVPLQVVPRRGNAVFFSYERPDPST- 267
Query: 183 RTSLHGSCPVIKGEKWVATKWIRDQEQH 210
+LHG PV+ GEKW+ATKW+R++E H
Sbjct: 268 -RTLHGGAPVLAGEKWIATKWLREREFH 294
>gi|423489423|ref|ZP_17466105.1| hypothetical protein IEU_04046 [Bacillus cereus BtB2-4]
gi|402431659|gb|EJV63723.1| hypothetical protein IEU_04046 [Bacillus cereus BtB2-4]
Length = 216
Score = 130 bits (327), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 69/208 (33%), Positives = 114/208 (54%), Gaps = 29/208 (13%)
Query: 1 MQVLS--WRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 58
+Q++S P + N S E+C +I +K +++ S++ + RTSSG
Sbjct: 30 IQIISKFEEPLIVVLANVLSDEECDELIELSKSKMERSKVG-----SSRDVNDIRTSSGA 84
Query: 59 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQ 117
F+ +E + IE +I+ T +P +HGE ++L YE+ Q+Y +HYD F AE+
Sbjct: 85 FLEENE----LTSKIEKRISSITNVPVSHGEGLHILNYEVDQEYKAHYDYF--AEHSRSA 138
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
+ R+++ ++YL+DVEEGGET FP + L V PR+G + F +
Sbjct: 139 ANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQ 183
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+ +++ +LHG PV KGEKW+AT+W+R
Sbjct: 184 DQSLNELTLHGGAPVTKGEKWIATQWVR 211
>gi|302842389|ref|XP_002952738.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
gi|300262082|gb|EFJ46291.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
Length = 281
Score = 130 bits (327), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 74/194 (38%), Positives = 112/194 (57%), Gaps = 11/194 (5%)
Query: 19 AEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIA 78
AE+ I+ +++RL+ + + G+ T RTS G F+ ED+ I++ +E +IA
Sbjct: 7 AEEADHIVKVSERRLE--RSGVVGGDGGSETSNIRTSYGVFLDRGEDE--IVKRVEERIA 62
Query: 79 RATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGET 138
T++P +GE VLRY+ QKYD+H+D F + R A+ L+YL D EEGGET
Sbjct: 63 AWTLMPVGNGEGLQVLRYQKEQKYDAHWDYFFHKDGITNGGNRYATVLMYLVDTEEGGET 122
Query: 139 MFPFENGIFLDSGYD--YKKCI--GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIK 194
+FP + G + + +C L KP++G +LF+S+ P G ++R SLH +CPVI+
Sbjct: 123 VFP---NVAAPGGENVGFSECARYHLAAKPKKGTAILFHSIKPTGELERKSLHTACPVIR 179
Query: 195 GEKWVATKWIRDQE 208
G KW A KWI E
Sbjct: 180 GIKWSAAKWIHHAE 193
>gi|299532490|ref|ZP_07045880.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni S44]
gi|298719437|gb|EFI60404.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni S44]
Length = 299
Score = 130 bits (327), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 75/206 (36%), Positives = 111/206 (53%), Gaps = 25/206 (12%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
PR + F N S E+C +IIA A+ R++ S L + E+ RTS+G F E++
Sbjct: 112 PRVVVFGNLLSDEECDAIIAAARPRMRRS-LTVDNQSGGEAVNDDRTSNGMFFQRGENE- 169
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQRL 122
++ L+E +IAR P +GE VL Y G +Y HYD F P E G P + QR+
Sbjct: 170 -LISLVEQRIARLLNWPLENGEGMQVLHYRPGAEYKPHYDYFAPNEPGTPTILKRGGQRV 228
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
+ ++YL++ GG T FP +GL+V PRRG+ + F P+
Sbjct: 229 GTLVMYLNEPARGGATTFP---------------DVGLQVVPRRGNAVFFSYNRPDPATK 273
Query: 183 RTSLHGSCPVIKGEKWVATKWIRDQE 208
+LHG PV++GEKW+ATKW+R++E
Sbjct: 274 --TLHGGAPVLEGEKWIATKWLRERE 297
>gi|228902749|ref|ZP_04066896.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL
4222]
gi|228967277|ref|ZP_04128313.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
sotto str. T04001]
gi|402564350|ref|YP_006607074.1| prolyl 4-hydroxylase subunit alpha domain-containing protein
[Bacillus thuringiensis HD-771]
gi|434377355|ref|YP_006611999.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
thuringiensis HD-789]
gi|228792646|gb|EEM40212.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
sotto str. T04001]
gi|228856936|gb|EEN01449.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL
4222]
gi|401793002|gb|AFQ19041.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
thuringiensis HD-771]
gi|401875912|gb|AFQ28079.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
thuringiensis HD-789]
Length = 216
Score = 130 bits (327), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 67/199 (33%), Positives = 108/199 (54%), Gaps = 27/199 (13%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P + N S E+C +I +K ++K S++ + RTSSG F+ +E
Sbjct: 39 PLIVVLANVLSDEECDKLIEMSKNKMKRSKVG-----SSRDVNDIRTSSGAFLEDNE--- 90
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 126
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 127 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 186
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 148 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 192
Query: 187 HGSCPVIKGEKWVATKWIR 205
HG PV KGEKW+AT+W+R
Sbjct: 193 HGGAPVTKGEKWIATQWVR 211
>gi|423558182|ref|ZP_17534484.1| hypothetical protein II3_03386 [Bacillus cereus MC67]
gi|401191450|gb|EJQ98472.1| hypothetical protein II3_03386 [Bacillus cereus MC67]
Length = 216
Score = 130 bits (326), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 70/208 (33%), Positives = 112/208 (53%), Gaps = 29/208 (13%)
Query: 1 MQVLS--WRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 58
+Q++S P + N S E+C +I +K ++K S++ + RTSSG
Sbjct: 30 IQIISKFEEPLIVVLANVLSDEECDGLIELSKNKIKRSKIG-----SSRDVNDIRTSSGA 84
Query: 59 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQ 117
F+ +E + IE +I+ +P HGE ++L YE+ Q+Y +HYD F AE+
Sbjct: 85 FLEENE----LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQEYKAHYDYF--AEHSRSA 138
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
+ R+++ ++YL+DVEEGGET FP + L V PR+G + F +
Sbjct: 139 ANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQ 183
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+ +++ +LHG PV KGEKWVAT+W+R
Sbjct: 184 DQSLNELTLHGGAPVTKGEKWVATQWVR 211
>gi|228922987|ref|ZP_04086280.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
huazhongensis BGSC 4BD1]
gi|228836620|gb|EEM81968.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
huazhongensis BGSC 4BD1]
Length = 216
Score = 130 bits (326), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 68/199 (34%), Positives = 108/199 (54%), Gaps = 27/199 (13%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P + N S E+C +I +K ++K S++ + RTSSG F+ SE
Sbjct: 39 PLIVVLANVLSDEECDELIEMSKNKMKRSKVG-----SSRDVNDIRTSSGAFLEDSE--- 90
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 126
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTLKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 127 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 186
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 148 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 192
Query: 187 HGSCPVIKGEKWVATKWIR 205
HG PV KGEKW+AT+W+R
Sbjct: 193 HGGAPVTKGEKWIATQWVR 211
>gi|49480949|ref|YP_038297.1| prolyl 4-hydroxylase subunit alpha [Bacillus thuringiensis serovar
konkukian str. 97-27]
gi|49332505|gb|AAT63151.1| prolyl 4-hydroxylase, alpha subunit [Bacillus thuringiensis serovar
konkukian str. 97-27]
Length = 232
Score = 130 bits (326), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 70/208 (33%), Positives = 113/208 (54%), Gaps = 29/208 (13%)
Query: 1 MQVLS--WRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 58
+Q++S P + N S E+C +I +K +L S++ + RTSSG
Sbjct: 46 IQIISKFEEPLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDVNDIRTSSGA 100
Query: 59 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQ 117
F+ +E + E IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+
Sbjct: 101 FLDDNE----LTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSA 154
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
+ R+++ ++YL+DVEEGGET FP + L V PR+G + F +
Sbjct: 155 ANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQ 199
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+ +++ +LHG PV KGEKW+AT+W+R
Sbjct: 200 DQSLNELTLHGGAPVTKGEKWIATQWVR 227
>gi|423634936|ref|ZP_17610589.1| hypothetical protein IK7_01345 [Bacillus cereus VD156]
gi|401278922|gb|EJR84852.1| hypothetical protein IK7_01345 [Bacillus cereus VD156]
Length = 248
Score = 130 bits (326), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 70/200 (35%), Positives = 109/200 (54%), Gaps = 29/200 (14%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P + N S E+C +I +K ++K S++ + RTSSG F+ SE
Sbjct: 71 PLIVVLANVLSDEECDELIEMSKNKMKRSKVG-----SSRDVNDIRTSSGAFLEDSE--- 122
Query: 68 GILEL-IEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASF 125
L L IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++
Sbjct: 123 --LTLKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTL 178
Query: 126 LLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTS 185
++YL+DVEEGGET FP + L V PR+G + F + + +++ +
Sbjct: 179 VMYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELT 223
Query: 186 LHGSCPVIKGEKWVATKWIR 205
LHG PV KGEKW+AT+W+R
Sbjct: 224 LHGGAPVTKGEKWIATQWVR 243
>gi|423582447|ref|ZP_17558558.1| hypothetical protein IIA_03962 [Bacillus cereus VD014]
gi|401213326|gb|EJR20067.1| hypothetical protein IIA_03962 [Bacillus cereus VD014]
Length = 248
Score = 130 bits (326), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 72/209 (34%), Positives = 114/209 (54%), Gaps = 31/209 (14%)
Query: 1 MQVLS--WRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 58
+Q++S P + N S E+C +I +K ++K S++ + RTSSG
Sbjct: 62 IQIISKFEEPLIVVLANVLSDEECDELIEMSKNKMKRSKVG-----SSRDVNDIRTSSGA 116
Query: 59 FISASEDKTGILEL-IEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-P 116
F+ SE L L IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+
Sbjct: 117 FLEDSE-----LTLKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRS 169
Query: 117 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
+ R+++ ++YL+DVEEGGET FP + L V PR+G + F +
Sbjct: 170 AANNRISTLVMYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFY 214
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+ +++ +LHG PV KGEKW+AT+W+R
Sbjct: 215 QDQSLNELTLHGGAPVTKGEKWIATQWVR 243
>gi|428175714|gb|EKX44602.1| hypothetical protein GUITHDRAFT_71994 [Guillardia theta CCMP2712]
Length = 244
Score = 130 bits (326), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 81/223 (36%), Positives = 112/223 (50%), Gaps = 25/223 (11%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISA- 62
LS PR NF SAE+C+ II TA L PS + L+QG+ + + T +A
Sbjct: 23 LSSTPRLFVVENFLSAEECEEIIKTATPLLAPSTV-LKQGDQSNGEEKVKDEVRTSETAW 81
Query: 63 -SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-- 119
+ K I+ I ++ +P ++ E VL+Y Q Y HYD F+P Y + S
Sbjct: 82 LMDKKVPIVAKIRQRVEELIRIPMSYAEDMQVLKYTFKQHYHVHYDFFDPKMYPGRWSSG 141
Query: 120 -QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKC-----------IGLKVKPRRG 167
RL + YL+ VE+GGET+FPF N S ++ K +KVKP RG
Sbjct: 142 HNRLVTVFFYLTSVEKGGETIFPFGN----TSAEEHHKIQSWGPCENAVESSIKVKPVRG 197
Query: 168 DGLLFYSLFPNG----TIDRTSLHGSCPVIKGEKWVATKWIRD 206
++FY + P+G +D TSLHG C I GEKW A WIR+
Sbjct: 198 SAVIFYLMKPHGHTHGELDHTSLHGGCDPIVGEKWAANYWIRN 240
>gi|412988743|emb|CCO15334.1| predicted protein [Bathycoccus prasinos]
Length = 352
Score = 130 bits (326), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 79/220 (35%), Positives = 109/220 (49%), Gaps = 24/220 (10%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
++ LSW PRA + NF S E+ + ++ + R+ S + Q V RTS GTFI
Sbjct: 68 IEALSWDPRAFLYHNFLSKEEAKHLVDLGEPRVTRSTVVGGQTGRVSDI---RTSFGTFI 124
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
D+ +LE IE + A + +P H E +LRY GQKY H D G +
Sbjct: 125 PKKYDE--VLEKIEDRCAVFSGIPVAHQEQMQLLRYRDGQKYSDHTDGLISENGG----K 178
Query: 121 RLASFLLYLSDVEEGGETMFPFENGI-------------FLDSGYDYKKCIGLKVKPRRG 167
R+A+ L++L + EGGET F N + F D GY K G VKP+ G
Sbjct: 179 RIATILMFLHEPTEGGETSFVLGNPLGKVKERIERTKDQFSDCGYRSGK--GFAVKPKVG 236
Query: 168 DGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 207
D +LF+S G D S+H SCP + G KW AT WI ++
Sbjct: 237 DAILFFSFSEAGITDNNSMHASCPTLGGTKWTATMWIHER 276
>gi|423527903|ref|ZP_17504348.1| hypothetical protein IGE_01455 [Bacillus cereus HuB1-1]
gi|402451566|gb|EJV83385.1| hypothetical protein IGE_01455 [Bacillus cereus HuB1-1]
Length = 248
Score = 130 bits (326), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 67/199 (33%), Positives = 108/199 (54%), Gaps = 27/199 (13%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P + N S E+C +I +K ++K S++ + RTSSG F+ +E
Sbjct: 71 PLIVVLANVLSDEECDKLIEMSKNKMKRSKVG-----SSRDVNDIRTSSGAFLEDNE--- 122
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 126
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 123 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 179
Query: 127 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 186
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 180 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 224
Query: 187 HGSCPVIKGEKWVATKWIR 205
HG PV KGEKW+AT+W+R
Sbjct: 225 HGGAPVTKGEKWIATQWVR 243
>gi|228960501|ref|ZP_04122151.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pakistani str. T13001]
gi|229047930|ref|ZP_04193506.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH676]
gi|423630961|ref|ZP_17606708.1| hypothetical protein IK5_03811 [Bacillus cereus VD154]
gi|423650103|ref|ZP_17625673.1| hypothetical protein IKA_03890 [Bacillus cereus VD169]
gi|228723387|gb|EEL74756.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH676]
gi|228799198|gb|EEM46165.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pakistani str. T13001]
gi|401264328|gb|EJR70440.1| hypothetical protein IK5_03811 [Bacillus cereus VD154]
gi|401282521|gb|EJR88420.1| hypothetical protein IKA_03890 [Bacillus cereus VD169]
Length = 248
Score = 130 bits (326), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 67/199 (33%), Positives = 108/199 (54%), Gaps = 27/199 (13%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P + N S E+C +I +K ++K S++ + RTSSG F+ +E
Sbjct: 71 PLIVVLANVLSDEECDELIEMSKNKMKRSKVG-----SSRDVNDIRTSSGAFLEDNE--- 122
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 126
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 123 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 179
Query: 127 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 186
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 180 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 224
Query: 187 HGSCPVIKGEKWVATKWIR 205
HG PV KGEKW+AT+W+R
Sbjct: 225 HGGAPVTKGEKWIATQWVR 243
>gi|75760922|ref|ZP_00740932.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
israelensis ATCC 35646]
gi|423385740|ref|ZP_17362996.1| hypothetical protein ICE_03486 [Bacillus cereus BAG1X1-2]
gi|423561293|ref|ZP_17537569.1| hypothetical protein II5_00697 [Bacillus cereus MSX-A1]
gi|74491592|gb|EAO54798.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
israelensis ATCC 35646]
gi|401201550|gb|EJR08415.1| hypothetical protein II5_00697 [Bacillus cereus MSX-A1]
gi|401635796|gb|EJS53551.1| hypothetical protein ICE_03486 [Bacillus cereus BAG1X1-2]
Length = 248
Score = 130 bits (326), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 67/199 (33%), Positives = 108/199 (54%), Gaps = 27/199 (13%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P + N S E+C +I +K ++K S++ + RTSSG F+ +E
Sbjct: 71 PLIVVLANVLSDEECDKLIEMSKNKMKRSKVG-----SSRDVNDIRTSSGAFLEDNE--- 122
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 126
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 123 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 179
Query: 127 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 186
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 180 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 224
Query: 187 HGSCPVIKGEKWVATKWIR 205
HG PV KGEKW+AT+W+R
Sbjct: 225 HGGAPVTKGEKWIATQWVR 243
>gi|423358724|ref|ZP_17336227.1| hypothetical protein IC1_00704 [Bacillus cereus VD022]
gi|401084596|gb|EJP92842.1| hypothetical protein IC1_00704 [Bacillus cereus VD022]
Length = 248
Score = 130 bits (326), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 67/199 (33%), Positives = 108/199 (54%), Gaps = 27/199 (13%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P + N S E+C +I +K ++K S++ + RTSSG F+ +E
Sbjct: 71 PLIVVLANVLSDEECDKLIEMSKNKMKRSKVG-----SSRDVNDIRTSSGAFLEDNE--- 122
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 126
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 123 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 179
Query: 127 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 186
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 180 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 224
Query: 187 HGSCPVIKGEKWVATKWIR 205
HG PV KGEKW+AT+W+R
Sbjct: 225 HGGAPVTKGEKWIATQWVR 243
>gi|163941996|ref|YP_001646880.1| 2OG-Fe(II) oxygenase [Bacillus weihenstephanensis KBAB4]
gi|229013455|ref|ZP_04170592.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides DSM 2048]
gi|423495146|ref|ZP_17471790.1| hypothetical protein IEW_04044 [Bacillus cereus CER057]
gi|423498060|ref|ZP_17474677.1| hypothetical protein IEY_01287 [Bacillus cereus CER074]
gi|163864193|gb|ABY45252.1| 2OG-Fe(II) oxygenase [Bacillus weihenstephanensis KBAB4]
gi|228747867|gb|EEL97733.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides DSM 2048]
gi|401151239|gb|EJQ58691.1| hypothetical protein IEW_04044 [Bacillus cereus CER057]
gi|401161347|gb|EJQ68714.1| hypothetical protein IEY_01287 [Bacillus cereus CER074]
Length = 216
Score = 129 bits (325), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 69/208 (33%), Positives = 113/208 (54%), Gaps = 29/208 (13%)
Query: 1 MQVLS--WRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 58
+Q++S P + N S E+C +I +K +++ S++ + RTSSG
Sbjct: 30 IQIISKFEEPLIVVLANVLSDEECDELIELSKSKMERSKVG-----SSRDVNDIRTSSGA 84
Query: 59 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQ 117
F+ +E + IE +I+ T +P HGE ++L YE+ Q+Y +HYD F AE+
Sbjct: 85 FLEENE----LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF--AEHSRSA 138
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
+ R+++ ++YL+DVEEGGET FP + L V PR+G + F +
Sbjct: 139 ANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQ 183
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+ +++ +LHG PV KGEKW+AT+W+R
Sbjct: 184 DQSLNELTLHGGAPVTKGEKWIATQWVR 211
>gi|343171882|gb|AEL98645.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein, partial
[Silene latifolia]
gi|343171884|gb|AEL98646.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein, partial
[Silene latifolia]
Length = 162
Score = 129 bits (325), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 58/87 (66%), Positives = 75/87 (86%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
QVLSWRPR LYFP FA+A+ C++II+ A+ +LKPS+LALR+GET++ST+ RTSSG FIS
Sbjct: 76 QVLSWRPRVLYFPKFATADHCETIISIARSQLKPSRLALRKGETLDSTREIRTSSGMFIS 135
Query: 62 ASEDKTGILELIEHKIARATMLPQTHG 88
A EDKTGIL+ I+ KIARATM+P+ +G
Sbjct: 136 ADEDKTGILDFIDEKIARATMIPRANG 162
>gi|229104864|ref|ZP_04235524.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-28]
gi|228678581|gb|EEL32798.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-28]
Length = 216
Score = 129 bits (325), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 70/208 (33%), Positives = 112/208 (53%), Gaps = 29/208 (13%)
Query: 1 MQVLS--WRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 58
+Q++S P + N S E+C +I +K ++K S + + RTSSG
Sbjct: 30 IQIISKFEEPLIVVLGNVISDEECGELIEMSKNKIKRSTIG-----SSRDVNDIRTSSGA 84
Query: 59 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQ 117
F+ +E + IE +I+ +P THGE ++L YE+ Q+Y +HYD F AE+
Sbjct: 85 FLEENE----LTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYF--AEHSRSA 138
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
+ R+++ ++YL+DVEEGGET FP + L V PR+G + F +
Sbjct: 139 ANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQ 183
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+ +++ +LHG PV KGEKW+AT+W+R
Sbjct: 184 DQSLNELTLHGGAPVTKGEKWIATQWVR 211
>gi|423518940|ref|ZP_17495421.1| hypothetical protein IG7_04010 [Bacillus cereus HuA2-4]
gi|401159995|gb|EJQ67374.1| hypothetical protein IG7_04010 [Bacillus cereus HuA2-4]
Length = 216
Score = 129 bits (325), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 70/208 (33%), Positives = 111/208 (53%), Gaps = 29/208 (13%)
Query: 1 MQVLS--WRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 58
+Q++S P + N S E+C +I +K +K S++ + RTSSG
Sbjct: 30 IQIISKFEEPLIVVLANVLSDEECAELIELSKNNMKRSKVG-----SSRDVNDIRTSSGA 84
Query: 59 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQ 117
F+ +E + IE +I+ T +P HGE ++L YE+ Q+Y +HYD F AE+
Sbjct: 85 FLEENE----LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF--AEHSRSA 138
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
+ R+++ ++YL+DVEEGGET FP + L V PR+G + F +
Sbjct: 139 ANNRISTLVMYLNDVEEGGETFFP---------------QLNLSVHPRKGMAVYFEYFYQ 183
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+ ++ +LHG PV KGEKW+AT+W+R
Sbjct: 184 DQLLNELTLHGGAPVTKGEKWIATQWVR 211
>gi|423368291|ref|ZP_17345723.1| hypothetical protein IC3_03392 [Bacillus cereus VD142]
gi|401081042|gb|EJP89322.1| hypothetical protein IC3_03392 [Bacillus cereus VD142]
Length = 216
Score = 129 bits (325), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 70/208 (33%), Positives = 111/208 (53%), Gaps = 29/208 (13%)
Query: 1 MQVLS--WRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 58
+Q++S P + N S E+C +I +K +K S++ + RTSSG
Sbjct: 30 IQIISKFEEPLIVVLANVLSDEECAELIELSKNNMKRSKVG-----SSRDVNDIRTSSGA 84
Query: 59 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQ 117
F+ +E + IE +I+ T +P HGE ++L YE+ Q+Y +HYD F AE+
Sbjct: 85 FLEENE----LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF--AEHSRSA 138
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
+ R+++ ++YL+DVEEGGET FP + L V PR+G + F +
Sbjct: 139 ANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQ 183
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+ ++ +LHG PV KGEKW+AT+W+R
Sbjct: 184 DQLLNELTLHGGAPVTKGEKWIATQWVR 211
>gi|159487763|ref|XP_001701892.1| predicted protein [Chlamydomonas reinhardtii]
gi|158281111|gb|EDP06867.1| predicted protein [Chlamydomonas reinhardtii]
Length = 259
Score = 129 bits (324), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 76/211 (36%), Positives = 117/211 (55%), Gaps = 15/211 (7%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
++ ++W+PR + NF + + + +I A ++K S + G++VE RTS GTF+
Sbjct: 1 IEHVAWKPRVFIYHNFITEVEAKHLIELAAPQMKRSTVVGAGGKSVEDN--YRTSYGTFL 58
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
+D+ I+E IE+++A T +P H E +LRY +GQ+Y H D E G
Sbjct: 59 KRYQDE--IVERIENRVAAWTQIPVAHQEDTQILRYGLGQQYKVHADTLRDEEAG----V 112
Query: 121 RLASFLLYLSDVEEGGETMFPFENGI----FLDSGYDYKKCIGLKV--KPRRGDGLLFYS 174
R+A+ L+YL++ + GGET FP + G ++ C V P+RGD LLF+S
Sbjct: 113 RVATVLIYLNEPDGGGETAFPSSEWVNPQLAKTLGANFSDCAKNHVAFAPKRGDALLFWS 172
Query: 175 LFPNG-TIDRTSLHGSCPVIKGEKWVATKWI 204
+ P+G T D + H CPV+ G KW ATKWI
Sbjct: 173 INPDGNTEDTHASHTGCPVLSGVKWTATKWI 203
>gi|15233345|ref|NP_195307.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
gi|3805848|emb|CAA21468.1| putative protein [Arabidopsis thaliana]
gi|7270534|emb|CAB81491.1| putative protein [Arabidopsis thaliana]
gi|332661175|gb|AEE86575.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
Length = 272
Score = 129 bits (324), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 75/197 (38%), Positives = 116/197 (58%), Gaps = 31/197 (15%)
Query: 1 MQVLSWRPRALYFPNF--------ASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKG 51
++V++ PRA + NF + E+C +I+ AK + S++ R T +
Sbjct: 88 LEVITKEPRAFVYHNFLALFFKICKTNEECDHLISLAKPSMARSKV--RNALTGLGEESS 145
Query: 52 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 111
+RTSSGTFI + DK I++ IE +I+ T +PQ +GE V+ YE+GQK++ H+D F
Sbjct: 146 SRTSSGTFIRSGHDK--IVKEIEKRISEFTFIPQENGETLQVINYEVGQKFEPHFDGF-- 201
Query: 112 AEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLL 171
QR+A+ L+YLSDV++GGET+FP GI K G+ V+P++GD LL
Sbjct: 202 --------QRIATVLMYLSDVDKGGETVFPEAKGI--------KSKKGVSVRPKKGDALL 245
Query: 172 FYSLFPNGTIDRTSLHG 188
F+S+ P+G+ D +S HG
Sbjct: 246 FWSMRPDGSRDPSSKHG 262
>gi|229061929|ref|ZP_04199257.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH603]
gi|228717372|gb|EEL69042.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH603]
Length = 216
Score = 129 bits (324), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 70/208 (33%), Positives = 111/208 (53%), Gaps = 29/208 (13%)
Query: 1 MQVLS--WRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 58
+Q++S P + N S E+C +I +K +K S++ + RTSSG
Sbjct: 30 IQIISKFEEPLIVVLANVLSDEECAELIELSKSNMKRSKVG-----SSRDVNDIRTSSGA 84
Query: 59 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQ 117
F+ +E + IE +I+ T +P HGE ++L YE+ Q+Y +HYD F AE+
Sbjct: 85 FLEENE----LTSKIEKRISSITNVPVVHGEGLHILNYEVDQEYKAHYDYF--AEHSRSA 138
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
+ R+++ ++YL+DVEEGGET FP + L V PR+G + F +
Sbjct: 139 ANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQ 183
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+ ++ +LHG PV KGEKW+AT+W+R
Sbjct: 184 DQLLNELTLHGGAPVTKGEKWIATQWVR 211
>gi|228941395|ref|ZP_04103947.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
berliner ATCC 10792]
gi|228974327|ref|ZP_04134896.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
thuringiensis str. T01001]
gi|228980919|ref|ZP_04141223.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
gi|384188306|ref|YP_005574202.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
chinensis CT-43]
gi|410676625|ref|YP_006928996.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
gi|452200698|ref|YP_007480779.1| Peptidyl prolyl 4-hydroxylase-like protein, alpha subunit [Bacillus
thuringiensis serovar thuringiensis str. IS5056]
gi|228778855|gb|EEM27118.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
gi|228785377|gb|EEM33387.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
thuringiensis str. T01001]
gi|228818321|gb|EEM64394.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
berliner ATCC 10792]
gi|326942015|gb|AEA17911.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
chinensis CT-43]
gi|409175754|gb|AFV20059.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
gi|452106091|gb|AGG03031.1| Peptidyl prolyl 4-hydroxylase-like protein, alpha subunit [Bacillus
thuringiensis serovar thuringiensis str. IS5056]
Length = 216
Score = 129 bits (324), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 67/199 (33%), Positives = 108/199 (54%), Gaps = 27/199 (13%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P + N S E+C +I +K ++K S++ + RTSSG F+ +E
Sbjct: 39 PLIVVLANVLSDEECGELIEMSKNKMKRSKVG-----SSRDVNDIRTSSGAFLEDNE--- 90
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 126
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 127 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 186
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 148 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 192
Query: 187 HGSCPVIKGEKWVATKWIR 205
HG PV KGEKW+AT+W+R
Sbjct: 193 HGGAPVTKGEKWIATQWVR 211
>gi|423512354|ref|ZP_17488885.1| hypothetical protein IG3_03851 [Bacillus cereus HuA2-1]
gi|402449325|gb|EJV81162.1| hypothetical protein IG3_03851 [Bacillus cereus HuA2-1]
Length = 216
Score = 129 bits (324), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 70/208 (33%), Positives = 111/208 (53%), Gaps = 29/208 (13%)
Query: 1 MQVLS--WRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 58
+Q++S P + N S E+C +I +K +K S++ + RTSSG
Sbjct: 30 IQIISKFEEPLIVVLANVLSDEECAELIELSKSNMKRSKVG-----SSRDVNDIRTSSGA 84
Query: 59 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQ 117
F+ +E + IE +I+ T +P HGE ++L YE+ Q+Y +HYD F AE+
Sbjct: 85 FLEENE----LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF--AEHSRSA 138
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
+ R+++ ++YL+DVEEGGET FP + L V PR+G + F +
Sbjct: 139 ANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQ 183
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+ ++ +LHG PV KGEKW+AT+W+R
Sbjct: 184 DQLLNELTLHGGAPVTKGEKWIATQWVR 211
>gi|403238305|ref|ZP_10916891.1| procollagen-proline dioxygenase [Bacillus sp. 10403023]
Length = 296
Score = 129 bits (324), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 75/203 (36%), Positives = 110/203 (54%), Gaps = 21/203 (10%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASED 65
+P L+ F S E+C +I +++RLKPS + + GE E RTS G E+
Sbjct: 108 KPFILHLDYFLSEEECDQLIEMSRERLKPSTVIDPKTGE--EKAATGRTSKGMSFYLQEN 165
Query: 66 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-QRLAS 124
+ ++ +E +IA P +GE VL Y IG++Y SH+D F ++ P+ QR+ +
Sbjct: 166 E--FIKKVEKRIAELIEFPVENGEGLQVLNYGIGEEYKSHFDYFPQSKVVPEKGGQRVGT 223
Query: 125 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 184
FL+YL+DV GGET+FP G+ + P++G + F G +DR
Sbjct: 224 FLIYLNDVPAGGETVFP---------------KAGVSIVPKKGSAVYFQYGNSKGEVDRM 268
Query: 185 SLHGSCPVIKGEKWVATKWIRDQ 207
SLH S PV +GEKWVATKWIR +
Sbjct: 269 SLHSSIPVSEGEKWVATKWIRQE 291
>gi|264677094|ref|YP_003277000.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni CNB-2]
gi|262207606|gb|ACY31704.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni CNB-2]
Length = 306
Score = 129 bits (324), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 74/206 (35%), Positives = 110/206 (53%), Gaps = 25/206 (12%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
PR + F N S E+C +IIA A+ R++ S L + E+ RTS+G F E+
Sbjct: 119 PRVVVFGNLLSDEECDAIIAAARPRMRRS-LTVDNQSGGEAVNDDRTSNGMFFQRGEND- 176
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQRL 122
++ L+E +IAR P +GE VL Y G +Y HYD F P E G P + QR+
Sbjct: 177 -LISLVEQRIARLLNWPLENGEGMQVLHYRPGAEYKPHYDYFAPNEPGTPTILKRGGQRV 235
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
+ ++YL++ GG T FP +GL++ PRRG+ + F P+
Sbjct: 236 GTLVMYLNEPARGGATTFP---------------DVGLQIVPRRGNAVFFSYNRPDPATK 280
Query: 183 RTSLHGSCPVIKGEKWVATKWIRDQE 208
+LHG PV++GEKW+ATKW+R++E
Sbjct: 281 --TLHGGAPVLEGEKWIATKWLRERE 304
>gi|229135058|ref|ZP_04263863.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST196]
gi|228648443|gb|EEL04473.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST196]
Length = 216
Score = 129 bits (324), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 70/208 (33%), Positives = 111/208 (53%), Gaps = 29/208 (13%)
Query: 1 MQVLS--WRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 58
+Q++S P + N S E+C +I +K +K S++ + RTSSG
Sbjct: 30 IQIISKFEEPLIVVLANVLSDEECAELIELSKSNMKRSKVG-----SSRDVNDIRTSSGA 84
Query: 59 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQ 117
F+ +E + IE +I+ T +P HGE ++L YE+ Q+Y +HYD F AE+
Sbjct: 85 FLEENE----LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF--AEHSRSA 138
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
+ R+++ ++YL+DVEEGGET FP + L V PR+G + F +
Sbjct: 139 ANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQ 183
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+ ++ +LHG PV KGEKW+AT+W+R
Sbjct: 184 DQLLNELTLHGGAPVTKGEKWIATQWVR 211
>gi|307109700|gb|EFN57937.1| hypothetical protein CHLNCDRAFT_142031 [Chlorella variabilis]
Length = 325
Score = 129 bits (324), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 80/209 (38%), Positives = 118/209 (56%), Gaps = 20/209 (9%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 63
+SW PRA NFAS E+ +I A+ +L+ S + +GE+V RTS G FI
Sbjct: 35 VSWYPRAFVAHNFASKEETDHMIKLAQPQLRRSTVVGSRGESV--VDNYRTSYGMFIRRH 92
Query: 64 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 123
D+ ++ +E ++A T TH E VLRY Q+Y +H+D+ + S R A
Sbjct: 93 HDE--VVSTLEKRVATWTKYNVTHQEDIQVLRYGTTQEYKAHFDSLD------DDSPRTA 144
Query: 124 SFLLYLSDVEEGGETMFPFENGIFLDSGY-----DYKKCI--GLKVKPRRGDGLLFYSLF 176
+ L+YLSDVE GGET FP N ++D + +C + +KP+RGD ++F+SL
Sbjct: 145 TVLIYLSDVESGGETTFP--NSEWIDPALPKALGPFSECAQGHVAMKPKRGDAIVFHSLN 202
Query: 177 PNG-TIDRTSLHGSCPVIKGEKWVATKWI 204
P+G + D+ +LH +CPVI G K+VA WI
Sbjct: 203 PDGRSHDQHALHTACPVIVGVKYVAIFWI 231
>gi|218231188|ref|YP_002369041.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
B4264]
gi|218159145|gb|ACK59137.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
B4264]
Length = 216
Score = 129 bits (324), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 67/199 (33%), Positives = 108/199 (54%), Gaps = 27/199 (13%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P + N S E+C +I +K +++ S++ + RTSSG F+ +E
Sbjct: 39 PLIVVLANVLSDEECGELIEMSKNKMERSKIG-----SSRDVNDIRTSSGAFLEDNE--- 90
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 126
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 127 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 186
+YL+DVEEGGET FP + L V PR+G + F + + +I+ +L
Sbjct: 148 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSINELTL 192
Query: 187 HGSCPVIKGEKWVATKWIR 205
HG PV KGEKW+AT+W+R
Sbjct: 193 HGGAPVTKGEKWIATQWVR 211
>gi|330799463|ref|XP_003287764.1| hypothetical protein DICPUDRAFT_151895 [Dictyostelium purpureum]
gi|325082219|gb|EGC35708.1| hypothetical protein DICPUDRAFT_151895 [Dictyostelium purpureum]
Length = 220
Score = 129 bits (323), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 75/212 (35%), Positives = 108/212 (50%), Gaps = 31/212 (14%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT-RTSSGTFISA 62
LS +PR P F + E+C +I T+K +L+P E + G R+ G F+
Sbjct: 28 LSQKPRVYRIPEFLTEEECNHLIDTSKNKLRPCN---------EISSGVHRSGWGLFMKE 78
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS--- 119
E++ + + I +K+ + + E ++RY G++ +HYD FNP M
Sbjct: 79 GEEEHPVTKNIFNKMKNFVNISDS-CEVMQIIRYNPGEETSAHYDYFNPLTTNGSMKIGL 137
Query: 120 --QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
QR+ + L+YL DVEEGGET FP +G+KVKP RGD +LFY+ P
Sbjct: 138 YGQRICTILMYLCDVEEGGETSFPE---------------VGIKVKPIRGDAVLFYNCKP 182
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
NG +D SLH PV KG KWVA K I + +
Sbjct: 183 NGDVDPLSLHQGDPVTKGTKWVAIKLINQKSK 214
>gi|47567794|ref|ZP_00238502.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus G9241]
gi|47555471|gb|EAL13814.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus G9241]
Length = 216
Score = 129 bits (323), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 69/208 (33%), Positives = 112/208 (53%), Gaps = 29/208 (13%)
Query: 1 MQVLS--WRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 58
+Q++S P + N S E+C +I +K +L S++ + RTS G
Sbjct: 30 IQIISKFEEPLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDVNDIRTSKGA 84
Query: 59 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQ 117
F+ +E + E IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+
Sbjct: 85 FLDDNE----LTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSA 138
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
+ R+++ ++YL+DVEEGGET FP + L V PR+G + F +
Sbjct: 139 ANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQ 183
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+ +++ +LHG PV KGEKW+AT+W+R
Sbjct: 184 DQSLNELTLHGGAPVTKGEKWIATQWVR 211
>gi|423669823|ref|ZP_17644852.1| hypothetical protein IKO_03520 [Bacillus cereus VDM034]
gi|423673973|ref|ZP_17648912.1| hypothetical protein IKS_01516 [Bacillus cereus VDM062]
gi|401298950|gb|EJS04550.1| hypothetical protein IKO_03520 [Bacillus cereus VDM034]
gi|401309524|gb|EJS14857.1| hypothetical protein IKS_01516 [Bacillus cereus VDM062]
Length = 216
Score = 129 bits (323), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 69/208 (33%), Positives = 112/208 (53%), Gaps = 29/208 (13%)
Query: 1 MQVLS--WRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 58
+Q++S P + N S E+C +I +K +++ S++ + RTSSG
Sbjct: 30 IQIISKFEEPLIVVLANVLSDEECDELIELSKSKMERSKVG-----SSRDVNDIRTSSGA 84
Query: 59 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQ 117
F+ +E + IE +I+ T +P HGE ++L YE+ Q+Y +HYD F AE+
Sbjct: 85 FLEENE----LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF--AEHSRSA 138
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
+ R+++ ++YL+DVEEGGET FP + L V PR+G + F +
Sbjct: 139 ANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQ 183
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+ ++ +LHG PV KGEKW+AT+W+R
Sbjct: 184 DQLLNELTLHGGAPVTKGEKWIATQWVR 211
>gi|423400914|ref|ZP_17378087.1| hypothetical protein ICW_01312 [Bacillus cereus BAG2X1-2]
gi|401653904|gb|EJS71447.1| hypothetical protein ICW_01312 [Bacillus cereus BAG2X1-2]
Length = 216
Score = 129 bits (323), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 67/199 (33%), Positives = 108/199 (54%), Gaps = 27/199 (13%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P + N S E+C +I +K ++K S++ + RTSSG F+ +E
Sbjct: 39 PLIVVLGNVLSDEECDELIELSKSKMKRSKVG-----SSRDVNDIRTSSGAFLDDNE--- 90
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 126
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 127 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 186
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 148 MYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 192
Query: 187 HGSCPVIKGEKWVATKWIR 205
HG PV KGEKW+AT+W+R
Sbjct: 193 HGGAPVTKGEKWIATQWVR 211
>gi|423483822|ref|ZP_17460512.1| hypothetical protein IEQ_03600 [Bacillus cereus BAG6X1-2]
gi|401141373|gb|EJQ48928.1| hypothetical protein IEQ_03600 [Bacillus cereus BAG6X1-2]
Length = 216
Score = 129 bits (323), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 69/208 (33%), Positives = 111/208 (53%), Gaps = 29/208 (13%)
Query: 1 MQVLS--WRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 58
+Q++S P + N S E+C +I +K ++K S + + RTSSG
Sbjct: 30 IQIISKFEEPLIVVLGNVISDEECDELIEMSKNKIKRSTIG-----SSRDVNDIRTSSGA 84
Query: 59 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQ 117
F+ +E + IE +I+ +P HGE ++L YE+ Q+Y +HYD F AE+
Sbjct: 85 FLEENE----LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYF--AEHSRSA 138
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
+ R+++ ++YL+DVEEGGET FP + L V PR+G + F +
Sbjct: 139 ANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQ 183
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+ +++ +LHG PV KGEKW+AT+W+R
Sbjct: 184 DQSLNELTLHGGAPVTKGEKWIATQWVR 211
>gi|423478381|ref|ZP_17455096.1| hypothetical protein IEO_03839 [Bacillus cereus BAG6X1-1]
gi|402428543|gb|EJV60640.1| hypothetical protein IEO_03839 [Bacillus cereus BAG6X1-1]
Length = 216
Score = 129 bits (323), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 67/199 (33%), Positives = 108/199 (54%), Gaps = 27/199 (13%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P + N S E+C +I +K ++K S++ + RTSSG F+ +E
Sbjct: 39 PLIVVLGNVLSDEECDELIELSKSKMKRSKVG-----SSRDVNDIRTSSGAFLDDNE--- 90
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 126
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 127 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 186
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 148 MYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 192
Query: 187 HGSCPVIKGEKWVATKWIR 205
HG PV KGEKW+AT+W+R
Sbjct: 193 HGGAPVTKGEKWIATQWVR 211
>gi|423657194|ref|ZP_17632493.1| hypothetical protein IKG_04182 [Bacillus cereus VD200]
gi|401289937|gb|EJR95641.1| hypothetical protein IKG_04182 [Bacillus cereus VD200]
Length = 248
Score = 129 bits (323), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 66/199 (33%), Positives = 108/199 (54%), Gaps = 27/199 (13%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P + N S E+C +I +K +++ S++ + RTSSG F+ +E
Sbjct: 71 PLIVVLANVLSDEECDELIEMSKNKMERSKIG-----SSRDVNDIRTSSGAFLEDNE--- 122
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 126
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 123 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 179
Query: 127 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 186
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 180 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 224
Query: 187 HGSCPVIKGEKWVATKWIR 205
HG PV KGEKW+AT+W+R
Sbjct: 225 HGGAPVTKGEKWIATQWVR 243
>gi|302835042|ref|XP_002949083.1| hypothetical protein VOLCADRAFT_89416 [Volvox carteri f.
nagariensis]
gi|300265828|gb|EFJ50018.1| hypothetical protein VOLCADRAFT_89416 [Volvox carteri f.
nagariensis]
Length = 263
Score = 129 bits (323), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 78/220 (35%), Positives = 117/220 (53%), Gaps = 31/220 (14%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
++ +SW PRA + F + +C +I A +L+ S + + ++ + + ++S +
Sbjct: 59 VETVSWMPRAFVYHQFLTPAECDHLIELATPKLERSMVVGTDSDLIDDIRTSFSASIMY- 117
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-S 119
+T I+ IE +IAR T VLRY GQKYD+H+D F+ E S
Sbjct: 118 ----GETSIVSSIEERIARWT-----------VLRYVNGQKYDAHWDWFDDNEVAKAGGS 162
Query: 120 QRLASFLLYLSDVE--EGGETMFPFENGIFLD------SGYDYKKC---IGLKVKPRRGD 168
R+A+ L+YLSDV+ GGET P LD G Y +C +G+ ++PR+GD
Sbjct: 163 NRMATVLMYLSDVDPAAGGETALPLAEP--LDPHKQSVDGQGYSQCAARMGISIRPRKGD 220
Query: 169 GLLFYSLFPNGTI-DRTSLHGSCPVIKGEKWVATKWIRDQ 207
LLF+ + P G I DR +LH SCP G KW ATKWI ++
Sbjct: 221 VLLFWDMDPAGLIPDRHALHASCPTFSGTKWTATKWIHNK 260
>gi|423615424|ref|ZP_17591258.1| hypothetical protein IIO_00750 [Bacillus cereus VD115]
gi|401259961|gb|EJR66134.1| hypothetical protein IIO_00750 [Bacillus cereus VD115]
Length = 216
Score = 129 bits (323), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 69/208 (33%), Positives = 111/208 (53%), Gaps = 29/208 (13%)
Query: 1 MQVLS--WRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 58
+Q++S P + N S E+C +I +K ++K S + + RTSSG
Sbjct: 30 IQIISKFEEPLIVVLGNVISDEECDELIEMSKNKIKRSTIG-----SSRDVNDIRTSSGA 84
Query: 59 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQ 117
F+ +E + IE +I+ +P HGE ++L YE+ Q+Y +HYD F AE+
Sbjct: 85 FLEENE----LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYF--AEHSRSA 138
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
+ R+++ ++YL+DVEEGGET FP + L V PR+G + F +
Sbjct: 139 ANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQ 183
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+ +++ +LHG PV KGEKW+AT+W+R
Sbjct: 184 DQSLNELTLHGGAPVTKGEKWIATQWVR 211
>gi|229152436|ref|ZP_04280628.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1550]
gi|228631044|gb|EEK87681.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1550]
Length = 248
Score = 129 bits (323), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 67/199 (33%), Positives = 108/199 (54%), Gaps = 27/199 (13%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P + N S E+C +I +K +++ S++ + RTSSG F+ +E
Sbjct: 71 PLIVVLANVLSDEECGELIEMSKNKMERSKIG-----SSRDVNDIRTSSGAFLEDNE--- 122
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 126
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 123 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 179
Query: 127 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 186
+YL+DVEEGGET FP + L V PR+G + F + + +I+ +L
Sbjct: 180 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSINELTL 224
Query: 187 HGSCPVIKGEKWVATKWIR 205
HG PV KGEKW+AT+W+R
Sbjct: 225 HGGAPVTKGEKWIATQWVR 243
>gi|297802348|ref|XP_002869058.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297314894|gb|EFH45317.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 245
Score = 128 bits (322), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 74/197 (37%), Positives = 117/197 (59%), Gaps = 31/197 (15%)
Query: 1 MQVLSWRPRALYFPNF--------ASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKG 51
++V++ PRA + NF + E+C+ +I+ AK + S++ R T +
Sbjct: 55 LEVIAKEPRAFVYHNFLALFFKFCKTNEECEHLISLAKPSMARSKV--RNAITGLGEESS 112
Query: 52 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 111
+RTSSGTF+ DK I++ IE +I+ T +P+ +GEA V+ YE+GQK++ H+D F
Sbjct: 113 SRTSSGTFLRKGHDK--IVKEIEKRISEFTFIPEENGEALQVIHYEVGQKFEPHFDGF-- 168
Query: 112 AEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLL 171
QR+A+ L+YLSDV++GGET+FP GI K G+ V+P++GD LL
Sbjct: 169 --------QRIATVLMYLSDVDKGGETVFPEAKGI--------KSKKGVSVRPKKGDALL 212
Query: 172 FYSLFPNGTIDRTSLHG 188
F+S+ P+G+ D +S HG
Sbjct: 213 FWSMRPDGSQDPSSKHG 229
>gi|3169183|gb|AAC17826.1| hypothetical protein [Arabidopsis thaliana]
Length = 1036
Score = 128 bits (322), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 62/121 (51%), Positives = 82/121 (67%), Gaps = 4/121 (3%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
Q LSW PR Y PNFA+ +QC+++I AK +LKPS LALR+ E+
Sbjct: 798 QGLSWNPRVFYLPNFATKQQCEAVIDMAKPKLKPSTLALRK----ETKHFQMQYRSLHQH 853
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 121
ED++G+L IE KIA AT P+ + E+FN+LRY++GQKYDSHYDAF+ AEYGP +SQR
Sbjct: 854 TDEDESGVLAAIEEKIALATRFPKDYYESFNILRYQLGQKYDSHYDAFHSAEYGPLISQR 913
Query: 122 L 122
+
Sbjct: 914 V 914
>gi|229098707|ref|ZP_04229647.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-29]
gi|423441025|ref|ZP_17417931.1| hypothetical protein IEA_01355 [Bacillus cereus BAG4X2-1]
gi|423533441|ref|ZP_17509859.1| hypothetical protein IGI_01273 [Bacillus cereus HuB2-9]
gi|228684786|gb|EEL38724.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-29]
gi|402417686|gb|EJV49986.1| hypothetical protein IEA_01355 [Bacillus cereus BAG4X2-1]
gi|402463660|gb|EJV95360.1| hypothetical protein IGI_01273 [Bacillus cereus HuB2-9]
Length = 216
Score = 128 bits (322), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 69/208 (33%), Positives = 111/208 (53%), Gaps = 29/208 (13%)
Query: 1 MQVLS--WRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 58
+Q++S P + N S E+C +I +K ++K S + + RTSSG
Sbjct: 30 IQIISKFEEPLIVVLGNVISDEECNELIEMSKNKIKRSTIG-----SARDVNDIRTSSGA 84
Query: 59 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQ 117
F+ +E + IE +I+ +P THGE ++L YE+ Q+Y +HYD F AE+
Sbjct: 85 FLEENE----LTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYF--AEHSRSA 138
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
+ R+++ ++YL+DVEEGGET FP + L V PR+G + F +
Sbjct: 139 ANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQ 183
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+ +++ +LHG V KGEKW+AT+W+R
Sbjct: 184 DQSLNELTLHGGASVTKGEKWIATQWVR 211
>gi|308799555|ref|XP_003074558.1| putative oxidoreductase (ISS) [Ostreococcus tauri]
gi|116000729|emb|CAL50409.1| putative oxidoreductase (ISS) [Ostreococcus tauri]
Length = 274
Score = 128 bits (322), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 78/214 (36%), Positives = 118/214 (55%), Gaps = 15/214 (7%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTKGTRTSSGTF 59
++ LSW PRA N + ++I+A A+ R+ + + + G++V RTS TF
Sbjct: 9 VEPLSWYPRAFALRNALDETEMRAILALARTRVARSTVIDSESGKSV--VNPIRTSKQTF 66
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA-EYGPQM 118
+S ++ ++ + +++ T LP H E VL Y G+KYD+H D + G Q+
Sbjct: 67 LSRNDP---VVRKVLERMSSVTHLPWYHCEDLQVLEYSAGEKYDAHEDVGEEGTKSGDQL 123
Query: 119 SQ----RLASFLLYLSDVEEGGETMFPFENGIFLDSGYD--YKKCIGLKV--KPRRGDGL 170
S+ R+A+ LLYL + EEGGET FP I + + KC +V KP RGDGL
Sbjct: 124 SKNGGKRVATILLYLEEPEEGGETAFPDSEWIDPERAKTETWSKCAHRRVAMKPTRGDGL 183
Query: 171 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 204
+F+S+ P+GTID +LH CP +G KW AT W+
Sbjct: 184 MFWSVRPDGTIDHRALHVGCPPTRGTKWTATIWV 217
>gi|242085722|ref|XP_002443286.1| hypothetical protein SORBIDRAFT_08g016950 [Sorghum bicolor]
gi|241943979|gb|EES17124.1| hypothetical protein SORBIDRAFT_08g016950 [Sorghum bicolor]
Length = 147
Score = 128 bits (322), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 66/137 (48%), Positives = 87/137 (63%), Gaps = 4/137 (2%)
Query: 69 ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLY 128
I+ IE +IA T +P +GE VL Y +GQK++ H+D + R A+FL+Y
Sbjct: 10 IVRTIEQRIADYTSVPIENGEPLQVLHYAVGQKFEPHFDYTDGTSVTKIGGPRKATFLMY 69
Query: 129 LSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHG 188
LSDVEEGGET+FP N S K G+ VKP+ GD LLF+S+ P+G++D SLHG
Sbjct: 70 LSDVEEGGETVFP--NATAKGSAPSAKS--GISVKPKMGDALLFWSMKPDGSLDPKSLHG 125
Query: 189 SCPVIKGEKWVATKWIR 205
+ PVIKG+KW ATKWI
Sbjct: 126 ASPVIKGDKWSATKWIH 142
>gi|196041590|ref|ZP_03108882.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
NVH0597-99]
gi|218905373|ref|YP_002453207.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
AH820]
gi|225866219|ref|YP_002751597.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
03BB102]
gi|423550018|ref|ZP_17526345.1| hypothetical protein IGW_00649 [Bacillus cereus ISP3191]
gi|196027578|gb|EDX66193.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
NVH0597-99]
gi|218537435|gb|ACK89833.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
AH820]
gi|225786013|gb|ACO26230.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
03BB102]
gi|401189634|gb|EJQ96684.1| hypothetical protein IGW_00649 [Bacillus cereus ISP3191]
Length = 216
Score = 128 bits (322), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 69/208 (33%), Positives = 112/208 (53%), Gaps = 29/208 (13%)
Query: 1 MQVLS--WRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 58
+Q++S P + N S E+C +I +K +L S++ + RTSSG
Sbjct: 30 IQIISKFEEPLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDVNDIRTSSGA 84
Query: 59 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQ 117
F+ +E + IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+
Sbjct: 85 FLDDNE----LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSA 138
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
+ R+++ ++YL+DVEEGGET FP + L V PR+G + F +
Sbjct: 139 ANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQ 183
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+ +++ +LHG PV KGEKW+AT+W+R
Sbjct: 184 DQSLNELTLHGGAPVTKGEKWIATQWVR 211
>gi|229157835|ref|ZP_04285910.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 4342]
gi|228625792|gb|EEK82544.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 4342]
Length = 232
Score = 128 bits (322), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 69/208 (33%), Positives = 112/208 (53%), Gaps = 29/208 (13%)
Query: 1 MQVLS--WRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 58
+Q++S P + N S E+C +I +K +L S++ + RTS G
Sbjct: 46 IQIISKFEEPLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDVNDIRTSKGA 100
Query: 59 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQ 117
F+ +E + E IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+
Sbjct: 101 FLDDNE----LTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSA 154
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
+ R+++ ++YL+DVEEGGET FP + L V PR+G + F +
Sbjct: 155 ANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQ 199
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+ +++ +LHG PV KGEKW+AT+W+R
Sbjct: 200 DQSLNELTLHGGAPVTKGEKWIATQWVR 227
>gi|229093299|ref|ZP_04224414.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-42]
gi|228690082|gb|EEL43879.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-42]
Length = 232
Score = 128 bits (322), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 69/208 (33%), Positives = 112/208 (53%), Gaps = 29/208 (13%)
Query: 1 MQVLS--WRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 58
+Q++S P + N S E+C +I +K +L S++ + RTSSG
Sbjct: 46 IQIISKFEEPLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDVNDIRTSSGA 100
Query: 59 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQ 117
F+ +E + E IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+
Sbjct: 101 FLDDNE----LTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSA 154
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
+ R+++ ++YL+DVEEGGET FP + L V PR+G + F +
Sbjct: 155 ANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQ 199
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+ +++ +LHG PV KGEKW+ T+W+R
Sbjct: 200 DQSLNELTLHGGAPVTKGEKWITTQWVR 227
>gi|302845026|ref|XP_002954052.1| hypothetical protein VOLCADRAFT_64430 [Volvox carteri f.
nagariensis]
gi|300260551|gb|EFJ44769.1| hypothetical protein VOLCADRAFT_64430 [Volvox carteri f.
nagariensis]
Length = 311
Score = 128 bits (322), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 79/214 (36%), Positives = 112/214 (52%), Gaps = 17/214 (7%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
+ V+SW+PRA NF + +C I A+ ++ S + G +V RTS GTFI
Sbjct: 1 VSVISWQPRAFVIRNFLTEHECTHIADLAQVHMRRSTVVADNGSSV--LDDYRTSYGTFI 58
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
+ + T ++ +E ++A T P + E VLRY +GQ Y H D+ S
Sbjct: 59 NRYQ--TPVIAAVEDRVALLTRTPVVYQEDMQVLRYGLGQYYHRHTDSLE------NDSP 110
Query: 121 RLASFLLYLSDVEEGGETMFP----FENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYS 174
R+A+ LLYLS+ E GGET FP + + + C+ + KPRRGD LLF+S
Sbjct: 111 RMATVLLYLSEPELGGETAFPQAASWAHPAMAQLFGPFSDCVKGNVAFKPRRGDALLFWS 170
Query: 175 LFPNG-TIDRTSLHGSCPVIKGEKWVATKWIRDQ 207
+ P+G T D S H CPVI+G KW AT W+ Q
Sbjct: 171 VKPDGRTEDPYSEHEGCPVIRGVKWTATVWVHTQ 204
>gi|228987427|ref|ZP_04147547.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
tochigiensis BGSC 4Y1]
gi|228772399|gb|EEM20845.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
tochigiensis BGSC 4Y1]
Length = 232
Score = 128 bits (322), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 69/208 (33%), Positives = 112/208 (53%), Gaps = 29/208 (13%)
Query: 1 MQVLS--WRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 58
+Q++S P + N S E+C +I +K +L S++ + RTS G
Sbjct: 46 IQIISKFEEPLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDVNDIRTSKGA 100
Query: 59 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQ 117
F+ +E + E IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+
Sbjct: 101 FLDDNE----LTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSA 154
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
+ R+++ ++YL+DVEEGGET FP + L V PR+G + F +
Sbjct: 155 ANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQ 199
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+ +++ +LHG PV KGEKW+AT+W+R
Sbjct: 200 DQSLNELTLHGGAPVTKGEKWIATQWVR 227
>gi|196046329|ref|ZP_03113555.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
03BB108]
gi|376268135|ref|YP_005120847.1| Peptidyl prolyl 4- hydroxylase like protein [Bacillus cereus
F837/76]
gi|196022799|gb|EDX61480.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
03BB108]
gi|364513935|gb|AEW57334.1| Peptidyl prolyl 4- hydroxylase like protein [Bacillus cereus
F837/76]
Length = 216
Score = 128 bits (322), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 69/208 (33%), Positives = 112/208 (53%), Gaps = 29/208 (13%)
Query: 1 MQVLS--WRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 58
+Q++S P + N S E+C +I +K +L S++ + RTSSG
Sbjct: 30 IQIISKFEEPLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDVNDIRTSSGA 84
Query: 59 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQ 117
F+ +E + IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+
Sbjct: 85 FLDDNE----LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSA 138
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
+ R+++ ++YL+DVEEGGET FP + L V PR+G + F +
Sbjct: 139 ANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQ 183
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+ +++ +LHG PV KGEKW+AT+W+R
Sbjct: 184 DQSLNELTLHGGAPVTKGEKWIATQWVR 211
>gi|423452458|ref|ZP_17429311.1| hypothetical protein IEE_01202 [Bacillus cereus BAG5X1-1]
gi|401140096|gb|EJQ47653.1| hypothetical protein IEE_01202 [Bacillus cereus BAG5X1-1]
Length = 216
Score = 128 bits (322), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 69/208 (33%), Positives = 112/208 (53%), Gaps = 29/208 (13%)
Query: 1 MQVLS--WRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 58
+Q++S P + N S E+C +I +K +++ S++ + RTSSG
Sbjct: 30 IQIISKFEEPLIVVLANVLSDEECDGLIELSKNKIERSKIG-----SSRDVNDIRTSSGA 84
Query: 59 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQ 117
F+ +E + IE +I+ +P HGE ++L YE+ Q+Y +HYD F AE+
Sbjct: 85 FLEENE----LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQEYKAHYDYF--AEHSRSA 138
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
+ R+++ ++YL+DVEEGGET FP + L V PR+G + F +
Sbjct: 139 ANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQ 183
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+ +++ +LHG PV KGEKWVAT+W+R
Sbjct: 184 DQSLNELTLHGGAPVTKGEKWVATQWVR 211
>gi|301055727|ref|YP_003793938.1| prolyl 4-hydroxylase subunit alpha [Bacillus cereus biovar
anthracis str. CI]
gi|300377896|gb|ADK06800.1| prolyl 4-hydroxylase, alpha subunit [Bacillus cereus biovar
anthracis str. CI]
Length = 216
Score = 128 bits (321), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 69/208 (33%), Positives = 112/208 (53%), Gaps = 29/208 (13%)
Query: 1 MQVLS--WRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 58
+Q++S P + N S E+C +I +K +L S++ + RTSSG
Sbjct: 30 IQIISKFEEPLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDVNDIRTSSGA 84
Query: 59 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQ 117
F+ +E + IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+
Sbjct: 85 FLDDNE----LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSA 138
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
+ R+++ ++YL+DVEEGGET FP + L V PR+G + F +
Sbjct: 139 ANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQ 183
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+ +++ +LHG PV KGEKW+AT+W+R
Sbjct: 184 DQSLNELTLHGGAPVTKGEKWIATQWVR 211
>gi|229186477|ref|ZP_04313640.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BGSC 6E1]
gi|228596991|gb|EEK54648.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BGSC 6E1]
Length = 216
Score = 128 bits (321), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 69/208 (33%), Positives = 112/208 (53%), Gaps = 29/208 (13%)
Query: 1 MQVLS--WRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 58
+Q++S P + N S E+C +I +K +L S++ + RTSSG
Sbjct: 30 IQIISKFEEPLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDVNDIRTSSGA 84
Query: 59 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQ 117
F+ +E + IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+
Sbjct: 85 FLDDNE----LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSA 138
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
+ R+++ ++YL+DVEEGGET FP + L V PR+G + F +
Sbjct: 139 ANNRISTLVIYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQ 183
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+ +++ +LHG PV KGEKW+AT+W+R
Sbjct: 184 DQSLNELTLHGGAPVTKGEKWIATQWVR 211
>gi|229192445|ref|ZP_04319408.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 10876]
gi|228591022|gb|EEK48878.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 10876]
Length = 216
Score = 128 bits (321), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 66/199 (33%), Positives = 107/199 (53%), Gaps = 27/199 (13%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P + N S E+C +I +K +++ S++ + RTSSG F+ +E
Sbjct: 39 PLIVVLANVISDEECDELIEMSKNKMERSKIG-----SSRDVNDIRTSSGAFLEDNE--- 90
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 126
+ IE +I+ +P HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 127 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 186
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 148 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 192
Query: 187 HGSCPVIKGEKWVATKWIR 205
HG PV KGEKW+AT+W+R
Sbjct: 193 HGGAPVTKGEKWIATQWVR 211
>gi|354334983|gb|AER23925.1| procollagen-proline dioxygenase [Variovorax sp. HH01]
Length = 280
Score = 128 bits (321), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 81/207 (39%), Positives = 112/207 (54%), Gaps = 27/207 (13%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPS-QLALRQGETVESTKGTRTSSGTFISASEDK 66
PR + F N SAE+C+ +IA A+ RL S + R G V + RTS G F E++
Sbjct: 93 PRVVVFGNLLSAEECEGLIAAARVRLARSLTVETRTGGEVLNVD--RTSDGMFFERGENE 150
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQR 121
I+ +E +IA P GE +LRY G +Y HYD F+P+E G P + QR
Sbjct: 151 --IVARVEQRIAALLRWPLEFGEGLQILRYAPGAQYRPHYDYFDPSEPGTPTILKRGGQR 208
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 181
+A+ ++YL + E GG T FP +GL+V P RG G+ F P+ +
Sbjct: 209 VATLVMYLQEPEGGGATTFP---------------DVGLEVAPARGCGVFFSYDRPD-PV 252
Query: 182 DRTSLHGSCPVIKGEKWVATKWIRDQE 208
RT LHG PV+ GEKWVATKW+R++E
Sbjct: 253 TRT-LHGGAPVLAGEKWVATKWLRERE 278
>gi|30264308|ref|NP_846685.1| prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. Ames]
gi|47529753|ref|YP_021102.1| prolyl 4-hydroxylase subunit alpha [Bacillus anthracis str. 'Ames
Ancestor']
gi|65321616|ref|ZP_00394575.1| hypothetical protein Bant_01005109 [Bacillus anthracis str. A2012]
gi|165873278|ref|ZP_02217887.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0488]
gi|167634610|ref|ZP_02392930.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0442]
gi|167638693|ref|ZP_02396969.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0193]
gi|170687507|ref|ZP_02878724.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0465]
gi|170709341|ref|ZP_02899757.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0389]
gi|177655890|ref|ZP_02937082.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0174]
gi|190566156|ref|ZP_03019075.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Tsiankovskii-I]
gi|196034803|ref|ZP_03102210.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
W]
gi|227817011|ref|YP_002817020.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
anthracis str. CDC 684]
gi|228929280|ref|ZP_04092307.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pondicheriensis BGSC 4BA1]
gi|228935557|ref|ZP_04098373.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
andalousiensis BGSC 4AW1]
gi|229123754|ref|ZP_04252949.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 95/8201]
gi|229604260|ref|YP_002868528.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0248]
gi|254683996|ref|ZP_05147856.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. CNEVA-9066]
gi|254721830|ref|ZP_05183619.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A1055]
gi|254736344|ref|ZP_05194050.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Western North America USA6153]
gi|254741382|ref|ZP_05199069.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Kruger B]
gi|254753983|ref|ZP_05206018.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Vollum]
gi|254757854|ref|ZP_05209881.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Australia 94]
gi|386738126|ref|YP_006211307.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. H9401]
gi|421506493|ref|ZP_15953416.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. UR-1]
gi|421638315|ref|ZP_16078911.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. BF1]
gi|30258953|gb|AAP28171.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Ames]
gi|47504901|gb|AAT33577.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. 'Ames Ancestor']
gi|164710995|gb|EDR16563.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0488]
gi|167513541|gb|EDR88911.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0193]
gi|167530062|gb|EDR92797.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0442]
gi|170125767|gb|EDS94678.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0389]
gi|170668702|gb|EDT19448.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0465]
gi|172079923|gb|EDT65028.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0174]
gi|190563075|gb|EDV17041.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Tsiankovskii-I]
gi|195992342|gb|EDX56303.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
W]
gi|227005734|gb|ACP15477.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. CDC 684]
gi|228659889|gb|EEL15534.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 95/8201]
gi|228824095|gb|EEM69911.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
andalousiensis BGSC 4AW1]
gi|228830570|gb|EEM76180.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pondicheriensis BGSC 4BA1]
gi|229268668|gb|ACQ50305.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0248]
gi|384387978|gb|AFH85639.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. H9401]
gi|401823486|gb|EJT22633.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. UR-1]
gi|403394741|gb|EJY91981.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. BF1]
Length = 216
Score = 128 bits (321), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 69/208 (33%), Positives = 112/208 (53%), Gaps = 29/208 (13%)
Query: 1 MQVLS--WRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 58
+Q++S P + N S E+C +I +K +L S++ + RTSSG
Sbjct: 30 IQIISKFEEPLIVVLGNVLSDEECDELIELSKSKLARSKVG-----SSRDVNDIRTSSGA 84
Query: 59 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQ 117
F+ +E + IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+
Sbjct: 85 FLDDNE----LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSA 138
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
+ R+++ ++YL+DVEEGGET FP + L V PR+G + F +
Sbjct: 139 ANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQ 183
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+ +++ +LHG PV KGEKW+AT+W+R
Sbjct: 184 DQSLNELTLHGGAPVTKGEKWIATQWVR 211
>gi|206971296|ref|ZP_03232247.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
AH1134]
gi|229081494|ref|ZP_04213993.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-2]
gi|423411965|ref|ZP_17389085.1| hypothetical protein IE1_01269 [Bacillus cereus BAG3O-2]
gi|423432249|ref|ZP_17409253.1| hypothetical protein IE7_04065 [Bacillus cereus BAG4O-1]
gi|206734068|gb|EDZ51239.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
AH1134]
gi|228701801|gb|EEL54288.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-2]
gi|401104033|gb|EJQ12010.1| hypothetical protein IE1_01269 [Bacillus cereus BAG3O-2]
gi|401117005|gb|EJQ24843.1| hypothetical protein IE7_04065 [Bacillus cereus BAG4O-1]
Length = 216
Score = 128 bits (321), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 66/199 (33%), Positives = 107/199 (53%), Gaps = 27/199 (13%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P + N S E+C +I +K +++ S++ + RTSSG F+ +E
Sbjct: 39 PLIVVLANVLSDEECDELIEMSKNKMERSKIG-----SSRDVNDIRTSSGAFLEDNE--- 90
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 126
+ IE +I+ +P HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 127 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 186
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 148 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 192
Query: 187 HGSCPVIKGEKWVATKWIR 205
HG PV KGEKW+AT+W+R
Sbjct: 193 HGGAPVTKGEKWIATQWVR 211
>gi|229146822|ref|ZP_04275187.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST24]
gi|228636650|gb|EEK93115.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST24]
Length = 216
Score = 128 bits (321), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 66/199 (33%), Positives = 107/199 (53%), Gaps = 27/199 (13%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P + N S E+C +I +K +++ S++ + RTSSG F+ +E
Sbjct: 39 PLIVVLANVLSDEECDELIEMSKNKMERSKIG-----SSRDVNDIRTSSGAFLEDNE--- 90
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 126
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 127 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 186
+YL+DVEEGGET FP + L V PR+G + F + +++ +L
Sbjct: 148 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQGQSLNELTL 192
Query: 187 HGSCPVIKGEKWVATKWIR 205
HG PV KGEKW+AT+W+R
Sbjct: 193 HGGAPVTKGEKWIATQWVR 211
>gi|423448819|ref|ZP_17425698.1| hypothetical protein IEC_03427 [Bacillus cereus BAG5O-1]
gi|401129413|gb|EJQ37096.1| hypothetical protein IEC_03427 [Bacillus cereus BAG5O-1]
Length = 216
Score = 127 bits (320), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 69/208 (33%), Positives = 111/208 (53%), Gaps = 29/208 (13%)
Query: 1 MQVLS--WRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 58
+Q++S P + N S E+C +I +K ++K S + + RTSSG
Sbjct: 30 IQIISKFEEPLIVVLGNVISDEECDELIEMSKNKIKRSTIG-----SSRDVNDIRTSSGA 84
Query: 59 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQ 117
F+ +E + IE +I+ +P THGE ++L YE+ Q+Y +HYD F AE+
Sbjct: 85 FLEENE----LTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYF--AEHSRSA 138
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
+ R+++ ++YL+DVEEGGET FP + L V PR+G + F +
Sbjct: 139 ANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQ 183
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+ +++ +LHG V KGEKW+AT+W+R
Sbjct: 184 DQSLNELTLHGGASVTKGEKWIATQWVR 211
>gi|217961727|ref|YP_002340297.1| prolyl 4-hydroxylase subunit alpha domain-containing protein
[Bacillus cereus AH187]
gi|222097680|ref|YP_002531737.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
Q1]
gi|229198365|ref|ZP_04325071.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1293]
gi|375286242|ref|YP_005106681.1| prolyl 4-hydroxylase subunit alpha domain-containing protein
[Bacillus cereus NC7401]
gi|423354732|ref|ZP_17332357.1| hypothetical protein IAU_02806 [Bacillus cereus IS075]
gi|423566803|ref|ZP_17543050.1| hypothetical protein II7_00026 [Bacillus cereus MSX-A12]
gi|423574080|ref|ZP_17550199.1| hypothetical protein II9_01301 [Bacillus cereus MSX-D12]
gi|217067199|gb|ACJ81449.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
AH187]
gi|221241738|gb|ACM14448.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
Q1]
gi|228585065|gb|EEK43177.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1293]
gi|358354769|dbj|BAL19941.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
NC7401]
gi|401086280|gb|EJP94507.1| hypothetical protein IAU_02806 [Bacillus cereus IS075]
gi|401212649|gb|EJR19392.1| hypothetical protein II9_01301 [Bacillus cereus MSX-D12]
gi|401215318|gb|EJR22035.1| hypothetical protein II7_00026 [Bacillus cereus MSX-A12]
Length = 216
Score = 127 bits (320), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 69/208 (33%), Positives = 112/208 (53%), Gaps = 29/208 (13%)
Query: 1 MQVLS--WRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 58
+Q++S P + N S E+C +I +K +L S++ + RTSSG
Sbjct: 30 IQIISKFEEPLIVVLGNVLSDEECDKLIELSKNKLARSKVG-----SSRDVNDIRTSSGA 84
Query: 59 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQ 117
F+ +E + IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+
Sbjct: 85 FLDDNE----LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF--AEHSRSA 138
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
+ R+++ ++YL+DVEEGGET FP + L V PR+G + F +
Sbjct: 139 ANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQ 183
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+ +++ +LHG PV KGEKW+AT+W+R
Sbjct: 184 DQSLNELTLHGGAPVTKGEKWIATQWVR 211
>gi|66820122|ref|XP_643703.1| hypothetical protein DDB_G0275385 [Dictyostelium discoideum AX4]
gi|60471803|gb|EAL69758.1| hypothetical protein DDB_G0275385 [Dictyostelium discoideum AX4]
Length = 221
Score = 127 bits (320), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 76/212 (35%), Positives = 109/212 (51%), Gaps = 31/212 (14%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT-RTSSGTFISA 62
LS PR P F + E+C+ +I T+K +L+P E + G R+ G F+
Sbjct: 28 LSQAPRIYRIPGFLTDEECEFLIDTSKNKLRPCN---------EISSGVHRSGWGLFMKE 78
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS--- 119
E+ I + I +K+ + ++ E V+RY G++ SH+D FNP M
Sbjct: 79 GEEDHQITKNIFNKMKSFVNISES-CEVMQVIRYNQGEETSSHFDYFNPLTTNGSMKIGL 137
Query: 120 --QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
QR+ + L+YL DVEEGGET FP +G+KVKP +GD +LFY+ P
Sbjct: 138 YGQRVCTILMYLCDVEEGGETTFPE---------------VGIKVKPIKGDAVLFYNCKP 182
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
NG +D SLH PV+KG KWVA K I + +
Sbjct: 183 NGDVDPLSLHQGDPVLKGNKWVAIKLINQKSK 214
>gi|423470454|ref|ZP_17447198.1| hypothetical protein IEM_01760 [Bacillus cereus BAG6O-2]
gi|402436583|gb|EJV68613.1| hypothetical protein IEM_01760 [Bacillus cereus BAG6O-2]
Length = 216
Score = 127 bits (320), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 69/208 (33%), Positives = 112/208 (53%), Gaps = 29/208 (13%)
Query: 1 MQVLS--WRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 58
+Q++S P + N S E+C +I +K +++ S++ + RTSSG
Sbjct: 30 IQIISKFEEPLIVVLANVLSDEECDGLIELSKNKIERSKIG-----SSRDVNDIRTSSGA 84
Query: 59 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQ 117
F+ +E + IE +I+ +P HGE ++L YE+ Q+Y +HYD F AE+
Sbjct: 85 FLEENE----LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQEYKAHYDYF--AEHSRSA 138
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
+ R+++ ++YL+DVEEGGET FP + L V PR+G + F +
Sbjct: 139 ANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQ 183
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+ +++ +LHG PV KGEKWVAT+W+R
Sbjct: 184 DQSLNELTLHGGAPVTKGEKWVATQWMR 211
>gi|319652187|ref|ZP_08006306.1| hypothetical protein HMPREF1013_02919 [Bacillus sp. 2_A_57_CT2]
gi|317396176|gb|EFV76895.1| hypothetical protein HMPREF1013_02919 [Bacillus sp. 2_A_57_CT2]
Length = 283
Score = 127 bits (320), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 72/201 (35%), Positives = 108/201 (53%), Gaps = 19/201 (9%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
+P L+ S+E+C +I+ ++ RL+PS L + +G E RTS E++
Sbjct: 95 KPFVLHLDQVLSSEECDELISLSRSRLQPS-LVVDRGSGEERAGSGRTSKSMAFRLKENE 153
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP-AEYGPQMSQRLASF 125
++E IE +IA T P +GE +L Y +G++Y H+D F P + QR+ +F
Sbjct: 154 --LVERIETRIAELTGYPAENGEGLQILNYGLGEEYKPHFDFFPPHMADASKGGQRVGTF 211
Query: 126 LLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTS 185
L+YL+DVE+GGET+F GL P++G + F+ G +DR S
Sbjct: 212 LIYLNDVEDGGETVF---------------SKAGLSFVPKKGAAIYFHYGNAQGQLDRLS 256
Query: 186 LHGSCPVIKGEKWVATKWIRD 206
+H S PV KGEKW ATKWIR+
Sbjct: 257 VHSSVPVRKGEKWAATKWIRE 277
>gi|229168980|ref|ZP_04296697.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH621]
gi|423591765|ref|ZP_17567796.1| hypothetical protein IIG_00633 [Bacillus cereus VD048]
gi|228614572|gb|EEK71680.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH621]
gi|401231898|gb|EJR38400.1| hypothetical protein IIG_00633 [Bacillus cereus VD048]
Length = 216
Score = 127 bits (320), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 68/199 (34%), Positives = 106/199 (53%), Gaps = 27/199 (13%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P + N S E+C +I +K +K S++ + RTSSG F+ +E
Sbjct: 39 PLIVVLANVLSDEECAELIELSKSNMKRSKVG-----SSRDVNDIRTSSGAFLEENE--- 90
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 126
+ IE +I+ T +P HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTWKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 127 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 186
+YL+DVEEGGET FP + L V PR+G + F + + ++ +L
Sbjct: 148 MYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQLLNELTL 192
Query: 187 HGSCPVIKGEKWVATKWIR 205
HG PV KGEKW+AT+W+R
Sbjct: 193 HGGAPVTKGEKWIATQWVR 211
>gi|365158975|ref|ZP_09355162.1| hypothetical protein HMPREF1014_00625 [Bacillus sp. 7_6_55CFAA_CT2]
gi|363625964|gb|EHL76973.1| hypothetical protein HMPREF1014_00625 [Bacillus sp. 7_6_55CFAA_CT2]
Length = 248
Score = 127 bits (320), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 66/199 (33%), Positives = 107/199 (53%), Gaps = 27/199 (13%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P + N S E+C +I +K +++ S++ + RTSSG F+ +E
Sbjct: 71 PLIVVLANVLSDEECDELIEMSKNKMERSKIG-----SSRDVNDIRTSSGAFLEDNE--- 122
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 126
+ IE +I+ +P HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 123 -LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 179
Query: 127 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 186
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 180 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 224
Query: 187 HGSCPVIKGEKWVATKWIR 205
HG PV KGEKW+AT+W+R
Sbjct: 225 HGGAPVTKGEKWIATQWVR 243
>gi|52141260|ref|YP_085568.1| prolyl 4-hydroxylase, alpha subunit [Bacillus cereus E33L]
gi|51974729|gb|AAU16279.1| prolyl 4-hydroxylase, alpha subunit [Bacillus cereus E33L]
Length = 232
Score = 127 bits (320), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 69/208 (33%), Positives = 112/208 (53%), Gaps = 29/208 (13%)
Query: 1 MQVLS--WRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 58
+Q++S P + N S E+C +I +K +L S++ + RTSSG
Sbjct: 46 IQIISKFEEPLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDVNDIRTSSGA 100
Query: 59 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQ 117
F+ +E + IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+
Sbjct: 101 FLDDNE----LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSA 154
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
+ R+++ ++YL+DVEEGGET FP + L V PR+G + F +
Sbjct: 155 ANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQ 199
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+ +++ +LHG PV KGEKW+AT+W+R
Sbjct: 200 DQSLNELTLHGGAPVTKGEKWIATQWVR 227
>gi|229111709|ref|ZP_04241257.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-15]
gi|296504733|ref|YP_003666433.1| prolyl 4-hydroxylase subunit alpha [Bacillus thuringiensis BMB171]
gi|423585282|ref|ZP_17561369.1| hypothetical protein IIE_00694 [Bacillus cereus VD045]
gi|423640681|ref|ZP_17616299.1| hypothetical protein IK9_00626 [Bacillus cereus VD166]
gi|228671703|gb|EEL26999.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-15]
gi|296325785|gb|ADH08713.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis BMB171]
gi|401233925|gb|EJR40411.1| hypothetical protein IIE_00694 [Bacillus cereus VD045]
gi|401279742|gb|EJR85664.1| hypothetical protein IK9_00626 [Bacillus cereus VD166]
Length = 248
Score = 127 bits (320), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 66/199 (33%), Positives = 107/199 (53%), Gaps = 27/199 (13%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P + N S E+C +I +K +++ S++ + RTSSG F+ +E
Sbjct: 71 PLIVVLANVLSDEECDELIEMSKNKMERSKIG-----SSRDVNDIRTSSGAFLEDNE--- 122
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 126
IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 123 -FTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 179
Query: 127 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 186
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 180 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 224
Query: 187 HGSCPVIKGEKWVATKWIR 205
HG PV KGEKW+AT+W+R
Sbjct: 225 HGGAPVTKGEKWIATQWVR 243
>gi|423426372|ref|ZP_17403403.1| hypothetical protein IE5_04061 [Bacillus cereus BAG3X2-2]
gi|401111119|gb|EJQ19018.1| hypothetical protein IE5_04061 [Bacillus cereus BAG3X2-2]
Length = 248
Score = 127 bits (320), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 66/199 (33%), Positives = 107/199 (53%), Gaps = 27/199 (13%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P + N S E+C +I +K +++ S++ + RTSSG F+ +E
Sbjct: 71 PLIVVLANVLSDEECDELIEISKNKMERSKIG-----SSRDVNDIRTSSGAFLEDNE--- 122
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 126
+ IE +I+ +P HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 123 -LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 179
Query: 127 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 186
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 180 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 224
Query: 187 HGSCPVIKGEKWVATKWIR 205
HG PV KGEKW+AT+W+R
Sbjct: 225 HGGAPVTKGEKWIATQWVR 243
>gi|108706360|gb|ABF94155.1| oxidoreductase, 2OG-Fe oxygenase family protein, putative [Oryza
sativa Japonica Group]
gi|125585047|gb|EAZ25711.1| hypothetical protein OsJ_09544 [Oryza sativa Japonica Group]
Length = 277
Score = 127 bits (320), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 79/216 (36%), Positives = 119/216 (55%), Gaps = 25/216 (11%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES-TKGTRTSSGTFISA 62
+SWRPRA + F S +C +I+ AK+ K + + GE+ ES T RTSSG F+
Sbjct: 45 VSWRPRAFLYEGFLSDAECDHLISLAKQG-KMEKSTVVDGESGESVTSKVRTSSGMFLDK 103
Query: 63 SEDKTGILELIEHKIARATMLP-----------------QTHGEAFNVLRYEIGQKYDSH 105
+D+ ++ IE +IA TMLP +GE+ +LRY G+KY+ H
Sbjct: 104 KQDE--VVARIEERIAAWTMLPTECIIFYCFANFAILKLSENGESMQILRYGQGEKYEPH 161
Query: 106 YDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI--GLKVK 163
+D + + + R+A+ L+YLS+V+ G +++ P + + + C G VK
Sbjct: 162 FDYISGRQGSTREGDRVATVLMYLSNVKMG-DSLLP-QARLSQPKDETWSDCAEQGFAVK 219
Query: 164 PRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWV 199
P +G +LF+SL PN T+D SLHGSCPVI+GEK V
Sbjct: 220 PAKGSAVLFFSLHPNATLDTDSLHGSCPVIEGEKVV 255
>gi|406665340|ref|ZP_11073114.1| hypothetical protein B857_00901 [Bacillus isronensis B3W22]
gi|405387266|gb|EKB46691.1| hypothetical protein B857_00901 [Bacillus isronensis B3W22]
Length = 211
Score = 127 bits (320), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 71/210 (33%), Positives = 115/210 (54%), Gaps = 24/210 (11%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
+VL P + F N S E+CQ++I A RL+ S+LA ++ ++ RTSSG F
Sbjct: 24 EVLHEEPLIVKFLNVLSDEECQNLIDCASSRLERSKLAKKEISSI------RTSSGMFFE 77
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 121
E++ ++ IE +I+ LP H E VL YE GQ++ +H+D F P + + R
Sbjct: 78 --ENENPLISEIEKRISSLMHLPIEHAEGLQVLHYEPGQEFKAHFDFFGP-NHPSSSNNR 134
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 181
+++ ++YL+DVEEGG T FP +G+ P++G + F + + +
Sbjct: 135 ISTLVVYLNDVEEGGVTTFP---------------NLGIVNVPKKGTAVYFEYFYNDQKL 179
Query: 182 DRTSLHGSCPVIKGEKWVATKWIRDQEQHE 211
+ +LH PVI+GEKWVAT+W+R ++ E
Sbjct: 180 NELTLHSGEPVIQGEKWVATQWMRKKQIRE 209
>gi|120609859|ref|YP_969537.1| 2OG-Fe(II) oxygenase [Acidovorax citrulli AAC00-1]
gi|120588323|gb|ABM31763.1| 2OG-Fe(II) oxygenase [Acidovorax citrulli AAC00-1]
Length = 309
Score = 127 bits (320), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 76/207 (36%), Positives = 112/207 (54%), Gaps = 27/207 (13%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPS-QLALRQGETVESTKGTRTSSGTFISASED 65
+PR + F N S E+C +II A+ R+ S +A R G E RTS+G F E+
Sbjct: 121 QPRVVLFGNLLSPEECDAIIDAARPRMARSLTVATRTGG--EEVNDDRTSNGMFFQREEN 178
Query: 66 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQ 120
++ +E +IAR P +GE VL Y G +Y HYD F+PAE G P + Q
Sbjct: 179 P--VVARLEARIARLVNWPLENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPTILRRGGQ 236
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+A+ ++YL+D E+GG T FP + L+V PRRG+ + F P+ +
Sbjct: 237 RVATIVIYLNDPEKGGGTTFP---------------DVHLEVAPRRGNAVFFSYERPHPS 281
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQ 207
+LHG PV+ G+KW+ATKW+R++
Sbjct: 282 T--RTLHGGAPVVAGDKWIATKWLRER 306
>gi|118479416|ref|YP_896567.1| prolyl 4-hydroxylase subunit alpha [Bacillus thuringiensis str. Al
Hakam]
gi|118418641|gb|ABK87060.1| prolyl 4-hydroxylase, alpha subunit [Bacillus thuringiensis str. Al
Hakam]
Length = 232
Score = 127 bits (319), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 69/208 (33%), Positives = 112/208 (53%), Gaps = 29/208 (13%)
Query: 1 MQVLS--WRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 58
+Q++S P + N S E+C +I +K +L S++ + RTSSG
Sbjct: 46 IQIISKFEEPLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDVNDIRTSSGA 100
Query: 59 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQ 117
F+ +E + IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+
Sbjct: 101 FLDDNE----LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSA 154
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
+ R+++ ++YL+DVEEGGET FP + L V PR+G + F +
Sbjct: 155 ANNRISTLVIYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQ 199
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+ +++ +LHG PV KGEKW+AT+W+R
Sbjct: 200 DQSLNELTLHGGAPVTKGEKWIATQWVR 227
>gi|229180513|ref|ZP_04307855.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 172560W]
gi|228602937|gb|EEK60416.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 172560W]
Length = 232
Score = 127 bits (319), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 66/199 (33%), Positives = 107/199 (53%), Gaps = 27/199 (13%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P + N S E+C +I +K +++ S++ + RTSSG F+ +E
Sbjct: 55 PLIVVLANVLSDEECDELIEMSKNKMERSKIG-----SSRDVNDIRTSSGAFLEDNE--- 106
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 126
+ IE +I+ +P HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 107 -LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 163
Query: 127 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 186
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 164 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 208
Query: 187 HGSCPVIKGEKWVATKWIR 205
HG PV KGEKW+AT+W+R
Sbjct: 209 HGGAPVTKGEKWIATQWVR 227
>gi|49187135|ref|YP_030387.1| prolyl 4-hydroxylase subunit alpha [Bacillus anthracis str. Sterne]
gi|228947951|ref|ZP_04110238.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
monterrey BGSC 4AJ1]
gi|49181062|gb|AAT56438.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Sterne]
gi|228811938|gb|EEM58272.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
monterrey BGSC 4AJ1]
Length = 232
Score = 127 bits (319), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 69/208 (33%), Positives = 112/208 (53%), Gaps = 29/208 (13%)
Query: 1 MQVLS--WRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 58
+Q++S P + N S E+C +I +K +L S++ + RTSSG
Sbjct: 46 IQIISKFEEPLIVVLGNVLSDEECDELIELSKSKLARSKVG-----SSRDVNDIRTSSGA 100
Query: 59 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQ 117
F+ +E + IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+
Sbjct: 101 FLDDNE----LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSA 154
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
+ R+++ ++YL+DVEEGGET FP + L V PR+G + F +
Sbjct: 155 ANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQ 199
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+ +++ +LHG PV KGEKW+AT+W+R
Sbjct: 200 DQSLNELTLHGGAPVTKGEKWIATQWVR 227
>gi|384182063|ref|YP_005567825.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
finitimus YBT-020]
gi|324328147|gb|ADY23407.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
finitimus YBT-020]
Length = 216
Score = 127 bits (319), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 69/208 (33%), Positives = 112/208 (53%), Gaps = 29/208 (13%)
Query: 1 MQVLS--WRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 58
+Q++S P + N S E+C +I +K +L S++ + RTSSG
Sbjct: 30 IQIISKFEEPLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDVNDIRTSSGA 84
Query: 59 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQ 117
F+ +E + IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+
Sbjct: 85 FLDDNE----LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF--AEHSRSA 138
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
+ R+++ ++YL+DVEEGGET FP + L V PR+G + F +
Sbjct: 139 ANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQ 183
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+ +++ +LHG PV KGEKW+AT+W+R
Sbjct: 184 DRSLNELTLHGGAPVTKGEKWIATQWVR 211
>gi|281307110|pdb|3ITQ|A Chain A, Crystal Structure Of A Prolyl 4-Hydroxylase From Bacillus
Anthracis
gi|281307111|pdb|3ITQ|B Chain B, Crystal Structure Of A Prolyl 4-Hydroxylase From Bacillus
Anthracis
Length = 216
Score = 127 bits (319), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 69/208 (33%), Positives = 111/208 (53%), Gaps = 29/208 (13%)
Query: 1 MQVLS--WRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 58
+Q++S P + N S E+C +I +K +L S++ + RTSSG
Sbjct: 30 IQIISKFEEPLIVVLGNVLSDEECDELIELSKSKLARSKVG-----SSRDVNDIRTSSGA 84
Query: 59 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQ 117
F+ +E + IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+
Sbjct: 85 FLDDNE----LTAKIEKRISSIXNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSA 138
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
+ R+++ + YL+DVEEGGET FP + L V PR+G + F +
Sbjct: 139 ANNRISTLVXYLNDVEEGGETFFP---------------KLNLSVHPRKGXAVYFEYFYQ 183
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+ +++ +LHG PV KGEKW+AT+W+R
Sbjct: 184 DQSLNELTLHGGAPVTKGEKWIATQWVR 211
>gi|206978009|ref|ZP_03238895.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
H3081.97]
gi|423373947|ref|ZP_17351286.1| hypothetical protein IC5_03002 [Bacillus cereus AND1407]
gi|206743809|gb|EDZ55230.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
H3081.97]
gi|401094762|gb|EJQ02832.1| hypothetical protein IC5_03002 [Bacillus cereus AND1407]
Length = 216
Score = 127 bits (319), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 69/208 (33%), Positives = 111/208 (53%), Gaps = 29/208 (13%)
Query: 1 MQVLS--WRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 58
+Q++S P + N S E+C +I +K +L S++ + RTSSG
Sbjct: 30 IQIISKFEEPLIVVLGNVLSDEECDKLIELSKNKLARSKVG-----SSRDVNDIRTSSGA 84
Query: 59 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQ 117
F+ E + IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+
Sbjct: 85 FLDDDE----LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF--AEHSRSA 138
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
+ R+++ ++YL+DVEEGGET FP + L V PR+G + F +
Sbjct: 139 ANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQ 183
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+ +++ +LHG PV KGEKW+AT+W+R
Sbjct: 184 DQSLNELTLHGGAPVTKGEKWIATQWVR 211
>gi|423457579|ref|ZP_17434376.1| hypothetical protein IEI_00719 [Bacillus cereus BAG5X2-1]
gi|401147963|gb|EJQ55456.1| hypothetical protein IEI_00719 [Bacillus cereus BAG5X2-1]
Length = 216
Score = 127 bits (319), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 67/199 (33%), Positives = 107/199 (53%), Gaps = 27/199 (13%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P + N S E+C +I +K +L S++ + RTSSG F+ +E
Sbjct: 39 PLIVVLGNVLSDEECDELIELSKSKLARSKVG-----SSRDVNDIRTSSGAFLEDNE--- 90
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 126
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTVKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 127 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 186
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 148 MYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 192
Query: 187 HGSCPVIKGEKWVATKWIR 205
HG PV KGEKW+AT+W+R
Sbjct: 193 HGGAPVTKGEKWIATQWVR 211
>gi|42783360|ref|NP_980607.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 10987]
gi|42739288|gb|AAS43215.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
ATCC 10987]
Length = 216
Score = 127 bits (319), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 69/208 (33%), Positives = 112/208 (53%), Gaps = 29/208 (13%)
Query: 1 MQVLS--WRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 58
+Q++S P + N S E+C +I +K +L S++ + RTSSG
Sbjct: 30 IQIISKFEEPLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDVNDIRTSSGA 84
Query: 59 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQ 117
F+ +E + IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+
Sbjct: 85 FLDDNE----LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF--AEHSRSA 138
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
+ R+++ ++YL+DVEEGGET FP + L V PR+G + F +
Sbjct: 139 ANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQ 183
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+ +++ +LHG PV KGEKW+AT+W+R
Sbjct: 184 DQSLNELTLHGGAPVTKGEKWIATQWMR 211
>gi|319792090|ref|YP_004153730.1| procollagen-proline dioxygenase [Variovorax paradoxus EPS]
gi|315594553|gb|ADU35619.1| Procollagen-proline dioxygenase [Variovorax paradoxus EPS]
Length = 280
Score = 127 bits (318), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 79/207 (38%), Positives = 112/207 (54%), Gaps = 27/207 (13%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPS-QLALRQGETVESTKGTRTSSGTFISASEDK 66
PR + F N S E+C+ +IA A+ RL S + R G V + RTS G F E++
Sbjct: 93 PRVIVFGNLLSTEECEGLIAAARVRLARSLTVETRTGGEVLNVD--RTSDGMFFERGENE 150
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQR 121
I+ +E ++A P +GE +LRY G +Y HYD F+P E G P + QR
Sbjct: 151 --IVARLEQRLAMLLRWPLEYGEGLQILRYAPGAQYRPHYDYFDPNEPGTPTILKRGGQR 208
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 181
+A+ ++YL + E+GG T FP +GL+V P RG G+ F P+ +
Sbjct: 209 VATLVMYLQEPEQGGATTFP---------------DVGLEVAPVRGTGVFFSYDRPD-PV 252
Query: 182 DRTSLHGSCPVIKGEKWVATKWIRDQE 208
RT LHG PV+ GEKWVATKW+R++E
Sbjct: 253 TRT-LHGGAPVLAGEKWVATKWLRERE 278
>gi|229140971|ref|ZP_04269515.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST26]
gi|228642547|gb|EEK98834.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST26]
Length = 232
Score = 127 bits (318), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 69/208 (33%), Positives = 112/208 (53%), Gaps = 29/208 (13%)
Query: 1 MQVLS--WRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 58
+Q++S P + N S E+C +I +K +L S++ + RTSSG
Sbjct: 46 IQIISKFEEPLIVVLGNVLSDEECDKLIELSKNKLARSKVG-----SSRDVNDIRTSSGA 100
Query: 59 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQ 117
F+ +E + IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+
Sbjct: 101 FLDDNE----LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF--AEHSRSA 154
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
+ R+++ ++YL+DVEEGGET FP + L V PR+G + F +
Sbjct: 155 ANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQ 199
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+ +++ +LHG PV KGEKW+AT+W+R
Sbjct: 200 DQSLNELTLHGGAPVTKGEKWIATQWVR 227
>gi|398808448|ref|ZP_10567311.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
gi|398087480|gb|EJL78066.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
Length = 280
Score = 127 bits (318), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 80/207 (38%), Positives = 112/207 (54%), Gaps = 27/207 (13%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPS-QLALRQGETVESTKGTRTSSGTFISASEDK 66
PR + F N SAE+C+ +IA A+ RL S + R G V + RTS G F E++
Sbjct: 93 PRVVVFGNLLSAEECEGLIAAARVRLARSLTVETRTGGEVLNVD--RTSDGMFFERGENE 150
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQR 121
I+ +E ++A P +GE +LRY G +Y HYD F+P E G P + QR
Sbjct: 151 --IVARLEQRLATLLRWPLEYGEGLQILRYAPGAQYRPHYDYFDPGEPGTPTILKRGGQR 208
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 181
+A+ ++YL + E GG T FP +GL+V P RG G+ F P+ +
Sbjct: 209 VATLVMYLQEPEGGGATTFP---------------DVGLEVAPVRGCGVFFSYDRPD-PV 252
Query: 182 DRTSLHGSCPVIKGEKWVATKWIRDQE 208
RT LHG PV+ GEKWVATKW+R++E
Sbjct: 253 TRT-LHGGAPVLAGEKWVATKWLRERE 278
>gi|229071739|ref|ZP_04204954.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus F65185]
gi|228711334|gb|EEL63294.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus F65185]
Length = 232
Score = 127 bits (318), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 66/199 (33%), Positives = 107/199 (53%), Gaps = 27/199 (13%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P + N S E+C +I +K +++ S++ + RTSSG F+ +E
Sbjct: 55 PLIVVLANVLSDEECDELIEMSKNKMERSKIG-----SSRDVNDIRTSSGAFLEDNE--- 106
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 126
+ IE +I+ +P HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 107 -LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 163
Query: 127 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 186
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 164 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 208
Query: 187 HGSCPVIKGEKWVATKWIR 205
HG PV KGEKW+AT+W+R
Sbjct: 209 HGGAPVTKGEKWIATQWMR 227
>gi|30022316|ref|NP_833947.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 14579]
gi|229129515|ref|ZP_04258486.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-Cer4]
gi|29897873|gb|AAP11148.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 14579]
gi|228654120|gb|EEL09987.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-Cer4]
Length = 232
Score = 126 bits (317), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 65/199 (32%), Positives = 107/199 (53%), Gaps = 27/199 (13%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P + N S E+C +I +K +++ S++ + RTSSG F+ +
Sbjct: 55 PLIVVLANVLSDEECDELIEMSKNKMERSKIG-----SSRDVNDIRTSSGAFL----EDN 105
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 126
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 106 KLTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 163
Query: 127 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 186
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 164 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 208
Query: 187 HGSCPVIKGEKWVATKWIR 205
HG PV KGEKW+AT+W+R
Sbjct: 209 HGGAPVTKGEKWIATQWVR 227
>gi|319763870|ref|YP_004127807.1| procollagen-proline dioxygenase [Alicycliphilus denitrificans BC]
gi|330823866|ref|YP_004387169.1| procollagen-proline dioxygenase [Alicycliphilus denitrificans K601]
gi|317118431|gb|ADV00920.1| Procollagen-proline dioxygenase [Alicycliphilus denitrificans BC]
gi|329309238|gb|AEB83653.1| Procollagen-proline dioxygenase [Alicycliphilus denitrificans K601]
Length = 284
Score = 126 bits (317), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 73/206 (35%), Positives = 109/206 (52%), Gaps = 25/206 (12%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
PR + F N S E+CQ++I A+ R+ S L ++ E RTS G F E++
Sbjct: 97 PRVVLFGNLLSPEECQAVIEAARTRMARS-LTVQAASGGEEVNKDRTSDGMFFQRGENEA 155
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQRL 122
+ +E +IAR P +GE VL Y G +Y HYD F+PAE G P++ QR+
Sbjct: 156 --VARLEERIARLVRWPVENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPRLLRRGGQRV 213
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
A+ ++YL+D GG T FP + L++ PR+G+ + F +
Sbjct: 214 ATLVIYLNDPVRGGGTTFP---------------DVPLEIGPRQGNAVFFS--YGRAHPS 256
Query: 183 RTSLHGSCPVIKGEKWVATKWIRDQE 208
+LHG PVI+GEKW+ATKW+R++E
Sbjct: 257 SRTLHGGAPVIEGEKWIATKWLRERE 282
>gi|229163182|ref|ZP_04291137.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus R309803]
gi|228620245|gb|EEK77116.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus R309803]
Length = 229
Score = 126 bits (317), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 66/199 (33%), Positives = 106/199 (53%), Gaps = 27/199 (13%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P + N S E+C +I +K +L S++ + RTS G F+ +E
Sbjct: 52 PLIVVLGNVLSDEECDELIELSKSKLARSKVG-----SSRDVNDIRTSKGAFLDDNE--- 103
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 126
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 104 -LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 160
Query: 127 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 186
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 161 MYLNDVEEGGETFFP---------------KLNLSVNPRKGMAVYFEYFYQDQSLNELTL 205
Query: 187 HGSCPVIKGEKWVATKWIR 205
HG PV KGEKW+AT+W+R
Sbjct: 206 HGGAPVTKGEKWIATQWVR 224
>gi|423612451|ref|ZP_17588312.1| hypothetical protein IIM_03166 [Bacillus cereus VD107]
gi|401246040|gb|EJR52392.1| hypothetical protein IIM_03166 [Bacillus cereus VD107]
Length = 254
Score = 126 bits (317), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 66/199 (33%), Positives = 107/199 (53%), Gaps = 27/199 (13%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P + N S E+C +I +K +++ S++ + + RTSSG F+ +E
Sbjct: 77 PLIVVLANVLSDEECDELIELSKNKMERSKIG-----SSRNVNDIRTSSGAFLEENE--- 128
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 126
IE +I+ T +P HGE ++L Y + Q+Y +HYD F AE+ + R+++ +
Sbjct: 129 -FTSKIEKRISSITNVPVAHGEGLHILNYAVDQEYKAHYDYF--AEHSRSAANNRISTLV 185
Query: 127 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 186
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 186 MYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 230
Query: 187 HGSCPVIKGEKWVATKWIR 205
HG PV KGEKW+AT+W+R
Sbjct: 231 HGGAPVTKGEKWIATQWMR 249
>gi|423604110|ref|ZP_17580003.1| hypothetical protein IIK_00691 [Bacillus cereus VD102]
gi|401245796|gb|EJR52149.1| hypothetical protein IIK_00691 [Bacillus cereus VD102]
Length = 216
Score = 126 bits (316), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 69/208 (33%), Positives = 111/208 (53%), Gaps = 29/208 (13%)
Query: 1 MQVLS--WRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 58
+Q++S P + N S E+C +I +K +L S++ + RTSSG
Sbjct: 30 IQIISKFEEPLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDVNDIRTSSGA 84
Query: 59 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQ 117
F+ +E + IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+
Sbjct: 85 FLDDNE----LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF--AEHSRSA 138
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
+ R+++ ++YL+DVEEGGET FP + L V PR+G + F
Sbjct: 139 ANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFHQ 183
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+ +++ +LHG PV KGEKW+AT+W+R
Sbjct: 184 DQSLNELTLHGGAPVTKGEKWIATQWVR 211
>gi|418530659|ref|ZP_13096582.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni ATCC 11996]
gi|371452378|gb|EHN65407.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni ATCC 11996]
Length = 299
Score = 126 bits (316), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 74/206 (35%), Positives = 108/206 (52%), Gaps = 25/206 (12%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
PR + F N S E+C +IIA A+ R++ S L + E+ RTS+G F E+
Sbjct: 112 PRVVVFGNLLSNEECDAIIAAARPRMQRS-LTVDNQSGGEAVNDDRTSNGMFFQRGEND- 169
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQRL 122
++ +E +IAR P +GE VL Y G +Y HYD F P E G P + QR+
Sbjct: 170 -LISRVEQRIARLLNWPLENGEGMQVLHYRPGAEYKPHYDYFAPNEPGTPTILKRGGQRV 228
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
+ ++YL++ GG T FP +GL+V PRRG+ + F P
Sbjct: 229 GTLVMYLNEPARGGATTFP---------------DVGLQVVPRRGNAVFFSYNRPEPATK 273
Query: 183 RTSLHGSCPVIKGEKWVATKWIRDQE 208
+LHG PV++GEKW+ATKW+R++E
Sbjct: 274 --TLHGGAPVLEGEKWIATKWLRERE 297
>gi|402555628|ref|YP_006596899.1| prolyl 4-hydroxylase subunit alpha [Bacillus cereus FRI-35]
gi|401796838|gb|AFQ10697.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus FRI-35]
Length = 216
Score = 126 bits (316), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 69/208 (33%), Positives = 112/208 (53%), Gaps = 29/208 (13%)
Query: 1 MQVLS--WRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 58
+Q++S P + N S E+C +I +K +L S++ + RTSSG
Sbjct: 30 IQIISKFEEPLIVVLGNVLSDEECGELIELSKNKLARSKVG-----SSRDVNDIRTSSGA 84
Query: 59 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQ 117
F+ +E + IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+
Sbjct: 85 FLDDNE----LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF--AEHSRSA 138
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
+ R+++ ++YL+DVEEGGET FP + L V PR+G + F +
Sbjct: 139 ANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQ 183
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+ +++ +LHG PV KGEKW+AT+W+R
Sbjct: 184 DQSLNELTLHGGAPVTKGEKWIATQWMR 211
>gi|423395462|ref|ZP_17372663.1| hypothetical protein ICU_01156 [Bacillus cereus BAG2X1-1]
gi|401654873|gb|EJS72412.1| hypothetical protein ICU_01156 [Bacillus cereus BAG2X1-1]
Length = 216
Score = 126 bits (316), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 66/199 (33%), Positives = 106/199 (53%), Gaps = 27/199 (13%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P + N S E+C +I +K +L S++ + RTS G F+ +E
Sbjct: 39 PLIVVLGNVLSDEECDKLIELSKNKLARSKVG-----SSRDVNDIRTSKGAFLDDNE--- 90
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 126
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 127 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 186
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 148 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 192
Query: 187 HGSCPVIKGEKWVATKWIR 205
HG PV KGEKW+AT+W+R
Sbjct: 193 HGGAPVTKGEKWIATQWVR 211
>gi|398804098|ref|ZP_10563100.1| 2OG-Fe(II) oxygenase superfamily enzyme [Polaromonas sp. CF318]
gi|398094921|gb|EJL85274.1| 2OG-Fe(II) oxygenase superfamily enzyme [Polaromonas sp. CF318]
Length = 277
Score = 126 bits (316), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 79/208 (37%), Positives = 112/208 (53%), Gaps = 29/208 (13%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPS-QLALRQGETVESTKGTRTSSGTFISASEDK 66
P F N SA +C+++IA A+ RL S + +R G E RTS G F + E++
Sbjct: 90 PELWVFDNLLSAAECEALIAAAESRLARSLTVDIRTGG--EELNHDRTSHGMFYTRGENE 147
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQR 121
++ IE +IAR P +GE VLRY G +Y HYD F+P E G QR
Sbjct: 148 --VIRRIEARIARLLNWPVQNGEGLQVLRYRRGAEYKPHYDYFDPGEPGTAAILRRGGQR 205
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLF-YSLFPNGT 180
+AS ++YL + EGG T+FP IGLKV+P++G + F Y+L +
Sbjct: 206 VASLIMYLREPGEGGATVFP---------------DIGLKVRPQQGSAVFFSYALAHPAS 250
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQE 208
+ +LHG PV GEKW+ATKW+R++E
Sbjct: 251 L---TLHGGEPVKSGEKWIATKWLRERE 275
>gi|393200372|ref|YP_006462214.1| prolyl 4-hydroxylase [Solibacillus silvestris StLB046]
gi|327439703|dbj|BAK16068.1| prolyl 4-hydroxylase [Solibacillus silvestris StLB046]
Length = 211
Score = 126 bits (316), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 71/210 (33%), Positives = 113/210 (53%), Gaps = 24/210 (11%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
+VL P + F N S E+CQ++I A RL+ S+LA ++ ++ RTSSG F
Sbjct: 24 EVLHEEPLIVKFLNVLSDEECQNLIDCASSRLERSKLAKKEISSI------RTSSGMFFE 77
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 121
E++ ++ IE +I+ LP H E VL YE GQ++ H+D F P + + R
Sbjct: 78 --ENENPLISEIEKRISSLMHLPIEHAEGLQVLHYEPGQEFKPHFDFFGP-NHPSSSNNR 134
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 181
+ + ++YL+DVEEGG T FP +G+ P++G + F + + +
Sbjct: 135 ICTLVVYLNDVEEGGVTTFP---------------NLGIVNVPKKGTAVYFEYFYNDQKL 179
Query: 182 DRTSLHGSCPVIKGEKWVATKWIRDQEQHE 211
+ +LH PVI+GEKWVAT+W+R ++ E
Sbjct: 180 NELTLHSGEPVIQGEKWVATQWMRKKQIRE 209
>gi|228916870|ref|ZP_04080433.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pulsiensis BGSC 4CC1]
gi|228842793|gb|EEM87878.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pulsiensis BGSC 4CC1]
Length = 232
Score = 125 bits (315), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 68/208 (32%), Positives = 111/208 (53%), Gaps = 29/208 (13%)
Query: 1 MQVLS--WRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 58
+Q++S P + N S E+C +I +K +L S++ + RTS G
Sbjct: 46 IQIISKFEEPLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDVNDIRTSKGA 100
Query: 59 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQ 117
F+ +E + IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+
Sbjct: 101 FLDDNE----LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSA 154
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
+ R+++ ++YL+DVEEGGET FP + L V PR+G + F +
Sbjct: 155 ANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQ 199
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+ +++ +LHG PV KGEKW+AT+W+R
Sbjct: 200 DQSLNELTLHGGAPVTKGEKWIATQWVR 227
>gi|423406337|ref|ZP_17383486.1| hypothetical protein ICY_01022 [Bacillus cereus BAG2X1-3]
gi|401660331|gb|EJS77813.1| hypothetical protein ICY_01022 [Bacillus cereus BAG2X1-3]
Length = 216
Score = 125 bits (315), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 66/199 (33%), Positives = 106/199 (53%), Gaps = 27/199 (13%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P + N S E+C +I +K +L S++ + RTS G F+ +E
Sbjct: 39 PLIVVLGNVLSDEECDKLIELSKNKLARSKVG-----SSRDVNDIRTSKGAFLDDNE--- 90
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 126
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 127 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 186
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 148 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 192
Query: 187 HGSCPVIKGEKWVATKWIR 205
HG PV KGEKW+AT+W+R
Sbjct: 193 HGGAPVTKGEKWIATQWVR 211
>gi|229031885|ref|ZP_04187873.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1271]
gi|228729503|gb|EEL80492.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1271]
Length = 216
Score = 125 bits (315), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 67/199 (33%), Positives = 106/199 (53%), Gaps = 27/199 (13%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P + N S E+C +I +K +L S++ + RTS G F+ +E T
Sbjct: 39 PLIVVLGNVLSDEECGELIELSKSKLARSKVG-----SSRDVNDIRTSKGAFLDDNELTT 93
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 126
I E +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 94 KI----EKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 127 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 186
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 148 MYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 192
Query: 187 HGSCPVIKGEKWVATKWIR 205
HG PV KGEKW+AT+W+R
Sbjct: 193 HGGAPVTKGEKWIATQWVR 211
>gi|328876967|gb|EGG25330.1| putative prolyl 4-hydroxylase alpha subunit [Dictyostelium
fasciculatum]
Length = 244
Score = 125 bits (315), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 74/210 (35%), Positives = 108/210 (51%), Gaps = 31/210 (14%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT-RTSSGTFISA 62
+S PR P+F S +C+ +I +K +L+P E + G R+ G F+
Sbjct: 29 MSQCPRVYRVPDFLSPAECEHLIDISKNKLRPCN---------EISSGVHRSGWGLFMKE 79
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS--- 119
E+ +++ I ++ L + + E V+RY G++ +HYD FNP M
Sbjct: 80 GEEDHDVVKKIFQRMKMLVNLTE-NCEVMQVIRYHPGEETSAHYDYFNPLTTNGAMKIGL 138
Query: 120 --QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
QR+ + L+YLS+VEEGGET FP +G+KVKP +GD +LFY+ P
Sbjct: 139 YGQRVCTILMYLSEVEEGGETSFPE---------------VGVKVKPVKGDAVLFYNCKP 183
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 207
NG +D SLH PVIKG KWVA K I +
Sbjct: 184 NGEVDPLSLHQGDPVIKGTKWVAIKLINQK 213
>gi|229174912|ref|ZP_04302432.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus MM3]
gi|228608580|gb|EEK65882.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus MM3]
Length = 216
Score = 125 bits (315), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 66/199 (33%), Positives = 106/199 (53%), Gaps = 27/199 (13%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P + N S E+C +I +K +L S++ + RTS G F+ +E
Sbjct: 39 PLIVVLGNVLSDEECDELIELSKSKLARSKVG-----SSRDVNDIRTSKGAFLDDNE--- 90
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 126
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTVKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 127 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 186
+YL+DVEEGGET FP + L V PR+G + F + + +++ +L
Sbjct: 148 MYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 192
Query: 187 HGSCPVIKGEKWVATKWIR 205
HG PV KGEKW+AT+W+R
Sbjct: 193 HGGAPVTKGEKWIATQWVR 211
>gi|388567209|ref|ZP_10153646.1| procollagen-proline dioxygenase [Hydrogenophaga sp. PBC]
gi|388265592|gb|EIK91145.1| procollagen-proline dioxygenase [Hydrogenophaga sp. PBC]
Length = 296
Score = 125 bits (315), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 77/206 (37%), Positives = 109/206 (52%), Gaps = 25/206 (12%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
PR + N SAE+C +II +AK +L S L ++ E RTSSG F + +T
Sbjct: 109 PRVVVLGNLLSAEECDAIIESAKPKLARS-LTVQTATGGEELNADRTSSGMFFT--RGQT 165
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQRL 122
+ +E +IAR P +GE VL Y G +Y HYD F+P E G P + QR+
Sbjct: 166 PEVTAVERRIARLVGWPVENGEGLQVLHYRPGAEYKPHYDYFDPKEAGTPTILKRGGQRV 225
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
A+ ++YL++ GG T FP +GL+V P +G + F P+ T
Sbjct: 226 ATLVMYLNEPARGGGTTFP---------------DVGLEVAPVKGSAVFFSYDRPHPTTR 270
Query: 183 RTSLHGSCPVIKGEKWVATKWIRDQE 208
SLHG PV++GEKWVATKW+R++E
Sbjct: 271 --SLHGGAPVLEGEKWVATKWLRERE 294
>gi|281206564|gb|EFA80750.1| putative prolyl 4-hydroxylase alpha subunit [Polysphondylium
pallidum PN500]
Length = 251
Score = 125 bits (314), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 73/210 (34%), Positives = 109/210 (51%), Gaps = 31/210 (14%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT-RTSSGTFISA 62
+S +PR P F + E+C+ +I T+K +LKP E + G R+ G F+
Sbjct: 60 VSQKPRIYRIPKFLTDEECEHLIETSKNKLKPCN---------EISSGVHRSGWGLFMKE 110
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS--- 119
E+ + + I +++ L ++ E V+RY G++ +H+D FNP M
Sbjct: 111 GEEDHPVTQNIFNRMKTFVNLTES-SEVMQVIRYNPGEETSAHFDYFNPLTTNGAMKIGL 169
Query: 120 --QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
QR+ + L+YL+DVEEGGET FP N +KVKP +GD +LFY+ P
Sbjct: 170 YGQRICTILMYLADVEEGGETSFPEVN---------------VKVKPIKGDAVLFYNCKP 214
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 207
NG +D SLH PVIKG KW+A K + +
Sbjct: 215 NGEVDPLSLHQGDPVIKGTKWIAIKLVNQK 244
>gi|253575459|ref|ZP_04852796.1| prolyl 4-hydroxylase [Paenibacillus sp. oral taxon 786 str. D14]
gi|251845106|gb|EES73117.1| prolyl 4-hydroxylase [Paenibacillus sp. oral taxon 786 str. D14]
Length = 215
Score = 125 bits (314), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 72/210 (34%), Positives = 106/210 (50%), Gaps = 26/210 (12%)
Query: 3 VLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISA 62
VL P + F + ++C+ +I A RL+ S+L + RTS G F
Sbjct: 25 VLHKEPLIMRFERLLTDDECRQLIEAAAPRLRESKLVNK------VVSEIRTSRGMFFE- 77
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ-R 121
E++ + IE +I+ +P H E VL Y GQ+Y +HYD F P P S R
Sbjct: 78 -EEENPFIHRIEKRISALMNVPIEHAEGLQVLHYGPGQEYQAHYDFFGPN--SPSASNNR 134
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 181
+++ ++YL+DVE GGET+FP + L+VKP RG L F + +
Sbjct: 135 ISTLIIYLNDVEAGGETVFPL---------------LDLEVKPERGSALYFEYFYRQQEL 179
Query: 182 DRTSLHGSCPVIKGEKWVATKWIRDQEQHE 211
+ +LH S PV++GEKWVAT+W+R Q E
Sbjct: 180 NNLTLHSSVPVVRGEKWVATQWMRRQRVRE 209
>gi|326316001|ref|YP_004233673.1| procollagen-proline dioxygenase [Acidovorax avenae subsp. avenae
ATCC 19860]
gi|323372837|gb|ADX45106.1| Procollagen-proline dioxygenase [Acidovorax avenae subsp. avenae
ATCC 19860]
Length = 298
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 75/207 (36%), Positives = 111/207 (53%), Gaps = 27/207 (13%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPS-QLALRQGETVESTKGTRTSSGTFISASED 65
+PR + F N S E+C +II A+ R+ S +A R G E RTS+G F E+
Sbjct: 110 QPRVVLFGNLLSPEECDAIIDAARPRMARSLTVATRTGG--EEVNDDRTSNGMFFQREEN 167
Query: 66 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQ 120
++ +E +IAR P +GE VL Y G +Y HYD F+P E G P + Q
Sbjct: 168 P--MVAKLEARIARLVNWPLENGEGLQVLHYRPGAEYKPHYDYFDPTEPGTPTILRRGGQ 225
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+A+ ++YL+D E+GG T FP + L+V PRRG+ + F P+ +
Sbjct: 226 RVATIVIYLNDPEKGGGTTFP---------------DVHLEVAPRRGNAVFFSYERPHPS 270
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQ 207
+LHG PV+ G+KW+ATKW+R++
Sbjct: 271 T--RTLHGGAPVVAGDKWIATKWLRER 295
>gi|430808003|ref|ZP_19435118.1| prolyl 4-hydroxylase [Cupriavidus sp. HMR-1]
gi|429499635|gb|EKZ98045.1| prolyl 4-hydroxylase [Cupriavidus sp. HMR-1]
Length = 293
Score = 124 bits (312), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 72/205 (35%), Positives = 102/205 (49%), Gaps = 23/205 (11%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
PR L N +C +++A A+ RL+ S + + E+ RTS G E
Sbjct: 101 PRILLLQNLLDDAECDAVVALARDRLQRSPV-VNPDTGDENLIDARTSMGAMFQVGEH-- 157
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRL 122
+L+ IE +IA T P HGE F VL Y+ G +Y H+D FNP G QR+
Sbjct: 158 ALLQRIEARIAAVTGWPVEHGEGFQVLNYKPGGEYQPHFDFFNPKRPGEARQLRVGGQRV 217
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
A+ ++YL+ GG T FP IGL+V P +G+ +LF P+G +D
Sbjct: 218 ATMVIYLNSPASGGATAFP---------------RIGLEVAPVKGNAVLFSYGLPDGALD 262
Query: 183 RTSLHGSCPVIKGEKWVATKWIRDQ 207
+LH PV GEKW+ATKW+R+
Sbjct: 263 ERTLHAGLPVEAGEKWIATKWLREH 287
>gi|94312029|ref|YP_585239.1| prolyl 4-hydroxylase [Cupriavidus metallidurans CH34]
gi|93355881|gb|ABF09970.1| prolyl 4-hydroxylase [Cupriavidus metallidurans CH34]
Length = 293
Score = 124 bits (312), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 72/205 (35%), Positives = 102/205 (49%), Gaps = 23/205 (11%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
PR L N +C +++A A+ RL+ S + + E+ RTS G E
Sbjct: 101 PRILLLQNLLDDAECDAVVALARDRLQRSPV-VNPDTGDENLIDARTSMGAMFQVGEH-- 157
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRL 122
+L+ IE +IA T P HGE F VL Y+ G +Y H+D FNP G QR+
Sbjct: 158 ALLQRIEARIAAVTGWPVEHGEGFQVLNYKPGGEYQPHFDFFNPKRPGEARQLRVGGQRV 217
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
A+ ++YL+ GG T FP IGL+V P +G+ +LF P+G +D
Sbjct: 218 ATMVIYLNSPASGGATAFP---------------RIGLEVAPVKGNAVLFSYGLPDGALD 262
Query: 183 RTSLHGSCPVIKGEKWVATKWIRDQ 207
+LH PV GEKW+ATKW+R+
Sbjct: 263 ERTLHAGLPVEAGEKWIATKWLREH 287
>gi|308804269|ref|XP_003079447.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
gi|116057902|emb|CAL54105.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
Length = 363
Score = 124 bits (312), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 76/217 (35%), Positives = 114/217 (52%), Gaps = 27/217 (12%)
Query: 3 VLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISA 62
LSW PRA + NF + ++C+ +IA +K+L+ S + +G+ + RTS GTFI+
Sbjct: 93 TLSWSPRAFLYQNFLTEDECEHLIALGEKKLERSTVVGSKGKEGD-VHSARTSFGTFIT- 150
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 122
T L +E ++A + +P H E +LRYE GQ+Y + +R+
Sbjct: 151 -RRLTPTLSAVEDRVAEYSGIPWRHQEQLQLLRYEKGQEYGNG-------------EKRI 196
Query: 123 ASFLLYLSDVEEGGETMFPFENGI------FLDSGYDYKKC-----IGLKVKPRRGDGLL 171
A+ L++L + E GGET FP + FL S C G V PR+GD +L
Sbjct: 197 ATVLMFLREPEFGGETHFPDATPLPATRSEFLGSRAKLSDCGWNEGRGFSVIPRKGDAIL 256
Query: 172 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 208
F+S NGT D + H SCP ++G K+ ATKWI ++E
Sbjct: 257 FFSHHINGTSDDAASHASCPTLRGIKYTATKWIHEKE 293
>gi|116784858|gb|ABK23496.1| unknown [Picea sitchensis]
Length = 208
Score = 124 bits (311), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 69/148 (46%), Positives = 86/148 (58%), Gaps = 10/148 (6%)
Query: 66 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASF 125
K I+ IE KIA T LP+ +GE VLRYE G+KYD H+D F + R+A+
Sbjct: 7 KDAIISRIEDKIAAWTFLPKENGEDMQVLRYEPGEKYDPHFDFFQDKVNIVRGGHRVATV 66
Query: 126 LLYLSDVEEGGETMFP---------FENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
L+YL+DV +GGET+FP + I D+ D K G VKP+RGD LLF+SL
Sbjct: 67 LMYLTDVSKGGETVFPSAEEDTHRRISSIIKDDTLSDCAK-RGTAVKPKRGDALLFFSLT 125
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWI 204
D SLH CPVI+GEKW TKWI
Sbjct: 126 TQAKPDTRSLHAGCPVIEGEKWSVTKWI 153
>gi|221068712|ref|ZP_03544817.1| Procollagen-proline dioxygenase [Comamonas testosteroni KF-1]
gi|220713735|gb|EED69103.1| Procollagen-proline dioxygenase [Comamonas testosteroni KF-1]
Length = 299
Score = 124 bits (311), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 74/206 (35%), Positives = 108/206 (52%), Gaps = 25/206 (12%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
PR + F N S E+C +IIA A R++ S L + E+ RTS+G F E+
Sbjct: 112 PRVVVFGNLLSDEECDAIIAAAGPRMQRS-LTVDNQSGGEAVNDDRTSNGMFFQRGEND- 169
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQRL 122
++ +E +IAR P +GE VL Y G +Y HYD F P E G P + QR+
Sbjct: 170 -LICRVEQRIARLLNWPLENGEGMQVLHYRPGAEYKPHYDYFAPNEPGTPTILKRGGQRV 228
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
+ ++YL++ GG T FP +GL+V PRRG+ + F P+
Sbjct: 229 GTLVMYLNEPARGGATTFPD---------------VGLQVVPRRGNAVFFSYNRPDPATK 273
Query: 183 RTSLHGSCPVIKGEKWVATKWIRDQE 208
+LHG PV++GEKW+ATKW+R++E
Sbjct: 274 --TLHGGAPVLEGEKWIATKWLRERE 297
>gi|194290782|ref|YP_002006689.1| prolyl 4-hydroxylase subunit alpha [Cupriavidus taiwanensis LMG
19424]
gi|193224617|emb|CAQ70628.1| putative Prolyl 4-hydroxylase alpha subunit [Cupriavidus
taiwanensis LMG 19424]
Length = 296
Score = 124 bits (311), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 70/205 (34%), Positives = 108/205 (52%), Gaps = 23/205 (11%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P+ F S ++C +++A ++ RL S + + E+ RTS G +E
Sbjct: 104 PQVQLFQQLLSDDECDALVALSRGRLARSPV-VNPDTGDENLIDARTSMGAMFQVAEH-- 160
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP--QMS---QRL 122
++ IE +IA T +P HGE +L Y+ G +Y H+D FNP G Q+S QR+
Sbjct: 161 ALIARIEARIAAVTGVPADHGEGLQILNYKPGGEYQPHFDYFNPQRPGEARQLSVGGQRI 220
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
A+ ++YL+ E GG T FP +GL+V P +G+ + F L P+GT+D
Sbjct: 221 ATLVIYLNTPEAGGATAFP---------------RVGLEVAPVKGNAVYFSYLLPDGTLD 265
Query: 183 RTSLHGSCPVIKGEKWVATKWIRDQ 207
+LH PV GEKW+ATKW+R++
Sbjct: 266 DRTLHAGLPVAAGEKWIATKWLRER 290
>gi|159489450|ref|XP_001702710.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280732|gb|EDP06489.1| predicted protein [Chlamydomonas reinhardtii]
Length = 252
Score = 123 bits (309), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 79/209 (37%), Positives = 110/209 (52%), Gaps = 15/209 (7%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
+ V+SW PRA NF + ++ I A+ ++ S + G +V RTS GTFI
Sbjct: 1 VSVISWEPRAFVIRNFLTDQEATHIADVAQVHMRRSTVVADNGSSV--LDDYRTSYGTFI 58
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
+ T ++ +E ++A T +P + E VLRY GQ Y H D+ S
Sbjct: 59 NRY--ATPVVARVEDRVAVLTRVPVHYQEDMQVLRYGNGQYYHRHTDSLE------NDSP 110
Query: 121 RLASFLLYLSDVEEGGETMFP--FENGIFLDSGYDYKKCIGLKV--KPRRGDGLLFYSLF 176
RLA+ LLYLSD E GGET FP + + + +C+ V KPR+GD LLF+S+
Sbjct: 111 RLATVLLYLSDPELGGETAFPLAWAHPDMPKVFGPFSECVKNNVAFKPRKGDALLFWSVK 170
Query: 177 PNG-TIDRTSLHGSCPVIKGEKWVATKWI 204
P+G T D S H CPVI+G KW AT W+
Sbjct: 171 PDGKTEDPLSEHEGCPVIRGVKWTATVWV 199
>gi|239814309|ref|YP_002943219.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
gi|239800886|gb|ACS17953.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
Length = 279
Score = 123 bits (309), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 77/207 (37%), Positives = 107/207 (51%), Gaps = 27/207 (13%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPS-QLALRQGETVESTKGTRTSSGTFISASEDK 66
PR + F N S E+C+ +IA A+ RL S + R G V + RTS G F E+
Sbjct: 92 PRVVVFGNLVSPEECEGLIAAARVRLARSLTVETRTGGEVLNVD--RTSEGMFFERGEND 149
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQR 121
I+ +E +IA P GE +LRY G +Y HYD F+P E G P + QR
Sbjct: 150 --IVARLEQRIAALLRWPVEFGEGLQILRYAPGAQYRPHYDYFDPGEPGTPTILKRGGQR 207
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 181
+A+ ++YL + +GG T FP +GL+V P RG G+ F P+
Sbjct: 208 VATLVMYLQEPGQGGATTFP---------------DVGLEVAPVRGTGVFFSYEEPDPAT 252
Query: 182 DRTSLHGSCPVIKGEKWVATKWIRDQE 208
+LHG PV+ GEKWVATKW+R++E
Sbjct: 253 --RTLHGGAPVLAGEKWVATKWLRERE 277
>gi|339327280|ref|YP_004686973.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator N-1]
gi|338167437|gb|AEI78492.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator N-1]
Length = 297
Score = 123 bits (309), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 69/205 (33%), Positives = 108/205 (52%), Gaps = 23/205 (11%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P+ F + ++C +++A ++ RL S + + E+ RTS G +E
Sbjct: 105 PQVQLFQQLLTDDECDALVALSRGRLARSPV-VNPDTGDENLIDARTSMGAMFQVAEH-- 161
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP--QMS---QRL 122
++ IE +IA T +P HGE +L Y+ G +Y H+D FNP G Q+S QR+
Sbjct: 162 ALIARIEARIAAVTGVPAEHGEGLQILNYKPGGEYQPHFDYFNPQRPGEARQLSVGGQRI 221
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
A+ ++YL+ E GG T FP +GL+V P +G+ + F L P+GT+D
Sbjct: 222 ATLVIYLNTPEAGGATAFP---------------RVGLEVAPVKGNAVYFSYLLPDGTLD 266
Query: 183 RTSLHGSCPVIKGEKWVATKWIRDQ 207
+LH PV GEKW+ATKW+R++
Sbjct: 267 ERTLHAGLPVASGEKWIATKWLRER 291
>gi|301093292|ref|XP_002997494.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262110636|gb|EEY68688.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 324
Score = 123 bits (308), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 77/216 (35%), Positives = 108/216 (50%), Gaps = 17/216 (7%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
++ LS P F ++ I+A + + LKPS + L G + RTS+ F+
Sbjct: 108 LETLSLTPLVFSVDEFLKDDEIDIIMALSLEHLKPSTVTLMDGHEDRAATDWRTSTTYFL 167
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY---GPQ 117
S+S K L+ I+ ++A T +P H E VLRYE QKYD H D F P E+ P
Sbjct: 168 SSS--KHSKLDEIDQRVADLTKVPVDHQEDVQVLRYEETQKYDHHTDYF-PVEHHKNSPH 224
Query: 118 M--------SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKC-IGLKVKPRRGD 168
+ R+ + Y+SDV +GG T+FP G K C GLKV P++
Sbjct: 225 VLESIDYGYKNRMITVFWYMSDVAKGGHTIFPRAGGA--PRPQSMKDCSTGLKVSPKKRK 282
Query: 169 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 204
++FYS+ PNG D SLHG CPV G K+ KW+
Sbjct: 283 VIVFYSMLPNGQGDPMSLHGGCPVEDGIKYSGNKWV 318
>gi|365090417|ref|ZP_09328465.1| 2OG-Fe(II) oxygenase [Acidovorax sp. NO-1]
gi|363416516|gb|EHL23626.1| 2OG-Fe(II) oxygenase [Acidovorax sp. NO-1]
Length = 302
Score = 123 bits (308), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 75/207 (36%), Positives = 110/207 (53%), Gaps = 25/207 (12%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
+PR + F N S E+C ++IA A+ RL S L + E RTS G F +
Sbjct: 114 QPRIVVFGNLLSPEECDALIADAQPRLARS-LTVATKTGGEEINDDRTSDGMFFQ--RGQ 170
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQR 121
+ +++ IE +IAR P +GE VL Y G +Y HYD F+PAE G P + QR
Sbjct: 171 SPLIQRIEERIARLLNWPIENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPSIVNRGGQR 230
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 181
+ + ++YL+ E+GG T FP + L+V P+RG+ + F P+ +
Sbjct: 231 VGTLVMYLNTPEKGGGTTFP---------------DVHLEVAPQRGNAVFFSYERPHPST 275
Query: 182 DRTSLHGSCPVIKGEKWVATKWIRDQE 208
+LHG PVI GEKW+ATKW+R++E
Sbjct: 276 --RTLHGGAPVIAGEKWIATKWLRERE 300
>gi|421749438|ref|ZP_16186877.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator HPC(L)]
gi|409771699|gb|EKN53918.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator HPC(L)]
Length = 319
Score = 122 bits (307), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 69/204 (33%), Positives = 105/204 (51%), Gaps = 23/204 (11%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
PR F ++C+++IA ++ RL S + + E+ RTS G E
Sbjct: 127 PRIALFQRLLMPDECEALIALSRGRLARSPV-VNPDTGDENLIDARTSMGAMFQVGEHP- 184
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRL 122
++E +E +IA T +P HGE +L Y+ G +Y HYD FNP G QR+
Sbjct: 185 -LIERLEARIAAVTGVPVEHGEGLQILNYKPGAEYQPHYDFFNPQRPGEARQLRVGGQRM 243
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
A+ ++YL+DV GG T FP +GL+V P +G+ + F L +G++D
Sbjct: 244 ATLVIYLNDVPAGGATAFP---------------KLGLRVNPVQGNAVFFAYLGEDGSLD 288
Query: 183 RTSLHGSCPVIKGEKWVATKWIRD 206
+LH PV +GEKW+ATKW+R+
Sbjct: 289 ERTLHAGLPVEQGEKWIATKWLRE 312
>gi|145347188|ref|XP_001418057.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144578285|gb|ABO96350.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 317
Score = 122 bits (307), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 73/218 (33%), Positives = 113/218 (51%), Gaps = 19/218 (8%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
++ LSW PR NF S E+C+ +I +K+L+ S + ST RTS GTF+
Sbjct: 36 VETLSWSPRVFLLKNFLSDEECEHLIELGEKKLERSTVVNSDESGAVST--ARTSFGTFV 93
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
+ +T L+ +E ++A+ + +P H E +LRY GQ+Y +H+D G +
Sbjct: 94 TRRLTET--LQRVEDRVAKYSGIPWEHQEQLQLLRYRDGQEYVAHHDGIISENGG----K 147
Query: 121 RLASFLLYLSDVEEGGETMFP------FENGIFLDSGYDYKKC-----IGLKVKPRRGDG 169
R+A+ L++L + GGET FP FL + +C G V P++G+
Sbjct: 148 RIATVLMFLREPTSGGETSFPQGTPLPETKAAFLANKDKLSECGWNDGNGFSVIPKKGEA 207
Query: 170 LLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 207
+LF+S NGT D + H SCP + G K+ ATKWI +
Sbjct: 208 VLFFSFHINGTNDPFANHASCPTLGGTKYTATKWIHEN 245
>gi|333981907|ref|YP_004511117.1| procollagen-proline dioxygenase [Methylomonas methanica MC09]
gi|333805948|gb|AEF98617.1| Procollagen-proline dioxygenase [Methylomonas methanica MC09]
Length = 286
Score = 122 bits (307), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 65/204 (31%), Positives = 105/204 (51%), Gaps = 23/204 (11%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
RP + F S E+C+ +I ++++L PS + Q + R+S GT+ E
Sbjct: 95 RPDIVVVDEFMSGEECEQLIEQSRRKLTPSAIVDPQTGKFQ-VIADRSSEGTYFQRGE-- 151
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-----PQMSQR 121
+ ++ ++ +I+ P+ HGE +L Y +G +Y H+D F E G Q QR
Sbjct: 152 SPLISRLDRRISELMNWPEDHGEGIQILHYGVGAQYKPHFDYFLENESGGALQMTQSGQR 211
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 181
+A+ ++YL++V EGGET+FP +G+ + P+RG F G +
Sbjct: 212 VATLVMYLNEVTEGGETVFPD---------------VGISITPKRGSAAYFAYCNSLGQV 256
Query: 182 DRTSLHGSCPVIKGEKWVATKWIR 205
D +LHG PV+ GEKW+ATKW+R
Sbjct: 257 DPATLHGGAPVLTGEKWIATKWMR 280
>gi|403234403|ref|ZP_10912989.1| Procollagen-proline dioxygenase [Bacillus sp. 10403023]
Length = 217
Score = 122 bits (306), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 65/198 (32%), Positives = 108/198 (54%), Gaps = 24/198 (12%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P + N S E+C +I +K R+ S++A + RTSS TFI E++
Sbjct: 39 PLIVVLGNVLSDEECDELIRLSKDRINRSKIA------NANVDNMRTSSSTFIE--ENEN 90
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 127
I+ IE +I++ +P +GE +L Y++GQ+Y SH+D F+ + + + R+++ ++
Sbjct: 91 IIVSRIEKRISQIMNIPTEYGEGLQILNYQVGQEYKSHFDFFS-SPHNAINNPRISTLVM 149
Query: 128 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 187
YLSDVE+GGET FP + V P++G + F + + T++ +LH
Sbjct: 150 YLSDVEQGGETYFP---------------KLHFSVSPQKGMAVYFEYFYNDQTLNELTLH 194
Query: 188 GSCPVIKGEKWVATKWIR 205
G PVI G+KW AT+W+R
Sbjct: 195 GGAPVIVGDKWAATQWMR 212
>gi|91789558|ref|YP_550510.1| procollagen-proline,2-oxoglutarate-4-dioxygenase [Polaromonas sp.
JS666]
gi|91698783|gb|ABE45612.1| Procollagen-proline,2-oxoglutarate-4-dioxygenase [Polaromonas sp.
JS666]
Length = 277
Score = 122 bits (306), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 74/206 (35%), Positives = 108/206 (52%), Gaps = 25/206 (12%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P + F N S +C++++ A+ RL S L + E RTS G F + E+
Sbjct: 90 PDLVVFGNLLSDSECEALMEVAQPRLARS-LTVNIKTGGEERNRDRTSQGMFFARGENP- 147
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQRL 122
+++ +E +IAR P GE VLRY G +Y HYD F+PAE G P + QR+
Sbjct: 148 -LVQRVEARIARLVGWPVDRGEGLQVLRYRQGAQYKPHYDYFDPAEPGTPAILQRGGQRV 206
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
A+ ++YL++ E+GG T+FP IGL+V PRRG + F +P
Sbjct: 207 ATLIMYLNEPEQGGATVFP---------------DIGLQVTPRRGTAVFFS--YPAANPA 249
Query: 183 RTSLHGSCPVIKGEKWVATKWIRDQE 208
+ HG PV GEKW+ATKW+R++E
Sbjct: 250 SLTRHGGEPVKAGEKWIATKWLRERE 275
>gi|337280547|ref|YP_004620019.1| hypothetical protein Rta_28970 [Ramlibacter tataouinensis TTB310]
gi|334731624|gb|AEG94000.1| conserved hypothetical protein [Ramlibacter tataouinensis TTB310]
Length = 286
Score = 122 bits (305), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 74/207 (35%), Positives = 109/207 (52%), Gaps = 25/207 (12%)
Query: 6 WRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASED 65
+ PR + F + S ++C+ +I AK RL S L + E RTSSG F E+
Sbjct: 97 YNPRVVVFGSLLSDQECEQLIGLAKPRLARS-LTVATKTGGEEVNEDRTSSGMFFQRGEN 155
Query: 66 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQ 120
+ ++ IE +IAR P +GE VL Y G +Y HYD F+PAE G P + Q
Sbjct: 156 E--LVARIEARIARLVNWPVENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPTILKRGGQ 213
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+ + ++YL + E+GG T FP + L+V P+RG G+ F P+ +
Sbjct: 214 RVGTLVMYLGEPEKGGGTTFP---------------DVHLEVAPKRGHGVFFSYERPHPS 258
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQ 207
+LHG PV+ GEKW+ATKW+R++
Sbjct: 259 T--RTLHGGAPVLAGEKWIATKWLRER 283
>gi|340357957|ref|ZP_08680560.1| prolyl 4-hydroxylase [Sporosarcina newyorkensis 2681]
gi|339616017|gb|EGQ20677.1| prolyl 4-hydroxylase [Sporosarcina newyorkensis 2681]
Length = 211
Score = 122 bits (305), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 67/200 (33%), Positives = 108/200 (54%), Gaps = 24/200 (12%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P + N S E+C +I A ++K S++ G T E + RTSS FI +D+
Sbjct: 33 PLIVVLGNVLSDEECDELIQLAGDKVKRSKI----GTTREENE-LRTSSSMFIE--DDEN 85
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 127
I+ ++ +I+ +P HGE +LRY GQ+Y +H+D F+ + R+++ ++
Sbjct: 86 LIVTRVKKRISAIMKIPMEHGEGLQILRYTPGQQYKAHHDFFSSD--SKITNNRISTLVM 143
Query: 128 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 187
YL+DVE+GGET FP + V PR+G + F + + T++ +LH
Sbjct: 144 YLNDVEQGGETFFPH---------------LKFSVSPRKGMAVYFEYFYSDQTLNDFTLH 188
Query: 188 GSCPVIKGEKWVATKWIRDQ 207
G PV++GEKWVAT+W+R Q
Sbjct: 189 GGAPVVEGEKWVATQWMRKQ 208
>gi|222111817|ref|YP_002554081.1| procollagen-proline dioxygenase [Acidovorax ebreus TPSY]
gi|221731261|gb|ACM34081.1| Procollagen-proline dioxygenase [Acidovorax ebreus TPSY]
Length = 289
Score = 122 bits (305), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 72/206 (34%), Positives = 108/206 (52%), Gaps = 25/206 (12%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
PR + F N S E+CQ+II A+ R+ S L ++ E RTS G F E T
Sbjct: 102 PRVVLFGNLLSPEECQAIIDAAQPRMARS-LTVQTTTGGEEVNADRTSDGMFFQRGE--T 158
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRL 122
+++ +E +IAR P +GE VL Y G +Y HYD F+P + G QR+
Sbjct: 159 PVVQRLEERIARLVRWPIQNGEGLQVLHYRPGAEYKPHYDYFDPDQPGTSTIVRRGGQRV 218
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
A+ ++YL++ +GG T FP + L+V PR+G+ + F P+ +
Sbjct: 219 ATLVIYLNNPRKGGGTTFP---------------DVPLEVAPRQGNAVFFSYERPHPST- 262
Query: 183 RTSLHGSCPVIKGEKWVATKWIRDQE 208
+LHG VI+GEKW+ATKW+R++E
Sbjct: 263 -RTLHGGASVIEGEKWIATKWLRERE 287
>gi|149180354|ref|ZP_01858859.1| prolyl 4-hydroxylase, alpha subunit [Bacillus sp. SG-1]
gi|148852546|gb|EDL66691.1| prolyl 4-hydroxylase, alpha subunit [Bacillus sp. SG-1]
Length = 212
Score = 122 bits (305), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 64/198 (32%), Positives = 106/198 (53%), Gaps = 27/198 (13%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P + N S E+C ++I +K +LK S++ + E RTSS TF+ E +
Sbjct: 37 PLIVVLGNVLSDEECDALIGLSKDKLKRSKIGNTRNEN-----DMRTSSSTFMEEGESE- 90
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 127
++ +E +I++ +P +GE +L Y+IGQ+Y +H+D F A + R+++ ++
Sbjct: 91 -VVTRVEKRISQIMNIPYENGEGLQILNYKIGQEYKAHFDFFKNAS-----NPRISTLVM 144
Query: 128 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 187
YL+DVEEGGET FP + V P++G + F + N ++ +LH
Sbjct: 145 YLNDVEEGGETYFP---------------KLNFSVSPQKGMAVYFEYFYDNQELNDLTLH 189
Query: 188 GSCPVIKGEKWVATKWIR 205
G PVI G+KW AT+W+R
Sbjct: 190 GGAPVIIGDKWAATQWMR 207
>gi|395003644|ref|ZP_10387769.1| 2OG-Fe(II) oxygenase superfamily enzyme [Acidovorax sp. CF316]
gi|394318439|gb|EJE54870.1| 2OG-Fe(II) oxygenase superfamily enzyme [Acidovorax sp. CF316]
Length = 299
Score = 121 bits (304), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 74/208 (35%), Positives = 109/208 (52%), Gaps = 27/208 (12%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPS-QLALRQGETVESTKGTRTSSGTFISASED 65
+PR + F N SAE+C ++IA A R+ S +A + G E RTS G F E+
Sbjct: 111 KPRIVVFGNLLSAEECDALIAAAAPRMARSLTVATKTGG--EEVNDDRTSDGMFFQRGEN 168
Query: 66 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQ 120
+++ IE +IAR P +GE VL Y G +Y HYD F+P E G P + Q
Sbjct: 169 P--VVQRIEERIARLLDWPIENGEGLQVLHYRPGAEYKPHYDYFDPGEPGTPTILKRGGQ 226
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+ + ++YL+ E+GG T FP + ++V P+RG+ + F +
Sbjct: 227 RVGTLVMYLNTPEKGGGTTFP---------------DVHVEVAPQRGNAVFFS--YERAH 269
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQE 208
+LHG PVI GEKW+ATKW+R++E
Sbjct: 270 PATRTLHGGAPVIAGEKWIATKWLRERE 297
>gi|302844281|ref|XP_002953681.1| hypothetical protein VOLCADRAFT_63898 [Volvox carteri f.
nagariensis]
gi|300261090|gb|EFJ45305.1| hypothetical protein VOLCADRAFT_63898 [Volvox carteri f.
nagariensis]
Length = 304
Score = 121 bits (304), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 73/211 (34%), Positives = 117/211 (55%), Gaps = 15/211 (7%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
++ ++W+PR + NF + + + +I A ++K S + G++VE + T ++G +
Sbjct: 1 IEHVAWKPRVFIYHNFITDMEAKHMIELAAPQMKRSTVVGAGGQSVEDSYRTLYTAG--V 58
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
+D ++E IE+++A T + H E +LRY IGQ+Y H D E G
Sbjct: 59 RRYQDD--VVERIENRVAAWTQISVLHQEDMQILRYGIGQQYKVHADTLRDDEAG----V 112
Query: 121 RLASFLLYLSDVEEGGETMFP---FENGIFLDS-GYDYKKCIGLKV--KPRRGDGLLFYS 174
R+A+ L+YL++ E GGET FP + N ++ G ++ C V P+RGD LLF+S
Sbjct: 113 RVATVLIYLNEPEAGGETAFPDSQWVNPKLAETIGANFSACAKNHVAFAPKRGDALLFWS 172
Query: 175 LFPNGTI-DRTSLHGSCPVIKGEKWVATKWI 204
+ P+GT D + H CPV+ G KW ATKWI
Sbjct: 173 IGPDGTTEDYHASHTGCPVLSGVKWTATKWI 203
>gi|302844249|ref|XP_002953665.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
gi|300261074|gb|EFJ45289.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
Length = 245
Score = 121 bits (304), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 77/192 (40%), Positives = 106/192 (55%), Gaps = 15/192 (7%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
PRA F NF + + ++ A +LK S + GE V RTS G FI D
Sbjct: 61 PRAYLFHNFLTKAERAHMVRLAAPKLKRSTVVGNDGEGV--VDEIRTSYGMFIRRLADP- 117
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA-EYGPQMSQRLASFL 126
++ IE +I+ T LP H E VLRY GQ Y +HYD+ + + E GP+ RLA+FL
Sbjct: 118 -VITRIEKRISLWTHLPIEHQEDIQVLRYAHGQTYGAHYDSGDKSNEPGPKW--RLATFL 174
Query: 127 LYLSDVEEGGETMFPFENGIFLDSGYDYK-----KCIG--LKVKPRRGDGLLFYSLFPNG 179
+YLSDVEEGGET FP +N ++ D + +C + KP+ GD +LFYS +PN
Sbjct: 175 MYLSDVEEGGETAFP-QNSVWYDPTIPERIGPVSECAKGHVAAKPKAGDAVLFYSFYPNL 233
Query: 180 TIDRTSLHGSCP 191
T+D ++H CP
Sbjct: 234 TMDPAAMHTGCP 245
>gi|113869198|ref|YP_727687.1| prolyl 4-hydroxylase alpha subunit [Ralstonia eutropha H16]
gi|113527974|emb|CAJ94319.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia eutropha H16]
Length = 297
Score = 121 bits (303), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 69/210 (32%), Positives = 108/210 (51%), Gaps = 23/210 (10%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P+ F + ++C +++A ++ RL S + + E+ RTS G +E
Sbjct: 105 PQVQLFQQLLTDDECDALVALSRGRLARSPV-VNPDTGDENLIDARTSMGAMFQVAEHP- 162
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP--QMS---QRL 122
++ IE +IA T +P HGE +L Y+ G +Y H+D FNP G Q+S QR+
Sbjct: 163 -LITRIEARIAAVTGVPAEHGEGLQILNYKPGGEYQPHFDYFNPQRPGEARQLSVGGQRI 221
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
A+ ++YL+ E GG T FP +GL+V P +G+ + F L P+G +D
Sbjct: 222 ATLVIYLNTPEAGGATAFP---------------RVGLEVAPVKGNAVYFSYLLPDGALD 266
Query: 183 RTSLHGSCPVIKGEKWVATKWIRDQEQHED 212
+LH PV GEKW+ATKW+R++ D
Sbjct: 267 ERTLHAGLPVAFGEKWIATKWLRERPYRSD 296
>gi|407938132|ref|YP_006853773.1| 2OG-Fe(II) oxygenase [Acidovorax sp. KKS102]
gi|407895926|gb|AFU45135.1| 2OG-Fe(II) oxygenase [Acidovorax sp. KKS102]
Length = 303
Score = 121 bits (303), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 72/207 (34%), Positives = 110/207 (53%), Gaps = 25/207 (12%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
+PR + F N S E+C ++IA A+ R+ S L + E RTS G F +
Sbjct: 115 QPRIVVFGNLLSPEECDALIAAAEPRMARS-LTVATKTGGEEINADRTSDGMFFQ--RGQ 171
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQR 121
+ +++ IE +IAR P +GE VL Y G +Y HYD F+PAE G P + QR
Sbjct: 172 SPLIQRIEERIARLLQWPIENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPSIIKRGGQR 231
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 181
+ + ++YL+ ++GG T FP + L+V P+RG+ + F P+ +
Sbjct: 232 VGTLVMYLNTPDKGGGTTFP---------------DVHLEVAPQRGNAVFFSYERPHPST 276
Query: 182 DRTSLHGSCPVIKGEKWVATKWIRDQE 208
+LHG PVI G+KW+ATKW+R++E
Sbjct: 277 --RTLHGGAPVIAGDKWIATKWLRERE 301
>gi|383642155|ref|ZP_09954561.1| hypothetical protein SeloA3_06917 [Sphingomonas elodea ATCC 31461]
Length = 327
Score = 120 bits (302), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 73/198 (36%), Positives = 101/198 (51%), Gaps = 22/198 (11%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
PR +FP F S E+C + TA+ L+PS L RTS G I + +
Sbjct: 140 PRVEHFPGFLSREECAHVATTAQDLLEPS-FVLDPNSGRPIPHPIRTSDGGAIGPTNENL 198
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 127
++ I +IA AT GE+ VLRY GQ+Y H D AE +QR+A+F++
Sbjct: 199 -VVRAINLRIAAATGTAVEQGESLTVLRYARGQEYRRHLDTIAGAE-----NQRIATFIV 252
Query: 128 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 187
YL+D EGGET FP N ++V+PR GD + F ++ P+GT D +H
Sbjct: 253 YLNDGFEGGETHFPLLN---------------IQVRPRIGDAIRFDTIRPDGTPDPRLVH 297
Query: 188 GSCPVIKGEKWVATKWIR 205
PV G KW+AT+WIR
Sbjct: 298 AGQPVRNGVKWIATRWIR 315
>gi|255577610|ref|XP_002529682.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223530830|gb|EEF32693.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 165
Score = 120 bits (302), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 68/156 (43%), Positives = 92/156 (58%), Gaps = 4/156 (2%)
Query: 52 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 111
RTSSG F+S+ E K+ + IE +I+ + +P +GE VLRYE Q Y H+D F+
Sbjct: 11 VRTSSGMFLSSEERKSPMA--IEKRISVYSQVPIENGELVQVLRYEKSQFYRPHHDYFSD 68
Query: 112 AEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLL 171
+ QR+A+ L+YLSD EGGET FP G K GL VKP +GD +L
Sbjct: 69 TFNLKRGGQRVATMLMYLSDNVEGGETYFPMAGSGECSCGGKIVK--GLSVKPIKGDAVL 126
Query: 172 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 207
F+S+ +G D S+HG C V+ GEKW ATKW+R +
Sbjct: 127 FWSMGLDGQSDPKSIHGGCEVLAGEKWSATKWMRQR 162
>gi|121595595|ref|YP_987491.1| 2OG-Fe(II) oxygenase [Acidovorax sp. JS42]
gi|120607675|gb|ABM43415.1| 2OG-Fe(II) oxygenase [Acidovorax sp. JS42]
Length = 289
Score = 120 bits (302), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 72/206 (34%), Positives = 108/206 (52%), Gaps = 25/206 (12%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
PR + F N S E+CQ+II A+ R+ S L ++ E RTS G F E T
Sbjct: 102 PRVVLFGNLLSPEECQAIIDAAQPRMARS-LTVQTTTGGEEVNADRTSDGMFFQRGE--T 158
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRL 122
+++ +E +IAR P +GE VL Y G +Y HYD F+P + G QR+
Sbjct: 159 PVVQRLEERIARLVRWPIQNGEGLQVLHYRPGAEYKPHYDYFDPDQPGTSTIVRRGGQRV 218
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
A+ ++YL++ +GG T FP + L+V PR+G+ + F P+ +
Sbjct: 219 ATLVIYLNNPLKGGGTTFP---------------DVPLEVAPRQGNAVFFSYERPHPST- 262
Query: 183 RTSLHGSCPVIKGEKWVATKWIRDQE 208
+LHG VI+GEKW+ATKW+R++E
Sbjct: 263 -RTLHGGASVIEGEKWIATKWLRERE 287
>gi|449520827|ref|XP_004167434.1| PREDICTED: putative prolyl 4-hydroxylase-like, partial [Cucumis
sativus]
Length = 164
Score = 120 bits (302), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 66/155 (42%), Positives = 91/155 (58%), Gaps = 2/155 (1%)
Query: 53 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 112
RTSSG F+S E +++ IE +I+ + +P +GE VLRYE Q Y H+D F+
Sbjct: 7 RTSSGMFLSHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQFYKPHHDYFSDT 66
Query: 113 EYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLF 172
+ QR+A+ L+YLS+ EGGET FP G K GL VKP +GD +LF
Sbjct: 67 FNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCG--GKTVPGLSVKPAKGDAVLF 124
Query: 173 YSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 207
+S+ +G D S+HG C V+ GEKW ATKW+R +
Sbjct: 125 WSMGLDGQSDPKSIHGGCEVLSGEKWSATKWMRQK 159
>gi|205374182|ref|ZP_03226981.1| prolyl 4-hydroxylase alpha subunit [Bacillus coahuilensis m4-4]
Length = 210
Score = 120 bits (301), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P N S E+C +I+ +K R+ S++A Q + RTS+ F+ ED +
Sbjct: 33 PFVAVLGNVLSDEECDELISLSKDRMNRSKIAGNQENDI------RTSTSVFLP--EDAS 84
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 127
+++ +E +I++ +P HGE +L Y+IGQ+Y +H+D F+P + + R+++ +L
Sbjct: 85 EVVQRVEKRISQIMNIPVEHGEGLQLLNYQIGQEYKAHFDFFSPKKLIE--NPRISTLVL 142
Query: 128 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 187
YL+DVEEGG+T FP + L V P +G + F + + ++ +LH
Sbjct: 143 YLNDVEEGGDTYFP---------------NLKLSVSPHKGMAVYFEYFYDDPMLNELTLH 187
Query: 188 GSCPVIKGEKWVATKWIR 205
G PV G+KW AT W+R
Sbjct: 188 GGAPVTIGDKWAATMWMR 205
>gi|385206010|ref|ZP_10032880.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
gi|385185901|gb|EIF35175.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
Length = 296
Score = 120 bits (300), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 81/207 (39%), Positives = 106/207 (51%), Gaps = 25/207 (12%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASED 65
RP A+ +F SA +C+ +I+ A+ RL S + G V G R+S G F E
Sbjct: 101 RPAAILLDDFLSANECEQLISLARPRLSRSTVVDPVTGRNV--VAGHRSSDGMFFRLGE- 157
Query: 66 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD---AFNPA--EYGPQMSQ 120
T ++ +E +IA T LP +GE +L YE+G + H D A NPA E + Q
Sbjct: 158 -TPLIARLEARIAELTGLPVENGEGLQLLHYEVGAESTPHVDYLIAGNPANQESIARSGQ 216
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+ + L+YL+DVE GGETMFP G V PRRG L F G
Sbjct: 217 RVGTLLMYLNDVEGGGETMFP---------------QTGWSVVPRRGQALYFEYGNRFGL 261
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQ 207
D +SLH S P+ GEKWVATKWIR +
Sbjct: 262 ADPSSLHTSTPLRVGEKWVATKWIRTR 288
>gi|91779740|ref|YP_554948.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
xenovorans LB400]
gi|91692400|gb|ABE35598.1| Procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
xenovorans LB400]
Length = 296
Score = 120 bits (300), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 82/207 (39%), Positives = 105/207 (50%), Gaps = 25/207 (12%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASED 65
RP A+ +F SA +C+ +IA A+ RL S + G V G R+S G F E
Sbjct: 101 RPAAVLLDDFLSANECEQLIALARPRLSRSTVVDPVTGRNV--VAGHRSSDGMFFRLGE- 157
Query: 66 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD---AFNPA--EYGPQMSQ 120
T ++ +E +IA T LP +GE +L YE G + H D A NPA E + Q
Sbjct: 158 -TPLIARLEARIAELTGLPVENGEGLQLLHYEAGAESTPHVDYLIAGNPANRESIARSGQ 216
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+ + L+YL+DVE GGETMFP G V PRRG L F G
Sbjct: 217 RVGTLLMYLNDVEGGGETMFP---------------QTGWSVVPRRGQALYFEYGNRFGL 261
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQ 207
D +SLH S P+ GEKWVATKWIR +
Sbjct: 262 ADPSSLHTSTPLRAGEKWVATKWIRTR 288
>gi|351731158|ref|ZP_08948849.1| 2OG-Fe(II) oxygenase [Acidovorax radicis N35]
Length = 303
Score = 119 bits (299), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 72/207 (34%), Positives = 109/207 (52%), Gaps = 25/207 (12%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
+PR + F N S E+C ++IA A R+ S L + E RTS G F +
Sbjct: 115 QPRVVVFGNLLSPEECDALIADAAPRMARS-LTVATKTGGEEINDDRTSDGMFFQRGQ-- 171
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQR 121
+ +++ IE +IAR P +GE VL Y G +Y HYD F+PAE G P + QR
Sbjct: 172 SPLIQRIEERIARLLNWPIENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPTIVKRGGQR 231
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 181
+ + ++YL+ E+GG T FP + ++V P+RG+ + F P+ +
Sbjct: 232 VGTLVMYLNTPEKGGGTTFP---------------DVHVEVAPQRGNAVFFSYERPHPST 276
Query: 182 DRTSLHGSCPVIKGEKWVATKWIRDQE 208
+LHG PV+ GEKW+ATKW+R++E
Sbjct: 277 --RTLHGGAPVLAGEKWIATKWLRERE 301
>gi|388520887|gb|AFK48505.1| unknown [Lotus japonicus]
Length = 187
Score = 119 bits (299), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 61/122 (50%), Positives = 79/122 (64%), Gaps = 3/122 (2%)
Query: 87 HGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP-FENG 145
+GE+ +L YE G+KY+ HYD F+ R+A+ L+YLSDV +GGET+FP E+
Sbjct: 8 NGESIQILHYENGRKYEPHYDYFHDRANQFMGGHRIATVLMYLSDVGKGGETIFPNAESK 67
Query: 146 IFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKW 203
+ + +C G VKPR+GD LLF+SL N T D SLHGSCPVI+GEKW ATKW
Sbjct: 68 LSQPKDESWSECAHKGYAVKPRKGDALLFFSLHLNATTDSNSLHGSCPVIEGEKWSATKW 127
Query: 204 IR 205
I
Sbjct: 128 IH 129
>gi|294499597|ref|YP_003563297.1| 2OG-Fe(II) oxygenase family protein [Bacillus megaterium QM B1551]
gi|294349534|gb|ADE69863.1| 2OG-Fe(II) oxygenase family protein [Bacillus megaterium QM B1551]
Length = 219
Score = 119 bits (299), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 67/202 (33%), Positives = 110/202 (54%), Gaps = 25/202 (12%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQL-ALRQGETVESTKGTRTSSGTFISASEDK 66
P L N S E+C +I +K +++ S++ A R+ ++ RTSSG F SE++
Sbjct: 39 PLVLVLGNVLSNEECDELIQLSKDKMQRSKIGAAREVNSI------RTSSGMFFEESENE 92
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFL 126
++ IE ++++ + E VL+Y Q+Y +H+D F A + + R+++ +
Sbjct: 93 --LVHQIERRLSKIMGPSIEYAEGLQVLKYLPDQEYKAHHDYFTSASKASK-NNRISTLV 149
Query: 127 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 186
+YL+DVEEGGET FP +GL V P +G + F + + ++ +L
Sbjct: 150 MYLNDVEEGGETYFP---------------KLGLSVSPTKGMAVYFEYFYSDAELNDRTL 194
Query: 187 HGSCPVIKGEKWVATKWIRDQE 208
HG PVIKGEKWVAT+W+R Q+
Sbjct: 195 HGGAPVIKGEKWVATQWMRKQK 216
>gi|420246706|ref|ZP_14750139.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. BT03]
gi|398073616|gb|EJL64785.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. BT03]
Length = 282
Score = 119 bits (298), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 70/206 (33%), Positives = 105/206 (50%), Gaps = 23/206 (11%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
RP+ + F + S E+C +I A+ RLK S + + E RTS G + ED
Sbjct: 92 RPQVIAFDDVLSGEECAELIERARHRLKRSTTVNPENGS-EDVIQLRTSEGFWFQRCED- 149
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQR 121
+E ++H+I+ P HGE +L Y G +Y H+D F P + G + QR
Sbjct: 150 -AFIERLDHRISALMNWPLEHGEGLQILHYRQGGEYRPHFDYFPPGQNGSVLHTARGGQR 208
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 181
+A+ ++YLSDVE GGET+FP D+G L V R+G + F + +
Sbjct: 209 VATLIVYLSDVEGGGETVFP-------DAG--------LAVMARQGGAIYFRYMNGRRQL 253
Query: 182 DRTSLHGSCPVIKGEKWVATKWIRDQ 207
D +LHG PV G+KW+ TKW+R++
Sbjct: 254 DPLTLHGGAPVTSGDKWIMTKWMRER 279
>gi|390570433|ref|ZP_10250698.1| procollagen-proline dioxygenase [Burkholderia terrae BS001]
gi|389937613|gb|EIM99476.1| procollagen-proline dioxygenase [Burkholderia terrae BS001]
Length = 285
Score = 119 bits (298), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 70/206 (33%), Positives = 105/206 (50%), Gaps = 23/206 (11%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
RP+ + F + S E+C +I A+ RLK S + + E RTS G + ED
Sbjct: 95 RPQVIAFDDVLSGEECAELIERARHRLKRSTTVNPENGS-EDVIQLRTSEGFWFQRCED- 152
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQR 121
+E ++H+I+ P HGE +L Y G +Y H+D F P + G + QR
Sbjct: 153 -AFIERLDHRISALMNWPLEHGEGLQILHYRQGGEYRPHFDYFPPGQNGSVLHTARGGQR 211
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 181
+A+ ++YLSDVE GGET+FP D+G L V R+G + F + +
Sbjct: 212 VATLIVYLSDVEGGGETVFP-------DAG--------LAVMARQGGAIYFRYMNGRRQL 256
Query: 182 DRTSLHGSCPVIKGEKWVATKWIRDQ 207
D +LHG PV G+KW+ TKW+R++
Sbjct: 257 DPLTLHGGAPVTSGDKWIMTKWMRER 282
>gi|319652240|ref|ZP_08006358.1| prolyl 4-hydroxylase [Bacillus sp. 2_A_57_CT2]
gi|317396063|gb|EFV76783.1| prolyl 4-hydroxylase [Bacillus sp. 2_A_57_CT2]
Length = 216
Score = 119 bits (297), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 108/198 (54%), Gaps = 23/198 (11%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P + N S E+C +I +K R++ S++A ++E + RTSS TF E++
Sbjct: 38 PLIVILGNVLSDEECDQLIQQSKDRMQRSKVA----NSLEVDE-LRTSSSTFFHEGENE- 91
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 127
I+ IE +I++ +P HGE +L Y+IGQ+Y +H+D F+ + R+++ ++
Sbjct: 92 -IVARIEKRISQIMNIPVEHGEGLQILNYKIGQEYKAHFDFFSSTSRAAS-NPRISTLVM 149
Query: 128 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 187
YL+DVE+GGET FP + V P++G + F + + ++ +LH
Sbjct: 150 YLNDVEQGGETYFP---------------KLNFSVSPQKGMAVYFEYFYNDQNLNDLTLH 194
Query: 188 GSCPVIKGEKWVATKWIR 205
G PV+ G+KW AT+W+R
Sbjct: 195 GGAPVVMGDKWAATQWMR 212
>gi|295704991|ref|YP_003598066.1| 2OG-Fe(II) oxygenase [Bacillus megaterium DSM 319]
gi|294802650|gb|ADF39716.1| 2OG-Fe(II) oxygenase family protein [Bacillus megaterium DSM 319]
Length = 219
Score = 119 bits (297), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 66/202 (32%), Positives = 110/202 (54%), Gaps = 25/202 (12%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQL-ALRQGETVESTKGTRTSSGTFISASEDK 66
P L N S E+C +I +K +++ S++ A R+ ++ RTSSG F SE++
Sbjct: 39 PLVLVLGNVLSNEECDELIRLSKDKMQRSKIGAAREVNSI------RTSSGMFFDESENE 92
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFL 126
++ IE ++++ + E +L+Y Q+Y +H+D F A + + R+++ +
Sbjct: 93 --LVHQIERRLSKIMGPSIEYAEGLQILKYLPDQEYKAHHDYFTSASKASK-NNRISTLV 149
Query: 127 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 186
+YL+DVEEGGET FP +GL V P +G + F + + ++ +L
Sbjct: 150 MYLNDVEEGGETYFP---------------KLGLSVSPTKGMAVYFEYFYSDAELNDRTL 194
Query: 187 HGSCPVIKGEKWVATKWIRDQE 208
HG PVIKGEKWVAT+W+R Q+
Sbjct: 195 HGGAPVIKGEKWVATQWMRKQK 216
>gi|317127314|ref|YP_004093596.1| Procollagen-proline dioxygenase [Bacillus cellulosilyticus DSM
2522]
gi|315472262|gb|ADU28865.1| Procollagen-proline dioxygenase [Bacillus cellulosilyticus DSM
2522]
Length = 229
Score = 119 bits (297), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 60/198 (30%), Positives = 105/198 (53%), Gaps = 24/198 (12%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P + N S E+C +I+ +K R++ S+++ + S RTSS F +E+
Sbjct: 44 PLIVLLGNVLSEEECDQLISLSKDRIERSKISNK------SVHDLRTSSSMFFDDAEND- 96
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 127
++ +E ++++ +P HGE +L Y IGQ+Y +HYD F+ + R+++ ++
Sbjct: 97 -VVSTVEKRVSQIMKIPVDHGEGIQILNYAIGQEYKAHYDYFSSGNSKVN-NPRISTLVM 154
Query: 128 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 187
YL+DVE GGET FP + V P++G + F + + T++ +LH
Sbjct: 155 YLNDVEAGGETYFP---------------KLNFYVAPKKGMAVYFEYFYNDTTLNELTLH 199
Query: 188 GSCPVIKGEKWVATKWIR 205
G PV+ G+KW AT+W+R
Sbjct: 200 GGAPVVIGDKWAATQWMR 217
>gi|384046522|ref|YP_005494539.1| prolyl 4-hydroxylase alpha subunit [Bacillus megaterium WSH-002]
gi|345444213|gb|AEN89230.1| Prolyl 4-hydroxylase alpha subunit [Bacillus megaterium WSH-002]
Length = 219
Score = 118 bits (295), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 63/201 (31%), Positives = 106/201 (52%), Gaps = 23/201 (11%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P L N S E+C +I +K +++ S++ + RTSSG F SE++
Sbjct: 39 PLVLVLGNVLSNEECDELIQLSKDKMQRSKIGAER-----EVNSIRTSSGMFFEESENE- 92
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 127
++ IE ++++ + E +L+Y Q+Y +H+D F A + + R+++ ++
Sbjct: 93 -LVHQIERRLSKIMGPSIEYAEGLQILKYLPDQEYKAHHDYFTSASKASK-NNRISTLVM 150
Query: 128 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 187
YL+DVEEGGET FP +GL + P +G + F + + ++ +LH
Sbjct: 151 YLNDVEEGGETYFP---------------KLGLSISPTKGMAVYFEYFYSDAELNDRTLH 195
Query: 188 GSCPVIKGEKWVATKWIRDQE 208
G PVIKGEKWVAT+W+R Q+
Sbjct: 196 GGAPVIKGEKWVATQWMRKQK 216
>gi|254254263|ref|ZP_04947580.1| hypothetical protein BDAG_03558 [Burkholderia dolosa AUO158]
gi|124898908|gb|EAY70751.1| hypothetical protein BDAG_03558 [Burkholderia dolosa AUO158]
Length = 285
Score = 118 bits (295), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 72/212 (33%), Positives = 103/212 (48%), Gaps = 35/212 (16%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT------RTSSGTFI 60
RP+ + F N ++C +I + +L+ Q TV + GT RTS GT+
Sbjct: 95 RPQIVVFGNVLDQDECDEMIQRSMHKLE-------QSTTVNAETGTQEVIRHRTSHGTWF 147
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-- 118
ED ++ IE ++A P +GE VLRY G +Y SHYD F P G
Sbjct: 148 QNGED--ALIRRIETRLAALMNCPVENGEGLQVLRYTPGGEYRSHYDYFQPTAAGSLTHV 205
Query: 119 ---SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSL 175
QR+A+ ++YL+DV GGET+FP G+ V PRRGD + F +
Sbjct: 206 RTGGQRVATLIVYLNDVPSGGETVFPEA---------------GISVVPRRGDAVYFRYM 250
Query: 176 FPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 207
+D +LH PV GEKW+ TKW+R++
Sbjct: 251 NRLRQLDPATLHAGAPVRDGEKWIMTKWVRER 282
>gi|125546091|gb|EAY92230.1| hypothetical protein OsI_13950 [Oryza sativa Indica Group]
Length = 178
Score = 117 bits (292), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 59/139 (42%), Positives = 90/139 (64%), Gaps = 5/139 (3%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 62
LSWRPRA + F S ++C ++ AK R++ S +A G+++ S RTSSGTF+S
Sbjct: 40 LSWRPRAFLYSGFLSHDECDHLVNLAKGRMEKSMVADNDSGKSIMSQ--VRTSSGTFLSK 97
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 122
ED I+ IE ++A T LP+ + E+ +L YE+GQKYD+H+D F+ + R+
Sbjct: 98 HEDD--IVSGIEKRVAAWTFLPEENAESIQILHYELGQKYDAHFDYFHDKNNLKRGGHRV 155
Query: 123 ASFLLYLSDVEEGGETMFP 141
A+ L+YL+DV++GGET+FP
Sbjct: 156 ATVLMYLTDVKKGGETVFP 174
>gi|89096248|ref|ZP_01169141.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus sp.
NRRL B-14911]
gi|89089102|gb|EAR68210.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus sp.
NRRL B-14911]
Length = 217
Score = 117 bits (292), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 62/198 (31%), Positives = 104/198 (52%), Gaps = 23/198 (11%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P + N S E+C+ +I ++ +LK S++ + RTSS F E++
Sbjct: 39 PLIVILGNVLSDEECEGLIRMSEDKLKRSKIG-----NTRTVDDIRTSSSMFFEEGENE- 92
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 127
++ IE ++++ +P HGE +L Y IGQ+Y +H+D F+ + + R+++ ++
Sbjct: 93 -LVARIERRLSQIMNIPVEHGEGLQMLNYHIGQEYKAHFDFFSSSSR-AASNPRISTLVM 150
Query: 128 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 187
YL+DVEEGGET FP + V P++G + F + N ++ +LH
Sbjct: 151 YLNDVEEGGETYFP---------------KLNFSVNPQKGSAVYFEYFYDNQDLNDLTLH 195
Query: 188 GSCPVIKGEKWVATKWIR 205
G PVIKG KW AT+W+R
Sbjct: 196 GGAPVIKGSKWAATQWMR 213
>gi|148653656|ref|YP_001280749.1| procollagen-proline dioxygenase [Psychrobacter sp. PRwf-1]
gi|148572740|gb|ABQ94799.1| Procollagen-proline dioxygenase [Psychrobacter sp. PRwf-1]
Length = 268
Score = 116 bits (291), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 67/213 (31%), Positives = 111/213 (52%), Gaps = 25/213 (11%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTF 59
+ + ++P +F S E+C ++I+ A ++LK S++ G VE + T TS+G
Sbjct: 72 LSFVCYKPFVTVINDFLSPEECDALISDADQKLKASRVVDPEDGSFVEHSARTSTSTGY- 130
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM- 118
+ I++ IE +IA P HGE VLRYE G +Y H+D F+PA+ ++
Sbjct: 131 ---HRGEIDIIKTIEARIADLINWPVDHGEGLQVLRYEDGGEYRPHFDFFDPAKKSSRLV 187
Query: 119 ----SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 174
QR+ +FL+YLS+V+ GG T FP + +++P +G L F +
Sbjct: 188 TKQGGQRVGTFLMYLSEVDSGGSTRFP---------------NLNFEIRPNKGSALYFAN 232
Query: 175 LFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 207
I+ +LH PV +G K++ATKW+R++
Sbjct: 233 TNLKAEIEPLTLHAGMPVTEGVKYLATKWLREK 265
>gi|402813396|ref|ZP_10862991.1| hypothetical protein PAV_1c08470 [Paenibacillus alvei DSM 29]
gi|402509339|gb|EJW19859.1| hypothetical protein PAV_1c08470 [Paenibacillus alvei DSM 29]
Length = 215
Score = 116 bits (291), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 70/200 (35%), Positives = 101/200 (50%), Gaps = 25/200 (12%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P + N S E+C +I +K+RL+ S++ GE S RTSSG F +E
Sbjct: 36 PLIVILGNVLSNEECDELIEHSKERLQRSKI----GEE-RSVNQIRTSSGVFCEENE--- 87
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 127
+ IE +I++ +P HG+ VL Y GQ+Y H+D F + R+++ ++
Sbjct: 88 -TVAKIEKRISQIMNIPIEHGDGLQVLLYAPGQEYKPHFDFFADTSRA-SANNRISTLVM 145
Query: 128 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 187
YL+DVEEGGET FP N L V P +G + F + N ++ +LH
Sbjct: 146 YLNDVEEGGETTFPMLN---------------LSVFPSKGMAVYFEYFYSNHELNERTLH 190
Query: 188 GSCPVIKGEKWVATKWIRDQ 207
PV KGEKWVAT W+R Q
Sbjct: 191 AGAPVRKGEKWVATMWMRRQ 210
>gi|241767624|ref|ZP_04765273.1| Procollagen-proline dioxygenase [Acidovorax delafieldii 2AN]
gi|241361463|gb|EER57922.1| Procollagen-proline dioxygenase [Acidovorax delafieldii 2AN]
Length = 318
Score = 116 bits (290), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 72/206 (34%), Positives = 104/206 (50%), Gaps = 25/206 (12%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
PR + F N S E+C+++IA A R+ S L + E RTS G F E
Sbjct: 131 PRVVVFGNLLSPEECEALIAAAAPRMARS-LTVATQTGGEEVNDDRTSHGMFFQRGESP- 188
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQRL 122
+++ IE +IA P +GE VL Y G +Y HYD F+PAE G P + QR+
Sbjct: 189 -LVQRIEERIASLLNWPIENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPTVIQRGGQRV 247
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
+ ++YL+ E+GG T FP ++V P+RG+ F P T
Sbjct: 248 GTLVMYLNTPEQGGGTTFPDAQ---------------IEVAPQRGNAAFFSYERP--TPS 290
Query: 183 RTSLHGSCPVIKGEKWVATKWIRDQE 208
+LHG PV+ G+KW+ATKW+R++E
Sbjct: 291 TRTLHGGAPVLAGDKWIATKWLRERE 316
>gi|334188665|ref|NP_001190630.1| iron ion binding / oxidoreductase/ oxidoreductase protein
[Arabidopsis thaliana]
gi|332010771|gb|AED98154.1| iron ion binding / oxidoreductase/ oxidoreductase protein
[Arabidopsis thaliana]
Length = 243
Score = 116 bits (290), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 64/148 (43%), Positives = 91/148 (61%), Gaps = 7/148 (4%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTF 59
++++SW PRA + NF E+C+ +I AK ++ S + + G++ +S RTSSGTF
Sbjct: 78 VEIISWEPRASVYHNFL--EECKYLIELAKPHMEKSTVVDEKTGKSTDSR--VRTSSGTF 133
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
++ DKT + IE +I+ T +P HGE VL YEIGQKY+ HYD F
Sbjct: 134 LARGRDKT--IREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHYDYFMDEYNTRNGG 191
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGIF 147
QR+A+ L+YLSDVEEGGET+FP G +
Sbjct: 192 QRIATVLMYLSDVEEGGETVFPAAKGNY 219
>gi|195061068|ref|XP_001995918.1| GH14106 [Drosophila grimshawi]
gi|193891710|gb|EDV90576.1| GH14106 [Drosophila grimshawi]
Length = 511
Score = 116 bits (290), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 69/210 (32%), Positives = 107/210 (50%), Gaps = 21/210 (10%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
M+++ P + F + S ++ + A+ LK + + + G+ V ++ RTS G ++
Sbjct: 310 MEIVLLNPFIVVFHDALSPQEIDYLQNLARPLLKRTTVHV-NGKYV--SRRVRTSKGAWL 366
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA-EYGPQMS 119
D + IE ++ T L EA+N++ Y +G Y +HYD FN + +
Sbjct: 367 E--RDLNNLTRRIERRVVDMTELSMQGSEAYNIMNYGLGGHYAAHYDFFNTTKQQTSETG 424
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 179
R+A+ L YLSDVE+GG T+FP + L V P RG L +Y+L NG
Sbjct: 425 DRIATVLFYLSDVEQGGATVFP---------------NLKLAVSPERGMALFWYNLLDNG 469
Query: 180 TIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
T D +LHG CPV+ G KWV T WI ++ Q
Sbjct: 470 TGDTRTLHGGCPVLVGSKWVMTLWIHERAQ 499
>gi|226314793|ref|YP_002774689.1| hypothetical protein BBR47_52080 [Brevibacillus brevis NBRC 100599]
gi|226097743|dbj|BAH46185.1| conserved hypothetical protein [Brevibacillus brevis NBRC 100599]
Length = 215
Score = 115 bits (289), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 68/200 (34%), Positives = 101/200 (50%), Gaps = 25/200 (12%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P + N S +C +I +++RL+ S++ GE S RTSSG F +E T
Sbjct: 36 PLVVVLGNVLSDSECDELIEHSRERLQRSKI----GED-RSVNSIRTSSGVFCEQTETIT 90
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 127
I E +I++ +P HG+ VLRY GQ+Y HYD F + R+++ ++
Sbjct: 91 RI----EKRISQIMNIPIEHGDGLQVLRYTPGQEYKPHYDFFAETSRAS-TNNRISTLVM 145
Query: 128 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 187
YL+DVE+GGET+FP + L V P +G + F + N ++ +LH
Sbjct: 146 YLNDVEQGGETVFPL---------------LHLSVFPTKGMAVYFEYFYRNQEVNEFTLH 190
Query: 188 GSCPVIKGEKWVATKWIRDQ 207
VI GEKWVAT W+R Q
Sbjct: 191 AGAQVIHGEKWVATMWMRRQ 210
>gi|255083957|ref|XP_002508553.1| predicted protein [Micromonas sp. RCC299]
gi|226523830|gb|ACO69811.1| predicted protein [Micromonas sp. RCC299]
Length = 262
Score = 115 bits (289), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 74/218 (33%), Positives = 108/218 (49%), Gaps = 18/218 (8%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
++ LS P+A + F SAE+C +I LK S + + +T RTS GTF+
Sbjct: 1 VEKLSDEPKAFLYHGFLSAEECDHLIKIGTPHLKRSTVVGGKDDT-GVLDDVRTSFGTFL 59
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
D +L IE ++ + + + E +L+Y GQ+Y H D P +
Sbjct: 60 PKKYDD--VLYGIERRVEDFSQISYENQEQLQLLKYHDGQEYKDHQDGLT----SPNGGR 113
Query: 121 RLASFLLYLSDVEEGGETMFPFENGI---------FLDSGYD--YKKCIGLKVKPRRGDG 169
R+A+ L++L + E+GGET FP + D D ++ GL VKPRRGD
Sbjct: 114 RIATVLMFLHEPEKGGETSFPQGKPLPAVAQRLRGMRDELSDCAWRDGRGLAVKPRRGDA 173
Query: 170 LLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 207
+LF+S NG D S H SCP + G KW ATKWI ++
Sbjct: 174 VLFFSFKKNGGSDIASTHASCPTVGGVKWTATKWIHEK 211
>gi|330821584|ref|YP_004350446.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
gladioli BSR3]
gi|327373579|gb|AEA64934.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
gladioli BSR3]
Length = 302
Score = 115 bits (288), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 75/207 (36%), Positives = 100/207 (48%), Gaps = 25/207 (12%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASED 65
RP A+ F SA +C+ +I A+ RL S + G + G R+S G F E
Sbjct: 101 RPAAVLLDGFLSAGECRQLIELARPRLNRSTVVDPVTGRNI--VAGHRSSDGMFFRLGE- 157
Query: 66 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP-----AEYGPQMSQ 120
T ++ IE +IA T P +GE +L YE G + H D P AE + Q
Sbjct: 158 -TPLISRIEQRIAALTGFPVENGEGLQMLHYEAGAESTPHVDYLVPGNPANAESIARSGQ 216
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+ + L+YL+DVE GGET+FP +G V PRRG F +G
Sbjct: 217 RVGTLLMYLNDVESGGETLFP---------------QVGCSVVPRRGQAFYFEYGNGSGR 261
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQ 207
D SLH S P+ G+KWVATKWIR +
Sbjct: 262 SDPASLHASSPIGSGDKWVATKWIRTR 288
>gi|386712780|ref|YP_006179102.1| prolyl 4-hydroxylase alpha subunit [Halobacillus halophilus DSM
2266]
gi|384072335|emb|CCG43825.1| prolyl 4-hydroxylase alpha subunit [Halobacillus halophilus DSM
2266]
Length = 211
Score = 115 bits (288), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 62/194 (31%), Positives = 99/194 (51%), Gaps = 25/194 (12%)
Query: 15 NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 74
N S E+C+ +I +K ++ S++ + RTSS TF+ + + IE
Sbjct: 41 NVVSEEECEELIFLSKNKMNRSKIGSQH-----EVSDIRTSSSTFLPEDD----LTNRIE 91
Query: 75 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEE 134
++A+ +P HGE ++L Y+ GQ+Y +HYD F + R+++ +LYL+DVEE
Sbjct: 92 KRVAQIMNVPVEHGEGLHILNYKQGQEYKAHYDYFRSKAKAAN-NPRISTLVLYLNDVEE 150
Query: 135 GGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIK 194
GGET FP N L + P +G + F + + I+ +LHG PV
Sbjct: 151 GGETYFPHMN---------------LSISPHKGMAVYFEYFYSDPLINERTLHGGSPVTS 195
Query: 195 GEKWVATKWIRDQE 208
GEKW AT W+R ++
Sbjct: 196 GEKWAATMWVRRKQ 209
>gi|167519971|ref|XP_001744325.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163777411|gb|EDQ91028.1| predicted protein [Monosiga brevicollis MX1]
Length = 492
Score = 115 bits (287), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 68/207 (32%), Positives = 101/207 (48%), Gaps = 32/207 (15%)
Query: 9 RALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT------RTSSGTFISA 62
R F NFASA++C + +K+L V T G R S+ ++
Sbjct: 305 RLQIFRNFASAQECAHLREEGRKKL---------SRAVAWTDGAFRPVEFRISTAAWLQP 355
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 122
D ++ + +IA AT L EA V Y IG Y++HYD E R+
Sbjct: 356 DHDD--VVTNLHTRIADATQLDLEFAEALQVSNYGIGGFYETHYDHHASRERELPEGDRI 413
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
A+F++YL+ VE+GG T FP +G V+P GD + +Y+L P+G D
Sbjct: 414 ATFMIYLNQVEQGGYTAFPR---------------LGAAVEPGHGDAVFWYNLLPDGESD 458
Query: 183 RTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+LHG+CPV++G KWVA KWI +++
Sbjct: 459 NNTLHGACPVLQGSKWVANKWIHEKKN 485
>gi|433460968|ref|ZP_20418587.1| prolyl 4-hydroxylase alpha subunit [Halobacillus sp. BAB-2008]
gi|432190746|gb|ELK47751.1| prolyl 4-hydroxylase alpha subunit [Halobacillus sp. BAB-2008]
Length = 211
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 64/201 (31%), Positives = 102/201 (50%), Gaps = 25/201 (12%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P+ N S E+C+++I +K ++ S++ + RTSS F+ E
Sbjct: 34 PKIAILGNVVSEEECEALIRLSKDKVNRSKIG-----SDHDVSDIRTSSSAFLPDDE--- 85
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 127
+ IE ++A+ +P HGE ++L Y+ GQ+Y +H+D F + + R+++ +L
Sbjct: 86 -LTGRIEKRLAQIMNVPVEHGEGIHILHYKPGQEYKAHHDYFRSTSRAAK-NPRISTLVL 143
Query: 128 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 187
YL+DVEEGGET FP N L V P +G + F + + I+ +LH
Sbjct: 144 YLNDVEEGGETYFPEMN---------------LTVSPHKGMAVYFEYFYNDPAINERTLH 188
Query: 188 GSCPVIKGEKWVATKWIRDQE 208
G PV GEKW AT W+R Q+
Sbjct: 189 GGSPVTAGEKWAATMWVRRQQ 209
>gi|326518408|dbj|BAJ88233.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 276
Score = 114 bits (286), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 68/176 (38%), Positives = 105/176 (59%), Gaps = 7/176 (3%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 60
+V+SW PR + F NF S+E+C + A+ RL+ S + + G+ V+S RTSSG F+
Sbjct: 61 EVISWTPRIIVFHNFLSSEECDYLREIARPRLEISTVVDVATGKGVKSD--VRTSSGMFV 118
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
++ E K +++ IE +I+ + +P +GE VLRYE Q Y H+D F+ + Q
Sbjct: 119 NSEERKLPVIKAIEKRISVFSQIPVENGELIQVLRYEPNQYYRPHHDYFSDTFNLKRGGQ 178
Query: 121 RLASFLLYLSDVEEGGETMFPFE-NGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSL 175
R+A+ L+YL+D EGGET FP +G + G + GL VKP +GD +LF+S+
Sbjct: 179 RVATMLMYLTDGVEGGETHFPQAGDGECICGG---RLVRGLCVKPNKGDAVLFWSM 231
>gi|398818543|ref|ZP_10577128.1| 2OG-Fe(II) oxygenase superfamily enzyme [Brevibacillus sp. BC25]
gi|398027481|gb|EJL21031.1| 2OG-Fe(II) oxygenase superfamily enzyme [Brevibacillus sp. BC25]
Length = 220
Score = 114 bits (286), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 68/200 (34%), Positives = 101/200 (50%), Gaps = 25/200 (12%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P + N S +C +I +++RL+ S++ GE S RTSSG F +E T
Sbjct: 41 PLVVVLGNVLSDSECDELIEHSRERLQRSKI----GED-GSVNSIRTSSGVFCEQTETIT 95
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 127
I E +I++ +P HG+ VLRY GQ+Y HYD F + R+++ ++
Sbjct: 96 RI----EKRISQIMNIPIEHGDGLQVLRYTPGQEYKPHYDFFAETSRAS-TNNRISTLVM 150
Query: 128 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 187
YL+DVE+GGET+FP + L V P +G + F + N ++ +LH
Sbjct: 151 YLNDVEQGGETVFPL---------------LHLSVFPTKGMAVYFEYFYSNQELNDFTLH 195
Query: 188 GSCPVIKGEKWVATKWIRDQ 207
VI GEKWVAT W+R Q
Sbjct: 196 AGTQVIHGEKWVATMWMRRQ 215
>gi|186474111|ref|YP_001861453.1| procollagen-proline dioxygenase [Burkholderia phymatum STM815]
gi|184196443|gb|ACC74407.1| Procollagen-proline dioxygenase [Burkholderia phymatum STM815]
Length = 305
Score = 114 bits (285), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 68/206 (33%), Positives = 102/206 (49%), Gaps = 23/206 (11%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
RP+ + F + S ++C +I A+ RLK S + E RTS G + ED
Sbjct: 115 RPQVIVFDDVLSRDECDELIERARHRLKRS-TTVNPESGREDVIQLRTSEGFWFQRCED- 172
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQR 121
+E ++ +I+ P HGE +L Y G +Y H+D F P++ G + QR
Sbjct: 173 -AFIERLDRRISALMNWPLEHGEGLQILHYTKGGEYRPHFDYFPPSQSGSVLHTSRGGQR 231
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 181
+A+ ++YLSDV GGET+FP GL V R+G + F L + +
Sbjct: 232 VATLIVYLSDVAGGGETVFP---------------NAGLAVMARQGGAIYFRYLNGHRQL 276
Query: 182 DRTSLHGSCPVIKGEKWVATKWIRDQ 207
D +LHG PV GEKW+ TKW+R++
Sbjct: 277 DPLTLHGGAPVTNGEKWIMTKWMRER 302
>gi|383757171|ref|YP_005436156.1| putative prolyl 4-hydroxylase alpha subunit [Rubrivivax gelatinosus
IL144]
gi|381377840|dbj|BAL94657.1| putative prolyl 4-hydroxylase alpha subunit homologue
oxidoreductase protein [Rubrivivax gelatinosus IL144]
Length = 279
Score = 114 bits (285), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 72/210 (34%), Positives = 103/210 (49%), Gaps = 37/210 (17%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFIS 61
PR + F S E+C ++A A+ RL S ETV+++ G RTS G F
Sbjct: 92 PRVVVFGGLLSDEECDELVALARPRLARS-------ETVDNSTGGSEVNAARTSDGMFFE 144
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----- 116
E ++E IE +IA P GE VLRY G +Y H+D F+PA G
Sbjct: 145 RGEKP--LIERIERRIAELVRWPVERGEGLQVLRYRPGAQYKPHHDFFDPAHPGTANILR 202
Query: 117 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
+ QR+ + ++YL+ GG T FP +GL+V+P +G+ + F
Sbjct: 203 RGGQRVGTVVMYLNTPAGGGATTFP---------------EVGLEVQPVKGNAVFFSYER 247
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRD 206
P + +LHG PV+ GEKWVATKW+R+
Sbjct: 248 PLAST--RTLHGGAPVLDGEKWVATKWMRE 275
>gi|348683507|gb|EGZ23322.1| hypothetical protein PHYSODRAFT_310730 [Phytophthora sojae]
Length = 417
Score = 114 bits (285), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 71/222 (31%), Positives = 110/222 (49%), Gaps = 17/222 (7%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
++ LS P F ++ I+ + + LKPS + L G + RTS+ F+
Sbjct: 201 LETLSMTPLVFSVEEFLKDDEIDIIMNLSLEHLKPSGVTLMDGHENRAATDWRTSTTYFL 260
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY---GPQ 117
+ D ++ I+ +++ T +P H E VLRYE QKYD H D F P E+ P
Sbjct: 261 PS--DAHPKIDEIDQRVSDLTKVPIDHQEDVQVLRYEKTQKYDHHTDYF-PVEHHKNAPH 317
Query: 118 M--------SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI-GLKVKPRRGD 168
+ R+ + Y+SDV +GG T+FP G + K C GL V P++
Sbjct: 318 ILESIDYGYKNRMITVFWYMSDVAKGGHTIFPRAGGAPRPTSM--KDCTTGLNVPPKKRK 375
Query: 169 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQH 210
++FYS+ PNG D SLHG CPV +G K+ KW+ ++ ++
Sbjct: 376 VIVFYSMLPNGEGDPMSLHGGCPVEEGVKYSGNKWVWNKARY 417
>gi|357417854|ref|YP_004930874.1| procollagen-proline dioxygenase [Pseudoxanthomonas spadix BD-a59]
gi|355335432|gb|AER56833.1| Procollagen-proline dioxygenase [Pseudoxanthomonas spadix BD-a59]
Length = 283
Score = 114 bits (284), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 71/214 (33%), Positives = 112/214 (52%), Gaps = 27/214 (12%)
Query: 1 MQVLS--WRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 58
+QVL+ PR + F N +AE+C ++IA A++++K S + + RTS G
Sbjct: 87 VQVLASLLHPRVIVFGNLLAAEECDALIALARRQIKRSPV-FDPDTGQDQQHQARTSEGM 145
Query: 59 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 118
F + + +E +IA P +GE VLRY G +Y+ HYD F+PA G ++
Sbjct: 146 FFGRGANP--LCARVEARIAALLNWPLENGEGLQVLRYGPGAQYEPHYDYFDPARPGAEV 203
Query: 119 S-----QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFY 173
+ QR+AS ++YL+ +GG T FP + L+V P +G+ + F
Sbjct: 204 ALRRGGQRVASLVIYLNTPTQGGATTFPDAH---------------LEVAPIKGNAVYFS 248
Query: 174 SLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 207
P+ +LHG PV++GEKWVATKW+R++
Sbjct: 249 YDRPHPMTG--TLHGGAPVVEGEKWVATKWLRER 280
>gi|413963357|ref|ZP_11402584.1| ProCollegen-proline dioxygenase [Burkholderia sp. SJ98]
gi|413929189|gb|EKS68477.1| ProCollegen-proline dioxygenase [Burkholderia sp. SJ98]
Length = 286
Score = 113 bits (283), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 68/190 (35%), Positives = 100/190 (52%), Gaps = 23/190 (12%)
Query: 21 QCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARA 80
+C +I ++ ++ S + + E T R S G F++AS D ++E I+ +IA
Sbjct: 107 ECDRLIEIGREHVQRSSV-VDPDSGKEITIEERRSEGAFVNASTD--ALVETIDRRIAEL 163
Query: 81 TMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRLASFLLYLSDVEEG 135
P +GE ++LRY +G +Y HYD F + G + QR+A+ +LYL++VE+G
Sbjct: 164 FRQPVENGEDLHILRYGMGGEYRPHYDYFPEEQAGSKHHMQRGGQRIATVILYLNEVEQG 223
Query: 136 GETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKG 195
G+T FP IGL + PRRG L F + G D +LH PV KG
Sbjct: 224 GDTTFP---------------DIGLAIHPRRGSALYFEYVNELGQSDPKTLHAGTPVEKG 268
Query: 196 EKWVATKWIR 205
EKW+ATKWIR
Sbjct: 269 EKWIATKWIR 278
>gi|251794605|ref|YP_003009336.1| procollagen-proline dioxygenase [Paenibacillus sp. JDR-2]
gi|247542231|gb|ACS99249.1| Procollagen-proline dioxygenase [Paenibacillus sp. JDR-2]
Length = 209
Score = 113 bits (283), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 62/200 (31%), Positives = 100/200 (50%), Gaps = 25/200 (12%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P L N S +C +I A R++ +++ + RTSS F SE++
Sbjct: 32 PLILILDNVLSWAECDLLIDLASARMQRAKIG-----SSHDVSEVRTSSSMFFEESENEC 86
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 127
+ +E ++A +P +H E VLRY+ G++Y H+D F G M+ R+++ ++
Sbjct: 87 --IGQVEARVAELMNIPVSHAEPLQVLRYQPGEQYHPHFDYFTQ---GSSMNNRISTLVM 141
Query: 128 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 187
YL+DVEEGGET FP + V P++G + F + + ++ +LH
Sbjct: 142 YLNDVEEGGETYFP---------------SLHFSVTPKKGSAVYFEYFYNDTRLNELTLH 186
Query: 188 GSCPVIKGEKWVATKWIRDQ 207
PV GEKWVAT+W+R Q
Sbjct: 187 AGHPVEAGEKWVATQWMRRQ 206
>gi|329913962|ref|ZP_08276011.1| hypothetical protein IMCC9480_1311 [Oxalobacteraceae bacterium
IMCC9480]
gi|327545257|gb|EGF30515.1| hypothetical protein IMCC9480_1311 [Oxalobacteraceae bacterium
IMCC9480]
Length = 280
Score = 113 bits (283), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 68/205 (33%), Positives = 101/205 (49%), Gaps = 23/205 (11%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
PR + N S ++C +I A ++ R S + + +RTS I E T
Sbjct: 92 PRIVVLGNVLSDDECDAIAAMSRTRFARST-TIDNASGINRFDDSRTSESAHIQRGE--T 148
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP-----QMSQRL 122
++ I+ ++A + P HGE + +Y+ G +Y H+D F+PA G + QRL
Sbjct: 149 ELIARIDARLAALSGWPVDHGEPLQLQKYQAGNEYRPHFDWFDPALAGTAKHLEKSGQRL 208
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
A+ +LYL+DVEEGG T FP IGL V P++G L F + P G D
Sbjct: 209 ATIILYLTDVEEGGGTSFP---------------GIGLDVHPQKGGALFFRNTTPYGVPD 253
Query: 183 RTSLHGSCPVIKGEKWVATKWIRDQ 207
R + H PV KG K +A KW+R++
Sbjct: 254 RKTQHAGLPVEKGTKIIANKWLREK 278
>gi|377810637|ref|YP_005043077.1| proCollegen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia sp.
YI23]
gi|357939998|gb|AET93554.1| proCollegen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia sp.
YI23]
Length = 297
Score = 113 bits (282), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 75/205 (36%), Positives = 100/205 (48%), Gaps = 25/205 (12%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASED 65
RP A+ F + +C +IA A+ RL S + G V + G R+S GTF +E
Sbjct: 101 RPAAVLLDEFLTGSECDQLIALARPRLSRSTVVDPVTGRDVAA--GHRSSDGTFFRLAE- 157
Query: 66 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA-----EYGPQMSQ 120
T ++ +E +IA T L +GE +LRY+ G + H D E + Q
Sbjct: 158 -TPLVARLEMRIAALTGLAAENGEGLQLLRYQPGAESTPHVDYLVAGNETNRESIARSGQ 216
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+ + L+YL+DVE GGET+FP +G V PRRG L F G
Sbjct: 217 RVGTLLMYLNDVEGGGETVFP---------------QVGCSVVPRRGQALYFEYCNRAGV 261
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIR 205
D SLH S P+ GEKWVATKWIR
Sbjct: 262 CDPASLHASTPLRSGEKWVATKWIR 286
>gi|295700439|ref|YP_003608332.1| procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
gi|295439652|gb|ADG18821.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
Length = 296
Score = 113 bits (282), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 77/207 (37%), Positives = 105/207 (50%), Gaps = 25/207 (12%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASED 65
RP A++ NF SA++C+ +IA A+ RL S + G V +T R+S G F E
Sbjct: 101 RPAAVHLANFLSADECEQLIALAQPRLDRSAVVDPVTGRDVIATH--RSSHGMFFRLGE- 157
Query: 66 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF---NPA--EYGPQMSQ 120
T ++ IE +IA T P +GE +L YE G + H D N A E + Q
Sbjct: 158 -TPLIARIEARIAELTATPVENGEGLQMLHYEEGAESTPHVDYLMTGNEANRESIARSGQ 216
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+ + L+YL DVE GGET+FP +G + P+RG L F G
Sbjct: 217 RMGTLLMYLKDVEGGGETVFP---------------QVGWSIVPQRGHALYFEYGNRYGM 261
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQ 207
D +SLH S P+ G+KWVATKWIR +
Sbjct: 262 CDPSSLHASTPLRTGDKWVATKWIRTR 288
>gi|224006596|ref|XP_002292258.1| hypothetical protein THAPSDRAFT_263536 [Thalassiosira pseudonana
CCMP1335]
gi|220971900|gb|EED90233.1| hypothetical protein THAPSDRAFT_263536 [Thalassiosira pseudonana
CCMP1335]
Length = 206
Score = 113 bits (282), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 75/213 (35%), Positives = 109/213 (51%), Gaps = 24/213 (11%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPS----QLALRQGETVESTKGTRTSSGTFISAS 63
PR Y NF SA++ ++A + + PS A QG + + TRTS F +
Sbjct: 2 PRVFYVHNFLSADEADELVAFS---MAPSTGGTHKAWNQGGS-NAKLTTRTSMNAFDITT 57
Query: 64 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-----NPAEYGPQM 118
+ I + ++ R + + +LRYE+GQ Y +H+D F N + P
Sbjct: 58 KLSFRI-KRRAFRLLRMGAYKENLADGIQILRYELGQAYIAHHDYFPVRQSNDHLWDPSK 116
Query: 119 --SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIG-----LKVKPRRGDGLL 171
S R A+ LYLSDVE GG+T+ E +D+G K + L V PRRGD +L
Sbjct: 117 GGSNRFATIFLYLSDVEVGGQTL---EKDAGVDAGSWEDKLVDQCYSKLAVPPRRGDAIL 173
Query: 172 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 204
FYS +P+G +D SLHG+CP++KG KW A W+
Sbjct: 174 FYSQYPDGHLDPNSLHGACPILKGTKWGANLWV 206
>gi|295699617|ref|YP_003607510.1| procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
gi|295438830|gb|ADG17999.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
Length = 286
Score = 111 bits (278), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 68/212 (32%), Positives = 104/212 (49%), Gaps = 35/212 (16%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFI 60
RP+ + F + SA +C +I ++ RLK S TV G RTS G +
Sbjct: 96 RPQLVVFADVLSAAECAELIERSRHRLKRST-------TVNPLTGREDVIRNRTSEGVWY 148
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP---- 116
ED+ ++ +E +IA T P +GE VL Y +Y H+D F P + G
Sbjct: 149 RRGEDQ--LIARVERRIASLTNWPLENGEGLQVLHYGTSGEYSPHFDFFAPDQPGSAVHT 206
Query: 117 -QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSL 175
Q QR+A+ ++YL+DV +GGET+FP GL V + G + F +
Sbjct: 207 TQGGQRVATLIIYLNDVADGGETVFP---------------TAGLSVAAQAGGAVYFRYM 251
Query: 176 FPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 207
+D ++LHG PV+ G+KW+ TKW+R++
Sbjct: 252 NAERQLDPSTLHGGAPVLAGDKWIMTKWMRER 283
>gi|388519941|gb|AFK48032.1| unknown [Lotus japonicus]
Length = 151
Score = 111 bits (277), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 63/153 (41%), Positives = 88/153 (57%), Gaps = 3/153 (1%)
Query: 59 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 118
F++ E K ++ IE +I+ + +P +GE VLRYE Q Y H+D F +
Sbjct: 2 FLTPEERKYPMVHAIEKRISVYSQVPIENGELMQVLRYEKNQYYKPHHDYFADTFNLKRG 61
Query: 119 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 178
QR+A+ L+YLSD EGGET FP G K GL VKP +G+ +LF+S+ +
Sbjct: 62 GQRIATMLMYLSDNVEGGETYFPNIGSGQCSCG--GKTVEGLSVKPTKGNAVLFWSMGLD 119
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 211
G D S+HG C V+ GEKW ATKW+R Q+ H+
Sbjct: 120 GQSDPLSVHGGCEVLAGEKWSATKWMR-QKAHQ 151
>gi|363543309|ref|NP_001241870.1| prolyl 4-hydroxylase 6-3 precursor [Zea mays]
gi|347978824|gb|AEP37754.1| prolyl 4-hydroxylase 6-3 [Zea mays]
Length = 208
Score = 111 bits (277), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 61/146 (41%), Positives = 90/146 (61%), Gaps = 5/146 (3%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 62
LS RPRA + F S +C +++ AK ++ S +A G++V S RTSSGTF++
Sbjct: 38 LSSRPRAFLYSGFLSDTECDHLVSLAKGSMEKSMVADNDSGKSVASQ--ARTSSGTFLAK 95
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 122
ED+ I+ IE ++A T LP+ + E+ VLRYE GQKYD+H+D F+ QR+
Sbjct: 96 REDE--IVSAIEKRVAAWTFLPEENAESLQVLRYETGQKYDAHFDYFHDRNNLKLGGQRV 153
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFL 148
A+ L+YL+DV++GGE +FP G L
Sbjct: 154 ATVLMYLTDVKKGGEAVFPDAEGSHL 179
>gi|209522122|ref|ZP_03270769.1| Procollagen-proline dioxygenase [Burkholderia sp. H160]
gi|209497434|gb|EDZ97642.1| Procollagen-proline dioxygenase [Burkholderia sp. H160]
Length = 296
Score = 111 bits (277), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 78/207 (37%), Positives = 104/207 (50%), Gaps = 25/207 (12%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASED 65
RP A++ +F SA++C+ +IA A+ RL S + G V G R+S G F E
Sbjct: 101 RPAAVHLADFLSADECEQLIALAQPRLDRSTVVDPVTGRNV--VAGHRSSHGMFFRLGE- 157
Query: 66 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF---NPA--EYGPQMSQ 120
T ++ IE +IA T P +GE +L YE G + H D N A E + Q
Sbjct: 158 -TPLIVRIEARIAALTGTPVENGEGLQMLHYEEGAESTPHVDYLITGNEANRESIARSGQ 216
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+ + L+YL DVE GGET+FP IG V P+RG L F G
Sbjct: 217 RMGTLLMYLKDVEGGGETVFP---------------QIGWSVAPQRGHALYFEYGNRFGL 261
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQ 207
D +SLH S P+ G+KWVATKWIR +
Sbjct: 262 CDPSSLHASTPLRVGDKWVATKWIRTR 288
>gi|325922187|ref|ZP_08183974.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas gardneri ATCC
19865]
gi|325547306|gb|EGD18373.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas gardneri ATCC
19865]
Length = 285
Score = 110 bits (276), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 69/205 (33%), Positives = 98/205 (47%), Gaps = 25/205 (12%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
PR + +F S +C ++IA A+ RL S+ + + RTS + +D
Sbjct: 96 PRVVVLGDFLSDAECDALIALAQPRLARSR-TVDNDNGAQIVHAARTSDSMCLQLGQD-- 152
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRL 122
+ + IE +IAR P HGE VLRY G +Y HYD F+P G + QRL
Sbjct: 153 ALCQRIEARIARLLDWPVDHGEGLQVLRYATGAEYQPHYDYFDPTAAGTPVLLQAGGQRL 212
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
AS ++YL+ E GG T FP + L V +G+ + F P+
Sbjct: 213 ASLVMYLNTPERGGATRFPD---------------VHLDVAAVKGNAVFFSYDRPHPMT- 256
Query: 183 RTSLHGSCPVIKGEKWVATKWIRDQ 207
SLH PV+ GEKWVATKW+R++
Sbjct: 257 -RSLHAGAPVLAGEKWVATKWLRER 280
>gi|389793983|ref|ZP_10197143.1| 2OG-Fe(II) oxygenase [Rhodanobacter fulvus Jip2]
gi|388433014|gb|EIL89992.1| 2OG-Fe(II) oxygenase [Rhodanobacter fulvus Jip2]
Length = 282
Score = 110 bits (275), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 69/198 (34%), Positives = 102/198 (51%), Gaps = 28/198 (14%)
Query: 15 NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 74
S +C +I A+ RL+ + G+ + RTS G F A E T ++ IE
Sbjct: 100 GLLSERECADLIELARPRLQRALTVDSDGK--QQIDQRRTSEGMFFRAGE--TPLVAAIE 155
Query: 75 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-----PQMSQRLASFLLYL 129
++A+ +P +HGE +L Y GQ+Y+ HYD F+PA G + QR+AS ++YL
Sbjct: 156 QRLAQLLGVPASHGEGLQILHYGPGQEYEPHYDWFDPALPGYDKLTARAGQRIASVVMYL 215
Query: 130 SDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGS 189
+ E GG T FP IGL V RRG + F + G D++SLH
Sbjct: 216 NTPERGGGTAFP---------------EIGLTVTARRGAAVYFA--YEGG--DQSSLHAG 256
Query: 190 CPVIKGEKWVATKWIRDQ 207
PV++GEKW+AT W+R++
Sbjct: 257 LPVLQGEKWIATHWLRER 274
>gi|363543293|ref|NP_001241862.1| prolyl 4-hydroxylase 2-1 precursor [Zea mays]
gi|347978802|gb|AEP37743.1| prolyl 4-hydroxylase 2-1 [Zea mays]
Length = 204
Score = 110 bits (275), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 60/146 (41%), Positives = 90/146 (61%), Gaps = 5/146 (3%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 62
LSWRPRA F S +C +IA AK +L+ S +A + G++V+S RTSSG F+
Sbjct: 39 LSWRPRAFLHKGFLSDAECDHLIALAKDKLEKSMVADNESGKSVQSE--VRTSSGMFLER 96
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 122
+D+ ++ IE +I+ T LP +GE+ +L Y+ G+KY+ HYD F+ + R+
Sbjct: 97 KQDE--VVTRIEERISAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKKNQALGGHRI 154
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFL 148
A+ L+YLS+VE+GGET+FP G L
Sbjct: 155 ATVLMYLSNVEKGGETIFPNAEGKLL 180
>gi|90022913|ref|YP_528740.1| hypothetical protein Sde_3273 [Saccharophagus degradans 2-40]
gi|89952513|gb|ABD82528.1| 2OG-Fe(II) oxygenase [Saccharophagus degradans 2-40]
Length = 478
Score = 110 bits (275), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 70/219 (31%), Positives = 108/219 (49%), Gaps = 41/219 (18%)
Query: 8 PRALYFPN----------------FASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG 51
PR ++ PN F + E+C+ IIA + +L+PS+L+ + ES K
Sbjct: 86 PRKIFIPNALKLNSDKLEMYALGEFLTTEECERIIANIRSKLRPSELS-----SQESDKT 140
Query: 52 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN- 110
RTS + +D + ++ +I + + ++ E YE+GQ++ +H D F
Sbjct: 141 YRTSRTCDLGTIDDP--FIHYVDSRICKLVGIDPSYSEVIQGQLYEVGQEFKAHTDYFEI 198
Query: 111 --PAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 168
E+G M QR + ++YL+DVEEGGET FP +G +KPR G
Sbjct: 199 KEMPEHGAVMGQRTYTVMIYLNDVEEGGETDFPAADG---------------AIKPRAGL 243
Query: 169 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 207
L++ SL NG + S+H + PV+KG K V TKW R Q
Sbjct: 244 ALIWNSLQSNGAPNPHSMHQAYPVLKGHKAVITKWFRSQ 282
>gi|73542634|ref|YP_297154.1| procollagen-proline,2-oxoglutarate-4-dioxygenase [Ralstonia
eutropha JMP134]
gi|72120047|gb|AAZ62310.1| Procollagen-proline,2-oxoglutarate-4-dioxygenase [Ralstonia
eutropha JMP134]
Length = 282
Score = 110 bits (275), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 65/205 (31%), Positives = 99/205 (48%), Gaps = 23/205 (11%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P + + S +C +++ A+ RL S + + E+ RTS G E
Sbjct: 90 PSIRLYQHLLSDAECDALVELARGRLARSPV-INPDTGDENLIDARTSMGAMFQVGEHT- 147
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRL 122
+++ IE +IA +P HGE +L Y+ G +Y H+D FNP G QR
Sbjct: 148 -LIQRIEDRIAAVLGVPVDHGEGLQILNYKPGGEYQPHFDFFNPKRPGEARQLRVGGQRT 206
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
A+ ++YL+ + GG T FP IGL+V P +G+ + F L P+G +D
Sbjct: 207 ATLVIYLNTPQAGGATAFP---------------RIGLEVAPVKGNAVYFSYLQPDGKLD 251
Query: 183 RTSLHGSCPVIKGEKWVATKWIRDQ 207
+LH PV GEKW+ATKW+R+
Sbjct: 252 ERTLHAGLPVQSGEKWIATKWLREH 276
>gi|377811809|ref|YP_005044249.1| ProCollegen-proline dioxygenase [Burkholderia sp. YI23]
gi|357941170|gb|AET94726.1| ProCollegen-proline dioxygenase [Burkholderia sp. YI23]
Length = 283
Score = 110 bits (275), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 66/203 (32%), Positives = 99/203 (48%), Gaps = 23/203 (11%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P + S +C +I ++R++ S + + E R S G F++ S D
Sbjct: 91 PVVALLADVLSPRECDRLIEIGRERVRRSSV-VDPDSGGEVLIDARKSEGAFVNGSTDP- 148
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRL 122
++ I+ +IA P +GE ++LRY G +Y H+D F + G + QR+
Sbjct: 149 -LVATIDRRIAELVQQPVENGEDLHILRYGAGGEYRPHFDYFPEEQAGSKHHMQRGGQRI 207
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
A+ +LYL+ VEEGG+T FP IGL + PRRG L F + G D
Sbjct: 208 ATLILYLNQVEEGGDTTFPD---------------IGLTIHPRRGAALYFEYVNALGQTD 252
Query: 183 RTSLHGSCPVIKGEKWVATKWIR 205
+LH PV +GEKW+ATKW+R
Sbjct: 253 PRTLHAGMPVERGEKWIATKWMR 275
>gi|332526359|ref|ZP_08402485.1| procollagen-proline dioxygenase [Rubrivivax benzoatilyticus JA2]
gi|332110495|gb|EGJ10818.1| procollagen-proline dioxygenase [Rubrivivax benzoatilyticus JA2]
Length = 224
Score = 110 bits (274), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 72/210 (34%), Positives = 104/210 (49%), Gaps = 37/210 (17%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFIS 61
PR + F S ++C ++A A+ RL LR ETV+++ G RTS G F
Sbjct: 37 PRVVVFGGLLSEQECDELVALAQPRL------LRS-ETVDNSTGGSEVNAARTSDGMFFE 89
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----- 116
E T ++E IE +IA P GE VL Y G +Y H+D F+PA G
Sbjct: 90 RGE--TPLIERIERRIAELVHWPVERGEGLQVLHYRPGAQYKPHHDFFDPAHPGTANILR 147
Query: 117 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
+ QR+ + ++YL+ GG T FP +GL+V+P +G+ + F
Sbjct: 148 RGGQRVGTVVIYLNTPAGGGATTFP---------------EVGLEVQPIKGNAVFFSYER 192
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRD 206
P + +LHG PV+ GEKWVATKW+R+
Sbjct: 193 PLAST--RTLHGGAPVLDGEKWVATKWLRE 220
>gi|418523362|ref|ZP_13089380.1| hypothetical protein WS7_20388 [Xanthomonas axonopodis pv.
malvacearum str. GSPB2388]
gi|410699993|gb|EKQ58573.1| hypothetical protein WS7_20388 [Xanthomonas axonopodis pv.
malvacearum str. GSPB2388]
Length = 286
Score = 110 bits (274), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 72/211 (34%), Positives = 101/211 (47%), Gaps = 37/211 (17%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFIS 61
PR + F S +C ++IA A+ RL S+ TV++ G RTS G +
Sbjct: 96 PRVVVLGGFLSDGECDALIALARPRLARSR-------TVDNANGEHLVHAARTSDGMCLR 148
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM--- 118
+D + + IE +IAR P HGE VLRY G +Y HYD F+P G +
Sbjct: 149 VGQD--ALCQRIEARIARLFDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAVGTPILLQ 206
Query: 119 --SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
QR+AS ++YL+ E GG T FP + L V +G+ + F
Sbjct: 207 AGGQRVASLVMYLNTPERGGATRFPDAH---------------LDVAAVKGNAVFFSYDR 251
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 207
P+ SLH PV+ GEKWVATKW+R++
Sbjct: 252 PHPMT--RSLHAGAPVLAGEKWVATKWLRER 280
>gi|171059332|ref|YP_001791681.1| procollagen-proline dioxygenase [Leptothrix cholodnii SP-6]
gi|170776777|gb|ACB34916.1| Procollagen-proline dioxygenase [Leptothrix cholodnii SP-6]
Length = 287
Score = 110 bits (274), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 73/209 (34%), Positives = 103/209 (49%), Gaps = 37/209 (17%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFIS 61
PR + F F S ++C +++A A+ RL S ETV++ G RTS G F
Sbjct: 100 PRVVVFGGFLSHDECDALVALAQPRLARS-------ETVDNDTGGSEVNEARTSQGMFFM 152
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM-- 118
E + ++ IE +IA P +GE VL Y G +Y HYD F+PA+ G P +
Sbjct: 153 RGEGE--LISRIEARIAALLDWPLENGEGVQVLHYRPGAEYKPHYDYFDPAQPGTPTILK 210
Query: 119 --SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
QR+ + ++YL+ E GG T FP N L+V P +G+ + F +
Sbjct: 211 RGGQRVGTLVMYLNTPERGGGTTFPDVN---------------LEVAPIKGNAVFFS--Y 253
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIR 205
SLHG PV+ GEKWVATKW+R
Sbjct: 254 ERAHPSTRSLHGGAPVLAGEKWVATKWLR 282
>gi|375106426|ref|ZP_09752687.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderiales bacterium
JOSHI_001]
gi|374667157|gb|EHR71942.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderiales bacterium
JOSHI_001]
Length = 295
Score = 109 bits (273), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 73/212 (34%), Positives = 107/212 (50%), Gaps = 26/212 (12%)
Query: 3 VLSWR-PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
+LS R PR + F S E+C +++ A+ RL S+ + G RTS G F
Sbjct: 102 LLSMRNPRVMVFGGLLSDEECDAMVDLARPRLARSE-TVHNGSGGSEVNAARTSDGMFFD 160
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM-- 118
E + IE +IA P +GE VLRY G +Y +H+D F+PA+ G P +
Sbjct: 161 RGEFP--LCRTIEQRIAALVNWPVENGEGLQVLRYRPGSEYKAHHDYFDPAQPGTPTILK 218
Query: 119 --SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
QR+ + ++YL+ GG T FP +GL+V P +G+ +F+S
Sbjct: 219 RGGQRVGTVVMYLNHPIRGGGTAFP---------------DVGLEVAPFKGNA-VFFSYD 262
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 208
+ RT LH PV++GEKWVATKW+R+ E
Sbjct: 263 RAHPMTRT-LHAGTPVLEGEKWVATKWVREGE 293
>gi|323528042|ref|YP_004230194.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1001]
gi|323385044|gb|ADX57134.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1001]
Length = 300
Score = 109 bits (273), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 64/206 (31%), Positives = 101/206 (49%), Gaps = 23/206 (11%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
RP+ + F N S E+C +I ++ RLK S + + E RTS G + ED
Sbjct: 110 RPQVIVFANVLSPEECDEVIERSRHRLKRSTI-VDPATGQEGVIRNRTSEGIWYQRGED- 167
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQR 121
+E ++ +IA P +GE +L Y +Y H+D F P + G + QR
Sbjct: 168 -AFIERLDQRIASLMNWPVENGEGLQILHYGPTGEYRPHFDYFPPDQPGSAVHTARGGQR 226
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 181
+A+ ++YL+DV +GGET+FP GL V ++G + F + +
Sbjct: 227 VATLVVYLNDVADGGETIFP---------------AAGLSVAAKQGGAVYFRYMNGQRQL 271
Query: 182 DRTSLHGSCPVIKGEKWVATKWIRDQ 207
D +LHG PV G+KW+ TKW+R++
Sbjct: 272 DPLTLHGGAPVHAGDKWIMTKWMRER 297
>gi|445499353|ref|ZP_21466208.1| prolyl 4-hydroxylase alpha subunit [Janthinobacterium sp. HH01]
gi|444789348|gb|ELX10896.1| prolyl 4-hydroxylase alpha subunit [Janthinobacterium sp. HH01]
Length = 272
Score = 109 bits (273), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 69/204 (33%), Positives = 102/204 (50%), Gaps = 23/204 (11%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
+P+ + N S E+C +IIA R S + + +G RTS FI E +
Sbjct: 82 QPQIILLGNVLSDEECDAIIAHCGTRYTRSTVTGEADGSSMVHEG-RTSEMAFIQRGEAE 140
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-----PQMSQR 121
+ E IE ++A P E F + +Y+ Q+Y HYD +P G + QR
Sbjct: 141 --VAERIERRLAALAHWPAECSEPFQLQKYDATQEYRPHYDWLDPDSSGHRSHLARGGQR 198
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 181
LA+F+LYLSDVE+GG T+FP +GL+V P++G L F + N
Sbjct: 199 LATFILYLSDVEQGGGTVFP---------------GLGLEVYPKKGSALWFLNTDINHQP 243
Query: 182 DRTSLHGSCPVIKGEKWVATKWIR 205
D+ +LHG PV++G K +A KW+R
Sbjct: 244 DKRTLHGGAPVVRGTKIIANKWLR 267
>gi|407708877|ref|YP_006792741.1| prolyl 4-hydroxylase [Burkholderia phenoliruptrix BR3459a]
gi|407237560|gb|AFT87758.1| prolyl 4-hydroxylase [Burkholderia phenoliruptrix BR3459a]
Length = 300
Score = 109 bits (272), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 64/206 (31%), Positives = 101/206 (49%), Gaps = 23/206 (11%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
RP+ + F N S E+C +I ++ RLK S + + E RTS G + ED
Sbjct: 110 RPQVIVFANVLSPEECDEVIERSRHRLKRSTI-VDPATGQEGVIRNRTSEGIWYQRGED- 167
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQR 121
+E ++ +IA P +GE +L Y +Y H+D F P + G + QR
Sbjct: 168 -AFIERLDRRIASLMNWPVENGEGLQILHYGPTGEYRPHFDYFPPDQPGSAVHTARGGQR 226
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 181
+A+ ++YL+DV +GGET+FP GL V ++G + F + +
Sbjct: 227 VATLVVYLNDVADGGETIFP---------------AAGLSVAAKQGGAVYFRYMNGQRQL 271
Query: 182 DRTSLHGSCPVIKGEKWVATKWIRDQ 207
D +LHG PV G+KW+ TKW+R++
Sbjct: 272 DPLTLHGGAPVRAGDKWIMTKWMRER 297
>gi|428182311|gb|EKX51172.1| hypothetical protein GUITHDRAFT_92735 [Guillardia theta CCMP2712]
Length = 190
Score = 109 bits (272), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 68/190 (35%), Positives = 94/190 (49%), Gaps = 20/190 (10%)
Query: 36 SQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLR 95
S +A E RTSS ++S + D ++ I ++A LP E VL
Sbjct: 4 STIAEAGNEAKNGVGSARTSSTAWLSKTADP--LVAKIRTRVAELVKLPMELAEDMQVLH 61
Query: 96 YEIGQKYDSHYDAFNPAEYGPQMS----QRLASFLLYLSDVEEGGETMFPFENGIFLDSG 151
Y Q Y +H+D F+P Y ++ R + YLSDVEEGGET+FPF NG
Sbjct: 62 YSKNQHYWAHHDFFDPNIYRGFVTSPGQNRFITVFFYLSDVEEGGETVFPFANGDDRRV- 120
Query: 152 YDYKKCI-GLKVKPRRGDGLLFYSLFPN------------GTIDRTSLHGSCPVIKGEKW 198
D+ C GLKVKP+ G+ ++FYS+ +D SLHG C VIKG+KW
Sbjct: 121 TDFADCSRGLKVKPKAGNAIIFYSMLAKRQQEICPPDDLGCNLDVRSLHGGCDVIKGDKW 180
Query: 199 VATKWIRDQE 208
A WI +++
Sbjct: 181 AANYWIANKK 190
>gi|196011912|ref|XP_002115819.1| hypothetical protein TRIADDRAFT_59908 [Trichoplax adhaerens]
gi|190581595|gb|EDV21671.1| hypothetical protein TRIADDRAFT_59908 [Trichoplax adhaerens]
Length = 300
Score = 109 bits (272), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 65/213 (30%), Positives = 109/213 (51%), Gaps = 22/213 (10%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTF 59
++ +S P + + N S + +S+ A A K+L+P+ + + +G TR + F
Sbjct: 97 IEEMSRDPLIILYHNLTSNAEMESLKALAAKQLQPAGVYHTTSADNRNLEGYTRIAKMAF 156
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
I ++++ + I ++ T L E V+ Y I +Y HYD F PA+ G +
Sbjct: 157 IL--DEESAVASAITQRLQDVTGLNMNFSEPLQVINYGIAGQYTPHYDTF-PAKSGDRSH 213
Query: 120 ---QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
RLA+ +LYLSDVE GG T+F I ++V PR+G+ +++Y+
Sbjct: 214 PSHDRLATAILYLSDVERGGATVF---------------TNINVRVLPRKGNVIIWYNYL 258
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
P+G + +LH CPV+ G KW+A KWI+ + Q
Sbjct: 259 PDGNLHPGTLHAGCPVLVGSKWIANKWIQSKGQ 291
>gi|389775678|ref|ZP_10193553.1| procollagen-proline dioxygenase [Rhodanobacter spathiphylli B39]
gi|388437120|gb|EIL93940.1| procollagen-proline dioxygenase [Rhodanobacter spathiphylli B39]
Length = 284
Score = 109 bits (272), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 69/205 (33%), Positives = 104/205 (50%), Gaps = 28/205 (13%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P N +AE+C+ +IA A+ RLK + G + RTS G F + +E
Sbjct: 95 PALRVLENLLAAEECEELIALAQPRLKRALTVASDGSNQVDQR--RTSEGMFFTLNE--L 150
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRL 122
++ IE ++A +P +HGE +L Y GQ+Y+ H+D F+P + G QR+
Sbjct: 151 PLVGRIEQRLATLLGMPVSHGEGLQILHYLPGQEYEPHFDWFDPQQPGYDTITAVGGQRV 210
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
AS ++YL+ +GG T FP +GL V RRG + F + G D
Sbjct: 211 ASVVMYLNTPAQGGGTAFP---------------ELGLTVTARRGAAVYFA--YEGG--D 251
Query: 183 RTSLHGSCPVIKGEKWVATKWIRDQ 207
+ SLH PV +GEKW+ATKW+R++
Sbjct: 252 QQSLHAGLPVQRGEKWIATKWLRER 276
>gi|389770666|ref|ZP_10192118.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
gi|388429637|gb|EIL86932.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
Length = 286
Score = 109 bits (272), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 63/195 (32%), Positives = 101/195 (51%), Gaps = 23/195 (11%)
Query: 18 SAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKI 77
S E+C +I A +L+ S + + T R+S GTF + D + ++ +I
Sbjct: 105 SHEECDELIRRAAAKLQRSTI-VDPTTGKHETIADRSSEGTFFEINADD--FIARLDRRI 161
Query: 78 ARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP--QMS---QRLASFLLYLSDV 132
+ LP HGE +L Y G +Y H+D F P + G QM+ QR+++ ++YL++V
Sbjct: 162 SALMNLPVDHGEGLQILHYGPGGEYKPHFDFFPPGDPGSAVQMATGGQRVSTLVMYLNEV 221
Query: 133 EEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPV 192
E+GG T+FP +GL V P++G + F G +D +LHG PV
Sbjct: 222 EDGGATIFP---------------ELGLSVLPKKGSAVYFEYTNSRGQLDPRTLHGGAPV 266
Query: 193 IKGEKWVATKWIRDQ 207
++GEKW+ TKW+R +
Sbjct: 267 LRGEKWIVTKWMRQR 281
>gi|389809938|ref|ZP_10205598.1| procollagen-proline dioxygenase [Rhodanobacter thiooxydans LCS2]
gi|388441354|gb|EIL97635.1| procollagen-proline dioxygenase [Rhodanobacter thiooxydans LCS2]
Length = 284
Score = 109 bits (272), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 67/198 (33%), Positives = 102/198 (51%), Gaps = 28/198 (14%)
Query: 15 NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 74
N SA +C +IA A+ RL+ + +G + RTS G F + D+ ++ IE
Sbjct: 102 NILSARECDELIALARPRLQRALTVDSEGR--QQVDRRRTSEGMFFTL--DEVPLVGRIE 157
Query: 75 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRLASFLLYL 129
++A +P +HGE +L Y GQ Y+ H+D F+P + G + QR+AS ++YL
Sbjct: 158 RRVAALLDVPASHGEGLQILHYLPGQAYEPHFDWFDPDQPGYETITAVGGQRIASVVMYL 217
Query: 130 SDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGS 189
+ GG T FP +GL V RRG + F + G D +SLH
Sbjct: 218 NTPARGGGTAFP---------------ALGLTVTARRGAAVYFA--YEGG--DCSSLHAG 258
Query: 190 CPVIKGEKWVATKWIRDQ 207
PV++GEKW+ATKW+R++
Sbjct: 259 LPVLEGEKWIATKWLRER 276
>gi|78046308|ref|YP_362483.1| 2OG-Fe(II) oxygenase [Xanthomonas campestris pv. vesicatoria str.
85-10]
gi|78034738|emb|CAJ22383.1| putative 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas
campestris pv. vesicatoria str. 85-10]
Length = 296
Score = 108 bits (271), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 71/211 (33%), Positives = 101/211 (47%), Gaps = 37/211 (17%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFIS 61
PR + F S E+C ++IA A+ RL S+ TV++ G RTS +
Sbjct: 106 PRVVVLGGFLSDEECDALIALARPRLARSR-------TVDNANGEHVVHAARTSDSMCLR 158
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM--- 118
+D + + IE +IAR P HGE VLRY G +Y HYD F+P G +
Sbjct: 159 LGQD--ALCQRIEARIARLLDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAAGTPVLVQ 216
Query: 119 --SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
QR+AS ++YL+ E GG T FP + L V +G+ + F
Sbjct: 217 AGGQRVASLVMYLNTPERGGATRFPDAH---------------LDVAAVKGNAVFFSYDR 261
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 207
P+ SLH PV+ G+KWVATKW+R++
Sbjct: 262 PHPMT--RSLHAGAPVLAGDKWVATKWLRER 290
>gi|344199983|ref|YP_004784309.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrivorans SS3]
gi|343775427|gb|AEM47983.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrivorans SS3]
Length = 212
Score = 108 bits (271), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 67/200 (33%), Positives = 97/200 (48%), Gaps = 23/200 (11%)
Query: 11 LYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGIL 70
++F S E+C +IA KPS++ + T G R+ T S S DK I+
Sbjct: 15 VHFSGLLSPEECTELIAAGGSHAKPSEVIYGVSDVSHETSGRRS---TVASPSADKYPII 71
Query: 71 ELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM---SQRLASFLL 127
+ + +I+ + + + E VL Y G +YD HYD+F E PQ+ R+ + LL
Sbjct: 72 KAVRRRISLFIGVAEENQEPLQVLHYTRGGRYDIHYDSF--LEGSPQLENGGNRMLTVLL 129
Query: 128 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 187
YL+DVE+GG T FP I + P G G+LF + R SLH
Sbjct: 130 YLNDVEQGGWTQFPH---------------IMANIVPNVGTGILFRNTDAQNLQLRESLH 174
Query: 188 GSCPVIKGEKWVATKWIRDQ 207
PVI GEKW+A+ WIR++
Sbjct: 175 AGLPVIDGEKWIASIWIREK 194
>gi|170064953|ref|XP_001867740.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
gi|167882143|gb|EDS45526.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
Length = 509
Score = 108 bits (271), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 70/212 (33%), Positives = 109/212 (51%), Gaps = 25/212 (11%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFI 60
+VL+ P + + AS + +I AK R+ S+ +R GE RTS ++
Sbjct: 303 EVLNLDPFITVYHDVASDREISKLIELAKSRI--SRATIRDDGEP--QVSNARTSQNAWL 358
Query: 61 SASEDKTGILELIEHKIARATM-LPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEY-GPQ 117
A +D+ ++ ++ ++ T L Q E V Y +G Y +H+D A Y G +
Sbjct: 359 DAGDDR--VVTTLDRRVGDMTGGLRQQSYEMLQVNNYGVGGHYVAHHDWAMEAVPYAGLR 416
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
+ R+A+ + YLSDVE GG T+FP +GL V PR+G +L+Y+L+
Sbjct: 417 VGNRIATVMFYLSDVEIGGATVFP---------------QLGLAVFPRKGSAILWYNLYR 461
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
NG DR +LH +CPV+ G KWVA +WI + Q
Sbjct: 462 NGKGDRRTLHAACPVLSGSKWVANQWIHEYHQ 493
>gi|307725787|ref|YP_003909000.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1003]
gi|307586312|gb|ADN59709.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1003]
Length = 313
Score = 108 bits (270), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 63/206 (30%), Positives = 103/206 (50%), Gaps = 23/206 (11%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
RP+ + F N S ++C +I ++ RLK S + + E RTS G + ED
Sbjct: 123 RPQVIVFGNVLSPDECAEMIERSRHRLKRSTI-VDPATGREDVIRNRTSEGIWYQRGED- 180
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQR 121
++E ++ +IA P +GE +L Y +Y H+D F P + G + QR
Sbjct: 181 -ALIERLDQRIASLMNWPLENGEGLQILHYGPSGEYRPHFDYFPPDQPGSAVHTARGGQR 239
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 181
+A+ ++YL+DV +GGET+FP GL V ++G + F + +
Sbjct: 240 VATLVVYLNDVPDGGETIFPEA---------------GLSVAAQQGGAVYFRYMNGRRQL 284
Query: 182 DRTSLHGSCPVIKGEKWVATKWIRDQ 207
D +LHG PV+ G+KW+ TKW+R++
Sbjct: 285 DPLTLHGGAPVLSGDKWIMTKWVRER 310
>gi|239816557|ref|YP_002945467.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
gi|239803134|gb|ACS20201.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
Length = 296
Score = 108 bits (270), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 67/201 (33%), Positives = 102/201 (50%), Gaps = 35/201 (17%)
Query: 18 SAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFISASEDKTGILE 71
SAE+C+++IA A+ RL PS +V+ G R+S G F E+ +
Sbjct: 109 SAEECEALIALARPRLAPST-------SVDPLTGRNRLGAQRSSLGMFFRLREN--AFVA 159
Query: 72 LIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-----QRLASFL 126
++ +++ LP +GE VL Y G + H+D P+ Q S QR+++ +
Sbjct: 160 RLDERLSELMNLPVENGEGLQVLHYPAGAQSLPHFDFLVPSNAANQASLQRSGQRVSTLV 219
Query: 127 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 186
YL++VEEGGET+FP ++G+ V P+RG + F G +D SL
Sbjct: 220 AYLNEVEEGGETVFP-------ETGW--------SVSPQRGGAVYFEYCNSLGQVDHASL 264
Query: 187 HGSCPVIKGEKWVATKWIRDQ 207
H PV+ GEKWVATKW+R +
Sbjct: 265 HAGAPVLSGEKWVATKWMRQR 285
>gi|319943342|ref|ZP_08017624.1| 2OG-Fe(II) oxygenase [Lautropia mirabilis ATCC 51599]
gi|319743157|gb|EFV95562.1| 2OG-Fe(II) oxygenase [Lautropia mirabilis ATCC 51599]
Length = 311
Score = 108 bits (269), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 66/206 (32%), Positives = 104/206 (50%), Gaps = 31/206 (15%)
Query: 14 PNFA------SAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISASEDK 66
PN A S E+C +I ++ ++K SQ+ R+ G + ES+ R S G+ E++
Sbjct: 121 PNIAVIRGLLSDEECDEVIRLSRGKMKTSQVVDRESGGSYESS--VRKSEGSHFERGENE 178
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQR 121
++ IE +++ LP GE +L Y G +Y +H D F P + G + QR
Sbjct: 179 --LVRRIEARLSALVDLPVNRGEPLQILHYGPGGEYKAHQDFFEPKDPGSAVLTRVGGQR 236
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 181
+ + ++YL+DV EGGET FP IG KP +G + F +G +
Sbjct: 237 IGTVVMYLNDVPEGGETAFP---------------DIGFSAKPIKGSAVYFEYQNADGQL 281
Query: 182 DRTSLHGSCPVIKGEKWVATKWIRDQ 207
D LH PVI+G+KW+ TKW+R++
Sbjct: 282 DYRCLHAGMPVIRGDKWIMTKWLRER 307
>gi|170690448|ref|ZP_02881615.1| Procollagen-proline dioxygenase [Burkholderia graminis C4D1M]
gi|170144883|gb|EDT13044.1| Procollagen-proline dioxygenase [Burkholderia graminis C4D1M]
Length = 307
Score = 108 bits (269), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 64/206 (31%), Positives = 101/206 (49%), Gaps = 23/206 (11%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
RP+ + F N S E+C +I ++ RLK S + + E RTS G + ED
Sbjct: 117 RPQVIVFANVLSPEECDEVIERSRHRLKRSTI-VDPATGQEDVIRNRTSEGIWYQRGEDA 175
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQR 121
+E ++ +IA P +GE +L Y +Y H+D F P + G + QR
Sbjct: 176 --FIERLDQRIASLMNWPVENGEGLQILHYGPTGEYRPHFDYFPPDQPGSMVHTARGGQR 233
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 181
+A+ ++YL+DV +GGET+FP GL V ++G + F + +
Sbjct: 234 VATLVIYLNDVPDGGETIFPEA---------------GLSVAAKQGGAVYFRYMNGQRQL 278
Query: 182 DRTSLHGSCPVIKGEKWVATKWIRDQ 207
D +LHG PV G+KW+ TKW+R++
Sbjct: 279 DPLTLHGGAPVRAGDKWIMTKWMRER 304
>gi|325267002|ref|ZP_08133672.1| 2OG-Fe(II) oxygenase [Kingella denitrificans ATCC 33394]
gi|324981502|gb|EGC17144.1| 2OG-Fe(II) oxygenase [Kingella denitrificans ATCC 33394]
Length = 279
Score = 108 bits (269), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 63/206 (30%), Positives = 107/206 (51%), Gaps = 25/206 (12%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASEDK 66
P + NF +AE+C +IA A+ +++ + + GE V+ RTS + +E
Sbjct: 91 PEVVVLDNFITAEECAQLIALAEGKVEDATVVDPATGEFVKHQD--RTSMNAAFARAEHP 148
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-----QR 121
++ +E +IA A P +GE VLRY G +Y +H+D F+ G + + QR
Sbjct: 149 --LIARLEARIAAAIHWPAENGEGMQVLRYRSGGEYKAHFDYFDTQSEGGRKNMQTGGQR 206
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 181
+ +FL+YL DV+ GG T FP + +++P++G L F + PNG
Sbjct: 207 VGTFLVYLCDVDAGGATRFP---------------ALNFEIRPKKGMALFFANTLPNGEG 251
Query: 182 DRTSLHGSCPVIKGEKWVATKWIRDQ 207
+ +LH PV+ G K++A+KW+R++
Sbjct: 252 NPLTLHAGVPVVSGVKYLASKWLREK 277
>gi|302850293|ref|XP_002956674.1| hypothetical protein VOLCADRAFT_67269 [Volvox carteri f.
nagariensis]
gi|300258035|gb|EFJ42276.1| hypothetical protein VOLCADRAFT_67269 [Volvox carteri f.
nagariensis]
Length = 325
Score = 108 bits (269), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 68/208 (32%), Positives = 107/208 (51%), Gaps = 19/208 (9%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
+Q +SW+PRA+ + NF S ++ + II A +++K S + + E V RTS GTF+
Sbjct: 41 IQTISWKPRAVVYHNFLSDQEARHIIDLAHEQMKRSTVVGNKNEGV--VDDIRTSYGTFL 98
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
++D ++ IE ++A + +P +H E VLRY KY H D +
Sbjct: 99 RRAQDP--VIMAIEERLALWSHMPPSHQEDMQVLRYGRTNKYGPHIDGL----------E 146
Query: 121 RLASFLLYLSDVEEGGETMFPFE--NGIFLDSGYDYKKCIG-LKVKPRRGDGLLFYSLFP 177
R+A+ L+YL E G + P ++ + G + KP+RGD L+F+ + P
Sbjct: 147 RVATVLMYLVG-ESPGPDLAPVSACECMYAEQSNPSACAKGHVAYKPKRGDALMFFDVKP 205
Query: 178 N-GTIDRTSLHGSCPVIKGEKWVATKWI 204
+ T D S+H CPV+ G KW A KWI
Sbjct: 206 DYTTTDGHSMHTGCPVVAGVKWNAVKWI 233
>gi|428183249|gb|EKX52107.1| hypothetical protein GUITHDRAFT_150687 [Guillardia theta CCMP2712]
Length = 315
Score = 108 bits (269), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 67/206 (32%), Positives = 107/206 (51%), Gaps = 25/206 (12%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQG--ETVESTKGTRTSSGTFISASED 65
PR N + E+C+S+ + L + G E VES+ TRT++ ++ +
Sbjct: 88 PRIYVLHNILTKEECESLKSLGVMAGMEKALIIPYGGKELVESS--TRTNTAAWLEYHQG 145
Query: 66 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM----SQR 121
++ +E+ +A+ T +GE +L Y+ Q++ H+D F+PA P+ R
Sbjct: 146 P--VVTKLENLLAKVTNTEPENGENLQILHYQTSQQFKEHHDYFDPATDPPENFEPGGNR 203
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 181
LA+ ++YL + EEGGET D+ K I KVKP G +LFY L P+G++
Sbjct: 204 LATAIIYLQNAEEGGET--------------DFMK-IDTKVKPEAGSAVLFYDLKPDGSV 248
Query: 182 DRTSLHGSCPVIKGEKWVATKWIRDQ 207
D+ ++H P GEKWVATKWI ++
Sbjct: 249 DKLTIHSGNPPKGGEKWVATKWIHER 274
>gi|352086439|ref|ZP_08953941.1| Procollagen-proline dioxygenase [Rhodanobacter sp. 2APBS1]
gi|389799401|ref|ZP_10202396.1| procollagen-proline dioxygenase [Rhodanobacter sp. 116-2]
gi|351679404|gb|EHA62545.1| Procollagen-proline dioxygenase [Rhodanobacter sp. 2APBS1]
gi|388442818|gb|EIL98985.1| procollagen-proline dioxygenase [Rhodanobacter sp. 116-2]
Length = 284
Score = 108 bits (269), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 67/198 (33%), Positives = 104/198 (52%), Gaps = 28/198 (14%)
Query: 15 NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 74
N S ++C+ +IA A+ RL+ + +G + RTS G F + +E ++ IE
Sbjct: 102 NILSTQECEELIALARPRLQRALTVDSEGR--QQVDRRRTSEGMFFTLNE--VPLVGRIE 157
Query: 75 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE--YGPQMS---QRLASFLLYL 129
++A +P +HGE +L Y GQ+Y+ H+D F+P + YG + QR+AS ++YL
Sbjct: 158 QRLAALLRVPASHGEGLQILHYLPGQEYEPHFDWFDPEQPGYGAITAVGGQRIASVVMYL 217
Query: 130 SDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGS 189
+ GG T FP +GL V RRG + F + G D +SLH
Sbjct: 218 NTPARGGGTAFP---------------ELGLTVTARRGSAVYFA--YEGG--DPSSLHAG 258
Query: 190 CPVIKGEKWVATKWIRDQ 207
PV+ GEKW+ATKW+R++
Sbjct: 259 LPVLDGEKWIATKWLRER 276
>gi|418515355|ref|ZP_13081536.1| hypothetical protein MOU_00890 [Xanthomonas axonopodis pv.
malvacearum str. GSPB1386]
gi|410708074|gb|EKQ66523.1| hypothetical protein MOU_00890 [Xanthomonas axonopodis pv.
malvacearum str. GSPB1386]
Length = 216
Score = 107 bits (268), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 71/211 (33%), Positives = 100/211 (47%), Gaps = 37/211 (17%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFIS 61
PR + F S +C ++IA A+ RL S+ TV++ G RTS +
Sbjct: 26 PRVVVLGGFLSDGECDALIALARPRLARSR-------TVDNANGEHLVHAARTSDSMCLR 78
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM--- 118
+D + + IE +IAR P HGE VLRY G +Y HYD F+P G +
Sbjct: 79 VGQD--ALCQRIEARIARLFDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAVGTPILLQ 136
Query: 119 --SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
QR+AS ++YL+ E GG T FP + L V +G+ + F
Sbjct: 137 AGGQRVASLVMYLNTPERGGATRFPDAH---------------LDVAAVKGNAVFFSYDR 181
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 207
P+ SLH PV+ GEKWVATKW+R++
Sbjct: 182 PHPMT--RSLHAGAPVLAGEKWVATKWLRER 210
>gi|319795182|ref|YP_004156822.1| procollagen-proline dioxygenase [Variovorax paradoxus EPS]
gi|315597645|gb|ADU38711.1| Procollagen-proline dioxygenase [Variovorax paradoxus EPS]
Length = 296
Score = 107 bits (268), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 66/199 (33%), Positives = 99/199 (49%), Gaps = 25/199 (12%)
Query: 15 NFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASEDKTGILELI 73
N A +C+++I AK RL PS L G V S K R S G F E+ ++ +
Sbjct: 107 NVVDAHECKALIEMAKPRLAPSTLVDPMSGRDVVSDK--RASWGMFFRLCEND--LVARL 162
Query: 74 EHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-----QRLASFLLY 128
+ +++ LP +GE ++L Y G + H+D P + S QR+++ + Y
Sbjct: 163 DRRLSALMNLPLENGEGLHLLYYPTGAGSEPHHDYLAPTNAANRESIARSGQRVSTLVTY 222
Query: 129 LSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHG 188
L+D EGG+T+FP +GL V P RG+ F NG +D SLH
Sbjct: 223 LNDAPEGGQTVFPQ---------------LGLAVSPIRGNACYFEYCDGNGRVDARSLHA 267
Query: 189 SCPVIKGEKWVATKWIRDQ 207
S PV +G+KWV TKW+R++
Sbjct: 268 SAPVTRGDKWVMTKWMRER 286
>gi|414587755|tpg|DAA38326.1| TPA: hypothetical protein ZEAMMB73_894856 [Zea mays]
Length = 244
Score = 107 bits (267), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 57/143 (39%), Positives = 90/143 (62%), Gaps = 3/143 (2%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 60
+V+SW PR + F NF S+E+C ++A A+ RL+ S + + G+ V+S RTSSG F+
Sbjct: 58 EVISWTPRIIVFHNFLSSEECDYLMAIARPRLQISTVVDVATGKGVKS--DVRTSSGMFV 115
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
++ E K+ +++ IE +I+ + +P+ +GE VLRYE Q Y H+D F+ + Q
Sbjct: 116 NSEERKSPVVQAIEKRISVFSQIPKENGELIQVLRYEASQYYRPHHDYFSDTFNLKRGGQ 175
Query: 121 RLASFLLYLSDVEEGGETMFPFE 143
R+A+ L+YL+D GGET FP E
Sbjct: 176 RVATMLMYLTDGVVGGETHFPQE 198
>gi|21106803|gb|AAM35580.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri
str. 306]
Length = 306
Score = 107 bits (267), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 71/211 (33%), Positives = 100/211 (47%), Gaps = 37/211 (17%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFIS 61
PR + F S +C ++IA A+ RL S+ TV++ G RTS +
Sbjct: 116 PRVVVLGGFLSDGECDALIALARPRLARSR-------TVDNANGEHMVHAARTSDSMCLR 168
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM--- 118
+D + + IE +IAR P HGE VLRY G +Y HYD F+P G +
Sbjct: 169 VGQD--ALCQRIEARIARLFDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAAGTPILLQ 226
Query: 119 --SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
QR+AS ++YL+ E GG T FP + L V +G+ + F
Sbjct: 227 AGGQRVASLVMYLNTPERGGATRFPDAH---------------LDVAAVKGNAVFFSYDR 271
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 207
P+ SLH PV+ GEKWVATKW+R++
Sbjct: 272 PHPMT--RSLHAGAPVLAGEKWVATKWLRER 300
>gi|77748547|ref|NP_641044.2| hypothetical protein XAC0691 [Xanthomonas axonopodis pv. citri str.
306]
gi|381169877|ref|ZP_09879039.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas citri pv.
mangiferaeindicae LMG 941]
gi|380689647|emb|CCG35526.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas citri pv.
mangiferaeindicae LMG 941]
Length = 286
Score = 107 bits (267), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 71/211 (33%), Positives = 100/211 (47%), Gaps = 37/211 (17%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFIS 61
PR + F S +C ++IA A+ RL S+ TV++ G RTS +
Sbjct: 96 PRVVVLGGFLSDGECDALIALARPRLARSR-------TVDNANGEHMVHAARTSDSMCLR 148
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM--- 118
+D + + IE +IAR P HGE VLRY G +Y HYD F+P G +
Sbjct: 149 VGQD--ALCQRIEARIARLFDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAAGTPILLQ 206
Query: 119 --SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
QR+AS ++YL+ E GG T FP + L V +G+ + F
Sbjct: 207 AGGQRVASLVMYLNTPERGGATRFPDAH---------------LDVAAVKGNAVFFSYDR 251
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 207
P+ SLH PV+ GEKWVATKW+R++
Sbjct: 252 PHPMT--RSLHAGAPVLAGEKWVATKWLRER 280
>gi|410637601|ref|ZP_11348175.1| prolyl 4-hydroxylase [Glaciecola lipolytica E3]
gi|410142794|dbj|GAC15380.1| prolyl 4-hydroxylase [Glaciecola lipolytica E3]
Length = 280
Score = 107 bits (266), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 65/205 (31%), Positives = 108/205 (52%), Gaps = 25/205 (12%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
R + + NF +A++C++++A K +L+PS++ R+G+ KG RTSS + ++D
Sbjct: 84 RVQMIKIDNFLTAQECEALVALTKSKLRPSEIPEREGDQY---KGFRTSSTCDLPFTKDP 140
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE-----YGPQMSQR 121
+ I+ KI A L E Y IGQ++ +H D F P Y QR
Sbjct: 141 --LAHEIDQKIVDALGLGVGEKEVIQAQHYAIGQEFKAHCDYFVPGSKDFKTYSKDGGQR 198
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 181
+F++YL+++ EGGET F +G+K KP++G L++ +L +G+I
Sbjct: 199 TWTFMIYLNELCEGGETEFV---------------KLGIKFKPKQGTALVWNNLHEDGSI 243
Query: 182 DRTSLHGSCPVIKGEKWVATKWIRD 206
+ +LH + P+ GEK V TKW R+
Sbjct: 244 NEDTLHHAHPIESGEKVVITKWFRE 268
>gi|325925807|ref|ZP_08187179.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas perforans
91-118]
gi|325543793|gb|EGD15204.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas perforans
91-118]
Length = 286
Score = 107 bits (266), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 70/211 (33%), Positives = 100/211 (47%), Gaps = 37/211 (17%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFIS 61
PR + F S E+C ++IA A+ L S+ TV++ G RTS +
Sbjct: 96 PRVVVLGGFLSDEECDALIALARPHLARSR-------TVDNANGEHVVHAARTSDSMCLR 148
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM--- 118
+D + + IE +IAR P HGE VLRY G +Y HYD F+P G +
Sbjct: 149 LGQD--ALCQRIEARIARLLDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAAGTPVLVQ 206
Query: 119 --SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
QR+AS ++YL+ E GG T FP + L V +G+ + F
Sbjct: 207 AGGQRVASLVMYLNTPERGGATRFPDAH---------------LDVAAVKGNAVFFSYDR 251
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 207
P+ SLH PV+ G+KWVATKW+R++
Sbjct: 252 PHPMT--RSLHAGAPVLAGDKWVATKWLRER 280
>gi|346723630|ref|YP_004850299.1| hypothetical protein XACM_0696 [Xanthomonas axonopodis pv.
citrumelo F1]
gi|346648377|gb|AEO41001.1| hypothetical protein XACM_0696 [Xanthomonas axonopodis pv.
citrumelo F1]
Length = 286
Score = 106 bits (264), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 70/211 (33%), Positives = 100/211 (47%), Gaps = 37/211 (17%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFIS 61
PR + F S E+C ++IA A+ L S+ TV++ G RTS +
Sbjct: 96 PRVVVLGGFLSDEECDALIALAQPHLARSR-------TVDNANGEHVVHAARTSDSMCLR 148
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM--- 118
+D + + IE +IAR P HGE VLRY G +Y HYD F+P G +
Sbjct: 149 LGQD--ALCQRIEARIARLLDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAAGTPVLVQ 206
Query: 119 --SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
QR+AS ++YL+ E GG T FP + L V +G+ + F
Sbjct: 207 AGGQRVASLVMYLNTPERGGATRFPDAH---------------LDVAAVKGNAVFFSYDR 251
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 207
P+ SLH PV+ G+KWVATKW+R++
Sbjct: 252 PHPMT--RSLHAGAPVLAGDKWVATKWLRER 280
>gi|414591891|tpg|DAA42462.1| TPA: hypothetical protein ZEAMMB73_637248 [Zea mays]
Length = 207
Score = 106 bits (264), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 58/146 (39%), Positives = 86/146 (58%), Gaps = 5/146 (3%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTF 59
++ +SW PR + F S +C ++ AKK+++ S +A + G++V+S RTSSG F
Sbjct: 45 VKAVSWHPRIFVYKGFLSDAECDHLVTLAKKKIQRSMVADNESGKSVKSE--VRTSSGMF 102
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
+ +D ++ IE +IA T LPQ + E VLRYE GQKY+ H+D F+ +
Sbjct: 103 LDKRQDP--VVSRIEERIAAWTFLPQENAENMQVLRYEPGQKYEPHFDYFHDRVNQARGG 160
Query: 120 QRLASFLLYLSDVEEGGETMFPFENG 145
R A+ L+YLS V EGGET+FP G
Sbjct: 161 HRYATVLMYLSTVREGGETVFPNAKG 186
>gi|384429387|ref|YP_005638747.1| procollagen-proline, 2-oxoglutarate-4-dioxygenase [Xanthomonas
campestris pv. raphani 756C]
gi|341938490|gb|AEL08629.1| procollagen-proline, 2-oxoglutarate-4-dioxygenase [Xanthomonas
campestris pv. raphani 756C]
Length = 286
Score = 106 bits (264), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 69/208 (33%), Positives = 100/208 (48%), Gaps = 25/208 (12%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
PR + S ++C ++IA A+ +L S+ + + E RTS + +D
Sbjct: 96 PRVVVLGGLLSDDECDALIALARPQLARSR-TVDNRDGSEIVHAARTSHSMALQPGQD-- 152
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRL 122
+ + IE +IAR P HGE VLRY G +Y HYD F P G + QR+
Sbjct: 153 ALCQRIEARIARLLEWPVEHGEGLQVLRYATGAQYAPHYDYFEPDAPGTPVLLQHGGQRV 212
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
AS ++YL+ E GG T FP + L V +G+ + F P+ +
Sbjct: 213 ASLVMYLNTPERGGATRFP---------------DVHLDVAAVKGNAVFFSYDRPH-PMT 256
Query: 183 RTSLHGSCPVIKGEKWVATKWIRDQEQH 210
RT LH PV+ GEKWVATKW+R++ H
Sbjct: 257 RT-LHAGAPVLAGEKWVATKWLRERPLH 283
>gi|428170517|gb|EKX39441.1| hypothetical protein GUITHDRAFT_114401 [Guillardia theta CCMP2712]
Length = 322
Score = 105 bits (263), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 69/219 (31%), Positives = 108/219 (49%), Gaps = 31/219 (14%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQG--ETVESTKGTRTSSGT 58
++ +S PR N + E+C +++ A ++ + L G + VEST TRT+
Sbjct: 75 IETVSVDPRIFIVHNLLTEEECDHLVSLALQKGLSASLITPYGTNKLVEST--TRTNKQA 132
Query: 59 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 118
++ +D +++ +E KIA+ T GE VL Y Q++ H+D F+PA P+
Sbjct: 133 WLDFQQDD--VVKRVEDKIAKLTKTTPEQGENLQVLHYAKSQQFTEHHDYFDPATDPPEN 190
Query: 119 ----SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 174
RL + ++YL EEGGET F N LK+ +GD ++FY+
Sbjct: 191 YEKGGNRLITVIVYLQAAEEGGETHFGAAN---------------LKLTAAKGDAVMFYN 235
Query: 175 L------FPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 207
L +D+ +LH P IKGEKWVATKWI ++
Sbjct: 236 LKHGCDGIDPTCVDKQTLHAGLPPIKGEKWVATKWIHER 274
>gi|218665910|ref|YP_002425647.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrooxidans ATCC 23270]
gi|218518123|gb|ACK78709.1| oxidoreductase, 2OG-Fe(II) oxygenase family [Acidithiobacillus
ferrooxidans ATCC 23270]
Length = 248
Score = 105 bits (263), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 70/193 (36%), Positives = 95/193 (49%), Gaps = 23/193 (11%)
Query: 18 SAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKI 77
+ E CQ++IA + L+P+ + Q E G R S + D IL+ + I
Sbjct: 73 TPENCQNLIAIGQSLLRPATVTDEQ-TGQEVAHGERVSEMAW--PKRDDYPILQSLAEGI 129
Query: 78 ARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ---RLASFLLYLSDVEE 134
A+ T +P E +L Y G +Y HYDAF A P + Q R A+ +LYL+ VEE
Sbjct: 130 AQLTGIPIDCQEPLQILHYRPGGEYKPHYDAF--AADAPTLRQGGNRQATLILYLNAVEE 187
Query: 135 GGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIK 194
GGET FP +GL+V P G G+ F +L G SLH PV K
Sbjct: 188 GGETAFPE---------------LGLQVSPIPGGGVFFRNLNEEGQRHPLSLHAGLPVRK 232
Query: 195 GEKWVATKWIRDQ 207
GEKW+AT+WIR +
Sbjct: 233 GEKWIATQWIRQE 245
>gi|195113239|ref|XP_002001175.1| GI10638 [Drosophila mojavensis]
gi|193917769|gb|EDW16636.1| GI10638 [Drosophila mojavensis]
Length = 511
Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 64/206 (31%), Positives = 104/206 (50%), Gaps = 23/206 (11%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
M+VL P + F + S+ + + A+ L+ S + ++ V+ R S+GT++
Sbjct: 312 MEVLVLDPLVVIFHDVLSSREIDGLQEIARPHLERS-MVVKYRANVQGKH--RISAGTWV 368
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
+ + IE +IA L E F V+ Y IG +Y +H+D F
Sbjct: 369 ERKYN--NLTWRIERRIADMVDLNLEGSEPFYVINYGIGGQYKAHWDFFGADTVE---DN 423
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
RLA+ L Y++DVE+GG T+FP +G V+ +RG+ L +Y++ NGT
Sbjct: 424 RLATVLFYMNDVEQGGATVFP---------------RLGQTVRAKRGNALFWYNMQHNGT 468
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRD 206
+D +LHG CP++ G KW+ T+WI D
Sbjct: 469 VDDRTLHGGCPILVGSKWIFTQWISD 494
>gi|389795384|ref|ZP_10198508.1| procollagen-proline dioxygenase [Rhodanobacter fulvus Jip2]
gi|388430823|gb|EIL87950.1| procollagen-proline dioxygenase [Rhodanobacter fulvus Jip2]
Length = 293
Score = 105 bits (262), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 64/197 (32%), Positives = 98/197 (49%), Gaps = 35/197 (17%)
Query: 20 EQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFISASEDKTGILELI 73
E+C +I + +L+ S TV+ G R+S GTF + D + +
Sbjct: 109 EECDELIRRSADKLQRST-------TVDPVNGGYEVIAARSSEGTFFPVNADD--FIARL 159
Query: 74 EHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRLASFLLY 128
+ +IA P +GE VL Y G +Y H+D F+P + G + QR+++ L+Y
Sbjct: 160 DRRIAELMNCPVENGEGLQVLHYGEGGEYQPHFDYFSPGDPGSEAQMVVGGQRVSTLLIY 219
Query: 129 LSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHG 188
L+DV +GG T+FP +GL+V PR+G + F +G +D +LHG
Sbjct: 220 LNDVAQGGATVFP---------------TLGLRVLPRKGMAVYFEYSNRDGQVDPLTLHG 264
Query: 189 SCPVIKGEKWVATKWIR 205
PV KGEKW+ TKW+R
Sbjct: 265 GEPVEKGEKWIITKWMR 281
>gi|242051901|ref|XP_002455096.1| hypothetical protein SORBIDRAFT_03g004265 [Sorghum bicolor]
gi|241927071|gb|EES00216.1| hypothetical protein SORBIDRAFT_03g004265 [Sorghum bicolor]
Length = 303
Score = 105 bits (262), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 65/203 (32%), Positives = 106/203 (52%), Gaps = 21/203 (10%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 63
LSW PR + F S +C +I+ A + +Q V S G I
Sbjct: 62 LSWHPRVFLYEGFLSDMECDHLISMAHGK--------KQSSLVVGGSAGNNSQGASI--- 110
Query: 64 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 123
ED I+ IE +I+ + LP+ GE+ +L+YE+ + ++Y++ + + + RL
Sbjct: 111 EDT--IVSTIEDRISVWSFLPKDFGESMQILKYEVNKSDYNNYESQSSSGH-----DRLV 163
Query: 124 SFLLYLSDVEEGGETMFPFE--NGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 181
+ L+YLSDV+ GGET FP G ++ +C G V+P RG+ +L ++L P+G I
Sbjct: 164 TVLMYLSDVKRGGETAFPRSELKGTKVELAAP-SECAGYAVQPVRGNAILLFNLKPDGVI 222
Query: 182 DRTSLHGSCPVIKGEKWVATKWI 204
D+ S + C V++GE+W+A K I
Sbjct: 223 DKDSQYEMCSVLEGEEWLAIKHI 245
>gi|321474898|gb|EFX85862.1| hypothetical protein DAPPUDRAFT_309117 [Daphnia pulex]
Length = 541
Score = 105 bits (262), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 64/211 (30%), Positives = 100/211 (47%), Gaps = 20/211 (9%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
M++ S +PR + + N + E+ ++ A+ RL+ S + E TK R + F+
Sbjct: 335 MELASLKPRLVIYHNVVTDEEIETAKKLAQSRLRRSTVQNSLTGASEPTK-YRIAKAAFL 393
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS- 119
SE + + +I T L T E V Y IG Y+ HYD E
Sbjct: 394 QNSEHDHIVK--MTRRIGDVTGLDMTTAEELQVCNYGIGGHYEPHYDHARKGEVQKDFGW 451
Query: 120 -QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 178
R+A+++ Y+SDVE GG T+FP I L + P++G +++L PN
Sbjct: 452 GNRIATWMFYMSDVEAGGATVFP---------------QINLALWPQKGSAAFWFNLHPN 496
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
G D + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 497 GEGDDLTQHAACPVLTGSKWVSNKWIHERNQ 527
>gi|389728965|ref|ZP_10189244.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
gi|388441204|gb|EIL97500.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
Length = 285
Score = 105 bits (262), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 71/209 (33%), Positives = 100/209 (47%), Gaps = 28/209 (13%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P F S ++C ++I AK RL+ ++ G + RTS G F E
Sbjct: 95 PPLRVFDGLLSDDECAALIELAKPRLQRARTVAEDG--AQQIDEHRTSDGMFFGLGEQP- 151
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-----PQMSQRL 122
++E IE +IA +P HGE VL Y GQ+Y+ H D F+P + G QR+
Sbjct: 152 -LIERIEARIAALLGIPVDHGEGLQVLHYLPGQQYEPHQDWFDPTQPGYAAITATGGQRI 210
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
AS ++YL+ + GG T FP IGL V RG + F + +G D
Sbjct: 211 ASLVIYLNTPDAGGGTAFPE---------------IGLTVTALRGSAVCFT--YESG--D 251
Query: 183 RTSLHGSCPVIKGEKWVATKWIRDQEQHE 211
SLH PV +GEKW+ATKW+R++ E
Sbjct: 252 VFSLHAGLPVTRGEKWIATKWLRERPYRE 280
>gi|194905419|ref|XP_001981192.1| GG11932 [Drosophila erecta]
gi|190655830|gb|EDV53062.1| GG11932 [Drosophila erecta]
Length = 535
Score = 105 bits (261), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 64/212 (30%), Positives = 101/212 (47%), Gaps = 20/212 (9%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
++ L P + ++ +S+ TA+ R+K S + G + RTS G
Sbjct: 319 LEELHLDPPVVQLHQVIGSKDAESLQRTARPRIKRSTVYSLAGNGDSTAAAFRTSQGASF 378
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF---NPAEYGPQ 117
+ S + +L+ H + + L + E V Y IG Y+ H+D+F + + G
Sbjct: 379 NYS--RNAATKLLSHHVGDFSGLNMEYAEDLQVANYGIGGHYEPHWDSFPDNHVYQEGDL 436
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
R+A+ + YLSDVE GG T FPF + L V P RG L +Y+L P
Sbjct: 437 HGNRIATAIYYLSDVEAGGGTAFPF---------------LPLLVTPERGSLLFWYNLHP 481
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D + H +CPV++G KW+A WIR++ Q
Sbjct: 482 SGDQDFRTKHAACPVLQGSKWIANVWIRERNQ 513
>gi|398810140|ref|ZP_10568970.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
gi|398083831|gb|EJL74535.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
Length = 296
Score = 105 bits (261), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 66/199 (33%), Positives = 98/199 (49%), Gaps = 35/199 (17%)
Query: 20 EQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFISASEDKTGILELI 73
++C+ +IA A+ RL PS TV+ G R+S G F E+ + +
Sbjct: 111 QECEELIALARPRLAPST-------TVDPLSGRDLVGEQRSSLGMFFRLREN--AFIARL 161
Query: 74 EHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-----QRLASFLLY 128
+ +++ LP +GE VL Y G + H+D P+ + S QR+++ + Y
Sbjct: 162 DQRVSELMNLPVENGEGLQVLCYPAGAQSMPHFDFLVPSNAANKASLARSGQRVSTLVSY 221
Query: 129 LSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHG 188
L++VEEGGET+FP +C G V PRRG + F G +D SLH
Sbjct: 222 LNEVEEGGETIFP--------------EC-GWSVPPRRGSAVYFEYCNSLGQVDHASLHA 266
Query: 189 SCPVIKGEKWVATKWIRDQ 207
PV+ GEKWVATKW+R +
Sbjct: 267 GGPVLHGEKWVATKWMRQR 285
>gi|385205097|ref|ZP_10031967.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
gi|385184988|gb|EIF34262.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
Length = 292
Score = 105 bits (261), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 62/206 (30%), Positives = 101/206 (49%), Gaps = 23/206 (11%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
RP+ + F + S ++C +I ++ RLK S + E RTS G + ED
Sbjct: 102 RPQMIVFADVLSPDECAEMIERSRHRLKRS-TTVNPATGKEDVIRNRTSEGIWYQRGEDP 160
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-----PQMSQR 121
+E ++ +I+ P +GE +LRY +Y H+D F P + G Q QR
Sbjct: 161 --FIERMDRRISSLMNWPVENGEGLQLLRYGTTGEYRPHFDYFPPDQPGSTVHTAQGGQR 218
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 181
+A+ ++YL+DV +GGET+FP G+ V +G + F + +
Sbjct: 219 VATLVIYLNDVPDGGETIFPEA---------------GMSVAASQGGAVYFRYMNGRRQL 263
Query: 182 DRTSLHGSCPVIKGEKWVATKWIRDQ 207
D +LHG PV+ G+KW+ TKW+R++
Sbjct: 264 DPLTLHGGAPVLSGDKWIMTKWMRER 289
>gi|363543297|ref|NP_001241864.1| prolyl 4-hydroxylase 4-2 precursor [Zea mays]
gi|194704960|gb|ACF86564.1| unknown [Zea mays]
gi|347978810|gb|AEP37747.1| prolyl 4-hydroxylase 4-2 [Zea mays]
Length = 207
Score = 105 bits (261), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 58/146 (39%), Positives = 85/146 (58%), Gaps = 5/146 (3%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTF 59
++ +SW PR + F S +C ++ AKK+ + S +A + G++V+S RTSSG F
Sbjct: 45 VKAVSWHPRIFVYKGFLSDAECDHLVTLAKKKTQRSMVADNESGKSVKSE--VRTSSGMF 102
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
+ +D ++ IE +IA T LPQ + E VLRYE GQKY+ H+D F+ +
Sbjct: 103 LDKRQDP--VVSRIEERIAAWTFLPQENAENMQVLRYEPGQKYEPHFDYFHDRVNQARGG 160
Query: 120 QRLASFLLYLSDVEEGGETMFPFENG 145
R A+ L+YLS V EGGET+FP G
Sbjct: 161 HRYATVLMYLSTVREGGETVFPNAKG 186
>gi|357135727|ref|XP_003569460.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like isoform 2
[Brachypodium distachyon]
Length = 314
Score = 104 bits (260), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 62/202 (30%), Positives = 104/202 (51%), Gaps = 13/202 (6%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 63
L+W PR + F S +C ++ A+ ++ S L + T+ + + F +
Sbjct: 63 LAWHPRVFLYEGFLSGMECDHLVYVARLNIESSLLVNAGARNI--TQNSTDARFKF-QLA 119
Query: 64 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 123
+ K ++ IE +I+ + +P+ HGE+ +L+Y Q D + D + G RL
Sbjct: 120 DSKDIVVSKIEDRISLWSFIPKEHGESMQILKYGSNQS-DHNKDGTQSSSGG----NRLV 174
Query: 124 SFLLYLSDVEEGGETMFPFENGIFLDSGYD---YKKCIGLKVKPRRGDGLLFYSLFPNGT 180
+ L+YLSDV++GGET+FP D+ +C G VKP +GD +L ++L P+G
Sbjct: 175 TILMYLSDVKQGGETVFPRSE--LKDTQAKEGALSECAGYAVKPVKGDAILLFNLRPDGV 232
Query: 181 IDRTSLHGSCPVIKGEKWVATK 202
D S + C V++GEKW+A K
Sbjct: 233 TDSDSHYEDCSVLEGEKWLAIK 254
>gi|187920106|ref|YP_001889137.1| procollagen-proline dioxygenase [Burkholderia phytofirmans PsJN]
gi|187718544|gb|ACD19767.1| Procollagen-proline dioxygenase [Burkholderia phytofirmans PsJN]
Length = 295
Score = 104 bits (260), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 62/206 (30%), Positives = 101/206 (49%), Gaps = 23/206 (11%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
RP+ + F + S ++C +I ++ RLK S + E RTS G + ED
Sbjct: 105 RPQVIVFGDVLSPDECAEMIERSRHRLKRS-TTVNPETGKEDVIRNRTSEGIWYQRGED- 162
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-----PQMSQR 121
+E ++ +I+ P +GE +L Y +Y H+D F P + G Q QR
Sbjct: 163 -AFIERMDRRISSLMNWPVENGEGLQILHYGTTGEYRPHFDYFPPDQPGSAVHTAQGGQR 221
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 181
+A+ ++YL+DV +GGET+FP G+ V R+G + F + +
Sbjct: 222 VATLVIYLNDVPDGGETIFPEA---------------GISVAARQGGAVYFRYMNGQRQL 266
Query: 182 DRTSLHGSCPVIKGEKWVATKWIRDQ 207
D +LHG PV+ G+KW+ TKW+R++
Sbjct: 267 DPLTLHGGAPVLGGDKWIMTKWMRER 292
>gi|416009427|ref|ZP_11561250.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein
[Acidithiobacillus sp. GGI-221]
gi|339836568|gb|EGQ64151.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein
[Acidithiobacillus sp. GGI-221]
Length = 196
Score = 104 bits (260), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 69/193 (35%), Positives = 94/193 (48%), Gaps = 23/193 (11%)
Query: 18 SAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKI 77
+ E CQ++IA + L+P+ + Q E G R S + D IL+ + I
Sbjct: 21 TPENCQNLIAIGQSLLRPATVTDEQ-TGQEVAHGERVSEMAW--PKRDDHPILQSLAEGI 77
Query: 78 ARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ---RLASFLLYLSDVEE 134
A+ T +P E +L Y G +Y HYDAF A P + Q R + +LYL+ VEE
Sbjct: 78 AQLTGIPIDCQEPLQILHYRPGGEYKPHYDAF--AADAPTLRQGGNRQGTLILYLNAVEE 135
Query: 135 GGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIK 194
GGET FP +GL+V P G G+ F +L G SLH PV K
Sbjct: 136 GGETAFPE---------------LGLQVSPIPGGGVFFRNLNEEGQRHPLSLHAGLPVRK 180
Query: 195 GEKWVATKWIRDQ 207
GEKW+AT+WIR +
Sbjct: 181 GEKWIATQWIRQE 193
>gi|294666178|ref|ZP_06731433.1| 2OG-Fe II oxygenase superfamily protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
gi|292604043|gb|EFF47439.1| 2OG-Fe II oxygenase superfamily protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
Length = 296
Score = 104 bits (260), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 70/211 (33%), Positives = 99/211 (46%), Gaps = 37/211 (17%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFIS 61
P + F S +C ++IA A+ RL S+ TV++ G RTS +
Sbjct: 106 PCVVVLGGFLSGGECDALIALARPRLARSR-------TVDNANGEHVVHAARTSDSMCLR 158
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM--- 118
+D + + IE +IAR P HGE VLRY G +Y HYD F+P G +
Sbjct: 159 VGQD--ALCQRIEARIARLLDWPVDHGEGLQVLRYGTGAEYRPHYDYFDPDAAGTPVLLQ 216
Query: 119 --SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
QR+AS ++YL+ E GG T FP + L V +G+ + F
Sbjct: 217 AGGQRVASLVMYLNTPERGGATRFPDAH---------------LDVAAVKGNAVFFSYDR 261
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 207
P+ SLH PV+ GEKWVATKW+R++
Sbjct: 262 PHPMT--RSLHAGAPVLAGEKWVATKWLRER 290
>gi|24417248|gb|AAN60234.1| unknown [Arabidopsis thaliana]
Length = 190
Score = 104 bits (260), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 59/136 (43%), Positives = 82/136 (60%), Gaps = 5/136 (3%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 62
LSW PR + F S E+C I AK +L+ S +A GE+VES RTSSG F+S
Sbjct: 59 LSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESE--VRTSSGMFLSK 116
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 122
+D I+ +E K+A T LP+ +GE+ +L YE GQKY+ H+D F+ R+
Sbjct: 117 RQDD--IVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRI 174
Query: 123 ASFLLYLSDVEEGGET 138
A+ L+YLS+VE+GGET
Sbjct: 175 ATVLMYLSNVEKGGET 190
>gi|294627644|ref|ZP_06706226.1| 2OG-Fe II oxygenase superfamily protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 11122]
gi|292597996|gb|EFF42151.1| 2OG-Fe II oxygenase superfamily protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 11122]
Length = 296
Score = 104 bits (260), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 70/211 (33%), Positives = 99/211 (46%), Gaps = 37/211 (17%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFIS 61
P + F S +C ++IA A+ RL S+ TV++ G RTS +
Sbjct: 106 PCVVVLGGFLSGGECDALIALARPRLARSR-------TVDNANGEHVVHAARTSDSMCLR 158
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM--- 118
+D + + IE +IAR P HGE VLRY G +Y HYD F+P G +
Sbjct: 159 VGQD--ALCQRIEARIARLLDWPVDHGEGLQVLRYGTGAEYRPHYDYFDPDAAGTPVLLQ 216
Query: 119 --SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
QR+AS ++YL+ E GG T FP + L V +G+ + F
Sbjct: 217 AGGQRVASLVMYLNTPERGGATRFPDAH---------------LDVAAVKGNAVFFSYDR 261
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 207
P+ SLH PV+ GEKWVATKW+R++
Sbjct: 262 PHPMT--RSLHAGAPVLAGEKWVATKWLRER 290
>gi|157111033|ref|XP_001651361.1| prolyl 4-hydroxylase alpha subunit 1, putative [Aedes aegypti]
gi|108878552|gb|EAT42777.1| AAEL005714-PA, partial [Aedes aegypti]
Length = 522
Score = 104 bits (260), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 61/209 (29%), Positives = 108/209 (51%), Gaps = 20/209 (9%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
++ + +P+ + F + S + + + AK L+ + +A +Q E +K + S F
Sbjct: 322 LEEMHLKPKIVIFHDVLSDTEIELLKRLAKPILERATIANQQTGKAERSKDRVSKSSWF- 380
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
++ + I ++A T L E V+ Y +G +YD H+D F+ + +
Sbjct: 381 --PDEYHSTIRTITKRVADMTGLSMDTAEELQVVNYGLGGQYDPHFDFFHWGKL--KEVN 436
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+A+ L Y+SDV GG T+FP +G+ ++ R+G +Y+L +G
Sbjct: 437 RIATVLFYMSDVSIGGATVFP---------------KLGVTLEARKGTAAFWYNLHSSGE 481
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+D ++LHG+CPV+ GEKWVA KWIR++ Q
Sbjct: 482 LDYSTLHGACPVLIGEKWVANKWIRERGQ 510
>gi|195452776|ref|XP_002073495.1| GK13117 [Drosophila willistoni]
gi|194169580|gb|EDW84481.1| GK13117 [Drosophila willistoni]
Length = 487
Score = 104 bits (260), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 66/213 (30%), Positives = 110/213 (51%), Gaps = 23/213 (10%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
M+++ P + + + S + + AK +LK +++ T + +K TRT+ +
Sbjct: 281 MELIGLDPYMVLYHDVISPNEIAELQEMAKPQLKRARVYNSTKNTDQLSK-TRTAKLAWF 339
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
+ ++ + E + +I T E V+ Y +G Y H+D FN + GP ++Q
Sbjct: 340 LDTFNQ--LTERLNQRIMDMTNFVLNGSEMLQVMNYGLGGYYVKHFDYFNTTK-GPHITQ 396
Query: 121 ----RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
R+A+ L YL+DVE+GG T+FP + KK V P+RG +++Y+L
Sbjct: 397 INGDRIATVLFYLNDVEQGGATVFP-----------EIKKA----VFPKRGSAIMWYNLK 441
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G +R +LH CPVI G KWV KWIR++EQ
Sbjct: 442 DDGEGNRDTLHAGCPVIVGSKWVCNKWIREREQ 474
>gi|299115443|emb|CBN75608.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 548
Score = 104 bits (259), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 78/241 (32%), Positives = 112/241 (46%), Gaps = 44/241 (18%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-----TRTS 55
++ LS PR NF E+ SII A L +Q A R + TKG TRTS
Sbjct: 207 LETLSHSPRVFSLYNFMDMEEADSIIEDA---LGMTQEAYRLKRSSTGTKGKAISKTRTS 263
Query: 56 SGTFISASEDKTGILELIEHKIARATMLPQTH---GEAFNVLRYEIGQKYDSHYDAFNPA 112
F++ T + ++ +I + + + H + VLRY Q Y +H+D A
Sbjct: 264 DNAFVT----HTNTAQALKRRIFQLLGIEEYHETWADGLQVLRYNESQAYVAHFDYLESA 319
Query: 113 EYGPQMSQ-----RLASFLLYLSDVEEGGETMFPFENGI-----------------FLD- 149
E S+ R A+ +LY +DV EGGET+F GI LD
Sbjct: 320 EGHDFKSEGLGTNRFATVVLYFNDVREGGETVFTHAPGIDHHLVPDTKVPVREVLENLDL 379
Query: 150 --SGYDYKKCIGLK----VKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKW 203
SG++ K + + V P+RG +LFY+ P+G D +S HG+CPVI G+KW A W
Sbjct: 380 PRSGWEEKLLLQCRRHMVVAPKRGQAVLFYNQHPDGRKDLSSEHGACPVIDGQKWAANLW 439
Query: 204 I 204
+
Sbjct: 440 V 440
>gi|91778899|ref|YP_554107.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
xenovorans LB400]
gi|91691559|gb|ABE34757.1| Procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
xenovorans LB400]
Length = 292
Score = 103 bits (258), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 61/206 (29%), Positives = 100/206 (48%), Gaps = 23/206 (11%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
RP+ + F + S ++C +I ++ RLK S + E RTS G + ED
Sbjct: 102 RPQVIVFADVLSPDECAEMIERSRHRLKRS-TTVNPATGKEDVIRNRTSEGIWYQRGEDP 160
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-----PQMSQR 121
+E ++ +I+ P +GE +L Y +Y H+D F P + G Q QR
Sbjct: 161 --FIERMDRRISSLMNWPVENGEGLQILHYGTTGEYRPHFDYFPPDQPGSAVHTAQGGQR 218
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 181
+A+ ++YL+DV +GGET+FP G+ V +G + F + +
Sbjct: 219 VATLVIYLNDVPDGGETIFPEA---------------GMSVAASQGGAVYFRYMNDRRQL 263
Query: 182 DRTSLHGSCPVIKGEKWVATKWIRDQ 207
D +LHG PV+ G+KW+ TKW+R++
Sbjct: 264 DPLTLHGGAPVLAGDKWIMTKWMRER 289
>gi|195505199|ref|XP_002099401.1| GE23383 [Drosophila yakuba]
gi|194185502|gb|EDW99113.1| GE23383 [Drosophila yakuba]
Length = 535
Score = 103 bits (258), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 64/212 (30%), Positives = 102/212 (48%), Gaps = 20/212 (9%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
++ L P + A+ +S+ TA+ R+K S + G + RTS G
Sbjct: 319 LEELHLDPLLVQLHQVIGAKDSESLQRTARPRIKRSTVYSLAGNGGSTAAAFRTSQGASF 378
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF---NPAEYGPQ 117
+ S ++ +L+ H + + L + E V Y IG Y+ H+D+F + + G
Sbjct: 379 NYS--RSAATKLLSHHVGDFSGLNMEYAEDLQVANYGIGGHYEPHWDSFPENHVYQEGDL 436
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
R+A+ + YLSDVE GG T FPF + L V P +G L +Y+L P
Sbjct: 437 HGNRIATGIYYLSDVEAGGGTAFPF---------------LPLLVTPEKGSLLFWYNLHP 481
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D + H +CPV++G KW+A WIR++ Q
Sbjct: 482 SGDQDFRTKHAACPVLQGSKWIANVWIRERNQ 513
>gi|195505209|ref|XP_002099405.1| GE10885 [Drosophila yakuba]
gi|194185506|gb|EDW99117.1| GE10885 [Drosophila yakuba]
Length = 473
Score = 103 bits (257), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 67/209 (32%), Positives = 101/209 (48%), Gaps = 21/209 (10%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
M++LS P + F + S + SI AK L + + G E RT+ GT++
Sbjct: 276 MELLSLDPYMVLFHDVVSDKDITSIRNLAKGGLVRAVTVTKDGSYEEDP--ARTTKGTWL 333
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
+ + +++ + T L + F VL Y IG Y +H+D E G S
Sbjct: 334 V---ENSKLIQRLSQLAQDMTNLDIRDADPFQVLNYGIGGYYGTHFDFLADTEMG-NFSN 389
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+A+ + YLSDV +GG T+FP +GL V P++G LL+Y+L G
Sbjct: 390 RIATAVFYLSDVPQGGATIFP---------------KLGLSVFPKKGSALLWYNLDHKGD 434
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
D + H +CP I G +WV TKWI ++EQ
Sbjct: 435 GDNRTAHSACPTIVGSRWVMTKWINEREQ 463
>gi|325915062|ref|ZP_08177391.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas vesicatoria
ATCC 35937]
gi|325538760|gb|EGD10427.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas vesicatoria
ATCC 35937]
Length = 286
Score = 103 bits (257), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 65/205 (31%), Positives = 96/205 (46%), Gaps = 25/205 (12%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
PR + F S +C ++IA A+ RL S+ + RTS + +D
Sbjct: 96 PRVMVLGGFLSDAECDAMIALAQPRLARSR-TVDNANGAHVVHAARTSDSMCLQLGQD-- 152
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRL 122
+ + IE +IAR P +GE VLRY G +Y HYD F+P G + QR+
Sbjct: 153 ALCQRIEARIARLLDWPVENGEGLQVLRYGTGAEYQPHYDYFDPDAAGTPVLLQAGGQRV 212
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
AS ++YL+ + GG T FP + L + +G+ + F P+
Sbjct: 213 ASLVMYLNTPDRGGATRFPD---------------VHLDIAAIKGNAVFFSYDRPHPMT- 256
Query: 183 RTSLHGSCPVIKGEKWVATKWIRDQ 207
SLH PV+ GEKWVATKW+R++
Sbjct: 257 -RSLHAGAPVLAGEKWVATKWLRER 280
>gi|115434812|ref|NP_001042164.1| Os01g0174500 [Oryza sativa Japonica Group]
gi|55296794|dbj|BAD68120.1| prolyl 4-hydroxylase -like [Oryza sativa Japonica Group]
gi|113531695|dbj|BAF04078.1| Os01g0174500 [Oryza sativa Japonica Group]
gi|222617830|gb|EEE53962.1| hypothetical protein OsJ_00571 [Oryza sativa Japonica Group]
Length = 303
Score = 103 bits (257), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 65/202 (32%), Positives = 105/202 (51%), Gaps = 20/202 (9%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 63
LSW PR + F S +C +++ + ++ S LA T G R SS I
Sbjct: 63 LSWHPRIFLYEGFLSDMECDHLVSMGRGNME-SSLAF--------TDGDRNSSYNNI--- 110
Query: 64 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 123
ED ++ IE +I+ + LP+ +GE+ VL+Y + + + + RLA
Sbjct: 111 EDI--VVSKIEDRISLWSFLPKENGESIQVLKYGVNRS-----GSIKEEPKSSSGAHRLA 163
Query: 124 SFLLYLSDVEEGGETMFPFENGIFLDSGYDY-KKCIGLKVKPRRGDGLLFYSLFPNGTID 182
+ L+YLSDV++GGET+FP + +C G V+P +G+ +L ++L P+G D
Sbjct: 164 TILMYLSDVKQGGETVFPRSEMKDAQAKEGAPSQCSGYAVRPAKGNAILLFNLRPDGETD 223
Query: 183 RTSLHGSCPVIKGEKWVATKWI 204
+ S + CPV++GEKW+A K I
Sbjct: 224 KDSQYEECPVLEGEKWLAIKHI 245
>gi|77761111|ref|YP_241833.2| hypothetical protein XC_0735 [Xanthomonas campestris pv. campestris
str. 8004]
Length = 288
Score = 103 bits (257), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 67/208 (32%), Positives = 100/208 (48%), Gaps = 25/208 (12%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
PR + + ++C ++IA A+ +L S+ + + E RTS + +D
Sbjct: 98 PRVVVLGGLLADDECDALIALARPQLARSR-TVDNRDGSEIVHAARTSHSMALQPGQD-- 154
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRL 122
+ + IE +IA+ P HGE VLRY G +Y HYD F P G + QR+
Sbjct: 155 ALCQRIEARIAQLLEWPVEHGEGLQVLRYATGAQYAPHYDYFEPDAPGTPVLLQHGGQRV 214
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
AS ++YL+ E GG T FP + L V +G+ + F P+ +
Sbjct: 215 ASLVMYLNTPERGGATRFP---------------DVHLDVAAVKGNAVFFSYDRPH-PMT 258
Query: 183 RTSLHGSCPVIKGEKWVATKWIRDQEQH 210
RT LH PV+ GEKWVATKW+R++ H
Sbjct: 259 RT-LHAGAPVLAGEKWVATKWLRERPLH 285
>gi|326436053|gb|EGD81623.1| p4ha2 protein [Salpingoeca sp. ATCC 50818]
Length = 548
Score = 103 bits (257), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 68/207 (32%), Positives = 107/207 (51%), Gaps = 34/207 (16%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQG--ETVESTKGTRTSSGTFISASE 64
R R F FAS E+C+ + K+RL+ + +A G + VE R S+ ++
Sbjct: 337 RQRLQVFRQFASPEECRHLQHAGKRRLERA-VAWTDGRFQPVE----FRISTAAWLQPDH 391
Query: 65 DKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD----AFNPAEYGPQMSQ 120
D I++ I +I AT + + EA + Y +G Y+ H+D NP +
Sbjct: 392 D--AIVKRIHGRIEDATQVDIEYAEALQISNYGMGGFYEPHFDHSSRGTNPD------GE 443
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
RLA+F++YL+ V++GG T FP +G V+P GD + +Y+L P+G
Sbjct: 444 RLATFMIYLNPVKQGGFTAFPR---------------LGAAVQPGYGDAVFWYNLQPSGV 488
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQ 207
D +LHG+CPV++G KWVA KWI ++
Sbjct: 489 GDPLTLHGACPVLRGSKWVANKWIHER 515
>gi|66572403|gb|AAY47813.1| conserved hypothetical protein [Xanthomonas campestris pv.
campestris str. 8004]
Length = 308
Score = 103 bits (257), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 67/208 (32%), Positives = 100/208 (48%), Gaps = 25/208 (12%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
PR + + ++C ++IA A+ +L S+ + + E RTS + +D
Sbjct: 118 PRVVVLGGLLADDECDALIALARPQLARSR-TVDNRDGSEIVHAARTSHSMALQPGQD-- 174
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRL 122
+ + IE +IA+ P HGE VLRY G +Y HYD F P G + QR+
Sbjct: 175 ALCQRIEARIAQLLEWPVEHGEGLQVLRYATGAQYAPHYDYFEPDAPGTPVLLQHGGQRV 234
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
AS ++YL+ E GG T FP + L V +G+ + F P+ +
Sbjct: 235 ASLVMYLNTPERGGATRFPD---------------VHLDVAAVKGNAVFFSYDRPH-PMT 278
Query: 183 RTSLHGSCPVIKGEKWVATKWIRDQEQH 210
RT LH PV+ GEKWVATKW+R++ H
Sbjct: 279 RT-LHAGAPVLAGEKWVATKWLRERPLH 305
>gi|218187602|gb|EEC70029.1| hypothetical protein OsI_00603 [Oryza sativa Indica Group]
Length = 549
Score = 103 bits (256), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 67/202 (33%), Positives = 104/202 (51%), Gaps = 20/202 (9%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 63
LSW PR + F S +C +++T + + S LA T G R SS I
Sbjct: 309 LSWHPRIFLYEGFLSDMECDHLVSTGRGNMD-SSLAF--------TDGDRNSSYNNI--- 356
Query: 64 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 123
ED ++ IE +I+ + LP+ +GE VL+Y + ++ + LA
Sbjct: 357 EDI--VVSKIEDRISLWSFLPKENGENIQVLKYGVNRR-----GSIKEEPKSSTGGHWLA 409
Query: 124 SFLLYLSDVEEGGETMFPFENGIFLDSGYDY-KKCIGLKVKPRRGDGLLFYSLFPNGTID 182
+ L+YLSDV++GGET+FP + +C G V+P +G+ LL ++L P+G ID
Sbjct: 410 TILIYLSDVKQGGETVFPRSEMKDAQAKEGAPSQCSGYAVRPAKGNALLLFNLRPDGEID 469
Query: 183 RTSLHGSCPVIKGEKWVATKWI 204
+ S + CPV++GEKW+A K I
Sbjct: 470 KDSQYEECPVLEGEKWLAIKHI 491
>gi|124267278|ref|YP_001021282.1| hypothetical protein Mpe_A2091 [Methylibium petroleiphilum PM1]
gi|124260053|gb|ABM95047.1| conserved hypothetical protein [Methylibium petroleiphilum PM1]
Length = 289
Score = 103 bits (256), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 68/203 (33%), Positives = 93/203 (45%), Gaps = 25/203 (12%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
PR + F S +C I+A A RL S + RTS G F + E
Sbjct: 102 PRVIVFSGLLSDAECDEIVALAGARLARSH-TVDTATGASEVNAARTSDGMFFTRGEHP- 159
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQRL 122
+ E +IA P +GE VL Y G +Y HYD F+P + G P + QR+
Sbjct: 160 -VCARFEARIAALLNWPVENGEGLQVLHYRPGAEYKPHYDYFDPDQPGTPAVLRRGGQRV 218
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
A+ + YL+ GG T FP IGL+V P +G + F P+ +
Sbjct: 219 ATLVTYLNTPTRGGGTTFP---------------DIGLEVTPLKGHAVFFSYDRPHPST- 262
Query: 183 RTSLHGSCPVIKGEKWVATKWIR 205
SLHG PV++G+KWVATKW+R
Sbjct: 263 -RSLHGGAPVLEGDKWVATKWLR 284
>gi|357135725|ref|XP_003569459.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like isoform 1
[Brachypodium distachyon]
Length = 303
Score = 103 bits (256), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 62/202 (30%), Positives = 100/202 (49%), Gaps = 24/202 (11%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 63
L+W PR + F S +C ++ A+ ++ S L G R I+ +
Sbjct: 63 LAWHPRVFLYEGFLSGMECDHLVYVARLNIESSLLV---------NAGARN-----ITQN 108
Query: 64 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 123
++ IE +I+ + +P+ HGE+ +L+Y Q D + D + G RL
Sbjct: 109 STDDIVVSKIEDRISLWSFIPKEHGESMQILKYGSNQS-DHNKDGTQSSSGG----NRLV 163
Query: 124 SFLLYLSDVEEGGETMFPFENGIFLDSGYD---YKKCIGLKVKPRRGDGLLFYSLFPNGT 180
+ L+YLSDV++GGET+FP D+ +C G VKP +GD +L ++L P+G
Sbjct: 164 TILMYLSDVKQGGETVFPRSE--LKDTQAKEGALSECAGYAVKPVKGDAILLFNLRPDGV 221
Query: 181 IDRTSLHGSCPVIKGEKWVATK 202
D S + C V++GEKW+A K
Sbjct: 222 TDSDSHYEDCSVLEGEKWLAIK 243
>gi|323445926|gb|EGB02303.1| hypothetical protein AURANDRAFT_39521 [Aureococcus anophagefferens]
Length = 239
Score = 103 bits (256), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 66/208 (31%), Positives = 105/208 (50%), Gaps = 23/208 (11%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 63
LS P + +FA + C+ +I A+ L +++ R+G + R +S +++A
Sbjct: 31 LSADPLVYFIDDFADEDSCEHLIRQARPSLGGAEVQTRRGSAART--AIRRASSCWLAAR 88
Query: 64 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYE--IGQKYDSHYDAFNPAEYGPQMS-Q 120
D+ LE +E I P+ E F+V+RY G++Y +H DAF + Q
Sbjct: 89 GDEA--LEHLEDAICAELGAPEERTEFFHVVRYRPSTGERYAAHADAFEAGNAELERGGQ 146
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
RL + LLYLSDV GG T+FP +GL V PRRG L+F ++ + T
Sbjct: 147 RLTTALLYLSDVGAGGATVFP---------------ALGLSVAPRRGRLLVFANVADDTT 191
Query: 181 IDRTSLHGSCPVI-KGEKWVATKWIRDQ 207
+D ++H P+ EKW+A KW+R++
Sbjct: 192 VDARTVHAGEPIAGDTEKWIANKWVRER 219
>gi|195575097|ref|XP_002105516.1| GD17035 [Drosophila simulans]
gi|194201443|gb|EDX15019.1| GD17035 [Drosophila simulans]
Length = 535
Score = 103 bits (256), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 64/213 (30%), Positives = 100/213 (46%), Gaps = 22/213 (10%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
++ L P + + +S+ TA+ R+K S + G + RTS G
Sbjct: 319 LEELHLDPLVVQLHQVIGSNDSESLQKTARPRIKRSTVYSLGGNGGSTAAAFRTSQGASF 378
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY----GP 116
+ S + +L+ H + + L + E V Y IG Y+ H+D+F P + G
Sbjct: 379 NYS--RNAATKLLSHHVGDFSGLNMDYAEDLQVANYGIGGHYEPHWDSF-PENHIYQEGD 435
Query: 117 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
R+A+ + YLSDVE GG T FPF + L V P +G L +Y+L
Sbjct: 436 LHGNRIATGIYYLSDVEAGGGTAFPF---------------LPLLVTPEKGSLLFWYNLH 480
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
P+G D + H +CPV++G KW+A WIR++ Q
Sbjct: 481 PSGDQDFRTKHAACPVLQGSKWIANVWIRERNQ 513
>gi|255607134|ref|XP_002538686.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223510975|gb|EEF23697.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 318
Score = 102 bits (255), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 67/205 (32%), Positives = 104/205 (50%), Gaps = 26/205 (12%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
PR F + S +C ++IA ++ RL+ S++ +G E TRTS G + + E+
Sbjct: 126 PRIALFDDVLSDAECDALIAASRSRLQRSKVVANRGSG-EFVDDTRTSYGAYFNKGENS- 183
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG---PQMS--QRL 122
++ I+ +IA T P TH E +L Y +G +Y H+D F P + G P S QR+
Sbjct: 184 -LVATIQRRIAELTRWPLTHAEPLQILNYGLGGEYLPHFDYFEPQQPGLPSPLESGGQRI 242
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLF-YSLFPNGTI 181
A+ ++YL+DVE GG T+FP N L+ +PR+G + F Y L +I
Sbjct: 243 ATVVMYLNDVEAGGGTIFPHLN---------------LETRPRKGGAIYFSYQLAVARSI 287
Query: 182 DRTSLHGSCPVIKGEKWVATKWIRD 206
+ + I KW+AT+W RD
Sbjct: 288 RSRCM--AARRIARRKWIATQWFRD 310
>gi|195452778|ref|XP_002073496.1| GK13116 [Drosophila willistoni]
gi|194169581|gb|EDW84482.1| GK13116 [Drosophila willistoni]
Length = 521
Score = 102 bits (254), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 65/213 (30%), Positives = 100/213 (46%), Gaps = 23/213 (10%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
M+++ P + + + S + + AK LK + + T + K F+
Sbjct: 318 MELIGLDPYMVLYHDVISPNEIAELQEMAKPELKRATVYNSTKNTNQFVKTRTAKVAWFL 377
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
T E + +I T E V+ Y +G Y H+D FN P +SQ
Sbjct: 378 DTFNQLT---ERLNQRIMDMTNFVLNGSEMLQVMNYGLGGYYVKHFDYFNTTT-NPHISQ 433
Query: 121 ----RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
R+A+ L YL+DVE+GG T+FP + KK V P+RG +++Y+L
Sbjct: 434 INGDRIATVLFYLNDVEQGGATVFP-----------EIKKA----VFPKRGSAIMWYNLK 478
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G +R +LH +CPVI G KWV KWIR++EQ
Sbjct: 479 DDGEGNRDTLHAACPVIVGSKWVCNKWIREREQ 511
>gi|327267604|ref|XP_003218589.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Anolis
carolinensis]
Length = 542
Score = 102 bits (254), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 66/211 (31%), Positives = 103/211 (48%), Gaps = 30/211 (14%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
RPR + F S E+ +++ AK RL + + Q + +T R S ++S E+
Sbjct: 342 RPRIVRFVEIISDEEIETVKELAKPRLSRATVHDPQTGKL-TTAHYRVSKSAWLSGYENP 400
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 118
I+ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 401 --IVARINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG--- 455
Query: 119 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 178
R+A++L Y+SDV GG T+FP +G V PR+G + +Y+LFP+
Sbjct: 456 -NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPRKGTAVFWYNLFPS 499
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
G D ++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 500 GEGDYSTRHAACPVLVGNKWVSNKWIHERGQ 530
>gi|398806116|ref|ZP_10565064.1| 2OG-Fe(II) oxygenase superfamily enzyme [Polaromonas sp. CF318]
gi|398089832|gb|EJL80333.1| 2OG-Fe(II) oxygenase superfamily enzyme [Polaromonas sp. CF318]
Length = 294
Score = 102 bits (254), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 60/200 (30%), Positives = 91/200 (45%), Gaps = 20/200 (10%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
PR + NF S+E+C + A+ P+ + + V + S +A +
Sbjct: 95 PRIVVLDNFLSSEECDGLCEEARPAFAPATVVDPHQDAVHAAHFRSNDSAQLPAAGSE-- 152
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 127
++ +E +I R T P E + RY GQ Y HYD F Q QRLA+ +L
Sbjct: 153 -LVRRVEARIERLTGWPSAFCETLQLQRYAQGQDYRPHYDFFGQDMVEAQGGQRLATLIL 211
Query: 128 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 187
YL E GG T F +G+++ PR+G L F +P+ + +LH
Sbjct: 212 YLRAPEAGGATYF---------------ANLGMRIAPRKGSALFF--TYPDPGNNSGTLH 254
Query: 188 GSCPVIKGEKWVATKWIRDQ 207
G V+ GEKW+AT+W RD+
Sbjct: 255 GGEAVLAGEKWIATQWFRDR 274
>gi|224009604|ref|XP_002293760.1| prolyl 4-hydroxylase alpha subunit [Thalassiosira pseudonana
CCMP1335]
gi|220970432|gb|EED88769.1| prolyl 4-hydroxylase alpha subunit [Thalassiosira pseudonana
CCMP1335]
Length = 206
Score = 102 bits (253), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 71/214 (33%), Positives = 108/214 (50%), Gaps = 21/214 (9%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSII-ATAKKRLKPSQLALRQGETVE---STKGTRTSS 56
++VLS PRA NF S + I+ T +L S A T + ST+ TRTS
Sbjct: 3 LKVLSCAPRAFEIENFLSQTEVDHIMYLTTGMKLHRSTTAGSDQITADERDSTRNTRTSL 62
Query: 57 GTFISASEDKTGILELIEHKIARATMLPQTH-GEAFNVLRYEIGQKYDSHYDAFNP---A 112
T++ +K+ I++ I + A ++ + EA ++ Y++GQ+Y +H+D +P
Sbjct: 63 NTWVY--REKSAIIDTIYRRAADLQLMNEALIAEALQLVHYDVGQEYTAHHDWGHPDIDN 120
Query: 113 EYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLF 172
EY P R + LLYL++ EGG T FP + + GL V+P+ G +LF
Sbjct: 121 EYQPA---RYCTLLLYLNEGMEGGATQFP--------RWVNAETRNGLDVEPKIGKAVLF 169
Query: 173 YSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 206
YS P+G +D S H + PV GEKW+ W D
Sbjct: 170 YSQLPDGNMDDWSHHAAMPVRVGEKWLMNLWTWD 203
>gi|195061074|ref|XP_001995919.1| GH14105 [Drosophila grimshawi]
gi|193891711|gb|EDV90577.1| GH14105 [Drosophila grimshawi]
Length = 513
Score = 102 bits (253), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 63/211 (29%), Positives = 99/211 (46%), Gaps = 22/211 (10%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
M++L P + + + S + + + A RLK +++ + Q RTS T++
Sbjct: 314 MELLQLDPYMVLYHDAISPREIEDLQFLAMPRLKRAKV-VDQVTHRNMMVKERTSKVTWL 372
Query: 61 SASEDKTGILEL-IEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
D T + + +I + E V+ Y +G Y SHYD N
Sbjct: 373 G---DATNAFTMRLNKRIEDMSGFTMYGSEMLQVMNYGLGGHYASHYDFLNATSKTRLNG 429
Query: 120 QRLASFLLYLSDVEEGGETMFP-FENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 178
R+A+ + YLSDVE+GG T+FP + +F P+RG +++Y+L N
Sbjct: 430 DRIATVMFYLSDVEQGGATVFPKIQKAVF----------------PQRGTAIIWYNLKEN 473
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
G D ++H +CPVI G KWV KWIR+ EQ
Sbjct: 474 GDFDTNTIHAACPVIVGSKWVCNKWIRENEQ 504
>gi|239915958|ref|NP_001070123.2| prolyl 4-hydroxylase alpha II-like precursor [Danio rerio]
Length = 490
Score = 101 bits (252), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 63/199 (31%), Positives = 98/199 (49%), Gaps = 31/199 (15%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P+ + + + S + +++ A+ L SQ G V S RTS F+ ++
Sbjct: 308 PKIIRYHDVISDTEIETLKDIARPELTRSQ----TGWGVIS--DIRTSQSVFL----EEV 357
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 127
G + I +IA T L E +V Y IG +Y H+D G ++++R A+FL+
Sbjct: 358 GTVARISQRIADITGLSVESAEKLHVQNYGIGGRYTPHFDT------GDEVNERTATFLI 411
Query: 128 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 187
Y+SDVE GG T+F +G+ VKP +G + +Y+L NG +D + H
Sbjct: 412 YMSDVEVGGATVF---------------TNVGVAVKPEKGSAVFWYNLHKNGELDLKTKH 456
Query: 188 GSCPVIKGEKWVATKWIRD 206
CPV+ G KWVA KWI +
Sbjct: 457 AGCPVLVGNKWVANKWIHE 475
>gi|92096574|gb|AAI15350.1| LOC557059 protein [Danio rerio]
Length = 508
Score = 101 bits (252), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 63/199 (31%), Positives = 98/199 (49%), Gaps = 31/199 (15%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P+ + + + S + +++ A+ L SQ G V S RTS F+ ++
Sbjct: 326 PKIIRYHDVISDTEIETLKDIARPELTRSQ----TGWGVIS--DIRTSQSVFL----EEV 375
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 127
G + I +IA T L E +V Y IG +Y H+D G ++++R A+FL+
Sbjct: 376 GTVARISQRIADITGLSVESAEKLHVQNYGIGGRYTPHFDT------GDEVNERTATFLI 429
Query: 128 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 187
Y+SDVE GG T+F +G+ VKP +G + +Y+L NG +D + H
Sbjct: 430 YMSDVEVGGATVF---------------TNVGVAVKPEKGSAVFWYNLHKNGELDLKTKH 474
Query: 188 GSCPVIKGEKWVATKWIRD 206
CPV+ G KWVA KWI +
Sbjct: 475 AGCPVLVGNKWVANKWIHE 493
>gi|301115862|ref|XP_002905660.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262110449|gb|EEY68501.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 215
Score = 101 bits (252), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 66/218 (30%), Positives = 101/218 (46%), Gaps = 19/218 (8%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P F ++ I+ + L PS + L+ G RTS+ ++ +S
Sbjct: 3 PLVFSVEEFLRDDEIDVILELSMPHLAPSGVTLQDGHENRPATDWRTSTTYWLDSSSHP- 61
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP------------AEYG 115
+++ I+ + A +P +H E+ VLRYE Q YD H D F+ EYG
Sbjct: 62 -VVQTIDKRTADLVKVPISHQESVQVLRYEPTQHYDQHLDYFSAERHRNSPDVLKRIEYG 120
Query: 116 PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI-GLKVKPRRGDGLLFYS 174
R+ + Y+SDV +GG T F G+ S K C G+ V P++ ++FYS
Sbjct: 121 --YKNRMITVFWYMSDVAKGGHTNFARSGGLPRPSSN--KDCSQGISVAPKKRKVVVFYS 176
Query: 175 LFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHED 212
+ PNG D SLH CPV +G K KWI ++ + +D
Sbjct: 177 MLPNGEGDPMSLHAGCPVEEGIKLSGNKWIWNKPRSDD 214
>gi|303273602|ref|XP_003056161.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226462245|gb|EEH59537.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 750
Score = 101 bits (252), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 72/229 (31%), Positives = 112/229 (48%), Gaps = 48/229 (20%)
Query: 13 FPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILEL 72
F +F SA +C ++A A L+ S++ G+ E RTSS TF++ + + ++
Sbjct: 534 FDHFLSAVECDDLVAIAAPDLRRSRVT--DGKLSEG----RTSSSTFLTGCKQEEPLVRA 587
Query: 73 IEHKIARA----TMLP---------QTHG--------------------EAFNVLRYEIG 99
IE ++ RA T++ + HG E V+RY G
Sbjct: 588 IEQRLLRAVQSATLIAAQPNVYDSNERHGQPYRGSTSRFSQRPNLLQGAEPMQVVRYTEG 647
Query: 100 QKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIG 159
Q Y +HYD +R A+F++YL+DV GG T FP + + G G
Sbjct: 648 QMYTAHYDNKQGC------LRRTATFMMYLTDVHSGGATHFPRAVPVSMRDGC--GDAAG 699
Query: 160 LKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 208
+++ P+RG L+F+S+ G D SLH + PVI+GEKW+ATKW+R+ E
Sbjct: 700 IRIWPKRGRALVFWSV-SGGIEDVRSLHEAEPVIEGEKWIATKWLREDE 747
>gi|195113237|ref|XP_002001174.1| GI10637 [Drosophila mojavensis]
gi|193917768|gb|EDW16635.1| GI10637 [Drosophila mojavensis]
Length = 529
Score = 101 bits (251), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 65/213 (30%), Positives = 104/213 (48%), Gaps = 23/213 (10%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
M+++S P + + + S + + + A LK + + +Q K TRTS T++
Sbjct: 323 MELISLDPYMVIYHDVISPSEISELQSLAVPGLKRATVFNQQSMRNHVVK-TRTSKVTWL 381
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN---PAEYGPQ 117
+ ++ I + +I T E V+ Y +G YD HYD FN A+
Sbjct: 382 LDTLNQLTIR--LNRRITDMTGFDMYGSEMLQVMNYGLGGHYDKHYDYFNSSVAADLTRL 439
Query: 118 MSQRLASFLLYLSDVEEGGETMFP-FENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
R+A+ L YL+DVE+GG T+FP E +F P+ G +++Y+L
Sbjct: 440 NGDRIATVLFYLTDVEQGGATVFPNIEKAVF----------------PKSGTAVVWYNLR 483
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D +LH +CPVI G KWV KWIR+++Q
Sbjct: 484 HDGNGDPQTLHAACPVIVGSKWVCNKWIRERQQ 516
>gi|344175386|emb|CCA88057.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia syzygii R24]
Length = 331
Score = 101 bits (251), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 65/213 (30%), Positives = 101/213 (47%), Gaps = 23/213 (10%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPS-QLALRQGETVESTKGTRTSSGTF 59
+Q +S PRA + S ++C ++I A+ RL S + G+ V + S +F
Sbjct: 125 VQFVSHHPRAALISDLLSTQECDALIEQARSRLTTSYVIEYESGQEVVNEATRSCSCASF 184
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-----NPAEY 114
E+ + + + I + AR P H E RY G+++ H D F N +
Sbjct: 185 --PPEEMSMLQKRIVERAARLVGQPGAHCEGVTFARYLPGEQFRPHVDYFRGAVLNNDKI 242
Query: 115 GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 174
R+A+ LLYL++VE GG T FP G +V+P++G L F
Sbjct: 243 MGSSGHRIATVLLYLNEVEAGGATFFPNP---------------GFEVRPQKGGALYFAY 287
Query: 175 LFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 207
+G++D TSLH C V +GEKW+AT W R++
Sbjct: 288 QQADGSMDPTSLHEGCAVTQGEKWIATLWFRER 320
>gi|255633460|gb|ACU17088.1| unknown [Glycine max]
Length = 207
Score = 101 bits (251), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 55/133 (41%), Positives = 78/133 (58%), Gaps = 5/133 (3%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTF 59
++V+SW PRA + NF + E+C+ +I AK + S + ET +S RTSSGTF
Sbjct: 79 VEVVSWEPRAFVYHNFLTKEECEYLIDIAKPNMHKSSVV--DSETGKSKDSRVRTSSGTF 136
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
++ DK I+ IE +IA + +P HGE VL YE+GQKY+ HYD F
Sbjct: 137 LARGRDK--IVRDIEKRIAHYSFIPVEHGEGLQVLHYEVGQKYEPHYDYFLDDFNTKNGG 194
Query: 120 QRLASFLLYLSDV 132
QR+A+ L+YL+DV
Sbjct: 195 QRIATVLMYLTDV 207
>gi|77747935|ref|NP_638775.2| hypothetical protein XCC3429 [Xanthomonas campestris pv. campestris
str. ATCC 33913]
Length = 288
Score = 101 bits (251), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 66/208 (31%), Positives = 99/208 (47%), Gaps = 25/208 (12%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
PR + + ++C ++IA A+ +L S+ + + E RTS + +D
Sbjct: 98 PRVVVLGGLLADDECDALIALARPQLARSR-TVDNRDGSEIVHAARTSHSMALQPGQD-- 154
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRL 122
+ + IE +IA+ P HGE VLRY G +Y HYD F P G + QR+
Sbjct: 155 ALCQRIEARIAQLLEWPVEHGEGLQVLRYATGAQYAPHYDYFEPDAPGTPVLLQHGGQRV 214
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
AS ++YL+ E GG T P + L V +G+ + F P+ +
Sbjct: 215 ASLVMYLNTPERGGATRVPD---------------VHLDVAAVKGNAVFFSYDRPH-PMT 258
Query: 183 RTSLHGSCPVIKGEKWVATKWIRDQEQH 210
RT LH PV+ GEKWVATKW+R++ H
Sbjct: 259 RT-LHAGAPVLAGEKWVATKWLRERPLH 285
>gi|21114687|gb|AAM42699.1| conserved hypothetical protein [Xanthomonas campestris pv.
campestris str. ATCC 33913]
Length = 308
Score = 100 bits (250), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 66/208 (31%), Positives = 99/208 (47%), Gaps = 25/208 (12%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
PR + + ++C ++IA A+ +L S+ + + E RTS + +D
Sbjct: 118 PRVVVLGGLLADDECDALIALARPQLARSR-TVDNRDGSEIVHAARTSHSMALQPGQD-- 174
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRL 122
+ + IE +IA+ P HGE VLRY G +Y HYD F P G + QR+
Sbjct: 175 ALCQRIEARIAQLLEWPVEHGEGLQVLRYATGAQYAPHYDYFEPDAPGTPVLLQHGGQRV 234
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
AS ++YL+ E GG T P + L V +G+ + F P+ +
Sbjct: 235 ASLVMYLNTPERGGATRVPD---------------VHLDVAAVKGNAVFFSYDRPH-PMT 278
Query: 183 RTSLHGSCPVIKGEKWVATKWIRDQEQH 210
RT LH PV+ GEKWVATKW+R++ H
Sbjct: 279 RT-LHAGAPVLAGEKWVATKWLRERPLH 305
>gi|198449502|ref|XP_001357605.2| GA15937 [Drosophila pseudoobscura pseudoobscura]
gi|198130635|gb|EAL26739.2| GA15937 [Drosophila pseudoobscura pseudoobscura]
Length = 510
Score = 100 bits (250), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 63/209 (30%), Positives = 100/209 (47%), Gaps = 22/209 (10%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
M++L P + + + S + I+ A++R+ + + T + TRT+ G ++
Sbjct: 312 MELLGEHPYVVVYHDVLSDSEIAEILEMAERRMARTSTVAQPNRT---SSPTRTAMGAWL 368
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
S + + I ++ + L E V+ Y IG Y H D F ++ M
Sbjct: 369 KRSSN--ALTRRIARRVRDMSGLQLEGSERMQVINYGIGGHYVPHKDWFT--QHPEVMGN 424
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
RLA+ L YL+DVE+GG TMF KV PRRG L +Y+L +G
Sbjct: 425 RLATVLFYLTDVEQGGATMFNKAEH---------------KVLPRRGTALFWYNLHTDGE 469
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
D ++ H +CP+I G KWV T+WIR++ Q
Sbjct: 470 GDWSTTHAACPIIVGSKWVLTQWIRERNQ 498
>gi|387016440|gb|AFJ50339.1| Prolyl 4-hydroxylase subunit alpha-1-like [Crotalus adamanteus]
Length = 543
Score = 100 bits (249), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 63/211 (29%), Positives = 104/211 (49%), Gaps = 30/211 (14%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
RPR + F + S E+ + + +K RL+ + ++ +E T R S ++S E+
Sbjct: 343 RPRIVRFLDIISNEEIEKVKELSKPRLRRATISNPITGVLE-TAHYRISKSAWLSGYENP 401
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 118
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 402 --VVARINQRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG--- 456
Query: 119 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 178
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LFP+
Sbjct: 457 -NRIATWLFYMSDVAAGGATVFP---------------EVGASVWPKKGTAVFWYNLFPS 500
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
G D ++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 501 GEGDYSTRHAACPVLVGNKWVSNKWIHERGQ 531
>gi|195391758|ref|XP_002054527.1| GJ22759 [Drosophila virilis]
gi|194152613|gb|EDW68047.1| GJ22759 [Drosophila virilis]
Length = 539
Score = 100 bits (249), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 66/212 (31%), Positives = 100/212 (47%), Gaps = 21/212 (9%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
++ L P + + SA + A+ L+ SQ+ R G + RTS GT
Sbjct: 324 LEELHLDPYIIQVHDVISARDTAELQHLARPELQRSQVYSRTGHE-HISANFRTSQGTTF 382
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQM- 118
++ I++ + H +A + L E + Y IG Y+ H D+F + +Y M
Sbjct: 383 EYTDHP--IMQKMSHHVAEISGLDMRSAEPLQIANYGIGGHYEPHMDSFPDSYDYSLNMY 440
Query: 119 -SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
+ RLA+ + YLS+VE GG T FPF + L V P RG L +Y+L P
Sbjct: 441 KTNRLATGIYYLSNVEAGGGTAFPF---------------LPLLVTPERGSLLFWYNLHP 485
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D + H +CPV++G KW+A WIR Q
Sbjct: 486 SGDADYRTKHAACPVLQGSKWIANVWIRLSNQ 517
>gi|449280261|gb|EMC87600.1| Prolyl 4-hydroxylase subunit alpha-1 [Columba livia]
Length = 536
Score = 100 bits (249), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 65/212 (30%), Positives = 106/212 (50%), Gaps = 32/212 (15%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 65
+PR + F + S E+ +++ AK RL S+ + ET + +T R S ++S E
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRL--SRATVHDPETGKLTTAHYRVSKSAWLSGYE- 392
Query: 66 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 117
+ ++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 -SPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG-- 449
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
R+A++L Y+SDV GG T+FP +G V PR+G + +Y+LFP
Sbjct: 450 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPRKGTAVFWYNLFP 492
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 493 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 524
>gi|116008432|ref|NP_651804.2| CG15539, isoform A [Drosophila melanogaster]
gi|66772391|gb|AAY55507.1| IP10910p [Drosophila melanogaster]
gi|66772535|gb|AAY55579.1| IP10810p [Drosophila melanogaster]
gi|113194858|gb|AAF57060.2| CG15539, isoform A [Drosophila melanogaster]
Length = 386
Score = 100 bits (249), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 64/209 (30%), Positives = 98/209 (46%), Gaps = 21/209 (10%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
M++LS P + F + S + SI K +L + + G E RT+ GT++
Sbjct: 189 MELLSLDPYMVLFHDVVSDKDIVSIRNLTKGKLARTVTVSKDGNYTEDPD--RTTKGTWL 246
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
+ +++ + T + F VL Y IG Y H+D AE S
Sbjct: 247 V---ENNALIQRLSQLTQDMTNFDIHDADPFQVLNYGIGGFYGIHFDFLEDAELD-NFSD 302
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+A+ + YLSDV +GG T+FP +GL V P++G LL+Y+L G
Sbjct: 303 RIATAVFYLSDVPQGGATIFP---------------KLGLSVFPKKGSALLWYNLDHKGD 347
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
D + H +CP + G +WV TKWI ++EQ
Sbjct: 348 GDNRTAHSACPTVVGSRWVMTKWINEREQ 376
>gi|115313004|gb|AAI24075.1| Zgc:152670 [Danio rerio]
Length = 235
Score = 100 bits (249), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 64/199 (32%), Positives = 97/199 (48%), Gaps = 31/199 (15%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P+ + + + S + +++ A+ L SQ G V S RTS F+ D+
Sbjct: 53 PKIIRYHDVISDTEIETLKDIARPELTRSQ----TGWGVISE--IRTSQSVFL----DEV 102
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 127
G + I +IA T L E +V Y IG +Y H+DA G +++R A+FL+
Sbjct: 103 GTVARISQRIADITGLSVESAEKLHVQNYGIGGRYTPHFDA------GGDVNERTATFLI 156
Query: 128 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 187
Y+SDVE GG T+F +G+ VKP +G + + +L NG +D + H
Sbjct: 157 YMSDVEVGGATVF---------------TNVGVAVKPEKGSAVFWNNLHKNGELDLKTKH 201
Query: 188 GSCPVIKGEKWVATKWIRD 206
CPV+ G KWVA KWI +
Sbjct: 202 AGCPVLVGNKWVANKWIHE 220
>gi|195110931|ref|XP_002000033.1| GI24862 [Drosophila mojavensis]
gi|193916627|gb|EDW15494.1| GI24862 [Drosophila mojavensis]
Length = 549
Score = 100 bits (249), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 59/209 (28%), Positives = 96/209 (45%), Gaps = 26/209 (12%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 63
LS P + F + + +++ AK ++ + + G RTS TF+ +
Sbjct: 330 LSHDPLLVLFHDVIYQSEIDTLMRLAKNKIHRATVT---GHNSSVVSNARTSQFTFLPKT 386
Query: 64 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY------GPQ 117
K +L I+ ++A T L + E + Y IG Y H D F P + P+
Sbjct: 387 RHK--VLRTIDQRVADMTDLHLEYAEDHQLANYGIGGHYAQHMDWFYPITFETKQVSNPE 444
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
M R+ + L YLSDVE+GG T FP + ++P++ +Y+L
Sbjct: 445 MGNRIGTVLFYLSDVEQGGATAFP---------------ALKQLLRPKKHAAAFWYNLHA 489
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRD 206
+G D ++HG+CP+I G KWV +WIR+
Sbjct: 490 SGVGDARTMHGACPIIVGSKWVLNRWIRE 518
>gi|194905392|ref|XP_001981188.1| GG11756 [Drosophila erecta]
gi|190655826|gb|EDV53058.1| GG11756 [Drosophila erecta]
Length = 509
Score = 100 bits (249), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 66/209 (31%), Positives = 98/209 (46%), Gaps = 21/209 (10%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
M++LS P + F + S + SI AK L + + G E RT+ GT++
Sbjct: 312 MELLSLDPYVVLFHDVVSDQDILSIRNLAKGGLARAVTVTQDGNDKEDP--ARTTKGTWL 369
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
+ + +++ + T + F VL Y IG Y +H+D E G S
Sbjct: 370 V---ENSKLIQRLSQLSQDMTNFDVRDADPFQVLNYGIGGFYGTHFDFLEDTEMG-HFSD 425
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+A+ + YLSDV +GG T FP +GL V P +G LL+Y+L G
Sbjct: 426 RIATAVFYLSDVPQGGATTFP---------------DLGLSVFPEKGAALLWYNLDHKGV 470
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
D + H +CP I G +WV TKWI ++EQ
Sbjct: 471 GDNRTAHSACPTIVGSRWVMTKWINEREQ 499
>gi|4336512|gb|AAD17844.1| prolyl 4-hydroxylase alpha subunit [Drosophila melanogaster]
Length = 535
Score = 100 bits (249), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 64/213 (30%), Positives = 99/213 (46%), Gaps = 22/213 (10%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
++ L P + ++ S+ TA+ R+K S + G + RTS G
Sbjct: 319 LEELHLDPLVVQLHQVIGSKDSDSLQKTARPRIKRSTVYSLGGNGGSTAAAFRTSQGASF 378
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY----GP 116
+ S + +L+ + + L + E V Y IG Y+ H+D+F P + G
Sbjct: 379 NYS--RNAATKLLSRHVGDFSGLNMDYAEDLQVANYGIGGHYEPHWDSF-PENHIYQEGD 435
Query: 117 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
R+A+ + YLSDVE GG T FPF + L V P RG L +Y+L
Sbjct: 436 LHGNRMATGIYYLSDVEAGGGTAFPF---------------LPLLVTPERGSLLFWYNLH 480
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
P+G D + H +CPV++G KW+A WIR++ Q
Sbjct: 481 PSGDQDFRTKHAACPVLQGSKWIANVWIRERNQ 513
>gi|194765138|ref|XP_001964684.1| GF23317 [Drosophila ananassae]
gi|190614956|gb|EDV30480.1| GF23317 [Drosophila ananassae]
Length = 520
Score = 100 bits (249), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 65/213 (30%), Positives = 105/213 (49%), Gaps = 25/213 (11%)
Query: 1 MQVLSWRP-RALYFPNFASAE--QCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 57
M+++ P +Y +SAE + + + + KR + +L + E V+ TRTS
Sbjct: 320 MEIVGLNPYMVIYHDVLSSAEIDEMKEMATPSLKRATVYKASLGKNEVVK----TRTSKV 375
Query: 58 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ 117
+ S + + + +I T + E ++ Y +G YD HYD FN E
Sbjct: 376 AWFPDSYNSLTLR--LNARIHDMTGFDLSGSEMLQLMNYGLGGHYDKHYDFFNATEKSSS 433
Query: 118 MS-QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
++ R+A+ L Y+SDVE+GG T+FP I V P+RG +++Y+L
Sbjct: 434 LTGDRIATVLFYMSDVEQGGATVFP---------------NIYKTVYPQRGTAVMWYNLK 478
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D +LH +CPV+ G KWV KWIR++ Q
Sbjct: 479 DDGQPDEQTLHAACPVLVGSKWVCNKWIRERAQ 511
>gi|347972274|ref|XP_001237637.3| AGAP004611-PA [Anopheles gambiae str. PEST]
gi|333469330|gb|EAU76664.3| AGAP004611-PA [Anopheles gambiae str. PEST]
Length = 514
Score = 100 bits (248), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 65/215 (30%), Positives = 104/215 (48%), Gaps = 27/215 (12%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
+Q ++ P + + + S ++ +II+ +K + S + + V T RTSS ++
Sbjct: 309 LQEVNHDPMIVMYHDVISNKEIDAIISISKPLMHRSMVGDDHEKAVSKT---RTSSNAWL 365
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-- 118
+ ++ + + T L T E V Y IG Y HYD + AE G ++
Sbjct: 366 D--DVMHPVVRTLSQRTEDMTNLAMTAAERLQVGNYGIGGHYLPHYD-YAVAEEGKEVYP 422
Query: 119 ----SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 174
R+A+ + YLSDV GG T+FP +GL V P++G + +Y+
Sbjct: 423 SIGKGNRIATVMYYLSDVAIGGATVFP---------------QLGLGVFPQKGSAIFWYN 467
Query: 175 LFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
L NGT+D +LHG+CPV G KWV KWI ++ Q
Sbjct: 468 LHANGTVDHRTLHGACPVFVGSKWVGNKWIHERGQ 502
>gi|194905290|ref|XP_001981166.1| GG11918 [Drosophila erecta]
gi|190655804|gb|EDV53036.1| GG11918 [Drosophila erecta]
Length = 525
Score = 100 bits (248), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 65/211 (30%), Positives = 102/211 (48%), Gaps = 20/211 (9%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
M+++ P + + + SA++ + + A L + + E K TRTS +
Sbjct: 322 MELVGLDPYMVLYHDVLSAKEIKELQGMATPGLTRATVFQASSGRNEVVK-TRTSKVAWF 380
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP--AEYGPQM 118
S + + + +IA T E ++ Y +G YD HYD FN +
Sbjct: 381 PDSYNPLTVR--LNARIADMTGFNLYGSEMLQLMNYGLGGHYDQHYDFFNTINSNLTAMS 438
Query: 119 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 178
R+A+ L YL+DVE+GG T+FP + +K V P+RG +++Y+L N
Sbjct: 439 GDRIATVLFYLTDVEQGGATVFP-----------NIRKA----VFPQRGSVIMWYNLQDN 483
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
G D +LH +CPVI G KWV KWIR++EQ
Sbjct: 484 GQTDNKTLHAACPVIVGSKWVCNKWIREREQ 514
>gi|195159144|ref|XP_002020442.1| GL13995 [Drosophila persimilis]
gi|194117211|gb|EDW39254.1| GL13995 [Drosophila persimilis]
Length = 535
Score = 100 bits (248), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 63/209 (30%), Positives = 100/209 (47%), Gaps = 22/209 (10%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
M++L P + + + S + I+ A++R+ + + T + TRT+ G ++
Sbjct: 337 MELLGEHPYVVVYHDVLSDSEIAEILEMAERRMARTSTVAQPNRT---SSPTRTALGAWL 393
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
S + + I ++ + L E V+ Y IG Y H D F ++ M
Sbjct: 394 KRSSN--ALTRRIARRVRDMSGLQLEGSERMQVINYGIGGHYVPHKDWFT--QHPEVMGN 449
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
RLA+ L YL+DVE+GG TMF KV PRRG L +Y+L +G
Sbjct: 450 RLATVLFYLTDVEQGGATMFNKAEH---------------KVLPRRGTALFWYNLHTDGE 494
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
D ++ H +CP+I G KWV T+WIR++ Q
Sbjct: 495 GDWSTTHAACPIIVGSKWVLTQWIRERNQ 523
>gi|116008128|ref|NP_001036776.1| CG15539, isoform B [Drosophila melanogaster]
gi|113194857|gb|ABI31220.1| CG15539, isoform B [Drosophila melanogaster]
Length = 509
Score = 100 bits (248), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 64/209 (30%), Positives = 98/209 (46%), Gaps = 21/209 (10%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
M++LS P + F + S + SI K +L + + G E RT+ GT++
Sbjct: 312 MELLSLDPYMVLFHDVVSDKDIVSIRNLTKGKLARTVTVSKDGNYTEDPD--RTTKGTWL 369
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
+ +++ + T + F VL Y IG Y H+D AE S
Sbjct: 370 V---ENNALIQRLSQLTQDMTNFDIHDADPFQVLNYGIGGFYGIHFDFLEDAELD-NFSD 425
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+A+ + YLSDV +GG T+FP +GL V P++G LL+Y+L G
Sbjct: 426 RIATAVFYLSDVPQGGATIFP---------------KLGLSVFPKKGSALLWYNLDHKGD 470
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
D + H +CP + G +WV TKWI ++EQ
Sbjct: 471 GDNRTAHSACPTVVGSRWVMTKWINEREQ 499
>gi|195159311|ref|XP_002020525.1| GL13465 [Drosophila persimilis]
gi|194117294|gb|EDW39337.1| GL13465 [Drosophila persimilis]
Length = 578
Score = 100 bits (248), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 59/213 (27%), Positives = 103/213 (48%), Gaps = 26/213 (12%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
++LS P + + + + + ++ +K +K + + + RTS+ +++
Sbjct: 379 ELLSLAPYMVLYHDVITPLESLTLKNLSKPHMKRRAMTFNKQKLRPLIDSGRTSNSVWLT 438
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 121
+ E+ ++E +E ++ T + E + ++ Y IG Y H D F PQ+ R
Sbjct: 439 SHEN--AVMERLERRVGVMTNFEMENSEVYQLINYGIGGHYKPHTDHFE----TPQLEHR 492
Query: 122 -----LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
+A+ L YLSDV +GG T+FP N + V+PR+GD LL+Y+L
Sbjct: 493 GGGDRIATVLFYLSDVPQGGATLFPRLN---------------ISVQPRQGDALLWYNLN 537
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
G + ++H SCP+IKG KW KWI + Q
Sbjct: 538 DRGQGEIGTVHTSCPIIKGSKWALVKWIDELSQ 570
>gi|196011908|ref|XP_002115817.1| hypothetical protein TRIADDRAFT_30052 [Trichoplax adhaerens]
gi|190581593|gb|EDV21669.1| hypothetical protein TRIADDRAFT_30052, partial [Trichoplax
adhaerens]
Length = 495
Score = 99.8 bits (247), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 63/203 (31%), Positives = 99/203 (48%), Gaps = 21/203 (10%)
Query: 4 LSWRPRALYFPNFASAEQCQSI--IATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
+S P + + + + Q ++I I+ +K P+ L G E+T+ + T++
Sbjct: 295 ISLDPFIVIYYDIINDHQIETIKKISPSKSNKSPNHAMLCSGIKSEATQVSIFCCSTWLE 354
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 121
+ D ++E I T L + E V Y IG Y HYD+ A P QR
Sbjct: 355 DAYDP--VVEKISRLTQELTHLDVNYAEDLQVANYGIGGHYVPHYDSTIIAPEDPL--QR 410
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 181
LA+ + YLS+VE GG T+FP +G+ V+P++G L + +L NG
Sbjct: 411 LATMMFYLSNVEIGGATIFPR---------------LGVAVRPQKGSALFWINLKRNGLT 455
Query: 182 DRTSLHGSCPVIKGEKWVATKWI 204
+R +LH +CPV+ G KW+A KWI
Sbjct: 456 NRQTLHAACPVVIGSKWIANKWI 478
>gi|312032354|ref|NP_001185664.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Gallus
gallus]
Length = 536
Score = 99.8 bits (247), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 64/212 (30%), Positives = 106/212 (50%), Gaps = 32/212 (15%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 65
+PR + F + S E+ +++ AK RL S+ + ET + +T R S ++S E
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRL--SRATVHDPETGKLTTAHYRVSKSAWLSGYE- 392
Query: 66 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 117
+ ++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 -SPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG-- 449
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LFP
Sbjct: 450 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFP 492
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 493 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 524
>gi|326923463|ref|XP_003207955.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 2
[Meleagris gallopavo]
Length = 536
Score = 99.8 bits (247), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 64/212 (30%), Positives = 106/212 (50%), Gaps = 32/212 (15%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 65
+PR + F + S E+ +++ AK RL S+ + ET + +T R S ++S E
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRL--SRATVHDPETGKLTTAHYRVSKSAWLSGYE- 392
Query: 66 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 117
+ ++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 -SPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG-- 449
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LFP
Sbjct: 450 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFP 492
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 493 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 524
>gi|303279839|ref|XP_003059212.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226459048|gb|EEH56344.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 409
Score = 99.8 bits (247), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 74/245 (30%), Positives = 109/245 (44%), Gaps = 44/245 (17%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ---GETVESTKGTRTSSG 57
++ LS PRA F F + E+C +I + LK S + GE RTS+G
Sbjct: 83 VEKLSDSPRAYLFREFLTKEECAHLIEISTPHLKRSTVVGDDALLGEADGRRSDYRTSTG 142
Query: 58 TFISASEDKTGILELIEHKIARATMLP---QTHGEAFNVLRYEIGQKYDSHYDAFNPAEY 114
F+ D ++ +E ++ + LP Q +A ++LRYE+GQ+Y H D F
Sbjct: 143 AFLPKLYDD--VVTRVERRVEAFSRLPFENQEQLQARSLLRYELGQEYRDHVDGFATENG 200
Query: 115 GPQMSQRLASFLLYLSDVEEGGETMFP---------------FENGIFLDSGYDYKKCI- 158
G +R+A+ L++L++ EEGGET FP G D +
Sbjct: 201 G----KRVATVLMFLAEPEEGGETAFPNGEPSEAVAARVAAQRARGELSDCAWRGGGGGT 256
Query: 159 ---------GLKVKPRRGDGLLFYSLFPNGT-------IDRTSLHGSCPVIKGEKWVATK 202
G VKPR GD +LF+S + + S H SCP +G KW ATK
Sbjct: 257 AGGGRGNLRGFAVKPRLGDAVLFFSYDADDDGGYDGAEVSHASTHASCPTTRGVKWTATK 316
Query: 203 WIRDQ 207
WI ++
Sbjct: 317 WIHER 321
>gi|195341542|ref|XP_002037365.1| GM12152 [Drosophila sechellia]
gi|194131481|gb|EDW53524.1| GM12152 [Drosophila sechellia]
Length = 535
Score = 99.8 bits (247), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 63/213 (29%), Positives = 99/213 (46%), Gaps = 22/213 (10%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
++ L P + + +S+ +A+ +K S + G + RTS G
Sbjct: 319 LEELHLDPLVVQLHQVIGSNDSESLQKSARPMIKRSTVYSLGGNGGSTAAAFRTSQGASF 378
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY----GP 116
+ S K +L+ H + + L + E V Y IG Y+ H+D+F P + G
Sbjct: 379 NYS--KNAATKLLSHHVGDFSDLNMDYAEDLQVANYGIGGHYEPHWDSF-PENHIYQEGD 435
Query: 117 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
R+A+ + YLSDVE GG T FPF + L V P +G L +Y+L
Sbjct: 436 LHGNRIATGIYYLSDVEAGGGTAFPF---------------LPLLVTPEKGSLLFWYNLH 480
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
P+G D + H +CPV++G KW+A WIR++ Q
Sbjct: 481 PSGDQDFRTKHAACPVLQGSKWIANVWIRERNQ 513
>gi|374370415|ref|ZP_09628419.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus basilensis OR16]
gi|373098067|gb|EHP39184.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus basilensis OR16]
Length = 454
Score = 99.8 bits (247), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 63/202 (31%), Positives = 93/202 (46%), Gaps = 23/202 (11%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
PR F + +C +++A A+ RL S + + E+ RTS G E
Sbjct: 132 PRVTLFQQLLTDAECDALVALARGRLARSPV-INPDTGDENLIEARTSLGAMFQVGEHP- 189
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRL 122
++E IE IA T + GE +L Y+ G +Y HYD FNP G QR+
Sbjct: 190 -LIERIEDCIAAVTGIAAERGEGLQILNYKPGGEYQPHYDFFNPQRPGEARQLKVGGQRV 248
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
+ ++YL+ GG T FP +GL+V P +G+ + F +G +D
Sbjct: 249 GTLVIYLNSPLAGGATAFPK---------------LGLEVAPVKGNAVYFSYRKSDGALD 293
Query: 183 RTSLHGSCPVIKGEKWVATKWI 204
+LH PV GEKW+ATKW+
Sbjct: 294 ERTLHAGLPVEAGEKWIATKWL 315
>gi|312032356|ref|NP_001185665.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Gallus
gallus]
Length = 536
Score = 99.8 bits (247), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 64/212 (30%), Positives = 106/212 (50%), Gaps = 32/212 (15%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 65
+PR + F + S E+ +++ AK RL S+ + ET + +T R S ++S E
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRL--SRATVHDPETGKLTTAHYRVSKSAWLSGYE- 392
Query: 66 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 117
+ ++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 -SPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 449
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LFP
Sbjct: 450 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFP 492
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 493 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 524
>gi|195390833|ref|XP_002054072.1| GJ22994 [Drosophila virilis]
gi|194152158|gb|EDW67592.1| GJ22994 [Drosophila virilis]
Length = 496
Score = 99.8 bits (247), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 57/158 (36%), Positives = 80/158 (50%), Gaps = 18/158 (11%)
Query: 53 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NP 111
RTS GT+I D + + IE +I L + E F V+ Y +G Y +H D +
Sbjct: 345 RTSKGTWIE--RDHNNLTKRIERRITDMVELDLRYSEPFQVMNYGLGGHYAAHEDFLGDT 402
Query: 112 AEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLL 171
+ R+A+ L YL+DVE+GG T+F N V P+RG L
Sbjct: 403 WADKKEEDDRIATVLFYLTDVEQGGATVFTILNQ---------------AVSPKRGTALF 447
Query: 172 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+Y+L NGT D +LHG CPV+ G KW+ T WIR++ Q
Sbjct: 448 WYNLHRNGTGDTRTLHGGCPVLVGSKWIMTLWIRERMQ 485
>gi|319786559|ref|YP_004146034.1| Procollagen-proline dioxygenase [Pseudoxanthomonas suwonensis 11-1]
gi|317465071|gb|ADV26803.1| Procollagen-proline dioxygenase [Pseudoxanthomonas suwonensis 11-1]
Length = 289
Score = 99.8 bits (247), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 67/210 (31%), Positives = 101/210 (48%), Gaps = 37/210 (17%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFIS 61
PR + S E+C +++ ++ RL R+ TV++ G RTS GTF
Sbjct: 102 PRVVVLGGLLSDEECDALVELSRPRL-------RRSTTVDAQTGGSQVHADRTSRGTFFE 154
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM--- 118
+ IE +IAR P +GE VL Y G ++ HYD F+P E G ++
Sbjct: 155 RGAHP--VCATIEARIARLLEWPVENGEGLQVLHYPPGAEFRPHYDYFDPDEPGAEVLLR 212
Query: 119 --SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
QR+A+ ++YL+ GG T FP + L+V +G+ + F
Sbjct: 213 QGGQRVATVVMYLNTPARGGATTFPDAH---------------LEVAAVKGNAVFFSYDR 257
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRD 206
P+ + RT LHG PV +GEKW+ATKW+R+
Sbjct: 258 PH-PMTRT-LHGGAPVTEGEKWIATKWLRE 285
>gi|66772633|gb|AAY55628.1| IP02961p [Drosophila melanogaster]
Length = 409
Score = 99.8 bits (247), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 62/212 (29%), Positives = 99/212 (46%), Gaps = 20/212 (9%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
++ L P + ++ S+ TA+ R+K S + G + RTS G
Sbjct: 193 LEELHLDPLVVQLHQVIGSKDSDSLQKTARPRIKRSTVYSLGGNGGSTAAAFRTSQGASF 252
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF---NPAEYGPQ 117
+ S + +L+ + + L + E V Y IG Y+ H+D+F + + G
Sbjct: 253 NYS--RNAATKLLSRHVGDFSGLNMDYAEDLQVANYGIGGHYEPHWDSFPENHIYQEGDL 310
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
R+A+ + YL+DVE GG T FPF + L V P RG L +Y+L P
Sbjct: 311 HGNRMATGIYYLADVEAGGGTAFPF---------------LPLLVTPERGSLLFWYNLHP 355
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D + H +CPV++G KW+A WIR++ Q
Sbjct: 356 SGDQDFRTKHAACPVLQGSKWIANVWIRERNQ 387
>gi|449469338|ref|XP_004152378.1| PREDICTED: uncharacterized protein LOC101218968 [Cucumis sativus]
Length = 311
Score = 99.4 bits (246), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 65/207 (31%), Positives = 108/207 (52%), Gaps = 19/207 (9%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKK-RLKPSQLALRQGETVESTKGTRTSSGTFISA 62
+SWRPR + F S E+C +I+ A PS+ + G TV + SSG ++
Sbjct: 59 VSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGITVSTE--LLNSSGVILNT 116
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 122
++D I+ IE+++A T+LP+ H F +++Y G++ Y N + P +
Sbjct: 117 TDD---IVARIENRLAIWTLLPKDHSMPFQIMQYR-GEEAKHKYFYGNRSAMLPSSEPLM 172
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLK-----VKPRRGDGLLFYSLFP 177
A+ +LYLSD GGE +FP +S K G + ++P +G+ +LF+S+
Sbjct: 173 ATVVLYLSDSASGGEILFP-------ESKVKSKFWSGRRKKNNFLRPVKGNAILFFSVHL 225
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWI 204
N + D++S H P+ GE WVATK++
Sbjct: 226 NASPDKSSYHIRSPIRDGELWVATKFL 252
>gi|390176896|ref|XP_002136934.2| GA26861 [Drosophila pseudoobscura pseudoobscura]
gi|388858831|gb|EDY67492.2| GA26861 [Drosophila pseudoobscura pseudoobscura]
Length = 513
Score = 99.4 bits (246), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 58/209 (27%), Positives = 102/209 (48%), Gaps = 20/209 (9%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
++LS P + + + + + ++ +K +K + + + RTS+ +++
Sbjct: 316 EILSLSPYMVLYHDVITPLESLTLKNLSKPHMKRRAMTFNKQKLRPLIDSGRTSNSVWLT 375
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMSQ 120
+ E+ ++E +E ++ T + E + ++ Y IG Y H D F P G
Sbjct: 376 SHEN--AVMERLERRVGVMTNFEMENSEVYQLINYGIGGHYKPHTDHFETPQHRGG--GD 431
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+A+ L YLSDV +GG T+FP N + V+PR+GD LL+Y+L G
Sbjct: 432 RIATVLFYLSDVPQGGATLFPRLN---------------ISVQPRQGDALLWYNLNDRGQ 476
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+ ++H SCP+I+G KW KWI + Q
Sbjct: 477 GEIGTVHTSCPIIQGSKWALVKWIDELSQ 505
>gi|129365|sp|P16924.1|P4HA1_CHICK RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1
Length = 516
Score = 99.4 bits (246), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 64/212 (30%), Positives = 106/212 (50%), Gaps = 32/212 (15%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 65
+PR + F + S E+ +++ AK RL S+ + ET + +T R S ++S E
Sbjct: 316 KPRIVRFLDIISDEEIETVKELAKPRL--SRATVHDPETGKLTTAHYRVSKSAWLSGYE- 372
Query: 66 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 117
+ ++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 373 -SPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG-- 429
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LFP
Sbjct: 430 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFP 472
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 473 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 504
>gi|24651418|ref|NP_524594.2| prolyl-4-hydroxylase-alpha MP [Drosophila melanogaster]
gi|7301951|gb|AAF57057.1| prolyl-4-hydroxylase-alpha MP [Drosophila melanogaster]
gi|359807686|gb|AEV66559.1| FI17802p1 [Drosophila melanogaster]
Length = 535
Score = 99.4 bits (246), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 63/213 (29%), Positives = 99/213 (46%), Gaps = 22/213 (10%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
++ L P + ++ S+ TA+ R+K S + G + RTS G
Sbjct: 319 LEELHLDPLVVQLHQVIGSKDSDSLQKTARPRIKRSTVYSLGGNGGSTAAAFRTSQGASF 378
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY----GP 116
+ S + +L+ + + L + E V Y IG Y+ H+D+F P + G
Sbjct: 379 NYS--RNAATKLLSRHVGDFSGLNMDYAEDLQVANYGIGGHYEPHWDSF-PENHIYQEGD 435
Query: 117 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
R+A+ + YL+DVE GG T FPF + L V P RG L +Y+L
Sbjct: 436 LHGNRMATGIYYLADVEAGGGTAFPF---------------LPLLVTPERGSLLFWYNLH 480
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
P+G D + H +CPV++G KW+A WIR++ Q
Sbjct: 481 PSGDQDFRTKHAACPVLQGSKWIANVWIRERNQ 513
>gi|212530|gb|AAA49002.1| prolyl 4-hydroxylase, alpha subunit (EC 1.14.11.2), partial [Gallus
gallus]
Length = 489
Score = 99.4 bits (246), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 64/212 (30%), Positives = 106/212 (50%), Gaps = 32/212 (15%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 65
+PR + F + S E+ +++ AK RL S+ + ET + +T R S ++S E
Sbjct: 289 KPRIVRFLDIISDEEIETVKELAKPRL--SRATVHDPETGKLTTAHYRVSKSAWLSGYE- 345
Query: 66 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 117
+ ++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 346 -SPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG-- 402
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LFP
Sbjct: 403 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFP 445
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 446 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 477
>gi|195575145|ref|XP_002105540.1| GD16902 [Drosophila simulans]
gi|194201467|gb|EDX15043.1| GD16902 [Drosophila simulans]
Length = 525
Score = 99.4 bits (246), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 50/123 (40%), Positives = 68/123 (55%), Gaps = 17/123 (13%)
Query: 89 EAFNVLRYEIGQKYDSHYDAFNP--AEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGI 146
E ++ Y +G YD HYD FN + R+A+ L YL+DVE+GG T+FP
Sbjct: 407 EMLQLMNYGLGGHYDQHYDFFNKTNSNMTAMSGDRIATVLFYLTDVEQGGATVFP----- 461
Query: 147 FLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 206
I V P+RG +++Y+L NG ID +LH +CPVI G KWV KWIR+
Sbjct: 462 ----------NIRKAVFPQRGSVVMWYNLRDNGQIDTQTLHAACPVIVGSKWVCNKWIRE 511
Query: 207 QEQ 209
+EQ
Sbjct: 512 REQ 514
>gi|224052167|ref|XP_002191912.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Taeniopygia
guttata]
Length = 536
Score = 99.4 bits (246), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 65/212 (30%), Positives = 105/212 (49%), Gaps = 32/212 (15%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 65
+PR + F + S E+ +++ AK RL S+ + ET + +T R S ++S E
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRL--SRATVHDPETGKLTTAHYRVSKSAWLSGYE- 392
Query: 66 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 117
+ ++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 -SPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG-- 449
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
R+A++L Y+SDV GG T+FP +G V PR+G + +Y+LFP
Sbjct: 450 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPRKGTAVFWYNLFP 492
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D ++ H +CPV+ G KWV KW+ ++ Q
Sbjct: 493 SGEGDYSTRHAACPVLVGNKWVFNKWLHERGQ 524
>gi|195505255|ref|XP_002099425.1| GE23368 [Drosophila yakuba]
gi|194185526|gb|EDW99137.1| GE23368 [Drosophila yakuba]
Length = 528
Score = 99.4 bits (246), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 51/126 (40%), Positives = 70/126 (55%), Gaps = 18/126 (14%)
Query: 87 HG-EAFNVLRYEIGQKYDSHYDAFNP--AEYGPQMSQRLASFLLYLSDVEEGGETMFPFE 143
HG E ++ Y +G YD HYD FN + R+A+ L YL+DVE+GG T+FP
Sbjct: 407 HGSEMLQLMNYGLGGHYDQHYDYFNTINSNLTAMSGDRIATVLFYLTDVEQGGATVFP-- 464
Query: 144 NGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKW 203
I V P+RG +++Y+L +G ID +LH +CPVI G KWV KW
Sbjct: 465 -------------NIRKAVFPQRGSVIMWYNLKDDGQIDTQTLHAACPVIVGSKWVCNKW 511
Query: 204 IRDQEQ 209
IR++EQ
Sbjct: 512 IREREQ 517
>gi|195341590|ref|XP_002037389.1| GM12139 [Drosophila sechellia]
gi|194131505|gb|EDW53548.1| GM12139 [Drosophila sechellia]
Length = 525
Score = 99.4 bits (246), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 50/123 (40%), Positives = 70/123 (56%), Gaps = 17/123 (13%)
Query: 89 EAFNVLRYEIGQKYDSHYDAFNP--AEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGI 146
E ++ Y +G YD HYD FN + R+A+ L YL+DVE+GG T+FP
Sbjct: 407 EMLQLMNYGLGGHYDQHYDFFNNTNSNMTAMSGDRIATVLFYLTDVEQGGATVFP----- 461
Query: 147 FLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 206
+ +K V P+RG +++Y+L NG ID +LH +CPVI G KWV KWIR+
Sbjct: 462 ------NIRKA----VFPQRGSVVMWYNLRDNGQIDTQTLHAACPVIVGSKWVCNKWIRE 511
Query: 207 QEQ 209
+EQ
Sbjct: 512 REQ 514
>gi|24651477|ref|NP_733395.1| prolyl-4-hydroxylase-alpha PV [Drosophila melanogaster]
gi|20269812|gb|AAM18061.1|AF495539_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]PV [Drosophila
melanogaster]
gi|23172718|gb|AAN14252.1| prolyl-4-hydroxylase-alpha PV [Drosophila melanogaster]
Length = 525
Score = 99.4 bits (246), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 50/124 (40%), Positives = 68/124 (54%), Gaps = 17/124 (13%)
Query: 88 GEAFNVLRYEIGQKYDSHYDAFNP--AEYGPQMSQRLASFLLYLSDVEEGGETMFPFENG 145
E ++ Y +G YD HYD FN + R+A+ L YL+DVE+GG T+FP
Sbjct: 406 SEMLQLMNYGLGGHYDQHYDFFNKTNSNMTAMSGDRIATVLFYLTDVEQGGATVFP---- 461
Query: 146 IFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 205
I V P+RG +++Y+L NG ID +LH +CPVI G KWV KWIR
Sbjct: 462 -----------NIRKAVFPQRGSVVMWYNLKDNGQIDTQTLHAACPVIVGSKWVCNKWIR 510
Query: 206 DQEQ 209
++EQ
Sbjct: 511 EREQ 514
>gi|198449500|ref|XP_001357604.2| GA15939 [Drosophila pseudoobscura pseudoobscura]
gi|198130634|gb|EAL26738.2| GA15939 [Drosophila pseudoobscura pseudoobscura]
Length = 528
Score = 99.0 bits (245), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 65/213 (30%), Positives = 103/213 (48%), Gaps = 23/213 (10%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
M+++ P + + + SA + + A LK + + G E K TRTS +
Sbjct: 323 MELVGLDPYMVLYHDVISAPEISQLQDMATPGLKRATVYKASGRRSEVVK-TRTSKVAWF 381
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
+ ++ + E + +IA T E + Y +G YD HYD FN A ++Q
Sbjct: 382 PDTFNE--LTERLNRRIADMTNFDLLGSEMLQAMNYGLGGHYDKHYDFFN-ASTATNLTQ 438
Query: 121 ----RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
R+A+ L YL+DVE+GG T+FP I V P+RG +++Y+L
Sbjct: 439 MNGDRIATVLFYLTDVEQGGATVFP---------------NIRKAVFPQRGSAIIWYNLK 483
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G + +LH +CPV+ G KWV KWIR++ Q
Sbjct: 484 DDGDPNPQTLHAACPVLVGSKWVCNKWIRERAQ 516
>gi|326923461|ref|XP_003207954.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 1
[Meleagris gallopavo]
Length = 536
Score = 99.0 bits (245), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 62/211 (29%), Positives = 105/211 (49%), Gaps = 30/211 (14%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
+PR + F + S E+ +++ AK RL+ + ++ +E T R S ++S E
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRLRRATISNPITGALE-TAHYRISKSAWLSGYE-- 392
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 118
+ ++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 SPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG--- 449
Query: 119 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 178
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LFP+
Sbjct: 450 -NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFPS 493
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 494 GEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 524
>gi|312032360|ref|NP_001185667.1| prolyl 4-hydroxylase subunit alpha-1 isoform 4 precursor [Gallus
gallus]
Length = 536
Score = 99.0 bits (245), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 62/211 (29%), Positives = 105/211 (49%), Gaps = 30/211 (14%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
+PR + F + S E+ +++ AK RL+ + ++ +E T R S ++S E
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRLRRATISNPITGALE-TAHYRISKSAWLSGYE-- 392
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 118
+ ++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 SPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG--- 449
Query: 119 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 178
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LFP+
Sbjct: 450 -NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFPS 493
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 494 GEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 524
>gi|195055767|ref|XP_001994784.1| GH14132 [Drosophila grimshawi]
gi|193892547|gb|EDV91413.1| GH14132 [Drosophila grimshawi]
Length = 537
Score = 99.0 bits (245), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 58/176 (32%), Positives = 86/176 (48%), Gaps = 24/176 (13%)
Query: 41 RQGETVESTKGT---RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYE 97
R G + ST RTS FI+A+ K +L I+ ++A T L + E + Y
Sbjct: 362 RAGVVINSTSTVSKKRTSQHIFIAATRHK--VLRTIDQRVADMTNLNMQYAEDHQLADYG 419
Query: 98 IGQKYDSHYDAFNPAEYG----PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYD 153
IG Y H+D F ++ +M R+A+ L YLSDV +GG T FP +
Sbjct: 420 IGGHYSQHFDWFGNSDLANSKCDEMGNRIATVLFYLSDVAQGGGTAFPILKQL------- 472
Query: 154 YKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+KP++ +Y+L +G D +LHG CP+I G KWV +WIR+ +Q
Sbjct: 473 --------LKPKKYAAAFWYNLHASGKGDWRNLHGGCPIIVGSKWVLNRWIREYDQ 520
>gi|312032358|ref|NP_001185666.1| prolyl 4-hydroxylase subunit alpha-1 isoform 3 precursor [Gallus
gallus]
Length = 536
Score = 99.0 bits (245), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 62/211 (29%), Positives = 105/211 (49%), Gaps = 30/211 (14%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
+PR + F + S E+ +++ AK RL+ + ++ +E T R S ++S E
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRLRRATISNPITGALE-TAHYRISKSAWLSGYE-- 392
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 118
+ ++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 SPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG--- 449
Query: 119 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 178
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LFP+
Sbjct: 450 -NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFPS 493
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 494 GEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 524
>gi|21711777|gb|AAM75079.1| RE70601p [Drosophila melanogaster]
Length = 316
Score = 99.0 bits (245), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 64/211 (30%), Positives = 100/211 (47%), Gaps = 20/211 (9%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
M+++ P + + + S ++ + + A LK + + E K TRTS +
Sbjct: 113 MELVGLDPYMVLYHDVLSPKEIKELQGMATPSLKRATVYQASSGRNEVVK-TRTSKVAWF 171
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP--AEYGPQM 118
+ + + +I+ T E ++ Y +G YD HYD FN +
Sbjct: 172 PDGYNPLTVR--LNARISDMTGFNLYGSEMLQLMNYGLGGHYDQHYDFFNKTNSNMTAMS 229
Query: 119 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 178
R+A+ L YL+DVE+GG T+FP I V P+RG +++Y+L N
Sbjct: 230 GDRIATVLFYLTDVEQGGATVFP---------------NIRKAVFPQRGSVVMWYNLKDN 274
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
G ID +LH +CPVI G KWV KWIR++EQ
Sbjct: 275 GQIDTQTLHAACPVIVGSKWVCNKWIREREQ 305
>gi|326435474|gb|EGD81044.1| hypothetical protein PTSG_10986 [Salpingoeca sp. ATCC 50818]
Length = 264
Score = 99.0 bits (245), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 69/209 (33%), Positives = 102/209 (48%), Gaps = 26/209 (12%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTF- 59
+ +LS P + F NF S E+ +I+ AK + S + + RTSS +
Sbjct: 62 ITMLSEDPPVIQFNNFISQERIDAILHFAKPKFARSTSGIER-----EVSNYRTSSTAWM 116
Query: 60 ---ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP 116
+ ++ L+ +E +IAR LP + E F VL+Y+ Q Y H D P
Sbjct: 117 LPDVLGNDPMQAHLKDMEEEIARIVRLPVENQEHFQVLQYQKNQYYKVHSDYIEEQRQQP 176
Query: 117 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
R+A+F LYL+DVEEGG T FP + L V+P +G+ +L+YS +
Sbjct: 177 -CGIRVATFFLYLNDVEEGGGTRFP---------------NLNLTVQPAKGNAVLWYSAY 220
Query: 177 PNGT-IDRTSLHGSCPVIKGEKWVATKWI 204
PN T +D + H + PV KG K+ A KWI
Sbjct: 221 PNTTRMDSRTDHEAMPVAKGMKYGANKWI 249
>gi|195159142|ref|XP_002020441.1| GL13994 [Drosophila persimilis]
gi|194117210|gb|EDW39253.1| GL13994 [Drosophila persimilis]
Length = 493
Score = 99.0 bits (245), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 65/213 (30%), Positives = 103/213 (48%), Gaps = 23/213 (10%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
M+++ P + + + SA + + A LK + + G E K TRTS +
Sbjct: 288 MELVGLDPYMVLYHDVISALEISQLQDMATPGLKRATVYKASGRRSEVVK-TRTSKVAWF 346
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
+ ++ + E + +IA T E + Y +G YD HYD FN A ++Q
Sbjct: 347 PDTFNE--LTERLNRRIADMTNFDLLGSEMLQAMNYGLGGHYDKHYDFFN-ASTAANLTQ 403
Query: 121 ----RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
R+A+ L YL+DVE+GG T+FP I V P+RG +++Y+L
Sbjct: 404 MNGDRIATVLFYLTDVEQGGATVFP---------------NIRKAVFPQRGSAIIWYNLK 448
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G + +LH +CPV+ G KWV KWIR++ Q
Sbjct: 449 DDGDPNPQTLHAACPVLVGSKWVCNKWIRERAQ 481
>gi|321474952|gb|EFX85916.1| hypothetical protein DAPPUDRAFT_45616 [Daphnia pulex]
Length = 537
Score = 99.0 bits (245), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 61/211 (28%), Positives = 100/211 (47%), Gaps = 30/211 (14%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
+P + + + S ++ +++ AK R K + +R +T E S + SE+
Sbjct: 335 KPMIVVYHDVMSDDEIETVKKMAKPRFKRA--TIRNSKTGELEPANYRISKSAWLKSEEH 392
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 118
IL+ + ++ T L + E V+ Y IG Y+ H+D AF +G
Sbjct: 393 DHILK-VTRRVGDITGLDMSTAEDLQVVNYGIGGHYEPHFDYARTETTEAFKELGWG--- 448
Query: 119 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 178
R+A++L Y+SDVE GG T+FP G V PR+G +Y+L+PN
Sbjct: 449 -NRIATWLFYMSDVEAGGATVFP---------------PTGAAVWPRKGSAAFWYNLYPN 492
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
G + + H +CPV+ G KWV+ +WI + Q
Sbjct: 493 GKGNELTRHAACPVLSGSKWVSNRWIHEHRQ 523
>gi|291190274|ref|NP_001167096.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide precursor [Salmo
salar]
gi|223648100|gb|ACN10808.1| Prolyl 4-hydroxylase subunit alpha-1 precursor [Salmo salar]
Length = 545
Score = 98.6 bits (244), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 62/211 (29%), Positives = 104/211 (49%), Gaps = 30/211 (14%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
RPR + + + S + + + AK RL+ + ++ +E T R S +++A ED
Sbjct: 345 RPRIIRYHDVLSNSEIEKVKELAKPRLRRATISNPITGVLE-TAHYRISKSAWLTAYEDP 403
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 118
+++ I +I T L E V Y +G +Y+ H+D AF G
Sbjct: 404 --VVDKINQRIEDITGLNVKTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG--- 458
Query: 119 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 178
R+A++L+Y+SDV GG T+F +G V P++G + +Y+LFP+
Sbjct: 459 -NRIATWLIYMSDVPSGGATVF---------------TDVGAAVWPKKGSAVFWYNLFPS 502
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
G D ++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 503 GEGDYSTRHAACPVLVGNKWVSNKWIHERGQ 533
>gi|323454062|gb|EGB09933.1| hypothetical protein AURANDRAFT_14928, partial [Aureococcus
anophagefferens]
Length = 182
Score = 98.6 bits (244), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 64/198 (32%), Positives = 100/198 (50%), Gaps = 29/198 (14%)
Query: 15 NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 74
NF + E+C ++I +AK + P+ + G +RTSS ++ A ED L +
Sbjct: 8 NFLTEEECDALIDSAKDHMTPAPVV---GPGNGEVSVSRTSSTCYL-ARED----LPSVC 59
Query: 75 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA-----EYGPQMSQRLASFLLYL 129
K+ T P H E V RY G+ Y HYDAF+ + + QR+A+ L+YL
Sbjct: 60 TKVCALTGKPLEHLELPQVGRYRGGEFYKPHYDAFDTSSADGRRFAQNGGQRVATVLVYL 119
Query: 130 SDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGS 189
+DVE GGET F +G+++KPR+G+ L+F+ +G +D+ LH +
Sbjct: 120 NDVERGGETSF---------------SKLGVRIKPRKGNALIFFPATLDGVLDQNYLHAA 164
Query: 190 CPVIKGEKWVATKWIRDQ 207
P + KWV+ WIR +
Sbjct: 165 EPAVD-PKWVSQIWIRQR 181
>gi|357483927|ref|XP_003612250.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355513585|gb|AES95208.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 204
Score = 98.6 bits (244), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 55/130 (42%), Positives = 75/130 (57%), Gaps = 5/130 (3%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTF 59
++V+SW PRA + NF + E+C+ +I AK + S + ET +S RTSSGTF
Sbjct: 78 VEVVSWEPRAFVYHNFLTKEECEYLIDIAKPSMHKSTVV--DSETGKSKDSRVRTSSGTF 135
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
++ DK I+ IE KIA T +P HGE VL YE+GQKY+ HYD F
Sbjct: 136 LARGRDK--IVRNIEKKIADFTFIPVEHGEGLQVLHYEVGQKYEPHYDYFLDEFNTKNGG 193
Query: 120 QRLASFLLYL 129
QR+A+ L+YL
Sbjct: 194 QRIATVLMYL 203
>gi|413945803|gb|AFW78452.1| hypothetical protein ZEAMMB73_588774 [Zea mays]
Length = 239
Score = 98.6 bits (244), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 59/139 (42%), Positives = 83/139 (59%), Gaps = 5/139 (3%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 62
+S +PR + +F S ++ +I+ A+ LK S +A G++ S RTSSGTF+
Sbjct: 54 ISCKPRVFLYQHFLSDDEANHLISLARAELKRSAVADNMSGKSTLSE--VRTSSGTFLRK 111
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 122
+D I+E IE KIA T LP+ +GE VLRY+ G+KY+ HYD F + R
Sbjct: 112 GQDP--IVEGIEDKIAAWTFLPKENGEDIQVLRYKHGEKYEPHYDYFTDNVNTVRGGHRY 169
Query: 123 ASFLLYLSDVEEGGETMFP 141
A+ LLYL+DV EGGET+FP
Sbjct: 170 ATVLLYLTDVPEGGETVFP 188
>gi|397643670|gb|EJK76008.1| hypothetical protein THAOC_02250 [Thalassiosira oceanica]
Length = 480
Score = 98.2 bits (243), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 71/251 (28%), Positives = 107/251 (42%), Gaps = 55/251 (21%)
Query: 5 SWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES-TKG-------TRTSS 56
S PR Y NF SA + + + P ++A G T ++ +G TRTS
Sbjct: 202 SSEPRVFYVHNFLSAAEADEFVKFSTAPENPYKMAPSTGGTHKAWNQGGDGAVLTTRTSE 261
Query: 57 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF------- 109
F ++ + + ++ R + + +LRY++GQ Y +H+D F
Sbjct: 262 NAFDITTKQSFDVKKRA-FRLLRMNGYQENMADGIQILRYKVGQAYVAHHDYFPTHQSKD 320
Query: 110 ---NPAEYGPQMSQRLASFLLYLSDVEEGGETMFP------------------------- 141
+P G S R A+ LYLSDV GG+T+FP
Sbjct: 321 FNWDPLSGG---SNRFATIFLYLSDVSYGGQTVFPNCEKLSAEKSPELVERLGESPSASE 377
Query: 142 ----FENGIFLDSGYD---YKKCI-GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVI 193
N ++ ++ KC V PRRGD +LFYS P+G +D SLHG+CP++
Sbjct: 378 LKEFVSNAGLMEGSWEDNLIHKCYEKFAVPPRRGDAILFYSQRPDGLLDTNSLHGACPIL 437
Query: 194 KGEKWVATKWI 204
G KW A W+
Sbjct: 438 NGTKWGANLWV 448
>gi|363814557|ref|NP_001242754.1| uncharacterized protein LOC100794585 [Glycine max]
gi|255628535|gb|ACU14612.1| unknown [Glycine max]
Length = 238
Score = 98.2 bits (243), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 55/146 (37%), Positives = 83/146 (56%), Gaps = 3/146 (2%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 60
+VL+W PR + NF S E+C + A A RL S + + G+ ++S RTSSG F+
Sbjct: 82 EVLNWSPRIILLHNFLSMEECDYLRALALPRLHISTVVDTKTGKGIKSD--VRTSSGMFL 139
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
++ E K +++ IE +I+ + +P +GE VLRYE Q Y H+D F+ + Q
Sbjct: 140 NSKERKYPMVQAIEKRISVYSQIPIENGELMQVLRYEKNQYYKPHHDYFSDTFNLKRGGQ 199
Query: 121 RLASFLLYLSDVEEGGETMFPFENGI 146
R+A+ L+YLSD E GET FP +
Sbjct: 200 RIATMLMYLSDNIERGETYFPLAGSV 225
>gi|381200649|ref|ZP_09907785.1| Prolyl 4-hydroxylase alpha subunit [Sphingobium yanoikuyae XLDN2-5]
Length = 305
Score = 98.2 bits (243), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 64/202 (31%), Positives = 103/202 (50%), Gaps = 29/202 (14%)
Query: 13 FPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTS-SGTFISASEDKTGIL 70
F F + ++C +I+ + L+P+ + R G + RTS G F A ED ++
Sbjct: 126 FRQFLTGDECHHVISEGQALLEPAMVIDPRSGRPMPHP--VRTSDGGIFGPAREDL--VI 181
Query: 71 ELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-SQRLASFLLYL 129
+ I +IA A+ + GE +LRY +GQ+Y H+D P + +QR + L+YL
Sbjct: 182 QAINRRIAAASGTMLSGGEPLTLLRYAVGQQYRQHHDCL------PHVRNQRAWTMLIYL 235
Query: 130 SDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGS 189
++ GGET+FP +GL VK R+GD LLF + G ++H
Sbjct: 236 NEGYAGGETIFPR---------------LGLSVKGRKGDALLFRNTDAQGQAAEAAVHLG 280
Query: 190 CPVIKGEKWVATKWIRDQEQHE 211
PV+ G+KW+ T+WIR ++H+
Sbjct: 281 APVMAGQKWLCTRWIR-HDRHD 301
>gi|403183473|gb|EJY58123.1| AAEL017524-PA, partial [Aedes aegypti]
Length = 212
Score = 97.8 bits (242), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 65/208 (31%), Positives = 105/208 (50%), Gaps = 27/208 (12%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETV-ESTKGTRTSSGTFISASEDK 66
P + + N S ++ + II +K LK S + GE+ + RTS +++ + +
Sbjct: 13 PLIVIYHNAISDKEIEQIIQVSKPMLKRSMV----GESFSKEVSNERTSQNAWLADYDFE 68
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF--NPAEYGPQ---MSQR 121
+++++ + T L + E+ V Y IG Y H+D N E + + R
Sbjct: 69 --LVKVLSLRTEDMTGLDRKSYESLQVNNYGIGGFYLPHFDWVRTNGTEEPYKDMGLGNR 126
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 181
+A+ + YLSDVE+GG T+FP IG+ V P++G + +Y+L P+GT
Sbjct: 127 IATLMYYLSDVEQGGATVFP---------------QIGVGVFPKKGSAIFWYNLLPDGTG 171
Query: 182 DRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
D +LHG+CPV+ G KWVA KWI Q
Sbjct: 172 DERTLHGACPVLLGSKWVANKWIHQYHQ 199
>gi|323452216|gb|EGB08091.1| hypothetical protein AURANDRAFT_26622 [Aureococcus anophagefferens]
Length = 190
Score = 97.8 bits (242), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 67/211 (31%), Positives = 97/211 (45%), Gaps = 31/211 (14%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASED-- 65
PR S +C II K ++ S + G+ T TRTS ++ S
Sbjct: 1 PRVFLVREMLSEFECDHIIELGTKVVRKSMV----GQGGGFTSKTRTSENGWLRRSASPI 56
Query: 66 ------KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
+ G + I+H + R+ + E V+RY+ Q+Y H+D + PQ
Sbjct: 57 LENIYKRFGDVLGIDHDLLRSG----KNAEELQVVRYDRSQEYAPHHDFGDDGT--PQ-- 108
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 179
QR + LLY+ EEGG T FP N +G++V P RGD +LFYS+ P+G
Sbjct: 109 QRFLTLLLYIQLPEEGGATSFPKAN-----------DGMGVQVVPARGDAVLFYSMLPDG 157
Query: 180 TIDRTSLHGSCPVIKGEKWVATKWIRDQEQH 210
D +LH PV KG+KWV W+ D +H
Sbjct: 158 NADDLALHAGMPVRKGQKWVCNLWVWDPHRH 188
>gi|195390831|ref|XP_002054071.1| GJ22995 [Drosophila virilis]
gi|194152157|gb|EDW67591.1| GJ22995 [Drosophila virilis]
Length = 485
Score = 97.8 bits (242), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 66/209 (31%), Positives = 97/209 (46%), Gaps = 32/209 (15%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
M+VL +P + F + S + + A LK + + S KGTRTS G ++
Sbjct: 297 MEVLVVKPFIVAFHDVLSPHEIGELQQLAMPLLKRTTVYDSNAGLHGSVKGTRTSKGIWL 356
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
S S + + + I +I+ T + V+ Y + Y H D FN AE
Sbjct: 357 SRSHN--NLTKRIGRRISDMTGFHLEGSTSLQVMNYGLSGHYALHTDYFNTAE------- 407
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
LSDVE+GG+T+FP F KP RG LL+Y+L NGT
Sbjct: 408 --------LSDVEQGGDTVFPRIEQAF---------------KPERGKALLWYNLHRNGT 444
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
D+ + HG+CPV+ G KW+ T+WI ++ Q
Sbjct: 445 GDKRTEHGACPVLVGSKWIMTQWINERPQ 473
>gi|414870897|tpg|DAA49454.1| TPA: hypothetical protein ZEAMMB73_536273 [Zea mays]
Length = 222
Score = 97.8 bits (242), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 54/134 (40%), Positives = 74/134 (55%), Gaps = 15/134 (11%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 55
+VLSW PRA + NF S E+C +I+ AK +K S + V+S G RTS
Sbjct: 97 EVLSWEPRAFVYHNFLSKEECDHLISLAKPHMKKSTV-------VDSATGGSKDSRVRTS 149
Query: 56 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 115
SG F+ +DK I+ IE +IA T +P GE VL YE+GQKY+ H+D F+
Sbjct: 150 SGMFLRRGQDK--IIRTIEKRIADYTFIPVEQGEGLQVLHYEVGQKYEPHFDYFHDDYNT 207
Query: 116 PQMSQRLASFLLYL 129
QR+A+ L+YL
Sbjct: 208 KNGGQRIATLLMYL 221
>gi|47550697|ref|NP_999856.1| prolyl 4-hydroxylase, alpha polypeptide I b precursor [Danio rerio]
gi|28277826|gb|AAH45890.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I [Danio rerio]
Length = 536
Score = 97.8 bits (242), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 63/211 (29%), Positives = 101/211 (47%), Gaps = 30/211 (14%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
RPR + + S + +++ AK RL+ + ++ +E T R S ++S E
Sbjct: 336 RPRIVRYHEIISDSEIETVKEMAKPRLRRATISNPITGVLE-TAPYRISKSAWLSGYEHS 394
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 118
T +E I +I T L E V Y +G +Y+ H+D AF G
Sbjct: 395 T--IERINQRIEDVTGLEMDTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG--- 449
Query: 119 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 178
R+A++L Y+SDV GG T+F +G V P++G + +Y+LFP+
Sbjct: 450 -NRIATWLFYMSDVSAGGATVF---------------TDVGAAVWPKKGTAVFWYNLFPS 493
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
G D ++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 494 GEGDYSTRHAACPVLVGNKWVSNKWIHERGQ 524
>gi|432949777|ref|XP_004084253.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Oryzias
latipes]
Length = 532
Score = 97.8 bits (242), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 63/206 (30%), Positives = 104/206 (50%), Gaps = 24/206 (11%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + N S ++ + I AK RL ++ +R +T V +T R S ++ +D
Sbjct: 335 PHIVRYLNILSDQEIEKIKELAKPRL--ARATVRDPKTGVLTTAPYRVSKSAWLEGEDDP 392
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS---QRLA 123
+++ + +I T L E V Y +G +Y+ H+D F+ + + RLA
Sbjct: 393 --VIDRVNQRIQDITGLTVETAELLQVANYGVGGQYEPHFD-FSRRPFDSNLKVDGNRLA 449
Query: 124 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 183
+FL Y+SDVE GG T+FP D+ G + PR+G + +Y+LF +G D
Sbjct: 450 TFLNYMSDVEAGGATVFP-----------DF----GASIWPRKGTAVFWYNLFRSGEGDY 494
Query: 184 TSLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 495 RTRHAACPVLVGSKWVSNKWIHERGQ 520
>gi|410900628|ref|XP_003963798.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Takifugu
rubripes]
Length = 548
Score = 97.8 bits (242), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 61/211 (28%), Positives = 103/211 (48%), Gaps = 30/211 (14%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
RP + + + S ++ +++ AK RL+ + ++ +E T R S +++ E
Sbjct: 348 RPYIVRYIDIISDKEIETVKKLAKPRLRRATISNPITGVLE-TASYRISKSAWLTGYEHP 406
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 118
++E+I +I T L E V Y +G +Y+ H+D AF G
Sbjct: 407 --VIEIINQRIEDLTGLEMDTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG--- 461
Query: 119 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 178
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF N
Sbjct: 462 -NRIATWLFYMSDVAAGGATVFP---------------DVGAAVWPQKGTAVFWYNLFAN 505
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
G D ++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 506 GEGDYSTRHAACPVLVGNKWVSNKWIHERGQ 536
>gi|229084249|ref|ZP_04216532.1| 2OG-Fe(II) oxygenase [Bacillus cereus Rock3-44]
gi|228699049|gb|EEL51751.1| 2OG-Fe(II) oxygenase [Bacillus cereus Rock3-44]
Length = 235
Score = 97.8 bits (242), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 61/204 (29%), Positives = 97/204 (47%), Gaps = 26/204 (12%)
Query: 13 FPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILEL 72
+ + +C +I A+ L+PS++ G + + T RTS I T +
Sbjct: 52 YEKVVTQTECHQLIDLARHGLQPSKVI---GNSEQKTSAVRTSDT--IGFQHHLTELTLQ 106
Query: 73 IEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE-----YGPQMSQRLASFLL 127
I +IA LP + E + RY++G K+++H+D FNP+ Y + QR+ + LL
Sbjct: 107 ICKRIASIVELPLNYAEHLQIARYQVGGKFNAHFDTFNPSTELGKMYLSENGQRIITALL 166
Query: 128 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT-SL 186
YL++V GGET FP N ++V P G L+F + N S+
Sbjct: 167 YLNNVSAGGETSFPLLN---------------IQVAPSEGTLLVFENCKKNSNERHALSI 211
Query: 187 HGSCPVIKGEKWVATKWIRDQEQH 210
H C V +GEKW+AT W ++ Q+
Sbjct: 212 HEGCAVHEGEKWIATLWFHEKSQY 235
>gi|292619367|ref|XP_001922562.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Danio rerio]
Length = 541
Score = 97.8 bits (242), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 62/211 (29%), Positives = 103/211 (48%), Gaps = 30/211 (14%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
RPR + + + ++ + I +K RL+ + ++ +E T R S +++A E
Sbjct: 341 RPRIIRYHEIITEQEIEKIKELSKPRLRRATISNPITGVLE-TAHYRISKSAWLAAYEHP 399
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 118
+++ I +I T L E V Y +G +Y+ H+D AF G
Sbjct: 400 --VVDRINQRIEDITGLNVKTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG--- 454
Query: 119 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 178
R+A++L Y+SDV GG T+FP +G VKP +G + +Y+LFP+
Sbjct: 455 -NRIATWLFYMSDVAAGGATVFP---------------EVGAAVKPLKGTAVFWYNLFPS 498
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
G D ++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 499 GEGDYSTRHAACPVLVGNKWVSNKWIHERGQ 529
>gi|291230950|ref|XP_002735430.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Saccoglossus
kowalevskii]
Length = 533
Score = 97.4 bits (241), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 60/212 (28%), Positives = 101/212 (47%), Gaps = 22/212 (10%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
+V+ +P+ + F + + + + A A RL+ + + +E + R S ++S
Sbjct: 327 EVVFDKPKLIIFHDAILTNEIRKVKALASPRLRRATIQNSVTGNLEFAE-YRISKSAWLS 385
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-- 119
ED ++ + H+I + T L E V Y +G Y+ H+D E S
Sbjct: 386 --EDDGDVVHRLNHRIEQYTGLTMDTAEELQVANYGLGGHYEPHFDFARKEEINAFKSLN 443
Query: 120 --QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
R+A+FL Y+SDVE GG T+FP +G ++ P +G +Y+L
Sbjct: 444 TGNRIATFLFYMSDVEAGGATVFP---------------QVGARLIPEKGSAAFWYNLLK 488
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
NG D ++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 489 NGEGDYSTRHAACPVLVGSKWVSNKWIHERGQ 520
>gi|255545252|ref|XP_002513687.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223547595|gb|EEF49090.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 309
Score = 97.4 bits (241), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 59/206 (28%), Positives = 104/206 (50%), Gaps = 25/206 (12%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 63
LSWRPR + F + E+C +I+ A + + KG + + +++S
Sbjct: 61 LSWRPRVFLYKGFLTDEECDRLISLA-----------HGAKEISKGKGDGSRNNIQLASS 109
Query: 64 EDKTGI----LELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 119
E ++ I L IE +I+ T +P+ + + V+ Y I + + H+D F+ +S
Sbjct: 110 ESRSHIYDDLLARIEERISAWTFIPKENSKPLQVMHYGIEEARE-HFDYFDNKTLISNVS 168
Query: 120 QRLASFLLYLSDVEEGGETMFP---FENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
+A+ +LYLS+V GGE +FP ++ ++ D D ++P +G+ +L ++
Sbjct: 169 L-MATLVLYLSNVTRGGEILFPKSELKDKVWSDCTKDSSI-----LRPVKGNAVLIFNAH 222
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATK 202
N + D S HG CPV++GE W ATK
Sbjct: 223 LNASADSRSTHGRCPVLEGEMWCATK 248
>gi|195391766|ref|XP_002054531.1| GJ24504 [Drosophila virilis]
gi|194152617|gb|EDW68051.1| GJ24504 [Drosophila virilis]
Length = 545
Score = 97.4 bits (241), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 54/170 (31%), Positives = 80/170 (47%), Gaps = 23/170 (13%)
Query: 43 GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKY 102
G RTS TFI + K +L I+ ++A T L E + Y IG Y
Sbjct: 362 GNNASVVSNARTSQFTFIPKTRHK--VLRTIDQRVADMTDLNMVFAEDHQLANYGIGGHY 419
Query: 103 DSHYDAFNPAEY------GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKK 156
H D F+P + +M R+A+ L YL+DVE+GG T FP +
Sbjct: 420 AQHMDWFSPNAFETKQVANSEMGNRIATVLFYLTDVEQGGGTAFPVLKQL---------- 469
Query: 157 CIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 206
+KP++ +Y+L +G D ++HG+CP+I G KWV +WIR+
Sbjct: 470 -----LKPKKYAAAFWYNLHASGAGDVRTMHGACPIIVGSKWVLNRWIRE 514
>gi|301613004|ref|XP_002936004.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Xenopus
(Silurana) tropicalis]
Length = 526
Score = 97.1 bits (240), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 59/209 (28%), Positives = 103/209 (49%), Gaps = 26/209 (12%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
+PR + + + S E+ + AK RL+ + ++ +E+ + R + ++S ED
Sbjct: 326 KPRIVRYHDIISDEEISKVKELAKPRLRRATISNPITGVLETAQ-YRITKSAWLSGYEDP 384
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ------MSQ 120
++ + +I T L + E V Y IG +Y+ H+D +Y P
Sbjct: 385 --VVARLNRRIEGVTGLDMSTAEELQVANYGIGGQYEPHFDFLR--KYEPDAFKKLGTGN 440
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+A++L Y+SDVE GG T+FP +G V P++G + +Y+L +G
Sbjct: 441 RVATWLFYMSDVEAGGATVFPE---------------VGAAVYPKKGTAVFWYNLLESGE 485
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
D ++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 486 GDYSTRHAACPVLVGNKWVSNKWIHERGQ 514
>gi|170064956|ref|XP_001867741.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
gi|167882144|gb|EDS45527.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
Length = 520
Score = 97.1 bits (240), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 65/213 (30%), Positives = 101/213 (47%), Gaps = 25/213 (11%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
++V++ P + + S + +I A+ +K S + + E + R S +
Sbjct: 317 LEVVNLEPLIVVYHEAVSDREIAKLIELARPLIKRSAVGDTRSEQISKI---RISQNAWF 373
Query: 61 SASEDKTGILELIEHKIAR--ATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ- 117
D I+E + + AR A L + E V Y +G Y HYD A P
Sbjct: 374 ENEHDP--IVETLNQR-ARDMAGGLNEPSYELLQVNNYGLGGFYSIHYDWSTSANPFPNK 430
Query: 118 -MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
M R+A+ + YLSDV+EGG T+FP N L V+PR+G + +Y+L
Sbjct: 431 GMGNRIATLMFYLSDVQEGGSTVFPRLN---------------LAVRPRKGTAIFWYNLH 475
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
NG ++ +LH +CPV+ G KWVA KWI ++ Q
Sbjct: 476 RNGKGNKKTLHAACPVLIGSKWVANKWIHERHQ 508
>gi|189241578|ref|XP_969458.2| PREDICTED: similar to prolyl 4-hydroxylase alpha subunit 1,
putative [Tribolium castaneum]
Length = 515
Score = 97.1 bits (240), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 60/202 (29%), Positives = 101/202 (50%), Gaps = 20/202 (9%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P L F N S + +++ A+ RL + + +E R S ++ E +
Sbjct: 327 PDILIFHNVLSDCEIETMKQLAQSRLVTAVFENPHSKQLELFP-FRISKVAWLEDQEHQH 385
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 127
L ++ ++A T L + E F V+ Y IG Y+ H+D + + P + R+ + L
Sbjct: 386 --LAVVAQRVAHMTGLTLSTAEEFQVVNYGIGGHYEPHFDFQSTVD--PAIGSRIETVLF 441
Query: 128 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 187
YLSDVE+GG T+FP I + V P++G +++++L P+G D+ + H
Sbjct: 442 YLSDVEQGGATVFP---------------EIQVSVWPQKGSAVVWFNLHPSGDGDQRTKH 486
Query: 188 GSCPVIKGEKWVATKWIRDQEQ 209
CPV+ G KW+ATKWI ++ Q
Sbjct: 487 AGCPVLIGSKWIATKWIHERGQ 508
>gi|321461762|gb|EFX72791.1| hypothetical protein DAPPUDRAFT_308081 [Daphnia pulex]
Length = 561
Score = 96.7 bits (239), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 64/216 (29%), Positives = 102/216 (47%), Gaps = 30/216 (13%)
Query: 5 SWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASE 64
S P + + + Q + + + +L S +G+ V S RTS ++ E
Sbjct: 346 SLDPMIVVLHDLITERQTEILRQLGEPKLATSLHRGGEGKFVRSM--IRTSKNAWLQEHE 403
Query: 65 DKTGILELIEHKIARATML---PQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ---- 117
+ + L I H++ AT L P+T E F + Y IG Y +H D + P+
Sbjct: 404 NAS--LPAIRHRMELATGLIYGPETASEYFQIANYGIGGLYKTHTDNVIHPDVRPEDQDP 461
Query: 118 ----MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFY 173
+ R+A+ ++YLSDVE GG T+FP G+ PR+G ++
Sbjct: 462 WNLYVGDRIATLMVYLSDVEAGGATVFPRA---------------GVTCWPRKGSAAFWW 506
Query: 174 SLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+L+ +G D T+ HG+CPV+ G KWV+ KWIR +Q
Sbjct: 507 NLYKSGEPDLTTRHGACPVLHGSKWVSNKWIRQYDQ 542
>gi|170064951|ref|XP_001867739.1| prolyl 4-hydroxylase subunit alpha-2 [Culex quinquefasciatus]
gi|167882142|gb|EDS45525.1| prolyl 4-hydroxylase subunit alpha-2 [Culex quinquefasciatus]
Length = 516
Score = 96.7 bits (239), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 62/210 (29%), Positives = 102/210 (48%), Gaps = 21/210 (10%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
+V++ P + + AS + +I + ++ S + + V + RTS ++++
Sbjct: 314 EVVNLDPFVAVYHDAASDAEINKVIELGRPQINRSMVGDAAKKEVSKS---RTSQNSWLT 370
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-- 119
D + L A L +T E+ V Y IG Y HYD P+++
Sbjct: 371 -DYDHPVVAALSRRTKDMALGLDETAYESLQVNNYGIGGHYLPHYDWSREENPYPELNTG 429
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 179
R+A+ + YLSDVEEGG T+FP +G+ V P++G + +Y+L +G
Sbjct: 430 NRIATLMFYLSDVEEGGATVFPH---------------LGVGVFPKKGTAIFWYNLRASG 474
Query: 180 TIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
D +LHG+CPV+ G KWVA KWI ++ Q
Sbjct: 475 KGDEKTLHGACPVLIGSKWVANKWIHERHQ 504
>gi|270001038|gb|EEZ97485.1| hypothetical protein TcasGA2_TC011322 [Tribolium castaneum]
Length = 509
Score = 96.7 bits (239), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 60/202 (29%), Positives = 101/202 (50%), Gaps = 20/202 (9%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P L F N S + +++ A+ RL + + +E R S ++ E +
Sbjct: 321 PDILIFHNVLSDCEIETMKQLAQSRLVTAVFENPHSKQLELFP-FRISKVAWLEDQEHQH 379
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 127
L ++ ++A T L + E F V+ Y IG Y+ H+D + + P + R+ + L
Sbjct: 380 --LAVVAQRVAHMTGLTLSTAEEFQVVNYGIGGHYEPHFDFQSTVD--PAIGSRIETVLF 435
Query: 128 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 187
YLSDVE+GG T+FP I + V P++G +++++L P+G D+ + H
Sbjct: 436 YLSDVEQGGATVFPE---------------IQVSVWPQKGSAVVWFNLHPSGDGDQRTKH 480
Query: 188 GSCPVIKGEKWVATKWIRDQEQ 209
CPV+ G KW+ATKWI ++ Q
Sbjct: 481 AGCPVLIGSKWIATKWIHERGQ 502
>gi|224008853|ref|XP_002293385.1| hypothetical protein THAPSDRAFT_264010 [Thalassiosira pseudonana
CCMP1335]
gi|220970785|gb|EED89121.1| hypothetical protein THAPSDRAFT_264010 [Thalassiosira pseudonana
CCMP1335]
Length = 248
Score = 96.7 bits (239), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 66/224 (29%), Positives = 98/224 (43%), Gaps = 34/224 (15%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
++ +S PR +F S + I+ + + + + TRTS T+I
Sbjct: 37 LRTVSCSPRIFELEHFISDVEADHILMLTNRTHELHRSSTGDSSHHSDHDSTRTSMNTWI 96
Query: 61 SASEDKTGILELIEHKIARATML---------PQTH---------GEAFNVLRYEIGQKY 102
E T I++ I ++A + P H E ++ Y+ G++Y
Sbjct: 97 YREE--TAIIDTIYRRVADVLRIDEALLRRRQPDEHPRLGTRSSIAEPLQMVHYDPGEEY 154
Query: 103 DSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKV 162
+H+D P R + LLYL+DVEEGGET FP + GL V
Sbjct: 155 TAHHDFGYTHMSAPHQPSRSINMLLYLNDVEEGGETSFP--------------RWGGLDV 200
Query: 163 KPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 206
KP +G +LFY L +G D S H + PVIKGEKW++ WI D
Sbjct: 201 KPVKGKAVLFYMLTADGNSDDLSQHAALPVIKGEKWMSNLWIWD 244
>gi|449488641|ref|XP_004158125.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized LOC101218968
[Cucumis sativus]
Length = 311
Score = 96.7 bits (239), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 64/207 (30%), Positives = 107/207 (51%), Gaps = 19/207 (9%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKK-RLKPSQLALRQGETVESTKGTRTSSGTFISA 62
+SWRPR + F S E+C +I+ A PS+ + G TV + SSG ++
Sbjct: 59 VSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGITVSTE--LLNSSGVILNT 116
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 122
++D I+ IE+++A T+LP+ H F +++Y G++ Y N + P +
Sbjct: 117 TDD---IVARIENRLAIWTLLPKDHSMPFQIMQYR-GEEAKHKYFYGNRSAMLPSSEPLM 172
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLK-----VKPRRGDGLLFYSLFP 177
A+ +LYLSD GGE +FP +S K G + ++P +G+ +L +S+
Sbjct: 173 ATVVLYLSDSASGGEILFP-------ESKVKSKFWSGRRKKNNFLRPVKGNAILXFSVHL 225
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWI 204
N + D++S H P+ GE WVATK++
Sbjct: 226 NASPDKSSYHIRSPIRDGELWVATKFL 252
>gi|413923982|gb|AFW63914.1| hypothetical protein ZEAMMB73_179176 [Zea mays]
Length = 222
Score = 96.3 bits (238), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 53/134 (39%), Positives = 74/134 (55%), Gaps = 15/134 (11%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 55
+V+SW PRA + NF S E+C+ +I AK + S + V+ST G RTS
Sbjct: 98 EVISWEPRAFVYHNFLSKEECEYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 150
Query: 56 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 115
SG F+ DK ++ +IE +IA T +P HGE VL YE+GQKY+ H+D F
Sbjct: 151 SGMFLQRGRDK--VIRVIEKRIADYTFIPVDHGEGLQVLHYEVGQKYEPHFDYFLDEFNT 208
Query: 116 PQMSQRLASFLLYL 129
QR+A+ L+YL
Sbjct: 209 KNGGQRMATLLMYL 222
>gi|194765180|ref|XP_001964705.1| GF23331 [Drosophila ananassae]
gi|190614977|gb|EDV30501.1| GF23331 [Drosophila ananassae]
Length = 535
Score = 96.3 bits (238), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 61/213 (28%), Positives = 97/213 (45%), Gaps = 20/213 (9%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
++ LS P S++ + I A+ ++K S + G RTS G
Sbjct: 319 LEELSHEPLVFQVHQVVSSKSAEFIKKMARPKIKRSTVYSIGGGGGSQAAAFRTSQGASF 378
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF---NPAEYGPQ 117
+ S + +++ + + L E V Y IG Y+ H+D+F + + G
Sbjct: 379 NYS--RNAATKILSRHVGDLSSLDMNFAEELQVANYGIGGHYEPHWDSFPENHIYDEGDD 436
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
R+A+ + YLSDVE GG T FPF + L V P +G L +Y+L
Sbjct: 437 RGNRIATGIYYLSDVEAGGGTAFPF---------------LPLLVTPEKGSLLFWYNLHE 481
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQH 210
+G D + H +CPV++G KW+A WIR++ QH
Sbjct: 482 SGDQDYRTKHAACPVLQGSKWIANVWIRERNQH 514
>gi|198477152|ref|XP_002136738.1| GA29216 [Drosophila pseudoobscura pseudoobscura]
gi|198145043|gb|EDY71755.1| GA29216 [Drosophila pseudoobscura pseudoobscura]
Length = 517
Score = 96.3 bits (238), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 58/212 (27%), Positives = 103/212 (48%), Gaps = 22/212 (10%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES-TKGTRTSSGTFI 60
++LS P + + + + + ++ +K +K + + V RTS+ ++
Sbjct: 316 EILSLSPYMVLYHDVITPLESLTLKNLSKPLMKRRAMVMVNNLKVRPFIDSGRTSNSVWL 375
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-- 118
++ E+ ++E +E ++ T + E + ++ Y IG Y H D F + P+
Sbjct: 376 ASHEN--AVMERLERRVGVMTNFEMENSEVYQLINYGIGGHYKPHTDHFETPQ-APEHRG 432
Query: 119 -SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
R+A+ L YLSDV +GG T+FP N + V+PR+GD LL+Y+L
Sbjct: 433 GGDRIATVLFYLSDVPQGGATLFPRLN---------------ISVQPRQGDALLWYNLND 477
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
G + ++H SCP+I+G KW KWI + Q
Sbjct: 478 RGQGEIGTVHTSCPIIQGSKWALVKWIDELSQ 509
>gi|432904500|ref|XP_004077362.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oryzias
latipes]
Length = 555
Score = 96.3 bits (238), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 63/211 (29%), Positives = 101/211 (47%), Gaps = 30/211 (14%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
RP + + + S + I AK RL+ + ++ +E T R S +++A ED
Sbjct: 350 RPYIVRYIDIISEAEMDKIKQLAKPRLRRATISNPVTGVLE-TAPYRISKSAWLTAYEDP 408
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 118
++E I +I T L E V Y +G +Y+ H+D AF G
Sbjct: 409 --VVEKINQRIEDLTGLEMDTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG--- 463
Query: 119 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 178
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF +
Sbjct: 464 -NRIATWLFYMSDVSAGGATVFP---------------DVGASVGPQKGTAVFWYNLFAS 507
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
G D ++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 508 GEGDYSTRHAACPVLVGNKWVSNKWIHERGQ 538
>gi|194765178|ref|XP_001964704.1| GF23330 [Drosophila ananassae]
gi|190614976|gb|EDV30500.1| GF23330 [Drosophila ananassae]
Length = 537
Score = 96.3 bits (238), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 61/204 (29%), Positives = 99/204 (48%), Gaps = 21/204 (10%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQL-ALRQGETVESTKGTRTSSGTFISASEDK 66
P + + + SA++ + + A R++ S + L G+ +S R S +++
Sbjct: 331 PYVVTYHDMLSAQKIRDLRQMAVPRMRRSTVNPLPGGQNKKS--AFRVSKNAWLAYESHP 388
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMSQRLASF 125
T +E + + AT L T+ E V Y +G Y+ H+D F +P Y + R+A+
Sbjct: 389 T--MEGMLRDLKDATGLDTTYCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRIATA 446
Query: 126 LLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTS 185
+ YLSDVE+GG T FPF + VKP+ G+ L +Y+L + +D +
Sbjct: 447 IFYLSDVEQGGATAFPF---------------LDFAVKPQLGNVLFWYNLHRSLDMDYRT 491
Query: 186 LHGSCPVIKGEKWVATKWIRDQEQ 209
H CPV+KG KW+ WI D Q
Sbjct: 492 KHAGCPVLKGSKWIGNVWIHDMTQ 515
>gi|348518914|ref|XP_003446976.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Oreochromis
niloticus]
Length = 536
Score = 95.9 bits (237), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 65/209 (31%), Positives = 103/209 (49%), Gaps = 28/209 (13%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + + S E+ + I AK RL ++ +R +T V +T R S ++ ED
Sbjct: 337 PHIVRYLDLLSDEEIEKIKELAKPRL--ARATVRDPKTGVLTTANYRVSKSAWLEGEEDP 394
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL---- 122
+++ + +I T L E V Y +G +Y+ H+D E P +RL
Sbjct: 395 --VIDRVNQRIEAITGLTVETAELLQVANYGVGGQYEPHFDFSRKDE--PDAFKRLGTGN 450
Query: 123 --ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
A+FL Y+SDVE GG T+FP D+ G + PR+G + +Y+LF +G
Sbjct: 451 RVATFLNYMSDVEAGGATVFP-----------DF----GAAIWPRKGTSVFWYNLFRSGE 495
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
D + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 496 GDYRTRHAACPVLVGSKWVSNKWIHERGQ 524
>gi|393718270|ref|ZP_10338197.1| putative oxygenase [Sphingomonas echinoides ATCC 14820]
Length = 226
Score = 95.9 bits (237), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 62/202 (30%), Positives = 98/202 (48%), Gaps = 30/202 (14%)
Query: 11 LYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK-TGI 69
Y P+F A C ++A + S + ES + RTS S D+ +
Sbjct: 43 FYHPDFLDAATCDRLVALIDANRRRSTVLAE-----ESVQDFRTSD----SCDMDRWSPD 93
Query: 70 LELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE-YGPQM----SQRLAS 124
+ + IA + HGE RY +GQ + +H+D FN A+ Y P+M QR +
Sbjct: 94 VRPTDEAIADLLGIDPVHGETMQGQRYAVGQHFRAHFDYFNEAQAYWPKMVETGGQRTWT 153
Query: 125 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 184
++YL+DVEEGG T FP IG++V P++G L + ++ P+G +
Sbjct: 154 AMIYLNDVEEGGATWFP---------------TIGIRVAPKKGLLLTWNNMKPDGDRNTA 198
Query: 185 SLHGSCPVIKGEKWVATKWIRD 206
+LH PV++G K++ TKW R+
Sbjct: 199 TLHEGMPVVQGTKYIVTKWFRE 220
>gi|427410797|ref|ZP_18900999.1| hypothetical protein HMPREF9718_03473 [Sphingobium yanoikuyae ATCC
51230]
gi|425710785|gb|EKU73805.1| hypothetical protein HMPREF9718_03473 [Sphingobium yanoikuyae ATCC
51230]
Length = 322
Score = 95.9 bits (237), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 63/202 (31%), Positives = 103/202 (50%), Gaps = 29/202 (14%)
Query: 13 FPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTS-SGTFISASEDKTGIL 70
F F + ++C +I+ + L+P+ + R G + RTS G F A ED ++
Sbjct: 143 FRQFLTGDECHHVISEGQALLEPAMVIDPRSGRPMPHP--IRTSDGGIFGPAREDL--VI 198
Query: 71 ELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-SQRLASFLLYL 129
+ I +IA A+ + GE +LRY +GQ+Y H+D P + +QR + L+YL
Sbjct: 199 QAINRRIAAASGTMLSGGEPLTLLRYAVGQQYRQHHDCL------PHVRNQRAWTMLIYL 252
Query: 130 SDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGS 189
++ GGET+FP +GL VK R+G+ LLF + G ++H
Sbjct: 253 NEGYAGGETIFPR---------------LGLSVKGRKGNALLFRNTDAQGQAAEAAVHLG 297
Query: 190 CPVIKGEKWVATKWIRDQEQHE 211
PV+ G+KW+ T+WIR ++H+
Sbjct: 298 APVMAGQKWLCTRWIR-HDRHD 318
>gi|452752943|ref|ZP_21952682.1| eukaryotic Peptidyl prolyl 4-hydroxylase, alpha subunit [alpha
proteobacterium JLT2015]
gi|451959765|gb|EMD82182.1| eukaryotic Peptidyl prolyl 4-hydroxylase, alpha subunit [alpha
proteobacterium JLT2015]
Length = 314
Score = 95.9 bits (237), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 63/195 (32%), Positives = 95/195 (48%), Gaps = 23/195 (11%)
Query: 16 FASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEH 75
F+SAE C + + RL+PS + L RTS G +S E+ ++ ++
Sbjct: 138 FSSAE-CAYLQQMSAPRLRPSTI-LDPQTGARRPDPVRTSVGAALSPVEEDL-VVGMLNR 194
Query: 76 KIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEG 135
+IA AT + GE ++LRY Q+Y H+DA E +QR + ++YL+ EG
Sbjct: 195 RIAAATGTDRMQGEPLHILRYSGAQEYRPHHDAVAGLE-----NQRSHTLIVYLTADYEG 249
Query: 136 GETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKG 195
GET FP +G +++ R+GD LLF +L +G D H P G
Sbjct: 250 GETAFPE---------------LGFRLRGRQGDALLFANLREDGRPDLRMRHAGLPATSG 294
Query: 196 EKWVATKWIRDQEQH 210
KW+AT+WIR + H
Sbjct: 295 AKWIATRWIRTRPYH 309
>gi|410914996|ref|XP_003970973.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Takifugu
rubripes]
Length = 538
Score = 95.9 bits (237), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 65/209 (31%), Positives = 103/209 (49%), Gaps = 28/209 (13%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + +F S E+ + I AK +L ++ +R ++ V +T R S ++ ED
Sbjct: 339 PNIVRYLDFLSNEEIEKIKELAKPKL--ARATVRDPKSGVLTTASYRVSKSAWLEGEEDP 396
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL---- 122
I+ + +I T L E V Y +G +Y+ H+D E P +RL
Sbjct: 397 --IIARVNQRIEDLTGLTVKTAELLQVANYGVGGQYEPHFDFSRKDE--PDAFKRLGTGN 452
Query: 123 --ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
A+FL Y+SDVE GG T+FP D+ G + PR+G + +Y+LF +G
Sbjct: 453 RVATFLNYMSDVEAGGATVFP-----------DF----GAAIWPRKGTAVFWYNLFKSGE 497
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
D + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 498 GDYRTRHAACPVLVGNKWVSNKWIHERGQ 526
>gi|290243077|ref|YP_003494747.1| Procollagen-proline dioxygenase [Thioalkalivibrio sp. K90mix]
gi|288945582|gb|ADC73280.1| Procollagen-proline dioxygenase [Thioalkalivibrio sp. K90mix]
Length = 575
Score = 95.9 bits (237), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 67/217 (30%), Positives = 105/217 (48%), Gaps = 26/217 (11%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
M+ LS P +Y F +C+++I A+ R+K + ++L V ++G RT S ++
Sbjct: 50 METLSQDPLVVYLDEFLEPGECEALIHLAQGRMKRALVSLDGSSGV--SQG-RTGSNCWL 106
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN-----PAEYG 115
E+ + I ++A+ P + E V+ Y Q+Y HYDA++
Sbjct: 107 RYQEEP--LARRIGERVAKRVGFPLEYAEPLQVIHYGHEQEYRPHYDAYDLDTPRGLRCT 164
Query: 116 PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSL 175
Q QR+ + LLYL++VEEGG T FP G++V PR+G +F ++
Sbjct: 165 RQGGQRMVTALLYLNEVEEGGATAFP---------------NAGVEVAPRKGRIAIFNNV 209
Query: 176 FPN-GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 211
+ G SLHG PV GEKW A+ W R + HE
Sbjct: 210 GADPGRPHPRSLHGGMPVKSGEKWAASIWFRARPAHE 246
>gi|359490628|ref|XP_002271805.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Vitis
vinifera]
Length = 312
Score = 95.5 bits (236), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 66/209 (31%), Positives = 114/209 (54%), Gaps = 23/209 (11%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET--VESTKGTRTSSGTFIS 61
LSW+PRA + F S E+C +I+ A K +LA G++ V + ++S G
Sbjct: 60 LSWQPRAFLYRGFLSDEECDHLISLALG--KKEELATNGGDSGNVVLKRLLKSSEGPLYI 117
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEI---GQKYDSHYDAFNPAEYGPQM 118
E + IE +I+ T LP+ + E V++Y+ QKY+ ++ + +++G +
Sbjct: 118 DDE----VAARIEKRISAWTFLPKENSEPLEVVQYQFENAKQKYN-YFSNKSTSKFGEPL 172
Query: 119 SQRLASFLLYLSDVEEGGETMFP---FENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSL 175
+A+ LL+LS+V GGE FP ++GI D + GL+ P +G+ +LF+++
Sbjct: 173 ---MATVLLHLSNVTRGGELFFPESESKSGILSDCT---ESSSGLR--PVKGNAILFFNV 224
Query: 176 FPNGTIDRTSLHGSCPVIKGEKWVATKWI 204
PN + D++S + CPV++GE W ATK+
Sbjct: 225 HPNASPDKSSSYARCPVLEGEMWCATKFF 253
>gi|195575143|ref|XP_002105539.1| GD16913 [Drosophila simulans]
gi|194201466|gb|EDX15042.1| GD16913 [Drosophila simulans]
Length = 534
Score = 95.5 bits (236), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 64/211 (30%), Positives = 97/211 (45%), Gaps = 28/211 (13%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P + + SA + +I A + +K +++ QG V RT+ G + ++
Sbjct: 326 PYVVLYHEVLSAREISMLIGKAAQNMKNTRVHKEQG--VPKKNRGRTAKGFWFKKESNE- 382
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE---------YGPQM 118
+ + I +I T E F V+ Y IG Y H D F+ A Y +
Sbjct: 383 -LTKGITRRIMDMTGFDLADSEGFQVINYGIGGHYLLHMDYFDFASSNHTDTRSGYSMDL 441
Query: 119 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 178
R+A+ L YL+DVE+GG T +F D GY V P+ G + +Y+L N
Sbjct: 442 GDRIATVLFYLTDVEQGGAT-------VFADVGYS--------VYPQAGTAIFWYNLDTN 486
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
G D + H +CPVI G KWV T+WIR++ Q
Sbjct: 487 GKGDPRTRHAACPVIVGSKWVMTEWIREKRQ 517
>gi|26336999|dbj|BAC32183.1| unnamed protein product [Mus musculus]
gi|148700261|gb|EDL32208.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide, isoform CRA_b [Mus
musculus]
Length = 534
Score = 95.5 bits (236), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 63/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 65
+PR + F + S + + + AK RL S+ + ET + +T R S ++S ED
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYED 391
Query: 66 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 117
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTG-- 447
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 490
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|354483223|ref|XP_003503794.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 1
[Cricetulus griseus]
Length = 534
Score = 95.5 bits (236), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 63/210 (30%), Positives = 103/210 (49%), Gaps = 28/210 (13%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 65
+PR + F + S + + + AK RL S+ + ET + +T R S ++S ED
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYED 391
Query: 66 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ----- 120
++ I +I T L + E V Y +G +Y+ H+D E P Q
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDE--PDAFQELGTG 447
Query: 121 -RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 179
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF +G
Sbjct: 448 NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASG 492
Query: 180 TIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 493 EGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|51036657|ref|NP_742059.2| prolyl 4-hydroxylase subunit alpha-1 precursor [Rattus norvegicus]
gi|90111077|sp|P54001.2|P4HA1_RAT RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; Flags: Precursor
gi|50927553|gb|AAH78703.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I [Rattus norvegicus]
gi|149038787|gb|EDL93076.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide, isoform CRA_a
[Rattus norvegicus]
Length = 534
Score = 95.5 bits (236), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 63/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 65
+PR + F + S + + + AK RL S+ + ET + +T R S ++S ED
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYED 391
Query: 66 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 117
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTG-- 447
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 490
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|395501518|ref|XP_003755140.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Sarcophilus
harrisii]
Length = 385
Score = 95.1 bits (235), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 64/212 (30%), Positives = 102/212 (48%), Gaps = 32/212 (15%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 65
+PR + F S + + + AK RL S+ + ET + +T R S ++S ED
Sbjct: 185 KPRIVRFHEIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYED 242
Query: 66 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 117
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 243 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG-- 298
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 299 --NRIATWLFYMSDVSAGGATVFPE---------------VGASVWPKKGTAVFWYNLFA 341
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D ++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 342 SGEGDYSTRHAACPVLVGNKWVSNKWIHERGQ 373
>gi|195505251|ref|XP_002099423.1| GE23370 [Drosophila yakuba]
gi|194185524|gb|EDW99135.1| GE23370 [Drosophila yakuba]
Length = 534
Score = 95.1 bits (235), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 65/211 (30%), Positives = 98/211 (46%), Gaps = 29/211 (13%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P + + SA + +I+ A + +K +++ ET T RT+ G ++ ++
Sbjct: 327 PYVVLYHEVLSAREISMLISKAAQNMKNTRV---HRETKPKTNRGRTAKGHWLKKESNE- 382
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE---YGPQMSQ---- 120
+ I +I T E F V+ Y IG Y H D F+ A GP+ Q
Sbjct: 383 -LTRRITRRIVDMTGFDLADSEDFQVINYGIGGHYFLHMDYFDYASSNYTGPRSRQSKVL 441
Query: 121 --RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 178
R+A+ L YLSDVE+GG T+F +G V P+ G + +Y+L +
Sbjct: 442 GDRIATVLFYLSDVEQGGATVF---------------GNVGYSVYPQAGTAIFWYNLDTD 486
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
G D + H SCPVI G KWV T+WIR+ Q
Sbjct: 487 GNGDPLTRHASCPVIVGSKWVMTEWIRESRQ 517
>gi|474940|emb|CAA55546.1| gamma-butyrobetaine,2-oxoglutarate dioxygenase [Rattus norvegicus]
Length = 534
Score = 95.1 bits (235), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 63/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 65
+PR + F + S + + + AK RL S+ + ET + +T R S ++S ED
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYED 391
Query: 66 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 117
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTG-- 447
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 490
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|194765168|ref|XP_001964699.1| GF22909 [Drosophila ananassae]
gi|190614971|gb|EDV30495.1| GF22909 [Drosophila ananassae]
Length = 525
Score = 95.1 bits (235), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 62/209 (29%), Positives = 96/209 (45%), Gaps = 26/209 (12%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 63
LS P + + + + +I +LK + + E+V S RTS TF+ +
Sbjct: 298 LSRDPLLILYHDVIYQSEIDTIRKLTTNKLKRATIT-STNESVVS--NVRTSQFTFLPVT 354
Query: 64 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY------GPQ 117
EDK +L I+ ++A T + E Y IG Y H D F + P+
Sbjct: 355 EDK--VLATIDRRVADMTNFNMRYAEDHQFANYGIGGHYGQHMDWFYQPSFDAGLVSSPE 412
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
M R+A+ L YLSDV +GG T FP + + +KP++ +Y+L
Sbjct: 413 MGNRIATVLFYLSDVTQGGGTAFPH---------------LRVLLKPKKYAAAFWYNLHA 457
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRD 206
+G D + HG+CP+I G KWV +WIR+
Sbjct: 458 SGVGDPRTQHGACPIISGSKWVQNRWIRE 486
>gi|836898|gb|AAC52197.1| prolyl 4-hydroxylase alpha(I)-subunit, partial [Mus musculus]
gi|1096887|prf||2112362A Pro 4-hydroxylase:SUBUNIT=alpha:ISOTYPE=I
Length = 526
Score = 95.1 bits (235), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 63/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 65
+PR + F + S + + + AK RL S+ + ET + +T R S ++S ED
Sbjct: 326 KPRIIRFHDIISDAEIEIVKYLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYED 383
Query: 66 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 117
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 384 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTG-- 439
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 440 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 482
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 483 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 514
>gi|195341588|ref|XP_002037388.1| GM12140 [Drosophila sechellia]
gi|194131504|gb|EDW53547.1| GM12140 [Drosophila sechellia]
Length = 534
Score = 95.1 bits (235), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 64/211 (30%), Positives = 97/211 (45%), Gaps = 28/211 (13%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P + + SA + +I A + +K +++ QG V RT+ G + ++
Sbjct: 326 PYVVLYHEVLSAREISMLIGKATQNMKNTRVHKEQG--VPKKNRGRTAKGFWFKKESNE- 382
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE---------YGPQM 118
+ + I +I T E F V+ Y IG Y H D F+ A Y +
Sbjct: 383 -LTKGITRRIMDMTGFDLADSEGFQVINYGIGGHYLLHMDYFDFASSNHTDTRSSYSMDL 441
Query: 119 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 178
R+A+ L YL+DVE+GG T +F D GY V P+ G + +Y+L N
Sbjct: 442 GDRIATVLFYLTDVEQGGAT-------VFADVGYS--------VYPQAGTAIFWYNLDTN 486
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
G D + H +CPVI G KWV T+WIR++ Q
Sbjct: 487 GKGDPRTKHAACPVIVGSKWVMTEWIREKRQ 517
>gi|195390835|ref|XP_002054073.1| GJ22993 [Drosophila virilis]
gi|194152159|gb|EDW67593.1| GJ22993 [Drosophila virilis]
Length = 525
Score = 95.1 bits (235), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 55/163 (33%), Positives = 85/163 (52%), Gaps = 24/163 (14%)
Query: 52 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 111
TRTS T+++ S + + + +I+ T E V+ Y +G YD H+D FN
Sbjct: 372 TRTSKVTWLTDSLNPLTVR--LNRRISDMTGFDLYGSEMLQVMNYGLGGHYDLHFDYFN- 428
Query: 112 AEYGPQMSQ----RLASFLLYLSDVEEGGETMFP-FENGIFLDSGYDYKKCIGLKVKPRR 166
A +++ R+A+ L YL+DVE+GG T+FP + IF P++
Sbjct: 429 ATIAKDLTKLNGDRIATVLFYLTDVEQGGATVFPNIKQAIF----------------PKK 472
Query: 167 GDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
G +++Y+L N D +LH +CPVI G KWV KWIR+ +Q
Sbjct: 473 GTAVMWYNLRHNNDGDPQTLHAACPVIVGSKWVCNKWIREHQQ 515
>gi|74225936|dbj|BAE28745.1| unnamed protein product [Mus musculus]
Length = 561
Score = 95.1 bits (235), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 60/211 (28%), Positives = 102/211 (48%), Gaps = 30/211 (14%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
+PR + F + S + + + AK RL+ + ++ +E+ R S ++S ED
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPVTGALETVH-YRISKSAWLSGYEDP 392
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 118
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTG--- 447
Query: 119 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 178
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF +
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFAS 491
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 492 GEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|334314085|ref|XP_001363658.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1
[Monodelphis domestica]
Length = 537
Score = 94.7 bits (234), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 64/212 (30%), Positives = 102/212 (48%), Gaps = 32/212 (15%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 65
+PR + F S + + + AK RL S+ + ET + +T R S ++S ED
Sbjct: 337 KPRIVRFHEIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYED 394
Query: 66 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 117
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 395 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG-- 450
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 451 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 493
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D ++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 494 SGEGDYSTRHAACPVLVGNKWVSNKWIHERGQ 525
>gi|219113023|ref|XP_002186095.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|209582945|gb|ACI65565.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 508
Score = 94.7 bits (234), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 69/217 (31%), Positives = 104/217 (47%), Gaps = 26/217 (11%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKR-LKPSQLALRQGETVESTKGTRTSSGTF 59
M VLS PR +F S + + ++ A KR LK S + + TRTS+ +
Sbjct: 281 MTVLSCVPRVFEVKDFLSDMEVEHLLNIASKRKLKRSTMHAGGSSEATTNDDTRTSTNDW 340
Query: 60 ISASED---------KTGILELIEH--KIARATMLPQ---TH---GEAFNVLRYEIGQKY 102
I +D +L++ E + R + +P+ +H E ++ Y++GQ+Y
Sbjct: 341 IPRHQDLITDTIYRRAADLLQMDEALLRWRRKSEIPEFTESHISISERLQLVNYQVGQQY 400
Query: 103 DSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKV 162
H+D P Q S R A+ L YL+D +GGET FP + G LKV
Sbjct: 401 TPHHDFTMPGLVNMQPS-RFATLLFYLNDDMDGGETAFPRWLHADEEGG-------SLKV 452
Query: 163 KPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWV 199
KP +G +LFY+L P+G D S H + PV +GEKW+
Sbjct: 453 KPEKGKAILFYNLLPDGNYDERSEHAALPVRRGEKWL 489
>gi|33859596|ref|NP_035160.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Mus musculus]
gi|20455506|sp|Q60715.2|P4HA1_MOUSE RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; Flags: Precursor
gi|16307134|gb|AAH09654.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide [Mus musculus]
gi|74144306|dbj|BAE36020.1| unnamed protein product [Mus musculus]
gi|74146660|dbj|BAE41331.1| unnamed protein product [Mus musculus]
gi|148700260|gb|EDL32207.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide, isoform CRA_a [Mus
musculus]
Length = 534
Score = 94.7 bits (234), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 60/211 (28%), Positives = 102/211 (48%), Gaps = 30/211 (14%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
+PR + F + S + + + AK RL+ + ++ +E+ R S ++S ED
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPVTGALETVH-YRISKSAWLSGYEDP 392
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 118
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTG--- 447
Query: 119 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 178
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF +
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFAS 491
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 492 GEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|74148153|dbj|BAE36242.1| unnamed protein product [Mus musculus]
Length = 454
Score = 94.7 bits (234), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 63/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 65
+PR + F + S + + + AK RL S+ + ET + +T R S ++S ED
Sbjct: 254 KPRIIRFHDIISDAENEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYED 311
Query: 66 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 117
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 312 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTG-- 367
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 368 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 410
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 411 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 442
>gi|354483225|ref|XP_003503795.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 2
[Cricetulus griseus]
Length = 534
Score = 94.7 bits (234), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 60/209 (28%), Positives = 102/209 (48%), Gaps = 26/209 (12%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
+PR + F + S + + + AK RL+ + ++ +E+ R S ++S ED
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGNLETVH-YRISKSAWLSGYEDP 392
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ------ 120
++ I +I T L + E V Y +G +Y+ H+D E P Q
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDE--PDAFQELGTGN 448
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF +G
Sbjct: 449 RIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGE 493
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 494 GDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|345305838|ref|XP_001508476.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Ornithorhynchus
anatinus]
Length = 493
Score = 94.7 bits (234), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 63/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 65
+PR + + S + +++ AK RL S+ + ET + +T R S ++S ED
Sbjct: 293 KPRIVRYHEIISDAEIETVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYED 350
Query: 66 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 117
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 351 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG-- 406
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 407 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 449
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D ++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 450 SGEGDYSTRHAACPVLVGNKWVSNKWIHERGQ 481
>gi|340367965|ref|XP_003382523.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Amphimedon
queenslandica]
Length = 525
Score = 94.7 bits (234), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 56/206 (27%), Positives = 99/206 (48%), Gaps = 20/206 (9%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
+P+ F + + + + + A +L + + GE + +T R S ++S S+D
Sbjct: 325 KPKIYIFYDIVTDREIERLKELANPKLNRATVHGENGELLHAT--YRISKSGWLSGSDDP 382
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE---YGPQMSQRLA 123
G ++ I+ +I T L + E V+ Y IG +Y+ HYD E R++
Sbjct: 383 LGYVDRIDQRIEDVTGLTMSTAEQLQVVNYGIGGQYEPHYDFARTGEDTFTSLGSGNRIS 442
Query: 124 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 183
+ L+Y+SDVE+GG T+FP +G ++ P + +++L +G D
Sbjct: 443 TLLIYMSDVEKGGATVFP---------------GVGARLVPIKRAAAYWWNLKRSGDGDY 487
Query: 184 TSLHGSCPVIKGEKWVATKWIRDQEQ 209
++ H CPV+ G KWV KWI ++ Q
Sbjct: 488 STRHAGCPVLVGSKWVCNKWIHERGQ 513
>gi|410910256|ref|XP_003968606.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Takifugu
rubripes]
Length = 540
Score = 94.7 bits (234), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 66/213 (30%), Positives = 105/213 (49%), Gaps = 25/213 (11%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
+VLS RP + + +F S + + I A+ L+ S +A G+ ++T R S ++
Sbjct: 336 EVLSLRPYVVLYHDFISDSESEEIKQHAQLGLRRSVVA--TGDK-QATAEYRISKSAWLK 392
Query: 62 ASEDKTGILELIEHKIARATML--PQTHGEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 116
S T + ++ KI+ T L HGE V+ Y IG Y+ H+D A +P+ +
Sbjct: 393 GSAHST--VSRLDQKISMLTGLNVQHPHGEYLQVVNYGIGGHYEPHFDHATSPSSPVFKL 450
Query: 117 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
+ R+A+F++YLS VE GG T F + N V + + +++L
Sbjct: 451 KTGNRVATFMIYLSSVEAGGSTAFIYAN---------------FSVPVMKNAAIFWWNLH 495
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
NG D +LH CPV+ G+KWVA KWI + Q
Sbjct: 496 RNGEGDADTLHAGCPVLIGDKWVANKWIHEYGQ 528
>gi|74224984|dbj|BAE38205.1| unnamed protein product [Mus musculus]
Length = 534
Score = 94.7 bits (234), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 60/211 (28%), Positives = 102/211 (48%), Gaps = 30/211 (14%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
+PR + F + S + + + AK RL+ + ++ +E+ R S ++S ED
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPVTGALETVH-YRISKSAWLSGYEDP 392
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 118
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTG--- 447
Query: 119 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 178
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF +
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFAS 491
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 492 GEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|149038788|gb|EDL93077.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide, isoform CRA_b
[Rattus norvegicus]
Length = 534
Score = 94.7 bits (234), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 60/211 (28%), Positives = 102/211 (48%), Gaps = 30/211 (14%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
+PR + F + S + + + AK RL+ + ++ +E+ R S ++S ED
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPVTGALETVH-YRISKSAWLSGYEDP 392
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 118
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTG--- 447
Query: 119 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 178
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF +
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFAS 491
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 492 GEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|334314087|ref|XP_003339988.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2
[Monodelphis domestica]
Length = 537
Score = 94.4 bits (233), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 62/211 (29%), Positives = 101/211 (47%), Gaps = 30/211 (14%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
+PR + F S + + + AK RL+ + ++ +E T R S ++S ED
Sbjct: 337 KPRIVRFHEIISDAEIEIVKDLAKPRLRRATISNPITGVLE-TAHYRISKSAWLSGYEDP 395
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 118
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 396 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG--- 450
Query: 119 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 178
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF +
Sbjct: 451 -NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFAS 494
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
G D ++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 495 GEGDYSTRHAACPVLVGNKWVSNKWIHERGQ 525
>gi|291190128|ref|NP_001167431.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Salmo salar]
gi|223649060|gb|ACN11288.1| Prolyl 4-hydroxylase subunit alpha-2 precursor [Salmo salar]
Length = 538
Score = 94.4 bits (233), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 64/211 (30%), Positives = 102/211 (48%), Gaps = 32/211 (15%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + N S + + I AK RL ++ +R +T V +T R S ++ ED
Sbjct: 339 PHIVRYLNALSDSEIEKIKELAKPRL--ARATVRDPKTGVLTTANYRVSKSAWLEGEEDP 396
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 118
++E + +I T L E + Y +G +Y+ H+D AF G
Sbjct: 397 --VIERVNQRIEDITGLTTQTAELLQIANYGVGGQYEPHFDFSRKDEPDAFKTLGTG--- 451
Query: 119 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 178
R+A+FL Y+SDVE GG T+FP D+ G + P++G + +Y+LF +
Sbjct: 452 -NRVATFLNYMSDVEAGGATVFP-----------DF----GAAIYPKKGTAVFWYNLFRS 495
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
G D + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 496 GEGDYRTRHAACPVLVGCKWVSNKWIHERGQ 526
>gi|344254200|gb|EGW10304.1| Prolyl 4-hydroxylase subunit alpha-1 [Cricetulus griseus]
Length = 507
Score = 94.4 bits (233), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 61/209 (29%), Positives = 102/209 (48%), Gaps = 26/209 (12%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
+PR + F + S + + + AK RL+ + ++ +E T R S ++S ED
Sbjct: 307 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGNLE-TVHYRISKSAWLSGYEDP 365
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ------ 120
++ I +I T L + E V Y +G +Y+ H+D E P Q
Sbjct: 366 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDE--PDAFQELGTGN 421
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF +G
Sbjct: 422 RIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGE 466
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 467 GDYSTRHAACPVLVGNKWVSNKWLHERGQ 495
>gi|198284815|ref|YP_002221136.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrooxidans ATCC 53993]
gi|218668131|ref|YP_002427500.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrooxidans ATCC 23270]
gi|198249336|gb|ACH84929.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrooxidans ATCC 53993]
gi|218520344|gb|ACK80930.1| oxidoreductase, 2OG-Fe(II) oxygenase family [Acidithiobacillus
ferrooxidans ATCC 23270]
Length = 213
Score = 94.4 bits (233), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 67/201 (33%), Positives = 93/201 (46%), Gaps = 24/201 (11%)
Query: 11 LYFPNFASAEQCQSIIATAK-KRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGI 69
++F S ++C +IA KPS + + T G R S T ++ S D I
Sbjct: 15 VHFKGLLSLDECAELIAIGSVSDAKPSVVVDGASDAAYETPG-RCS--TVVAPSVDAYPI 71
Query: 70 LELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM---SQRLASFL 126
+ I +I + + Q + E +L Y G KYD HYDAF ++ PQ+ RL + L
Sbjct: 72 ILEIRRRIELFSGISQENQEPLQILHYTRGGKYDIHYDAF--SDGSPQLRNGGNRLLTVL 129
Query: 127 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 186
LYL+DVE GG T FP I + P G G+LF + R SL
Sbjct: 130 LYLNDVEYGGWTQFPH---------------IMANIVPNAGSGILFRNTDAQNRQLRESL 174
Query: 187 HGSCPVIKGEKWVATKWIRDQ 207
H PV GEKW+A+ WIR+
Sbjct: 175 HAGLPVTHGEKWIASIWIREN 195
>gi|224122338|ref|XP_002318810.1| predicted protein [Populus trichocarpa]
gi|222859483|gb|EEE97030.1| predicted protein [Populus trichocarpa]
Length = 310
Score = 94.4 bits (233), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 57/203 (28%), Positives = 104/203 (51%), Gaps = 13/203 (6%)
Query: 3 VLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISA 62
+SW+PR + F + E+C +I+ A+ + S+ +E + SS + ++
Sbjct: 60 TVSWQPRVFVYKGFLTDEECDHLISLAQGTKETSEGKDDDSGRIERNR-LFASSTSLLNM 118
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 122
++ IL IE +++ T+LP+ + + V+ Y I + +++D F +
Sbjct: 119 DDN---ILSRIEERVSAWTLLPKENSKPLQVMHYGI-EDAKNYFDYFGNKSAIISSEPLM 174
Query: 123 ASFLLYLSDVEEGGETMFP---FENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 179
A+ + YLS+V +GGE FP +N I+ D I ++P +G+ +LF+++ PN
Sbjct: 175 ATLVFYLSNVTQGGEIFFPKSEVKNKIWSDCTK-----ISDSLRPIKGNAILFFTVHPNT 229
Query: 180 TIDRTSLHGSCPVIKGEKWVATK 202
+ D S H CPV++GE W ATK
Sbjct: 230 SPDMGSSHSRCPVLEGEMWYATK 252
>gi|415977972|ref|ZP_11559036.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein
[Acidithiobacillus sp. GGI-221]
gi|339834153|gb|EGQ61937.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein
[Acidithiobacillus sp. GGI-221]
Length = 215
Score = 94.4 bits (233), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 67/201 (33%), Positives = 93/201 (46%), Gaps = 24/201 (11%)
Query: 11 LYFPNFASAEQCQSIIATAK-KRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGI 69
++F S ++C +IA KPS + + T G R S T ++ S D I
Sbjct: 17 VHFKGLLSLDECAELIAIGSVSDAKPSVVVDGASDAAYETPG-RCS--TVVAPSVDAYPI 73
Query: 70 LELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM---SQRLASFL 126
+ I +I + + Q + E +L Y G KYD HYDAF ++ PQ+ RL + L
Sbjct: 74 ILEIRRRIELFSGISQENQEPLQILHYTRGGKYDIHYDAF--SDGSPQLRNGGNRLLTVL 131
Query: 127 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 186
LYL+DVE GG T FP I + P G G+LF + R SL
Sbjct: 132 LYLNDVEYGGWTQFPH---------------IMANIVPNAGSGILFRNTDAQNRQLRESL 176
Query: 187 HGSCPVIKGEKWVATKWIRDQ 207
H PV GEKW+A+ WIR+
Sbjct: 177 HAGLPVTHGEKWIASIWIREN 197
>gi|194905376|ref|XP_001981185.1| GG11927 [Drosophila erecta]
gi|190655823|gb|EDV53055.1| GG11927 [Drosophila erecta]
Length = 539
Score = 94.4 bits (233), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 62/210 (29%), Positives = 100/210 (47%), Gaps = 21/210 (10%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
++LS P + + S ++ I +++K + PS+ + V S RTS ++
Sbjct: 325 EILSIDPFVVLLHDMVSPKEAALIRSSSKSTIFPSETVNAANDFVVSK--FRTSKSVWLD 382
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF--NPAEYGPQMS 119
++ + + ++A AT L H E F V+ Y IG ++SH+D + +
Sbjct: 383 RDANEATVK--LTQRLADATGLDVKHSEHFQVINYGIGGVFESHFDTTLEDTNRFVGGFI 440
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 179
R+A+ L YL+DV +GG T FP N + V PR G L +Y+L G
Sbjct: 441 DRIATTLFYLNDVPQGGATHFPGLN---------------ITVFPRLGAALFWYNLDTQG 485
Query: 180 TIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+ ++H CPVI G KWV +KWI D+ Q
Sbjct: 486 MLQVRTMHTGCPVIVGSKWVVSKWIDDKGQ 515
>gi|159462456|ref|XP_001689458.1| predicted protein [Chlamydomonas reinhardtii]
gi|158283446|gb|EDP09196.1| predicted protein [Chlamydomonas reinhardtii]
Length = 221
Score = 94.0 bits (232), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 61/197 (30%), Positives = 91/197 (46%), Gaps = 34/197 (17%)
Query: 11 LYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGIL 70
+ + NF S +C+ II A ++K S + + V RTS GTF+ D ++
Sbjct: 1 MVYHNFLSDRECRHIIDLAHAQMKRSTVVGSKNAGV--VDDIRTSYGTFLRRVPDP--VI 56
Query: 71 ELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLS 130
IEH++A + LP +H E VLRY KY H D +R+A+ L+YL
Sbjct: 57 AAIEHRLALWSHLPASHQEDMQVLRYGPTNKYGPHIDGL----------ERVATVLIYLG 106
Query: 131 DVEEGGETMFPFENGIFLDSGYDYKKCIGLKV--KPRRGDGLLFYSLFPN-GTIDRTSLH 187
E + +C +V KP+RGD L+F+ P+ D S+H
Sbjct: 107 QAERA-----------------NLSQCARGRVAYKPKRGDALMFFDTMPDYKQTDVHSMH 149
Query: 188 GSCPVIKGEKWVATKWI 204
CPV++G KW A KW+
Sbjct: 150 TGCPVVEGVKWNAVKWL 166
>gi|344274272|ref|XP_003408941.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1
[Loxodonta africana]
Length = 534
Score = 94.0 bits (232), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 65
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 334 KPRIVRFHDIISDAEIEVVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391
Query: 66 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 117
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP---------------DVGASVWPKKGTAVFWYNLFA 490
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|73952886|ref|XP_850682.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3 [Canis
lupus familiaris]
Length = 534
Score = 94.0 bits (232), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 65
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391
Query: 66 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 117
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 490
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|395820524|ref|XP_003783614.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1 [Otolemur
garnettii]
Length = 534
Score = 94.0 bits (232), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 65
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391
Query: 66 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 117
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 490
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|428178571|gb|EKX47446.1| hypothetical protein GUITHDRAFT_152114 [Guillardia theta CCMP2712]
Length = 262
Score = 93.6 bits (231), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 64/220 (29%), Positives = 109/220 (49%), Gaps = 32/220 (14%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQG--ETVESTKGTRTSSGT 58
++ ++ PR N + ++C+ ++ A ++ + + G + VEST TRT+ G
Sbjct: 57 LEQINASPRVFRIRNLLTKQECEHLMLLAFRKGLSKTMIMPYGTHKLVEST--TRTNDGA 114
Query: 59 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIG-QKYDSHYDAFNPAEYGP- 116
++ +D ++ +E + + T GE VL Y G Q + HYD F+PA P
Sbjct: 115 WLDFLQDD--VVRRLEETLGKLTKTTPQQGENLQVLHYSNGAQFFQEHYDYFDPARDPPE 172
Query: 117 ---QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFY 173
Q R + ++YL EGGET FP +GLK+ + GD L+FY
Sbjct: 173 SFEQGGNRYITVIVYLEAALEGGETHFP---------------ELGLKLTAQPGDALMFY 217
Query: 174 SL--FPNGT----IDRTSLHGSCPVIKGEKWVATKWIRDQ 207
+L +GT +++ ++H + P ++GEKWVA KWI ++
Sbjct: 218 NLKEHCSGTDPDCVEKKTIHAALPPVRGEKWVAVKWIHEK 257
>gi|332244067|ref|XP_003271193.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
alpha-1 [Nomascus leucogenys]
Length = 502
Score = 93.6 bits (231), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 65
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 302 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 359
Query: 66 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 117
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 360 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 415
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 416 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 458
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 459 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 490
>gi|296220402|ref|XP_002756291.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Callithrix
jacchus]
Length = 534
Score = 93.6 bits (231), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 65
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391
Query: 66 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 117
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 490
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|63252886|ref|NP_000908.2| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Homo
sapiens]
gi|114631173|ref|XP_508168.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 13 [Pan
troglodytes]
gi|602676|gb|AAA59069.1| alpha-subunit of prolyl 4-hydroxylase [Homo sapiens]
gi|62897481|dbj|BAD96680.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I variant [Homo
sapiens]
gi|119574852|gb|EAW54467.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I, isoform CRA_a [Homo
sapiens]
gi|119574853|gb|EAW54468.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I, isoform CRA_b [Homo
sapiens]
gi|410349609|gb|JAA41408.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
gi|410349613|gb|JAA41410.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
Length = 534
Score = 93.6 bits (231), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 65
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391
Query: 66 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 117
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 490
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|301770069|ref|XP_002920453.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Ailuropoda
melanoleuca]
Length = 534
Score = 93.6 bits (231), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 65
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391
Query: 66 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 117
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 490
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|190786|gb|AAA36534.1| prolyl 4-hydroxylase alpha subunit (EC 1.14.11.2) [Homo sapiens]
Length = 534
Score = 93.6 bits (231), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 65
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391
Query: 66 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 117
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 490
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|348576112|ref|XP_003473831.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cavia
porcellus]
Length = 534
Score = 93.6 bits (231), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 65
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391
Query: 66 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 117
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 490
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|397490069|ref|XP_003816032.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Pan paniscus]
Length = 488
Score = 93.6 bits (231), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 65
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 288 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 345
Query: 66 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 117
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 346 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 401
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 402 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 444
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 445 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 476
>gi|432106758|gb|ELK32410.1| Prolyl 4-hydroxylase subunit alpha-1 [Myotis davidii]
Length = 534
Score = 93.6 bits (231), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 65
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391
Query: 66 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 117
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 490
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|195145314|ref|XP_002013641.1| GL24244 [Drosophila persimilis]
gi|194102584|gb|EDW24627.1| GL24244 [Drosophila persimilis]
Length = 496
Score = 93.6 bits (231), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 68/209 (32%), Positives = 100/209 (47%), Gaps = 33/209 (15%)
Query: 1 MQVLSWRPR-ALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTF 59
M++LS P ALY ++AEQ ++ L SQL ++G + + TF
Sbjct: 302 MELLSRDPLVALYHEVVSAAEQRHLML------LSESQLQRQRGHQYDKIR-------TF 348
Query: 60 ISAS--EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ 117
SAS + T +E + ++ T L E +L Y IG +Y H D P +
Sbjct: 349 ASASVAANATPTVEQLHRRLEDITGLDLAESEPLRILNYGIGGQYYIHVDCEQPQTHVEP 408
Query: 118 MSQ--RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSL 175
+ RLA+ LLYLSDV GG T FP +GL ++P RG L++++
Sbjct: 409 YPKEYRLATVLLYLSDVRLGGFTSFP---------------ALGLGIRPNRGSALVWHNA 453
Query: 176 FPNGTIDRTSLHGSCPVIKGEKWVATKWI 204
G D +LH +CPV+ G +WVA+KWI
Sbjct: 454 NNAGNCDYRALHAACPVLLGTRWVASKWI 482
>gi|383418721|gb|AFH32574.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Macaca
mulatta]
Length = 534
Score = 93.6 bits (231), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 65
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391
Query: 66 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 117
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 490
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|380813208|gb|AFE78478.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Macaca
mulatta]
gi|384947330|gb|AFI37270.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Macaca
mulatta]
Length = 534
Score = 93.6 bits (231), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 65
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391
Query: 66 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 117
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 490
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|291404182|ref|XP_002718471.1| PREDICTED: prolyl 4-hydroxylase, alpha I subunit isoform 1
[Oryctolagus cuniculus]
Length = 534
Score = 93.6 bits (231), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 65
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391
Query: 66 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 117
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 490
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|302143843|emb|CBI22704.3| unnamed protein product [Vitis vinifera]
Length = 317
Score = 93.6 bits (231), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 66/214 (30%), Positives = 114/214 (53%), Gaps = 28/214 (13%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET--VESTKGTRTSSGTFIS 61
LSW+PRA + F S E+C +I+ A K +LA G++ V + ++S G
Sbjct: 60 LSWQPRAFLYRGFLSDEECDHLISLALG--KKEELATNGGDSGNVVLKRLLKSSEGPLYI 117
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEI---GQKYDSHYDAFNPAEYGPQM 118
E + IE +I+ T LP+ + E V++Y+ QKY+ ++ + +++G +
Sbjct: 118 DDE----VAARIEKRISAWTFLPKENSEPLEVVQYQFENAKQKYN-YFSNKSTSKFGEPL 172
Query: 119 SQRLASFLLYLSDVEEGGETMFP--------FENGIFLDSGYDYKKCIGLKVKPRRGDGL 170
+A+ LL+LS+V GGE FP ++GI D + GL+ P +G+ +
Sbjct: 173 ---MATVLLHLSNVTRGGELFFPESELKNSQSKSGILSDCT---ESSSGLR--PVKGNAI 224
Query: 171 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 204
LF+++ PN + D++S + CPV++GE W ATK+
Sbjct: 225 LFFNVHPNASPDKSSSYARCPVLEGEMWCATKFF 258
>gi|90085216|dbj|BAE91349.1| unnamed protein product [Macaca fascicularis]
Length = 244
Score = 93.6 bits (231), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 65
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 44 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 101
Query: 66 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 117
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 102 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 157
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 158 --NRIATWLFYMSDVSAGGATVFPE---------------VGASVWPKKGTAVFWYNLFA 200
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 201 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 232
>gi|194905372|ref|XP_001981184.1| GG11758 [Drosophila erecta]
gi|190655822|gb|EDV53054.1| GG11758 [Drosophila erecta]
Length = 550
Score = 93.6 bits (231), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 56/173 (32%), Positives = 80/173 (46%), Gaps = 23/173 (13%)
Query: 43 GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKY 102
G RTS TFI AS K +L I+ ++A T L + E Y IG Y
Sbjct: 362 GHNESLVSNVRTSQFTFIPASAHK--VLSTIDQRVADMTNLNMKYAEDHQFANYGIGGHY 419
Query: 103 DSHYDAFNPAEY------GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKK 156
H D F + P+M R+A+ L YLSDV +GG T FP +
Sbjct: 420 GQHMDWFYQTTFDAGLVSSPEMGNRIATVLFYLSDVSQGGGTAFPQLRTL---------- 469
Query: 157 CIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+KP++ +++L +G D + HG+CP+I G KWV +WIR+ +Q
Sbjct: 470 -----LKPKKYAAAFWHNLHASGVGDVRTQHGACPIIAGSKWVQNRWIREFDQ 517
>gi|321474876|gb|EFX85840.1| hypothetical protein DAPPUDRAFT_309107 [Daphnia pulex]
Length = 528
Score = 93.6 bits (231), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 52/161 (32%), Positives = 78/161 (48%), Gaps = 21/161 (13%)
Query: 53 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 112
R + F+ SE ++ + ++ T L E V Y IG Y H+D
Sbjct: 373 RIAKAAFLKDSEH--NLIVKMSRRVGDITGLDMAASEDLQVCNYGIGGHYVPHFDYARQG 430
Query: 113 E-YGPQ---MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 168
E +GP+ R+A++L Y+SDVE GG T+FP +G + P++G
Sbjct: 431 EIHGPRDLDWGNRIATWLFYMSDVEAGGATVFP---------------AVGAALWPQKGS 475
Query: 169 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+Y+L PNG D +LH CPV+ G KWV+ KWI ++ Q
Sbjct: 476 AAFWYNLRPNGNGDEDTLHAGCPVLTGSKWVSNKWIHERSQ 516
>gi|349604936|gb|AEQ00344.1| Prolyl 4-hydroxylase subunit alpha-1-like protein, partial [Equus
caballus]
Length = 302
Score = 93.6 bits (231), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 65
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 102 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 159
Query: 66 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 117
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 160 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 215
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 216 --NRIATWLFYMSDVSAGGATVFPE---------------VGASVWPKKGTAVFWYNLFA 258
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 259 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 290
>gi|402880501|ref|XP_003903839.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like, partial
[Papio anubis]
Length = 379
Score = 93.2 bits (230), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 65
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 179 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 236
Query: 66 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 117
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 237 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 292
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 293 --NRIATWLFYMSDVSAGGATVFPE---------------VGASVWPKKGTAVFWYNLFA 335
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 336 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 367
>gi|410251924|gb|JAA13929.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
Length = 566
Score = 93.2 bits (230), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 65
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391
Query: 66 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 117
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 490
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|410927705|ref|XP_003977281.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Takifugu
rubripes]
Length = 531
Score = 93.2 bits (230), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 61/211 (28%), Positives = 102/211 (48%), Gaps = 30/211 (14%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
RP + + + S + +++ AK RL+ + + Q + +T R S ++ A E
Sbjct: 331 RPHIVRYHDILSNREMETVKELAKPRLRRATVHDPQTGQL-TTAPYRVSKSAWLGAFEHP 389
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 118
+++ I +I T L + E V Y +G +Y+ HYD AF G
Sbjct: 390 --VVDRINQRIEDITGLDVSTAEDLQVANYGVGGQYEPHYDFGRKDEPDAFKELGTG--- 444
Query: 119 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 178
R+A++LLY+S+V+ GG T+F IG V P++G + +Y+L P+
Sbjct: 445 -NRIATWLLYMSEVQAGGATVF---------------TDIGASVSPKKGSAVFWYNLHPS 488
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
G D + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 489 GDGDYRTRHAACPVLLGNKWVSNKWIHERGQ 519
>gi|307211752|gb|EFN87747.1| Prolyl 4-hydroxylase subunit alpha-1 [Harpegnathos saltator]
Length = 415
Score = 93.2 bits (230), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 61/209 (29%), Positives = 96/209 (45%), Gaps = 24/209 (11%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
PR +++ N E+ ++I A+ R K + + + +E R S ++ E K
Sbjct: 208 PRIVFYHNVIYDEEIETIKRMAQPRFKRATVQNYKTGALE-IANYRISKSAWLQEHEHKH 266
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS----QRLA 123
+ + ++ T + E V+ Y IG Y+ H+D E S R+A
Sbjct: 267 --VAAVSKRVEHMTSMSVETAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIA 324
Query: 124 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 183
+ L Y+SDVE+GG T+F I + + PR+G +Y+L PNG D
Sbjct: 325 TVLYYMSDVEQGGGTVFT---------------AINISLWPRKGSAAFWYNLKPNGEGDF 369
Query: 184 TSLHGSCPVIKGEKWVATKWI--RDQEQH 210
+ H +CPV+ G KWVA KW+ R QE H
Sbjct: 370 KTRHAACPVLTGSKWVANKWLHERGQEFH 398
>gi|66771513|gb|AAY55068.1| IP12095p [Drosophila melanogaster]
Length = 538
Score = 93.2 bits (230), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 64/210 (30%), Positives = 103/210 (49%), Gaps = 21/210 (10%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
++LS P + + S ++ I +++K ++ PS+ + E K + S F S
Sbjct: 324 EILSVDPFVILLHDMVSHKEGALIRSSSKNQILPSE-TVNAANEFEIAKFRTSKSVWFDS 382
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE--YGPQMS 119
+ + T L+L + ++ AT L H E F V+ Y IG ++SH+D E +
Sbjct: 383 DANEAT--LKLTQ-RLGEATGLDMKHSEPFQVINYGIGGVFESHFDTSLADEDRFVNGYI 439
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 179
RLA+ L YL+DV +GG T FP + + V P+ G L++Y+L G
Sbjct: 440 DRLATTLFYLNDVPQGGATHFP---------------GLNITVFPKFGTVLMWYNLHTEG 484
Query: 180 TIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+ ++H CPVI G KWV +KWI D+ Q
Sbjct: 485 MLHVRTMHTGCPVIVGSKWVVSKWIDDKGQ 514
>gi|198429625|ref|XP_002128613.1| PREDICTED: similar to procollagen-proline, 2-oxoglutarate
4-dioxygenase (proline 4-hydroxylase), alpha 1
polypeptide [Ciona intestinalis]
Length = 195
Score = 93.2 bits (230), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 43/144 (29%), Positives = 79/144 (54%), Gaps = 18/144 (12%)
Query: 69 ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP---QMSQRLASF 125
+++ + +I+ T L E + Y +G +Y+ H+D +++G ++ R+A+F
Sbjct: 50 VIKRVCQRISDVTGLSMETAEELQIANYGVGGQYEPHFDYSRKSDFGKFDDEVGNRIATF 109
Query: 126 LLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTS 185
L Y+S+VE+GG T+F G+ V+P +G + +Y+L P+G D +
Sbjct: 110 LTYMSNVEQGGSTVFLHP---------------GIAVRPIKGSAVFWYNLLPSGAGDERT 154
Query: 186 LHGSCPVIKGEKWVATKWIRDQEQ 209
H +CPV+ G KWV+ KWI +++Q
Sbjct: 155 RHAACPVLTGVKWVSNKWIHERDQ 178
>gi|261245137|gb|ACX54875.1| FI12021p [Drosophila melanogaster]
Length = 538
Score = 93.2 bits (230), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 64/210 (30%), Positives = 103/210 (49%), Gaps = 21/210 (10%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
++LS P + + S ++ I +++K ++ PS+ + E K + S F S
Sbjct: 324 EILSVDPFVILLHDMVSHKEGALIRSSSKNQILPSE-TVNAANEFEIAKFRTSKSVWFDS 382
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE--YGPQMS 119
+ + T L+L + ++ AT L H E F V+ Y IG ++SH+D E +
Sbjct: 383 DANEAT--LKLTQ-RLGEATGLDMKHSEPFQVINYGIGGVFESHFDTSLADEDRFVNGYI 439
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 179
RLA+ L YL+DV +GG T FP + + V P+ G L++Y+L G
Sbjct: 440 DRLATTLFYLNDVPQGGATHFP---------------GLNITVFPKFGTVLMWYNLHTEG 484
Query: 180 TIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+ ++H CPVI G KWV +KWI D+ Q
Sbjct: 485 MLHVRTMHTGCPVIVGSKWVVSKWIDDKGQ 514
>gi|228993272|ref|ZP_04153188.1| hypothetical protein bpmyx0001_40040 [Bacillus pseudomycoides DSM
12442]
gi|228766340|gb|EEM14983.1| hypothetical protein bpmyx0001_40040 [Bacillus pseudomycoides DSM
12442]
Length = 195
Score = 93.2 bits (230), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 69/218 (31%), Positives = 98/218 (44%), Gaps = 39/218 (17%)
Query: 3 VLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISA 62
VL P + + +CQ +I +KK ++P+Q GE R S T++
Sbjct: 7 VLHDEPFVAQYEQIITPAECQELIELSKKHIQPAQAYGHTGE--------RKSDFTWLPH 58
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF--------NPAEY 114
G++ + IA A LP H E RYE+G K+D+H D + N E
Sbjct: 59 YSH--GLVSQVSELIATAMPLPLNHAEPLQAARYEVGGKFDAHIDCYGTWHEDGRNRVEQ 116
Query: 115 GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 174
G QRL + +LYL+ V GGET FP + L V P G L+F +
Sbjct: 117 G---GQRLYTAILYLNTVNAGGETFFP---------------SLNLTVTPSEGKLLVFEN 158
Query: 175 LFPNGTID--RTSLHGSCPVIKGEKWVATKWIRDQEQH 210
GT + SLH C V +GEKW+AT W R++ Q+
Sbjct: 159 C-KRGTNEPHPLSLHEGCAVHEGEKWIATLWFREKPQY 195
>gi|66770643|gb|AAY54633.1| IP12395p [Drosophila melanogaster]
Length = 538
Score = 93.2 bits (230), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 64/210 (30%), Positives = 103/210 (49%), Gaps = 21/210 (10%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
++LS P + + S ++ I +++K ++ PS+ + E K + S F S
Sbjct: 324 EILSVDPFVILLHDMVSHKEGALIRSSSKNQILPSE-TVNAANEFEIAKFRTSKSVWFDS 382
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE--YGPQMS 119
+ + T L+L + ++ AT L H E F V+ Y IG ++SH+D E +
Sbjct: 383 DANEAT--LKLTQ-RLGEATGLDMKHSEPFQVINYGIGGVFESHFDTSLADEDRFVNGYI 439
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 179
RLA+ L YL+DV +GG T FP + + V P+ G L++Y+L G
Sbjct: 440 DRLATTLFYLNDVPQGGATHFP---------------GLNITVFPKFGTVLMWYNLHTEG 484
Query: 180 TIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+ ++H CPVI G KWV +KWI D+ Q
Sbjct: 485 MLHVRTMHTGCPVIVGSKWVVSKWIDDKGQ 514
>gi|195505241|ref|XP_002099419.1| GE10893 [Drosophila yakuba]
gi|194185520|gb|EDW99131.1| GE10893 [Drosophila yakuba]
Length = 508
Score = 93.2 bits (230), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 61/220 (27%), Positives = 110/220 (50%), Gaps = 34/220 (15%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
M+ +S P + + + + Q +IA A+ RL+P+++ + + E+ R++ GTF+
Sbjct: 298 MEEISLEPYIVVYHDILPDKDMQQLIALAEPRLRPTEVF--EEDKSEARTSDRSALGTFL 355
Query: 61 SASE-DKTG--ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE---- 113
+ + +G +L+ + ++ T + H FN+++Y G +Y +++D FN
Sbjct: 356 PFKDMNPSGGPLLDRLTQRMRDITGIQIRHENTFNIIKYGFGSQYATNFDFFNGTNSEME 415
Query: 114 -YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLF 172
YG R+A+ L YL+D GG T+FP I +KV RG L +
Sbjct: 416 GYG----DRMATVLFYLNDAPNGGATVFP---------------RIDVKVTAERGKVLFW 456
Query: 173 YSLFPNG---TIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
++L NG ++ +LH +CPV +G KWV WI + +Q
Sbjct: 457 HNL--NGETHDVEPNTLHAACPVFQGSKWVMAAWIHEYDQ 494
>gi|116008537|ref|NP_733379.2| CG31524, isoform A [Drosophila melanogaster]
gi|113194861|gb|AAN14239.2| CG31524, isoform A [Drosophila melanogaster]
Length = 536
Score = 93.2 bits (230), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 64/210 (30%), Positives = 103/210 (49%), Gaps = 21/210 (10%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
++LS P + + S ++ I +++K ++ PS+ + E K + S F S
Sbjct: 322 EILSVDPFVILLHDMVSHKEGALIRSSSKNQILPSE-TVNAANEFEIAKFRTSKSVWFDS 380
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE--YGPQMS 119
+ + T L+L + ++ AT L H E F V+ Y IG ++SH+D E +
Sbjct: 381 DANEAT--LKLTQ-RLGEATGLDMKHSEPFQVINYGIGGVFESHFDTSLADEDRFVNGYI 437
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 179
RLA+ L YL+DV +GG T FP + + V P+ G L++Y+L G
Sbjct: 438 DRLATTLFYLNDVPQGGATHFP---------------GLNITVFPKFGTVLMWYNLHTEG 482
Query: 180 TIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+ ++H CPVI G KWV +KWI D+ Q
Sbjct: 483 MLHVRTMHTGCPVIVGSKWVVSKWIDDKGQ 512
>gi|151556370|gb|AAI47868.1| P4HA1 protein [Bos taurus]
Length = 534
Score = 93.2 bits (230), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 65
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEVVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391
Query: 66 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 117
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 448 --NRIATWLFYMSDVLAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 490
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|116008130|ref|NP_001036777.1| CG31524, isoform B [Drosophila melanogaster]
gi|113194860|gb|ABI31221.1| CG31524, isoform B [Drosophila melanogaster]
Length = 535
Score = 93.2 bits (230), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 64/210 (30%), Positives = 103/210 (49%), Gaps = 21/210 (10%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
++LS P + + S ++ I +++K ++ PS+ + E K + S F S
Sbjct: 321 EILSVDPFVILLHDMVSHKEGALIRSSSKNQILPSE-TVNAANEFEIAKFRTSKSVWFDS 379
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE--YGPQMS 119
+ + T L+L + ++ AT L H E F V+ Y IG ++SH+D E +
Sbjct: 380 DANEAT--LKLTQ-RLGEATGLDMKHSEPFQVINYGIGGVFESHFDTSLADEDRFVNGYI 436
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 179
RLA+ L YL+DV +GG T FP + + V P+ G L++Y+L G
Sbjct: 437 DRLATTLFYLNDVPQGGATHFP---------------GLNITVFPKFGTVLMWYNLHTEG 481
Query: 180 TIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+ ++H CPVI G KWV +KWI D+ Q
Sbjct: 482 MLHVRTMHTGCPVIVGSKWVVSKWIDDKGQ 511
>gi|219124513|ref|XP_002182546.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217405892|gb|EEC45833.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 193
Score = 93.2 bits (230), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 59/209 (28%), Positives = 103/209 (49%), Gaps = 21/209 (10%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
++ LS PRA NF + + I+ +K+ +++ T TRTSS T++
Sbjct: 1 VKALSCAPRAFQVENFLTDVEADHIVGLVQKKND-----MQRSSTNGHISETRTSSTTWL 55
Query: 61 SASEDKTGILELIEHKIARA-----TMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 115
+ D +++ I ++A ML + E ++ Y +GQ+Y +H+D F +
Sbjct: 56 ARHSDP--VIDSIFRRVADTLKMDEAMLHRRINEDLQIVHYGVGQQYTAHHD-FGYPKGD 112
Query: 116 PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSL 175
P R +F +YL+DV GG+T FP + + L V P++G ++FY +
Sbjct: 113 PGSPSRSINFCMYLNDVPAGGQTSFP--------RWRNAETNGALNVVPKKGTAMIFYMV 164
Query: 176 FPNGTIDRTSLHGSCPVIKGEKWVATKWI 204
P+G +D + H + PVI+GEK+ + WI
Sbjct: 165 NPDGNLDDLTHHAALPVIEGEKFFSNLWI 193
>gi|260825355|ref|XP_002607632.1| hypothetical protein BRAFLDRAFT_84679 [Branchiostoma floridae]
gi|229292980|gb|EEN63642.1| hypothetical protein BRAFLDRAFT_84679 [Branchiostoma floridae]
Length = 519
Score = 92.8 bits (229), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 60/208 (28%), Positives = 95/208 (45%), Gaps = 25/208 (12%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKP--SQLALRQGETVESTKGTRTSSGTFISASE 64
+P+ N S + + I A+ RL+P +Q G + S R S ++ E
Sbjct: 320 KPKLWVLHNILSDPEMEVIKKLAQPRLRPAATQNPTTGGAVLSSY---RISKNAWLYYWE 376
Query: 65 DKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE---YGPQMSQR 121
+ ++ ++ ++ AT L E V+ Y IG Y+ H+D E P R
Sbjct: 377 HR--LINRVKQRVEDATGLTMETAEPLQVINYGIGGHYEPHFDCATKDEEFALDPNEGDR 434
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 181
+A+ L Y+SDVE GG T+FP +G +V P +G G +Y+L +G
Sbjct: 435 IATMLFYMSDVEAGGATVFP---------------QVGARVVPEKGAGAFWYNLLKSGEG 479
Query: 182 DRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
D + H CPV+ G KWV+ WI ++ Q
Sbjct: 480 DMLTEHAGCPVLVGSKWVSNMWIHERGQ 507
>gi|410295850|gb|JAA26525.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
gi|410295854|gb|JAA26527.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
Length = 534
Score = 92.8 bits (229), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 61/212 (28%), Positives = 103/212 (48%), Gaps = 32/212 (15%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 65
+PR + F + S + + + AK RL+ + + ET + +T R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLR--RATVHDPETGKLTTAQYRVSKSAWLSGYEN 391
Query: 66 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 117
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 490
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|443730626|gb|ELU16050.1| hypothetical protein CAPTEDRAFT_114796, partial [Capitella teleta]
Length = 150
Score = 92.8 bits (229), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 54/156 (34%), Positives = 83/156 (53%), Gaps = 29/156 (18%)
Query: 61 SASEDKTGILELIEHKIARATML-PQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY----- 114
SAS DK + +++ AT L + + E F V Y IG Y+ H+D F+ +Y
Sbjct: 8 SASADK------LSRRVSSATKLDAEKYAELFQVSTYGIGGHYEPHFD-FSKVKYFTNPV 60
Query: 115 -GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFY 173
QM R+A+F++YL+DVE GG T+FP N L ++P + + ++
Sbjct: 61 LNEQMGDRIATFMIYLNDVEAGGRTVFPRLN---------------LVIEPIKNSAVFWH 105
Query: 174 SLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+L +G D ++HG+CPV+ G KWVA KWI + Q
Sbjct: 106 NLLDDGQQDDRTIHGACPVVLGRKWVANKWIHEYGQ 141
>gi|159481038|ref|XP_001698589.1| predicted protein [Chlamydomonas reinhardtii]
gi|158282329|gb|EDP08082.1| predicted protein [Chlamydomonas reinhardtii]
Length = 258
Score = 92.8 bits (229), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 54/149 (36%), Positives = 76/149 (51%), Gaps = 6/149 (4%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
++ +SW PRA + NF S +C + KR+ S L + RTS G
Sbjct: 8 IETISWSPRAFIYHNFLSEAECDHLTDIGNKRVSRS-LVVDSKTGQSKLDDIRTSYGAAF 66
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMS 119
ED ++ +E +IA T LP +GE +LRY GQKYD+H+D F +P + +
Sbjct: 67 GRGEDP--VIAAVEERIAEWTHLPPEYGEPMQILRYVDGQKYDAHWDWFDDPVHHAAYLH 124
Query: 120 Q--RLASFLLYLSDVEEGGETMFPFENGI 146
+ R A+ LLYLS VE GGET P + I
Sbjct: 125 EGNRYATVLLYLSGVEGGGETNLPLADPI 153
>gi|348523976|ref|XP_003449499.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oreochromis
niloticus]
Length = 594
Score = 92.8 bits (229), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 61/210 (29%), Positives = 100/210 (47%), Gaps = 30/210 (14%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P + + N S + + + AK RL+ + ++ +E T R S ++ A E
Sbjct: 395 PHIVRYHNIVSEKDMEKVKELAKPRLRRATISNPVTGVLE-TAHYRISKSAWLGAYEHP- 452
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQMS 119
+++ I I T L E V Y +G +Y+ H+D AF G
Sbjct: 453 -VVDKINQLIEDVTGLNVKTAEDLQVANYGLGGQYEPHFDFGRKDEPDAFEELGTG---- 507
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 179
R+A++LLY++DV+ GG T+F IG VKP++G + +Y+L+P+G
Sbjct: 508 NRIATWLLYMTDVQAGGATVF---------------TDIGAAVKPKKGTAVFWYNLYPSG 552
Query: 180 TIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
D + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 553 EGDYRTRHAACPVLLGNKWVSNKWIHERGQ 582
>gi|195575113|ref|XP_002105524.1| GD16980 [Drosophila simulans]
gi|194201451|gb|EDX15027.1| GD16980 [Drosophila simulans]
Length = 518
Score = 92.8 bits (229), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 64/210 (30%), Positives = 102/210 (48%), Gaps = 21/210 (10%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
++LS P + + S + I +++K ++ PS+ + E K + S F S
Sbjct: 304 EILSVDPFVILLHDMVSPTEGALIRSSSKNQILPSE-TVNAANEFEVAKFRTSKSVWFDS 362
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE--YGPQMS 119
+ + T L+L + ++ AT L H E F V+ Y IG ++SH+D E +
Sbjct: 363 DANEAT--LKLTQ-RLGEATGLDMKHSEPFQVINYGIGGVFESHFDTSLADEDRFVNGYI 419
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 179
RLA+ L YL+DV +GG T FP + + V P+ G L++Y+L G
Sbjct: 420 DRLATTLFYLNDVPQGGATHFP---------------GLNITVFPKFGTVLMWYNLHTEG 464
Query: 180 TIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+ ++H CPVI G KWV +KWI D+ Q
Sbjct: 465 LLHVRTMHTGCPVIVGSKWVVSKWIDDKGQ 494
>gi|344274274|ref|XP_003408942.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2
[Loxodonta africana]
Length = 534
Score = 92.4 bits (228), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 59/211 (27%), Positives = 102/211 (48%), Gaps = 30/211 (14%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
+PR + F + S + + + AK RL+ + ++ +E+ R S ++S E+
Sbjct: 334 KPRIVRFHDIISDAEIEVVKDLAKPRLRRATISNPITGDLETVH-YRISKSAWLSGYENP 392
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 118
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG--- 447
Query: 119 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 178
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF +
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP---------------DVGASVWPKKGTAVFWYNLFAS 491
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 492 GEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|426255746|ref|XP_004021509.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Ovis
aries]
Length = 534
Score = 92.4 bits (228), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 65
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391
Query: 66 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 117
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF
Sbjct: 448 --NRIATWLFYMSDVLAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 490
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|390989473|ref|ZP_10259770.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas axonopodis
pv. punicae str. LMG 859]
gi|372555742|emb|CCF66745.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas axonopodis
pv. punicae str. LMG 859]
Length = 152
Score = 92.4 bits (228), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 57/162 (35%), Positives = 78/162 (48%), Gaps = 24/162 (14%)
Query: 51 GTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN 110
RTS + +D + + IE +IAR P HGE VLRY G +Y HYD F+
Sbjct: 4 AARTSDSMCLRVGQD--ALCQRIEARIARLFDWPVDHGEGLQVLRYATGAEYRPHYDYFD 61
Query: 111 PAEYGPQM-----SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPR 165
P G + QR+AS ++YL+ E GG T FP + L V
Sbjct: 62 PDAAGTPILLQAGGQRVASLVMYLNTPERGGATRFPDAH---------------LDVAAV 106
Query: 166 RGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 207
+G+ + F P+ SLH PV+ GEKWVATKW+R++
Sbjct: 107 KGNAVFFSYDRPHPMT--RSLHAGAPVLTGEKWVATKWLRER 146
>gi|190788|gb|AAA36535.1| prolyl 4-hydroxylase alpha subunit (EC 1.14.11.2) [Homo sapiens]
Length = 534
Score = 92.4 bits (228), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 59/211 (27%), Positives = 102/211 (48%), Gaps = 30/211 (14%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
+PR + F + S + + + AK RL+ + ++ +E+ R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVH-YRISKSAWLSGYENP 392
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 118
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG--- 447
Query: 119 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 178
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF +
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFAS 491
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 492 GEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|321474953|gb|EFX85917.1| hypothetical protein DAPPUDRAFT_309108 [Daphnia pulex]
Length = 549
Score = 92.4 bits (228), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 57/208 (27%), Positives = 96/208 (46%), Gaps = 23/208 (11%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
+P + + + E+ +++ A R K + + +E+ K R S F+ E
Sbjct: 346 KPLLVIYHDVIFDEEIETVKKLAHPRFKRTTVMNSATGKLETAK-YRISKAAFLKNKEHH 404
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY-----GPQMSQR 121
+L++ ++ T L + E V Y IG Y+ H+D E R
Sbjct: 405 H-VLKM-SRRVGAITGLDMSTAEDLQVCNYGIGGHYEPHFDYARKNETIGFNKDSGWRNR 462
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 181
+A++L Y+SDVE GG T+FP + + + P++G +Y+LFPNG
Sbjct: 463 IATWLFYMSDVEAGGATVFP---------------ALNVALWPQKGSAAFWYNLFPNGEG 507
Query: 182 DRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+ + H +CPV+ G KWVA KWI ++ Q
Sbjct: 508 NELTRHAACPVLTGSKWVANKWIHEKNQ 535
>gi|428172003|gb|EKX40915.1| hypothetical protein GUITHDRAFT_112917 [Guillardia theta CCMP2712]
Length = 421
Score = 92.4 bits (228), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 68/229 (29%), Positives = 98/229 (42%), Gaps = 37/229 (16%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
+V S PR L +F + E+C +I++AK + S ++ V + +RTSS ++
Sbjct: 195 KVRSISPRVLEVEDFLTPEECHELISSAKPLMSRSTVSAEGDSAVSLQESSRTSSTAWLP 254
Query: 62 ASE--------DKTGILELI-----EHKIARATMLPQTHG----EAFNVLRYEIGQKYDS 104
D+ L I EH + G A+ VLRYE+ Q Y
Sbjct: 255 PHSHTLANKLYDRVSSLVGIDFRKHEHVVVEDLQAIDKRGGSSVTAWQVLRYEVNQHYHI 314
Query: 105 HYDAFNPAEYGPQMS----QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI-G 159
H+D F+P + + R + YL+DVE G DY C G
Sbjct: 315 HHDYFDPVLHRGFLQGDGRNRFITAFFYLTDVERGDPRPIT-----------DYSDCNRG 363
Query: 160 LKVKPRRGDGLLFYSLFPNGT----IDRTSLHGSCPVIKGEKWVATKWI 204
L+V P+RG ++FYSL +G +D S HG C V G KW A WI
Sbjct: 364 LRVPPKRGKAIIFYSLLADGQRSGGLDVASWHGGCDVHNGTKWAANYWI 412
>gi|395820526|ref|XP_003783615.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Otolemur
garnettii]
Length = 534
Score = 92.4 bits (228), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 59/211 (27%), Positives = 102/211 (48%), Gaps = 30/211 (14%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
+PR + F + S + + + AK RL+ + ++ +E+ R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVH-YRISKSAWLSGYENP 392
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 118
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG--- 447
Query: 119 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 178
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF +
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFAS 491
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 492 GEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|63252888|ref|NP_001017962.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Homo
sapiens]
gi|197099666|ref|NP_001125733.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Pongo abelii]
gi|217272849|ref|NP_001136067.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Homo
sapiens]
gi|114631177|ref|XP_001140234.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3 [Pan
troglodytes]
gi|114631181|ref|XP_001140652.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 7 [Pan
troglodytes]
gi|2507090|sp|P13674.2|P4HA1_HUMAN RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; Flags: Precursor
gi|75061858|sp|Q5RAG8.1|P4HA1_PONAB RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; Flags: Precursor
gi|602675|gb|AAA59068.1| alpha-subunit of prolyl 4-hydroxylase [Homo sapiens]
gi|23271226|gb|AAH34998.1| Prolyl 4-hydroxylase, alpha polypeptide I [Homo sapiens]
gi|55729010|emb|CAH91242.1| hypothetical protein [Pongo abelii]
gi|56403853|emb|CAI29712.1| hypothetical protein [Pongo abelii]
gi|119574854|gb|EAW54469.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I, isoform CRA_c [Homo
sapiens]
gi|119574855|gb|EAW54470.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I, isoform CRA_d [Homo
sapiens]
gi|123981532|gb|ABM82595.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I [synthetic
construct]
gi|123996359|gb|ABM85781.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I [synthetic
construct]
gi|261861532|dbj|BAI47288.1| prolyl 4-hydroxylase, alpha polypeptide I [synthetic construct]
gi|410295852|gb|JAA26526.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
gi|410349611|gb|JAA41409.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
Length = 534
Score = 92.4 bits (228), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 59/211 (27%), Positives = 102/211 (48%), Gaps = 30/211 (14%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
+PR + F + S + + + AK RL+ + ++ +E+ R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVH-YRISKSAWLSGYENP 392
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 118
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG--- 447
Query: 119 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 178
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF +
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFAS 491
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 492 GEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|113682363|ref|NP_001038463.1| prolyl 4-hydroxylase, alpha polypeptide I a precursor [Danio rerio]
Length = 522
Score = 92.4 bits (228), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 58/194 (29%), Positives = 95/194 (48%), Gaps = 30/194 (15%)
Query: 25 IIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASEDKTGILELIEHKIARATML 83
++ TA R+ + + +T + +T R S +++A E +++ I +I T L
Sbjct: 338 VLETAHYRISKRRATVHDPQTGKLTTAQYRVSKSAWLAAYEHP--VVDRINQRIEDITGL 395
Query: 84 PQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQMSQRLASFLLYLSDVEEG 135
E V Y +G +Y+ H+D AF G R+A++L Y+SDV G
Sbjct: 396 NVKTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG----NRIATWLFYMSDVAAG 451
Query: 136 GETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKG 195
G T+FP +G VKP +G + +Y+LFP+G D ++ H +CPV+ G
Sbjct: 452 GATVFPE---------------VGAAVKPLKGTAVFWYNLFPSGEGDYSTRHAACPVLVG 496
Query: 196 EKWVATKWIRDQEQ 209
KWV+ KWI ++ Q
Sbjct: 497 NKWVSNKWIHERGQ 510
>gi|380813206|gb|AFE78477.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Macaca
mulatta]
gi|384947328|gb|AFI37269.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Macaca
mulatta]
Length = 534
Score = 92.0 bits (227), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 59/211 (27%), Positives = 102/211 (48%), Gaps = 30/211 (14%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
+PR + F + S + + + AK RL+ + ++ +E+ R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVH-YRISKSAWLSGYENP 392
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 118
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG--- 447
Query: 119 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 178
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF +
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFAS 491
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 492 GEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|291404184|ref|XP_002718472.1| PREDICTED: prolyl 4-hydroxylase, alpha I subunit isoform 2
[Oryctolagus cuniculus]
Length = 534
Score = 92.0 bits (227), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 59/211 (27%), Positives = 102/211 (48%), Gaps = 30/211 (14%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
+PR + F + S + + + AK RL+ + ++ +E+ R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVH-YRISKSAWLSGYENP 392
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 118
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG--- 447
Query: 119 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 178
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF +
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFAS 491
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 492 GEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|410251926|gb|JAA13930.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
Length = 566
Score = 92.0 bits (227), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 59/211 (27%), Positives = 102/211 (48%), Gaps = 30/211 (14%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
+PR + F + S + + + AK RL+ + ++ +E+ R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVH-YRISKSAWLSGYENP 392
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 118
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG--- 447
Query: 119 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 178
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF +
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFAS 491
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 492 GEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|355562502|gb|EHH19096.1| hypothetical protein EGK_19739 [Macaca mulatta]
gi|355782842|gb|EHH64763.1| hypothetical protein EGM_18071 [Macaca fascicularis]
gi|383418719|gb|AFH32573.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Macaca
mulatta]
Length = 534
Score = 92.0 bits (227), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 59/211 (27%), Positives = 102/211 (48%), Gaps = 30/211 (14%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
+PR + F + S + + + AK RL+ + ++ +E+ R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVH-YRISKSAWLSGYENP 392
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 118
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG--- 447
Query: 119 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 178
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF +
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFAS 491
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 492 GEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|325920649|ref|ZP_08182559.1| 2OG-Fe(II) oxygenase superfamily enzyme,Sel1 repeat protein
[Xanthomonas gardneri ATCC 19865]
gi|325548839|gb|EGD19783.1| 2OG-Fe(II) oxygenase superfamily enzyme,Sel1 repeat protein
[Xanthomonas gardneri ATCC 19865]
Length = 422
Score = 92.0 bits (227), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 64/198 (32%), Positives = 91/198 (45%), Gaps = 30/198 (15%)
Query: 18 SAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILE-----L 72
SA++C+ ++ A+ L+ SQ+ + T RTS G + ILE
Sbjct: 242 SADECRLLMLLARPHLRASQVVDPNDASTHRTP-IRTSRGATLDP------ILEDFAARA 294
Query: 73 IEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG---PQMSQRLASFLLYL 129
+ ++A LP TH EA +VL Y G+ Y +H D P P RL + +YL
Sbjct: 295 AQARVAACAQLPLTHAEALSVLCYAPGEHYRAHRDYLPPGTIAADRPGAGNRLRTACVYL 354
Query: 130 SDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGS 189
+DV+ GGET FP G++V+PR G + F +L +G D SLH
Sbjct: 355 NDVDAGGETEFPVA---------------GIRVQPRAGSVVCFDNLQADGCPDPDSLHAG 399
Query: 190 CPVIKGEKWVATKWIRDQ 207
PV G KW+ T W R Q
Sbjct: 400 LPVTTGSKWLGTLWFRQQ 417
>gi|221460681|ref|NP_733394.3| CG31013 [Drosophila melanogaster]
gi|220903261|gb|AAF57073.4| CG31013 [Drosophila melanogaster]
Length = 534
Score = 92.0 bits (227), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 61/211 (28%), Positives = 97/211 (45%), Gaps = 28/211 (13%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P + + SA + +I A + +K +++ + V RT+ G ++ ++
Sbjct: 326 PYVVLYHEVLSAREISMLIGKAAQNMKNTKI--HKERAVPKKNRGRTAKGFWLKKESNE- 382
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA---------EYGPQM 118
+ + I +I T E F V+ Y IG Y H D F+ A Y +
Sbjct: 383 -LTKRITRRIMDMTGFDLADSEGFQVINYGIGGHYFLHMDYFDFASSNHTDTRSRYSIDL 441
Query: 119 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 178
R+A+ L YL+DVE+GG T +F D GY V P+ G + +Y+L +
Sbjct: 442 GDRIATVLFYLTDVEQGGAT-------VFGDVGY--------YVSPQAGTAIFWYNLDTD 486
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
G D + H +CPVI G KWV T+WIR++ Q
Sbjct: 487 GNGDPRTRHAACPVIVGSKWVMTEWIREKRQ 517
>gi|348501574|ref|XP_003438344.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oreochromis
niloticus]
Length = 615
Score = 92.0 bits (227), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 60/211 (28%), Positives = 101/211 (47%), Gaps = 30/211 (14%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
RP + + + S + + + AK RL+ + ++ +E T R S +++ +D
Sbjct: 415 RPYIVRYLDIISDAEIERVKQLAKPRLRRATISNPITGVLE-TASYRISKSAWLTEYDDP 473
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 118
++E I +I T L E V Y +G +Y+ H+D AF G
Sbjct: 474 --MIEKINDRIEGVTGLEMDTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG--- 528
Query: 119 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 178
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF +
Sbjct: 529 -NRIATWLFYMSDVSAGGATVFP---------------DVGAAVWPQKGTAVFWYNLFAS 572
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
G D ++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 573 GEGDYSTRHAACPVLVGNKWVSNKWIHERGQ 603
>gi|323455897|gb|EGB11765.1| hypothetical protein AURANDRAFT_52419 [Aureococcus anophagefferens]
Length = 478
Score = 92.0 bits (227), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 72/249 (28%), Positives = 103/249 (41%), Gaps = 58/249 (23%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLAL--RQGETVESTKGTRTSSGT 58
+ LS RP+ F + + +I K R+KPS++ L R G+ TRTS+
Sbjct: 150 VTTLSMRPQVFRISQFMMGHETEKLIERNKPRIKPSEVGLVGRSGDK------TRTSTNA 203
Query: 59 FISASEDKTGILELIEHKIARATMLPQTHG-----EAFNVLRYEIGQKYDSHYDAFNPAE 113
+ +AS + I RA L + + VL YE Q Y H D F
Sbjct: 204 WDTASP-------VARDVIGRAFRLLKIDAHRKLEDGLQVLHYERPQWYKPHVDYFTSRN 256
Query: 114 YG----------------PQMSQRLASFLLYLSDVEEGGETMFP-------FENGIFLDS 150
G + R A+ LYL++ GGET+FP ++ G +
Sbjct: 257 AGGGGASEDAFSNAIPTANNGTNRFATVFLYLNNAGSGGETVFPLSTTHEIYQGGRLTQA 316
Query: 151 GYDYK---------------KCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKG 195
G + K L+V PR GD +LFYS + ++D SLHGSCP+ G
Sbjct: 317 GTNRTPGFIRDADAAWVCDTKSEALRVTPRTGDSVLFYSQRGDASLDGYSLHGSCPMGDG 376
Query: 196 EKWVATKWI 204
EKW A W+
Sbjct: 377 EKWAANLWV 385
>gi|116008434|ref|NP_651806.2| CG9698 [Drosophila melanogaster]
gi|113194862|gb|AAF57062.2| CG9698 [Drosophila melanogaster]
Length = 547
Score = 92.0 bits (227), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 53/165 (32%), Positives = 78/165 (47%), Gaps = 23/165 (13%)
Query: 51 GTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN 110
RTS TFI + K +L I+ ++A T L + E Y IG Y H D F
Sbjct: 373 NVRTSQFTFIPVTAHK--VLSTIDQRVADMTNLNMKYAEDHQFANYGIGGHYGQHMDWFY 430
Query: 111 PAEY------GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKP 164
+ P+M R+A+ L YLSDV +GG T FP + +KP
Sbjct: 431 QTTFDAGLVSSPEMGNRIATVLFYLSDVAQGGGTAFP---------------QLRTLLKP 475
Query: 165 RRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
++ +++L +G D + HG+CP+I G KWV +WIR+ +Q
Sbjct: 476 KKYAAAFWHNLHASGVGDVRTQHGACPIIAGSKWVQNRWIRENDQ 520
>gi|85857698|gb|ABC86384.1| IP10964p [Drosophila melanogaster]
Length = 534
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 61/211 (28%), Positives = 97/211 (45%), Gaps = 28/211 (13%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P + + SA + +I A + +K +++ + V RT+ G ++ ++
Sbjct: 326 PYVVLYHEVLSAREISMLIGKAAQNMKNTKI--HKERAVPKKNRGRTAKGFWLKKESNE- 382
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA---------EYGPQM 118
+ + I +I T E F V+ Y IG Y H D F+ A Y +
Sbjct: 383 -LTKRITRRIMDMTGFDLADSEGFQVINYGIGGHYFLHMDYFDFASSNHTDTRSRYSIDL 441
Query: 119 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 178
R+A+ L YL+DVE+GG T +F D GY V P+ G + +Y+L +
Sbjct: 442 GDRIATVLFYLTDVEQGGAT-------VFGDVGY--------YVSPQAGTAIFWYNLDTD 486
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
G D + H +CPVI G KWV T+WIR++ Q
Sbjct: 487 GNGDPRTRHAACPVIVGSKWVMTEWIREKRQ 517
>gi|115495019|ref|NP_001069238.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Bos taurus]
gi|122144801|sp|Q1RMU3.1|P4HA1_BOVIN RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; Flags: Precursor
gi|92097479|gb|AAI14709.1| Prolyl 4-hydroxylase, alpha polypeptide I [Bos taurus]
gi|296472132|tpg|DAA14247.1| TPA: prolyl 4-hydroxylase subunit alpha-1 precursor [Bos taurus]
gi|440892721|gb|ELR45796.1| Prolyl 4-hydroxylase subunit alpha-1 [Bos grunniens mutus]
Length = 534
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 59/211 (27%), Positives = 102/211 (48%), Gaps = 30/211 (14%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
+PR + F + S + + + AK RL+ + ++ +E+ R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEVVKDLAKPRLRRATISNPITGDLETVH-YRISKSAWLSGYENP 392
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 118
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG--- 447
Query: 119 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 178
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF +
Sbjct: 448 -NRIATWLFYMSDVLAGGATVFP---------------EVGASVWPKKGTAVFWYNLFAS 491
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 492 GEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|297515507|gb|ADI44133.1| RT08151p [Drosophila melanogaster]
Length = 546
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 53/165 (32%), Positives = 78/165 (47%), Gaps = 23/165 (13%)
Query: 51 GTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN 110
RTS TFI + K +L I+ ++A T L + E Y IG Y H D F
Sbjct: 373 NVRTSQFTFIPVTAHK--VLSTIDQRVADMTNLNMKYAEDHQFANYGIGGHYGQHMDWFY 430
Query: 111 PAEY------GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKP 164
+ P+M R+A+ L YLSDV +GG T FP + +KP
Sbjct: 431 QTTFDAGLVSSPEMGNRIAAVLFYLSDVAQGGGTAFP---------------QLRTLLKP 475
Query: 165 RRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
++ +++L +G D + HG+CP+I G KWV +WIR+ +Q
Sbjct: 476 KKYAAAFWHNLHASGVGDVRTQHGACPIIAGSKWVQNRWIRENDQ 520
>gi|228999322|ref|ZP_04158902.1| hypothetical protein bmyco0003_38780 [Bacillus mycoides Rock3-17]
gi|229006877|ref|ZP_04164509.1| hypothetical protein bmyco0002_37790 [Bacillus mycoides Rock1-4]
gi|228754370|gb|EEM03783.1| hypothetical protein bmyco0002_37790 [Bacillus mycoides Rock1-4]
gi|228760519|gb|EEM09485.1| hypothetical protein bmyco0003_38780 [Bacillus mycoides Rock3-17]
Length = 195
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 68/218 (31%), Positives = 97/218 (44%), Gaps = 39/218 (17%)
Query: 3 VLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISA 62
VL P + + +CQ +I +KK ++P+Q GE R S T++
Sbjct: 7 VLHDEPFVAQYEQIITPAECQELIELSKKHIQPAQAYGHTGE--------RKSDFTWLPH 58
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF--------NPAEY 114
G++ + IA A LP H E RYE+G K+D+H D + N E
Sbjct: 59 YSH--GLVSQVSELIATAMPLPLNHAEPLQAARYEVGGKFDAHIDCYGTWHEDGRNRVEQ 116
Query: 115 GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 174
G QRL + +LYL+ V GGET FP + L V P G L+F +
Sbjct: 117 G---GQRLYTAILYLNTVNAGGETFFP---------------SLNLTVTPSEGKLLVFEN 158
Query: 175 LFPNGTID--RTSLHGSCPVIKGEKWVATKWIRDQEQH 210
GT + SLH C V +GEKW+ T W R++ Q+
Sbjct: 159 C-KRGTNEPHPLSLHEGCAVHEGEKWIVTLWFREKPQY 195
>gi|198449635|ref|XP_001357660.2| GA21971 [Drosophila pseudoobscura pseudoobscura]
gi|198130694|gb|EAL26794.2| GA21971 [Drosophila pseudoobscura pseudoobscura]
Length = 549
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 58/209 (27%), Positives = 94/209 (44%), Gaps = 26/209 (12%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 63
LS P + + + + I R+ + + L TV + RTS TFI+ +
Sbjct: 324 LSHDPLLVLYHDVIYQSEIDVIRQLTTNRMARAMVTLTNQSTVSNV---RTSQITFIAKT 380
Query: 64 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY------GPQ 117
E + +L+ I+ ++A T L + E Y IG Y H D F + +
Sbjct: 381 EHE--VLQTIDRRVADMTNLNMDYAEDHQFANYGIGGHYGQHMDWFTETTFDNGLVSSTE 438
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
M R+A+ L YLSDV +GG T FP+ + ++P++ +++L
Sbjct: 439 MGNRIATVLFYLSDVAQGGGTAFPY---------------LKQHLRPKKYAAAFWHNLHA 483
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRD 206
G D + HG+CP+I G KWV +WIR+
Sbjct: 484 AGRGDARTQHGACPIIAGSKWVLNRWIRE 512
>gi|194905294|ref|XP_001981167.1| GG11919 [Drosophila erecta]
gi|190655805|gb|EDV53037.1| GG11919 [Drosophila erecta]
Length = 533
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 58/212 (27%), Positives = 96/212 (45%), Gaps = 29/212 (13%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
+P + + SA + ++ A + +K +++ Q E +T RT+ G ++ ++
Sbjct: 325 KPYVVLYHEVLSAREISMLMGKAAQNMKNTRV---QSEKAVNTNRERTAKGYWLKKESNE 381
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA---------EYGPQ 117
+ I +I T E F V+ Y IG Y H+D F A +
Sbjct: 382 --MTRRITRRIVDMTGFDLADSEDFQVINYGIGGHYSLHFDYFGFASSNYTGERSHHSIV 439
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
+ R+A+ L YL+DVE+GG T+F +G V P+ G + +Y+L
Sbjct: 440 LGDRIATVLFYLTDVEQGGATVF---------------GNVGYSVYPQAGTAIFWYNLDT 484
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D + H SCPV+ G KWV T+WI + Q
Sbjct: 485 DGNGDPLTRHASCPVVVGSKWVMTEWIHEARQ 516
>gi|195505218|ref|XP_002099409.1| GE10887 [Drosophila yakuba]
gi|194185510|gb|EDW99121.1| GE10887 [Drosophila yakuba]
Length = 521
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 60/208 (28%), Positives = 90/208 (43%), Gaps = 26/208 (12%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P + + + + I + RLK + + G RTS TFI S K
Sbjct: 302 PLLVLYHDVIYQSEIDVIRKLTENRLKRATVT---GHNESVVSNVRTSQFTFIPVSAHK- 357
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY------GPQMSQR 121
+L I+ ++A T L + E Y IG Y H D F P+M R
Sbjct: 358 -VLSTIDQRVADMTNLNMKYAEDHQFANYGIGGHYGQHMDWFYQTTIDAGLISSPEMGNR 416
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 181
+A+ L YLSDV +GG T FP + +KP++ +++L +G
Sbjct: 417 IATVLFYLSDVSQGGGTAFP---------------QLRTLLKPKKYAAAFWHNLHASGVG 461
Query: 182 DRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
D + HG+CP+I G KWV +WIR+ +Q
Sbjct: 462 DVRTQHGACPIIAGSKWVQNRWIREVDQ 489
>gi|148226320|ref|NP_001087703.1| prolyl 4-hydroxylase, alpha polypeptide 2 precursor [Xenopus
laevis]
gi|51703693|gb|AAH81114.1| MGC83530 protein [Xenopus laevis]
Length = 533
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 60/207 (28%), Positives = 103/207 (49%), Gaps = 24/207 (11%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
PR + + + S E+ + I AK RL ++ +R +T V + R S ++ +D
Sbjct: 336 PRIVRYLDVLSDEEIEKIKELAKPRL--ARATVRDPKTGVLTVANYRVSKSAWLEEYDDP 393
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS---QRLA 123
++ + ++ T L + E V Y +G +Y+ H+D F+ + + RLA
Sbjct: 394 --VIGRVNSRMQAITGLTKDTAELLQVANYGMGGQYEPHFD-FSRRPFDSNLKTEGNRLA 450
Query: 124 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 183
++L Y+SDVE GG T+FP D+ G + PR+G + +Y+LF +G D
Sbjct: 451 TYLNYMSDVEAGGATVFP-----------DF----GAAIWPRKGTAVFWYNLFRSGEGDY 495
Query: 184 TSLHGSCPVIKGEKWVATKWIRDQEQH 210
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 496 RTRHAACPVLVGSKWVSNKWFHERGQE 522
>gi|298712929|emb|CBJ26831.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 294
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 68/202 (33%), Positives = 96/202 (47%), Gaps = 31/202 (15%)
Query: 15 NFASAEQCQSIIATAKKRLKPSQL-ALRQGETVESTKGTRTSSGTFISASEDKTGILELI 73
+F S +C ++IA A + S + GE ES RTSS F+ A ED L +
Sbjct: 108 DFFSGPECDALIALAGNYMIVSPVVGAGAGEVSES----RTSSSCFL-ARED----LPTV 158
Query: 74 EHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-----NPAEYGPQMSQRLASFLLY 128
HK+ T P H E V RY QKY +H+DAF + + QR+ + L+Y
Sbjct: 159 CHKVMALTGKPIEHLELPQVGRYYTSQKYANHWDAFDLNTEDGRRFAQNGGQRVCTVLVY 218
Query: 129 LSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHG 188
L+DV GG T FP +G+KV+PR+G ++F+ +G +D LH
Sbjct: 219 LNDVPSGGCTAFPQ---------------LGMKVQPRKGMAVVFFPATLDGVLDSRLLHA 263
Query: 189 SCPVIKGEKWVATKWIRDQEQH 210
+ P I KWV+ WIR H
Sbjct: 264 AEPAID-TKWVSQIWIRQGAYH 284
>gi|307190793|gb|EFN74662.1| Prolyl 4-hydroxylase subunit alpha-2 [Camponotus floridanus]
Length = 476
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 61/209 (29%), Positives = 95/209 (45%), Gaps = 24/209 (11%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
PR + + N E+ ++I A+ R K + + + +E R S ++ E K
Sbjct: 269 PRIVVYHNVIYDEEIETIKRMAQPRFKRATVQNYKTGALE-IANYRISKSAWLQEHEHKH 327
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS----QRLA 123
+ + ++ T + E V+ Y IG Y+ H+D E S R+A
Sbjct: 328 --VAAVSKRVEHMTSMSIETAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIA 385
Query: 124 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 183
+ L Y+SDVE+GG T+F I + + PR+G +Y+L PNG D
Sbjct: 386 TVLYYMSDVEQGGGTVFT---------------AINISLWPRKGSAAFWYNLKPNGEGDF 430
Query: 184 TSLHGSCPVIKGEKWVATKWI--RDQEQH 210
+ H +CPV+ G KWVA KW+ R QE H
Sbjct: 431 KTRHAACPVLTGSKWVANKWLHERGQEFH 459
>gi|198449648|ref|XP_001357666.2| GA21989 [Drosophila pseudoobscura pseudoobscura]
gi|198130700|gb|EAL26801.2| GA21989 [Drosophila pseudoobscura pseudoobscura]
Length = 536
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 54/160 (33%), Positives = 79/160 (49%), Gaps = 20/160 (12%)
Query: 53 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 112
RTS G + S+ T + + +A + L + E + Y IG Y+ H+D+F
Sbjct: 372 RTSQGASFNYSQYATT--QRLSQHVADLSGLDMDYAENLQIANYGIGGHYEPHWDSFPEH 429
Query: 113 EYGPQ---MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDG 169
P+ RLA+ + YLSDV GG T FPF + L V P RG
Sbjct: 430 HEYPEDDLYGNRLATAIYYLSDVVAGGGTAFPF---------------LPLLVTPERGSL 474
Query: 170 LLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
L +Y+L P+G D + H +CPV++G KW+A WIR++ Q
Sbjct: 475 LFWYNLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQ 514
>gi|195452746|ref|XP_002073482.1| GK14141 [Drosophila willistoni]
gi|194169567|gb|EDW84468.1| GK14141 [Drosophila willistoni]
Length = 541
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 56/173 (32%), Positives = 83/173 (47%), Gaps = 28/173 (16%)
Query: 45 TVESTKGT-----RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIG 99
TV KG+ RTS TFI + K +L+ I+ ++A + L + E Y IG
Sbjct: 359 TVIGAKGSEVSKVRTSQFTFIPKTRHK--VLQTIDQRVADMSNLNMDYAELHQFANYGIG 416
Query: 100 QKYDSHYDAF------NPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYD 153
Y H D F N P+M R+A+ L YLSDV +GG T FP +
Sbjct: 417 GHYAQHNDWFGQDAFDNELVSSPEMGNRIATVLFYLSDVAQGGGTAFPHLKQL------- 469
Query: 154 YKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 206
++P++ +++L +G D +LHG+CP+I G KWV +WIR+
Sbjct: 470 --------LQPKKYAAAFWHNLHASGVGDLRTLHGACPIIAGSKWVQNRWIRE 514
>gi|195159319|ref|XP_002020529.1| GL14044 [Drosophila persimilis]
gi|194117298|gb|EDW39341.1| GL14044 [Drosophila persimilis]
Length = 536
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 54/160 (33%), Positives = 79/160 (49%), Gaps = 20/160 (12%)
Query: 53 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 112
RTS G + S+ T + + +A + L + E + Y IG Y+ H+D+F
Sbjct: 372 RTSQGASFNYSQYATT--QRLSQHVADLSGLDMDYAENLQIANYGIGGHYEPHWDSFPEH 429
Query: 113 EYGPQ---MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDG 169
P+ RLA+ + YLSDV GG T FPF + L V P RG
Sbjct: 430 HEYPEDDLYGNRLATAIYYLSDVVAGGGTAFPF---------------LPLLVTPERGSL 474
Query: 170 LLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
L +Y+L P+G D + H +CPV++G KW+A WIR++ Q
Sbjct: 475 LFWYNLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQ 514
>gi|357605723|gb|EHJ64752.1| prolyl 4-hydroxylase alpha subunit [Danaus plexippus]
Length = 235
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 60/210 (28%), Positives = 94/210 (44%), Gaps = 28/210 (13%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
M+ L P + F + S + I A+ R R+ + G + I
Sbjct: 15 MEYLYRNPDIIVFNDVLSDYEIDYIKRIAQPRF-------RRATVHDPATGELVPAHYRI 67
Query: 61 SAS----EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP 116
S S ++++ ++ + ++A T L T E V+ Y IG YD H+D E
Sbjct: 68 SKSAWLKDEESAVVARVSRRVADITGLSMTTAEELQVVNYGIGGHYDPHFDFARKEENAF 127
Query: 117 QM--SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 174
+ R+A+ L Y+SDV +GG T+F +GL V PRRG + + +
Sbjct: 128 EKFNGNRIATVLFYMSDVAQGGATVF---------------TELGLSVFPRRGSAVFWLN 172
Query: 175 LFPNGTIDRTSLHGSCPVIKGEKWVATKWI 204
L P+G D + H +CPV++G KWV KWI
Sbjct: 173 LHPSGEGDLATRHAACPVLRGSKWVCNKWI 202
>gi|410632646|ref|ZP_11343301.1| prolyl 4-hydroxylase [Glaciecola arctica BSs20135]
gi|410147883|dbj|GAC20168.1| prolyl 4-hydroxylase [Glaciecola arctica BSs20135]
Length = 480
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 55/194 (28%), Positives = 101/194 (52%), Gaps = 25/194 (12%)
Query: 15 NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 74
+F ++CQ++I ++ +PS + + + RTSS + +D ++ I+
Sbjct: 103 DFLLPQECQALIELIEQAKQPSTIT-----SENPDQQFRTSSTCHLGNMQDP--VIRKID 155
Query: 75 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP---AEYGPQMSQRLASFLLYLSD 131
+I + + ++ E Y++GQ++ H D F P A YG QR +F++YL++
Sbjct: 156 LQICQYLGIDPSYSEVIQGQHYQLGQQFKPHTDYFEPYELAHYGGIQGQRTYTFMIYLNE 215
Query: 132 VEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCP 191
VE+GG+T+FP + IG K K +G +++ ++ P+G+++ +LH P
Sbjct: 216 VEQGGDTVFP-------------ELAIGFKAK--KGMAVIWNNINPDGSVNYQTLHQGMP 260
Query: 192 VIKGEKWVATKWIR 205
V KGEK + TKW R
Sbjct: 261 VQKGEKLIITKWFR 274
>gi|426255744|ref|XP_004021508.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1 [Ovis
aries]
Length = 534
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 59/211 (27%), Positives = 102/211 (48%), Gaps = 30/211 (14%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
+PR + F + S + + + AK RL+ + ++ +E+ R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVH-YRISKSAWLSGYENP 392
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 118
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG--- 447
Query: 119 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 178
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF +
Sbjct: 448 -NRIATWLFYMSDVLAGGATVFP---------------EVGASVWPKKGTAVFWYNLFAS 491
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 492 GEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|241999340|ref|XP_002434313.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
gi|215496072|gb|EEC05713.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
Length = 267
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 70/226 (30%), Positives = 107/226 (47%), Gaps = 35/226 (15%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
++VLS PR + FP+F + +C+ + ++++L +++ L G RT+ ++
Sbjct: 51 IEVLSEDPRIVVFPDFLNPRECEIFRSISQEKLSRAKVYL-GGPPEGGFSLRRTNKVAWM 109
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH--YDAFNPAEYGPQM 118
S +D +L + +IA AT L T E + V Y +G Y H Y F A+
Sbjct: 110 S--DDLHPLLGKVSRRIALATGLTLTSAEMYQVANYGLGGHYIPHPDYAGFGEAQGDIYK 167
Query: 119 S--QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
S RLA+ L+YL+DV GG T F + L VKP G L +Y+L
Sbjct: 168 SSGNRLATMLIYLADVAGGGATAF---------------INMRLAVKPTLGTALFWYNLK 212
Query: 177 P-NGTI------------DRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
P +G I D + H CPV+ G KW+ TKWI ++EQ
Sbjct: 213 PYDGPIVNESFWNQRRFGDPRTFHMGCPVLTGSKWIVTKWIHEREQ 258
>gi|239792190|dbj|BAH72464.1| ACYPI007079 [Acyrthosiphon pisum]
Length = 249
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 55/206 (26%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
PR + + + + + I A+ RLK + + + +E R S ++ ED
Sbjct: 44 PRIILYRDVLYDNEIEVIKRMAQPRLKRATVQNYKTGELEFAD-YRISKSAWLKEHEDV- 101
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS----QRLA 123
++ + ++ T L E V+ Y +G YD HYD E S R+A
Sbjct: 102 -VVANVAKRVEVMTGLTTETAEELQVVNYGVGGHYDPHYDFARTEEINAFKSLGTGNRIA 160
Query: 124 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 183
+ L Y+SDV +GG T+FP+ +G+ ++P +G ++++L+P+G D
Sbjct: 161 TVLFYMSDVAQGGATVFPW---------------LGVALQPVKGTAAVWFNLYPSGNGDL 205
Query: 184 TSLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV++G KWV KW+ + Q
Sbjct: 206 RTRHAACPVLQGSKWVCNKWLHEAGQ 231
>gi|395817618|ref|XP_003782262.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Otolemur
garnettii]
Length = 538
Score = 90.9 bits (224), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 102/205 (49%), Gaps = 22/205 (10%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 341 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 396
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 124
++ + H++ T L E V Y +G +Y+ H+D + P + G + R+A+
Sbjct: 397 DPVVARVNHRMQHITGLSVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRVAT 456
Query: 125 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 184
FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 457 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 501
Query: 185 SLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 502 TRHAACPVLVGCKWVSNKWFHERGQ 526
>gi|328696638|ref|XP_003240086.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
[Acyrthosiphon pisum]
Length = 534
Score = 90.9 bits (224), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 55/206 (26%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
PR + + + + + I A+ RLK + + + +E R S ++ ED
Sbjct: 329 PRIILYRDVLYDNEIEVIKRMAQPRLKRATVQNYKTGELEFAD-YRISKSAWLKEHEDV- 386
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS----QRLA 123
++ + ++ T L E V+ Y +G YD HYD E S R+A
Sbjct: 387 -VVANVAKRVEVMTGLTTETAEELQVVNYGVGGHYDPHYDFARTEEINAFKSLGTGNRIA 445
Query: 124 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 183
+ L Y+SDV +GG T+FP+ +G+ ++P +G ++++L+P+G D
Sbjct: 446 TVLFYMSDVAQGGATVFPW---------------LGVALQPVKGTAAVWFNLYPSGNGDL 490
Query: 184 TSLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV++G KWV KW+ + Q
Sbjct: 491 RTRHAACPVLQGSKWVCNKWLHEAGQ 516
>gi|156405954|ref|XP_001640996.1| predicted protein [Nematostella vectensis]
gi|156228133|gb|EDO48933.1| predicted protein [Nematostella vectensis]
Length = 182
Score = 90.9 bits (224), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 59/171 (34%), Positives = 85/171 (49%), Gaps = 25/171 (14%)
Query: 55 SSGTFISASED-KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE 113
SS ++ ED K IL I + + E + +Y++GQKY HYD+
Sbjct: 9 SSSLYLKNKEDSKITILRDIAQLAGKLSNTQWRFAEPVALTKYKVGQKYSLHYDS----- 63
Query: 114 YGPQMSQ----RLASFLLYLSDVEEGGETMFPFENGI-------------FLDSGYDYKK 156
G M+Q R A+FL+YL+DV+ GGET+FP I LDS +
Sbjct: 64 -GFLMNQRRVKRTATFLVYLNDVKSGGETIFPLATNISSIQLKKENVDKPSLDSICGKEN 122
Query: 157 CIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 207
+ +KV P LLF++ +D SLHGSCPV+ GEKW+A W+ ++
Sbjct: 123 NM-VKVSPEAQSCLLFWNHVDGDDVDAFSLHGSCPVVSGEKWIAQIWLHNE 172
>gi|78046960|ref|YP_363135.1| hypothetical protein XCV1404 [Xanthomonas campestris pv.
vesicatoria str. 85-10]
gi|78035390|emb|CAJ23035.1| conserved hypothetical protein [Xanthomonas campestris pv.
vesicatoria str. 85-10]
Length = 418
Score = 90.5 bits (223), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 61/204 (29%), Positives = 90/204 (44%), Gaps = 22/204 (10%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG-TFISASEDK 66
PR + SA++C+ ++ A+ L+ S++ + + RTS G T ED
Sbjct: 228 PRIEEYAAVLSADECRLLMLLARPHLRASKV-IDPNDASTGRAPVRTSHGATLDPIIEDF 286
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG---PQMSQRLA 123
+ ++A LP H E +VL Y G++Y +H D P P R
Sbjct: 287 AA--RAAQSRLAACAQLPLAHAEPLSVLCYAPGEQYRAHRDYLPPGTIAADRPTAGNRQR 344
Query: 124 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 183
+ +YL+DV GGET FP G++V+PR G + F +L +G D
Sbjct: 345 TVCVYLNDVGAGGETEFPVA---------------GVRVRPRPGTLVCFDNLHADGRPDA 389
Query: 184 TSLHGSCPVIKGEKWVATKWIRDQ 207
SLH PV G KW+ T W R Q
Sbjct: 390 DSLHAGLPVTAGSKWLGTLWFRQQ 413
>gi|260825357|ref|XP_002607633.1| hypothetical protein BRAFLDRAFT_59428 [Branchiostoma floridae]
gi|229292981|gb|EEN63643.1| hypothetical protein BRAFLDRAFT_59428 [Branchiostoma floridae]
Length = 520
Score = 90.5 bits (223), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 63/214 (29%), Positives = 93/214 (43%), Gaps = 37/214 (17%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
+P+ N + + + I A+ RL+ ++ VES T G S K
Sbjct: 321 KPKLWVLHNILTDPEMEVIKKLAQPRLRRAR--------VESPT---TGEGELASYRISK 369
Query: 67 TGILELIEHKIAR--------ATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE---YG 115
+ L EH++ R T L E V+ Y IG Y+ H+D E
Sbjct: 370 SAWLYDWEHRVIRRVNQRVEDVTGLTMETAELLQVVNYGIGGHYEPHFDCATKDEEFALD 429
Query: 116 PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSL 175
P R+A+ L Y+SDVE GG T+FP +G +V P +G G +Y+L
Sbjct: 430 PNEGDRIATMLFYMSDVEAGGATVFP---------------QVGARVVPEKGAGAFWYNL 474
Query: 176 FPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D + H CPV+ G KWV+ KWI ++ Q
Sbjct: 475 LKSGEGDMLTEHAGCPVLVGSKWVSNKWIHERGQ 508
>gi|346724248|ref|YP_004850917.1| hypothetical protein XACM_1335 [Xanthomonas axonopodis pv.
citrumelo F1]
gi|346648995|gb|AEO41619.1| hypothetical protein XACM_1335 [Xanthomonas axonopodis pv.
citrumelo F1]
Length = 418
Score = 90.5 bits (223), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 61/204 (29%), Positives = 90/204 (44%), Gaps = 22/204 (10%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG-TFISASEDK 66
PR + SA++C+ ++ A+ L+ S++ + + RTS G T ED
Sbjct: 228 PRIEEYAAVLSADECRLLMLLARPHLRASKV-IDPNDASTGRAPVRTSHGATLDPIIEDF 286
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG---PQMSQRLA 123
+ ++A LP H E +VL Y G++Y +H D P P R
Sbjct: 287 AA--RAAQSRLAACAQLPLAHAEPLSVLCYAPGEQYRAHRDYLPPGTIAADRPTAGNRQR 344
Query: 124 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 183
+ +YL+DV GGET FP G++V+PR G + F +L +G D
Sbjct: 345 TVCVYLNDVGAGGETEFPVA---------------GVRVRPRPGTLVCFDNLHADGRPDA 389
Query: 184 TSLHGSCPVIKGEKWVATKWIRDQ 207
SLH PV G KW+ T W R Q
Sbjct: 390 DSLHAGLPVTAGSKWLGTLWFRQQ 413
>gi|195055775|ref|XP_001994788.1| GH17428 [Drosophila grimshawi]
gi|193892551|gb|EDV91417.1| GH17428 [Drosophila grimshawi]
Length = 540
Score = 90.5 bits (223), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 64/214 (29%), Positives = 98/214 (45%), Gaps = 24/214 (11%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQL-ALRQGETVESTKGTRTSSGTF 59
++ L P + + SAE+ + A+ L+ S + +L E + + R S GTF
Sbjct: 324 LEELHLDPYVIQVHDIISAEETIVLQQLARPELQRSMVYSLSNSEHI--STNFRISQGTF 381
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE-YGPQM 118
E I++ + + + L E V Y IG Y+ H D+F+ YG
Sbjct: 382 FEYHEHP--IMQRMSQHLENISGLDMRSAEQLQVANYGIGGHYEPHMDSFSENHNYGINT 439
Query: 119 ---SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSL 175
+ R+A+ + YLS+VE GG T FPF + L V+P RG L +Y+L
Sbjct: 440 YMSTNRVATGIYYLSNVEAGGGTAFPF---------------LPLLVEPERGSLLFWYNL 484
Query: 176 FPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G +D + H CPV+ G KW+A WIR Q
Sbjct: 485 HRSGDLDYRTKHAGCPVLMGSKWIANVWIRLSNQ 518
>gi|148233143|ref|NP_001090904.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Sus scrofa]
gi|83778522|gb|ABC47142.1| procollagen-proline 2-oxoglutarate-4-dioxygenase [Sus scrofa]
Length = 534
Score = 90.5 bits (223), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 58/211 (27%), Positives = 101/211 (47%), Gaps = 30/211 (14%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
+PR + F + S + + AK RL+ + ++ +E+ R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIDIVKDLAKPRLRRATISNPITGDLETVH-YRISKSAWLSGYENP 392
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 118
++ + +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 --VVSRLNMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG--- 447
Query: 119 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 178
R+A++L Y+SDV GG T+FP +G V P++G + +Y+LF +
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFAS 491
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 492 GEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|195452730|ref|XP_002073475.1| GK13125 [Drosophila willistoni]
gi|194169560|gb|EDW84461.1| GK13125 [Drosophila willistoni]
Length = 539
Score = 90.5 bits (223), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 61/213 (28%), Positives = 97/213 (45%), Gaps = 21/213 (9%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT-RTSSGTF 59
++ L P + N S + + A+ ++ SQ+ + E+ RTS G
Sbjct: 322 LEELHQDPFVVQVHNIVSQKDMNLLQKIARPNIQRSQVYAQDHNANETVAAAYRTSKGAT 381
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGP-- 116
E ++ +EL+ +A + L E + Y IG Y+ H+D F + Y P
Sbjct: 382 FEYFEHRS--MELLSRHVADLSGLDMNSAELLQIANYGIGGHYEPHWDCFPDHHVYLPDD 439
Query: 117 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
+ R+A+ + YLS+VE GG T FPF + L V P RG + +Y+L
Sbjct: 440 RDGNRIATGIYYLSEVEAGGGTAFPF---------------LPLLVTPERGSLVFWYNLH 484
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D + H +CPV++G KW+A WIR Q
Sbjct: 485 RSGDQDYRTKHAACPVLQGSKWIANVWIRQSNQ 517
>gi|421871431|ref|ZP_16303052.1| 2OG-Fe(II) oxygenase superfamily protein [Brevibacillus
laterosporus GI-9]
gi|372459315|emb|CCF12601.1| 2OG-Fe(II) oxygenase superfamily protein [Brevibacillus
laterosporus GI-9]
Length = 201
Score = 90.5 bits (223), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 63/215 (29%), Positives = 106/215 (49%), Gaps = 25/215 (11%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
Q+L+ +P +P+ S+E CQS+I A+ +L P+ + + G V R S +
Sbjct: 6 QLLNQQPFIGCYPSLISSEACQSLINLARGQLTPATVVGQSGLEVSHV---RISELAWFC 62
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE----YGPQ 117
+ ++ +++ I +IA P + E V Y G K+++H D ++ E +
Sbjct: 63 HNYNE--VVQSICKQIAEIVEQPIHYAEKLQVAHYGAGGKFEAHLDCYDSQEANKTFLEH 120
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
QRL + +LYL+DV GGET FP + ++V P G L+F + P
Sbjct: 121 SGQRLYTAILYLNDVVSGGETYFPN---------------LKIEVSPTTGTLLVFENCQP 165
Query: 178 NGTI-DRTSLHGSCPVIKGEKWVATKWIRDQEQHE 211
+ +I D SLHGS + GEKW+ T W ++ Q++
Sbjct: 166 DTSIPDLRSLHGSKILQSGEKWIGTLWFCERPQYQ 200
>gi|195069801|ref|XP_001997031.1| GH12975 [Drosophila grimshawi]
gi|193891500|gb|EDV90366.1| GH12975 [Drosophila grimshawi]
Length = 242
Score = 90.5 bits (223), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 64/214 (29%), Positives = 98/214 (45%), Gaps = 24/214 (11%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQL-ALRQGETVESTKGTRTSSGTF 59
++ L P + + SAE+ + A+ L+ S + +L E + + R S GTF
Sbjct: 26 LEELHLDPYVIQVHDIISAEETIVLQQLARPELQRSMVYSLSNSEHISTN--FRISQGTF 83
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE-YGPQM 118
E I++ + + + L E V Y IG Y+ H D+F+ YG
Sbjct: 84 FEYHEHP--IMQRMSQHLENISGLDMRSAEQLQVANYGIGGHYEPHMDSFSENHNYGINT 141
Query: 119 ---SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSL 175
+ R+A+ + YLS+VE GG T FPF + L V+P RG L +Y+L
Sbjct: 142 YMSTNRVATGIYYLSNVEAGGGTAFPF---------------LPLLVEPERGSLLFWYNL 186
Query: 176 FPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G +D + H CPV+ G KW+A WIR Q
Sbjct: 187 HRSGDLDYRTKHAGCPVLMGSKWIANVWIRLSNQ 220
>gi|339009924|ref|ZP_08642495.1| 2OG-Fe(II) oxygenase [Brevibacillus laterosporus LMG 15441]
gi|338773194|gb|EGP32726.1| 2OG-Fe(II) oxygenase [Brevibacillus laterosporus LMG 15441]
Length = 201
Score = 90.5 bits (223), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 63/215 (29%), Positives = 106/215 (49%), Gaps = 25/215 (11%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
Q+L+ +P +P+ S+E CQS+I A+ +L P+ + + G V R S +
Sbjct: 6 QLLNQQPFIGCYPSLISSEACQSLINLARGQLTPATVVGQSGLEVSHV---RISELAWFC 62
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE----YGPQ 117
+ ++ +++ I +IA P + E V Y G K+++H D ++ E +
Sbjct: 63 HNYNE--VVQSICKQIAEIVEQPIHYAEKLQVAHYGAGGKFEAHLDCYDSQEANKPFLEH 120
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
QRL + +LYL+DV GGET FP + ++V P G L+F + P
Sbjct: 121 SGQRLYTAILYLNDVVSGGETYFPN---------------LKIEVSPTTGTLLVFENCQP 165
Query: 178 NGTI-DRTSLHGSCPVIKGEKWVATKWIRDQEQHE 211
+ +I D SLHGS + GEKW+ T W ++ Q++
Sbjct: 166 DTSIPDLRSLHGSKILQSGEKWIGTLWFCERPQYQ 200
>gi|193688213|ref|XP_001943683.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 1
[Acyrthosiphon pisum]
Length = 552
Score = 90.5 bits (223), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 55/206 (26%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
PR + + + + + I A+ RLK + + + +E R S ++ ED
Sbjct: 347 PRIILYRDVLYDNEIEVIKRMAQPRLKRATVQNYKTGELEFA-DYRISKSAWLKEHEDV- 404
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS----QRLA 123
++ + ++ T L E V+ Y +G YD HYD E S R+A
Sbjct: 405 -VVANVAKRVEVMTGLTTETAEELQVVNYGVGGHYDPHYDFARTEEINAFKSLGTGNRIA 463
Query: 124 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 183
+ L Y+SDV +GG T+FP+ +G+ ++P +G ++++L+P+G D
Sbjct: 464 TVLFYMSDVAQGGATVFPW---------------LGVALQPVKGTAAVWFNLYPSGNGDL 508
Query: 184 TSLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV++G KWV KW+ + Q
Sbjct: 509 RTRHAACPVLQGSKWVCNKWLHEAGQ 534
>gi|198449524|ref|XP_002136918.1| GA26871 [Drosophila pseudoobscura pseudoobscura]
gi|198130646|gb|EDY67476.1| GA26871 [Drosophila pseudoobscura pseudoobscura]
Length = 530
Score = 90.5 bits (223), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 65/215 (30%), Positives = 103/215 (47%), Gaps = 25/215 (11%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTF 59
M+ LS P + + N S + IA ++ +P ++ GE S K RT+ G +
Sbjct: 319 MEELSLDPYIVVYHNVLSDAE----IAKVERVAEPLLKSIGVGEMDNSKKSKVRTALGAW 374
Query: 60 ISASEDKTG---ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP 116
I +++ I +I T L HG+ +++Y G YD+H+D N +
Sbjct: 375 IPDKNMHISGWPVIQRIVRRIHDMTGLIIKHGQVVQLIKYGYGGHYDTHFDYLNDSLPIT 434
Query: 117 Q-MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSL 175
Q + R+A+ L YL+DV+ GG T+FP + LKV RG L++Y++
Sbjct: 435 QALGDRMATVLFYLNDVKHGGSTVFP---------------VLKLKVPSERGKVLVWYNM 479
Query: 176 F-PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+D +LHGSCPVI G K V + WI + +Q
Sbjct: 480 HGETHDLDSRTLHGSCPVIDGAKTVLSCWIHEWDQ 514
>gi|260812289|ref|XP_002600853.1| hypothetical protein BRAFLDRAFT_214927 [Branchiostoma floridae]
gi|229286143|gb|EEN56865.1| hypothetical protein BRAFLDRAFT_214927 [Branchiostoma floridae]
Length = 281
Score = 90.5 bits (223), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 54/158 (34%), Positives = 77/158 (48%), Gaps = 21/158 (13%)
Query: 53 RTSSGTFISASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYDAFN 110
R S ++ +D+ I+ + +I T L T E VL Y +G +Y+ H+D
Sbjct: 126 RISQQAWLHDKDDE--IVARVSKRIGLLTGLNTTPTSTELLQVLNYGLGGQYEPHHDYMT 183
Query: 111 PAE--YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 168
E +G + R+A+FL+YLSDV GG T+FP N + V +
Sbjct: 184 AEEKMWGTILGNRMATFLMYLSDVTAGGATVFPVAN---------------VTVPVVKNA 228
Query: 169 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 206
GLLF L +G D SLH CPV+ G KW+A KWI +
Sbjct: 229 GLLFMDLLRSGRGDVNSLHAGCPVVIGSKWIANKWIHE 266
>gi|215697788|dbj|BAG91981.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 225
Score = 90.1 bits (222), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 58/185 (31%), Positives = 97/185 (52%), Gaps = 20/185 (10%)
Query: 21 QCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARA 80
+C +++ + ++ S LA T G R SS I ED ++ IE +I+
Sbjct: 2 ECDHLVSMGRGNME-SSLAF--------TDGDRNSSYNNI---EDI--VVSKIEDRISLW 47
Query: 81 TMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMF 140
+ LP+ +GE+ VL+Y + + + + RLA+ L+YLSDV++GGET+F
Sbjct: 48 SFLPKENGESIQVLKYGVNRS-----GSIKEEPKSSSGAHRLATILMYLSDVKQGGETVF 102
Query: 141 PFENGIFLDSGYDY-KKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWV 199
P + +C G V+P +G+ +L ++L P+G D+ S + CPV++GEKW+
Sbjct: 103 PRSEMKDAQAKEGAPSQCSGYAVRPAKGNAILLFNLRPDGETDKDSQYEECPVLEGEKWL 162
Query: 200 ATKWI 204
A K I
Sbjct: 163 AIKHI 167
>gi|383864775|ref|XP_003707853.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Megachile
rotundata]
Length = 550
Score = 90.1 bits (222), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 58/206 (28%), Positives = 94/206 (45%), Gaps = 22/206 (10%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
PR + + N E+ ++I A+ R K + + + +E R S ++ E K
Sbjct: 343 PRIVIYHNVIYDEEIETIKRMAQPRFKRATVQNYKTGALE-IANYRISKSAWLQEHEHKH 401
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS----QRLA 123
+ + ++ T L E V+ Y IG Y+ H+D E S R+A
Sbjct: 402 --VAAVSKRVEHMTSLNVETAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIA 459
Query: 124 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 183
+ L Y+SDVE+GG T+F I + + PR+G +++L PNG D
Sbjct: 460 TVLYYMSDVEQGGGTVFT---------------AINISLWPRKGSAAFWFNLKPNGEGDL 504
Query: 184 TSLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV+ G KWVA KW+ ++ Q
Sbjct: 505 RTRHAACPVLTGSKWVANKWLHERGQ 530
>gi|313229039|emb|CBY18191.1| unnamed protein product [Oikopleura dioica]
Length = 522
Score = 90.1 bits (222), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 48/154 (31%), Positives = 83/154 (53%), Gaps = 18/154 (11%)
Query: 53 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 112
R S ++ + T +E +I+R T L + E + Y IG +Y+ HYD ++
Sbjct: 367 RVSKSAWLKDEDSDT--VEKYNRRISRLTGLDLEYAEQLQMSNYGIGGQYEPHYD-YSRR 423
Query: 113 EYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLF 172
E+ ++R+A++L YL+ VE+GG T+F +GL ++ +G + +
Sbjct: 424 EWDIYNNRRIATWLSYLTTVEQGGGTVF---------------TELGLHIRSIKGSAVFW 468
Query: 173 YSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 206
Y+L PNG+ D + H +CPV++G KWV+ KWI +
Sbjct: 469 YNLLPNGSGDERTRHAACPVLRGNKWVSNKWIHE 502
>gi|424863736|ref|ZP_18287648.1| prolyl 4-hydroxylase subunit alpha-2 [SAR86 cluster bacterium
SAR86A]
gi|400757057|gb|EJP71269.1| prolyl 4-hydroxylase subunit alpha-2 [SAR86 cluster bacterium
SAR86A]
Length = 205
Score = 90.1 bits (222), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 56/212 (26%), Positives = 102/212 (48%), Gaps = 26/212 (12%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
+ + S P NF S ++C++ + K +++ +++ + E+ +RT+ ++
Sbjct: 10 VTLYSADPIVYVVNNFLSDDECEAFVEMGKGKMERAKV-ISDDES--EFHASRTNDFCWL 66
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS- 119
S + ++ + + + +P + E F ++ Y G +Y H+DAF+ Q +
Sbjct: 67 EHS--ASDVIHEVSKRFSVLVKMPINNAEQFQLVYYGPGNEYKPHFDAFDKTTKEGQNNW 124
Query: 120 ----QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSL 175
QR+ + L YL+DVEEGG T FP I + VKP +GD ++F++
Sbjct: 125 FPGGQRMVTALAYLNDVEEGGATDFP---------------KINVSVKPNKGDVVVFHNC 169
Query: 176 FPNGT-IDRTSLHGSCPVIKGEKWVATKWIRD 206
T I+ +LHG PV+ GEKW W R+
Sbjct: 170 IEGTTEINPQALHGGSPVVAGEKWAVNLWFRE 201
>gi|348688210|gb|EGZ28024.1| hypothetical protein PHYSODRAFT_321730 [Phytophthora sojae]
Length = 487
Score = 90.1 bits (222), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 65/222 (29%), Positives = 104/222 (46%), Gaps = 15/222 (6%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
M+ +S P F ++ ++ + L PS + L+ G RTS+ ++
Sbjct: 268 METISMTPLVFSVEEFLRDDEIDVVLELSMPHLAPSGVTLQDGHENRPATDWRTSTTYWL 327
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF------NPAEY 114
+S +++ I+ + A +P +H E+ VLRYE Q YD H D F N A+
Sbjct: 328 ESSSHP--VVQDIDKRTADLVKVPISHQESVQVLRYEHTQHYDQHLDYFSVKRHRNSADV 385
Query: 115 GPQMSQ----RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI-GLKVKPRRGDG 169
++ R+ + Y+SDV +GG T F G L K C GL V P++
Sbjct: 386 LKKIEHGYKNRMITVFWYMSDVAKGGHTNFARAGG--LPPPPTNKGCTQGLSVVPKKRKV 443
Query: 170 LLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 211
++FYS+ PNG D SLH CPV +G K KW+ ++ + +
Sbjct: 444 VVFYSMLPNGEGDPMSLHAGCPVEEGIKMSGNKWVWNKPRSD 485
>gi|321474875|gb|EFX85839.1| hypothetical protein DAPPUDRAFT_309105 [Daphnia pulex]
Length = 545
Score = 90.1 bits (222), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 56/205 (27%), Positives = 95/205 (46%), Gaps = 20/205 (9%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
+PR + + + S E+ ++I A+ R + + + ++ E ++ R + ++ E
Sbjct: 345 KPRIVVYHDIISDEEIETIKRLAQPRFERATVQKKESGEREFSR-YRIAKSAWLKHEEHD 403
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ--RLAS 124
+ I ++ T L E V Y IG Y+ HYD E R+A+
Sbjct: 404 --YVSDINFRVGDITGLDMATSEDLQVCNYGIGGHYEPHYDYARKGEVQQDFGWGGRIAT 461
Query: 125 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 184
+L Y+SDVE GG T+FP N L + P++G +++L+PNG +
Sbjct: 462 WLFYMSDVEAGGATVFPKLN---------------LSLWPQKGSAAFWFNLYPNGEGNEM 506
Query: 185 SLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H CPV+ G KWVA WI ++ Q
Sbjct: 507 TQHAGCPVLTGSKWVANYWIHERGQ 531
>gi|442757047|gb|JAA70682.1| Putative prolyl 4-hydroxylase alpha subunit [Ixodes ricinus]
Length = 532
Score = 90.1 bits (222), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 52/209 (24%), Positives = 101/209 (48%), Gaps = 25/209 (11%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
++ + +P + + +IA AK RL+ S+ + +RTSS T++
Sbjct: 319 LEEFNLKPYVVVLRDLLQDRDLNDMIAFAKPRLEQSKTLCAADK---DGPPSRTSSNTWL 375
Query: 61 SASEDKTG--ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF----NPAEY 114
+ + + + ++ + T+ + E + + Y IG Y H+D F P++
Sbjct: 376 NDEDAPVAARVNQYLQSLLGLGTLFSRDEAEKYQLANYGIGGHYVPHHDYFEEFQTPSK- 434
Query: 115 GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 174
G + R+A+ ++Y+SDVEEGG T+FP +G++V P++GD + +++
Sbjct: 435 GNRFGNRVATLMIYMSDVEEGGATVFP---------------SLGVRVSPKKGDAVFWWN 479
Query: 175 LFPNGTIDRTSLHGSCPVIKGEKWVATKW 203
+ + + + H CPV+ G KW+A KW
Sbjct: 480 IMSSWEGEMLTWHAGCPVLYGSKWIANKW 508
>gi|195159317|ref|XP_002020528.1| GL14042 [Drosophila persimilis]
gi|194117297|gb|EDW39340.1| GL14042 [Drosophila persimilis]
Length = 534
Score = 89.7 bits (221), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 47/134 (35%), Positives = 69/134 (51%), Gaps = 16/134 (11%)
Query: 77 IARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMSQRLASFLLYLSDVEEG 135
++ AT L T E V Y +G Y+ H+D F +P Y + R+A+ + YLSDVE+G
Sbjct: 394 LSDATGLDMTFCEQLQVANYGVGGHYEPHWDFFRDPDHYPAEEGNRMATAIFYLSDVEQG 453
Query: 136 GETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKG 195
G T FPF N VKP+ G+ L +Y++ + +D + H CPV+KG
Sbjct: 454 GATAFPFLN---------------FAVKPQLGNVLFWYNVHRSLDVDYRTKHAGCPVLKG 498
Query: 196 EKWVATKWIRDQEQ 209
KW+ WI + Q
Sbjct: 499 SKWIGNVWIHEATQ 512
>gi|310831339|ref|YP_003969982.1| putative prolyl 4-hydroxylase alpha subunit [Cafeteria
roenbergensis virus BV-PW1]
gi|309386523|gb|ADO67383.1| putative prolyl 4-hydroxylase alpha subunit [Cafeteria
roenbergensis virus BV-PW1]
Length = 210
Score = 89.7 bits (221), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 67/212 (31%), Positives = 94/212 (44%), Gaps = 29/212 (13%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
+LS P Y N + ++C II +LKP AL G + RT + ++S
Sbjct: 4 HILSQDPLIYYVDNVLNKQECYHIIKITSNKLKP---ALVSGNSRGFLSTGRTGTNCWLS 60
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-- 119
D+ I I KI P + E F VL Y QKY+ HYDAF P + +
Sbjct: 61 HKNDE--ITFNIALKITNLVNKPLENAENFQVLHYSTNQKYEYHYDAF-PIDNSEKAKRC 117
Query: 120 -----QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 174
QRL + L+YL++V +GGET F K + +K+ P+ G L+F +
Sbjct: 118 LKKGGQRLLTALIYLNNVTKGGETEF---------------KNLNIKITPKIGRILVFEN 162
Query: 175 LFPNGTIDR-TSLHGSCPVIKGEKWVATKWIR 205
N SLH VI+GEK+V W R
Sbjct: 163 TLQNSLNKHPDSLHSGKQVIEGEKYVINLWFR 194
>gi|125772813|ref|XP_001357665.1| GA21991 [Drosophila pseudoobscura pseudoobscura]
gi|54637397|gb|EAL26799.1| GA21991 [Drosophila pseudoobscura pseudoobscura]
Length = 534
Score = 89.7 bits (221), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 47/134 (35%), Positives = 69/134 (51%), Gaps = 16/134 (11%)
Query: 77 IARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMSQRLASFLLYLSDVEEG 135
++ AT L T E V Y +G Y+ H+D F +P Y + R+A+ + YLSDVE+G
Sbjct: 394 LSDATGLDMTFCEQLQVANYGVGGHYEPHWDFFRDPDHYPAEEGNRMATAIFYLSDVEQG 453
Query: 136 GETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKG 195
G T FPF N VKP+ G+ L +Y++ + +D + H CPV+KG
Sbjct: 454 GATAFPFLN---------------FAVKPQLGNVLFWYNVHRSLDVDYRTKHAGCPVLKG 498
Query: 196 EKWVATKWIRDQEQ 209
KW+ WI + Q
Sbjct: 499 SKWIGNVWIHEATQ 512
>gi|195055773|ref|XP_001994787.1| GH17427 [Drosophila grimshawi]
gi|193892550|gb|EDV91416.1| GH17427 [Drosophila grimshawi]
Length = 538
Score = 89.7 bits (221), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 47/134 (35%), Positives = 72/134 (53%), Gaps = 16/134 (11%)
Query: 77 IARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP-QMSQRLASFLLYLSDVEEG 135
++ AT L T+ E V Y +G Y+ H+D F +++ P + R+A+ + YLSDVE+G
Sbjct: 398 LSDATGLDMTYCEQLQVANYGVGGHYEPHWDFFVDSQHYPAEEGNRIATAIFYLSDVEQG 457
Query: 136 GETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKG 195
G T FPF N V+P+ G+ L +Y+L + +D + H CPV+KG
Sbjct: 458 GATAFPFLN---------------FAVRPQLGNILFWYNLHRSLDMDYRTKHAGCPVLKG 502
Query: 196 EKWVATKWIRDQEQ 209
KW+A WI + Q
Sbjct: 503 SKWIANIWIHEATQ 516
>gi|332026992|gb|EGI67088.1| Prolyl 4-hydroxylase subunit alpha-1 [Acromyrmex echinatior]
Length = 415
Score = 89.7 bits (221), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 60/209 (28%), Positives = 95/209 (45%), Gaps = 24/209 (11%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
PR + + N E+ ++I A+ R K + + + +E R S ++ E K
Sbjct: 208 PRIVVYHNVIYDEEIETIKRMAQPRFKRATVQNYKTGALE-IANYRISKSAWLQEHEHKH 266
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS----QRLA 123
+ + ++ T + E V+ Y IG Y+ H+D E S R+A
Sbjct: 267 --VAAVSKRVEHMTSMSVETAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIA 324
Query: 124 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 183
+ L Y+SDVE+GG T+F I + + PR+G +++L PNG D
Sbjct: 325 TVLYYMSDVEQGGGTVFT---------------AINISLWPRKGSAAFWHNLKPNGEGDF 369
Query: 184 TSLHGSCPVIKGEKWVATKWI--RDQEQH 210
+ H +CPV+ G KWVA KW+ R QE H
Sbjct: 370 KTRHAACPVLTGSKWVANKWLHERGQEFH 398
>gi|194905381|ref|XP_001981186.1| GG11928 [Drosophila erecta]
gi|190655824|gb|EDV53056.1| GG11928 [Drosophila erecta]
Length = 543
Score = 89.7 bits (221), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 63/210 (30%), Positives = 97/210 (46%), Gaps = 19/210 (9%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT-RTSSGTFI 60
++LS P L + ++ I A++K+ L S++ + E RTS +
Sbjct: 327 EILSLDPFVLLLHDMVRQKESTLIRASSKEHLLQSEITNTDASSSEDNVAIFRTSKSVWY 386
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMS 119
S+ + T + I ++A AT L E F V+ Y +G + +H D + S
Sbjct: 387 SSDFNDTT--KKITERLADATGLDMHFTEYFQVINYGLGGFFATHLDMLLSDKTRFNGTS 444
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 179
R+A+ + YL+ V +GG T FP N L V P+ G L +Y+L G
Sbjct: 445 DRIATTVFYLNGVRQGGATHFPLLN---------------LTVFPQPGSALFWYNLDTKG 489
Query: 180 TIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
R+++H CPVI G KWV TKW+ DQ Q
Sbjct: 490 NDQRSTMHTGCPVIVGSKWVMTKWVGDQGQ 519
>gi|147791524|emb|CAN70717.1| hypothetical protein VITISV_029140 [Vitis vinifera]
Length = 173
Score = 89.7 bits (221), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 63/173 (36%), Positives = 84/173 (48%), Gaps = 25/173 (14%)
Query: 52 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHG-------EAFNVLRYEIGQKYDS 104
RTSSG F+S +D T + I R ++P G L + +K S
Sbjct: 12 VRTSSGMFLSP-DDST-------YPIVRVFVVPPMEGFWNSCGLSNSLCLFLQAIEKRIS 63
Query: 105 HYDAFNPAEYGPQM-------SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKC 157
Y P E G + QR+A+ L+YLSD EGGET FP F G K
Sbjct: 64 VYSQV-PVENGELIQFNLKRGGQRVATMLIYLSDNVEGGETYFPMAGSGFCRCGG--KSV 120
Query: 158 IGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQH 210
GL V P +G+ +LF+S+ +G D S+HG C V+ GEKW ATKW+R + H
Sbjct: 121 RGLSVAPVKGNAVLFWSMGLDGQSDPNSIHGGCEVLAGEKWSATKWMRQRSTH 173
>gi|91091610|ref|XP_969386.1| PREDICTED: similar to prolyl 4-hydroxylase alpha subunit 1,
putative [Tribolium castaneum]
gi|270001037|gb|EEZ97484.1| hypothetical protein TcasGA2_TC011321 [Tribolium castaneum]
Length = 536
Score = 89.4 bits (220), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 57/207 (27%), Positives = 96/207 (46%), Gaps = 22/207 (10%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
RP F + + + +I A+ R K + + +E + R S ++ E K
Sbjct: 331 RPDIFIFRDVLADSEIATIKRMAQPRFKRATVQNTDTGELEIAQ-YRISKSAWLKEEEHK 389
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS----QRL 122
+ + +++ T L + E V+ Y IG Y+ H+D E S R+
Sbjct: 390 H--IADVSQRVSDMTGLTMSTAEELQVVNYGIGGHYEPHFDFARRDERNAFKSLGTGNRI 447
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
A+ L Y+SDVE+GG T+FP I + + P++G +Y+L P+G D
Sbjct: 448 ATVLFYMSDVEQGGATVFP---------------SIQVSLWPQKGSAAFWYNLHPSGDGD 492
Query: 183 RTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+ + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 493 KMTRHAACPVLTGSKWVSNKWIHERGQ 519
>gi|302849869|ref|XP_002956463.1| hypothetical protein VOLCADRAFT_107241 [Volvox carteri f.
nagariensis]
gi|300258161|gb|EFJ42400.1| hypothetical protein VOLCADRAFT_107241 [Volvox carteri f.
nagariensis]
Length = 965
Score = 89.4 bits (220), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 66/206 (32%), Positives = 97/206 (47%), Gaps = 38/206 (18%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKR--LKPSQLALRQGETVESTKGTRTSSGT 58
++VL+ P + F SA +C I+ +A +K S + + G V+ T+ RTSS
Sbjct: 744 LRVLNIDPPVITVEGFLSAPECDGIVRSAADSGLMKQSGVGV-SGYQVKDTENVRTSSTL 802
Query: 59 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 118
+A +T A LPQ V RY+ GQ + +H DAF A G +
Sbjct: 803 AATAEPGQT------------AFELPQ-------VARYQPGQHFLTHEDAFPAAVVGSKG 843
Query: 119 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 178
QR A+ L+YL+D E+GG T F + + V+PR+G LLF+ F N
Sbjct: 844 YQRRATLLVYLNDCEQGGATKF---------------DILDIAVQPRKGTALLFFPAFAN 888
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWI 204
G DR +LH + + EKWV W+
Sbjct: 889 GMPDRRTLHTAQDAVS-EKWVTQLWL 913
>gi|301104296|ref|XP_002901233.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262101167|gb|EEY59219.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 535
Score = 89.4 bits (220), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 71/254 (27%), Positives = 106/254 (41%), Gaps = 48/254 (18%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATA------KKRLKPSQLALRQGETVESTKGTRT 54
++ +S PR NF S E+ +I +L+ S + + + RT
Sbjct: 181 IESISESPRTFRLHNFFSGEEADKLIKRTLEIDDPSNKLQQSTVGANDNKNKKKKSKHRT 240
Query: 55 SSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD------- 107
S F + SE I + + + + +LRY+ Q Y +H D
Sbjct: 241 SENAFDTVSEAAVDIRKRV-FDVLSLGEFQADMADGLQLLRYQQKQAYIAHEDYFPVGAA 299
Query: 108 ---AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFEN---GIFL------DSGYDY- 154
F+P + G S R A+ LYLSDV GG+T+FP G+ +S DY
Sbjct: 300 KDFNFDPHKGG---SNRFATVFLYLSDVPRGGQTVFPLAEMPEGLPTEYQHPPNSAQDYE 356
Query: 155 -----------------KKC-IGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGE 196
+KC L P +G +LFYS PNG +D SLHG CPV++G
Sbjct: 357 AIGAELFEPGSWEMDMVRKCSTKLASYPSKGGAVLFYSQKPNGELDPKSLHGGCPVLEGT 416
Query: 197 KWVATKWIRDQEQH 210
KW A W+ ++ +H
Sbjct: 417 KWGANLWVWNRRRH 430
>gi|156370129|ref|XP_001628324.1| predicted protein [Nematostella vectensis]
gi|156215298|gb|EDO36261.1| predicted protein [Nematostella vectensis]
Length = 541
Score = 89.4 bits (220), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 65/227 (28%), Positives = 100/227 (44%), Gaps = 41/227 (18%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPS---------------QLALRQGETVESTKG 51
+P L F NF + + + I A RLK + +++ R+ G
Sbjct: 309 KPEVLIFRNFITDSEIKRIKELATPRLKRATVKDPVTGELIFANYRISKRRATIQHPVTG 368
Query: 52 T------RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH 105
R S ++ ED+ +++ I +++ + L T E V+ Y IG Y+ H
Sbjct: 369 KLEFANYRISKSGWLRDEEDE--LVKRISYRVQAYSGLNMTTSEDLQVVNYGIGGHYEPH 426
Query: 106 YDAFNPAE---YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKV 162
YD E R+A+FL YLSDVE GG T+F +G V
Sbjct: 427 YDFARDGEDKFTSLGTGNRIATFLSYLSDVEAGGGTVFT---------------RVGATV 471
Query: 163 KPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
P++GD +Y+L +G D ++ H +CPV+ G KWVA KWI + Q
Sbjct: 472 WPQKGDAAFWYNLKRSGDGDSSTRHAACPVLVGSKWVANKWIHEVGQ 518
>gi|432891690|ref|XP_004075614.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Oryzias
latipes]
Length = 517
Score = 89.4 bits (220), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 63/213 (29%), Positives = 106/213 (49%), Gaps = 25/213 (11%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
+VLS +P + + NF + + + I A+ L+ S +A GE ++T R S ++
Sbjct: 313 EVLSLQPYVVIYHNFITDREAEEIKGFAQPALRRSVVA--SGEN-QATVEYRISKSAWLK 369
Query: 62 ASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 116
SE + I+ ++ +I+ T L + E V+ Y IG Y+ H+D A +P+ +
Sbjct: 370 GSE--SCIVGKLDQRISMLTGLNVRPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPVFKL 427
Query: 117 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
+ R+A+F++YLS VE GG T F + N V + + +++L
Sbjct: 428 KTGNRVATFMIYLSSVEAGGSTAFIYAN---------------FSVPVLKKAAIFWWNLH 472
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
NG D +LH CPV+ G+KWVA KW+ + Q
Sbjct: 473 RNGRGDAETLHAGCPVLIGDKWVANKWVHEYGQ 505
>gi|405964867|gb|EKC30309.1| Prolyl 4-hydroxylase subunit alpha-1 [Crassostrea gigas]
Length = 591
Score = 89.4 bits (220), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 64/231 (27%), Positives = 107/231 (46%), Gaps = 38/231 (16%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKG----TRTSS 56
+V+++ PR F + S + + + A K S + L G T G R S
Sbjct: 361 EVVNYEPRIAIFHDVISPTSIEHLKSVASKGFTRSTVFLENTGPDGHVTYGKLDNVRVSQ 420
Query: 57 GTFISASEDKTGILELIEHKIARATMLP------QTHGEAFNVLRYEIGQKYDSHYD--- 107
+++ D+ L +E++I T L ++H E F VL Y +G Y HYD
Sbjct: 421 TSWLGT--DEYPELSRLENRIKLTTGLSAEYKSVRSHSEKFQVLNYGVGGMYTVHYDYTG 478
Query: 108 -----AFNPAEYGPQMS--QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGL 160
NP + + +R+A+++ YL+DV+ GG T+FP +
Sbjct: 479 YMLGIPSNPLDSDDIRTSGERMATWMFYLNDVKAGGATVFP---------------EVKT 523
Query: 161 KVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 211
++ +G +Y++ P+G D +LHG CPV+ G KWV+ KWIR++ Q +
Sbjct: 524 RIPVAKGGAAFWYNVRPSGATDPRTLHGGCPVLVGSKWVSNKWIREEGQMD 574
>gi|219123691|ref|XP_002182153.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217406114|gb|EEC46054.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 188
Score = 89.4 bits (220), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 63/208 (30%), Positives = 95/208 (45%), Gaps = 29/208 (13%)
Query: 3 VLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISA 62
VL+ P NF + +C+ +I A+ P+ + G+ +RTSS ++S
Sbjct: 1 VLNTSPPMFAVDNFLTPLECEFLIHMAQDSFGPAPVV---GKGAGEVSPSRTSSTCYLSR 57
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA-----EYGPQ 117
+ L + K++ T P H E V RY Q+Y HYDAF+ +
Sbjct: 58 ED-----LPDLMRKVSSLTGKPIEHCELPQVGRYFPSQQYLQHYDAFDLGTEDGLRFAAN 112
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
QR + LLYL+DV GG T FP N L V+PR+G L+F+
Sbjct: 113 GGQRTITVLLYLNDVARGGATRFPALN---------------LDVQPRQGMALVFFPATI 157
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+G +DR +LH + P + K+V+ WIR
Sbjct: 158 DGMLDRMALHAAMPAVD-TKYVSQVWIR 184
>gi|451927223|gb|AGF85101.1| 4-hydroxylase [Moumouvirus goulette]
Length = 239
Score = 89.4 bits (220), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 60/199 (30%), Positives = 95/199 (47%), Gaps = 30/199 (15%)
Query: 15 NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 74
NF + E+C I+ + +L S++ + K R S ++S + +++ +
Sbjct: 61 NFINKEKCGEIMNNTQSKLFDSEVISGK------NKAIRNSQQCWVSKYDP---MVKSMF 111
Query: 75 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-----NPAEYGPQMSQRLASFLLYL 129
KI++ +P + E V+RY GQ Y+ H+DA E+ + QR + L+YL
Sbjct: 112 QKISQQFNIPIQNAEDLQVVRYLPGQYYNEHHDACCDNNDKCNEFISRGGQRCLTVLIYL 171
Query: 130 SDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT-IDRTSLHG 188
++ EGG T F K +GLKVKP GD ++FY L N + SLH
Sbjct: 172 NNEFEGGHTFF---------------KNLGLKVKPETGDAIVFYPLAKNTSKCHPLSLHA 216
Query: 189 SCPVIKGEKWVATKWIRDQ 207
PV GEKW+A W R++
Sbjct: 217 GMPVTNGEKWIANLWFRER 235
>gi|195391760|ref|XP_002054528.1| GJ22757 [Drosophila virilis]
gi|194152614|gb|EDW68048.1| GJ22757 [Drosophila virilis]
Length = 534
Score = 89.0 bits (219), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 47/134 (35%), Positives = 69/134 (51%), Gaps = 16/134 (11%)
Query: 77 IARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-QRLASFLLYLSDVEEG 135
++ AT L T E V Y +G Y+ H+D F + + P R+A+ + YLSDVE+G
Sbjct: 394 VSDATGLDMTFCEQLQVANYGVGGHYEPHWDFFRDSRHYPAAEGNRIATAIFYLSDVEQG 453
Query: 136 GETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKG 195
G T FPF N V+P+ G+ L +Y+L + +D + H CPV+KG
Sbjct: 454 GATAFPFLN---------------FAVRPQLGNILFWYNLHRSSDMDFRTKHAGCPVLKG 498
Query: 196 EKWVATKWIRDQEQ 209
KW+A WI + Q
Sbjct: 499 SKWIANIWIHEATQ 512
>gi|390178148|ref|XP_001358756.3| GA13990 [Drosophila pseudoobscura pseudoobscura]
gi|388859341|gb|EAL27899.3| GA13990 [Drosophila pseudoobscura pseudoobscura]
Length = 498
Score = 89.0 bits (219), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 44/146 (30%), Positives = 76/146 (52%), Gaps = 16/146 (10%)
Query: 65 DKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ-MSQRLA 123
+ T +++ + ++ T L +A ++ Y +G YD HYD+ N +E + R+A
Sbjct: 351 NDTAVVKTLHRRLNDMTGLDMIESDALTLINYGMGGHYDVHYDSHNYSEANRLILGDRIA 410
Query: 124 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 183
+ L Y+ +V+ GG T FP+ I + V P++G +L+Y+L G ++
Sbjct: 411 TVLFYVGEVDSGGATTFPY---------------INVSVTPKKGSAVLWYNLDNAGQMNP 455
Query: 184 TSLHGSCPVIKGEKWVATKWIRDQEQ 209
++H CPVI G K+V TKWI + Q
Sbjct: 456 KAIHAGCPVIVGSKYVLTKWINEIPQ 481
>gi|195444366|ref|XP_002069834.1| GK11733 [Drosophila willistoni]
gi|194165919|gb|EDW80820.1| GK11733 [Drosophila willistoni]
Length = 517
Score = 89.0 bits (219), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 61/169 (36%), Positives = 87/169 (51%), Gaps = 28/169 (16%)
Query: 46 VESTKGTRTSSGTFISASEDKTGI--LELIEHKIARATMLPQT--HGEAFNVLRYEIGQK 101
++ RTS+ F+ ++TGI LE I + A T L T E V+ Y +G +
Sbjct: 361 IDQADVDRTSNSVFM----EETGITLLETISQRAADMTDLYVTAISSEDLQVINYGLGGQ 416
Query: 102 YDSHYDAFNP-AEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGL 160
Y H D F+ AE G RLA+ L YL+DV++GG T+FPF + L
Sbjct: 417 YTPHCDYFDENAENG----DRLATVLFYLTDVQQGGATVFPF---------------LRL 457
Query: 161 KVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
P++G L+F +L + D+ S H +CPV+ G KWVATKWI +Q
Sbjct: 458 SYFPKKGSALIFRNLDNAMSGDKDSTHSACPVLFGNKWVATKWIYHFDQ 506
>gi|195505216|ref|XP_002099408.1| GE23378 [Drosophila yakuba]
gi|194185509|gb|EDW99120.1| GE23378 [Drosophila yakuba]
Length = 546
Score = 89.0 bits (219), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 64/215 (29%), Positives = 101/215 (46%), Gaps = 24/215 (11%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGE---TVESTKGT-RTSS 56
++LS P + + S E+ + +K + PS+ A L E E G+ RTS
Sbjct: 325 EILSIDPFIVLLHDMVSVEEGALLRTFSKNMISPSETAELSDSEEKSIFEFEVGSFRTSK 384
Query: 57 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF--NPAEY 114
++ ++ + + ++ AT L +H E F V+ Y IG ++SH+D + +
Sbjct: 385 SVWLDNDANEATLK--LTQRLGDATGLDISHSEPFQVINYGIGGIFESHFDTSLQDENRF 442
Query: 115 GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 174
RLA+ L YL+DV +GG T FP N + V P+ G L +Y+
Sbjct: 443 LDGYMDRLATTLFYLNDVPQGGATHFPGLN---------------ITVFPKFGTALFWYN 487
Query: 175 LFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
L G + ++H CPVI G KWV +KWI D+ Q
Sbjct: 488 LDTKGLLRLRTMHTGCPVIVGSKWVVSKWIDDKGQ 522
>gi|195110925|ref|XP_002000030.1| GI22756 [Drosophila mojavensis]
gi|193916624|gb|EDW15491.1| GI22756 [Drosophila mojavensis]
Length = 533
Score = 88.6 bits (218), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 47/134 (35%), Positives = 68/134 (50%), Gaps = 16/134 (11%)
Query: 77 IARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-QRLASFLLYLSDVEEG 135
+ AT L T+ E V Y +G Y+ H+D F + + P R+A+ + YLSDVE+G
Sbjct: 393 VGDATGLDMTYCEQLQVANYGVGGHYEPHWDFFRDSRHYPAAEGNRIATAIFYLSDVEQG 452
Query: 136 GETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKG 195
G T FPF N V+P+ G+ L +Y+L + D + H CPV+KG
Sbjct: 453 GATAFPFLN---------------FAVRPQLGNILFWYNLHRSSDEDYRTKHAGCPVLKG 497
Query: 196 EKWVATKWIRDQEQ 209
KW+A WI + Q
Sbjct: 498 SKWIANIWIHEATQ 511
>gi|119582749|gb|EAW62345.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_d
[Homo sapiens]
Length = 488
Score = 88.6 bits (218), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 291 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 346
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 124
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 347 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 406
Query: 125 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 184
FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 407 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 451
Query: 185 SLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 452 TRHAACPVLVGCKWVSNKWFHERGQ 476
>gi|386368303|gb|AFJ06910.1| procollagen-proline dioxygenase [Mytilus galloprovincialis]
Length = 535
Score = 88.6 bits (218), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 47/146 (32%), Positives = 77/146 (52%), Gaps = 20/146 (13%)
Query: 69 ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-----AFNPAEYGPQMSQRLA 123
+++ ++++I T L +A V Y IG YD HYD + +E + R+A
Sbjct: 393 VVDRVQNRIKAVTGLDLDSADALQVANYGIGGHYDPHYDFSTRDDDDTSETEKRDGNRIA 452
Query: 124 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 183
+FLLY++DV+ GG T+FP I ++V P++G + +Y+L +G
Sbjct: 453 TFLLYMTDVDAGGATVFPI---------------IDVRVLPKKGTAVFWYNLRRSGKGIM 497
Query: 184 TSLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV+ G KWV+ KWIR + Q
Sbjct: 498 ETRHAACPVLVGTKWVSNKWIRTRGQ 523
>gi|224068121|ref|XP_002191580.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Taeniopygia
guttata]
Length = 539
Score = 88.6 bits (218), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 59/204 (28%), Positives = 101/204 (49%), Gaps = 24/204 (11%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + + S E+ + I AK RL ++ +R +T V + R S +++ ED
Sbjct: 342 PHIVRYYDVMSDEEIEKIKQLAKPRL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 397
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS---QRLA 123
++ + ++ T L E V Y +G +Y+ H+D F+ + + RLA
Sbjct: 398 DPVVAKVNQRMQHITGLTVKTAELLQVANYGMGGQYEPHFD-FSRRPFDSTLKSEGNRLA 456
Query: 124 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 183
+FL Y+SDVE GG T+FP D+ G + P++G + +Y+LF +G D
Sbjct: 457 TFLNYMSDVEAGGATVFP-----------DF----GAAIWPKKGTAVFWYNLFRSGEGDY 501
Query: 184 TSLHGSCPVIKGEKWVATKWIRDQ 207
+ H +CPV+ G KWV+ KW ++
Sbjct: 502 RTRHAACPVLVGCKWVSNKWFHER 525
>gi|47218149|emb|CAG10069.1| unnamed protein product [Tetraodon nigroviridis]
Length = 595
Score = 88.6 bits (218), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 62/230 (26%), Positives = 105/230 (45%), Gaps = 46/230 (20%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPS---------------QLALRQGETVESTKG 51
RP + + + S ++ + + AK RL+ + +++ R+ + G
Sbjct: 373 RPYIVRYLDIISDKEIELVKQLAKPRLRRATISNPITGVLETASYRISKRRATVHDPQTG 432
Query: 52 TRTSSGTFISASEDKTG----ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 107
T++ +S S TG ++E I +I T L E V Y +G +Y+ H+D
Sbjct: 433 KLTTAQYRVSKSAWLTGYEHPVIETINQRIEDLTGLEVDTAEELQVANYGVGGQYEPHFD 492
Query: 108 --------AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIG 159
AF G R+A++L Y+SDV GG T+FP +G
Sbjct: 493 FGRKDEPDAFKELGTG----NRIATWLFYMSDVAAGGATVFP---------------DVG 533
Query: 160 LKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
V P++G + +Y+LF +G D ++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 534 AAVWPQKGSAVFWYNLFTSGEGDYSTRHAACPVLVGNKWVSNKWIHERGQ 583
>gi|195452734|ref|XP_002073476.1| GK13124 [Drosophila willistoni]
gi|194169561|gb|EDW84462.1| GK13124 [Drosophila willistoni]
Length = 536
Score = 88.6 bits (218), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 46/134 (34%), Positives = 69/134 (51%), Gaps = 16/134 (11%)
Query: 77 IARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMSQRLASFLLYLSDVEEG 135
++ T L T+ E V Y +G Y+ H+D F NP Y + R+A+ + YLS+VE+G
Sbjct: 396 LSDTTGLDMTYCEQLQVANYGVGGHYEPHWDFFRNPDHYPAEEGNRIATAIYYLSEVEQG 455
Query: 136 GETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKG 195
G T FPF N V+P+ G+ L +Y+L + +D + H CPV+KG
Sbjct: 456 GATAFPFLN---------------FAVRPQLGNVLFWYNLHRSSDMDYRTKHAGCPVLKG 500
Query: 196 EKWVATKWIRDQEQ 209
KW+ WI + Q
Sbjct: 501 SKWIGNVWIHEVTQ 514
>gi|297675927|ref|XP_002815905.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Pongo
abelii]
gi|395736137|ref|XP_003776704.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Pongo abelii]
Length = 533
Score = 88.6 bits (218), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 124
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451
Query: 125 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 184
FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 452 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 496
Query: 185 SLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 497 TRHAACPVLVGCKWVSNKWFHERGQ 521
>gi|403255937|ref|XP_003920661.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Saimiri
boliviensis boliviensis]
gi|403255939|ref|XP_003920662.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Saimiri
boliviensis boliviensis]
gi|403255943|ref|XP_003920664.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 4 [Saimiri
boliviensis boliviensis]
Length = 533
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 124
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451
Query: 125 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 184
FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 452 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 496
Query: 185 SLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 497 TRHAACPVLVGCKWVSNKWFHERGQ 521
>gi|340722330|ref|XP_003399560.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Bombus
terrestris]
Length = 557
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 56/206 (27%), Positives = 94/206 (45%), Gaps = 22/206 (10%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
PR + + N E+ ++I A+ R K + + + +E R S ++ E +
Sbjct: 350 PRIVVYHNVIYDEEIETIKRMAQPRFKRATVQNYKTGALE-IANYRISKSAWLQEHEHEH 408
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS----QRLA 123
+ + ++ T + E V+ Y IG Y+ H+D E S R+A
Sbjct: 409 --VAAVSRRVEHMTSMTVDTAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIA 466
Query: 124 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 183
+ L Y+SDVE+GG T+F I + + P++G +Y+L PNG D
Sbjct: 467 TVLYYMSDVEQGGGTVFT---------------AINISLWPKKGSAAFWYNLKPNGEGDF 511
Query: 184 TSLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV+ G KWVA KW+ ++ Q
Sbjct: 512 KTRHAACPVLTGSKWVANKWLHERGQ 537
>gi|386780652|ref|NP_001247763.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Macaca mulatta]
gi|383422579|gb|AFH34503.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Macaca
mulatta]
gi|384939466|gb|AFI33338.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Macaca
mulatta]
Length = 533
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 124
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451
Query: 125 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 184
FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 452 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 496
Query: 185 SLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 497 TRHAACPVLVGCKWVSNKWFHERGQ 521
>gi|350416719|ref|XP_003491070.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Bombus
impatiens]
Length = 557
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 56/206 (27%), Positives = 94/206 (45%), Gaps = 22/206 (10%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
PR + + N E+ ++I A+ R K + + + +E R S ++ E +
Sbjct: 350 PRIVVYHNVIYDEEIETIKRMAQPRFKRATVQNYKTGALE-IANYRISKSAWLQEHEHEH 408
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS----QRLA 123
+ + ++ T + E V+ Y IG Y+ H+D E S R+A
Sbjct: 409 --VAAVSRRVEHMTSMTVDTAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIA 466
Query: 124 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 183
+ L Y+SDVE+GG T+F I + + P++G +Y+L PNG D
Sbjct: 467 TVLYYMSDVEQGGGTVFT---------------AINISLWPKKGSAAFWYNLKPNGEGDF 511
Query: 184 TSLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV+ G KWVA KW+ ++ Q
Sbjct: 512 KTRHAACPVLTGSKWVANKWLHERGQ 537
>gi|57997558|emb|CAI46066.1| hypothetical protein [Homo sapiens]
Length = 533
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 124
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451
Query: 125 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 184
FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 452 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 496
Query: 185 SLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 497 TRHAACPVLVGCKWVSNKWFHERGQ 521
>gi|114601548|ref|XP_001162501.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 9 [Pan
troglodytes]
gi|114601562|ref|XP_001162805.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 16 [Pan
troglodytes]
gi|114601564|ref|XP_517917.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 19 [Pan
troglodytes]
gi|397518354|ref|XP_003829356.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Pan
paniscus]
gi|397518356|ref|XP_003829357.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Pan
paniscus]
gi|397518360|ref|XP_003829359.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 4 [Pan
paniscus]
gi|410215942|gb|JAA05190.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
gi|410255606|gb|JAA15770.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
gi|410331277|gb|JAA34585.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
gi|410331281|gb|JAA34587.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
Length = 533
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 124
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451
Query: 125 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 184
FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 452 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 496
Query: 185 SLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 497 TRHAACPVLVGCKWVSNKWFHERGQ 521
>gi|63252891|ref|NP_001017973.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Homo
sapiens]
gi|63252893|ref|NP_001017974.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Homo
sapiens]
gi|217272861|ref|NP_001136070.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Homo
sapiens]
gi|18073925|emb|CAC85688.1| Prolyl 4-hydroxylase alpha IIa subunit [Homo sapiens]
gi|23274221|gb|AAH35813.1| Prolyl 4-hydroxylase, alpha polypeptide II [Homo sapiens]
gi|37183058|gb|AAQ89329.1| P4HA2 [Homo sapiens]
gi|119582745|gb|EAW62341.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_a
[Homo sapiens]
gi|119582750|gb|EAW62346.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_a
[Homo sapiens]
gi|123983232|gb|ABM83357.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II [synthetic
construct]
gi|157928048|gb|ABW03320.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II [synthetic
construct]
Length = 533
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 124
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451
Query: 125 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 184
FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 452 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 496
Query: 185 SLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 497 TRHAACPVLVGCKWVSNKWFHERGQ 521
>gi|332221656|ref|XP_003259979.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Nomascus
leucogenys]
gi|332221658|ref|XP_003259980.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Nomascus
leucogenys]
Length = 535
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 338 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 393
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 124
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 394 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 453
Query: 125 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 184
FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 454 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 498
Query: 185 SLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 499 TRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|328790718|ref|XP_392392.4| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Apis mellifera]
Length = 415
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 56/206 (27%), Positives = 94/206 (45%), Gaps = 22/206 (10%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
PR + + N ++ ++I A+ R K + + + +E R S ++ E K
Sbjct: 208 PRIVVYHNVIYDDEIETIKRMAQPRFKRATVQNYKTGALE-IANYRISKSAWLQEHEHKH 266
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS----QRLA 123
+ + ++ T + E V+ Y IG Y+ H+D E S R+A
Sbjct: 267 --VAAVSRRVEHMTSMTVDTAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIA 324
Query: 124 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 183
+ L Y+SDVE+GG T+F I + + P++G +Y+L PNG D
Sbjct: 325 TVLYYMSDVEQGGGTVFT---------------AINIALWPKKGSAAFWYNLKPNGEGDF 369
Query: 184 TSLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV+ G KWVA KW+ ++ Q
Sbjct: 370 KTRHAACPVLTGSKWVANKWLHERGQ 395
>gi|190402274|gb|ACE77683.1| prolyl 4-hydroxylase subunit alpha-2 precursor (predicted) [Sorex
araneus]
Length = 533
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 103/205 (50%), Gaps = 22/205 (10%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + + S E+ + I AK +L ++ +R +T V +T R S +++ ++D
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTTASYRVSKSSWLEETDDP 393
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 124
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 394 --VVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451
Query: 125 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 184
FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 452 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 496
Query: 185 SLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 497 TRHAACPVLVGCKWVSNKWFHERGQ 521
>gi|291387300|ref|XP_002710241.1| PREDICTED: prolyl 4-hydroxylase, alpha II subunit isoform 1
precursor (predicted)-like isoform 1 [Oryctolagus
cuniculus]
Length = 533
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 59/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 124
++ I ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 392 DPVVARINRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451
Query: 125 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 184
FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 452 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 496
Query: 185 SLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 497 TRHAACPVLVGCKWVSNKWFHERGQ 521
>gi|116283554|gb|AAH17062.1| P4HA2 protein [Homo sapiens]
Length = 504
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 307 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 362
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 124
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 363 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 422
Query: 125 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 184
FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 423 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 467
Query: 185 SLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 468 TRHAACPVLVGCKWVSNKWFHERGQ 492
>gi|387016442|gb|AFJ50340.1| Prolyl 4-hydroxylase subunit alpha-2-like [Crotalus adamanteus]
Length = 533
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 58/204 (28%), Positives = 102/204 (50%), Gaps = 24/204 (11%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + S E+ + I AK +L ++ +R +T V + R S +++ +D
Sbjct: 336 PHIVRYYEVLSDEEIEKIKELAKPKL--ARATVRDPKTGVLTVANYRVSKSSWLEEEDDL 393
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS---QRLA 123
++ + H++ + T L E V Y +G +Y+ H+D F+ + + RLA
Sbjct: 394 --VVARVNHRMEQITGLTTKTAELLQVANYGMGGQYEPHFD-FSRRPFDITLKTEGNRLA 450
Query: 124 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 183
+FL Y+SDVE GG T+FP D+ G + P++G + +Y+LF +G D
Sbjct: 451 TFLNYMSDVEAGGATVFP-----------DF----GAAIWPKKGTAVFWYNLFRSGEGDY 495
Query: 184 TSLHGSCPVIKGEKWVATKWIRDQ 207
+ H +CPV+ G KWV+ KW ++
Sbjct: 496 RTRHAACPVLVGCKWVSNKWFHER 519
>gi|426349879|ref|XP_004042513.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Gorilla gorilla
gorilla]
Length = 565
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 368 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 423
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 124
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 424 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 483
Query: 125 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 184
FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 484 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 528
Query: 185 SLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 529 TRHAACPVLVGCKWVSNKWFHERGQ 553
>gi|332221662|ref|XP_003259982.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 4 [Nomascus
leucogenys]
Length = 556
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 359 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 414
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 124
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 415 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 474
Query: 125 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 184
FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 475 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 519
Query: 185 SLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 520 TRHAACPVLVGCKWVSNKWFHERGQ 544
>gi|195110923|ref|XP_002000029.1| GI22757 [Drosophila mojavensis]
gi|193916623|gb|EDW15490.1| GI22757 [Drosophila mojavensis]
Length = 535
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 58/179 (32%), Positives = 85/179 (47%), Gaps = 25/179 (13%)
Query: 39 ALRQGETVESTKGT-----RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNV 93
L++ E T G+ RTS GT D+ I+E + + + L E +
Sbjct: 354 VLQRSEVYSPTNGSTAATFRTSQGTVFEY--DEHPIIEKLSQHMTLISGLDMGFAEPLQI 411
Query: 94 LRYEIGQKYDSHYDAFNPA-EYGPQM--SQRLASFLLYLSDVEEGGETMFPFENGIFLDS 150
Y IG Y+ H D+F + +Y Q + R+A+ + YLS+VE GG T FPF
Sbjct: 412 ANYGIGGHYEPHMDSFPESFDYSLQRFKTNRIATGIFYLSNVEAGGATAFPF-------- 463
Query: 151 GYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+ L VKP +G L +Y+L +G D + H CPV++G KW+A WIR Q
Sbjct: 464 -------LPLLVKPEQGSLLFWYNLHRSGDADYRTKHAGCPVLQGSKWIANVWIRLSHQ 515
>gi|119582748|gb|EAW62344.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_c
[Homo sapiens]
Length = 565
Score = 88.2 bits (217), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 368 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 423
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 124
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 424 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 483
Query: 125 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 184
FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 484 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 528
Query: 185 SLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 529 TRHAACPVLVGCKWVSNKWFHERGQ 553
>gi|312080225|ref|XP_003142509.1| prolyl 4-hydroxylase 2 [Loa loa]
Length = 541
Score = 88.2 bits (217), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 62/213 (29%), Positives = 96/213 (45%), Gaps = 22/213 (10%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
+++L + P A++F + + E+ I A RL+ + + +E T RTS ++
Sbjct: 325 VEILRFSPLAVFFRDVITDEEVTIIQMLATPRLRRATVQNSITGELE-TASYRTSKSAWL 383
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS- 119
E + I+ I +I T L Q E V Y IG YD H+D E S
Sbjct: 384 KDEEHE--IVHRINRRIDLMTNLEQETSEELQVGNYGIGGHYDPHFDFARREEVNAFQSL 441
Query: 120 ---QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
RLA+ L Y++ E GG T+F + V P + D L +Y+L
Sbjct: 442 NTGNRLATLLFYMTQPESGGATVF---------------TEVKTTVMPSKNDALFWYNLL 486
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 487 RSGEGDLRTRHAACPVLIGSKWVSNKWIHERGQ 519
>gi|395736139|ref|XP_003776705.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Pongo abelii]
Length = 575
Score = 88.2 bits (217), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 378 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 433
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 124
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 434 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 493
Query: 125 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 184
FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 494 FLNYMSDVEAGGATVFPD---------------LGAAIWPKKGTAVFWYNLLRSGEGDYR 538
Query: 185 SLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 539 TRHAACPVLVGCKWVSNKWFHERGQ 563
>gi|393909803|gb|EFO21561.2| prolyl 4-hydroxylase 2 [Loa loa]
Length = 542
Score = 88.2 bits (217), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 62/213 (29%), Positives = 96/213 (45%), Gaps = 22/213 (10%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
+++L + P A++F + + E+ I A RL+ + + +E T RTS ++
Sbjct: 326 VEILRFSPLAVFFRDVITDEEVTIIQMLATPRLRRATVQNSITGELE-TASYRTSKSAWL 384
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS- 119
E + I+ I +I T L Q E V Y IG YD H+D E S
Sbjct: 385 KDEEHE--IVHRINRRIDLMTNLEQETSEELQVGNYGIGGHYDPHFDFARREEVNAFQSL 442
Query: 120 ---QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
RLA+ L Y++ E GG T+F + V P + D L +Y+L
Sbjct: 443 NTGNRLATLLFYMTQPESGGATVF---------------TEVKTTVMPSKNDALFWYNLL 487
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 488 RSGEGDLRTRHAACPVLIGSKWVSNKWIHERGQ 520
>gi|359400227|ref|ZP_09193216.1| 2OG-Fe(II) oxygenase [Novosphingobium pentaromativorans US6-1]
gi|357598467|gb|EHJ60196.1| 2OG-Fe(II) oxygenase [Novosphingobium pentaromativorans US6-1]
Length = 193
Score = 88.2 bits (217), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 56/198 (28%), Positives = 95/198 (47%), Gaps = 28/198 (14%)
Query: 15 NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 74
+F QC ++IA + +PS +A G+ V RTSS +S G + +
Sbjct: 16 DFLDTAQCDALIALIEAEHRPSTVANYNGDDV-----FRTSSTCDLSPD---VGAVAALA 67
Query: 75 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP-----AEYGPQMSQRLASFLLYL 129
K+ + + H E RYE+GQ++ +H D F P +Y QR +F++YL
Sbjct: 68 RKLCDISGIDPAHAEPLQGQRYEVGQEFKAHTDYFEPNNSDFEKYCSVSGQRTWTFMIYL 127
Query: 130 SDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGS 189
+DV+ GG T F K I ++P RG + + + P+G+++ +LH +
Sbjct: 128 NDVDAGGATRF---------------KVINKLIQPERGKLVAWNNRRPDGSLNPATLHHA 172
Query: 190 CPVIKGEKWVATKWIRDQ 207
V +G K+V T+W R++
Sbjct: 173 MKVRQGRKYVVTQWFRER 190
>gi|354474415|ref|XP_003499426.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2
[Cricetulus griseus]
Length = 533
Score = 88.2 bits (217), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 124
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451
Query: 125 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 184
FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 452 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 496
Query: 185 SLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 497 TRHAACPVLVGCKWVSNKWFHERGQ 521
>gi|195505214|ref|XP_002099407.1| GE23379 [Drosophila yakuba]
gi|194185508|gb|EDW99119.1| GE23379 [Drosophila yakuba]
Length = 547
Score = 88.2 bits (217), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 63/210 (30%), Positives = 97/210 (46%), Gaps = 19/210 (9%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT-RTSSGTFI 60
++LS P L F + S ++ I +++K+ + PS E T RTS +
Sbjct: 331 EILSIDPFVLLFHDMISQKESTLIRSSSKEHMLPSATTDVDASGSEDHVATFRTSKSVWY 390
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMS 119
S++ + T + I ++ AT L E F V+ Y +G +++H D +
Sbjct: 391 SSTSNDTT--KRITERLGDATGLDMNFTEYFQVINYGLGGFFETHLDMLLSDRSRFNGTR 448
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 179
RLA+ L YL++V +GG T FP N L V P+ G L +Y+L G
Sbjct: 449 DRLATTLFYLNEVRQGGGTHFPRLN---------------LTVFPQPGSALFWYNLDTRG 493
Query: 180 TIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
++LH CPVI G KWV +KW+ D Q
Sbjct: 494 NDHTSTLHTGCPVIVGSKWVMSKWVEDAGQ 523
>gi|395509387|ref|XP_003758979.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1
[Sarcophilus harrisii]
Length = 534
Score = 88.2 bits (217), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + + S E+ + I AK +L ++ +R +T V + R S +++ +D
Sbjct: 337 PHIVRYYDVLSDEEIERIKELAKPKL--ARATVRDPKTGVLTVANYRVSKSSWLEEGDDP 394
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 124
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 395 --VIAQLNRRMHYITGLSVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 452
Query: 125 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 184
FL Y+SDVE GG T+FP G + P++G + +Y+LF +G D
Sbjct: 453 FLNYMSDVEAGGATVFP---------------DFGATIWPKKGTSVFWYNLFRSGEGDYR 497
Query: 185 SLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 498 TRHAACPVLVGSKWVSNKWFHERGQ 522
>gi|380025232|ref|XP_003696381.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Apis florea]
Length = 537
Score = 88.2 bits (217), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 56/206 (27%), Positives = 94/206 (45%), Gaps = 22/206 (10%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
PR + + N ++ ++I A+ R K + + + +E R S ++ E K
Sbjct: 330 PRIVVYHNVIYDDEIETIKRMAQPRFKRATVQNYKTGALE-IANYRISKSAWLQEHEHKH 388
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS----QRLA 123
+ + ++ T + E V+ Y IG Y+ H+D E S R+A
Sbjct: 389 --VAAVSRRVEHMTSMTVDTAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIA 446
Query: 124 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 183
+ L Y+SDVE+GG T+F I + + P++G +Y+L PNG D
Sbjct: 447 TVLYYMSDVEQGGGTVFT---------------AINIALWPKKGSAAFWYNLKPNGEGDF 491
Query: 184 TSLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV+ G KWVA KW+ ++ Q
Sbjct: 492 KTRHAACPVLTGSKWVANKWLHERGQ 517
>gi|441432545|ref|YP_007354587.1| Prolyl 4-hydroxylase [Acanthamoeba polyphaga moumouvirus]
gi|371944705|gb|AEX62527.1| putative prolyl4-hydroxylase [Moumouvirus Monve]
gi|440383625|gb|AGC02151.1| Prolyl 4-hydroxylase [Acanthamoeba polyphaga moumouvirus]
Length = 239
Score = 88.2 bits (217), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 59/199 (29%), Positives = 95/199 (47%), Gaps = 30/199 (15%)
Query: 15 NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 74
NF + E+C+ I+ + +L S++ + K R S ++S + +++ +
Sbjct: 61 NFINKEKCKEIMNNTQNKLFDSEVISGK------NKAIRNSQQCWVSKYDP---MVKSMF 111
Query: 75 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-----NPAEYGPQMSQRLASFLLYL 129
KI++ +P + E V+RY GQ Y+ H+DA E+ + QR + L+YL
Sbjct: 112 QKISQQFNIPLENAEDLQVVRYLPGQYYNEHHDACCDNNDKCNEFISRGGQRCLTVLVYL 171
Query: 130 SDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT-IDRTSLHG 188
++ EGG T F K + LKVKP GD ++FY L N + SLH
Sbjct: 172 NNEFEGGHTFF---------------KNLNLKVKPETGDAIVFYPLAKNTSKCHPLSLHA 216
Query: 189 SCPVIKGEKWVATKWIRDQ 207
PV GEKW+A W R++
Sbjct: 217 GMPVTSGEKWIANLWFRER 235
>gi|156370133|ref|XP_001628326.1| predicted protein [Nematostella vectensis]
gi|156215300|gb|EDO36263.1| predicted protein [Nematostella vectensis]
Length = 526
Score = 88.2 bits (217), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 62/211 (29%), Positives = 100/211 (47%), Gaps = 22/211 (10%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
M+++S P+ F N S + + ++ A+ RL+ +++ + +E R S ++
Sbjct: 319 MEIVSVNPQITLFHNVLSEMEIEQMLELARPRLRRARVNNLETGEIEDVD-YRISQIAWL 377
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS- 119
S S+ I+ I ++ T L GE V Y +G Y+ H+D E P S
Sbjct: 378 SDSDGD--IVRRINRRVGFITGLNTNTGECLQVNNYGVGGHYEPHFDHSLDMENSPIASL 435
Query: 120 ---QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
R+A+F+ YLS+VE GG T+F G+K P +G + +Y+L
Sbjct: 436 GQGNRIATFMFYLSEVEAGGSTVFI---------------KTGVKTNPFKGGAVFWYNLK 480
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 207
+G D SLH CPV+ G KWVA KW+ +
Sbjct: 481 KSGEGDWDSLHAGCPVLIGNKWVANKWLHEH 511
>gi|198477150|ref|XP_002136737.1| GA29215 [Drosophila pseudoobscura pseudoobscura]
gi|198145042|gb|EDY71754.1| GA29215 [Drosophila pseudoobscura pseudoobscura]
Length = 508
Score = 88.2 bits (217), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 58/209 (27%), Positives = 96/209 (45%), Gaps = 21/209 (10%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
M++LS P + + + A++ S++ +R + E S RT+ ++
Sbjct: 312 MELLSLDPYVVLYHDVL-ADREMSLLKLMAQRDLVRAVTYNATEKKHSEDPNRTTKAGWL 370
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
S + + ++ ++ L E F VL Y IG Y H D F + P++
Sbjct: 371 DPSHNLIRRMGILTEDMSN---LDLERSEDFQVLNYGIGGHYAVHPDFFEGS--NPELPD 425
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+A+ L YLSDV GG T+FP + L V P++G L++Y+L G
Sbjct: 426 RVATLLFYLSDVPLGGATVFPL---------------LDLSVFPKKGAVLMWYNLDHKGQ 470
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
++H +CPV+ G +WV TKW+ Q Q
Sbjct: 471 GMEKTIHSACPVVVGSRWVMTKWVNQQPQ 499
>gi|195145084|ref|XP_002013526.1| GL24185 [Drosophila persimilis]
gi|194102469|gb|EDW24512.1| GL24185 [Drosophila persimilis]
Length = 229
Score = 88.2 bits (217), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 100/210 (47%), Gaps = 25/210 (11%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
M+ LS P + F + + ++ + LK S + Q V ++K F+
Sbjct: 27 MEELSHDPYMVLFHDVVYESEIDFLLNATQ--LKASLVGQYQYSPVRTSKEQH-----FV 79
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ-MS 119
++ T +++ + ++ T L + ++ Y +G YD HYD+ N +E +
Sbjct: 80 EYND--TAVVKTLHRRLNDMTGLDMIESDTLTLINYGMGGHYDVHYDSHNYSEANRLILG 137
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 179
R+A+ L Y+ +V+ GG T FP+ I + V P++G +L+Y+L +G
Sbjct: 138 DRIATVLFYVGEVDSGGATTFPY---------------INVSVTPKKGSAVLWYNLDNSG 182
Query: 180 TIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
++ ++H CPVI G K+V TKWI + Q
Sbjct: 183 QMNPKAIHAGCPVIVGSKYVLTKWINEIPQ 212
>gi|395817620|ref|XP_003782263.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Otolemur
garnettii]
Length = 540
Score = 88.2 bits (217), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 59/211 (27%), Positives = 100/211 (47%), Gaps = 32/211 (15%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 341 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 396
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 118
++ + H++ T L E V Y +G +Y+ H+D AF G
Sbjct: 397 DPVVARVNHRMQHITGLSVKTAELLQVANYGVGGQYEPHFDFSRNHERDAFKRLGTG--- 453
Query: 119 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 178
R+A+FL Y+SDVE GG T+FP +G + P++G + +Y+L +
Sbjct: 454 -NRVATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRS 497
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
G D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 498 GEGDYRTRHAACPVLVGCKWVSNKWFHERGQ 528
>gi|209862961|ref|NP_001129548.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Mus
musculus]
gi|17390970|gb|AAH18411.1| P4ha2 protein [Mus musculus]
gi|18073922|emb|CAC85690.1| Prolyl 4-hydroxylase alpha IIa subunit [Mus musculus]
gi|74211515|dbj|BAE26490.1| unnamed protein product [Mus musculus]
Length = 535
Score = 88.2 bits (217), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 338 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 393
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 124
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 394 DPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 453
Query: 125 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 184
FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 454 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 498
Query: 185 SLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 499 TRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|390352104|ref|XP_003727818.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like
[Strongylocentrotus purpuratus]
Length = 121
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 45/121 (37%), Positives = 64/121 (52%), Gaps = 16/121 (13%)
Query: 89 EAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFL 148
E + Y +G Y H+D F + R+AS L YLSDV +GG+T +F+
Sbjct: 5 EFLQIANYGLGGHYLPHFD-FTRDVATHKNGNRIASMLFYLSDVAKGGDT-------VFI 56
Query: 149 DSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 208
D+G K+KP +G + +Y+LF NG +D + H SCPVI G KWVA W+ +
Sbjct: 57 DAG--------AKIKPEKGSAIFWYNLFKNGKVDERTKHASCPVISGSKWVANMWMHEHG 108
Query: 209 Q 209
Q
Sbjct: 109 Q 109
>gi|410948132|ref|XP_003980795.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Felis
catus]
gi|410948136|ref|XP_003980797.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Felis
catus]
Length = 533
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 124
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451
Query: 125 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 184
FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 452 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 496
Query: 185 SLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 497 TRHAACPVLVGCKWVSNKWFHERGQ 521
>gi|321474877|gb|EFX85841.1| hypothetical protein DAPPUDRAFT_208740 [Daphnia pulex]
Length = 545
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 55/211 (26%), Positives = 99/211 (46%), Gaps = 30/211 (14%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
+P + + N + ++ +++ A+ R K + + +E R S ++ SE+
Sbjct: 343 KPLIVIYHNVINDDEIETVKKMAQPRFKRATVQNSVTGNLEPA-NYRISKSAWLK-SEEH 400
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 118
+ + + ++ T L E V+ Y IG Y+ H+D AF +G
Sbjct: 401 DHVFK-VTRRVGDVTGLDMATAEDLQVVNYGIGGHYEPHFDYARKEEVNAFKDLGWG--- 456
Query: 119 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 178
R+A++L Y+S+VE GG T+FP + L + P++G +Y+L PN
Sbjct: 457 -NRVATWLFYMSEVEAGGATVFP---------------KLNLALWPQKGSAAFWYNLHPN 500
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
G + + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 501 GEGNELTRHAACPVLTGSKWVSNKWIHERNQ 531
>gi|148701600|gb|EDL33547.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide, isoform CRA_e [Mus
musculus]
Length = 593
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 396 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 451
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 124
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 452 DPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 511
Query: 125 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 184
FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 512 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 556
Query: 185 SLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 557 TRHAACPVLVGCKWVSNKWFHERGQ 581
>gi|289526401|gb|ADD01323.1| FI13021p [Drosophila melanogaster]
gi|373432715|gb|AEY70761.1| FI17809p1 [Drosophila melanogaster]
Length = 193
Score = 87.8 bits (216), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 58/194 (29%), Positives = 91/194 (46%), Gaps = 28/194 (14%)
Query: 25 IIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLP 84
+I A + +K +++ + V RT+ G ++ ++ + + I +I T
Sbjct: 2 LIGKAAQNMKNTKI--HKERAVPKKNRGRTAKGFWLKKESNE--LTKRITRRIMDMTGFD 57
Query: 85 QTHGEAFNVLRYEIGQKYDSHYDAFNPA---------EYGPQMSQRLASFLLYLSDVEEG 135
E F V+ Y IG Y H D F+ A Y + R+A+ L YL+DVE+G
Sbjct: 58 LADSEGFQVINYGIGGHYFLHMDYFDFASSNHTDTRSRYSIDLGDRIATVLFYLTDVEQG 117
Query: 136 GETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKG 195
G T +F D GY V P+ G + +Y+L +G D + H +CPVI G
Sbjct: 118 GAT-------VFGDVGY--------YVSPQAGTAIFWYNLDTDGNGDPRTRHAACPVIVG 162
Query: 196 EKWVATKWIRDQEQ 209
KWV T+WIR++ Q
Sbjct: 163 SKWVMTEWIREKRQ 176
>gi|221126103|ref|XP_002165259.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Hydra
magnipapillata]
Length = 533
Score = 87.8 bits (216), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 61/213 (28%), Positives = 97/213 (45%), Gaps = 21/213 (9%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTF 59
M+VL P + + ++ + II AK L+ + + + G+ + + R S T+
Sbjct: 326 MEVLHHDPYIELYYELITDDEAKHIIKFAKPLLRRAFVHDMVTGDLIYA--DYRVSKNTW 383
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD---AFNPAEYGP 116
I+ ED I I ++ T L + E V Y I +Y+ H+D P +
Sbjct: 384 IA--EDMDVIAAKIIRRVGDVTGLNMRYAEHLQVANYGIAGQYEPHFDHSTGTRPKHFDR 441
Query: 117 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
R+A+ LLYLSDV+ GG T+F G+ P +G G+ +Y+L
Sbjct: 442 WGGNRIATMLLYLSDVDWGGRTVFT-------------NTAPGVGTDPIKGAGVFWYNLL 488
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
NG + + H CPV+ G+KWVA WI + Q
Sbjct: 489 RNGKSNPKTQHAGCPVVLGQKWVANLWIHEHGQ 521
>gi|195113245|ref|XP_002001178.1| GI22115 [Drosophila mojavensis]
gi|193917772|gb|EDW16639.1| GI22115 [Drosophila mojavensis]
Length = 498
Score = 87.8 bits (216), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 62/215 (28%), Positives = 103/215 (47%), Gaps = 29/215 (13%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
M++LS P + F + + + + TA+ L S + E+V S RT+ G F+
Sbjct: 284 MELLSEDPYIVVFHDVIYDSEIKHLRNTAEPLLHRSYVKKSNNESVVSK--VRTAKGAFM 341
Query: 61 SA---SEDKTGILELIEHKIARATMLPQTHGEAFN---VLRYEIGQKYDSHYDAFNPAEY 114
A S + +++ ++ ++ + L E +N L Y+ G Y H D FN +
Sbjct: 342 HADRLSPESAQVVQRLKQRMGDLSDL-NIKREGYNEMQYLNYDFGDHYLLHMDYFNIS-- 398
Query: 115 GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 174
M+ R+A+FL+YL+DV GG T+FP + V P +G +L+Y+
Sbjct: 399 ---MNDRIATFLIYLNDVTRGGGTIFP---------------QVKQAVHPEKGKLILWYN 440
Query: 175 LFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+ N + SLHG+CPV+ G K WIR+ +Q
Sbjct: 441 MNSNLDYELASLHGACPVLIGRKIAIVYWIREHDQ 475
>gi|344264849|ref|XP_003404502.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2
[Loxodonta africana]
Length = 534
Score = 87.8 bits (216), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 337 PHIVRYYDVMSDEEIERIKQIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 392
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 124
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 393 DPVVAQVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 452
Query: 125 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 184
FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 453 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 497
Query: 185 SLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 498 TRHAACPVLVGCKWVSNKWFHERGQ 522
>gi|114799222|ref|YP_760562.1| 2OG-Fe(II) oxygenase [Hyphomonas neptunium ATCC 15444]
gi|114739396|gb|ABI77521.1| oxidoreductase, 2OG-Fe(II) oxygenase family [Hyphomonas neptunium
ATCC 15444]
Length = 298
Score = 87.8 bits (216), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 61/207 (29%), Positives = 104/207 (50%), Gaps = 29/207 (14%)
Query: 8 PRA-LY-FPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASED 65
P+A LY +PNF + E C ++IA +RL+ S + RTS + I +
Sbjct: 100 PKAQLYVWPNFLAPETCDALIALTDERLRASTTT-----DAFADPKIRTSRSSDI-GTMG 153
Query: 66 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQ 120
+++L E IA A + ++ +A RY++ Q+Y +HYD F P Q+ Q
Sbjct: 154 HNLVMQLDE-LIAEALGIHWSYSDATQTQRYDVNQEYKAHYDYFTPGTRDYQVHCQFTGQ 212
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R +F++YL+DVEEGG T F + + + P +G +++ +L P+G+
Sbjct: 213 RTWTFMIYLNDVEEGGGTRF---------------RRLEKTIMPEKGKAVIWNNLNPDGS 257
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQ 207
++ ++H V G K+V TKW R++
Sbjct: 258 VNPYTIHHGMKVRSGAKYVITKWFRER 284
>gi|57525020|ref|NP_001006155.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Gallus gallus]
gi|82082587|sp|Q5ZLK5.1|P4HA2_CHICK RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
alpha-2; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-2; Flags: Precursor
gi|53129464|emb|CAG31388.1| hypothetical protein RCJMB04_5l17 [Gallus gallus]
Length = 534
Score = 87.8 bits (216), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 58/204 (28%), Positives = 102/204 (50%), Gaps = 24/204 (11%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 337 PHIVRYYDVMSDEEIEKIKQLAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 392
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS---QRLA 123
++ + ++ + T L E V Y +G +Y+ H+D F+ + + RLA
Sbjct: 393 DPVVAKVNQRMQQITGLTVKTAELLQVANYGMGGQYEPHFD-FSRRPFDSTLKSEGNRLA 451
Query: 124 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 183
+FL Y+SDVE GG T+FP D+ G + P++G + +Y+LF +G D
Sbjct: 452 TFLNYMSDVEAGGATVFP-----------DF----GAAIWPKKGTAVFWYNLFRSGEGDY 496
Query: 184 TSLHGSCPVIKGEKWVATKWIRDQ 207
+ H +CPV+ G KWV+ KW ++
Sbjct: 497 RTRHAACPVLVGCKWVSNKWFHER 520
>gi|52139015|gb|AAH82538.1| P4ha3 protein [Mus musculus]
Length = 404
Score = 87.8 bits (216), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 66/213 (30%), Positives = 103/213 (48%), Gaps = 25/213 (11%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
+V+ RP + +F S E+ Q I A+ L+ S +A GE + R S ++
Sbjct: 200 EVVHLRPLIALYHDFVSDEEAQKIRELAEPWLQRSVVA--SGEKQLQVE-YRISKSAWLK 256
Query: 62 ASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 116
+ D +L ++H+IA T L + E V+ Y IG Y+ H+D A +P+ Y
Sbjct: 257 DTVDP--MLVTLDHRIAALTGLDIQPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRM 314
Query: 117 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
+ R+A+F++YLS VE GG T F + N V + L +++L
Sbjct: 315 KSGNRVATFMIYLSSVEAGGATAFIYGN---------------FSVPVVKNAALFWWNLH 359
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D +LH CPV+ G+KWVA KWI + Q
Sbjct: 360 RSGEGDGDTLHAGCPVLVGDKWVANKWIHEYGQ 392
>gi|326928728|ref|XP_003210527.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Meleagris
gallopavo]
Length = 535
Score = 87.8 bits (216), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 58/204 (28%), Positives = 102/204 (50%), Gaps = 24/204 (11%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 338 PHIVRYYDVMSDEEIEKIKQLAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 393
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS---QRLA 123
++ + ++ + T L E V Y +G +Y+ H+D F+ + + RLA
Sbjct: 394 DPVVAKVNQRMQQITGLTVKTAELLQVANYGMGGQYEPHFD-FSRRPFDSTLKSEGNRLA 452
Query: 124 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 183
+FL Y+SDVE GG T+FP D+ G + P++G + +Y+LF +G D
Sbjct: 453 TFLNYMSDVEAGGATVFP-----------DF----GAAIWPKKGTAVFWYNLFRSGEGDY 497
Query: 184 TSLHGSCPVIKGEKWVATKWIRDQ 207
+ H +CPV+ G KWV+ KW ++
Sbjct: 498 RTRHAACPVLVGCKWVSNKWFHER 521
>gi|432926124|ref|XP_004080841.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oryzias
latipes]
Length = 523
Score = 87.8 bits (216), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 61/210 (29%), Positives = 103/210 (49%), Gaps = 30/210 (14%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 67
P + + + AS ++ +++ AK RL+ + + Q + +T R S ++ + E
Sbjct: 324 PYIVRYHDVASEKEMETVKELAKPRLRRATVHDPQTGKL-TTAQYRVSKSAWLGSHEHP- 381
Query: 68 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQMS 119
I++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 382 -IVDRINQRIEDITGLDVSTAEDLQVANYGVGGQYEPHFDFGRKDEADAFEELGTG---- 436
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 179
R+A++LLY+SDV+ GG N +F D IG V P++G + +Y+L +G
Sbjct: 437 NRIATWLLYMSDVQAGG-------NTVFTD--------IGAVVWPKKGTAVFWYNLHRSG 481
Query: 180 TIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
D + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 482 EGDYRTRHAACPVLVGNKWVSNKWIHERGQ 511
>gi|195452744|ref|XP_002073481.1| GK14140 [Drosophila willistoni]
gi|194169566|gb|EDW84467.1| GK14140 [Drosophila willistoni]
Length = 454
Score = 87.4 bits (215), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 56/210 (26%), Positives = 97/210 (46%), Gaps = 19/210 (9%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
M++LS P + + + + + + KRL+ ++ AL Q + RTS T++
Sbjct: 254 MEILSLNPYIVLCHDVILPSEQEFLKTQSSKRLEGAR-ALDQVKNEVVFNFIRTSKATWL 312
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE-YGPQMS 119
+ D + + H I + L G+ + ++ Y +G +++H D E +
Sbjct: 313 KKNSD--NVTRRLSHWIEDVSNLDSNIGDLYQIINYGVGGLFEAHSDTMRKDEDRWKVLY 370
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 179
R+A+F+ YL DV +GG T+F + L V P+ G L +++L G
Sbjct: 371 DRIATFIFYLQDVPQGGATLF---------------NNLNLTVFPKAGAALFWFNLDNAG 415
Query: 180 TIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
D ++H CPVI G KW+ TKW+ D Q
Sbjct: 416 DTDLFTVHTGCPVIVGSKWIMTKWVYDLGQ 445
>gi|159474434|ref|XP_001695330.1| predicted protein [Chlamydomonas reinhardtii]
gi|158275813|gb|EDP01588.1| predicted protein [Chlamydomonas reinhardtii]
Length = 1887
Score = 87.4 bits (215), Expect = 3e-15, Method: Composition-based stats.
Identities = 59/211 (27%), Positives = 94/211 (44%), Gaps = 25/211 (11%)
Query: 5 SWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASE 64
S PR L F C ++ A A RL +R + + +R S TF +
Sbjct: 1687 SLSPRVLVVDGFLPPGLCDALCAVAAPRL------IRSRVSTGAETPSRVSQSTFFTGDS 1740
Query: 65 DKTGILELIEHKIARATMLPQT---------HGEAFNVLRYEIGQKYDSHYDAFNPAEYG 115
+ + +E ++ P+ EA V+ Y++G Y HYD + G
Sbjct: 1741 ARLPEVVAVEARLQALMERPEVTAGGRPTLVKSEALQVVSYDVGGFYSEHYDN----KTG 1796
Query: 116 PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSL 175
+S R A+ ++YL D + GG T FP + + GL+V P +G L+F+S
Sbjct: 1797 GVIS-RAATIIIYLQDTQAGGSTHFPNQQLRLMRVARP-----GLRVYPAKGRALIFWSR 1850
Query: 176 FPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 206
P+G+ D SLH + PV G KW+ T+W ++
Sbjct: 1851 LPDGSEDLASLHSAEPVRAGSKWICTRWFKE 1881
>gi|406595590|ref|YP_006746720.1| hypothetical protein MASE_03040 [Alteromonas macleodii ATCC 27126]
gi|407682553|ref|YP_006797727.1| hypothetical protein AMEC673_03255 [Alteromonas macleodii str.
'English Channel 673']
gi|406372911|gb|AFS36166.1| hypothetical protein MASE_03040 [Alteromonas macleodii ATCC 27126]
gi|407244164|gb|AFT73350.1| hypothetical protein AMEC673_03255 [Alteromonas macleodii str.
'English Channel 673']
Length = 263
Score = 87.4 bits (215), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 59/203 (29%), Positives = 96/203 (47%), Gaps = 31/203 (15%)
Query: 13 FPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILEL 72
+ +F S+++C I+A K +L PS+LA S RTSS ++ +K +++
Sbjct: 85 YDDFLSSQECDDIVALTKDKLAPSKLA-----GAASADDIRTSSTCELAFLGNK--LVKD 137
Query: 73 IEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-------QRLASF 125
++++I L GE Y +G+ Y HYD F P PQ QR +
Sbjct: 138 VDNRIVSTLSLGVGEGEVIQAQHYNVGEYYKPHYDFFPPGS--PQYKAHCLSRGQRTWTC 195
Query: 126 LLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTS 185
++YL+D +GG T F + + VKP++G L + +L P+G + S
Sbjct: 196 MIYLNDECDGGHTRF---------------TKLDIAVKPKKGMALFWNNLLPSGDPNLNS 240
Query: 186 LHGSCPVIKGEKWVATKWIRDQE 208
+H + PV +G K V TKW R +
Sbjct: 241 IHFAEPVTRGHKTVITKWFRTKN 263
>gi|156333122|ref|XP_001619372.1| hypothetical protein NEMVEDRAFT_v1g151555 [Nematostella vectensis]
gi|156202442|gb|EDO27272.1| predicted protein [Nematostella vectensis]
Length = 144
Score = 87.4 bits (215), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 48/141 (34%), Positives = 71/141 (50%), Gaps = 15/141 (10%)
Query: 69 ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLY 128
+++ I +++ + L T E V+ Y IG Y+ HYD R+A+FL Y
Sbjct: 13 LVKRISYRVQAYSGLNMTTSEDLQVVNYGIGGHYEPHYDFARDKFTSLGTGNRIATFLSY 72
Query: 129 LSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHG 188
LSDVE GG T+F +G V P++GD +Y+L +G D ++ H
Sbjct: 73 LSDVEAGGGTVFTR---------------VGATVWPQKGDAAFWYNLKRSGDGDSSTRHA 117
Query: 189 SCPVIKGEKWVATKWIRDQEQ 209
+CPV+ G KWVA KWI + Q
Sbjct: 118 ACPVLVGSKWVANKWIHEVGQ 138
>gi|56118630|ref|NP_001007975.1| prolyl 4-hydroxylase, alpha polypeptide 2 precursor [Xenopus
(Silurana) tropicalis]
gi|51513259|gb|AAH80485.1| p4ha2 protein [Xenopus (Silurana) tropicalis]
Length = 527
Score = 87.4 bits (215), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 62/209 (29%), Positives = 101/209 (48%), Gaps = 26/209 (12%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
PR + + N S E+ I AK +L ++ +R +T V S R S ++ ++D
Sbjct: 338 PRIVRYLNALSDEEIAKIKELAKPKL--ARATVRDPKTGVLSVANYRVSKSAWLEENDDP 395
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS---QRLA 123
++ + ++ T L E V Y +G +Y+ H+D F+ + + RLA
Sbjct: 396 --VIARVNLRMQAITGLTVDTAELLQVANYGMGGQYEPHFD-FSRRPFDSNLKTDGNRLA 452
Query: 124 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 183
+FL Y+SDVE GG T+FP D+ G + P++G + +Y+LF +G D
Sbjct: 453 TFLNYMSDVEAGGATVFP-----------DF----GAAIWPKKGTAVFWYNLFRSGEGDY 497
Query: 184 TSLHGSCPVIKGEKWVATKWIRDQEQHED 212
+ H +CPV+ G KW KW Q+ H D
Sbjct: 498 RTRHAACPVLVGSKW--GKWTHTQDHHFD 524
>gi|268536692|ref|XP_002633481.1| C. briggsae CBR-PHY-2 protein [Caenorhabditis briggsae]
gi|94442973|emb|CAJ98659.1| prolyl 4-hydroxylase [Caenorhabditis briggsae]
Length = 539
Score = 87.4 bits (215), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 62/218 (28%), Positives = 97/218 (44%), Gaps = 32/218 (14%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTF- 59
+++L + P A+ F N S + + I A +LK + TV+++K T+
Sbjct: 318 VEILRFDPLAVLFKNVISDSEIEVIKELASPKLKRA--------TVQNSKTGELEHATYR 369
Query: 60 ISASE----DKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 115
IS S D +++ + +I T L Q E V Y +G YD H+D E
Sbjct: 370 ISKSAWLKGDLDPVIDRVNRRIEDFTGLNQATSEELQVANYGLGGHYDPHFDFARKEEKN 429
Query: 116 P----QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLL 171
R+A+ L Y+S E GG T+F +G V P + D L
Sbjct: 430 AFKTLNTGNRIATVLFYMSQPERGGATVF---------------NHLGTAVFPSKNDALF 474
Query: 172 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+Y+L +G D + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 475 WYNLRRDGEGDLRTRHAACPVLLGVKWVSNKWIHERGQ 512
>gi|198466401|ref|XP_002135182.1| GA23910 [Drosophila pseudoobscura pseudoobscura]
gi|198150583|gb|EDY73809.1| GA23910 [Drosophila pseudoobscura pseudoobscura]
Length = 530
Score = 87.4 bits (215), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 44/118 (37%), Positives = 65/118 (55%), Gaps = 18/118 (15%)
Query: 89 EAFNVLRYEIGQKYDSHYDAFNPAEY--GPQMSQRLASFLLYLSDVEEGGETMFPFENGI 146
E NV Y +G + HYD + P Y G M L + L Y+SD+++GG T+FP
Sbjct: 409 EELNVANYGLGTIFGPHYD-YTPENYDIGWFMGGPLGTILFYVSDLQQGGATIFP----- 462
Query: 147 FLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 204
I + V PR+G LL+++L+ +G D +LH SCPVI+G++W TKW+
Sbjct: 463 ----------SINITVSPRKGSALLWFNLYDDGEPDPRTLHSSCPVIEGDRWTLTKWV 510
>gi|227908832|ref|NP_796135.3| prolyl 4-hydroxylase subunit alpha-3 precursor [Mus musculus]
Length = 542
Score = 87.4 bits (215), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 66/213 (30%), Positives = 103/213 (48%), Gaps = 25/213 (11%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
+V+ RP + +F S E+ Q I A+ L+ S +A GE + R S ++
Sbjct: 338 EVVHLRPLIALYHDFVSDEEAQKIRELAEPWLQRSVVA--SGEKQLQVE-YRISKSAWLK 394
Query: 62 ASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 116
+ D +L ++H+IA T L + E V+ Y IG Y+ H+D A +P+ Y
Sbjct: 395 DTVDP--MLVTLDHRIAALTGLDIQPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRM 452
Query: 117 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
+ R+A+F++YLS VE GG T F + N V + L +++L
Sbjct: 453 KSGNRVATFMIYLSSVEAGGATAFIYGN---------------FSVPVVKNAALFWWNLH 497
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D +LH CPV+ G+KWVA KWI + Q
Sbjct: 498 RSGEGDGDTLHAGCPVLVGDKWVANKWIHEYGQ 530
>gi|81870817|sp|Q6W3F0.1|P4HA3_MOUSE RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
alpha-3; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-3; Flags: Precursor
gi|36962749|gb|AAQ87604.1| collagen prolyl 4-hydroxylase alpha III subunit [Mus musculus]
Length = 542
Score = 87.4 bits (215), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 66/213 (30%), Positives = 103/213 (48%), Gaps = 25/213 (11%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
+V+ RP + +F S E+ Q I A+ L+ S +A GE + R S ++
Sbjct: 338 EVVHLRPLIALYHDFVSDEEAQKIRELAEPWLQRSVVA--SGEKQLQVE-YRISKSAWLK 394
Query: 62 ASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 116
+ D +L ++H+IA T L + E V+ Y IG Y+ H+D A +P+ Y
Sbjct: 395 DTVDP--MLVTLDHRIAALTGLDIQPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRM 452
Query: 117 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
+ R+A+F++YLS VE GG T F + N V + L +++L
Sbjct: 453 KSGNRVATFMIYLSSVEAGGATAFIYGN---------------FSVPVVKNAALFWWNLH 497
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D +LH CPV+ G+KWVA KWI + Q
Sbjct: 498 RSGEGDGDTLHAGCPVLVGDKWVANKWIHEYGQ 530
>gi|443721482|gb|ELU10773.1| hypothetical protein CAPTEDRAFT_174752 [Capitella teleta]
Length = 525
Score = 87.4 bits (215), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 61/222 (27%), Positives = 103/222 (46%), Gaps = 31/222 (13%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
++L+ +P + F + S + +++ A +L+ + +A + + S R S +++
Sbjct: 310 EMLNRKPHIVLFHDVMSDAEAKTMKMEAMHKLERAHVADNENKHGHSASAKRISQVSWLW 369
Query: 62 ASEDKTGILELIEHKIARATMLPQT-------HGEAFNVLRYEIGQKYDSHYDAF----- 109
I +L ++A T L QT E F +L Y IG +Y+ H D F
Sbjct: 370 DDHANKTIHQL-SRRVADITGL-QTGVVSGLHSAEPFQILNYGIGGQYEPHVDYFAGNHS 427
Query: 110 --NPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRG 167
+ E+ RLA+F+ YL+DV GG T+FP K +G + P +
Sbjct: 428 HSSLPEHVRASGNRLATFMFYLNDVHAGGATVFP-------------KLKVG--IPPTKN 472
Query: 168 DGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+Y++ NG +D + H CPV+ G+KWVA KWI + Q
Sbjct: 473 GAAFWYNIGLNGDVDPLTEHAGCPVLLGQKWVANKWIHEHGQ 514
>gi|426229221|ref|XP_004008689.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
[Ovis aries]
Length = 487
Score = 87.4 bits (215), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 290 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 345
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 124
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 346 DPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 405
Query: 125 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 184
FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 406 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 450
Query: 185 SLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 451 TRHAACPVLVGCKWVSNKWFHERGQ 475
>gi|74353841|gb|AAI03334.1| Prolyl 4-hydroxylase, alpha polypeptide II [Bos taurus]
Length = 487
Score = 87.4 bits (215), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 290 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 345
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 124
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 346 DPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 405
Query: 125 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 184
FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 406 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 450
Query: 185 SLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 451 TRHAACPVLVGCKWVSNKWFHERGQ 475
>gi|156398644|ref|XP_001638298.1| predicted protein [Nematostella vectensis]
gi|156225417|gb|EDO46235.1| predicted protein [Nematostella vectensis]
Length = 495
Score = 87.4 bits (215), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 48/158 (30%), Positives = 79/158 (50%), Gaps = 20/158 (12%)
Query: 53 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD----A 108
R S ++S E +++ +E +IA T L E F V Y + +YD H+D
Sbjct: 339 RISKNCWLSGREHGE-VIDRVERRIAAMTRLNLETAEGFQVQNYGLAGQYDPHFDFSRDL 397
Query: 109 FNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 168
N + R+A+ L+++S VE GG T+FP+ +G ++ P++GD
Sbjct: 398 ANSSLGSLGTGNRIATVLVWMSQVESGGATVFPY---------------VGARILPQKGD 442
Query: 169 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 206
+ +++L +G D + H CPV+ G KWVA KWI +
Sbjct: 443 AVFWHNLLRSGDGDFRTRHAGCPVLSGIKWVANKWIHE 480
>gi|73970649|ref|XP_850109.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Canis
lupus familiaris]
Length = 533
Score = 87.0 bits (214), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 124
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 392 DPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451
Query: 125 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 184
FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 452 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 496
Query: 185 SLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 497 TRHAACPVLVGCKWVSNKWFHERGQ 521
>gi|355709025|gb|AES03456.1| prolyl 4-hydroxylase, alpha polypeptide II [Mustela putorius furo]
Length = 532
Score = 87.0 bits (214), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 124
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 392 DPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451
Query: 125 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 184
FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 452 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 496
Query: 185 SLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 497 TRHAACPVLVGCKWVSNKWFHERGQ 521
>gi|351696981|gb|EHA99899.1| Prolyl 4-hydroxylase subunit alpha-3 [Heterocephalus glaber]
Length = 572
Score = 87.0 bits (214), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 65/213 (30%), Positives = 101/213 (47%), Gaps = 25/213 (11%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
+V+ P + +F S + Q I A+ L+ S +A GE + R S ++
Sbjct: 368 EVIHLEPYVALYHDFVSDPEAQKIRKLAEPWLQRSVVA--SGEKQLQVE-YRISKSAWLK 424
Query: 62 ASEDKTGILELIEHKIARATMLPQTH--GEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 116
+ D +L ++H+IA T L H E V+ Y IG Y+ H+D A +P+ Y
Sbjct: 425 DTADP--VLVTLDHRIAALTGLDVQHPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRM 482
Query: 117 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
+ R+A+F++YLS VE GG T F + N V + L +++L
Sbjct: 483 KSGNRVATFMIYLSSVEAGGATAFIYAN---------------FSVPVVKNAALFWWNLH 527
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D +LH CPV+ G+KWVA KWI + Q
Sbjct: 528 RSGEGDGDTLHAGCPVLVGDKWVANKWIHEYGQ 560
>gi|170591592|ref|XP_001900554.1| prolyl 4-hydroxylase [Brugia malayi]
gi|16415740|emb|CAC82616.1| prolyl 4-hydroxylase [Brugia malayi]
gi|21425621|emb|CAD19314.1| prolyl 4-hydroxylase [Brugia malayi]
gi|158592166|gb|EDP30768.1| prolyl 4-hydroxylase, putative [Brugia malayi]
Length = 541
Score = 87.0 bits (214), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 61/213 (28%), Positives = 95/213 (44%), Gaps = 22/213 (10%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
+++L + P A+ F + + E+ I A RL+ + + +E T RTS ++
Sbjct: 325 VEILRFNPLAVLFRDVITDEEVTMIQMLATPRLRRATVQNSITGELE-TASYRTSKSAWL 383
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS- 119
E + ++ I +I T L Q E V Y IG YD H+D E S
Sbjct: 384 KDEEHE--VVHRINKRIDLMTNLEQETSEELQVGNYGIGGHYDPHFDFARREEVNAFQSL 441
Query: 120 ---QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
RLA+ L Y++ E GG T+F + V P + D L +Y+L
Sbjct: 442 NTGNRLATLLFYMTQPESGGATVF---------------TEVKTTVMPSKNDALFWYNLL 486
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 487 RSGEGDLRTRHAACPVLTGTKWVSNKWIHERGQ 519
>gi|440912197|gb|ELR61789.1| Prolyl 4-hydroxylase subunit alpha-2, partial [Bos grunniens mutus]
Length = 535
Score = 87.0 bits (214), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 338 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 393
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 124
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 394 DPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 453
Query: 125 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 184
FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 454 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 498
Query: 185 SLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 499 TRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|334311009|ref|XP_001371555.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Monodelphis
domestica]
Length = 534
Score = 87.0 bits (214), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 60/205 (29%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASEDK 66
P + + + S E+ + I +K +L S+ +R +T R S +++ ED
Sbjct: 337 PHIVRYYDVLSDEEIEKIKEISKPKL--SRATVRDPKTGHLIVVSYRISKSSWLK--EDD 392
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 124
I+ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 393 DPIIAQVNRRMQYITGLSVKTAELLQVSNYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 452
Query: 125 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 184
FL Y+SDVE GG T+FP D+ G + P++G + +Y+LF +G D
Sbjct: 453 FLNYMSDVEAGGATVFP-----------DF----GAAIWPKKGTSVFWYNLFRSGECDYR 497
Query: 185 SLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 498 TRHAACPVLVGSKWVSNKWFHERGQ 522
>gi|226874885|ref|NP_001029465.2| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Bos
taurus]
gi|296485623|tpg|DAA27738.1| TPA: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Bos taurus]
Length = 533
Score = 87.0 bits (214), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 124
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 392 DPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451
Query: 125 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 184
FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 452 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 496
Query: 185 SLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 497 TRHAACPVLVGCKWVSNKWFHERGQ 521
>gi|321463241|gb|EFX74258.1| hypothetical protein DAPPUDRAFT_22132 [Daphnia pulex]
Length = 523
Score = 87.0 bits (214), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 66/217 (30%), Positives = 99/217 (45%), Gaps = 30/217 (13%)
Query: 5 SWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASE 64
S+ P F + S E+ ++I AK L S + + G E + RTS ++ E
Sbjct: 316 SFEPAIYTFHDVLSDEEIETIKELAKPLLARSMVQGKLGVGHEVS-NVRTSKTAWLP--E 372
Query: 65 DKTGILELIEHKIARATMLP----QTHGEAFNVLRYEIGQKYDSHYDAF--NPAEYG--- 115
+L + +I T L + E V Y IG Y H+D + A++
Sbjct: 373 GLHPLLNRLSRRIGLITGLKTDPIRDEAELLQVANYGIGGHYSPHHDYLMKDKADFEYMH 432
Query: 116 ---PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLF 172
Q R+A+F+ YL+DVE GG T FP G+ VKP +G +
Sbjct: 433 HRELQAGDRIATFMFYLNDVERGGSTAFP---------------RAGVAVKPVKGGAAFW 477
Query: 173 YSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
++L +G D +LHG+CPV+ G KWV+ KWIR+ Q
Sbjct: 478 FNLKRSGKPDPLTLHGACPVLLGHKWVSNKWIRETAQ 514
>gi|395814850|ref|XP_003780953.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Otolemur
garnettii]
Length = 544
Score = 87.0 bits (214), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 65/213 (30%), Positives = 101/213 (47%), Gaps = 25/213 (11%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
+V+ P + +F S + Q I A+ L+ S +A GE + R S ++
Sbjct: 340 EVIHLEPFVALYHDFVSDSEAQKIRELAEPWLQRSVVA--SGEK-QLQVDYRISKSAWLK 396
Query: 62 ASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 116
+ D +L ++H+IA T L + E V+ Y IG Y+ H+D A +P+ Y
Sbjct: 397 DTVDP--MLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRM 454
Query: 117 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
+ R+A+F++YLS VE GG T F + N V + L +++L
Sbjct: 455 KSGNRVATFMIYLSSVEAGGATAFIYAN---------------FSVPVVKNAALFWWNLH 499
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
NG D +LH CPV+ G+KWVA KWI + Q
Sbjct: 500 RNGEGDSDTLHAGCPVLVGDKWVANKWIHEYGQ 532
>gi|345324764|ref|XP_001505668.2| PREDICTED: LOW QUALITY PROTEIN: transmembrane prolyl 4-hydroxylase
[Ornithorhynchus anatinus]
Length = 495
Score = 87.0 bits (214), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 58/195 (29%), Positives = 88/195 (45%), Gaps = 34/195 (17%)
Query: 44 ETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQT---HGEAFNVLRYEIGQ 100
+ V+ + R S T++ E ++ I+ ++ R T LPQ H E V+RY+ G
Sbjct: 259 QKVKMSDLVRNSQHTWLYQGEGAHQVMRSIQQRVLRLTRLPQEIVEHSEPLQVVRYDQGG 318
Query: 101 KYDSHYDA-------------FNPAEYGP-QMSQRLASFLLYLSDVEEGGETMFP----- 141
Y +H D+ F E P + S R + L YL++V GGET FP
Sbjct: 319 HYHAHMDSGPVFPETACSHTKFITNETAPFETSCRYVTVLFYLNNVTGGGETTFPVADNR 378
Query: 142 -------FENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT-----IDRTSLHGS 189
+N I L + L+VKP++G + +Y+ +G +D SLHG
Sbjct: 379 TYDEMSLIQNDIDLRDTRKHCDKGNLRVKPKQGTAVFWYNYLSDGQGWVGDLDEYSLHGG 438
Query: 190 CPVIKGEKWVATKWI 204
C V +G KW+A WI
Sbjct: 439 CLVTQGTKWIANNWI 453
>gi|402593814|gb|EJW87741.1| hypothetical protein WUBG_01349 [Wuchereria bancrofti]
Length = 541
Score = 87.0 bits (214), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 61/213 (28%), Positives = 95/213 (44%), Gaps = 22/213 (10%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
+++L + P A+ F + + E+ I A RL+ + + +E T RTS ++
Sbjct: 325 VEILRFNPLAVLFRDVITDEEITMIQMLATPRLRRATVQNSITGELE-TASYRTSKSAWL 383
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS- 119
E + ++ I +I T L Q E V Y IG YD H+D E S
Sbjct: 384 KDEEHE--VVHRINKRIDLMTNLEQETSEELQVGNYGIGGHYDPHFDFARREEVNAFQSL 441
Query: 120 ---QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
RLA+ L Y++ E GG T+F + V P + D L +Y+L
Sbjct: 442 NTGNRLATLLFYMTQPESGGATVF---------------TEVKTTVMPSKNDALFWYNLL 486
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 487 RSGEGDLRTRHAACPVLTGTKWVSNKWIHERGQ 519
>gi|332211329|ref|XP_003254773.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Nomascus
leucogenys]
Length = 544
Score = 87.0 bits (214), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 66/213 (30%), Positives = 101/213 (47%), Gaps = 25/213 (11%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
+V+ P + +F S + Q I A+ L+ S +A GE + R S ++
Sbjct: 340 EVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVA--SGEKQLQVE-YRISKSAWLK 396
Query: 62 ASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 116
+ D +L + H+IA T L + E V+ Y IG Y+ H+D A +P+ Y
Sbjct: 397 DTVDP--MLVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRM 454
Query: 117 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
+ R+A+F++YLS VE GG T F + N L V R L +++L
Sbjct: 455 KSGNRVATFMIYLSSVEAGGATAFIYAN---------------LSVPVVRNAALFWWNLH 499
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D +LH CPV+ G+KWVA KWI + Q
Sbjct: 500 RSGEGDSDTLHAGCPVLVGDKWVANKWIHEYGQ 532
>gi|431838427|gb|ELK00359.1| Prolyl 4-hydroxylase subunit alpha-3 [Pteropus alecto]
Length = 483
Score = 87.0 bits (214), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 64/213 (30%), Positives = 103/213 (48%), Gaps = 25/213 (11%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
+V+ P + + +F S + Q I A+ L+ S +A GE + R S ++
Sbjct: 279 EVIHLEPYVVLYHDFVSDLEAQKIRGLAEPWLQRSVVA--SGEKQLPVE-YRISKSAWLK 335
Query: 62 ASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 116
+ D +L ++H+IA T L + E V+ Y IG Y+ H+D A +P+ Y
Sbjct: 336 DTADP--MLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRM 393
Query: 117 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
+ R+A+F++YLS VE GG T F + N V + L +++L
Sbjct: 394 KSGNRVATFMIYLSSVEAGGATAFIYAN---------------FSVPVVKNAALFWWNLH 438
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D +LH +CPV+ G+KWVA KWI + Q
Sbjct: 439 RSGEGDSDTLHAACPVLVGDKWVANKWIHEYGQ 471
>gi|195505202|ref|XP_002099402.1| GE23382 [Drosophila yakuba]
gi|194185503|gb|EDW99114.1| GE23382 [Drosophila yakuba]
Length = 537
Score = 87.0 bits (214), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 46/134 (34%), Positives = 69/134 (51%), Gaps = 16/134 (11%)
Query: 77 IARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMSQRLASFLLYLSDVEEG 135
+ AT L T+ E V Y +G Y+ H+D F +P Y + R+A+ + YLS+VE+G
Sbjct: 397 LKEATGLDTTYCEQLQVANYGVGGHYEPHWDFFRDPNHYPEEEGNRIATAIFYLSEVEQG 456
Query: 136 GETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKG 195
G T FPF + + VKP+ G+ L +Y+L + D + H CPV+KG
Sbjct: 457 GATAFPF---------------LDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLKG 501
Query: 196 EKWVATKWIRDQEQ 209
KW+ WI + Q
Sbjct: 502 SKWIGNVWIHEVTQ 515
>gi|440899661|gb|ELR50930.1| Prolyl 4-hydroxylase subunit alpha-3, partial [Bos grunniens mutus]
Length = 478
Score = 87.0 bits (214), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 64/213 (30%), Positives = 103/213 (48%), Gaps = 25/213 (11%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
+V+ P + + +F S + Q+I A+ L+ S +A GE + R S ++
Sbjct: 274 EVIHLEPYVVLYHDFVSDAEAQTIRGLAEPWLQRSVVA--SGEKQLPVE-YRISKSAWLK 330
Query: 62 ASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 116
+ D +L ++H+IA T L + E V+ Y IG Y+ H+D A +P+ Y
Sbjct: 331 DTVDP--VLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRM 388
Query: 117 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
R+A+F++YLS VE GG T F + N V + L +++L
Sbjct: 389 NSGNRVATFMIYLSSVEAGGATAFIYGN---------------FSVPVVKNAALFWWNLH 433
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D +LH +CPV+ G+KWVA KWI + Q
Sbjct: 434 RSGEGDGDTLHAACPVLVGDKWVANKWIHEYGQ 466
>gi|126327904|ref|XP_001367838.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Monodelphis
domestica]
Length = 559
Score = 87.0 bits (214), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 65/213 (30%), Positives = 101/213 (47%), Gaps = 25/213 (11%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
+VL P + + +F S + Q I A L+ S +A GE + + R S ++
Sbjct: 355 EVLHLEPYIVLYHDFVSDSEAQKIRGFAAPWLQRSVVA--SGEKQQQVE-YRISKSAWLK 411
Query: 62 ASEDKTGILELIEHKIARATML--PQTHGEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 116
+ D +L ++H+IA T L + E V+ Y IG Y+ H+D A +P+ Y
Sbjct: 412 DTVDP--MLVSLDHRIAALTGLNVQPPYAEHLQVVNYGIGGHYEPHFDHATSPSSPLYRM 469
Query: 117 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
R+A+F++YLS VE GG T F + N V + L +++L
Sbjct: 470 NSGNRVATFMIYLSSVEAGGSTAFIYAN---------------FSVPVVKNAALFWWNLH 514
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D +LH CPV+ G+KWVA KWI + Q
Sbjct: 515 RSGEGDGDTLHAGCPVLVGDKWVANKWIHEYGQ 547
>gi|348505573|ref|XP_003440335.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Oreochromis
niloticus]
Length = 517
Score = 86.7 bits (213), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 62/213 (29%), Positives = 104/213 (48%), Gaps = 25/213 (11%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
+++S +P + + +F + + + I + A L+ S +A GE ++T R S ++
Sbjct: 313 ELVSLQPYVVLYHDFVTDTEAEDIKSLAHPGLRRSVVA--AGEK-QATADYRISKSAWLK 369
Query: 62 ASEDKTGILELIEHKIARATMLPQTH--GEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 116
S I+ ++ +I+ T L H GE V+ Y IG Y+ H+D A +P+ +
Sbjct: 370 GS--AQSIVGKLDQRISLLTGLNVKHPYGEYLQVVNYGIGGHYEPHFDHATSPSSPVFKL 427
Query: 117 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
+ R+A+F++YLS VE GG T F + N V + +++L
Sbjct: 428 KTGNRVATFMIYLSPVEAGGSTAFIYAN---------------FSVPVVEKAAIFWWNLH 472
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
NG D +LH CPV+ G+KWVA KWI + Q
Sbjct: 473 RNGEGDDDTLHAGCPVLIGDKWVANKWIHEYGQ 505
>gi|48675383|ref|NP_001001598.1| prolyl 4-hydroxylase subunit alpha-3 precursor [Bos taurus]
gi|75053350|sp|Q75UG4.1|P4HA3_BOVIN RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
alpha-3; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-3; Flags: Precursor
gi|47115494|dbj|BAD18888.1| Collagen prolyl 4-hydroxylase alpha III subunit [Bos taurus]
gi|296479828|tpg|DAA21943.1| TPA: prolyl 4-hydroxylase subunit alpha-3 precursor [Bos taurus]
Length = 544
Score = 86.7 bits (213), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 64/213 (30%), Positives = 103/213 (48%), Gaps = 25/213 (11%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
+V+ P + + +F S + Q+I A+ L+ S +A GE + R S ++
Sbjct: 340 EVIHLEPYVVLYHDFVSDAEAQTIRGLAEPWLQRSVVA--SGEKQLPVE-YRISKSAWLK 396
Query: 62 ASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 116
+ D +L ++H+IA T L + E V+ Y IG Y+ H+D A +P+ Y
Sbjct: 397 DTVDP--VLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRM 454
Query: 117 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
R+A+F++YLS VE GG T F + N V + L +++L
Sbjct: 455 NSGNRVATFMIYLSSVEAGGATAFIYGN---------------FSVPVVKNAALFWWNLH 499
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D +LH +CPV+ G+KWVA KWI + Q
Sbjct: 500 RSGEGDGDTLHAACPVLVGDKWVANKWIHEYGQ 532
>gi|297689698|ref|XP_002822285.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Pongo abelii]
Length = 544
Score = 86.7 bits (213), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 66/213 (30%), Positives = 101/213 (47%), Gaps = 25/213 (11%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
+V+ P + +F S + Q I A+ L+ S +A GE + R S ++
Sbjct: 340 EVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVA--SGEKQLQVE-YRISKSAWLK 396
Query: 62 ASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 116
+ D +L + H+IA T L + E V+ Y IG Y+ H+D A +P+ Y
Sbjct: 397 DTVDP--MLVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRM 454
Query: 117 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
+ R+A+F++YLS VE GG T F + N L V R L +++L
Sbjct: 455 KSGNRVATFMIYLSSVEAGGATAFIYAN---------------LSVPVVRNAALFWWNLH 499
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D +LH CPV+ G+KWVA KWI + Q
Sbjct: 500 RSGEGDSDTLHAGCPVLVGDKWVANKWIHEYGQ 532
>gi|195159164|ref|XP_002020452.1| GL13506 [Drosophila persimilis]
gi|194117221|gb|EDW39264.1| GL13506 [Drosophila persimilis]
Length = 536
Score = 86.7 bits (213), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 64/215 (29%), Positives = 102/215 (47%), Gaps = 25/215 (11%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTF 59
M+ LS P + + N S + IA ++ +P ++ GE S K RT+ G +
Sbjct: 325 MEELSLDPYIVVYHNVLSDAE----IAKVERVAEPLLKSIGVGEMDNSKKSKVRTALGAW 380
Query: 60 ISASEDKTG---ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP 116
I +++ I +I T L G+ +++Y G YD+H+D N +
Sbjct: 381 IPDENMHISGWPVIQRIVRRIHDMTGLIIKRGQVVQLIKYGYGGHYDTHFDYLNDSLPIT 440
Query: 117 Q-MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSL 175
Q + R+A+ L YL+DV+ GG T+FP + LKV RG L++Y++
Sbjct: 441 QALGDRMATVLFYLNDVKHGGSTVFP---------------VLQLKVPSERGKVLVWYNM 485
Query: 176 F-PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+D +LHGSCPVI G K V + WI + +Q
Sbjct: 486 HGETHDLDSRTLHGSCPVIDGAKTVLSCWIHEWDQ 520
>gi|47227817|emb|CAG08980.1| unnamed protein product [Tetraodon nigroviridis]
Length = 285
Score = 86.7 bits (213), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 61/213 (28%), Positives = 100/213 (46%), Gaps = 25/213 (11%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
+ LS +P + + +F S + + I A+ L+ S +A R + T R S ++
Sbjct: 81 ETLSLQPYVVLYHDFISDTEAEEIKHHAQLGLRRSVVATRDKQV---TAEYRISKSAWLK 137
Query: 62 ASEDKTGILELIEHKIARATML--PQTHGEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 116
S + ++ +I+ T L HGE V+ Y IG Y+ H+D A +P+ +
Sbjct: 138 GSAQSA--VSRLDQRISMLTGLNVQHPHGEYLQVVNYGIGGHYEPHFDHATSPSSPVFKL 195
Query: 117 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
+ R+A+ ++YLS VE GG T F + N V + + +++L
Sbjct: 196 KTGNRVATVMIYLSSVEAGGSTAFIYAN---------------FSVPVMKNAAIFWWNLH 240
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
NG D +LH CPV+ G+KWVA KWI + Q
Sbjct: 241 RNGRGDPDTLHAGCPVLIGDKWVANKWIHEYGQ 273
>gi|289662828|ref|ZP_06484409.1| hypothetical protein XcampvN_06993, partial [Xanthomonas campestris
pv. vasculorum NCPPB 702]
Length = 301
Score = 86.7 bits (213), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 59/194 (30%), Positives = 89/194 (45%), Gaps = 22/194 (11%)
Query: 18 SAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG-TFISASEDKTGILELIEHK 76
SA++C+ ++ A+ L+ SQ+ + + RTS G T ED + + +
Sbjct: 121 SADECRLLMLLARPHLRDSQV-IDPNDASTQRAPVRTSRGATLDPIIEDFAA--RVAQAR 177
Query: 77 IARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP---AEYGPQMSQRLASFLLYLSDVE 133
+A L TH E +VL Y G++Y +H D P A P R + +YL+ V+
Sbjct: 178 LAACAQLTLTHAEPLSVLCYAPGEQYRAHRDYLPPGTIAADHPNAGNRQRTVCVYLNVVD 237
Query: 134 EGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVI 193
GGET FP G++V+PR G + F +L +G + SLH PV
Sbjct: 238 AGGETEFPLA---------------GVRVQPRPGALVCFDNLHADGRPNADSLHAGLPVT 282
Query: 194 KGEKWVATKWIRDQ 207
G KW+ T W R Q
Sbjct: 283 AGSKWLGTLWFRQQ 296
>gi|407686446|ref|YP_006801619.1| hypothetical protein AMBAS45_03290 [Alteromonas macleodii str.
'Balearic Sea AD45']
gi|407289826|gb|AFT94138.1| hypothetical protein AMBAS45_03290 [Alteromonas macleodii str.
'Balearic Sea AD45']
Length = 263
Score = 86.7 bits (213), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 59/203 (29%), Positives = 95/203 (46%), Gaps = 31/203 (15%)
Query: 13 FPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILEL 72
+ +F S+++C I+A K +L PS+LA S RTSS ++ +K +++
Sbjct: 85 YDDFLSSQECDDIVALTKDKLAPSKLA-----GAASADDIRTSSTCELAFLGNK--LVKD 137
Query: 73 IEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-------QRLASF 125
++ +I L GE Y +G+ Y HYD F P PQ QR +
Sbjct: 138 VDSRIVSTLSLGVGEGEVIQAQHYNVGEYYKPHYDFFPPGS--PQYKAHCLSRGQRTWTC 195
Query: 126 LLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTS 185
++YL+D +GG T F + + VKP++G L + +L P+G + S
Sbjct: 196 MIYLNDECDGGHTRF---------------TKLDIAVKPKKGMALFWNNLLPSGDPNLNS 240
Query: 186 LHGSCPVIKGEKWVATKWIRDQE 208
+H + PV +G K V TKW R +
Sbjct: 241 IHFAEPVTRGHKTVITKWFRTKN 263
>gi|414587754|tpg|DAA38325.1| TPA: hypothetical protein ZEAMMB73_894856 [Zea mays]
Length = 169
Score = 86.7 bits (213), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 43/110 (39%), Positives = 71/110 (64%), Gaps = 3/110 (2%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 60
+V+SW PR + F NF S+E+C ++A A+ RL+ S + + G+ V+S RTSSG F+
Sbjct: 58 EVISWTPRIIVFHNFLSSEECDYLMAIARPRLQISTVVDVATGKGVKS--DVRTSSGMFV 115
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN 110
++ E K+ +++ IE +I+ + +P+ +GE VLRYE Q Y H+D F+
Sbjct: 116 NSEERKSPVVQAIEKRISVFSQIPKENGELIQVLRYEASQYYRPHHDYFS 165
>gi|301613006|ref|XP_002936013.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Xenopus
(Silurana) tropicalis]
Length = 504
Score = 86.3 bits (212), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 57/211 (27%), Positives = 97/211 (45%), Gaps = 56/211 (26%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
+PR + + + S E+ + AK RL+ + + S
Sbjct: 330 KPRIVRYHDIISDEEISKVKELAKPRLRRATI------------------------SNPI 365
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 118
TG+LE +++I++ + + V Y +G +Y+ H+D AF G
Sbjct: 366 TGVLETAQYRISKRWAIME-----LEVANYGMGGQYEPHFDFARKDEPDAFKELGTG--- 417
Query: 119 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 178
R+A++L Y+SDVE GG T+FP +G V P++G + +Y+LF +
Sbjct: 418 -NRVATWLFYMSDVEAGGATVFP---------------EVGAAVYPKKGTAVFWYNLFES 461
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
G D ++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 462 GEGDYSTRHAACPVLVGNKWVSNKWIHERGQ 492
>gi|348557544|ref|XP_003464579.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
[Cavia porcellus]
Length = 533
Score = 86.3 bits (212), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 57/205 (27%), Positives = 102/205 (49%), Gaps = 22/205 (10%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + + S E+ + I AK +L ++ +R +T V + R S +++ +D
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLEEEDDP 393
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 124
++ + ++ + T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 394 --VVARVNRRMQQITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451
Query: 125 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 184
FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 452 FLNYMSDVEAGGATVFP---------------DLGAALWPKKGTAVFWYNLLRSGEGDYR 496
Query: 185 SLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 497 TRHAACPVLVGCKWVSNKWFHERGQ 521
>gi|195575111|ref|XP_002105523.1| GD16991 [Drosophila simulans]
gi|194201450|gb|EDX15026.1| GD16991 [Drosophila simulans]
Length = 542
Score = 86.3 bits (212), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 63/210 (30%), Positives = 95/210 (45%), Gaps = 19/210 (9%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT-RTSSGTFI 60
++LS P L + S ++ I ++K+ + PS E+ T RTS +
Sbjct: 326 EILSVDPFVLLLHDMISQKESTLIRNSSKEHMLPSATTDPDSSDTETQVDTYRTSKSVWY 385
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMS 119
S+ + T + I ++ AT L E + V+ Y +G +++H D +
Sbjct: 386 SSDFNDTT--KKITERLGDATGLDTNFTEFYQVINYGLGGFFETHLDMLLSEKNRFNGTR 443
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 179
R+A+ L YL++V +GG T FP I L V P+ G L +Y+L NG
Sbjct: 444 DRIATTLFYLNEVRQGGGTYFPR---------------INLTVFPQPGSALFWYNLDTNG 488
Query: 180 TIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
SLH CPVI G KWV +KWI D Q
Sbjct: 489 NDHMGSLHTGCPVIVGSKWVMSKWINDMGQ 518
>gi|443707037|gb|ELU02831.1| hypothetical protein CAPTEDRAFT_181697 [Capitella teleta]
Length = 538
Score = 86.3 bits (212), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 54/211 (25%), Positives = 95/211 (45%), Gaps = 20/211 (9%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
+V+ P + N + ++ I +K +L S + G + + RTS +I
Sbjct: 333 EVMFLDPFIAIYHNLMTDKEADMIKRISKPKLHRSGVFTYSGGNQKPVQDYRTSKSAWIE 392
Query: 62 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE---YGPQM 118
E ++ + + + T L E F V+ Y IG Y+ H+D P E + P++
Sbjct: 393 DEEHP--MIRRVSERTSALTDLSLDTVELFQVVNYGIGGHYEPHFDFARPNEIATFDPEV 450
Query: 119 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 178
R+ + + Y++ E GG T+FP +G+K+ P +G ++++L N
Sbjct: 451 GNRIITVIFYVAAPEAGGATVFP---------------DLGVKLWPEKGSCAVWWNLMRN 495
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
G D + H CP I G KW+A KW ++ Q
Sbjct: 496 GEGDYRTKHAGCPTITGSKWIANKWYHERGQ 526
>gi|345481336|ref|XP_001600680.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Nasonia
vitripennis]
Length = 556
Score = 86.3 bits (212), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 59/210 (28%), Positives = 95/210 (45%), Gaps = 26/210 (12%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASEDK 66
PR + + + ++ ++I A+ R K + + + GE R S ++ E K
Sbjct: 349 PRIVIYHDVIYDDEIETIKRMAQPRFKRATVQNYKTGEL--EIANYRISKSAWLQEHEHK 406
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS----QRL 122
+ + ++ T + E V+ Y IG Y+ H+D E S R+
Sbjct: 407 H--VRAVSQRVEHMTSMSIETAEELQVVNYGIGGHYEPHFDFARREEKNAFKSLGTGNRI 464
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
A+ L Y+SDVE+GG T+F I + + P++G +Y+L PNG D
Sbjct: 465 ATVLYYMSDVEQGGGTVFT---------------KINISLWPKKGSAAFWYNLKPNGEGD 509
Query: 183 RTSLHGSCPVIKGEKWVATKWI--RDQEQH 210
+ H +CPV+ G KWVA KW+ R QE H
Sbjct: 510 YKTRHAACPVLTGSKWVANKWLHERGQEFH 539
>gi|412986224|emb|CCO17424.1| predicted protein [Bathycoccus prasinos]
Length = 557
Score = 85.9 bits (211), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 72/241 (29%), Positives = 112/241 (46%), Gaps = 46/241 (19%)
Query: 3 VLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISA 62
+S P F NF +C+ + A K LK S++ T RTSS F+
Sbjct: 318 CVSLSPLLFVFENFLHESECEFLRTLADKDLKRSRV------TDGKLSNGRTSSSCFLIG 371
Query: 63 SEDKTGILELIEHKI---ARATMLPQTH---------GEAFNVLRYEIGQKYDSHYDAFN 110
++ K +++ IE ++ R+T + T E ++RY +KY SH+D N
Sbjct: 372 AKGKEDVVKTIERRMLDAIRSTPVLTTRRFDTLKLKGSEPMQIVRYGKNEKYTSHFD--N 429
Query: 111 PAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYD----------YKKCI-- 158
A +R+A+F+ YLSD EGG T FP +FL+ +D KK +
Sbjct: 430 KA----GSFRRVATFMCYLSDQCEGGCTNFPKAEPLFLEPSFDEHGAFKPFGRKKKTVAS 485
Query: 159 ---GLKVKPRRGDGLLFYSL----FPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 211
G+K+ P+ G +LF+S+ F + SLH V KGEK++ TKW+ E+ E
Sbjct: 486 EQHGVKIHPKLGRAILFFSISEEPFRENPL---SLHEGQTVRKGEKFICTKWLTRTEESE 542
Query: 212 D 212
+
Sbjct: 543 N 543
>gi|17541712|ref|NP_502317.1| Protein PHY-2 [Caenorhabditis elegans]
gi|32171589|sp|Q20065.1|P4HA2_CAEEL RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
alpha-2; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-2; Flags: Precursor
gi|3876769|emb|CAA93469.1| Protein PHY-2 [Caenorhabditis elegans]
Length = 539
Score = 85.9 bits (211), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 61/218 (27%), Positives = 96/218 (44%), Gaps = 32/218 (14%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTF- 59
+++L + P A+ F N + + I A +LK + TV+++K T+
Sbjct: 318 VEILRFDPLAVLFKNVIHDSEIEVIKELASPKLKRA--------TVQNSKTGELEHATYR 369
Query: 60 ISASE----DKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 115
IS S D +++ + +I T L Q E V Y +G YD H+D E
Sbjct: 370 ISKSAWLKGDLDPVIDRVNRRIEDFTNLNQATSEELQVANYGLGGHYDPHFDFARKEEKN 429
Query: 116 P----QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLL 171
R+A+ L Y+S E GG T+F +G V P + D L
Sbjct: 430 AFKTLNTGNRIATVLFYMSQPERGGATVF---------------NHLGTAVFPSKNDALF 474
Query: 172 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+Y+L +G D + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 475 WYNLRRDGEGDLRTRHAACPVLLGVKWVSNKWIHEKGQ 512
>gi|426245942|ref|XP_004016760.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3, partial [Ovis
aries]
Length = 514
Score = 85.9 bits (211), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 64/213 (30%), Positives = 102/213 (47%), Gaps = 25/213 (11%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
+V+ P + + +F S + Q I A+ L+ S +A GE + R S ++
Sbjct: 310 EVIHLEPYVVLYHDFVSDAEAQKIRGLAEPWLQRSVVA--SGEKQLPVE-YRISKSAWLK 366
Query: 62 ASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 116
+ D +L ++H+IA T L + E V+ Y IG Y+ H+D A +P+ Y
Sbjct: 367 DTVDP--VLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRM 424
Query: 117 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
R+A+F++YLS VE GG T F + N V + L +++L
Sbjct: 425 NSGNRVATFMIYLSSVEAGGATAFIYGN---------------FSVPVVKNAALFWWNLH 469
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D +LH +CPV+ G+KWVA KWI + Q
Sbjct: 470 RSGEGDGDTLHAACPVLVGDKWVANKWIHEYGQ 502
>gi|59809017|gb|AAH89446.1| P4HA3 protein [Homo sapiens]
Length = 528
Score = 85.9 bits (211), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 66/213 (30%), Positives = 100/213 (46%), Gaps = 25/213 (11%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
+V+ P + +F S + Q I A+ L+ S +A GE + R S ++
Sbjct: 324 EVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVA--SGEKQLQVE-YRISKSAWLK 380
Query: 62 ASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 116
+ D L + H+IA T L + E V+ Y IG Y+ H+D A +P+ Y
Sbjct: 381 DTVDPK--LVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRM 438
Query: 117 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
+ R+A+F++YLS VE GG T F + N L V R L +++L
Sbjct: 439 KSGNRVATFMIYLSSVEAGGATAFIYAN---------------LSVPVVRNAALFWWNLH 483
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D +LH CPV+ G+KWVA KWI + Q
Sbjct: 484 RSGEGDSDTLHAGCPVLVGDKWVANKWIHEYGQ 516
>gi|33589818|ref|NP_878907.1| prolyl 4-hydroxylase subunit alpha-3 precursor [Homo sapiens]
gi|114639354|ref|XP_001174896.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Pan troglodytes]
gi|397487266|ref|XP_003814725.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Pan paniscus]
gi|74738714|sp|Q7Z4N8.1|P4HA3_HUMAN RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
alpha-3; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-3; Flags: Precursor
gi|33188232|gb|AAP97874.1| prolyl 4-hydroxylase alpha III subunit [Homo sapiens]
gi|36962719|gb|AAQ87603.1| collagen prolyl 4-hydroxylase alpha III subunit [Homo sapiens]
gi|37182165|gb|AAQ88885.1| GPGA711 [Homo sapiens]
gi|109658570|gb|AAI17334.1| Prolyl 4-hydroxylase, alpha polypeptide III [Homo sapiens]
gi|119595341|gb|EAW74935.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide III, isoform CRA_b
[Homo sapiens]
gi|410219716|gb|JAA07077.1| prolyl 4-hydroxylase, alpha polypeptide III [Pan troglodytes]
gi|410248278|gb|JAA12106.1| prolyl 4-hydroxylase, alpha polypeptide III [Pan troglodytes]
gi|410336087|gb|JAA36990.1| prolyl 4-hydroxylase, alpha polypeptide III [Pan troglodytes]
Length = 544
Score = 85.9 bits (211), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 66/213 (30%), Positives = 100/213 (46%), Gaps = 25/213 (11%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
+V+ P + +F S + Q I A+ L+ S +A GE + R S ++
Sbjct: 340 EVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVA--SGEKQLQVE-YRISKSAWLK 396
Query: 62 ASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 116
+ D L + H+IA T L + E V+ Y IG Y+ H+D A +P+ Y
Sbjct: 397 DTVDPK--LVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRM 454
Query: 117 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
+ R+A+F++YLS VE GG T F + N L V R L +++L
Sbjct: 455 KSGNRVATFMIYLSSVEAGGATAFIYAN---------------LSVPVVRNAALFWWNLH 499
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D +LH CPV+ G+KWVA KWI + Q
Sbjct: 500 RSGEGDSDTLHAGCPVLVGDKWVANKWIHEYGQ 532
>gi|391342914|ref|XP_003745760.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Metaseiulus
occidentalis]
Length = 525
Score = 85.9 bits (211), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 57/217 (26%), Positives = 101/217 (46%), Gaps = 30/217 (13%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
++V+ RP F + S ++ Q++I + RLK + + + +E R S ++
Sbjct: 319 LEVIHERPYLALFHDIMSDDEIQTVIELSAPRLKRATVQNAKSGELE-VANYRISKSAWL 377
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPA 112
+ + ++E + + T L E V+ Y IG Y++H+D AF
Sbjct: 378 KNHDHE--VVERLSFRFEYLTGLTHLTAEELQVVNYGIGGHYEAHFDFARRDEKDAFKQL 435
Query: 113 EYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLF 172
G R+A+++ Y+SDV+ GG T+FP +GL V P +G +
Sbjct: 436 GTG----NRIATWINYMSDVKAGGATVFPR---------------LGLTVWPEKGSAAFW 476
Query: 173 YSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
++L +G D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 477 WNLHRSGEGDILTRHAACPVLAGSKWVSNKWFHERGQ 513
>gi|426369750|ref|XP_004051847.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3, partial [Gorilla
gorilla gorilla]
Length = 517
Score = 85.9 bits (211), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 66/213 (30%), Positives = 100/213 (46%), Gaps = 25/213 (11%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
+V+ P + +F S + Q I A+ L+ S +A GE + R S ++
Sbjct: 313 EVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVA--SGEKQLQVE-YRISKSAWLK 369
Query: 62 ASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 116
+ D L + H+IA T L + E V+ Y IG Y+ H+D A +P+ Y
Sbjct: 370 DTVDPK--LVALNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRM 427
Query: 117 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
+ R+A+F++YLS VE GG T F + N L V R L +++L
Sbjct: 428 KSGNRVATFMIYLSSVEAGGATAFIYAN---------------LSVPVVRNAALFWWNLH 472
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D +LH CPV+ G+KWVA KWI + Q
Sbjct: 473 RSGEGDSDTLHAGCPVLVGDKWVANKWIHEYGQ 505
>gi|347964867|ref|XP_309164.4| AGAP000971-PA [Anopheles gambiae str. PEST]
gi|333466515|gb|EAA04901.5| AGAP000971-PA [Anopheles gambiae str. PEST]
Length = 553
Score = 85.9 bits (211), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 55/207 (26%), Positives = 95/207 (45%), Gaps = 22/207 (10%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
RP + + + S + + I A+ R + + + + +E R S ++ +ED+
Sbjct: 348 RPYIVIYHDVMSDREIERIKHYARPRFRRATVQNYKTGELEFA-NYRISKSAWLKDAEDE 406
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS----QRL 122
++ I ++ T L E V+ Y IG Y+ H+D E S R+
Sbjct: 407 --MIRTISQRVEDMTGLTMETAEELQVVNYGIGGHYEPHFDFARREERNAFKSLGTGNRI 464
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
A+ L Y+SDV +GG T+FP + L + PR+G +++L +G D
Sbjct: 465 ATVLFYMSDVTQGGATVFP---------------SLNLALWPRKGTAAFWFNLHASGRGD 509
Query: 183 RTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 510 YATRHAACPVLTGTKWVSNKWIHERGQ 536
>gi|148701598|gb|EDL33545.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide, isoform CRA_c [Mus
musculus]
gi|149052607|gb|EDM04424.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide (predicted),
isoform CRA_d [Rattus norvegicus]
Length = 189
Score = 85.9 bits (211), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 57/195 (29%), Positives = 97/195 (49%), Gaps = 22/195 (11%)
Query: 18 SAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDKTGILELIEHK 76
S E+ + I AK +L ++ +R +T V + R S +++ ED ++ + +
Sbjct: 2 SDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDDDPVVARVNRR 57
Query: 77 IARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQM-SQRLASFLLYLSDVEE 134
+ T L E V Y +G +Y+ H+D + P + G + RLA+FL Y+SDVE
Sbjct: 58 MQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEA 117
Query: 135 GGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIK 194
GG T+FP +G + P++G + +Y+L +G D + H +CPV+
Sbjct: 118 GGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLV 162
Query: 195 GEKWVATKWIRDQEQ 209
G KWV+ KW ++ Q
Sbjct: 163 GCKWVSNKWFHERGQ 177
>gi|443712762|gb|ELU05926.1| hypothetical protein CAPTEDRAFT_153364 [Capitella teleta]
Length = 491
Score = 85.9 bits (211), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 47/128 (36%), Positives = 66/128 (51%), Gaps = 22/128 (17%)
Query: 89 EAFNVLRYEIGQKYDSHYDAFNPAE----YGPQMSQ---RLASFLLYLSDVEEGGETMFP 141
EA V+ Y IG +Y+ H D + E P + R+++FL YLS V GG T+FP
Sbjct: 364 EAMQVVNYGIGGQYEPHLDFYEDPEMLKNVNPSLQDTGDRISTFLFYLSRVHLGGATVFP 423
Query: 142 FENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVAT 201
N ++V P + +Y+ PNG D+ +LH CPV+ GEKWVA
Sbjct: 424 KLN---------------VRVPPVKNGAAFWYNARPNGEHDKRTLHAGCPVVLGEKWVAN 468
Query: 202 KWIRDQEQ 209
KWIR++ Q
Sbjct: 469 KWIRERGQ 476
>gi|443709455|gb|ELU04127.1| hypothetical protein CAPTEDRAFT_149240 [Capitella teleta]
Length = 532
Score = 85.9 bits (211), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 52/161 (32%), Positives = 80/161 (49%), Gaps = 22/161 (13%)
Query: 53 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD---AF 109
R S ++ ED ++ I + + T L T E V+ Y IG +Y+ H+D
Sbjct: 380 RISKSGWLRDEEDP--LIARISERCSALTNLSLTTVEELQVVNYGIGGQYEPHFDFSRRS 437
Query: 110 NPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDG 169
P + R+ + + Y++DVE GG T +FLD+G +KV P +G
Sbjct: 438 EPTAFEKWRGNRILTVIYYMTDVEAGGAT-------VFLDAG--------VKVYPEKGSA 482
Query: 170 LLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI--RDQE 208
++++L P+G D + H +CPV+ G KWVA KW RDQE
Sbjct: 483 AVWHNLLPSGEGDMRTRHAACPVLTGSKWVANKWFHERDQE 523
>gi|194213450|ref|XP_001495951.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Equus
caballus]
Length = 548
Score = 85.9 bits (211), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 64/213 (30%), Positives = 101/213 (47%), Gaps = 25/213 (11%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
+V+ P + + +F S + Q I A+ L+ S +A GE + R S ++
Sbjct: 344 EVIHLEPYVVLYHDFVSDSEAQKIRGLAEPWLQRSVVA--SGEKQLPVE-YRISKSAWLK 400
Query: 62 ASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 116
+ D +L ++H+IA T L + E V+ Y IG Y+ H+D A +P Y
Sbjct: 401 DTVDP--MLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPTSPLYRM 458
Query: 117 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
+ R+A+F++YLS VE GG T F + N V + L +++L
Sbjct: 459 KSGNRVATFMIYLSSVEAGGATAFIYAN---------------FSVPVVKNAALFWWNLH 503
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D +LH CPV+ G+KWVA KWI + Q
Sbjct: 504 RSGEGDSDTLHAGCPVLVGDKWVANKWIHEYGQ 536
>gi|410860761|ref|YP_006975995.1| prolyl 4-hydroxylase subunit alpha [Alteromonas macleodii AltDE1]
gi|410818023|gb|AFV84640.1| Prolyl 4-hydroxylase alpha subunit [Alteromonas macleodii AltDE1]
Length = 376
Score = 85.9 bits (211), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 58/203 (28%), Positives = 92/203 (45%), Gaps = 32/203 (15%)
Query: 13 FPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFISASEDK 66
+ + S +C+ +IA LKPS + V+ G RTS I +
Sbjct: 181 YESILSEYECRYLIAKFSALLKPSMV-------VDPVTGRGKIDSVRTSYVAVIEPTHCD 233
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS---QRLA 123
I ++ I++ T + +GEA N+LRY GQ+Y HYD N QR+
Sbjct: 234 -WITRKLDKIISQITHTLRQNGEALNLLRYSPGQQYKPHYDGLNEINDALMFKDGKQRIK 292
Query: 124 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 183
+ L+YL+ + EGGET+FP + +++ P+ G ++F + NG +
Sbjct: 293 TALVYLNTINEGGETLFPK---------------LDIRIAPKSGTMVVFSNSDENGKLLL 337
Query: 184 TSLHGSCPVIKGEKWVATKWIRD 206
S H P + KW+ TKWIR+
Sbjct: 338 NSYHAGAPTVSENKWLVTKWIRE 360
>gi|195390805|ref|XP_002054058.1| GJ23004 [Drosophila virilis]
gi|194152144|gb|EDW67578.1| GJ23004 [Drosophila virilis]
Length = 446
Score = 85.9 bits (211), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 50/157 (31%), Positives = 81/157 (51%), Gaps = 19/157 (12%)
Query: 53 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 112
R+ FI +K +++ IE ++ + L + +++ Y IG Y H+D+F+
Sbjct: 296 RSGKNVFIEL--EKGELVKTIEMRVTDMSGLSMEGSDDLSLINYGIGGHYIPHHDSFSEE 353
Query: 113 EYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLF 172
E + R+A+ L YLSDVE GG T FP N L + P +G +L+
Sbjct: 354 E--NKTEDRIATALFYLSDVELGGATTFPLLN---------------LTISPEKGTAVLW 396
Query: 173 YSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
++L +GT ++H +CPVI G K+V TKWI + +Q
Sbjct: 397 HNLKDSGTPHPKTVHAACPVIVGSKYVMTKWIYNMDQ 433
>gi|323456313|gb|EGB12180.1| hypothetical protein AURANDRAFT_61447 [Aureococcus anophagefferens]
Length = 317
Score = 85.9 bits (211), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 67/217 (30%), Positives = 97/217 (44%), Gaps = 33/217 (15%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETV-ESTKGTRTSSGTFI 60
+VLS P A +FA+ +C IIA A RL AL G+ E +R++ ++
Sbjct: 103 EVLSTAPLAFCVRDFATGAECDRIIAEATPRL---SAALVAGDGAGEQAGSSRSAQVAWV 159
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF---------NP 111
S D + ++A +P +H E+ V++Y G +Y H+DAF
Sbjct: 160 PRSPDD----PWLARRVAELIDVPLSHAESLQVVKYGAGGEYKPHFDAFPLDAARGRRAA 215
Query: 112 AEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLL 171
QR + +LYL+DVE+GG T F E V+PRRG +
Sbjct: 216 VRGRTYAGQRRVTAILYLNDVEKGGGTAFHSETP------------AEFVVRPRRGSLFV 263
Query: 172 FYSLFPNGTIDR--TSLHGSCPVIK-GEKWVATKWIR 205
FY+ + + T DR SLH PV G KW+A W R
Sbjct: 264 FYNCYEDST-DRHPMSLHAGLPVAPGGTKWIANIWWR 299
>gi|327265288|ref|XP_003217440.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Anolis
carolinensis]
Length = 554
Score = 85.5 bits (210), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 59/207 (28%), Positives = 100/207 (48%), Gaps = 28/207 (13%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + N S E+ + I AK +L ++ +R +T V + R S +++ +D
Sbjct: 355 PHIVRYYNVLSDEEIEKIKELAKPKL--ARATVRDPKTGVLTVANYRVSKSSWLEEEDDL 412
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL---- 122
++ + ++ T L E V Y +G +Y+ H+D E P +RL
Sbjct: 413 --VVAKVNQRMEHITGLTVKTAELLQVANYGMGGQYEPHFDFSRKEE--PDAFKRLGTGN 468
Query: 123 --ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
A+FL Y+SDVE GG T+FP D+ G + P++G + +Y+LF +G
Sbjct: 469 RVATFLNYMSDVEAGGATVFP-----------DF----GAAIWPKKGTAVFWYNLFRSGE 513
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQ 207
D + H +CPV+ G KWV+ KW ++
Sbjct: 514 GDYRTRHAACPVLVGCKWVSNKWFHER 540
>gi|407698902|ref|YP_006823689.1| hypothetical protein AMBLS11_03220 [Alteromonas macleodii str.
'Black Sea 11']
gi|407248049|gb|AFT77234.1| hypothetical protein AMBLS11_03220 [Alteromonas macleodii str.
'Black Sea 11']
Length = 263
Score = 85.5 bits (210), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 58/203 (28%), Positives = 95/203 (46%), Gaps = 31/203 (15%)
Query: 13 FPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILEL 72
+ +F S+++C I+A K +L PS+LA S RTSS ++ +K +++
Sbjct: 85 YDDFLSSQECDDIVALTKDKLAPSKLA-----GAASADDIRTSSTCELAFLGNK--LVKD 137
Query: 73 IEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-------QRLASF 125
++ +I L GE Y +G+ Y HYD F P PQ QR +
Sbjct: 138 VDSRIVSTLSLGVGEGEVIQAQHYNVGEYYKPHYDFFPPGS--PQYKTHCLSRGQRTWTC 195
Query: 126 LLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTS 185
++YL+D +GG T F + + V+P++G L + +L P+G + S
Sbjct: 196 MIYLNDECDGGHTRF---------------TKLDIAVRPKKGMALFWNNLLPSGDPNLNS 240
Query: 186 LHGSCPVIKGEKWVATKWIRDQE 208
+H + PV +G K V TKW R +
Sbjct: 241 IHFAEPVTRGHKTVITKWFRTKN 263
>gi|198417610|ref|XP_002125349.1| PREDICTED: similar to Prolyl 4-hydroxylase subunit alpha-1
precursor (4-PH alpha-1)
(Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1) [Ciona intestinalis]
Length = 527
Score = 85.5 bits (210), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 48/138 (34%), Positives = 67/138 (48%), Gaps = 19/138 (13%)
Query: 73 IEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE----YGPQMSQRLASFLLY 128
I +I+ T L E V Y +G +Y H+D E Q +R+A+FL+Y
Sbjct: 381 ITERISDITGLTLNTSEEIQVANYGVGGEYPPHFDIPTTDEERDDLKSQDGERIATFLIY 440
Query: 129 LSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHG 188
LSDVE GG T F G+ KP +G + +Y++FP+G D + HG
Sbjct: 441 LSDVEVGGRTAFV---------------NAGVSAKPIKGSAVFWYNVFPSGEPDLRTYHG 485
Query: 189 SCPVIKGEKWVATKWIRD 206
+CPV G KW KWIR+
Sbjct: 486 ACPVAFGNKWAGNKWIRE 503
>gi|195390825|ref|XP_002054068.1| GJ24233 [Drosophila virilis]
gi|194152154|gb|EDW67588.1| GJ24233 [Drosophila virilis]
Length = 533
Score = 85.5 bits (210), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 63/215 (29%), Positives = 101/215 (46%), Gaps = 24/215 (11%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
+++LS P F + A + +I + LK + + RT++G++I
Sbjct: 316 LELLSKDPYIAVFHDVIYASEIAELIRIGEPMLKRTAVQNITQNVDTYISKDRTATGSWI 375
Query: 61 ---SASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYDAFNPAEYG 115
+ ++ + ++ I+ +I T L T + +L Y G Y SHYD FN +
Sbjct: 376 LNGNLTKLERNMIWRIQRRIEDMTGLLITGFSEQDLQLLNYVFGGHYQSHYDFFNCPSFP 435
Query: 116 PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSL 175
R+A+ L+YL+DV GG T+FP + L V+P RG L +Y++
Sbjct: 436 ---HDRIATTLIYLNDVVRGGATVFP---------------KLDLVVQPERGKVLHWYNM 477
Query: 176 FPNG-TIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
P+ DR SLHG CPV+ GEK T WI + +Q
Sbjct: 478 LPDTFDYDRRSLHGGCPVLIGEKLALTNWIYEWDQ 512
>gi|196011900|ref|XP_002115813.1| hypothetical protein TRIADDRAFT_59899 [Trichoplax adhaerens]
gi|190581589|gb|EDV21665.1| hypothetical protein TRIADDRAFT_59899 [Trichoplax adhaerens]
Length = 581
Score = 85.5 bits (210), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 57/215 (26%), Positives = 96/215 (44%), Gaps = 23/215 (10%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
++VLS +P + + N + + + A LK + + + + R S ++
Sbjct: 345 VEVLSLQPYIVIYHNLLTNSEVVLLKTLASPLLKRAVVVGKPDKEYGEETTYRISKTAWL 404
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP------AEY 114
ED + + I I L E + Y IG Y+ H D +EY
Sbjct: 405 D-KEDHPAV-KRITTLIGDIIGLTSETAEPLQIANYGIGGHYEPHLDFIESEDKEALSEY 462
Query: 115 GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 174
++ R+A+ L+YLS+VE GG T+FP G++V+PR+G +Y+
Sbjct: 463 TSRIGNRIATVLIYLSNVEAGGATVFP---------------KAGVRVEPRQGSAAFWYN 507
Query: 175 LFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+ NG ++ S+H +CPV+ G KW A W R+ Q
Sbjct: 508 MHRNGEGNKLSVHAACPVLIGSKWAANLWFREVGQ 542
>gi|403263105|ref|XP_003923900.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
alpha-3, partial [Saimiri boliviensis boliviensis]
Length = 534
Score = 85.5 bits (210), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 66/213 (30%), Positives = 100/213 (46%), Gaps = 25/213 (11%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
+VL P + +F S + Q I A+ L+ S +A GE + R S ++
Sbjct: 330 EVLHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVA--SGEKQLQVE-YRISKSAWLK 386
Query: 62 ASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 116
+ D +L + H+IA T L + E V+ Y IG Y+ H+D A +P+ Y
Sbjct: 387 DTVDP--MLVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRM 444
Query: 117 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
+ R+A+F++YLS VE GG T F + N L V + L +++L
Sbjct: 445 KSGNRVATFMIYLSSVEAGGATAFIYAN---------------LSVPVVKNAALFWWNLH 489
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D +LH CPV+ G KWVA KWI + Q
Sbjct: 490 RSGEGDSDTLHAGCPVLVGNKWVANKWIHEYGQ 522
>gi|195575099|ref|XP_002105517.1| GD17024 [Drosophila simulans]
gi|194201444|gb|EDX15020.1| GD17024 [Drosophila simulans]
Length = 537
Score = 85.5 bits (210), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 46/131 (35%), Positives = 67/131 (51%), Gaps = 16/131 (12%)
Query: 80 ATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMSQRLASFLLYLSDVEEGGET 138
AT L T E V Y +G Y+ H+D F +P Y + R+A+ + YLS+VE+GG T
Sbjct: 400 ATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRIATAIFYLSEVEQGGAT 459
Query: 139 MFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKW 198
FPF + + VKP+ G+ L +Y+L + D + H CPV+KG KW
Sbjct: 460 AFPF---------------LDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGSKW 504
Query: 199 VATKWIRDQEQ 209
+ WI + Q
Sbjct: 505 IGNVWIHEVTQ 515
>gi|395521232|ref|XP_003764722.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Sarcophilus
harrisii]
Length = 521
Score = 85.5 bits (210), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 65/213 (30%), Positives = 100/213 (46%), Gaps = 25/213 (11%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
+VL P + + +F S + Q I A L+ S +A GE + + R S ++
Sbjct: 317 EVLHLEPYIVLYHDFVSDSEAQKIRGFAAPWLQRSVVA--SGEKQQQVE-YRISKSAWLK 373
Query: 62 ASEDKTGILELIEHKIARATML--PQTHGEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 116
+ D IL ++ +IA T L + E V+ Y IG Y+ H+D A +P+ Y
Sbjct: 374 DTVDP--ILVSLDRRIAALTGLNVQPPYAEHLQVVNYGIGGHYEPHFDHATSPSSPLYRM 431
Query: 117 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
R+A+F++YLS VE GG T F + N V + L +++L
Sbjct: 432 NSGNRVATFMIYLSSVEAGGSTAFIYAN---------------FSVPVVKNAALFWWNLH 476
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D +LH CPV+ G+KWVA KWI + Q
Sbjct: 477 RSGQGDGDTLHAGCPVLVGDKWVANKWIHEYGQ 509
>gi|195341544|ref|XP_002037366.1| GM12151 [Drosophila sechellia]
gi|194131482|gb|EDW53525.1| GM12151 [Drosophila sechellia]
Length = 537
Score = 85.5 bits (210), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 46/131 (35%), Positives = 67/131 (51%), Gaps = 16/131 (12%)
Query: 80 ATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMSQRLASFLLYLSDVEEGGET 138
AT L T E V Y +G Y+ H+D F +P Y + R+A+ + YLS+VE+GG T
Sbjct: 400 ATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRIATAIFYLSEVEQGGAT 459
Query: 139 MFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKW 198
FPF + + VKP+ G+ L +Y+L + D + H CPV+KG KW
Sbjct: 460 AFPF---------------LDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGSKW 504
Query: 199 VATKWIRDQEQ 209
+ WI + Q
Sbjct: 505 IGNVWIHEVTQ 515
>gi|195159303|ref|XP_002020521.1| GL13468 [Drosophila persimilis]
gi|194117290|gb|EDW39333.1| GL13468 [Drosophila persimilis]
Length = 415
Score = 85.5 bits (210), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 56/209 (26%), Positives = 96/209 (45%), Gaps = 33/209 (15%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES-TKGTRTSSGTFI 60
++LS P + + + + + ++ +K +K + + V RTS+ ++
Sbjct: 231 ELLSLSPYMVLYHDVITPLESLTLKNLSKPLMKRRAMVMVNNLKVRPFIDSGRTSNSVWL 290
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 120
++ E+ ++E +E ++ T + E + ++ Y IG Y H D F PQ
Sbjct: 291 TSHEN--AVMERLERRVGVMTNFEMENSEVYQLINYGIGGHYKPHTDHFE----TPQ--- 341
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
LSDV +GG T+FP N + V+PR+GD LL+Y+L G
Sbjct: 342 --------LSDVPQGGATLFPRLN---------------ISVQPRQGDALLWYNLNDRGQ 378
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+ ++H SCP+IKG KW KWI + Q
Sbjct: 379 GEIGTVHTSCPIIKGSKWALVKWIDELSQ 407
>gi|24651420|ref|NP_733374.1| prolyl-4-hydroxylase-alpha NE1 [Drosophila melanogaster]
gi|7301952|gb|AAF57058.1| prolyl-4-hydroxylase-alpha NE1 [Drosophila melanogaster]
gi|363987308|gb|AEW43896.1| FI16820p1 [Drosophila melanogaster]
Length = 537
Score = 85.5 bits (210), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 46/131 (35%), Positives = 67/131 (51%), Gaps = 16/131 (12%)
Query: 80 ATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMSQRLASFLLYLSDVEEGGET 138
AT L T E V Y +G Y+ H+D F +P Y + R+A+ + YLS+VE+GG T
Sbjct: 400 ATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRIATAIFYLSEVEQGGAT 459
Query: 139 MFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKW 198
FPF + + VKP+ G+ L +Y+L + D + H CPV+KG KW
Sbjct: 460 AFPF---------------LDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGSKW 504
Query: 199 VATKWIRDQEQ 209
+ WI + Q
Sbjct: 505 IGNVWIHEVTQ 515
>gi|354504916|ref|XP_003514519.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Cricetulus
griseus]
Length = 509
Score = 85.5 bits (210), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 65/213 (30%), Positives = 102/213 (47%), Gaps = 25/213 (11%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
+V+ RP + +F S + Q I A+ L+ S +A GE + R S ++
Sbjct: 305 EVIHLRPFVALYHDFVSDAEAQKIRELAEPWLQRSVVA--SGEKQLPVE-YRISKSAWLK 361
Query: 62 ASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 116
+ D +L ++H+IA T L + E V+ Y IG Y+ H+D A +P+ Y
Sbjct: 362 DTVDP--MLGTLDHRIAALTGLDIQPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRM 419
Query: 117 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
+ R+A+F++YLS VE GG T F + N V + L +++L
Sbjct: 420 KSGNRVATFMIYLSAVEAGGATAFIYAN---------------FSVPVVKNAALFWWNLH 464
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D +LH CPV+ G+KWVA KWI + Q
Sbjct: 465 RSGEGDGDTLHAGCPVLVGDKWVANKWIHEYGQ 497
>gi|334140935|ref|YP_004534141.1| 2OG-Fe(II) oxygenase [Novosphingobium sp. PP1Y]
gi|333938965|emb|CCA92323.1| 2OG-Fe(II) oxygenase [Novosphingobium sp. PP1Y]
Length = 209
Score = 85.5 bits (210), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 56/198 (28%), Positives = 93/198 (46%), Gaps = 28/198 (14%)
Query: 15 NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 74
+F QC ++IA + +PS +A G+ V RTSS +S L
Sbjct: 32 DFLDTAQCDALIALIEAEHRPSTVANYNGDDV-----FRTSSTCDLSPDVPAVAALA--- 83
Query: 75 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP-----AEYGPQMSQRLASFLLYL 129
K+ + + H E RYE+GQ++ +H D F P +Y QR +F++YL
Sbjct: 84 RKLCDISGIDPAHAEPLQGQRYEVGQEFKAHTDYFEPNNSDFEKYCSVSGQRTWTFMIYL 143
Query: 130 SDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGS 189
+DV+ GG T F K I ++P RG + + + P+G+++ +LH +
Sbjct: 144 NDVDAGGATRF---------------KVINKLIQPERGKLVAWNNRRPDGSLNPATLHHA 188
Query: 190 CPVIKGEKWVATKWIRDQ 207
V +G K+V T+W R++
Sbjct: 189 MKVRQGRKYVVTQWFRER 206
>gi|227553849|gb|ACP40552.1| IP22178p [Drosophila melanogaster]
Length = 467
Score = 85.5 bits (210), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 46/131 (35%), Positives = 67/131 (51%), Gaps = 16/131 (12%)
Query: 80 ATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMSQRLASFLLYLSDVEEGGET 138
AT L T E V Y +G Y+ H+D F +P Y + R+A+ + YLS+VE+GG T
Sbjct: 330 ATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRIATAIFYLSEVEQGGAT 389
Query: 139 MFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKW 198
FPF + + VKP+ G+ L +Y+L + D + H CPV+KG KW
Sbjct: 390 AFPF---------------LDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGSKW 434
Query: 199 VATKWIRDQEQ 209
+ WI + Q
Sbjct: 435 IGNVWIHEVTQ 445
>gi|17861644|gb|AAL39299.1| GH17175p [Drosophila melanogaster]
Length = 187
Score = 85.1 bits (209), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 46/131 (35%), Positives = 67/131 (51%), Gaps = 16/131 (12%)
Query: 80 ATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMSQRLASFLLYLSDVEEGGET 138
AT L T E V Y +G Y+ H+D F +P Y + R+A+ + YLS+VE+GG T
Sbjct: 50 ATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRIATAIFYLSEVEQGGAT 109
Query: 139 MFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKW 198
FPF + + VKP+ G+ L +Y+L + D + H CPV+KG KW
Sbjct: 110 AFPF---------------LDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGSKW 154
Query: 199 VATKWIRDQEQ 209
+ WI + Q
Sbjct: 155 IGNVWIHEVTQ 165
>gi|332187533|ref|ZP_08389270.1| 2OG-Fe(II) oxygenase superfamily protein [Sphingomonas sp. S17]
gi|332012462|gb|EGI54530.1| 2OG-Fe(II) oxygenase superfamily protein [Sphingomonas sp. S17]
Length = 228
Score = 85.1 bits (209), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 59/200 (29%), Positives = 98/200 (49%), Gaps = 28/200 (14%)
Query: 12 YFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILE 71
Y +F + QC ++IA +PS L + G RTS ++ + ++
Sbjct: 47 YQADFLTPAQCDALIAMIDANRRPSTLL-----SDRPDYGFRTSESCDMNRWSPE---VQ 98
Query: 72 LIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE-YGPQM----SQRLASFL 126
I+ IA+ +P GE RY GQ++ +H+D F+ +E Y ++ QR + +
Sbjct: 99 PIDESIAQLLGIPPEQGETMQGQRYAPGQQFRAHHDYFHESESYWEKVKVHGGQRTWTAM 158
Query: 127 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 186
+YL+DV EGG T FP G++V PRRG L + ++ +G+ + +L
Sbjct: 159 IYLNDVPEGGATWFP---------------QAGIRVAPRRGLLLAWNNMLLDGSPNDATL 203
Query: 187 HGSCPVIKGEKWVATKWIRD 206
H PV++G K+V TKW R+
Sbjct: 204 HEGMPVVEGVKYVITKWFRE 223
>gi|54792285|emb|CAG28668.1| prolyl 4-hydroxylase alpha-2 subunit [Gallus gallus]
Length = 538
Score = 85.1 bits (209), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 59/207 (28%), Positives = 101/207 (48%), Gaps = 28/207 (13%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 340 PHIVRYYDVMSDEEIEKIKQLAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 395
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL---- 122
++ + ++ + T L E V Y +G +Y+ H+D E P +RL
Sbjct: 396 DPVVAKVNQRMQQITGLTVKTAELLQVANYGMGGQYEPHFDFSRKDE--PDAFKRLGTGN 453
Query: 123 --ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
A+FL Y+SDVE GG T+FP D+ G + P++G + +Y+LF +G
Sbjct: 454 RVATFLNYMSDVEAGGATVFP-----------DF----GAAIWPKKGTAVFWYNLFRSGE 498
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQ 207
D + H +CPV+ G KWV+ KW ++
Sbjct: 499 GDYRTRHAACPVLVGCKWVSNKWFHER 525
>gi|395509389|ref|XP_003758980.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2
[Sarcophilus harrisii]
Length = 536
Score = 85.1 bits (209), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 97/207 (46%), Gaps = 24/207 (11%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + + S E+ + I AK +L ++ +R +T V + R S +++ +D
Sbjct: 337 PHIVRYYDVLSDEEIERIKELAKPKL--ARATVRDPKTGVLTVANYRVSKSSWLEEGDDP 394
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----QMSQRL 122
++ + ++ T L E V Y +G +Y+ H+D E R+
Sbjct: 395 --VIAQLNRRMHYITGLSVKTAELLQVANYGMGGQYEPHFDFSRKGEQDAFKHLGTGNRV 452
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
A+FL Y+SDVE GG T+FP G + P++G + +Y+LF +G D
Sbjct: 453 ATFLNYMSDVEAGGATVFP---------------DFGATIWPKKGTSVFWYNLFRSGEGD 497
Query: 183 RTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 498 YRTRHAACPVLVGSKWVSNKWFHERGQ 524
>gi|195064500|ref|XP_001996577.1| GH12091 [Drosophila grimshawi]
gi|193895397|gb|EDV94263.1| GH12091 [Drosophila grimshawi]
Length = 521
Score = 85.1 bits (209), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 58/185 (31%), Positives = 85/185 (45%), Gaps = 24/185 (12%)
Query: 31 KRLKPS-QLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGE 89
KRL P Q G TK T ++ ++ + T LE + +I T
Sbjct: 344 KRLSPQMQNGYIHGYKANQTKVTDIAAR--VNWLVENTPFLERMNQRITDMTGFDLKEFP 401
Query: 90 AFNVLRYEIGQKYDSHYDAF-----NPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFEN 144
+ V + IG +++HYD + G + RLAS + Y SDV GG T+FP
Sbjct: 402 SVQVANFGIGNNFEAHYDYIFGKRVRKEDVG-DLGDRLASIIFYSSDVPLGGATVFP--- 457
Query: 145 GIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 204
I + V+P++G+ LL+Y+LF +GT D SLH CPV+ G +W TKW+
Sbjct: 458 ------------DIQVAVQPQKGNSLLWYNLFDDGTPDPRSLHSVCPVVVGSRWTLTKWL 505
Query: 205 RDQEQ 209
Q
Sbjct: 506 HTSPQ 510
>gi|194905410|ref|XP_001981191.1| GG11931 [Drosophila erecta]
gi|190655829|gb|EDV53061.1| GG11931 [Drosophila erecta]
Length = 537
Score = 85.1 bits (209), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 60/208 (28%), Positives = 93/208 (44%), Gaps = 23/208 (11%)
Query: 5 SWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS--A 62
S P F + S + + A R++ S + R G + + R S +++ A
Sbjct: 328 SLDPYVASFHDMLSPRKISQLREMAVPRMQRSTVNPRPGGQHKKS-AFRVSKNAWLAYEA 386
Query: 63 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMSQR 121
G+L + AT L T E V Y +G Y+ H+D F +P+ Y R
Sbjct: 387 HPTMAGMLR----DLKDATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPSHYPAAEGNR 442
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 181
+A+ + YLS+VE+GG T FPF + VKP+ G+ L +Y+L +
Sbjct: 443 IATAIFYLSEVEQGGATAFPF---------------LDFAVKPQLGNVLFWYNLHRSLDK 487
Query: 182 DRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
D + H CPV+KG KW+ WI + Q
Sbjct: 488 DYRTKHAGCPVLKGSKWIGNVWIHEVTQ 515
>gi|402894624|ref|XP_003910453.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
alpha-3 [Papio anubis]
Length = 535
Score = 85.1 bits (209), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 65/213 (30%), Positives = 101/213 (47%), Gaps = 25/213 (11%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
+V+ P + +F S + Q I A+ L+ S +A GE + R S ++
Sbjct: 331 EVIHLEPYIALYHDFVSDSEAQKIREFAEPWLQRSVVA--SGEKQLQVE-YRISKSAWLK 387
Query: 62 ASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 116
+ D +L + H+IA T L + E V+ Y IG Y+ H+D A +P+ Y
Sbjct: 388 DTVDP--MLVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRM 445
Query: 117 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
+ R+A+F++YLS VE GG T F + N L V + L +++L
Sbjct: 446 KSGNRVATFMIYLSSVEAGGATAFIYAN---------------LSVPVVKNAALFWWNLH 490
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D +LH CPV+ G+KWVA KWI + Q
Sbjct: 491 RSGEGDSDTLHAGCPVLVGDKWVANKWIHEYGQ 523
>gi|449267219|gb|EMC78185.1| Prolyl 4-hydroxylase subunit alpha-2 [Columba livia]
Length = 538
Score = 85.1 bits (209), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 59/207 (28%), Positives = 101/207 (48%), Gaps = 28/207 (13%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 339 PHIVRYYDVMSDEEIEKIKQLAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 394
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL---- 122
++ + ++ + T L E V Y +G +Y+ H+D E P +RL
Sbjct: 395 DPVVAKVNQRMQQITGLTVKTAELLQVANYGMGGQYEPHFDFSRKDE--PDAFKRLGTGN 452
Query: 123 --ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
A+FL Y+SDVE GG T+FP D+ G + P++G + +Y+LF +G
Sbjct: 453 RVATFLNYMSDVEAGGATVFP-----------DF----GAAIWPKKGTAVFWYNLFRSGE 497
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQ 207
D + H +CPV+ G KWV+ KW ++
Sbjct: 498 GDYRTRHAACPVLVGCKWVSNKWFHER 524
>gi|74216495|dbj|BAE25162.1| unnamed protein product [Mus musculus]
Length = 187
Score = 85.1 bits (209), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 56/193 (29%), Positives = 96/193 (49%), Gaps = 22/193 (11%)
Query: 20 EQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDKTGILELIEHKIA 78
E+ + I AK +L ++ +R +T V + R S +++ ED ++ + ++
Sbjct: 2 EEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDDDPVVARVNRRMQ 57
Query: 79 RATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQM-SQRLASFLLYLSDVEEGG 136
T L E V Y +G +Y+ H+D + P + G + RLA+FL Y+SDVE GG
Sbjct: 58 HITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGG 117
Query: 137 ETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGE 196
T+FP +G + P++G + +Y+L +G D + H +CPV+ G
Sbjct: 118 ATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGC 162
Query: 197 KWVATKWIRDQEQ 209
KWV+ KW ++ Q
Sbjct: 163 KWVSNKWFHERGQ 175
>gi|407699315|ref|YP_006824102.1| prolyl 4-hydroxylase subunit alpha [Alteromonas macleodii str.
'Black Sea 11']
gi|407248462|gb|AFT77647.1| Prolyl 4-hydroxylase alpha subunit [Alteromonas macleodii str.
'Black Sea 11']
Length = 354
Score = 84.7 bits (208), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 63/204 (30%), Positives = 94/204 (46%), Gaps = 25/204 (12%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLA--LRQGETVESTKGTRTSSGTFISASED 65
P LY + S +C +I L+PS + L V++ RTS I+ S
Sbjct: 155 PVELYV-DVLSEYECAYLITKFSSLLQPSMVVDPLTGNGKVDNV---RTSYVAIIAPSYC 210
Query: 66 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM---SQRL 122
I ++ I++ T P+ +GEA N+LRY GQ+Y HYDA N G QR+
Sbjct: 211 D-WITRKLDKVISQVTHTPRCNGEALNLLRYTPGQQYKPHYDALNEDHDGSMYKDGKQRI 269
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
+ L+YL+ V +GGET FP + + V P G+ ++F + +G +
Sbjct: 270 KTALVYLNTVRQGGETRFPK---------------LDISVSPTLGNMVVFSNSDESGKLL 314
Query: 183 RTSLHGSCPVIKGEKWVATKWIRD 206
S H P KW+ TKWIR+
Sbjct: 315 LNSYHLGAPTFSENKWLVTKWIRE 338
>gi|198452400|ref|XP_002137470.1| GA26529 [Drosophila pseudoobscura pseudoobscura]
gi|198131917|gb|EDY68028.1| GA26529 [Drosophila pseudoobscura pseudoobscura]
Length = 348
Score = 84.7 bits (208), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 61/209 (29%), Positives = 99/209 (47%), Gaps = 26/209 (12%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPS--QLALRQGETVESTKGTRTSSGT 58
+++ S P + + + + Q +I + ++R+ S Q +RQ E E RTS
Sbjct: 143 LEIFSHDPYVVIYHDVLYDAEMQGLIDSTRRRMSRSMVQYEIRQIEISEQ----RTSKEA 198
Query: 59 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH---YDAFNPAEYG 115
+ D +L+ I ++ T E ++L Y+ G +D H +D + EY
Sbjct: 199 PFTEKNDPQ-LLKRIYDRLKDMTGCDMLRSEHLSILLYDQGGHHDPHVDYHDLYWEYEYH 257
Query: 116 PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSL 175
P R AS + YL+DVE+GGET+FP + L + P +G L++++L
Sbjct: 258 P-FGDRQASVVFYLNDVEDGGETVFP---------------KLQLVIPPTKGSALMWHNL 301
Query: 176 FPNGTIDRTSLHGSCPVIKGEKWVATKWI 204
P G D + H SCPV+ G K VA +WI
Sbjct: 302 RPWGEGDPRTQHASCPVLSGYKQVAIQWI 330
>gi|397644356|gb|EJK76358.1| hypothetical protein THAOC_01879, partial [Thalassiosira oceanica]
Length = 539
Score = 84.7 bits (208), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 63/203 (31%), Positives = 96/203 (47%), Gaps = 23/203 (11%)
Query: 8 PRALYFPNFASAEQCQSIIATAK----KRLKPSQLALRQGETVESTKGTRTSSGTFISAS 63
P + F NF + E+ ++ + +R A GE + TRTSS +
Sbjct: 336 PWVVVFDNFLTDEEVADLVKGGELEGYERSTDQGAANAYGEQEKVVSRTRTSSNAWCMHK 395
Query: 64 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 123
++ + KI T +PQ + E+F +L+Y+ GQ Y SH+D+ + + P R+
Sbjct: 396 CERLPGVRSASKKIEAVTGIPQVNYESFQLLKYDGGQFYRSHHDS-SSVDDSP-AGHRIL 453
Query: 124 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI-- 181
+F LYLSDVEEGGET F +G+ VKP++G L++ S+
Sbjct: 454 TFFLYLSDVEEGGETYF---------------SKLGIAVKPKKGRALVWPSVLDEDPTYW 498
Query: 182 DRTSLHGSCPVIKGEKWVATKWI 204
D+ H + VIKGEK A WI
Sbjct: 499 DKRMYHEAKDVIKGEKKAANHWI 521
>gi|195572619|ref|XP_002104293.1| GD18524 [Drosophila simulans]
gi|194200220|gb|EDX13796.1| GD18524 [Drosophila simulans]
Length = 472
Score = 84.7 bits (208), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 50/162 (30%), Positives = 78/162 (48%), Gaps = 29/162 (17%)
Query: 52 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 111
RTS ++I SE + ++ T + F+++ Y +G Y HYD
Sbjct: 318 VRTSKDSYIVDSES-------LNERVTDMTGFSMEMSDPFSLINYGLGGHYMLHYDFH-- 368
Query: 112 AEYG----PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRG 167
EY P+ R+A+ L YL +V+ GG T+FP I + V P++G
Sbjct: 369 -EYTNTTRPKQGDRIATVLFYLGEVDSGGATIFP---------------KINIAVTPKKG 412
Query: 168 DGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+ +Y+L +G ++ SLH +CPVI G K+V TKWI + Q
Sbjct: 413 SAVFWYNLHNSGAMNLKSLHSACPVISGSKYVLTKWINELPQ 454
>gi|326923465|ref|XP_003207956.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 3
[Meleagris gallopavo]
Length = 518
Score = 84.7 bits (208), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 61/204 (29%), Positives = 100/204 (49%), Gaps = 34/204 (16%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 65
+PR + F + S E+ +++ AK RL S+ + ET + +T R S ++S E
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRL--SRATVHDPETGKLTTAHYRVSKSAWLSGYE- 392
Query: 66 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASF 125
+ ++ I +I T L + E QK + DAF G R+A++
Sbjct: 393 -SPVVSRINTRIQDLTGLDVSTAEEL--------QKDEP--DAFKELGTG----NRIATW 437
Query: 126 LLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTS 185
L Y+SDV GG T+FP +G V P++G + +Y+LFP+G D ++
Sbjct: 438 LFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFPSGEGDYST 482
Query: 186 LHGSCPVIKGEKWVATKWIRDQEQ 209
H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 483 RHAACPVLVGNKWVSNKWLHERGQ 506
>gi|198449643|ref|XP_001357664.2| GA15938 [Drosophila pseudoobscura pseudoobscura]
gi|198130698|gb|EAL26798.2| GA15938 [Drosophila pseudoobscura pseudoobscura]
Length = 549
Score = 84.7 bits (208), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 63/212 (29%), Positives = 101/212 (47%), Gaps = 25/212 (11%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST-KGTRTSSGTF 59
++ LS P YF + S ++ + II K ++ S++ G+T ST RTS T+
Sbjct: 339 VEQLSGDPYVAYFHDVLSDKESEQIIEHGKGQVTRSEI----GQTGNSTVSDIRTSQNTW 394
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE--YGPQ 117
+ + L I+ ++ T L E ++ Y IG +Y+ H+D + AE +G +
Sbjct: 395 LWY--ENNPWLADIKQRLEDITGLSTDTAEPLQLVNYGIGGQYEPHFDFMDDAEKNFGWK 452
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
RL + L YL+DV GG T FPF + L V P +G L++Y+L
Sbjct: 453 -GNRLLTALFYLNDVPLGGATAFPF---------------LHLAVPPVKGSLLVWYNLHR 496
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+ D + H CPV+KG KW+ +W + Q
Sbjct: 497 SLHKDFRTKHAGCPVLKGSKWICNQWFHEAAQ 528
>gi|198449641|ref|XP_002136935.1| GA26860 [Drosophila pseudoobscura pseudoobscura]
gi|198130697|gb|EDY67493.1| GA26860 [Drosophila pseudoobscura pseudoobscura]
Length = 508
Score = 84.7 bits (208), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 61/212 (28%), Positives = 95/212 (44%), Gaps = 27/212 (12%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
M++LS P + + + + + + + A+K L + + S RT+ ++
Sbjct: 312 MELLSLDPYVVLYHDVLADREMSLLKSMAQKDLVRAS-TYDVMDKKHSEDPNRTTKARWL 370
Query: 61 SASED---KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ 117
S + GIL T L E F VL Y IG D H D + + P+
Sbjct: 371 DPSHSLIRRMGIL------TEDMTNLDLERLEDFQVLNYGIGGHDDIHPDYYEGS--NPE 422
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
+ R+A+ L YLSDV GG T+FP + L V P+RG L++Y+L
Sbjct: 423 LPDRVATLLFYLSDVPLGGATVFPL---------------LDLSVFPKRGAVLMWYNLDH 467
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
G ++H +CPV+ G +WV TKW+ Q Q
Sbjct: 468 KGQGIEKTVHSACPVVVGSRWVMTKWVNQQPQ 499
>gi|195392288|ref|XP_002054791.1| GJ24631 [Drosophila virilis]
gi|194152877|gb|EDW68311.1| GJ24631 [Drosophila virilis]
Length = 499
Score = 84.7 bits (208), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 98/208 (47%), Gaps = 27/208 (12%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
++ LS P + + + A + + I+ AK L+ + + + + R +
Sbjct: 298 LEQLSLDPYMVLYHDVVQANEREHIMQLAKPHLRRALVGAARAHS------QRFAMNAGF 351
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF----NPAEYGP 116
S ++ + G + + ++ + T+ VL Y IG +Y HYD + + A+
Sbjct: 352 SYNDSRQG--QRLRQRLEDMSGFDLTNSGQLAVLNYGIGGQYYMHYDCWFSQDDAAQVAS 409
Query: 117 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
R+A+ LLYL+DV+ GG T FP +GL V+P G L+++++
Sbjct: 410 IKDNRIATILLYLTDVQLGGLTSFP---------------ALGLAVQPSPGSALIWHNMN 454
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWI 204
DR +LH +CP++ G +WVAT+WI
Sbjct: 455 NAAECDRRTLHAACPLLLGTRWVATQWI 482
>gi|20269818|gb|AAM18064.1| prolyl 4-hydroxylase alpha-related protein PH4[alpha]NE1
[Drosophila melanogaster]
Length = 286
Score = 84.7 bits (208), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 46/131 (35%), Positives = 67/131 (51%), Gaps = 16/131 (12%)
Query: 80 ATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMSQRLASFLLYLSDVEEGGET 138
AT L T E V Y +G Y+ H+D F +P Y + R+A+ + YLS+VE+GG T
Sbjct: 149 ATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRIATAIFYLSEVEQGGAT 208
Query: 139 MFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKW 198
FPF + + VKP+ G+ L +Y+L + D + H CPV+KG KW
Sbjct: 209 AFPF---------------LDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGSKW 253
Query: 199 VATKWIRDQEQ 209
+ WI + Q
Sbjct: 254 IGNVWIHEVTQ 264
>gi|308476969|ref|XP_003100699.1| hypothetical protein CRE_15564 [Caenorhabditis remanei]
gi|308264511|gb|EFP08464.1| hypothetical protein CRE_15564 [Caenorhabditis remanei]
Length = 573
Score = 84.7 bits (208), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 65/236 (27%), Positives = 99/236 (41%), Gaps = 50/236 (21%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTF- 59
+++L + P A+ F N S + + I A +LK + TV+++K T+
Sbjct: 334 VEILRFDPLAVLFKNVISDSEIKVIKELASPKLKRA--------TVQNSKTGELEHATYR 385
Query: 60 ISASE----DKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 115
IS S D ++E + +I T L Q E V Y +G YD H+D A YG
Sbjct: 386 ISKSAWLKGDLHPVIERVNRRIEDFTGLYQGTSEELQVANYGLGGHYDPHFDFARIANYG 445
Query: 116 P----------------------QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYD 153
R+A+ L Y+S E GG T+F
Sbjct: 446 LGGHYEPHYDMSLKEEKNAFKTLNTGNRIATVLFYMSQPERGGATVF------------- 492
Query: 154 YKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G V P + D L +Y+L +G D + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 493 --NHLGTAVFPSKNDALFWYNLRRDGEGDLRTRHAACPVLLGVKWVSNKWIHERGQ 546
>gi|328718395|ref|XP_003246475.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Acyrthosiphon
pisum]
Length = 518
Score = 84.7 bits (208), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 59/188 (31%), Positives = 87/188 (46%), Gaps = 25/188 (13%)
Query: 33 LKPSQLALR--QGETVESTKGT------RTSSGTFISASE-DKTGILELIEHKIARATML 83
LK LAL + TV+S G +T SG S+ D L+ ++ +I T
Sbjct: 334 LKIKTLALENMKDATVKSVDGKGDSLIEKTRSGQVYWISKVDAVEYLDALDTRIESFTGF 393
Query: 84 PQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFE 143
E + ++ Y +G Y H+D+F A Q RL + L YL+DV+ G T FP
Sbjct: 394 STKTAEQYQIVNYGLGGHYLPHHDSFAKAINCLQFGNRLVTVLFYLTDVQNDGYTSFPLL 453
Query: 144 NGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSL-FPNGTIDRTSLHGSCPVIKGEKWVATK 202
N I +G L++ +L NG SLHGSCP++KG KW+ T+
Sbjct: 454 NII---------------APAEKGAALVWNNLHMSNGQKFYESLHGSCPLLKGNKWIMTR 498
Query: 203 WIRDQEQH 210
W+ ++ QH
Sbjct: 499 WLYEEGQH 506
>gi|449668268|ref|XP_002154169.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Hydra
magnipapillata]
Length = 531
Score = 84.7 bits (208), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 61/200 (30%), Positives = 91/200 (45%), Gaps = 31/200 (15%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLA--LRQGETVESTKGTRTSSGTFIS-ASE 64
P L F + E + I A RL+PS++ + Q T R S F A E
Sbjct: 342 PDVLVFHEMITEEVAEKIRDVANPRLRPSEVIDPIIQKHV---TASYRVSKNVFFDDAFE 398
Query: 65 DKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA------EYGPQM 118
++ I + + AT L E V Y +G +Y+ H D +P E+G
Sbjct: 399 EELEISRKLRPLVEDATDLNDDFSEQLQVNNYGLGGQYEFHVDFGDPGSPLDKHEHG--- 455
Query: 119 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 178
R+A+ L+YLSDVE GG+T+F +GL +KP+ GD +++L+ N
Sbjct: 456 -NRIATLLIYLSDVERGGDTVFT---------------RLGLSLKPKLGDAAFWHNLYKN 499
Query: 179 GTIDRTSLHGSCPVIKGEKW 198
G+ + H SCPV+ G KW
Sbjct: 500 GSGIYATEHASCPVVSGSKW 519
>gi|224007761|ref|XP_002292840.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220971702|gb|EED90036.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 490
Score = 84.7 bits (208), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 67/213 (31%), Positives = 96/213 (45%), Gaps = 32/213 (15%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKK----RLKPSQLALRQGETVESTKGTRTSSGTF 59
+S P + F NF + E+C +I K R K G RTS +
Sbjct: 284 MSQPPWIITFDNFLTDEECNQMIQLGYKAKYERSKDVGEMQIDGSYDSVVSKGRTSENAW 343
Query: 60 ISASED--KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE---Y 114
S + T +LI +I+ T +P H E F +L+YE GQ Y SH+D E
Sbjct: 344 CSFRDKCRNTTTAQLIHDRISTVTGIPANHSEDFQILKYEKGQFYRSHHDYIEHQEKRRC 403
Query: 115 GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 174
GP R+ +F LYLSDVEEGG+T FP + + VKP++G +L+ S
Sbjct: 404 GP----RVLTFFLYLSDVEEGGDTNFPK---------------LSIAVKPKKGSAVLWPS 444
Query: 175 LF---PNGTIDRTSLHGSCPVIKGEKWVATKWI 204
+ P+ RT H + V+ G K+ A W+
Sbjct: 445 VLDSNPSMKDPRTD-HEAQEVVNGTKFGANAWL 476
>gi|397644755|gb|EJK76534.1| hypothetical protein THAOC_01697 [Thalassiosira oceanica]
Length = 475
Score = 84.7 bits (208), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 65/206 (31%), Positives = 99/206 (48%), Gaps = 26/206 (12%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRL--KPSQLALRQGE-TVESTKGT-RTSSGTFISAS 63
P + F NF + ++C +I +K + + Q + + +S + T RTS + S
Sbjct: 273 PWVITFENFLTEDECTHMIEQGRKAEYERSEDVGEVQADGSYDSVRSTGRTSENAWCSFR 332
Query: 64 ED--KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 121
+ I+EL+ +IA+ T + H E F +L+YE GQ Y H+D + + + R
Sbjct: 333 DGCRNDTIVELVHDRIAKVTGIGANHSEDFQILKYEPGQFYRQHHD-YIEHQRDRRCGPR 391
Query: 122 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF---PN 178
+ +F LYLSDVEEGG T FP +G+ VKP+ G LL+ S+ P
Sbjct: 392 VLTFFLYLSDVEEGGATNFPK---------------LGIAVKPKVGRALLWPSVLNSEPR 436
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWI 204
RT H + VI G K+ A WI
Sbjct: 437 NKDGRTD-HEAQDVIAGVKYGANAWI 461
>gi|403255941|ref|XP_003920663.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Saimiri
boliviensis boliviensis]
gi|403255945|ref|XP_003920665.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 5 [Saimiri
boliviensis boliviensis]
Length = 535
Score = 84.7 bits (208), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 58/211 (27%), Positives = 99/211 (46%), Gaps = 32/211 (15%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 118
++ + ++ T L E V Y +G +Y+ H+D AF G
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDAFKHLGTG--- 448
Query: 119 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 178
R+A+FL Y+SDVE GG T+FP +G + P++G + +Y+L +
Sbjct: 449 -NRVATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRS 492
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
G D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 493 GEGDYRTRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|291387304|ref|XP_002710243.1| PREDICTED: prolyl 4-hydroxylase, alpha II subunit isoform 1
precursor (predicted)-like isoform 3 [Oryctolagus
cuniculus]
Length = 535
Score = 84.7 bits (208), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 59/211 (27%), Positives = 99/211 (46%), Gaps = 32/211 (15%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 118
++ I ++ T L E V Y +G +Y+ H+D AF G
Sbjct: 392 DPVVARINRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRNNERDAFKRLGTG--- 448
Query: 119 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 178
R+A+FL Y+SDVE GG T+FP +G + P++G + +Y+L +
Sbjct: 449 -NRVATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRS 492
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
G D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 493 GEGDYRTRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|399057802|ref|ZP_10744231.1| 2OG-Fe(II) oxygenase superfamily enzyme [Novosphingobium sp. AP12]
gi|398041550|gb|EJL34606.1| 2OG-Fe(II) oxygenase superfamily enzyme [Novosphingobium sp. AP12]
Length = 210
Score = 84.7 bits (208), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 56/198 (28%), Positives = 95/198 (47%), Gaps = 28/198 (14%)
Query: 15 NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 74
NF +AEQC ++A + +PS +A G+ RTSS +S ++ +
Sbjct: 33 NFVAAEQCAELMALIEDSHRPSTIADYNGD-----DAFRTSSTCDLSTD---VPVVANLA 84
Query: 75 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA-----EYGPQMSQRLASFLLYL 129
++R + + H E RYE+GQ++ +H D F P +Y QR +F++YL
Sbjct: 85 AALSRLSGIDLAHAEPLQGQRYEVGQEFKAHTDYFEPGNADYDKYCAVPGQRTWTFMIYL 144
Query: 130 SDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGS 189
++VE GG T F + I ++P G + + + P+GT + +LH +
Sbjct: 145 NEVEAGGATRF---------------RVIDKMIQPEIGKLIAWNNRRPDGTPNAATLHHA 189
Query: 190 CPVIKGEKWVATKWIRDQ 207
V KG K+V T+W R++
Sbjct: 190 MKVRKGYKYVITQWYRER 207
>gi|332140647|ref|YP_004426385.1| Prolyl 4-hydroxylase alpha subunit [Alteromonas macleodii str.
'Deep ecotype']
gi|327550669|gb|AEA97387.1| Prolyl 4-hydroxylase alpha subunit [Alteromonas macleodii str.
'Deep ecotype']
Length = 376
Score = 84.3 bits (207), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 57/203 (28%), Positives = 91/203 (44%), Gaps = 32/203 (15%)
Query: 13 FPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFISASEDK 66
+ + S +C+ +I LKPS + V+ G RTS I +
Sbjct: 181 YESILSEYECRYLITKFNALLKPSMV-------VDPVTGRGKIDSVRTSYVAVIEPAHCD 233
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS---QRLA 123
I ++ I++ T + +GEA N+LRY GQ+Y HYD N QR+
Sbjct: 234 -WITRKLDKTISQITHTLRQNGEALNLLRYSPGQQYKPHYDGLNEINDALMFKDGKQRIK 292
Query: 124 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 183
+ L+YL+ + EGGET+FP + +++ P+ G ++F + NG +
Sbjct: 293 TALVYLNTISEGGETLFPK---------------LDIRIAPKSGTMVVFSNSDENGKLLL 337
Query: 184 TSLHGSCPVIKGEKWVATKWIRD 206
S H P + KW+ TKWIR+
Sbjct: 338 NSYHAGAPTVSENKWLVTKWIRE 360
>gi|324511726|gb|ADY44875.1| Prolyl 4-hydroxylase subunit alpha-1 [Ascaris suum]
Length = 550
Score = 84.3 bits (207), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 61/217 (28%), Positives = 99/217 (45%), Gaps = 30/217 (13%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
+++L + P A+ F + S E+ + I A RLK + + + +E T R S ++
Sbjct: 322 VEILRFNPLAVLFVDIISDEEAKMIQQIATPRLKRATVQNSKTGELE-TAAYRISKSAWL 380
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPA 112
+ + +++ I +I T L Q E + Y +G YD H+D AF
Sbjct: 381 KGGDHE--LIDRINRRIELMTNLIQETSEELQIANYGVGGHYDPHFDFARKEEPKAFESL 438
Query: 113 EYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLF 172
G RLA+ L YL++ E GG T+F + V P + L +
Sbjct: 439 GTG----NRLATVLFYLTEPEIGGGTVF---------------TELRTAVMPSKNGALFW 479
Query: 173 YSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
Y+L+ +G D + H +CPV+ G KWVA KWI ++ Q
Sbjct: 480 YNLYRSGEGDLRTRHAACPVLVGIKWVANKWIHERGQ 516
>gi|296217074|ref|XP_002754870.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Callithrix
jacchus]
Length = 544
Score = 84.3 bits (207), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 65/213 (30%), Positives = 100/213 (46%), Gaps = 25/213 (11%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
++L P + +F S + Q I A+ L+ S +A GE + R S ++
Sbjct: 340 EILHLEPYIALYHDFVSDSEAQKIREFAEPWLQRSVVA--SGEKQLQVE-YRISKSAWLK 396
Query: 62 ASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 116
+ D +L + H+IA T L + E V+ Y IG Y+ H+D A +P+ Y
Sbjct: 397 DTVDP--MLVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRM 454
Query: 117 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
+ R+A+F++YLS VE GG T F + N L V + L +++L
Sbjct: 455 KSGNRVATFMIYLSSVEAGGATAFIYAN---------------LSVPVVKNAALFWWNLH 499
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D +LH CPV+ G KWVA KWI + Q
Sbjct: 500 RSGEGDSDTLHAGCPVLVGNKWVANKWIHEYGQ 532
>gi|313768105|ref|YP_004061536.1| hypothetical protein BpV1_106c [Bathycoccus sp. RCC1105 virus BpV1]
gi|312599712|gb|ADQ91733.1| hypothetical protein BpV1_106c [Bathycoccus sp. RCC1105 virus BpV1]
Length = 197
Score = 84.3 bits (207), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 59/199 (29%), Positives = 94/199 (47%), Gaps = 28/199 (14%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
+PR L N S ++C+ I A K+L+ S +++ + + + R S ++ ASED
Sbjct: 23 KPRVL--KNVLSEDECKHIQDIASKKLQTSTVSMSR----DIDEKIRKSETAWLKASED- 75
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFL 126
+++ + K T P + E VL+Y+ G Y H D F + ++R+ +F+
Sbjct: 76 -PVVDKLIRKCVSMTDRPLHNCEDLQVLKYKPGGFYKPHQDCFKNDK-----NKRMYTFI 129
Query: 127 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 186
+ L+D EGGET FP I + + +GD L F +L + +L
Sbjct: 130 IALNDEYEGGETEFPN---------------IKRRYRLEKGDALFFNTLNNYECTTKQAL 174
Query: 187 HGSCPVIKGEKWVATKWIR 205
HG PV GEKWV WIR
Sbjct: 175 HGGAPVKSGEKWVCNLWIR 193
>gi|397615311|gb|EJK63351.1| hypothetical protein THAOC_15991 [Thalassiosira oceanica]
Length = 463
Score = 84.3 bits (207), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 59/184 (32%), Positives = 91/184 (49%), Gaps = 30/184 (16%)
Query: 44 ETVESTKGTRTSSGTFISASEDKTGILELIEHK------IARATMLPQTHGE-------- 89
E + TRTS T++ +D I++ I + I A + P++ GE
Sbjct: 289 EARHDIRETRTSLNTWVYREKDL--IIDAIYRRAADLLRIDEALLRPRSAGEVPEMKNTR 346
Query: 90 ----AFNVLRYEIGQKYDSHYD-AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFEN 144
A ++ YE+GQ+Y +H+D + P + Q + R A+ LLYL++ GGET FP
Sbjct: 347 GLAEALQLVHYEVGQEYTAHHDFGYAPFDRKDQPA-RFATLLLYLNEGMVGGETQFP--- 402
Query: 145 GIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 204
+ + GL V+P+ G +LFYS P+G +D S H + PV GEKW+ W+
Sbjct: 403 -----RWANAETRAGLDVEPKIGKAVLFYSQLPDGNMDDLSQHAARPVKIGEKWLMNLWV 457
Query: 205 RDQE 208
D E
Sbjct: 458 WDPE 461
>gi|332221660|ref|XP_003259981.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Nomascus
leucogenys]
Length = 537
Score = 84.3 bits (207), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 97/207 (46%), Gaps = 24/207 (11%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 338 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 393
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----QMSQRL 122
++ + ++ T L E V Y +G +Y+ H+D E R+
Sbjct: 394 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRV 453
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
A+FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 454 ATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGD 498
Query: 183 RTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 499 YRTRHAACPVLVGCKWVSNKWFHERGQ 525
>gi|312599252|gb|ADQ91275.1| hypothetical protein BpV2_108c [Bathycoccus sp. RCC1105 virus BpV2]
Length = 197
Score = 84.3 bits (207), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 62/199 (31%), Positives = 98/199 (49%), Gaps = 28/199 (14%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
+PR L N S ++C+ I A K+L+ S ++ ++ + + R S ++ ASED
Sbjct: 23 KPRVL--KNVLSEDECKHIQNIASKKLQTSTVS----KSRDIDESIRKSETAWLKASED- 75
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFL 126
+++ + K T P + E VL+Y+ G Y H D F P + ++R+ +F+
Sbjct: 76 -PVVDKLIRKCVSMTDRPLRNCEDLQVLKYKPGGFYKPHQDTF-PDD----KNKRMYTFI 129
Query: 127 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 186
+ L+D EGGET FP + KK L+ +GD L F +L I + +L
Sbjct: 130 IALNDEYEGGETEFP-----------NIKKSYRLE----KGDALFFNTLNNYECITKKAL 174
Query: 187 HGSCPVIKGEKWVATKWIR 205
HG PV GEKWV W+R
Sbjct: 175 HGGTPVKSGEKWVCNLWVR 193
>gi|4758868|ref|NP_004190.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Homo
sapiens]
gi|217272863|ref|NP_001136071.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Homo
sapiens]
gi|20455169|sp|O15460.1|P4HA2_HUMAN RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
alpha-2; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-2; Flags: Precursor
gi|2439985|gb|AAB71339.1| prolyl 4-hydroxylase alpha (II) subunit [Homo sapiens]
gi|18073926|emb|CAC85689.1| Prolyl 4-hydroxylase alpha IIb subunit [Homo sapiens]
gi|119582746|gb|EAW62342.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_b
[Homo sapiens]
gi|119582747|gb|EAW62343.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_b
[Homo sapiens]
Length = 535
Score = 84.3 bits (207), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 97/207 (46%), Gaps = 24/207 (11%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----QMSQRL 122
++ + ++ T L E V Y +G +Y+ H+D E R+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRV 451
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
A+FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 452 ATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGD 496
Query: 183 RTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 497 YRTRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|335283456|ref|XP_003354320.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Sus scrofa]
Length = 535
Score = 84.3 bits (207), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 97/207 (46%), Gaps = 24/207 (11%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----QMSQRL 122
++ + ++ T L E V Y +G +Y+ H+D E R+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRKDEQDAFKRLGTGNRV 451
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
A+FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 452 ATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGD 496
Query: 183 RTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 497 YRTRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|297675929|ref|XP_002815906.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Pongo
abelii]
Length = 535
Score = 84.3 bits (207), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 97/207 (46%), Gaps = 24/207 (11%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----QMSQRL 122
++ + ++ T L E V Y +G +Y+ H+D E R+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRV 451
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
A+FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 452 ATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGD 496
Query: 183 RTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 497 YRTRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|157818741|ref|NP_001101745.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Rattus norvegicus]
gi|149052604|gb|EDM04421.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide (predicted),
isoform CRA_a [Rattus norvegicus]
Length = 535
Score = 84.3 bits (207), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 97/207 (46%), Gaps = 24/207 (11%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----QMSQRL 122
++ + ++ T L E V Y +G +Y+ H+D E R+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRSDERDAFKRLGTGNRV 451
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
A+FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 452 ATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGD 496
Query: 183 RTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 497 YRTRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|114601566|ref|XP_001162222.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Pan
troglodytes]
gi|114601568|ref|XP_001162843.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 17 [Pan
troglodytes]
gi|397518358|ref|XP_003829358.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Pan
paniscus]
gi|397518362|ref|XP_003829360.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 5 [Pan
paniscus]
gi|410215944|gb|JAA05191.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
gi|410255608|gb|JAA15771.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
gi|410331279|gb|JAA34586.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
Length = 535
Score = 84.3 bits (207), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 97/207 (46%), Gaps = 24/207 (11%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----QMSQRL 122
++ + ++ T L E V Y +G +Y+ H+D E R+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRV 451
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
A+FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 452 ATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGD 496
Query: 183 RTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 497 YRTRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|410948134|ref|XP_003980796.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Felis
catus]
Length = 535
Score = 84.3 bits (207), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 97/207 (46%), Gaps = 24/207 (11%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----QMSQRL 122
++ + ++ T L E V Y +G +Y+ H+D E R+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRKNEQDAFKRLGTGNRV 451
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
A+FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 452 ATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGD 496
Query: 183 RTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 497 YRTRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|116496629|gb|AAI26171.1| Prolyl 4-hydroxylase, alpha polypeptide III [Homo sapiens]
Length = 544
Score = 84.3 bits (207), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 65/213 (30%), Positives = 100/213 (46%), Gaps = 25/213 (11%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
+V+ P + +F S + Q I A+ L+ S +A GE + R S ++
Sbjct: 340 EVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVA--SGEKQLQVE-YRISKSAWLK 396
Query: 62 ASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 116
+ + L + H+IA T L + E V+ Y IG Y+ H+D A +P+ Y
Sbjct: 397 DTVNPK--LVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRM 454
Query: 117 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
+ R+A+F++YLS VE GG T F + N L V R L +++L
Sbjct: 455 KSGNRVATFMIYLSSVEAGGATAFIYAN---------------LSVPVVRNAALFWWNLH 499
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D +LH CPV+ G+KWVA KWI + Q
Sbjct: 500 RSGEGDSDTLHAGCPVLVGDKWVANKWIHEYGQ 532
>gi|37912909|gb|AAR05245.1| conserved hypothetical protein [uncultured marine proteobacterium
ANT32C12]
Length = 186
Score = 84.3 bits (207), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 57/193 (29%), Positives = 95/193 (49%), Gaps = 29/193 (15%)
Query: 26 IATAKKRLKPSQLALRQGETVESTK----GTRTSSGTFISASEDKTGILELIEHKIARAT 81
+ +A+ L+ Q + + + ++ +RT+S +I D + I+ + + +
Sbjct: 9 LMSARPLLRLDQARVERATVITDSEHQFHDSRTNSYAWIQ--HDASEIIHEVSKRFSILV 66
Query: 82 MLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA-EYGPQM----SQRLASFLLYLSDVEEGG 136
+P + E F ++ Y G +Y H+DAF+ + E G QR+ + L YL+DVE+GG
Sbjct: 67 KMPINNAEQFQLVHYGPGTEYKPHFDAFDKSTEEGRNNWFPGGQRMVTALAYLNDVEDGG 126
Query: 137 ETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT--IDRTSLHGSCPVIK 194
T FP I + VKP +GD ++F++ +GT I+ SLHG PVI
Sbjct: 127 ATDFP---------------DIHVSVKPNKGDVVVFHNC-KDGTSDINPNSLHGGSPVIS 170
Query: 195 GEKWVATKWIRDQ 207
GEKW W R +
Sbjct: 171 GEKWAVNLWFRQE 183
>gi|395736141|ref|XP_003776706.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Pongo abelii]
Length = 577
Score = 84.3 bits (207), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 97/207 (46%), Gaps = 24/207 (11%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 378 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 433
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----QMSQRL 122
++ + ++ T L E V Y +G +Y+ H+D E R+
Sbjct: 434 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRV 493
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
A+FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 494 ATFLNYMSDVEAGGATVFPD---------------LGAAIWPKKGTAVFWYNLLRSGEGD 538
Query: 183 RTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 539 YRTRHAACPVLVGCKWVSNKWFHERGQ 565
>gi|148701597|gb|EDL33544.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide, isoform CRA_b [Mus
musculus]
Length = 506
Score = 84.3 bits (207), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 97/207 (46%), Gaps = 24/207 (11%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 307 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 362
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----QMSQRL 122
++ + ++ T L E V Y +G +Y+ H+D E R+
Sbjct: 363 DPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRSDEQDAFKRLGTGNRV 422
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
A+FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 423 ATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGD 467
Query: 183 RTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 468 YRTRHAACPVLVGCKWVSNKWFHERGQ 494
>gi|149052606|gb|EDM04423.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide (predicted),
isoform CRA_c [Rattus norvegicus]
Length = 506
Score = 84.3 bits (207), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 97/207 (46%), Gaps = 24/207 (11%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 307 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 362
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----QMSQRL 122
++ + ++ T L E V Y +G +Y+ H+D E R+
Sbjct: 363 DPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRSDERDAFKRLGTGNRV 422
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
A+FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 423 ATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGD 467
Query: 183 RTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 468 YRTRHAACPVLVGCKWVSNKWFHERGQ 494
>gi|332221664|ref|XP_003259983.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 5 [Nomascus
leucogenys]
Length = 558
Score = 84.3 bits (207), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 97/207 (46%), Gaps = 24/207 (11%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 359 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 414
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----QMSQRL 122
++ + ++ T L E V Y +G +Y+ H+D E R+
Sbjct: 415 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRV 474
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
A+FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 475 ATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGD 519
Query: 183 RTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 520 YRTRHAACPVLVGCKWVSNKWFHERGQ 546
>gi|226874876|ref|NP_035161.2| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Mus
musculus]
gi|148701601|gb|EDL33548.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide, isoform CRA_f [Mus
musculus]
Length = 537
Score = 84.3 bits (207), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 97/207 (46%), Gaps = 24/207 (11%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 338 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 393
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----QMSQRL 122
++ + ++ T L E V Y +G +Y+ H+D E R+
Sbjct: 394 DPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRSDEQDAFKRLGTGNRV 453
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
A+FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 454 ATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGD 498
Query: 183 RTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 499 YRTRHAACPVLVGCKWVSNKWFHERGQ 525
>gi|119582752|gb|EAW62348.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_f
[Homo sapiens]
Length = 567
Score = 84.3 bits (207), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 97/207 (46%), Gaps = 24/207 (11%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 368 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 423
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----QMSQRL 122
++ + ++ T L E V Y +G +Y+ H+D E R+
Sbjct: 424 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRV 483
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
A+FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 484 ATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGD 528
Query: 183 RTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 529 YRTRHAACPVLVGCKWVSNKWFHERGQ 555
>gi|354474413|ref|XP_003499425.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1
[Cricetulus griseus]
Length = 535
Score = 84.0 bits (206), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 97/207 (46%), Gaps = 24/207 (11%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----QMSQRL 122
++ + ++ T L E V Y +G +Y+ H+D E R+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRSDEQDAFKRLGTGNRV 451
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
A+FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 452 ATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGD 496
Query: 183 RTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 497 YRTRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|195159313|ref|XP_002020526.1| GL14040 [Drosophila persimilis]
gi|194117295|gb|EDW39338.1| GL14040 [Drosophila persimilis]
Length = 549
Score = 84.0 bits (206), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 63/212 (29%), Positives = 101/212 (47%), Gaps = 25/212 (11%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTF 59
++ LS P YF + S ++ + II K ++ S++ G+T ST RTS T+
Sbjct: 339 VEQLSGDPYVAYFHDVLSDKESEQIIEHGKGQVTRSEI----GQTGNSTVSEIRTSQNTW 394
Query: 60 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE--YGPQ 117
+ + L I+ ++ T L E ++ Y IG +Y+ H+D + AE +G +
Sbjct: 395 LWY--ENNPWLADIKQRLEDITGLSTDTAEPLQLVNYGIGGQYEPHFDFMDDAEKNFGWK 452
Query: 118 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 177
RL + L YL+DV GG T FPF + L V P +G L++Y+L
Sbjct: 453 -GNRLLTALFYLNDVPLGGATAFPF---------------LHLAVPPVKGSLLVWYNLHR 496
Query: 178 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+ D + H CPV+KG KW+ +W + Q
Sbjct: 497 SLHKDFRTKHAGCPVLKGSKWICNEWFHEAAQ 528
>gi|2498741|sp|Q60716.1|P4HA2_MOUSE RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
alpha-2; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-2; Flags: Precursor
gi|836900|gb|AAC52198.1| prolyl 4-hydroxylase alpha(II)-subunit [Mus musculus]
gi|18073923|emb|CAC85691.1| Prolyl 4-hydroxylase alpha IIb subunit [Mus musculus]
gi|1096888|prf||2112362B Pro 4-hydroxylase:SUBUNIT=alpha:ISOTYPE=II
Length = 537
Score = 84.0 bits (206), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 58/211 (27%), Positives = 99/211 (46%), Gaps = 32/211 (15%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 338 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 393
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 118
++ + ++ T L E V Y +G +Y+ H+D AF G
Sbjct: 394 DPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRSDDEDAFKRLGTG--- 450
Query: 119 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 178
R+A+FL Y+SDVE GG T+FP +G + P++G + +Y+L +
Sbjct: 451 -NRVATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRS 494
Query: 179 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
G D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 495 GEGDYRTRHAACPVLVGCKWVSNKWFHERGQ 525
>gi|324507368|gb|ADY43128.1| Prolyl 4-hydroxylase subunit alpha-2 [Ascaris suum]
Length = 534
Score = 84.0 bits (206), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 61/217 (28%), Positives = 96/217 (44%), Gaps = 30/217 (13%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
+++L + P + F S + + I A +LK + + + +E R S ++
Sbjct: 318 VEILRFSPLVVLFKQVISDYEIEVIEKLAIPKLKRATVQNARTGDLEYA-NYRISKSAWL 376
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPA 112
++ ++ I +I T L Q E Y IG YD H+D AF
Sbjct: 377 KGTDHPA--IDRINKRIDLMTNLNQETAEELQAQNYGIGGHYDPHFDFARKEDINAFKTL 434
Query: 113 EYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLF 172
G R+A+ L+Y+SDVE GG T+F +G V P + D L +
Sbjct: 435 NTG----NRIATILIYMSDVESGGATVF---------------NHLGNAVFPSKYDALFW 475
Query: 173 YSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
Y+L +G D + H +CPV+ G KWV+ KWI D+ Q
Sbjct: 476 YNLRRDGEGDLRTRHAACPVLTGIKWVSNKWIHDRGQ 512
>gi|442747091|gb|JAA65705.1| Putative prolyl 4-hydroxylase alpha subunit [Ixodes ricinus]
Length = 533
Score = 84.0 bits (206), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 51/203 (25%), Positives = 93/203 (45%), Gaps = 24/203 (11%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 66
+P + + +IA AK RL+ S+ + RTSS T++ +
Sbjct: 325 KPYVVVLRDLLQDRDLNDMIAFAKPRLEQSKTLCAADK---DGPPPRTSSNTWLDDDDAP 381
Query: 67 TG--ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD----AFNPAEYGPQMSQ 120
+ + ++ + T+ + E + + Y IG Y H+D + ++
Sbjct: 382 VAARVNQYLQSLLGLGTLYGKDEAEKYQLANYGIGGHYVPHHDYLEESLTSSKKHRLFGD 441
Query: 121 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 180
R+A+ ++Y+SDVEEGG T+FP +G++V PR+GD + ++++ +
Sbjct: 442 RVATLMIYMSDVEEGGATVFP---------------SLGVRVSPRKGDAVFWWNIKSSWE 486
Query: 181 IDRTSLHGSCPVIKGEKWVATKW 203
D + H CPV+ G KW+A KW
Sbjct: 487 GDVLTWHAGCPVLYGSKWIANKW 509
>gi|427783867|gb|JAA57385.1| Putative prolyl 4-hydroxylase subunit alpha-1 [Rhipicephalus
pulchellus]
Length = 548
Score = 84.0 bits (206), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 54/210 (25%), Positives = 93/210 (44%), Gaps = 29/210 (13%)
Query: 7 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASED- 65
+P + F + ++A A RL S GE T RTSS ++ +
Sbjct: 335 KPYIITFHDIIGDRDINDLLAYATPRLFRST---HYGEHGTETSLIRTSSTAWLGDQDAP 391
Query: 66 -KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ------- 117
T + +E + + + E + + Y +G +Y +H+D P
Sbjct: 392 VATRLNRFVESLLGLGSQYLKGEAEYYQLANYGVGGQYIAHHDFLADIYADPNRKLDDFE 451
Query: 118 --MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSL 175
R+A+ + YLSDVEEGG T+FP +G+++ P++G+ +++L
Sbjct: 452 RSAGDRIATLMFYLSDVEEGGATVFPH---------------LGVRLTPKKGNAAFWWNL 496
Query: 176 FPNGTIDRTSLHGSCPVIKGEKWVATKWIR 205
+G ++ + HG CPV+ G KW+A KW R
Sbjct: 497 NSDGEGEQLTKHGGCPVLYGSKWIANKWFR 526
>gi|87199403|ref|YP_496660.1| 2OG-Fe(II) oxygenase [Novosphingobium aromaticivorans DSM 12444]
gi|87135084|gb|ABD25826.1| 2OG-Fe(II) oxygenase [Novosphingobium aromaticivorans DSM 12444]
Length = 211
Score = 84.0 bits (206), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 62/212 (29%), Positives = 98/212 (46%), Gaps = 28/212 (13%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
M+V S R +F S +C +IA ++ +PS +A G+ T T
Sbjct: 20 MRVPSPRLEMFVVRDFLSQAECNGLIARIERDRRPSTIADANGDHYFRTSET-------C 72
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP-----AEYG 115
D I+ L E K+ + + + GE RYE GQ++ +H D F+P +
Sbjct: 73 DLPMDDPEIVALDE-KLCALSGIGRPFGEPIQGQRYESGQEFKAHTDYFDPHGADFQRFC 131
Query: 116 PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSL 175
QR +F++YL+DVE GG T F K I ++P RG + + +
Sbjct: 132 SVAGQRTWTFMVYLNDVEAGGATRF---------------KVIDKTIQPERGKLVCWNNR 176
Query: 176 FPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 207
P+GT++ +LH + V KG K+V TKW R++
Sbjct: 177 RPDGTVNPCTLHHAMKVRKGLKYVITKWYREK 208
>gi|344264847|ref|XP_003404501.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1
[Loxodonta africana]
Length = 536
Score = 84.0 bits (206), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 97/207 (46%), Gaps = 24/207 (11%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 337 PHIVRYYDVMSDEEIERIKQIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 392
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----QMSQRL 122
++ + ++ T L E V Y +G +Y+ H+D E R+
Sbjct: 393 DPVVAQVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRSHEQDAFKRLGTGNRV 452
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
A+FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 453 ATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGD 497
Query: 183 RTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 498 YRTRHAACPVLVGCKWVSNKWFHERGQ 524
>gi|355691582|gb|EHH26767.1| hypothetical protein EGK_16829 [Macaca mulatta]
gi|355750162|gb|EHH54500.1| hypothetical protein EGM_15360 [Macaca fascicularis]
gi|384939464|gb|AFI33337.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Macaca
mulatta]
Length = 535
Score = 84.0 bits (206), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 97/207 (46%), Gaps = 24/207 (11%)
Query: 8 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 66
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391
Query: 67 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----QMSQRL 122
++ + ++ T L E V Y +G +Y+ H+D E R+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERHTFKHLGTGNRV 451
Query: 123 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 182
A+FL Y+SDVE GG T+FP +G + P++G + +Y+L +G D
Sbjct: 452 ATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGD 496
Query: 183 RTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 497 YRTRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|24651430|ref|NP_733378.1| prolyl-4-hydroxylase-alpha NE2 [Drosophila melanogaster]
gi|23172699|gb|AAF57061.2| prolyl-4-hydroxylase-alpha NE2 [Drosophila melanogaster]
Length = 542
Score = 84.0 bits (206), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 62/210 (29%), Positives = 95/210 (45%), Gaps = 19/210 (9%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT-RTSSGTFI 60
++LS P + + S ++ I ++K+ + PS E+ T RTS +
Sbjct: 326 EILSIDPFVVLLHDMISQKESTLIRTSSKEHMLPSATTDPDASDDETQVDTYRTSKSVWY 385
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMS 119
S+ + T + I ++ AT L E + V+ Y +G +++H D + S
Sbjct: 386 SSDFNDTT--KKITERLGDATGLDMNSTEFYQVINYGLGGFFETHLDMLLSEKNRFNGTS 443
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 179
R+A+ L YL++V +GG T FP N L V P+ G L +Y+L G
Sbjct: 444 DRIATTLFYLNEVRQGGGTYFPRLN---------------LTVFPQPGSALFWYNLDTKG 488
Query: 180 TIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
SLH CPVI G KWV +KWI D Q
Sbjct: 489 NDHMGSLHTGCPVIVGSKWVMSKWINDMGQ 518
>gi|297803562|ref|XP_002869665.1| ShTK domain-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297315501|gb|EFH45924.1| ShTK domain-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 290
Score = 84.0 bits (206), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 64/209 (30%), Positives = 94/209 (44%), Gaps = 31/209 (14%)
Query: 4 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 63
LSW+PR + F S E+ +I+ K +T E T G
Sbjct: 61 LSWQPRVFLYRGFLSEEESDHLISLRK-------------DTSEVTSGDADGKTQL---- 103
Query: 64 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 123
++ IE KI+ T LP+ +G + V Y +K D F LA
Sbjct: 104 ---DPVVAGIEEKISAWTFLPRENGGSIKVRSY-TSEKSGKKLDYFGEEPSSVLRESLLA 159
Query: 124 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI---GLKVKPRRGDGLLFYSLFPNGT 180
+ +LYLS+ +GGE +FP +S KK G ++P +G+ +LF+S N +
Sbjct: 160 TVVLYLSNTTQGGELLFP-------NSEVKPKKSCSEDGNILRPVKGNAVLFFSRLLNAS 212
Query: 181 IDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+D TS H CPV+KGE VATK I ++Q
Sbjct: 213 LDETSTHLICPVVKGELLVATKLIYAKKQ 241
>gi|170064960|ref|XP_001867743.1| prolyl 4-hydroxylase subunit alpha-1 [Culex quinquefasciatus]
gi|167882146|gb|EDS45529.1| prolyl 4-hydroxylase subunit alpha-1 [Culex quinquefasciatus]
Length = 545
Score = 83.6 bits (205), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 44/150 (29%), Positives = 74/150 (49%), Gaps = 19/150 (12%)
Query: 64 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS---- 119
+++ ++ + ++ T L T E V+ Y IG Y+ H+D E S
Sbjct: 395 DEEHSVVRTVGQRVEDMTGLTMTTAEELQVVNYGIGGHYEPHFDFARREEKNAFKSLGTG 454
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 179
R+A+ L Y+SDV +GG T+FP I + ++P++G +Y+L +G
Sbjct: 455 NRIATVLFYMSDVSQGGATVFP---------------SIRVALRPKKGTAAFWYNLHASG 499
Query: 180 TIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
D + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 500 HGDYATRHAACPVLTGTKWVSNKWIHERGQ 529
>gi|400602974|gb|EJP70572.1| 2OG-Fe(II) oxygenase family Oxidoreductase [Beauveria bassiana
ARSEF 2860]
Length = 269
Score = 83.6 bits (205), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 61/215 (28%), Positives = 101/215 (46%), Gaps = 16/215 (7%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 60
+++LS P A+Y NF + + + ++A + KPS++A G V +T R+S F+
Sbjct: 47 VEILSIDPLAIYLNNFLNDAEIRYLLALGENIYKPSEVASHSGIIVNTT--VRSSESAFL 104
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIG-QKYDSHYDAFNPAEYGP--- 116
ED LI + + H E+ +++Y G +Y H D A+
Sbjct: 105 L--EDDAVCNCLISRMKSLLGNVQHEHVESLQMVKYAAGGDRYRLHTDWSVAAKNNTDEA 162
Query: 117 ----QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYD----YKKCIGLKVKPRRGD 168
+ S+RL + +YL D GGET FP G+ D+ + K+ GL V+P+RG+
Sbjct: 163 SGKLRQSRRLGTIFVYLEDSCAGGETYFPLLTGVSDDADGEKFAVAKQGGGLLVRPKRGN 222
Query: 169 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKW 203
G+ + ++ NGT D +H P+ G K W
Sbjct: 223 GVFWNNIHSNGTGDDRVVHAGLPIKSGVKIGLNMW 257
>gi|211938649|gb|ACJ13221.1| FI08532p [Drosophila melanogaster]
Length = 543
Score = 83.6 bits (205), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 62/210 (29%), Positives = 95/210 (45%), Gaps = 19/210 (9%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT-RTSSGTFI 60
++LS P + + S ++ I ++K+ + PS E+ T RTS +
Sbjct: 327 EILSIDPFVVLLHDMISQKESTLIRTSSKEHMLPSATTDPDASDDETQVDTYRTSKSVWY 386
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMS 119
S+ + T + I ++ AT L E + V+ Y +G +++H D + S
Sbjct: 387 SSDFNDTT--KKITERLGDATGLDMNSTEFYQVINYGLGGFFETHLDMLLSEKNRFNGTS 444
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 179
R+A+ L YL++V +GG T FP N L V P+ G L +Y+L G
Sbjct: 445 DRIATTLFYLNEVRQGGGTYFPRLN---------------LTVFPQPGSALFWYNLDTKG 489
Query: 180 TIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
SLH CPVI G KWV +KWI D Q
Sbjct: 490 NDHMGSLHTGCPVIVGSKWVMSKWINDMGQ 519
>gi|195145080|ref|XP_002013524.1| GL24183 [Drosophila persimilis]
gi|194102467|gb|EDW24510.1| GL24183 [Drosophila persimilis]
Length = 296
Score = 83.6 bits (205), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 61/211 (28%), Positives = 99/211 (46%), Gaps = 27/211 (12%)
Query: 1 MQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPS--QLALRQGETVESTKGTRTSSGT 58
+++ S P + + + + Q +I + ++R+ S Q +RQ E E RTS
Sbjct: 75 LEIFSHDPYVVIYHDVLYDAEMQGLIDSTRRRMSRSMVQYEIRQIEISEQ----RTSKEA 130
Query: 59 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA----FNPAEY 114
+ D +L+ I ++ T E ++L Y+ G +D H D ++P EY
Sbjct: 131 PFTEKNDPQ-LLKRIYDRLKDMTGCDMLRSEHLSILLYDQGGHHDPHVDYHDLYWHPQEY 189
Query: 115 GPQ-MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFY 173
R AS + YL+DVE+GGET+FP + L + P +G L+++
Sbjct: 190 EYHPFGDRQASVVFYLNDVEDGGETVFP---------------KLQLVIPPTKGSALMWH 234
Query: 174 SLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 204
+L P G D + H SCPV+ G K VA +WI
Sbjct: 235 NLRPWGEGDPRTQHASCPVLSGYKQVAIQWI 265
>gi|20269814|gb|AAM18062.1|AF495540_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]NE2
[Drosophila melanogaster]
gi|19528175|gb|AAL90202.1| AT27756p [Drosophila melanogaster]
Length = 542
Score = 83.6 bits (205), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 62/210 (29%), Positives = 95/210 (45%), Gaps = 19/210 (9%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT-RTSSGTFI 60
++LS P + + S ++ I ++K+ + PS E+ T RTS +
Sbjct: 326 EILSIDPFVVLLHDMISQKESTLIRTSSKEHMLPSATTDPDASDDETQVDTYRTSKSVWY 385
Query: 61 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMS 119
S+ + T + I ++ AT L E + V+ Y +G +++H D + S
Sbjct: 386 SSDFNDTT--KKITERLGDATGLDMNSTEFYQVINYGLGGFFETHLDMLLSEKNRFNGTS 443
Query: 120 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 179
R+A+ L YL++V +GG T FP N L V P+ G L +Y+L G
Sbjct: 444 DRIATTLFYLNEVRQGGGTYFPRLN---------------LTVFPQPGSALFWYNLDTKG 488
Query: 180 TIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
SLH CPVI G KWV +KWI D Q
Sbjct: 489 NDHMGSLHTGCPVIVGSKWVMSKWINDMGQ 518
>gi|194764881|ref|XP_001964556.1| GF23245 [Drosophila ananassae]
gi|190614828|gb|EDV30352.1| GF23245 [Drosophila ananassae]
Length = 460
Score = 83.6 bits (205), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 45/142 (31%), Positives = 73/142 (51%), Gaps = 16/142 (11%)
Query: 69 ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-SQRLASFLL 127
++ IE +I T L E F ++ Y IG Y HYD + +E + +R+ + L
Sbjct: 320 VMRNIEKRIKDMTGLSMDLSEDFMLINYGIGGTYKMHYDFYVYSEPLRFLRGERIVTVLF 379
Query: 128 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 187
YL DVE G T+FPF N + + P++G +++Y+L +G + + + H
Sbjct: 380 YLGDVELSGSTVFPFLN---------------ISITPKKGSAVMWYNLHNSGDVHQKTQH 424
Query: 188 GSCPVIKGEKWVATKWIRDQEQ 209
+CPV+ G K+V TKWI + Q
Sbjct: 425 CACPVVVGSKYVLTKWINELHQ 446
>gi|326928035|ref|XP_003210190.1| PREDICTED: WD repeat-containing protein 6-like [Meleagris
gallopavo]
Length = 900
Score = 83.6 bits (205), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 56/195 (28%), Positives = 89/195 (45%), Gaps = 34/195 (17%)
Query: 44 ETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQT---HGEAFNVLRYEIGQ 100
+ V+++ R S T++ E ++ I ++ R T LP H E V+RY+ G
Sbjct: 134 QKVKTSDAVRNSQHTWLYQGEGAHQVMRAIRQRVMRLTRLPPEIVEHSEPLQVVRYDQGG 193
Query: 101 KYDSHYDA-------------FNPAEYGP-QMSQRLASFLLYLSDVEEGGETMFP----- 141
Y +H D+ E P + S R + L YL++V GGET+FP
Sbjct: 194 HYHAHMDSGPVFPETACSHTKLVANESAPFETSCRYVTVLFYLNNVTGGGETVFPIADNR 253
Query: 142 -FENGIFLDSGYDY----KKCI--GLKVKPRRGDGLLFYSLFPNGT-----IDRTSLHGS 189
+E + + D K C L+VKP++G + +Y+ +G +D +LHG
Sbjct: 254 TYEEMSLIQNDVDLRDTRKNCDKGNLRVKPQQGTAVFWYNYLSDGEGWVGELDDFALHGG 313
Query: 190 CPVIKGEKWVATKWI 204
C V +G KW+A WI
Sbjct: 314 CLVTQGTKWIANNWI 328
>gi|344296798|ref|XP_003420090.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Loxodonta
africana]
Length = 544
Score = 83.6 bits (205), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 63/213 (29%), Positives = 102/213 (47%), Gaps = 25/213 (11%)
Query: 2 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 61
+V+ P + + +F + + Q I A+ L+ S +A GE + R S ++
Sbjct: 340 EVIHLEPYVVLYHDFVNDMEAQKIKGLAEPWLQRSVVA--SGEK-QLQVDYRISKSAWLK 396
Query: 62 ASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 116
S D +L ++H+IA T L + E V+ Y IG Y+ H+D A +P+ Y
Sbjct: 397 DSVDP--MLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRM 454
Query: 117 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 176
+ R+A+F++YLS VE GG T F + N + + L +++L
Sbjct: 455 KSGNRVATFMIYLSAVEAGGATAFIYAN---------------FSMPVVKNAALFWWNLH 499
Query: 177 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
+G D +LH CPV+ G+KWVA KWI + Q
Sbjct: 500 RSGEGDGDTLHAGCPVLVGDKWVANKWIHEYGQ 532
>gi|24651407|ref|NP_733371.1| prolyl-4-hydroxylase-alpha EFB [Drosophila melanogaster]
gi|20269806|gb|AAM18058.1|AF495536_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]EFB
[Drosophila melanogaster]
gi|15292529|gb|AAK93533.1| SD05564p [Drosophila melanogaster]
gi|23172692|gb|AAF57053.2| prolyl-4-hydroxylase-alpha EFB [Drosophila melanogaster]
gi|220946562|gb|ACL85824.1| PH4alphaEFB-PA [synthetic construct]
Length = 550
Score = 83.6 bits (205), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 49/166 (29%), Positives = 77/166 (46%), Gaps = 23/166 (13%)
Query: 49 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 108
T R S ++ ED+ ++E + + A T L E V+ Y IG Y+ H+D
Sbjct: 386 TANYRISKSAWLKTQEDR--VIETVVQRTADMTGLDMDSAEELQVVNYGIGGHYEPHFDF 443
Query: 109 FNPAEY----GPQMSQRLASFLLYLSDVEEGGETMF-PFENGIFLDSGYDYKKCIGLKVK 163
E G + R+A+ L Y+SDVE+GG T+F +F
Sbjct: 444 ARKEEQRAFEGLNLGNRIATVLFYMSDVEQGGATVFTSLHTALF---------------- 487
Query: 164 PRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 209
P++G + +L +G D + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 488 PKKGTAAFWMNLHRDGQGDVRTRHAACPVLTGTKWVSNKWIHERGQ 533
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.318 0.135 0.404
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,451,736,984
Number of Sequences: 23463169
Number of extensions: 140533168
Number of successful extensions: 270793
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1451
Number of HSP's successfully gapped in prelim test: 589
Number of HSP's that attempted gapping in prelim test: 265797
Number of HSP's gapped (non-prelim): 2272
length of query: 212
length of database: 8,064,228,071
effective HSP length: 136
effective length of query: 76
effective length of database: 9,168,204,383
effective search space: 696783533108
effective search space used: 696783533108
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 73 (32.7 bits)