BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 027189
(227 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255573113|ref|XP_002527486.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223533126|gb|EEF34884.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 286
Score = 337 bits (864), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 154/211 (72%), Positives = 185/211 (87%)
Query: 13 GLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKENESGLPHGETGDASIQS 72
GLP +L+CS FF+AGFY ST +S+DVP I+P+LR LEV ++E +P G TG++ I+S
Sbjct: 14 GLPAVILVCSVFFVAGFYASTLISQDVPVIKPRLRMLEVTDEEKHQAMPRGVTGESYIES 73
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
IPFQVLSW+PRA+YFP+FA+ EQC++II AK RLKPS LALR+GET ESTKGTRTSSGT
Sbjct: 74 IPFQVLSWKPRAVYFPDFATPEQCKNIIEMAKLRLKPSGLALRKGETAESTKGTRTSSGT 133
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
F+SASED TG L+ IEHKIARATM+P++HGEAFN+LRYEIGQKYDSHYD+FNPAEYGPQM
Sbjct: 134 FLSASEDGTGTLDFIEHKIARATMIPRSHGEAFNILRYEIGQKYDSHYDSFNPAEYGPQM 193
Query: 193 SQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
SQR+ASFLLYLSDVE+GGETMFPFE ++IS
Sbjct: 194 SQRVASFLLYLSDVEKGGETMFPFENGVKIS 224
>gi|297798522|ref|XP_002867145.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297312981|gb|EFH43404.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 288
Score = 328 bits (842), Expect = 8e-88, Method: Compositional matrix adjust.
Identities = 152/209 (72%), Positives = 179/209 (85%), Gaps = 2/209 (0%)
Query: 11 KLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVE--KENESGLPHGETGDA 68
KLGL T ++ CS FL GFYGST LS++VP ++P+LR LE+VE +E+ +PHG TG+
Sbjct: 12 KLGLATVIVFCSLCFLVGFYGSTLLSQNVPRVKPRLRMLEMVENGEEDTGSMPHGVTGEE 71
Query: 69 SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRT 128
S+ SIPFQVLSWRPRA+YFPNFA+AEQCQ+II AK LKPS LALR+GET E+TKGTRT
Sbjct: 72 SVGSIPFQVLSWRPRAIYFPNFATAEQCQAIIERAKVNLKPSALALRKGETAENTKGTRT 131
Query: 129 SSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY 188
SSGTFISASED TG L+ +E KIARATM+P++HGE+FN+LRYE+GQKYDSHYD FNP EY
Sbjct: 132 SSGTFISASEDSTGALDFVERKIARATMIPRSHGESFNILRYELGQKYDSHYDVFNPTEY 191
Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPFE 217
GPQ SQR+ASFLLYLSDVEEGGETMFPFE
Sbjct: 192 GPQSSQRIASFLLYLSDVEEGGETMFPFE 220
>gi|385137888|gb|AFI41205.1| oxygenase protein, partial [Arabidopsis thaliana]
Length = 288
Score = 325 bits (834), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 154/209 (73%), Positives = 180/209 (86%), Gaps = 2/209 (0%)
Query: 11 KLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEK-ENESG-LPHGETGDA 68
KLGL T ++ CS FL GFYGST LS++VP ++P+LR LE+VE E E+G +PHG TG+
Sbjct: 12 KLGLATVIVFCSLCFLFGFYGSTLLSQNVPRVKPRLRMLEMVENGEEEAGSMPHGVTGEE 71
Query: 69 SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRT 128
SI SIPFQVLSWRPRA+YFPNFA+AEQCQ+II AK LKPS LALR+GET E+TKGTRT
Sbjct: 72 SIGSIPFQVLSWRPRAIYFPNFATAEQCQAIIERAKVNLKPSALALRKGETAENTKGTRT 131
Query: 129 SSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY 188
SSGTFISASE+ TG L+ +E KIARATM+P++HGE+FN+LRYE+GQKYDSHYD FNP EY
Sbjct: 132 SSGTFISASEESTGALDFVERKIARATMIPRSHGESFNILRYELGQKYDSHYDVFNPTEY 191
Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPFE 217
GPQ SQR+ASFLLYLSDVEEGGETMFPFE
Sbjct: 192 GPQSSQRIASFLLYLSDVEEGGETMFPFE 220
>gi|18418321|ref|NP_567941.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
gi|17381226|gb|AAL36425.1| unknown protein [Arabidopsis thaliana]
gi|20465827|gb|AAM20018.1| unknown protein [Arabidopsis thaliana]
gi|21592377|gb|AAM64328.1| putative dioxygenase [Arabidopsis thaliana]
gi|332660892|gb|AEE86292.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
Length = 288
Score = 325 bits (833), Expect = 8e-87, Method: Compositional matrix adjust.
Identities = 152/209 (72%), Positives = 179/209 (85%), Gaps = 2/209 (0%)
Query: 11 KLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVE--KENESGLPHGETGDA 68
KLGL T ++ CS FL GFYGST LS++VP ++P+LR L++VE +E S +PHG TG+
Sbjct: 12 KLGLATVIVFCSLCFLFGFYGSTLLSQNVPRVKPRLRMLDMVENGEEEASSMPHGVTGEE 71
Query: 69 SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRT 128
SI SIPFQVLSWRPRA+YFPNFA+AEQCQ+II AK LKPS LALR+GET E+TKGTRT
Sbjct: 72 SIGSIPFQVLSWRPRAIYFPNFATAEQCQAIIERAKVNLKPSALALRKGETAENTKGTRT 131
Query: 129 SSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY 188
SSGTFISASE+ TG L+ +E KIARATM+P++HGE+FN+LRYE+GQKYDSHYD FNP EY
Sbjct: 132 SSGTFISASEESTGALDFVERKIARATMIPRSHGESFNILRYELGQKYDSHYDVFNPTEY 191
Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPFE 217
GPQ SQR+ASFLLYLSDVEEGGETMFPFE
Sbjct: 192 GPQSSQRIASFLLYLSDVEEGGETMFPFE 220
>gi|224103711|ref|XP_002313164.1| predicted protein [Populus trichocarpa]
gi|222849572|gb|EEE87119.1| predicted protein [Populus trichocarpa]
Length = 294
Score = 323 bits (829), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 156/225 (69%), Positives = 182/225 (80%), Gaps = 9/225 (4%)
Query: 1 MKGKAKRSSTKLGLPTALLLCSFFFLAGFYGSTFLSR--------DVPSIRPKLRTLEVV 52
MK K KRS KLGLP A++LCS FFLAGFY S F DVP RP+LR LEV
Sbjct: 1 MKIKTKRSKRKLGLPAAIILCSLFFLAGFYSSIFFPHVSASLSACDVPVTRPRLRMLEVA 60
Query: 53 EKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQL 112
E ++++ +PHG TG+AS++SIPFQVLSW+PRALYFP FA+ EQC+SII + +LKPS L
Sbjct: 61 EGDHDT-MPHGVTGEASVESIPFQVLSWKPRALYFPKFATPEQCESIIKMVESKLKPSTL 119
Query: 113 ALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEI 172
ALR+GET ESTK TRTSSG+F+S SED+TG L+ IE KIA+ATM+PQ+HGEAFN+LRYEI
Sbjct: 120 ALRKGETAESTKDTRTSSGSFVSGSEDETGTLDFIEKKIAKATMIPQSHGEAFNILRYEI 179
Query: 173 GQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFE 217
GQKYDSHYDAFNP EYG Q SQR ASFLLYLS+VEEGGETMFPFE
Sbjct: 180 GQKYDSHYDAFNPDEYGQQSSQRTASFLLYLSNVEEGGETMFPFE 224
>gi|225428938|ref|XP_002262952.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
gi|296083079|emb|CBI22483.3| unnamed protein product [Vitis vinifera]
Length = 284
Score = 322 bits (824), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 160/223 (71%), Positives = 183/223 (82%), Gaps = 3/223 (1%)
Query: 1 MKGKAKRS-STKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKENESG 59
MKGK K KLGL + SFFFLAG +GS S+DV +R + R LE VE+ S
Sbjct: 1 MKGKGKGVWRPKLGLLLLFISWSFFFLAGLFGSMLFSQDVNGVRSQPRLLESVEEY--SP 58
Query: 60 LPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET 119
+PHGETG++S+ IPFQVLSW+PRALYFP FA+AEQCQSII AK L+PS LALRQGET
Sbjct: 59 MPHGETGESSVDMIPFQVLSWKPRALYFPRFATAEQCQSIIEMAKSHLRPSTLALRQGET 118
Query: 120 VESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH 179
ESTKGTRTSSGTFISASEDKTGIL+ +E KIA+ATM+P++HGEAFN+LRYEIGQ+Y+SH
Sbjct: 119 DESTKGTRTSSGTFISASEDKTGILDFVERKIAKATMIPRSHGEAFNILRYEIGQRYNSH 178
Query: 180 YDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFEVDLQI 222
YDAFNPAEYGPQ SQR+ASFLLYLSDVEEGGETMFPFE DL I
Sbjct: 179 YDAFNPAEYGPQTSQRVASFLLYLSDVEEGGETMFPFEHDLNI 221
>gi|449448264|ref|XP_004141886.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 294
Score = 310 bits (793), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 155/228 (67%), Positives = 181/228 (79%), Gaps = 6/228 (2%)
Query: 1 MKGKAKRSS----TKLGLPTAL-LLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKE 55
MKGK+ RS+ +KLGLP + +LC F FLAGF+GST LS+DV RP+ R L+
Sbjct: 1 MKGKSGRSNWSLRSKLGLPALIFVLCLFCFLAGFFGSTLLSQDVDDDRPRARLLQSASDV 60
Query: 56 NESGL-PHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLAL 114
E L GE GD SI SIPFQVLSWRPRALYFP FA+AEQCQSI+ AK +L+PS LAL
Sbjct: 61 TEFDLMSSGENGDDSISSIPFQVLSWRPRALYFPKFATAEQCQSIVNLAKPKLRPSTLAL 120
Query: 115 RQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQ 174
R+GET ESTKG RTSSG F SASED++G L +IE KIARATM+P+THGEA+N+LRYEIGQ
Sbjct: 121 RKGETAESTKGVRTSSGVFFSASEDESGTLGVIEEKIARATMIPRTHGEAYNILRYEIGQ 180
Query: 175 KYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFEVDLQI 222
KY+SHYDAF P+EYGPQ SQR+ASFLLYL+DVEEGGETMFPFE L +
Sbjct: 181 KYNSHYDAFKPSEYGPQKSQRVASFLLYLTDVEEGGETMFPFENGLNM 228
>gi|147823227|emb|CAN70872.1| hypothetical protein VITISV_009065 [Vitis vinifera]
Length = 276
Score = 307 bits (787), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 158/223 (70%), Positives = 180/223 (80%), Gaps = 11/223 (4%)
Query: 1 MKGKAKRS-STKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKENESG 59
MKGK K KLGL + SFFFLAG +GS S+ P+L LE VE+ S
Sbjct: 1 MKGKGKGVWRPKLGLLLLFISWSFFFLAGLFGSMLFSQ------PRL--LESVEEY--SP 50
Query: 60 LPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET 119
+PHGETG++S+ IPFQVLSW+PRALYFP FA+AEQCQSII AK L+PS LALRQGET
Sbjct: 51 MPHGETGESSVDMIPFQVLSWKPRALYFPRFATAEQCQSIIEMAKSHLRPSTLALRQGET 110
Query: 120 VESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH 179
ESTKGTRTSSGTFISASEDKTGIL+ +E KIA+ATM+P++HGEAFN+LRYEIGQ+Y+SH
Sbjct: 111 DESTKGTRTSSGTFISASEDKTGILDFVERKIAKATMIPRSHGEAFNILRYEIGQRYNSH 170
Query: 180 YDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFEVDLQI 222
YDAFNPAEYGPQ SQR+ASFLLYLSDVEEGGETMFPFE DL I
Sbjct: 171 YDAFNPAEYGPQTSQRVASFLLYLSDVEEGGETMFPFEHDLNI 213
>gi|449511009|ref|XP_004163837.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
alpha-1-like [Cucumis sativus]
Length = 294
Score = 307 bits (787), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 154/228 (67%), Positives = 180/228 (78%), Gaps = 6/228 (2%)
Query: 1 MKGKAKRSS----TKLGLPTAL-LLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKE 55
MKGK+ RS+ +KLGLP + +LC F FLAGF+GST LS+DV RP+ R L+
Sbjct: 1 MKGKSGRSNWSLRSKLGLPALIFVLCLFXFLAGFFGSTLLSQDVDDDRPRARLLQSASDV 60
Query: 56 NESGL-PHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLAL 114
E L GE GD SI SIPFQVLSWRPRALYFP FA+AEQCQSI+ AK +L+PS LAL
Sbjct: 61 TEFDLMSSGENGDDSISSIPFQVLSWRPRALYFPKFATAEQCQSIVNLAKPKLRPSTLAL 120
Query: 115 RQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQ 174
R+GET ESTKG RTSSG F SASED++G L +IE K ARATM+P+THGEA+N+LRYEIGQ
Sbjct: 121 RKGETAESTKGVRTSSGVFFSASEDESGTLGVIEEKXARATMIPRTHGEAYNILRYEIGQ 180
Query: 175 KYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFEVDLQI 222
KY+SHYDAF P+EYGPQ SQR+ASFLLYL+DVEEGGETMFPFE L +
Sbjct: 181 KYNSHYDAFKPSEYGPQKSQRVASFLLYLTDVEEGGETMFPFENGLNM 228
>gi|225428943|ref|XP_002263094.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
gi|296083076|emb|CBI22480.3| unnamed protein product [Vitis vinifera]
Length = 282
Score = 305 bits (781), Expect = 9e-81, Method: Compositional matrix adjust.
Identities = 154/223 (69%), Positives = 176/223 (78%), Gaps = 2/223 (0%)
Query: 1 MKGKAKRSSTKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKENESGL 60
MKGK S KL LP L S FFLAG +GS S+DV +R R LE VE+E S +
Sbjct: 1 MKGKGGWSP-KLELPLLFLSWSLFFLAGLFGSMLFSQDVNGVRSPPRLLESVEEEYSS-M 58
Query: 61 PHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETV 120
PHGETG++S+ IPFQVLSW+PRA YFP+FA+AEQCQSII AK L PS L LR+GET
Sbjct: 59 PHGETGESSVDLIPFQVLSWKPRARYFPHFATAEQCQSIIEMAKSGLSPSTLVLRKGETE 118
Query: 121 ESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHY 180
ESTKG RTSSGTFISASEDKTGIL+ IE KIA+ATM+P+ HGE FN+LRYEIGQ+Y+SHY
Sbjct: 119 ESTKGIRTSSGTFISASEDKTGILDFIERKIAKATMIPRNHGEVFNILRYEIGQRYNSHY 178
Query: 181 DAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
DA +PAEYG Q SQR+ASFLLYLSDVEEGGETMFPFE DL I+
Sbjct: 179 DAISPAEYGLQTSQRIASFLLYLSDVEEGGETMFPFEHDLNIN 221
>gi|255584898|ref|XP_002533164.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223527036|gb|EEF29223.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 290
Score = 302 bits (774), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 150/229 (65%), Positives = 177/229 (77%), Gaps = 7/229 (3%)
Query: 1 MKGKAKRSS----TKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKE- 55
MK K + +KLGLP L C FFFLAG + S +S++V + + R L+ V++E
Sbjct: 1 MKAKGSKGKWSIKSKLGLPVVFLSCLFFFLAGLFASNLISQNVNGDKNR-RQLQWVKEEI 59
Query: 56 -NESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLAL 114
LP G+TGD + IPFQVLSW+PRALYFPNFA+AEQCQS+I AK L PS LAL
Sbjct: 60 IEYDLLPSGDTGDDYLTVIPFQVLSWKPRALYFPNFATAEQCQSVINMAKPNLTPSTLAL 119
Query: 115 RQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQ 174
R+GET E+TKG RTSSG F+SASEDKTG+L+ IE KIARATMLP+ +GEAFN+LRYEIGQ
Sbjct: 120 RKGETEENTKGIRTSSGMFLSASEDKTGVLDAIEEKIARATMLPRANGEAFNILRYEIGQ 179
Query: 175 KYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
KY+SHYDAFNPAEYGPQ SQR+ASFLLYLSDVEEGGETMFPFE DL +
Sbjct: 180 KYNSHYDAFNPAEYGPQKSQRVASFLLYLSDVEEGGETMFPFENDLDVD 228
>gi|356536125|ref|XP_003536590.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 286
Score = 296 bits (759), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 144/224 (64%), Positives = 175/224 (78%), Gaps = 2/224 (0%)
Query: 1 MKGKAKRSSTKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTL-EVVEKENESG 59
MKGK K S KLG+PT +LC+ FF GF+ S L +D+ + P+ R L E V+KE E
Sbjct: 2 MKGKVKSSKLKLGVPTLFILCALFFFVGFFVSPLLFQDLDDVGPRSRILQESVKKEYEP- 60
Query: 60 LPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET 119
L HGE+G+ + SIP Q+LSWRPRA++FPNF S E CQ II AK +L+PS+LALR+GET
Sbjct: 61 LEHGESGEPFVDSIPSQILSWRPRAVFFPNFTSVEVCQQIIEMAKPKLEPSKLALRKGET 120
Query: 120 VESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH 179
ESTK TRTSSGTFISASEDK+GIL+L+E KIA+ TM+P+THGE FN+L+YE+GQKYDSH
Sbjct: 121 AESTKDTRTSSGTFISASEDKSGILDLVERKIAKVTMIPRTHGEIFNILKYEVGQKYDSH 180
Query: 180 YDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
YDAFNP EYG SQR+ASFLLYLS+VE GGETMFP+E L I
Sbjct: 181 YDAFNPDEYGSVESQRIASFLLYLSNVEAGGETMFPYEGGLNID 224
>gi|357476355|ref|XP_003608463.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
gi|355509518|gb|AES90660.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
Length = 297
Score = 296 bits (759), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 142/219 (64%), Positives = 171/219 (78%), Gaps = 4/219 (1%)
Query: 9 STKLGLPTALLLCSFFFLAGFYGSTFLSR----DVPSIRPKLRTLEVVEKENESGLPHGE 64
+ KL P L+C FFFLAGF+GST S D +RP+ R L+ ++ + + GE
Sbjct: 15 TNKLTFPYVFLICIFFFLAGFFGSTLFSHSQDGDGYGLRPRPRLLDSTKETEYNLMTAGE 74
Query: 65 TGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTK 124
GD SI SIPFQVLSW+PRALYFPNFA+AEQC++I++ AK LKPS LALR+GET E+TK
Sbjct: 75 FGDDSITSIPFQVLSWKPRALYFPNFATAEQCENIVSVAKAGLKPSSLALRKGETTENTK 134
Query: 125 GTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN 184
G RTSSG F+SAS DKT LE IE KIARATM+P++HGEAFN+LRYE+GQ+Y+SHYDAFN
Sbjct: 135 GIRTSSGVFLSASRDKTKTLEAIEEKIARATMIPRSHGEAFNILRYEVGQRYNSHYDAFN 194
Query: 185 PAEYGPQMSQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
P EYGPQ SQR+ASFLLYL+DVEEGGETMFPFE L +
Sbjct: 195 PDEYGPQKSQRVASFLLYLTDVEEGGETMFPFENGLNMD 233
>gi|388505024|gb|AFK40578.1| unknown [Medicago truncatula]
Length = 297
Score = 295 bits (754), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 142/219 (64%), Positives = 170/219 (77%), Gaps = 4/219 (1%)
Query: 9 STKLGLPTALLLCSFFFLAGFYGSTFLSR----DVPSIRPKLRTLEVVEKENESGLPHGE 64
+ KL P L+C FFFLAGF+GST S D +RP+ R L+ ++ + + GE
Sbjct: 15 TNKLTFPYVFLICIFFFLAGFFGSTLFSHSQDGDGYGLRPRPRLLDSTKETEYNLMTAGE 74
Query: 65 TGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTK 124
GD SI SIPFQVLSW+PRALYFPNFA+AEQC++I++ AK LKPS LALR+GET E+TK
Sbjct: 75 FGDDSITSIPFQVLSWKPRALYFPNFATAEQCENIVSVAKAGLKPSSLALRKGETTENTK 134
Query: 125 GTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN 184
G RTSSG F+SAS DKT LE IE KIARATM+P++HGEAFN+LRYE+GQ+Y SHYDAFN
Sbjct: 135 GIRTSSGVFLSASRDKTKTLEAIEEKIARATMIPRSHGEAFNILRYEVGQRYYSHYDAFN 194
Query: 185 PAEYGPQMSQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
P EYGPQ SQR+ASFLLYL+DVEEGGETMFPFE L +
Sbjct: 195 PDEYGPQKSQRVASFLLYLTDVEEGGETMFPFENGLNMD 233
>gi|40809925|dbj|BAD07294.1| prolyl 4-hydroxylase [Nicotiana tabacum]
Length = 286
Score = 285 bits (730), Expect = 7e-75, Method: Compositional matrix adjust.
Identities = 139/184 (75%), Positives = 158/184 (85%)
Query: 34 FLSRDVPSIRPKLRTLEVVEKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASA 93
F +DVPS+RP+ R LE V +E+ LP GETG+ S+ SIPFQVLSW PRALYFPNFAS
Sbjct: 35 FSHQDVPSVRPRPRFLESVYQEDFDPLPIGETGEHSLISIPFQVLSWFPRALYFPNFASI 94
Query: 94 EQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIAR 153
EQCQSII AK ++PS LALR GET E+TKG RTSSGTFISASEDKTGIL+LIE KIA+
Sbjct: 95 EQCQSIIKMAKANMEPSSLALRTGETEETTKGIRTSSGTFISASEDKTGILDLIEEKIAK 154
Query: 154 ATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETM 213
ATM+P+THGEAFNVLRYEIGQ+Y SHYDAF+PA+YGPQ SQR ASFLLYLSDVEEGGET+
Sbjct: 155 ATMIPKTHGEAFNVLRYEIGQRYQSHYDAFDPAQYGPQKSQRAASFLLYLSDVEEGGETV 214
Query: 214 FPFE 217
FP+E
Sbjct: 215 FPYE 218
>gi|224071291|ref|XP_002303388.1| predicted protein [Populus trichocarpa]
gi|222840820|gb|EEE78367.1| predicted protein [Populus trichocarpa]
Length = 297
Score = 285 bits (728), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 144/233 (61%), Positives = 172/233 (73%), Gaps = 14/233 (6%)
Query: 1 MKGKAKRSSTK------LGLPTALLLCSFFFLAGFYGSTFL-----SRDVPSIRPKLRTL 49
M+GKA + + + LG P +L CSFFF+AG + S L S D +R + R L
Sbjct: 1 MRGKACKENWRFYKKPNLGFPALILSCSFFFIAGLFASNLLLSQGTSSDERWLRARARQL 60
Query: 50 EVVEKENESG---LPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKR 106
+ VE+E S LP GE+GD I IPFQVLSWRPRALY+P F +AEQCQ II AK
Sbjct: 61 QSVEEEIISKYDLLPSGESGDDFITLIPFQVLSWRPRALYYPGFITAEQCQHIINMAKPS 120
Query: 107 LKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFN 166
L+PS LALR+GET E+TKG RTSSG F+ +SED+ G+L++IE KIARATM+P THGEAFN
Sbjct: 121 LQPSTLALRKGETAETTKGIRTSSGMFVFSSEDQAGVLQVIEEKIARATMIPSTHGEAFN 180
Query: 167 VLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFEVD 219
VLRYEIGQKYD+HYDAFNPAEYGPQ SQR+A+FLLYLS+ EEGGET FP E D
Sbjct: 181 VLRYEIGQKYDAHYDAFNPAEYGPQTSQRVATFLLYLSNFEEGGETTFPIEND 233
>gi|388523073|gb|AFK49598.1| unknown [Lotus japonicus]
Length = 318
Score = 283 bits (723), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 149/235 (63%), Positives = 171/235 (72%), Gaps = 12/235 (5%)
Query: 1 MKGKAKRSSTKLGLPTALLLCSFFFLAGFYGSTFL-----SRDVPSIR----PKL-RTLE 50
++G K+ P+ LLC FFFLAGF+GST D +R P+ R LE
Sbjct: 6 VRGNWSWRMNKIAFPSVFLLCIFFFLAGFFGSTLFFQYHSQEDEHGLRQIQRPRTTRLLE 65
Query: 51 -VVEKENESGL-PHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLK 108
+ EKE ES L P GETGD I +IPFQVLSW P ALYFPNFA+AEQC+SII TAK+ LK
Sbjct: 66 NLTEKETESHLLPAGETGDNFITTIPFQVLSWNPHALYFPNFATAEQCESIIETAKEGLK 125
Query: 109 PSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVL 168
PS L LR GET EST G RTSSG FISA EDKTG+L++IE KIARAT +P+THGEAFNVL
Sbjct: 126 PSTLVLRVGETDESTTGIRTSSGVFISAFEDKTGVLDVIEEKIARATKIPRTHGEAFNVL 185
Query: 169 RYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
RY++GQKY SHYDA +P YGPQ SQR+ASFLLYLSDV EGGETMFPFE L +
Sbjct: 186 RYKVGQKYSSHYDALHPDIYGPQKSQRMASFLLYLSDVPEGGETMFPFENGLNMD 240
>gi|356563543|ref|XP_003550021.1| PREDICTED: putative prolyl 4-hydroxylase-like [Glycine max]
Length = 293
Score = 282 bits (722), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 146/226 (64%), Positives = 174/226 (76%), Gaps = 3/226 (1%)
Query: 1 MKGKAKRSSTKLGLPTALLLCSFFFLAGFYGSTFLSR---DVPSIRPKLRTLEVVEKENE 57
+KG + KL P L+C FFFLAGF+G T S D +RP+ R L+ +
Sbjct: 6 VKGNWSLRTNKLSFPYVFLVCIFFFLAGFFGFTLFSHSQGDGDGLRPRQRLLDSTNEAEY 65
Query: 58 SGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQG 117
+ +P GE GD SI SIPFQVLSWRPRA+YFPNFA+AEQC+SII AK LKPS LALRQG
Sbjct: 66 NLMPVGELGDDSITSIPFQVLSWRPRAVYFPNFATAEQCESIIDVAKDGLKPSTLALRQG 125
Query: 118 ETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYD 177
ET ++TKG RTSSG F+SASEDKT L++IE KIARATM+P++HGEAFN+LRYE+ Q+Y+
Sbjct: 126 ETEDNTKGIRTSSGVFVSASEDKTRTLDVIEEKIARATMIPRSHGEAFNILRYEVNQRYN 185
Query: 178 SHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
SHYDAFNPAEYGPQ SQR+ASFLLYL+DVEEGGETMFPFE L +
Sbjct: 186 SHYDAFNPAEYGPQKSQRMASFLLYLTDVEEGGETMFPFENGLNMD 231
>gi|255647903|gb|ACU24410.1| unknown [Glycine max]
Length = 293
Score = 282 bits (721), Expect = 9e-74, Method: Compositional matrix adjust.
Identities = 146/226 (64%), Positives = 174/226 (76%), Gaps = 3/226 (1%)
Query: 1 MKGKAKRSSTKLGLPTALLLCSFFFLAGFYGSTFLSR---DVPSIRPKLRTLEVVEKENE 57
+KG + KL P L+C FFFLAGF+G T S D +RP+ R L+ +
Sbjct: 6 VKGNWSLRTNKLSFPYVFLVCIFFFLAGFFGFTLFSHSQGDGDGLRPRQRLLDSTNEAEY 65
Query: 58 SGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQG 117
+ +P GE GD SI SIPFQVLSWRPRA+YFPNFA+AEQC+SII AK LKPS LALRQG
Sbjct: 66 NLMPVGELGDDSITSIPFQVLSWRPRAVYFPNFATAEQCESIIDVAKDGLKPSTLALRQG 125
Query: 118 ETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYD 177
ET ++TKG RTSSG F+SASEDKT L++IE KIARATM+P++HGEAFN+LRYE+ Q+Y+
Sbjct: 126 ETEDNTKGIRTSSGVFVSASEDKTRTLDVIEEKIARATMIPRSHGEAFNILRYEVNQRYN 185
Query: 178 SHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
SHYDAFNPAEYGPQ SQR+ASFLLYL+DVEEGGETMFPFE L +
Sbjct: 186 SHYDAFNPAEYGPQKSQRMASFLLYLTDVEEGGETMFPFENGLNMD 231
>gi|356574299|ref|XP_003555286.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 290
Score = 281 bits (720), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 140/223 (62%), Positives = 171/223 (76%)
Query: 1 MKGKAKRSSTKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKENESGL 60
MKGK +RS KLG+PT LC+ FF GF+ S L +D+ + P+ R L+ K+ L
Sbjct: 1 MKGKVRRSKLKLGVPTFFFLCALFFFVGFFVSPLLFQDLGDVGPRSRILQESVKKAYEPL 60
Query: 61 PHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETV 120
HGE+G+ + SIPFQ+LSWRPRA+YFPNF S E CQ II AK +L+PS+LALR+GET
Sbjct: 61 EHGESGEPFLNSIPFQILSWRPRAVYFPNFTSVEVCQQIIEMAKPKLEPSKLALRKGETA 120
Query: 121 ESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHY 180
ESTK TRTSSGTFISASEDK+GIL+ +E KIA+ TM+P+THGE FN+L+YE+ QKYDSHY
Sbjct: 121 ESTKDTRTSSGTFISASEDKSGILDFVERKIAKVTMIPRTHGEKFNILKYEVAQKYDSHY 180
Query: 181 DAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
DAFNP EYG SQR+ASFLLYLS+VE GGETMFP+E L I
Sbjct: 181 DAFNPDEYGTVESQRIASFLLYLSNVEAGGETMFPYEGGLNID 223
>gi|223945827|gb|ACN26997.1| unknown [Zea mays]
gi|414872966|tpg|DAA51523.1| TPA: prolyl 4-hydroxylase [Zea mays]
Length = 294
Score = 280 bits (716), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 136/211 (64%), Positives = 166/211 (78%), Gaps = 7/211 (3%)
Query: 10 TKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIR---PKLRTLEVVEKENESGLPHGETG 66
T+L LP LL CS FFLAGF+GS ++D P+ R LE E +P+GE+G
Sbjct: 22 TRLRLPVVLLSCSLFFLAGFFGSLLFTQDPEEADMPVPRERLLEAAWPE----MPYGESG 77
Query: 67 DASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT 126
+A+ IP+Q+LSW+PRALYFP FA++EQC++I+ TAK+RLKPS LALR+GET ESTKG
Sbjct: 78 EAAPSLIPYQILSWQPRALYFPQFATSEQCENIVKTAKERLKPSTLALRKGETAESTKGI 137
Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
RTSSGTF+SA+ED T L IE KIARATMLP+ HGE FNVLRY IGQ+Y SHYDAF+PA
Sbjct: 138 RTSSGTFLSANEDPTETLAEIEKKIARATMLPRNHGEPFNVLRYNIGQRYASHYDAFDPA 197
Query: 187 EYGPQMSQRLASFLLYLSDVEEGGETMFPFE 217
+YGPQ +QR+ASFLLYL+DVEEGGETMFP+E
Sbjct: 198 QYGPQKNQRVASFLLYLTDVEEGGETMFPYE 228
>gi|226499492|ref|NP_001150030.1| LOC100283657 [Zea mays]
gi|195636206|gb|ACG37571.1| prolyl 4-hydroxylase [Zea mays]
gi|347978804|gb|AEP37744.1| prolyl 4-hydroxylase 3 [Zea mays]
Length = 294
Score = 279 bits (713), Expect = 8e-73, Method: Compositional matrix adjust.
Identities = 135/211 (63%), Positives = 166/211 (78%), Gaps = 7/211 (3%)
Query: 10 TKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIR---PKLRTLEVVEKENESGLPHGETG 66
T+L LP LL CS FFLAGF+GS ++D P+ R L+ E +P+GE+G
Sbjct: 22 TRLRLPVVLLSCSLFFLAGFFGSLLFTQDPEEADMPVPRERLLDAAWPE----MPYGESG 77
Query: 67 DASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT 126
+A+ IP+Q+LSW+PRALYFP FA++EQC++I+ TAK+RLKPS LALR+GET ESTKG
Sbjct: 78 EAAPSLIPYQILSWQPRALYFPQFATSEQCENIVKTAKERLKPSTLALRKGETAESTKGI 137
Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
RTSSGTF+SA+ED T L IE KIARATMLP+ HGE FNVLRY IGQ+Y SHYDAF+PA
Sbjct: 138 RTSSGTFLSANEDPTETLAEIEKKIARATMLPRNHGEPFNVLRYNIGQRYASHYDAFDPA 197
Query: 187 EYGPQMSQRLASFLLYLSDVEEGGETMFPFE 217
+YGPQ +QR+ASFLLYL+DVEEGGETMFP+E
Sbjct: 198 QYGPQKNQRVASFLLYLTDVEEGGETMFPYE 228
>gi|363807682|ref|NP_001242420.1| uncharacterized protein LOC100775302 [Glycine max]
gi|255641811|gb|ACU21174.1| unknown [Glycine max]
Length = 293
Score = 277 bits (708), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 146/226 (64%), Positives = 172/226 (76%), Gaps = 3/226 (1%)
Query: 1 MKGKAKRSSTKLGLPTALLLCSFFFLAGFYGSTFLSR---DVPSIRPKLRTLEVVEKENE 57
+KG + KL P L+C FFFLAGF+G T S D +RP+ R LE +
Sbjct: 6 VKGNWSLRTNKLSFPYVFLVCIFFFLAGFFGFTLFSHSQGDGDGLRPRQRLLEPGNEAEY 65
Query: 58 SGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQG 117
+ +P + GD SI SIPFQVLSWRPRALYFPNFA+AEQC++II AK LKPS LALRQG
Sbjct: 66 NLMPVRDLGDDSITSIPFQVLSWRPRALYFPNFATAEQCENIIDVAKDGLKPSTLALRQG 125
Query: 118 ETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYD 177
ET E+TKG RTSSG F+SAS DKTG L +IE KIARATM+P++HGEAFN+LRYE+ Q+Y+
Sbjct: 126 ETEENTKGIRTSSGVFVSASGDKTGTLAVIEEKIARATMIPRSHGEAFNILRYEVDQRYN 185
Query: 178 SHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
SHYDAFNPAEYGPQ SQR+ASFLLYL+DVEEGGETMFPFE L +
Sbjct: 186 SHYDAFNPAEYGPQKSQRMASFLLYLTDVEEGGETMFPFENGLNMD 231
>gi|242038031|ref|XP_002466410.1| hypothetical protein SORBIDRAFT_01g007280 [Sorghum bicolor]
gi|241920264|gb|EER93408.1| hypothetical protein SORBIDRAFT_01g007280 [Sorghum bicolor]
Length = 294
Score = 276 bits (707), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 134/211 (63%), Positives = 165/211 (78%), Gaps = 7/211 (3%)
Query: 10 TKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIR---PKLRTLEVVEKENESGLPHGETG 66
T+L LP LL CS FFLAGF+GS ++D P+ R LE E +P+GE+G
Sbjct: 22 TRLRLPVVLLSCSLFFLAGFFGSLLFTQDPEEADMPVPRERLLETAWPE----MPYGESG 77
Query: 67 DASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT 126
+A+ IP+Q+LSW+PRALYFP FA++EQC++I+ TAK+RLKPS LALR+GET ESTKG
Sbjct: 78 EAAPSLIPYQILSWQPRALYFPQFATSEQCENIVKTAKERLKPSTLALRKGETAESTKGI 137
Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
RTSSGTF+SA+ED T L IE KIARATM+P+ HGE FNVLRY IGQ+Y SHYDAF+P
Sbjct: 138 RTSSGTFLSANEDPTRTLAEIEKKIARATMIPRNHGEPFNVLRYNIGQRYASHYDAFDPV 197
Query: 187 EYGPQMSQRLASFLLYLSDVEEGGETMFPFE 217
+YGPQ SQR+ASFLLYL++VEEGGETMFP+E
Sbjct: 198 QYGPQKSQRVASFLLYLTNVEEGGETMFPYE 228
>gi|3297815|emb|CAA19873.1| putative protein [Arabidopsis thaliana]
gi|7270340|emb|CAB80108.1| putative protein [Arabidopsis thaliana]
Length = 257
Score = 275 bits (703), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 127/164 (77%), Positives = 145/164 (88%)
Query: 54 KENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA 113
+E S +PHG TG+ SI SIPFQVLSWRPRA+YFPNFA+AEQCQ+II AK LKPS LA
Sbjct: 6 EEEASSMPHGVTGEESIGSIPFQVLSWRPRAIYFPNFATAEQCQAIIERAKVNLKPSALA 65
Query: 114 LRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIG 173
LR+GET E+TKGTRTSSGTFISASE+ TG L+ +E KIARATM+P++HGE+FN+LRYE+G
Sbjct: 66 LRKGETAENTKGTRTSSGTFISASEESTGALDFVERKIARATMIPRSHGESFNILRYELG 125
Query: 174 QKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFE 217
QKYDSHYD FNP EYGPQ SQR+ASFLLYLSDVEEGGETMFPFE
Sbjct: 126 QKYDSHYDVFNPTEYGPQSSQRIASFLLYLSDVEEGGETMFPFE 169
>gi|326492085|dbj|BAJ98267.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 347
Score = 273 bits (697), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 135/214 (63%), Positives = 166/214 (77%), Gaps = 10/214 (4%)
Query: 10 TKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSI----RP--KLRTLEVVEKENESGLPHG 63
T+L LP LL CS FFLAGF GS ++D RP + R +E V E + +G
Sbjct: 72 TRLRLPVVLLSCSLFFLAGFLGSLLFTQDPQGEEDPERPLRRERVMEAVWPE----MAYG 127
Query: 64 ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
E+GD + IP+Q+LSW+PRALYFP FA+AEQC++++ TAK RL+PS LALR+GE+ E+T
Sbjct: 128 ESGDPAPSLIPYQILSWQPRALYFPQFATAEQCENVVKTAKARLRPSTLALRKGESEETT 187
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
KG RTSSGTF+SA ED TG L IE KIA+ATM+P++HGE FNVLRYEIGQKY SHYDAF
Sbjct: 188 KGIRTSSGTFLSAEEDPTGALAEIETKIAKATMMPRSHGEPFNVLRYEIGQKYASHYDAF 247
Query: 184 NPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFE 217
+PA+YGPQ SQR+ASFLLYL+DVEEGGETMFP+E
Sbjct: 248 DPAQYGPQKSQRVASFLLYLTDVEEGGETMFPYE 281
>gi|225438938|ref|XP_002279411.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
gi|296087348|emb|CBI33722.3| unnamed protein product [Vitis vinifera]
Length = 285
Score = 271 bits (692), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 149/226 (65%), Positives = 173/226 (76%), Gaps = 6/226 (2%)
Query: 1 MKGKAK---RSSTKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKENE 57
MK KAK R TKLGLP L C FFFLAG +GS L ++ S P R L + E+ +
Sbjct: 1 MKSKAKGKWRFGTKLGLPVVFLFCLFFFLAGLFGSGLLPQEFSSSEP--RRL-IREETDY 57
Query: 58 SGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQG 117
L HGE+G+ S+ SIPFQVLSWRPRALYFPNFA++EQCQSII AK L PS +ALR G
Sbjct: 58 DPLAHGESGEDSVTSIPFQVLSWRPRALYFPNFATSEQCQSIINMAKSNLTPSTVALRVG 117
Query: 118 ETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYD 177
E +T+G RTSSG FISASEDKTG L+LIE KIAR M+P+THGEAFNVLRYEIGQ+Y+
Sbjct: 118 EIRGNTEGIRTSSGVFISASEDKTGTLDLIEQKIARVIMIPRTHGEAFNVLRYEIGQRYN 177
Query: 178 SHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
SHYDAF+PAEYGPQ S R+A+FL+YLSDVEEGGETMFPFE L +
Sbjct: 178 SHYDAFDPAEYGPQKSHRIATFLVYLSDVEEGGETMFPFENGLNMD 223
>gi|115455509|ref|NP_001051355.1| Os03g0761900 [Oryza sativa Japonica Group]
gi|14488368|gb|AAK63935.1|AC084282_16 putative dioxygenase [Oryza sativa Japonica Group]
gi|17027263|gb|AAL34117.1|AC090713_4 putative hydroxylase subunit [Oryza sativa Japonica Group]
gi|108711218|gb|ABF99013.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
Group]
gi|113549826|dbj|BAF13269.1| Os03g0761900 [Oryza sativa Japonica Group]
gi|125545807|gb|EAY91946.1| hypothetical protein OsI_13633 [Oryza sativa Indica Group]
Length = 310
Score = 270 bits (691), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 131/211 (62%), Positives = 166/211 (78%), Gaps = 4/211 (1%)
Query: 10 TKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKENES---GLPHGETG 66
T+L LP LL CS FFLAGF+GS ++D P +L T E+ E+ G+ +GE+G
Sbjct: 33 TRLRLPVVLLSCSLFFLAGFFGSILFTQD-PQGEEELDTPMRRERLMEAAWPGMAYGESG 91
Query: 67 DASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT 126
+ IP+Q+LSW+PRALYFP FA+++QC++I+ TAK+RL PS LALR+GET ESTKG
Sbjct: 92 EPEPSLIPYQILSWQPRALYFPQFATSQQCENIVKTAKQRLMPSTLALRKGETEESTKGI 151
Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
RTSSGTF+S+ ED TG L +E KIA+ATM+P+ HGE FN+LRYEIGQ+Y SHYDAF+PA
Sbjct: 152 RTSSGTFLSSDEDPTGTLAEVEKKIAKATMIPRHHGEPFNILRYEIGQRYASHYDAFDPA 211
Query: 187 EYGPQMSQRLASFLLYLSDVEEGGETMFPFE 217
+YGPQ SQR+ASFLLYL+DVEEGGETMFP+E
Sbjct: 212 QYGPQKSQRVASFLLYLTDVEEGGETMFPYE 242
>gi|125588006|gb|EAZ28670.1| hypothetical protein OsJ_12681 [Oryza sativa Japonica Group]
Length = 280
Score = 270 bits (691), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 131/211 (62%), Positives = 166/211 (78%), Gaps = 4/211 (1%)
Query: 10 TKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKENES---GLPHGETG 66
T+L LP LL CS FFLAGF+GS ++D P +L T E+ E+ G+ +GE+G
Sbjct: 3 TRLRLPVVLLSCSLFFLAGFFGSILFTQD-PQGEEELDTPMRRERLMEAAWPGMAYGESG 61
Query: 67 DASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT 126
+ IP+Q+LSW+PRALYFP FA+++QC++I+ TAK+RL PS LALR+GET ESTKG
Sbjct: 62 EPEPSLIPYQILSWQPRALYFPQFATSQQCENIVKTAKQRLMPSTLALRKGETEESTKGI 121
Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
RTSSGTF+S+ ED TG L +E KIA+ATM+P+ HGE FN+LRYEIGQ+Y SHYDAF+PA
Sbjct: 122 RTSSGTFLSSDEDPTGTLAEVEKKIAKATMIPRHHGEPFNILRYEIGQRYASHYDAFDPA 181
Query: 187 EYGPQMSQRLASFLLYLSDVEEGGETMFPFE 217
+YGPQ SQR+ASFLLYL+DVEEGGETMFP+E
Sbjct: 182 QYGPQKSQRVASFLLYLTDVEEGGETMFPYE 212
>gi|357114580|ref|XP_003559078.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
distachyon]
Length = 295
Score = 270 bits (689), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 132/214 (61%), Positives = 166/214 (77%), Gaps = 10/214 (4%)
Query: 10 TKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSI----RP--KLRTLEVVEKENESGLPHG 63
T+L LP LL CS FFLAGF+GS ++D RP + R +E V E + +G
Sbjct: 20 TRLRLPVVLLSCSLFFLAGFFGSLLFTQDPQGEEELERPLSRERLMEAVWPE----MAYG 75
Query: 64 ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
++GD + IP+Q+LSW+PRALYFP FA++EQC++++ TAK RL+PS LALR+GET E+T
Sbjct: 76 DSGDPAPSLIPYQILSWQPRALYFPQFATSEQCENVVKTAKARLRPSTLALRKGETEETT 135
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
KG RTSSGTF+SA ED T L +E KIA+ATM+P++HGE FNVLRYEIGQKY SHYDAF
Sbjct: 136 KGIRTSSGTFLSADEDPTRTLAEVEKKIAKATMIPRSHGEPFNVLRYEIGQKYASHYDAF 195
Query: 184 NPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFE 217
+PA+YGPQ SQR+ASFLLYL+DVEEGGETMFP+E
Sbjct: 196 DPAQYGPQKSQRVASFLLYLTDVEEGGETMFPYE 229
>gi|168043388|ref|XP_001774167.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674574|gb|EDQ61081.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 284
Score = 268 bits (686), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 132/207 (63%), Positives = 157/207 (75%), Gaps = 6/207 (2%)
Query: 11 KLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKENESGLPHGETGDASI 70
KLG P LL C+FFFL GFYGS LS++ + ++ V E + HG TGD+S+
Sbjct: 17 KLGFPLVLLCCTFFFLVGFYGSNSLSKEEKHV-----VIDPVTNE-KLVFEHGRTGDSSV 70
Query: 71 QSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSS 130
IPFQVLSW+PRAL +PNFAS EQC++II A+ RL PS LALR+GE+ +TK RTSS
Sbjct: 71 TDIPFQVLSWKPRALLYPNFASKEQCEAIIKLARTRLAPSGLALRKGESEATTKEIRTSS 130
Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP 190
GTF+ ASEDKT L +E K+ARATM+P+ +GEAFNVLRY GQKYD HYD F+PAEYGP
Sbjct: 131 GTFLRASEDKTQSLAEVEEKMARATMIPRQNGEAFNVLRYNPGQKYDCHYDVFDPAEYGP 190
Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFE 217
Q SQR+ASFLLYLSDVEEGGETMFPFE
Sbjct: 191 QPSQRMASFLLYLSDVEEGGETMFPFE 217
>gi|356541677|ref|XP_003539300.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 297
Score = 267 bits (683), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 136/225 (60%), Positives = 162/225 (72%), Gaps = 11/225 (4%)
Query: 4 KAKRSSTKLGLPTALLLCSFFFLAGFYGSTFLSR------DVPSIRPKLRTLEVVEKENE 57
K R KL LP+ L+C FFFLAGF S+ S D+ ++EK +
Sbjct: 7 KGNRRMNKLDLPSVFLICIFFFLAGFSASSLFSHSQKHEYDLRLRPRPRPRARLLEKSTQ 66
Query: 58 SG-----LPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQL 112
L G++GD + IPFQVLSW PRALYFPNFASAEQC+SII A+ LK S L
Sbjct: 67 EKTEYHLLKAGDSGDDYVTLIPFQVLSWYPRALYFPNFASAEQCESIIEMARGGLKSSTL 126
Query: 113 ALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEI 172
ALR+GET ESTKG RTSSG F+SASED+TGIL+ IE KIA+AT +P+THGEAFN+LRYE+
Sbjct: 127 ALRKGETEESTKGIRTSSGVFMSASEDETGILDAIEEKIAKATKIPRTHGEAFNILRYEV 186
Query: 173 GQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFE 217
GQKY+SHYDAF+ AEYGP SQR+ASFLLYL+DV EGGETMFP+E
Sbjct: 187 GQKYNSHYDAFDEAEYGPLQSQRVASFLLYLTDVPEGGETMFPYE 231
>gi|356496957|ref|XP_003517331.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 299
Score = 259 bits (662), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 134/218 (61%), Positives = 158/218 (72%), Gaps = 13/218 (5%)
Query: 10 TKLGLPTALLLCSFFFLAGFYGSTFLSRDVPS--------IRPKLRTLE--VVEKENESG 59
KL LP+ L+C FFFLAGF S+ S PS + P+ R LE EK
Sbjct: 17 NKLDLPSVFLICIFFFLAGFSASSLFS---PSQEHEYDLRLIPRARLLEKSTHEKTEHHL 73
Query: 60 LPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET 119
L G++GD I IPFQVLSW PRALYFPNF SAEQC++II A+ LKPS L LR+GET
Sbjct: 74 LKAGDSGDDYITLIPFQVLSWYPRALYFPNFVSAEQCETIIEMARGGLKPSTLVLRKGET 133
Query: 120 VESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH 179
ESTKG RTS G F+SASED+TGIL+ IE KIA+AT +P+THGEAFN+LRYE+GQKY H
Sbjct: 134 EESTKGIRTSYGVFMSASEDETGILDSIEEKIAKATKIPRTHGEAFNILRYEVGQKYSPH 193
Query: 180 YDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFE 217
YDAF+ AE+GP SQR ASFLLYL+DV EGGET+FP+E
Sbjct: 194 YDAFDEAEFGPLQSQRAASFLLYLTDVPEGGETLFPYE 231
>gi|357453665|ref|XP_003597113.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
gi|357482683|ref|XP_003611628.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
gi|355486161|gb|AES67364.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
gi|355512963|gb|AES94586.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
Length = 294
Score = 254 bits (650), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 135/214 (63%), Positives = 156/214 (72%), Gaps = 7/214 (3%)
Query: 9 STKLGLPTALLLC---SFFFLAGFYGSTFLSRDVPSIRPKLRTLE-VVEKENESGLPH-G 63
+ K LP+ LLC GF D +R +R LE V+ E E L H G
Sbjct: 14 TKKFSLPSVFLLCIFFFLAGFFGFSFFHHSQEDEYGVR--VRLLEKSVKDETEHRLLHAG 71
Query: 64 ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
++GD I SIPFQVLSW PRALYFPNFASAEQC II AK L PS+L LR+GET E T
Sbjct: 72 KSGDNFITSIPFQVLSWNPRALYFPNFASAEQCDRIIEMAKAELSPSRLMLREGETEEGT 131
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
KG RTSSG FISASEDKTG+LE+I+ KIARA +P+THG A+N+LRY++GQKY+SHYDAF
Sbjct: 132 KGIRTSSGMFISASEDKTGLLEVIDEKIARAAKIPKTHGGAYNILRYKVGQKYNSHYDAF 191
Query: 184 NPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFE 217
NPAEYGPQ SQR+ASFLLYL+DV EGGETMFPFE
Sbjct: 192 NPAEYGPQESQRVASFLLYLTDVPEGGETMFPFE 225
>gi|168006299|ref|XP_001755847.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693166|gb|EDQ79520.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 299
Score = 248 bits (634), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 126/216 (58%), Positives = 151/216 (69%), Gaps = 9/216 (4%)
Query: 11 KLGLPTALLLCSFFFLAGFYGSTFLSRDVP--SIRPKLRTLEVVEKENESG-------LP 61
KL LP LL C+ LAG+ S F + + +I R E E G L
Sbjct: 17 KLSLPQVLLWCTICLLAGYAASNFFPQKIEEEAIYQPYRKSAQQEGEFPFGEFSEKVVLD 76
Query: 62 HGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE 121
HG TGD I IPFQVLSW+PRAL +P FAS EQC++I+ A+ RL PS LALR+GE+ +
Sbjct: 77 HGSTGDNFIADIPFQVLSWKPRALLYPRFASKEQCEAIMKLARTRLAPSALALRKGESED 136
Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
STK RTSSGTF+ A ED T LE +E K+A+ATM+P+ +GEAFNVL+Y +GQKYD HYD
Sbjct: 137 STKDIRTSSGTFLRADEDTTRSLEQVEEKMAKATMIPRENGEAFNVLKYNVGQKYDCHYD 196
Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFE 217
F+PAEYGPQ SQR+ASFLLYLSDVEEGGETMFPFE
Sbjct: 197 VFDPAEYGPQPSQRMASFLLYLSDVEEGGETMFPFE 232
>gi|302802700|ref|XP_002983104.1| hypothetical protein SELMODRAFT_234144 [Selaginella moellendorffii]
gi|300149257|gb|EFJ15913.1| hypothetical protein SELMODRAFT_234144 [Selaginella moellendorffii]
Length = 292
Score = 246 bits (627), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 124/225 (55%), Positives = 154/225 (68%), Gaps = 9/225 (4%)
Query: 1 MKGKAK--RSSTKLGLPTALLLCSFFFLAGFYGSTF------LSRDVPSIRPKLRTLEVV 52
MKG+++ ++ P L+LC FL G++ F +PS +
Sbjct: 1 MKGRSRGRVCGHQVSAPLVLVLCVLAFLVGYFIPEFQQVILVTKHSIPSFSSFANQRHEL 60
Query: 53 EKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQL 112
E+ + HG TGD + IPFQVLSW PRAL FP FAS QC++II+ AK +L PS L
Sbjct: 61 -LEDVTVAEHGVTGDDQLSFIPFQVLSWTPRALLFPKFASPAQCEAIISLAKTKLTPSSL 119
Query: 113 ALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEI 172
ALR+GET T+ RTS G F+S+ +DKTG L +E K+A+ATM+P++HGEAFNVLRYEI
Sbjct: 120 ALRKGETATETQDVRTSHGCFLSSRQDKTGTLAWVEEKMAKATMIPKSHGEAFNVLRYEI 179
Query: 173 GQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFE 217
GQKY+SHYD FNPAEYGPQ SQR+ASFLLYLSDVEEGGETMFPFE
Sbjct: 180 GQKYNSHYDVFNPAEYGPQKSQRMASFLLYLSDVEEGGETMFPFE 224
>gi|302764866|ref|XP_002965854.1| hypothetical protein SELMODRAFT_84512 [Selaginella moellendorffii]
gi|300166668|gb|EFJ33274.1| hypothetical protein SELMODRAFT_84512 [Selaginella moellendorffii]
Length = 231
Score = 237 bits (604), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 109/156 (69%), Positives = 128/156 (82%)
Query: 62 HGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE 121
HG TG+ + IPFQVLSW PRAL FP FAS QC++II+ AK +L PS LALR+GET
Sbjct: 8 HGVTGEDQLAFIPFQVLSWTPRALLFPKFASPAQCEAIISLAKTKLTPSSLALRKGETAT 67
Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
T+ RTS G F+S+ +DKTG L +E K+A+ATM+P++HGEAFNVLRYEIGQKY+SHYD
Sbjct: 68 ETQDVRTSHGCFLSSRQDKTGTLAWVEEKMAKATMIPKSHGEAFNVLRYEIGQKYNSHYD 127
Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFE 217
FNPAEYGPQ SQR+ASFLLYLSDVEEGGETMFPFE
Sbjct: 128 VFNPAEYGPQKSQRMASFLLYLSDVEEGGETMFPFE 163
>gi|297825201|ref|XP_002880483.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297326322|gb|EFH56742.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 272
Score = 211 bits (537), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 110/208 (52%), Positives = 141/208 (67%), Gaps = 12/208 (5%)
Query: 11 KLGLPTALLLCSFFF-LAGFYGSTFLSRDVPSIRPKLRTLEVVEKENESGLPHGETGDAS 69
KLG P L C FF + GF LS+ + ++ T V +E ES L HG S
Sbjct: 9 KLGFPYVFLACCFFLAIFGFCFFNLLSQIISF--SEISTRRSVNEETES-LDHG-----S 60
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+ +IPF LSW PR Y PNFA+ +QC+++I AK +LKPS LALR+GET E+T+ RT
Sbjct: 61 VSNIPFHGLSWNPRVFYLPNFATKQQCEAVIDMAKPKLKPSLLALRKGETAETTQNVRTR 120
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
ED++GIL IE KIA AT +P + E+FN+LRY++GQKYDSHYDAF+PAEYG
Sbjct: 121 ---LKKTDEDESGILAAIEEKIALATRIPIDYYESFNILRYQLGQKYDSHYDAFHPAEYG 177
Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFE 217
PQ+SQR+ +F+L+LS VEEGGETMFPFE
Sbjct: 178 PQISQRVVTFILFLSSVEEGGETMFPFE 205
>gi|30681957|ref|NP_850038.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
gi|330252315|gb|AEC07409.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
Length = 274
Score = 197 bits (502), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 104/208 (50%), Positives = 136/208 (65%), Gaps = 10/208 (4%)
Query: 11 KLGLPTALLLCSFFF-LAGFYGSTFLSRDVPSIRPKLRTLEVVEKENESGLPHGETGDAS 69
KL P + C FF + GF S+ + S T V E +S L HG +S
Sbjct: 9 KLVFPYVFIACCFFLAIFGFCFFNLFSQGI-SFSEIPTTRRSVNDETDS-LDHG----SS 62
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+ +IPF LSW PR Y PNFA+ +QC+++I AK +LKPS LALR+GET E+T+ R+
Sbjct: 63 VSNIPFHGLSWNPRVFYLPNFATKQQCEAVIDMAKPKLKPSTLALRKGETAETTQNYRSL 122
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
ED++G+L IE KIA AT P+ + E+FN+LRY++GQKYDSHYDAF+ AEYG
Sbjct: 123 HQ---HTDEDESGVLAAIEEKIALATRFPKDYYESFNILRYQLGQKYDSHYDAFHSAEYG 179
Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFE 217
P +SQR+ +FLL+LS VEEGGETMFPFE
Sbjct: 180 PLISQRVVTFLLFLSSVEEGGETMFPFE 207
>gi|412994121|emb|CCO14632.1| predicted protein [Bathycoccus prasinos]
Length = 341
Score = 195 bits (495), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 88/155 (56%), Positives = 120/155 (77%)
Query: 63 GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES 122
G GD + + FQ+LS PR++ + NFAS C +I+ A+ RL S LAL++GET+E+
Sbjct: 116 GALGDEYLTELKFQLLSTAPRSVMYRNFASDADCDAIVEAARSRLHKSGLALKRGETLET 175
Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
TK RTSSGTF+++ +++G L+ +E K+ARAT +P THGEA+N+LRYEIGQKYDSHYD
Sbjct: 176 TKNIRTSSGTFLTSKMEQSGALKRVEEKMARATHIPATHGEAYNILRYEIGQKYDSHYDM 235
Query: 183 FNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFE 217
F+P++YGPQ SQR+ASFLLYL+ +EGGET+FP E
Sbjct: 236 FDPSQYGPQRSQRVASFLLYLTTPDEGGETVFPLE 270
>gi|224056224|ref|XP_002298763.1| predicted protein [Populus trichocarpa]
gi|222846021|gb|EEE83568.1| predicted protein [Populus trichocarpa]
Length = 175
Score = 178 bits (451), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 88/121 (72%), Positives = 97/121 (80%), Gaps = 9/121 (7%)
Query: 103 AKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHG 162
AK +LKPS LALR+GET EST FI SEDKTG L+ IE KIA+ATM+PQ+HG
Sbjct: 2 AKSKLKPSTLALRKGETTEST---------FIGGSEDKTGTLDFIERKIAKATMIPQSHG 52
Query: 163 EAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFEVDLQI 222
EAFN+LRYEIGQKYDSHYDAFNP EYGPQ SQR+ASFLLYLS VEEGGETMFPFE +
Sbjct: 53 EAFNILRYEIGQKYDSHYDAFNPDEYGPQPSQRVASFLLYLSSVEEGGETMFPFENGSAV 112
Query: 223 S 223
S
Sbjct: 113 S 113
>gi|302845120|ref|XP_002954099.1| hypothetical protein VOLCADRAFT_64439 [Volvox carteri f.
nagariensis]
gi|300260598|gb|EFJ44816.1| hypothetical protein VOLCADRAFT_64439 [Volvox carteri f.
nagariensis]
Length = 231
Score = 177 bits (449), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 84/155 (54%), Positives = 111/155 (71%)
Query: 60 LPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET 119
LP E+G ++ IPFQ+LSW PR + FP F + + +I A K + PS LA R GET
Sbjct: 2 LPAAESGSDNVYVIPFQILSWYPRVVVFPGFIDKARAEYVIKLASKFMYPSGLAYRPGET 61
Query: 120 VESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH 179
V+ ++ TRTS+GTF++A+ D G+L +E +IA AT+LP +GEAFNVL YE Q YDSH
Sbjct: 62 VDPSQQTRTSTGTFLAAAMDPEGVLGWVEQRIAAATLLPAENGEAFNVLHYEKEQHYDSH 121
Query: 180 YDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMF 214
YD F+P E+GPQ SQR+A+ LLYLS+V EGGET+F
Sbjct: 122 YDTFDPKEFGPQPSQRIATVLLYLSEVLEGGETVF 156
>gi|384250599|gb|EIE24078.1| hypothetical protein COCSUDRAFT_47131 [Coccomyxa subellipsoidea
C-169]
Length = 327
Score = 172 bits (437), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 80/154 (51%), Positives = 103/154 (66%)
Query: 64 ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
E+G+ P Q++SW PR + +P F E+C+ + AK RL PS LALR E + T
Sbjct: 98 ESGNDFYSVQPQQLISWYPRIILYPGFIDPERCKHFVKVAKARLAPSGLALRTTEGPQET 157
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
+ RTS GTF+S +D G++ +E K A+ T LP +HGE FNVLRY+ GQ YDSHYD F
Sbjct: 158 ENVRTSQGTFMSRKDDPAGVIAWVEEKAAQVTGLPVSHGEPFNVLRYQDGQHYDSHYDIF 217
Query: 184 NPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFE 217
P YGPQ SQR+A+ L YL+DVEEGGET+FP E
Sbjct: 218 EPESYGPQPSQRMATILFYLTDVEEGGETIFPLE 251
>gi|159489502|ref|XP_001702736.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280758|gb|EDP06515.1| predicted protein [Chlamydomonas reinhardtii]
Length = 231
Score = 168 bits (425), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 82/155 (52%), Positives = 107/155 (69%)
Query: 60 LPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET 119
LP +G IPFQ+LSW PR + FP F + + I+ A K + PS LA R GE
Sbjct: 2 LPAAASGSDVTYIIPFQILSWYPRIVVFPGFIDKARAEHIVKLAGKFMYPSGLAYRPGEQ 61
Query: 120 VESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH 179
VES++ TRTS+GTF+S+ D G+L +E +IA AT+LP +GEAFNVL YE Q YDSH
Sbjct: 62 VESSQQTRTSTGTFLSSGMDTEGVLGWVEQRIAAATLLPADNGEAFNVLHYEHMQHYDSH 121
Query: 180 YDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMF 214
D+F+P ++GPQ SQR+A+ LLYLS+V EGGET+F
Sbjct: 122 MDSFDPKDFGPQPSQRIATVLLYLSEVLEGGETVF 156
>gi|307108817|gb|EFN57056.1| hypothetical protein CHLNCDRAFT_143796 [Chlorella variabilis]
Length = 334
Score = 163 bits (412), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 82/158 (51%), Positives = 102/158 (64%), Gaps = 12/158 (7%)
Query: 60 LPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET 119
L HGE+G + P Q+LS PRA P F S +QC +IA A++RL PS LA + G+T
Sbjct: 116 LEHGESGHSFYTVQPMQLLSLYPRAYLMPRFLSQKQCDHVIAMAERRLAPSGLAFKAGDT 175
Query: 120 VESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH 179
E+T+ ED G+L IE K+A TM+P HGE FNVLRYE Q YDSH
Sbjct: 176 AENTR------------DEDPDGVLAWIEDKLAAVTMIPAGHGEPFNVLRYEPSQHYDSH 223
Query: 180 YDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFE 217
YD+F+ EYGPQ SQR+A+ LLYL+DVEEGGET+F E
Sbjct: 224 YDSFSEEEYGPQFSQRIATVLLYLADVEEGGETVFLLE 261
>gi|343171882|gb|AEL98645.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein, partial
[Silene latifolia]
gi|343171884|gb|AEL98646.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein, partial
[Silene latifolia]
Length = 162
Score = 160 bits (404), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 93/161 (57%), Positives = 120/161 (74%), Gaps = 9/161 (5%)
Query: 11 KLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKENE-SG--------LP 61
KLGLP+ LC FFFL GF+GS+ +S+D + R + R LE E + +G LP
Sbjct: 2 KLGLPSVFFLCFFFFLLGFFGSSLISQDYANARLRARILESSNGEEDITGGSSFGFDLLP 61
Query: 62 HGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE 121
G+TGD SI SIPFQVLSWRPR LYFP FA+A+ C++II+ A+ +LKPS+LALR+GET++
Sbjct: 62 SGDTGDDSISSIPFQVLSWRPRVLYFPKFATADHCETIISIARSQLKPSRLALRKGETLD 121
Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHG 162
ST+ RTSSG FISA EDKTGIL+ I+ KIARATM+P+ +G
Sbjct: 122 STREIRTSSGMFISADEDKTGILDFIDEKIARATMIPRANG 162
>gi|147834798|emb|CAN75013.1| hypothetical protein VITISV_039948 [Vitis vinifera]
Length = 282
Score = 144 bits (363), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 73/130 (56%), Positives = 84/130 (64%), Gaps = 33/130 (25%)
Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGE----------------------- 163
R SG FISASEDKTG L+LIE KIAR M+P+THGE
Sbjct: 91 RLCSGVFISASEDKTGTLDLIEQKIARVIMIPRTHGEIKPKENCLNWLGQVPPFEFVVMK 150
Query: 164 ----------AFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETM 213
AFN+LRYEIGQ+Y+SHYDAF+PAEYGPQ S R+A+FL+YLSDVEEGGETM
Sbjct: 151 RFLTDVVYHVAFNILRYEIGQRYNSHYDAFDPAEYGPQKSHRIATFLVYLSDVEEGGETM 210
Query: 214 FPFEVDLQIS 223
FPFE L +
Sbjct: 211 FPFENGLNMD 220
>gi|3169183|gb|AAC17826.1| hypothetical protein [Arabidopsis thaliana]
Length = 1036
Score = 135 bits (339), Expect = 2e-29, Method: Composition-based stats.
Identities = 68/150 (45%), Positives = 92/150 (61%), Gaps = 11/150 (7%)
Query: 47 RTLEVVEKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKR 106
RTL ++ + GL H + Q LSW PR Y PNFA+ +QC+++I AK +
Sbjct: 776 RTLITLDSSDSQGLVH-------VLMWNDQGLSWNPRVFYLPNFATKQQCEAVIDMAKPK 828
Query: 107 LKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFN 166
LKPS LALR+ E+ ED++G+L IE KIA AT P+ + E+FN
Sbjct: 829 LKPSTLALRK----ETKHFQMQYRSLHQHTDEDESGVLAAIEEKIALATRFPKDYYESFN 884
Query: 167 VLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+LRY++GQKYDSHYDAF+ AEYGP +SQR+
Sbjct: 885 ILRYQLGQKYDSHYDAFHSAEYGPLISQRV 914
>gi|159490898|ref|XP_001703410.1| prolyl 4-hydroxylase [Chlamydomonas reinhardtii]
gi|158280334|gb|EDP06092.1| prolyl 4-hydroxylase [Chlamydomonas reinhardtii]
Length = 429
Score = 134 bits (338), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 75/161 (46%), Positives = 101/161 (62%), Gaps = 3/161 (1%)
Query: 63 GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES 122
G++G+A ++IPFQ+LS PR FPNF + + IIA A K + PS LA R GE VE+
Sbjct: 203 GDSGEAFYRTIPFQILSLYPRIKVFPNFVDKARREEIIALASKFMYPSGLAYRPGEQVEA 262
Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
+ RTS GTF+ D + L +E KIA T +P+ +GE +NVL Y+ Q YDSH D+
Sbjct: 263 EQQVRTSKGTFLGG--DSSPALTWLESKIAAVTDIPRQNGEFWNVLNYKHTQHYDSHMDS 320
Query: 183 FNPAEYGPQMSQRLASFLLYLSDVE-EGGETMFPFEVDLQI 222
F+P EYG Q SQR+A+ ++ LSD GGET+F E I
Sbjct: 321 FDPKEYGQQYSQRIATVIVVLSDEGLVGGETVFKREGKANI 361
>gi|357517897|ref|XP_003629237.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355523259|gb|AET03713.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|388513409|gb|AFK44766.1| unknown [Medicago truncatula]
gi|388516345|gb|AFK46234.1| unknown [Medicago truncatula]
Length = 275
Score = 131 bits (330), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 85/214 (39%), Positives = 120/214 (56%), Gaps = 15/214 (7%)
Query: 4 KAKRSSTKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIR-PKLRTLEVVEKENESGLPH 62
K K S+ L P +L+ C FL F TFL + ++R PKL L + + H
Sbjct: 3 KFKHSNVGLRKP-SLITCWTLFLTLFVTFTFLILIILTLRIPKLNHL--------NSITH 53
Query: 63 GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQ-LALRQGETVE 121
T Q++SW PRA + NF + E+C+ +I AK + S+ + + G+++
Sbjct: 54 SNTLRNDDNKRWVQIISWEPRAFLYHNFLTKEECEHLINIAKPSMHKSEVIDEKTGKSLN 113
Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
S+ RTSSGTF+ D+ I+ IE +IA T +P HGE+FNVL YE+GQKY+ HYD
Sbjct: 114 SS--IRTSSGTFLDREGDE--IVSNIEKRIADFTFIPVEHGESFNVLHYEVGQKYEPHYD 169
Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
F QR+A+ L+YLSDVEEGGET+FP
Sbjct: 170 YFLDTFSTRHAGQRIATMLMYLSDVEEGGETVFP 203
>gi|449520146|ref|XP_004167095.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 249
Score = 129 bits (324), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 68/154 (44%), Positives = 97/154 (62%), Gaps = 10/154 (6%)
Query: 63 GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVE 121
G+ GD ++ I SW PRA + NF S E+C +I+ AK ++ S + + G+ VE
Sbjct: 32 GKRGDQWVEFI-----SWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDNETGKNVE 86
Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
+ RTSSG F++ +DK I+ IE +IA T +P HGE +L YE+GQKYD+HYD
Sbjct: 87 DS--VRTSSGMFLNRGQDK--IVSNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYD 142
Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
F+ ++ QR+A+ L+YLSDVEEGGET+FP
Sbjct: 143 FFDDEFNLKEIGQRMATLLMYLSDVEEGGETVFP 176
>gi|449529555|ref|XP_004171765.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 284
Score = 129 bits (323), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 69/154 (44%), Positives = 96/154 (62%), Gaps = 10/154 (6%)
Query: 63 GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVE 121
G+ GD ++ I SW PRA + NF S E+C +I+ AK ++ S + + GE+V+
Sbjct: 67 GKRGDQWVEFI-----SWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSKTGESVD 121
Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
S RTSSG F++ +DK I+ IE +IA T +P HGE +L YE+GQKYD+HYD
Sbjct: 122 SR--VRTSSGMFLNRGQDK--IIRNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYD 177
Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
F + QR+A+ L+YLSDVEEGGET+FP
Sbjct: 178 YFVDEYNIKKGGQRMATLLMYLSDVEEGGETVFP 211
>gi|302793288|ref|XP_002978409.1| hypothetical protein SELMODRAFT_418273 [Selaginella moellendorffii]
gi|300153758|gb|EFJ20395.1| hypothetical protein SELMODRAFT_418273 [Selaginella moellendorffii]
Length = 256
Score = 128 bits (321), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 65/141 (46%), Positives = 93/141 (65%), Gaps = 5/141 (3%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFI 134
+ +SW+PRA F NF S+E+C +I A+ +K S + Q G++ +S RTSSGTF+
Sbjct: 47 ETISWQPRASVFHNFLSSEECDHLIRLAQPNMKRSAVVDNQTGKSKDSR--VRTSSGTFL 104
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
+D+ I+ IE +IA+ T +P+ HGE VL YE+GQKYD+H+D F+ Q
Sbjct: 105 RRGQDE--IISRIEERIAKFTFIPKEHGEGLQVLHYEVGQKYDAHHDYFHDKVNTKNGGQ 162
Query: 195 RLASFLLYLSDVEEGGETMFP 215
R+A+ L+YLSDVEEGGET+FP
Sbjct: 163 RVATVLMYLSDVEEGGETVFP 183
>gi|302773668|ref|XP_002970251.1| hypothetical protein SELMODRAFT_411114 [Selaginella moellendorffii]
gi|300161767|gb|EFJ28381.1| hypothetical protein SELMODRAFT_411114 [Selaginella moellendorffii]
Length = 256
Score = 128 bits (321), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 65/141 (46%), Positives = 93/141 (65%), Gaps = 5/141 (3%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFI 134
+ +SW+PRA F NF S+E+C +I A+ +K S + Q G++ +S RTSSGTF+
Sbjct: 47 ETISWQPRASVFHNFLSSEECDHLIRLAQPNMKRSAVVDNQTGKSKDSR--VRTSSGTFL 104
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
+D+ I+ IE +IA+ T +P+ HGE VL YE+GQKYD+H+D F+ Q
Sbjct: 105 RRGQDE--IISRIEERIAKFTFIPKEHGEGLQVLHYEVGQKYDAHHDYFHDKVNTKNGGQ 162
Query: 195 RLASFLLYLSDVEEGGETMFP 215
R+A+ L+YLSDVEEGGET+FP
Sbjct: 163 RVATVLMYLSDVEEGGETVFP 183
>gi|449443243|ref|XP_004139389.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 284
Score = 127 bits (319), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 68/154 (44%), Positives = 95/154 (61%), Gaps = 10/154 (6%)
Query: 63 GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVE 121
G+ GD ++ I SW PRA + NF S E+C +I+ AK ++ S + + G+ VE
Sbjct: 67 GKRGDQWVEFI-----SWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDNETGKNVE 121
Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
+ RTSSG F++ +DK I+ IE +IA T +P HGE +L YE+GQKYD+HYD
Sbjct: 122 DS--VRTSSGMFLNRGQDK--IVSNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYD 177
Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
F + QR+A+ L+YLSDVEEGGET+FP
Sbjct: 178 YFVDEYNIKKGGQRMATLLMYLSDVEEGGETVFP 211
>gi|357517881|ref|XP_003629229.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355523251|gb|AET03705.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 278
Score = 127 bits (318), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 65/141 (46%), Positives = 93/141 (65%), Gaps = 5/141 (3%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFI 134
Q++SW PRA + NF + ++C+ +I TAK ++ S + + G++ +S+ RTSSGTF+
Sbjct: 68 QIVSWEPRAFLYHNFLTKKECEHLINTAKPSMQKSSVVDNETGKSKDSS--VRTSSGTFL 125
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
D+ I+ IE +IA T +P +GE+FNVLRYE+GQKYD H D F Q
Sbjct: 126 DRGGDE--IVRNIEKRIADFTFIPVENGESFNVLRYEVGQKYDPHLDYFADDYNTVNGGQ 183
Query: 195 RLASFLLYLSDVEEGGETMFP 215
R+A+ L+YLSDVEEGGET+FP
Sbjct: 184 RIATMLMYLSDVEEGGETVFP 204
>gi|357517895|ref|XP_003629236.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355523258|gb|AET03712.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 326
Score = 126 bits (316), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 65/141 (46%), Positives = 89/141 (63%), Gaps = 5/141 (3%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFI 134
Q++SW PRA + NF + E+C+ +I AK + S + + G V+S + RTSSG F+
Sbjct: 116 QIISWEPRAFLYHNFLTKEECEHLINIAKPSMHKSAVIDEETGNGVDSRE--RTSSGAFL 173
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
D+ I++ IE +IA T +P HGE FNVL YE+GQKY+ HYD F Q
Sbjct: 174 KRGSDR--IVKNIERRIADFTFIPVEHGENFNVLHYEVGQKYEPHYDYFMDTFSTTYAGQ 231
Query: 195 RLASFLLYLSDVEEGGETMFP 215
R+A+ L+YLSDVEEGGET+FP
Sbjct: 232 RIATMLMYLSDVEEGGETVFP 252
>gi|242039227|ref|XP_002467008.1| hypothetical protein SORBIDRAFT_01g018200 [Sorghum bicolor]
gi|241920862|gb|EER94006.1| hypothetical protein SORBIDRAFT_01g018200 [Sorghum bicolor]
Length = 307
Score = 124 bits (312), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 62/140 (44%), Positives = 87/140 (62%), Gaps = 3/140 (2%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
+VLSW PRA + NF S E+C +I+ AK +K S + + ++ RTSSG F+
Sbjct: 97 EVLSWEPRAFVYHNFLSKEECDHLISLAKPHMKKSTVVDSATGASKDSR-VRTSSGMFLR 155
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
+DK I++ IE +IA T +P HGE VL YE+GQKY+ H+D F+ QR
Sbjct: 156 RGQDK--IIQTIEKRIADFTFIPVEHGEGLQVLHYEVGQKYEPHFDYFHDDYNTKNGGQR 213
Query: 196 LASFLLYLSDVEEGGETMFP 215
+A+ L+YLSDVE+GGET+FP
Sbjct: 214 IATLLMYLSDVEDGGETVFP 233
>gi|21537370|gb|AAM61711.1| putative prolyl 4-hydroxylase, alpha subunit [Arabidopsis thaliana]
Length = 287
Score = 124 bits (312), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 67/141 (47%), Positives = 86/141 (60%), Gaps = 5/141 (3%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFI 134
+VLSW PRA + NF S E+C+ +I+ AK + S + ET +S RTSSGTF+
Sbjct: 77 EVLSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVV--DSETGKSKDSRVRTSSGTFL 134
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
DK I++ IE +IA T +P HGE VL YE GQKY+ HYD F Q
Sbjct: 135 RRGRDK--IIKTIEKRIADYTFIPADHGEGLQVLHYEAGQKYEPHYDYFVDEFNTKNGGQ 192
Query: 195 RLASFLLYLSDVEEGGETMFP 215
R+A+ L+YLSDVEEGGET+FP
Sbjct: 193 RMATMLMYLSDVEEGGETVFP 213
>gi|18394842|ref|NP_564109.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
gi|9558598|gb|AAF88161.1|AC026234_12 Contains similarity to a prolyl 4-hydroxylase alpha subunit protein
from Gallus gallus gi|212530 [Arabidopsis thaliana]
gi|90962978|gb|ABE02413.1| At1g20270 [Arabidopsis thaliana]
gi|332191835|gb|AEE29956.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
Length = 287
Score = 124 bits (312), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 67/141 (47%), Positives = 86/141 (60%), Gaps = 5/141 (3%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFI 134
+VLSW PRA + NF S E+C+ +I+ AK + S + ET +S RTSSGTF+
Sbjct: 77 EVLSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVV--DSETGKSKDSRVRTSSGTFL 134
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
DK I++ IE +IA T +P HGE VL YE GQKY+ HYD F Q
Sbjct: 135 RRGRDK--IIKTIEKRIADYTFIPADHGEGLQVLHYEAGQKYEPHYDYFVDEFNTKNGGQ 192
Query: 195 RLASFLLYLSDVEEGGETMFP 215
R+A+ L+YLSDVEEGGET+FP
Sbjct: 193 RMATMLMYLSDVEEGGETVFP 213
>gi|225459748|ref|XP_002285898.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Vitis vinifera]
gi|302141716|emb|CBI18919.3| unnamed protein product [Vitis vinifera]
Length = 288
Score = 124 bits (311), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 63/140 (45%), Positives = 87/140 (62%), Gaps = 3/140 (2%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
+++SW PRA + NF S E+C+ +I+ AK +K S + + + ++ RTSSG F+
Sbjct: 78 EIVSWEPRAFIYHNFLSKEECEYMISLAKPYMKKSTVVDSETGRSKDSR-VRTSSGMFLR 136
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
DK I+ IE +IA T +P HGE VL YE+GQKYD+HYD F QR
Sbjct: 137 RGRDK--IIRDIEKRIADFTFIPVEHGEGLQVLHYEVGQKYDAHYDYFLDEFNTKNGGQR 194
Query: 196 LASFLLYLSDVEEGGETMFP 215
+A+ L+YLSDVEEGGET+FP
Sbjct: 195 IATLLMYLSDVEEGGETVFP 214
>gi|297850430|ref|XP_002893096.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297338938|gb|EFH69355.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 287
Score = 124 bits (310), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 66/141 (46%), Positives = 86/141 (60%), Gaps = 5/141 (3%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFI 134
+VLSW PRA + NF S E+C+ +I+ AK + S + ET +S RTSSGTF+
Sbjct: 77 EVLSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVV--DSETGKSKDSRVRTSSGTFL 134
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
DK I++ IE +IA T +P HGE +L YE GQKY+ HYD F Q
Sbjct: 135 RRGRDK--IIKTIEKRIADYTFIPADHGEGLQILHYEAGQKYEPHYDYFVDEFNTKNGGQ 192
Query: 195 RLASFLLYLSDVEEGGETMFP 215
R+A+ L+YLSDVEEGGET+FP
Sbjct: 193 RMATMLMYLSDVEEGGETVFP 213
>gi|357137804|ref|XP_003570489.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Brachypodium
distachyon]
Length = 318
Score = 123 bits (308), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 65/146 (44%), Positives = 87/146 (59%), Gaps = 15/146 (10%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
+V+SW PRA + NF S E+C+ +I AK R++ S + V+ST G RTS
Sbjct: 108 EVISWEPRAFVYHNFLSKEECEYLIGLAKPRMEKSTV-------VDSTTGKSKDSRVRTS 160
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
SG F+ DK ++ IE +IA T +P HGE VL YE+GQKY+ H+D F
Sbjct: 161 SGMFLRRGRDK--VIRAIERRIADYTFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEFNT 218
Query: 190 PQMSQRLASFLLYLSDVEEGGETMFP 215
QR+A+ L+YLSDVEEGGET+FP
Sbjct: 219 KNGGQRMATILMYLSDVEEGGETIFP 244
>gi|356517655|ref|XP_003527502.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Glycine max]
Length = 290
Score = 123 bits (308), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 64/141 (45%), Positives = 91/141 (64%), Gaps = 5/141 (3%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTKGTRTSSGTFI 134
++LSW PRA + NF S E+C+ +I AK ++ K S + + G++ ES RTSSG F+
Sbjct: 80 EILSWEPRAFIYHNFLSKEECEYLIELAKPQMVKSSVVDSKTGKSTESR--VRTSSGMFL 137
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
+DK I++ IE +IA T +P+ +GE +L YE+GQKY+ HYD F Q
Sbjct: 138 KRGKDK--IVQNIEKRIADFTFIPEENGEGLQILHYEVGQKYEPHYDYFLDEFNTKNGGQ 195
Query: 195 RLASFLLYLSDVEEGGETMFP 215
R+A+ L+YLSDVEEGGET+FP
Sbjct: 196 RIATVLMYLSDVEEGGETVFP 216
>gi|302791635|ref|XP_002977584.1| hypothetical protein SELMODRAFT_106693 [Selaginella moellendorffii]
gi|300154954|gb|EFJ21588.1| hypothetical protein SELMODRAFT_106693 [Selaginella moellendorffii]
Length = 296
Score = 122 bits (307), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 72/166 (43%), Positives = 101/166 (60%), Gaps = 12/166 (7%)
Query: 67 DASIQSI--PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVE 121
D S QSI P +V LSW+PRA + F SA +C ++ AK +L+ S +A + G++V
Sbjct: 30 DKSDQSIVDPTKVIQLSWKPRAFLYKGFMSAAECDHVVKMAKDKLQKSMVADNESGKSVL 89
Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
S RTSSG F+S +D+ ++ IE +IA T LP+ +GEA VLRYE G+KY+ HYD
Sbjct: 90 SN--IRTSSGMFLSKGQDE--VINRIEERIAAWTFLPKENGEAIQVLRYEFGEKYEPHYD 145
Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFEVDLQI---SW 224
F+ R+A+ L+YLSDV +GGET+FP D + SW
Sbjct: 146 YFHDKYNQALGGHRIATVLMYLSDVVKGGETVFPSSEDTTVKDDSW 191
>gi|357517885|ref|XP_003629231.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355523253|gb|AET03707.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 279
Score = 122 bits (307), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 64/141 (45%), Positives = 87/141 (61%), Gaps = 5/141 (3%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFI 134
+++SW PR + NF + E+C+ +I AK ++ S + G++V S+ RTSSGTFI
Sbjct: 71 EIVSWEPRVFLYHNFLAKEECEHLINIAKPDVQKSTVVDDTTGKSVNSS--ARTSSGTFI 128
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
DK IL IE +IA T +P HGE N+L YE+GQKYD H D F +
Sbjct: 129 DRGYDK--ILSDIEKRIADFTFIPVEHGEDVNILHYEVGQKYDFHTDYFEDEVNTKHGGE 186
Query: 195 RLASFLLYLSDVEEGGETMFP 215
R+A+ L+YLSDVEEGGET+FP
Sbjct: 187 RIATMLMYLSDVEEGGETVFP 207
>gi|255579590|ref|XP_002530636.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223529809|gb|EEF31744.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 287
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 64/141 (45%), Positives = 86/141 (60%), Gaps = 5/141 (3%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFI 134
+V+SW PRA + NF + E+C+ +I AK ++ S + ET S RTSSGTF+
Sbjct: 77 EVISWEPRAFVYHNFLTKEECEYLINLAKPNMQKSTVV--DSETGRSKDSRVRTSSGTFL 134
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
S DK + IE +IA + +P HGE VL YE+GQKY+ H+D FN Q
Sbjct: 135 SRGRDKK--IRDIEKRIADFSFIPVEHGEGLQVLHYEVGQKYEPHFDYFNDEFNTKNGGQ 192
Query: 195 RLASFLLYLSDVEEGGETMFP 215
R+A+ L+YLSDVEEGGET+FP
Sbjct: 193 RVATLLMYLSDVEEGGETVFP 213
>gi|357483925|ref|XP_003612249.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355513584|gb|AES95207.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 289
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 65/141 (46%), Positives = 86/141 (60%), Gaps = 5/141 (3%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFI 134
+V+SW PRA + NF + E+C+ +I AK + S + ET +S RTSSGTF+
Sbjct: 79 EVVSWEPRAFVYHNFLTKEECEYLIDIAKPSMHKSTVV--DSETGKSKDSRVRTSSGTFL 136
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
+ DK I+ IE KIA T +P HGE VL YE+GQKY+ HYD F Q
Sbjct: 137 ARGRDK--IVRNIEKKIADFTFIPVEHGEGLQVLHYEVGQKYEPHYDYFLDEFNTKNGGQ 194
Query: 195 RLASFLLYLSDVEEGGETMFP 215
R+A+ L+YL+DVEEGGET+FP
Sbjct: 195 RIATVLMYLTDVEEGGETVFP 215
>gi|240256489|ref|NP_201407.4| iron ion binding / oxidoreductase/ oxidoreductase protein
[Arabidopsis thaliana]
gi|332010770|gb|AED98153.1| iron ion binding / oxidoreductase/ oxidoreductase protein
[Arabidopsis thaliana]
Length = 289
Score = 121 bits (304), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 65/150 (43%), Positives = 91/150 (60%), Gaps = 5/150 (3%)
Query: 67 DASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKG 125
D S +++SW PRA + NF + E+C+ +I AK ++ S + + G++ +S
Sbjct: 70 DDSKNERWVEIISWEPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKTGKSTDSR-- 127
Query: 126 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 185
RTSSGTF++ DKT + IE +I+ T +P HGE VL YEIGQKY+ HYD F
Sbjct: 128 VRTSSGTFLARGRDKT--IREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHYDYFMD 185
Query: 186 AEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
QR+A+ L+YLSDVEEGGET+FP
Sbjct: 186 EYNTRNGGQRIATVLMYLSDVEEGGETVFP 215
>gi|297797785|ref|XP_002866777.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297312612|gb|EFH43036.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 266
Score = 121 bits (304), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 82/214 (38%), Positives = 116/214 (54%), Gaps = 16/214 (7%)
Query: 7 RSSTKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRP---KLRTL-EVVEKENESGLPH 62
R S++ L A+L+ S F + LS VPS K L +V K + G+
Sbjct: 12 RKSSRSTLVFAVLIMSTFVILILLAFGILS--VPSNNAGSSKANDLTSIVRKTLQRGVE- 68
Query: 63 GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVE 121
D S +++SW PRA + NF + E+C+ +I AK ++ S + + G++ +
Sbjct: 69 ----DDSKNERWVEIISWEPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKTGKSTD 124
Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
S RTSSGTF++ DKT + IE +I+ T +P HGE VL YEIGQKY+ HYD
Sbjct: 125 SR--VRTSSGTFLARGRDKT--IREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHYD 180
Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
F QR+A+ L+YLSDVEEGGET+FP
Sbjct: 181 YFMDEYNTRNGGQRIATVLMYLSDVEEGGETVFP 214
>gi|357146834|ref|XP_003574128.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Brachypodium
distachyon]
Length = 306
Score = 121 bits (304), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 64/146 (43%), Positives = 89/146 (60%), Gaps = 15/146 (10%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
+VLSW PRA + NF S E+C+ +I+ AK +K S + V+S G RTS
Sbjct: 96 EVLSWEPRAFLYHNFLSKEECEYLISLAKPHMKKSTV-------VDSATGGSKDSRVRTS 148
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
SGTF+ +DK ++ IE +I+ T +P +GE VL YE+GQKY+ H+D F+
Sbjct: 149 SGTFLRRGQDK--VIRTIEKRISDFTFIPAENGEGLQVLHYEVGQKYEPHFDYFHDDFNT 206
Query: 190 PQMSQRLASFLLYLSDVEEGGETMFP 215
QR+A+ L+YLSDVEEGGET+FP
Sbjct: 207 KNGGQRIATLLMYLSDVEEGGETVFP 232
>gi|10177121|dbj|BAB10411.1| prolyl 4-hydroxylase, alpha subunit-like protein [Arabidopsis
thaliana]
Length = 267
Score = 121 bits (304), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 63/141 (44%), Positives = 89/141 (63%), Gaps = 5/141 (3%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
+++SW PRA + NF + E+C+ +I AK ++ S + + G++ +S RTSSGTF+
Sbjct: 79 EIISWEPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKTGKSTDSR--VRTSSGTFL 136
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
+ DKT + IE +I+ T +P HGE VL YEIGQKY+ HYD F Q
Sbjct: 137 ARGRDKT--IREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHYDYFMDEYNTRNGGQ 194
Query: 195 RLASFLLYLSDVEEGGETMFP 215
R+A+ L+YLSDVEEGGET+FP
Sbjct: 195 RIATVLMYLSDVEEGGETVFP 215
>gi|326495334|dbj|BAJ85763.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 300
Score = 121 bits (303), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 65/146 (44%), Positives = 89/146 (60%), Gaps = 15/146 (10%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
+VLSW PRA + NF S E+C+ +I+ AK +K S + V+S G RTS
Sbjct: 90 EVLSWEPRAFIYHNFLSKEECEYLISLAKPHMKKSTV-------VDSATGGSKDSRVRTS 142
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
SGTF+ +DK I+ IE +I+ T +P +GE VL YE+GQKY+ H+D F+
Sbjct: 143 SGTFLRRGQDK--IVRTIEKRISDFTFIPVENGEGLQVLHYEVGQKYEPHFDYFHDDFNT 200
Query: 190 PQMSQRLASFLLYLSDVEEGGETMFP 215
QR+A+ L+YLSDVEEGGET+FP
Sbjct: 201 KNGGQRIATVLMYLSDVEEGGETVFP 226
>gi|18071415|gb|AAL58274.1|AC068923_16 putative prolyl 4-hydroxylase, alpha subunit [Oryza sativa Japonica
Group]
Length = 343
Score = 120 bits (302), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 61/140 (43%), Positives = 87/140 (62%), Gaps = 3/140 (2%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
+VLSW PRA + NF S E+C+ +I+ AK +K S + + ++ RTSSG F+
Sbjct: 111 EVLSWEPRAFLYHNFLSKEECEYLISLAKPHMKKSTVVDASTGGSKDSR-VRTSSGMFLG 169
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
+DK I+ IE +I+ T +P +GE VL YE+GQKY+ H+D F+ QR
Sbjct: 170 RGQDK--IIRTIEKRISDYTFIPVENGEGLQVLHYEVGQKYEPHFDYFHDEFNTKNGGQR 227
Query: 196 LASFLLYLSDVEEGGETMFP 215
+A+ L+YLSDVEEGGET+FP
Sbjct: 228 IATLLMYLSDVEEGGETIFP 247
>gi|226529219|ref|NP_001151238.1| LOC100284871 [Zea mays]
gi|195645242|gb|ACG42089.1| prolyl 4-hydroxylase alpha-2 subunit precursor [Zea mays]
gi|347978812|gb|AEP37748.1| prolyl 4-hydroxylase 5 [Zea mays]
gi|413923983|gb|AFW63915.1| prolyl 4-hydroxylase alpha-2 subunit [Zea mays]
Length = 308
Score = 120 bits (302), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 65/154 (42%), Positives = 91/154 (59%), Gaps = 16/154 (10%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
+V+SW PRA + NF S E+C+ +I AK + S + V+ST G RTS
Sbjct: 98 EVISWEPRAFVYHNFLSKEECEYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 150
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
SG F+ DK ++ +IE +IA T +P HGE VL YE+GQKY+ H+D F
Sbjct: 151 SGMFLQRGRDK--VIRVIEKRIADYTFIPVDHGEGLQVLHYEVGQKYEPHFDYFLDEFNT 208
Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
QR+A+ L+YLSDVEEGGET+FP + ++ +S
Sbjct: 209 KNGGQRMATLLMYLSDVEEGGETIFP-DANVNVS 241
>gi|222613083|gb|EEE51215.1| hypothetical protein OsJ_32038 [Oryza sativa Japonica Group]
Length = 222
Score = 120 bits (302), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 61/140 (43%), Positives = 87/140 (62%), Gaps = 3/140 (2%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
+VLSW PRA + NF S E+C+ +I+ AK +K S + + ++ RTSSG F+
Sbjct: 12 EVLSWEPRAFLYHNFLSKEECEYLISLAKPHMKKSTVVDASTGGSKDSR-VRTSSGMFLG 70
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
+DK I+ IE +I+ T +P +GE VL YE+GQKY+ H+D F+ QR
Sbjct: 71 RGQDK--IIRTIEKRISDYTFIPVENGEGLQVLHYEVGQKYEPHFDYFHDEFNTKNGGQR 128
Query: 196 LASFLLYLSDVEEGGETMFP 215
+A+ L+YLSDVEEGGET+FP
Sbjct: 129 IATLLMYLSDVEEGGETIFP 148
>gi|115482738|ref|NP_001064962.1| Os10g0497800 [Oryza sativa Japonica Group]
gi|78708853|gb|ABB47828.1| prolyl 4-hydroxylase alpha subunit, putative, expressed [Oryza
sativa Japonica Group]
gi|113639571|dbj|BAF26876.1| Os10g0497800 [Oryza sativa Japonica Group]
gi|215767852|dbj|BAH00081.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218184821|gb|EEC67248.1| hypothetical protein OsI_34188 [Oryza sativa Indica Group]
Length = 321
Score = 120 bits (302), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 61/140 (43%), Positives = 87/140 (62%), Gaps = 3/140 (2%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
+VLSW PRA + NF S E+C+ +I+ AK +K S + + ++ RTSSG F+
Sbjct: 111 EVLSWEPRAFLYHNFLSKEECEYLISLAKPHMKKSTVVDASTGGSKDSR-VRTSSGMFLG 169
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
+DK I+ IE +I+ T +P +GE VL YE+GQKY+ H+D F+ QR
Sbjct: 170 RGQDK--IIRTIEKRISDYTFIPVENGEGLQVLHYEVGQKYEPHFDYFHDEFNTKNGGQR 227
Query: 196 LASFLLYLSDVEEGGETMFP 215
+A+ L+YLSDVEEGGET+FP
Sbjct: 228 IATLLMYLSDVEEGGETIFP 247
>gi|212720775|ref|NP_001131953.1| uncharacterized protein LOC100193348 [Zea mays]
gi|194693016|gb|ACF80592.1| unknown [Zea mays]
gi|347978798|gb|AEP37741.1| prolyl 4-hydroxylase 1 [Zea mays]
gi|414870898|tpg|DAA49455.1| TPA: hypothetical protein ZEAMMB73_536273 [Zea mays]
Length = 307
Score = 120 bits (302), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 64/146 (43%), Positives = 86/146 (58%), Gaps = 15/146 (10%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
+VLSW PRA + NF S E+C +I+ AK +K S + V+S G RTS
Sbjct: 97 EVLSWEPRAFVYHNFLSKEECDHLISLAKPHMKKSTV-------VDSATGGSKDSRVRTS 149
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
SG F+ +DK I+ IE +IA T +P GE VL YE+GQKY+ H+D F+
Sbjct: 150 SGMFLRRGQDK--IIRTIEKRIADYTFIPVEQGEGLQVLHYEVGQKYEPHFDYFHDDYNT 207
Query: 190 PQMSQRLASFLLYLSDVEEGGETMFP 215
QR+A+ L+YLSDVE+GGET+FP
Sbjct: 208 KNGGQRIATLLMYLSDVEDGGETVFP 233
>gi|222623961|gb|EEE58093.1| hypothetical protein OsJ_08962 [Oryza sativa Japonica Group]
Length = 387
Score = 120 bits (301), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 64/146 (43%), Positives = 84/146 (57%), Gaps = 15/146 (10%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
+V+SW PRA + NF S E+C +I AK + S + V+ST G RTS
Sbjct: 100 EVISWEPRAFVYHNFLSKEECDYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 152
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
SG F+ DK ++ IE +IA T +P HGE VL YE+GQKY+ H+D F
Sbjct: 153 SGMFLQRGRDK--VIRAIEKRIADYTFIPMEHGEGLQVLHYEVGQKYEPHFDYFLDEYNT 210
Query: 190 PQMSQRLASFLLYLSDVEEGGETMFP 215
QR+A+ L+YLSDVEEGGET+FP
Sbjct: 211 KNGGQRMATLLMYLSDVEEGGETIFP 236
>gi|147800995|emb|CAN64470.1| hypothetical protein VITISV_014644 [Vitis vinifera]
Length = 288
Score = 120 bits (301), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 63/146 (43%), Positives = 89/146 (60%), Gaps = 15/146 (10%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
+V+SW PRA + NF S ++C+ +I AK ++ S + V+S+ G RTS
Sbjct: 78 EVISWEPRAFVYHNFLSKDECEYLIKLAKPHMQKSTV-------VDSSTGKSKDSRVRTS 130
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
SGTF++ +DK I+ IE +++ T LP HGE +L YE+GQKY+ HYD F
Sbjct: 131 SGTFLTRGQDK--IIRGIEKRLSDFTFLPVEHGEGLQILHYEVGQKYEPHYDYFLDDYNT 188
Query: 190 PQMSQRLASFLLYLSDVEEGGETMFP 215
QR+A+ L+YLSDVEEGGET+FP
Sbjct: 189 KNGGQRMATVLMYLSDVEEGGETVFP 214
>gi|356540840|ref|XP_003538892.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Glycine max]
Length = 290
Score = 120 bits (301), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 63/141 (44%), Positives = 86/141 (60%), Gaps = 5/141 (3%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFI 134
+V+SW PRA + NF + E+C+ +I AK + S + ET +S RTSSGTF+
Sbjct: 80 EVVSWEPRAFVYHNFLTKEECEYLIDIAKPNMHKSSVV--DSETGKSKDSRVRTSSGTFL 137
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
+ DK I+ IE +IA + +P HGE VL YE+GQKY+ HYD F Q
Sbjct: 138 ARGRDK--IVRDIEKRIAHYSFIPVEHGEGLQVLHYEVGQKYEPHYDYFLDDFNTKNGGQ 195
Query: 195 RLASFLLYLSDVEEGGETMFP 215
R+A+ L+YL+DVEEGGET+FP
Sbjct: 196 RIATVLMYLTDVEEGGETVFP 216
>gi|363806698|ref|NP_001242522.1| uncharacterized protein LOC100806046 [Glycine max]
gi|255647110|gb|ACU24023.1| unknown [Glycine max]
Length = 289
Score = 120 bits (301), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 64/141 (45%), Positives = 86/141 (60%), Gaps = 5/141 (3%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFI 134
+V+SW PRA + NF + E+C+ +I AK + S + ET +S RTSSGTF+
Sbjct: 79 EVVSWEPRAFVYHNFLTKEECEYLIDIAKPSMHKSTVV--DSETGKSKDSRVRTSSGTFL 136
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
+ DK I+ IE KI+ T +P HGE VL YE+GQKY+ HYD F Q
Sbjct: 137 ARGRDK--IVRNIEKKISDFTFIPVEHGEGLQVLHYEVGQKYEPHYDYFLDDFNTKNGGQ 194
Query: 195 RLASFLLYLSDVEEGGETMFP 215
R+A+ L+YL+DVEEGGET+FP
Sbjct: 195 RIATVLMYLTDVEEGGETVFP 215
>gi|225468574|ref|XP_002263060.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
gi|296084059|emb|CBI24447.3| unnamed protein product [Vitis vinifera]
Length = 288
Score = 120 bits (301), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 62/141 (43%), Positives = 89/141 (63%), Gaps = 5/141 (3%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFI 134
+V+SW PRA + NF S ++C+ +I AK ++ S + G++ +S RTSSGTF+
Sbjct: 78 EVISWEPRAFVYHNFLSKDECEYLIKLAKPHMQKSTVVDSSTGKSKDSR--VRTSSGTFL 135
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
+ +DK I+ IE +++ T LP HGE +L YE+GQKY+ HYD F Q
Sbjct: 136 TRGQDK--IIRGIEKRLSDFTFLPVEHGEGLQILHYEVGQKYEPHYDYFLDDYNTKNGGQ 193
Query: 195 RLASFLLYLSDVEEGGETMFP 215
R+A+ L+YLSDVEEGGET+FP
Sbjct: 194 RMATVLMYLSDVEEGGETVFP 214
>gi|218191856|gb|EEC74283.1| hypothetical protein OsI_09531 [Oryza sativa Indica Group]
Length = 376
Score = 120 bits (300), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 64/146 (43%), Positives = 84/146 (57%), Gaps = 15/146 (10%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
+V+SW PRA + NF S E+C +I AK + S + V+ST G RTS
Sbjct: 100 EVISWEPRAFVYHNFLSKEECDYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 152
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
SG F+ DK ++ IE +IA T +P HGE VL YE+GQKY+ H+D F
Sbjct: 153 SGMFLQRGRDK--VIRAIEKRIADYTFIPMEHGEGLQVLHYEVGQKYEPHFDYFLDEYNT 210
Query: 190 PQMSQRLASFLLYLSDVEEGGETMFP 215
QR+A+ L+YLSDVEEGGET+FP
Sbjct: 211 KNGGQRMATLLMYLSDVEEGGETIFP 236
>gi|297802350|ref|XP_002869059.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297314895|gb|EFH45318.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 290
Score = 120 bits (300), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 60/142 (42%), Positives = 94/142 (66%), Gaps = 5/142 (3%)
Query: 75 FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTF 133
+V+SW PRA + NF + E+C+ +I+ AK + S++ ++ G++++S RTSSGTF
Sbjct: 80 LEVISWEPRAFVYHNFLTNEECEHLISLAKPSMVKSKVVDVKTGKSIDSR--VRTSSGTF 137
Query: 134 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 193
+ D+ I+E IE++I+ T +P +GE VL YE+GQKY+ H+D F +
Sbjct: 138 LKRGHDE--IVEEIENRISDFTFIPIENGEGLQVLHYEVGQKYEPHHDYFFDEFNVRKGG 195
Query: 194 QRLASFLLYLSDVEEGGETMFP 215
QR+A+ L+YLSDV+EGGET+FP
Sbjct: 196 QRIATVLMYLSDVDEGGETVFP 217
>gi|302786814|ref|XP_002975178.1| hypothetical protein SELMODRAFT_174666 [Selaginella moellendorffii]
gi|300157337|gb|EFJ23963.1| hypothetical protein SELMODRAFT_174666 [Selaginella moellendorffii]
Length = 283
Score = 120 bits (300), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 68/154 (44%), Positives = 96/154 (62%), Gaps = 9/154 (5%)
Query: 67 DASIQSI--PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVE 121
D S QSI P +V LSW+PRA + F SA +C ++ AK +L+ S +A + G++V
Sbjct: 16 DKSDQSIVDPTKVIQLSWKPRAFLYKGFMSAAECDHVVKMAKDKLQKSMVADNESGKSVL 75
Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
S RTSSG F+S +D+ ++ IE +IA T LP+ +GEA VLRYE G+KY+ HYD
Sbjct: 76 SN--IRTSSGMFLSKGQDE--VINRIEERIAAWTFLPKENGEAIQVLRYEFGEKYEPHYD 131
Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
F+ R+A+ L+YLSD +GGET+FP
Sbjct: 132 YFHDKYNQALGGHRIATVLMYLSDAVKGGETVFP 165
>gi|326489721|dbj|BAK01841.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 315
Score = 120 bits (300), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 63/141 (44%), Positives = 85/141 (60%), Gaps = 5/141 (3%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFI 134
+V+SW PRA + NF S E+C+ +I AK R+ S + ET +S RTSSG F+
Sbjct: 105 EVISWEPRAFVYHNFLSKEECEYLIELAKPRMVKSTVV--DSETGKSKDSRVRTSSGMFL 162
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
DK ++ IE +IA T +P HGE VL YE+GQKY+ H+D F Q
Sbjct: 163 QRGRDK--VIRAIERRIADYTFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEFNTKNGGQ 220
Query: 195 RLASFLLYLSDVEEGGETMFP 215
R+A+ L+YLSD+EEGGET+FP
Sbjct: 221 RMATILMYLSDIEEGGETIFP 241
>gi|242063586|ref|XP_002453082.1| hypothetical protein SORBIDRAFT_04g038020 [Sorghum bicolor]
gi|241932913|gb|EES06058.1| hypothetical protein SORBIDRAFT_04g038020 [Sorghum bicolor]
Length = 307
Score = 120 bits (300), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 64/146 (43%), Positives = 85/146 (58%), Gaps = 15/146 (10%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
+V+SW PRA + NF S E+C+ +I AK + S + V+ST G RTS
Sbjct: 97 EVISWEPRAFVYHNFLSKEECEYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 149
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
SG F+ DK ++ IE +IA T +P HGE VL YE+GQKY+ H+D F
Sbjct: 150 SGMFLQRGRDK--VIRAIEKRIADYTFIPADHGEGLQVLHYEVGQKYEPHFDYFLDEFNT 207
Query: 190 PQMSQRLASFLLYLSDVEEGGETMFP 215
QR+A+ L+YLSDVEEGGET+FP
Sbjct: 208 KNGGQRMATLLMYLSDVEEGGETIFP 233
>gi|414870899|tpg|DAA49456.1| TPA: hypothetical protein ZEAMMB73_536273 [Zea mays]
Length = 364
Score = 119 bits (299), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 64/146 (43%), Positives = 86/146 (58%), Gaps = 15/146 (10%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
+VLSW PRA + NF S E+C +I+ AK +K S + V+S G RTS
Sbjct: 154 EVLSWEPRAFVYHNFLSKEECDHLISLAKPHMKKSTV-------VDSATGGSKDSRVRTS 206
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
SG F+ +DK I+ IE +IA T +P GE VL YE+GQKY+ H+D F+
Sbjct: 207 SGMFLRRGQDK--IIRTIEKRIADYTFIPVEQGEGLQVLHYEVGQKYEPHFDYFHDDYNT 264
Query: 190 PQMSQRLASFLLYLSDVEEGGETMFP 215
QR+A+ L+YLSDVE+GGET+FP
Sbjct: 265 KNGGQRIATLLMYLSDVEDGGETVFP 290
>gi|48716447|dbj|BAD23054.1| putative prolyl 4-hydroxylase [Oryza sativa Japonica Group]
Length = 310
Score = 119 bits (299), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 64/146 (43%), Positives = 84/146 (57%), Gaps = 15/146 (10%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
+V+SW PRA + NF S E+C +I AK + S + V+ST G RTS
Sbjct: 100 EVISWEPRAFVYHNFLSKEECDYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 152
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
SG F+ DK ++ IE +IA T +P HGE VL YE+GQKY+ H+D F
Sbjct: 153 SGMFLQRGRDK--VIRAIEKRIADYTFIPMEHGEGLQVLHYEVGQKYEPHFDYFLDEYNT 210
Query: 190 PQMSQRLASFLLYLSDVEEGGETMFP 215
QR+A+ L+YLSDVEEGGET+FP
Sbjct: 211 KNGGQRMATLLMYLSDVEEGGETIFP 236
>gi|255539064|ref|XP_002510597.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223551298|gb|EEF52784.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 289
Score = 119 bits (299), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 63/141 (44%), Positives = 87/141 (61%), Gaps = 5/141 (3%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTKGTRTSSGTFI 134
+++SW PRA + NF S E+C+ +IA AK + K + + + G + +S RTSSG F+
Sbjct: 79 EIISWEPRAFVYHNFLSKEECEYLIALAKPHMVKSTVVDSKTGRSKDSR--VRTSSGMFL 136
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
DK I+ IE +IA + +P HGE VL YE+GQKY++HYD F Q
Sbjct: 137 RRGRDK--IIRNIEKRIADFSFIPIEHGEGLQVLHYEVGQKYEAHYDYFLDEFNTKNGGQ 194
Query: 195 RLASFLLYLSDVEEGGETMFP 215
R A+ L+YLSDVEEGGET+FP
Sbjct: 195 RTATLLMYLSDVEEGGETVFP 215
>gi|42567428|ref|NP_195306.2| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
gi|332661174|gb|AEE86574.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
Length = 290
Score = 119 bits (299), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 59/142 (41%), Positives = 95/142 (66%), Gaps = 5/142 (3%)
Query: 75 FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTF 133
+V+SW PRA + NF + E+C+ +I+ AK + S++ ++ G++++S RTSSGTF
Sbjct: 80 LEVISWEPRAFVYHNFLTNEECEHLISLAKPSMMKSKVVDVKTGKSIDSR--VRTSSGTF 137
Query: 134 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 193
++ D+ I+E IE++I+ T +P +GE VL YE+GQ+Y+ H+D F +
Sbjct: 138 LNRGHDE--IVEEIENRISDFTFIPPENGEGLQVLHYEVGQRYEPHHDYFFDEFNVRKGG 195
Query: 194 QRLASFLLYLSDVEEGGETMFP 215
QR+A+ L+YLSDV+EGGET+FP
Sbjct: 196 QRIATVLMYLSDVDEGGETVFP 217
>gi|125546091|gb|EAY92230.1| hypothetical protein OsI_13950 [Oryza sativa Indica Group]
Length = 178
Score = 119 bits (297), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 59/139 (42%), Positives = 90/139 (64%), Gaps = 5/139 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LSWRPRA + F S ++C ++ AK R++ S +A G+++ S RTSSGTF+S
Sbjct: 40 LSWRPRAFLYSGFLSHDECDHLVNLAKGRMEKSMVADNDSGKSIMSQ--VRTSSGTFLSK 97
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
ED I+ IE ++A T LP+ + E+ +L YE+GQKYD+H+D F+ + R+
Sbjct: 98 HEDD--IVSGIEKRVAAWTFLPEENAESIQILHYELGQKYDAHFDYFHDKNNLKRGGHRV 155
Query: 197 ASFLLYLSDVEEGGETMFP 215
A+ L+YL+DV++GGET+FP
Sbjct: 156 ATVLMYLTDVKKGGETVFP 174
>gi|357125236|ref|XP_003564301.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
distachyon]
Length = 293
Score = 118 bits (296), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 61/139 (43%), Positives = 89/139 (64%), Gaps = 5/139 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LSWRPRA + F S +C ++ AK RL+ S +A G++V S RTSSGTF++
Sbjct: 34 LSWRPRAFLYSGFLSHAECDHLVKLAKGRLQKSMVADNDSGKSVMSQ--VRTSSGTFLNK 91
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
ED+ I+ IE ++A T LP+ + E+ VL YE+GQKYD+H+D F+ R+
Sbjct: 92 HEDE--IISGIEKRVAAWTFLPEENAESIQVLHYEVGQKYDAHFDYFHDKNNQKLGGHRV 149
Query: 197 ASFLLYLSDVEEGGETMFP 215
A+ L+YL+DV++GGET+FP
Sbjct: 150 ATVLMYLTDVKKGGETVFP 168
>gi|303285562|ref|XP_003062071.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226456482|gb|EEH53783.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 522
Score = 118 bits (296), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 65/150 (43%), Positives = 95/150 (63%), Gaps = 11/150 (7%)
Query: 71 QSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTK-GTRTS 129
+S P + + RP+A F NF + E+C+ +IA AK +L PS + G+ +STK G RTS
Sbjct: 225 RSPPLVLSATRPKAYLFRNFLTEEECRHLIALAKAQLAPSTVVADGGK--KSTKSGIRTS 282
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF----NP 185
+G F++ + T + ++E ++A A LP+ +GE +LRYE GQKYD HYD F NP
Sbjct: 283 AGMFLTKGQ--TPTVRMVEERVAAAVGLPEENGEGMQILRYEHGQKYDPHYDYFHDKINP 340
Query: 186 AEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
+ + QR+A+ L+YL D EEGGET+FP
Sbjct: 341 SPN--RGGQRMATMLIYLKDTEEGGETIFP 368
>gi|218193936|gb|EEC76363.1| hypothetical protein OsI_13952 [Oryza sativa Indica Group]
Length = 1062
Score = 118 bits (296), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 59/139 (42%), Positives = 90/139 (64%), Gaps = 5/139 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LSWRPRA + F S ++C ++ AK R++ S +A G+++ S RTSSGTF+S
Sbjct: 40 LSWRPRAFLYSGFLSHDECDHLVNLAKGRMEKSMVADNDSGKSIMSQ--VRTSSGTFLSK 97
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
ED I+ IE ++A T LP+ + E+ +L YE+GQKYD+H+D F+ + R+
Sbjct: 98 HEDD--IVSGIEKRVAAWTFLPEENAESIQILHYELGQKYDAHFDYFHDKNNLKRGGHRV 155
Query: 197 ASFLLYLSDVEEGGETMFP 215
A+ L+YL+DV++GGET+FP
Sbjct: 156 ATVLMYLTDVKKGGETVFP 174
>gi|449491267|ref|XP_004158845.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 287
Score = 118 bits (295), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 61/141 (43%), Positives = 87/141 (61%), Gaps = 5/141 (3%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFI 134
+V+SW PRA + NF + E+C+ +I+ AK ++ S + ET +S RTSSGTF+
Sbjct: 77 EVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSTVV--DSETGQSKDSRVRTSSGTFL 134
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
DKT + IE +++ + +P HGE VL YE+GQKY+ H+D F Q
Sbjct: 135 PRGRDKT--VRTIEKRLSDFSFIPVEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQ 192
Query: 195 RLASFLLYLSDVEEGGETMFP 215
R+A+ L+YLSDVEEGGET+FP
Sbjct: 193 RIATVLMYLSDVEEGGETVFP 213
>gi|357467085|ref|XP_003603827.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355492875|gb|AES74078.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 280
Score = 118 bits (295), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 64/141 (45%), Positives = 88/141 (62%), Gaps = 5/141 (3%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTKGTRTSSGTFI 134
++LSW PRA + NF S E+C+ +I AK L K S + + G++ ES RTSSG F+
Sbjct: 70 EILSWEPRAFVYHNFLSKEECEHLINLAKPFLAKSSVVDSKTGKSTESR--VRTSSGMFL 127
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
+DK I++ IE +IA T +P +GE VL Y +G+KY+ HYD F Q
Sbjct: 128 KRGKDK--IIQNIERRIADFTFIPVENGEGLQVLHYGVGEKYEPHYDYFLDEFNTKNGGQ 185
Query: 195 RLASFLLYLSDVEEGGETMFP 215
R+A+ L+YLSDVEEGGET+FP
Sbjct: 186 RVATVLMYLSDVEEGGETVFP 206
>gi|224133600|ref|XP_002327635.1| predicted protein [Populus trichocarpa]
gi|222836720|gb|EEE75113.1| predicted protein [Populus trichocarpa]
Length = 291
Score = 118 bits (295), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 59/140 (42%), Positives = 86/140 (61%), Gaps = 3/140 (2%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
+V+SW+PRA + NF + +C+ +I AK R++ S + + +K RTSSGTF+
Sbjct: 81 EVISWKPRAFVYHNFLTKAECEYLINLAKPRMQKSTVVDSSTGKSKDSK-VRTSSGTFLP 139
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
DK I+ IE +IA + +P HGE +L YE+GQ+Y+ H+D F QR
Sbjct: 140 RGRDK--IVRDIEKRIADFSFIPVEHGEGLQILHYEVGQRYEPHFDYFMDEYNTKNGGQR 197
Query: 196 LASFLLYLSDVEEGGETMFP 215
+A+ L+YLSDVEEGGET+FP
Sbjct: 198 IATVLMYLSDVEEGGETVFP 217
>gi|449434114|ref|XP_004134841.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 287
Score = 118 bits (295), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 61/141 (43%), Positives = 87/141 (61%), Gaps = 5/141 (3%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFI 134
+V+SW PRA + NF + E+C+ +I+ AK ++ S + ET +S RTSSGTF+
Sbjct: 77 EVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSTVV--DSETGQSKDSRVRTSSGTFL 134
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
DKT + IE +++ + +P HGE VL YE+GQKY+ H+D F Q
Sbjct: 135 PRGRDKT--VRTIEKRLSDFSFIPVEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQ 192
Query: 195 RLASFLLYLSDVEEGGETMFP 215
R+A+ L+YLSDVEEGGET+FP
Sbjct: 193 RIATVLMYLSDVEEGGETVFP 213
>gi|115456019|ref|NP_001051610.1| Os03g0803500 [Oryza sativa Japonica Group]
gi|29150365|gb|AAO72374.1| putative oxidoreductase [Oryza sativa Japonica Group]
gi|108711618|gb|ABF99413.1| oxidoreductase, 2OG-Fe oxygenase family protein, putative,
expressed [Oryza sativa Japonica Group]
gi|113550081|dbj|BAF13524.1| Os03g0803500 [Oryza sativa Japonica Group]
gi|215765410|dbj|BAG87107.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222625993|gb|EEE60125.1| hypothetical protein OsJ_13003 [Oryza sativa Japonica Group]
Length = 299
Score = 118 bits (295), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 59/139 (42%), Positives = 90/139 (64%), Gaps = 5/139 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LSWRPRA + F S ++C ++ AK R++ S +A G+++ S RTSSGTF+S
Sbjct: 40 LSWRPRAFLYSGFLSHDECDHLVNLAKGRMEKSMVADNDSGKSIMSQ--VRTSSGTFLSK 97
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
ED I+ IE ++A T LP+ + E+ +L YE+GQKYD+H+D F+ + R+
Sbjct: 98 HEDD--IVSGIEKRVAAWTFLPEENAESIQILHYELGQKYDAHFDYFHDKNNLKRGGHRV 155
Query: 197 ASFLLYLSDVEEGGETMFP 215
A+ L+YL+DV++GGET+FP
Sbjct: 156 ATVLMYLTDVKKGGETVFP 174
>gi|363543371|ref|NP_001241695.1| prolyl 4-hydroxylase 8-5 [Zea mays]
gi|347978840|gb|AEP37762.1| prolyl 4-hydroxylase 8-5 [Zea mays]
Length = 307
Score = 118 bits (295), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 63/146 (43%), Positives = 85/146 (58%), Gaps = 15/146 (10%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
+V+SW PRA + NF S ++C+ +I AK + S + V+ST G RTS
Sbjct: 97 EVISWEPRAFVYHNFLSKDECEYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 149
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
SG F+ DK ++ IE +IA T +P HGE VL YE+GQKY+ H+D F
Sbjct: 150 SGMFLQRGRDK--VIRAIEKRIADYTFIPVDHGEGLQVLHYEVGQKYEPHFDYFLDEFNT 207
Query: 190 PQMSQRLASFLLYLSDVEEGGETMFP 215
QR+A+ L+YLSDVEEGGET+FP
Sbjct: 208 KNGGQRIATLLMYLSDVEEGGETIFP 233
>gi|242032633|ref|XP_002463711.1| hypothetical protein SORBIDRAFT_01g004670 [Sorghum bicolor]
gi|241917565|gb|EER90709.1| hypothetical protein SORBIDRAFT_01g004670 [Sorghum bicolor]
Length = 297
Score = 117 bits (294), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 61/139 (43%), Positives = 88/139 (63%), Gaps = 5/139 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LSWRPRA + F S +C +I AK ++ S +A G+++ S RTSSG F++
Sbjct: 38 LSWRPRAFLYSGFLSDTECDHLINLAKGSMEKSMVADNDSGKSLMSQ--VRTSSGAFLAK 95
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
ED+ I+ IE ++A T LP+ + E+ VLRYEIGQKYD+H+D F+ QR
Sbjct: 96 HEDE--IVSAIEKRVAAWTFLPEENAESMQVLRYEIGQKYDAHFDYFHDKNNVKHGGQRF 153
Query: 197 ASFLLYLSDVEEGGETMFP 215
A+ L+YL+DV++GGET+FP
Sbjct: 154 ATVLMYLTDVKKGGETVFP 172
>gi|259490206|ref|NP_001159002.1| prolyl 4-hydroxylase alpha-2 subunit [Zea mays]
gi|195626402|gb|ACG35031.1| prolyl 4-hydroxylase alpha-2 subunit precursor [Zea mays]
gi|347978830|gb|AEP37757.1| prolyl 4-hydroxylase 8 [Zea mays]
gi|347978832|gb|AEP37758.1| prolyl 4-hydroxylase 8-1 [Zea mays]
gi|413939569|gb|AFW74120.1| prolyl 4-hydroxylase alpha-2 subunit isoform 1 [Zea mays]
gi|413939570|gb|AFW74121.1| prolyl 4-hydroxylase alpha-2 subunit isoform 2 [Zea mays]
gi|413939571|gb|AFW74122.1| prolyl 4-hydroxylase alpha-2 subunit isoform 3 [Zea mays]
gi|413939572|gb|AFW74123.1| prolyl 4-hydroxylase alpha-2 subunit isoform 4 [Zea mays]
Length = 307
Score = 117 bits (294), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 63/146 (43%), Positives = 85/146 (58%), Gaps = 15/146 (10%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
+V+SW PRA + NF S ++C+ +I AK + S + V+ST G RTS
Sbjct: 97 EVISWEPRAFVYHNFLSKDECEYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 149
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
SG F+ DK ++ IE +IA T +P HGE VL YE+GQKY+ H+D F
Sbjct: 150 SGMFLQRGRDK--VIRAIEKRIADYTFIPVDHGEGLQVLHYEVGQKYEPHFDYFLDEFNT 207
Query: 190 PQMSQRLASFLLYLSDVEEGGETMFP 215
QR+A+ L+YLSDVEEGGET+FP
Sbjct: 208 KNGGQRIATLLMYLSDVEEGGETIFP 233
>gi|90704797|dbj|BAE92293.1| putative prolyl 4-hydroxylase, alpha subunit [Cryptomeria japonica]
Length = 302
Score = 117 bits (294), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 61/141 (43%), Positives = 89/141 (63%), Gaps = 5/141 (3%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
+VLSW PRA + NF + ++C+ +I AK + S + + G +++S RTSSG F+
Sbjct: 92 EVLSWEPRAFLYHNFLAKDECEYLINIAKPHMVKSMVVDSKTGGSMDSN--VRTSSGWFL 149
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
+ +DK I+ IE +IA + +P HGE +VL YE+ QKYD+HYD F+ Q
Sbjct: 150 NRGQDK--IIRRIEKRIADFSHIPVEHGEGLHVLHYEVEQKYDAHYDYFSDTINVKNGGQ 207
Query: 195 RLASFLLYLSDVEEGGETMFP 215
R A+ L+YLSDVE+GGET+FP
Sbjct: 208 RGATMLMYLSDVEKGGETVFP 228
>gi|334188665|ref|NP_001190630.1| iron ion binding / oxidoreductase/ oxidoreductase protein
[Arabidopsis thaliana]
gi|332010771|gb|AED98154.1| iron ion binding / oxidoreductase/ oxidoreductase protein
[Arabidopsis thaliana]
Length = 243
Score = 116 bits (291), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 65/150 (43%), Positives = 90/150 (60%), Gaps = 7/150 (4%)
Query: 67 DASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKG 125
D S +++SW PRA + NF E+C+ +I AK ++ S + + G++ +S
Sbjct: 70 DDSKNERWVEIISWEPRASVYHNFL--EECKYLIELAKPHMEKSTVVDEKTGKSTDSR-- 125
Query: 126 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 185
RTSSGTF++ DKT + IE +I+ T +P HGE VL YEIGQKY+ HYD F
Sbjct: 126 VRTSSGTFLARGRDKT--IREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHYDYFMD 183
Query: 186 AEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
QR+A+ L+YLSDVEEGGET+FP
Sbjct: 184 EYNTRNGGQRIATVLMYLSDVEEGGETVFP 213
>gi|363543369|ref|NP_001241694.1| prolyl 4-hydroxylase 8-4 [Zea mays]
gi|347978838|gb|AEP37761.1| prolyl 4-hydroxylase 8-4 [Zea mays]
Length = 307
Score = 116 bits (290), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 62/146 (42%), Positives = 85/146 (58%), Gaps = 15/146 (10%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
+V+SW PRA + NF S ++C+ +I AK + S + V+ST G RTS
Sbjct: 97 EVISWEPRAFVYHNFLSKDECEYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 149
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
SG F+ +K ++ IE +IA T +P HGE VL YE+GQKY+ H+D F
Sbjct: 150 SGMFLQRGRNK--VIRAIEKRIADYTFIPVDHGEGLQVLHYEVGQKYEPHFDYFLDEFNT 207
Query: 190 PQMSQRLASFLLYLSDVEEGGETMFP 215
QR+A+ L+YLSDVEEGGET+FP
Sbjct: 208 KNGGQRIATLLMYLSDVEEGGETIFP 233
>gi|359477455|ref|XP_002278454.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1 [Vitis
vinifera]
Length = 296
Score = 116 bits (290), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 65/156 (41%), Positives = 92/156 (58%), Gaps = 5/156 (3%)
Query: 62 HGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETV 120
+ + +++ + + +SW+PRA + F S E+C +I+ AK LK S +A G++
Sbjct: 24 YADAAGSNVSAAKVRQISWKPRAFVYEGFLSEEECDHLISLAKSELKRSAVADNVSGKSR 83
Query: 121 ESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHY 180
S RTSSG FI +D I+ IE KIA T LP+ +GE VLRYE GQKYD+HY
Sbjct: 84 LSE--VRTSSGMFIGKGKDP--IVAGIEDKIAAWTFLPKDNGEDMQVLRYEPGQKYDAHY 139
Query: 181 DAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPF 216
D F + R+A+ L+YLSDV +GGET+FP
Sbjct: 140 DYFVDKVNIARGGHRIATVLMYLSDVVKGGETVFPM 175
>gi|224085946|ref|XP_002307750.1| predicted protein [Populus trichocarpa]
gi|222857199|gb|EEE94746.1| predicted protein [Populus trichocarpa]
Length = 288
Score = 116 bits (290), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 60/141 (42%), Positives = 86/141 (60%), Gaps = 5/141 (3%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTKGTRTSSGTFI 134
++LSW PRA + NF S E+C+ +I AK + K + + + G + +S RTSSG F+
Sbjct: 78 EILSWEPRAFLYHNFLSKEECEYLINLAKPHMMKSTVVDSKTGRSKDSR--VRTSSGMFL 135
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
D+ ++ IE +IA + +P HGE VL YE+GQKY++H+D F Q
Sbjct: 136 RRGRDR--VIREIEKRIADFSFIPVEHGEGLQVLHYEVGQKYEAHFDYFLDEFNTKNGGQ 193
Query: 195 RLASFLLYLSDVEEGGETMFP 215
R A+ L+YLSDVEEGGET+FP
Sbjct: 194 RTATLLMYLSDVEEGGETVFP 214
>gi|215490183|dbj|BAG86625.1| type 2 proly 4-hydroxylase [Nicotiana tabacum]
Length = 318
Score = 115 bits (289), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 60/139 (43%), Positives = 89/139 (64%), Gaps = 5/139 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
+SWRPRA + NF + E+C I AK +L+ S +A + G++VES RTSSG F
Sbjct: 65 ISWRPRAFVYRNFLTDEECDHFITLAKHKLEKSMVADNESGKSVESE--VRTSSGMFFRK 122
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
++D+ ++ +E +IA T LP+ +GE+ +L YE GQKY+ H+D F+ R+
Sbjct: 123 AQDQ--VVANVEARIAAWTFLPEENGESIQILHYEHGQKYEPHFDYFHDKVNQELGGHRV 180
Query: 197 ASFLLYLSDVEEGGETMFP 215
A+ L+YLSDVE+GGET+FP
Sbjct: 181 ATVLMYLSDVEKGGETVFP 199
>gi|412993142|emb|CCO16675.1| predicted protein [Bathycoccus prasinos]
Length = 564
Score = 115 bits (289), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 63/144 (43%), Positives = 92/144 (63%), Gaps = 6/144 (4%)
Query: 74 PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTF 133
P + + +P+A F NF SAE+C ++ AK L PS + G +V ST RTS+G F
Sbjct: 278 PLVLSAVKPKAYLFRNFLSAEECDHLMKLAKAELAPSTVVGAGGTSVPST--IRTSAGMF 335
Query: 134 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA-EYGPQM 192
+ + DKT LE IE++IA A+ P+ +GE +LRY++GQKYD H+D F+ A P+
Sbjct: 336 LRKAADKT--LENIEYRIAAASGTPEPNGEGMQILRYDVGQKYDPHFDYFHDAVNPSPKR 393
Query: 193 -SQRLASFLLYLSDVEEGGETMFP 215
QR+A+ L+YL + +EGGET+FP
Sbjct: 394 GGQRMATMLIYLENTKEGGETIFP 417
>gi|359477453|ref|XP_003631980.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Vitis
vinifera]
gi|297736941|emb|CBI26142.3| unnamed protein product [Vitis vinifera]
Length = 298
Score = 115 bits (289), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 65/156 (41%), Positives = 92/156 (58%), Gaps = 5/156 (3%)
Query: 62 HGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETV 120
+ + +++ + + +SW+PRA + F S E+C +I+ AK LK S +A G++
Sbjct: 24 YADAAGSNVSAAKVRQISWKPRAFVYEGFLSEEECDHLISLAKSELKRSAVADNVSGKSR 83
Query: 121 ESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHY 180
S RTSSG FI +D I+ IE KIA T LP+ +GE VLRYE GQKYD+HY
Sbjct: 84 LSE--VRTSSGMFIGKGKDP--IVAGIEDKIAAWTFLPKDNGEDMQVLRYEPGQKYDAHY 139
Query: 181 DAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPF 216
D F + R+A+ L+YLSDV +GGET+FP
Sbjct: 140 DYFVDKVNIARGGHRIATVLMYLSDVVKGGETVFPM 175
>gi|307102975|gb|EFN51240.1| hypothetical protein CHLNCDRAFT_28187 [Chlorella variabilis]
Length = 322
Score = 115 bits (288), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 62/152 (40%), Positives = 92/152 (60%), Gaps = 14/152 (9%)
Query: 75 FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQG----ETVESTKGTRTSS 130
+++SW+PRAL F + +C +I+ A+ RL+PS++ R G ++V + +G +SS
Sbjct: 15 IELVSWKPRALLLHGFLAHSECDHMISLAEARLEPSKVVSRDGSGKLDSVRTRQGL-SSS 73
Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY-- 188
GTF++ +D ++ +E +I AT LP +H E VL+YE+GQKY +HYD E
Sbjct: 74 GTFLTKRQDS--VVAGVEDRIELATHLPFSHSEQLQVLKYELGQKYSAHYDVHGSNEQAQ 131
Query: 189 -----GPQMSQRLASFLLYLSDVEEGGETMFP 215
G Q R A+ L+YLSDVEEGGET FP
Sbjct: 132 LAIRRGEQGGSRYATMLMYLSDVEEGGETSFP 163
>gi|356550516|ref|XP_003543632.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 318
Score = 115 bits (288), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 64/153 (41%), Positives = 96/153 (62%), Gaps = 7/153 (4%)
Query: 66 GDASIQSIPFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVES 122
G +S++ P +V LSW PRA + F S E+C +I AK +L+ S +A + G+++ S
Sbjct: 45 GGSSVKFDPTRVTQLSWSPRAFLYKGFLSDEECDHLITLAKDKLEKSMVADNESGKSIMS 104
Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
RTSSG F++ ++D+ I+ IE +IA T LP +GE+ +L YE GQKY+ H+D
Sbjct: 105 E--VRTSSGMFLNKAQDE--IVAGIEARIAAWTFLPIENGESMQILHYENGQKYEPHFDY 160
Query: 183 FNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
F+ R+A+ L+YLSDVE+GGET+FP
Sbjct: 161 FHDKANQVMGGHRIATVLMYLSDVEKGGETIFP 193
>gi|297832394|ref|XP_002884079.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297329919|gb|EFH60338.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 291
Score = 115 bits (288), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 59/141 (41%), Positives = 90/141 (63%), Gaps = 5/141 (3%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTKGTRTSSGTFI 134
+V+SW PRA+ + NF S E+C+ +I AK + K + + + G + +S RTSSGTF+
Sbjct: 81 EVISWEPRAVVYHNFLSNEECEHLINLAKPSMVKSTVVDEKTGGSKDSR--VRTSSGTFL 138
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
D+ ++E+IE +I+ T +P +GE VL Y++GQKY+ HYD F Q
Sbjct: 139 RRGHDE--VVEVIEKRISDFTFIPVENGEGLQVLHYQVGQKYEPHYDYFLDEFNTKNGGQ 196
Query: 195 RLASFLLYLSDVEEGGETMFP 215
R+A+ L+YLSDV++GGET+FP
Sbjct: 197 RIATVLMYLSDVDDGGETVFP 217
>gi|168046048|ref|XP_001775487.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162673157|gb|EDQ59684.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 263
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 62/143 (43%), Positives = 90/143 (62%), Gaps = 5/143 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LSW+PRA + NF S +C +I+ AK +L+ S +A + G++V+S RTSSG F+
Sbjct: 9 LSWKPRAFLYSNFLSDAECDHMISLAKDKLEKSMVADNESGKSVKSE--IRTSSGMFLMK 66
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D I+ IE +IA T LP+ +GEA VLRY+ G+KY+ H+D F+ R+
Sbjct: 67 GQDD--IISRIEDRIAAWTFLPKENGEAIQVLRYQDGEKYEPHFDYFHDKNNQALGGHRI 124
Query: 197 ASFLLYLSDVEEGGETMFPFEVD 219
A+ L+YLSDV +GGET+FP D
Sbjct: 125 ATVLMYLSDVVKGGETVFPSSED 147
>gi|356572148|ref|XP_003554232.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Glycine max]
Length = 319
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 64/153 (41%), Positives = 95/153 (62%), Gaps = 7/153 (4%)
Query: 66 GDASIQSIPFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVES 122
G +S++ P +V LSW PRA + F S E+C +I AK +L+ S +A G+++ S
Sbjct: 46 GGSSVKFDPTRVTQLSWSPRAFLYKGFLSEEECDHLIVLAKDKLEKSMVADNDSGKSIMS 105
Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
RTSSG F++ ++D+ I+ IE +IA T LP +GE+ +L YE GQKY+ H+D
Sbjct: 106 D--IRTSSGMFLNKAQDE--IVAGIEARIAAWTFLPVENGESMQILHYENGQKYEPHFDY 161
Query: 183 FNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
F+ R+A+ L+YLSDVE+GGET+FP
Sbjct: 162 FHDKANQVMGGHRIATVLMYLSDVEKGGETIFP 194
>gi|307106819|gb|EFN55064.1| hypothetical protein CHLNCDRAFT_35843 [Chlorella variabilis]
Length = 287
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 62/143 (43%), Positives = 96/143 (67%), Gaps = 9/143 (6%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTKGTRTSSGTFISA 136
+SWRPRA + NF S E+C+ + A+KRL K + + + G++++ST RTSSGTF++
Sbjct: 41 VSWRPRAFVYHNFLSDEECEHLKELARKRLTKSTVVDNKTGKSMDST--VRTSSGTFLAR 98
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM---S 193
ED+ ++ IE +I+ TM+P+ +GEA +L+Y GQKY+ H D F+ +Y +
Sbjct: 99 GEDE--VVRAIEKRISLVTMIPEENGEAIQILKYVDGQKYEPHTDYFH-DKYNSRTENGG 155
Query: 194 QRLASFLLYLSDVEEGGETMFPF 216
QR+A+ L+YLS EEGGET+FP+
Sbjct: 156 QRVATILMYLSTPEEGGETVFPY 178
>gi|302815629|ref|XP_002989495.1| hypothetical protein SELMODRAFT_129912 [Selaginella moellendorffii]
gi|300142673|gb|EFJ09371.1| hypothetical protein SELMODRAFT_129912 [Selaginella moellendorffii]
Length = 213
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 57/140 (40%), Positives = 85/140 (60%), Gaps = 3/140 (2%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
+++SW PRA NF + ++C +I A ++ S + Q ++ RTSSG F++
Sbjct: 3 EIISWTPRASLVHNFLTDDECDHLIRVAMPLMQKSTVVDSQTGGSRDSR-VRTSSGMFLN 61
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
+D+ ++ IE KIA+ T +P+ HGE VL YE GQKYD+H+D F QR
Sbjct: 62 RGQDR--VISEIEDKIAKLTFIPKDHGEGIQVLHYEPGQKYDAHHDFFYDTVNTRNGGQR 119
Query: 196 LASFLLYLSDVEEGGETMFP 215
+A+ L+YL+DVEEGGET+FP
Sbjct: 120 IATLLMYLTDVEEGGETVFP 139
>gi|15227885|ref|NP_179363.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
gi|25411813|pir||F84555 similar to prolyl 4-hydroxylase alpha subunit [imported] -
Arabidopsis thaliana
gi|89274129|gb|ABD65585.1| At2g17720 [Arabidopsis thaliana]
gi|110738861|dbj|BAF01353.1| similar to prolyl 4-hydroxylase alpha subunit [Arabidopsis
thaliana]
gi|330251579|gb|AEC06673.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
Length = 291
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 58/141 (41%), Positives = 91/141 (64%), Gaps = 5/141 (3%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTKGTRTSSGTFI 134
+V+SW PRA+ + NF + E+C+ +I+ AK + K + + + G + +S RTSSGTF+
Sbjct: 81 EVISWEPRAVVYHNFLTNEECEHLISLAKPSMVKSTVVDEKTGGSKDSR--VRTSSGTFL 138
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
D+ ++E+IE +I+ T +P +GE VL Y++GQKY+ HYD F Q
Sbjct: 139 RRGHDE--VVEVIEKRISDFTFIPVENGEGLQVLHYQVGQKYEPHYDYFLDEFNTKNGGQ 196
Query: 195 RLASFLLYLSDVEEGGETMFP 215
R+A+ L+YLSDV++GGET+FP
Sbjct: 197 RIATVLMYLSDVDDGGETVFP 217
>gi|21593091|gb|AAM65040.1| putative prolyl 4-hydroxylase, alpha subunit [Arabidopsis thaliana]
Length = 291
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 58/141 (41%), Positives = 91/141 (64%), Gaps = 5/141 (3%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTKGTRTSSGTFI 134
+V+SW PRA+ + NF + E+C+ +I+ AK + K + + + G + +S RTSSGTF+
Sbjct: 81 EVISWEPRAVVYHNFLTNEECEHLISLAKPSMVKSTVVDEKTGGSKDSR--VRTSSGTFL 138
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
D+ ++E+IE +I+ T +P +GE VL Y++GQKY+ HYD F Q
Sbjct: 139 RRGHDE--VVEVIEKRISDFTFIPVENGEGLQVLHYQVGQKYEPHYDYFLDEFNTKNGGQ 196
Query: 195 RLASFLLYLSDVEEGGETMFP 215
R+A+ L+YLSDV++GGET+FP
Sbjct: 197 RIATVLMYLSDVDDGGETVFP 217
>gi|224141327|ref|XP_002324025.1| predicted protein [Populus trichocarpa]
gi|222867027|gb|EEF04158.1| predicted protein [Populus trichocarpa]
Length = 239
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 64/139 (46%), Positives = 89/139 (64%), Gaps = 5/139 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LSW+PRA + F S E+C +I AK +L S +A + GE++ES + RTSSG FI
Sbjct: 21 LSWQPRAFVYKGFLSDEECDHLINLAKGKLVKSMVANDETGESMESQE--RTSSGMFIFK 78
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+ED+ I+ IE +IA T LP+ +GE +LRYE GQKY++H D F + R
Sbjct: 79 TEDE--IVNGIEARIAAWTFLPEENGEPIQILRYEHGQKYEAHIDYFVDKANQEEGGHRA 136
Query: 197 ASFLLYLSDVEEGGETMFP 215
A+ L+YLSDV++GGET+FP
Sbjct: 137 ATVLMYLSDVKKGGETVFP 155
>gi|302762452|ref|XP_002964648.1| hypothetical protein SELMODRAFT_82355 [Selaginella moellendorffii]
gi|300168377|gb|EFJ34981.1| hypothetical protein SELMODRAFT_82355 [Selaginella moellendorffii]
Length = 225
Score = 114 bits (286), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 57/140 (40%), Positives = 85/140 (60%), Gaps = 3/140 (2%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
+++SW PRA NF + ++C +I A ++ S + Q ++ RTSSG F++
Sbjct: 15 EIISWTPRASLVHNFLTDDECDHLIRVAMPLMQKSTVVDSQTGGSRDSR-VRTSSGMFLN 73
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
+D+ ++ IE KIA+ T +P+ HGE VL YE GQKYD+H+D F QR
Sbjct: 74 RGQDR--VISEIEDKIAKLTFIPKDHGEGIQVLHYEPGQKYDAHHDFFYDTVNTRNGGQR 131
Query: 196 LASFLLYLSDVEEGGETMFP 215
+A+ L+YL+DVEEGGET+FP
Sbjct: 132 IATLLMYLTDVEEGGETVFP 151
>gi|224117220|ref|XP_002331751.1| predicted protein [Populus trichocarpa]
gi|222874448|gb|EEF11579.1| predicted protein [Populus trichocarpa]
Length = 266
Score = 114 bits (286), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 68/189 (35%), Positives = 102/189 (53%), Gaps = 8/189 (4%)
Query: 32 STFLSRDVPSIRPKLRTLEVVEKENESGLPHG----ETGDASIQSIPF-QVLSWRPRALY 86
S +R PS ++L V + S + H E+GD ++ + + +SW PRA
Sbjct: 7 SRISTRKSPSSTLIRKSLNVHFPNDLSSIAHNSKIHESGDDEGKAEQWVEAISWEPRAFI 66
Query: 87 FPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILEL 146
+ NF + +C +I AK ++ S + + ++ RTSSGTF+ DK I+
Sbjct: 67 YHNFLTKAECDYLINLAKPHMQKSMVVDSSSGKSKDSR-VRTSSGTFLPRGRDK--IIRD 123
Query: 147 IEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDV 206
IE +IA + +P HGE +L YE+GQKY+ H+D F QR+A+ L+YLSDV
Sbjct: 124 IEKRIADFSFIPSEHGEGLQILHYEVGQKYEPHFDYFMDDYNTENGGQRIATVLMYLSDV 183
Query: 207 EEGGETMFP 215
EEGGET+FP
Sbjct: 184 EEGGETVFP 192
>gi|255552788|ref|XP_002517437.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223543448|gb|EEF44979.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 311
Score = 114 bits (286), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 60/139 (43%), Positives = 90/139 (64%), Gaps = 5/139 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LSW PRA + F S E+C +I A+ +L+ S +A + G+++ES RTSSG FI+
Sbjct: 52 LSWHPRAFLYKGFLSYEECDHLIDLARDKLEKSMVADNESGKSIESE--VRTSSGMFIAK 109
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
++D+ I+ IE +IA T LP+ +GE+ +L YE GQKY+ H+D F+ R+
Sbjct: 110 AQDE--IVADIEARIAAWTFLPEENGESMQILHYEHGQKYEPHFDYFHDKANQELGGHRV 167
Query: 197 ASFLLYLSDVEEGGETMFP 215
A+ L+YLS+VE+GGET+FP
Sbjct: 168 ATVLMYLSNVEKGGETVFP 186
>gi|215490181|dbj|BAG86624.1| type 2 proly 4-hydroxylase [Nicotiana tabacum]
Length = 294
Score = 114 bits (285), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 61/138 (44%), Positives = 87/138 (63%), Gaps = 3/138 (2%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
+SW+PRA + F + E+C +I+ AK LK S +A + ++++ RTSSG FI +
Sbjct: 36 ISWKPRAFVYEGFLTDEECNHLISLAKSELKRSAVADNESGNSKTSE-VRTSSGMFIPKA 94
Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
+D I+ IE KIA T LP+ +GE VLRYE GQKY+ HYD F + RLA
Sbjct: 95 KDP--IVSGIEEKIATWTFLPKENGEEIQVLRYEEGQKYEPHYDYFVDKVNIARGGHRLA 152
Query: 198 SFLLYLSDVEEGGETMFP 215
+ L+YL++VE+GGET+FP
Sbjct: 153 TVLMYLTNVEKGGETVFP 170
>gi|224102545|ref|XP_002312720.1| predicted protein [Populus trichocarpa]
gi|222852540|gb|EEE90087.1| predicted protein [Populus trichocarpa]
Length = 300
Score = 114 bits (284), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 63/151 (41%), Positives = 91/151 (60%), Gaps = 3/151 (1%)
Query: 65 TGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTK 124
T + I + +SW+PRA + F + +C +I+ AK LK S +A + + ++
Sbjct: 29 TSSSIINPAKVKQVSWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVADNESGKSKLSE 88
Query: 125 GTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN 184
RTSSG FI+ ++D I+ IE KIA T LP+ +GE VLRYE GQKYD HYD F+
Sbjct: 89 -VRTSSGMFITKAKDP--IVAGIEDKIATWTFLPRENGEDIQVLRYEHGQKYDPHYDYFS 145
Query: 185 PAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
+ R+A+ L+YL+DVE+GGET+FP
Sbjct: 146 DKVNIARGGHRVATVLMYLTDVEKGGETVFP 176
>gi|449459442|ref|XP_004147455.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
gi|449515722|ref|XP_004164897.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 319
Score = 114 bits (284), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 65/154 (42%), Positives = 97/154 (62%), Gaps = 6/154 (3%)
Query: 64 ETGDASIQSIPFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE 121
+TG +++ P +V LS +PRA + F SAE+CQ +I +AK +L S +A G++V
Sbjct: 47 KTGGSAMTIDPTRVIQLSSKPRAFLYKGFLSAEECQHLINSAKGKLHQSLVAAGTGQSVT 106
Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
S + RTS+G F+ ++D+ I+ IE +IA T LP +GE +LRYE GQKY+ H+D
Sbjct: 107 SKE--RTSTGMFLHKAQDE--IVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFD 162
Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
F R+A+ L+YLS+VE+GGET+FP
Sbjct: 163 FFQDPGNIAIGGHRIATILMYLSNVEKGGETVFP 196
>gi|116788056|gb|ABK24739.1| unknown [Picea sitchensis]
Length = 303
Score = 114 bits (284), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 62/150 (41%), Positives = 89/150 (59%), Gaps = 12/150 (8%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVES---------TKG 125
+VLSW PRA+ + NF + E+C+ +I AK + K + + G++ +S
Sbjct: 82 EVLSWEPRAILYHNFLNKEECEYLINLAKPHMAKSTVVDSATGKSKDSRFVHRWKSNDSR 141
Query: 126 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 185
RTSSG F++ +DKT + IE +IA T +P HGE VL YE+GQKY+ H+D F
Sbjct: 142 VRTSSGMFLNRGQDKT--IRSIEKRIADFTFIPAEHGEGLQVLHYEVGQKYEPHFDYFLD 199
Query: 186 AEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
QR+A+ L+YLSDVE+GGET+FP
Sbjct: 200 EFNTKNGGQRIATVLMYLSDVEKGGETVFP 229
>gi|449454448|ref|XP_004144967.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
gi|449474082|ref|XP_004154068.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
gi|449515181|ref|XP_004164628.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 300
Score = 114 bits (284), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 65/161 (40%), Positives = 95/161 (59%), Gaps = 5/161 (3%)
Query: 57 ESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ 116
ES + + A++ + +SW+PRA + F + +C +++ A+ LK S++A
Sbjct: 21 ESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADND 80
Query: 117 -GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQK 175
G++ ST RTSSG FIS ++D I+ IE KI+ T LP+ +GE VLRYE GQK
Sbjct: 81 SGKSKLST--VRTSSGMFISKNKDP--IVSGIEDKISAWTFLPKENGEDIQVLRYEHGQK 136
Query: 176 YDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPF 216
Y+SHYD F RLA+ L+YLS+V +GGET+FP
Sbjct: 137 YESHYDYFVDKVNIAWGGHRLATVLMYLSNVTQGGETVFPL 177
>gi|356502610|ref|XP_003520111.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 286
Score = 114 bits (284), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 61/155 (39%), Positives = 91/155 (58%), Gaps = 5/155 (3%)
Query: 62 HGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETV 120
H E + ++ +V+SW+PRA + NF + E+C+ +I A ++ S +A Q G++V
Sbjct: 61 HIEAEEDDQVALRMEVISWQPRAFLYHNFLTKEECEYLINIATPHMQKSTVADNQSGQSV 120
Query: 121 ESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHY 180
R S+G F+ +D+ I+ IE +IA T +P +GE V+ YE+GQ YD HY
Sbjct: 121 --VHDVRKSTGAFLDRGQDE--IVRNIEKRIADVTFIPIENGEPIYVIHYEVGQYYDPHY 176
Query: 181 DAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
D F QR+A+ L+YLS+VEEGGETMFP
Sbjct: 177 DYFIDDFNIENGGQRIATMLMYLSNVEEGGETMFP 211
>gi|114796723|gb|ABI79328.1| prolyl 4-hydroxylase [Dianthus caryophyllus]
Length = 297
Score = 114 bits (284), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 59/138 (42%), Positives = 88/138 (63%), Gaps = 3/138 (2%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
+SW+PRA + F + E+C +I+ AK LK S +A + + ++ RTSSG FIS +
Sbjct: 40 ISWKPRAFVYEGFLTDEECDHLISIAKTELKRSAVADNESGKSQVSE-VRTSSGAFISKA 98
Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
+D I++ IE K+A T LP +GE VLRYE GQKY++H+D F+ + R A
Sbjct: 99 KD--AIVQRIEEKLATWTFLPIENGEDIQVLRYEEGQKYENHFDFFSDKVNIARGGHRYA 156
Query: 198 SFLLYLSDVEEGGETMFP 215
+ L+YLS+VE+GG+T+FP
Sbjct: 157 TVLMYLSNVEKGGDTVFP 174
>gi|145345836|ref|XP_001417405.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577632|gb|ABO95698.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 330
Score = 113 bits (282), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 61/149 (40%), Positives = 91/149 (61%), Gaps = 10/149 (6%)
Query: 71 QSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSS 130
++ P + + +P+A NF SAE+C ++ AK+ L PS + G++V S RTS+
Sbjct: 38 ETPPLVLSATQPKAYLLRNFLSAEECDHLMKLAKRELAPSTVVGEAGDSVPSD--IRTSA 95
Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF----NPA 186
G F+ +DK I++ IE +IAR + P +GE +LRY++GQKYD H+D F NPA
Sbjct: 96 GMFLRKGQDK--IVKAIEERIARLSGTPVDNGEGMQILRYDVGQKYDPHFDYFHDKVNPA 153
Query: 187 EYGPQMSQRLASFLLYLSDVEEGGETMFP 215
+ QRLA+ L+YL D ++GGET FP
Sbjct: 154 PK--RGGQRLATMLIYLVDTDKGGETTFP 180
>gi|363543301|ref|NP_001241866.1| prolyl 4-hydroxylase 6 precursor [Zea mays]
gi|195624808|gb|ACG34234.1| oxidoreductase [Zea mays]
gi|347978818|gb|AEP37751.1| prolyl 4-hydroxylase 6 [Zea mays]
Length = 297
Score = 113 bits (282), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 61/139 (43%), Positives = 89/139 (64%), Gaps = 5/139 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LS RPRA + F S +C I++ AK ++ S +A G++V S RTSSGTF++
Sbjct: 38 LSSRPRAFLYSGFLSDTECDHIVSLAKGSMEKSMVADNDSGKSVASQ--ARTSSGTFLAK 95
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
ED+ I+ IE ++A T LP+ + E+ VLRYE GQKYD+H+D F+ QR+
Sbjct: 96 REDE--IVSAIEKRVAAWTFLPEENAESLQVLRYETGQKYDAHFDYFHDRNNLKLGGQRV 153
Query: 197 ASFLLYLSDVEEGGETMFP 215
A+ L+YL+DV++GGET+FP
Sbjct: 154 ATVLMYLTDVKKGGETVFP 172
>gi|356502598|ref|XP_003520105.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 296
Score = 113 bits (282), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 58/141 (41%), Positives = 87/141 (61%), Gaps = 5/141 (3%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFI 134
+++SW PR + NF + E+C+ +I AK ++ S + + G ++ES RTSSGTF+
Sbjct: 86 EIISWEPRIFLYHNFLTKEECEHLINIAKPNMRKSTVIESETGMSIESR--VRTSSGTFL 143
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
+ DK I+ IE++IA T +P +GE VL Y++G+KY H+D F
Sbjct: 144 ARGRDK--IVRNIENRIADFTFIPVDNGEELQVLHYQVGEKYVPHHDYFMDDINTANGGD 201
Query: 195 RLASFLLYLSDVEEGGETMFP 215
R+A+ L+YLSDVEEGGET+FP
Sbjct: 202 RIATMLMYLSDVEEGGETVFP 222
>gi|297818456|ref|XP_002877111.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297322949|gb|EFH53370.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 316
Score = 112 bits (281), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 61/139 (43%), Positives = 86/139 (61%), Gaps = 5/139 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LSW PRA + F S E+C I AK +L+ S +A GE+VES RTSSG F+S
Sbjct: 59 LSWTPRAFLYKGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESE--VRTSSGMFLSK 116
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D I+ +E K+A T +P+ +GE+ +L YE GQKY+ H+D F+ R+
Sbjct: 117 RQDD--IVANVEAKLAAWTFIPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRI 174
Query: 197 ASFLLYLSDVEEGGETMFP 215
A+ L+YLS+VE+GGET+FP
Sbjct: 175 ATVLMYLSNVEKGGETVFP 193
>gi|50845214|gb|AAT84604.1| prolyl 4-hydroxylase [Dianthus caryophyllus]
Length = 316
Score = 112 bits (281), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 62/148 (41%), Positives = 92/148 (62%), Gaps = 6/148 (4%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LSW+PRA + F + E+C +I AK +L+ S +A + G+++ S RTSSG F+
Sbjct: 58 LSWKPRAFLYEGFLTHEECDHLIDMAKDKLEKSMVADNESGKSIPSE--VRTSSGMFLQK 115
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
++D ++ IE +IA T LP +GEA +L YE GQKY+ H+D F+ R+
Sbjct: 116 AQDD--VVAAIEARIAAWTFLPIENGEAMQILHYERGQKYEPHFDYFHDKVNQQLGGHRI 173
Query: 197 ASFLLYLSDVEEGGETMFP-FEVDLQIS 223
A+ L+YLS+VEEGGET+FP E LQ++
Sbjct: 174 ATVLMYLSNVEEGGETVFPNAEAKLQLA 201
>gi|255637501|gb|ACU19077.1| unknown [Glycine max]
Length = 318
Score = 112 bits (280), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 63/152 (41%), Positives = 95/152 (62%), Gaps = 7/152 (4%)
Query: 66 GDASIQSIPFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVES 122
G +S++ P +V LSW PRA + F S E+C +I AK +L+ S +A + G+++ S
Sbjct: 45 GGSSVKFDPTRVTQLSWSPRAFLYKGFLSDEECDHLITLAKDKLEKSMVADNESGKSIMS 104
Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
RTSSG F++ ++D+ I+ IE +IA T LP +GE+ +L YE GQKY+ H+D
Sbjct: 105 E--VRTSSGMFLNKAQDE--IVAGIEARIAAWTFLPIENGESMQILHYENGQKYEPHFDY 160
Query: 183 FNPAEYGPQMSQRLASFLLYLSDVEEGGETMF 214
F+ R+A+ L+YLSDVE+GGET+F
Sbjct: 161 FHDKANQVMGGHRIATVLMYLSDVEKGGETIF 192
>gi|224141325|ref|XP_002324024.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Populus
trichocarpa]
gi|222867026|gb|EEF04157.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Populus
trichocarpa]
Length = 308
Score = 112 bits (280), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 59/139 (42%), Positives = 88/139 (63%), Gaps = 5/139 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LSW PRA + F S E+C ++ A+ +L+ S +A + G+++ES RTSSG FI
Sbjct: 49 LSWNPRAFLYKGFLSDEECDHLMNLARDKLEKSMVADNESGKSIESE--VRTSSGMFIGK 106
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
S+D+ I++ IE +IA T LPQ +GE+ +L YE GQKY+ H+D F+ R+
Sbjct: 107 SQDE--IVDDIEARIAAWTFLPQENGESIQILHYEHGQKYEPHFDYFHDKANQELGGHRV 164
Query: 197 ASFLLYLSDVEEGGETMFP 215
+ L+YLS+V +GGET+FP
Sbjct: 165 VTVLMYLSNVGKGGETVFP 183
>gi|18086437|gb|AAL57673.1| AT3g28480/MFJ20_16 [Arabidopsis thaliana]
gi|24796986|gb|AAN64505.1| At3g28480/MFJ20_16 [Arabidopsis thaliana]
Length = 316
Score = 112 bits (280), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 61/139 (43%), Positives = 85/139 (61%), Gaps = 5/139 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LSW PR + F S E+C I AK +L+ S +A GE+VES RTSSG F+S
Sbjct: 59 LSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESE--VRTSSGMFLSK 116
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D I+ +E K+A T LP+ +GE+ +L YE GQKY+ H+D F+ R+
Sbjct: 117 RQDD--IVNNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRI 174
Query: 197 ASFLLYLSDVEEGGETMFP 215
A+ L+YLS+VE+GGET+FP
Sbjct: 175 ATVLMYLSNVEKGGETVFP 193
>gi|413932756|gb|AFW67307.1| oxidoreductase [Zea mays]
Length = 297
Score = 112 bits (279), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 60/139 (43%), Positives = 88/139 (63%), Gaps = 5/139 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LS RPRA + F S +C +++ AK ++ S +A G++V S RTSSGTF++
Sbjct: 38 LSSRPRAFLYSGFLSDTECDHLVSLAKGSMEKSMVADNDSGKSVASQ--ARTSSGTFLAK 95
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
ED+ I+ IE ++A T LP+ + E+ VLRYE GQKYD+H+D F+ QR+
Sbjct: 96 REDE--IVSAIEKRVAAWTFLPEENAESLQVLRYETGQKYDAHFDYFHDRNNLKLGGQRV 153
Query: 197 ASFLLYLSDVEEGGETMFP 215
A+ L+YL+DV +GGET+FP
Sbjct: 154 ATVLMYLTDVNKGGETVFP 172
>gi|18405808|ref|NP_566838.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
gi|21617881|gb|AAM66931.1| prolyl 4-hydroxylase, putative [Arabidopsis thaliana]
gi|332643929|gb|AEE77450.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
Length = 316
Score = 112 bits (279), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 61/139 (43%), Positives = 85/139 (61%), Gaps = 5/139 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LSW PR + F S E+C I AK +L+ S +A GE+VES RTSSG F+S
Sbjct: 59 LSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESE--VRTSSGMFLSK 116
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D I+ +E K+A T LP+ +GE+ +L YE GQKY+ H+D F+ R+
Sbjct: 117 RQDD--IVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRI 174
Query: 197 ASFLLYLSDVEEGGETMFP 215
A+ L+YLS+VE+GGET+FP
Sbjct: 175 ATVLMYLSNVEKGGETVFP 193
>gi|449432777|ref|XP_004134175.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 303
Score = 111 bits (278), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 63/139 (45%), Positives = 85/139 (61%), Gaps = 5/139 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTFISA 136
+SW PRA + F + +C +I+ AK LK S +A G++ S RTSSG FI
Sbjct: 44 ISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGKSKVSE--VRTSSGAFIHK 101
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
++D I+ IE KIA T LP+ +GE VLRYE GQKYD+H+D F + R+
Sbjct: 102 AKDP--IVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRM 159
Query: 197 ASFLLYLSDVEEGGETMFP 215
A+ L+YLSDVE+GGET+FP
Sbjct: 160 ATVLMYLSDVEKGGETVFP 178
>gi|9294583|dbj|BAB02864.1| prolyl 4-hydroxylase alpha subunit-like protein [Arabidopsis
thaliana]
Length = 332
Score = 111 bits (278), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 61/140 (43%), Positives = 85/140 (60%), Gaps = 5/140 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LSW PR + F S E+C I AK +L+ S +A GE+VES RTSSG F+S
Sbjct: 75 LSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESE--VRTSSGMFLSK 132
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D I+ +E K+A T LP+ +GE+ +L YE GQKY+ H+D F+ R+
Sbjct: 133 RQDD--IVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRI 190
Query: 197 ASFLLYLSDVEEGGETMFPF 216
A+ L+YLS+VE+GGET+FP
Sbjct: 191 ATVLMYLSNVEKGGETVFPM 210
>gi|357447553|ref|XP_003594052.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
truncatula]
gi|355483100|gb|AES64303.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
truncatula]
Length = 301
Score = 111 bits (278), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 64/149 (42%), Positives = 88/149 (59%), Gaps = 5/149 (3%)
Query: 68 ASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGT 126
A I + +SW+PRA + F + +C +I+ AK LK S +A GE+ S
Sbjct: 33 AIIDPTKVKQVSWKPRAFVYKGFLTDLECDHLISIAKSELKRSAVADNLSGESKLSE--V 90
Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
RTSSG FIS ++D I+ IE KI+ T LP+ +GE VLRYE GQKYD HYD F
Sbjct: 91 RTSSGMFISKNKD--AIVSGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDYFADK 148
Query: 187 EYGPQMSQRLASFLLYLSDVEEGGETMFP 215
+ R+A+ L+YL++V +GGET+FP
Sbjct: 149 VNIARGGHRVATVLMYLTNVTKGGETVFP 177
>gi|357447555|ref|XP_003594053.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
truncatula]
gi|355483101|gb|AES64304.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
truncatula]
Length = 303
Score = 111 bits (277), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 64/149 (42%), Positives = 88/149 (59%), Gaps = 5/149 (3%)
Query: 68 ASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGT 126
A I + +SW+PRA + F + +C +I+ AK LK S +A GE+ S
Sbjct: 33 AIIDPTKVKQVSWKPRAFVYKGFLTDLECDHLISIAKSELKRSAVADNLSGESKLSE--V 90
Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
RTSSG FIS ++D I+ IE KI+ T LP+ +GE VLRYE GQKYD HYD F
Sbjct: 91 RTSSGMFISKNKD--AIVSGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDYFADK 148
Query: 187 EYGPQMSQRLASFLLYLSDVEEGGETMFP 215
+ R+A+ L+YL++V +GGET+FP
Sbjct: 149 VNIARGGHRVATVLMYLTNVTKGGETVFP 177
>gi|302845234|ref|XP_002954156.1| hypothetical protein VOLCADRAFT_82641 [Volvox carteri f.
nagariensis]
gi|300260655|gb|EFJ44873.1| hypothetical protein VOLCADRAFT_82641 [Volvox carteri f.
nagariensis]
Length = 309
Score = 111 bits (277), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 63/149 (42%), Positives = 91/149 (61%), Gaps = 8/149 (5%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTKGTRTSSGTFISA 136
LSW PRA F S E+C+ IIA AK R+ K S + G++V+S RTS+G +++
Sbjct: 57 LSWSPRAFLLKGFLSDEECEHIIAKAKPRMVKSSVVDNASGKSVDSE--IRTSTGAWLAK 114
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQM-SQ 194
ED+ I+ IE ++A+ TM+P + E VL Y GQKY+ HYD F +P P+ Q
Sbjct: 115 GEDE--IISRIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNASPEHGGQ 172
Query: 195 RLASFLLYLSDVEEGGETMFPFEVDLQIS 223
R+ + L+YL+ VEEGGET+ P D ++S
Sbjct: 173 RVVTVLMYLTTVEEGGETVLP-HADQKVS 200
>gi|297818458|ref|XP_002877112.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297322950|gb|EFH53371.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 289
Score = 111 bits (277), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 59/141 (41%), Positives = 86/141 (60%), Gaps = 6/141 (4%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLAL--RQGETVESTKGTRTSSGTFIS 135
LSW PRA + F S E+C +I AK +L+ S + GE+++S + RTSSG F++
Sbjct: 35 LSWTPRAFLYNGFLSDEECDHLINLAKGKLEKSMVVADDNSGESIDSEE--RTSSGVFLT 92
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
+D I+ +E K+A T LP+ +GEA +L YE GQKYD H+D + E R
Sbjct: 93 KRQDD--IVANVEAKLATWTFLPEENGEALQILHYENGQKYDPHFDYYYDKETLKLGGHR 150
Query: 196 LASFLLYLSDVEEGGETMFPF 216
+A+ L+YLS+V +GGET+FP
Sbjct: 151 IATVLMYLSNVTKGGETVFPM 171
>gi|359806348|ref|NP_001241485.1| uncharacterized protein LOC100783075 precursor [Glycine max]
gi|255645457|gb|ACU23224.1| unknown [Glycine max]
Length = 298
Score = 111 bits (277), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 67/153 (43%), Positives = 91/153 (59%), Gaps = 7/153 (4%)
Query: 66 GDASIQSIPFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVES 122
G AS P +V +SW+PRA + F + +C +I+ AK LK S +A GE+ S
Sbjct: 26 GSASSIVNPSKVKQISWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVADNLSGESQLS 85
Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
RTSSG FIS ++D I+ IE KI+ T LP+ +GE VLRYE GQKYD HYD
Sbjct: 86 D--VRTSSGMFISKNKDP--IISGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDY 141
Query: 183 FNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
F + R+A+ L+YL++V +GGET+FP
Sbjct: 142 FTDKVNIARGGHRIATVLMYLTNVTKGGETVFP 174
>gi|159794881|pdb|2JIJ|A Chain A, Crystal Structure Of The Apo Form Of Chlamydomonas
Reinhardtii Prolyl-4 Hydroxylase Type I
gi|159794882|pdb|2JIJ|B Chain B, Crystal Structure Of The Apo Form Of Chlamydomonas
Reinhardtii Prolyl-4 Hydroxylase Type I
gi|159794883|pdb|2JIJ|C Chain C, Crystal Structure Of The Apo Form Of Chlamydomonas
Reinhardtii Prolyl-4 Hydroxylase Type I
Length = 233
Score = 111 bits (277), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 60/141 (42%), Positives = 86/141 (60%), Gaps = 7/141 (4%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAK-KRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
LSW PRA NF S E+C I+ A+ K +K S + G++V+S RTS+GT+ +
Sbjct: 25 LSWSPRAFLLKNFLSDEECDYIVEKARPKMVKSSVVDNESGKSVDSE--IRTSTGTWFAK 82
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQM-SQ 194
ED ++ IE ++A+ TM+P + E VL Y GQKY+ HYD F +P GP+ Q
Sbjct: 83 GEDS--VISKIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNAGPEHGGQ 140
Query: 195 RLASFLLYLSDVEEGGETMFP 215
R+ + L+YL+ VEEGGET+ P
Sbjct: 141 RVVTMLMYLTTVEEGGETVLP 161
>gi|363543309|ref|NP_001241870.1| prolyl 4-hydroxylase 6-3 precursor [Zea mays]
gi|347978824|gb|AEP37754.1| prolyl 4-hydroxylase 6-3 [Zea mays]
Length = 208
Score = 110 bits (276), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 59/139 (42%), Positives = 88/139 (63%), Gaps = 5/139 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LS RPRA + F S +C +++ AK ++ S +A G++V S RTSSGTF++
Sbjct: 38 LSSRPRAFLYSGFLSDTECDHLVSLAKGSMEKSMVADNDSGKSVASQ--ARTSSGTFLAK 95
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
ED+ I+ IE ++A T LP+ + E+ VLRYE GQKYD+H+D F+ QR+
Sbjct: 96 REDE--IVSAIEKRVAAWTFLPEENAESLQVLRYETGQKYDAHFDYFHDRNNLKLGGQRV 153
Query: 197 ASFLLYLSDVEEGGETMFP 215
A+ L+YL+DV++GGE +FP
Sbjct: 154 ATVLMYLTDVKKGGEAVFP 172
>gi|241913390|pdb|3GZE|A Chain A, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
(Ser-Pro)5 Peptide Substrate
gi|241913391|pdb|3GZE|B Chain B, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
(Ser-Pro)5 Peptide Substrate
gi|241913392|pdb|3GZE|C Chain C, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
(Ser-Pro)5 Peptide Substrate
gi|241913393|pdb|3GZE|D Chain D, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
(Ser-Pro)5 Peptide Substrate
Length = 225
Score = 110 bits (276), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 60/141 (42%), Positives = 86/141 (60%), Gaps = 7/141 (4%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAK-KRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
LSW PRA NF S E+C I+ A+ K +K S + G++V+S RTS+GT+ +
Sbjct: 17 LSWSPRAFLLKNFLSDEECDYIVEKARPKMVKSSVVDNESGKSVDSE--IRTSTGTWFAK 74
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQM-SQ 194
ED ++ IE ++A+ TM+P + E VL Y GQKY+ HYD F +P GP+ Q
Sbjct: 75 GEDS--VISKIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNAGPEHGGQ 132
Query: 195 RLASFLLYLSDVEEGGETMFP 215
R+ + L+YL+ VEEGGET+ P
Sbjct: 133 RVVTMLMYLTTVEEGGETVLP 153
>gi|29150368|gb|AAO72377.1| putative oxidoreductase [Oryza sativa Japonica Group]
gi|108711617|gb|ABF99412.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
Group]
gi|125546090|gb|EAY92229.1| hypothetical protein OsI_13949 [Oryza sativa Indica Group]
gi|125588294|gb|EAZ28958.1| hypothetical protein OsJ_13002 [Oryza sativa Japonica Group]
Length = 310
Score = 110 bits (276), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 58/140 (41%), Positives = 85/140 (60%), Gaps = 5/140 (3%)
Query: 77 VLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFIS 135
++SW+PR ++ F S ++C ++ K++LK S +A + G++V S RTSSG F+
Sbjct: 50 IISWKPRIFFYKGFLSDDECDHLVKLGKEKLKRSMVADNESGKSVMSE--VRTSSGMFLD 107
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
+D ++ IE +IA T+LPQ + E +LRYE GQKYD H+D F Q R
Sbjct: 108 KQQDP--VVSGIEERIAAWTLLPQENAENIQILRYENGQKYDPHFDYFQDKVNQLQGGHR 165
Query: 196 LASFLLYLSDVEEGGETMFP 215
A+ L YLS VE+GGET+FP
Sbjct: 166 YATVLTYLSTVEKGGETVFP 185
>gi|159794879|pdb|2JIG|A Chain A, Crystal Structure Of Chlamydomonas Reinhardtii Prolyl-4
Hydroxylase Type I Complexed With Zinc And Pyridine-2,4-
Dicarboxylate
gi|159794880|pdb|2JIG|B Chain B, Crystal Structure Of Chlamydomonas Reinhardtii Prolyl-4
Hydroxylase Type I Complexed With Zinc And Pyridine-2,4-
Dicarboxylate
Length = 224
Score = 110 bits (276), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 60/141 (42%), Positives = 86/141 (60%), Gaps = 7/141 (4%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAK-KRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
LSW PRA NF S E+C I+ A+ K +K S + G++V+S RTS+GT+ +
Sbjct: 16 LSWSPRAFLLKNFLSDEECDYIVEKARPKMVKSSVVDNESGKSVDSE--IRTSTGTWFAK 73
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQM-SQ 194
ED ++ IE ++A+ TM+P + E VL Y GQKY+ HYD F +P GP+ Q
Sbjct: 74 GEDS--VISKIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNAGPEHGGQ 131
Query: 195 RLASFLLYLSDVEEGGETMFP 215
R+ + L+YL+ VEEGGET+ P
Sbjct: 132 RVVTMLMYLTTVEEGGETVLP 152
>gi|159478673|ref|XP_001697425.1| predicted protein [Chlamydomonas reinhardtii]
gi|158274304|gb|EDP00087.1| predicted protein [Chlamydomonas reinhardtii]
Length = 297
Score = 110 bits (276), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 60/141 (42%), Positives = 86/141 (60%), Gaps = 7/141 (4%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAK-KRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
LSW PRA NF S E+C I+ A+ K +K S + G++V+S RTS+GT+ +
Sbjct: 45 LSWSPRAFLLKNFLSDEECDYIVEKARPKMVKSSVVDNESGKSVDSE--IRTSTGTWFAK 102
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQM-SQ 194
ED ++ IE ++A+ TM+P + E VL Y GQKY+ HYD F +P GP+ Q
Sbjct: 103 GEDS--VISKIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNAGPEHGGQ 160
Query: 195 RLASFLLYLSDVEEGGETMFP 215
R+ + L+YL+ VEEGGET+ P
Sbjct: 161 RVVTMLMYLTTVEEGGETVLP 181
>gi|356555587|ref|XP_003546112.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 2
[Glycine max]
Length = 297
Score = 110 bits (275), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 66/153 (43%), Positives = 91/153 (59%), Gaps = 7/153 (4%)
Query: 66 GDASIQSIPFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVES 122
G AS P +V +SW+PRA + F + +C +I+ AK LK S +A GE+ S
Sbjct: 29 GSASAIIDPSKVKQVSWKPRAFVYEGFLTELECDHLISIAKSELKRSAVADNLSGESKLS 88
Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
RTSSG FI ++D I+ +E KI+ T+LP+ +GE VLRYE GQKYD HYD
Sbjct: 89 E--VRTSSGMFIPKNKDP--IVAGVEDKISSWTLLPKENGEDIQVLRYEHGQKYDPHYDY 144
Query: 183 FNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
F + R+A+ L+YL+DV +GGET+FP
Sbjct: 145 FADKVNIARGGHRVATVLMYLTDVTKGGETVFP 177
>gi|356555585|ref|XP_003546111.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 1
[Glycine max]
Length = 301
Score = 110 bits (274), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 66/153 (43%), Positives = 91/153 (59%), Gaps = 7/153 (4%)
Query: 66 GDASIQSIPFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVES 122
G AS P +V +SW+PRA + F + +C +I+ AK LK S +A GE+ S
Sbjct: 29 GSASAIIDPSKVKQVSWKPRAFVYEGFLTELECDHLISIAKSELKRSAVADNLSGESKLS 88
Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
RTSSG FI ++D I+ +E KI+ T+LP+ +GE VLRYE GQKYD HYD
Sbjct: 89 E--VRTSSGMFIPKNKDP--IVAGVEDKISSWTLLPKENGEDIQVLRYEHGQKYDPHYDY 144
Query: 183 FNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
F + R+A+ L+YL+DV +GGET+FP
Sbjct: 145 FADKVNIARGGHRVATVLMYLTDVTKGGETVFP 177
>gi|293337056|ref|NP_001169835.1| uncharacterized protein LOC100383727 precursor [Zea mays]
gi|224031897|gb|ACN35024.1| unknown [Zea mays]
gi|347978800|gb|AEP37742.1| prolyl 4-hydroxylase 2 [Zea mays]
gi|414871435|tpg|DAA49992.1| TPA: hypothetical protein ZEAMMB73_500506 [Zea mays]
Length = 299
Score = 110 bits (274), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 58/139 (41%), Positives = 88/139 (63%), Gaps = 5/139 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LSWRPRA F S +C +IA AK +L+ S +A + G++V+S RTSSG F+
Sbjct: 39 LSWRPRAFLHKGFLSDAECDHLIALAKDKLEKSMVADNESGKSVQSE--VRTSSGMFLER 96
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D+ ++ IE +I+ T LP +GE+ +L Y+ G+KY+ HYD F+ + R+
Sbjct: 97 KQDE--VVTRIEERISAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKKNQALGGHRI 154
Query: 197 ASFLLYLSDVEEGGETMFP 215
A+ L+YLS+VE+GGET+FP
Sbjct: 155 ATVLMYLSNVEKGGETIFP 173
>gi|363807286|ref|NP_001242363.1| uncharacterized protein LOC100796794 precursor [Glycine max]
gi|255641119|gb|ACU20838.1| unknown [Glycine max]
Length = 297
Score = 110 bits (274), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 67/153 (43%), Positives = 90/153 (58%), Gaps = 7/153 (4%)
Query: 66 GDASIQSIPFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVES 122
G AS P +V +SW+PRA + F + +C +I+ AK LK S +A GE+ S
Sbjct: 25 GSASSVINPSKVKQISWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVADNLSGESQLS 84
Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
RTSSG FIS ++D I+ IE KI+ T LP+ +GE V RYE GQKYD HYD
Sbjct: 85 D--VRTSSGMFISKNKDP--IVAGIEDKISSWTFLPKENGEDIQVSRYEHGQKYDPHYDY 140
Query: 183 FNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
F + R+A+ L+YL+DV +GGET+FP
Sbjct: 141 FTDKVNIARGGHRIATVLMYLTDVAKGGETVFP 173
>gi|363543293|ref|NP_001241862.1| prolyl 4-hydroxylase 2-1 precursor [Zea mays]
gi|347978802|gb|AEP37743.1| prolyl 4-hydroxylase 2-1 [Zea mays]
Length = 204
Score = 110 bits (274), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 58/139 (41%), Positives = 88/139 (63%), Gaps = 5/139 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LSWRPRA F S +C +IA AK +L+ S +A + G++V+S RTSSG F+
Sbjct: 39 LSWRPRAFLHKGFLSDAECDHLIALAKDKLEKSMVADNESGKSVQSE--VRTSSGMFLER 96
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D+ ++ IE +I+ T LP +GE+ +L Y+ G+KY+ HYD F+ + R+
Sbjct: 97 KQDE--VVTRIEERISAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKKNQALGGHRI 154
Query: 197 ASFLLYLSDVEEGGETMFP 215
A+ L+YLS+VE+GGET+FP
Sbjct: 155 ATVLMYLSNVEKGGETIFP 173
>gi|242039723|ref|XP_002467256.1| hypothetical protein SORBIDRAFT_01g022150 [Sorghum bicolor]
gi|241921110|gb|EER94254.1| hypothetical protein SORBIDRAFT_01g022150 [Sorghum bicolor]
Length = 303
Score = 110 bits (274), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 58/139 (41%), Positives = 86/139 (61%), Gaps = 5/139 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LSWRPRA F S +C +I AK +L+ S +A + G++V+S RTSSG F+
Sbjct: 43 LSWRPRAFLHKGFLSDAECDHLIVLAKDKLEKSMVADNESGKSVQSE--VRTSSGMFLEK 100
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D+ ++ IE +IA T LP +GE+ +L Y+ G+KY+ HYD F+ R+
Sbjct: 101 KQDE--VVRGIEERIAAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKNNQALGGHRI 158
Query: 197 ASFLLYLSDVEEGGETMFP 215
A+ L+YLS+VE+GGET+FP
Sbjct: 159 ATVLMYLSNVEKGGETIFP 177
>gi|326526235|dbj|BAJ97134.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 308
Score = 109 bits (273), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 60/139 (43%), Positives = 85/139 (61%), Gaps = 3/139 (2%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
+SW PRA +P+F S ++ +++ A+ LK S +A + ++ RTSSGTFIS
Sbjct: 54 ISWHPRAFLYPHFLSDDEANHLVSLARAELKRSAVADETSGKSQLSE-VRTSSGTFISKG 112
Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
+D I+ IE KIA T LP+ +GE VLRY+ G+KY+ HYD F + R+A
Sbjct: 113 KDP--IVAGIEDKIAAWTFLPKENGEDMQVLRYKRGEKYEPHYDFFTDSVNTILGGHRVA 170
Query: 198 SFLLYLSDVEEGGETMFPF 216
+ LLYL+DV EGGET+FP
Sbjct: 171 TVLLYLTDVAEGGETVFPL 189
>gi|357140446|ref|XP_003571778.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
distachyon]
Length = 298
Score = 109 bits (272), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 58/139 (41%), Positives = 86/139 (61%), Gaps = 5/139 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LSWRPRA F S +C +I AK +L+ S +A + G++V+S RTSSG F+
Sbjct: 38 LSWRPRAFLHKGFLSEPECDHMIELAKDKLEKSMVADNESGKSVQSE--VRTSSGMFLEK 95
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D+ ++ IE +IA T LP +GE+ +L Y+ G+KY+ HYD F+ R+
Sbjct: 96 RQDE--VVARIEERIAAWTFLPSENGESIQILHYKNGEKYEPHYDYFHDKNNQALGGHRI 153
Query: 197 ASFLLYLSDVEEGGETMFP 215
A+ L+YLS+VE+GGET+FP
Sbjct: 154 ATVLMYLSNVEKGGETIFP 172
>gi|242075290|ref|XP_002447581.1| hypothetical protein SORBIDRAFT_06g004550 [Sorghum bicolor]
gi|241938764|gb|EES11909.1| hypothetical protein SORBIDRAFT_06g004550 [Sorghum bicolor]
Length = 263
Score = 109 bits (272), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 58/152 (38%), Positives = 94/152 (61%), Gaps = 3/152 (1%)
Query: 64 ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVES 122
E G ++ + +V+SW PR + F NF S+E+C ++A A+ RL+ S + + G+ V+S
Sbjct: 46 EAGLLRLRYVKPEVISWTPRIIIFHNFLSSEECDYLMAIARPRLQMSTVVDVATGKGVKS 105
Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
RTSSG F+++ E K+ +++ IE +I+ + +P+ +GE VLRYE Q Y H+D
Sbjct: 106 D--VRTSSGMFVNSEERKSPVIQAIEKRISVFSQIPKENGELIQVLRYEASQYYRPHHDY 163
Query: 183 FNPAEYGPQMSQRLASFLLYLSDVEEGGETMF 214
F+ + QR+A+ L+YL+D EGGET F
Sbjct: 164 FSDTFNLKRGGQRVATMLMYLTDGVEGGETHF 195
>gi|308799217|ref|XP_003074389.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
gi|116000560|emb|CAL50240.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
Length = 294
Score = 109 bits (272), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 62/153 (40%), Positives = 89/153 (58%), Gaps = 11/153 (7%)
Query: 68 ASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTR 127
+I + + LSW P A + F + +C+ I A LKPS + + +++ R
Sbjct: 19 GTIDAGAIERLSWAPHAEVYRGFLTEAECEHIERLATAELKPSTV-VDASTGGDASSEIR 77
Query: 128 TSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA- 186
TSSG F+ +ED ++E IE +IA T +P++HGE F VLRYE Q+Y +HYD F+
Sbjct: 78 TSSGMFLGRAEDD--VIEAIEARIAAWTHVPESHGEGFQVLRYEKHQEYRAHYDYFHDKF 135
Query: 187 ----EYGPQMSQRLASFLLYLSDVEEGGETMFP 215
E G QR+ + L+YLSDVEEGGET+FP
Sbjct: 136 NVKREKG---GQRMGTVLMYLSDVEEGGETVFP 165
>gi|255551575|ref|XP_002516833.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223543921|gb|EEF45447.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 297
Score = 109 bits (272), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 60/138 (43%), Positives = 84/138 (60%), Gaps = 3/138 (2%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
+SW+PRA + F + +C +I+ AK LK S +A + + ++ RTSSG FI+
Sbjct: 39 VSWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVADNESGKSKLSE-VRTSSGMFIAKG 97
Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
+D I+ IE KI+ T LP+ +GE VLRYE GQKYD HYD F + R+A
Sbjct: 98 KDP--IIAGIEEKISTWTFLPKENGEDLQVLRYEHGQKYDPHYDYFADKINIARGGHRMA 155
Query: 198 SFLLYLSDVEEGGETMFP 215
+ L+YLSDV +GGET+FP
Sbjct: 156 TVLMYLSDVVKGGETVFP 173
>gi|449495423|ref|XP_004159836.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 304
Score = 109 bits (272), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 62/138 (44%), Positives = 84/138 (60%), Gaps = 5/138 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTFISA 136
+SW PRA + F + +C +I+ AK LK S +A G++ S RTSSG FI
Sbjct: 44 ISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGKSKVSE--VRTSSGAFIHK 101
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
++D I+ IE KIA T LP+ +GE VLRYE GQKYD+H+D F + R+
Sbjct: 102 AKDP--IVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRM 159
Query: 197 ASFLLYLSDVEEGGETMF 214
A+ L+YLSDVE+GGET+F
Sbjct: 160 ATVLMYLSDVEKGGETVF 177
>gi|255641919|gb|ACU21228.1| unknown [Glycine max]
Length = 301
Score = 109 bits (272), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 67/153 (43%), Positives = 90/153 (58%), Gaps = 7/153 (4%)
Query: 66 GDASIQSIPFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVES 122
G AS P +V +SW+PRA + F + +C +I+ AK LK S +A GE+ S
Sbjct: 29 GSASAIIDPSKVKQVSWKPRAFVYEGFLTELECDHLISIAKSELKRSAVADNLSGESKLS 88
Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
RTSSG FI ++D I+ IE KI+ T LP+ +GE VLRYE GQKYD HYD
Sbjct: 89 E--VRTSSGMFIPKNKDL--IVAGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDY 144
Query: 183 FNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
F + R+A+ L+YL+DV +GGET+FP
Sbjct: 145 FADKVNIARGGHRVATVLMYLTDVTKGGETVFP 177
>gi|3805847|emb|CAA21467.1| putative protein [Arabidopsis thaliana]
gi|7270533|emb|CAB81490.1| putative protein [Arabidopsis thaliana]
Length = 307
Score = 108 bits (271), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 60/171 (35%), Positives = 96/171 (56%), Gaps = 32/171 (18%)
Query: 75 FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGT------- 126
+V+SW PRA + NF + E+C+ +I+ AK + S++ ++ G++++S T
Sbjct: 80 LEVISWEPRAFVYHNFLTNEECEHLISLAKPSMMKSKVVDVKTGKSIDSRFCTLTSVVVF 139
Query: 127 ----------------------RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEA 164
RTSSGTF++ D+ I+E IE++I+ T +P +GE
Sbjct: 140 TFQLNLERFENSKFANPSLCRVRTSSGTFLNRGHDE--IVEEIENRISDFTFIPPENGEG 197
Query: 165 FNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
VL YE+GQ+Y+ H+D F + QR+A+ L+YLSDV+EGGET+FP
Sbjct: 198 LQVLHYEVGQRYEPHHDYFFDEFNVRKGGQRIATVLMYLSDVDEGGETVFP 248
>gi|212720650|ref|NP_001132477.1| uncharacterized protein LOC100193935 precursor [Zea mays]
gi|194694488|gb|ACF81328.1| unknown [Zea mays]
gi|347978828|gb|AEP37756.1| prolyl 4-hydroxylase 7 [Zea mays]
gi|413934218|gb|AFW68769.1| prolyl 4-hydroxylase [Zea mays]
Length = 298
Score = 108 bits (271), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 58/139 (41%), Positives = 86/139 (61%), Gaps = 5/139 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISA 136
LSWRPRA F +C +IA AK +L+ S +A + G++V+S RTSSG F+
Sbjct: 38 LSWRPRAFLHKGFLLDAECDHLIALAKDKLEKSMVADNKSGKSVQSE--VRTSSGMFLEK 95
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D+ ++ IE +I+ T LP +GEA +L Y+ G+KY+ HYD F+ R+
Sbjct: 96 KQDE--VVTRIEERISAWTFLPPENGEAIQILHYQNGEKYEPHYDYFHDKNNQALGGHRI 153
Query: 197 ASFLLYLSDVEEGGETMFP 215
A+ L+YLS+VE+GGET+FP
Sbjct: 154 ATVLMYLSNVEKGGETIFP 172
>gi|195627276|gb|ACG35468.1| prolyl 4-hydroxylase [Zea mays]
Length = 298
Score = 108 bits (271), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 58/139 (41%), Positives = 86/139 (61%), Gaps = 5/139 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISA 136
LSWRPRA F +C +IA AK +L+ S +A + G++V+S RTSSG F+
Sbjct: 38 LSWRPRAFLHKGFLLDAECDHLIALAKDKLEKSMVADNKSGKSVQSE--VRTSSGMFLEK 95
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D+ ++ IE +I+ T LP +GEA +L Y+ G+KY+ HYD F+ R+
Sbjct: 96 KQDE--VVTRIEERISAWTFLPPENGEAIQILHYQNGEKYEPHYDYFHDKNNQALGGHRI 153
Query: 197 ASFLLYLSDVEEGGETMFP 215
A+ L+YLS+VE+GGET+FP
Sbjct: 154 ATVLMYLSNVEKGGETIFP 172
>gi|414587755|tpg|DAA38326.1| TPA: hypothetical protein ZEAMMB73_894856 [Zea mays]
Length = 244
Score = 108 bits (271), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 57/145 (39%), Positives = 92/145 (63%), Gaps = 3/145 (2%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
+V+SW PR + F NF S+E+C ++A A+ RL+ S + + G+ V+S RTSSG F+
Sbjct: 58 EVISWTPRIIVFHNFLSSEECDYLMAIARPRLQISTVVDVATGKGVKS--DVRTSSGMFV 115
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
++ E K+ +++ IE +I+ + +P+ +GE VLRYE Q Y H+D F+ + Q
Sbjct: 116 NSEERKSPVVQAIEKRISVFSQIPKENGELIQVLRYEASQYYRPHHDYFSDTFNLKRGGQ 175
Query: 195 RLASFLLYLSDVEEGGETMFPFEVD 219
R+A+ L+YL+D GGET FP E++
Sbjct: 176 RVATMLMYLTDGVVGGETHFPQEME 200
>gi|388496942|gb|AFK36537.1| unknown [Lotus japonicus]
Length = 302
Score = 108 bits (271), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 64/168 (38%), Positives = 96/168 (57%), Gaps = 5/168 (2%)
Query: 49 LEVVEKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLK 108
+ ++ + + +G + + A I + +SW+PRA + F + +C +I+ AK LK
Sbjct: 15 VSLICESHGAGSSYAGSASAIIDPSKVKQVSWKPRAFVYKGFLTELECDHLISLAKSELK 74
Query: 109 PSQLALR-QGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNV 167
S +A G++ S RTSSG FIS ++D I+ IE KI+ T LP+ +GE V
Sbjct: 75 RSAVADNLSGDSKLSD--VRTSSGMFISKNKDP--IVAGIEDKISSWTFLPKENGEDIQV 130
Query: 168 LRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
LRYE GQKYD HYD F + R+A+ L+YL++V GGET+FP
Sbjct: 131 LRYEHGQKYDPHYDFFADKVNIARGGHRVATVLMYLTNVTRGGETVFP 178
>gi|30689216|ref|NP_189490.2| Oxoglutarate/iron-dependent oxygenase [Arabidopsis thaliana]
gi|332643931|gb|AEE77452.1| Oxoglutarate/iron-dependent oxygenase [Arabidopsis thaliana]
Length = 288
Score = 108 bits (271), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 59/140 (42%), Positives = 85/140 (60%), Gaps = 6/140 (4%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA--LRQGETVESTKGTRTSSGTFIS 135
LSW PRA + F S E+C +I AK +L+ S + + GE+ +S RTSSG F++
Sbjct: 35 LSWTPRAFLYKGFLSDEECDHLIKLAKGKLEKSMVVADVDSGESEDSE--VRTSSGMFLT 92
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
+D I+ +E K+A T LP+ +GEA +L YE GQKYD H+D F + R
Sbjct: 93 KRQDD--IVANVEAKLAAWTFLPEENGEALQILHYENGQKYDPHFDYFYDKKALELGGHR 150
Query: 196 LASFLLYLSDVEEGGETMFP 215
+A+ L+YLS+V +GGET+FP
Sbjct: 151 IATVLMYLSNVTKGGETVFP 170
>gi|356546462|ref|XP_003541645.1| PREDICTED: uncharacterized protein LOC100818794 [Glycine max]
Length = 839
Score = 108 bits (271), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 67/153 (43%), Positives = 90/153 (58%), Gaps = 7/153 (4%)
Query: 66 GDASIQSIPFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVES 122
G AS P +V +SW+PRA + F + +C +I+ AK LK S +A GE+ S
Sbjct: 567 GSASAIIDPSKVKQVSWKPRAFVYEGFLTELECDHLISIAKSELKRSAVADNLSGESKLS 626
Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
RTSSG FI ++D I+ IE KI+ T LP+ +GE VLRYE GQKYD HYD
Sbjct: 627 E--VRTSSGMFIPKNKDL--IVAGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDY 682
Query: 183 FNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
F + R+A+ L+YL+DV +GGET+FP
Sbjct: 683 FADKVNIARGGHRVATVLMYLTDVTKGGETVFP 715
>gi|343172438|gb|AEL98923.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein, partial
[Silene latifolia]
gi|343172440|gb|AEL98924.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein, partial
[Silene latifolia]
Length = 120
Score = 108 bits (271), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 48/56 (85%), Positives = 54/56 (96%)
Query: 164 AFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFEVD 219
A+NVLRYE+GQKY+SHYDAF+PAEYGPQ SQR+ASFLLYLSDVEEGGETMFP+E D
Sbjct: 1 AYNVLRYEVGQKYNSHYDAFHPAEYGPQKSQRIASFLLYLSDVEEGGETMFPYEND 56
>gi|222636605|gb|EEE66737.1| hypothetical protein OsJ_23428 [Oryza sativa Japonica Group]
Length = 487
Score = 108 bits (270), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 60/163 (36%), Positives = 93/163 (57%), Gaps = 14/163 (8%)
Query: 63 GETGDASIQSI----PF-----QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA 113
GE GD + ++ PF + +SWRPR + F S ++C ++ K++++ S +A
Sbjct: 36 GEAGDDGVGAVAAAPPFNASRVRAVSWRPRVFVYKGFLSDDECDHLVKLGKRKMQRSMVA 95
Query: 114 -LRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEI 172
+ G++V S RTSSG F+ +D ++ IE +IA T LP+ + E +LRYE
Sbjct: 96 DNKSGKSVMSE--VRTSSGMFLDKRQDP--VVSRIEKRIAAWTFLPEENAENIQILRYEH 151
Query: 173 GQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
GQKY+ H+D F+ R A+ L+YLS VE+GGET+FP
Sbjct: 152 GQKYEPHFDYFHDKVNQALGGHRYATVLMYLSTVEKGGETVFP 194
>gi|218199253|gb|EEC81680.1| hypothetical protein OsI_25242 [Oryza sativa Indica Group]
Length = 487
Score = 108 bits (270), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 60/163 (36%), Positives = 93/163 (57%), Gaps = 14/163 (8%)
Query: 63 GETGDASIQSI----PF-----QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA 113
GE GD + ++ PF + +SWRPR + F S ++C ++ K++++ S +A
Sbjct: 36 GEAGDDGVGAVAAAPPFNASRVRAVSWRPRVFVYKGFLSDDECDHLVKLGKRKMQRSMVA 95
Query: 114 -LRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEI 172
+ G++V S RTSSG F+ +D ++ IE +IA T LP+ + E +LRYE
Sbjct: 96 DNKSGKSVMSE--VRTSSGMFLDKRQDP--VVSRIEKRIAAWTFLPEENAENIQILRYEH 151
Query: 173 GQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
GQKY+ H+D F+ R A+ L+YLS VE+GGET+FP
Sbjct: 152 GQKYEPHFDYFHDKVNQALGGHRYATVLMYLSTVEKGGETVFP 194
>gi|115471029|ref|NP_001059113.1| Os07g0194500 [Oryza sativa Japonica Group]
gi|113610649|dbj|BAF21027.1| Os07g0194500 [Oryza sativa Japonica Group]
gi|215768445|dbj|BAH00674.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 319
Score = 108 bits (269), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 60/163 (36%), Positives = 93/163 (57%), Gaps = 14/163 (8%)
Query: 63 GETGDASIQSI----PF-----QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA 113
GE GD + ++ PF + +SWRPR + F S ++C ++ K++++ S +A
Sbjct: 36 GEAGDDGVGAVAAAPPFNASRVRAVSWRPRVFVYKGFLSDDECDHLVKLGKRKMQRSMVA 95
Query: 114 -LRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEI 172
+ G++V S RTSSG F+ +D ++ IE +IA T LP+ + E +LRYE
Sbjct: 96 DNKSGKSVMSE--VRTSSGMFLDKRQDP--VVSRIEKRIAAWTFLPEENAENIQILRYEH 151
Query: 173 GQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
GQKY+ H+D F+ R A+ L+YLS VE+GGET+FP
Sbjct: 152 GQKYEPHFDYFHDKVNQALGGHRYATVLMYLSTVEKGGETVFP 194
>gi|308802438|ref|XP_003078532.1| prolyl 4-hydroxylase alpha-1 subunit precursor (IC) [Ostreococcus
tauri]
gi|116056985|emb|CAL51412.1| prolyl 4-hydroxylase alpha-1 subunit precursor (IC) [Ostreococcus
tauri]
Length = 369
Score = 108 bits (269), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 65/154 (42%), Positives = 90/154 (58%), Gaps = 11/154 (7%)
Query: 67 DASIQSIPFQVLS-WRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG 125
+A S P VLS +P+A NF S ++C ++ AK+ L PS + G +V S
Sbjct: 74 EAGFPSTPPLVLSSKKPKAYLMRNFLSPQECDHLMMLAKRELAPSTVVGDGGSSVASE-- 131
Query: 126 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-- 183
RTS+G F+ S+D T + IE +IAR + +P +GE +LRY+ GQKYD H+D F
Sbjct: 132 IRTSAGMFLRKSQDDT--VREIEERIARLSGVPVDNGEGMQILRYDKGQKYDPHFDYFHD 189
Query: 184 --NPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
NPA + QR+A+ L+YL D EEGGET FP
Sbjct: 190 KVNPAPK--RGGQRVATVLIYLVDTEEGGETTFP 221
>gi|326501992|dbj|BAK06488.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 306
Score = 108 bits (269), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 62/142 (43%), Positives = 91/142 (64%), Gaps = 8/142 (5%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKR-LKPSQLALRQ-GETVESTKGTRTSSGTFIS 135
+SWRPRA + F + +C ++A A++ L+ S + RQ G++V S RTSSGTF++
Sbjct: 41 VSWRPRAFLYKGFLTEAECDHLVALAEEGGLQKSMVVDRQTGKSVMSE--VRTSSGTFLA 98
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG--PQMS 193
+D+ ++ IE +IA T+LPQ +GE+ VLRYE GQKY+ H D A G +
Sbjct: 99 KKQDQ--VVATIEARIAAWTLLPQENGESIQVLRYENGQKYEPHVDFIRHAAKGHHSRGG 156
Query: 194 QRLASFLLYLSDVEEGGETMFP 215
R+A+ L+YLSDV+ GGET+FP
Sbjct: 157 HRVATVLMYLSDVKMGGETVFP 178
>gi|449461905|ref|XP_004148682.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 295
Score = 108 bits (269), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 57/139 (41%), Positives = 87/139 (62%), Gaps = 5/139 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LSW+PRA + F S +C +I AK +L+ S +A G++V S RTSSG F+
Sbjct: 35 LSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVSSE--VRTSSGMFLRK 92
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
++D+ ++ +E +IA T+LP +GE+ +L YE GQKY+ H+D F+ R+
Sbjct: 93 AQDE--VVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRI 150
Query: 197 ASFLLYLSDVEEGGETMFP 215
A+ L+YLS+VE+GGET+FP
Sbjct: 151 ATVLMYLSNVEKGGETIFP 169
>gi|34393269|dbj|BAC83179.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein [Oryza
sativa Japonica Group]
gi|50509101|dbj|BAD30161.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein [Oryza
sativa Japonica Group]
Length = 313
Score = 108 bits (269), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 60/163 (36%), Positives = 93/163 (57%), Gaps = 14/163 (8%)
Query: 63 GETGDASIQSI----PF-----QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA 113
GE GD + ++ PF + +SWRPR + F S ++C ++ K++++ S +A
Sbjct: 30 GEAGDDGVGAVAAAPPFNASRVRAVSWRPRVFVYKGFLSDDECDHLVKLGKRKMQRSMVA 89
Query: 114 -LRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEI 172
+ G++V S RTSSG F+ +D ++ IE +IA T LP+ + E +LRYE
Sbjct: 90 DNKSGKSVMSE--VRTSSGMFLDKRQDP--VVSRIEKRIAAWTFLPEENAENIQILRYEH 145
Query: 173 GQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
GQKY+ H+D F+ R A+ L+YLS VE+GGET+FP
Sbjct: 146 GQKYEPHFDYFHDKVNQALGGHRYATVLMYLSTVEKGGETVFP 188
>gi|224069056|ref|XP_002302889.1| predicted protein [Populus trichocarpa]
gi|222844615|gb|EEE82162.1| predicted protein [Populus trichocarpa]
Length = 287
Score = 108 bits (269), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 71/221 (32%), Positives = 113/221 (51%), Gaps = 27/221 (12%)
Query: 12 LGLPTALLLCSFFFLAGF-------YGSTFLSRDVPSIRPKLRTLEVVEKENESGLPHG- 63
L TA ++ FF F YG+ F P + + ++ + LP G
Sbjct: 12 LAFVTAGMIVGAFFQLAFILKLEDSYGTKF---------PSFKRVRKLQSDAYLQLPRGI 62
Query: 64 ----ETGDASIQSIPF---QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LR 115
+A++ I + +++SW PR + +F S+E+C + A AK RL+ S + ++
Sbjct: 63 SHWDNDTEAAVLRIGYVKPEIISWSPRIIVLHDFLSSEECDYLRALAKPRLRISTVVDVK 122
Query: 116 QGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQK 175
G+ +ES RTSSG F+S+ E +++ IE +I+ + +P +GE VLRYE Q
Sbjct: 123 TGKGIESK--VRTSSGMFLSSEEKTYQVVQAIEKRISVYSQVPIENGELIQVLRYEKNQY 180
Query: 176 YDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPF 216
Y H+D F+ + QR+A+ L+YLSD EGGET FP
Sbjct: 181 YKPHHDYFSDTFNLKRGGQRVATMLMYLSDNVEGGETYFPM 221
>gi|302823087|ref|XP_002993198.1| hypothetical protein SELMODRAFT_431327 [Selaginella moellendorffii]
gi|300138968|gb|EFJ05718.1| hypothetical protein SELMODRAFT_431327 [Selaginella moellendorffii]
Length = 269
Score = 107 bits (268), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 61/146 (41%), Positives = 86/146 (58%), Gaps = 10/146 (6%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA------LRQGETVESTKGTRTS 129
+VL+W PR + F SAE+C +IA A RL S + R G +ES RTS
Sbjct: 61 EVLNWSPRIILLHKFLSAEECDYLIAIAGPRLAKSTVVDTSTGKARHG--IESK--VRTS 116
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
+G F+S + + +++ IE +IA +M+P +GE VLRYE Q Y H+D F+
Sbjct: 117 TGMFLSNYDRRYPMIQAIERRIAVYSMIPVENGELLQVLRYEPNQYYKPHHDYFSDQFNL 176
Query: 190 PQMSQRLASFLLYLSDVEEGGETMFP 215
+ QR+A+ L+YLSDVEEGGET+FP
Sbjct: 177 KRGGQRVATVLMYLSDVEEGGETIFP 202
>gi|218184507|gb|EEC66934.1| hypothetical protein OsI_33548 [Oryza sativa Indica Group]
Length = 308
Score = 107 bits (268), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 57/139 (41%), Positives = 86/139 (61%), Gaps = 5/139 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LSWRPRA F + +C+ +I+ AK +L+ S +A + G++V S RTSSG F+
Sbjct: 48 LSWRPRAFLHKGFLTDAECEHLISLAKDKLEKSMVADNESGKSVMSE--VRTSSGMFLEK 105
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D+ ++ IE +IA T LP +GE+ +L Y+ G+KY+ HYD F+ R+
Sbjct: 106 KQDE--VVARIEERIAAWTFLPPDNGESIQILHYQNGEKYEPHYDYFHDKNNQALGGHRI 163
Query: 197 ASFLLYLSDVEEGGETMFP 215
A+ L+YLSDV +GGET+FP
Sbjct: 164 ATVLMYLSDVGKGGETIFP 182
>gi|110289076|gb|ABB47602.2| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
Group]
Length = 309
Score = 107 bits (268), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 57/139 (41%), Positives = 86/139 (61%), Gaps = 5/139 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LSWRPRA F + +C+ +I+ AK +L+ S +A + G++V S RTSSG F+
Sbjct: 48 LSWRPRAFLHKGFLTDAECEHLISLAKDKLEKSMVADNESGKSVMSE--VRTSSGMFLEK 105
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D+ ++ IE +IA T LP +GE+ +L Y+ G+KY+ HYD F+ R+
Sbjct: 106 KQDE--VVARIEERIAAWTFLPPDNGESIQILHYQNGEKYEPHYDYFHDKNNQALGGHRI 163
Query: 197 ASFLLYLSDVEEGGETMFP 215
A+ L+YLSDV +GGET+FP
Sbjct: 164 ATVLMYLSDVGKGGETIFP 182
>gi|414587756|tpg|DAA38327.1| TPA: hypothetical protein ZEAMMB73_894856 [Zea mays]
Length = 263
Score = 107 bits (268), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 58/149 (38%), Positives = 92/149 (61%), Gaps = 3/149 (2%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
+V+SW PR + F NF S+E+C ++A A+ RL+ S + + G+ V+S RTSSG F+
Sbjct: 58 EVISWTPRIIVFHNFLSSEECDYLMAIARPRLQISTVVDVATGKGVKSD--VRTSSGMFV 115
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
++ E K+ +++ IE +I+ + +P+ +GE VLRYE Q Y H+D F+ + Q
Sbjct: 116 NSEERKSPVVQAIEKRISVFSQIPKENGELIQVLRYEASQYYRPHHDYFSDTFNLKRGGQ 175
Query: 195 RLASFLLYLSDVEEGGETMFPFEVDLQIS 223
R+A+ L+YL+D GGET FP D + S
Sbjct: 176 RVATMLMYLTDGVVGGETHFPQAGDGECS 204
>gi|388495016|gb|AFK35574.1| unknown [Lotus japonicus]
Length = 297
Score = 107 bits (268), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 62/139 (44%), Positives = 83/139 (59%), Gaps = 5/139 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTFISA 136
+SW+PRA + F + +C +I+ AK LK S +A G++ S RTSSG FIS
Sbjct: 39 VSWKPRAFVYEGFLTGLECDHLISLAKSELKRSAVADNLPGDSKLSE--VRTSSGMFISK 96
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D I+ IE KI+ T LP+ +GE VLRYE GQKYD HYD F + R+
Sbjct: 97 KKDP--IVAGIEDKISAWTFLPKENGEDMQVLRYEHGQKYDPHYDYFTDKVNIVRGGHRM 154
Query: 197 ASFLLYLSDVEEGGETMFP 215
A+ LLYL++V GGET+FP
Sbjct: 155 ATVLLYLTNVTRGGETVFP 173
>gi|115481998|ref|NP_001064592.1| Os10g0413500 [Oryza sativa Japonica Group]
gi|110289075|gb|ABG66075.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
Group]
gi|113639201|dbj|BAF26506.1| Os10g0413500 [Oryza sativa Japonica Group]
gi|215692577|dbj|BAG87997.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222612821|gb|EEE50953.1| hypothetical protein OsJ_31503 [Oryza sativa Japonica Group]
Length = 308
Score = 107 bits (268), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 57/139 (41%), Positives = 86/139 (61%), Gaps = 5/139 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LSWRPRA F + +C+ +I+ AK +L+ S +A + G++V S RTSSG F+
Sbjct: 48 LSWRPRAFLHKGFLTDAECEHLISLAKDKLEKSMVADNESGKSVMSE--VRTSSGMFLEK 105
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D+ ++ IE +IA T LP +GE+ +L Y+ G+KY+ HYD F+ R+
Sbjct: 106 KQDE--VVARIEERIAAWTFLPPDNGESIQILHYQNGEKYEPHYDYFHDKNNQALGGHRI 163
Query: 197 ASFLLYLSDVEEGGETMFP 215
A+ L+YLSDV +GGET+FP
Sbjct: 164 ATVLMYLSDVGKGGETIFP 182
>gi|302764100|ref|XP_002965471.1| hypothetical protein SELMODRAFT_67344 [Selaginella moellendorffii]
gi|300166285|gb|EFJ32891.1| hypothetical protein SELMODRAFT_67344 [Selaginella moellendorffii]
Length = 264
Score = 107 bits (268), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 62/146 (42%), Positives = 85/146 (58%), Gaps = 10/146 (6%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA------LRQGETVESTKGTRTS 129
+VL+W PR F SAE+C +IA A RL S + R G +ES RTS
Sbjct: 60 EVLNWSPRITLLHKFLSAEECDYLIAIAGPRLAKSTVVDTSTGKARHG--IESK--VRTS 115
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
+G F+S + + ++E IE +IA +M+P +GE VLRYE Q Y H+D F+
Sbjct: 116 TGMFLSNYDRRYPMIEAIERRIAVYSMIPVENGELLQVLRYEPNQYYKPHHDYFSDQFNL 175
Query: 190 PQMSQRLASFLLYLSDVEEGGETMFP 215
+ QR+A+ L+YLSDVEEGGET+FP
Sbjct: 176 KRGGQRVATVLMYLSDVEEGGETIFP 201
>gi|449522594|ref|XP_004168311.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Cucumis
sativus]
Length = 313
Score = 107 bits (268), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 57/139 (41%), Positives = 87/139 (62%), Gaps = 5/139 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LSW+PRA + F S +C +I AK +L+ S +A G++V S RTSSG F+
Sbjct: 56 LSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVSSE--VRTSSGMFLRK 113
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
++D+ ++ +E +IA T+LP +GE+ +L YE GQKY+ H+D F+ R+
Sbjct: 114 AQDE--VVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRI 171
Query: 197 ASFLLYLSDVEEGGETMFP 215
A+ L+YLS+VE+GGET+FP
Sbjct: 172 ATVLMYLSNVEKGGETIFP 190
>gi|115457822|ref|NP_001052511.1| Os04g0346000 [Oryza sativa Japonica Group]
gi|38346023|emb|CAE03962.2| OSJNBb0085H11.11 [Oryza sativa Japonica Group]
gi|113564082|dbj|BAF14425.1| Os04g0346000 [Oryza sativa Japonica Group]
gi|125547818|gb|EAY93640.1| hypothetical protein OsI_15426 [Oryza sativa Indica Group]
gi|125589953|gb|EAZ30303.1| hypothetical protein OsJ_14349 [Oryza sativa Japonica Group]
gi|215693934|dbj|BAG89133.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 267
Score = 107 bits (268), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 59/149 (39%), Positives = 91/149 (61%), Gaps = 3/149 (2%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
+V+SW PR + F NF S+E+C + + A+ RL+ S + + G+ V+S RTSSG F+
Sbjct: 62 EVISWSPRIIVFHNFLSSEECDYLRSIARPRLQISTVVDVATGKGVKSN--VRTSSGMFV 119
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
S+ E K +++ IE +I+ + +P+ +GE VLRYE Q Y H+D F+ + Q
Sbjct: 120 SSEERKLPVIQSIEKRISVYSQIPEENGELIQVLRYEPSQYYRPHHDYFSDTFNIKRGGQ 179
Query: 195 RLASFLLYLSDVEEGGETMFPFEVDLQIS 223
R+A+ L+YL+D EGGET FP D + S
Sbjct: 180 RVATMLMYLTDGVEGGETHFPQAGDGECS 208
>gi|357496283|ref|XP_003618430.1| Prolyl 4-hydroxylase subunit alpha-2 [Medicago truncatula]
gi|217073992|gb|ACJ85356.1| unknown [Medicago truncatula]
gi|355493445|gb|AES74648.1| Prolyl 4-hydroxylase subunit alpha-2 [Medicago truncatula]
gi|388494436|gb|AFK35284.1| unknown [Medicago truncatula]
Length = 313
Score = 107 bits (268), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 62/164 (37%), Positives = 98/164 (59%), Gaps = 7/164 (4%)
Query: 55 ENESGLPHGETGDASIQSIPFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQL 112
E + + G A ++ P +V LSW PRA + NF + E+C +I +K +L+ S +
Sbjct: 29 EEGNKITRGSVFGAKVKFDPTRVTQLSWSPRAFLYKNFLTDEECDHLIELSKDKLEKSMV 88
Query: 113 ALRQ-GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYE 171
A + G++++S RTSSG F++ +D+ I+ IE +IA T LP +GE+ VL Y
Sbjct: 89 ADNESGKSIQSE--VRTSSGMFLNKQQDE--IVSGIEARIAAWTFLPVENGESMQVLHYM 144
Query: 172 IGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
G+KY+ H+D F+ R+A+ L+YLS+VE+GGET+FP
Sbjct: 145 NGEKYEPHFDFFHDKANQRLGGHRVATVLMYLSNVEKGGETIFP 188
>gi|116309432|emb|CAH66506.1| OSIGBa0111I14.1 [Oryza sativa Indica Group]
Length = 267
Score = 107 bits (268), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 59/149 (39%), Positives = 91/149 (61%), Gaps = 3/149 (2%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
+V+SW PR + F NF S+E+C + + A+ RL+ S + + G+ V+S RTSSG F+
Sbjct: 62 EVISWSPRIIVFHNFLSSEECDYLRSIARPRLQISTVVDVATGKGVKSN--VRTSSGMFV 119
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
S+ E K +++ IE +I+ + +P+ +GE VLRYE Q Y H+D F+ + Q
Sbjct: 120 SSEERKLPVIQSIEKRISVYSQIPEENGELIQVLRYEPSQYYRPHHDYFSDTFNIKRGGQ 179
Query: 195 RLASFLLYLSDVEEGGETMFPFEVDLQIS 223
R+A+ L+YL+D EGGET FP D + S
Sbjct: 180 RVATMLMYLTDGVEGGETHFPQAGDGECS 208
>gi|225452614|ref|XP_002281420.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
gi|296087745|emb|CBI35001.3| unnamed protein product [Vitis vinifera]
Length = 316
Score = 107 bits (267), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 59/139 (42%), Positives = 87/139 (62%), Gaps = 5/139 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LSWRPRA + F S E+C +I AK +L+ S +A + G+++ S RTSSG F+
Sbjct: 57 LSWRPRAFLYKGFLSEEECDHLITLAKDKLEKSMVADNESGKSIMSE--VRTSSGMFLLK 114
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
++D+ I+ IE +IA T LP +GE+ +L YE G+KY+ H+D F+ R+
Sbjct: 115 AQDE--IVADIEARIAAWTFLPVENGESIQILHYENGEKYEPHFDYFHDKVNQLLGGHRI 172
Query: 197 ASFLLYLSDVEEGGETMFP 215
A+ L+YL+ VEEGGET+FP
Sbjct: 173 ATVLMYLATVEEGGETVFP 191
>gi|357478545|ref|XP_003609558.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
gi|355510613|gb|AES91755.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
Length = 299
Score = 107 bits (267), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 62/139 (44%), Positives = 84/139 (60%), Gaps = 5/139 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTFISA 136
+SW PRA + F + +C +I+ AK LK S +A G++ S RTSSG FIS
Sbjct: 40 ISWIPRAFVYQGFLTDLECDHLISLAKSELKRSAVADNLSGDSQLSD--VRTSSGMFISK 97
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
++D I+ IE +I+ T LP+ +GE VLRYE GQKYD HYD F Q RL
Sbjct: 98 NKDP--IVSGIEDRISAWTFLPKENGEDIQVLRYEHGQKYDPHYDYFADKVNIVQGGHRL 155
Query: 197 ASFLLYLSDVEEGGETMFP 215
A+ L+YL++V +GGET+FP
Sbjct: 156 ATVLMYLTNVTKGGETVFP 174
>gi|388492638|gb|AFK34385.1| unknown [Medicago truncatula]
Length = 299
Score = 107 bits (267), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 62/139 (44%), Positives = 84/139 (60%), Gaps = 5/139 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTFISA 136
+SW PRA + F + +C +I+ AK LK S +A G++ S RTSSG FIS
Sbjct: 40 ISWIPRAFVYQGFLTDLECDHLISLAKSELKRSAVADNLSGDSQLSD--VRTSSGMFISK 97
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
++D I+ IE +I+ T LP+ +GE VLRYE GQKYD HYD F Q RL
Sbjct: 98 NKDP--IVSGIEDRISAWTFLPKENGEDIQVLRYEHGQKYDPHYDYFADKVNIVQGGHRL 155
Query: 197 ASFLLYLSDVEEGGETMFP 215
A+ L+YL++V +GGET+FP
Sbjct: 156 ATVLMYLTNVTKGGETVFP 174
>gi|168002780|ref|XP_001754091.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162694645|gb|EDQ80992.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 214
Score = 107 bits (267), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 58/141 (41%), Positives = 87/141 (61%), Gaps = 5/141 (3%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTKGTRTSSGTFI 134
+VLSW PRA + +F + E+C +I A+ L K + + G++ +S RTSSGTF+
Sbjct: 4 EVLSWEPRAFLYHHFLTEEECNHLIEVARPSLVKSTVVDSDTGKSKDSR--LRTSSGTFL 61
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
+D +++ IE +IA T +P GE VL+Y+ +KY+ HYD F+ A Q
Sbjct: 62 MRGQDP--VIKRIEKRIADFTFIPAEQGEGLQVLQYKESEKYEPHYDYFHDAYNTKNGGQ 119
Query: 195 RLASFLLYLSDVEEGGETMFP 215
R+A+ L+YLS+VEEGGET+FP
Sbjct: 120 RIATVLMYLSNVEEGGETVFP 140
>gi|28393447|gb|AAO42145.1| putative prolyl 4-hydroxylase [Arabidopsis thaliana]
Length = 253
Score = 107 bits (267), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 58/139 (41%), Positives = 84/139 (60%), Gaps = 6/139 (4%)
Query: 79 SWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA--LRQGETVESTKGTRTSSGTFISA 136
SW PRA + F S E+C +I AK +L+ S + + GE+ +S RTSSG F++
Sbjct: 1 SWTPRAFLYKGFLSDEECDHLIKLAKGKLEKSMVVADVDSGESEDSE--VRTSSGMFLTK 58
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D I+ +E K+A T LP+ +GEA +L YE GQKYD H+D F + R+
Sbjct: 59 RQDD--IVANVEAKLAAWTFLPEENGEALQILHYENGQKYDPHFDYFYDKKALELGGHRI 116
Query: 197 ASFLLYLSDVEEGGETMFP 215
A+ L+YLS+V +GGET+FP
Sbjct: 117 ATVLMYLSNVTKGGETVFP 135
>gi|255072321|ref|XP_002499835.1| prolyl 4-hydroxylase [Micromonas sp. RCC299]
gi|226515097|gb|ACO61093.1| prolyl 4-hydroxylase [Micromonas sp. RCC299]
Length = 454
Score = 107 bits (267), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 60/149 (40%), Positives = 88/149 (59%), Gaps = 5/149 (3%)
Query: 72 SIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
S P + + P+A F NF + +C+ ++ AKK+L PS + +G +K RTS+G
Sbjct: 167 SPPLVLSNHEPKAYMFRNFLTPHECEHLMQLAKKQLAPSTVVGDKGSGSMVSK-IRTSAG 225
Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGP 190
F+ +D T + IE +IA A+ LP+ +GE +LRYE GQKYD H+D F + P
Sbjct: 226 MFLGRGQDPT--VRAIEERIAAASGLPEPNGEGLQILRYENGQKYDPHFDYFHDQVNSSP 283
Query: 191 QM-SQRLASFLLYLSDVEEGGETMFPFEV 218
+ QR+A+ L+YL D EGGET+FP V
Sbjct: 284 RRGGQRMATMLIYLEDTTEGGETIFPNGV 312
>gi|242047772|ref|XP_002461632.1| hypothetical protein SORBIDRAFT_02g005750 [Sorghum bicolor]
gi|241925009|gb|EER98153.1| hypothetical protein SORBIDRAFT_02g005750 [Sorghum bicolor]
Length = 307
Score = 107 bits (266), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 60/158 (37%), Positives = 92/158 (58%), Gaps = 10/158 (6%)
Query: 64 ETGDASIQSIPF-----QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-G 117
+ GD + PF + +SW+PR + F S +C ++ AKK+++ S +A Q G
Sbjct: 28 DGGDVVAPAPPFNSSRVKAVSWQPRIFVYKGFLSDAECDHLVTLAKKKIQRSMVADNQSG 87
Query: 118 ETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYD 177
++V S RTSSG F++ +D ++ IE +IA T LPQ + E +LRYE GQKY+
Sbjct: 88 KSVMSE--VRTSSGMFLNKRQDP--VVSRIEERIAAWTFLPQENAENMQILRYEHGQKYE 143
Query: 178 SHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
H+D F+ + R A+ L+YLS V++GGET+FP
Sbjct: 144 PHFDYFHDKINQVRGGHRYATVLMYLSTVDKGGETVFP 181
>gi|242088305|ref|XP_002439985.1| hypothetical protein SORBIDRAFT_09g023860 [Sorghum bicolor]
gi|241945270|gb|EES18415.1| hypothetical protein SORBIDRAFT_09g023860 [Sorghum bicolor]
Length = 308
Score = 107 bits (266), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 63/145 (43%), Positives = 87/145 (60%), Gaps = 7/145 (4%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTFISA 136
+SW+PR + +F S ++ +I+ A+ LK S +A G++ S RTSSGTF+
Sbjct: 54 ISWKPRVFLYQHFLSDDEANHLISLARAELKRSAVADNMSGKSTLSD--VRTSSGTFLRK 111
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D I+E IE KIA T LP+ +GE VLRY+ G+KY+ HYD F + R
Sbjct: 112 GQDP--IVEGIEDKIAAWTFLPKENGEDIQVLRYKHGEKYEPHYDYFTDNVNTIRGGHRY 169
Query: 197 ASFLLYLSDVEEGGETMFPF--EVD 219
A+ LLYL+DV EGGET+FP EVD
Sbjct: 170 ATVLLYLTDVAEGGETVFPLAEEVD 194
>gi|255085592|ref|XP_002505227.1| predicted protein [Micromonas sp. RCC299]
gi|226520496|gb|ACO66485.1| predicted protein [Micromonas sp. RCC299]
Length = 267
Score = 107 bits (266), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 65/147 (44%), Positives = 86/147 (58%), Gaps = 11/147 (7%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LS +P+A + F +C I AK +L+ S + + G++V S RTS G F
Sbjct: 8 LSEKPKAYLYRGFLRQAECDYIKERAKPKLEKSTVVDNKTGQSVPSN--IRTSDGMFFDR 65
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA----FNPAEYGPQM 192
ED I+E IE +IA T +P +GE VLRYE+GQKY+ H DA FN E +
Sbjct: 66 HEDD--IIEDIERRIAEWTNVPWENGEGIQVLRYEVGQKYEPHLDAFSDKFNTEE--SKG 121
Query: 193 SQRLASFLLYLSDVEEGGETMFPFEVD 219
QR+A+ L+YLSDVEEGGET+FP VD
Sbjct: 122 GQRMATVLMYLSDVEEGGETVFPRSVD 148
>gi|6437556|gb|AAF08583.1|AC011623_16 unknown protein [Arabidopsis thaliana]
Length = 278
Score = 107 bits (266), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 61/139 (43%), Positives = 87/139 (62%), Gaps = 5/139 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTFISA 136
+S +PRA + F + +C +I+ AK+ L+ S +A GE+ S RTSSGTFIS
Sbjct: 41 VSSKPRAFVYEGFLTDLECDHLISLAKENLQRSAVADNDNGESQVSD--VRTSSGTFISK 98
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D I+ IE K++ T LP+ +GE VLRYE GQKYD+H+D F+ + R+
Sbjct: 99 GKDP--IVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQKYDAHFDYFHDKVNIARGGHRI 156
Query: 197 ASFLLYLSDVEEGGETMFP 215
A+ LLYLS+V +GGET+FP
Sbjct: 157 ATVLLYLSNVTKGGETVFP 175
>gi|21618073|gb|AAM67123.1| prolyl 4-hydroxylase alpha subunit-like protein [Arabidopsis
thaliana]
Length = 297
Score = 107 bits (266), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 61/139 (43%), Positives = 87/139 (62%), Gaps = 5/139 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTFISA 136
+S +PRA + F + +C +I+ AK+ L+ S +A GE+ S RTSSGTFIS
Sbjct: 39 VSSKPRAFVYEGFLTDLECDHLISLAKENLQRSAVADNDNGESQVSD--VRTSSGTFISK 96
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D I+ IE K++ T LP+ +GE VLRYE GQKYD+H+D F+ + R+
Sbjct: 97 GKDP--IVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQKYDAHFDYFHDKVNIARGGHRI 154
Query: 197 ASFLLYLSDVEEGGETMFP 215
A+ LLYLS+V +GGET+FP
Sbjct: 155 ATVLLYLSNVTKGGETVFP 173
>gi|18397528|ref|NP_566279.1| P4H isoform 2 [Arabidopsis thaliana]
gi|332640849|gb|AEE74370.1| P4H isoform 2 [Arabidopsis thaliana]
Length = 299
Score = 106 bits (265), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 61/139 (43%), Positives = 87/139 (62%), Gaps = 5/139 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTFISA 136
+S +PRA + F + +C +I+ AK+ L+ S +A GE+ S RTSSGTFIS
Sbjct: 41 VSSKPRAFVYEGFLTDLECDHLISLAKENLQRSAVADNDNGESQVSD--VRTSSGTFISK 98
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D I+ IE K++ T LP+ +GE VLRYE GQKYD+H+D F+ + R+
Sbjct: 99 GKDP--IVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQKYDAHFDYFHDKVNIARGGHRI 156
Query: 197 ASFLLYLSDVEEGGETMFP 215
A+ LLYLS+V +GGET+FP
Sbjct: 157 ATVLLYLSNVTKGGETVFP 175
>gi|125552794|gb|EAY98503.1| hypothetical protein OsI_20415 [Oryza sativa Indica Group]
Length = 319
Score = 106 bits (265), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 58/139 (41%), Positives = 84/139 (60%), Gaps = 3/139 (2%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
+SW+PR + +F S ++ +++ A+ LK S +A E + RTSSGTFI S
Sbjct: 61 ISWKPRVFLYQHFLSDDEANHLVSLARAELKRSAVADNLSGKSELSDA-RTSSGTFIRKS 119
Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
+D I+ IE KIA T LP+ +GE VLRY+ G+KY+ HYD F+ + R+A
Sbjct: 120 QDP--IVAGIEEKIAAWTFLPKENGEDIQVLRYKHGEKYERHYDYFSDNVNTLRGGHRIA 177
Query: 198 SFLLYLSDVEEGGETMFPF 216
+ L+YL+DV EGGET+FP
Sbjct: 178 TVLMYLTDVAEGGETVFPL 196
>gi|115464581|ref|NP_001055890.1| Os05g0489100 [Oryza sativa Japonica Group]
gi|50511363|gb|AAT77286.1| putative prolyl 4-hydroxylase alpha subunit [Oryza sativa Japonica
Group]
gi|113579441|dbj|BAF17804.1| Os05g0489100 [Oryza sativa Japonica Group]
gi|125587281|gb|EAZ27945.1| hypothetical protein OsJ_11906 [Oryza sativa Japonica Group]
gi|215737307|dbj|BAG96236.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 319
Score = 106 bits (265), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 58/139 (41%), Positives = 84/139 (60%), Gaps = 3/139 (2%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
+SW+PR + +F S ++ +++ A+ LK S +A E + RTSSGTFI S
Sbjct: 61 ISWKPRVFLYQHFLSDDEANHLVSLARTELKRSAVADNLSGKSELSDA-RTSSGTFIRKS 119
Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
+D I+ IE KIA T LP+ +GE VLRY+ G+KY+ HYD F+ + R+A
Sbjct: 120 QDP--IVAGIEEKIAAWTFLPKENGEDIQVLRYKHGEKYERHYDYFSDNVNTLRGGHRIA 177
Query: 198 SFLLYLSDVEEGGETMFPF 216
+ L+YL+DV EGGET+FP
Sbjct: 178 TVLMYLTDVAEGGETVFPL 196
>gi|110738390|dbj|BAF01121.1| hypothetical protein [Arabidopsis thaliana]
Length = 299
Score = 106 bits (265), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 61/139 (43%), Positives = 87/139 (62%), Gaps = 5/139 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTFISA 136
+S +PRA + F + +C +I+ AK+ L+ S +A GE+ S RTSSGTFIS
Sbjct: 41 VSSKPRAFVYGGFLTDLECDHLISLAKENLQRSAVADNDNGESQVSD--VRTSSGTFISK 98
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D I+ IE K++ T LP+ +GE VLRYE GQKYD+H+D F+ + R+
Sbjct: 99 GKDP--IVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQKYDAHFDYFHDKVNIARGGHRI 156
Query: 197 ASFLLYLSDVEEGGETMFP 215
A+ LLYLS+V +GGET+FP
Sbjct: 157 ATVLLYLSNVTKGGETVFP 175
>gi|24417248|gb|AAN60234.1| unknown [Arabidopsis thaliana]
Length = 190
Score = 106 bits (265), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 59/136 (43%), Positives = 82/136 (60%), Gaps = 5/136 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LSW PR + F S E+C I AK +L+ S +A GE+VES RTSSG F+S
Sbjct: 59 LSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESE--VRTSSGMFLSK 116
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D I+ +E K+A T LP+ +GE+ +L YE GQKY+ H+D F+ R+
Sbjct: 117 RQDD--IVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRI 174
Query: 197 ASFLLYLSDVEEGGET 212
A+ L+YLS+VE+GGET
Sbjct: 175 ATVLMYLSNVEKGGET 190
>gi|297829156|ref|XP_002882460.1| hypothetical protein ARALYDRAFT_896741 [Arabidopsis lyrata subsp.
lyrata]
gi|297328300|gb|EFH58719.1| hypothetical protein ARALYDRAFT_896741 [Arabidopsis lyrata subsp.
lyrata]
Length = 299
Score = 106 bits (265), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 61/139 (43%), Positives = 87/139 (62%), Gaps = 5/139 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTFISA 136
+S +PRA + F + +C +I+ AK+ L+ S +A GE+ S RTSSGTFIS
Sbjct: 41 VSAKPRAFVYEGFLTDLECDHLISLAKENLQRSAVADNDNGESQVSD--VRTSSGTFISK 98
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D I+ IE K++ T LP+ +GE VLRYE GQKYD+H+D F+ + R+
Sbjct: 99 GKDP--IVSGIEDKLSTWTFLPKENGEDLQVLRYEPGQKYDAHFDYFHDKVNIARGGHRI 156
Query: 197 ASFLLYLSDVEEGGETMFP 215
A+ LLYLS+V +GGET+FP
Sbjct: 157 ATVLLYLSNVTKGGETVFP 175
>gi|414591891|tpg|DAA42462.1| TPA: hypothetical protein ZEAMMB73_637248 [Zea mays]
Length = 207
Score = 106 bits (265), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 60/159 (37%), Positives = 88/159 (55%), Gaps = 11/159 (6%)
Query: 64 ETGDASIQSIP------FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ- 116
E G + P + +SW PR + F S +C ++ AKK+++ S +A +
Sbjct: 28 ENGGGGVAPAPPFNSSRVKAVSWHPRIFVYKGFLSDAECDHLVTLAKKKIQRSMVADNES 87
Query: 117 GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKY 176
G++V+S RTSSG F+ +D ++ IE +IA T LPQ + E VLRYE GQKY
Sbjct: 88 GKSVKSE--VRTSSGMFLDKRQDP--VVSRIEERIAAWTFLPQENAENMQVLRYEPGQKY 143
Query: 177 DSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
+ H+D F+ + R A+ L+YLS V EGGET+FP
Sbjct: 144 EPHFDYFHDRVNQARGGHRYATVLMYLSTVREGGETVFP 182
>gi|145343778|ref|XP_001416487.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144576712|gb|ABO94780.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 255
Score = 106 bits (265), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 62/136 (45%), Positives = 82/136 (60%), Gaps = 9/136 (6%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQL--ALRQGETVESTKGTRTSSGTFISASED 139
PRA + F + E+C I+A +K L S + A G T T RTS+GTFIS + D
Sbjct: 1 PRAFVYEGFLTDEECDHILALSKGHLHKSGVVDAKTGGST---TSDIRTSTGTFISRAHD 57
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASF 199
T + IE +I + +P HGEA VLRYE GQ+Y +H+D F G + + R+A+
Sbjct: 58 PT--ITAIEERIELWSQIPVDHGEALQVLRYENGQEYKAHFDYF--FHKGGKRNNRIATV 113
Query: 200 LLYLSDVEEGGETMFP 215
LLYLSDVEEGGET+FP
Sbjct: 114 LLYLSDVEEGGETVFP 129
>gi|363543295|ref|NP_001241863.1| prolyl 4-hydroxylase 4 precursor [Zea mays]
gi|347978806|gb|AEP37745.1| prolyl 4-hydroxylase 4 [Zea mays]
gi|414591890|tpg|DAA42461.1| TPA: hypothetical protein ZEAMMB73_637248 [Zea mays]
Length = 274
Score = 106 bits (264), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 60/159 (37%), Positives = 88/159 (55%), Gaps = 11/159 (6%)
Query: 64 ETGDASIQSIP------FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ- 116
E G + P + +SW PR + F S +C ++ AKK+++ S +A +
Sbjct: 28 ENGGGGVAPAPPFNSSRVKAVSWHPRIFVYKGFLSDAECDHLVTLAKKKIQRSMVADNES 87
Query: 117 GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKY 176
G++V+S RTSSG F+ +D ++ IE +IA T LPQ + E VLRYE GQKY
Sbjct: 88 GKSVKSE--VRTSSGMFLDKRQDP--VVSRIEERIAAWTFLPQENAENMQVLRYEPGQKY 143
Query: 177 DSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
+ H+D F+ + R A+ L+YLS V EGGET+FP
Sbjct: 144 EPHFDYFHDRVNQARGGHRYATVLMYLSTVREGGETVFP 182
>gi|307110744|gb|EFN58979.1| hypothetical protein CHLNCDRAFT_137600 [Chlorella variabilis]
Length = 327
Score = 106 bits (264), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 60/149 (40%), Positives = 82/149 (55%), Gaps = 11/149 (7%)
Query: 75 FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 134
+V++W+PRAL F S +C II A L+ S + +G ++ RTSSG FI
Sbjct: 42 VEVVAWKPRALLLHGFLSHAECDHIIRVADPSLERSTVVSPEGGSMLDE--IRTSSGMFI 99
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
D ++ +E ++A T LP +H E VLRYE+GQKY +H+D + E QM
Sbjct: 100 LKGHD--AVISGLEERVAALTHLPVSHQEDLQVLRYELGQKYSAHWDINDSPERAQQMRA 157
Query: 195 -------RLASFLLYLSDVEEGGETMFPF 216
R A+ L+YLSDVEEGGET FP
Sbjct: 158 KGVLGGLRTATLLMYLSDVEEGGETAFPH 186
>gi|168001068|ref|XP_001753237.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162695523|gb|EDQ81866.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 284
Score = 106 bits (264), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 56/149 (37%), Positives = 91/149 (61%), Gaps = 3/149 (2%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTKGTRTSSGTFI 134
+V+SW+PR + NF SA++C +I A+ RL K + + G+ +ES RTS+G F+
Sbjct: 79 EVISWQPRIILLHNFLSADECDHLINLARPRLVKSTVVDATTGKGIESK--VRTSTGMFL 136
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
+ ++ + ++ IE +IA +M+P +GE VLRYE Q Y +H+D F+ + Q
Sbjct: 137 NGNDRRHHTIQAIETRIAAYSMVPVQNGELLQVLRYESDQYYKAHHDYFSDEFNLKRGGQ 196
Query: 195 RLASFLLYLSDVEEGGETMFPFEVDLQIS 223
R+A+ L+YL++ EGGET+FP D + S
Sbjct: 197 RVATMLMYLTEGVEGGETIFPQAGDKECS 225
>gi|159795555|pdb|2V4A|A Chain A, Crystal Structure Of The Semet-Labeled Prolyl-4
Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
Reinhardtii.
gi|159795556|pdb|2V4A|B Chain B, Crystal Structure Of The Semet-Labeled Prolyl-4
Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
Reinhardtii.
gi|159795557|pdb|2V4A|C Chain C, Crystal Structure Of The Semet-Labeled Prolyl-4
Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
Reinhardtii.
gi|159795558|pdb|2V4A|D Chain D, Crystal Structure Of The Semet-Labeled Prolyl-4
Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
Reinhardtii
Length = 233
Score = 106 bits (264), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 59/141 (41%), Positives = 84/141 (59%), Gaps = 7/141 (4%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAK-KRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
LSW PRA NF S E+C I+ A+ K +K S + G++V+S RTS+GT+ +
Sbjct: 25 LSWSPRAFLLKNFLSDEECDYIVEKARPKXVKSSVVDNESGKSVDSE--IRTSTGTWFAK 82
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQM-SQ 194
ED ++ IE ++A+ T +P + E VL Y GQKY+ HYD F +P GP+ Q
Sbjct: 83 GEDS--VISKIEKRVAQVTXIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNAGPEHGGQ 140
Query: 195 RLASFLLYLSDVEEGGETMFP 215
R+ + L YL+ VEEGGET+ P
Sbjct: 141 RVVTXLXYLTTVEEGGETVLP 161
>gi|168060785|ref|XP_001782374.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666166|gb|EDQ52828.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 211
Score = 106 bits (264), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 59/141 (41%), Positives = 85/141 (60%), Gaps = 5/141 (3%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTKGTRTSSGTFI 134
+VLSW PRA + +F + +C +I AK L K + + G++ +S RTSSGTF+
Sbjct: 3 EVLSWEPRAFLYHHFLTQVECNHLIEVAKPSLVKSTVIDSATGKSKDSR--VRTSSGTFL 60
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
+D I++ IE +IA T +P GE VL+Y +KY+ HYD F+ A Q
Sbjct: 61 VRGQDH--IIKRIEKRIADFTFIPVEQGEGLQVLQYRESEKYEPHYDYFHDAFNTKNGGQ 118
Query: 195 RLASFLLYLSDVEEGGETMFP 215
R+A+ L+YLSDVE+GGET+FP
Sbjct: 119 RIATVLMYLSDVEKGGETVFP 139
>gi|302830268|ref|XP_002946700.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
gi|300267744|gb|EFJ51926.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
Length = 186
Score = 106 bits (264), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 56/112 (50%), Positives = 74/112 (66%), Gaps = 3/112 (2%)
Query: 107 LKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFN 166
+ PS LA R GE E+ + RTS GTF+ D + L +E KIA T+LP+T+GE +N
Sbjct: 1 MYPSGLAYRPGEKAEAEQQVRTSKGTFLGG--DSSPALRWLEDKIAAVTLLPRTNGEFWN 58
Query: 167 VLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVE-EGGETMFPFE 217
VL Y+ Q YDSH D+F+P EYGPQ SQR+A+ ++ LSD GGET+F E
Sbjct: 59 VLNYKHSQHYDSHMDSFDPKEYGPQYSQRIATVIVVLSDDGLMGGETVFKRE 110
>gi|145345764|ref|XP_001417370.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577597|gb|ABO95663.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 328
Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 62/152 (40%), Positives = 88/152 (57%), Gaps = 7/152 (4%)
Query: 67 DASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKG 125
D +I+ + +SWRP A + F + E+C + A A L S + G +V S
Sbjct: 48 DRAIEFDRIERVSWRPHAEVYRGFLTREECDHLKALATPSLGRSTVVDASNGGSVPSD-- 105
Query: 126 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 185
RTSSG F+ ED ++ IE +IA T +P++HGE F VLRYE GQ+Y H+D F
Sbjct: 106 IRTSSGMFLLRGEDD--VVASIERRIASWTHVPESHGEGFQVLRYEFGQEYRPHFDYFQD 163
Query: 186 --AEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
+ + QR+A+ L+YL+DVEEGGET+FP
Sbjct: 164 EFNQKREKGGQRVATVLMYLTDVEEGGETIFP 195
>gi|363543297|ref|NP_001241864.1| prolyl 4-hydroxylase 4-2 precursor [Zea mays]
gi|194704960|gb|ACF86564.1| unknown [Zea mays]
gi|347978810|gb|AEP37747.1| prolyl 4-hydroxylase 4-2 [Zea mays]
Length = 207
Score = 105 bits (262), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 60/159 (37%), Positives = 87/159 (54%), Gaps = 11/159 (6%)
Query: 64 ETGDASIQSIP------FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ- 116
E G + P + +SW PR + F S +C ++ AKK+ + S +A +
Sbjct: 28 ENGGGGVAPAPPFNSSRVKAVSWHPRIFVYKGFLSDAECDHLVTLAKKKTQRSMVADNES 87
Query: 117 GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKY 176
G++V+S RTSSG F+ +D ++ IE +IA T LPQ + E VLRYE GQKY
Sbjct: 88 GKSVKSE--VRTSSGMFLDKRQDP--VVSRIEERIAAWTFLPQENAENMQVLRYEPGQKY 143
Query: 177 DSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
+ H+D F+ + R A+ L+YLS V EGGET+FP
Sbjct: 144 EPHFDYFHDRVNQARGGHRYATVLMYLSTVREGGETVFP 182
>gi|20260280|gb|AAM13038.1| unknown protein [Arabidopsis thaliana]
gi|22136524|gb|AAM91340.1| unknown protein [Arabidopsis thaliana]
Length = 298
Score = 105 bits (262), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 61/139 (43%), Positives = 86/139 (61%), Gaps = 5/139 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
+S +PRA + F + +C +++ AK LK S +A GE+ S RTSSGTFIS
Sbjct: 40 VSSKPRAFVYEGFLTELECDHMVSLAKASLKRSAVADNDSGESKFSE--VRTSSGTFISK 97
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D I+ IE KI+ T LP+ +GE VLRYE GQKYD+H+D F+ + R+
Sbjct: 98 GKDP--IVSGIEDKISTWTFLPKENGEDIQVLRYEHGQKYDAHFDYFHDKVNIVRGGHRM 155
Query: 197 ASFLLYLSDVEEGGETMFP 215
A+ L+YLS+V +GGET+FP
Sbjct: 156 ATILMYLSNVTKGGETVFP 174
>gi|15239594|ref|NP_197391.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
gi|21593296|gb|AAM65245.1| prolyl 4-hydroxylase alpha subunit-like protein [Arabidopsis
thaliana]
gi|332005243|gb|AED92626.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
Length = 298
Score = 105 bits (262), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 61/139 (43%), Positives = 86/139 (61%), Gaps = 5/139 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
+S +PRA + F + +C +++ AK LK S +A GE+ S RTSSGTFIS
Sbjct: 40 VSSKPRAFVYEGFLTELECDHMVSLAKASLKRSAVADNDSGESKFSE--VRTSSGTFISK 97
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D I+ IE KI+ T LP+ +GE VLRYE GQKYD+H+D F+ + R+
Sbjct: 98 GKDP--IVSGIEDKISTWTFLPKENGEDIQVLRYEHGQKYDAHFDYFHDKVNIVRGGHRM 155
Query: 197 ASFLLYLSDVEEGGETMFP 215
A+ L+YLS+V +GGET+FP
Sbjct: 156 ATILMYLSNVTKGGETVFP 174
>gi|388500582|gb|AFK38357.1| unknown [Medicago truncatula]
Length = 299
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 61/139 (43%), Positives = 83/139 (59%), Gaps = 5/139 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTFISA 136
+SW PRA + F + +C +I+ AK LK S +A G++ S RTSSG IS
Sbjct: 40 ISWIPRAFVYQGFLTDLECDHLISLAKSELKRSAVADNLSGDSQLSD--VRTSSGMLISK 97
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
++D I+ IE +I+ T LP+ +GE VLRYE GQKYD HYD F Q RL
Sbjct: 98 NKDP--IVSGIEDRISAWTFLPKENGEDIQVLRYEHGQKYDPHYDYFADKVNIVQGGHRL 155
Query: 197 ASFLLYLSDVEEGGETMFP 215
A+ L+YL++V +GGET+FP
Sbjct: 156 ATVLMYLTNVTKGGETVFP 174
>gi|224001336|ref|XP_002290340.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220973762|gb|EED92092.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 483
Score = 105 bits (261), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 56/157 (35%), Positives = 86/157 (54%), Gaps = 12/157 (7%)
Query: 69 SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRT 128
S + + + LS RP + F S E+C I A ++K S ++L+ + + + RT
Sbjct: 255 SQKQVTIETLSLRPLVVSVEGFLSDEECDYIAEIASPQVKYSSVSLKDADKGKDSSEWRT 314
Query: 129 SSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY 188
S F+SA +D+ +L I+H++A T +P+ H E VLRY G+KYDSH+D F+P+ Y
Sbjct: 315 SQSAFLSARDDE--VLTEIDHRVASLTRIPRNHQEYVQVLRYGAGEKYDSHHDYFDPSAY 372
Query: 189 GPQMS----------QRLASFLLYLSDVEEGGETMFP 215
S R A+ YL+DV +GGET+FP
Sbjct: 373 RSDKSTLRLIENGKKNRYATVFWYLTDVHDGGETIFP 409
>gi|363807814|ref|NP_001242181.1| uncharacterized protein LOC100782154 [Glycine max]
gi|255644463|gb|ACU22735.1| unknown [Glycine max]
Length = 285
Score = 104 bits (259), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 57/142 (40%), Positives = 83/142 (58%), Gaps = 5/142 (3%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAK-KRLKPSQLALRQGETVESTKGTRTSSGTFI 134
+V+SW PRA + NF + E+C+ +I TA LK + GE +E++ RTS+ +
Sbjct: 84 EVMSWEPRAFLYHNFLTKEECEYLINTATPNMLKSLVIDNESGEGIETS--YRTSTEYVV 141
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
+DK I+ IE +IA T +P HGE +V+RY +GQ Y+ H D F Q
Sbjct: 142 ERGKDK--IVRNIEKRIADVTFIPIEHGEPLHVIRYAVGQYYEPHVDYFEEEFSLVNGGQ 199
Query: 195 RLASFLLYLSDVEEGGETMFPF 216
R+A+ L+YLS+VE GGET+FP
Sbjct: 200 RIATMLMYLSNVEGGGETVFPI 221
>gi|326503458|dbj|BAJ86235.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326516134|dbj|BAJ88090.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 266
Score = 103 bits (258), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 57/150 (38%), Positives = 89/150 (59%), Gaps = 3/150 (2%)
Query: 67 DASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKG 125
D + + +V+SW PR + F NF S+E+C + A+ RL+ S + + G+ V+S
Sbjct: 52 DLRLGYVKPEVISWTPRIIVFHNFLSSEECDYLREIARPRLEISTVVDVATGKGVKSD-- 109
Query: 126 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 185
RTSSG F+++ E K +++ IE +I+ + +P +GE VLRYE Q Y H+D F+
Sbjct: 110 VRTSSGMFVNSEERKLPVIKAIEKRISVFSQIPVENGELIQVLRYEPNQYYRPHHDYFSD 169
Query: 186 AEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
+ QR+A+ L+YL+D EGGET FP
Sbjct: 170 TFNLKRGGQRVATMLMYLTDGVEGGETHFP 199
>gi|357467077|ref|XP_003603823.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355492871|gb|AES74074.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 291
Score = 103 bits (258), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 62/145 (42%), Positives = 88/145 (60%), Gaps = 13/145 (8%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST-KGTRTSSGTFI 134
+VLS PRA + NF S E+C+ +I AK ++ S + G T + RTSSGTF+
Sbjct: 86 EVLSSEPRASMYHNFLSKEECEHLINLAKPFMQRSLVV--DGVTGQGILNSVRTSSGTFL 143
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFN---PAEYGP 190
+DK I++ +E +IA T +P +GE ++ YE+GQK++ HYD FN GP
Sbjct: 144 ERGKDK--IVQNVERRIADITSIPIENGEGLQIIHYEVGQKFEPHYDYNFNWRITNNGGP 201
Query: 191 QMSQRLASFLLYLSDVEEGGETMFP 215
R+A+ L+YLSDVEEGGET+FP
Sbjct: 202 ----RVATVLMYLSDVEEGGETVFP 222
>gi|326518408|dbj|BAJ88233.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 276
Score = 103 bits (257), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 59/162 (36%), Positives = 93/162 (57%), Gaps = 8/162 (4%)
Query: 55 ENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA- 113
+N+ G D + + +V+SW PR + F NF S+E+C + A+ RL+ S +
Sbjct: 45 DNDQG-----AADLRLGYVKPEVISWTPRIIVFHNFLSSEECDYLREIARPRLEISTVVD 99
Query: 114 LRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIG 173
+ G+ V+S RTSSG F+++ E K +++ IE +I+ + +P +GE VLRYE
Sbjct: 100 VATGKGVKSD--VRTSSGMFVNSEERKLPVIKAIEKRISVFSQIPVENGELIQVLRYEPN 157
Query: 174 QKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
Q Y H+D F+ + QR+A+ L+YL+D EGGET FP
Sbjct: 158 QYYRPHHDYFSDTFNLKRGGQRVATMLMYLTDGVEGGETHFP 199
>gi|357162904|ref|XP_003579560.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
distachyon]
Length = 266
Score = 103 bits (256), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 56/141 (39%), Positives = 86/141 (60%), Gaps = 3/141 (2%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
+V+SW PR + F NF S+E+C + A+ RL+ S + + G+ V+S RTSSG F+
Sbjct: 61 EVISWTPRIIVFHNFLSSEECDFLKEIARPRLEISTVVDVATGKGVKSD--VRTSSGMFV 118
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
++ E K +++ IE +I+ + +P +GE VLRYE Q Y H+D F+ + Q
Sbjct: 119 NSEERKFPVIQAIEKRISVFSQIPVENGELIQVLRYEPSQYYRPHHDYFSDTFNLKRGGQ 178
Query: 195 RLASFLLYLSDVEEGGETMFP 215
R+A+ L+YL+D EGGET FP
Sbjct: 179 RVATMLMYLTDGVEGGETHFP 199
>gi|297824279|ref|XP_002880022.1| AT-P4H-1 [Arabidopsis lyrata subsp. lyrata]
gi|297325861|gb|EFH56281.1| AT-P4H-1 [Arabidopsis lyrata subsp. lyrata]
Length = 283
Score = 103 bits (256), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 58/151 (38%), Positives = 88/151 (58%), Gaps = 3/151 (1%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRT 128
I ++ +V+SW PR + +F S E+C+ + A A+ RL+ S + ++ G+ V+S RT
Sbjct: 72 IGNVKPEVVSWSPRIIVLHDFLSPEECEYLKAIARPRLQVSTVVDVKTGKGVKSD--VRT 129
Query: 129 SSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY 188
SSG F++ E I++ IE +IA + +P +GE VLRYE Q Y H+D F
Sbjct: 130 SSGMFLTHVERSNPIIQAIEKRIAVFSQVPAENGELIQVLRYEPKQFYKPHHDYFADTFN 189
Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPFEVD 219
+ QR+A+ L+YL+D EGGET FP D
Sbjct: 190 LKRGGQRVATMLMYLTDDVEGGETYFPLAGD 220
>gi|297812067|ref|XP_002873917.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297319754|gb|EFH50176.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 298
Score = 103 bits (256), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 60/139 (43%), Positives = 85/139 (61%), Gaps = 5/139 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
+S +PRA + F + +C +++ AK LK S +A GE+ S RTSSGTFI
Sbjct: 40 VSSKPRAFVYEGFLTELECDHMVSLAKASLKRSAVADNDSGESKFSE--VRTSSGTFIPK 97
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D I+ IE KI+ T LP+ +GE VLRYE GQKYD+H+D F+ + R+
Sbjct: 98 GKDP--IVSGIEDKISTWTFLPKENGEDIQVLRYEHGQKYDAHFDYFHDKVNIVRGGHRI 155
Query: 197 ASFLLYLSDVEEGGETMFP 215
A+ L+YLS+V +GGET+FP
Sbjct: 156 ATVLMYLSNVTKGGETVFP 174
>gi|255633460|gb|ACU17088.1| unknown [Glycine max]
Length = 207
Score = 102 bits (255), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 55/132 (41%), Positives = 77/132 (58%), Gaps = 5/132 (3%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFI 134
+V+SW PRA + NF + E+C+ +I AK + S + ET +S RTSSGTF+
Sbjct: 80 EVVSWEPRAFVYHNFLTKEECEYLIDIAKPNMHKSSVV--DSETGKSKDSRVRTSSGTFL 137
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
+ DK I+ IE +IA + +P HGE VL YE+GQKY+ HYD F Q
Sbjct: 138 ARGRDK--IVRDIEKRIAHYSFIPVEHGEGLQVLHYEVGQKYEPHYDYFLDDFNTKNGGQ 195
Query: 195 RLASFLLYLSDV 206
R+A+ L+YL+DV
Sbjct: 196 RIATVLMYLTDV 207
>gi|384251901|gb|EIE25378.1| hypothetical protein COCSUDRAFT_35772 [Coccomyxa subellipsoidea
C-169]
Length = 222
Score = 102 bits (254), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 52/145 (35%), Positives = 84/145 (57%), Gaps = 3/145 (2%)
Query: 75 FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 134
+VLSW PRA + NF + + ++ K ++ S++ + +K RTSSG F+
Sbjct: 1 MEVLSWEPRAYLYHNFLTEAEADYLVQKGKPHMEKSEVVDNETGKSAPSK-VRTSSGMFL 59
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
+ ED ++E IE +IA+ T +P+ +GE +L Y+ ++Y H+D F+ Q
Sbjct: 60 NRGEDD--VIERIEARIAKYTAIPKENGEGLQILHYQASEEYRPHFDYFHDNFNTQNGGQ 117
Query: 195 RLASFLLYLSDVEEGGETMFPFEVD 219
R+A+ L+YLSDVE+GGET+FP D
Sbjct: 118 RIATMLMYLSDVEDGGETVFPESSD 142
>gi|384246332|gb|EIE19822.1| hypothetical protein COCSUDRAFT_25518 [Coccomyxa subellipsoidea
C-169]
Length = 347
Score = 102 bits (254), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 56/149 (37%), Positives = 87/149 (58%), Gaps = 15/149 (10%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
+SW PRA F +C+ +I+ AK + S + G++++ST RTS+GTF
Sbjct: 86 VSWSPRAFLLKGFLKEAECEHLISKAKPSMVKSTVVDNDTGKSIDST--VRTSTGTFFGR 143
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN------PAEYGP 190
ED+ +++ IE +I+ T LP+ +GE +L YE GQKY++H+D F+ P G
Sbjct: 144 EEDE--VIQGIERRISMITHLPEVNGEGLQILHYEDGQKYEAHHDFFHDKFNSRPENGG- 200
Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFEVD 219
QR+A+ L+YL+ EEGGET+FP +
Sbjct: 201 ---QRIATVLMYLTTAEEGGETVFPMAAN 226
>gi|308801080|ref|XP_003075321.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
gi|116061875|emb|CAL52593.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
Length = 541
Score = 102 bits (254), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 58/134 (43%), Positives = 82/134 (61%), Gaps = 5/134 (3%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
PRA + NF S ++C+ ++A +K +L S + Q S RTS+GTFIS D
Sbjct: 265 PRAFLYENFLSEKECEHLLALSKGKLHKSGVVDAQ-TGGSSLSEVRTSTGTFISRKYDD- 322
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
I+ +E +I + +PQ+H EAF +LRYE GQ+Y +H+D F + R+A+ LL
Sbjct: 323 -IIAGVEERIELWSQIPQSHHEAFQILRYEPGQEYKAHFDYF--FHKSGMRNNRIATVLL 379
Query: 202 YLSDVEEGGETMFP 215
YLSDVEEGGET+FP
Sbjct: 380 YLSDVEEGGETVFP 393
>gi|148537204|dbj|BAF63493.1| prolyl 4-hydroxylase [Potamogeton distinctus]
Length = 246
Score = 102 bits (254), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 56/127 (44%), Positives = 80/127 (62%), Gaps = 5/127 (3%)
Query: 90 FASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISASEDKTGILELIE 148
F S E+C +IA K +L+ S +A + G++V S RTSSG F+ +D+T + IE
Sbjct: 3 FLSHEECDHLIALGKDKLEKSMVADNESGKSVMSE--IRTSSGMFLERRQDET--ITRIE 58
Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEE 208
+IA T LP+ +GE +L YE GQKYD+HYD F+ R+A+ L+YLSDV++
Sbjct: 59 KRIAAWTFLPEENGEPIQILHYEKGQKYDAHYDYFHDKNNQRVGGHRMATVLMYLSDVKK 118
Query: 209 GGETMFP 215
GGET+FP
Sbjct: 119 GGETVFP 125
>gi|15224220|ref|NP_181836.1| P4H isoform 1 [Arabidopsis thaliana]
gi|3763917|gb|AAC64297.1| hypothetical protein [Arabidopsis thaliana]
gi|20197628|gb|AAM15158.1| hypothetical protein [Arabidopsis thaliana]
gi|26450452|dbj|BAC42340.1| unknown protein [Arabidopsis thaliana]
gi|29824245|gb|AAP04083.1| unknown protein [Arabidopsis thaliana]
gi|330255112|gb|AEC10206.1| P4H isoform 1 [Arabidopsis thaliana]
Length = 283
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 58/151 (38%), Positives = 88/151 (58%), Gaps = 3/151 (1%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRT 128
I ++ +V+SW PR + +F S E+C+ + A A+ RL+ S + ++ G+ V+S RT
Sbjct: 72 IGNVKPEVVSWSPRIIVLHDFLSPEECEYLKAIARPRLQVSTVVDVKTGKGVKSD--VRT 129
Query: 129 SSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY 188
SSG F++ E I++ IE +IA + +P +GE VLRYE Q Y H+D F
Sbjct: 130 SSGMFLTHVERSYPIIQAIEKRIAVFSQVPAENGELIQVLRYEPQQFYKPHHDYFADTFN 189
Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPFEVD 219
+ QR+A+ L+YL+D EGGET FP D
Sbjct: 190 LKRGGQRVATMLMYLTDDVEGGETYFPLAGD 220
>gi|357467075|ref|XP_003603822.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355492870|gb|AES74073.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 683
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 59/141 (41%), Positives = 85/141 (60%), Gaps = 5/141 (3%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTFI 134
++LS PRA + NF S E+C+ +I AK + S + GE ES+ +RTSSG F+
Sbjct: 113 EILSSVPRASMYHNFLSKEECEHLINLAKPFMARSLVVDGVTGEVKESS--SRTSSGMFL 170
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
+DK I++ IE +IA T +P +GE +V+ Y +GQK + HYD +
Sbjct: 171 DRGKDK--IVQNIERRIADITSVPIENGEGLHVIHYGVGQKCEPHYDYTSDGVVTKNGGP 228
Query: 195 RLASFLLYLSDVEEGGETMFP 215
R+A+ L+YLSDVEEGGET+FP
Sbjct: 229 RVATVLMYLSDVEEGGETVFP 249
Score = 57.8 bits (138), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 46/127 (36%), Positives = 67/127 (52%), Gaps = 20/127 (15%)
Query: 90 FASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTFISASEDKTGILELIE 148
F S E+C+ +I AK + S + G+ ES+ RTSSG F+ +DK I++ IE
Sbjct: 372 FGSKEECEHLINLAKPFMTRSLVVDGLTGKGRESS--ARTSSGRFLERGKDK--IVQNIE 427
Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEE 208
+IA T +P+ A + + + G GP R+A+ L+YLSDVEE
Sbjct: 428 QRIADITSIPRM---ARDFMLFTAG--------GVVTKNGGP----RVATVLMYLSDVEE 472
Query: 209 GGETMFP 215
GGET+FP
Sbjct: 473 GGETVFP 479
>gi|255085784|ref|XP_002505323.1| predicted protein [Micromonas sp. RCC299]
gi|226520592|gb|ACO66581.1| predicted protein [Micromonas sp. RCC299]
Length = 215
Score = 101 bits (252), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 54/140 (38%), Positives = 81/140 (57%), Gaps = 8/140 (5%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES-TKGTRTSSGTFISA 136
+SW PRA + NF + E+C ++ AK + L++ ++ T GT SG F+
Sbjct: 5 ISWEPRAFVYHNFLTPEECAHLVNLAKA----TDGGLKRATVADARTGGTFPGSGAFLLR 60
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-SQR 195
+ D I+ IE +I+ M+P HGE +LRY G+KYD H+D F+ + + QR
Sbjct: 61 NHDP--IVTRIEERISAFAMIPADHGEGMRILRYGRGEKYDPHHDYFDDGDKNLRFYGQR 118
Query: 196 LASFLLYLSDVEEGGETMFP 215
+A+ L+YLSDVE GGET+FP
Sbjct: 119 VATVLMYLSDVESGGETVFP 138
>gi|356576923|ref|XP_003556579.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 287
Score = 101 bits (251), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 56/142 (39%), Positives = 82/142 (57%), Gaps = 3/142 (2%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
+VL+W PR + NF S E+C + A A RL S + + G+ ++S RTSSG F+
Sbjct: 82 EVLNWSPRIILLHNFLSMEECDYLRAIALPRLHISNVVDTKTGKGIKSD--VRTSSGMFL 139
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
+ E K +++ IE +I+ + +P +GE VLRYE Q Y H+D F+ + Q
Sbjct: 140 NPQERKYPMVQAIEKRISVYSQIPIENGELMQVLRYEKNQYYKPHHDYFSDTFNLKRGGQ 199
Query: 195 RLASFLLYLSDVEEGGETMFPF 216
R+A+ L+YLSD EGGET FP
Sbjct: 200 RIATMLMYLSDNIEGGETYFPL 221
>gi|15233345|ref|NP_195307.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
gi|3805848|emb|CAA21468.1| putative protein [Arabidopsis thaliana]
gi|7270534|emb|CAB81491.1| putative protein [Arabidopsis thaliana]
gi|332661175|gb|AEE86575.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
Length = 272
Score = 100 bits (250), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 57/150 (38%), Positives = 88/150 (58%), Gaps = 23/150 (15%)
Query: 75 FQVLSWRPRALYFPNF--------ASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKG 125
+V++ PRA + NF + E+C +I+ AK + S++ R T +
Sbjct: 88 LEVITKEPRAFVYHNFLALFFKICKTNEECDHLISLAKPSMARSKV--RNALTGLGEESS 145
Query: 126 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 185
+RTSSGTFI + DK I++ IE +I+ T +PQ +GE V+ YE+GQK++ H+D F
Sbjct: 146 SRTSSGTFIRSGHDK--IVKEIEKRISEFTFIPQENGETLQVINYEVGQKFEPHFDGF-- 201
Query: 186 AEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
QR+A+ L+YLSDV++GGET+FP
Sbjct: 202 --------QRIATVLMYLSDVDKGGETVFP 223
>gi|413945803|gb|AFW78452.1| hypothetical protein ZEAMMB73_588774 [Zea mays]
Length = 239
Score = 100 bits (249), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 62/147 (42%), Positives = 87/147 (59%), Gaps = 6/147 (4%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
+S +PR + +F S ++ +I+ A+ LK S +A G++ S RTSSGTF+
Sbjct: 54 ISCKPRVFLYQHFLSDDEANHLISLARAELKRSAVADNMSGKSTLSE--VRTSSGTFLRK 111
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D I+E IE KIA T LP+ +GE VLRY+ G+KY+ HYD F + R
Sbjct: 112 GQDP--IVEGIEDKIAAWTFLPKENGEDIQVLRYKHGEKYEPHYDYFTDNVNTVRGGHRY 169
Query: 197 ASFLLYLSDVEEGGETMFPF-EVDLQI 222
A+ LLYL+DV EGGET+FP EV+ I
Sbjct: 170 ATVLLYLTDVPEGGETVFPLAEVNFFI 196
>gi|255083627|ref|XP_002508388.1| predicted protein [Micromonas sp. RCC299]
gi|226523665|gb|ACO69646.1| predicted protein [Micromonas sp. RCC299]
Length = 253
Score = 100 bits (249), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 58/149 (38%), Positives = 83/149 (55%), Gaps = 20/149 (13%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSG 131
+SW PRA + NF S E+C I+ A+ R+ R+ ++S G RTS
Sbjct: 1 VSWYPRAFHLHNFMSHEECDRILEIARPRV-------RRSTVIDSVTGQSKVDPIRTSEQ 53
Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN-PAEYGP 190
TF++ I+ +E ++A T LP HGE +L+Y +GQKYD+H+D + G
Sbjct: 54 TFLNRG--TWDIVTKVEERLAVVTQLPAYHGEDMQILKYGLGQKYDAHHDVGELTSASGK 111
Query: 191 QMS----QRLASFLLYLSDVEEGGETMFP 215
Q++ R+A+ LLYLSDVEEGGET FP
Sbjct: 112 QLAAEGGHRVATVLLYLSDVEEGGETAFP 140
>gi|226495689|ref|NP_001149322.1| LOC100282945 precursor [Zea mays]
gi|194697650|gb|ACF82909.1| unknown [Zea mays]
gi|194708468|gb|ACF88318.1| unknown [Zea mays]
gi|195626376|gb|ACG35018.1| oxidoreductase [Zea mays]
gi|347978842|gb|AEP37763.1| prolyl 4-hydroxylase 9 [Zea mays]
gi|413945802|gb|AFW78451.1| oxidoreductase [Zea mays]
Length = 308
Score = 100 bits (249), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 59/140 (42%), Positives = 83/140 (59%), Gaps = 5/140 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTFISA 136
+S +PR + +F S ++ +I+ A+ LK S +A G++ S RTSSGTF+
Sbjct: 54 ISCKPRVFLYQHFLSDDEANHLISLARAELKRSAVADNMSGKSTLSE--VRTSSGTFLRK 111
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D I+E IE KIA T LP+ +GE VLRY+ G+KY+ HYD F + R
Sbjct: 112 GQDP--IVEGIEDKIAAWTFLPKENGEDIQVLRYKHGEKYEPHYDYFTDNVNTVRGGHRY 169
Query: 197 ASFLLYLSDVEEGGETMFPF 216
A+ LLYL+DV EGGET+FP
Sbjct: 170 ATVLLYLTDVPEGGETVFPL 189
>gi|357445147|ref|XP_003592851.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
gi|355481899|gb|AES63102.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
Length = 281
Score = 99.8 bits (247), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 57/141 (40%), Positives = 79/141 (56%), Gaps = 3/141 (2%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
+VLSW PR + NF S E+C + A RLK S + G+ ++S RTSSG F+
Sbjct: 76 EVLSWSPRIILLHNFLSYEECDYLRGVALPRLKISTVVDANTGKGIKSD--VRTSSGMFL 133
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
S E K ++ IE +I+ + +P +GE VLRYE Q Y H+D F+ + Q
Sbjct: 134 SHEERKYPMIHAIEKRISVYSQIPIENGELMQVLRYEKNQYYRPHHDYFSDTFNLKRGGQ 193
Query: 195 RLASFLLYLSDVEEGGETMFP 215
R+A+ L+YL D EGGET FP
Sbjct: 194 RIATMLMYLGDNVEGGETHFP 214
>gi|302834449|ref|XP_002948787.1| hypothetical protein VOLCADRAFT_80309 [Volvox carteri f.
nagariensis]
gi|300265978|gb|EFJ50167.1| hypothetical protein VOLCADRAFT_80309 [Volvox carteri f.
nagariensis]
Length = 329
Score = 99.8 bits (247), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 60/157 (38%), Positives = 86/157 (54%), Gaps = 8/157 (5%)
Query: 63 GETGDASIQSIP---FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGE 118
GET + +P VLSW+PR + + E+C +I A+ RL+ S ++ GE
Sbjct: 35 GETYRSGRTDVPDSRMVVLSWQPRVFLYKGILTQEECDYLIKIAQGRLERSGVSDATTGE 94
Query: 119 TVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDS 178
RTSSG F + E+ +++ IE ++A TMLP +GE VLRYE QKYD
Sbjct: 95 G--GVSDIRTSSGMFYTRGEND--VVKRIETRLAMWTMLPVENGEGIQVLRYEKTQKYDP 150
Query: 179 HYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
H+D F+ R+A+ L+YL+ EEGGET+FP
Sbjct: 151 HHDYFSFEGRDANGGNRMATVLMYLATPEEGGETVFP 187
>gi|357483927|ref|XP_003612250.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355513585|gb|AES95208.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 204
Score = 99.8 bits (247), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 55/129 (42%), Positives = 74/129 (57%), Gaps = 5/129 (3%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFI 134
+V+SW PRA + NF + E+C+ +I AK + S + ET +S RTSSGTF+
Sbjct: 79 EVVSWEPRAFVYHNFLTKEECEYLIDIAKPSMHKSTVV--DSETGKSKDSRVRTSSGTFL 136
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
+ DK I+ IE KIA T +P HGE VL YE+GQKY+ HYD F Q
Sbjct: 137 ARGRDK--IVRNIEKKIADFTFIPVEHGEGLQVLHYEVGQKYEPHYDYFLDEFNTKNGGQ 194
Query: 195 RLASFLLYL 203
R+A+ L+YL
Sbjct: 195 RIATVLMYL 203
>gi|297802348|ref|XP_002869058.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297314894|gb|EFH45317.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 245
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 56/150 (37%), Positives = 89/150 (59%), Gaps = 23/150 (15%)
Query: 75 FQVLSWRPRALYFPNF--------ASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKG 125
+V++ PRA + NF + E+C+ +I+ AK + S++ R T +
Sbjct: 55 LEVIAKEPRAFVYHNFLALFFKFCKTNEECEHLISLAKPSMARSKV--RNAITGLGEESS 112
Query: 126 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 185
+RTSSGTF+ DK I++ IE +I+ T +P+ +GEA V+ YE+GQK++ H+D F
Sbjct: 113 SRTSSGTFLRKGHDK--IVKEIEKRISEFTFIPEENGEALQVIHYEVGQKFEPHFDGF-- 168
Query: 186 AEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
QR+A+ L+YLSDV++GGET+FP
Sbjct: 169 --------QRIATVLMYLSDVDKGGETVFP 190
>gi|218192156|gb|EEC74583.1| hypothetical protein OsI_10158 [Oryza sativa Indica Group]
Length = 299
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 55/140 (39%), Positives = 86/140 (61%), Gaps = 6/140 (4%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKK-RLKPSQLAL-RQGETVESTKGTRTSSGTFIS 135
+SW PR + F S +C+ +IA AK+ R++ S + + GE+V S TRTSSG F+
Sbjct: 40 VSWSPRVFLYEGFLSDAECEHLIALAKQGRMERSTVVNGKSGESVMSK--TRTSSGMFLI 97
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
+D+ ++ IE +IA TM P +GE+ +LRY G+KY+ H+D + + R
Sbjct: 98 RKQDE--VVARIEERIAAWTMFPAENGESMQMLRYGQGEKYEPHFDYIRGRQASARGGHR 155
Query: 196 LASFLLYLSDVEEGGETMFP 215
+A+ L+YLS+V+ GGET+FP
Sbjct: 156 IATVLMYLSNVKMGGETVFP 175
>gi|414870897|tpg|DAA49454.1| TPA: hypothetical protein ZEAMMB73_536273 [Zea mays]
Length = 222
Score = 99.0 bits (245), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 54/134 (40%), Positives = 74/134 (55%), Gaps = 15/134 (11%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
+VLSW PRA + NF S E+C +I+ AK +K S + V+S G RTS
Sbjct: 97 EVLSWEPRAFVYHNFLSKEECDHLISLAKPHMKKSTV-------VDSATGGSKDSRVRTS 149
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
SG F+ +DK I+ IE +IA T +P GE VL YE+GQKY+ H+D F+
Sbjct: 150 SGMFLRRGQDK--IIRTIEKRIADYTFIPVEQGEGLQVLHYEVGQKYEPHFDYFHDDYNT 207
Query: 190 PQMSQRLASFLLYL 203
QR+A+ L+YL
Sbjct: 208 KNGGQRIATLLMYL 221
>gi|159476104|ref|XP_001696154.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein
[Chlamydomonas reinhardtii]
gi|158275325|gb|EDP01103.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein
[Chlamydomonas reinhardtii]
Length = 343
Score = 99.0 bits (245), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 60/162 (37%), Positives = 87/162 (53%), Gaps = 18/162 (11%)
Query: 63 GET---GDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET 119
GET G+ Q VLSW PR + + E+C ++ ++ RL+ S ++
Sbjct: 52 GETFNAGEHRAQDSRMVVLSWHPRVFLYKGILTHEECDQLMDNSRSRLERSGVS------ 105
Query: 120 VESTKGT------RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIG 173
++T G RTSSG F E T +++ IE+++A TMLP +GE VLRYE
Sbjct: 106 -DATTGAGAVSDIRTSSGMFYERGE--TELVKRIENRLAMWTMLPVENGEGIQVLRYEKT 162
Query: 174 QKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
QKYD H+D F+ R+A+ L+YL+ EEGGET+FP
Sbjct: 163 QKYDPHHDYFSFDGADDNGGNRMATVLMYLATPEEGGETVFP 204
>gi|108706361|gb|ABF94156.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
Group]
gi|222624253|gb|EEE58385.1| hypothetical protein OsJ_09545 [Oryza sativa Japonica Group]
Length = 299
Score = 98.6 bits (244), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 55/140 (39%), Positives = 86/140 (61%), Gaps = 6/140 (4%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKK-RLKPSQLAL-RQGETVESTKGTRTSSGTFIS 135
+SW PR + F S +C+ +IA AK+ R++ S + + GE+V S TRTSSG F+
Sbjct: 40 VSWSPRVFLYEGFLSDVECEHLIALAKQGRMERSTVVNGKSGESVMSK--TRTSSGMFLI 97
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
+D+ ++ IE +IA TM P +GE+ +LRY G+KY+ H+D + + R
Sbjct: 98 RKQDE--VVARIEERIAAWTMFPAENGESMQMLRYGQGEKYEPHFDYIRGRQASARGGHR 155
Query: 196 LASFLLYLSDVEEGGETMFP 215
+A+ L+YLS+V+ GGET+FP
Sbjct: 156 IATVLMYLSNVKMGGETVFP 175
>gi|363814557|ref|NP_001242754.1| uncharacterized protein LOC100794585 [Glycine max]
gi|255628535|gb|ACU14612.1| unknown [Glycine max]
Length = 238
Score = 98.2 bits (243), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 55/142 (38%), Positives = 82/142 (57%), Gaps = 3/142 (2%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
+VL+W PR + NF S E+C + A A RL S + + G+ ++S RTSSG F+
Sbjct: 82 EVLNWSPRIILLHNFLSMEECDYLRALALPRLHISTVVDTKTGKGIKSD--VRTSSGMFL 139
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
++ E K +++ IE +I+ + +P +GE VLRYE Q Y H+D F+ + Q
Sbjct: 140 NSKERKYPMVQAIEKRISVYSQIPIENGELMQVLRYEKNQYYKPHHDYFSDTFNLKRGGQ 199
Query: 195 RLASFLLYLSDVEEGGETMFPF 216
R+A+ L+YLSD E GET FP
Sbjct: 200 RIATMLMYLSDNIERGETYFPL 221
>gi|159487419|ref|XP_001701720.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280939|gb|EDP06695.1| predicted protein [Chlamydomonas reinhardtii]
Length = 274
Score = 98.2 bits (243), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 61/141 (43%), Positives = 80/141 (56%), Gaps = 7/141 (4%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
Q + PRA YF NF + + ++ A +LK S + GE V RTS G FI
Sbjct: 2 QQVGLHPRAYYFHNFLTKAERGHLVKLAAPKLKRSTVVGNDGEGV--VDNIRTSYGMFIR 59
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA-EYGPQMSQ 194
+D ++ IE +I+ T LP H E VLRY GQ Y +HYD+ + + E GP+
Sbjct: 60 RLQDP--VVARIEKRISLWTHLPVEHQEDIQVLRYAHGQTYGAHYDSGDKSNEPGPKW-- 115
Query: 195 RLASFLLYLSDVEEGGETMFP 215
RLA+FL+YLSDVEEGGET FP
Sbjct: 116 RLATFLMYLSDVEEGGETAFP 136
>gi|294461211|gb|ADE76168.1| unknown [Picea sitchensis]
Length = 280
Score = 98.2 bits (243), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 54/135 (40%), Positives = 83/135 (61%), Gaps = 5/135 (3%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISASEDK 140
P + NF + +C +I A+ +L+ S +A + G++V S RTSSG F++ ++D+
Sbjct: 28 PGLFLYKNFLTDAECDHLIFLARDKLQKSMVADNESGKSVMSE--IRTSSGMFLNKAQDE 85
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFL 200
I+ +E +IA T LP +GEA VL YE+GQKY+ H+D F+ R+A+ L
Sbjct: 86 --IVASVEDRIAAWTFLPIENGEAMQVLHYELGQKYEPHFDYFHDKINQAMGGHRIATVL 143
Query: 201 LYLSDVEEGGETMFP 215
+YLSDV +GGET+FP
Sbjct: 144 MYLSDVVKGGETVFP 158
>gi|413923982|gb|AFW63914.1| hypothetical protein ZEAMMB73_179176 [Zea mays]
Length = 222
Score = 97.8 bits (242), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 53/134 (39%), Positives = 74/134 (55%), Gaps = 15/134 (11%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
+V+SW PRA + NF S E+C+ +I AK + S + V+ST G RTS
Sbjct: 98 EVISWEPRAFVYHNFLSKEECEYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 150
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
SG F+ DK ++ +IE +IA T +P HGE VL YE+GQKY+ H+D F
Sbjct: 151 SGMFLQRGRDK--VIRVIEKRIADYTFIPVDHGEGLQVLHYEVGQKYEPHFDYFLDEFNT 208
Query: 190 PQMSQRLASFLLYL 203
QR+A+ L+YL
Sbjct: 209 KNGGQRMATLLMYL 222
>gi|307110383|gb|EFN58619.1| hypothetical protein CHLNCDRAFT_19485 [Chlorella variabilis]
Length = 328
Score = 97.8 bits (242), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 58/140 (41%), Positives = 80/140 (57%), Gaps = 12/140 (8%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
+SW+PRA F NF + E+ I+A AK +K S + G +VE RTS GTF+
Sbjct: 35 VSWKPRAFVFHNFMTEEEADHIVALAKPFMKRSTVVGAGGASVEDQ--IRTSYGTFLKRL 92
Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
+D I+ +E ++A T L +H E +LRY IGQKY +HYD+ + S R+
Sbjct: 93 QDP--IVTAVEQRLATWTKLNVSHQEDMQILRYGIGQKYGAHYDSLD------NDSPRVC 144
Query: 198 SFLLYLSDV--EEGGETMFP 215
+ LLYLSDV + GGET FP
Sbjct: 145 TVLLYLSDVPADGGGETAFP 164
>gi|333912984|ref|YP_004486716.1| procollagen-proline dioxygenase [Delftia sp. Cs1-4]
gi|333743184|gb|AEF88361.1| Procollagen-proline dioxygenase [Delftia sp. Cs1-4]
Length = 294
Score = 97.1 bits (240), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 62/164 (37%), Positives = 86/164 (52%), Gaps = 12/164 (7%)
Query: 64 ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
+ GD +Q + V PR + F N S E+C +IIA A+ R+ S L + E
Sbjct: 92 DVGDRQVQVL---VSMRNPRIVVFGNLLSHEECDAIIAAARPRMARS-LTVATQSGGEEI 147
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTS+G F E TGI+ +E +IAR P HGE VL Y G +Y H+D F
Sbjct: 148 NDDRTSNGMFFQRGE--TGIVSQLEERIARLLRWPLDHGEGLQVLHYGPGAEYKPHHDYF 205
Query: 184 NPAEYG-PQM----SQRLASFLLYLSDVEEGGETMFPFEVDLQI 222
P E G P + QR+ + ++YL++ E GG T+FP EV LQ+
Sbjct: 206 APGEPGTPTILKRGGQRVGTLVIYLNEPERGGATIFP-EVPLQV 248
>gi|160900716|ref|YP_001566298.1| procollagen-proline dioxygenase [Delftia acidovorans SPH-1]
gi|160366300|gb|ABX37913.1| Procollagen-proline dioxygenase [Delftia acidovorans SPH-1]
Length = 294
Score = 96.7 bits (239), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 62/164 (37%), Positives = 86/164 (52%), Gaps = 12/164 (7%)
Query: 64 ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
+ GD +Q + V PR + F N S E+C +IIA A+ R+ S L + E
Sbjct: 92 DVGDRQVQVL---VSMRNPRIVVFGNLLSHEECDAIIAAARPRMARS-LTVATQSGGEEI 147
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTS+G F E TGI+ +E +IAR P HGE VL Y G +Y H+D F
Sbjct: 148 NDDRTSNGMFFQRGE--TGIVSQLEERIARLLRWPLDHGEGLQVLHYGPGAEYKPHHDYF 205
Query: 184 NPAEYG-PQM----SQRLASFLLYLSDVEEGGETMFPFEVDLQI 222
P E G P + QR+ + ++YL++ E GG T+FP EV LQ+
Sbjct: 206 APGEPGTPTILKRGGQRVGTLVIYLNEPERGGATIFP-EVPLQV 248
>gi|334185677|ref|NP_001189994.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
gi|332643930|gb|AEE77451.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
Length = 324
Score = 96.7 bits (239), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 57/143 (39%), Positives = 83/143 (58%), Gaps = 5/143 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVES--TKGTRTSSGTFI 134
LSW PR + F S E+C I AK +L+ S +A GE+VES + S +FI
Sbjct: 59 LSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEDSVSVVRQSSSFI 118
Query: 135 SA--SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
+ S + I+ +E K+A T LP+ +GE+ +L YE GQKY+ H+D F+
Sbjct: 119 ANMDSLEIDDIVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELG 178
Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
R+A+ L+YLS+VE+GGET+FP
Sbjct: 179 GHRIATVLMYLSNVEKGGETVFP 201
>gi|357128903|ref|XP_003566109.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Brachypodium
distachyon]
Length = 313
Score = 96.7 bits (239), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 57/144 (39%), Positives = 83/144 (57%), Gaps = 5/144 (3%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTFISA 136
+SW+PR + +F S ++ +++ A+ LK S +A G++ S RTS GTFIS
Sbjct: 55 ISWKPRVFLYQHFLSDDEANHLLSLARAELKRSAVADNTSGKSTLSE--VRTSYGTFISK 112
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+D I+ IE KIA T LP+ +GE VLRY+ G+K + +D F + R+
Sbjct: 113 GKDP--IVAGIEDKIAAWTFLPKENGEDMQVLRYKRGEKDEPQFDFFTDTVNTVRGGHRV 170
Query: 197 ASFLLYLSDVEEGGETMFPFEVDL 220
A+ LLYL+DV EGGET+FP D
Sbjct: 171 ATVLLYLTDVAEGGETVFPLAKDF 194
>gi|449443245|ref|XP_004139390.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 295
Score = 96.3 bits (238), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 48/135 (35%), Positives = 81/135 (60%), Gaps = 3/135 (2%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
PRA + NF S ++C +I AK R++ S ++ + RTSSG F++ +++
Sbjct: 83 PRAFIYHNFLSEKECSQLINLAKPRMERSLVSAQNTNWEGVVSSRRTSSGRFLAKGQNQ- 141
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
++ IE +IA T +P +GE ++L YE+GQK++ H+D +P + + QR A+ +
Sbjct: 142 -LVRRIEKRIAEFTFIPVENGEGLSILHYEVGQKFEPHHDYTHPDSFSFKSLGQRNATLV 200
Query: 201 LYLSDVEEGGETMFP 215
+YLS V+EGG T+FP
Sbjct: 201 MYLSGVKEGGATVFP 215
>gi|449520144|ref|XP_004167094.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 323
Score = 96.3 bits (238), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 48/135 (35%), Positives = 81/135 (60%), Gaps = 3/135 (2%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
PRA + NF S ++C +I AK R++ S ++ + RTSSG F++ +++
Sbjct: 74 PRAFIYHNFLSEKECSQLINLAKPRMERSLVSAQNTNWEGVVSSRRTSSGRFLAKGQNQ- 132
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
++ IE +IA T +P +GE ++L YE+GQK++ H+D +P + + QR A+ +
Sbjct: 133 -LVRRIEKRIAEFTFIPVENGEGLSILHYEVGQKFEPHHDYTHPDSFSFKSLGQRNATLV 191
Query: 201 LYLSDVEEGGETMFP 215
+YLS V+EGG T+FP
Sbjct: 192 MYLSGVKEGGATVFP 206
>gi|303282201|ref|XP_003060392.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226457863|gb|EEH55161.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 369
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 56/139 (40%), Positives = 77/139 (55%), Gaps = 11/139 (7%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASEDK 140
PRA + F + +C IA A +L S + GE V S RTS G F ED
Sbjct: 83 PRAYVYRGFLTDAECDHFIARASPKLAKSNVVDTDTGEGVPSA--IRTSDGMFFDRGEDD 140
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA----FNPAEYGPQMSQRL 196
+++ +E +I+ T LP +GE VLRY GQKYD+H DA FN + QR+
Sbjct: 141 --VVDAVERRISAWTRLPTENGEGMQVLRYAGGQKYDAHLDAFVDKFNADDA--HGGQRV 196
Query: 197 ASFLLYLSDVEEGGETMFP 215
A+ L+YL+DV++GGET+FP
Sbjct: 197 ATVLMYLNDVDDGGETVFP 215
>gi|242047774|ref|XP_002461633.1| hypothetical protein SORBIDRAFT_02g005760 [Sorghum bicolor]
gi|241925010|gb|EER98154.1| hypothetical protein SORBIDRAFT_02g005760 [Sorghum bicolor]
Length = 275
Score = 95.5 bits (236), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 52/145 (35%), Positives = 79/145 (54%), Gaps = 18/145 (12%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-----TRTSS 130
+ LSW+PR + F S ++C ++ AKK G V + TRTSS
Sbjct: 49 KALSWQPRIFVYKGFLSDDECDHLVTLAKK-----------GTMVAHNRSSYYRQTRTSS 97
Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP 190
G F+ +D ++ IE +IA T+LP+ + E + RY+ GQKYD H+D F+ +
Sbjct: 98 GMFLRKRQDP--VVSRIEERIAAWTLLPRENVEKMQIQRYQHGQKYDPHFDYFDDKIHHT 155
Query: 191 QMSQRLASFLLYLSDVEEGGETMFP 215
+ R A+ L+YLS V++GGET+FP
Sbjct: 156 RGGPRYATVLMYLSTVDKGGETVFP 180
>gi|225433714|ref|XP_002268409.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
gi|296089634|emb|CBI39453.3| unnamed protein product [Vitis vinifera]
Length = 287
Score = 95.5 bits (236), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 57/154 (37%), Positives = 88/154 (57%), Gaps = 6/154 (3%)
Query: 67 DASIQSIPF---QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVES 122
DA I I + ++L+W PR + +F S+E+C + A A+ L+ S + Q G+ ++S
Sbjct: 70 DADILRIGYVKPEILNWSPRIILLHSFLSSEECDYLRAMAEPLLQISTVVDAQTGKGIQS 129
Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
RTSSG F+S + I+ IE +I+ + +P +GE VLRY+ Q Y H+D
Sbjct: 130 D--VRTSSGMFLSPDDSTYPIVRAIEKRISVYSQVPVENGELIQVLRYKKSQFYKPHHDY 187
Query: 183 FNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPF 216
F+ + + QR+A+ L+YLSD EGGET FP
Sbjct: 188 FSDSFNLKRGGQRVATMLIYLSDNVEGGETYFPM 221
>gi|340787855|ref|YP_004753320.1| peptidyl prolyl 4-hydroxylase-like protein subunit alpha
[Collimonas fungivorans Ter331]
gi|340553122|gb|AEK62497.1| Peptidyl prolyl 4-hydroxylase-like protein, alpha subunit
[Collimonas fungivorans Ter331]
Length = 289
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 53/145 (36%), Positives = 81/145 (55%), Gaps = 18/145 (12%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT-----RTSSGTFIS 135
+PRA+ F N S ++C +IA +K +L LR G T T RTSSGTF
Sbjct: 99 KPRAILFGNVLSHDECDQLIALSKTKL------LRSGVVDHQTGNTKLHEHRTSSGTFFH 152
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP-----AEYGP 190
T + +I+ ++A +P++HGE +L Y++G +Y HYD F P A++
Sbjct: 153 --RGTTPFIAMIDKRLAALMQVPESHGEGLQILNYQMGGEYRPHYDYFRPDAPGSAKHLA 210
Query: 191 QMSQRLASFLLYLSDVEEGGETMFP 215
+ QR A+ ++YL+DV+ GGET+FP
Sbjct: 211 RGGQRTATLIIYLNDVDGGGETIFP 235
>gi|308812133|ref|XP_003083374.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein (ISS)
[Ostreococcus tauri]
gi|116055254|emb|CAL57650.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein (ISS)
[Ostreococcus tauri]
Length = 311
Score = 95.1 bits (235), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 54/143 (37%), Positives = 78/143 (54%), Gaps = 5/143 (3%)
Query: 75 FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTF 133
+ +S PRA F F + +C +I A ++ S++ GE R+S G +
Sbjct: 68 IEKISDSPRAYVFREFLTDAECDRVIERAYPTMEASEVTDDDSGEA--RPDDARSSIGGW 125
Query: 134 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 193
+S +D+ ++ IE + + MLP GE VLRYE GQKYD+H D F+
Sbjct: 126 VSGDDDE--VIRNIELRASTWAMLPMNRGETMQVLRYEKGQKYDAHDDFFHDEHNVKNGG 183
Query: 194 QRLASFLLYLSDVEEGGETMFPF 216
QR+A+ L+YLSDVEEGGET+FP
Sbjct: 184 QRVATILMYLSDVEEGGETVFPL 206
>gi|406665340|ref|ZP_11073114.1| hypothetical protein B857_00901 [Bacillus isronensis B3W22]
gi|405387266|gb|EKB46691.1| hypothetical protein B857_00901 [Bacillus isronensis B3W22]
Length = 211
Score = 94.7 bits (234), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 53/145 (36%), Positives = 84/145 (57%), Gaps = 9/145 (6%)
Query: 71 QSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSS 130
++I +VL P + F N S E+CQ++I A RL+ S+LA ++ ++ RTSS
Sbjct: 19 RTITAEVLHEEPLIVKFLNVLSDEECQNLIDCASSRLERSKLAKKEISSI------RTSS 72
Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP 190
G F E++ ++ IE +I+ LP H E VL YE GQ++ +H+D F P +
Sbjct: 73 GMFFE--ENENPLISEIEKRISSLMHLPIEHAEGLQVLHYEPGQEFKAHFDFFGP-NHPS 129
Query: 191 QMSQRLASFLLYLSDVEEGGETMFP 215
+ R+++ ++YL+DVEEGG T FP
Sbjct: 130 SSNNRISTLVVYLNDVEEGGVTTFP 154
>gi|229002593|ref|ZP_04160640.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock3-17]
gi|229003816|ref|ZP_04161625.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock1-4]
gi|228757417|gb|EEM06653.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock1-4]
gi|228758520|gb|EEM07660.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock3-17]
Length = 219
Score = 94.7 bits (234), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 50/142 (35%), Positives = 81/142 (57%), Gaps = 11/142 (7%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C+++I +K ++K S++ + + T RTSSG F+ SE
Sbjct: 42 PLIVVLANVLSDEECETLIEMSKNKMKRSKIGISR-----KTNDIRTSSGAFLEESE--- 93
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
I IE +IA +P HGE +L+Y +GQ+Y +HYD F + R+++ ++
Sbjct: 94 -ITTRIERRIASIMNVPAPHGEGLQILKYTVGQEYQAHYDFF-VENSAAASNNRMSTLVM 151
Query: 202 YLSDVEEGGETMFPFEVDLQIS 223
YL+ VEEGGET FP +++L +S
Sbjct: 152 YLNHVEEGGETFFP-KLNLSVS 172
>gi|238007346|gb|ACR34708.1| unknown [Zea mays]
Length = 180
Score = 94.7 bits (234), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 50/111 (45%), Positives = 68/111 (61%), Gaps = 9/111 (8%)
Query: 119 TVESTKG------TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEI 172
V+ST G RTSSG F+ DK ++ +IE +IA T +P HGE VL YE+
Sbjct: 6 VVDSTTGKSKDSRVRTSSGMFLQRGRDK--VIRVIEKRIADYTFIPVDHGEGLQVLHYEV 63
Query: 173 GQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
GQKY+ H+D F QR+A+ L+YLSDVEEGGET+FP + ++ +S
Sbjct: 64 GQKYEPHFDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETIFP-DANVNVS 113
>gi|148653656|ref|YP_001280749.1| procollagen-proline dioxygenase [Psychrobacter sp. PRwf-1]
gi|148572740|gb|ABQ94799.1| Procollagen-proline dioxygenase [Psychrobacter sp. PRwf-1]
Length = 268
Score = 94.7 bits (234), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 57/182 (31%), Positives = 95/182 (52%), Gaps = 17/182 (9%)
Query: 50 EVVEKENESGLPHGETGDASIQ----SIPFQVLSWRPRALYFPNFASAEQCQSIIATAKK 105
E++ K + +PH + ++ + + ++P +F S E+C ++I+ A +
Sbjct: 43 ELISKSSNQKIPHINMTNNYVELSDKRVSLSFVCYKPFVTVINDFLSPEECDALISDADQ 102
Query: 106 RLKPSQLAL-RQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEA 164
+LK S++ G VE + T TS+G + I++ IE +IA P HGE
Sbjct: 103 KLKASRVVDPEDGSFVEHSARTSTSTGY----HRGEIDIIKTIEARIADLINWPVDHGEG 158
Query: 165 FNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFP---F 216
VLRYE G +Y H+D F+PA+ ++ QR+ +FL+YLS+V+ GG T FP F
Sbjct: 159 LQVLRYEDGGEYRPHFDFFDPAKKSSRLVTKQGGQRVGTFLMYLSEVDSGGSTRFPNLNF 218
Query: 217 EV 218
E+
Sbjct: 219 EI 220
>gi|228990015|ref|ZP_04149988.1| Prolyl 4-hydroxylase alpha subunit [Bacillus pseudomycoides DSM
12442]
gi|228769681|gb|EEM18271.1| Prolyl 4-hydroxylase alpha subunit [Bacillus pseudomycoides DSM
12442]
Length = 219
Score = 94.7 bits (234), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 50/142 (35%), Positives = 81/142 (57%), Gaps = 11/142 (7%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C+++I +K ++K S++ + + T RTSSG F+ SE
Sbjct: 42 PLIVVLANVLSDEECETLIEMSKNKMKRSKIGVSR-----KTNDIRTSSGAFLEESE--- 93
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
I IE +IA +P HGE +L+Y +GQ+Y +HYD F + R+++ ++
Sbjct: 94 -ITTRIERRIASIMNVPAPHGEGLQILKYTVGQEYQAHYDFF-VENSAAASNNRMSTLVM 151
Query: 202 YLSDVEEGGETMFPFEVDLQIS 223
YL+ VEEGGET FP +++L +S
Sbjct: 152 YLNHVEEGGETFFP-KLNLSVS 172
>gi|219121927|ref|XP_002181308.1| proly 4-hydroxylase [Phaeodactylum tricornutum CCAP 1055/1]
gi|217407294|gb|EEC47231.1| proly 4-hydroxylase [Phaeodactylum tricornutum CCAP 1055/1]
Length = 226
Score = 94.4 bits (233), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 54/155 (34%), Positives = 80/155 (51%), Gaps = 12/155 (7%)
Query: 71 QSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSS 130
+++ + LS P L F S ++C I TA+ ++ S++ L + RTS
Sbjct: 3 RAVTLETLSLVPLVLSVEGFLSDDECTYIQETAEPHMEYSEVTLMDKDQGRPASDFRTSQ 62
Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP 190
FI A +D IL I+++ A +P+ H E VLRY++ +KYDSH D F+PA Y
Sbjct: 63 SAFIRAHDD--AILTDIDYRTASLVRIPRRHQEDVQVLRYDVTEKYDSHADYFDPALYTK 120
Query: 191 Q----------MSQRLASFLLYLSDVEEGGETMFP 215
R+A+ YLSDVE+GGET+FP
Sbjct: 121 DKRTLALIRNGHRNRMATVFWYLSDVEKGGETVFP 155
>gi|299532490|ref|ZP_07045880.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni S44]
gi|298719437|gb|EFI60404.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni S44]
Length = 299
Score = 94.4 bits (233), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 64/188 (34%), Positives = 97/188 (51%), Gaps = 23/188 (12%)
Query: 50 EVVEKENESGLPHGE---------TGDASIQSIPFQVLSWR-PRALYFPNFASAEQCQSI 99
EV + +ESG+P + GD + + +S R PR + F N S E+C +I
Sbjct: 74 EVAVQVHESGVPEPDLLTDPCCVHAGDRDVHVL----MSVRHPRVVVFGNLLSDEECDAI 129
Query: 100 IATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQ 159
IA A+ R++ S L + E+ RTS+G F E++ ++ L+E +IAR P
Sbjct: 130 IAAARPRMRRS-LTVDNQSGGEAVNDDRTSNGMFFQRGENE--LISLVEQRIARLLNWPL 186
Query: 160 THGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQRLASFLLYLSDVEEGGETMF 214
+GE VL Y G +Y HYD F P E G P + QR+ + ++YL++ GG T F
Sbjct: 187 ENGEGMQVLHYRPGAEYKPHYDYFAPNEPGTPTILKRGGQRVGTLVMYLNEPARGGATTF 246
Query: 215 PFEVDLQI 222
P +V LQ+
Sbjct: 247 P-DVGLQV 253
>gi|302831512|ref|XP_002947321.1| hypothetical protein VOLCADRAFT_120451 [Volvox carteri f.
nagariensis]
gi|300267185|gb|EFJ51369.1| hypothetical protein VOLCADRAFT_120451 [Volvox carteri f.
nagariensis]
Length = 797
Score = 94.4 bits (233), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 54/152 (35%), Positives = 78/152 (51%), Gaps = 6/152 (3%)
Query: 75 FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 134
+ +SW PRA + NF ++ +C ++ +R+ S L + RTS G
Sbjct: 493 IETISWSPRAFVYHNFLTSAECDHLVQIGTQRVSRS-LVVDSQTGQSKLDDIRTSYGAAF 551
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQM- 192
ED ++ IE +IA T LP HGE +LRY GQKYD+H+D F +P + +
Sbjct: 552 GRGEDP--VIAEIEERIAEWTHLPPEHGEPMQILRYVDGQKYDAHWDWFDDPVHHRSYLV 609
Query: 193 -SQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
R A+ LLYLS+VE GGET P + +S
Sbjct: 610 DGNRYATVLLYLSEVEAGGETNLPLADPIDMS 641
>gi|449468746|ref|XP_004152082.1| PREDICTED: putative prolyl 4-hydroxylase-like [Cucumis sativus]
Length = 290
Score = 94.4 bits (233), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 54/141 (38%), Positives = 81/141 (57%), Gaps = 3/141 (2%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
+V+SW PR + NF S ++C + A RL+ S + + G+ V+S RTSSG F+
Sbjct: 83 EVVSWSPRIIVLHNFLSTKECDYLKGIALARLEISTVVDTKTGKGVKSD--FRTSSGMFL 140
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
S E +++ IE +I+ + +P +GE VLRYE Q Y H+D F+ + Q
Sbjct: 141 SHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ 200
Query: 195 RLASFLLYLSDVEEGGETMFP 215
R+A+ L+YLS+ EGGET FP
Sbjct: 201 RIATMLMYLSENIEGGETYFP 221
>gi|307102962|gb|EFN51227.1| hypothetical protein CHLNCDRAFT_28161 [Chlorella variabilis]
Length = 300
Score = 94.4 bits (233), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 50/143 (34%), Positives = 75/143 (52%), Gaps = 2/143 (1%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
I +VLSW PR + + E+C ++ A RL S + ES RTS G
Sbjct: 14 ILLKVLSWDPRIFLYQRLLTEEECDHMMTKAGPRLTRSGVVDVDNPGGESVSDIRTSYGM 73
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
F ED+ ++ +E +++ +++P HGE VLRYE G++Y H+D F
Sbjct: 74 FFDRGEDE--VVREVERRLSEWSLIPPGHGEGIQVLRYENGEEYKPHFDYFFDNLSVQNG 131
Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
RLA+ L+YL++ E GGET+FP
Sbjct: 132 GNRLATILMYLAEPEFGGETVFP 154
>gi|319763870|ref|YP_004127807.1| procollagen-proline dioxygenase [Alicycliphilus denitrificans BC]
gi|330823866|ref|YP_004387169.1| procollagen-proline dioxygenase [Alicycliphilus denitrificans K601]
gi|317118431|gb|ADV00920.1| Procollagen-proline dioxygenase [Alicycliphilus denitrificans BC]
gi|329309238|gb|AEB83653.1| Procollagen-proline dioxygenase [Alicycliphilus denitrificans K601]
Length = 284
Score = 94.0 bits (232), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 57/162 (35%), Positives = 85/162 (52%), Gaps = 9/162 (5%)
Query: 67 DASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT 126
DA + + + PR + F N S E+CQ++I A+ R+ S L ++ E
Sbjct: 82 DAGDRRVEVLMAMANPRVVLFGNLLSPEECQAVIEAARTRMARS-LTVQAASGGEEVNKD 140
Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
RTS G F E++ + +E +IAR P +GE VL Y G +Y HYD F+PA
Sbjct: 141 RTSDGMFFQRGENEA--VARLEERIARLVRWPVENGEGLQVLHYRPGAEYKPHYDYFDPA 198
Query: 187 EYG-PQM----SQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
E G P++ QR+A+ ++YL+D GG T FP +V L+I
Sbjct: 199 EPGTPRLLRRGGQRVATLVIYLNDPVRGGGTTFP-DVPLEIG 239
>gi|120609859|ref|YP_969537.1| 2OG-Fe(II) oxygenase [Acidovorax citrulli AAC00-1]
gi|120588323|gb|ABM31763.1| 2OG-Fe(II) oxygenase [Acidovorax citrulli AAC00-1]
Length = 309
Score = 94.0 bits (232), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 57/149 (38%), Positives = 84/149 (56%), Gaps = 11/149 (7%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPS-QLALRQGETVESTKGTRTSSGTFISASED 139
+PR + F N S E+C +II A+ R+ S +A R G E RTS+G F E+
Sbjct: 121 QPRVVLFGNLLSPEECDAIIDAARPRMARSLTVATRTGG--EEVNDDRTSNGMFFQREEN 178
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQ 194
++ +E +IAR P +GE VL Y G +Y HYD F+PAE G P + Q
Sbjct: 179 P--VVARLEARIARLVNWPLENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPTILRRGGQ 236
Query: 195 RLASFLLYLSDVEEGGETMFPFEVDLQIS 223
R+A+ ++YL+D E+GG T FP +V L+++
Sbjct: 237 RVATIVIYLNDPEKGGGTTFP-DVHLEVA 264
>gi|255637879|gb|ACU19258.1| unknown [Glycine max]
Length = 287
Score = 94.0 bits (232), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 57/146 (39%), Positives = 82/146 (56%), Gaps = 11/146 (7%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
+VL+W PR + NF S E+C + A A RL S + + G+ ++S RTSSG F+
Sbjct: 82 EVLNWSPRIILLHNFLSMEECDYLRALALPRLHISTVVDTKTGKGIKSD--VRTSSGMFL 139
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKY----DSHYDAFNPAEYGP 190
++ E K +++ IE +I+ + +P +GE VLRYE Q Y D +D FN G
Sbjct: 140 NSKERKYPMVQAIEKRISVYSQIPIENGELMQVLRYEKNQYYKPRHDYFFDTFNLKRGG- 198
Query: 191 QMSQRLASFLLYLSDVEEGGETMFPF 216
Q +A+ L+YLSD EGGET FP
Sbjct: 199 ---QGIATMLMYLSDNIEGGETYFPL 221
>gi|307109700|gb|EFN57937.1| hypothetical protein CHLNCDRAFT_142031 [Chlorella variabilis]
Length = 325
Score = 93.6 bits (231), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 55/141 (39%), Positives = 78/141 (55%), Gaps = 10/141 (7%)
Query: 75 FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 134
F+ +SW PRA NFAS E+ +I A+ +L+ S + +GE+V RTS G FI
Sbjct: 32 FEPVSWYPRAFVAHNFASKEETDHMIKLAQPQLRRSTVVGSRGESV--VDNYRTSYGMFI 89
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
D+ ++ +E ++A T TH E VLRY Q+Y +H+D+ + S
Sbjct: 90 RRHHDE--VVSTLEKRVATWTKYNVTHQEDIQVLRYGTTQEYKAHFDSLD------DDSP 141
Query: 195 RLASFLLYLSDVEEGGETMFP 215
R A+ L+YLSDVE GGET FP
Sbjct: 142 RTATVLIYLSDVESGGETTFP 162
>gi|393200372|ref|YP_006462214.1| prolyl 4-hydroxylase [Solibacillus silvestris StLB046]
gi|327439703|dbj|BAK16068.1| prolyl 4-hydroxylase [Solibacillus silvestris StLB046]
Length = 211
Score = 93.6 bits (231), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 53/145 (36%), Positives = 82/145 (56%), Gaps = 9/145 (6%)
Query: 71 QSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSS 130
++I +VL P + F N S E+CQ++I A RL+ S+LA ++ ++ RTSS
Sbjct: 19 RTITAEVLHEEPLIVKFLNVLSDEECQNLIDCASSRLERSKLAKKEISSI------RTSS 72
Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP 190
G F E++ ++ IE +I+ LP H E VL YE GQ++ H+D F P +
Sbjct: 73 GMFFE--ENENPLISEIEKRISSLMHLPIEHAEGLQVLHYEPGQEFKPHFDFFGP-NHPS 129
Query: 191 QMSQRLASFLLYLSDVEEGGETMFP 215
+ R+ + ++YL+DVEEGG T FP
Sbjct: 130 SSNNRICTLVVYLNDVEEGGVTTFP 154
>gi|125542543|gb|EAY88682.1| hypothetical protein OsI_10157 [Oryza sativa Indica Group]
Length = 321
Score = 93.6 bits (231), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 56/156 (35%), Positives = 86/156 (55%), Gaps = 21/156 (13%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES-TKGTRTSSGTFISA 136
+SWRPRA + F S +C +I+ AK+ K + + GE+ ES T RTSSG F+
Sbjct: 45 VSWRPRAFLYEGFLSDAECDHLISLAKQG-KMEKSTVVDGESGESVTSKVRTSSGMFLDK 103
Query: 137 SEDKTGILELIEHKIARATMLP-----------------QTHGEAFNVLRYEIGQKYDSH 179
+D+ ++ IE +IA TMLP +GE+ +LRY G+KY+ H
Sbjct: 104 KQDE--VVARIEERIAAWTMLPTECIIFYCFANFAILKLSENGESMQILRYGQGEKYEPH 161
Query: 180 YDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
+D + + + R+A+ L+YLS+V+ GGET+FP
Sbjct: 162 FDYISGRQGSTREGDRVATVLMYLSNVKMGGETIFP 197
>gi|302844247|ref|XP_002953664.1| prolyl 4-hydroxylase alpha subunit-like protein [Volvox carteri f.
nagariensis]
gi|300261073|gb|EFJ45288.1| prolyl 4-hydroxylase alpha subunit-like protein [Volvox carteri f.
nagariensis]
Length = 364
Score = 93.2 bits (230), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 59/135 (43%), Positives = 77/135 (57%), Gaps = 7/135 (5%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
PRA F NF + + ++ A +LK S + +GE V RTS G FI D
Sbjct: 55 PRAYLFHNFLTKAERAHMVRLAAPKLKRSTVVGSKGEGV--VDNIRTSFGMFIRRLSDP- 111
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY-GPQMSQRLASFL 200
I+ IE +I+ T LP H E VLRY GQ Y +HYD+ +++ GP+ RLA+FL
Sbjct: 112 -IIARIEKRISLWTHLPIEHQEDIQVLRYAHGQTYGAHYDSGASSDHVGPKW--RLATFL 168
Query: 201 LYLSDVEEGGETMFP 215
+YLSDVEEGGET FP
Sbjct: 169 MYLSDVEEGGETAFP 183
>gi|264677094|ref|YP_003277000.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni CNB-2]
gi|262207606|gb|ACY31704.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni CNB-2]
Length = 306
Score = 93.2 bits (230), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 65/188 (34%), Positives = 95/188 (50%), Gaps = 23/188 (12%)
Query: 50 EVVEKENESGLPHGE---------TGDASIQSIPFQVLSWR-PRALYFPNFASAEQCQSI 99
EV +ESG+P + GD + + +S R PR + F N S E+C +I
Sbjct: 81 EVAVLVHESGVPEPDLLTDPCCVHAGDRDVHVL----MSVRHPRVVVFGNLLSDEECDAI 136
Query: 100 IATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQ 159
IA A+ R++ S L + E+ RTS+G F E+ ++ L+E +IAR P
Sbjct: 137 IAAARPRMRRS-LTVDNQSGGEAVNDDRTSNGMFFQRGEND--LISLVEQRIARLLNWPL 193
Query: 160 THGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQRLASFLLYLSDVEEGGETMF 214
+GE VL Y G +Y HYD F P E G P + QR+ + ++YL++ GG T F
Sbjct: 194 ENGEGMQVLHYRPGAEYKPHYDYFAPNEPGTPTILKRGGQRVGTLVMYLNEPARGGATTF 253
Query: 215 PFEVDLQI 222
P +V LQI
Sbjct: 254 P-DVGLQI 260
>gi|224033439|gb|ACN35795.1| unknown [Zea mays]
Length = 180
Score = 93.2 bits (230), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 49/103 (47%), Positives = 62/103 (60%), Gaps = 8/103 (7%)
Query: 119 TVESTKG------TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEI 172
V+ST G RTSSG F+ DK ++ IE +IA T +P HGE VL YE+
Sbjct: 6 VVDSTTGKSKDSRVRTSSGMFLQRGRDK--VIRAIEKRIADYTFIPVDHGEGLQVLHYEV 63
Query: 173 GQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
GQKY+ H+D F QR+A+ L+YLSDVEEGGET+FP
Sbjct: 64 GQKYEPHFDYFLDEFNTKNGGQRIATLLMYLSDVEEGGETIFP 106
>gi|159481038|ref|XP_001698589.1| predicted protein [Chlamydomonas reinhardtii]
gi|158282329|gb|EDP08082.1| predicted protein [Chlamydomonas reinhardtii]
Length = 258
Score = 93.2 bits (230), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 54/147 (36%), Positives = 74/147 (50%), Gaps = 6/147 (4%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
I + +SW PRA + NF S +C + KR+ S L + RTS G
Sbjct: 6 IRIETISWSPRAFIYHNFLSEAECDHLTDIGNKRVSRS-LVVDSKTGQSKLDDIRTSYGA 64
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQ 191
ED ++ +E +IA T LP +GE +LRY GQKYD+H+D F +P +
Sbjct: 65 AFGRGEDP--VIAAVEERIAEWTHLPPEYGEPMQILRYVDGQKYDAHWDWFDDPVHHAAY 122
Query: 192 MSQ--RLASFLLYLSDVEEGGETMFPF 216
+ + R A+ LLYLS VE GGET P
Sbjct: 123 LHEGNRYATVLLYLSGVEGGGETNLPL 149
>gi|159469311|ref|XP_001692811.1| predicted protein [Chlamydomonas reinhardtii]
gi|158278064|gb|EDP03830.1| predicted protein [Chlamydomonas reinhardtii]
Length = 273
Score = 93.2 bits (230), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 55/139 (39%), Positives = 73/139 (52%), Gaps = 3/139 (2%)
Query: 77 VLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
VL R + F + E+C I A+KRL+ S + + G RTS G F
Sbjct: 38 VLDPDARIYLWKGFLTPEECDYIRMKAEKRLERSGV-VDTGSGGSVVSDIRTSDGMFFER 96
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
ED I+E +E ++A TM P GE+ VLRY QKYDSH+D F + R
Sbjct: 97 GED--AIIEAVEQRLADWTMTPIWGGESLQVLRYRKDQKYDSHWDYFFHKDGSSNGGNRW 154
Query: 197 ASFLLYLSDVEEGGETMFP 215
A+ LLYL++ EEGGET+FP
Sbjct: 155 ATVLLYLTETEEGGETVFP 173
>gi|307111754|gb|EFN59988.1| hypothetical protein CHLNCDRAFT_49444 [Chlorella variabilis]
Length = 344
Score = 92.8 bits (229), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 55/146 (37%), Positives = 77/146 (52%), Gaps = 15/146 (10%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
QVL R + NF + E+C II A+ + S + VE+ G RTS
Sbjct: 64 QVLHEDARIFLYHNFLTDEECDHIIKLAEPTMARSGV-------VETDSGKSKIDNVRTS 116
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
GTF++ D ++ IE +IA+ T++P +GE VL+YE GQ+Y+ HYD F
Sbjct: 117 KGTFLNRGHDS--VIADIEARIAKWTLMPAGNGEGLQVLKYEHGQEYEGHYDYFFHKAGT 174
Query: 190 PQMSQRLASFLLYLSDVEEGGETMFP 215
R + L+YL+DVEEGGET FP
Sbjct: 175 ANGGNRYLTVLMYLNDVEEGGETCFP 200
>gi|302844249|ref|XP_002953665.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
gi|300261074|gb|EFJ45289.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
Length = 245
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 59/136 (43%), Positives = 76/136 (55%), Gaps = 7/136 (5%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
PRA F NF + + ++ A +LK S + GE V RTS G FI D
Sbjct: 60 HPRAYLFHNFLTKAERAHMVRLAAPKLKRSTVVGNDGEGV--VDEIRTSYGMFIRRLADP 117
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA-EYGPQMSQRLASF 199
++ IE +I+ T LP H E VLRY GQ Y +HYD+ + + E GP+ RLA+F
Sbjct: 118 --VITRIEKRISLWTHLPIEHQEDIQVLRYAHGQTYGAHYDSGDKSNEPGPKW--RLATF 173
Query: 200 LLYLSDVEEGGETMFP 215
L+YLSDVEEGGET FP
Sbjct: 174 LMYLSDVEEGGETAFP 189
>gi|145354086|ref|XP_001421326.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144581563|gb|ABO99619.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 309
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 54/141 (38%), Positives = 86/141 (60%), Gaps = 3/141 (2%)
Query: 75 FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 134
+ +S PRA + NF + E+ ++ IA A++ ++ S++ + + + T RTSSG ++
Sbjct: 78 IERISESPRAYVYRNFLTREEAEATIAAARRTMRRSEV-VNEADGTSKTSDERTSSGGWV 136
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
S + + ++ IE ++A TMLP+ GE V+RYE GQ+Y +H D F+ Q
Sbjct: 137 SGEDSE--VMANIERRVAAWTMLPRNRGETTQVMRYEAGQEYAAHDDYFHDEVNVKNGGQ 194
Query: 195 RLASFLLYLSDVEEGGETMFP 215
R A+ L+YLSDVEEGGET+FP
Sbjct: 195 RAATVLMYLSDVEEGGETVFP 215
>gi|397568865|gb|EJK46391.1| hypothetical protein THAOC_34939 [Thalassiosira oceanica]
Length = 488
Score = 92.4 bits (228), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 58/165 (35%), Positives = 85/165 (51%), Gaps = 14/165 (8%)
Query: 61 PHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETV 120
P TG++ +++ + LS +P L F + E+C I+ A +K S ++L+ +
Sbjct: 255 PSTITGNS--KNVTIETLSMKPLVLSISGFLADEECDYIMEKAAPTMKYSGVSLKDADKG 312
Query: 121 ESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHY 180
RTS TF++A D IL IE + A T +P TH E VLRY + +KYD+H+
Sbjct: 313 RPASDWRTSQSTFVAAMGDP--ILRDIELRTASLTRVPVTHQEFVQVLRYGVTEKYDAHH 370
Query: 181 DAFNPAEY--GPQMSQ--------RLASFLLYLSDVEEGGETMFP 215
D F+P+ Y P Q R A+ YL+DV GGET FP
Sbjct: 371 DFFDPSSYRSDPGTLQLIENGKKNRYATVFWYLTDVARGGETCFP 415
>gi|423389445|ref|ZP_17366671.1| hypothetical protein ICG_01293 [Bacillus cereus BAG1X1-3]
gi|401641536|gb|EJS59253.1| hypothetical protein ICG_01293 [Bacillus cereus BAG1X1-3]
Length = 216
Score = 92.4 bits (228), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 54/153 (35%), Positives = 84/153 (54%), Gaps = 17/153 (11%)
Query: 65 TGDASIQSI-PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
T D IQ I F+ P + N S E+C+ +I +K ++K S++ +
Sbjct: 25 TEDREIQIISKFE----EPLIVVLANVLSDEECEELIELSKNKMKRSKVG-----SSRDV 75
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTSSG F+ +E + IE +I+ T +P HGE ++L YE+ Q+Y +HYD F
Sbjct: 76 NDIRTSSGAFLEENE----LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF 131
Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFP 215
AE+ + R+++ ++YL+DVEEGGET FP
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP 162
>gi|412992163|emb|CCO19876.1| predicted protein [Bathycoccus prasinos]
Length = 350
Score = 92.0 bits (227), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 56/144 (38%), Positives = 82/144 (56%), Gaps = 10/144 (6%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTFISA 136
+SW+PRA + S E+C+ I+ AK +K S + GE T RTS TF++
Sbjct: 83 ISWQPRAFVLHSILSEEECEEILRIAKPMMKRSTVVDSITGEI--KTDPIRTSKQTFLAR 140
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP-AEYGPQMS-- 193
K ++ +E +++R TMLP +GE +L Y +G+KY +H+D + G Q+S
Sbjct: 141 G--KYPVVTRVEERLSRFTMLPWYNGEDMQILSYGVGEKYSAHHDVGEKNTKSGQQLSAD 198
Query: 194 --QRLASFLLYLSDVEEGGETMFP 215
QR+A+ LLYL D EEGGET FP
Sbjct: 199 GGQRVATVLLYLQDTEEGGETAFP 222
>gi|384250156|gb|EIE23636.1| hypothetical protein COCSUDRAFT_53414 [Coccomyxa subellipsoidea
C-169]
Length = 285
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 50/145 (34%), Positives = 81/145 (55%), Gaps = 5/145 (3%)
Query: 72 SIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTKGTRTSS 130
S+ + +SW PRA + S ++C II A+ + K + L + + V + R +
Sbjct: 50 SLMVERISWNPRAFLYRGLLSQDECDYIINAARPNMVKATVLDAKTKKQVPNK--LRNNK 107
Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP 190
+I S D +++ IE +IAR T LP HGE F++++Y GQ Y H D + +
Sbjct: 108 EAYIDGSADD--VIDQIERRIARYTFLPAAHGEPFHIMQYLPGQGYAPHTDWLDDWWHPR 165
Query: 191 QMSQRLASFLLYLSDVEEGGETMFP 215
++R+A+ ++YLSDV EGGET+FP
Sbjct: 166 LGNERIATMIIYLSDVVEGGETVFP 190
>gi|229086310|ref|ZP_04218488.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-44]
gi|228697005|gb|EEL49812.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-44]
Length = 220
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 48/135 (35%), Positives = 80/135 (59%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQL-ALRQGETVESTKGTRTSSGTFISASEDK 140
P + N S E+C+S+I +K +K S++ A R+ + + RTSSGTF+ +E
Sbjct: 43 PLIVVLENVLSDEECESLIELSKDSMKRSKIGASREVDNI------RTSSGTFLEENE-- 94
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFL 200
+ +IE +++ +P HGE ++L+Y GQ+Y +HYD F + + R+++ +
Sbjct: 95 --TVAIIEKRVSSIMNIPVEHGEGLHILKYTPGQEYKAHYDYFAEHSRAAE-NNRISTLV 151
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 152 MYLNDVEEGGETFFP 166
>gi|388520325|gb|AFK48224.1| unknown [Lotus japonicus]
Length = 188
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 52/118 (44%), Positives = 71/118 (60%), Gaps = 5/118 (4%)
Query: 99 IIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATML 157
+I AK + S + Q G++V S RTSSG F+ +DK +++ IE +IA +
Sbjct: 1 MINLAKPHMAKSSVVDSQTGKSVGSR--VRTSSGMFLKRGKDK--VIQTIEKRIADFAFI 56
Query: 158 PQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
P +GE VL YE+GQKY+ HYD F QR+A+ L+YLSDVEEGGET+FP
Sbjct: 57 PVENGEGLQVLHYEVGQKYEPHYDYFLDEFNTKNGGQRIATVLMYLSDVEEGGETIFP 114
>gi|229019457|ref|ZP_04176278.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1273]
gi|229025700|ref|ZP_04182104.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1272]
gi|423417837|ref|ZP_17394926.1| hypothetical protein IE3_01309 [Bacillus cereus BAG3X2-1]
gi|228735575|gb|EEL86166.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1272]
gi|228741812|gb|EEL91991.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1273]
gi|401107008|gb|EJQ14965.1| hypothetical protein IE3_01309 [Bacillus cereus BAG3X2-1]
Length = 216
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 54/153 (35%), Positives = 83/153 (54%), Gaps = 17/153 (11%)
Query: 65 TGDASIQSI-PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
T D IQ I F+ P + N S E+C +I +K ++K S++ +
Sbjct: 25 TEDREIQIISKFE----EPLIVVLANVLSDEECDELIELSKNKMKRSKVG-----SSRDV 75
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTSSG F+ +E + IE +I+ T +P HGE ++L YE+ Q+Y +HYD F
Sbjct: 76 NDIRTSSGAFLEENE----LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF 131
Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFP 215
AE+ + R+++ ++YL+DVEEGGET FP
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP 162
>gi|229098707|ref|ZP_04229647.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-29]
gi|423441025|ref|ZP_17417931.1| hypothetical protein IEA_01355 [Bacillus cereus BAG4X2-1]
gi|423533441|ref|ZP_17509859.1| hypothetical protein IGI_01273 [Bacillus cereus HuB2-9]
gi|228684786|gb|EEL38724.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-29]
gi|402417686|gb|EJV49986.1| hypothetical protein IEA_01355 [Bacillus cereus BAG4X2-1]
gi|402463660|gb|EJV95360.1| hypothetical protein IGI_01273 [Bacillus cereus HuB2-9]
Length = 216
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 48/135 (35%), Positives = 75/135 (55%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K ++K S + + RTSSG F+ +E
Sbjct: 39 PLIVVLGNVISDEECNELIEMSKNKIKRSTIG-----SARDVNDIRTSSGAFLEENE--- 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P THGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETFFP 162
>gi|17547533|ref|NP_520935.1| hypothetical protein RSc2814 [Ralstonia solanacearum GMI1000]
gi|17429837|emb|CAD16521.1| putative prolyl 4-hydroxylase alpha subunit homologue
oxidoreductase protein [Ralstonia solanacearum GMI1000]
Length = 289
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 65/182 (35%), Positives = 90/182 (49%), Gaps = 14/182 (7%)
Query: 43 RPKLRTLEVVEKENESGLPHGETGDA---SIQSIPFQVLSWRPRALYFPNFASAEQCQSI 99
RPK V +G P E+ +A S + IP PR + F +F S E+C +
Sbjct: 56 RPKPEPAATVPSPAPAG-PEPESSNAVRTSDREIPILFAIETPRIVLFQHFLSDEECDQL 114
Query: 100 IATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLP 158
IA + RLK S + + GE E+ RTS G E ++ IE +IA+AT +P
Sbjct: 115 IALGRHRLKRSPVVNPETGE--ENLISARTSQGAMFQVGEHP--LVARIEARIAQATGVP 170
Query: 159 QTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRLASFLLYLSDVEEGGETM 213
HGE F VL Y+ G +Y H+D FNP G QR+A+ ++YL+ V GG T
Sbjct: 171 VEHGEGFQVLHYQPGGEYQPHFDYFNPGRSGEARQLEVGGQRVATLVIYLNSVPAGGATG 230
Query: 214 FP 215
FP
Sbjct: 231 FP 232
>gi|344172475|emb|CCA85118.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia syzygii R24]
Length = 289
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 57/153 (37%), Positives = 80/153 (52%), Gaps = 10/153 (6%)
Query: 69 SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTR 127
S + IP PR + F +F S E+C +IA + RLK S + + GE E+ R
Sbjct: 84 SDREIPILFAIETPRIVLFQHFLSDEECDELIALGRHRLKRSPVVNPETGE--ENLISAR 141
Query: 128 TSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE 187
TS G E ++ IE +IA+AT +P HGE F VL Y+ G +Y H+D FNP
Sbjct: 142 TSQGAMFQVGEHP--LIARIEARIAQATGVPVEHGEGFQVLHYQPGGEYQPHFDYFNPGR 199
Query: 188 YGPQM-----SQRLASFLLYLSDVEEGGETMFP 215
G QR+A+ ++YL+ V+ GG T FP
Sbjct: 200 SGEARQLEVGGQRVATLVIYLNSVQAGGATGFP 232
>gi|344169181|emb|CCA81504.1| putative Prolyl 4-hydroxylase alpha subunit [blood disease
bacterium R229]
Length = 289
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 57/153 (37%), Positives = 80/153 (52%), Gaps = 10/153 (6%)
Query: 69 SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTR 127
S + IP PR + F +F S E+C +IA + RLK S + + GE E+ R
Sbjct: 84 SDREIPILFAIETPRIVLFQHFLSDEECDELIALGRHRLKRSPVVNPETGE--ENLISAR 141
Query: 128 TSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE 187
TS G E ++ IE +IA+AT +P HGE F VL Y+ G +Y H+D FNP
Sbjct: 142 TSQGAMFQVGEHP--LIARIEARIAQATGVPVEHGEGFQVLHYQPGGEYQPHFDYFNPGR 199
Query: 188 YGPQM-----SQRLASFLLYLSDVEEGGETMFP 215
G QR+A+ ++YL+ V+ GG T FP
Sbjct: 200 SGEARQLEVGGQRVATLVIYLNSVQAGGATGFP 232
>gi|300690371|ref|YP_003751366.1| prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum PSI07]
gi|299077431|emb|CBJ50057.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
PSI07]
Length = 289
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 57/153 (37%), Positives = 80/153 (52%), Gaps = 10/153 (6%)
Query: 69 SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTR 127
S + IP PR + F +F S E+C +IA + RLK S + + GE E+ R
Sbjct: 84 SDREIPILFAIETPRIVLFQHFLSDEECDELIALGRHRLKRSPVVNPETGE--ENLISAR 141
Query: 128 TSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE 187
TS G E ++ IE +IA+AT +P HGE F VL Y+ G +Y H+D FNP
Sbjct: 142 TSQGAMFQVGEHP--LIARIEARIAQATGVPVEHGEGFQVLHYQPGGEYQPHFDYFNPGR 199
Query: 188 YGPQM-----SQRLASFLLYLSDVEEGGETMFP 215
G QR+A+ ++YL+ V+ GG T FP
Sbjct: 200 SGEARQLEVGGQRVATLVIYLNSVQAGGATGFP 232
>gi|229075940|ref|ZP_04208916.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-18]
gi|229117732|ref|ZP_04247101.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-3]
gi|407706764|ref|YP_006830349.1| alpha/beta fold family hydrolase [Bacillus thuringiensis MC28]
gi|423377905|ref|ZP_17355189.1| hypothetical protein IC9_01258 [Bacillus cereus BAG1O-2]
gi|423464099|ref|ZP_17440867.1| hypothetical protein IEK_01286 [Bacillus cereus BAG6O-1]
gi|423547540|ref|ZP_17523898.1| hypothetical protein IGO_03975 [Bacillus cereus HuB5-5]
gi|423622677|ref|ZP_17598455.1| hypothetical protein IK3_01275 [Bacillus cereus VD148]
gi|228665709|gb|EEL21182.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-3]
gi|228707255|gb|EEL59452.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-18]
gi|401179261|gb|EJQ86434.1| hypothetical protein IGO_03975 [Bacillus cereus HuB5-5]
gi|401260797|gb|EJR66965.1| hypothetical protein IK3_01275 [Bacillus cereus VD148]
gi|401636171|gb|EJS53925.1| hypothetical protein IC9_01258 [Bacillus cereus BAG1O-2]
gi|402420366|gb|EJV52637.1| hypothetical protein IEK_01286 [Bacillus cereus BAG6O-1]
gi|407384449|gb|AFU14950.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis MC28]
Length = 216
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 48/135 (35%), Positives = 75/135 (55%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K ++K S + + RTSSG F+ +E
Sbjct: 39 PLIVVLGNVISDEECNELIEMSKNKIKRSTIG-----SARDVNDIRTSSGAFLEENE--- 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P THGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETFFP 162
>gi|218899396|ref|YP_002447807.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
G9842]
gi|218542449|gb|ACK94843.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
G9842]
Length = 216
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 47/135 (34%), Positives = 77/135 (57%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K ++K S++ + RTSSG F+ +E
Sbjct: 39 PLIVVLANVLSDEECDELIEMSKNKMKRSKVG-----SSRDVNDIRTSSGAFLEDNE--- 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ ++ R+++ +
Sbjct: 91 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAVNNRISTLV 147
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETYFP 162
>gi|228910069|ref|ZP_04073889.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL 200]
gi|228849586|gb|EEM94420.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL 200]
Length = 248
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 47/135 (34%), Positives = 77/135 (57%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K ++K S++ + RTSSG F+ +E
Sbjct: 71 PLIVVLANVLSDEECDELIEMSKNKMKRSKVG-----SSRDVNDIRTSSGAFLEDNE--- 122
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ ++ R+++ +
Sbjct: 123 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAVNNRISTLV 179
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 180 MYLNDVEEGGETYFP 194
>gi|302841711|ref|XP_002952400.1| hypothetical protein VOLCADRAFT_81799 [Volvox carteri f.
nagariensis]
gi|300262336|gb|EFJ46543.1| hypothetical protein VOLCADRAFT_81799 [Volvox carteri f.
nagariensis]
Length = 269
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 57/134 (42%), Positives = 71/134 (52%), Gaps = 5/134 (3%)
Query: 83 RALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASEDKT 141
R + F + E+C I A+KRL+ S + G +V S RTS G F ED
Sbjct: 44 RIYLWRGFLTPEECDYIRMKAEKRLERSGVVDTASGSSVVSD--IRTSDGMFFERGED-- 99
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
ILE +E ++A TM P GEA VLRY QKYDSH + F E R A+ L
Sbjct: 100 AILEAVEQRLADWTMTPIWAGEALQVLRYRKDQKYDSHVNYFFHKEGSANGGNRWATVLT 159
Query: 202 YLSDVEEGGETMFP 215
YL+D EEGGET+FP
Sbjct: 160 YLTDTEEGGETVFP 173
>gi|326316001|ref|YP_004233673.1| procollagen-proline dioxygenase [Acidovorax avenae subsp. avenae
ATCC 19860]
gi|323372837|gb|ADX45106.1| Procollagen-proline dioxygenase [Acidovorax avenae subsp. avenae
ATCC 19860]
Length = 298
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 56/149 (37%), Positives = 83/149 (55%), Gaps = 11/149 (7%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPS-QLALRQGETVESTKGTRTSSGTFISASED 139
+PR + F N S E+C +II A+ R+ S +A R G E RTS+G F E+
Sbjct: 110 QPRVVLFGNLLSPEECDAIIDAARPRMARSLTVATRTGG--EEVNDDRTSNGMFFQREEN 167
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQ 194
++ +E +IAR P +GE VL Y G +Y HYD F+P E G P + Q
Sbjct: 168 P--MVAKLEARIARLVNWPLENGEGLQVLHYRPGAEYKPHYDYFDPTEPGTPTILRRGGQ 225
Query: 195 RLASFLLYLSDVEEGGETMFPFEVDLQIS 223
R+A+ ++YL+D E+GG T FP +V L+++
Sbjct: 226 RVATIVIYLNDPEKGGGTTFP-DVHLEVA 253
>gi|255071007|ref|XP_002507585.1| predicted protein [Micromonas sp. RCC299]
gi|226522860|gb|ACO68843.1| predicted protein [Micromonas sp. RCC299]
Length = 433
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 55/151 (36%), Positives = 81/151 (53%), Gaps = 6/151 (3%)
Query: 75 FQVLSW-RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTF 133
QV+S PRA F S +C ++ A+ + S + + S RTS+G+F
Sbjct: 158 IQVVSLDNPRAFMHIGFLSERECDLLVEYARPNMYKSGV-VDASNGGSSFSNIRTSTGSF 216
Query: 134 ISA--SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ 191
+ ++ IE +IA T +P HGE VLRY+IGQ+Y SH+D F G
Sbjct: 217 VPTVFPLGMNDVVRRIERRIAAWTQIPAAHGEPIQVLRYQIGQEYQSHFDYF--FHEGGM 274
Query: 192 MSQRLASFLLYLSDVEEGGETMFPFEVDLQI 222
+ R+A+ L+YLSDV++GGET+FP LQ+
Sbjct: 275 KNNRIATVLMYLSDVKDGGETVFPSAESLQV 305
>gi|423598444|ref|ZP_17574444.1| hypothetical protein III_01246 [Bacillus cereus VD078]
gi|423660914|ref|ZP_17636083.1| hypothetical protein IKM_01311 [Bacillus cereus VDM022]
gi|401236714|gb|EJR43171.1| hypothetical protein III_01246 [Bacillus cereus VD078]
gi|401300955|gb|EJS06544.1| hypothetical protein IKM_01311 [Bacillus cereus VDM022]
Length = 216
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 54/153 (35%), Positives = 83/153 (54%), Gaps = 17/153 (11%)
Query: 65 TGDASIQSI-PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
T D IQ I F+ P + N S E+C +I +K ++K S++ +
Sbjct: 25 TEDREIQIISKFE----EPLIVVLANVLSDEECDELIELSKSKMKRSKVG-----SSRDV 75
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTSSG F+ +E + IE +I+ T +P HGE ++L YE+ Q+Y +HYD F
Sbjct: 76 NDIRTSSGAFLEENE----LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF 131
Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFP 215
AE+ + R+++ ++YL+DVEEGGET FP
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP 162
>gi|420246706|ref|ZP_14750139.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. BT03]
gi|398073616|gb|EJL64785.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. BT03]
Length = 282
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 53/158 (33%), Positives = 81/158 (51%), Gaps = 8/158 (5%)
Query: 63 GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES 122
G T A + + ++ RP+ + F + S E+C +I A+ RLK S + + E
Sbjct: 74 GNTVHAHDRDVTVRIRFERPQVIAFDDVLSGEECAELIERARHRLKRSTTVNPENGS-ED 132
Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
RTS G + ED +E ++H+I+ P HGE +L Y G +Y H+D
Sbjct: 133 VIQLRTSEGFWFQRCED--AFIERLDHRISALMNWPLEHGEGLQILHYRQGGEYRPHFDY 190
Query: 183 FNPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFP 215
F P + G + QR+A+ ++YLSDVE GGET+FP
Sbjct: 191 FPPGQNGSVLHTARGGQRVATLIVYLSDVEGGGETVFP 228
>gi|228954520|ref|ZP_04116545.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
kurstaki str. T03a001]
gi|449091198|ref|YP_007423639.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
kurstaki str. HD73]
gi|228805177|gb|EEM51771.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
kurstaki str. T03a001]
gi|449024955|gb|AGE80118.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
kurstaki str. HD73]
Length = 216
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 47/135 (34%), Positives = 76/135 (56%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K ++K S++ + RTSSG F+ +E
Sbjct: 39 PLIVVLANVLSDEECDELIEMSKNKMKRSKVG-----SARDVNDIRTSSGAFLEDNE--- 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETYFP 162
>gi|423437685|ref|ZP_17414666.1| hypothetical protein IE9_03866 [Bacillus cereus BAG4X12-1]
gi|423503075|ref|ZP_17479667.1| hypothetical protein IG1_00641 [Bacillus cereus HD73]
gi|401120840|gb|EJQ28636.1| hypothetical protein IE9_03866 [Bacillus cereus BAG4X12-1]
gi|402459296|gb|EJV91033.1| hypothetical protein IG1_00641 [Bacillus cereus HD73]
Length = 248
Score = 90.9 bits (224), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 47/135 (34%), Positives = 76/135 (56%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K ++K S++ + RTSSG F+ +E
Sbjct: 71 PLIVVLANVLSDEECDELIEMSKNKMKRSKVG-----SARDVNDIRTSSGAFLEDNE--- 122
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 123 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 179
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 180 MYLNDVEEGGETYFP 194
>gi|363543299|ref|NP_001241865.1| prolyl 4-hydroxylase 5-1 [Zea mays]
gi|347978814|gb|AEP37749.1| prolyl 4-hydroxylase 5-1 [Zea mays]
Length = 180
Score = 90.9 bits (224), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 47/103 (45%), Positives = 62/103 (60%), Gaps = 8/103 (7%)
Query: 119 TVESTKG------TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEI 172
V+ST G RTSSG F+ DK ++ +IE +I T +P HGE VL YE+
Sbjct: 6 VVDSTTGKSKDSRVRTSSGMFLQRGRDK--VIRVIEKRITDYTFIPVDHGEGLQVLHYEV 63
Query: 173 GQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
GQKY+ H+D F QR+A+ L++LSDVEEGGET+FP
Sbjct: 64 GQKYEPHFDYFLDEFNTKNGGQRMATLLMHLSDVEEGGETIFP 106
>gi|205374182|ref|ZP_03226981.1| prolyl 4-hydroxylase alpha subunit [Bacillus coahuilensis m4-4]
Length = 210
Score = 90.9 bits (224), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 46/134 (34%), Positives = 79/134 (58%), Gaps = 10/134 (7%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P N S E+C +I+ +K R+ S++A Q + RTS+ F+ ED +
Sbjct: 33 PFVAVLGNVLSDEECDELISLSKDRMNRSKIAGNQENDI------RTSTSVFLP--EDAS 84
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
+++ +E +I++ +P HGE +L Y+IGQ+Y +H+D F+P + + R+++ +L
Sbjct: 85 EVVQRVEKRISQIMNIPVEHGEGLQLLNYQIGQEYKAHFDFFSPKKLIE--NPRISTLVL 142
Query: 202 YLSDVEEGGETMFP 215
YL+DVEEGG+T FP
Sbjct: 143 YLNDVEEGGDTYFP 156
>gi|423448819|ref|ZP_17425698.1| hypothetical protein IEC_03427 [Bacillus cereus BAG5O-1]
gi|401129413|gb|EJQ37096.1| hypothetical protein IEC_03427 [Bacillus cereus BAG5O-1]
Length = 216
Score = 90.9 bits (224), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 48/135 (35%), Positives = 75/135 (55%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K ++K S + + RTSSG F+ +E
Sbjct: 39 PLIVVLGNVISDEECDELIEMSKNKIKRSTIG-----SSRDVNDIRTSSGAFLEENE--- 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P THGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETFFP 162
>gi|390570433|ref|ZP_10250698.1| procollagen-proline dioxygenase [Burkholderia terrae BS001]
gi|389937613|gb|EIM99476.1| procollagen-proline dioxygenase [Burkholderia terrae BS001]
Length = 285
Score = 90.9 bits (224), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 53/158 (33%), Positives = 81/158 (51%), Gaps = 8/158 (5%)
Query: 63 GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES 122
G T A + + ++ RP+ + F + S E+C +I A+ RLK S + + E
Sbjct: 77 GNTVHAHDRDVTVRIRFERPQVIAFDDVLSGEECAELIERARHRLKRSTTVNPENGS-ED 135
Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
RTS G + ED +E ++H+I+ P HGE +L Y G +Y H+D
Sbjct: 136 VIQLRTSEGFWFQRCED--AFIERLDHRISALMNWPLEHGEGLQILHYRQGGEYRPHFDY 193
Query: 183 FNPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFP 215
F P + G + QR+A+ ++YLSDVE GGET+FP
Sbjct: 194 FPPGQNGSVLHTARGGQRVATLIVYLSDVEGGGETVFP 231
>gi|300702992|ref|YP_003744594.1| prolyl 4-hydroxylase subunit alpha [Ralstonia solanacearum
CFBP2957]
gi|299070655|emb|CBJ41950.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
CFBP2957]
Length = 289
Score = 90.9 bits (224), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 60/164 (36%), Positives = 83/164 (50%), Gaps = 13/164 (7%)
Query: 61 PHGETGDA---SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ- 116
P E +A S + IP PR + F +F S E+C +IA + RLK S + +
Sbjct: 73 PEAENSNAVRTSDREIPILFAIETPRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPET 132
Query: 117 GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKY 176
GE E+ RTS G E ++ IE +IA+AT +P HGE F VL Y G +Y
Sbjct: 133 GE--ENLISARTSEGAMFQVGEHP--LVARIEARIAQATGVPVEHGEGFQVLHYHPGGEY 188
Query: 177 DSHYDAFNPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFP 215
H+D FNP G QR+A+ ++YL+ V+ GG T FP
Sbjct: 189 QPHFDYFNPGRSGEARQLEVGGQRVATLVIYLNSVQAGGATGFP 232
>gi|418530659|ref|ZP_13096582.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni ATCC 11996]
gi|371452378|gb|EHN65407.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni ATCC 11996]
Length = 299
Score = 90.9 bits (224), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 63/188 (33%), Positives = 95/188 (50%), Gaps = 23/188 (12%)
Query: 50 EVVEKENESGLPHGE---------TGDASIQSIPFQVLSWR-PRALYFPNFASAEQCQSI 99
EV + +ESG+P + GD + + +S R PR + F N S E+C +I
Sbjct: 74 EVAVQIHESGVPEPDLLTDPCCVHAGDRDVHVL----MSVRHPRVVVFGNLLSNEECDAI 129
Query: 100 IATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQ 159
IA A+ R++ S L + E+ RTS+G F E+ ++ +E +IAR P
Sbjct: 130 IAAARPRMQRS-LTVDNQSGGEAVNDDRTSNGMFFQRGEND--LISRVEQRIARLLNWPL 186
Query: 160 THGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQRLASFLLYLSDVEEGGETMF 214
+GE VL Y G +Y HYD F P E G P + QR+ + ++YL++ GG T F
Sbjct: 187 ENGEGMQVLHYRPGAEYKPHYDYFAPNEPGTPTILKRGGQRVGTLVMYLNEPARGGATTF 246
Query: 215 PFEVDLQI 222
P +V LQ+
Sbjct: 247 P-DVGLQV 253
>gi|423521903|ref|ZP_17498376.1| hypothetical protein IGC_01286 [Bacillus cereus HuA4-10]
gi|401176565|gb|EJQ83760.1| hypothetical protein IGC_01286 [Bacillus cereus HuA4-10]
Length = 216
Score = 90.9 bits (224), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 48/135 (35%), Positives = 75/135 (55%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +K S++ + RTSSG F+ +E
Sbjct: 39 PLIVVLANVLSDEECDKLIELSKNNMKRSKVG-----SSRDVNDIRTSSGAFLEENE--- 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ T +P HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETFFP 162
>gi|354334983|gb|AER23925.1| procollagen-proline dioxygenase [Variovorax sp. HH01]
Length = 280
Score = 90.9 bits (224), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 59/166 (35%), Positives = 89/166 (53%), Gaps = 14/166 (8%)
Query: 64 ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPS-QLALRQGETVES 122
+ GD +Q + Q + PR + F N SAE+C+ +IA A+ RL S + R G V +
Sbjct: 78 DAGDRRVQVL--QTMR-HPRVVVFGNLLSAEECEGLIAAARVRLARSLTVETRTGGEVLN 134
Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
RTS G F E++ I+ +E +IA P GE +LRY G +Y HYD
Sbjct: 135 VD--RTSDGMFFERGENE--IVARVEQRIAALLRWPLEFGEGLQILRYAPGAQYRPHYDY 190
Query: 183 FNPAEYG-----PQMSQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
F+P+E G + QR+A+ ++YL + E GG T FP +V L+++
Sbjct: 191 FDPSEPGTPTILKRGGQRVATLVMYLQEPEGGGATTFP-DVGLEVA 235
>gi|423541303|ref|ZP_17517694.1| hypothetical protein IGK_03395 [Bacillus cereus HuB4-10]
gi|401172491|gb|EJQ79712.1| hypothetical protein IGK_03395 [Bacillus cereus HuB4-10]
Length = 216
Score = 90.9 bits (224), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 48/135 (35%), Positives = 75/135 (55%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K ++K S + + RTSSG F+ +E
Sbjct: 39 PLIVVLGNVISDEECDELIEMSKNKIKRSTIG-----SSRDVNDIRTSSGAFLEENE--- 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P THGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETFFP 162
>gi|229093299|ref|ZP_04224414.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-42]
gi|228690082|gb|EEL43879.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-42]
Length = 232
Score = 90.9 bits (224), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 48/135 (35%), Positives = 76/135 (56%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +L S++ + RTSSG F+ +E
Sbjct: 55 PLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDVNDIRTSSGAFLDDNE--- 106
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ E IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 107 -LTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 163
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 164 MYLNDVEEGGETFFP 178
>gi|49480949|ref|YP_038297.1| prolyl 4-hydroxylase subunit alpha [Bacillus thuringiensis serovar
konkukian str. 97-27]
gi|49332505|gb|AAT63151.1| prolyl 4-hydroxylase, alpha subunit [Bacillus thuringiensis serovar
konkukian str. 97-27]
Length = 232
Score = 90.5 bits (223), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 48/135 (35%), Positives = 76/135 (56%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +L S++ + RTSSG F+ +E
Sbjct: 55 PLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDVNDIRTSSGAFLDDNE--- 106
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ E IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 107 -LTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 163
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 164 MYLNDVEEGGETFFP 178
>gi|228902749|ref|ZP_04066896.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL
4222]
gi|228967277|ref|ZP_04128313.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
sotto str. T04001]
gi|402564350|ref|YP_006607074.1| prolyl 4-hydroxylase subunit alpha domain-containing protein
[Bacillus thuringiensis HD-771]
gi|434377355|ref|YP_006611999.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
thuringiensis HD-789]
gi|228792646|gb|EEM40212.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
sotto str. T04001]
gi|228856936|gb|EEN01449.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL
4222]
gi|401793002|gb|AFQ19041.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
thuringiensis HD-771]
gi|401875912|gb|AFQ28079.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
thuringiensis HD-789]
Length = 216
Score = 90.5 bits (223), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 47/135 (34%), Positives = 76/135 (56%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K ++K S++ + RTSSG F+ +E
Sbjct: 39 PLIVVLANVLSDEECDKLIEMSKNKMKRSKVG-----SSRDVNDIRTSSGAFLEDNE--- 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETYFP 162
>gi|207744371|ref|YP_002260763.1| prolyl 4-hydroxylase subunit alpha [Ralstonia solanacearum IPO1609]
gi|206595776|emb|CAQ62703.1| prolyl 4-hydroxylase alpha subunit homologue protein [Ralstonia
solanacearum IPO1609]
Length = 280
Score = 90.5 bits (223), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 60/164 (36%), Positives = 83/164 (50%), Gaps = 13/164 (7%)
Query: 61 PHGETGDA---SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ- 116
P E +A S + IP PR + F +F S E+C +IA + RLK S + +
Sbjct: 64 PEAENSNAVRTSDREIPILFAIETPRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPET 123
Query: 117 GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKY 176
GE E+ RTS G E ++ IE +IA+AT +P HGE F VL Y G +Y
Sbjct: 124 GE--ENLISARTSEGAMFQVGEHP--LVARIEARIAQATGVPVEHGEGFQVLHYHPGGEY 179
Query: 177 DSHYDAFNPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFP 215
H+D FNP G QR+A+ ++YL+ V+ GG T FP
Sbjct: 180 QPHFDYFNPGRSGEARQLEVGGQRVATLVIYLNSVQAGGATGFP 223
>gi|398808448|ref|ZP_10567311.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
gi|398087480|gb|EJL78066.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
Length = 280
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 59/166 (35%), Positives = 90/166 (54%), Gaps = 14/166 (8%)
Query: 64 ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPS-QLALRQGETVES 122
+ GD +Q + Q + PR + F N SAE+C+ +IA A+ RL S + R G V +
Sbjct: 78 DAGDRRVQVL--QTMR-HPRVVVFGNLLSAEECEGLIAAARVRLARSLTVETRTGGEVLN 134
Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
RTS G F E++ I+ +E ++A P +GE +LRY G +Y HYD
Sbjct: 135 VD--RTSDGMFFERGENE--IVARLEQRLATLLRWPLEYGEGLQILRYAPGAQYRPHYDY 190
Query: 183 FNPAEYG-PQM----SQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
F+P E G P + QR+A+ ++YL + E GG T FP +V L+++
Sbjct: 191 FDPGEPGTPTILKRGGQRVATLVMYLQEPEGGGATTFP-DVGLEVA 235
>gi|228922987|ref|ZP_04086280.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
huazhongensis BGSC 4BD1]
gi|228836620|gb|EEM81968.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
huazhongensis BGSC 4BD1]
Length = 216
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 48/135 (35%), Positives = 76/135 (56%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K ++K S++ + RTSSG F+ SE
Sbjct: 39 PLIVVLANVLSDEECDELIEMSKNKMKRSKVG-----SSRDVNDIRTSSGAFLEDSE--- 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTLKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETYFP 162
>gi|228960501|ref|ZP_04122151.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pakistani str. T13001]
gi|229047930|ref|ZP_04193506.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH676]
gi|423630961|ref|ZP_17606708.1| hypothetical protein IK5_03811 [Bacillus cereus VD154]
gi|423650103|ref|ZP_17625673.1| hypothetical protein IKA_03890 [Bacillus cereus VD169]
gi|228723387|gb|EEL74756.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH676]
gi|228799198|gb|EEM46165.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pakistani str. T13001]
gi|401264328|gb|EJR70440.1| hypothetical protein IK5_03811 [Bacillus cereus VD154]
gi|401282521|gb|EJR88420.1| hypothetical protein IKA_03890 [Bacillus cereus VD169]
Length = 248
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 47/135 (34%), Positives = 76/135 (56%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K ++K S++ + RTSSG F+ +E
Sbjct: 71 PLIVVLANVLSDEECDELIEMSKNKMKRSKVG-----SSRDVNDIRTSSGAFLEDNE--- 122
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 123 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 179
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 180 MYLNDVEEGGETYFP 194
>gi|423582447|ref|ZP_17558558.1| hypothetical protein IIA_03962 [Bacillus cereus VD014]
gi|401213326|gb|EJR20067.1| hypothetical protein IIA_03962 [Bacillus cereus VD014]
Length = 248
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 56/154 (36%), Positives = 84/154 (54%), Gaps = 19/154 (12%)
Query: 65 TGDASIQSI-PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
T D IQ I F+ P + N S E+C +I +K ++K S++ +
Sbjct: 57 TEDREIQIISKFE----EPLIVVLANVLSDEECDELIEMSKNKMKRSKVG-----SSRDV 107
Query: 124 KGTRTSSGTFISASEDKTGILEL-IEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
RTSSG F+ SE L L IE +I+ +P +HGE ++L YE+ Q+Y +HYD
Sbjct: 108 NDIRTSSGAFLEDSE-----LTLKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDY 162
Query: 183 FNPAEYG-PQMSQRLASFLLYLSDVEEGGETMFP 215
F AE+ + R+++ ++YL+DVEEGGET FP
Sbjct: 163 F--AEHSRSAANNRISTLVMYLNDVEEGGETYFP 194
>gi|254254263|ref|ZP_04947580.1| hypothetical protein BDAG_03558 [Burkholderia dolosa AUO158]
gi|124898908|gb|EAY70751.1| hypothetical protein BDAG_03558 [Burkholderia dolosa AUO158]
Length = 285
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 56/164 (34%), Positives = 81/164 (49%), Gaps = 20/164 (12%)
Query: 63 GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES 122
G DA+ + IP + RP+ + F N ++C +I + +L+ Q TV +
Sbjct: 77 GNVIDAADRRIPVLIRCERPQIVVFGNVLDQDECDEMIQRSMHKLE-------QSTTVNA 129
Query: 123 TKGT------RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKY 176
GT RTS GT+ ED ++ IE ++A P +GE VLRY G +Y
Sbjct: 130 ETGTQEVIRHRTSHGTWFQNGED--ALIRRIETRLAALMNCPVENGEGLQVLRYTPGGEY 187
Query: 177 DSHYDAFNPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFP 215
SHYD F P G QR+A+ ++YL+DV GGET+FP
Sbjct: 188 RSHYDYFQPTAAGSLTHVRTGGQRVATLIVYLNDVPSGGETVFP 231
>gi|423527903|ref|ZP_17504348.1| hypothetical protein IGE_01455 [Bacillus cereus HuB1-1]
gi|402451566|gb|EJV83385.1| hypothetical protein IGE_01455 [Bacillus cereus HuB1-1]
Length = 248
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 47/135 (34%), Positives = 76/135 (56%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K ++K S++ + RTSSG F+ +E
Sbjct: 71 PLIVVLANVLSDEECDKLIEMSKNKMKRSKVG-----SSRDVNDIRTSSGAFLEDNE--- 122
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 123 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 179
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 180 MYLNDVEEGGETYFP 194
>gi|75760922|ref|ZP_00740932.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
israelensis ATCC 35646]
gi|423385740|ref|ZP_17362996.1| hypothetical protein ICE_03486 [Bacillus cereus BAG1X1-2]
gi|423561293|ref|ZP_17537569.1| hypothetical protein II5_00697 [Bacillus cereus MSX-A1]
gi|74491592|gb|EAO54798.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
israelensis ATCC 35646]
gi|401201550|gb|EJR08415.1| hypothetical protein II5_00697 [Bacillus cereus MSX-A1]
gi|401635796|gb|EJS53551.1| hypothetical protein ICE_03486 [Bacillus cereus BAG1X1-2]
Length = 248
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 47/135 (34%), Positives = 76/135 (56%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K ++K S++ + RTSSG F+ +E
Sbjct: 71 PLIVVLANVLSDEECDKLIEMSKNKMKRSKVG-----SSRDVNDIRTSSGAFLEDNE--- 122
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 123 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 179
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 180 MYLNDVEEGGETYFP 194
>gi|423358724|ref|ZP_17336227.1| hypothetical protein IC1_00704 [Bacillus cereus VD022]
gi|401084596|gb|EJP92842.1| hypothetical protein IC1_00704 [Bacillus cereus VD022]
Length = 248
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 47/135 (34%), Positives = 76/135 (56%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K ++K S++ + RTSSG F+ +E
Sbjct: 71 PLIVVLANVLSDEECDKLIEMSKNKMKRSKVG-----SSRDVNDIRTSSGAFLEDNE--- 122
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 123 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 179
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 180 MYLNDVEEGGETYFP 194
>gi|83746819|ref|ZP_00943867.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum UW551]
gi|83726588|gb|EAP73718.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum UW551]
Length = 289
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 60/164 (36%), Positives = 83/164 (50%), Gaps = 13/164 (7%)
Query: 61 PHGETGDA---SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ- 116
P E +A S + IP PR + F +F S E+C +IA + RLK S + +
Sbjct: 73 PEAENSNAVRTSDREIPILFAIETPRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPET 132
Query: 117 GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKY 176
GE E+ RTS G E ++ IE +IA+AT +P HGE F VL Y G +Y
Sbjct: 133 GE--ENLISARTSEGAMFQVGEHP--LVARIEARIAQATGVPVEHGEGFQVLHYHPGGEY 188
Query: 177 DSHYDAFNPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFP 215
H+D FNP G QR+A+ ++YL+ V+ GG T FP
Sbjct: 189 QPHFDYFNPGRSGEARQLEVGGQRVATLVIYLNSVQAGGATGFP 232
>gi|386332363|ref|YP_006028532.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum Po82]
gi|334194811|gb|AEG67996.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum Po82]
Length = 292
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 60/164 (36%), Positives = 83/164 (50%), Gaps = 13/164 (7%)
Query: 61 PHGETGDA---SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ- 116
P E +A S + IP PR + F +F S E+C +IA + RLK S + +
Sbjct: 76 PEAENSNAVRTSDREIPILFAIETPRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPET 135
Query: 117 GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKY 176
GE E+ RTS G E ++ IE +IA+AT +P HGE F VL Y G +Y
Sbjct: 136 GE--ENLISARTSEGAMFQVGEHP--LVARIEARIAQATGVPVEHGEGFQVLHYHPGGEY 191
Query: 177 DSHYDAFNPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFP 215
H+D FNP G QR+A+ ++YL+ V+ GG T FP
Sbjct: 192 QPHFDYFNPGRSGEARQLEVGGQRVATLVIYLNSVQAGGATGFP 235
>gi|423634936|ref|ZP_17610589.1| hypothetical protein IK7_01345 [Bacillus cereus VD156]
gi|401278922|gb|EJR84852.1| hypothetical protein IK7_01345 [Bacillus cereus VD156]
Length = 248
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 50/136 (36%), Positives = 77/136 (56%), Gaps = 14/136 (10%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K ++K S++ + RTSSG F+ SE
Sbjct: 71 PLIVVLANVLSDEECDELIEMSKNKMKRSKVG-----SSRDVNDIRTSSGAFLEDSE--- 122
Query: 142 GILEL-IEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASF 199
L L IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++
Sbjct: 123 --LTLKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTL 178
Query: 200 LLYLSDVEEGGETMFP 215
++YL+DVEEGGET FP
Sbjct: 179 VMYLNDVEEGGETYFP 194
>gi|423368291|ref|ZP_17345723.1| hypothetical protein IC3_03392 [Bacillus cereus VD142]
gi|401081042|gb|EJP89322.1| hypothetical protein IC3_03392 [Bacillus cereus VD142]
Length = 216
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 54/153 (35%), Positives = 82/153 (53%), Gaps = 17/153 (11%)
Query: 65 TGDASIQSI-PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
T D IQ I F+ P + N S E+C +I +K +K S++ +
Sbjct: 25 TEDREIQIISKFE----EPLIVVLANVLSDEECAELIELSKNNMKRSKVG-----SSRDV 75
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTSSG F+ +E + IE +I+ T +P HGE ++L YE+ Q+Y +HYD F
Sbjct: 76 NDIRTSSGAFLEENE----LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF 131
Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFP 215
AE+ + R+++ ++YL+DVEEGGET FP
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP 162
>gi|423518940|ref|ZP_17495421.1| hypothetical protein IG7_04010 [Bacillus cereus HuA2-4]
gi|401159995|gb|EJQ67374.1| hypothetical protein IG7_04010 [Bacillus cereus HuA2-4]
Length = 216
Score = 90.1 bits (222), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 54/153 (35%), Positives = 82/153 (53%), Gaps = 17/153 (11%)
Query: 65 TGDASIQSI-PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
T D IQ I F+ P + N S E+C +I +K +K S++ +
Sbjct: 25 TEDREIQIISKFE----EPLIVVLANVLSDEECAELIELSKNNMKRSKVG-----SSRDV 75
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTSSG F+ +E + IE +I+ T +P HGE ++L YE+ Q+Y +HYD F
Sbjct: 76 NDIRTSSGAFLEENE----LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF 131
Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFP 215
AE+ + R+++ ++YL+DVEEGGET FP
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP 162
>gi|423489423|ref|ZP_17466105.1| hypothetical protein IEU_04046 [Bacillus cereus BtB2-4]
gi|402431659|gb|EJV63723.1| hypothetical protein IEU_04046 [Bacillus cereus BtB2-4]
Length = 216
Score = 90.1 bits (222), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 53/153 (34%), Positives = 84/153 (54%), Gaps = 17/153 (11%)
Query: 65 TGDASIQSI-PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
T D IQ I F+ P + N S E+C +I +K +++ S++ +
Sbjct: 25 TEDREIQIISKFE----EPLIVVLANVLSDEECDELIELSKSKMERSKVG-----SSRDV 75
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTSSG F+ +E + IE +I+ T +P +HGE ++L YE+ Q+Y +HYD F
Sbjct: 76 NDIRTSSGAFLEENE----LTSKIEKRISSITNVPVSHGEGLHILNYEVDQEYKAHYDYF 131
Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFP 215
AE+ + R+++ ++YL+DVEEGGET FP
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP 162
>gi|229061929|ref|ZP_04199257.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH603]
gi|228717372|gb|EEL69042.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH603]
Length = 216
Score = 90.1 bits (222), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 54/153 (35%), Positives = 82/153 (53%), Gaps = 17/153 (11%)
Query: 65 TGDASIQSI-PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
T D IQ I F+ P + N S E+C +I +K +K S++ +
Sbjct: 25 TEDREIQIISKFE----EPLIVVLANVLSDEECAELIELSKSNMKRSKVG-----SSRDV 75
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTSSG F+ +E + IE +I+ T +P HGE ++L YE+ Q+Y +HYD F
Sbjct: 76 NDIRTSSGAFLEENE----LTSKIEKRISSITNVPVVHGEGLHILNYEVDQEYKAHYDYF 131
Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFP 215
AE+ + R+++ ++YL+DVEEGGET FP
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP 162
>gi|398804098|ref|ZP_10563100.1| 2OG-Fe(II) oxygenase superfamily enzyme [Polaromonas sp. CF318]
gi|398094921|gb|EJL85274.1| 2OG-Fe(II) oxygenase superfamily enzyme [Polaromonas sp. CF318]
Length = 277
Score = 90.1 bits (222), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 55/147 (37%), Positives = 81/147 (55%), Gaps = 11/147 (7%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPS-QLALRQGETVESTKGTRTSSGTFISASEDK 140
P F N SA +C+++IA A+ RL S + +R G E RTS G F + E++
Sbjct: 90 PELWVFDNLLSAAECEALIAAAESRLARSLTVDIRTGG--EELNHDRTSHGMFYTRGENE 147
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP-----QMSQR 195
++ IE +IAR P +GE VLRY G +Y HYD F+P E G + QR
Sbjct: 148 --VIRRIEARIARLLNWPVQNGEGLQVLRYRRGAEYKPHYDYFDPGEPGTAAILRRGGQR 205
Query: 196 LASFLLYLSDVEEGGETMFPFEVDLQI 222
+AS ++YL + EGG T+FP ++ L++
Sbjct: 206 VASLIMYLREPGEGGATVFP-DIGLKV 231
>gi|421890664|ref|ZP_16321519.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
K60-1]
gi|378964031|emb|CCF98267.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
K60-1]
Length = 288
Score = 90.1 bits (222), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 57/153 (37%), Positives = 79/153 (51%), Gaps = 10/153 (6%)
Query: 69 SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTR 127
S + IP PR + F +F S E+C +IA + RLK S + + GE E+ R
Sbjct: 83 SDREIPILFAIETPRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGE--ENLISAR 140
Query: 128 TSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE 187
TS G E ++ IE +IA+AT +P HGE F VL Y G +Y H+D FNP
Sbjct: 141 TSEGAMFQVGEHP--LVARIEARIAQATGVPVEHGEGFQVLHYHPGGEYQPHFDYFNPGR 198
Query: 188 YGPQM-----SQRLASFLLYLSDVEEGGETMFP 215
G QR+A+ ++YL+ V+ GG T FP
Sbjct: 199 SGEARQLDVGGQRVATLVIYLNSVQAGGATGFP 231
>gi|372266874|ref|ZP_09502922.1| peptidyl prolyl 4-hydroxylase-like protein subunit alpha
[Alteromonas sp. S89]
Length = 294
Score = 90.1 bits (222), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 51/147 (34%), Positives = 81/147 (55%), Gaps = 9/147 (6%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+P + F NF + +C +++ ++ L PS++ Q E K +RTS GT + E
Sbjct: 102 QPNIVLFANFLAEWECDALVEMSRPNLSPSRVVNTQHGAFE-LKPSRTSGGTHFARGE-- 158
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQR 195
T ++ IE +IA +P+ HGE +L Y + +Y HYD F+P + G Q QR
Sbjct: 159 TPLIADIEARIASLLKVPEAHGEPLQILHYPVSGEYRPHYDFFDPEKPGNQEVLAAGGQR 218
Query: 196 LASFLLYLSDVEEGGETMFPFEVDLQI 222
+ + ++YLSDVE GG T+FP V L++
Sbjct: 219 VGTLIMYLSDVESGGATVFP-RVGLEV 244
>gi|222111817|ref|YP_002554081.1| procollagen-proline dioxygenase [Acidovorax ebreus TPSY]
gi|221731261|gb|ACM34081.1| Procollagen-proline dioxygenase [Acidovorax ebreus TPSY]
Length = 289
Score = 90.1 bits (222), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 52/147 (35%), Positives = 79/147 (53%), Gaps = 9/147 (6%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
PR + F N S E+CQ+II A+ R+ S L ++ E RTS G F E T
Sbjct: 102 PRVVLFGNLLSPEECQAIIDAAQPRMARS-LTVQTTTGGEEVNADRTSDGMFFQRGE--T 158
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRL 196
+++ +E +IAR P +GE VL Y G +Y HYD F+P + G QR+
Sbjct: 159 PVVQRLEERIARLVRWPIQNGEGLQVLHYRPGAEYKPHYDYFDPDQPGTSTIVRRGGQRV 218
Query: 197 ASFLLYLSDVEEGGETMFPFEVDLQIS 223
A+ ++YL++ +GG T FP +V L+++
Sbjct: 219 ATLVIYLNNPRKGGGTTFP-DVPLEVA 244
>gi|423512354|ref|ZP_17488885.1| hypothetical protein IG3_03851 [Bacillus cereus HuA2-1]
gi|402449325|gb|EJV81162.1| hypothetical protein IG3_03851 [Bacillus cereus HuA2-1]
Length = 216
Score = 90.1 bits (222), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 54/153 (35%), Positives = 82/153 (53%), Gaps = 17/153 (11%)
Query: 65 TGDASIQSI-PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
T D IQ I F+ P + N S E+C +I +K +K S++ +
Sbjct: 25 TEDREIQIISKFE----EPLIVVLANVLSDEECAELIELSKSNMKRSKVG-----SSRDV 75
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTSSG F+ +E + IE +I+ T +P HGE ++L YE+ Q+Y +HYD F
Sbjct: 76 NDIRTSSGAFLEENE----LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF 131
Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFP 215
AE+ + R+++ ++YL+DVEEGGET FP
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP 162
>gi|229135058|ref|ZP_04263863.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST196]
gi|228648443|gb|EEL04473.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST196]
Length = 216
Score = 89.7 bits (221), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 54/153 (35%), Positives = 82/153 (53%), Gaps = 17/153 (11%)
Query: 65 TGDASIQSI-PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
T D IQ I F+ P + N S E+C +I +K +K S++ +
Sbjct: 25 TEDREIQIISKFE----EPLIVVLANVLSDEECAELIELSKSNMKRSKVG-----SSRDV 75
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTSSG F+ +E + IE +I+ T +P HGE ++L YE+ Q+Y +HYD F
Sbjct: 76 NDIRTSSGAFLEENE----LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF 131
Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFP 215
AE+ + R+++ ++YL+DVEEGGET FP
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP 162
>gi|421895470|ref|ZP_16325871.1| prolyl 4-hydroxylase alpha subunit homologue protein [Ralstonia
solanacearum MolK2]
gi|206586635|emb|CAQ17221.1| prolyl 4-hydroxylase alpha subunit homologue protein [Ralstonia
solanacearum MolK2]
Length = 283
Score = 89.7 bits (221), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 60/164 (36%), Positives = 83/164 (50%), Gaps = 13/164 (7%)
Query: 61 PHGETGDA---SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ- 116
P E +A S + IP PR + F +F S E+C +IA + RLK S + +
Sbjct: 67 PGAENSNAVRTSDREIPILFAIETPRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPET 126
Query: 117 GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKY 176
GE E+ RTS G E ++ IE +IA+AT +P HGE F VL Y G +Y
Sbjct: 127 GE--ENLISARTSEGAMFQVGEHP--LVARIEARIAQATGVPVEHGEGFQVLHYHPGGEY 182
Query: 177 DSHYDAFNPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFP 215
H+D FNP G QR+A+ ++YL+ V+ GG T FP
Sbjct: 183 QPHFDYFNPGRGGEARQLEVGGQRVATLVIYLNSVQAGGATGFP 226
>gi|229104864|ref|ZP_04235524.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-28]
gi|228678581|gb|EEL32798.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-28]
Length = 216
Score = 89.7 bits (221), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 48/135 (35%), Positives = 75/135 (55%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K ++K S + + RTSSG F+ +E
Sbjct: 39 PLIVVLGNVISDEECGELIEMSKNKIKRSTIG-----SSRDVNDIRTSSGAFLEENE--- 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P THGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETFFP 162
>gi|403234403|ref|ZP_10912989.1| Procollagen-proline dioxygenase [Bacillus sp. 10403023]
Length = 217
Score = 89.7 bits (221), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 47/134 (35%), Positives = 78/134 (58%), Gaps = 9/134 (6%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K R+ S++A + RTSS TFI E++
Sbjct: 39 PLIVVLGNVLSDEECDELIRLSKDRINRSKIA------NANVDNMRTSSSTFIE--ENEN 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
I+ IE +I++ +P +GE +L Y++GQ+Y SH+D F+ + + + R+++ ++
Sbjct: 91 IIVSRIEKRISQIMNIPTEYGEGLQILNYQVGQEYKSHFDFFS-SPHNAINNPRISTLVM 149
Query: 202 YLSDVEEGGETMFP 215
YLSDVE+GGET FP
Sbjct: 150 YLSDVEQGGETYFP 163
>gi|423558182|ref|ZP_17534484.1| hypothetical protein II3_03386 [Bacillus cereus MC67]
gi|401191450|gb|EJQ98472.1| hypothetical protein II3_03386 [Bacillus cereus MC67]
Length = 216
Score = 89.7 bits (221), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 53/153 (34%), Positives = 82/153 (53%), Gaps = 17/153 (11%)
Query: 65 TGDASIQSI-PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
T D IQ I F+ P + N S E+C +I +K ++K S++ +
Sbjct: 25 TEDREIQIISKFE----EPLIVVLANVLSDEECDGLIELSKNKIKRSKIG-----SSRDV 75
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTSSG F+ +E + IE +I+ +P HGE ++L YE+ Q+Y +HYD F
Sbjct: 76 NDIRTSSGAFLEENE----LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQEYKAHYDYF 131
Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFP 215
AE+ + R+++ ++YL+DVEEGGET FP
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP 162
>gi|319792090|ref|YP_004153730.1| procollagen-proline dioxygenase [Variovorax paradoxus EPS]
gi|315594553|gb|ADU35619.1| Procollagen-proline dioxygenase [Variovorax paradoxus EPS]
Length = 280
Score = 89.7 bits (221), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 58/166 (34%), Positives = 90/166 (54%), Gaps = 14/166 (8%)
Query: 64 ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPS-QLALRQGETVES 122
+ GD +Q + Q + PR + F N S E+C+ +IA A+ RL S + R G V +
Sbjct: 78 DAGDRQVQVL--QTMR-HPRVIVFGNLLSTEECEGLIAAARVRLARSLTVETRTGGEVLN 134
Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
RTS G F E++ I+ +E ++A P +GE +LRY G +Y HYD
Sbjct: 135 VD--RTSDGMFFERGENE--IVARLEQRLAMLLRWPLEYGEGLQILRYAPGAQYRPHYDY 190
Query: 183 FNPAEYG-PQM----SQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
F+P E G P + QR+A+ ++YL + E+GG T FP +V L+++
Sbjct: 191 FDPNEPGTPTILKRGGQRVATLVMYLQEPEQGGATTFP-DVGLEVA 235
>gi|423669823|ref|ZP_17644852.1| hypothetical protein IKO_03520 [Bacillus cereus VDM034]
gi|423673973|ref|ZP_17648912.1| hypothetical protein IKS_01516 [Bacillus cereus VDM062]
gi|401298950|gb|EJS04550.1| hypothetical protein IKO_03520 [Bacillus cereus VDM034]
gi|401309524|gb|EJS14857.1| hypothetical protein IKS_01516 [Bacillus cereus VDM062]
Length = 216
Score = 89.7 bits (221), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 53/153 (34%), Positives = 83/153 (54%), Gaps = 17/153 (11%)
Query: 65 TGDASIQSI-PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
T D IQ I F+ P + N S E+C +I +K +++ S++ +
Sbjct: 25 TEDREIQIISKFE----EPLIVVLANVLSDEECDELIELSKSKMERSKVG-----SSRDV 75
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTSSG F+ +E + IE +I+ T +P HGE ++L YE+ Q+Y +HYD F
Sbjct: 76 NDIRTSSGAFLEENE----LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF 131
Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFP 215
AE+ + R+++ ++YL+DVEEGGET FP
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP 162
>gi|163941996|ref|YP_001646880.1| 2OG-Fe(II) oxygenase [Bacillus weihenstephanensis KBAB4]
gi|229013455|ref|ZP_04170592.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides DSM 2048]
gi|423495146|ref|ZP_17471790.1| hypothetical protein IEW_04044 [Bacillus cereus CER057]
gi|423498060|ref|ZP_17474677.1| hypothetical protein IEY_01287 [Bacillus cereus CER074]
gi|163864193|gb|ABY45252.1| 2OG-Fe(II) oxygenase [Bacillus weihenstephanensis KBAB4]
gi|228747867|gb|EEL97733.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides DSM 2048]
gi|401151239|gb|EJQ58691.1| hypothetical protein IEW_04044 [Bacillus cereus CER057]
gi|401161347|gb|EJQ68714.1| hypothetical protein IEY_01287 [Bacillus cereus CER074]
Length = 216
Score = 89.7 bits (221), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 53/153 (34%), Positives = 83/153 (54%), Gaps = 17/153 (11%)
Query: 65 TGDASIQSI-PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
T D IQ I F+ P + N S E+C +I +K +++ S++ +
Sbjct: 25 TEDREIQIISKFE----EPLIVVLANVLSDEECDELIELSKSKMERSKVG-----SSRDV 75
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTSSG F+ +E + IE +I+ T +P HGE ++L YE+ Q+Y +HYD F
Sbjct: 76 NDIRTSSGAFLEENE----LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF 131
Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFP 215
AE+ + R+++ ++YL+DVEEGGET FP
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP 162
>gi|228941395|ref|ZP_04103947.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
berliner ATCC 10792]
gi|228974327|ref|ZP_04134896.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
thuringiensis str. T01001]
gi|228980919|ref|ZP_04141223.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
gi|384188306|ref|YP_005574202.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
chinensis CT-43]
gi|410676625|ref|YP_006928996.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
gi|452200698|ref|YP_007480779.1| Peptidyl prolyl 4-hydroxylase-like protein, alpha subunit [Bacillus
thuringiensis serovar thuringiensis str. IS5056]
gi|228778855|gb|EEM27118.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
gi|228785377|gb|EEM33387.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
thuringiensis str. T01001]
gi|228818321|gb|EEM64394.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
berliner ATCC 10792]
gi|326942015|gb|AEA17911.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
chinensis CT-43]
gi|409175754|gb|AFV20059.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
gi|452106091|gb|AGG03031.1| Peptidyl prolyl 4-hydroxylase-like protein, alpha subunit [Bacillus
thuringiensis serovar thuringiensis str. IS5056]
Length = 216
Score = 89.7 bits (221), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 47/135 (34%), Positives = 76/135 (56%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K ++K S++ + RTSSG F+ +E
Sbjct: 39 PLIVVLANVLSDEECGELIEMSKNKMKRSKVG-----SSRDVNDIRTSSGAFLEDNE--- 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETYFP 162
>gi|159485424|ref|XP_001700744.1| hypothetical protein CHLREDRAFT_187378 [Chlamydomonas reinhardtii]
gi|158281243|gb|EDP06998.1| predicted protein [Chlamydomonas reinhardtii]
Length = 253
Score = 89.4 bits (220), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 58/156 (37%), Positives = 84/156 (53%), Gaps = 10/156 (6%)
Query: 64 ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
++GD I + +SW PRA + F S +C +I A +L+ S + + + V+
Sbjct: 29 DSGDVGAPWI--ETISWVPRAFIYHGFLSHAECDHLIGLALPKLERSLVVGNKSDEVDPI 86
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTS I +E T ++ IE +IAR T LP++H E VLRY GQKYD+H+D F
Sbjct: 87 ---RTSYSASIGYNE--TDVVADIEGRIARWTHLPRSHQEPMEVLRYINGQKYDAHWDWF 141
Query: 184 N-PAEYGPQMSQRLASFLLYLSDVE--EGGETMFPF 216
+ G R+A+ L+YLSD+E GGET P
Sbjct: 142 DETETGGTGGGNRMATALMYLSDMEPAAGGETALPL 177
>gi|395003644|ref|ZP_10387769.1| 2OG-Fe(II) oxygenase superfamily enzyme [Acidovorax sp. CF316]
gi|394318439|gb|EJE54870.1| 2OG-Fe(II) oxygenase superfamily enzyme [Acidovorax sp. CF316]
Length = 299
Score = 89.4 bits (220), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 56/163 (34%), Positives = 88/163 (53%), Gaps = 11/163 (6%)
Query: 67 DASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPS-QLALRQGETVESTKG 125
DA +++ + +PR + F N SAE+C ++IA A R+ S +A + G E
Sbjct: 97 DAGDRAVNILLAIAKPRIVVFGNLLSAEECDALIAAAAPRMARSLTVATKTGG--EEVND 154
Query: 126 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 185
RTS G F E+ +++ IE +IAR P +GE VL Y G +Y HYD F+P
Sbjct: 155 DRTSDGMFFQRGENP--VVQRIEERIARLLDWPIENGEGLQVLHYRPGAEYKPHYDYFDP 212
Query: 186 AEYG-PQM----SQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
E G P + QR+ + ++YL+ E+GG T FP +V ++++
Sbjct: 213 GEPGTPTILKRGGQRVGTLVMYLNTPEKGGGTTFP-DVHVEVA 254
>gi|47567794|ref|ZP_00238502.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus G9241]
gi|47555471|gb|EAL13814.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus G9241]
Length = 216
Score = 89.4 bits (220), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 47/135 (34%), Positives = 75/135 (55%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +L S++ + RTS G F+ +E
Sbjct: 39 PLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDVNDIRTSKGAFLDDNE--- 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ E IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETFFP 162
>gi|423400914|ref|ZP_17378087.1| hypothetical protein ICW_01312 [Bacillus cereus BAG2X1-2]
gi|401653904|gb|EJS71447.1| hypothetical protein ICW_01312 [Bacillus cereus BAG2X1-2]
Length = 216
Score = 89.4 bits (220), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 47/135 (34%), Positives = 76/135 (56%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K ++K S++ + RTSSG F+ +E
Sbjct: 39 PLIVVLGNVLSDEECDELIELSKSKMKRSKVG-----SSRDVNDIRTSSGAFLDDNE--- 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETFFP 162
>gi|229168980|ref|ZP_04296697.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH621]
gi|423591765|ref|ZP_17567796.1| hypothetical protein IIG_00633 [Bacillus cereus VD048]
gi|228614572|gb|EEK71680.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH621]
gi|401231898|gb|EJR38400.1| hypothetical protein IIG_00633 [Bacillus cereus VD048]
Length = 216
Score = 89.4 bits (220), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 54/153 (35%), Positives = 82/153 (53%), Gaps = 17/153 (11%)
Query: 65 TGDASIQSI-PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
T D IQ I F+ P + N S E+C +I +K +K S++ +
Sbjct: 25 TEDREIQIISKFE----EPLIVVLANVLSDEECAELIELSKSNMKRSKVG-----SSRDV 75
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTSSG F+ +E + IE +I+ T +P HGE ++L YE+ Q+Y +HYD F
Sbjct: 76 NDIRTSSGAFLEENE----LTWKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF 131
Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFP 215
AE+ + R+++ ++YL+DVEEGGET FP
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP 162
>gi|423478381|ref|ZP_17455096.1| hypothetical protein IEO_03839 [Bacillus cereus BAG6X1-1]
gi|402428543|gb|EJV60640.1| hypothetical protein IEO_03839 [Bacillus cereus BAG6X1-1]
Length = 216
Score = 89.4 bits (220), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 47/135 (34%), Positives = 76/135 (56%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K ++K S++ + RTSSG F+ +E
Sbjct: 39 PLIVVLGNVLSDEECDELIELSKSKMKRSKVG-----SSRDVNDIRTSSGAFLDDNE--- 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETFFP 162
>gi|423657194|ref|ZP_17632493.1| hypothetical protein IKG_04182 [Bacillus cereus VD200]
gi|401289937|gb|EJR95641.1| hypothetical protein IKG_04182 [Bacillus cereus VD200]
Length = 248
Score = 89.0 bits (219), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 46/135 (34%), Positives = 76/135 (56%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +++ S++ + RTSSG F+ +E
Sbjct: 71 PLIVVLANVLSDEECDELIEMSKNKMERSKIG-----SSRDVNDIRTSSGAFLEDNE--- 122
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 123 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 179
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 180 MYLNDVEEGGETYFP 194
>gi|149180354|ref|ZP_01858859.1| prolyl 4-hydroxylase, alpha subunit [Bacillus sp. SG-1]
gi|148852546|gb|EDL66691.1| prolyl 4-hydroxylase, alpha subunit [Bacillus sp. SG-1]
Length = 212
Score = 89.0 bits (219), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 46/134 (34%), Positives = 77/134 (57%), Gaps = 12/134 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C ++I +K +LK S++ + E RTSS TF+ E +
Sbjct: 37 PLIVVLGNVLSDEECDALIGLSKDKLKRSKIGNTRNEN-----DMRTSSSTFMEEGESE- 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
++ +E +I++ +P +GE +L Y+IGQ+Y +H+D F A + R+++ ++
Sbjct: 91 -VVTRVEKRISQIMNIPYENGEGLQILNYKIGQEYKAHFDFFKNAS-----NPRISTLVM 144
Query: 202 YLSDVEEGGETMFP 215
YL+DVEEGGET FP
Sbjct: 145 YLNDVEEGGETYFP 158
>gi|228987427|ref|ZP_04147547.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
tochigiensis BGSC 4Y1]
gi|228772399|gb|EEM20845.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
tochigiensis BGSC 4Y1]
Length = 232
Score = 89.0 bits (219), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 47/135 (34%), Positives = 75/135 (55%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +L S++ + RTS G F+ +E
Sbjct: 55 PLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDVNDIRTSKGAFLDDNE--- 106
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ E IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 107 -LTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 163
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 164 MYLNDVEEGGETFFP 178
>gi|229146822|ref|ZP_04275187.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST24]
gi|228636650|gb|EEK93115.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST24]
Length = 216
Score = 89.0 bits (219), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 46/135 (34%), Positives = 76/135 (56%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +++ S++ + RTSSG F+ +E
Sbjct: 39 PLIVVLANVLSDEECDELIEMSKNKMERSKIG-----SSRDVNDIRTSSGAFLEDNE--- 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETYFP 162
>gi|159487421|ref|XP_001701721.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280940|gb|EDP06696.1| predicted protein [Chlamydomonas reinhardtii]
Length = 336
Score = 89.0 bits (219), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 56/141 (39%), Positives = 77/141 (54%), Gaps = 7/141 (4%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
Q + PRA YF NF + + ++ A +LK S + + RTS G FI
Sbjct: 20 QQVGLHPRAYYFHNFLTKAERAHLVRVAAPKLKRSTVVGGK--GEGVVDDIRTSYGMFIR 77
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY-GPQMSQ 194
D ++ IE +I+ T LP H E +LRY GQ Y +HYD+ +++ GP+
Sbjct: 78 RLSDP--VVTRIEKRISLWTHLPVEHQEDIQILRYAHGQTYGAHYDSGASSDHVGPKW-- 133
Query: 195 RLASFLLYLSDVEEGGETMFP 215
RLA+FL+YLSDVEEGGET FP
Sbjct: 134 RLATFLMYLSDVEEGGETAFP 154
>gi|229157835|ref|ZP_04285910.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 4342]
gi|228625792|gb|EEK82544.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 4342]
Length = 232
Score = 89.0 bits (219), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 47/135 (34%), Positives = 75/135 (55%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +L S++ + RTS G F+ +E
Sbjct: 55 PLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDVNDIRTSKGAFLDDNE--- 106
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ E IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 107 -LTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 163
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 164 MYLNDVEEGGETFFP 178
>gi|430751569|ref|YP_007214477.1| 2OG-Fe(II) oxygenase [Thermobacillus composti KWC4]
gi|430735534|gb|AGA59479.1| 2OG-Fe(II) oxygenase superfamily enzyme [Thermobacillus composti
KWC4]
Length = 215
Score = 89.0 bits (219), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 50/140 (35%), Positives = 76/140 (54%), Gaps = 9/140 (6%)
Query: 77 VLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
VL P + F S ++C+ +I TA RLK S+L + RTS G F
Sbjct: 25 VLHQEPLIVRFERLLSDDECRQLIETAAPRLKESKLVNK------VVSDIRTSRGMFFE- 77
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
E+++ + IE +IA+ +P H E VL Y GQ+Y +H+D F P + + R+
Sbjct: 78 -EEESPFIHRIERRIAQLMNVPIEHAEGLQVLHYGPGQEYKAHHDFFAPGSPAAR-NNRI 135
Query: 197 ASFLLYLSDVEEGGETMFPF 216
++ ++YL+DVEEGGET+FP
Sbjct: 136 STLIVYLNDVEEGGETVFPL 155
>gi|159486447|ref|XP_001701251.1| hypothetical protein CHLREDRAFT_122372 [Chlamydomonas reinhardtii]
gi|158271833|gb|EDO97644.1| predicted protein [Chlamydomonas reinhardtii]
Length = 251
Score = 89.0 bits (219), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 54/141 (38%), Positives = 72/141 (51%), Gaps = 10/141 (7%)
Query: 75 FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 134
+ +SW PR + NF S +C+ I TA +K S + G +V T RTS GTFI
Sbjct: 2 IETVSWNPRVFIYHNFLSDAECRHIKRTAAPMMKRSSVVGTNGSSVLDT--IRTSYGTFI 59
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
D ++E + ++A T P + E VLRY GQKY +H D+ S
Sbjct: 60 RRRHDP--VVERVLRRVAAWTKAPPENQEDLQVLRYGPGQKYGAHMDSLI------DDSP 111
Query: 195 RLASFLLYLSDVEEGGETMFP 215
R+A+ LLYL D E GGET FP
Sbjct: 112 RMATVLLYLHDTEYGGETAFP 132
>gi|299065638|emb|CBJ36810.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
CMR15]
Length = 289
Score = 89.0 bits (219), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 56/153 (36%), Positives = 78/153 (50%), Gaps = 10/153 (6%)
Query: 69 SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTR 127
S + IP PR + F +F S E+C +I + RLK S + + GE E+ R
Sbjct: 84 SDREIPILFAIETPRIVLFQHFLSDEECDQLITLGRHRLKRSPVVNPETGE--ENLISAR 141
Query: 128 TSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE 187
TS G E ++ IE +IA+AT +P HGE F VL Y+ G +Y H+D FNP
Sbjct: 142 TSQGAMFQVGEHP--LIARIEARIAQATGVPVEHGEGFQVLHYQPGGEYQPHFDYFNPGR 199
Query: 188 YGPQM-----SQRLASFLLYLSDVEEGGETMFP 215
G QR+A+ ++YL+ V GG T FP
Sbjct: 200 SGEARQLEVGGQRVATLVIYLNSVPAGGATGFP 232
>gi|423483822|ref|ZP_17460512.1| hypothetical protein IEQ_03600 [Bacillus cereus BAG6X1-2]
gi|401141373|gb|EJQ48928.1| hypothetical protein IEQ_03600 [Bacillus cereus BAG6X1-2]
Length = 216
Score = 89.0 bits (219), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 47/135 (34%), Positives = 74/135 (54%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K ++K S + + RTSSG F+ +E
Sbjct: 39 PLIVVLGNVISDEECDELIEMSKNKIKRSTIG-----SSRDVNDIRTSSGAFLEENE--- 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETFFP 162
>gi|423615424|ref|ZP_17591258.1| hypothetical protein IIO_00750 [Bacillus cereus VD115]
gi|401259961|gb|EJR66134.1| hypothetical protein IIO_00750 [Bacillus cereus VD115]
Length = 216
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 47/135 (34%), Positives = 74/135 (54%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K ++K S + + RTSSG F+ +E
Sbjct: 39 PLIVVLGNVISDEECDELIEMSKNKIKRSTIG-----SSRDVNDIRTSSGAFLEENE--- 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETFFP 162
>gi|307102963|gb|EFN51228.1| hypothetical protein CHLNCDRAFT_141231 [Chlorella variabilis]
Length = 313
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 59/147 (40%), Positives = 77/147 (52%), Gaps = 19/147 (12%)
Query: 75 FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT-----RTS 129
QVL R F NF + E+C I+A AK L+ R G +T G+ RTS
Sbjct: 34 MQVLDAEAR--IFINFLTEEECDHIVALAKPHLE------RSGVVDTATGGSEISDIRTS 85
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH-YDAFNPAEY 188
G F+ D T + IE +IAR T+LP +GE VL Y G+KYD + +D N
Sbjct: 86 KGMFLERGHDDT--VAAIEERIARWTLLPVGNGEGLQVLNYHPGEKYDDYFFDKVNGESN 143
Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFP 215
G R A+ L+YL+ VEEGGET+FP
Sbjct: 144 G---GNRYATVLMYLNTVEEGGETVFP 167
>gi|121595595|ref|YP_987491.1| 2OG-Fe(II) oxygenase [Acidovorax sp. JS42]
gi|120607675|gb|ABM43415.1| 2OG-Fe(II) oxygenase [Acidovorax sp. JS42]
Length = 289
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 52/147 (35%), Positives = 79/147 (53%), Gaps = 9/147 (6%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
PR + F N S E+CQ+II A+ R+ S L ++ E RTS G F E T
Sbjct: 102 PRVVLFGNLLSPEECQAIIDAAQPRMARS-LTVQTTTGGEEVNADRTSDGMFFQRGE--T 158
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRL 196
+++ +E +IAR P +GE VL Y G +Y HYD F+P + G QR+
Sbjct: 159 PVVQRLEERIARLVRWPIQNGEGLQVLHYRPGAEYKPHYDYFDPDQPGTSTIVRRGGQRV 218
Query: 197 ASFLLYLSDVEEGGETMFPFEVDLQIS 223
A+ ++YL++ +GG T FP +V L+++
Sbjct: 219 ATLVIYLNNPLKGGGTTFP-DVPLEVA 244
>gi|241664232|ref|YP_002982592.1| procollagen-proline dioxygenase [Ralstonia pickettii 12D]
gi|309783051|ref|ZP_07677770.1| procollagen-proline dioxygenase [Ralstonia sp. 5_7_47FAA]
gi|404397139|ref|ZP_10988932.1| hypothetical protein HMPREF0989_00773 [Ralstonia sp. 5_2_56FAA]
gi|240866259|gb|ACS63920.1| Procollagen-proline dioxygenase [Ralstonia pickettii 12D]
gi|308918159|gb|EFP63837.1| procollagen-proline dioxygenase [Ralstonia sp. 5_7_47FAA]
gi|348610674|gb|EGY60360.1| hypothetical protein HMPREF0989_00773 [Ralstonia sp. 5_2_56FAA]
Length = 288
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 59/171 (34%), Positives = 82/171 (47%), Gaps = 16/171 (9%)
Query: 50 EVVEKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKP 109
E VE N H GD IP PR + F +F S ++C +IA + RLK
Sbjct: 72 EAVENSNAV---HTADGD-----IPILFAIETPRIVLFQHFLSDQECDELIAIGRNRLKR 123
Query: 110 SQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLR 169
S + + E+ RTS G E ++ IE +IA+A +P HGE F VL
Sbjct: 124 SPV-VNPDTGEENLISARTSQGGMFQVGEHP--LIAKIEARIAQAVGVPVEHGEGFQVLN 180
Query: 170 YEIGQKYDSHYDAFNPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFP 215
Y+ G +Y H+D FNP G QR+A+ ++YL+ V+ GG T FP
Sbjct: 181 YQPGGEYQPHFDFFNPGRSGEARQLEVGGQRVATMVIYLNSVQAGGATGFP 231
>gi|301055727|ref|YP_003793938.1| prolyl 4-hydroxylase subunit alpha [Bacillus cereus biovar
anthracis str. CI]
gi|300377896|gb|ADK06800.1| prolyl 4-hydroxylase, alpha subunit [Bacillus cereus biovar
anthracis str. CI]
Length = 216
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 47/135 (34%), Positives = 75/135 (55%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +L S++ + RTSSG F+ +E
Sbjct: 39 PLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDVNDIRTSSGAFLDDNE--- 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETFFP 162
>gi|388567209|ref|ZP_10153646.1| procollagen-proline dioxygenase [Hydrogenophaga sp. PBC]
gi|388265592|gb|EIK91145.1| procollagen-proline dioxygenase [Hydrogenophaga sp. PBC]
Length = 296
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 54/147 (36%), Positives = 80/147 (54%), Gaps = 9/147 (6%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
PR + N SAE+C +II +AK +L S L ++ E RTSSG F + +T
Sbjct: 109 PRVVVLGNLLSAEECDAIIESAKPKLARS-LTVQTATGGEELNADRTSSGMFFT--RGQT 165
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQRL 196
+ +E +IAR P +GE VL Y G +Y HYD F+P E G P + QR+
Sbjct: 166 PEVTAVERRIARLVGWPVENGEGLQVLHYRPGAEYKPHYDYFDPKEAGTPTILKRGGQRV 225
Query: 197 ASFLLYLSDVEEGGETMFPFEVDLQIS 223
A+ ++YL++ GG T FP +V L+++
Sbjct: 226 ATLVMYLNEPARGGGTTFP-DVGLEVA 251
>gi|196041590|ref|ZP_03108882.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
NVH0597-99]
gi|218905373|ref|YP_002453207.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
AH820]
gi|225866219|ref|YP_002751597.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
03BB102]
gi|423550018|ref|ZP_17526345.1| hypothetical protein IGW_00649 [Bacillus cereus ISP3191]
gi|196027578|gb|EDX66193.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
NVH0597-99]
gi|218537435|gb|ACK89833.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
AH820]
gi|225786013|gb|ACO26230.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
03BB102]
gi|401189634|gb|EJQ96684.1| hypothetical protein IGW_00649 [Bacillus cereus ISP3191]
Length = 216
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 47/135 (34%), Positives = 75/135 (55%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +L S++ + RTSSG F+ +E
Sbjct: 39 PLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDVNDIRTSSGAFLDDNE--- 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETFFP 162
>gi|229192445|ref|ZP_04319408.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 10876]
gi|228591022|gb|EEK48878.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 10876]
Length = 216
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 46/135 (34%), Positives = 75/135 (55%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +++ S++ + RTSSG F+ +E
Sbjct: 39 PLIVVLANVISDEECDELIEMSKNKMERSKIG-----SSRDVNDIRTSSGAFLEDNE--- 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETYFP 162
>gi|196046329|ref|ZP_03113555.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
03BB108]
gi|376268135|ref|YP_005120847.1| Peptidyl prolyl 4- hydroxylase like protein [Bacillus cereus
F837/76]
gi|196022799|gb|EDX61480.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
03BB108]
gi|364513935|gb|AEW57334.1| Peptidyl prolyl 4- hydroxylase like protein [Bacillus cereus
F837/76]
Length = 216
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 47/135 (34%), Positives = 75/135 (55%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +L S++ + RTSSG F+ +E
Sbjct: 39 PLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDVNDIRTSSGAFLDDNE--- 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETFFP 162
>gi|255607134|ref|XP_002538686.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223510975|gb|EEF23697.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 318
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 50/139 (35%), Positives = 78/139 (56%), Gaps = 8/139 (5%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
PR F + S +C ++IA ++ RL+ S++ +G E TRTS G + + E+
Sbjct: 126 PRIALFDDVLSDAECDALIAASRSRLQRSKVVANRGSG-EFVDDTRTSYGAYFNKGENS- 183
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG---PQMS--QRL 196
++ I+ +IA T P TH E +L Y +G +Y H+D F P + G P S QR+
Sbjct: 184 -LVATIQRRIAELTRWPLTHAEPLQILNYGLGGEYLPHFDYFEPQQPGLPSPLESGGQRI 242
Query: 197 ASFLLYLSDVEEGGETMFP 215
A+ ++YL+DVE GG T+FP
Sbjct: 243 ATVVMYLNDVEAGGGTIFP 261
>gi|229186477|ref|ZP_04313640.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BGSC 6E1]
gi|228596991|gb|EEK54648.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BGSC 6E1]
Length = 216
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 47/135 (34%), Positives = 75/135 (55%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +L S++ + RTSSG F+ +E
Sbjct: 39 PLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDVNDIRTSSGAFLDDNE--- 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 148 IYLNDVEEGGETFFP 162
>gi|319652187|ref|ZP_08006306.1| hypothetical protein HMPREF1013_02919 [Bacillus sp. 2_A_57_CT2]
gi|317396176|gb|EFV76895.1| hypothetical protein HMPREF1013_02919 [Bacillus sp. 2_A_57_CT2]
Length = 283
Score = 88.2 bits (217), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 55/168 (32%), Positives = 88/168 (52%), Gaps = 9/168 (5%)
Query: 48 TLEVVEKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRL 107
T EV E + + H D + S + +P L+ S+E+C +I+ ++ RL
Sbjct: 67 TYEVPEIGRKGNILHTSDKDVKVLS-----RNEKPFVLHLDQVLSSEECDELISLSRSRL 121
Query: 108 KPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNV 167
+PS L + +G E RTS E++ ++E IE +IA T P +GE +
Sbjct: 122 QPS-LVVDRGSGEERAGSGRTSKSMAFRLKENE--LVERIETRIAELTGYPAENGEGLQI 178
Query: 168 LRYEIGQKYDSHYDAFNP-AEYGPQMSQRLASFLLYLSDVEEGGETMF 214
L Y +G++Y H+D F P + QR+ +FL+YL+DVE+GGET+F
Sbjct: 179 LNYGLGEEYKPHFDFFPPHMADASKGGQRVGTFLIYLNDVEDGGETVF 226
>gi|159487763|ref|XP_001701892.1| predicted protein [Chlamydomonas reinhardtii]
gi|158281111|gb|EDP06867.1| predicted protein [Chlamydomonas reinhardtii]
Length = 259
Score = 88.2 bits (217), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 49/138 (35%), Positives = 79/138 (57%), Gaps = 8/138 (5%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
++W+PR + NF + + + +I A ++K S + G++VE RTS GTF+
Sbjct: 4 VAWKPRVFIYHNFITEVEAKHLIELAAPQMKRSTVVGAGGKSVEDN--YRTSYGTFLKRY 61
Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
+D+ I+E IE+++A T +P H E +LRY +GQ+Y H D E G R+A
Sbjct: 62 QDE--IVERIENRVAAWTQIPVAHQEDTQILRYGLGQQYKVHADTLRDEEAG----VRVA 115
Query: 198 SFLLYLSDVEEGGETMFP 215
+ L+YL++ + GGET FP
Sbjct: 116 TVLIYLNEPDGGGETAFP 133
>gi|428175714|gb|EKX44602.1| hypothetical protein GUITHDRAFT_71994 [Guillardia theta CCMP2712]
Length = 244
Score = 88.2 bits (217), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 53/153 (34%), Positives = 79/153 (51%), Gaps = 6/153 (3%)
Query: 69 SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRT 128
S +++ + LS PR NF SAE+C+ II TA L PS + L+QG+ + +
Sbjct: 14 SSRTVEVKRLSSTPRLFVVENFLSAEECEEIIKTATPLLAPSTV-LKQGDQSNGEEKVKD 72
Query: 129 SSGTFISA--SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
T +A + K I+ I ++ +P ++ E VL+Y Q Y HYD F+P
Sbjct: 73 EVRTSETAWLMDKKVPIVAKIRQRVEELIRIPMSYAEDMQVLKYTFKQHYHVHYDFFDPK 132
Query: 187 EYGPQMS---QRLASFLLYLSDVEEGGETMFPF 216
Y + S RL + YL+ VE+GGET+FPF
Sbjct: 133 MYPGRWSSGHNRLVTVFFYLTSVEKGGETIFPF 165
>gi|206971296|ref|ZP_03232247.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
AH1134]
gi|229081494|ref|ZP_04213993.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-2]
gi|423411965|ref|ZP_17389085.1| hypothetical protein IE1_01269 [Bacillus cereus BAG3O-2]
gi|423432249|ref|ZP_17409253.1| hypothetical protein IE7_04065 [Bacillus cereus BAG4O-1]
gi|206734068|gb|EDZ51239.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
AH1134]
gi|228701801|gb|EEL54288.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-2]
gi|401104033|gb|EJQ12010.1| hypothetical protein IE1_01269 [Bacillus cereus BAG3O-2]
gi|401117005|gb|EJQ24843.1| hypothetical protein IE7_04065 [Bacillus cereus BAG4O-1]
Length = 216
Score = 88.2 bits (217), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 46/135 (34%), Positives = 75/135 (55%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +++ S++ + RTSSG F+ +E
Sbjct: 39 PLIVVLANVLSDEECDELIEMSKNKMERSKIG-----SSRDVNDIRTSSGAFLEDNE--- 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETYFP 162
>gi|52141260|ref|YP_085568.1| prolyl 4-hydroxylase, alpha subunit [Bacillus cereus E33L]
gi|51974729|gb|AAU16279.1| prolyl 4-hydroxylase, alpha subunit [Bacillus cereus E33L]
Length = 232
Score = 88.2 bits (217), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 53/154 (34%), Positives = 83/154 (53%), Gaps = 17/154 (11%)
Query: 64 ETGDASIQSI-PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES 122
+T D IQ I F+ P + N S E+C +I +K +L S++ +
Sbjct: 40 KTEDREIQIISKFE----EPLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRD 90
Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
RTSSG F+ +E + IE +I+ +P +HGE ++L YE+ Q+Y +HYD
Sbjct: 91 VNDIRTSSGAFLDDNE----LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDY 146
Query: 183 FNPAEYG-PQMSQRLASFLLYLSDVEEGGETMFP 215
F AE+ + R+++ ++YL+DVEEGGET FP
Sbjct: 147 F--AEHSRSAANNRISTLVMYLNDVEEGGETFFP 178
>gi|423426372|ref|ZP_17403403.1| hypothetical protein IE5_04061 [Bacillus cereus BAG3X2-2]
gi|401111119|gb|EJQ19018.1| hypothetical protein IE5_04061 [Bacillus cereus BAG3X2-2]
Length = 248
Score = 88.2 bits (217), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 46/135 (34%), Positives = 75/135 (55%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +++ S++ + RTSSG F+ +E
Sbjct: 71 PLIVVLANVLSDEECDELIEISKNKMERSKIG-----SSRDVNDIRTSSGAFLEDNE--- 122
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 123 -LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 179
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 180 MYLNDVEEGGETYFP 194
>gi|384182063|ref|YP_005567825.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
finitimus YBT-020]
gi|324328147|gb|ADY23407.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
finitimus YBT-020]
Length = 216
Score = 88.2 bits (217), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 47/135 (34%), Positives = 75/135 (55%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +L S++ + RTSSG F+ +E
Sbjct: 39 PLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDVNDIRTSSGAFLDDNE--- 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETFFP 162
>gi|229152436|ref|ZP_04280628.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1550]
gi|228631044|gb|EEK87681.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1550]
Length = 248
Score = 88.2 bits (217), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 46/135 (34%), Positives = 76/135 (56%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +++ S++ + RTSSG F+ +E
Sbjct: 71 PLIVVLANVLSDEECGELIEMSKNKMERSKIG-----SSRDVNDIRTSSGAFLEDNE--- 122
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 123 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 179
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 180 MYLNDVEEGGETYFP 194
>gi|403238305|ref|ZP_10916891.1| procollagen-proline dioxygenase [Bacillus sp. 10403023]
Length = 296
Score = 88.2 bits (217), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 51/137 (37%), Positives = 79/137 (57%), Gaps = 6/137 (4%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQ-LALRQGETVESTKGTRTSSGTFISASED 139
+P L+ F S E+C +I +++RLKPS + + GE +T RTS G E+
Sbjct: 108 KPFILHLDYFLSEEECDQLIEMSRERLKPSTVIDPKTGEEKAAT--GRTSKGMSFYLQEN 165
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-SQRLAS 198
+ ++ +E +IA P +GE VL Y IG++Y SH+D F ++ P+ QR+ +
Sbjct: 166 E--FIKKVEKRIAELIEFPVENGEGLQVLNYGIGEEYKSHFDYFPQSKVVPEKGGQRVGT 223
Query: 199 FLLYLSDVEEGGETMFP 215
FL+YL+DV GGET+FP
Sbjct: 224 FLIYLNDVPAGGETVFP 240
>gi|91789558|ref|YP_550510.1| procollagen-proline,2-oxoglutarate-4-dioxygenase [Polaromonas sp.
JS666]
gi|91698783|gb|ABE45612.1| Procollagen-proline,2-oxoglutarate-4-dioxygenase [Polaromonas sp.
JS666]
Length = 277
Score = 88.2 bits (217), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 52/147 (35%), Positives = 82/147 (55%), Gaps = 9/147 (6%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + F N S +C++++ A+ RL S L + E RTS G F + E+
Sbjct: 90 PDLVVFGNLLSDSECEALMEVAQPRLARS-LTVNIKTGGEERNRDRTSQGMFFARGENP- 147
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQRL 196
+++ +E +IAR P GE VLRY G +Y HYD F+PAE G P + QR+
Sbjct: 148 -LVQRVEARIARLVGWPVDRGEGLQVLRYRQGAQYKPHYDYFDPAEPGTPAILQRGGQRV 206
Query: 197 ASFLLYLSDVEEGGETMFPFEVDLQIS 223
A+ ++YL++ E+GG T+FP ++ LQ++
Sbjct: 207 ATLIMYLNEPEQGGATVFP-DIGLQVT 232
>gi|30264308|ref|NP_846685.1| prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. Ames]
gi|47529753|ref|YP_021102.1| prolyl 4-hydroxylase subunit alpha [Bacillus anthracis str. 'Ames
Ancestor']
gi|65321616|ref|ZP_00394575.1| hypothetical protein Bant_01005109 [Bacillus anthracis str. A2012]
gi|165873278|ref|ZP_02217887.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0488]
gi|167634610|ref|ZP_02392930.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0442]
gi|167638693|ref|ZP_02396969.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0193]
gi|170687507|ref|ZP_02878724.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0465]
gi|170709341|ref|ZP_02899757.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0389]
gi|177655890|ref|ZP_02937082.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0174]
gi|190566156|ref|ZP_03019075.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Tsiankovskii-I]
gi|196034803|ref|ZP_03102210.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
W]
gi|227817011|ref|YP_002817020.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
anthracis str. CDC 684]
gi|228929280|ref|ZP_04092307.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pondicheriensis BGSC 4BA1]
gi|228935557|ref|ZP_04098373.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
andalousiensis BGSC 4AW1]
gi|229123754|ref|ZP_04252949.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 95/8201]
gi|229604260|ref|YP_002868528.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0248]
gi|254683996|ref|ZP_05147856.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. CNEVA-9066]
gi|254721830|ref|ZP_05183619.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A1055]
gi|254736344|ref|ZP_05194050.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Western North America USA6153]
gi|254741382|ref|ZP_05199069.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Kruger B]
gi|254753983|ref|ZP_05206018.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Vollum]
gi|254757854|ref|ZP_05209881.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Australia 94]
gi|386738126|ref|YP_006211307.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. H9401]
gi|421506493|ref|ZP_15953416.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. UR-1]
gi|421638315|ref|ZP_16078911.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. BF1]
gi|30258953|gb|AAP28171.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Ames]
gi|47504901|gb|AAT33577.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. 'Ames Ancestor']
gi|164710995|gb|EDR16563.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0488]
gi|167513541|gb|EDR88911.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0193]
gi|167530062|gb|EDR92797.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0442]
gi|170125767|gb|EDS94678.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0389]
gi|170668702|gb|EDT19448.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0465]
gi|172079923|gb|EDT65028.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0174]
gi|190563075|gb|EDV17041.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Tsiankovskii-I]
gi|195992342|gb|EDX56303.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
W]
gi|227005734|gb|ACP15477.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. CDC 684]
gi|228659889|gb|EEL15534.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 95/8201]
gi|228824095|gb|EEM69911.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
andalousiensis BGSC 4AW1]
gi|228830570|gb|EEM76180.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pondicheriensis BGSC 4BA1]
gi|229268668|gb|ACQ50305.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0248]
gi|384387978|gb|AFH85639.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. H9401]
gi|401823486|gb|EJT22633.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. UR-1]
gi|403394741|gb|EJY91981.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. BF1]
Length = 216
Score = 88.2 bits (217), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 47/135 (34%), Positives = 75/135 (55%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +L S++ + RTSSG F+ +E
Sbjct: 39 PLIVVLGNVLSDEECDELIELSKSKLARSKVG-----SSRDVNDIRTSSGAFLDDNE--- 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETFFP 162
>gi|423604110|ref|ZP_17580003.1| hypothetical protein IIK_00691 [Bacillus cereus VD102]
gi|401245796|gb|EJR52149.1| hypothetical protein IIK_00691 [Bacillus cereus VD102]
Length = 216
Score = 88.2 bits (217), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 47/135 (34%), Positives = 75/135 (55%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +L S++ + RTSSG F+ +E
Sbjct: 39 PLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDVNDIRTSSGAFLDDNE--- 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETFFP 162
>gi|217961727|ref|YP_002340297.1| prolyl 4-hydroxylase subunit alpha domain-containing protein
[Bacillus cereus AH187]
gi|222097680|ref|YP_002531737.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
Q1]
gi|229198365|ref|ZP_04325071.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1293]
gi|375286242|ref|YP_005106681.1| prolyl 4-hydroxylase subunit alpha domain-containing protein
[Bacillus cereus NC7401]
gi|423354732|ref|ZP_17332357.1| hypothetical protein IAU_02806 [Bacillus cereus IS075]
gi|423566803|ref|ZP_17543050.1| hypothetical protein II7_00026 [Bacillus cereus MSX-A12]
gi|423574080|ref|ZP_17550199.1| hypothetical protein II9_01301 [Bacillus cereus MSX-D12]
gi|217067199|gb|ACJ81449.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
AH187]
gi|221241738|gb|ACM14448.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
Q1]
gi|228585065|gb|EEK43177.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1293]
gi|358354769|dbj|BAL19941.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
NC7401]
gi|401086280|gb|EJP94507.1| hypothetical protein IAU_02806 [Bacillus cereus IS075]
gi|401212649|gb|EJR19392.1| hypothetical protein II9_01301 [Bacillus cereus MSX-D12]
gi|401215318|gb|EJR22035.1| hypothetical protein II7_00026 [Bacillus cereus MSX-A12]
Length = 216
Score = 88.2 bits (217), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 47/135 (34%), Positives = 75/135 (55%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +L S++ + RTSSG F+ +E
Sbjct: 39 PLIVVLGNVLSDEECDKLIELSKNKLARSKVG-----SSRDVNDIRTSSGAFLDDNE--- 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETFFP 162
>gi|229111709|ref|ZP_04241257.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-15]
gi|296504733|ref|YP_003666433.1| prolyl 4-hydroxylase subunit alpha [Bacillus thuringiensis BMB171]
gi|423585282|ref|ZP_17561369.1| hypothetical protein IIE_00694 [Bacillus cereus VD045]
gi|423640681|ref|ZP_17616299.1| hypothetical protein IK9_00626 [Bacillus cereus VD166]
gi|228671703|gb|EEL26999.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-15]
gi|296325785|gb|ADH08713.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis BMB171]
gi|401233925|gb|EJR40411.1| hypothetical protein IIE_00694 [Bacillus cereus VD045]
gi|401279742|gb|EJR85664.1| hypothetical protein IK9_00626 [Bacillus cereus VD166]
Length = 248
Score = 88.2 bits (217), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 46/135 (34%), Positives = 75/135 (55%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +++ S++ + RTSSG F+ +E
Sbjct: 71 PLIVVLANVLSDEECDELIEMSKNKMERSKIG-----SSRDVNDIRTSSGAFLEDNE--- 122
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 123 -FTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 179
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 180 MYLNDVEEGGETYFP 194
>gi|365158975|ref|ZP_09355162.1| hypothetical protein HMPREF1014_00625 [Bacillus sp. 7_6_55CFAA_CT2]
gi|363625964|gb|EHL76973.1| hypothetical protein HMPREF1014_00625 [Bacillus sp. 7_6_55CFAA_CT2]
Length = 248
Score = 88.2 bits (217), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 46/135 (34%), Positives = 75/135 (55%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +++ S++ + RTSSG F+ +E
Sbjct: 71 PLIVVLANVLSDEECDELIEMSKNKMERSKIG-----SSRDVNDIRTSSGAFLEDNE--- 122
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 123 -LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 179
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 180 MYLNDVEEGGETYFP 194
>gi|218231188|ref|YP_002369041.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
B4264]
gi|218159145|gb|ACK59137.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
B4264]
Length = 216
Score = 88.2 bits (217), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 46/135 (34%), Positives = 76/135 (56%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +++ S++ + RTSSG F+ +E
Sbjct: 39 PLIVVLANVLSDEECGELIEMSKNKMERSKIG-----SSRDVNDIRTSSGAFLEDNE--- 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETYFP 162
>gi|159464219|ref|XP_001690339.1| hypothetical protein CHLREDRAFT_114525 [Chlamydomonas reinhardtii]
gi|158279839|gb|EDP05598.1| predicted protein [Chlamydomonas reinhardtii]
Length = 244
Score = 88.2 bits (217), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 48/133 (36%), Positives = 75/133 (56%), Gaps = 4/133 (3%)
Query: 83 RALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTG 142
R +F + E+ I+ +++RL+ S + G + ES RTS G F+ ED
Sbjct: 1 RIFLIEHFLTDEEADHIVQVSERRLERSGVVATNGGSEESQ--IRTSFGVFLERGEDP-- 56
Query: 143 ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLY 202
+++ +E +I+ T++P +GE VLRY+ QKYD+H+D F + R A+ L+Y
Sbjct: 57 VVKGVEERISALTLMPVGNGEGLQVLRYQKEQKYDAHWDYFFHKDGIANGGNRYATVLMY 116
Query: 203 LSDVEEGGETMFP 215
L D EEGGET+FP
Sbjct: 117 LVDTEEGGETVFP 129
>gi|42783360|ref|NP_980607.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 10987]
gi|42739288|gb|AAS43215.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
ATCC 10987]
Length = 216
Score = 88.2 bits (217), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 47/135 (34%), Positives = 75/135 (55%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +L S++ + RTSSG F+ +E
Sbjct: 39 PLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDVNDIRTSSGAFLDDNE--- 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETFFP 162
>gi|229180513|ref|ZP_04307855.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 172560W]
gi|228602937|gb|EEK60416.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 172560W]
Length = 232
Score = 88.2 bits (217), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 46/135 (34%), Positives = 75/135 (55%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +++ S++ + RTSSG F+ +E
Sbjct: 55 PLIVVLANVLSDEECDELIEMSKNKMERSKIG-----SSRDVNDIRTSSGAFLEDNE--- 106
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 107 -LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 163
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 164 MYLNDVEEGGETYFP 178
>gi|118479416|ref|YP_896567.1| prolyl 4-hydroxylase subunit alpha [Bacillus thuringiensis str. Al
Hakam]
gi|118418641|gb|ABK87060.1| prolyl 4-hydroxylase, alpha subunit [Bacillus thuringiensis str. Al
Hakam]
Length = 232
Score = 87.8 bits (216), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 47/135 (34%), Positives = 75/135 (55%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +L S++ + RTSSG F+ +E
Sbjct: 55 PLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDVNDIRTSSGAFLDDNE--- 106
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 107 -LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 163
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 164 IYLNDVEEGGETFFP 178
>gi|423452458|ref|ZP_17429311.1| hypothetical protein IEE_01202 [Bacillus cereus BAG5X1-1]
gi|401140096|gb|EJQ47653.1| hypothetical protein IEE_01202 [Bacillus cereus BAG5X1-1]
Length = 216
Score = 87.8 bits (216), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 52/153 (33%), Positives = 82/153 (53%), Gaps = 17/153 (11%)
Query: 65 TGDASIQSI-PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
T D IQ I F+ P + N S E+C +I +K +++ S++ +
Sbjct: 25 TEDREIQIISKFE----EPLIVVLANVLSDEECDGLIELSKNKIERSKIG-----SSRDV 75
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTSSG F+ +E + IE +I+ +P HGE ++L YE+ Q+Y +HYD F
Sbjct: 76 NDIRTSSGAFLEENE----LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQEYKAHYDYF 131
Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFP 215
AE+ + R+++ ++YL+DVEEGGET FP
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP 162
>gi|229071739|ref|ZP_04204954.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus F65185]
gi|228711334|gb|EEL63294.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus F65185]
Length = 232
Score = 87.8 bits (216), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 46/135 (34%), Positives = 75/135 (55%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +++ S++ + RTSSG F+ +E
Sbjct: 55 PLIVVLANVLSDEECDELIEMSKNKMERSKIG-----SSRDVNDIRTSSGAFLEDNE--- 106
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 107 -LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 163
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 164 MYLNDVEEGGETYFP 178
>gi|423470454|ref|ZP_17447198.1| hypothetical protein IEM_01760 [Bacillus cereus BAG6O-2]
gi|402436583|gb|EJV68613.1| hypothetical protein IEM_01760 [Bacillus cereus BAG6O-2]
Length = 216
Score = 87.8 bits (216), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 52/153 (33%), Positives = 82/153 (53%), Gaps = 17/153 (11%)
Query: 65 TGDASIQSI-PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
T D IQ I F+ P + N S E+C +I +K +++ S++ +
Sbjct: 25 TEDREIQIISKFE----EPLIVVLANVLSDEECDGLIELSKNKIERSKIG-----SSRDV 75
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTSSG F+ +E + IE +I+ +P HGE ++L YE+ Q+Y +HYD F
Sbjct: 76 NDIRTSSGAFLEENE----LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQEYKAHYDYF 131
Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFP 215
AE+ + R+++ ++YL+DVEEGGET FP
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP 162
>gi|365090417|ref|ZP_09328465.1| 2OG-Fe(II) oxygenase [Acidovorax sp. NO-1]
gi|363416516|gb|EHL23626.1| 2OG-Fe(II) oxygenase [Acidovorax sp. NO-1]
Length = 302
Score = 87.8 bits (216), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 54/148 (36%), Positives = 81/148 (54%), Gaps = 9/148 (6%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+PR + F N S E+C ++IA A+ RL S L + E RTS G F +
Sbjct: 114 QPRIVVFGNLLSPEECDALIADAQPRLARS-LTVATKTGGEEINDDRTSDGMFFQRGQ-- 170
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQR 195
+ +++ IE +IAR P +GE VL Y G +Y HYD F+PAE G P + QR
Sbjct: 171 SPLIQRIEERIARLLNWPIENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPSIVNRGGQR 230
Query: 196 LASFLLYLSDVEEGGETMFPFEVDLQIS 223
+ + ++YL+ E+GG T FP +V L+++
Sbjct: 231 VGTLVMYLNTPEKGGGTTFP-DVHLEVA 257
>gi|49187135|ref|YP_030387.1| prolyl 4-hydroxylase subunit alpha [Bacillus anthracis str. Sterne]
gi|228947951|ref|ZP_04110238.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
monterrey BGSC 4AJ1]
gi|49181062|gb|AAT56438.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Sterne]
gi|228811938|gb|EEM58272.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
monterrey BGSC 4AJ1]
Length = 232
Score = 87.8 bits (216), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 53/154 (34%), Positives = 83/154 (53%), Gaps = 17/154 (11%)
Query: 64 ETGDASIQSI-PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES 122
+T D IQ I F+ P + N S E+C +I +K +L S++ +
Sbjct: 40 KTEDREIQIISKFE----EPLIVVLGNVLSDEECDELIELSKSKLARSKVG-----SSRD 90
Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
RTSSG F+ +E + IE +I+ +P +HGE ++L YE+ Q+Y +HYD
Sbjct: 91 VNDIRTSSGAFLDDNE----LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDY 146
Query: 183 FNPAEYG-PQMSQRLASFLLYLSDVEEGGETMFP 215
F AE+ + R+++ ++YL+DVEEGGET FP
Sbjct: 147 F--AEHSRSAANNRISTLVMYLNDVEEGGETFFP 178
>gi|229140971|ref|ZP_04269515.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST26]
gi|228642547|gb|EEK98834.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST26]
Length = 232
Score = 87.8 bits (216), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 47/135 (34%), Positives = 75/135 (55%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +L S++ + RTSSG F+ +E
Sbjct: 55 PLIVVLGNVLSDEECDKLIELSKNKLARSKVG-----SSRDVNDIRTSSGAFLDDNE--- 106
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 107 -LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 163
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 164 MYLNDVEEGGETFFP 178
>gi|206978009|ref|ZP_03238895.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
H3081.97]
gi|423373947|ref|ZP_17351286.1| hypothetical protein IC5_03002 [Bacillus cereus AND1407]
gi|206743809|gb|EDZ55230.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
H3081.97]
gi|401094762|gb|EJQ02832.1| hypothetical protein IC5_03002 [Bacillus cereus AND1407]
Length = 216
Score = 87.8 bits (216), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 47/135 (34%), Positives = 74/135 (54%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +L S++ + RTSSG F+ E
Sbjct: 39 PLIVVLGNVLSDEECDKLIELSKNKLARSKVG-----SSRDVNDIRTSSGAFLDDDE--- 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETFFP 162
>gi|337280547|ref|YP_004620019.1| hypothetical protein Rta_28970 [Ramlibacter tataouinensis TTB310]
gi|334731624|gb|AEG94000.1| conserved hypothetical protein [Ramlibacter tataouinensis TTB310]
Length = 286
Score = 87.8 bits (216), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 53/149 (35%), Positives = 80/149 (53%), Gaps = 9/149 (6%)
Query: 80 WRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASED 139
+ PR + F + S ++C+ +I AK RL S L + E RTSSG F E+
Sbjct: 97 YNPRVVVFGSLLSDQECEQLIGLAKPRLARS-LTVATKTGGEEVNEDRTSSGMFFQRGEN 155
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-----PQMSQ 194
+ ++ IE +IAR P +GE VL Y G +Y HYD F+PAE G + Q
Sbjct: 156 E--LVARIEARIARLVNWPVENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPTILKRGGQ 213
Query: 195 RLASFLLYLSDVEEGGETMFPFEVDLQIS 223
R+ + ++YL + E+GG T FP +V L+++
Sbjct: 214 RVGTLVMYLGEPEKGGGTTFP-DVHLEVA 241
>gi|385206010|ref|ZP_10032880.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
gi|385185901|gb|EIF35175.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
Length = 296
Score = 87.4 bits (215), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 57/153 (37%), Positives = 81/153 (52%), Gaps = 15/153 (9%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASED 139
RP A+ +F SA +C+ +I+ A+ RL S + G V G R+S G F E
Sbjct: 101 RPAAILLDDFLSANECEQLISLARPRLSRSTVVDPVTGRNV--VAGHRSSDGMFFRLGE- 157
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD---AFNPA--EYGPQMSQ 194
T ++ +E +IA T LP +GE +L YE+G + H D A NPA E + Q
Sbjct: 158 -TPLIARLEARIAELTGLPVENGEGLQLLHYEVGAESTPHVDYLIAGNPANQESIARSGQ 216
Query: 195 RLASFLLYLSDVEEGGETMFPFEVDLQISWLII 227
R+ + L+YL+DVE GGETMFP Q W ++
Sbjct: 217 RVGTLLMYLNDVEGGGETMFP-----QTGWSVV 244
>gi|224034451|gb|ACN36301.1| unknown [Zea mays]
gi|413945801|gb|AFW78450.1| hypothetical protein ZEAMMB73_588774 [Zea mays]
Length = 295
Score = 87.4 bits (215), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 54/139 (38%), Positives = 76/139 (54%), Gaps = 16/139 (11%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
+S +PR + +F S ++ +I+ A+ LK S +A ++ G T S
Sbjct: 54 ISCKPRVFLYQHFLSDDEANHLISLARAELKRSAVA-------DNMSGKST-------LS 99
Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
ED I+E IE KIA T LP+ +GE VLRY+ G+KY+ HYD F + R A
Sbjct: 100 EDP--IVEGIEDKIAAWTFLPKENGEDIQVLRYKHGEKYEPHYDYFTDNVNTVRGGHRYA 157
Query: 198 SFLLYLSDVEEGGETMFPF 216
+ LLYL+DV EGGET+FP
Sbjct: 158 TVLLYLTDVPEGGETVFPL 176
>gi|30022316|ref|NP_833947.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 14579]
gi|229129515|ref|ZP_04258486.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-Cer4]
gi|29897873|gb|AAP11148.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 14579]
gi|228654120|gb|EEL09987.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-Cer4]
Length = 232
Score = 87.4 bits (215), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 45/135 (33%), Positives = 75/135 (55%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +++ S++ + RTSSG F+ +
Sbjct: 55 PLIVVLANVLSDEECDELIEMSKNKMERSKIG-----SSRDVNDIRTSSGAFL----EDN 105
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 106 KLTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 163
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 164 MYLNDVEEGGETYFP 178
>gi|423457579|ref|ZP_17434376.1| hypothetical protein IEI_00719 [Bacillus cereus BAG5X2-1]
gi|401147963|gb|EJQ55456.1| hypothetical protein IEI_00719 [Bacillus cereus BAG5X2-1]
Length = 216
Score = 87.4 bits (215), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 47/135 (34%), Positives = 75/135 (55%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +L S++ + RTSSG F+ +E
Sbjct: 39 PLIVVLGNVLSDEECDELIELSKSKLARSKVG-----SSRDVNDIRTSSGAFLEDNE--- 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTVKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETFFP 162
>gi|302838815|ref|XP_002950965.1| hypothetical protein VOLCADRAFT_60971 [Volvox carteri f.
nagariensis]
gi|300263660|gb|EFJ47859.1| hypothetical protein VOLCADRAFT_60971 [Volvox carteri f.
nagariensis]
Length = 298
Score = 87.0 bits (214), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 56/153 (36%), Positives = 75/153 (49%), Gaps = 22/153 (14%)
Query: 75 FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 134
+ +SW PR + NF + +C+ I TA +K S + + G +V T RTS GTFI
Sbjct: 2 IEAVSWNPRVFIYHNFLTDGECRHIKRTAAPMMKRSSVVGQNGSSV--TDNIRTSYGTFI 59
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFN------------VLRYEIGQKYDSHYDA 182
D ++E I ++A T P + E VLRY IGQKY +H D+
Sbjct: 60 RRRHDP--VIERILRRVAAWTKAPPENQEDLQAGRGEGGREKERVLRYGIGQKYGAHMDS 117
Query: 183 FNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
S R+A+ LLYL D EEGGET FP
Sbjct: 118 LI------DDSPRMATVLLYLHDTEEGGETAFP 144
>gi|402555628|ref|YP_006596899.1| prolyl 4-hydroxylase subunit alpha [Bacillus cereus FRI-35]
gi|401796838|gb|AFQ10697.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus FRI-35]
Length = 216
Score = 87.0 bits (214), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 47/135 (34%), Positives = 75/135 (55%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +L S++ + RTSSG F+ +E
Sbjct: 39 PLIVVLGNVLSDEECGELIELSKNKLARSKVG-----SSRDVNDIRTSSGAFLDDNE--- 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETFFP 162
>gi|221068712|ref|ZP_03544817.1| Procollagen-proline dioxygenase [Comamonas testosteroni KF-1]
gi|220713735|gb|EED69103.1| Procollagen-proline dioxygenase [Comamonas testosteroni KF-1]
Length = 299
Score = 87.0 bits (214), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 53/146 (36%), Positives = 77/146 (52%), Gaps = 9/146 (6%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
PR + F N S E+C +IIA A R++ S L + E+ RTS+G F E+
Sbjct: 112 PRVVVFGNLLSDEECDAIIAAAGPRMQRS-LTVDNQSGGEAVNDDRTSNGMFFQRGEND- 169
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQRL 196
++ +E +IAR P +GE VL Y G +Y HYD F P E G P + QR+
Sbjct: 170 -LICRVEQRIARLLNWPLENGEGMQVLHYRPGAEYKPHYDYFAPNEPGTPTILKRGGQRV 228
Query: 197 ASFLLYLSDVEEGGETMFPFEVDLQI 222
+ ++YL++ GG T FP +V LQ+
Sbjct: 229 GTLVMYLNEPARGGATTFP-DVGLQV 253
>gi|239814309|ref|YP_002943219.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
gi|239800886|gb|ACS17953.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
Length = 279
Score = 87.0 bits (214), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 57/166 (34%), Positives = 86/166 (51%), Gaps = 14/166 (8%)
Query: 64 ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPS-QLALRQGETVES 122
+ GD +Q + Q + PR + F N S E+C+ +IA A+ RL S + R G V +
Sbjct: 77 DAGDRRVQVL--QTMR-HPRVVVFGNLVSPEECEGLIAAARVRLARSLTVETRTGGEVLN 133
Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
RTS G F E+ I+ +E +IA P GE +LRY G +Y HYD
Sbjct: 134 VD--RTSEGMFFERGEND--IVARLEQRIAALLRWPVEFGEGLQILRYAPGAQYRPHYDY 189
Query: 183 FNPAEYG-----PQMSQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
F+P E G + QR+A+ ++YL + +GG T FP +V L+++
Sbjct: 190 FDPGEPGTPTILKRGGQRVATLVMYLQEPGQGGATTFP-DVGLEVA 234
>gi|319652240|ref|ZP_08006358.1| prolyl 4-hydroxylase [Bacillus sp. 2_A_57_CT2]
gi|317396063|gb|EFV76783.1| prolyl 4-hydroxylase [Bacillus sp. 2_A_57_CT2]
Length = 216
Score = 87.0 bits (214), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 47/134 (35%), Positives = 79/134 (58%), Gaps = 8/134 (5%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K R++ S++A ++E + RTSS TF E++
Sbjct: 38 PLIVILGNVLSDEECDQLIQQSKDRMQRSKVA----NSLEVDE-LRTSSSTFFHEGENE- 91
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
I+ IE +I++ +P HGE +L Y+IGQ+Y +H+D F+ + R+++ ++
Sbjct: 92 -IVARIEKRISQIMNIPVEHGEGLQILNYKIGQEYKAHFDFFSSTSRAAS-NPRISTLVM 149
Query: 202 YLSDVEEGGETMFP 215
YL+DVE+GGET FP
Sbjct: 150 YLNDVEQGGETYFP 163
>gi|414587754|tpg|DAA38325.1| TPA: hypothetical protein ZEAMMB73_894856 [Zea mays]
Length = 169
Score = 87.0 bits (214), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 43/110 (39%), Positives = 71/110 (64%), Gaps = 3/110 (2%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
+V+SW PR + F NF S+E+C ++A A+ RL+ S + + G+ V+S RTSSG F+
Sbjct: 58 EVISWTPRIIVFHNFLSSEECDYLMAIARPRLQISTVVDVATGKGVKS--DVRTSSGMFV 115
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN 184
++ E K+ +++ IE +I+ + +P+ +GE VLRYE Q Y H+D F+
Sbjct: 116 NSEERKSPVVQAIEKRISVFSQIPKENGELIQVLRYEASQYYRPHHDYFS 165
>gi|91779740|ref|YP_554948.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
xenovorans LB400]
gi|91692400|gb|ABE35598.1| Procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
xenovorans LB400]
Length = 296
Score = 87.0 bits (214), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 58/153 (37%), Positives = 80/153 (52%), Gaps = 15/153 (9%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASED 139
RP A+ +F SA +C+ +IA A+ RL S + G V G R+S G F E
Sbjct: 101 RPAAVLLDDFLSANECEQLIALARPRLSRSTVVDPVTGRNV--VAGHRSSDGMFFRLGE- 157
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD---AFNPA--EYGPQMSQ 194
T ++ +E +IA T LP +GE +L YE G + H D A NPA E + Q
Sbjct: 158 -TPLIARLEARIAELTGLPVENGEGLQLLHYEAGAESTPHVDYLIAGNPANRESIARSGQ 216
Query: 195 RLASFLLYLSDVEEGGETMFPFEVDLQISWLII 227
R+ + L+YL+DVE GGETMFP Q W ++
Sbjct: 217 RVGTLLMYLNDVEGGGETMFP-----QTGWSVV 244
>gi|423612451|ref|ZP_17588312.1| hypothetical protein IIM_03166 [Bacillus cereus VD107]
gi|401246040|gb|EJR52392.1| hypothetical protein IIM_03166 [Bacillus cereus VD107]
Length = 254
Score = 87.0 bits (214), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 46/135 (34%), Positives = 75/135 (55%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +++ S++ + + RTSSG F+ +E
Sbjct: 77 PLIVVLANVLSDEECDELIELSKNKMERSKIG-----SSRNVNDIRTSSGAFLEENE--- 128
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
IE +I+ T +P HGE ++L Y + Q+Y +HYD F AE+ + R+++ +
Sbjct: 129 -FTSKIEKRISSITNVPVAHGEGLHILNYAVDQEYKAHYDYF--AEHSRSAANNRISTLV 185
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 186 MYLNDVEEGGETFFP 200
>gi|407938132|ref|YP_006853773.1| 2OG-Fe(II) oxygenase [Acidovorax sp. KKS102]
gi|407895926|gb|AFU45135.1| 2OG-Fe(II) oxygenase [Acidovorax sp. KKS102]
Length = 303
Score = 86.7 bits (213), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 52/148 (35%), Positives = 81/148 (54%), Gaps = 9/148 (6%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+PR + F N S E+C ++IA A+ R+ S L + E RTS G F +
Sbjct: 115 QPRIVVFGNLLSPEECDALIAAAEPRMARS-LTVATKTGGEEINADRTSDGMFFQRGQ-- 171
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQR 195
+ +++ IE +IAR P +GE VL Y G +Y HYD F+PAE G P + QR
Sbjct: 172 SPLIQRIEERIARLLQWPIENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPSIIKRGGQR 231
Query: 196 LASFLLYLSDVEEGGETMFPFEVDLQIS 223
+ + ++YL+ ++GG T FP +V L+++
Sbjct: 232 VGTLVMYLNTPDKGGGTTFP-DVHLEVA 258
>gi|281307110|pdb|3ITQ|A Chain A, Crystal Structure Of A Prolyl 4-Hydroxylase From Bacillus
Anthracis
gi|281307111|pdb|3ITQ|B Chain B, Crystal Structure Of A Prolyl 4-Hydroxylase From Bacillus
Anthracis
Length = 216
Score = 86.7 bits (213), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 47/135 (34%), Positives = 74/135 (54%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +L S++ + RTSSG F+ +E
Sbjct: 39 PLIVVLGNVLSDEECDELIELSKSKLARSKVG-----SSRDVNDIRTSSGAFLDDNE--- 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTAKIEKRISSIXNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 201 LYLSDVEEGGETMFP 215
YL+DVEEGGET FP
Sbjct: 148 XYLNDVEEGGETFFP 162
>gi|325922187|ref|ZP_08183974.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas gardneri ATCC
19865]
gi|325547306|gb|EGD18373.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas gardneri ATCC
19865]
Length = 285
Score = 86.7 bits (213), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 57/177 (32%), Positives = 89/177 (50%), Gaps = 14/177 (7%)
Query: 52 VEKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQ 111
V ++N++ L + GD ++ + +L PR + +F S +C ++IA A+ RL S+
Sbjct: 71 VLQDNDASL--LDLGDRQVRVLVSLLL---PRVVVLGDFLSDAECDALIALAQPRLARSR 125
Query: 112 LALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYE 171
+ + RTS + +D + + IE +IAR P HGE VLRY
Sbjct: 126 -TVDNDNGAQIVHAARTSDSMCLQLGQD--ALCQRIEARIARLLDWPVDHGEGLQVLRYA 182
Query: 172 IGQKYDSHYDAFNPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
G +Y HYD F+P G + QRLAS ++YL+ E GG T FP +V L ++
Sbjct: 183 TGAEYQPHYDYFDPTAAGTPVLLQAGGQRLASLVMYLNTPERGGATRFP-DVHLDVA 238
>gi|412985583|emb|CCO19029.1| predicted protein [Bathycoccus prasinos]
Length = 458
Score = 86.7 bits (213), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 51/143 (35%), Positives = 82/143 (57%), Gaps = 8/143 (5%)
Query: 75 FQVLSW-RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT-RTSSGT 132
Q++S PRA + F + E+C +I +K R+ S+ + ET + K RTS+G+
Sbjct: 177 MQIISLDHPRAFLYKRFMTDEECDFLIDHSKSRM--SKSGVVDAETGGTAKSDIRTSTGS 234
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
F+ + +++ +E ++A +MLP H EA VLRYE+ Q+Y +HYD F G
Sbjct: 235 FVGIGAND--LMKKLEKRVATFSMLPVKHQEATQVLRYEVKQEYRAHYDYF--FHKGGMA 290
Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
+ R+ + L+YL + E GGET+FP
Sbjct: 291 NNRIVTILMYLHEPEFGGETVFP 313
>gi|423406337|ref|ZP_17383486.1| hypothetical protein ICY_01022 [Bacillus cereus BAG2X1-3]
gi|401660331|gb|EJS77813.1| hypothetical protein ICY_01022 [Bacillus cereus BAG2X1-3]
Length = 216
Score = 86.7 bits (213), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 46/135 (34%), Positives = 74/135 (54%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +L S++ + RTS G F+ +E
Sbjct: 39 PLIVVLGNVLSDEECDKLIELSKNKLARSKVG-----SSRDVNDIRTSKGAFLDDNE--- 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETYFP 162
>gi|228916870|ref|ZP_04080433.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pulsiensis BGSC 4CC1]
gi|228842793|gb|EEM87878.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pulsiensis BGSC 4CC1]
Length = 232
Score = 86.7 bits (213), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 46/135 (34%), Positives = 74/135 (54%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +L S++ + RTS G F+ +E
Sbjct: 55 PLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDVNDIRTSKGAFLDDNE--- 106
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 107 -LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 163
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 164 MYLNDVEEGGETFFP 178
>gi|423395462|ref|ZP_17372663.1| hypothetical protein ICU_01156 [Bacillus cereus BAG2X1-1]
gi|401654873|gb|EJS72412.1| hypothetical protein ICU_01156 [Bacillus cereus BAG2X1-1]
Length = 216
Score = 86.7 bits (213), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 46/135 (34%), Positives = 74/135 (54%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +L S++ + RTS G F+ +E
Sbjct: 39 PLIVVLGNVLSDEECDKLIELSKNKLARSKVG-----SSRDVNDIRTSKGAFLDDNE--- 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETYFP 162
>gi|317127314|ref|YP_004093596.1| Procollagen-proline dioxygenase [Bacillus cellulosilyticus DSM
2522]
gi|315472262|gb|ADU28865.1| Procollagen-proline dioxygenase [Bacillus cellulosilyticus DSM
2522]
Length = 229
Score = 86.7 bits (213), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 43/134 (32%), Positives = 75/134 (55%), Gaps = 9/134 (6%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I+ +K R++ S+++ + S RTSS F +E+
Sbjct: 44 PLIVLLGNVLSEEECDQLISLSKDRIERSKISNK------SVHDLRTSSSMFFDDAEND- 96
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
++ +E ++++ +P HGE +L Y IGQ+Y +HYD F+ + R+++ ++
Sbjct: 97 -VVSTVEKRVSQIMKIPVDHGEGIQILNYAIGQEYKAHYDYFSSGNSKVN-NPRISTLVM 154
Query: 202 YLSDVEEGGETMFP 215
YL+DVE GGET FP
Sbjct: 155 YLNDVEAGGETYFP 168
>gi|187930127|ref|YP_001900614.1| procollagen-proline dioxygenase [Ralstonia pickettii 12J]
gi|187727017|gb|ACD28182.1| Procollagen-proline dioxygenase [Ralstonia pickettii 12J]
Length = 288
Score = 86.3 bits (212), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 57/166 (34%), Positives = 81/166 (48%), Gaps = 14/166 (8%)
Query: 55 ENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLAL 114
EN + + H GD IP PR + F +F S +C +IA + RLK S + +
Sbjct: 75 ENSNAV-HTADGD-----IPILFAIETPRIVLFQHFLSDAECDELIAIGRNRLKRSPV-V 127
Query: 115 RQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQ 174
E+ RTS G E ++ IE +IA+A +P HGE F VL Y+ G
Sbjct: 128 NPDTGEENLISARTSQGGMFQVGEHP--LIAKIEVRIAQAVGVPVEHGEGFQVLNYQPGG 185
Query: 175 KYDSHYDAFNPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFP 215
+Y H+D FNP G QR+A+ ++YL+ V+ GG T FP
Sbjct: 186 EYQPHFDFFNPGRSGEARQLEVGGQRVATMVIYLNSVQAGGATGFP 231
>gi|229163182|ref|ZP_04291137.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus R309803]
gi|228620245|gb|EEK77116.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus R309803]
Length = 229
Score = 86.3 bits (212), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 46/135 (34%), Positives = 74/135 (54%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +L S++ + RTS G F+ +E
Sbjct: 52 PLIVVLGNVLSDEECDELIELSKSKLARSKVG-----SSRDVNDIRTSKGAFLDDNE--- 103
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 104 -LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 160
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 161 MYLNDVEEGGETFFP 175
>gi|229174912|ref|ZP_04302432.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus MM3]
gi|228608580|gb|EEK65882.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus MM3]
Length = 216
Score = 85.9 bits (211), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 46/135 (34%), Positives = 74/135 (54%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +L S++ + RTS G F+ +E
Sbjct: 39 PLIVVLGNVLSDEECDELIELSKSKLARSKVG-----SSRDVNDIRTSKGAFLDDNE--- 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
+ IE +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 91 -LTVKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETFFP 162
>gi|229031885|ref|ZP_04187873.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1271]
gi|228729503|gb|EEL80492.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1271]
Length = 216
Score = 85.9 bits (211), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 47/135 (34%), Positives = 74/135 (54%), Gaps = 12/135 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K +L S++ + RTS G F+ +E T
Sbjct: 39 PLIVVLGNVLSDEECGELIELSKSKLARSKVG-----SSRDVNDIRTSKGAFLDDNELTT 93
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
I E +I+ +P +HGE ++L YE+ Q+Y +HYD F AE+ + R+++ +
Sbjct: 94 KI----EKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETFFP 162
>gi|241767624|ref|ZP_04765273.1| Procollagen-proline dioxygenase [Acidovorax delafieldii 2AN]
gi|241361463|gb|EER57922.1| Procollagen-proline dioxygenase [Acidovorax delafieldii 2AN]
Length = 318
Score = 85.5 bits (210), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 57/164 (34%), Positives = 83/164 (50%), Gaps = 14/164 (8%)
Query: 64 ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
+ GD S++ + + PR + F N S E+C+++IA A R+ S L + E
Sbjct: 116 DCGDRSVEVL---LTMAHPRVVVFGNLLSPEECEALIAAAAPRMARS-LTVATQTGGEEV 171
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTS G F E +++ IE +IA P +GE VL Y G +Y HYD F
Sbjct: 172 NDDRTSHGMFFQRGESP--LVQRIEERIASLLNWPIENGEGLQVLHYRPGAEYKPHYDYF 229
Query: 184 NPAEYG-PQM----SQRLASFLLYLSDVEEGGETMFPFEVDLQI 222
+PAE G P + QR+ + ++YL+ E+GG T FP D QI
Sbjct: 230 DPAEPGTPTVIQRGGQRVGTLVMYLNTPEQGGGTTFP---DAQI 270
>gi|299115886|emb|CBN75895.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein
[Ectocarpus siliculosus]
Length = 404
Score = 85.5 bits (210), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 49/153 (32%), Positives = 80/153 (52%), Gaps = 5/153 (3%)
Query: 66 GDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG 125
G ++ SI + LS P NF E+C+ I A +KPS ++L + +
Sbjct: 184 GLETLGSIDMKTLSMEPLVFEARNFLLDEECKHIREKADPHMKPSPVSLMDHDKGKPDTN 243
Query: 126 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 185
RTS+ F+ ++ D +L+ I+ ++ T +P++H E VL+Y+ GQ+Y +H+D +
Sbjct: 244 WRTSTTYFMPSTRDP--LLQGIDRRVEEFTRVPKSHQEQVQVLKYDKGQRYTAHHDFLDE 301
Query: 186 ---AEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
R+ + YLSDVEEGGET+FP
Sbjct: 302 RTMRNMDGGRKNRMITVFWYLSDVEEGGETIFP 334
>gi|418523362|ref|ZP_13089380.1| hypothetical protein WS7_20388 [Xanthomonas axonopodis pv.
malvacearum str. GSPB2388]
gi|410699993|gb|EKQ58573.1| hypothetical protein WS7_20388 [Xanthomonas axonopodis pv.
malvacearum str. GSPB2388]
Length = 286
Score = 85.5 bits (210), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 52/145 (35%), Positives = 72/145 (49%), Gaps = 20/145 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFIS 135
PR + F S +C ++IA A+ RL S+ TV++ G RTS G +
Sbjct: 96 PRVVVLGGFLSDGECDALIALARPRLARSR-------TVDNANGEHLVHAARTSDGMCLR 148
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM--- 192
+D + + IE +IAR P HGE VLRY G +Y HYD F+P G +
Sbjct: 149 VGQD--ALCQRIEARIARLFDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAVGTPILLQ 206
Query: 193 --SQRLASFLLYLSDVEEGGETMFP 215
QR+AS ++YL+ E GG T FP
Sbjct: 207 AGGQRVASLVMYLNTPERGGATRFP 231
>gi|351731158|ref|ZP_08948849.1| 2OG-Fe(II) oxygenase [Acidovorax radicis N35]
Length = 303
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 53/149 (35%), Positives = 82/149 (55%), Gaps = 11/149 (7%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPS-QLALRQGETVESTKGTRTSSGTFISASED 139
+PR + F N S E+C ++IA A R+ S +A + G E RTS G F +
Sbjct: 115 QPRVVVFGNLLSPEECDALIADAAPRMARSLTVATKTGG--EEINDDRTSDGMFFQRGQ- 171
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQ 194
+ +++ IE +IAR P +GE VL Y G +Y HYD F+PAE G P + Q
Sbjct: 172 -SPLIQRIEERIARLLNWPIENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPTIVKRGGQ 230
Query: 195 RLASFLLYLSDVEEGGETMFPFEVDLQIS 223
R+ + ++YL+ E+GG T FP +V ++++
Sbjct: 231 RVGTLVMYLNTPEKGGGTTFP-DVHVEVA 258
>gi|78046308|ref|YP_362483.1| 2OG-Fe(II) oxygenase [Xanthomonas campestris pv. vesicatoria str.
85-10]
gi|78034738|emb|CAJ22383.1| putative 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas
campestris pv. vesicatoria str. 85-10]
Length = 296
Score = 85.1 bits (209), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 52/145 (35%), Positives = 72/145 (49%), Gaps = 20/145 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFIS 135
PR + F S E+C ++IA A+ RL S+ TV++ G RTS +
Sbjct: 106 PRVVVLGGFLSDEECDALIALARPRLARSR-------TVDNANGEHVVHAARTSDSMCLR 158
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM--- 192
+D + + IE +IAR P HGE VLRY G +Y HYD F+P G +
Sbjct: 159 LGQD--ALCQRIEARIARLLDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAAGTPVLVQ 216
Query: 193 --SQRLASFLLYLSDVEEGGETMFP 215
QR+AS ++YL+ E GG T FP
Sbjct: 217 AGGQRVASLVMYLNTPERGGATRFP 241
>gi|226314793|ref|YP_002774689.1| hypothetical protein BBR47_52080 [Brevibacillus brevis NBRC 100599]
gi|226097743|dbj|BAH46185.1| conserved hypothetical protein [Brevibacillus brevis NBRC 100599]
Length = 215
Score = 84.7 bits (208), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 48/135 (35%), Positives = 73/135 (54%), Gaps = 10/135 (7%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S +C +I +++RL+ S++ GE S RTSSG F +E T
Sbjct: 36 PLVVVLGNVLSDSECDELIEHSRERLQRSKI----GED-RSVNSIRTSSGVFCEQTETIT 90
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
IE +I++ +P HG+ VLRY GQ+Y HYD F + R+++ ++
Sbjct: 91 ----RIEKRISQIMNIPIEHGDGLQVLRYTPGQEYKPHYDFFAETSRAS-TNNRISTLVM 145
Query: 202 YLSDVEEGGETMFPF 216
YL+DVE+GGET+FP
Sbjct: 146 YLNDVEQGGETVFPL 160
>gi|302845026|ref|XP_002954052.1| hypothetical protein VOLCADRAFT_64430 [Volvox carteri f.
nagariensis]
gi|300260551|gb|EFJ44769.1| hypothetical protein VOLCADRAFT_64430 [Volvox carteri f.
nagariensis]
Length = 311
Score = 84.7 bits (208), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 51/139 (36%), Positives = 73/139 (52%), Gaps = 10/139 (7%)
Query: 77 VLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
V+SW+PRA NF + +C I A+ ++ S + G +V RTS GTFI+
Sbjct: 3 VISWQPRAFVIRNFLTEHECTHIADLAQVHMRRSTVVADNGSSV--LDDYRTSYGTFINR 60
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
+ T ++ +E ++A T P + E VLRY +GQ Y H D+ S R+
Sbjct: 61 YQ--TPVIAAVEDRVALLTRTPVVYQEDMQVLRYGLGQYYHRHTDSLE------NDSPRM 112
Query: 197 ASFLLYLSDVEEGGETMFP 215
A+ LLYLS+ E GGET FP
Sbjct: 113 ATVLLYLSEPELGGETAFP 131
>gi|333981907|ref|YP_004511117.1| procollagen-proline dioxygenase [Methylomonas methanica MC09]
gi|333805948|gb|AEF98617.1| Procollagen-proline dioxygenase [Methylomonas methanica MC09]
Length = 286
Score = 84.7 bits (208), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 54/185 (29%), Positives = 92/185 (49%), Gaps = 20/185 (10%)
Query: 53 EKENESGLPH---------GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATA 103
E N++ LP+ G D +S+ + RP + F S E+C+ +I +
Sbjct: 58 EPSNKAKLPYQYETSLVAAGNNIDLFDRSVKVSLRVSRPDIVVVDEFMSGEECEQLIEQS 117
Query: 104 KKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGE 163
+++L PS + Q + R+S GT+ E + ++ ++ +I+ P+ HGE
Sbjct: 118 RRKLTPSAIVDPQTGKFQVI-ADRSSEGTYFQRGE--SPLISRLDRRISELMNWPEDHGE 174
Query: 164 AFNVLRYEIGQKYDSHYDAFNPAEYG-----PQMSQRLASFLLYLSDVEEGGETMFPFEV 218
+L Y +G +Y H+D F E G Q QR+A+ ++YL++V EGGET+FP
Sbjct: 175 GIQILHYGVGAQYKPHFDYFLENESGGALQMTQSGQRVATLVMYLNEVTEGGETVFP--- 231
Query: 219 DLQIS 223
D+ IS
Sbjct: 232 DVGIS 236
>gi|302842389|ref|XP_002952738.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
gi|300262082|gb|EFJ46291.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
Length = 281
Score = 84.3 bits (207), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 47/123 (38%), Positives = 72/123 (58%), Gaps = 4/123 (3%)
Query: 93 AEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIA 152
AE+ I+ +++RL+ + + G+ T RTS G F+ ED+ I++ +E +IA
Sbjct: 7 AEEADHIVKVSERRLE--RSGVVGGDGGSETSNIRTSYGVFLDRGEDE--IVKRVEERIA 62
Query: 153 RATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGET 212
T++P +GE VLRY+ QKYD+H+D F + R A+ L+YL D EEGGET
Sbjct: 63 AWTLMPVGNGEGLQVLRYQKEQKYDAHWDYFFHKDGITNGGNRYATVLMYLVDTEEGGET 122
Query: 213 MFP 215
+FP
Sbjct: 123 VFP 125
>gi|302765413|ref|XP_002966127.1| hypothetical protein SELMODRAFT_86017 [Selaginella moellendorffii]
gi|300165547|gb|EFJ32154.1| hypothetical protein SELMODRAFT_86017 [Selaginella moellendorffii]
Length = 201
Score = 84.3 bits (207), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 45/131 (34%), Positives = 72/131 (54%), Gaps = 3/131 (2%)
Query: 85 LYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGIL 144
L F S ++C +I A RL+ S + + + ++ RTS G F+ D I+
Sbjct: 1 LIFFYLYSDDECDHLIGLALPRLRRSSVIDEKTGLGKDSR-NRTSWGAFLRRDHDN--IV 57
Query: 145 ELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLS 204
IE +I+ T +P+ +GE+ V+RY+ GQK++ H D + E R+ + LLYL+
Sbjct: 58 SGIEDRISSITFIPKEYGESLQVVRYKTGQKFEPHQDYYKLTENNNNGGHRIGTLLLYLT 117
Query: 205 DVEEGGETMFP 215
+VE GGET+FP
Sbjct: 118 NVENGGETVFP 128
>gi|398818543|ref|ZP_10577128.1| 2OG-Fe(II) oxygenase superfamily enzyme [Brevibacillus sp. BC25]
gi|398027481|gb|EJL21031.1| 2OG-Fe(II) oxygenase superfamily enzyme [Brevibacillus sp. BC25]
Length = 220
Score = 84.0 bits (206), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 48/135 (35%), Positives = 73/135 (54%), Gaps = 10/135 (7%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S +C +I +++RL+ S++ GE S RTSSG F +E T
Sbjct: 41 PLVVVLGNVLSDSECDELIEHSRERLQRSKI----GED-GSVNSIRTSSGVFCEQTETIT 95
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
IE +I++ +P HG+ VLRY GQ+Y HYD F + R+++ ++
Sbjct: 96 ----RIEKRISQIMNIPIEHGDGLQVLRYTPGQEYKPHYDFFAETSRAS-TNNRISTLVM 150
Query: 202 YLSDVEEGGETMFPF 216
YL+DVE+GGET+FP
Sbjct: 151 YLNDVEQGGETVFPL 165
>gi|281206564|gb|EFA80750.1| putative prolyl 4-hydroxylase alpha subunit [Polysphondylium
pallidum PN500]
Length = 251
Score = 84.0 bits (206), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 52/162 (32%), Positives = 87/162 (53%), Gaps = 18/162 (11%)
Query: 68 ASIQSIPFQV-LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT 126
+S +IP + +S +PR P F + E+C+ +I T+K +LKP E + G
Sbjct: 49 SSTDNIPKLIEVSQKPRIYRIPKFLTDEECEHLIETSKNKLKPCN---------EISSGV 99
Query: 127 -RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 185
R+ G F+ E+ + + I +++ L ++ E V+RY G++ +H+D FNP
Sbjct: 100 HRSGWGLFMKEGEEDHPVTQNIFNRMKTFVNLTES-SEVMQVIRYNPGEETSAHFDYFNP 158
Query: 186 AEYGPQMS-----QRLASFLLYLSDVEEGGETMFPFEVDLQI 222
M QR+ + L+YL+DVEEGGET FP EV++++
Sbjct: 159 LTTNGAMKIGLYGQRICTILMYLADVEEGGETSFP-EVNVKV 199
>gi|145347188|ref|XP_001418057.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144578285|gb|ABO96350.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 317
Score = 84.0 bits (206), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 48/140 (34%), Positives = 76/140 (54%), Gaps = 8/140 (5%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
+ LSW PR NF S E+C+ +I +K+L+ S + ST RTS GTF++
Sbjct: 37 ETLSWSPRVFLLKNFLSDEECEHLIELGEKKLERSTVVNSDESGAVST--ARTSFGTFVT 94
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
+T L+ +E ++A+ + +P H E +LRY GQ+Y +H+D G +R
Sbjct: 95 RRLTET--LQRVEDRVAKYSGIPWEHQEQLQLLRYRDGQEYVAHHDGIISENGG----KR 148
Query: 196 LASFLLYLSDVEEGGETMFP 215
+A+ L++L + GGET FP
Sbjct: 149 IATVLMFLREPTSGGETSFP 168
>gi|145341735|ref|XP_001415959.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144576182|gb|ABO94251.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 254
Score = 83.6 bits (205), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 50/149 (33%), Positives = 76/149 (51%), Gaps = 21/149 (14%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSG 131
LSW PRA + + QC++++ + R+ R+ V+S G RTS
Sbjct: 6 LSWYPRAFALRDALTEAQCEAVLRATRARV-------RRSTVVDSVTGESKVDPIRTSKQ 58
Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH-----YDAFNPA 186
TF++ E+ ++ I ++ TMLP TH E VL Y +G+KYD+H D+ +
Sbjct: 59 TFLNRDEE---VVREIYDALSAVTMLPWTHNEDMQVLEYRVGEKYDAHEDVGAEDSLSGR 115
Query: 187 EYGPQMSQRLASFLLYLSDVEEGGETMFP 215
E +R+A+ LLYL + E GGET FP
Sbjct: 116 ELSKDGGKRVATVLLYLEEPEAGGETAFP 144
>gi|386712780|ref|YP_006179102.1| prolyl 4-hydroxylase alpha subunit [Halobacillus halophilus DSM
2266]
gi|384072335|emb|CCG43825.1| prolyl 4-hydroxylase alpha subunit [Halobacillus halophilus DSM
2266]
Length = 211
Score = 83.6 bits (205), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 46/135 (34%), Positives = 75/135 (55%), Gaps = 11/135 (8%)
Query: 89 NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
N S E+C+ +I +K ++ S++ + RTSS TF+ + + IE
Sbjct: 41 NVVSEEECEELIFLSKNKMNRSKIGSQH-----EVSDIRTSSSTFLPEDD----LTNRIE 91
Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEE 208
++A+ +P HGE ++L Y+ GQ+Y +HYD F + R+++ +LYL+DVEE
Sbjct: 92 KRVAQIMNVPVEHGEGLHILNYKQGQEYKAHYDYFRSKAKAAN-NPRISTLVLYLNDVEE 150
Query: 209 GGETMFPFEVDLQIS 223
GGET FP ++L IS
Sbjct: 151 GGETYFP-HMNLSIS 164
>gi|253575459|ref|ZP_04852796.1| prolyl 4-hydroxylase [Paenibacillus sp. oral taxon 786 str. D14]
gi|251845106|gb|EES73117.1| prolyl 4-hydroxylase [Paenibacillus sp. oral taxon 786 str. D14]
Length = 215
Score = 83.6 bits (205), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 50/147 (34%), Positives = 77/147 (52%), Gaps = 12/147 (8%)
Query: 77 VLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
VL P + F + ++C+ +I A RL+ S+L + RTS G F
Sbjct: 25 VLHKEPLIMRFERLLTDDECRQLIEAAAPRLRESKLVNK------VVSEIRTSRGMFFE- 77
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-QR 195
E++ + IE +I+ +P H E VL Y GQ+Y +HYD F P P S R
Sbjct: 78 -EEENPFIHRIEKRISALMNVPIEHAEGLQVLHYGPGQEYQAHYDFFGPN--SPSASNNR 134
Query: 196 LASFLLYLSDVEEGGETMFPFEVDLQI 222
+++ ++YL+DVE GGET+FP +DL++
Sbjct: 135 ISTLIIYLNDVEAGGETVFPL-LDLEV 160
>gi|418515355|ref|ZP_13081536.1| hypothetical protein MOU_00890 [Xanthomonas axonopodis pv.
malvacearum str. GSPB1386]
gi|410708074|gb|EKQ66523.1| hypothetical protein MOU_00890 [Xanthomonas axonopodis pv.
malvacearum str. GSPB1386]
Length = 216
Score = 83.6 bits (205), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 51/145 (35%), Positives = 71/145 (48%), Gaps = 20/145 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFIS 135
PR + F S +C ++IA A+ RL S+ TV++ G RTS +
Sbjct: 26 PRVVVLGGFLSDGECDALIALARPRLARSR-------TVDNANGEHLVHAARTSDSMCLR 78
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM--- 192
+D + + IE +IAR P HGE VLRY G +Y HYD F+P G +
Sbjct: 79 VGQD--ALCQRIEARIARLFDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAVGTPILLQ 136
Query: 193 --SQRLASFLLYLSDVEEGGETMFP 215
QR+AS ++YL+ E GG T FP
Sbjct: 137 AGGQRVASLVMYLNTPERGGATRFP 161
>gi|330799463|ref|XP_003287764.1| hypothetical protein DICPUDRAFT_151895 [Dictyostelium purpureum]
gi|325082219|gb|EGC35708.1| hypothetical protein DICPUDRAFT_151895 [Dictyostelium purpureum]
Length = 220
Score = 83.2 bits (204), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 49/151 (32%), Positives = 78/151 (51%), Gaps = 17/151 (11%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT-RTSSGTFISA 136
LS +PR P F + E+C +I T+K +L+P E + G R+ G F+
Sbjct: 28 LSQKPRVYRIPEFLTEEECNHLIDTSKNKLRPCN---------EISSGVHRSGWGLFMKE 78
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS--- 193
E++ + + I +K+ + + E ++RY G++ +HYD FNP M
Sbjct: 79 GEEEHPVTKNIFNKMKNFVNISDS-CEVMQIIRYNPGEETSAHYDYFNPLTTNGSMKIGL 137
Query: 194 --QRLASFLLYLSDVEEGGETMFPFEVDLQI 222
QR+ + L+YL DVEEGGET FP EV +++
Sbjct: 138 YGQRICTILMYLCDVEEGGETSFP-EVGIKV 167
>gi|303287328|ref|XP_003062953.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226455589|gb|EEH52892.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 259
Score = 83.2 bits (204), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 55/156 (35%), Positives = 81/156 (51%), Gaps = 27/156 (17%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSG 131
+SW PRA + N + +C ++ A+ R +R+ V+ST G RTS
Sbjct: 4 ISWHPRAFHLHNIMTDAECDEVLELARTR-------VRRSTVVDSTTGESKVDPIRTSEQ 56
Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFN-----VLRYEIGQKYDSHYDAFN-P 185
F++ I+ +IE ++ R TMLP +GE VL+Y GQKYD+H+D
Sbjct: 57 CFLNRGH--FPIVSVIEKRLERYTMLPWYNGEDLQARPSRVLKYSNGQKYDAHHDVGELD 114
Query: 186 AEYGPQMS----QRLASFLLYLSDVEE--GGETMFP 215
G Q++ R+A+ LLYLSDV++ GGET FP
Sbjct: 115 TASGKQLAAEGGHRVATVLLYLSDVDDDGGGETAFP 150
>gi|325925807|ref|ZP_08187179.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas perforans
91-118]
gi|325543793|gb|EGD15204.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas perforans
91-118]
Length = 286
Score = 83.2 bits (204), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 51/145 (35%), Positives = 71/145 (48%), Gaps = 20/145 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFIS 135
PR + F S E+C ++IA A+ L S+ TV++ G RTS +
Sbjct: 96 PRVVVLGGFLSDEECDALIALARPHLARSR-------TVDNANGEHVVHAARTSDSMCLR 148
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM--- 192
+D + + IE +IAR P HGE VLRY G +Y HYD F+P G +
Sbjct: 149 LGQD--ALCQRIEARIARLLDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAAGTPVLVQ 206
Query: 193 --SQRLASFLLYLSDVEEGGETMFP 215
QR+AS ++YL+ E GG T FP
Sbjct: 207 AGGQRVASLVMYLNTPERGGATRFP 231
>gi|383757171|ref|YP_005436156.1| putative prolyl 4-hydroxylase alpha subunit [Rubrivivax gelatinosus
IL144]
gi|381377840|dbj|BAL94657.1| putative prolyl 4-hydroxylase alpha subunit homologue
oxidoreductase protein [Rubrivivax gelatinosus IL144]
Length = 279
Score = 83.2 bits (204), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 53/152 (34%), Positives = 75/152 (49%), Gaps = 21/152 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFIS 135
PR + F S E+C ++A A+ RL S ETV+++ G RTS G F
Sbjct: 92 PRVVVFGGLLSDEECDELVALARPRLARS-------ETVDNSTGGSEVNAARTSDGMFFE 144
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----- 190
E ++E IE +IA P GE VLRY G +Y H+D F+PA G
Sbjct: 145 RGEKP--LIERIERRIAELVRWPVERGEGLQVLRYRPGAQYKPHHDFFDPAHPGTANILR 202
Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFEVDLQI 222
+ QR+ + ++YL+ GG T FP EV L++
Sbjct: 203 RGGQRVGTVVMYLNTPAGGGATTFP-EVGLEV 233
>gi|295700439|ref|YP_003608332.1| procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
gi|295439652|gb|ADG18821.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
Length = 296
Score = 82.8 bits (203), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 57/153 (37%), Positives = 80/153 (52%), Gaps = 15/153 (9%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASED 139
RP A++ NF SA++C+ +IA A+ RL S + G V +T R+S G F E
Sbjct: 101 RPAAVHLANFLSADECEQLIALAQPRLDRSAVVDPVTGRDVIATH--RSSHGMFFRLGE- 157
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF---NPA--EYGPQMSQ 194
T ++ IE +IA T P +GE +L YE G + H D N A E + Q
Sbjct: 158 -TPLIARIEARIAELTATPVENGEGLQMLHYEEGAESTPHVDYLMTGNEANRESIARSGQ 216
Query: 195 RLASFLLYLSDVEEGGETMFPFEVDLQISWLII 227
R+ + L+YL DVE GGET+FP Q+ W I+
Sbjct: 217 RMGTLLMYLKDVEGGGETVFP-----QVGWSIV 244
>gi|383642155|ref|ZP_09954561.1| hypothetical protein SeloA3_06917 [Sphingomonas elodea ATCC 31461]
Length = 327
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 53/142 (37%), Positives = 76/142 (53%), Gaps = 10/142 (7%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPS-QLALRQGETVESTKGTRTSSGTFISASEDK 140
PR +FP F S E+C + TA+ L+PS L G + RTS G I + +
Sbjct: 140 PRVEHFPGFLSREECAHVATTAQDLLEPSFVLDPNSGRPIPHP--IRTSDGGAIGPTNEN 197
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFL 200
++ I +IA AT GE+ VLRY GQ+Y H D AE +QR+A+F+
Sbjct: 198 L-VVRAINLRIAAATGTAVEQGESLTVLRYARGQEYRRHLDTIAGAE-----NQRIATFI 251
Query: 201 LYLSDVEEGGETMFPFEVDLQI 222
+YL+D EGGET FP +++Q+
Sbjct: 252 VYLNDGFEGGETHFPL-LNIQV 272
>gi|108706360|gb|ABF94155.1| oxidoreductase, 2OG-Fe oxygenase family protein, putative [Oryza
sativa Japonica Group]
gi|125585047|gb|EAZ25711.1| hypothetical protein OsJ_09544 [Oryza sativa Japonica Group]
Length = 277
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 52/156 (33%), Positives = 84/156 (53%), Gaps = 22/156 (14%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES-TKGTRTSSGTFISA 136
+SWRPRA + F S +C +I+ AK+ K + + GE+ ES T RTSSG F+
Sbjct: 45 VSWRPRAFLYEGFLSDAECDHLISLAKQG-KMEKSTVVDGESGESVTSKVRTSSGMFLDK 103
Query: 137 SEDKTGILELIEHKIARATMLP-----------------QTHGEAFNVLRYEIGQKYDSH 179
+D+ ++ IE +IA TMLP +GE+ +LRY G+KY+ H
Sbjct: 104 KQDE--VVARIEERIAAWTMLPTECIIFYCFANFAILKLSENGESMQILRYGQGEKYEPH 161
Query: 180 YDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
+D + + + R+A+ L+YLS+V+ G+++ P
Sbjct: 162 FDYISGRQGSTREGDRVATVLMYLSNVKM-GDSLLP 196
>gi|357417854|ref|YP_004930874.1| procollagen-proline dioxygenase [Pseudoxanthomonas spadix BD-a59]
gi|355335432|gb|AER56833.1| Procollagen-proline dioxygenase [Pseudoxanthomonas spadix BD-a59]
Length = 283
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 50/157 (31%), Positives = 81/157 (51%), Gaps = 11/157 (7%)
Query: 64 ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
+ GD +Q + + PR + F N +AE+C ++IA A++++K S + +
Sbjct: 81 QAGDRQVQVLASLL---HPRVIVFGNLLAAEECDALIALARRQIKRSPV-FDPDTGQDQQ 136
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTS G F + + +E +IA P +GE VLRY G +Y+ HYD F
Sbjct: 137 HQARTSEGMFFGRGANP--LCARVEARIAALLNWPLENGEGLQVLRYGPGAQYEPHYDYF 194
Query: 184 NPAEYGPQMS-----QRLASFLLYLSDVEEGGETMFP 215
+PA G +++ QR+AS ++YL+ +GG T FP
Sbjct: 195 DPARPGAEVALRRGGQRVASLVIYLNTPTQGGATTFP 231
>gi|77748547|ref|NP_641044.2| hypothetical protein XAC0691 [Xanthomonas axonopodis pv. citri str.
306]
gi|381169877|ref|ZP_09879039.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas citri pv.
mangiferaeindicae LMG 941]
gi|380689647|emb|CCG35526.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas citri pv.
mangiferaeindicae LMG 941]
Length = 286
Score = 82.4 bits (202), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 51/145 (35%), Positives = 71/145 (48%), Gaps = 20/145 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFIS 135
PR + F S +C ++IA A+ RL S+ TV++ G RTS +
Sbjct: 96 PRVVVLGGFLSDGECDALIALARPRLARSR-------TVDNANGEHMVHAARTSDSMCLR 148
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM--- 192
+D + + IE +IAR P HGE VLRY G +Y HYD F+P G +
Sbjct: 149 VGQD--ALCQRIEARIARLFDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAAGTPILLQ 206
Query: 193 --SQRLASFLLYLSDVEEGGETMFP 215
QR+AS ++YL+ E GG T FP
Sbjct: 207 AGGQRVASLVMYLNTPERGGATRFP 231
>gi|433460968|ref|ZP_20418587.1| prolyl 4-hydroxylase alpha subunit [Halobacillus sp. BAB-2008]
gi|432190746|gb|ELK47751.1| prolyl 4-hydroxylase alpha subunit [Halobacillus sp. BAB-2008]
Length = 211
Score = 82.4 bits (202), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 46/142 (32%), Positives = 79/142 (55%), Gaps = 11/142 (7%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P+ N S E+C+++I +K ++ S++ + RTSS F+ E
Sbjct: 34 PKIAILGNVVSEEECEALIRLSKDKVNRSKIG-----SDHDVSDIRTSSSAFLPDDE--- 85
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
+ IE ++A+ +P HGE ++L Y+ GQ+Y +H+D F + + R+++ +L
Sbjct: 86 -LTGRIEKRLAQIMNVPVEHGEGIHILHYKPGQEYKAHHDYFRSTSRAAK-NPRISTLVL 143
Query: 202 YLSDVEEGGETMFPFEVDLQIS 223
YL+DVEEGGET FP E++L +S
Sbjct: 144 YLNDVEEGGETYFP-EMNLTVS 164
>gi|346723630|ref|YP_004850299.1| hypothetical protein XACM_0696 [Xanthomonas axonopodis pv.
citrumelo F1]
gi|346648377|gb|AEO41001.1| hypothetical protein XACM_0696 [Xanthomonas axonopodis pv.
citrumelo F1]
Length = 286
Score = 82.4 bits (202), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 51/145 (35%), Positives = 71/145 (48%), Gaps = 20/145 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFIS 135
PR + F S E+C ++IA A+ L S+ TV++ G RTS +
Sbjct: 96 PRVVVLGGFLSDEECDALIALAQPHLARSR-------TVDNANGEHVVHAARTSDSMCLR 148
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM--- 192
+D + + IE +IAR P HGE VLRY G +Y HYD F+P G +
Sbjct: 149 LGQD--ALCQRIEARIARLLDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAAGTPVLVQ 206
Query: 193 --SQRLASFLLYLSDVEEGGETMFP 215
QR+AS ++YL+ E GG T FP
Sbjct: 207 AGGQRVASLVMYLNTPERGGATRFP 231
>gi|340357957|ref|ZP_08680560.1| prolyl 4-hydroxylase [Sporosarcina newyorkensis 2681]
gi|339616017|gb|EGQ20677.1| prolyl 4-hydroxylase [Sporosarcina newyorkensis 2681]
Length = 211
Score = 82.4 bits (202), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 46/134 (34%), Positives = 75/134 (55%), Gaps = 9/134 (6%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I A ++K S++ G T E + RTSS FI +D+
Sbjct: 33 PLIVVLGNVLSDEECDELIQLAGDKVKRSKI----GTTREENE-LRTSSSMFIE--DDEN 85
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
I+ ++ +I+ +P HGE +LRY GQ+Y +H+D F+ + R+++ ++
Sbjct: 86 LIVTRVKKRISAIMKIPMEHGEGLQILRYTPGQQYKAHHDFFSSD--SKITNNRISTLVM 143
Query: 202 YLSDVEEGGETMFP 215
YL+DVE+GGET FP
Sbjct: 144 YLNDVEQGGETFFP 157
>gi|159489450|ref|XP_001702710.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280732|gb|EDP06489.1| predicted protein [Chlamydomonas reinhardtii]
Length = 252
Score = 82.4 bits (202), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 52/140 (37%), Positives = 71/140 (50%), Gaps = 10/140 (7%)
Query: 77 VLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
V+SW PRA NF + ++ I A+ ++ S + G +V RTS GTFI+
Sbjct: 3 VISWEPRAFVIRNFLTDQEATHIADVAQVHMRRSTVVADNGSSV--LDDYRTSYGTFINR 60
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
T ++ +E ++A T +P + E VLRY GQ Y H D+ S RL
Sbjct: 61 Y--ATPVVARVEDRVAVLTRVPVHYQEDMQVLRYGNGQYYHRHTDSLE------NDSPRL 112
Query: 197 ASFLLYLSDVEEGGETMFPF 216
A+ LLYLSD E GGET FP
Sbjct: 113 ATVLLYLSDPELGGETAFPL 132
>gi|21106803|gb|AAM35580.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri
str. 306]
Length = 306
Score = 82.4 bits (202), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 51/145 (35%), Positives = 71/145 (48%), Gaps = 20/145 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFIS 135
PR + F S +C ++IA A+ RL S+ TV++ G RTS +
Sbjct: 116 PRVVVLGGFLSDGECDALIALARPRLARSR-------TVDNANGEHMVHAARTSDSMCLR 168
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM--- 192
+D + + IE +IAR P HGE VLRY G +Y HYD F+P G +
Sbjct: 169 VGQD--ALCQRIEARIARLFDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAAGTPILLQ 226
Query: 193 --SQRLASFLLYLSDVEEGGETMFP 215
QR+AS ++YL+ E GG T FP
Sbjct: 227 AGGQRVASLVMYLNTPERGGATRFP 251
>gi|9294584|dbj|BAB02865.1| unnamed protein product [Arabidopsis thaliana]
Length = 328
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 44/99 (44%), Positives = 63/99 (63%), Gaps = 4/99 (4%)
Query: 117 GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKY 176
GE+ +S RTSSG F++ +D I+ +E K+A T LP+ +GEA +L YE GQKY
Sbjct: 9 GESEDSE--VRTSSGMFLTKRQDD--IVANVEAKLAAWTFLPEENGEALQILHYENGQKY 64
Query: 177 DSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
D H+D F + R+A+ L+YLS+V +GGET+FP
Sbjct: 65 DPHFDYFYDKKALELGGHRIATVLMYLSNVTKGGETVFP 103
>gi|186474111|ref|YP_001861453.1| procollagen-proline dioxygenase [Burkholderia phymatum STM815]
gi|184196443|gb|ACC74407.1| Procollagen-proline dioxygenase [Burkholderia phymatum STM815]
Length = 305
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 49/153 (32%), Positives = 77/153 (50%), Gaps = 8/153 (5%)
Query: 68 ASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTR 127
A + + +V RP+ + F + S ++C +I A+ RLK S + E R
Sbjct: 102 AHDRDVAVRVRFERPQVIVFDDVLSRDECDELIERARHRLKRS-TTVNPESGREDVIQLR 160
Query: 128 TSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE 187
TS G + ED +E ++ +I+ P HGE +L Y G +Y H+D F P++
Sbjct: 161 TSEGFWFQRCED--AFIERLDRRISALMNWPLEHGEGLQILHYTKGGEYRPHFDYFPPSQ 218
Query: 188 YGPQM-----SQRLASFLLYLSDVEEGGETMFP 215
G + QR+A+ ++YLSDV GGET+FP
Sbjct: 219 SGSVLHTSRGGQRVATLIVYLSDVAGGGETVFP 251
>gi|251794605|ref|YP_003009336.1| procollagen-proline dioxygenase [Paenibacillus sp. JDR-2]
gi|247542231|gb|ACS99249.1| Procollagen-proline dioxygenase [Paenibacillus sp. JDR-2]
Length = 209
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 44/134 (32%), Positives = 71/134 (52%), Gaps = 10/134 (7%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P L N S +C +I A R++ +++ + RTSS F SE++
Sbjct: 32 PLILILDNVLSWAECDLLIDLASARMQRAKIG-----SSHDVSEVRTSSSMFFEESENEC 86
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
+ +E ++A +P +H E VLRY+ G++Y H+D F G M+ R+++ ++
Sbjct: 87 --IGQVEARVAELMNIPVSHAEPLQVLRYQPGEQYHPHFDYFTQ---GSSMNNRISTLVM 141
Query: 202 YLSDVEEGGETMFP 215
YL+DVEEGGET FP
Sbjct: 142 YLNDVEEGGETYFP 155
>gi|66820122|ref|XP_643703.1| hypothetical protein DDB_G0275385 [Dictyostelium discoideum AX4]
gi|60471803|gb|EAL69758.1| hypothetical protein DDB_G0275385 [Dictyostelium discoideum AX4]
Length = 221
Score = 81.6 bits (200), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 51/151 (33%), Positives = 78/151 (51%), Gaps = 17/151 (11%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT-RTSSGTFISA 136
LS PR P F + E+C+ +I T+K +L+P E + G R+ G F+
Sbjct: 28 LSQAPRIYRIPGFLTDEECEFLIDTSKNKLRPCN---------EISSGVHRSGWGLFMKE 78
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS--- 193
E+ I + I +K+ + ++ E V+RY G++ SH+D FNP M
Sbjct: 79 GEEDHQITKNIFNKMKSFVNISES-CEVMQVIRYNQGEETSSHFDYFNPLTTNGSMKIGL 137
Query: 194 --QRLASFLLYLSDVEEGGETMFPFEVDLQI 222
QR+ + L+YL DVEEGGET FP EV +++
Sbjct: 138 YGQRVCTILMYLCDVEEGGETTFP-EVGIKV 167
>gi|421749438|ref|ZP_16186877.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator HPC(L)]
gi|409771699|gb|EKN53918.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator HPC(L)]
Length = 319
Score = 81.6 bits (200), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 52/163 (31%), Positives = 78/163 (47%), Gaps = 13/163 (7%)
Query: 58 SGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQG 117
+ LP G+ D +P PR F ++C+++IA ++ RL S + +
Sbjct: 108 NALPAGDGRD-----VPVLFAIESPRIALFQRLLMPDECEALIALSRGRLARSPV-VNPD 161
Query: 118 ETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYD 177
E+ RTS G E ++E +E +IA T +P HGE +L Y+ G +Y
Sbjct: 162 TGDENLIDARTSMGAMFQVGEHP--LIERLEARIAAVTGVPVEHGEGLQILNYKPGAEYQ 219
Query: 178 SHYDAFNPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFP 215
HYD FNP G QR+A+ ++YL+DV GG T FP
Sbjct: 220 PHYDFFNPQRPGEARQLRVGGQRMATLVIYLNDVPAGGATAFP 262
>gi|171059332|ref|YP_001791681.1| procollagen-proline dioxygenase [Leptothrix cholodnii SP-6]
gi|170776777|gb|ACB34916.1| Procollagen-proline dioxygenase [Leptothrix cholodnii SP-6]
Length = 287
Score = 81.6 bits (200), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 53/153 (34%), Positives = 81/153 (52%), Gaps = 21/153 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFIS 135
PR + F F S ++C +++A A+ RL S ETV++ G RTS G F
Sbjct: 100 PRVVVFGGFLSHDECDALVALAQPRLARS-------ETVDNDTGGSEVNEARTSQGMFFM 152
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM-- 192
E + ++ IE +IA P +GE VL Y G +Y HYD F+PA+ G P +
Sbjct: 153 RGEGE--LISRIEARIAALLDWPLENGEGVQVLHYRPGAEYKPHYDYFDPAQPGTPTILK 210
Query: 193 --SQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
QR+ + ++YL+ E GG T FP +V+L+++
Sbjct: 211 RGGQRVGTLVMYLNTPERGGGTTFP-DVNLEVA 242
>gi|402813396|ref|ZP_10862991.1| hypothetical protein PAV_1c08470 [Paenibacillus alvei DSM 29]
gi|402509339|gb|EJW19859.1| hypothetical protein PAV_1c08470 [Paenibacillus alvei DSM 29]
Length = 215
Score = 81.3 bits (199), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 48/135 (35%), Positives = 72/135 (53%), Gaps = 10/135 (7%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C +I +K+RL+ S++ GE S RTSSG F +E
Sbjct: 36 PLIVILGNVLSNEECDELIEHSKERLQRSKI----GEE-RSVNQIRTSSGVFCEENE--- 87
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
+ IE +I++ +P HG+ VL Y GQ+Y H+D F + R+++ ++
Sbjct: 88 -TVAKIEKRISQIMNIPIEHGDGLQVLLYAPGQEYKPHFDFFADTSRA-SANNRISTLVM 145
Query: 202 YLSDVEEGGETMFPF 216
YL+DVEEGGET FP
Sbjct: 146 YLNDVEEGGETTFPM 160
>gi|170690448|ref|ZP_02881615.1| Procollagen-proline dioxygenase [Burkholderia graminis C4D1M]
gi|170144883|gb|EDT13044.1| Procollagen-proline dioxygenase [Burkholderia graminis C4D1M]
Length = 307
Score = 81.3 bits (199), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 48/148 (32%), Positives = 76/148 (51%), Gaps = 9/148 (6%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
RP+ + F N S E+C +I ++ RLK S + + E RTS G + ED
Sbjct: 117 RPQVIVFANVLSPEECDEVIERSRHRLKRSTI-VDPATGQEDVIRNRTSEGIWYQRGEDA 175
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQR 195
+E ++ +IA P +GE +L Y +Y H+D F P + G + QR
Sbjct: 176 --FIERLDQRIASLMNWPVENGEGLQILHYGPTGEYRPHFDYFPPDQPGSMVHTARGGQR 233
Query: 196 LASFLLYLSDVEEGGETMFPFEVDLQIS 223
+A+ ++YL+DV +GGET+FP E L ++
Sbjct: 234 VATLVIYLNDVPDGGETIFP-EAGLSVA 260
>gi|295699617|ref|YP_003607510.1| procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
gi|295438830|gb|ADG17999.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
Length = 286
Score = 81.3 bits (199), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 50/146 (34%), Positives = 74/146 (50%), Gaps = 20/146 (13%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFI 134
RP+ + F + SA +C +I ++ RLK S TV G RTS G +
Sbjct: 96 RPQLVVFADVLSAAECAELIERSRHRLKRST-------TVNPLTGREDVIRNRTSEGVWY 148
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP---- 190
ED+ ++ +E +IA T P +GE VL Y +Y H+D F P + G
Sbjct: 149 RRGEDQ--LIARVERRIASLTNWPLENGEGLQVLHYGTSGEYSPHFDFFAPDQPGSAVHT 206
Query: 191 -QMSQRLASFLLYLSDVEEGGETMFP 215
Q QR+A+ ++YL+DV +GGET+FP
Sbjct: 207 TQGGQRVATLIIYLNDVADGGETVFP 232
>gi|357517893|ref|XP_003629235.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355523257|gb|AET03711.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 196
Score = 80.9 bits (198), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 47/127 (37%), Positives = 72/127 (56%), Gaps = 17/127 (13%)
Query: 90 FASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEH 149
+ E+C+ +I AK + S + G++V+++ RTSSGTFI+ DK IL IE
Sbjct: 12 ITTKEECEHLINIAKPSMHKSTVDDETGKSVDNS--ARTSSGTFINRGHDK--ILRNIEQ 67
Query: 150 KIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEG 209
+IA T +P +GE+ N+L YE+GQKY+ H D F +++ + E+G
Sbjct: 68 RIADFTFIPVENGESVNILHYEVGQKYEPHPDFFT-----DEINTKNGG--------EQG 114
Query: 210 GETMFPF 216
GET+FPF
Sbjct: 115 GETVFPF 121
>gi|297727581|ref|NP_001176154.1| Os10g0415128 [Oryza sativa Japonica Group]
gi|255679404|dbj|BAH94882.1| Os10g0415128 [Oryza sativa Japonica Group]
Length = 241
Score = 80.9 bits (198), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 39/90 (43%), Positives = 57/90 (63%), Gaps = 2/90 (2%)
Query: 126 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 185
RTSSG F+ +D+ ++ IE +IA T LP +GE+ +L Y+ G+KY+ HYD F+
Sbjct: 15 VRTSSGMFLEKKQDE--VVARIEERIAAWTFLPPDNGESIQILHYQNGEKYEPHYDYFHD 72
Query: 186 AEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
R+A+ L+YLSDV +GGET+FP
Sbjct: 73 KNNQALGGHRIATVLMYLSDVGKGGETIFP 102
>gi|389793983|ref|ZP_10197143.1| 2OG-Fe(II) oxygenase [Rhodanobacter fulvus Jip2]
gi|388433014|gb|EIL89992.1| 2OG-Fe(II) oxygenase [Rhodanobacter fulvus Jip2]
Length = 282
Score = 80.9 bits (198), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 53/162 (32%), Positives = 80/162 (49%), Gaps = 12/162 (7%)
Query: 67 DASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT 126
D IP V + P S +C +I A+ RL+ + G+ +
Sbjct: 80 DGRTIGIPLSVDA--PALRVLDGLLSERECADLIELARPRLQRALTVDSDGK--QQIDQR 135
Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
RTS G F A E T ++ IE ++A+ +P +HGE +L Y GQ+Y+ HYD F+PA
Sbjct: 136 RTSEGMFFRAGE--TPLVAAIEQRLAQLLGVPASHGEGLQILHYGPGQEYEPHYDWFDPA 193
Query: 187 EYG-----PQMSQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
G + QR+AS ++YL+ E GG T FP E+ L ++
Sbjct: 194 LPGYDKLTARAGQRIASVVMYLNTPERGGGTAFP-EIGLTVT 234
>gi|323528042|ref|YP_004230194.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1001]
gi|323385044|gb|ADX57134.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1001]
Length = 300
Score = 80.9 bits (198), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 46/140 (32%), Positives = 72/140 (51%), Gaps = 8/140 (5%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
RP+ + F N S E+C +I ++ RLK S + + E RTS G + ED
Sbjct: 110 RPQVIVFANVLSPEECDEVIERSRHRLKRSTI-VDPATGQEGVIRNRTSEGIWYQRGEDA 168
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQR 195
+E ++ +IA P +GE +L Y +Y H+D F P + G + QR
Sbjct: 169 --FIERLDQRIASLMNWPVENGEGLQILHYGPTGEYRPHFDYFPPDQPGSAVHTARGGQR 226
Query: 196 LASFLLYLSDVEEGGETMFP 215
+A+ ++YL+DV +GGET+FP
Sbjct: 227 VATLVVYLNDVADGGETIFP 246
>gi|89096248|ref|ZP_01169141.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus sp.
NRRL B-14911]
gi|89089102|gb|EAR68210.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus sp.
NRRL B-14911]
Length = 217
Score = 80.9 bits (198), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 43/134 (32%), Positives = 75/134 (55%), Gaps = 8/134 (5%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C+ +I ++ +LK S++ + RTSS F E++
Sbjct: 39 PLIVILGNVLSDEECEGLIRMSEDKLKRSKIG-----NTRTVDDIRTSSSMFFEEGENE- 92
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
++ IE ++++ +P HGE +L Y IGQ+Y +H+D F+ + + R+++ ++
Sbjct: 93 -LVARIERRLSQIMNIPVEHGEGLQMLNYHIGQEYKAHFDFFSSSSR-AASNPRISTLVM 150
Query: 202 YLSDVEEGGETMFP 215
YL+DVEEGGET FP
Sbjct: 151 YLNDVEEGGETYFP 164
>gi|407708877|ref|YP_006792741.1| prolyl 4-hydroxylase [Burkholderia phenoliruptrix BR3459a]
gi|407237560|gb|AFT87758.1| prolyl 4-hydroxylase [Burkholderia phenoliruptrix BR3459a]
Length = 300
Score = 80.9 bits (198), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 46/140 (32%), Positives = 72/140 (51%), Gaps = 8/140 (5%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
RP+ + F N S E+C +I ++ RLK S + + E RTS G + ED
Sbjct: 110 RPQVIVFANVLSPEECDEVIERSRHRLKRSTI-VDPATGQEGVIRNRTSEGIWYQRGEDA 168
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQR 195
+E ++ +IA P +GE +L Y +Y H+D F P + G + QR
Sbjct: 169 --FIERLDRRIASLMNWPVENGEGLQILHYGPTGEYRPHFDYFPPDQPGSAVHTARGGQR 226
Query: 196 LASFLLYLSDVEEGGETMFP 215
+A+ ++YL+DV +GGET+FP
Sbjct: 227 VATLVVYLNDVADGGETIFP 246
>gi|384429387|ref|YP_005638747.1| procollagen-proline, 2-oxoglutarate-4-dioxygenase [Xanthomonas
campestris pv. raphani 756C]
gi|341938490|gb|AEL08629.1| procollagen-proline, 2-oxoglutarate-4-dioxygenase [Xanthomonas
campestris pv. raphani 756C]
Length = 286
Score = 80.5 bits (197), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 53/165 (32%), Positives = 81/165 (49%), Gaps = 12/165 (7%)
Query: 64 ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
+ GD +Q + +L PR + S ++C ++IA A+ +L S+ + + E
Sbjct: 81 DLGDRQVQVLVSLML---PRVVVLGGLLSDDECDALIALARPQLARSR-TVDNRDGSEIV 136
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTS + +D + + IE +IAR P HGE VLRY G +Y HYD F
Sbjct: 137 HAARTSHSMALQPGQD--ALCQRIEARIARLLEWPVEHGEGLQVLRYATGAQYAPHYDYF 194
Query: 184 NPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
P G + QR+AS ++YL+ E GG T FP +V L ++
Sbjct: 195 EPDAPGTPVLLQHGGQRVASLVMYLNTPERGGATRFP-DVHLDVA 238
>gi|90022913|ref|YP_528740.1| hypothetical protein Sde_3273 [Saccharophagus degradans 2-40]
gi|89952513|gb|ABD82528.1| 2OG-Fe(II) oxygenase [Saccharophagus degradans 2-40]
Length = 478
Score = 80.5 bits (197), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 52/196 (26%), Positives = 92/196 (46%), Gaps = 26/196 (13%)
Query: 39 VPSIRPKLRTLEVVEKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPN--------- 89
V + RP + +++ + P + ++Q+ PR ++ PN
Sbjct: 43 VMNFRPTVPVDQLINPFKRAASPQQTVNNQNLQTAANHGAPTDPRKIFIPNALKLNSDKL 102
Query: 90 -------FASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTG 142
F + E+C+ IIA + +L+PS+L+ + ES K RTS + +D
Sbjct: 103 EMYALGEFLTTEECERIIANIRSKLRPSELS-----SQESDKTYRTSRTCDLGTIDDP-- 155
Query: 143 ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE---YGPQMSQRLASF 199
+ ++ +I + + ++ E YE+GQ++ +H D F E +G M QR +
Sbjct: 156 FIHYVDSRICKLVGIDPSYSEVIQGQLYEVGQEFKAHTDYFEIKEMPEHGAVMGQRTYTV 215
Query: 200 LLYLSDVEEGGETMFP 215
++YL+DVEEGGET FP
Sbjct: 216 MIYLNDVEEGGETDFP 231
>gi|412988743|emb|CCO15334.1| predicted protein [Bathycoccus prasinos]
Length = 352
Score = 80.5 bits (197), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 50/140 (35%), Positives = 71/140 (50%), Gaps = 9/140 (6%)
Query: 75 FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 134
+ LSW PRA + NF S E+ + ++ + R+ S + Q V RTS GTFI
Sbjct: 68 IEALSWDPRAFLYHNFLSKEEAKHLVDLGEPRVTRSTVVGGQTGRVSDI---RTSFGTFI 124
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
D+ +LE IE + A + +P H E +LRY GQKY H D G +
Sbjct: 125 PKKYDE--VLEKIEDRCAVFSGIPVAHQEQMQLLRYRDGQKYSDHTDGLISENGG----K 178
Query: 195 RLASFLLYLSDVEEGGETMF 214
R+A+ L++L + EGGET F
Sbjct: 179 RIATILMFLHEPTEGGETSF 198
>gi|430808003|ref|ZP_19435118.1| prolyl 4-hydroxylase [Cupriavidus sp. HMR-1]
gi|429499635|gb|EKZ98045.1| prolyl 4-hydroxylase [Cupriavidus sp. HMR-1]
Length = 293
Score = 80.5 bits (197), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 49/147 (33%), Positives = 73/147 (49%), Gaps = 9/147 (6%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
PR L N +C +++A A+ RL+ S + + E+ RTS G E
Sbjct: 101 PRILLLQNLLDDAECDAVVALARDRLQRSPV-VNPDTGDENLIDARTSMGAMFQVGEH-- 157
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRL 196
+L+ IE +IA T P HGE F VL Y+ G +Y H+D FNP G QR+
Sbjct: 158 ALLQRIEARIAAVTGWPVEHGEGFQVLNYKPGGEYQPHFDFFNPKRPGEARQLRVGGQRV 217
Query: 197 ASFLLYLSDVEEGGETMFPFEVDLQIS 223
A+ ++YL+ GG T FP + L+++
Sbjct: 218 ATMVIYLNSPASGGATAFP-RIGLEVA 243
>gi|301093292|ref|XP_002997494.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262110636|gb|EEY68688.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 324
Score = 80.5 bits (197), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 51/155 (32%), Positives = 75/155 (48%), Gaps = 14/155 (9%)
Query: 72 SIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
+ + LS P F ++ I+A + + LKPS + L G + RTS+
Sbjct: 105 DVVLETLSLTPLVFSVDEFLKDDEIDIIMALSLEHLKPSTVTLMDGHEDRAATDWRTSTT 164
Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY--- 188
F+S+S K L+ I+ ++A T +P H E VLRYE QKYD H D F P E+
Sbjct: 165 YFLSSS--KHSKLDEIDQRVADLTKVPVDHQEDVQVLRYEETQKYDHHTDYF-PVEHHKN 221
Query: 189 GPQM--------SQRLASFLLYLSDVEEGGETMFP 215
P + R+ + Y+SDV +GG T+FP
Sbjct: 222 SPHVLESIDYGYKNRMITVFWYMSDVAKGGHTIFP 256
>gi|308799555|ref|XP_003074558.1| putative oxidoreductase (ISS) [Ostreococcus tauri]
gi|116000729|emb|CAL50409.1| putative oxidoreductase (ISS) [Ostreococcus tauri]
Length = 274
Score = 80.5 bits (197), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 54/156 (34%), Positives = 84/156 (53%), Gaps = 13/156 (8%)
Query: 66 GDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTK 124
GDA + I + LSW PRA N + ++I+A A+ R+ + + + G++V
Sbjct: 2 GDARV--IAVEPLSWYPRAFALRNALDETEMRAILALARTRVARSTVIDSESGKSV--VN 57
Query: 125 GTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN 184
RTS TF+S ++ ++ + +++ T LP H E VL Y G+KYD+H D
Sbjct: 58 PIRTSKQTFLSRNDP---VVRKVLERMSSVTHLPWYHCEDLQVLEYSAGEKYDAHEDVGE 114
Query: 185 PA-EYGPQMS----QRLASFLLYLSDVEEGGETMFP 215
+ G Q+S +R+A+ LLYL + EEGGET FP
Sbjct: 115 EGTKSGDQLSKNGGKRVATILLYLEEPEEGGETAFP 150
>gi|94312029|ref|YP_585239.1| prolyl 4-hydroxylase [Cupriavidus metallidurans CH34]
gi|93355881|gb|ABF09970.1| prolyl 4-hydroxylase [Cupriavidus metallidurans CH34]
Length = 293
Score = 80.5 bits (197), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 49/147 (33%), Positives = 73/147 (49%), Gaps = 9/147 (6%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
PR L N +C +++A A+ RL+ S + + E+ RTS G E
Sbjct: 101 PRILLLQNLLDDAECDAVVALARDRLQRSPV-VNPDTGDENLIDARTSMGAMFQVGEH-- 157
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRL 196
+L+ IE +IA T P HGE F VL Y+ G +Y H+D FNP G QR+
Sbjct: 158 ALLQRIEARIAAVTGWPVEHGEGFQVLNYKPGGEYQPHFDFFNPKRPGEARQLRVGGQRV 217
Query: 197 ASFLLYLSDVEEGGETMFPFEVDLQIS 223
A+ ++YL+ GG T FP + L+++
Sbjct: 218 ATMVIYLNSPASGGATAFP-RIGLEVA 243
>gi|389728965|ref|ZP_10189244.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
gi|388441204|gb|EIL97500.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
Length = 285
Score = 80.1 bits (196), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 54/170 (31%), Positives = 84/170 (49%), Gaps = 13/170 (7%)
Query: 61 PHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETV 120
PH G+ S++ + + + P F S ++C ++I AK RL+ ++ G
Sbjct: 77 PHAVIGERSVRVM---LAAETPPLRVFDGLLSDDECAALIELAKPRLQRARTVAEDG--A 131
Query: 121 ESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHY 180
+ RTS G F E ++E IE +IA +P HGE VL Y GQ+Y+ H
Sbjct: 132 QQIDEHRTSDGMFFGLGEQP--LIERIEARIAALLGIPVDHGEGLQVLHYLPGQQYEPHQ 189
Query: 181 DAFNPAEYG-----PQMSQRLASFLLYLSDVEEGGETMFPFEVDLQISWL 225
D F+P + G QR+AS ++YL+ + GG T FP E+ L ++ L
Sbjct: 190 DWFDPTQPGYAAITATGGQRIASLVIYLNTPDAGGGTAFP-EIGLTVTAL 238
>gi|294666178|ref|ZP_06731433.1| 2OG-Fe II oxygenase superfamily protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
gi|292604043|gb|EFF47439.1| 2OG-Fe II oxygenase superfamily protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
Length = 296
Score = 80.1 bits (196), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 50/145 (34%), Positives = 70/145 (48%), Gaps = 20/145 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFIS 135
P + F S +C ++IA A+ RL S+ TV++ G RTS +
Sbjct: 106 PCVVVLGGFLSGGECDALIALARPRLARSR-------TVDNANGEHVVHAARTSDSMCLR 158
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM--- 192
+D + + IE +IAR P HGE VLRY G +Y HYD F+P G +
Sbjct: 159 VGQD--ALCQRIEARIARLLDWPVDHGEGLQVLRYGTGAEYRPHYDYFDPDAAGTPVLLQ 216
Query: 193 --SQRLASFLLYLSDVEEGGETMFP 215
QR+AS ++YL+ E GG T FP
Sbjct: 217 AGGQRVASLVMYLNTPERGGATRFP 241
>gi|325915062|ref|ZP_08177391.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas vesicatoria
ATCC 35937]
gi|325538760|gb|EGD10427.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas vesicatoria
ATCC 35937]
Length = 286
Score = 80.1 bits (196), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 49/147 (33%), Positives = 72/147 (48%), Gaps = 9/147 (6%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
PR + F S +C ++IA A+ RL S+ + RTS + +D
Sbjct: 96 PRVMVLGGFLSDAECDAMIALAQPRLARSR-TVDNANGAHVVHAARTSDSMCLQLGQD-- 152
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRL 196
+ + IE +IAR P +GE VLRY G +Y HYD F+P G + QR+
Sbjct: 153 ALCQRIEARIARLLDWPVENGEGLQVLRYGTGAEYQPHYDYFDPDAAGTPVLLQAGGQRV 212
Query: 197 ASFLLYLSDVEEGGETMFPFEVDLQIS 223
AS ++YL+ + GG T FP +V L I+
Sbjct: 213 ASLVMYLNTPDRGGATRFP-DVHLDIA 238
>gi|294627644|ref|ZP_06706226.1| 2OG-Fe II oxygenase superfamily protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 11122]
gi|292597996|gb|EFF42151.1| 2OG-Fe II oxygenase superfamily protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 11122]
Length = 296
Score = 80.1 bits (196), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 50/145 (34%), Positives = 70/145 (48%), Gaps = 20/145 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFIS 135
P + F S +C ++IA A+ RL S+ TV++ G RTS +
Sbjct: 106 PCVVVLGGFLSGGECDALIALARPRLARSR-------TVDNANGEHVVHAARTSDSMCLR 158
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM--- 192
+D + + IE +IAR P HGE VLRY G +Y HYD F+P G +
Sbjct: 159 VGQD--ALCQRIEARIARLLDWPVDHGEGLQVLRYGTGAEYRPHYDYFDPDAAGTPVLLQ 216
Query: 193 --SQRLASFLLYLSDVEEGGETMFP 215
QR+AS ++YL+ E GG T FP
Sbjct: 217 AGGQRVASLVMYLNTPERGGATRFP 241
>gi|329913962|ref|ZP_08276011.1| hypothetical protein IMCC9480_1311 [Oxalobacteraceae bacterium
IMCC9480]
gi|327545257|gb|EGF30515.1| hypothetical protein IMCC9480_1311 [Oxalobacteraceae bacterium
IMCC9480]
Length = 280
Score = 80.1 bits (196), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 46/139 (33%), Positives = 71/139 (51%), Gaps = 8/139 (5%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
PR + N S ++C +I A ++ R S + + +RTS I E T
Sbjct: 92 PRIVVLGNVLSDDECDAIAAMSRTRFARST-TIDNASGINRFDDSRTSESAHIQRGE--T 148
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP-----QMSQRL 196
++ I+ ++A + P HGE + +Y+ G +Y H+D F+PA G + QRL
Sbjct: 149 ELIARIDARLAALSGWPVDHGEPLQLQKYQAGNEYRPHFDWFDPALAGTAKHLEKSGQRL 208
Query: 197 ASFLLYLSDVEEGGETMFP 215
A+ +LYL+DVEEGG T FP
Sbjct: 209 ATIILYLTDVEEGGGTSFP 227
>gi|226494249|ref|NP_001141909.1| uncharacterized protein LOC100274058 [Zea mays]
gi|194706408|gb|ACF87288.1| unknown [Zea mays]
gi|413932757|gb|AFW67308.1| hypothetical protein ZEAMMB73_919439 [Zea mays]
gi|413932758|gb|AFW67309.1| hypothetical protein ZEAMMB73_919439 [Zea mays]
Length = 217
Score = 79.7 bits (195), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 37/83 (44%), Positives = 54/83 (65%)
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
+S + K I+ IE ++A T LP+ + E+ VLRYE GQKYD+H+D F+
Sbjct: 10 MLSPPQPKDEIVSAIEKRVAAWTFLPEENAESLQVLRYETGQKYDAHFDYFHDRNNLKLG 69
Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
QR+A+ L+YL+DV +GGET+FP
Sbjct: 70 GQRVATVLMYLTDVNKGGETVFP 92
>gi|330821584|ref|YP_004350446.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
gladioli BSR3]
gi|327373579|gb|AEA64934.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
gladioli BSR3]
Length = 302
Score = 79.7 bits (195), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 51/141 (36%), Positives = 71/141 (50%), Gaps = 10/141 (7%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASED 139
RP A+ F SA +C+ +I A+ RL S + G + G R+S G F E
Sbjct: 101 RPAAVLLDGFLSAGECRQLIELARPRLNRSTVVDPVTGRNI--VAGHRSSDGMFFRLGE- 157
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP-----AEYGPQMSQ 194
T ++ IE +IA T P +GE +L YE G + H D P AE + Q
Sbjct: 158 -TPLISRIEQRIAALTGFPVENGEGLQMLHYEAGAESTPHVDYLVPGNPANAESIARSGQ 216
Query: 195 RLASFLLYLSDVEEGGETMFP 215
R+ + L+YL+DVE GGET+FP
Sbjct: 217 RVGTLLMYLNDVESGGETLFP 237
>gi|308804269|ref|XP_003079447.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
gi|116057902|emb|CAL54105.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
Length = 363
Score = 79.7 bits (195), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 47/141 (33%), Positives = 76/141 (53%), Gaps = 16/141 (11%)
Query: 75 FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 134
+ LSW PRA + NF + ++C+ +IA +K+L+ S + +G+ + RTS GTFI
Sbjct: 91 WTTLSWSPRAFLYQNFLTEDECEHLIALGEKKLERSTVVGSKGKEGD-VHSARTSFGTFI 149
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
+ T L +E ++A + +P H E +LRYE GQ+Y + +
Sbjct: 150 --TRRLTPTLSAVEDRVAEYSGIPWRHQEQLQLLRYEKGQEYGNG-------------EK 194
Query: 195 RLASFLLYLSDVEEGGETMFP 215
R+A+ L++L + E GGET FP
Sbjct: 195 RIATVLMFLREPEFGGETHFP 215
>gi|332526359|ref|ZP_08402485.1| procollagen-proline dioxygenase [Rubrivivax benzoatilyticus JA2]
gi|332110495|gb|EGJ10818.1| procollagen-proline dioxygenase [Rubrivivax benzoatilyticus JA2]
Length = 224
Score = 79.7 bits (195), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 56/168 (33%), Positives = 81/168 (48%), Gaps = 24/168 (14%)
Query: 66 GDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG 125
GD + + L PR + F S ++C ++A A+ RL LR ETV+++ G
Sbjct: 24 GDREVHVLATMAL---PRVVVFGGLLSEQECDELVALAQPRL------LRS-ETVDNSTG 73
Query: 126 ------TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH 179
RTS G F E T ++E IE +IA P GE VL Y G +Y H
Sbjct: 74 GSEVNAARTSDGMFFERGE--TPLIERIERRIAELVHWPVERGEGLQVLHYRPGAQYKPH 131
Query: 180 YDAFNPAEYGP-----QMSQRLASFLLYLSDVEEGGETMFPFEVDLQI 222
+D F+PA G + QR+ + ++YL+ GG T FP EV L++
Sbjct: 132 HDFFDPAHPGTANILRRGGQRVGTVVIYLNTPAGGGATTFP-EVGLEV 178
>gi|209522122|ref|ZP_03270769.1| Procollagen-proline dioxygenase [Burkholderia sp. H160]
gi|209497434|gb|EDZ97642.1| Procollagen-proline dioxygenase [Burkholderia sp. H160]
Length = 296
Score = 79.3 bits (194), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 56/152 (36%), Positives = 78/152 (51%), Gaps = 15/152 (9%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASED 139
RP A++ +F SA++C+ +IA A+ RL S + G V G R+S G F E
Sbjct: 101 RPAAVHLADFLSADECEQLIALAQPRLDRSTVVDPVTGRNV--VAGHRSSHGMFFRLGE- 157
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF---NPA--EYGPQMSQ 194
T ++ IE +IA T P +GE +L YE G + H D N A E + Q
Sbjct: 158 -TPLIVRIEARIAALTGTPVENGEGLQMLHYEEGAESTPHVDYLITGNEANRESIARSGQ 216
Query: 195 RLASFLLYLSDVEEGGETMFPFEVDLQISWLI 226
R+ + L+YL DVE GGET+FP QI W +
Sbjct: 217 RMGTLLMYLKDVEGGGETVFP-----QIGWSV 243
>gi|194290782|ref|YP_002006689.1| prolyl 4-hydroxylase subunit alpha [Cupriavidus taiwanensis LMG
19424]
gi|193224617|emb|CAQ70628.1| putative Prolyl 4-hydroxylase alpha subunit [Cupriavidus
taiwanensis LMG 19424]
Length = 296
Score = 79.3 bits (194), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 52/171 (30%), Positives = 84/171 (49%), Gaps = 9/171 (5%)
Query: 58 SGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQG 117
+G P D + +P P+ F S ++C +++A ++ RL S + +
Sbjct: 80 AGAPTVYAADGGDRRVPILFRLASPQVQLFQQLLSDDECDALVALSRGRLARSPV-VNPD 138
Query: 118 ETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYD 177
E+ RTS G +E ++ IE +IA T +P HGE +L Y+ G +Y
Sbjct: 139 TGDENLIDARTSMGAMFQVAEHA--LIARIEARIAAVTGVPADHGEGLQILNYKPGGEYQ 196
Query: 178 SHYDAFNPAEYGP--QMS---QRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
H+D FNP G Q+S QR+A+ ++YL+ E GG T FP V L+++
Sbjct: 197 PHFDYFNPQRPGEARQLSVGGQRIATLVIYLNTPEAGGATAFP-RVGLEVA 246
>gi|307725787|ref|YP_003909000.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1003]
gi|307586312|gb|ADN59709.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1003]
Length = 313
Score = 79.0 bits (193), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 45/140 (32%), Positives = 73/140 (52%), Gaps = 8/140 (5%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
RP+ + F N S ++C +I ++ RLK S + + E RTS G + ED
Sbjct: 123 RPQVIVFGNVLSPDECAEMIERSRHRLKRSTI-VDPATGREDVIRNRTSEGIWYQRGEDA 181
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQR 195
++E ++ +IA P +GE +L Y +Y H+D F P + G + QR
Sbjct: 182 --LIERLDQRIASLMNWPLENGEGLQILHYGPSGEYRPHFDYFPPDQPGSAVHTARGGQR 239
Query: 196 LASFLLYLSDVEEGGETMFP 215
+A+ ++YL+DV +GGET+FP
Sbjct: 240 VATLVVYLNDVPDGGETIFP 259
>gi|328876967|gb|EGG25330.1| putative prolyl 4-hydroxylase alpha subunit [Dictyostelium
fasciculatum]
Length = 244
Score = 79.0 bits (193), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 48/151 (31%), Positives = 78/151 (51%), Gaps = 17/151 (11%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT-RTSSGTFISA 136
+S PR P+F S +C+ +I +K +L+P E + G R+ G F+
Sbjct: 29 MSQCPRVYRVPDFLSPAECEHLIDISKNKLRPCN---------EISSGVHRSGWGLFMKE 79
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS--- 193
E+ +++ I ++ L + + E V+RY G++ +HYD FNP M
Sbjct: 80 GEEDHDVVKKIFQRMKMLVNLTE-NCEVMQVIRYHPGEETSAHYDYFNPLTTNGAMKIGL 138
Query: 194 --QRLASFLLYLSDVEEGGETMFPFEVDLQI 222
QR+ + L+YLS+VEEGGET FP EV +++
Sbjct: 139 YGQRVCTILMYLSEVEEGGETSFP-EVGVKV 168
>gi|302844281|ref|XP_002953681.1| hypothetical protein VOLCADRAFT_63898 [Volvox carteri f.
nagariensis]
gi|300261090|gb|EFJ45305.1| hypothetical protein VOLCADRAFT_63898 [Volvox carteri f.
nagariensis]
Length = 304
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 45/138 (32%), Positives = 76/138 (55%), Gaps = 8/138 (5%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
++W+PR + NF + + + +I A ++K S + G++VE + T ++G +
Sbjct: 4 VAWKPRVFIYHNFITDMEAKHMIELAAPQMKRSTVVGAGGQSVEDSYRTLYTAG--VRRY 61
Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
+D ++E IE+++A T + H E +LRY IGQ+Y H D E G R+A
Sbjct: 62 QDD--VVERIENRVAAWTQISVLHQEDMQILRYGIGQQYKVHADTLRDDEAG----VRVA 115
Query: 198 SFLLYLSDVEEGGETMFP 215
+ L+YL++ E GGET FP
Sbjct: 116 TVLIYLNEPEAGGETAFP 133
>gi|375106426|ref|ZP_09752687.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderiales bacterium
JOSHI_001]
gi|374667157|gb|EHR71942.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderiales bacterium
JOSHI_001]
Length = 295
Score = 78.6 bits (192), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 50/153 (32%), Positives = 77/153 (50%), Gaps = 10/153 (6%)
Query: 77 VLSWR-PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
+LS R PR + F S E+C +++ A+ RL S+ + G RTS G F
Sbjct: 102 LLSMRNPRVMVFGGLLSDEECDAMVDLARPRLARSE-TVHNGSGGSEVNAARTSDGMFFD 160
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-----P 190
E + IE +IA P +GE VLRY G +Y +H+D F+PA+ G
Sbjct: 161 RGEFP--LCRTIEQRIAALVNWPVENGEGLQVLRYRPGSEYKAHHDYFDPAQPGTPTILK 218
Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
+ QR+ + ++YL+ GG T FP +V L+++
Sbjct: 219 RGGQRVGTVVMYLNHPIRGGGTAFP-DVGLEVA 250
>gi|303279839|ref|XP_003059212.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226459048|gb|EEH56344.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 409
Score = 78.6 bits (192), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 57/184 (30%), Positives = 87/184 (47%), Gaps = 12/184 (6%)
Query: 38 DVPSIRPKLRTLEVVEKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQ 97
+VP T E ++ S G T D + + LS PRA F F + E+C
Sbjct: 46 EVPRADATTTTREAPRADDASASSLGPTRDIGVGDARVEKLSDSPRAYLFREFLTKEECA 105
Query: 98 SIIATAKKRLKPSQLALRQ---GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARA 154
+I + LK S + GE RTS+G F+ D ++ +E ++
Sbjct: 106 HLIEISTPHLKRSTVVGDDALLGEADGRRSDYRTSTGAFLPKLYDD--VVTRVERRVEAF 163
Query: 155 TMLP---QTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGE 211
+ LP Q +A ++LRYE+GQ+Y H D F G +R+A+ L++L++ EEGGE
Sbjct: 164 SRLPFENQEQLQARSLLRYELGQEYRDHVDGFATENGG----KRVATVLMFLAEPEEGGE 219
Query: 212 TMFP 215
T FP
Sbjct: 220 TAFP 223
>gi|385205097|ref|ZP_10031967.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
gi|385184988|gb|EIF34262.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
Length = 292
Score = 78.2 bits (191), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 45/140 (32%), Positives = 72/140 (51%), Gaps = 8/140 (5%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
RP+ + F + S ++C +I ++ RLK S + E RTS G + ED
Sbjct: 102 RPQMIVFADVLSPDECAEMIERSRHRLKRS-TTVNPATGKEDVIRNRTSEGIWYQRGEDP 160
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-----PQMSQR 195
+E ++ +I+ P +GE +LRY +Y H+D F P + G Q QR
Sbjct: 161 --FIERMDRRISSLMNWPVENGEGLQLLRYGTTGEYRPHFDYFPPDQPGSTVHTAQGGQR 218
Query: 196 LASFLLYLSDVEEGGETMFP 215
+A+ ++YL+DV +GGET+FP
Sbjct: 219 VATLVIYLNDVPDGGETIFP 238
>gi|339327280|ref|YP_004686973.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator N-1]
gi|338167437|gb|AEI78492.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator N-1]
Length = 297
Score = 78.2 bits (191), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 51/171 (29%), Positives = 84/171 (49%), Gaps = 9/171 (5%)
Query: 58 SGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQG 117
+G P D + +P P+ F + ++C +++A ++ RL S + +
Sbjct: 81 AGTPAVYAADGGDRQVPILFRLASPQVQLFQQLLTDDECDALVALSRGRLARSPV-VNPD 139
Query: 118 ETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYD 177
E+ RTS G +E ++ IE +IA T +P HGE +L Y+ G +Y
Sbjct: 140 TGDENLIDARTSMGAMFQVAEHA--LIARIEARIAAVTGVPAEHGEGLQILNYKPGGEYQ 197
Query: 178 SHYDAFNPAEYGP--QMS---QRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
H+D FNP G Q+S QR+A+ ++YL+ E GG T FP V L+++
Sbjct: 198 PHFDYFNPQRPGEARQLSVGGQRIATLVIYLNTPEAGGATAFP-RVGLEVA 247
>gi|344199983|ref|YP_004784309.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrivorans SS3]
gi|343775427|gb|AEM47983.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrivorans SS3]
Length = 212
Score = 78.2 bits (191), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 46/134 (34%), Positives = 69/134 (51%), Gaps = 8/134 (5%)
Query: 85 LYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGIL 144
++F S E+C +IA KPS++ + T G R+ T S S DK I+
Sbjct: 15 VHFSGLLSPEECTELIAAGGSHAKPSEVIYGVSDVSHETSGRRS---TVASPSADKYPII 71
Query: 145 ELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM---SQRLASFLL 201
+ + +I+ + + + E VL Y G +YD HYD+F E PQ+ R+ + LL
Sbjct: 72 KAVRRRISLFIGVAEENQEPLQVLHYTRGGRYDIHYDSF--LEGSPQLENGGNRMLTVLL 129
Query: 202 YLSDVEEGGETMFP 215
YL+DVE+GG T FP
Sbjct: 130 YLNDVEQGGWTQFP 143
>gi|357467087|ref|XP_003603828.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355492876|gb|AES74079.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 156
Score = 77.8 bits (190), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 39/83 (46%), Positives = 53/83 (63%), Gaps = 2/83 (2%)
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
F+ +DK I++ IE +IA T +P +GE VL Y +G+KY+ HYD F
Sbjct: 2 FLKRGKDK--IIQNIERRIADFTFIPVENGEGLQVLHYGVGEKYEPHYDYFLDEFNTKNG 59
Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
QR+A+ L+YLSDVEEGGET+FP
Sbjct: 60 GQRVATVLMYLSDVEEGGETVFP 82
>gi|113869198|ref|YP_727687.1| prolyl 4-hydroxylase alpha subunit [Ralstonia eutropha H16]
gi|113527974|emb|CAJ94319.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia eutropha H16]
Length = 297
Score = 77.8 bits (190), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 51/170 (30%), Positives = 83/170 (48%), Gaps = 9/170 (5%)
Query: 59 GLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGE 118
G P D + +P P+ F + ++C +++A ++ RL S + +
Sbjct: 82 GTPAVYAADGGDRQVPILFRLASPQVQLFQQLLTDDECDALVALSRGRLARSPV-VNPDT 140
Query: 119 TVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDS 178
E+ RTS G +E ++ IE +IA T +P HGE +L Y+ G +Y
Sbjct: 141 GDENLIDARTSMGAMFQVAEHP--LITRIEARIAAVTGVPAEHGEGLQILNYKPGGEYQP 198
Query: 179 HYDAFNPAEYGP--QMS---QRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
H+D FNP G Q+S QR+A+ ++YL+ E GG T FP V L+++
Sbjct: 199 HFDYFNPQRPGEARQLSVGGQRIATLVIYLNTPEAGGATAFP-RVGLEVA 247
>gi|389775678|ref|ZP_10193553.1| procollagen-proline dioxygenase [Rhodanobacter spathiphylli B39]
gi|388437120|gb|EIL93940.1| procollagen-proline dioxygenase [Rhodanobacter spathiphylli B39]
Length = 284
Score = 77.8 bits (190), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 47/140 (33%), Positives = 76/140 (54%), Gaps = 10/140 (7%)
Query: 89 NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
N +AE+C+ +IA A+ RLK + G + RTS G F + +E ++ IE
Sbjct: 102 NLLAAEECEELIALAQPRLKRALTVASDGSNQVDQR--RTSEGMFFTLNE--LPLVGRIE 157
Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRLASFLLYL 203
++A +P +HGE +L Y GQ+Y+ H+D F+P + G QR+AS ++YL
Sbjct: 158 QRLATLLGMPVSHGEGLQILHYLPGQEYEPHFDWFDPQQPGYDTITAVGGQRVASVVMYL 217
Query: 204 SDVEEGGETMFPFEVDLQIS 223
+ +GG T FP E+ L ++
Sbjct: 218 NTPAQGGGTAFP-ELGLTVT 236
>gi|357459545|ref|XP_003600053.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
truncatula]
gi|355489101|gb|AES70304.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
truncatula]
Length = 156
Score = 77.8 bits (190), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 49/143 (34%), Positives = 80/143 (55%), Gaps = 20/143 (13%)
Query: 77 VLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFIS 135
+ SW+ + + S E+C+ +I K L+ S+++ R G+ +E+ + G F+
Sbjct: 17 LCSWKIKL-----YESKEECEHLIKLGKPYLERSRISDKRTGKGIENR--FAYACGGFV- 68
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY---GPQM 192
K I++ IE +I +P +GE V+ Y +GQK+ HYD+ + + GP
Sbjct: 69 ----KDKIIKNIEQRIPDIISIPVENGEGLQVIHYGVGQKFVPHYDSRSNESFWNGGP-- 122
Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
R+A+FL+YLSDVEEGGET+FP
Sbjct: 123 --RVATFLMYLSDVEEGGETVFP 143
>gi|77761111|ref|YP_241833.2| hypothetical protein XC_0735 [Xanthomonas campestris pv. campestris
str. 8004]
Length = 288
Score = 77.4 bits (189), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 51/165 (30%), Positives = 81/165 (49%), Gaps = 12/165 (7%)
Query: 64 ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
+ GD +Q + +L PR + + ++C ++IA A+ +L S+ + + E
Sbjct: 83 DLGDRQVQVLVSLML---PRVVVLGGLLADDECDALIALARPQLARSR-TVDNRDGSEIV 138
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTS + +D + + IE +IA+ P HGE VLRY G +Y HYD F
Sbjct: 139 HAARTSHSMALQPGQD--ALCQRIEARIAQLLEWPVEHGEGLQVLRYATGAQYAPHYDYF 196
Query: 184 NPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
P G + QR+AS ++YL+ E GG T FP +V L ++
Sbjct: 197 EPDAPGTPVLLQHGGQRVASLVMYLNTPERGGATRFP-DVHLDVA 240
>gi|116784858|gb|ABK23496.1| unknown [Picea sitchensis]
Length = 208
Score = 77.4 bits (189), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 37/76 (48%), Positives = 49/76 (64%)
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASF 199
K I+ IE KIA T LP+ +GE VLRYE G+KYD H+D F + R+A+
Sbjct: 7 KDAIISRIEDKIAAWTFLPKENGEDMQVLRYEPGEKYDPHFDFFQDKVNIVRGGHRVATV 66
Query: 200 LLYLSDVEEGGETMFP 215
L+YL+DV +GGET+FP
Sbjct: 67 LMYLTDVSKGGETVFP 82
>gi|66572403|gb|AAY47813.1| conserved hypothetical protein [Xanthomonas campestris pv.
campestris str. 8004]
Length = 308
Score = 77.4 bits (189), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 51/165 (30%), Positives = 81/165 (49%), Gaps = 12/165 (7%)
Query: 64 ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
+ GD +Q + +L PR + + ++C ++IA A+ +L S+ + + E
Sbjct: 103 DLGDRQVQVLVSLML---PRVVVLGGLLADDECDALIALARPQLARSR-TVDNRDGSEIV 158
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTS + +D + + IE +IA+ P HGE VLRY G +Y HYD F
Sbjct: 159 HAARTSHSMALQPGQD--ALCQRIEARIAQLLEWPVEHGEGLQVLRYATGAQYAPHYDYF 216
Query: 184 NPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
P G + QR+AS ++YL+ E GG T FP +V L ++
Sbjct: 217 EPDAPGTPVLLQHGGQRVASLVMYLNTPERGGATRFP-DVHLDVA 260
>gi|91778899|ref|YP_554107.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
xenovorans LB400]
gi|91691559|gb|ABE34757.1| Procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
xenovorans LB400]
Length = 292
Score = 77.0 bits (188), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 44/140 (31%), Positives = 71/140 (50%), Gaps = 8/140 (5%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
RP+ + F + S ++C +I ++ RLK S + E RTS G + ED
Sbjct: 102 RPQVIVFADVLSPDECAEMIERSRHRLKRS-TTVNPATGKEDVIRNRTSEGIWYQRGEDP 160
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-----PQMSQR 195
+E ++ +I+ P +GE +L Y +Y H+D F P + G Q QR
Sbjct: 161 --FIERMDRRISSLMNWPVENGEGLQILHYGTTGEYRPHFDYFPPDQPGSAVHTAQGGQR 218
Query: 196 LASFLLYLSDVEEGGETMFP 215
+A+ ++YL+DV +GGET+FP
Sbjct: 219 VATLVIYLNDVPDGGETIFP 238
>gi|294499597|ref|YP_003563297.1| 2OG-Fe(II) oxygenase family protein [Bacillus megaterium QM B1551]
gi|294349534|gb|ADE69863.1| 2OG-Fe(II) oxygenase family protein [Bacillus megaterium QM B1551]
Length = 219
Score = 76.6 bits (187), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 44/135 (32%), Positives = 76/135 (56%), Gaps = 10/135 (7%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQL-ALRQGETVESTKGTRTSSGTFISASEDK 140
P L N S E+C +I +K +++ S++ A R+ ++ RTSSG F SE++
Sbjct: 39 PLVLVLGNVLSNEECDELIQLSKDKMQRSKIGAAREVNSI------RTSSGMFFEESENE 92
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFL 200
++ IE ++++ + E VL+Y Q+Y +H+D F A + + R+++ +
Sbjct: 93 --LVHQIERRLSKIMGPSIEYAEGLQVLKYLPDQEYKAHHDYFTSASKASK-NNRISTLV 149
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 150 MYLNDVEEGGETYFP 164
>gi|445499353|ref|ZP_21466208.1| prolyl 4-hydroxylase alpha subunit [Janthinobacterium sp. HH01]
gi|444789348|gb|ELX10896.1| prolyl 4-hydroxylase alpha subunit [Janthinobacterium sp. HH01]
Length = 272
Score = 76.6 bits (187), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 49/140 (35%), Positives = 71/140 (50%), Gaps = 8/140 (5%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+P+ + N S E+C +IIA R S + + +G RTS FI E +
Sbjct: 82 QPQIILLGNVLSDEECDAIIAHCGTRYTRSTVTGEADGSSMVHEG-RTSEMAFIQRGEAE 140
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-----PQMSQR 195
+ E IE ++A P E F + +Y+ Q+Y HYD +P G + QR
Sbjct: 141 --VAERIERRLAALAHWPAECSEPFQLQKYDATQEYRPHYDWLDPDSSGHRSHLARGGQR 198
Query: 196 LASFLLYLSDVEEGGETMFP 215
LA+F+LYLSDVE+GG T+FP
Sbjct: 199 LATFILYLSDVEQGGGTVFP 218
>gi|389809938|ref|ZP_10205598.1| procollagen-proline dioxygenase [Rhodanobacter thiooxydans LCS2]
gi|388441354|gb|EIL97635.1| procollagen-proline dioxygenase [Rhodanobacter thiooxydans LCS2]
Length = 284
Score = 76.6 bits (187), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 44/132 (33%), Positives = 69/132 (52%), Gaps = 9/132 (6%)
Query: 89 NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
N SA +C +IA A+ RL+ + +G + RTS G F + D+ ++ IE
Sbjct: 102 NILSARECDELIALARPRLQRALTVDSEGR--QQVDRRRTSEGMFFTL--DEVPLVGRIE 157
Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRLASFLLYL 203
++A +P +HGE +L Y GQ Y+ H+D F+P + G + QR+AS ++YL
Sbjct: 158 RRVAALLDVPASHGEGLQILHYLPGQAYEPHFDWFDPDQPGYETITAVGGQRIASVVMYL 217
Query: 204 SDVEEGGETMFP 215
+ GG T FP
Sbjct: 218 NTPARGGGTAFP 229
>gi|413963357|ref|ZP_11402584.1| ProCollegen-proline dioxygenase [Burkholderia sp. SJ98]
gi|413929189|gb|EKS68477.1| ProCollegen-proline dioxygenase [Burkholderia sp. SJ98]
Length = 286
Score = 76.6 bits (187), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 50/161 (31%), Positives = 81/161 (50%), Gaps = 8/161 (4%)
Query: 60 LPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET 119
LP AS +S P S +P + +C +I ++ ++ S + +
Sbjct: 72 LPQARVLTASDRSAPKVFSSEQPVIALVADVLDDTECDRLIEIGREHVQRSSV-VDPDSG 130
Query: 120 VESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH 179
E T R S G F++AS D ++E I+ +IA P +GE ++LRY +G +Y H
Sbjct: 131 KEITIEERRSEGAFVNASTD--ALVETIDRRIAELFRQPVENGEDLHILRYGMGGEYRPH 188
Query: 180 YDAFNPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFP 215
YD F + G + QR+A+ +LYL++VE+GG+T FP
Sbjct: 189 YDYFPEEQAGSKHHMQRGGQRIATVILYLNEVEQGGDTTFP 229
>gi|302835042|ref|XP_002949083.1| hypothetical protein VOLCADRAFT_89416 [Volvox carteri f.
nagariensis]
gi|300265828|gb|EFJ50018.1| hypothetical protein VOLCADRAFT_89416 [Volvox carteri f.
nagariensis]
Length = 263
Score = 76.6 bits (187), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 51/144 (35%), Positives = 72/144 (50%), Gaps = 19/144 (13%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
+ +SW PRA + F + +C +I A +L+ S + + ++ RTS I
Sbjct: 60 ETVSWMPRAFVYHQFLTPAECDHLIELATPKLERSMVVGTDSDLIDDI---RTSFSASIM 116
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ-MSQ 194
E T I+ IE +IAR T VLRY GQKYD+H+D F+ E S
Sbjct: 117 YGE--TSIVSSIEERIARWT-----------VLRYVNGQKYDAHWDWFDDNEVAKAGGSN 163
Query: 195 RLASFLLYLSDVE--EGGETMFPF 216
R+A+ L+YLSDV+ GGET P
Sbjct: 164 RMATVLMYLSDVDPAAGGETALPL 187
>gi|295704991|ref|YP_003598066.1| 2OG-Fe(II) oxygenase [Bacillus megaterium DSM 319]
gi|294802650|gb|ADF39716.1| 2OG-Fe(II) oxygenase family protein [Bacillus megaterium DSM 319]
Length = 219
Score = 76.3 bits (186), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 43/135 (31%), Positives = 76/135 (56%), Gaps = 10/135 (7%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQL-ALRQGETVESTKGTRTSSGTFISASEDK 140
P L N S E+C +I +K +++ S++ A R+ ++ RTSSG F SE++
Sbjct: 39 PLVLVLGNVLSNEECDELIRLSKDKMQRSKIGAAREVNSI------RTSSGMFFDESENE 92
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFL 200
++ IE ++++ + E +L+Y Q+Y +H+D F A + + R+++ +
Sbjct: 93 --LVHQIERRLSKIMGPSIEYAEGLQILKYLPDQEYKAHHDYFTSASKASK-NNRISTLV 149
Query: 201 LYLSDVEEGGETMFP 215
+YL+DVEEGGET FP
Sbjct: 150 MYLNDVEEGGETYFP 164
>gi|384046522|ref|YP_005494539.1| prolyl 4-hydroxylase alpha subunit [Bacillus megaterium WSH-002]
gi|345444213|gb|AEN89230.1| Prolyl 4-hydroxylase alpha subunit [Bacillus megaterium WSH-002]
Length = 219
Score = 75.9 bits (185), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 41/134 (30%), Positives = 72/134 (53%), Gaps = 8/134 (5%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P L N S E+C +I +K +++ S++ + RTSSG F SE++
Sbjct: 39 PLVLVLGNVLSNEECDELIQLSKDKMQRSKIGAER-----EVNSIRTSSGMFFEESENE- 92
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
++ IE ++++ + E +L+Y Q+Y +H+D F A + + R+++ ++
Sbjct: 93 -LVHQIERRLSKIMGPSIEYAEGLQILKYLPDQEYKAHHDYFTSASKASK-NNRISTLVM 150
Query: 202 YLSDVEEGGETMFP 215
YL+DVEEGGET FP
Sbjct: 151 YLNDVEEGGETYFP 164
>gi|187920106|ref|YP_001889137.1| procollagen-proline dioxygenase [Burkholderia phytofirmans PsJN]
gi|187718544|gb|ACD19767.1| Procollagen-proline dioxygenase [Burkholderia phytofirmans PsJN]
Length = 295
Score = 75.9 bits (185), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 44/140 (31%), Positives = 71/140 (50%), Gaps = 8/140 (5%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
RP+ + F + S ++C +I ++ RLK S + E RTS G + ED
Sbjct: 105 RPQVIVFGDVLSPDECAEMIERSRHRLKRS-TTVNPETGKEDVIRNRTSEGIWYQRGED- 162
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-----PQMSQR 195
+E ++ +I+ P +GE +L Y +Y H+D F P + G Q QR
Sbjct: 163 -AFIERMDRRISSLMNWPVENGEGLQILHYGTTGEYRPHFDYFPPDQPGSAVHTAQGGQR 221
Query: 196 LASFLLYLSDVEEGGETMFP 215
+A+ ++YL+DV +GGET+FP
Sbjct: 222 VATLVIYLNDVPDGGETIFP 241
>gi|352086439|ref|ZP_08953941.1| Procollagen-proline dioxygenase [Rhodanobacter sp. 2APBS1]
gi|389799401|ref|ZP_10202396.1| procollagen-proline dioxygenase [Rhodanobacter sp. 116-2]
gi|351679404|gb|EHA62545.1| Procollagen-proline dioxygenase [Rhodanobacter sp. 2APBS1]
gi|388442818|gb|EIL98985.1| procollagen-proline dioxygenase [Rhodanobacter sp. 116-2]
Length = 284
Score = 75.9 bits (185), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 46/140 (32%), Positives = 77/140 (55%), Gaps = 10/140 (7%)
Query: 89 NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
N S ++C+ +IA A+ RL+ + +G + RTS G F + +E ++ IE
Sbjct: 102 NILSTQECEELIALARPRLQRALTVDSEGR--QQVDRRRTSEGMFFTLNE--VPLVGRIE 157
Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE--YGPQMS---QRLASFLLYL 203
++A +P +HGE +L Y GQ+Y+ H+D F+P + YG + QR+AS ++YL
Sbjct: 158 QRLAALLRVPASHGEGLQILHYLPGQEYEPHFDWFDPEQPGYGAITAVGGQRIASVVMYL 217
Query: 204 SDVEEGGETMFPFEVDLQIS 223
+ GG T FP E+ L ++
Sbjct: 218 NTPARGGGTAFP-ELGLTVT 236
>gi|319943342|ref|ZP_08017624.1| 2OG-Fe(II) oxygenase [Lautropia mirabilis ATCC 51599]
gi|319743157|gb|EFV95562.1| 2OG-Fe(II) oxygenase [Lautropia mirabilis ATCC 51599]
Length = 311
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 51/169 (30%), Positives = 84/169 (49%), Gaps = 28/169 (16%)
Query: 71 QSIPFQVLSWRPR------------ALYFPNFA------SAEQCQSIIATAKKRLKPSQL 112
Q+ P +++S PR + PN A S E+C +I ++ ++K SQ+
Sbjct: 92 QASPIRLISQLPRFTVADREVELAAVMSNPNIAVIRGLLSDEECDEVIRLSRGKMKTSQV 151
Query: 113 ALRQ-GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYE 171
R+ G + ES+ R S G+ E++ ++ IE +++ LP GE +L Y
Sbjct: 152 VDRESGGSYESS--VRKSEGSHFERGENE--LVRRIEARLSALVDLPVNRGEPLQILHYG 207
Query: 172 IGQKYDSHYDAFNPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFP 215
G +Y +H D F P + G + QR+ + ++YL+DV EGGET FP
Sbjct: 208 PGGEYKAHQDFFEPKDPGSAVLTRVGGQRIGTVVMYLNDVPEGGETAFP 256
>gi|410637601|ref|ZP_11348175.1| prolyl 4-hydroxylase [Glaciecola lipolytica E3]
gi|410142794|dbj|GAC15380.1| prolyl 4-hydroxylase [Glaciecola lipolytica E3]
Length = 280
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 45/139 (32%), Positives = 74/139 (53%), Gaps = 10/139 (7%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
R + + NF +A++C++++A K +L+PS++ R+G+ KG RTSS + ++D
Sbjct: 84 RVQMIKIDNFLTAQECEALVALTKSKLRPSEIPEREGDQY---KGFRTSSTCDLPFTKDP 140
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE-----YGPQMSQR 195
+ I+ KI A L E Y IGQ++ +H D F P Y QR
Sbjct: 141 --LAHEIDQKIVDALGLGVGEKEVIQAQHYAIGQEFKAHCDYFVPGSKDFKTYSKDGGQR 198
Query: 196 LASFLLYLSDVEEGGETMF 214
+F++YL+++ EGGET F
Sbjct: 199 TWTFMIYLNELCEGGETEF 217
>gi|224007761|ref|XP_002292840.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220971702|gb|EED90036.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 490
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 51/148 (34%), Positives = 74/148 (50%), Gaps = 15/148 (10%)
Query: 78 LSWRPRALYFPNFASAEQCQSII-----ATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
+S P + F NF + E+C +I A ++ ++ + +KG RTS
Sbjct: 284 MSQPPWIITFDNFLTDEECNQMIQLGYKAKYERSKDVGEMQIDGSYDSVVSKG-RTSENA 342
Query: 133 FISASED--KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE--- 187
+ S + T +LI +I+ T +P H E F +L+YE GQ Y SH+D E
Sbjct: 343 WCSFRDKCRNTTTAQLIHDRISTVTGIPANHSEDFQILKYEKGQFYRSHHDYIEHQEKRR 402
Query: 188 YGPQMSQRLASFLLYLSDVEEGGETMFP 215
GP R+ +F LYLSDVEEGG+T FP
Sbjct: 403 CGP----RVLTFFLYLSDVEEGGDTNFP 426
>gi|77747935|ref|NP_638775.2| hypothetical protein XCC3429 [Xanthomonas campestris pv. campestris
str. ATCC 33913]
Length = 288
Score = 75.1 bits (183), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 50/165 (30%), Positives = 80/165 (48%), Gaps = 12/165 (7%)
Query: 64 ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
+ GD +Q + +L PR + + ++C ++IA A+ +L S+ + + E
Sbjct: 83 DLGDRQVQVLVSLML---PRVVVLGGLLADDECDALIALARPQLARSR-TVDNRDGSEIV 138
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTS + +D + + IE +IA+ P HGE VLRY G +Y HYD F
Sbjct: 139 HAARTSHSMALQPGQD--ALCQRIEARIAQLLEWPVEHGEGLQVLRYATGAQYAPHYDYF 196
Query: 184 NPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
P G + QR+AS ++YL+ E GG T P +V L ++
Sbjct: 197 EPDAPGTPVLLQHGGQRVASLVMYLNTPERGGATRVP-DVHLDVA 240
>gi|229084249|ref|ZP_04216532.1| 2OG-Fe(II) oxygenase [Bacillus cereus Rock3-44]
gi|228699049|gb|EEL51751.1| 2OG-Fe(II) oxygenase [Bacillus cereus Rock3-44]
Length = 235
Score = 75.1 bits (183), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 46/155 (29%), Positives = 78/155 (50%), Gaps = 11/155 (7%)
Query: 74 PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTF 133
P +L P + + +C +I A+ L+PS++ G + + T RTS
Sbjct: 39 PSNLLHDNPFIGCYEKVVTQTECHQLIDLARHGLQPSKVI---GNSEQKTSAVRTSDT-- 93
Query: 134 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE-----Y 188
I T + I +IA LP + E + RY++G K+++H+D FNP+ Y
Sbjct: 94 IGFQHHLTELTLQICKRIASIVELPLNYAEHLQIARYQVGGKFNAHFDTFNPSTELGKMY 153
Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
+ QR+ + LLYL++V GGET FP +++Q++
Sbjct: 154 LSENGQRIITALLYLNNVSAGGETSFPL-LNIQVA 187
>gi|21114687|gb|AAM42699.1| conserved hypothetical protein [Xanthomonas campestris pv.
campestris str. ATCC 33913]
Length = 308
Score = 75.1 bits (183), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 50/165 (30%), Positives = 80/165 (48%), Gaps = 12/165 (7%)
Query: 64 ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
+ GD +Q + +L PR + + ++C ++IA A+ +L S+ + + E
Sbjct: 103 DLGDRQVQVLVSLML---PRVVVLGGLLADDECDALIALARPQLARSR-TVDNRDGSEIV 158
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
RTS + +D + + IE +IA+ P HGE VLRY G +Y HYD F
Sbjct: 159 HAARTSHSMALQPGQD--ALCQRIEARIAQLLEWPVEHGEGLQVLRYATGAQYAPHYDYF 216
Query: 184 NPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
P G + QR+AS ++YL+ E GG T P +V L ++
Sbjct: 217 EPDAPGTPVLLQHGGQRVASLVMYLNTPERGGATRVP-DVHLDVA 260
>gi|319786559|ref|YP_004146034.1| Procollagen-proline dioxygenase [Pseudoxanthomonas suwonensis 11-1]
gi|317465071|gb|ADV26803.1| Procollagen-proline dioxygenase [Pseudoxanthomonas suwonensis 11-1]
Length = 289
Score = 74.7 bits (182), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 51/163 (31%), Positives = 76/163 (46%), Gaps = 23/163 (14%)
Query: 64 ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
+ GD +Q + +L PR + S E+C +++ ++ RL R+ TV++
Sbjct: 87 DAGDRQVQLLASLML---PRVVVLGGLLSDEECDALVELSRPRL-------RRSTTVDAQ 136
Query: 124 KG------TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYD 177
G RTS GTF + IE +IAR P +GE VL Y G ++
Sbjct: 137 TGGSQVHADRTSRGTFFERGAHP--VCATIEARIARLLEWPVENGEGLQVLHYPPGAEFR 194
Query: 178 SHYDAFNPAEYGP-----QMSQRLASFLLYLSDVEEGGETMFP 215
HYD F+P E G Q QR+A+ ++YL+ GG T FP
Sbjct: 195 PHYDYFDPDEPGAEVLLRQGGQRVATVVMYLNTPARGGATTFP 237
>gi|325267002|ref|ZP_08133672.1| 2OG-Fe(II) oxygenase [Kingella denitrificans ATCC 33394]
gi|324981502|gb|EGC17144.1| 2OG-Fe(II) oxygenase [Kingella denitrificans ATCC 33394]
Length = 279
Score = 74.3 bits (181), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 48/146 (32%), Positives = 77/146 (52%), Gaps = 13/146 (8%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASEDK 140
P + NF +AE+C +IA A+ +++ + + GE V+ RTS + +E
Sbjct: 91 PEVVVLDNFITAEECAQLIALAEGKVEDATVVDPATGEFVKHQD--RTSMNAAFARAEHP 148
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-----QR 195
++ +E +IA A P +GE VLRY G +Y +H+D F+ G + + QR
Sbjct: 149 --LIARLEARIAAAIHWPAENGEGMQVLRYRSGGEYKAHFDYFDTQSEGGRKNMQTGGQR 206
Query: 196 LASFLLYLSDVEEGGETMFP---FEV 218
+ +FL+YL DV+ GG T FP FE+
Sbjct: 207 VGTFLVYLCDVDAGGATRFPALNFEI 232
>gi|377810637|ref|YP_005043077.1| proCollegen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia sp.
YI23]
gi|357939998|gb|AET93554.1| proCollegen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia sp.
YI23]
Length = 297
Score = 73.6 bits (179), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 51/141 (36%), Positives = 74/141 (52%), Gaps = 10/141 (7%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASED 139
RP A+ F + +C +IA A+ RL S + G V + G R+S GTF +E
Sbjct: 101 RPAAVLLDEFLTGSECDQLIALARPRLSRSTVVDPVTGRDVAA--GHRSSDGTFFRLAE- 157
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD---AFNPA--EYGPQMSQ 194
T ++ +E +IA T L +GE +LRY+ G + H D A N E + Q
Sbjct: 158 -TPLVARLEMRIAALTGLAAENGEGLQLLRYQPGAESTPHVDYLVAGNETNRESIARSGQ 216
Query: 195 RLASFLLYLSDVEEGGETMFP 215
R+ + L+YL+DVE GGET+FP
Sbjct: 217 RVGTLLMYLNDVEGGGETVFP 237
>gi|218665910|ref|YP_002425647.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrooxidans ATCC 23270]
gi|218518123|gb|ACK78709.1| oxidoreductase, 2OG-Fe(II) oxygenase family [Acidithiobacillus
ferrooxidans ATCC 23270]
Length = 248
Score = 73.2 bits (178), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 50/135 (37%), Positives = 70/135 (51%), Gaps = 9/135 (6%)
Query: 92 SAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKI 151
+ E CQ++IA + L+P+ + Q E G R S + D IL+ + I
Sbjct: 73 TPENCQNLIAIGQSLLRPATVTDEQ-TGQEVAHGERVSEMAW--PKRDDYPILQSLAEGI 129
Query: 152 ARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ---RLASFLLYLSDVEE 208
A+ T +P E +L Y G +Y HYDAF A P + Q R A+ +LYL+ VEE
Sbjct: 130 AQLTGIPIDCQEPLQILHYRPGGEYKPHYDAF--AADAPTLRQGGNRQATLILYLNAVEE 187
Query: 209 GGETMFPFEVDLQIS 223
GGET FP E+ LQ+S
Sbjct: 188 GGETAFP-ELGLQVS 201
>gi|449488641|ref|XP_004158125.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized LOC101218968
[Cucumis sativus]
Length = 311
Score = 72.8 bits (177), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 45/139 (32%), Positives = 73/139 (52%), Gaps = 7/139 (5%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATA-KKRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
+SWRPR + F S E+C +I+ A PS+ + G TV + SSG ++
Sbjct: 59 VSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGITVSTE--LLNSSGVILNT 116
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
++D I+ IE+++A T+LP+ H F +++Y G++ Y N + P +
Sbjct: 117 TDD---IVARIENRLAIWTLLPKDHSMPFQIMQYR-GEEAKHKYFYGNRSAMLPSSEPLM 172
Query: 197 ASFLLYLSDVEEGGETMFP 215
A+ +LYLSD GGE +FP
Sbjct: 173 ATVVLYLSDSASGGEILFP 191
>gi|449469338|ref|XP_004152378.1| PREDICTED: uncharacterized protein LOC101218968 [Cucumis sativus]
Length = 311
Score = 72.8 bits (177), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 45/139 (32%), Positives = 73/139 (52%), Gaps = 7/139 (5%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATA-KKRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
+SWRPR + F S E+C +I+ A PS+ + G TV + SSG ++
Sbjct: 59 VSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGITVSTE--LLNSSGVILNT 116
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
++D I+ IE+++A T+LP+ H F +++Y G++ Y N + P +
Sbjct: 117 TDD---IVARIENRLAIWTLLPKDHSMPFQIMQYR-GEEAKHKYFYGNRSAMLPSSEPLM 172
Query: 197 ASFLLYLSDVEEGGETMFP 215
A+ +LYLSD GGE +FP
Sbjct: 173 ATVVLYLSDSASGGEILFP 191
>gi|302850293|ref|XP_002956674.1| hypothetical protein VOLCADRAFT_67269 [Volvox carteri f.
nagariensis]
gi|300258035|gb|EFJ42276.1| hypothetical protein VOLCADRAFT_67269 [Volvox carteri f.
nagariensis]
Length = 325
Score = 72.4 bits (176), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 42/129 (32%), Positives = 68/129 (52%), Gaps = 14/129 (10%)
Query: 75 FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 134
Q +SW+PRA+ + NF S ++ + II A +++K S + + E V RTS GTF+
Sbjct: 41 IQTISWKPRAVVYHNFLSDQEARHIIDLAHEQMKRSTVVGNKNEGV--VDDIRTSYGTFL 98
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
++D ++ IE ++A + +P +H E VLRY KY H D +
Sbjct: 99 RRAQDP--VIMAIEERLALWSHMPPSHQEDMQVLRYGRTNKYGPHIDGL----------E 146
Query: 195 RLASFLLYL 203
R+A+ L+YL
Sbjct: 147 RVATVLMYL 155
>gi|398806116|ref|ZP_10565064.1| 2OG-Fe(II) oxygenase superfamily enzyme [Polaromonas sp. CF318]
gi|398089832|gb|EJL80333.1| 2OG-Fe(II) oxygenase superfamily enzyme [Polaromonas sp. CF318]
Length = 294
Score = 72.4 bits (176), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 55/201 (27%), Positives = 86/201 (42%), Gaps = 12/201 (5%)
Query: 14 LPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKENESGLPHGETGDASIQSI 73
+P A+ + + G T LS D P I +L ++ K++ LP + +
Sbjct: 36 VPMAVRIVDAVLRSEGEGGT-LSYDPPHITRRLPAIDTSAKQSVLHLPGADA------VV 88
Query: 74 PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTF 133
F+ L+ PR + NF S+E+C + A+ P+ + + V + S
Sbjct: 89 TFEQLA--PRIVVLDNFLSSEECDGLCEEARPAFAPATVVDPHQDAVHAAHFRSNDSAQL 146
Query: 134 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 193
+A + ++ +E +I R T P E + RY GQ Y HYD F Q
Sbjct: 147 PAAGSE---LVRRVEARIERLTGWPSAFCETLQLQRYAQGQDYRPHYDFFGQDMVEAQGG 203
Query: 194 QRLASFLLYLSDVEEGGETMF 214
QRLA+ +LYL E GG T F
Sbjct: 204 QRLATLILYLRAPEAGGATYF 224
>gi|348683507|gb|EGZ23322.1| hypothetical protein PHYSODRAFT_310730 [Phytophthora sojae]
Length = 417
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 46/155 (29%), Positives = 72/155 (46%), Gaps = 14/155 (9%)
Query: 72 SIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
+ + LS P F ++ I+ + + LKPS + L G + RTS+
Sbjct: 198 DVVLETLSMTPLVFSVEEFLKDDEIDIIMNLSLEHLKPSGVTLMDGHENRAATDWRTSTT 257
Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY--- 188
F+ + D ++ I+ +++ T +P H E VLRYE QKYD H D F P E+
Sbjct: 258 YFLPS--DAHPKIDEIDQRVSDLTKVPIDHQEDVQVLRYEKTQKYDHHTDYF-PVEHHKN 314
Query: 189 GPQM--------SQRLASFLLYLSDVEEGGETMFP 215
P + R+ + Y+SDV +GG T+FP
Sbjct: 315 APHILESIDYGYKNRMITVFWYMSDVAKGGHTIFP 349
>gi|449520827|ref|XP_004167434.1| PREDICTED: putative prolyl 4-hydroxylase-like, partial [Cucumis
sativus]
Length = 164
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 38/89 (42%), Positives = 54/89 (60%)
Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
RTSSG F+S E +++ IE +I+ + +P +GE VLRYE Q Y H+D F+
Sbjct: 7 RTSSGMFLSHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQFYKPHHDYFSDT 66
Query: 187 EYGPQMSQRLASFLLYLSDVEEGGETMFP 215
+ QR+A+ L+YLS+ EGGET FP
Sbjct: 67 FNLKRGGQRIATMLMYLSENIEGGETYFP 95
>gi|416009427|ref|ZP_11561250.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein
[Acidithiobacillus sp. GGI-221]
gi|339836568|gb|EGQ64151.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein
[Acidithiobacillus sp. GGI-221]
Length = 196
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 49/140 (35%), Positives = 70/140 (50%), Gaps = 9/140 (6%)
Query: 87 FPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILEL 146
+ + E CQ++IA + L+P+ + Q E G R S + D IL+
Sbjct: 16 WAGLLTPENCQNLIAIGQSLLRPATVTDEQ-TGQEVAHGERVSEMAW--PKRDDHPILQS 72
Query: 147 IEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ---RLASFLLYL 203
+ IA+ T +P E +L Y G +Y HYDAF A P + Q R + +LYL
Sbjct: 73 LAEGIAQLTGIPIDCQEPLQILHYRPGGEYKPHYDAF--AADAPTLRQGGNRQGTLILYL 130
Query: 204 SDVEEGGETMFPFEVDLQIS 223
+ VEEGGET FP E+ LQ+S
Sbjct: 131 NAVEEGGETAFP-ELGLQVS 149
>gi|413934216|gb|AFW68767.1| hypothetical protein ZEAMMB73_452923 [Zea mays]
Length = 210
Score = 71.6 bits (174), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 32/73 (43%), Positives = 48/73 (65%)
Query: 143 ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLY 202
++ IE +I+ T LP +GEA +L Y+ G+KY+ HYD F+ R+A+ L+Y
Sbjct: 12 VVTRIEERISAWTFLPPENGEAIQILHYQNGEKYEPHYDYFHDKNNQALGGHRIATVLMY 71
Query: 203 LSDVEEGGETMFP 215
LS+VE+GGET+FP
Sbjct: 72 LSNVEKGGETIFP 84
>gi|310831339|ref|YP_003969982.1| putative prolyl 4-hydroxylase alpha subunit [Cafeteria
roenbergensis virus BV-PW1]
gi|309386523|gb|ADO67383.1| putative prolyl 4-hydroxylase alpha subunit [Cafeteria
roenbergensis virus BV-PW1]
Length = 210
Score = 71.6 bits (174), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 49/146 (33%), Positives = 68/146 (46%), Gaps = 13/146 (8%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
+LS P Y N + ++C II +LKP AL G + RT + ++S
Sbjct: 4 HILSQDPLIYYVDNVLNKQECYHIIKITSNKLKP---ALVSGNSRGFLSTGRTGTNCWLS 60
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-- 193
D+ I I KI P + E F VL Y QKY+ HYDAF P + +
Sbjct: 61 HKNDE--ITFNIALKITNLVNKPLENAENFQVLHYSTNQKYEYHYDAF-PIDNSEKAKRC 117
Query: 194 -----QRLASFLLYLSDVEEGGETMF 214
QRL + L+YL++V +GGET F
Sbjct: 118 LKKGGQRLLTALIYLNNVTKGGETEF 143
>gi|397644755|gb|EJK76534.1| hypothetical protein THAOC_01697 [Thalassiosira oceanica]
Length = 475
Score = 71.6 bits (174), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 46/140 (32%), Positives = 73/140 (52%), Gaps = 7/140 (5%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRL--KPSQLALRQGE-TVESTKGT-RTSSGTFISAS 137
P + F NF + ++C +I +K + + Q + + +S + T RTS + S
Sbjct: 273 PWVITFENFLTEDECTHMIEQGRKAEYERSEDVGEVQADGSYDSVRSTGRTSENAWCSFR 332
Query: 138 ED--KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
+ I+EL+ +IA+ T + H E F +L+YE GQ Y H+D + + + R
Sbjct: 333 DGCRNDTIVELVHDRIAKVTGIGANHSEDFQILKYEPGQFYRQHHD-YIEHQRDRRCGPR 391
Query: 196 LASFLLYLSDVEEGGETMFP 215
+ +F LYLSDVEEGG T FP
Sbjct: 392 VLTFFLYLSDVEEGGATNFP 411
>gi|413934217|gb|AFW68768.1| hypothetical protein ZEAMMB73_452923 [Zea mays]
Length = 204
Score = 71.6 bits (174), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 32/73 (43%), Positives = 48/73 (65%)
Query: 143 ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLY 202
++ IE +I+ T LP +GEA +L Y+ G+KY+ HYD F+ R+A+ L+Y
Sbjct: 6 VVTRIEERISAWTFLPPENGEAIQILHYQNGEKYEPHYDYFHDKNNQALGGHRIATVLMY 65
Query: 203 LSDVEEGGETMFP 215
LS+VE+GGET+FP
Sbjct: 66 LSNVEKGGETIFP 78
>gi|55741040|gb|AAV64184.1| unknown [Zea mays]
Length = 394
Score = 71.6 bits (174), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 33/80 (41%), Positives = 54/80 (67%), Gaps = 2/80 (2%)
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
A++D+ ++ IE +I+ T LP +GE+ +L Y+ G+KY+ HYD F+ + R
Sbjct: 191 ATQDE--VVTRIEERISAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKKNQALGGHR 248
Query: 196 LASFLLYLSDVEEGGETMFP 215
+A+ L+YLS+VE+GGET+FP
Sbjct: 249 IATVLMYLSNVEKGGETIFP 268
>gi|389770666|ref|ZP_10192118.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
gi|388429637|gb|EIL86932.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
Length = 286
Score = 71.6 bits (174), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 49/167 (29%), Positives = 81/167 (48%), Gaps = 20/167 (11%)
Query: 60 LPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET 119
LP G A+ + + + +P S E+C +I A +L+ S +
Sbjct: 73 LPPGNRIIAADREVQVALRVEQPVLAVLDGVLSHEECDELIRRAAAKLQRSTI------- 125
Query: 120 VESTKG------TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIG 173
V+ T G R+S GTF + D + ++ +I+ LP HGE +L Y G
Sbjct: 126 VDPTTGKHETIADRSSEGTFFEINADD--FIARLDRRISALMNLPVDHGEGLQILHYGPG 183
Query: 174 QKYDSHYDAFNPAEYGP--QMS---QRLASFLLYLSDVEEGGETMFP 215
+Y H+D F P + G QM+ QR+++ ++YL++VE+GG T+FP
Sbjct: 184 GEYKPHFDFFPPGDPGSAVQMATGGQRVSTLVMYLNEVEDGGATIFP 230
>gi|377811809|ref|YP_005044249.1| ProCollegen-proline dioxygenase [Burkholderia sp. YI23]
gi|357941170|gb|AET94726.1| ProCollegen-proline dioxygenase [Burkholderia sp. YI23]
Length = 283
Score = 71.2 bits (173), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 42/139 (30%), Positives = 69/139 (49%), Gaps = 8/139 (5%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + S +C +I ++R++ S + + E R S G F++ S D
Sbjct: 91 PVVALLADVLSPRECDRLIEIGRERVRRSSV-VDPDSGGEVLIDARKSEGAFVNGSTDP- 148
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRL 196
++ I+ +IA P +GE ++LRY G +Y H+D F + G + QR+
Sbjct: 149 -LVATIDRRIAELVQQPVENGEDLHILRYGAGGEYRPHFDYFPEEQAGSKHHMQRGGQRI 207
Query: 197 ASFLLYLSDVEEGGETMFP 215
A+ +LYL+ VEEGG+T FP
Sbjct: 208 ATLILYLNQVEEGGDTTFP 226
>gi|55741082|gb|AAV64222.1| unknown [Zea mays]
Length = 369
Score = 71.2 bits (173), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 33/80 (41%), Positives = 54/80 (67%), Gaps = 2/80 (2%)
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
A++D+ ++ IE +I+ T LP +GE+ +L Y+ G+KY+ HYD F+ + R
Sbjct: 191 ATQDE--VVTRIEERISAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKKNQALGGHR 248
Query: 196 LASFLLYLSDVEEGGETMFP 215
+A+ L+YLS+VE+GGET+FP
Sbjct: 249 IATVLMYLSNVEKGGETIFP 268
>gi|242085722|ref|XP_002443286.1| hypothetical protein SORBIDRAFT_08g016950 [Sorghum bicolor]
gi|241943979|gb|EES17124.1| hypothetical protein SORBIDRAFT_08g016950 [Sorghum bicolor]
Length = 147
Score = 71.2 bits (173), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 34/73 (46%), Positives = 46/73 (63%)
Query: 143 ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLY 202
I+ IE +IA T +P +GE VL Y +GQK++ H+D + R A+FL+Y
Sbjct: 10 IVRTIEQRIADYTSVPIENGEPLQVLHYAVGQKFEPHFDYTDGTSVTKIGGPRKATFLMY 69
Query: 203 LSDVEEGGETMFP 215
LSDVEEGGET+FP
Sbjct: 70 LSDVEEGGETVFP 82
>gi|124267278|ref|YP_001021282.1| hypothetical protein Mpe_A2091 [Methylibium petroleiphilum PM1]
gi|124260053|gb|ABM95047.1| conserved hypothetical protein [Methylibium petroleiphilum PM1]
Length = 289
Score = 70.9 bits (172), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 46/147 (31%), Positives = 67/147 (45%), Gaps = 9/147 (6%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
PR + F S +C I+A A RL S + RTS G F + E
Sbjct: 102 PRVIVFSGLLSDAECDEIVALAGARLARSH-TVDTATGASEVNAARTSDGMFFTRGEHP- 159
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQRL 196
+ E +IA P +GE VL Y G +Y HYD F+P + G P + QR+
Sbjct: 160 -VCARFEARIAALLNWPVENGEGLQVLHYRPGAEYKPHYDYFDPDQPGTPAVLRRGGQRV 218
Query: 197 ASFLLYLSDVEEGGETMFPFEVDLQIS 223
A+ + YL+ GG T FP ++ L+++
Sbjct: 219 ATLVTYLNTPTRGGGTTFP-DIGLEVT 244
>gi|255577610|ref|XP_002529682.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223530830|gb|EEF32693.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 165
Score = 70.5 bits (171), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 40/91 (43%), Positives = 55/91 (60%), Gaps = 2/91 (2%)
Query: 126 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 185
RTSSG F+S+ E K+ + IE +I+ + +P +GE VLRYE Q Y H+D F+
Sbjct: 11 VRTSSGMFLSSEERKSPMA--IEKRISVYSQVPIENGELVQVLRYEKSQFYRPHHDYFSD 68
Query: 186 AEYGPQMSQRLASFLLYLSDVEEGGETMFPF 216
+ QR+A+ L+YLSD EGGET FP
Sbjct: 69 TFNLKRGGQRVATMLMYLSDNVEGGETYFPM 99
>gi|428170517|gb|EKX39441.1| hypothetical protein GUITHDRAFT_114401 [Guillardia theta CCMP2712]
Length = 322
Score = 70.5 bits (171), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 47/155 (30%), Positives = 77/155 (49%), Gaps = 10/155 (6%)
Query: 66 GDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQG--ETVEST 123
G + +S + +S PR N + E+C +++ A ++ + L G + VEST
Sbjct: 66 GGMAKKSTWIETVSVDPRIFIVHNLLTEEECDHLVSLALQKGLSASLITPYGTNKLVEST 125
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
TRT+ ++ +D +++ +E KIA+ T GE VL Y Q++ H+D F
Sbjct: 126 --TRTNKQAWLDFQQDD--VVKRVEDKIAKLTKTTPEQGENLQVLHYAKSQQFTEHHDYF 181
Query: 184 NPAEYGPQM----SQRLASFLLYLSDVEEGGETMF 214
+PA P+ RL + ++YL EEGGET F
Sbjct: 182 DPATDPPENYEKGGNRLITVIVYLQAAEEGGETHF 216
>gi|323445926|gb|EGB02303.1| hypothetical protein AURANDRAFT_39521 [Aureococcus anophagefferens]
Length = 239
Score = 70.5 bits (171), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 46/141 (32%), Positives = 72/141 (51%), Gaps = 7/141 (4%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
LS P + +FA + C+ +I A+ L +++ R+G + R +S +++A
Sbjct: 31 LSADPLVYFIDDFADEDSCEHLIRQARPSLGGAEVQTRRGSAART--AIRRASSCWLAAR 88
Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYE--IGQKYDSHYDAFNPAEYG-PQMSQ 194
D+ LE +E I P+ E F+V+RY G++Y +H DAF + Q
Sbjct: 89 GDEA--LEHLEDAICAELGAPEERTEFFHVVRYRPSTGERYAAHADAFEAGNAELERGGQ 146
Query: 195 RLASFLLYLSDVEEGGETMFP 215
RL + LLYLSDV GG T+FP
Sbjct: 147 RLTTALLYLSDVGAGGATVFP 167
>gi|224012759|ref|XP_002295032.1| hypothetical protein THAPSDRAFT_264808 [Thalassiosira pseudonana
CCMP1335]
gi|220969471|gb|EED87812.1| hypothetical protein THAPSDRAFT_264808 [Thalassiosira pseudonana
CCMP1335]
Length = 194
Score = 70.5 bits (171), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 45/137 (32%), Positives = 71/137 (51%), Gaps = 9/137 (6%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI--SASED 139
P + NF + E+ +I K+ + QL+ ++ +++ GTRTS + +D
Sbjct: 2 PWLVSLENFLTDEEADYLIEVGKR--QQYQLSEQRKDSL----GTRTSYSAWCRRDCWKD 55
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASF 199
+ +++ +IA+ T + +LRYE GQK+ H D G QRL +F
Sbjct: 56 DATVSSVVD-RIAKVTKVETKQLSNLQLLRYEEGQKFKQHTDFAAMLSRGRAQGQRLMTF 114
Query: 200 LLYLSDVEEGGETMFPF 216
L+YLSDVEEGGET FP+
Sbjct: 115 LIYLSDVEEGGETSFPY 131
>gi|239816557|ref|YP_002945467.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
gi|239803134|gb|ACS20201.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
Length = 296
Score = 70.1 bits (170), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 44/135 (32%), Positives = 70/135 (51%), Gaps = 20/135 (14%)
Query: 92 SAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFISASEDKTGILE 145
SAE+C+++IA A+ RL PS +V+ G R+S G F E+ +
Sbjct: 109 SAEECEALIALARPRLAPST-------SVDPLTGRNRLGAQRSSLGMFFRLREN--AFVA 159
Query: 146 LIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-----QRLASFL 200
++ +++ LP +GE VL Y G + H+D P+ Q S QR+++ +
Sbjct: 160 RLDERLSELMNLPVENGEGLQVLHYPAGAQSLPHFDFLVPSNAANQASLQRSGQRVSTLV 219
Query: 201 LYLSDVEEGGETMFP 215
YL++VEEGGET+FP
Sbjct: 220 AYLNEVEEGGETVFP 234
>gi|242051901|ref|XP_002455096.1| hypothetical protein SORBIDRAFT_03g004265 [Sorghum bicolor]
gi|241927071|gb|EES00216.1| hypothetical protein SORBIDRAFT_03g004265 [Sorghum bicolor]
Length = 303
Score = 69.7 bits (169), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 43/138 (31%), Positives = 69/138 (50%), Gaps = 18/138 (13%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
LSW PR + F S +C +I+ A + +Q V S G I
Sbjct: 62 LSWHPRVFLYEGFLSDMECDHLISMAHGK--------KQSSLVVGGSAGNNSQGASI--- 110
Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
ED I+ IE +I+ + LP+ GE+ +L+YE+ + ++Y++ + + + RL
Sbjct: 111 EDT--IVSTIEDRISVWSFLPKDFGESMQILKYEVNKSDYNNYESQSSSGH-----DRLV 163
Query: 198 SFLLYLSDVEEGGETMFP 215
+ L+YLSDV+ GGET FP
Sbjct: 164 TVLMYLSDVKRGGETAFP 181
>gi|397620797|gb|EJK65916.1| hypothetical protein THAOC_13183 [Thalassiosira oceanica]
Length = 520
Score = 69.7 bits (169), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 51/150 (34%), Positives = 70/150 (46%), Gaps = 9/150 (6%)
Query: 75 FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 134
+++ P +F SA++C I+A A LKP + + VE RTS+ +
Sbjct: 314 LKMIHQDPDIYSVDDFLSADECDRIVAKACPHLKPCLVNNERNGRVEQDPA-RTSTNANV 372
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP--AEYGPQM 192
E T + KI T E VLRY+ GQ++ H D F+ + G
Sbjct: 373 PQVEVPT-----VMRKITDLTNCSPDKVEILQVLRYKKGQEFVPHTDGFSGPYSSCGFVD 427
Query: 193 SQRLASFLLYLSDVEEGGETMFPFEVDLQI 222
S RL + YL+DVEEGG T FP EVDL I
Sbjct: 428 STRLCTVFCYLNDVEEGGTTYFP-EVDLDI 456
>gi|357135727|ref|XP_003569460.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like isoform 2
[Brachypodium distachyon]
Length = 314
Score = 69.7 bits (169), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 40/138 (28%), Positives = 71/138 (51%), Gaps = 8/138 (5%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
L+W PR + F S +C ++ A+ ++ S L + T+ + + F +
Sbjct: 63 LAWHPRVFLYEGFLSGMECDHLVYVARLNIESSLLVNAGARNI--TQNSTDARFKF-QLA 119
Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
+ K ++ IE +I+ + +P+ HGE+ +L+Y Q D + D + G RL
Sbjct: 120 DSKDIVVSKIEDRISLWSFIPKEHGESMQILKYGSNQS-DHNKDGTQSSSGG----NRLV 174
Query: 198 SFLLYLSDVEEGGETMFP 215
+ L+YLSDV++GGET+FP
Sbjct: 175 TILMYLSDVKQGGETVFP 192
>gi|255083957|ref|XP_002508553.1| predicted protein [Micromonas sp. RCC299]
gi|226523830|gb|ACO69811.1| predicted protein [Micromonas sp. RCC299]
Length = 262
Score = 69.3 bits (168), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 44/138 (31%), Positives = 68/138 (49%), Gaps = 7/138 (5%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
LS P+A + F SAE+C +I LK S + + +T RTS GTF+
Sbjct: 4 LSDEPKAFLYHGFLSAEECDHLIKIGTPHLKRSTVVGGKDDT-GVLDDVRTSFGTFLPKK 62
Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
D +L IE ++ + + + E +L+Y GQ+Y H D P +R+A
Sbjct: 63 YDD--VLYGIERRVEDFSQISYENQEQLQLLKYHDGQEYKDHQDGLT----SPNGGRRIA 116
Query: 198 SFLLYLSDVEEGGETMFP 215
+ L++L + E+GGET FP
Sbjct: 117 TVLMFLHEPEKGGETSFP 134
>gi|228999322|ref|ZP_04158902.1| hypothetical protein bmyco0003_38780 [Bacillus mycoides Rock3-17]
gi|229006877|ref|ZP_04164509.1| hypothetical protein bmyco0002_37790 [Bacillus mycoides Rock1-4]
gi|228754370|gb|EEM03783.1| hypothetical protein bmyco0002_37790 [Bacillus mycoides Rock1-4]
gi|228760519|gb|EEM09485.1| hypothetical protein bmyco0003_38780 [Bacillus mycoides Rock3-17]
Length = 195
Score = 69.3 bits (168), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 47/147 (31%), Positives = 67/147 (45%), Gaps = 21/147 (14%)
Query: 77 VLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
VL P + + +CQ +I +KK ++P+Q GE R S T++
Sbjct: 7 VLHDEPFVAQYEQIITPAECQELIELSKKHIQPAQAYGHTGE--------RKSDFTWLPH 58
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF--------NPAEY 188
G++ + IA A LP H E RYE+G K+D+H D + N E
Sbjct: 59 YSH--GLVSQVSELIATAMPLPLNHAEPLQAARYEVGGKFDAHIDCYGTWHEDGRNRVEQ 116
Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFP 215
G QRL + +LYL+ V GGET FP
Sbjct: 117 G---GQRLYTAILYLNTVNAGGETFFP 140
>gi|307106128|gb|EFN54375.1| hypothetical protein CHLNCDRAFT_135671 [Chlorella variabilis]
Length = 486
Score = 69.3 bits (168), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 50/165 (30%), Positives = 80/165 (48%), Gaps = 30/165 (18%)
Query: 77 VLSWR-PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
V+SW PR L +F + ++ + +I A S++ + + E+ RTS G++++
Sbjct: 181 VVSWSSPRVLLIRDFLTPDETEHLIRQATGGFARSEVVAEESKQHEA----RTSYGSWLN 236
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--AFNPAEYGPQMS 193
++ +LE I+++I R +P+ GE+ VL+Y GQKYD H D A +PA Q
Sbjct: 237 GAKRDDKVLE-IQNRIHRLVGIPEAFGESIYVLQYSDGQKYDPHTDHCASSPAAASTQSC 295
Query: 194 Q----------------------RLASFLLYLSDVEEGGETMFPF 216
Q RLA+F+LYL GG T+FP
Sbjct: 296 QDFLRRGGGPKCGPGGGGVTCGDRLATFILYLRSPTRGGATVFPL 340
>gi|228993272|ref|ZP_04153188.1| hypothetical protein bpmyx0001_40040 [Bacillus pseudomycoides DSM
12442]
gi|228766340|gb|EEM14983.1| hypothetical protein bpmyx0001_40040 [Bacillus pseudomycoides DSM
12442]
Length = 195
Score = 69.3 bits (168), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 47/147 (31%), Positives = 67/147 (45%), Gaps = 21/147 (14%)
Query: 77 VLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
VL P + + +CQ +I +KK ++P+Q GE R S T++
Sbjct: 7 VLHDEPFVAQYEQIITPAECQELIELSKKHIQPAQAYGHTGE--------RKSDFTWLPH 58
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF--------NPAEY 188
G++ + IA A LP H E RYE+G K+D+H D + N E
Sbjct: 59 YSH--GLVSQVSELIATAMPLPLNHAEPLQAARYEVGGKFDAHIDCYGTWHEDGRNRVEQ 116
Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFP 215
G QRL + +LYL+ V GGET FP
Sbjct: 117 G---GQRLYTAILYLNTVNAGGETFFP 140
>gi|428183249|gb|EKX52107.1| hypothetical protein GUITHDRAFT_150687 [Guillardia theta CCMP2712]
Length = 315
Score = 68.9 bits (167), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 62/225 (27%), Positives = 108/225 (48%), Gaps = 27/225 (12%)
Query: 15 PTALLLCSFFFLAGFYGS--TFLSRDVPSIRPKLRTLEVVEKENESGL---------PHG 63
P A +LC F L G TF++ V S R ++ LEV ++ SG+ P
Sbjct: 15 PPAAVLCGIFVLGIGSGVILTFIAAQVVSRRSEI--LEVKPQKLRSGMLNAPGTSLSPAD 72
Query: 64 ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQG--ETVE 121
+ + I++I PR N + E+C+S+ + L + G E VE
Sbjct: 73 KKNEFWIETISDLP---GPRIYVLHNILTKEECESLKSLGVMAGMEKALIIPYGGKELVE 129
Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
S+ TRT++ ++ + ++ +E+ +A+ T +GE +L Y+ Q++ H+D
Sbjct: 130 SS--TRTNTAAWLEYHQGP--VVTKLENLLAKVTNTEPENGENLQILHYQTSQQFKEHHD 185
Query: 182 AFNPAEYGPQM----SQRLASFLLYLSDVEEGGETMFPFEVDLQI 222
F+PA P+ RLA+ ++YL + EEGGET F ++D ++
Sbjct: 186 YFDPATDPPENFEPGGNRLATAIIYLQNAEEGGETDF-MKIDTKV 229
>gi|407699315|ref|YP_006824102.1| prolyl 4-hydroxylase subunit alpha [Alteromonas macleodii str.
'Black Sea 11']
gi|407248462|gb|AFT77647.1| Prolyl 4-hydroxylase alpha subunit [Alteromonas macleodii str.
'Black Sea 11']
Length = 354
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 50/155 (32%), Positives = 77/155 (49%), Gaps = 10/155 (6%)
Query: 74 PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA--LRQGETVESTKGTRTSSG 131
P +VL + + S +C +I L+PS + L V++ RTS
Sbjct: 146 PTEVLDQTLPVELYVDVLSEYECAYLITKFSSLLQPSMVVDPLTGNGKVDNV---RTSYV 202
Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ 191
I+ S I ++ I++ T P+ +GEA N+LRY GQ+Y HYDA N G
Sbjct: 203 AIIAPSYCD-WITRKLDKVISQVTHTPRCNGEALNLLRYTPGQQYKPHYDALNEDHDGSM 261
Query: 192 M---SQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
QR+ + L+YL+ V +GGET FP ++D+ +S
Sbjct: 262 YKDGKQRIKTALVYLNTVRQGGETRFP-KLDISVS 295
>gi|421871431|ref|ZP_16303052.1| 2OG-Fe(II) oxygenase superfamily protein [Brevibacillus
laterosporus GI-9]
gi|372459315|emb|CCF12601.1| 2OG-Fe(II) oxygenase superfamily protein [Brevibacillus
laterosporus GI-9]
Length = 201
Score = 68.6 bits (166), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 44/146 (30%), Positives = 74/146 (50%), Gaps = 9/146 (6%)
Query: 74 PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTF 133
P Q+L+ +P +P+ S+E CQS+I A+ +L P+ + + G V R S +
Sbjct: 4 PTQLLNQQPFIGCYPSLISSEACQSLINLARGQLTPATVVGQSGLEVSH---VRISELAW 60
Query: 134 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE----YG 189
+ ++ +++ I +IA P + E V Y G K+++H D ++ E +
Sbjct: 61 FCHNYNE--VVQSICKQIAEIVEQPIHYAEKLQVAHYGAGGKFEAHLDCYDSQEANKTFL 118
Query: 190 PQMSQRLASFLLYLSDVEEGGETMFP 215
QRL + +LYL+DV GGET FP
Sbjct: 119 EHSGQRLYTAILYLNDVVSGGETYFP 144
>gi|339009924|ref|ZP_08642495.1| 2OG-Fe(II) oxygenase [Brevibacillus laterosporus LMG 15441]
gi|338773194|gb|EGP32726.1| 2OG-Fe(II) oxygenase [Brevibacillus laterosporus LMG 15441]
Length = 201
Score = 68.6 bits (166), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 44/146 (30%), Positives = 74/146 (50%), Gaps = 9/146 (6%)
Query: 74 PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTF 133
P Q+L+ +P +P+ S+E CQS+I A+ +L P+ + + G V R S +
Sbjct: 4 PTQLLNQQPFIGCYPSLISSEACQSLINLARGQLTPATVVGQSGLEVSH---VRISELAW 60
Query: 134 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE----YG 189
+ ++ +++ I +IA P + E V Y G K+++H D ++ E +
Sbjct: 61 FCHNYNE--VVQSICKQIAEIVEQPIHYAEKLQVAHYGAGGKFEAHLDCYDSQEANKPFL 118
Query: 190 PQMSQRLASFLLYLSDVEEGGETMFP 215
QRL + +LYL+DV GGET FP
Sbjct: 119 EHSGQRLYTAILYLNDVVSGGETYFP 144
>gi|389795384|ref|ZP_10198508.1| procollagen-proline dioxygenase [Rhodanobacter fulvus Jip2]
gi|388430823|gb|EIL87950.1| procollagen-proline dioxygenase [Rhodanobacter fulvus Jip2]
Length = 293
Score = 68.6 bits (166), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 41/133 (30%), Positives = 66/133 (49%), Gaps = 20/133 (15%)
Query: 94 EQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFISASEDKTGILELI 147
E+C +I + +L+ S TV+ G R+S GTF + D + +
Sbjct: 109 EECDELIRRSADKLQRST-------TVDPVNGGYEVIAARSSEGTFFPVNADD--FIARL 159
Query: 148 EHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRLASFLLY 202
+ +IA P +GE VL Y G +Y H+D F+P + G + QR+++ L+Y
Sbjct: 160 DRRIAELMNCPVENGEGLQVLHYGEGGEYQPHFDYFSPGDPGSEAQMVVGGQRVSTLLIY 219
Query: 203 LSDVEEGGETMFP 215
L+DV +GG T+FP
Sbjct: 220 LNDVAQGGATVFP 232
>gi|357135725|ref|XP_003569459.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like isoform 1
[Brachypodium distachyon]
Length = 303
Score = 68.6 bits (166), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 40/138 (28%), Positives = 67/138 (48%), Gaps = 19/138 (13%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
L+W PR + F S +C ++ A+ ++ S L G R I+ +
Sbjct: 63 LAWHPRVFLYEGFLSGMECDHLVYVARLNIESSLLV---------NAGARN-----ITQN 108
Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
++ IE +I+ + +P+ HGE+ +L+Y Q D + D + G RL
Sbjct: 109 STDDIVVSKIEDRISLWSFIPKEHGESMQILKYGSNQS-DHNKDGTQSSSGG----NRLV 163
Query: 198 SFLLYLSDVEEGGETMFP 215
+ L+YLSDV++GGET+FP
Sbjct: 164 TILMYLSDVKQGGETVFP 181
>gi|323454062|gb|EGB09933.1| hypothetical protein AURANDRAFT_14928, partial [Aureococcus
anophagefferens]
Length = 182
Score = 68.6 bits (166), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/131 (35%), Positives = 67/131 (51%), Gaps = 13/131 (9%)
Query: 89 NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
NF + E+C ++I +AK + P+ + G +RTSS ++ A ED L +
Sbjct: 8 NFLTEEECDALIDSAKDHMTPAPVV---GPGNGEVSVSRTSSTCYL-ARED----LPSVC 59
Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA-----EYGPQMSQRLASFLLYL 203
K+ T P H E V RY G+ Y HYDAF+ + + QR+A+ L+YL
Sbjct: 60 TKVCALTGKPLEHLELPQVGRYRGGEFYKPHYDAFDTSSADGRRFAQNGGQRVATVLVYL 119
Query: 204 SDVEEGGETMF 214
+DVE GGET F
Sbjct: 120 NDVERGGETSF 130
>gi|195061068|ref|XP_001995918.1| GH14106 [Drosophila grimshawi]
gi|193891710|gb|EDV90576.1| GH14106 [Drosophila grimshawi]
Length = 511
Score = 68.2 bits (165), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 74/144 (51%), Gaps = 6/144 (4%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
+ +++ P + F + S ++ + A+ LK + + + G+ V ++ RTS G
Sbjct: 308 LKMEIVLLNPFIVVFHDALSPQEIDYLQNLARPLLKRTTVHV-NGKYV--SRRVRTSKGA 364
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA-EYGPQ 191
++ D + IE ++ T L EA+N++ Y +G Y +HYD FN + +
Sbjct: 365 WLE--RDLNNLTRRIERRVVDMTELSMQGSEAYNIMNYGLGGHYAAHYDFFNTTKQQTSE 422
Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
R+A+ L YLSDVE+GG T+FP
Sbjct: 423 TGDRIATVLFYLSDVEQGGATVFP 446
>gi|397644356|gb|EJK76358.1| hypothetical protein THAOC_01879, partial [Thalassiosira oceanica]
Length = 539
Score = 67.8 bits (164), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 49/157 (31%), Positives = 77/157 (49%), Gaps = 14/157 (8%)
Query: 62 HGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAK----KRLKPSQLALRQG 117
HG T + + + P+ V+ F NF + E+ ++ + +R A G
Sbjct: 324 HGATWEGAEEGSPWVVV--------FDNFLTDEEVADLVKGGELEGYERSTDQGAANAYG 375
Query: 118 ETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYD 177
E + TRTSS + ++ + KI T +PQ + E+F +L+Y+ GQ Y
Sbjct: 376 EQEKVVSRTRTSSNAWCMHKCERLPGVRSASKKIEAVTGIPQVNYESFQLLKYDGGQFYR 435
Query: 178 SHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMF 214
SH+D+ + + P R+ +F LYLSDVEEGGET F
Sbjct: 436 SHHDS-SSVDDSP-AGHRILTFFLYLSDVEEGGETYF 470
>gi|340367965|ref|XP_003382523.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Amphimedon
queenslandica]
Length = 525
Score = 67.8 bits (164), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 41/146 (28%), Positives = 72/146 (49%), Gaps = 5/146 (3%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
I +V +P+ F + + + + + A +L + + GE + +T R S
Sbjct: 317 IKTEVAFVKPKIYIFYDIVTDREIERLKELANPKLNRATVHGENGELLHAT--YRISKSG 374
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE---YG 189
++S S+D G ++ I+ +I T L + E V+ Y IG +Y+ HYD E
Sbjct: 375 WLSGSDDPLGYVDRIDQRIEDVTGLTMSTAEQLQVVNYGIGGQYEPHYDFARTGEDTFTS 434
Query: 190 PQMSQRLASFLLYLSDVEEGGETMFP 215
R+++ L+Y+SDVE+GG T+FP
Sbjct: 435 LGSGNRISTLLIYMSDVEKGGATVFP 460
>gi|321474898|gb|EFX85862.1| hypothetical protein DAPPUDRAFT_309117 [Daphnia pulex]
Length = 541
Score = 67.8 bits (164), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 46/148 (31%), Positives = 71/148 (47%), Gaps = 5/148 (3%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
IQ I ++ S +PR + + N + E+ ++ A+ RL+ S + E TK R +
Sbjct: 330 IQPIKMELASLKPRLVIYHNVVTDEEIETAKKLAQSRLRRSTVQNSLTGASEPTK-YRIA 388
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
F+ SE I+++ +I T L T E V Y IG Y+ HYD E
Sbjct: 389 KAAFLQNSEHD-HIVKMTR-RIGDVTGLDMTTAEELQVCNYGIGGHYEPHYDHARKGEVQ 446
Query: 190 PQMS--QRLASFLLYLSDVEEGGETMFP 215
R+A+++ Y+SDVE GG T+FP
Sbjct: 447 KDFGWGNRIATWMFYMSDVEAGGATVFP 474
>gi|326435474|gb|EGD81044.1| hypothetical protein PTSG_10986 [Salpingoeca sp. ATCC 50818]
Length = 264
Score = 67.8 bits (164), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 53/161 (32%), Positives = 74/161 (45%), Gaps = 14/161 (8%)
Query: 63 GETGDASI----QSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGE 118
GE G A Q +LS P + F NF S E+ +I+ AK + S + +
Sbjct: 46 GEWGYADADWLRQHYNITMLSEDPPVIQFNNFISQERIDAILHFAKPKFARSTSGIER-- 103
Query: 119 TVESTKGTRTSSGTF----ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQ 174
RTSS + + ++ L+ +E +IAR LP + E F VL+Y+ Q
Sbjct: 104 ---EVSNYRTSSTAWMLPDVLGNDPMQAHLKDMEEEIARIVRLPVENQEHFQVLQYQKNQ 160
Query: 175 KYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
Y H D P R+A+F LYL+DVEEGG T FP
Sbjct: 161 YYKVHSDYIEEQRQQP-CGIRVATFFLYLNDVEEGGGTRFP 200
>gi|390989473|ref|ZP_10259770.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas axonopodis
pv. punicae str. LMG 859]
gi|372555742|emb|CCF66745.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas axonopodis
pv. punicae str. LMG 859]
Length = 152
Score = 67.4 bits (163), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 37/98 (37%), Positives = 49/98 (50%), Gaps = 7/98 (7%)
Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
RTS + +D + + IE +IAR P HGE VLRY G +Y HYD
Sbjct: 2 VHAARTSDSMCLRVGQD--ALCQRIEARIARLFDWPVDHGEGLQVLRYATGAEYRPHYDY 59
Query: 183 FNPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFP 215
F+P G + QR+AS ++YL+ E GG T FP
Sbjct: 60 FDPDAAGTPILLQAGGQRVASLVMYLNTPERGGATRFP 97
>gi|115434812|ref|NP_001042164.1| Os01g0174500 [Oryza sativa Japonica Group]
gi|55296794|dbj|BAD68120.1| prolyl 4-hydroxylase -like [Oryza sativa Japonica Group]
gi|113531695|dbj|BAF04078.1| Os01g0174500 [Oryza sativa Japonica Group]
gi|222617830|gb|EEE53962.1| hypothetical protein OsJ_00571 [Oryza sativa Japonica Group]
Length = 303
Score = 67.0 bits (162), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 44/138 (31%), Positives = 70/138 (50%), Gaps = 19/138 (13%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
LSW PR + F S +C +++ + ++ S LA T G R SS I
Sbjct: 63 LSWHPRIFLYEGFLSDMECDHLVSMGRGNME-SSLAF--------TDGDRNSSYNNI--- 110
Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
ED ++ IE +I+ + LP+ +GE+ VL+Y + + + + RLA
Sbjct: 111 EDI--VVSKIEDRISLWSFLPKENGESIQVLKYGVNRS-----GSIKEEPKSSSGAHRLA 163
Query: 198 SFLLYLSDVEEGGETMFP 215
+ L+YLSDV++GGET+FP
Sbjct: 164 TILMYLSDVKQGGETVFP 181
>gi|319795182|ref|YP_004156822.1| procollagen-proline dioxygenase [Variovorax paradoxus EPS]
gi|315597645|gb|ADU38711.1| Procollagen-proline dioxygenase [Variovorax paradoxus EPS]
Length = 296
Score = 67.0 bits (162), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 42/133 (31%), Positives = 67/133 (50%), Gaps = 10/133 (7%)
Query: 89 NFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASEDKTGILELI 147
N A +C+++I AK RL PS L G V S K R S G F E+ ++ +
Sbjct: 107 NVVDAHECKALIEMAKPRLAPSTLVDPMSGRDVVSDK--RASWGMFFRLCEND--LVARL 162
Query: 148 EHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-----QRLASFLLY 202
+ +++ LP +GE ++L Y G + H+D P + S QR+++ + Y
Sbjct: 163 DRRLSALMNLPLENGEGLHLLYYPTGAGSEPHHDYLAPTNAANRESIARSGQRVSTLVTY 222
Query: 203 LSDVEEGGETMFP 215
L+D EGG+T+FP
Sbjct: 223 LNDAPEGGQTVFP 235
>gi|73542634|ref|YP_297154.1| procollagen-proline,2-oxoglutarate-4-dioxygenase [Ralstonia
eutropha JMP134]
gi|72120047|gb|AAZ62310.1| Procollagen-proline,2-oxoglutarate-4-dioxygenase [Ralstonia
eutropha JMP134]
Length = 282
Score = 67.0 bits (162), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 70/147 (47%), Gaps = 9/147 (6%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + + S +C +++ A+ RL S + + E+ RTS G E
Sbjct: 90 PSIRLYQHLLSDAECDALVELARGRLARSPV-INPDTGDENLIDARTSMGAMFQVGEHT- 147
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRL 196
+++ IE +IA +P HGE +L Y+ G +Y H+D FNP G QR
Sbjct: 148 -LIQRIEDRIAAVLGVPVDHGEGLQILNYKPGGEYQPHFDFFNPKRPGEARQLRVGGQRT 206
Query: 197 ASFLLYLSDVEEGGETMFPFEVDLQIS 223
A+ ++YL+ + GG T FP + L+++
Sbjct: 207 ATLVIYLNTPQAGGATAFP-RIGLEVA 232
>gi|323456313|gb|EGB12180.1| hypothetical protein AURANDRAFT_61447 [Aureococcus anophagefferens]
Length = 317
Score = 66.6 bits (161), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 46/151 (30%), Positives = 70/151 (46%), Gaps = 17/151 (11%)
Query: 74 PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETV-ESTKGTRTSSGT 132
P +VLS P A +FA+ +C IIA A RL AL G+ E +R++
Sbjct: 101 PCEVLSTAPLAFCVRDFATGAECDRIIAEATPRL---SAALVAGDGAGEQAGSSRSAQVA 157
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF--------- 183
++ S D + ++A +P +H E+ V++Y G +Y H+DAF
Sbjct: 158 WVPRSPDDP----WLARRVAELIDVPLSHAESLQVVKYGAGGEYKPHFDAFPLDAARGRR 213
Query: 184 NPAEYGPQMSQRLASFLLYLSDVEEGGETMF 214
QR + +LYL+DVE+GG T F
Sbjct: 214 AAVRGRTYAGQRRVTAILYLNDVEKGGGTAF 244
>gi|114799222|ref|YP_760562.1| 2OG-Fe(II) oxygenase [Hyphomonas neptunium ATCC 15444]
gi|114739396|gb|ABI77521.1| oxidoreductase, 2OG-Fe(II) oxygenase family [Hyphomonas neptunium
ATCC 15444]
Length = 298
Score = 66.6 bits (161), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 47/140 (33%), Positives = 73/140 (52%), Gaps = 14/140 (10%)
Query: 82 PRA-LY-FPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASED 139
P+A LY +PNF + E C ++IA +RL+ S + RTS + I +
Sbjct: 100 PKAQLYVWPNFLAPETCDALIALTDERLRASTTT-----DAFADPKIRTSRSSDI-GTMG 153
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQ 194
+++L E IA A + ++ +A RY++ Q+Y +HYD F P Q+ Q
Sbjct: 154 HNLVMQLDE-LIAEALGIHWSYSDATQTQRYDVNQEYKAHYDYFTPGTRDYQVHCQFTGQ 212
Query: 195 RLASFLLYLSDVEEGGETMF 214
R +F++YL+DVEEGG T F
Sbjct: 213 RTWTFMIYLNDVEEGGGTRF 232
>gi|415977972|ref|ZP_11559036.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein
[Acidithiobacillus sp. GGI-221]
gi|339834153|gb|EGQ61937.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein
[Acidithiobacillus sp. GGI-221]
Length = 215
Score = 66.2 bits (160), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 47/135 (34%), Positives = 67/135 (49%), Gaps = 9/135 (6%)
Query: 85 LYFPNFASAEQCQSIIATAK-KRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGI 143
++F S ++C +IA KPS + + T G R S T ++ S D I
Sbjct: 17 VHFKGLLSLDECAELIAIGSVSDAKPSVVVDGASDAAYETPG-RCS--TVVAPSVDAYPI 73
Query: 144 LELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM---SQRLASFL 200
+ I +I + + Q + E +L Y G KYD HYDAF ++ PQ+ RL + L
Sbjct: 74 ILEIRRRIELFSGISQENQEPLQILHYTRGGKYDIHYDAF--SDGSPQLRNGGNRLLTVL 131
Query: 201 LYLSDVEEGGETMFP 215
LYL+DVE GG T FP
Sbjct: 132 LYLNDVEYGGWTQFP 146
>gi|198284815|ref|YP_002221136.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrooxidans ATCC 53993]
gi|218668131|ref|YP_002427500.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrooxidans ATCC 23270]
gi|198249336|gb|ACH84929.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrooxidans ATCC 53993]
gi|218520344|gb|ACK80930.1| oxidoreductase, 2OG-Fe(II) oxygenase family [Acidithiobacillus
ferrooxidans ATCC 23270]
Length = 213
Score = 66.2 bits (160), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 47/135 (34%), Positives = 67/135 (49%), Gaps = 9/135 (6%)
Query: 85 LYFPNFASAEQCQSIIATAK-KRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGI 143
++F S ++C +IA KPS + + T G R S T ++ S D I
Sbjct: 15 VHFKGLLSLDECAELIAIGSVSDAKPSVVVDGASDAAYETPG-RCS--TVVAPSVDAYPI 71
Query: 144 LELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM---SQRLASFL 200
+ I +I + + Q + E +L Y G KYD HYDAF ++ PQ+ RL + L
Sbjct: 72 ILEIRRRIELFSGISQENQEPLQILHYTRGGKYDIHYDAF--SDGSPQLRNGGNRLLTVL 129
Query: 201 LYLSDVEEGGETMFP 215
LYL+DVE GG T FP
Sbjct: 130 LYLNDVEYGGWTQFP 144
>gi|393718270|ref|ZP_10338197.1| putative oxygenase [Sphingomonas echinoides ATCC 14820]
Length = 226
Score = 66.2 bits (160), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 44/137 (32%), Positives = 65/137 (47%), Gaps = 15/137 (10%)
Query: 85 LYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK-TGI 143
Y P+F A C ++A + S + ES + RTS S D+ +
Sbjct: 43 FYHPDFLDAATCDRLVALIDANRRRSTVLAE-----ESVQDFRTSD----SCDMDRWSPD 93
Query: 144 LELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE-YGPQM----SQRLAS 198
+ + IA + HGE RY +GQ + +H+D FN A+ Y P+M QR +
Sbjct: 94 VRPTDEAIADLLGIDPVHGETMQGQRYAVGQHFRAHFDYFNEAQAYWPKMVETGGQRTWT 153
Query: 199 FLLYLSDVEEGGETMFP 215
++YL+DVEEGG T FP
Sbjct: 154 AMIYLNDVEEGGATWFP 170
>gi|224009604|ref|XP_002293760.1| prolyl 4-hydroxylase alpha subunit [Thalassiosira pseudonana
CCMP1335]
gi|220970432|gb|EED88769.1| prolyl 4-hydroxylase alpha subunit [Thalassiosira pseudonana
CCMP1335]
Length = 206
Score = 66.2 bits (160), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 50/155 (32%), Positives = 79/155 (50%), Gaps = 13/155 (8%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSII-ATAKKRLKPSQLALRQGETVE---STKGTRT 128
+ +VLS PRA NF S + I+ T +L S A T + ST+ TRT
Sbjct: 1 MTLKVLSCAPRAFEIENFLSQTEVDHIMYLTTGMKLHRSTTAGSDQITADERDSTRNTRT 60
Query: 129 SSGTFISASEDKTGILELIEHKIARATMLPQTH-GEAFNVLRYEIGQKYDSHYDAFNP-- 185
S T++ +K+ I++ I + A ++ + EA ++ Y++GQ+Y +H+D +P
Sbjct: 61 SLNTWVY--REKSAIIDTIYRRAADLQLMNEALIAEALQLVHYDVGQEYTAHHDWGHPDI 118
Query: 186 -AEYGPQMSQRLASFLLYLSDVEEGGETMFPFEVD 219
EY P R + LLYL++ EGG T FP V+
Sbjct: 119 DNEYQP---ARYCTLLLYLNEGMEGGATQFPRWVN 150
>gi|290243077|ref|YP_003494747.1| Procollagen-proline dioxygenase [Thioalkalivibrio sp. K90mix]
gi|288945582|gb|ADC73280.1| Procollagen-proline dioxygenase [Thioalkalivibrio sp. K90mix]
Length = 575
Score = 66.2 bits (160), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 44/146 (30%), Positives = 73/146 (50%), Gaps = 10/146 (6%)
Query: 75 FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 134
+ LS P +Y F +C+++I A+ R+K + ++L V ++G RT S ++
Sbjct: 50 METLSQDPLVVYLDEFLEPGECEALIHLAQGRMKRALVSLDGSSGV--SQG-RTGSNCWL 106
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN-----PAEYG 189
E+ + I ++A+ P + E V+ Y Q+Y HYDA++
Sbjct: 107 RYQEEP--LARRIGERVAKRVGFPLEYAEPLQVIHYGHEQEYRPHYDAYDLDTPRGLRCT 164
Query: 190 PQMSQRLASFLLYLSDVEEGGETMFP 215
Q QR+ + LLYL++VEEGG T FP
Sbjct: 165 RQGGQRMVTALLYLNEVEEGGATAFP 190
>gi|255545252|ref|XP_002513687.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223547595|gb|EEF49090.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 309
Score = 66.2 bits (160), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 52/181 (28%), Positives = 91/181 (50%), Gaps = 23/181 (12%)
Query: 41 SIRPKLRTLEVVEKENESGLPHGETGDASIQSIPFQV--LSWRPRALYFPNFASAEQCQS 98
SIR +LR EV ++E+ + G + + S+ QV LSWRPR + F + E+C
Sbjct: 26 SIRKELRDKEV---KHETIIQLGSSVQTNRISL-LQVVQLSWRPRVFLYKGFLTDEECDR 81
Query: 99 IIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGI----LELIEHKIARA 154
+I+ A + + KG + + +++SE ++ I L IE +I+
Sbjct: 82 LISLA-----------HGAKEISKGKGDGSRNNIQLASSESRSHIYDDLLARIEERISAW 130
Query: 155 TMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMF 214
T +P+ + + V+ Y I + + H+D F+ +S +A+ +LYLS+V GGE +F
Sbjct: 131 TFIPKENSKPLQVMHYGIEEARE-HFDYFDNKTLISNVSL-MATLVLYLSNVTRGGEILF 188
Query: 215 P 215
P
Sbjct: 189 P 189
>gi|195110931|ref|XP_002000033.1| GI24862 [Drosophila mojavensis]
gi|193916627|gb|EDW15494.1| GI24862 [Drosophila mojavensis]
Length = 549
Score = 65.9 bits (159), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 44/150 (29%), Positives = 68/150 (45%), Gaps = 13/150 (8%)
Query: 74 PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
P +V LS P + F + + +++ AK ++ + + G RTS
Sbjct: 324 PLKVEELSHDPLLVLFHDVIYQSEIDTLMRLAKNKIHRATVT---GHNSSVVSNARTSQF 380
Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY--- 188
TF+ + K +L I+ ++A T L + E + Y IG Y H D F P +
Sbjct: 381 TFLPKTRHK--VLRTIDQRVADMTDLHLEYAEDHQLANYGIGGHYAQHMDWFYPITFETK 438
Query: 189 ---GPQMSQRLASFLLYLSDVEEGGETMFP 215
P+M R+ + L YLSDVE+GG T FP
Sbjct: 439 QVSNPEMGNRIGTVLFYLSDVEQGGATAFP 468
>gi|442757047|gb|JAA70682.1| Putative prolyl 4-hydroxylase alpha subunit [Ixodes ricinus]
Length = 532
Score = 65.9 bits (159), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 43/159 (27%), Positives = 77/159 (48%), Gaps = 10/159 (6%)
Query: 63 GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES 122
GETG +Q I + + +P + + +IA AK RL+ S+ +
Sbjct: 307 GETGFFKLQPIKLEEFNLKPYVVVLRDLLQDRDLNDMIAFAKPRLEQSKTLCAADK---D 363
Query: 123 TKGTRTSSGTFISASEDKTG--ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHY 180
+RTSS T+++ + + + ++ + T+ + E + + Y IG Y H+
Sbjct: 364 GPPSRTSSNTWLNDEDAPVAARVNQYLQSLLGLGTLFSRDEAEKYQLANYGIGGHYVPHH 423
Query: 181 DAF----NPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
D F P++ G + R+A+ ++Y+SDVEEGG T+FP
Sbjct: 424 DYFEEFQTPSK-GNRFGNRVATLMIYMSDVEEGGATVFP 461
>gi|85373596|ref|YP_457658.1| hypothetical protein ELI_03845 [Erythrobacter litoralis HTCC2594]
gi|84786679|gb|ABC62861.1| hypothetical protein ELI_03845 [Erythrobacter litoralis HTCC2594]
Length = 217
Score = 65.5 bits (158), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 49/166 (29%), Positives = 73/166 (43%), Gaps = 23/166 (13%)
Query: 60 LPHGE------TGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA 113
LP GE T +Q +P+ L R +F S + C +IA + +PS +A
Sbjct: 2 LPPGESSADRLTHYGGVQRLPYAKLELFQRR----DFLSPDHCAELIALIETNRRPSTIA 57
Query: 114 LRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIG 173
G+ RTS + D G LE + + + + HGE RYE+G
Sbjct: 58 DHNGDEY-----FRTSETCDLDPDLDAVGALEAM---LEELSGIDSAHGEPVQGQRYEVG 109
Query: 174 QKYDSHYDAFNP-----AEYGPQMSQRLASFLLYLSDVEEGGETMF 214
Q++ H D F P +Y QR +F++YL++VE GG T F
Sbjct: 110 QEFKPHNDWFEPNGADYEKYCAVAGQRTWTFMIYLNEVEAGGATRF 155
>gi|359400227|ref|ZP_09193216.1| 2OG-Fe(II) oxygenase [Novosphingobium pentaromativorans US6-1]
gi|357598467|gb|EHJ60196.1| 2OG-Fe(II) oxygenase [Novosphingobium pentaromativorans US6-1]
Length = 193
Score = 65.1 bits (157), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 40/131 (30%), Positives = 63/131 (48%), Gaps = 13/131 (9%)
Query: 89 NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
+F QC ++IA + +PS +A G+ V RTSS +S G + +
Sbjct: 16 DFLDTAQCDALIALIEAEHRPSTVANYNGDDV-----FRTSSTCDLSPD---VGAVAALA 67
Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA-----EYGPQMSQRLASFLLYL 203
K+ + + H E RYE+GQ++ +H D F P +Y QR +F++YL
Sbjct: 68 RKLCDISGIDPAHAEPLQGQRYEVGQEFKAHTDYFEPNNSDFEKYCSVSGQRTWTFMIYL 127
Query: 204 SDVEEGGETMF 214
+DV+ GG T F
Sbjct: 128 NDVDAGGATRF 138
>gi|398810140|ref|ZP_10568970.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
gi|398083831|gb|EJL74535.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
Length = 296
Score = 65.1 bits (157), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 41/133 (30%), Positives = 67/133 (50%), Gaps = 20/133 (15%)
Query: 94 EQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFISASEDKTGILELI 147
++C+ +IA A+ RL PS TV+ G R+S G F E+ + +
Sbjct: 111 QECEELIALARPRLAPST-------TVDPLSGRDLVGEQRSSLGMFFRLREN--AFIARL 161
Query: 148 EHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-----QRLASFLLY 202
+ +++ LP +GE VL Y G + H+D P+ + S QR+++ + Y
Sbjct: 162 DQRVSELMNLPVENGEGLQVLCYPAGAQSMPHFDFLVPSNAANKASLARSGQRVSTLVSY 221
Query: 203 LSDVEEGGETMFP 215
L++VEEGGET+FP
Sbjct: 222 LNEVEEGGETIFP 234
>gi|388519941|gb|AFK48032.1| unknown [Lotus japonicus]
Length = 151
Score = 64.7 bits (156), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 34/83 (40%), Positives = 48/83 (57%)
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
F++ E K ++ IE +I+ + +P +GE VLRYE Q Y H+D F +
Sbjct: 2 FLTPEERKYPMVHAIEKRISVYSQVPIENGELMQVLRYEKNQYYKPHHDYFADTFNLKRG 61
Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
QR+A+ L+YLSD EGGET FP
Sbjct: 62 GQRIATMLMYLSDNVEGGETYFP 84
>gi|374370415|ref|ZP_09628419.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus basilensis OR16]
gi|373098067|gb|EHP39184.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus basilensis OR16]
Length = 454
Score = 64.7 bits (156), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 43/139 (30%), Positives = 63/139 (45%), Gaps = 8/139 (5%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
PR F + +C +++A A+ RL S + + E+ RTS G E
Sbjct: 132 PRVTLFQQLLTDAECDALVALARGRLARSPV-INPDTGDENLIEARTSLGAMFQVGEHP- 189
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRL 196
++E IE IA T + GE +L Y+ G +Y HYD FNP G QR+
Sbjct: 190 -LIERIEDCIAAVTGIAAERGEGLQILNYKPGGEYQPHYDFFNPQRPGEARQLKVGGQRV 248
Query: 197 ASFLLYLSDVEEGGETMFP 215
+ ++YL+ GG T FP
Sbjct: 249 GTLVIYLNSPLAGGATAFP 267
>gi|167519971|ref|XP_001744325.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163777411|gb|EDQ91028.1| predicted protein [Monosiga brevicollis MX1]
Length = 492
Score = 64.7 bits (156), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 43/135 (31%), Positives = 63/135 (46%), Gaps = 17/135 (12%)
Query: 87 FPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT------RTSSGTFISASEDK 140
F NFASA++C + +K+L V T G R S+ ++ D
Sbjct: 309 FRNFASAQECAHLREEGRKKL---------SRAVAWTDGAFRPVEFRISTAAWLQPDHDD 359
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFL 200
++ + +IA AT L EA V Y IG Y++HYD E R+A+F+
Sbjct: 360 --VVTNLHTRIADATQLDLEFAEALQVSNYGIGGFYETHYDHHASRERELPEGDRIATFM 417
Query: 201 LYLSDVEEGGETMFP 215
+YL+ VE+GG T FP
Sbjct: 418 IYLNQVEQGGYTAFP 432
>gi|399057802|ref|ZP_10744231.1| 2OG-Fe(II) oxygenase superfamily enzyme [Novosphingobium sp. AP12]
gi|398041550|gb|EJL34606.1| 2OG-Fe(II) oxygenase superfamily enzyme [Novosphingobium sp. AP12]
Length = 210
Score = 64.7 bits (156), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 45/152 (29%), Positives = 71/152 (46%), Gaps = 17/152 (11%)
Query: 68 ASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTR 127
A +Q P S R NF +AEQC ++A + +PS +A G+ R
Sbjct: 16 AGVQRFP----SSRLDLFILKNFVAAEQCAELMALIEDSHRPSTIADYNGD-----DAFR 66
Query: 128 TSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA- 186
TSS +S ++ + ++R + + H E RYE+GQ++ +H D F P
Sbjct: 67 TSSTCDLSTD---VPVVANLAAALSRLSGIDLAHAEPLQGQRYEVGQEFKAHTDYFEPGN 123
Query: 187 ----EYGPQMSQRLASFLLYLSDVEEGGETMF 214
+Y QR +F++YL++VE GG T F
Sbjct: 124 ADYDKYCAVPGQRTWTFMIYLNEVEAGGATRF 155
>gi|218187602|gb|EEC70029.1| hypothetical protein OsI_00603 [Oryza sativa Indica Group]
Length = 549
Score = 64.7 bits (156), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 44/138 (31%), Positives = 68/138 (49%), Gaps = 19/138 (13%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
LSW PR + F S +C +++T + + S LA T G R SS I
Sbjct: 309 LSWHPRIFLYEGFLSDMECDHLVSTGRGNMD-SSLAF--------TDGDRNSSYNNI--- 356
Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
ED ++ IE +I+ + LP+ +GE VL+Y + ++ + LA
Sbjct: 357 EDI--VVSKIEDRISLWSFLPKENGENIQVLKYGVNRR-----GSIKEEPKSSTGGHWLA 409
Query: 198 SFLLYLSDVEEGGETMFP 215
+ L+YLSDV++GGET+FP
Sbjct: 410 TILIYLSDVKQGGETVFP 427
>gi|196011912|ref|XP_002115819.1| hypothetical protein TRIADDRAFT_59908 [Trichoplax adhaerens]
gi|190581595|gb|EDV21671.1| hypothetical protein TRIADDRAFT_59908 [Trichoplax adhaerens]
Length = 300
Score = 64.3 bits (155), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 45/148 (30%), Positives = 75/148 (50%), Gaps = 9/148 (6%)
Query: 73 IPFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTS 129
+P+ + +S P + + N S + +S+ A A K+L+P+ + + +G TR +
Sbjct: 93 MPYAIEEMSRDPLIILYHNLTSNAEMESLKALAAKQLQPAGVYHTTSADNRNLEGYTRIA 152
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
FI ++++ + I ++ T L E V+ Y I +Y HYD F PA+ G
Sbjct: 153 KMAFIL--DEESAVASAITQRLQDVTGLNMNFSEPLQVINYGIAGQYTPHYDTF-PAKSG 209
Query: 190 PQMS---QRLASFLLYLSDVEEGGETMF 214
+ RLA+ +LYLSDVE GG T+F
Sbjct: 210 DRSHPSHDRLATAILYLSDVERGGATVF 237
>gi|393774561|ref|ZP_10362923.1| 2OG-Fe(II) oxygenase [Novosphingobium sp. Rr 2-17]
gi|392720044|gb|EIZ77547.1| 2OG-Fe(II) oxygenase [Novosphingobium sp. Rr 2-17]
Length = 210
Score = 64.3 bits (155), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 40/131 (30%), Positives = 66/131 (50%), Gaps = 13/131 (9%)
Query: 89 NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
+F SA +C+ ++A + +PS +A G+ RTSS +SA + + +
Sbjct: 33 DFLSAPECEELVALIEAEHRPSTIADFTGD-----DAFRTSSTCDLSA---QVPAVADLA 84
Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA-----EYGPQMSQRLASFLLYL 203
K+AR + + H E RYE+GQ++ +H D F P Y QR +F++YL
Sbjct: 85 AKLARLSGIDPAHAEPLQGQRYEVGQQFKAHTDTFEPGTADYQRYCSASGQRTWTFMVYL 144
Query: 204 SDVEEGGETMF 214
++V+ GG T F
Sbjct: 145 NEVDAGGATRF 155
>gi|224011205|ref|XP_002295377.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|209583408|gb|ACI64094.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 207
Score = 64.3 bits (155), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 67/137 (48%), Gaps = 5/137 (3%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQ---LALRQGETVESTKGT-RTSSGTFISAS 137
P + F S E+C I R + S + T +S + + RTS+ T+
Sbjct: 9 PWVVAIEGFLSDEECNRFIELGGDRYERSTEYASTMNLDGTFDSKESSGRTSTNTWCGEG 68
Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
I++ + ++ T +P + E ++RYEIGQ+Y+ H+D ++ + G Q R+
Sbjct: 69 CRDDPIIKKVIERMESLTGIPYANFEDLQLVRYEIGQRYEEHHD-YSSSHEGTQYGPRIL 127
Query: 198 SFLLYLSDVEEGGETMF 214
+ YL+DVEEGG T F
Sbjct: 128 TVFFYLNDVEEGGGTQF 144
>gi|195505241|ref|XP_002099419.1| GE10893 [Drosophila yakuba]
gi|194185520|gb|EDW99131.1| GE10893 [Drosophila yakuba]
Length = 508
Score = 64.3 bits (155), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 42/160 (26%), Positives = 86/160 (53%), Gaps = 17/160 (10%)
Query: 74 PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
PF++ +S P + + + + Q +IA A+ RL+P+++ + + E+ R++ G
Sbjct: 295 PFKMEEISLEPYIVVYHDILPDKDMQQLIALAEPRLRPTEVF--EEDKSEARTSDRSALG 352
Query: 132 TFISASE-DKTG--ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE- 187
TF+ + + +G +L+ + ++ T + H FN+++Y G +Y +++D FN
Sbjct: 353 TFLPFKDMNPSGGPLLDRLTQRMRDITGIQIRHENTFNIIKYGFGSQYATNFDFFNGTNS 412
Query: 188 ----YGPQMSQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
YG R+A+ L YL+D GG T+FP +D++++
Sbjct: 413 EMEGYG----DRMATVLFYLNDAPNGGATVFP-RIDVKVT 447
>gi|359490628|ref|XP_002271805.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Vitis
vinifera]
Length = 312
Score = 64.3 bits (155), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 55/179 (30%), Positives = 90/179 (50%), Gaps = 16/179 (8%)
Query: 42 IRPKLRTLEVVEKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIA 101
IR +LR +VV +E L H + S Q LSW+PRA + F S E+C +I+
Sbjct: 25 IRKELRINKVVNQETTVQLGHSIEYNRVDPSRVIQ-LSWQPRAFLYRGFLSDEECDHLIS 83
Query: 102 TAKKRLKPSQLALRQGET--VESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQ 159
A K +LA G++ V + ++S G E + IE +I+ T LP+
Sbjct: 84 LALG--KKEELATNGGDSGNVVLKRLLKSSEGPLYIDDE----VAARIEKRISAWTFLPK 137
Query: 160 THGEAFNVLRYEI---GQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
+ E V++Y+ QKY+ ++ + +++G + +A+ LL+LS+V GGE FP
Sbjct: 138 ENSEPLEVVQYQFENAKQKYN-YFSNKSTSKFGEPL---MATVLLHLSNVTRGGELFFP 192
>gi|302143843|emb|CBI22704.3| unnamed protein product [Vitis vinifera]
Length = 317
Score = 64.3 bits (155), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 55/179 (30%), Positives = 90/179 (50%), Gaps = 16/179 (8%)
Query: 42 IRPKLRTLEVVEKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIA 101
IR +LR +VV +E L H + S Q LSW+PRA + F S E+C +I+
Sbjct: 25 IRKELRINKVVNQETTVQLGHSIEYNRVDPSRVIQ-LSWQPRAFLYRGFLSDEECDHLIS 83
Query: 102 TAKKRLKPSQLALRQGET--VESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQ 159
A K +LA G++ V + ++S G E + IE +I+ T LP+
Sbjct: 84 LALG--KKEELATNGGDSGNVVLKRLLKSSEGPLYIDDE----VAARIEKRISAWTFLPK 137
Query: 160 THGEAFNVLRYEI---GQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
+ E V++Y+ QKY+ ++ + +++G + +A+ LL+LS+V GGE FP
Sbjct: 138 ENSEPLEVVQYQFENAKQKYN-YFSNKSTSKFGEPL---MATVLLHLSNVTRGGELFFP 192
>gi|224003081|ref|XP_002291212.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220972988|gb|EED91319.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 467
Score = 64.3 bits (155), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 47/146 (32%), Positives = 70/146 (47%), Gaps = 7/146 (4%)
Query: 81 RPRALYFPNFASAEQCQSIIATAK----KRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
P + F NF S + +I + +R A GE + TRTSS +
Sbjct: 263 HPWVVVFNNFLSDNEVDDLIRGGEMEGFERSTDQGAANALGEQEKIVSQTRTSSNAWCMH 322
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
++ G + KI T +P+ + E+F +L Y Q Y SH+D+ + P R+
Sbjct: 323 KCERLGGVRSATTKIEDVTGIPRVNYESFQLLNYGQNQFYRSHHDSSSRDHTPP--GPRI 380
Query: 197 ASFLLYLSDVEEGGETMFPFEVDLQI 222
+F LYLSDVEEGGET F ++DL +
Sbjct: 381 LTFFLYLSDVEEGGETYFN-KLDLAV 405
>gi|298712929|emb|CBJ26831.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 294
Score = 63.9 bits (154), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 48/133 (36%), Positives = 65/133 (48%), Gaps = 15/133 (11%)
Query: 89 NFASAEQCQSIIATAKKRLKPSQL-ALRQGETVESTKGTRTSSGTFISASEDKTGILELI 147
+F S +C ++IA A + S + GE ES RTSS F+ A ED L +
Sbjct: 108 DFFSGPECDALIALAGNYMIVSPVVGAGAGEVSES----RTSSSCFL-ARED----LPTV 158
Query: 148 EHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-----NPAEYGPQMSQRLASFLLY 202
HK+ T P H E V RY QKY +H+DAF + + QR+ + L+Y
Sbjct: 159 CHKVMALTGKPIEHLELPQVGRYYTSQKYANHWDAFDLNTEDGRRFAQNGGQRVCTVLVY 218
Query: 203 LSDVEEGGETMFP 215
L+DV GG T FP
Sbjct: 219 LNDVPSGGCTAFP 231
>gi|428182311|gb|EKX51172.1| hypothetical protein GUITHDRAFT_92735 [Guillardia theta CCMP2712]
Length = 190
Score = 63.9 bits (154), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 39/111 (35%), Positives = 54/111 (48%), Gaps = 6/111 (5%)
Query: 110 SQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLR 169
S +A E RTSS ++S + D ++ I ++A LP E VL
Sbjct: 4 STIAEAGNEAKNGVGSARTSSTAWLSKTADP--LVAKIRTRVAELVKLPMELAEDMQVLH 61
Query: 170 YEIGQKYDSHYDAFNPAEYGPQMS----QRLASFLLYLSDVEEGGETMFPF 216
Y Q Y +H+D F+P Y ++ R + YLSDVEEGGET+FPF
Sbjct: 62 YSKNQHYWAHHDFFDPNIYRGFVTSPGQNRFITVFFYLSDVEEGGETVFPF 112
>gi|374620441|ref|ZP_09692975.1| 2OG-Fe(II) oxygenase superfamily enzyme [gamma proteobacterium
HIMB55]
gi|374303668|gb|EHQ57852.1| 2OG-Fe(II) oxygenase superfamily enzyme [gamma proteobacterium
HIMB55]
Length = 570
Score = 63.5 bits (153), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 70/135 (51%), Gaps = 16/135 (11%)
Query: 89 NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
N S +C +I AK +K + + L +G + ++G RT S ++ ED +++ +
Sbjct: 28 NVISPVECAYLIELAKPHIKRAGVVLDEG--YKESEG-RTGSNHWLKYDEDD--VVQSVG 82
Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN--------PAEYGPQMSQRLASFL 200
+I+ LP + E+ ++ Y Q+Y H+DAFN A++G QRL + L
Sbjct: 83 QRISDIVGLPLEYAESMQIIHYGPEQEYRPHFDAFNLSLPKGQRAAKWG---GQRLVTAL 139
Query: 201 LYLSDVEEGGETMFP 215
+YL+ VE GG T FP
Sbjct: 140 VYLNKVEAGGATQFP 154
>gi|219128111|ref|XP_002184265.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217404496|gb|EEC44443.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 584
Score = 63.5 bits (153), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 46/148 (31%), Positives = 70/148 (47%), Gaps = 10/148 (6%)
Query: 82 PRALYFPNFASAEQCQSIIATAKK----RLKPSQLALRQG--ETVESTKGTRTSSGTFIS 135
P + NF + E+C +I K R K G E V+ST+ T ++
Sbjct: 382 PWVITLDNFLTLEECTELINIGHKHGYNRSKDVGKVKVDGTHEAVQSTRRTSENAWCSNQ 441
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
+ + +L+ ++A +P + E F +L+YE GQ Y +H+D F + Q R
Sbjct: 442 SGCRDEALPQLLHERMATVMRIPAQNSEDFQLLKYEKGQFYRTHHD-FIQHQTKRQCGPR 500
Query: 196 LASFLLYLSDVEEGGETMFPFEVDLQIS 223
+ +F LYLSDV GG T FP DL I+
Sbjct: 501 ILTFFLYLSDVTAGGGTNFP---DLDIT 525
>gi|311032645|ref|ZP_07710735.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus sp.
m3-13]
Length = 137
Score = 63.5 bits (153), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 59/102 (57%), Gaps = 7/102 (6%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + N S E+C ++I +K ++K S+++ G V++ RTSS TF E++
Sbjct: 39 PLVVVLANVLSDEECDALIRLSKDKMKRSKVS--NGLEVDAI---RTSSSTFFHEGENE- 92
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
++ IE ++++ +P HGE +L Y++GQ+Y +H+D F
Sbjct: 93 -LVARIEKRVSQIMNVPVEHGEGLQILNYQVGQEYKAHFDFF 133
>gi|219126272|ref|XP_002183385.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217405141|gb|EEC45085.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 474
Score = 63.5 bits (153), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 44/166 (26%), Positives = 80/166 (48%), Gaps = 10/166 (6%)
Query: 61 PHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAK----KRLKPSQLALRQ 116
P E D ++++ +Q+ W + +F + + ++IA +R L
Sbjct: 258 PDYEPND-TVETASYQIGPW---VVIIDDFLNETETSTLIALGADQGYERSTDVGEILED 313
Query: 117 GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKY 176
G + TRTS+ + D + ++I ++ T +P + E+ +LRYE GQ Y
Sbjct: 314 GSYEDDESETRTSTNAWCYNECDDHEVTQIIWERMTFLTQIPPENSESLQMLRYEPGQFY 373
Query: 177 DSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFEVDLQI 222
H+D + ++ + R+ + LYL+DVEEGG T FP E++L +
Sbjct: 374 AVHHD-YIENDWNRAVGSRILTVFLYLNDVEEGGATNFP-ELELAV 417
>gi|344175386|emb|CCA88057.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia syzygii R24]
Length = 331
Score = 63.2 bits (152), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 47/160 (29%), Positives = 73/160 (45%), Gaps = 11/160 (6%)
Query: 71 QSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQ-LALRQGETVESTKGTRTS 129
+ + Q +S PRA + S ++C ++I A+ RL S + G+ V + S
Sbjct: 121 RQVSVQFVSHHPRAALISDLLSTQECDALIEQARSRLTTSYVIEYESGQEVVNEATRSCS 180
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-----N 184
+F E+ + + + I + AR P H E RY G+++ H D F N
Sbjct: 181 CASF--PPEEMSMLQKRIVERAARLVGQPGAHCEGVTFARYLPGEQFRPHVDYFRGAVLN 238
Query: 185 PAEYGPQMSQRLASFLLYLSDVEEGGETMFP---FEVDLQ 221
+ R+A+ LLYL++VE GG T FP FEV Q
Sbjct: 239 NDKIMGSSGHRIATVLLYLNEVEAGGATFFPNPGFEVRPQ 278
>gi|195505199|ref|XP_002099401.1| GE23383 [Drosophila yakuba]
gi|194185502|gb|EDW99113.1| GE23383 [Drosophila yakuba]
Length = 535
Score = 63.2 bits (152), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 70/148 (47%), Gaps = 7/148 (4%)
Query: 74 PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
PF++ L P + A+ +S+ TA+ R+K S + G + RTS G
Sbjct: 316 PFKLEELHLDPLLVQLHQVIGAKDSESLQRTARPRIKRSTVYSLAGNGGSTAAAFRTSQG 375
Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF---NPAEY 188
+ S ++ +L+ H + + L + E V Y IG Y+ H+D+F + +
Sbjct: 376 ASFNYS--RSAATKLLSHHVGDFSGLNMEYAEDLQVANYGIGGHYEPHWDSFPENHVYQE 433
Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPF 216
G R+A+ + YLSDVE GG T FPF
Sbjct: 434 GDLHGNRIATGIYYLSDVEAGGGTAFPF 461
>gi|195055767|ref|XP_001994784.1| GH14132 [Drosophila grimshawi]
gi|193892547|gb|EDV91413.1| GH14132 [Drosophila grimshawi]
Length = 537
Score = 63.2 bits (152), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 38/109 (34%), Positives = 54/109 (49%), Gaps = 9/109 (8%)
Query: 115 RQGETVESTKGT---RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYE 171
R G + ST RTS FI+A+ K +L I+ ++A T L + E + Y
Sbjct: 362 RAGVVINSTSTVSKKRTSQHIFIAATRHK--VLRTIDQRVADMTNLNMQYAEDHQLADYG 419
Query: 172 IGQKYDSHYDAFNPAEYG----PQMSQRLASFLLYLSDVEEGGETMFPF 216
IG Y H+D F ++ +M R+A+ L YLSDV +GG T FP
Sbjct: 420 IGGHYSQHFDWFGNSDLANSKCDEMGNRIATVLFYLSDVAQGGGTAFPI 468
>gi|223999645|ref|XP_002289495.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220974703|gb|EED93032.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 1578
Score = 63.2 bits (152), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 47/147 (31%), Positives = 65/147 (44%), Gaps = 9/147 (6%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
L P NF S E C I+A A +P + E VE RTSS +
Sbjct: 1375 LHSNPDIFTVDNFLSDEDCDRIVAKADPHFRPCLVKNESTEAVEQDP-FRTSSDANLPQR 1433
Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA--EYGPQMSQR 195
E + + +L A L E +LRY+ GQ++ H D F+ A G + R
Sbjct: 1434 EAPSLVSKLTSLASCDANQL-----EIMQILRYDKGQQFQPHTDGFDGATTACGFEQGNR 1488
Query: 196 LASFLLYLSDVEEGGETMFPFEVDLQI 222
L + YL+ VE+GG T FP +DL+I
Sbjct: 1489 LVTIFCYLNSVEKGGSTYFPG-IDLEI 1514
>gi|381200649|ref|ZP_09907785.1| Prolyl 4-hydroxylase alpha subunit [Sphingobium yanoikuyae XLDN2-5]
Length = 305
Score = 63.2 bits (152), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 42/132 (31%), Positives = 69/132 (52%), Gaps = 13/132 (9%)
Query: 87 FPNFASAEQCQSIIATAKKRLKPSQ-LALRQGETVESTKGTRTSSG-TFISASEDKTGIL 144
F F + ++C +I+ + L+P+ + R G + RTS G F A ED ++
Sbjct: 126 FRQFLTGDECHHVISEGQALLEPAMVIDPRSGRPMPHP--VRTSDGGIFGPAREDL--VI 181
Query: 145 ELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-SQRLASFLLYL 203
+ I +IA A+ + GE +LRY +GQ+Y H+D P + +QR + L+YL
Sbjct: 182 QAINRRIAAASGTMLSGGEPLTLLRYAVGQQYRQHHDCL------PHVRNQRAWTMLIYL 235
Query: 204 SDVEEGGETMFP 215
++ GGET+FP
Sbjct: 236 NEGYAGGETIFP 247
>gi|223997846|ref|XP_002288596.1| hypothetical protein THAPSDRAFT_261963 [Thalassiosira pseudonana
CCMP1335]
gi|220975704|gb|EED94032.1| hypothetical protein THAPSDRAFT_261963 [Thalassiosira pseudonana
CCMP1335]
Length = 373
Score = 63.2 bits (152), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 44/129 (34%), Positives = 66/129 (51%), Gaps = 4/129 (3%)
Query: 88 PNFASAEQCQSIIATAK-KRLKPSQLALRQGETVESTKGTRTSSGTFI-SASEDKTGILE 145
P + CQS A L S + GE + RTSS + E+ TG+ +
Sbjct: 178 PGWMIVNCCQSCDAKEGFGVLLDSNVIGGSGEKEKVISTHRTSSNAWCRKECENLTGV-K 236
Query: 146 LIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSD 205
+ +I T +PQ + E+F +L+Y+ G+ Y SH+D+ + A R+ +F LYL+D
Sbjct: 237 GVSKRIEEMTGIPQNNYESFQILQYKPGEYYKSHHDSSD-ANKDKVTGHRVLTFFLYLND 295
Query: 206 VEEGGETMF 214
VEEGGET F
Sbjct: 296 VEEGGETHF 304
>gi|410860761|ref|YP_006975995.1| prolyl 4-hydroxylase subunit alpha [Alteromonas macleodii AltDE1]
gi|410818023|gb|AFV84640.1| Prolyl 4-hydroxylase alpha subunit [Alteromonas macleodii AltDE1]
Length = 376
Score = 63.2 bits (152), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 49/162 (30%), Positives = 79/162 (48%), Gaps = 27/162 (16%)
Query: 71 QSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG----- 125
+S+P +V + + S +C+ +IA LKPS + V+ G
Sbjct: 174 KSLPIEV---------YESILSEYECRYLIAKFSALLKPSMV-------VDPVTGRGKID 217
Query: 126 -TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF- 183
RTS I + I ++ I++ T + +GEA N+LRY GQ+Y HYD
Sbjct: 218 SVRTSYVAVIEPTHCD-WITRKLDKIISQITHTLRQNGEALNLLRYSPGQQYKPHYDGLN 276
Query: 184 --NPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
N A QR+ + L+YL+ + EGGET+FP ++D++I+
Sbjct: 277 EINDALMFKDGKQRIKTALVYLNTINEGGETLFP-KLDIRIA 317
>gi|194905419|ref|XP_001981192.1| GG11932 [Drosophila erecta]
gi|190655830|gb|EDV53062.1| GG11932 [Drosophila erecta]
Length = 535
Score = 63.2 bits (152), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 43/148 (29%), Positives = 69/148 (46%), Gaps = 7/148 (4%)
Query: 74 PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
PF++ L P + ++ +S+ TA+ R+K S + G + RTS G
Sbjct: 316 PFKLEELHLDPPVVQLHQVIGSKDAESLQRTARPRIKRSTVYSLAGNGDSTAAAFRTSQG 375
Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF---NPAEY 188
+ S + +L+ H + + L + E V Y IG Y+ H+D+F + +
Sbjct: 376 ASFNYS--RNAATKLLSHHVGDFSGLNMEYAEDLQVANYGIGGHYEPHWDSFPDNHVYQE 433
Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPF 216
G R+A+ + YLSDVE GG T FPF
Sbjct: 434 GDLHGNRIATAIYYLSDVEAGGGTAFPF 461
>gi|428178571|gb|EKX47446.1| hypothetical protein GUITHDRAFT_152114 [Guillardia theta CCMP2712]
Length = 262
Score = 62.8 bits (151), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 55/187 (29%), Positives = 88/187 (47%), Gaps = 28/187 (14%)
Query: 44 PKLRTLEVVEKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATA 103
P LR LE ++ S LP+ E +AS PR N + ++C+ ++ A
Sbjct: 42 PSLRGLESLK----SNLPYLEQINAS------------PRVFRIRNLLTKQECEHLMLLA 85
Query: 104 KKRLKPSQLALRQG--ETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTH 161
++ + + G + VEST TRT+ G ++ +D ++ +E + + T
Sbjct: 86 FRKGLSKTMIMPYGTHKLVEST--TRTNDGAWLDFLQDD--VVRRLEETLGKLTKTTPQQ 141
Query: 162 GEAFNVLRYEIG-QKYDSHYDAFNPAEYGP----QMSQRLASFLLYLSDVEEGGETMFPF 216
GE VL Y G Q + HYD F+PA P Q R + ++YL EGGET FP
Sbjct: 142 GENLQVLHYSNGAQFFQEHYDYFDPARDPPESFEQGGNRYITVIVYLEAALEGGETHFP- 200
Query: 217 EVDLQIS 223
E+ L+++
Sbjct: 201 ELGLKLT 207
>gi|410632646|ref|ZP_11343301.1| prolyl 4-hydroxylase [Glaciecola arctica BSs20135]
gi|410147883|dbj|GAC20168.1| prolyl 4-hydroxylase [Glaciecola arctica BSs20135]
Length = 480
Score = 62.8 bits (151), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 36/130 (27%), Positives = 69/130 (53%), Gaps = 10/130 (7%)
Query: 89 NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
+F ++CQ++I ++ +PS + + + RTSS + +D ++ I+
Sbjct: 103 DFLLPQECQALIELIEQAKQPSTIT-----SENPDQQFRTSSTCHLGNMQDP--VIRKID 155
Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP---AEYGPQMSQRLASFLLYLSD 205
+I + + ++ E Y++GQ++ H D F P A YG QR +F++YL++
Sbjct: 156 LQICQYLGIDPSYSEVIQGQHYQLGQQFKPHTDYFEPYELAHYGGIQGQRTYTFMIYLNE 215
Query: 206 VEEGGETMFP 215
VE+GG+T+FP
Sbjct: 216 VEQGGDTVFP 225
>gi|427410797|ref|ZP_18900999.1| hypothetical protein HMPREF9718_03473 [Sphingobium yanoikuyae ATCC
51230]
gi|425710785|gb|EKU73805.1| hypothetical protein HMPREF9718_03473 [Sphingobium yanoikuyae ATCC
51230]
Length = 322
Score = 62.8 bits (151), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 42/132 (31%), Positives = 69/132 (52%), Gaps = 13/132 (9%)
Query: 87 FPNFASAEQCQSIIATAKKRLKPSQ-LALRQGETVESTKGTRTSSG-TFISASEDKTGIL 144
F F + ++C +I+ + L+P+ + R G + RTS G F A ED ++
Sbjct: 143 FRQFLTGDECHHVISEGQALLEPAMVIDPRSGRPMPHP--IRTSDGGIFGPAREDL--VI 198
Query: 145 ELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-SQRLASFLLYL 203
+ I +IA A+ + GE +LRY +GQ+Y H+D P + +QR + L+YL
Sbjct: 199 QAINRRIAAASGTMLSGGEPLTLLRYAVGQQYRQHHDCL------PHVRNQRAWTMLIYL 252
Query: 204 SDVEEGGETMFP 215
++ GGET+FP
Sbjct: 253 NEGYAGGETIFP 264
>gi|299115443|emb|CBN75608.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 548
Score = 62.8 bits (151), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 49/155 (31%), Positives = 71/155 (45%), Gaps = 20/155 (12%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-----TR 127
+ + LS PR NF E+ SII A L +Q A R + TKG TR
Sbjct: 205 VVLETLSHSPRVFSLYNFMDMEEADSIIEDA---LGMTQEAYRLKRSSTGTKGKAISKTR 261
Query: 128 TSSGTFISASEDKTGILELIEHKIARATMLPQTH---GEAFNVLRYEIGQKYDSHYDAFN 184
TS F++ T + ++ +I + + + H + VLRY Q Y +H+D
Sbjct: 262 TSDNAFVT----HTNTAQALKRRIFQLLGIEEYHETWADGLQVLRYNESQAYVAHFDYLE 317
Query: 185 PAEYGPQMSQ-----RLASFLLYLSDVEEGGETMF 214
AE S+ R A+ +LY +DV EGGET+F
Sbjct: 318 SAEGHDFKSEGLGTNRFATVVLYFNDVREGGETVF 352
>gi|195391766|ref|XP_002054531.1| GJ24504 [Drosophila virilis]
gi|194152617|gb|EDW68051.1| GJ24504 [Drosophila virilis]
Length = 545
Score = 62.8 bits (151), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 49/105 (46%), Gaps = 8/105 (7%)
Query: 117 GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKY 176
G RTS TFI + K +L I+ ++A T L E + Y IG Y
Sbjct: 362 GNNASVVSNARTSQFTFIPKTRHK--VLRTIDQRVADMTDLNMVFAEDHQLANYGIGGHY 419
Query: 177 DSHYDAFNPAEY------GPQMSQRLASFLLYLSDVEEGGETMFP 215
H D F+P + +M R+A+ L YL+DVE+GG T FP
Sbjct: 420 AQHMDWFSPNAFETKQVANSEMGNRIATVLFYLTDVEQGGGTAFP 464
>gi|224012377|ref|XP_002294841.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220969280|gb|EED87621.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 1164
Score = 62.8 bits (151), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 41/136 (30%), Positives = 67/136 (49%), Gaps = 6/136 (4%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + NF + E+ +I K++ Q + ++ T +S+ +D
Sbjct: 937 PWLVSLENFLTEEEAAYLIEVGKRQ----QYQRSDYKEIDPEHRTSSSAWCRRDCWKDDA 992
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-SQRLASFL 200
+ +++ +IA+ T +LRYE GQK++ H D P ++ QRL +FL
Sbjct: 993 TVSSVVD-RIAKVTKSETKQLSNLQILRYEEGQKFNQHSDFSRPIRRLKRVQGQRLMTFL 1051
Query: 201 LYLSDVEEGGETMFPF 216
+YLSDVEEGGET FP+
Sbjct: 1052 IYLSDVEEGGETSFPY 1067
>gi|195575097|ref|XP_002105516.1| GD17035 [Drosophila simulans]
gi|194201443|gb|EDX15019.1| GD17035 [Drosophila simulans]
Length = 535
Score = 62.4 bits (150), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 44/149 (29%), Positives = 68/149 (45%), Gaps = 9/149 (6%)
Query: 74 PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
PF++ L P + + +S+ TA+ R+K S + G + RTS G
Sbjct: 316 PFKLEELHLDPLVVQLHQVIGSNDSESLQKTARPRIKRSTVYSLGGNGGSTAAAFRTSQG 375
Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY--- 188
+ S + +L+ H + + L + E V Y IG Y+ H+D+F P +
Sbjct: 376 ASFNYS--RNAATKLLSHHVGDFSGLNMDYAEDLQVANYGIGGHYEPHWDSF-PENHIYQ 432
Query: 189 -GPQMSQRLASFLLYLSDVEEGGETMFPF 216
G R+A+ + YLSDVE GG T FPF
Sbjct: 433 EGDLHGNRIATGIYYLSDVEAGGGTAFPF 461
>gi|194905372|ref|XP_001981184.1| GG11758 [Drosophila erecta]
gi|190655822|gb|EDV53054.1| GG11758 [Drosophila erecta]
Length = 550
Score = 62.0 bits (149), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 38/105 (36%), Positives = 48/105 (45%), Gaps = 8/105 (7%)
Query: 117 GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKY 176
G RTS TFI AS K +L I+ ++A T L + E Y IG Y
Sbjct: 362 GHNESLVSNVRTSQFTFIPASAHK--VLSTIDQRVADMTNLNMKYAEDHQFANYGIGGHY 419
Query: 177 DSHYDAFNPAEY------GPQMSQRLASFLLYLSDVEEGGETMFP 215
H D F + P+M R+A+ L YLSDV +GG T FP
Sbjct: 420 GQHMDWFYQTTFDAGLVSSPEMGNRIATVLFYLSDVSQGGGTAFP 464
>gi|402824057|ref|ZP_10873445.1| 2OG-Fe(II) oxygenase [Sphingomonas sp. LH128]
gi|402262407|gb|EJU12382.1| 2OG-Fe(II) oxygenase [Sphingomonas sp. LH128]
Length = 210
Score = 62.0 bits (149), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 39/130 (30%), Positives = 64/130 (49%), Gaps = 13/130 (10%)
Query: 90 FASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEH 149
F SAE+C ++ + +PS +A G+ RTSS +S +++ +
Sbjct: 34 FLSAERCAQLVDLIETNNRPSTIADYNGD-----DAFRTSSTCDLSRD---YPVVDELAQ 85
Query: 150 KIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP-----AEYGPQMSQRLASFLLYLS 204
++R + + H E RYE+GQ++ +H D F P +Y QR +F++YL+
Sbjct: 86 ALSRLSGIDLAHAEPLQGQRYEVGQEFKAHTDYFEPDSADFEKYCKVPGQRTWTFMIYLN 145
Query: 205 DVEEGGETMF 214
DVE GG T F
Sbjct: 146 DVEAGGATRF 155
>gi|219123691|ref|XP_002182153.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217406114|gb|EEC46054.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 188
Score = 62.0 bits (149), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 46/151 (30%), Positives = 69/151 (45%), Gaps = 14/151 (9%)
Query: 77 VLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
VL+ P NF + +C+ +I A+ P+ + G+ +RTSS ++S
Sbjct: 1 VLNTSPPMFAVDNFLTPLECEFLIHMAQDSFGPAPVV---GKGAGEVSPSRTSSTCYLSR 57
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA-----EYGPQ 191
+ L + K++ T P H E V RY Q+Y HYDAF+ +
Sbjct: 58 ED-----LPDLMRKVSSLTGKPIEHCELPQVGRYFPSQQYLQHYDAFDLGTEDGLRFAAN 112
Query: 192 MSQRLASFLLYLSDVEEGGETMFP-FEVDLQ 221
QR + LLYL+DV GG T FP +D+Q
Sbjct: 113 GGQRTITVLLYLNDVARGGATRFPALNLDVQ 143
>gi|323453493|gb|EGB09364.1| hypothetical protein AURANDRAFT_15704, partial [Aureococcus
anophagefferens]
Length = 148
Score = 62.0 bits (149), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 33/94 (35%), Positives = 51/94 (54%), Gaps = 7/94 (7%)
Query: 126 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD---A 182
+RTS + + + + + +I T +P+ + E+F VLRY GQ+Y +H+D
Sbjct: 3 SRTSENAWCTGACESNRATRAVMARIEEVTGVPKENYESFQVLRYTHGQQYRAHHDMSRG 62
Query: 183 FNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPF 216
N GP R+ +F +Y SDVE+GGET FP
Sbjct: 63 DNALACGP----RIYTFFMYFSDVEKGGETEFPM 92
>gi|332140647|ref|YP_004426385.1| Prolyl 4-hydroxylase alpha subunit [Alteromonas macleodii str.
'Deep ecotype']
gi|327550669|gb|AEA97387.1| Prolyl 4-hydroxylase alpha subunit [Alteromonas macleodii str.
'Deep ecotype']
Length = 376
Score = 62.0 bits (149), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 48/162 (29%), Positives = 78/162 (48%), Gaps = 27/162 (16%)
Query: 71 QSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG----- 125
+S+P +V + + S +C+ +I LKPS + V+ G
Sbjct: 174 KSLPIEV---------YESILSEYECRYLITKFNALLKPSMV-------VDPVTGRGKID 217
Query: 126 -TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF- 183
RTS I + I ++ I++ T + +GEA N+LRY GQ+Y HYD
Sbjct: 218 SVRTSYVAVIEPAHCD-WITRKLDKTISQITHTLRQNGEALNLLRYSPGQQYKPHYDGLN 276
Query: 184 --NPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
N A QR+ + L+YL+ + EGGET+FP ++D++I+
Sbjct: 277 EINDALMFKDGKQRIKTALVYLNTISEGGETLFP-KLDIRIA 317
>gi|357542083|gb|AET84843.1| hypothetical protein MPXG_00045 [Micromonas pusilla virus SP1]
Length = 196
Score = 62.0 bits (149), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 44/129 (34%), Positives = 65/129 (50%), Gaps = 12/129 (9%)
Query: 87 FPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILEL 146
F NF ++ + +I AKK L PS ++ + + ES + + T+ +F I+
Sbjct: 24 FRNFITSGERAHVIEEAKKNLTPSTVST-EHKLDESVRKSETAWLSF------DDPIIRG 76
Query: 147 IEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDV 206
I K R T P + E VLRYE G Y H D A+ +QR+ +F+L L+D
Sbjct: 77 IAEKCIRYTDRPLINCEKLQVLRYEEGGHYIPHQDILRNAK-----NQRMYTFILALNDD 131
Query: 207 EEGGETMFP 215
EGGET+FP
Sbjct: 132 YEGGETVFP 140
>gi|334140935|ref|YP_004534141.1| 2OG-Fe(II) oxygenase [Novosphingobium sp. PP1Y]
gi|333938965|emb|CCA92323.1| 2OG-Fe(II) oxygenase [Novosphingobium sp. PP1Y]
Length = 209
Score = 62.0 bits (149), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 40/131 (30%), Positives = 61/131 (46%), Gaps = 13/131 (9%)
Query: 89 NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
+F QC ++IA + +PS +A G+ V RTSS +S L
Sbjct: 32 DFLDTAQCDALIALIEAEHRPSTVANYNGDDV-----FRTSSTCDLSPDVPAVAALA--- 83
Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA-----EYGPQMSQRLASFLLYL 203
K+ + + H E RYE+GQ++ +H D F P +Y QR +F++YL
Sbjct: 84 RKLCDISGIDPAHAEPLQGQRYEVGQEFKAHTDYFEPNNSDFEKYCSVSGQRTWTFMIYL 143
Query: 204 SDVEEGGETMF 214
+DV+ GG T F
Sbjct: 144 NDVDAGGATRF 154
>gi|406595590|ref|YP_006746720.1| hypothetical protein MASE_03040 [Alteromonas macleodii ATCC 27126]
gi|407682553|ref|YP_006797727.1| hypothetical protein AMEC673_03255 [Alteromonas macleodii str.
'English Channel 673']
gi|406372911|gb|AFS36166.1| hypothetical protein MASE_03040 [Alteromonas macleodii ATCC 27126]
gi|407244164|gb|AFT73350.1| hypothetical protein AMEC673_03255 [Alteromonas macleodii str.
'English Channel 673']
Length = 263
Score = 61.6 bits (148), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 40/135 (29%), Positives = 65/135 (48%), Gaps = 16/135 (11%)
Query: 87 FPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILEL 146
+ +F S+++C I+A K +L PS+LA S RTSS ++ +K +++
Sbjct: 85 YDDFLSSQECDDIVALTKDKLAPSKLA-----GAASADDIRTSSTCELAFLGNK--LVKD 137
Query: 147 IEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-------SQRLASF 199
++++I L GE Y +G+ Y HYD F P PQ QR +
Sbjct: 138 VDNRIVSTLSLGVGEGEVIQAQHYNVGEYYKPHYDFFPPGS--PQYKAHCLSRGQRTWTC 195
Query: 200 LLYLSDVEEGGETMF 214
++YL+D +GG T F
Sbjct: 196 MIYLNDECDGGHTRF 210
>gi|301115862|ref|XP_002905660.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262110449|gb|EEY68501.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 215
Score = 61.2 bits (147), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 39/137 (28%), Positives = 62/137 (45%), Gaps = 16/137 (11%)
Query: 90 FASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEH 149
F ++ I+ + L PS + L+ G RTS+ ++ +S +++ I+
Sbjct: 11 FLRDDEIDVILELSMPHLAPSGVTLQDGHENRPATDWRTSTTYWLDSSSHP--VVQTIDK 68
Query: 150 KIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA------------EYGPQMSQRLA 197
+ A +P +H E+ VLRYE Q YD H D F+ EYG R+
Sbjct: 69 RTADLVKVPISHQESVQVLRYEPTQHYDQHLDYFSAERHRNSPDVLKRIEYG--YKNRMI 126
Query: 198 SFLLYLSDVEEGGETMF 214
+ Y+SDV +GG T F
Sbjct: 127 TVFWYMSDVAKGGHTNF 143
>gi|302849869|ref|XP_002956463.1| hypothetical protein VOLCADRAFT_107241 [Volvox carteri f.
nagariensis]
gi|300258161|gb|EFJ42400.1| hypothetical protein VOLCADRAFT_107241 [Volvox carteri f.
nagariensis]
Length = 965
Score = 61.2 bits (147), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 47/142 (33%), Positives = 68/142 (47%), Gaps = 22/142 (15%)
Query: 75 FQVLSWRPRALYFPNFASAEQCQSIIATAKKR--LKPSQLALRQGETVESTKGTRTSSGT 132
+VL+ P + F SA +C I+ +A +K S + + G V+ T+ RTSS
Sbjct: 744 LRVLNIDPPVITVEGFLSAPECDGIVRSAADSGLMKQSGVGV-SGYQVKDTENVRTSSTL 802
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
+A +T A LPQ V RY+ GQ + +H DAF A G +
Sbjct: 803 AATAEPGQT------------AFELPQ-------VARYQPGQHFLTHEDAFPAAVVGSKG 843
Query: 193 SQRLASFLLYLSDVEEGGETMF 214
QR A+ L+YL+D E+GG T F
Sbjct: 844 YQRRATLLVYLNDCEQGGATKF 865
>gi|159462456|ref|XP_001689458.1| predicted protein [Chlamydomonas reinhardtii]
gi|158283446|gb|EDP09196.1| predicted protein [Chlamydomonas reinhardtii]
Length = 221
Score = 61.2 bits (147), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 58/124 (46%), Gaps = 14/124 (11%)
Query: 85 LYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGIL 144
+ + NF S +C+ II A ++K S + + V RTS GTF+ D ++
Sbjct: 1 MVYHNFLSDRECRHIIDLAHAQMKRSTVVGSKNAGV--VDDIRTSYGTFLRRVPDP--VI 56
Query: 145 ELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLS 204
IEH++A + LP +H E VLRY KY H D +R+A+ L+YL
Sbjct: 57 AAIEHRLALWSHLPASHQEDMQVLRYGPTNKYGPHIDGL----------ERVATVLIYLG 106
Query: 205 DVEE 208
E
Sbjct: 107 QAER 110
>gi|194765178|ref|XP_001964704.1| GF23330 [Drosophila ananassae]
gi|190614976|gb|EDV30500.1| GF23330 [Drosophila ananassae]
Length = 537
Score = 61.2 bits (147), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 41/137 (29%), Positives = 69/137 (50%), Gaps = 6/137 (4%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASEDK 140
P + + + SA++ + + A R++ S + L G+ +S R S +++
Sbjct: 331 PYVVTYHDMLSAQKIRDLRQMAVPRMRRSTVNPLPGGQNKKS--AFRVSKNAWLAYESHP 388
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMSQRLASF 199
T +E + + AT L T+ E V Y +G Y+ H+D F +P Y + R+A+
Sbjct: 389 T--MEGMLRDLKDATGLDTTYCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRIATA 446
Query: 200 LLYLSDVEEGGETMFPF 216
+ YLSDVE+GG T FPF
Sbjct: 447 IFYLSDVEQGGATAFPF 463
>gi|90023340|ref|YP_529167.1| response regulator receiver domain-containing protein
[Saccharophagus degradans 2-40]
gi|89952940|gb|ABD82955.1| 2OG-Fe(II) oxygenase [Saccharophagus degradans 2-40]
Length = 269
Score = 60.8 bits (146), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 41/146 (28%), Positives = 77/146 (52%), Gaps = 11/146 (7%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P +F + + II A +++ ++++ + E +ES RT S +++ +K
Sbjct: 63 PSVTICEDFLTQAEVFQIIKAAGDKMQRARVSSGK-EGIESA--GRTGSNCWVAHDHNK- 118
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN-PAEYGPQM----SQRL 196
+ + +I++ + + E+F V+ Y + Q+Y SH+DA+ E G + QRL
Sbjct: 119 -VTHALAKRISKLVGISLQNAESFQVIHYGVSQEYSSHFDAWEFNTERGERCMARGGQRL 177
Query: 197 ASFLLYLSDVEEGGETMFPFEVDLQI 222
+ L+YL+DV GG T FP E+DL++
Sbjct: 178 VTCLIYLNDVPAGGGTGFP-ELDLEV 202
>gi|198449635|ref|XP_001357660.2| GA21971 [Drosophila pseudoobscura pseudoobscura]
gi|198130694|gb|EAL26794.2| GA21971 [Drosophila pseudoobscura pseudoobscura]
Length = 549
Score = 60.8 bits (146), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 42/145 (28%), Positives = 65/145 (44%), Gaps = 11/145 (7%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
LS P + + + + I R+ + + L TV + RTS TFI+ +
Sbjct: 324 LSHDPLLVLYHDVIYQSEIDVIRQLTTNRMARAMVTLTNQSTVSNV---RTSQITFIAKT 380
Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY------GPQ 191
E + +L+ I+ ++A T L + E Y IG Y H D F + +
Sbjct: 381 EHE--VLQTIDRRVADMTNLNMDYAEDHQFANYGIGGHYGQHMDWFTETTFDNGLVSSTE 438
Query: 192 MSQRLASFLLYLSDVEEGGETMFPF 216
M R+A+ L YLSDV +GG T FP+
Sbjct: 439 MGNRIATVLFYLSDVAQGGGTAFPY 463
>gi|194765168|ref|XP_001964699.1| GF22909 [Drosophila ananassae]
gi|190614971|gb|EDV30495.1| GF22909 [Drosophila ananassae]
Length = 525
Score = 60.8 bits (146), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 45/152 (29%), Positives = 68/152 (44%), Gaps = 11/152 (7%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
I + + LS P + + + + +I +LK + + E+V S RTS
Sbjct: 290 IAPLKAEELSRDPLLILYHDVIYQSEIDTIRKLTTNKLKRATIT-STNESVVSN--VRTS 346
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY- 188
TF+ +EDK +L I+ ++A T + E Y IG Y H D F +
Sbjct: 347 QFTFLPVTEDK--VLATIDRRVADMTNFNMRYAEDHQFANYGIGGHYGQHMDWFYQPSFD 404
Query: 189 -----GPQMSQRLASFLLYLSDVEEGGETMFP 215
P+M R+A+ L YLSDV +GG T FP
Sbjct: 405 AGLVSSPEMGNRIATVLFYLSDVTQGGGTAFP 436
>gi|307103831|gb|EFN52088.1| hypothetical protein CHLNCDRAFT_139357 [Chlorella variabilis]
Length = 1038
Score = 60.8 bits (146), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 37/90 (41%), Positives = 52/90 (57%), Gaps = 4/90 (4%)
Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLP-QTHGEAFNVLRYEIGQKYDSHYDAFNP 185
RTS GTF++ ++D+ ++ IEH++A T LP + G R+ G +D NP
Sbjct: 11 RTSWGTFLTRAQDE--VVYAIEHRVANWTHLPVENAGGVLQGKRFHYGAHWDDLDLDENP 68
Query: 186 AEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
G S R+A+ L+YLSD EEGGET FP
Sbjct: 69 DGLG-GGSVRVATVLIYLSDAEEGGETAFP 97
>gi|219124513|ref|XP_002182546.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217405892|gb|EEC45833.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 193
Score = 60.8 bits (146), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 41/145 (28%), Positives = 70/145 (48%), Gaps = 13/145 (8%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
+ LS PRA NF + + I+ +K+ +++ T TRTSS T+++
Sbjct: 2 KALSCAPRAFQVENFLTDVEADHIVGLVQKKND-----MQRSSTNGHISETRTSSTTWLA 56
Query: 136 ASEDKTGILELIEHKIARA-----TMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP 190
D +++ I ++A ML + E ++ Y +GQ+Y +H+D F + P
Sbjct: 57 RHSDP--VIDSIFRRVADTLKMDEAMLHRRINEDLQIVHYGVGQQYTAHHD-FGYPKGDP 113
Query: 191 QMSQRLASFLLYLSDVEEGGETMFP 215
R +F +YL+DV GG+T FP
Sbjct: 114 GSPSRSINFCMYLNDVPAGGQTSFP 138
>gi|241999340|ref|XP_002434313.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
gi|215496072|gb|EEC05713.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
Length = 267
Score = 60.8 bits (146), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 46/149 (30%), Positives = 74/149 (49%), Gaps = 7/149 (4%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+Q +VLS PR + FP+F + +C+ + ++++L +++ L G RT+
Sbjct: 46 LQPFKIEVLSEDPRIVVFPDFLNPRECEIFRSISQEKLSRAKVYL-GGPPEGGFSLRRTN 104
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH--YDAFNPAE 187
++ S+D +L + +IA AT L T E + V Y +G Y H Y F A+
Sbjct: 105 KVAWM--SDDLHPLLGKVSRRIALATGLTLTSAEMYQVANYGLGGHYIPHPDYAGFGEAQ 162
Query: 188 YGPQMS--QRLASFLLYLSDVEEGGETMF 214
S RLA+ L+YL+DV GG T F
Sbjct: 163 GDIYKSSGNRLATMLIYLADVAGGGATAF 191
>gi|407686446|ref|YP_006801619.1| hypothetical protein AMBAS45_03290 [Alteromonas macleodii str.
'Balearic Sea AD45']
gi|407289826|gb|AFT94138.1| hypothetical protein AMBAS45_03290 [Alteromonas macleodii str.
'Balearic Sea AD45']
Length = 263
Score = 60.5 bits (145), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 40/135 (29%), Positives = 64/135 (47%), Gaps = 16/135 (11%)
Query: 87 FPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILEL 146
+ +F S+++C I+A K +L PS+LA S RTSS ++ +K +++
Sbjct: 85 YDDFLSSQECDDIVALTKDKLAPSKLA-----GAASADDIRTSSTCELAFLGNK--LVKD 137
Query: 147 IEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-------QRLASF 199
++ +I L GE Y +G+ Y HYD F P PQ QR +
Sbjct: 138 VDSRIVSTLSLGVGEGEVIQAQHYNVGEYYKPHYDFFPPGS--PQYKAHCLSRGQRTWTC 195
Query: 200 LLYLSDVEEGGETMF 214
++YL+D +GG T F
Sbjct: 196 MIYLNDECDGGHTRF 210
>gi|115313004|gb|AAI24075.1| Zgc:152670 [Danio rerio]
Length = 235
Score = 60.5 bits (145), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 43/133 (32%), Positives = 66/133 (49%), Gaps = 16/133 (12%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P+ + + + S + +++ A+ L SQ G V S RTS F+ D+
Sbjct: 53 PKIIRYHDVISDTEIETLKDIARPELTRSQ----TGWGVISE--IRTSQSVFL----DEV 102
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
G + I +IA T L E +V Y IG +Y H+DA G +++R A+FL+
Sbjct: 103 GTVARISQRIADITGLSVESAEKLHVQNYGIGGRYTPHFDA------GGDVNERTATFLI 156
Query: 202 YLSDVEEGGETMF 214
Y+SDVE GG T+F
Sbjct: 157 YMSDVEVGGATVF 169
>gi|407698902|ref|YP_006823689.1| hypothetical protein AMBLS11_03220 [Alteromonas macleodii str.
'Black Sea 11']
gi|407248049|gb|AFT77234.1| hypothetical protein AMBLS11_03220 [Alteromonas macleodii str.
'Black Sea 11']
Length = 263
Score = 60.5 bits (145), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 40/135 (29%), Positives = 64/135 (47%), Gaps = 16/135 (11%)
Query: 87 FPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILEL 146
+ +F S+++C I+A K +L PS+LA S RTSS ++ +K +++
Sbjct: 85 YDDFLSSQECDDIVALTKDKLAPSKLA-----GAASADDIRTSSTCELAFLGNK--LVKD 137
Query: 147 IEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-------QRLASF 199
++ +I L GE Y +G+ Y HYD F P PQ QR +
Sbjct: 138 VDSRIVSTLSLGVGEGEVIQAQHYNVGEYYKPHYDFFPPGS--PQYKTHCLSRGQRTWTC 195
Query: 200 LLYLSDVEEGGETMF 214
++YL+D +GG T F
Sbjct: 196 MIYLNDECDGGHTRF 210
>gi|410910256|ref|XP_003968606.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Takifugu
rubripes]
Length = 540
Score = 60.1 bits (144), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 46/146 (31%), Positives = 76/146 (52%), Gaps = 10/146 (6%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
+VLS RP + + +F S + + I A+ L+ S +A G+ ++T R S ++
Sbjct: 336 EVLSLRPYVVLYHDFISDSESEEIKQHAQLGLRRSVVA--TGDK-QATAEYRISKSAWLK 392
Query: 136 ASEDKTGILELIEHKIARATML--PQTHGEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 190
S T + ++ KI+ T L HGE V+ Y IG Y+ H+D A +P+ +
Sbjct: 393 GSAHST--VSRLDQKISMLTGLNVQHPHGEYLQVVNYGIGGHYEPHFDHATSPSSPVFKL 450
Query: 191 QMSQRLASFLLYLSDVEEGGETMFPF 216
+ R+A+F++YLS VE GG T F +
Sbjct: 451 KTGNRVATFMIYLSSVEAGGSTAFIY 476
>gi|196011908|ref|XP_002115817.1| hypothetical protein TRIADDRAFT_30052 [Trichoplax adhaerens]
gi|190581593|gb|EDV21669.1| hypothetical protein TRIADDRAFT_30052, partial [Trichoplax
adhaerens]
Length = 495
Score = 60.1 bits (144), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 44/145 (30%), Positives = 69/145 (47%), Gaps = 6/145 (4%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSI--IATAKKRLKPSQLALRQGETVESTKGTRTSS 130
IP + +S P + + + + Q ++I I+ +K P+ L G E+T+ +
Sbjct: 290 IPVEEISLDPFIVIYYDIINDHQIETIKKISPSKSNKSPNHAMLCSGIKSEATQVSIFCC 349
Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP 190
T++ + D ++E I T L + E V Y IG Y HYD+ A P
Sbjct: 350 STWLEDAYDP--VVEKISRLTQELTHLDVNYAEDLQVANYGIGGHYVPHYDSTIIAPEDP 407
Query: 191 QMSQRLASFLLYLSDVEEGGETMFP 215
QRLA+ + YLS+VE GG T+FP
Sbjct: 408 L--QRLATMMFYLSNVEIGGATIFP 430
>gi|348688210|gb|EGZ28024.1| hypothetical protein PHYSODRAFT_321730 [Phytophthora sojae]
Length = 487
Score = 60.1 bits (144), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 39/150 (26%), Positives = 67/150 (44%), Gaps = 12/150 (8%)
Query: 75 FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 134
+ +S P F ++ ++ + L PS + L+ G RTS+ ++
Sbjct: 268 METISMTPLVFSVEEFLRDDEIDVVLELSMPHLAPSGVTLQDGHENRPATDWRTSTTYWL 327
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF------NPAEY 188
+S +++ I+ + A +P +H E+ VLRYE Q YD H D F N A+
Sbjct: 328 ESSSHP--VVQDIDKRTADLVKVPISHQESVQVLRYEHTQHYDQHLDYFSVKRHRNSADV 385
Query: 189 GPQMSQ----RLASFLLYLSDVEEGGETMF 214
++ R+ + Y+SDV +GG T F
Sbjct: 386 LKKIEHGYKNRMITVFWYMSDVAKGGHTNF 415
>gi|224013910|ref|XP_002296619.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220968971|gb|EED87315.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 541
Score = 59.7 bits (143), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 46/147 (31%), Positives = 68/147 (46%), Gaps = 9/147 (6%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKR--LKPSQLALRQ--GETVESTKGTRTSSGTFISA- 136
P + F NF S EQ ++IA K+ + + + + G + RTS +
Sbjct: 340 PWIVVFENFVSDEQATALIAAGAKKGYERSADVGIENPDGSHEDDVNDDRTSHNAWCDDE 399
Query: 137 -SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
+ I +IE +IA T + E +L+YE GQ Y H+D E P R
Sbjct: 400 LCNNDPVIAPVIE-RIASVTKTSVNNSEFLQLLQYEPGQYYKQHHDYIEYQEDMP-CGVR 457
Query: 196 LASFLLYLSDVEEGGETMFPFEVDLQI 222
+ + LYL+DVEEGG T FP +D+ I
Sbjct: 458 MLTLFLYLNDVEEGGGTHFPL-LDITI 483
>gi|195113239|ref|XP_002001175.1| GI10638 [Drosophila mojavensis]
gi|193917769|gb|EDW16636.1| GI10638 [Drosophila mojavensis]
Length = 511
Score = 59.7 bits (143), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 42/143 (29%), Positives = 69/143 (48%), Gaps = 8/143 (5%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
I +VL P + F + S+ + + A+ L+ S + ++ V+ R S+GT
Sbjct: 310 IKMEVLVLDPLVVIFHDVLSSREIDGLQEIARPHLERS-MVVKYRANVQGKH--RISAGT 366
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
++ + + IE +IA L E F V+ Y IG +Y +H+D F
Sbjct: 367 WVERKYN--NLTWRIERRIADMVDLNLEGSEPFYVINYGIGGQYKAHWDFFGADTVE--- 421
Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
RLA+ L Y++DVE+GG T+FP
Sbjct: 422 DNRLATVLFYMNDVEQGGATVFP 444
>gi|356980146|gb|AET43625.1| hypothetical protein MPWG_00137 [Micromonas pusilla virus PL1]
Length = 196
Score = 59.7 bits (143), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 44/134 (32%), Positives = 65/134 (48%), Gaps = 14/134 (10%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
PR F NF + ++ II AK LKPS ++ + + E + + T+ F
Sbjct: 21 PRV--FHNFITPDERMHIIRKAKSELKPSTVSTER-KVDEKVRKSETAWLDF------DD 71
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
I++ + + R T P + E VLRYE G Y H D A +QR+ +F+L
Sbjct: 72 PIVKGVADRCLRYTDRPLINCEKLQVLRYEEGGHYIPHQDIIANA-----TNQRMYTFIL 126
Query: 202 YLSDVEEGGETMFP 215
L+D EGGET+FP
Sbjct: 127 ALNDDYEGGETVFP 140
>gi|313768105|ref|YP_004061536.1| hypothetical protein BpV1_106c [Bathycoccus sp. RCC1105 virus BpV1]
gi|312599712|gb|ADQ91733.1| hypothetical protein BpV1_106c [Bathycoccus sp. RCC1105 virus BpV1]
Length = 197
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 40/135 (29%), Positives = 69/135 (51%), Gaps = 13/135 (9%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+PR L N S ++C+ I A K+L+ S +++ + + + R S ++ ASED
Sbjct: 23 KPRVL--KNVLSEDECKHIQDIASKKLQTSTVSMSR----DIDEKIRKSETAWLKASED- 75
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFL 200
+++ + K T P + E VL+Y+ G Y H D F + ++R+ +F+
Sbjct: 76 -PVVDKLIRKCVSMTDRPLHNCEDLQVLKYKPGGFYKPHQDCFKNDK-----NKRMYTFI 129
Query: 201 LYLSDVEEGGETMFP 215
+ L+D EGGET FP
Sbjct: 130 IALNDEYEGGETEFP 144
>gi|195341542|ref|XP_002037365.1| GM12152 [Drosophila sechellia]
gi|194131481|gb|EDW53524.1| GM12152 [Drosophila sechellia]
Length = 535
Score = 59.3 bits (142), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 43/149 (28%), Positives = 67/149 (44%), Gaps = 9/149 (6%)
Query: 74 PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
PF++ L P + + +S+ +A+ +K S + G + RTS G
Sbjct: 316 PFKLEELHLDPLVVQLHQVIGSNDSESLQKSARPMIKRSTVYSLGGNGGSTAAAFRTSQG 375
Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY--- 188
+ S K +L+ H + + L + E V Y IG Y+ H+D+F P +
Sbjct: 376 ASFNYS--KNAATKLLSHHVGDFSDLNMDYAEDLQVANYGIGGHYEPHWDSF-PENHIYQ 432
Query: 189 -GPQMSQRLASFLLYLSDVEEGGETMFPF 216
G R+A+ + YLSDVE GG T FPF
Sbjct: 433 EGDLHGNRIATGIYYLSDVEAGGGTAFPF 461
>gi|452752943|ref|ZP_21952682.1| eukaryotic Peptidyl prolyl 4-hydroxylase, alpha subunit [alpha
proteobacterium JLT2015]
gi|451959765|gb|EMD82182.1| eukaryotic Peptidyl prolyl 4-hydroxylase, alpha subunit [alpha
proteobacterium JLT2015]
Length = 314
Score = 59.3 bits (142), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 42/126 (33%), Positives = 64/126 (50%), Gaps = 8/126 (6%)
Query: 90 FASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEH 149
F+SAE C + + RL+PS + L RTS G +S E+ ++ ++
Sbjct: 138 FSSAE-CAYLQQMSAPRLRPSTI-LDPQTGARRPDPVRTSVGAALSPVEEDL-VVGMLNR 194
Query: 150 KIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEG 209
+IA AT + GE ++LRY Q+Y H+DA E +QR + ++YL+ EG
Sbjct: 195 RIAAATGTDRMQGEPLHILRYSGAQEYRPHHDAVAGLE-----NQRSHTLIVYLTADYEG 249
Query: 210 GETMFP 215
GET FP
Sbjct: 250 GETAFP 255
>gi|296284739|ref|ZP_06862737.1| hypothetical protein CbatJ_14013 [Citromicrobium bathyomarinum
JL354]
Length = 210
Score = 59.3 bits (142), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 43/150 (28%), Positives = 67/150 (44%), Gaps = 17/150 (11%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+Q +PF + L F + C +IA +PS LA G+ RTS
Sbjct: 17 MQRLPFPKVE----LLQMRQFLDPDFCGELIAMIDADRRPSTLADHDGDMY-----FRTS 67
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP---- 185
+ + +T LE + +A + + HGE RY +GQ++ +H D FNP
Sbjct: 68 ETCDLPMDDPRTQRLEAM---LAELSGIDPRHGEPLQGQRYAVGQEFKAHCDYFNPDGQD 124
Query: 186 -AEYGPQMSQRLASFLLYLSDVEEGGETMF 214
+Y QR +F++YL++ E GG T F
Sbjct: 125 WEKYCSVAGQRTWTFMIYLNEPEAGGATRF 154
>gi|284035817|ref|YP_003385747.1| 2OG-Fe(II) oxygenase [Spirosoma linguale DSM 74]
gi|283815110|gb|ADB36948.1| 2OG-Fe(II) oxygenase [Spirosoma linguale DSM 74]
Length = 328
Score = 59.3 bits (142), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 41/159 (25%), Positives = 79/159 (49%), Gaps = 14/159 (8%)
Query: 64 ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
+T D SI + ++ R P+F SA++C II A+++ ++ L + +
Sbjct: 136 DTDDESIAAAVYK----RECVQIHPHFFSADECAYIIQYAEEKTLFTRSQLEYDDNTVNE 191
Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
TRTS F+ + + + + I ++A + + + E +RY GQ++ H+D+
Sbjct: 192 SDTRTSYSAFL--KDRQHPVFQAIYERVAASLKVDLNYIEPLQCVRYGEGQQFKPHFDSM 249
Query: 184 NPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFEVDLQI 222
+ + RL + L+YL+D GGET FP E+++ +
Sbjct: 250 S-------ANHRLHTMLVYLNDDFVGGETYFP-ELNMNV 280
>gi|397571457|gb|EJK47807.1| hypothetical protein THAOC_33450 [Thalassiosira oceanica]
Length = 861
Score = 59.3 bits (142), Expect = 1e-06, Method: Composition-based stats.
Identities = 40/146 (27%), Positives = 63/146 (43%), Gaps = 18/146 (12%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-----------TRTSS 130
P + +F + E+C +I S+L Q +T +RTS+
Sbjct: 661 PWVITIEDFLTGEECDVLIKHG------SELGYEQSTQYSNTADVGGSHEFIESESRTST 714
Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP 190
TF D I+ + +I + + H E ++RYE+GQ Y H+D + E P
Sbjct: 715 NTFCQGDCDADPIVRGVIDRIVEMSGITANHYEKLQLVRYEVGQYYKEHHDYASAHEELP 774
Query: 191 QMSQRLASFLLYLSDVEEGGETMFPF 216
R+ +F LYL+D EGG T F +
Sbjct: 775 -FGPRILTFFLYLNDDYEGGGTSFNY 799
>gi|294889729|ref|XP_002772943.1| prolyl 4-hydroxylase alpha subunit, putative [Perkinsus marinus
ATCC 50983]
gi|239877523|gb|EER04759.1| prolyl 4-hydroxylase alpha subunit, putative [Perkinsus marinus
ATCC 50983]
Length = 383
Score = 59.3 bits (142), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 42/150 (28%), Positives = 70/150 (46%), Gaps = 24/150 (16%)
Query: 77 VLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLAL--------RQGETVESTKGTRT 128
V+ P+ P+F + E+C+ +I+ A+ + +PS + + + V RT
Sbjct: 170 VICRSPKVRLVPDFLTPEECEYMISLAEGKWRPSTVGRSSSSISDGKSDKYVNKRSKGRT 229
Query: 129 SSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAE 187
SS + S+D ++ IE + A P H E N+LRYE G+ + H+D AF P
Sbjct: 230 SSSFMLLHSQDD--VVAEIERRAASLVGFPADHVERLNMLRYESGEFFGQHHDGAFRP-- 285
Query: 188 YGPQMSQRLASFLLYLSDVEE--GGETMFP 215
+ + L+D+ GGET+FP
Sbjct: 286 ---------WTVFITLNDIPRGAGGETLFP 306
>gi|321474952|gb|EFX85916.1| hypothetical protein DAPPUDRAFT_45616 [Daphnia pulex]
Length = 537
Score = 59.3 bits (142), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 43/154 (27%), Positives = 72/154 (46%), Gaps = 15/154 (9%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
IQ I + +P + + + S ++ +++ AK R K + +R +T E
Sbjct: 324 IQPIKMEEALLKPMIVVYHDVMSDDEIETVKKMAKPRFKRA--TIRNSKTGELEPANYRI 381
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-------- 181
S + SE+ IL+ + ++ T L + E V+ Y IG Y+ H+D
Sbjct: 382 SKSAWLKSEEHDHILK-VTRRVGDITGLDMSTAEDLQVVNYGIGGHYEPHFDYARTETTE 440
Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
AF +G R+A++L Y+SDVE GG T+FP
Sbjct: 441 AFKELGWG----NRIATWLFYMSDVEAGGATVFP 470
>gi|198449524|ref|XP_002136918.1| GA26871 [Drosophila pseudoobscura pseudoobscura]
gi|198130646|gb|EDY67476.1| GA26871 [Drosophila pseudoobscura pseudoobscura]
Length = 530
Score = 59.3 bits (142), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 42/143 (29%), Positives = 69/143 (48%), Gaps = 9/143 (6%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFISA 136
LS P + + N S + IA ++ +P ++ GE S K RT+ G +I
Sbjct: 322 LSLDPYIVVYHNVLSDAE----IAKVERVAEPLLKSIGVGEMDNSKKSKVRTALGAWIPD 377
Query: 137 SEDKTG---ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ-M 192
+++ I +I T L HG+ +++Y G YD+H+D N + Q +
Sbjct: 378 KNMHISGWPVIQRIVRRIHDMTGLIIKHGQVVQLIKYGYGGHYDTHFDYLNDSLPITQAL 437
Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
R+A+ L YL+DV+ GG T+FP
Sbjct: 438 GDRMATVLFYLNDVKHGGSTVFP 460
>gi|390176896|ref|XP_002136934.2| GA26861 [Drosophila pseudoobscura pseudoobscura]
gi|388858831|gb|EDY67492.2| GA26861 [Drosophila pseudoobscura pseudoobscura]
Length = 513
Score = 58.9 bits (141), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 37/145 (25%), Positives = 70/145 (48%), Gaps = 7/145 (4%)
Query: 74 PF--QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
PF ++LS P + + + + + ++ +K +K + + + RTS+
Sbjct: 312 PFKTEILSLSPYMVLYHDVITPLESLTLKNLSKPHMKRRAMTFNKQKLRPLIDSGRTSNS 371
Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGP 190
++++ E+ ++E +E ++ T + E + ++ Y IG Y H D F P G
Sbjct: 372 VWLTSHEN--AVMERLERRVGVMTNFEMENSEVYQLINYGIGGHYKPHTDHFETPQHRGG 429
Query: 191 QMSQRLASFLLYLSDVEEGGETMFP 215
R+A+ L YLSDV +GG T+FP
Sbjct: 430 --GDRIATVLFYLSDVPQGGATLFP 452
>gi|195505209|ref|XP_002099405.1| GE10885 [Drosophila yakuba]
gi|194185506|gb|EDW99117.1| GE10885 [Drosophila yakuba]
Length = 473
Score = 58.9 bits (141), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 42/143 (29%), Positives = 67/143 (46%), Gaps = 6/143 (4%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
+ ++LS P + F + S + SI AK L + + G E RT+ GT
Sbjct: 274 LKMELLSLDPYMVLFHDVVSDKDITSIRNLAKGGLVRAVTVTKDGSYEEDP--ARTTKGT 331
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
++ + + +++ + T L + F VL Y IG Y +H+D E G
Sbjct: 332 WLV---ENSKLIQRLSQLAQDMTNLDIRDADPFQVLNYGIGGYYGTHFDFLADTEMG-NF 387
Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
S R+A+ + YLSDV +GG T+FP
Sbjct: 388 SNRIATAVFYLSDVPQGGATIFP 410
>gi|297515507|gb|ADI44133.1| RT08151p [Drosophila melanogaster]
Length = 546
Score = 58.9 bits (141), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 35/97 (36%), Positives = 46/97 (47%), Gaps = 8/97 (8%)
Query: 125 GTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN 184
RTS TFI + K +L I+ ++A T L + E Y IG Y H D F
Sbjct: 373 NVRTSQFTFIPVTAHK--VLSTIDQRVADMTNLNMKYAEDHQFANYGIGGHYGQHMDWFY 430
Query: 185 PAEY------GPQMSQRLASFLLYLSDVEEGGETMFP 215
+ P+M R+A+ L YLSDV +GG T FP
Sbjct: 431 QTTFDAGLVSSPEMGNRIAAVLFYLSDVAQGGGTAFP 467
>gi|195452746|ref|XP_002073482.1| GK14141 [Drosophila willistoni]
gi|194169567|gb|EDW84468.1| GK14141 [Drosophila willistoni]
Length = 541
Score = 58.9 bits (141), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 39/108 (36%), Positives = 52/108 (48%), Gaps = 13/108 (12%)
Query: 119 TVESTKGT-----RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIG 173
TV KG+ RTS TFI + K +L+ I+ ++A + L + E Y IG
Sbjct: 359 TVIGAKGSEVSKVRTSQFTFIPKTRHK--VLQTIDQRVADMSNLNMDYAELHQFANYGIG 416
Query: 174 QKYDSHYDAF------NPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
Y H D F N P+M R+A+ L YLSDV +GG T FP
Sbjct: 417 GHYAQHNDWFGQDAFDNELVSSPEMGNRIATVLFYLSDVAQGGGTAFP 464
>gi|116181314|ref|XP_001220506.1| hypothetical protein CHGG_01285 [Chaetomium globosum CBS 148.51]
gi|88185582|gb|EAQ93050.1| hypothetical protein CHGG_01285 [Chaetomium globosum CBS 148.51]
Length = 304
Score = 58.9 bits (141), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 40/138 (28%), Positives = 67/138 (48%), Gaps = 10/138 (7%)
Query: 85 LYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGIL 144
+Y N + + S++ATA+ + PSQ+ + G ++ T +S+G +D
Sbjct: 78 IYIHNILTPSEITSLLATAEPKFAPSQVT-KHGRKQQTNDRTSSSAGL---PRDDPAVRC 133
Query: 145 ELIEHKIARATMLPQTHGEAF--NVLRYEIGQKYDSHYDAFNPAEYGPQMS----QRLAS 198
L + TML E ++RY GQ+++ H+D ++ ++ P + R+AS
Sbjct: 134 VLDRARTFLGTMLRDGWDEMGPPQLVRYSEGQRFNVHHDWYDVPQWAPDGTTRKWNRVAS 193
Query: 199 FLLYLSDVEEGGETMFPF 216
F L D EGGET FPF
Sbjct: 194 FFAILEDQCEGGETHFPF 211
>gi|116008434|ref|NP_651806.2| CG9698 [Drosophila melanogaster]
gi|113194862|gb|AAF57062.2| CG9698 [Drosophila melanogaster]
Length = 547
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 35/97 (36%), Positives = 46/97 (47%), Gaps = 8/97 (8%)
Query: 125 GTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN 184
RTS TFI + K +L I+ ++A T L + E Y IG Y H D F
Sbjct: 373 NVRTSQFTFIPVTAHK--VLSTIDQRVADMTNLNMKYAEDHQFANYGIGGHYGQHMDWFY 430
Query: 185 PAEY------GPQMSQRLASFLLYLSDVEEGGETMFP 215
+ P+M R+A+ L YLSDV +GG T FP
Sbjct: 431 QTTFDAGLVSSPEMGNRIATVLFYLSDVAQGGGTAFP 467
>gi|198449643|ref|XP_001357664.2| GA15938 [Drosophila pseudoobscura pseudoobscura]
gi|198130698|gb|EAL26798.2| GA15938 [Drosophila pseudoobscura pseudoobscura]
Length = 549
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 48/148 (32%), Positives = 74/148 (50%), Gaps = 12/148 (8%)
Query: 74 PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST-KGTRTSS 130
PF+V LS P YF + S ++ + II K ++ S++ G+T ST RTS
Sbjct: 336 PFKVEQLSGDPYVAYFHDVLSDKESEQIIEHGKGQVTRSEI----GQTGNSTVSDIRTSQ 391
Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE--Y 188
T++ + L I+ ++ T L E ++ Y IG +Y+ H+D + AE +
Sbjct: 392 NTWLWY--ENNPWLADIKQRLEDITGLSTDTAEPLQLVNYGIGGQYEPHFDFMDDAEKNF 449
Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPF 216
G + RL + L YL+DV GG T FPF
Sbjct: 450 GWK-GNRLLTALFYLNDVPLGGATAFPF 476
>gi|4336512|gb|AAD17844.1| prolyl 4-hydroxylase alpha subunit [Drosophila melanogaster]
Length = 535
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 43/149 (28%), Positives = 67/149 (44%), Gaps = 9/149 (6%)
Query: 74 PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
PF++ L P + ++ S+ TA+ R+K S + G + RTS G
Sbjct: 316 PFKLEELHLDPLVVQLHQVIGSKDSDSLQKTARPRIKRSTVYSLGGNGGSTAAAFRTSQG 375
Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY--- 188
+ S + +L+ + + L + E V Y IG Y+ H+D+F P +
Sbjct: 376 ASFNYS--RNAATKLLSRHVGDFSGLNMDYAEDLQVANYGIGGHYEPHWDSF-PENHIYQ 432
Query: 189 -GPQMSQRLASFLLYLSDVEEGGETMFPF 216
G R+A+ + YLSDVE GG T FPF
Sbjct: 433 EGDLHGNRMATGIYYLSDVEAGGGTAFPF 461
>gi|195505218|ref|XP_002099409.1| GE10887 [Drosophila yakuba]
gi|194185510|gb|EDW99121.1| GE10887 [Drosophila yakuba]
Length = 521
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/140 (30%), Positives = 58/140 (41%), Gaps = 11/140 (7%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + + + + I + RLK + + G RTS TFI S K
Sbjct: 302 PLLVLYHDVIYQSEIDVIRKLTENRLKRATVT---GHNESVVSNVRTSQFTFIPVSAHK- 357
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY------GPQMSQR 195
+L I+ ++A T L + E Y IG Y H D F P+M R
Sbjct: 358 -VLSTIDQRVADMTNLNMKYAEDHQFANYGIGGHYGQHMDWFYQTTIDAGLISSPEMGNR 416
Query: 196 LASFLLYLSDVEEGGETMFP 215
+A+ L YLSDV +GG T FP
Sbjct: 417 IATVLFYLSDVSQGGGTAFP 436
>gi|357010489|ref|ZP_09075488.1| response regulator receiver domain-containing protein
[Paenibacillus elgii B69]
Length = 397
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/155 (27%), Positives = 76/155 (49%), Gaps = 11/155 (7%)
Query: 74 PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTF 133
P V P + S +C+ +I A+ +L+P+++ + + E V S R S F
Sbjct: 4 PSVVFHEEPLVASYEQVVSGPECRQLIELARHQLEPAKV-IGEKEVVASE--FRKSE--F 58
Query: 134 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN-----PAEY 188
D ++ + ++A P + E+ + RY +G ++ +H+D ++ +
Sbjct: 59 AWFHHDSHPLVREVSERLAALAGRPLHYAESLQIARYVVGGRFGAHFDTYDLNTVDGKRF 118
Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
Q QRL + LLYL+ V+ GGET FP E++L I+
Sbjct: 119 YDQGGQRLYTALLYLNTVDAGGETYFP-ELNLDIA 152
>gi|89092696|ref|ZP_01165649.1| hypothetical protein MED92_15358 [Neptuniibacter caesariensis]
gi|89083208|gb|EAR62427.1| hypothetical protein MED92_15358 [Oceanospirillum sp. MED92]
Length = 441
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/140 (27%), Positives = 63/140 (45%), Gaps = 11/140 (7%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+ Y NF S E+C +I + +PS G K RTS +S E
Sbjct: 69 KAEVFYVNNFLSPEECAQMIELIQHHQRPSTTTNETGHY----KHYRTSKTCDLSLLE-- 122
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA-----EYGPQMSQR 195
+ + I+ +I + + ++ E Y+IG+++ H D F P E+ QR
Sbjct: 123 STFVAEIDQRICKMLGIEPSYSEGIQGQWYDIGEEFKPHTDYFEPKSDEFLEHAEARGQR 182
Query: 196 LASFLLYLSDVEEGGETMFP 215
+F++YL++ +EGG T FP
Sbjct: 183 TWTFMIYLNNTQEGGGTFFP 202
>gi|333894037|ref|YP_004467912.1| prolyl 4-hydroxylase subunit alpha [Alteromonas sp. SN2]
gi|332994055|gb|AEF04110.1| Prolyl 4-hydroxylase subunit alpha [Alteromonas sp. SN2]
Length = 373
Score = 58.5 bits (140), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 45/135 (33%), Positives = 61/135 (45%), Gaps = 11/135 (8%)
Query: 87 FPNFASAEQCQSIIATAKKRLKPSQLA---LRQGETVESTKGTRTSSGTFISASEDKTGI 143
F + S +C +I L+PS + QG RTS IS E +
Sbjct: 181 FESTLSGFECNYLITKFSALLQPSMVVDPITGQGRI----DKVRTSYVAIISP-EHCDWL 235
Query: 144 LELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN---PAEYGPQMSQRLASFL 200
I+ +A+AT GE N+LRY GQ+Y HYDA N A+ QR + +
Sbjct: 236 TRKIDKLVAKATKTRCCEGEVLNLLRYVPGQEYKPHYDALNRLHDAKTFEDGGQRTKTAI 295
Query: 201 LYLSDVEEGGETMFP 215
+YL+ V EGG T FP
Sbjct: 296 IYLNTVNEGGNTTFP 310
>gi|424863736|ref|ZP_18287648.1| prolyl 4-hydroxylase subunit alpha-2 [SAR86 cluster bacterium
SAR86A]
gi|400757057|gb|EJP71269.1| prolyl 4-hydroxylase subunit alpha-2 [SAR86 cluster bacterium
SAR86A]
Length = 205
Score = 58.5 bits (140), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 35/139 (25%), Positives = 69/139 (49%), Gaps = 10/139 (7%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P NF S ++C++ + K +++ +++ + E+ +RT+ ++ S +
Sbjct: 17 PIVYVVNNFLSDDECEAFVEMGKGKMERAKV-ISDDES--EFHASRTNDFCWLEHS--AS 71
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-----QRL 196
++ + + + +P + E F ++ Y G +Y H+DAF+ Q + QR+
Sbjct: 72 DVIHEVSKRFSVLVKMPINNAEQFQLVYYGPGNEYKPHFDAFDKTTKEGQNNWFPGGQRM 131
Query: 197 ASFLLYLSDVEEGGETMFP 215
+ L YL+DVEEGG T FP
Sbjct: 132 VTALAYLNDVEEGGATDFP 150
>gi|325920649|ref|ZP_08182559.1| 2OG-Fe(II) oxygenase superfamily enzyme,Sel1 repeat protein
[Xanthomonas gardneri ATCC 19865]
gi|325548839|gb|EGD19783.1| 2OG-Fe(II) oxygenase superfamily enzyme,Sel1 repeat protein
[Xanthomonas gardneri ATCC 19865]
Length = 422
Score = 58.5 bits (140), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 44/133 (33%), Positives = 64/133 (48%), Gaps = 15/133 (11%)
Query: 92 SAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILE-----L 146
SA++C+ ++ A+ L+ SQ+ + T RTS G + ILE
Sbjct: 242 SADECRLLMLLARPHLRASQVVDPNDASTHRTP-IRTSRGATLDP------ILEDFAARA 294
Query: 147 IEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP---AEYGPQMSQRLASFLLYL 203
+ ++A LP TH EA +VL Y G+ Y +H D P A P RL + +YL
Sbjct: 295 AQARVAACAQLPLTHAEALSVLCYAPGEHYRAHRDYLPPGTIAADRPGAGNRLRTACVYL 354
Query: 204 SDVEEGGETMFPF 216
+DV+ GGET FP
Sbjct: 355 NDVDAGGETEFPV 367
>gi|219126074|ref|XP_002183290.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217405046|gb|EEC44990.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 509
Score = 58.5 bits (140), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 43/159 (27%), Positives = 75/159 (47%), Gaps = 6/159 (3%)
Query: 60 LPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKR-LKPSQLALRQ-- 116
+ H E+ D ++ + +V P + +F + E+C +I + + S+ Q
Sbjct: 298 MEHWESRD-TLTNTTTEVHDDGPWVVSLEDFLTPEECAVMIQLGGDQGYEQSKDVGEQKF 356
Query: 117 -GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQK 175
G T RTS+ + + D+ +++ I ++ +P + E +LRYE Q
Sbjct: 357 DGTYAAVTSNERTSTNAWCVGACDEHPVVQTIHQRMESLLNIPAVNYEHLQLLRYEETQF 416
Query: 176 YDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMF 214
Y +H+D F P G + R+ + LYL+DVEEGG T F
Sbjct: 417 YGNHHD-FIPFHVGRRQGPRILTVFLYLNDVEEGGGTHF 454
>gi|301613004|ref|XP_002936004.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Xenopus
(Silurana) tropicalis]
Length = 526
Score = 58.5 bits (140), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 39/141 (27%), Positives = 69/141 (48%), Gaps = 11/141 (7%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+PR + + + S E+ + AK RL+ + ++ +E+ + R + ++S ED
Sbjct: 326 KPRIVRYHDIISDEEISKVKELAKPRLRRATISNPITGVLETAQ-YRITKSAWLSGYEDP 384
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ------MSQ 194
++ + +I T L + E V Y IG +Y+ H+D +Y P
Sbjct: 385 --VVARLNRRIEGVTGLDMSTAEELQVANYGIGGQYEPHFDFLR--KYEPDAFKKLGTGN 440
Query: 195 RLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDVE GG T+FP
Sbjct: 441 RVATWLFYMSDVEAGGATVFP 461
>gi|195391758|ref|XP_002054527.1| GJ22759 [Drosophila virilis]
gi|194152613|gb|EDW68047.1| GJ22759 [Drosophila virilis]
Length = 539
Score = 58.5 bits (140), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 49/177 (27%), Positives = 77/177 (43%), Gaps = 6/177 (3%)
Query: 43 RPKLRTLEVVEKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIAT 102
R +LR +E L G ++ + L P + + SA +
Sbjct: 292 REELRPAPAALRELRCRLFAGNGRKSTYAPYKLEELHLDPYIIQVHDVISARDTAELQHL 351
Query: 103 AKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHG 162
A+ L+ SQ+ R G + RTS GT ++ I++ + H +A + L
Sbjct: 352 ARPELQRSQVYSRTGHE-HISANFRTSQGTTFEYTDHP--IMQKMSHHVAEISGLDMRSA 408
Query: 163 EAFNVLRYEIGQKYDSHYDAF-NPAEYGPQM--SQRLASFLLYLSDVEEGGETMFPF 216
E + Y IG Y+ H D+F + +Y M + RLA+ + YLS+VE GG T FPF
Sbjct: 409 EPLQIANYGIGGHYEPHMDSFPDSYDYSLNMYKTNRLATGIYYLSNVEAGGGTAFPF 465
>gi|334343683|ref|YP_004552235.1| procollagen-proline dioxygenase [Sphingobium chlorophenolicum L-1]
gi|334100305|gb|AEG47729.1| Procollagen-proline dioxygenase [Sphingobium chlorophenolicum L-1]
Length = 225
Score = 58.5 bits (140), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/131 (32%), Positives = 60/131 (45%), Gaps = 13/131 (9%)
Query: 90 FASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEH 149
F SAE+C + +PS L + + RTS+ +S + ++ +
Sbjct: 48 FLSAEECAGLRDLIDAGAQPSSLF-----SGSANADYRTSASCNLSPWDP---LVSAVSD 99
Query: 150 KIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN-PAEYGPQM----SQRLASFLLYLS 204
+I T + HGE RY GQ+Y H+D F A Y P M QR + ++YLS
Sbjct: 100 RICALTGIAADHGETLQGQRYHPGQEYKPHWDYFPVTANYWPAMLKTGGQRSWTAMIYLS 159
Query: 205 DVEEGGETMFP 215
VE GGET FP
Sbjct: 160 PVEAGGETHFP 170
>gi|432949777|ref|XP_004084253.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Oryzias
latipes]
Length = 532
Score = 58.5 bits (140), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 41/138 (29%), Positives = 69/138 (50%), Gaps = 9/138 (6%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + N S ++ + I AK RL ++ +R +T V +T R S ++ +D
Sbjct: 335 PHIVRYLNILSDQEIEKIKELAKPRL--ARATVRDPKTGVLTTAPYRVSKSAWLEGEDDP 392
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS---QRLA 197
+++ + +I T L E V Y +G +Y+ H+D F+ + + RLA
Sbjct: 393 --VIDRVNQRIQDITGLTVETAELLQVANYGVGGQYEPHFD-FSRRPFDSNLKVDGNRLA 449
Query: 198 SFLLYLSDVEEGGETMFP 215
+FL Y+SDVE GG T+FP
Sbjct: 450 TFLNYMSDVEAGGATVFP 467
>gi|195113247|ref|XP_002001179.1| GI22114 [Drosophila mojavensis]
gi|193917773|gb|EDW16640.1| GI22114 [Drosophila mojavensis]
Length = 487
Score = 58.5 bits (140), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 41/154 (26%), Positives = 78/154 (50%), Gaps = 9/154 (5%)
Query: 74 PF--QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
PF ++LS P + F + + + + +K L+ + + + + + ++ RT++G
Sbjct: 295 PFKMELLSEDPYMVVFHDVIYESEIEHLNRISKPFLQRATVVV-EDNSEDTLIKFRTANG 353
Query: 132 TFISASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
F+ + ++L+E R + Q + +AF L+Y+ G YD H D FN +
Sbjct: 354 AFLYRDKISPKDVQLVERIFQRMRDMSDLQINDDAFEYLKYDFGGHYDIHADYFNYTD-D 412
Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
R A+F++YL+DV GG T+FP D++I+
Sbjct: 413 QFTDDRFATFVIYLNDVARGGATVFP---DVEIA 443
>gi|198449500|ref|XP_001357604.2| GA15939 [Drosophila pseudoobscura pseudoobscura]
gi|198130634|gb|EAL26738.2| GA15939 [Drosophila pseudoobscura pseudoobscura]
Length = 528
Score = 58.2 bits (139), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 43/146 (29%), Positives = 69/146 (47%), Gaps = 6/146 (4%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
+ +++ P + + + SA + + A LK + + G E K TRTS
Sbjct: 321 LKMELVGLDPYMVLYHDVISAPEISQLQDMATPGLKRATVYKASGRRSEVVK-TRTSKVA 379
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP--AEYGP 190
+ + ++ + E + +IA T E + Y +G YD HYD FN A
Sbjct: 380 WFPDTFNE--LTERLNRRIADMTNFDLLGSEMLQAMNYGLGGHYDKHYDFFNASTATNLT 437
Query: 191 QMS-QRLASFLLYLSDVEEGGETMFP 215
QM+ R+A+ L YL+DVE+GG T+FP
Sbjct: 438 QMNGDRIATVLFYLTDVEQGGATVFP 463
>gi|255641158|gb|ACU20856.1| unknown [Glycine max]
Length = 195
Score = 58.2 bits (139), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 33/94 (35%), Positives = 53/94 (56%), Gaps = 3/94 (3%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
+VL+W PR + NF S E+C + A A RL S + + G+ ++S RTSSG F+
Sbjct: 82 EVLNWSPRIILLHNFLSMEECDYLRAIALPRLHISNVVDTKTGKGIKS--DVRTSSGMFL 139
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVL 168
+ E K +++ IE +I+ + +P +GE VL
Sbjct: 140 NPQERKYPMVQAIEKRISVYSQIPIENGELMQVL 173
>gi|195159313|ref|XP_002020526.1| GL14040 [Drosophila persimilis]
gi|194117295|gb|EDW39338.1| GL14040 [Drosophila persimilis]
Length = 549
Score = 58.2 bits (139), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 48/148 (32%), Positives = 74/148 (50%), Gaps = 12/148 (8%)
Query: 74 PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSS 130
PF+V LS P YF + S ++ + II K ++ S++ G+T ST RTS
Sbjct: 336 PFKVEQLSGDPYVAYFHDVLSDKESEQIIEHGKGQVTRSEI----GQTGNSTVSEIRTSQ 391
Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE--Y 188
T++ + L I+ ++ T L E ++ Y IG +Y+ H+D + AE +
Sbjct: 392 NTWLWY--ENNPWLADIKQRLEDITGLSTDTAEPLQLVNYGIGGQYEPHFDFMDDAEKNF 449
Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPF 216
G + RL + L YL+DV GG T FPF
Sbjct: 450 GWK-GNRLLTALFYLNDVPLGGATAFPF 476
>gi|312599252|gb|ADQ91275.1| hypothetical protein BpV2_108c [Bathycoccus sp. RCC1105 virus BpV2]
Length = 197
Score = 58.2 bits (139), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 41/135 (30%), Positives = 70/135 (51%), Gaps = 13/135 (9%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+PR L N S ++C+ I A K+L+ S ++ ++ + + R S ++ ASED
Sbjct: 23 KPRVL--KNVLSEDECKHIQNIASKKLQTSTVS----KSRDIDESIRKSETAWLKASED- 75
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFL 200
+++ + K T P + E VL+Y+ G Y H D F P + ++R+ +F+
Sbjct: 76 -PVVDKLIRKCVSMTDRPLRNCEDLQVLKYKPGGFYKPHQDTF-PDD----KNKRMYTFI 129
Query: 201 LYLSDVEEGGETMFP 215
+ L+D EGGET FP
Sbjct: 130 IALNDEYEGGETEFP 144
>gi|383640592|ref|ZP_09952998.1| 2OG-Fe(II) oxygenase [Sphingomonas elodea ATCC 31461]
Length = 221
Score = 58.2 bits (139), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 41/131 (31%), Positives = 60/131 (45%), Gaps = 13/131 (9%)
Query: 89 NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
NF + QC +++A + +PS LA + T T + A + K ILELI
Sbjct: 43 NFLDSGQCAALMARIDEHRRPSTLANAGDDYAFRTSETCDLAAEDPLAIDLKARILELIG 102
Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA-----EYGPQMSQRLASFLLYL 203
L H E RY +GQ++ +H D F+P +Y QR + +LYL
Sbjct: 103 --------LDPDHAEPMQGQRYAVGQEFKAHTDYFDPGSVDFDKYCSVAGQRTWTVMLYL 154
Query: 204 SDVEEGGETMF 214
++VE GG T F
Sbjct: 155 NEVEAGGATRF 165
>gi|388520887|gb|AFK48505.1| unknown [Lotus japonicus]
Length = 187
Score = 58.2 bits (139), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 37/55 (67%)
Query: 161 HGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
+GE+ +L YE G+KY+ HYD F+ R+A+ L+YLSDV +GGET+FP
Sbjct: 8 NGESIQILHYENGRKYEPHYDYFHDRANQFMGGHRIATVLMYLSDVGKGGETIFP 62
>gi|195341558|ref|XP_002037373.1| GM12146 [Drosophila sechellia]
gi|194131489|gb|EDW53532.1| GM12146 [Drosophila sechellia]
Length = 485
Score = 58.2 bits (139), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 44/145 (30%), Positives = 72/145 (49%), Gaps = 6/145 (4%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
I ++LS P + F + S + I +++K ++ PS+ + E K + S
Sbjct: 302 IKTEILSIDPFVILFHDMVSPTEGALIRSSSKNQILPSE-TVNAANEFEVAKFRTSKSVW 360
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA--FNPAEYGP 190
F S + + T L+L + ++ AT L H E F V+ Y IG ++SH+D + +
Sbjct: 361 FDSDANEAT--LKLTQ-RLGEATGLDMKHSEPFQVINYGIGGVFESHFDTSLADEDRFVN 417
Query: 191 QMSQRLASFLLYLSDVEEGGETMFP 215
RLA+ L YL+DV +GG T FP
Sbjct: 418 GYIDRLATTLFYLNDVPQGGATHFP 442
>gi|195159142|ref|XP_002020441.1| GL13994 [Drosophila persimilis]
gi|194117210|gb|EDW39253.1| GL13994 [Drosophila persimilis]
Length = 493
Score = 58.2 bits (139), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 43/150 (28%), Positives = 71/150 (47%), Gaps = 14/150 (9%)
Query: 73 IPFQVLSWRPRALYFPNFASA---EQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+ +++ P + + + SA Q Q + KR + + R+ E V+ TRTS
Sbjct: 286 LKMELVGLDPYMVLYHDVISALEISQLQDMATPGLKRATVYKASGRRSEVVK----TRTS 341
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
+ + ++ + E + +IA T E + Y +G YD HYD FN A
Sbjct: 342 KVAWFPDTFNE--LTERLNRRIADMTNFDLLGSEMLQAMNYGLGGHYDKHYDFFN-ASTA 398
Query: 190 PQMSQ----RLASFLLYLSDVEEGGETMFP 215
++Q R+A+ L YL+DVE+GG T+FP
Sbjct: 399 ANLTQMNGDRIATVLFYLTDVEQGGATVFP 428
>gi|224122338|ref|XP_002318810.1| predicted protein [Populus trichocarpa]
gi|222859483|gb|EEE97030.1| predicted protein [Populus trichocarpa]
Length = 310
Score = 58.2 bits (139), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 38/141 (26%), Positives = 71/141 (50%), Gaps = 11/141 (7%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRT---SSGTFI 134
+SW+PR + F + E+C +I+ A+ + S +G+ +S + R +S T +
Sbjct: 61 VSWQPRVFVYKGFLTDEECDHLISLAQGTKETS-----EGKDDDSGRIERNRLFASSTSL 115
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
+D IL IE +++ T+LP+ + + V+ Y I + +++D F
Sbjct: 116 LNMDD--NILSRIEERVSAWTLLPKENSKPLQVMHYGI-EDAKNYFDYFGNKSAIISSEP 172
Query: 195 RLASFLLYLSDVEEGGETMFP 215
+A+ + YLS+V +GGE FP
Sbjct: 173 LMATLVFYLSNVTQGGEIFFP 193
>gi|442747091|gb|JAA65705.1| Putative prolyl 4-hydroxylase alpha subunit [Ixodes ricinus]
Length = 533
Score = 58.2 bits (139), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 40/159 (25%), Positives = 72/159 (45%), Gaps = 9/159 (5%)
Query: 63 GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES 122
GETG +Q I + + +P + + +IA AK RL+ S+ +
Sbjct: 307 GETGFFKLQPIKLEEYNLKPYVVVLRDLLQDRDLNDMIAFAKPRLEQSKTLCAADK---D 363
Query: 123 TKGTRTSSGTFISASEDKTG--ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHY 180
RTSS T++ + + + ++ + T+ + E + + Y IG Y H+
Sbjct: 364 GPPPRTSSNTWLDDDDAPVAARVNQYLQSLLGLGTLYGKDEAEKYQLANYGIGGHYVPHH 423
Query: 181 D----AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
D + ++ R+A+ ++Y+SDVEEGG T+FP
Sbjct: 424 DYLEESLTSSKKHRLFGDRVATLMIYMSDVEEGGATVFP 462
>gi|219126281|ref|XP_002183389.1| hypothetical protein PHATRDRAFT_48891 [Phaeodactylum tricornutum
CCAP 1055/1]
gi|217405145|gb|EEC45089.1| hypothetical protein PHATRDRAFT_48891 [Phaeodactylum tricornutum
CCAP 1055/1]
Length = 427
Score = 58.2 bits (139), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 50/101 (49%), Gaps = 2/101 (1%)
Query: 115 RQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQ 174
R + E+ G RTS+ + K + ++A T +P+ + E +L+YE Q
Sbjct: 266 RSADVAETNSG-RTSTNAWCQHDCYKDPTARAVMDRVANITSIPEVNSEYLQMLQYEKSQ 324
Query: 175 KYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
Y +H D + P + R+ +F YLSDVEEGG T FP
Sbjct: 325 FYQTHSD-YIPYQVNRPTGVRILTFYFYLSDVEEGGGTNFP 364
>gi|170064953|ref|XP_001867740.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
gi|167882143|gb|EDS45526.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
Length = 509
Score = 58.2 bits (139), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 44/144 (30%), Positives = 72/144 (50%), Gaps = 10/144 (6%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFI 134
+VL+ P + + AS + +I AK R+ S+ +R GE RTS ++
Sbjct: 303 EVLNLDPFITVYHDVASDREISKLIELAKSRI--SRATIRDDGEP--QVSNARTSQNAWL 358
Query: 135 SASEDKTGILELIEHKIARATM-LPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEY-GPQ 191
A +D+ ++ ++ ++ T L Q E V Y +G Y +H+D A Y G +
Sbjct: 359 DAGDDR--VVTTLDRRVGDMTGGLRQQSYEMLQVNNYGVGGHYVAHHDWAMEAVPYAGLR 416
Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
+ R+A+ + YLSDVE GG T+FP
Sbjct: 417 VGNRIATVMFYLSDVEIGGATVFP 440
>gi|291230950|ref|XP_002735430.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Saccoglossus
kowalevskii]
Length = 533
Score = 58.2 bits (139), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 40/150 (26%), Positives = 69/150 (46%), Gaps = 7/150 (4%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+Q +V+ +P+ + F + + + + A A RL+ + + +E + R S
Sbjct: 321 LQPAKEEVVFDKPKLIIFHDAILTNEIRKVKALASPRLRRATIQNSVTGNLEFAE-YRIS 379
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
++S ED ++ + H+I + T L E V Y +G Y+ H+D E
Sbjct: 380 KSAWLS--EDDGDVVHRLNHRIEQYTGLTMDTAEELQVANYGLGGHYEPHFDFARKEEIN 437
Query: 190 P----QMSQRLASFLLYLSDVEEGGETMFP 215
R+A+FL Y+SDVE GG T+FP
Sbjct: 438 AFKSLNTGNRIATFLFYMSDVEAGGATVFP 467
>gi|194765138|ref|XP_001964684.1| GF23317 [Drosophila ananassae]
gi|190614956|gb|EDV30480.1| GF23317 [Drosophila ananassae]
Length = 520
Score = 58.2 bits (139), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 40/135 (29%), Positives = 66/135 (48%), Gaps = 9/135 (6%)
Query: 84 ALYFPNFASAE--QCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
+Y +SAE + + + + KR + +L + E V+ TRTS + S +
Sbjct: 330 VIYHDVLSSAEIDEMKEMATPSLKRATVYKASLGKNEVVK----TRTSKVAWFPDSYNSL 385
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-QRLASFL 200
+ + +I T + E ++ Y +G YD HYD FN E ++ R+A+ L
Sbjct: 386 TLR--LNARIHDMTGFDLSGSEMLQLMNYGLGGHYDKHYDFFNATEKSSSLTGDRIATVL 443
Query: 201 LYLSDVEEGGETMFP 215
Y+SDVE+GG T+FP
Sbjct: 444 FYMSDVEQGGATVFP 458
>gi|224011483|ref|XP_002295516.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|209583547|gb|ACI64233.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 451
Score = 57.8 bits (138), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 36/104 (34%), Positives = 52/104 (50%), Gaps = 5/104 (4%)
Query: 116 QGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQK 175
G E TRTSS ++ K + + + +I TM+PQT+ E+ +LRYE GQ
Sbjct: 285 DGSYTEDVHSTRTSSNSWCLDKCMKDPVAKDVVDRIEHMTMIPQTNSESLQLLRYEEGQY 344
Query: 176 YDSHYDAFNPAEYGPQMSQRLASFLLYLSDVE----EGGETMFP 215
Y H+D + P R+ +F +YL+ E EGG T FP
Sbjct: 345 YGVHHDLIEHQKDRPP-GVRILTFYMYLNGNEDSGLEGGGTKFP 387
>gi|66772633|gb|AAY55628.1| IP02961p [Drosophila melanogaster]
Length = 409
Score = 57.8 bits (138), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 42/149 (28%), Positives = 67/149 (44%), Gaps = 9/149 (6%)
Query: 74 PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
PF++ L P + ++ S+ TA+ R+K S + G + RTS G
Sbjct: 190 PFKLEELHLDPLVVQLHQVIGSKDSDSLQKTARPRIKRSTVYSLGGNGGSTAAAFRTSQG 249
Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY--- 188
+ S + +L+ + + L + E V Y IG Y+ H+D+F P +
Sbjct: 250 ASFNYS--RNAATKLLSRHVGDFSGLNMDYAEDLQVANYGIGGHYEPHWDSF-PENHIYQ 306
Query: 189 -GPQMSQRLASFLLYLSDVEEGGETMFPF 216
G R+A+ + YL+DVE GG T FPF
Sbjct: 307 EGDLHGNRMATGIYYLADVEAGGGTAFPF 335
>gi|24651418|ref|NP_524594.2| prolyl-4-hydroxylase-alpha MP [Drosophila melanogaster]
gi|7301951|gb|AAF57057.1| prolyl-4-hydroxylase-alpha MP [Drosophila melanogaster]
gi|359807686|gb|AEV66559.1| FI17802p1 [Drosophila melanogaster]
Length = 535
Score = 57.8 bits (138), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 42/149 (28%), Positives = 67/149 (44%), Gaps = 9/149 (6%)
Query: 74 PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
PF++ L P + ++ S+ TA+ R+K S + G + RTS G
Sbjct: 316 PFKLEELHLDPLVVQLHQVIGSKDSDSLQKTARPRIKRSTVYSLGGNGGSTAAAFRTSQG 375
Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY--- 188
+ S + +L+ + + L + E V Y IG Y+ H+D+F P +
Sbjct: 376 ASFNYS--RNAATKLLSRHVGDFSGLNMDYAEDLQVANYGIGGHYEPHWDSF-PENHIYQ 432
Query: 189 -GPQMSQRLASFLLYLSDVEEGGETMFPF 216
G R+A+ + YL+DVE GG T FPF
Sbjct: 433 EGDLHGNRMATGIYYLADVEAGGGTAFPF 461
>gi|195575105|ref|XP_002105520.1| GD21524 [Drosophila simulans]
gi|194201447|gb|EDX15023.1| GD21524 [Drosophila simulans]
Length = 448
Score = 57.8 bits (138), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 42/143 (29%), Positives = 65/143 (45%), Gaps = 6/143 (4%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
+ ++LS P + F + S + SI AK RL + + G E RT+ GT
Sbjct: 274 LKMELLSLDPYMVLFHDVVSDKDIVSIRNMAKGRLARAVTVSKDGNYTEDPD--RTTKGT 331
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
++ + + +++ + T + F VL Y IG Y H D AE
Sbjct: 332 WLV---ENSKLIQRLSQLTQDMTNFEIHDADPFQVLNYGIGGFYGIHLDFLGEAELD-NF 387
Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
S R+A+ + YLSDV +GG T+FP
Sbjct: 388 SDRIATAVFYLSDVPQGGATIFP 410
>gi|224006596|ref|XP_002292258.1| hypothetical protein THAPSDRAFT_263536 [Thalassiosira pseudonana
CCMP1335]
gi|220971900|gb|EED90233.1| hypothetical protein THAPSDRAFT_263536 [Thalassiosira pseudonana
CCMP1335]
Length = 206
Score = 57.8 bits (138), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 44/143 (30%), Positives = 68/143 (47%), Gaps = 16/143 (11%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPS----QLALRQGETVESTKGTRTSSGTFISAS 137
PR Y NF SA++ ++A + + PS A QG + + TRTS F +
Sbjct: 2 PRVFYVHNFLSADEADELVAFS---MAPSTGGTHKAWNQGGS-NAKLTTRTSMNAF-DIT 56
Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-----NPAEYGPQM 192
+ ++ ++ R + + +LRYE+GQ Y +H+D F N + P
Sbjct: 57 TKLSFRIKRRAFRLLRMGAYKENLADGIQILRYELGQAYIAHHDYFPVRQSNDHLWDPSK 116
Query: 193 --SQRLASFLLYLSDVEEGGETM 213
S R A+ LYLSDVE GG+T+
Sbjct: 117 GGSNRFATIFLYLSDVEVGGQTL 139
>gi|397643670|gb|EJK76008.1| hypothetical protein THAOC_02250 [Thalassiosira oceanica]
Length = 480
Score = 57.8 bits (138), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 43/152 (28%), Positives = 68/152 (44%), Gaps = 22/152 (14%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES-TKG-------TRTSSGTF 133
PR Y NF SA + + + P ++A G T ++ +G TRTS F
Sbjct: 205 PRVFYVHNFLSAAEADEFVKFSTAPENPYKMAPSTGGTHKAWNQGGDGAVLTTRTSENAF 264
Query: 134 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF---------- 183
++ + + ++ R + + +LRY++GQ Y +H+D F
Sbjct: 265 DITTKQSFDVKKRA-FRLLRMNGYQENMADGIQILRYKVGQAYVAHHDYFPTHQSKDFNW 323
Query: 184 NPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
+P G S R A+ LYLSDV GG+T+FP
Sbjct: 324 DPLSGG---SNRFATIFLYLSDVSYGGQTVFP 352
>gi|398385057|ref|ZP_10543083.1| 2OG-Fe(II) oxygenase superfamily enzyme [Sphingobium sp. AP49]
gi|397721148|gb|EJK81697.1| 2OG-Fe(II) oxygenase superfamily enzyme [Sphingobium sp. AP49]
Length = 225
Score = 57.8 bits (138), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 46/133 (34%), Positives = 61/133 (45%), Gaps = 15/133 (11%)
Query: 89 NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILEL-I 147
+F SAE+C ++ + +PS L G RTS + D+ L L I
Sbjct: 47 DFLSAEECAALRSLIDDGAQPSTLFSGSGNAE-----YRTSHSCHL----DRHDPLVLAI 97
Query: 148 EHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP-AEYGPQM----SQRLASFLLY 202
+I T L +GE RY GQ+Y H D F P A Y P M QR + ++Y
Sbjct: 98 SDRICALTGLDSDNGETLQGQRYTSGQEYKVHCDYFPPNASYWPDMLKTGGQRSWTAMIY 157
Query: 203 LSDVEEGGETMFP 215
LS V+ GGET FP
Sbjct: 158 LSSVDAGGETHFP 170
>gi|326436053|gb|EGD81623.1| p4ha2 protein [Salpingoeca sp. ATCC 50818]
Length = 548
Score = 57.8 bits (138), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 44/141 (31%), Positives = 71/141 (50%), Gaps = 19/141 (13%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQG--ETVESTKGTRTSSGTFISASE 138
R R F FAS E+C+ + K+RL+ + +A G + VE R S+ ++
Sbjct: 337 RQRLQVFRQFASPEECRHLQHAGKRRLERA-VAWTDGRFQPVE----FRISTAAWLQPDH 391
Query: 139 DKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD----AFNPAEYGPQMSQ 194
D I++ I +I AT + + EA + Y +G Y+ H+D NP +
Sbjct: 392 D--AIVKRIHGRIEDATQVDIEYAEALQISNYGMGGFYEPHFDHSSRGTNPD------GE 443
Query: 195 RLASFLLYLSDVEEGGETMFP 215
RLA+F++YL+ V++GG T FP
Sbjct: 444 RLATFMIYLNPVKQGGFTAFP 464
>gi|303273602|ref|XP_003056161.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226462245|gb|EEH59537.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 750
Score = 57.8 bits (138), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 47/169 (27%), Positives = 75/169 (44%), Gaps = 45/169 (26%)
Query: 87 FPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILEL 146
F +F SA +C ++A A L+ S++ G+ E RTSS TF++ + + ++
Sbjct: 534 FDHFLSAVECDDLVAIAAPDLRRSRVT--DGKLSEG----RTSSSTFLTGCKQEEPLVRA 587
Query: 147 IEHKIARA----TMLP---------QTHG--------------------EAFNVLRYEIG 173
IE ++ RA T++ + HG E V+RY G
Sbjct: 588 IEQRLLRAVQSATLIAAQPNVYDSNERHGQPYRGSTSRFSQRPNLLQGAEPMQVVRYTEG 647
Query: 174 QKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFEVDLQI 222
Q Y +HYD +R A+F++YL+DV GG T FP V + +
Sbjct: 648 QMYTAHYDNKQGC------LRRTATFMMYLTDVHSGGATHFPRAVPVSM 690
>gi|239915958|ref|NP_001070123.2| prolyl 4-hydroxylase alpha II-like precursor [Danio rerio]
Length = 490
Score = 57.8 bits (138), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 41/133 (30%), Positives = 66/133 (49%), Gaps = 16/133 (12%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P+ + + + S + +++ A+ L SQ G V S RTS F+ ++
Sbjct: 308 PKIIRYHDVISDTEIETLKDIARPELTRSQ----TGWGVIS--DIRTSQSVFL----EEV 357
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
G + I +IA T L E +V Y IG +Y H+D G ++++R A+FL+
Sbjct: 358 GTVARISQRIADITGLSVESAEKLHVQNYGIGGRYTPHFDT------GDEVNERTATFLI 411
Query: 202 YLSDVEEGGETMF 214
Y+SDVE GG T+F
Sbjct: 412 YMSDVEVGGATVF 424
>gi|404253277|ref|ZP_10957245.1| putative oxygenase [Sphingomonas sp. PAMC 26621]
Length = 226
Score = 57.8 bits (138), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 30/74 (40%), Positives = 41/74 (55%), Gaps = 5/74 (6%)
Query: 148 EHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE-YGPQM----SQRLASFLLY 202
+ IA + HGE RY +GQ + +H D FN A+ Y P+M QR + ++Y
Sbjct: 98 DEAIATLLGIDPVHGETMQGQRYAVGQHFRAHNDYFNEAQPYWPKMIESGGQRTWTAMIY 157
Query: 203 LSDVEEGGETMFPF 216
L+DVEEGG T FP
Sbjct: 158 LNDVEEGGATWFPL 171
>gi|195159311|ref|XP_002020525.1| GL13465 [Drosophila persimilis]
gi|194117294|gb|EDW39337.1| GL13465 [Drosophila persimilis]
Length = 578
Score = 57.4 bits (137), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 37/149 (24%), Positives = 71/149 (47%), Gaps = 13/149 (8%)
Query: 74 PF--QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
PF ++LS P + + + + + ++ +K +K + + + RTS+
Sbjct: 375 PFKTELLSLAPYMVLYHDVITPLESLTLKNLSKPHMKRRAMTFNKQKLRPLIDSGRTSNS 434
Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ 191
++++ E+ ++E +E ++ T + E + ++ Y IG Y H D F PQ
Sbjct: 435 VWLTSHEN--AVMERLERRVGVMTNFEMENSEVYQLINYGIGGHYKPHTDHFE----TPQ 488
Query: 192 MSQR-----LASFLLYLSDVEEGGETMFP 215
+ R +A+ L YLSDV +GG T+FP
Sbjct: 489 LEHRGGGDRIATVLFYLSDVPQGGATLFP 517
>gi|92096574|gb|AAI15350.1| LOC557059 protein [Danio rerio]
Length = 508
Score = 57.4 bits (137), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 41/133 (30%), Positives = 66/133 (49%), Gaps = 16/133 (12%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P+ + + + S + +++ A+ L SQ G V S RTS F+ ++
Sbjct: 326 PKIIRYHDVISDTEIETLKDIARPELTRSQ----TGWGVIS--DIRTSQSVFL----EEV 375
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
G + I +IA T L E +V Y IG +Y H+D G ++++R A+FL+
Sbjct: 376 GTVARISQRIADITGLSVESAEKLHVQNYGIGGRYTPHFDT------GDEVNERTATFLI 429
Query: 202 YLSDVEEGGETMF 214
Y+SDVE GG T+F
Sbjct: 430 YMSDVEVGGATVF 442
>gi|395817618|ref|XP_003782262.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Otolemur
garnettii]
Length = 538
Score = 57.4 bits (137), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 70/137 (51%), Gaps = 7/137 (5%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 341 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 396
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
++ + H++ T L E V Y +G +Y+ H+D + P + G + R+A+
Sbjct: 397 DPVVARVNHRMQHITGLSVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRVAT 456
Query: 199 FLLYLSDVEEGGETMFP 215
FL Y+SDVE GG T+FP
Sbjct: 457 FLNYMSDVEAGGATVFP 473
>gi|226503309|ref|NP_001141668.1| uncharacterized protein LOC100273794 [Zea mays]
gi|194705482|gb|ACF86825.1| unknown [Zea mays]
gi|414587757|tpg|DAA38328.1| TPA: hypothetical protein ZEAMMB73_894856 [Zea mays]
gi|414587758|tpg|DAA38329.1| TPA: hypothetical protein ZEAMMB73_894856 [Zea mays]
Length = 165
Score = 57.4 bits (137), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 28/73 (38%), Positives = 49/73 (67%), Gaps = 3/73 (4%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
+V+SW PR + F NF S+E+C ++A A+ RL+ S + + G+ V+S RTSSG F+
Sbjct: 58 EVISWTPRIIVFHNFLSSEECDYLMAIARPRLQISTVVDVATGKGVKSD--VRTSSGMFV 115
Query: 135 SASEDKTGILELI 147
++ E K+ +++L+
Sbjct: 116 NSEERKSPVVQLV 128
>gi|395492951|ref|ZP_10424530.1| putative oxygenase [Sphingomonas sp. PAMC 26617]
Length = 226
Score = 57.4 bits (137), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 30/74 (40%), Positives = 41/74 (55%), Gaps = 5/74 (6%)
Query: 148 EHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE-YGPQM----SQRLASFLLY 202
+ IA + HGE RY +GQ + +H D FN A+ Y P+M QR + ++Y
Sbjct: 98 DEAIAALLGIDPVHGETMQGQRYAVGQHFRAHNDYFNEAQPYWPKMIESGGQRTWTAMIY 157
Query: 203 LSDVEEGGETMFPF 216
L+DVEEGG T FP
Sbjct: 158 LNDVEEGGATWFPL 171
>gi|260825355|ref|XP_002607632.1| hypothetical protein BRAFLDRAFT_84679 [Branchiostoma floridae]
gi|229292980|gb|EEN63642.1| hypothetical protein BRAFLDRAFT_84679 [Branchiostoma floridae]
Length = 519
Score = 57.4 bits (137), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 40/140 (28%), Positives = 64/140 (45%), Gaps = 10/140 (7%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKP--SQLALRQGETVESTKGTRTSSGTFISASE 138
+P+ N S + + I A+ RL+P +Q G + S R S ++ E
Sbjct: 320 KPKLWVLHNILSDPEMEVIKKLAQPRLRPAATQNPTTGGAVLSSY---RISKNAWLYYWE 376
Query: 139 DKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE---YGPQMSQR 195
+ ++ ++ ++ AT L E V+ Y IG Y+ H+D E P R
Sbjct: 377 HR--LINRVKQRVEDATGLTMETAEPLQVINYGIGGHYEPHFDCATKDEEFALDPNEGDR 434
Query: 196 LASFLLYLSDVEEGGETMFP 215
+A+ L Y+SDVE GG T+FP
Sbjct: 435 IATMLFYMSDVEAGGATVFP 454
>gi|219113023|ref|XP_002186095.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|209582945|gb|ACI65565.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 508
Score = 57.4 bits (137), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 46/162 (28%), Positives = 75/162 (46%), Gaps = 19/162 (11%)
Query: 72 SIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKR-LKPSQLALRQGETVESTKGTRTSS 130
++ VLS PR +F S + + ++ A KR LK S + + TRTS+
Sbjct: 278 NMTMTVLSCVPRVFEVKDFLSDMEVEHLLNIASKRKLKRSTMHAGGSSEATTNDDTRTST 337
Query: 131 GTFISASED---------KTGILELIEH--KIARATMLPQ---TH---GEAFNVLRYEIG 173
+I +D +L++ E + R + +P+ +H E ++ Y++G
Sbjct: 338 NDWIPRHQDLITDTIYRRAADLLQMDEALLRWRRKSEIPEFTESHISISERLQLVNYQVG 397
Query: 174 QKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
Q+Y H+D P Q S R A+ L YL+D +GGET FP
Sbjct: 398 QQYTPHHDFTMPGLVNMQPS-RFATLLFYLNDDMDGGETAFP 438
>gi|195452776|ref|XP_002073495.1| GK13117 [Drosophila willistoni]
gi|194169580|gb|EDW84481.1| GK13117 [Drosophila willistoni]
Length = 487
Score = 57.4 bits (137), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 52/199 (26%), Positives = 90/199 (45%), Gaps = 22/199 (11%)
Query: 35 LSRDVPSIRPKLRTLEVVEKENESGLPHGETGDASIQSI------PF--------QVLSW 80
L R+ P+I+PK V + E DA + I PF +++
Sbjct: 227 LIRNEPNIKPKPFNKSVGDFERGCRGEFPALTDAKLYCIYNTTSSPFLRLAPLKMELIGL 286
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
P + + + S + + AK +LK +++ T + +K TRT+ + + ++
Sbjct: 287 DPYMVLYHDVISPNEIAELQEMAKPQLKRARVYNSTKNTDQLSK-TRTAKLAWFLDTFNQ 345
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ----RL 196
+ E + +I T E V+ Y +G Y H+D FN + GP ++Q R+
Sbjct: 346 --LTERLNQRIMDMTNFVLNGSEMLQVMNYGLGGYYVKHFDYFNTTK-GPHITQINGDRI 402
Query: 197 ASFLLYLSDVEEGGETMFP 215
A+ L YL+DVE+GG T+FP
Sbjct: 403 ATVLFYLNDVEQGGATVFP 421
>gi|157111033|ref|XP_001651361.1| prolyl 4-hydroxylase alpha subunit 1, putative [Aedes aegypti]
gi|108878552|gb|EAT42777.1| AAEL005714-PA, partial [Aedes aegypti]
Length = 522
Score = 57.4 bits (137), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 39/144 (27%), Positives = 71/144 (49%), Gaps = 7/144 (4%)
Query: 74 PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
PF++ + +P+ + F + S + + + AK L+ + +A +Q E +K + S
Sbjct: 319 PFKLEEMHLKPKIVIFHDVLSDTEIELLKRLAKPILERATIANQQTGKAERSKDRVSKSS 378
Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ 191
F ++ + I ++A T L E V+ Y +G +YD H+D F+ + +
Sbjct: 379 WF---PDEYHSTIRTITKRVADMTGLSMDTAEELQVVNYGLGGQYDPHFDFFHWGKL--K 433
Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
R+A+ L Y+SDV GG T+FP
Sbjct: 434 EVNRIATVLFYMSDVSIGGATVFP 457
>gi|195113237|ref|XP_002001174.1| GI10637 [Drosophila mojavensis]
gi|193917768|gb|EDW16635.1| GI10637 [Drosophila mojavensis]
Length = 529
Score = 57.0 bits (136), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 41/146 (28%), Positives = 69/146 (47%), Gaps = 6/146 (4%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
+ +++S P + + + S + + + A LK + + +Q K TRTS T
Sbjct: 321 LKMELISLDPYMVIYHDVISPSEISELQSLAVPGLKRATVFNQQSMRNHVVK-TRTSKVT 379
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN---PAEYG 189
++ + ++ I + +I T E V+ Y +G YD HYD FN A+
Sbjct: 380 WLLDTLNQLTIR--LNRRITDMTGFDMYGSEMLQVMNYGLGGHYDKHYDYFNSSVAADLT 437
Query: 190 PQMSQRLASFLLYLSDVEEGGETMFP 215
R+A+ L YL+DVE+GG T+FP
Sbjct: 438 RLNGDRIATVLFYLTDVEQGGATVFP 463
>gi|116008432|ref|NP_651804.2| CG15539, isoform A [Drosophila melanogaster]
gi|66772391|gb|AAY55507.1| IP10910p [Drosophila melanogaster]
gi|66772535|gb|AAY55579.1| IP10810p [Drosophila melanogaster]
gi|113194858|gb|AAF57060.2| CG15539, isoform A [Drosophila melanogaster]
Length = 386
Score = 57.0 bits (136), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 40/143 (27%), Positives = 64/143 (44%), Gaps = 6/143 (4%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
+ ++LS P + F + S + SI K +L + + G E RT+ GT
Sbjct: 187 LKMELLSLDPYMVLFHDVVSDKDIVSIRNLTKGKLARTVTVSKDGNYTEDPD--RTTKGT 244
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
++ + +++ + T + F VL Y IG Y H+D AE
Sbjct: 245 WLV---ENNALIQRLSQLTQDMTNFDIHDADPFQVLNYGIGGFYGIHFDFLEDAELD-NF 300
Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
S R+A+ + YLSDV +GG T+FP
Sbjct: 301 SDRIATAVFYLSDVPQGGATIFP 323
>gi|332187533|ref|ZP_08389270.1| 2OG-Fe(II) oxygenase superfamily protein [Sphingomonas sp. S17]
gi|332012462|gb|EGI54530.1| 2OG-Fe(II) oxygenase superfamily protein [Sphingomonas sp. S17]
Length = 228
Score = 57.0 bits (136), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 40/135 (29%), Positives = 66/135 (48%), Gaps = 13/135 (9%)
Query: 86 YFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILE 145
Y +F + QC ++IA +PS L + G RTS ++ + ++
Sbjct: 47 YQADFLTPAQCDALIAMIDANRRPSTLL-----SDRPDYGFRTSESCDMNRWSPE---VQ 98
Query: 146 LIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE-YGPQM----SQRLASFL 200
I+ IA+ +P GE RY GQ++ +H+D F+ +E Y ++ QR + +
Sbjct: 99 PIDESIAQLLGIPPEQGETMQGQRYAPGQQFRAHHDYFHESESYWEKVKVHGGQRTWTAM 158
Query: 201 LYLSDVEEGGETMFP 215
+YL+DV EGG T FP
Sbjct: 159 IYLNDVPEGGATWFP 173
>gi|194905392|ref|XP_001981188.1| GG11756 [Drosophila erecta]
gi|190655826|gb|EDV53058.1| GG11756 [Drosophila erecta]
Length = 509
Score = 57.0 bits (136), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 41/143 (28%), Positives = 65/143 (45%), Gaps = 6/143 (4%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
+ ++LS P + F + S + SI AK L + + G E RT+ GT
Sbjct: 310 LKMELLSLDPYVVLFHDVVSDQDILSIRNLAKGGLARAVTVTQDGNDKEDP--ARTTKGT 367
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
++ + + +++ + T + F VL Y IG Y +H+D E G
Sbjct: 368 WLV---ENSKLIQRLSQLSQDMTNFDVRDADPFQVLNYGIGGFYGTHFDFLEDTEMG-HF 423
Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
S R+A+ + YLSDV +GG T FP
Sbjct: 424 SDRIATAVFYLSDVPQGGATTFP 446
>gi|195505251|ref|XP_002099423.1| GE23370 [Drosophila yakuba]
gi|194185524|gb|EDW99135.1| GE23370 [Drosophila yakuba]
Length = 534
Score = 57.0 bits (136), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 42/142 (29%), Positives = 66/142 (46%), Gaps = 14/142 (9%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + + SA + +I+ A + +K +++ ET T RT+ G ++ ++
Sbjct: 327 PYVVLYHEVLSAREISMLISKAAQNMKNTRV---HRETKPKTNRGRTAKGHWLKKESNE- 382
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE---YGPQMSQ---- 194
+ I +I T E F V+ Y IG Y H D F+ A GP+ Q
Sbjct: 383 -LTRRITRRIVDMTGFDLADSEDFQVINYGIGGHYFLHMDYFDYASSNYTGPRSRQSKVL 441
Query: 195 --RLASFLLYLSDVEEGGETMF 214
R+A+ L YLSDVE+GG T+F
Sbjct: 442 GDRIATVLFYLSDVEQGGATVF 463
>gi|219116348|ref|XP_002178969.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217409736|gb|EEC49667.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 302
Score = 57.0 bits (136), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 42/142 (29%), Positives = 63/142 (44%), Gaps = 9/142 (6%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
V+S P + +F S C+++I TA K +R E T +S T
Sbjct: 90 HVVSSEPPLVLIHDFLSTSMCKNLIDTATSTDK----MIRSTTGSEQETSTIRTSTTVWL 145
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ- 194
E +I KI+ + P H E V+RYE GQ + H D + +M +
Sbjct: 146 NDEQVPETSRIIAEKISSISGFPANHMENLQVVRYETGQSFKLHTDTIDAYN---EMDKR 202
Query: 195 -RLASFLLYLSDVEEGGETMFP 215
R+A+ L+YL+ GGET+FP
Sbjct: 203 GRVATCLIYLAAPTIGGETLFP 224
>gi|348518914|ref|XP_003446976.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Oreochromis
niloticus]
Length = 536
Score = 57.0 bits (136), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 43/141 (30%), Positives = 68/141 (48%), Gaps = 13/141 (9%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK RL ++ +R +T V +T R S ++ ED
Sbjct: 337 PHIVRYLDLLSDEEIEKIKELAKPRL--ARATVRDPKTGVLTTANYRVSKSAWLEGEEDP 394
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL---- 196
+++ + +I T L E V Y +G +Y+ H+D E P +RL
Sbjct: 395 --VIDRVNQRIEAITGLTVETAELLQVANYGVGGQYEPHFDFSRKDE--PDAFKRLGTGN 450
Query: 197 --ASFLLYLSDVEEGGETMFP 215
A+FL Y+SDVE GG T+FP
Sbjct: 451 RVATFLNYMSDVEAGGATVFP 471
>gi|291190128|ref|NP_001167431.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Salmo salar]
gi|223649060|gb|ACN11288.1| Prolyl 4-hydroxylase subunit alpha-2 precursor [Salmo salar]
Length = 538
Score = 57.0 bits (136), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 43/143 (30%), Positives = 67/143 (46%), Gaps = 17/143 (11%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + N S + + I AK RL ++ +R +T V +T R S ++ ED
Sbjct: 339 PHIVRYLNALSDSEIEKIKELAKPRL--ARATVRDPKTGVLTTANYRVSKSAWLEGEEDP 396
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
++E + +I T L E + Y +G +Y+ H+D AF G
Sbjct: 397 --VIERVNQRIEDITGLTTQTAELLQIANYGVGGQYEPHFDFSRKDEPDAFKTLGTG--- 451
Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
R+A+FL Y+SDVE GG T+FP
Sbjct: 452 -NRVATFLNYMSDVEAGGATVFP 473
>gi|412986224|emb|CCO17424.1| predicted protein [Bathycoccus prasinos]
Length = 557
Score = 57.0 bits (136), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 24/151 (15%)
Query: 77 VLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
+S P F NF +C+ + A K LK S++ T RTSS F+
Sbjct: 318 CVSLSPLLFVFENFLHESECEFLRTLADKDLKRSRV------TDGKLSNGRTSSSCFLIG 371
Query: 137 SEDKTGILELIEHKI------------ARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN 184
++ K +++ IE ++ R L E ++RY +KY SH+D N
Sbjct: 372 AKGKEDVVKTIERRMLDAIRSTPVLTTRRFDTLKLKGSEPMQIVRYGKNEKYTSHFD--N 429
Query: 185 PAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
A +R+A+F+ YLSD EGG T FP
Sbjct: 430 KA----GSFRRVATFMCYLSDQCEGGCTNFP 456
>gi|313768324|ref|YP_004062004.1| hypothetical protein MpV1_121c [Micromonas sp. RCC1109 virus MpV1]
gi|312599020|gb|ADQ91044.1| hypothetical protein MpV1_121c [Micromonas sp. RCC1109 virus MpV1]
Length = 219
Score = 57.0 bits (136), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 41/137 (29%), Positives = 68/137 (49%), Gaps = 18/137 (13%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ--GETVESTKGTRTSSGTFISASE 138
RPR F +F + ++ + I+ A K LKPS ++ + E+V R S ++ +
Sbjct: 45 RPRV--FHDFITPQERKHIMEMASKELKPSTVSTDRILNESV------RKSETAWLGRED 96
Query: 139 DKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLAS 198
+++ + H+ + P + E VLRY+ G Y+ H D+ N + RL +
Sbjct: 97 ---PVVDAVIHRCLKYIDRPIKNCEKLQVLRYKPGGYYNPHQDSINDGS-----NPRLYT 148
Query: 199 FLLYLSDVEEGGETMFP 215
F+L L+D EGGET FP
Sbjct: 149 FILALNDEYEGGETEFP 165
>gi|66771513|gb|AAY55068.1| IP12095p [Drosophila melanogaster]
Length = 538
Score = 56.6 bits (135), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 43/145 (29%), Positives = 72/145 (49%), Gaps = 6/145 (4%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
I ++LS P + + S ++ I +++K ++ PS+ + E K + S
Sbjct: 321 IKTEILSVDPFVILLHDMVSHKEGALIRSSSKNQILPSE-TVNAANEFEIAKFRTSKSVW 379
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA--FNPAEYGP 190
F S + + T L+L + ++ AT L H E F V+ Y IG ++SH+D + +
Sbjct: 380 FDSDANEAT--LKLTQ-RLGEATGLDMKHSEPFQVINYGIGGVFESHFDTSLADEDRFVN 436
Query: 191 QMSQRLASFLLYLSDVEEGGETMFP 215
RLA+ L YL+DV +GG T FP
Sbjct: 437 GYIDRLATTLFYLNDVPQGGATHFP 461
>gi|66770643|gb|AAY54633.1| IP12395p [Drosophila melanogaster]
Length = 538
Score = 56.6 bits (135), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 43/145 (29%), Positives = 72/145 (49%), Gaps = 6/145 (4%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
I ++LS P + + S ++ I +++K ++ PS+ + E K + S
Sbjct: 321 IKTEILSVDPFVILLHDMVSHKEGALIRSSSKNQILPSE-TVNAANEFEIAKFRTSKSVW 379
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA--FNPAEYGP 190
F S + + T L+L + ++ AT L H E F V+ Y IG ++SH+D + +
Sbjct: 380 FDSDANEAT--LKLTQ-RLGEATGLDMKHSEPFQVINYGIGGVFESHFDTSLADEDRFVN 436
Query: 191 QMSQRLASFLLYLSDVEEGGETMFP 215
RLA+ L YL+DV +GG T FP
Sbjct: 437 GYIDRLATTLFYLNDVPQGGATHFP 461
>gi|195159317|ref|XP_002020528.1| GL14042 [Drosophila persimilis]
gi|194117297|gb|EDW39340.1| GL14042 [Drosophila persimilis]
Length = 534
Score = 56.6 bits (135), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 28/67 (41%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 151 IARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMSQRLASFLLYLSDVEEG 209
++ AT L T E V Y +G Y+ H+D F +P Y + R+A+ + YLSDVE+G
Sbjct: 394 LSDATGLDMTFCEQLQVANYGVGGHYEPHWDFFRDPDHYPAEEGNRMATAIFYLSDVEQG 453
Query: 210 GETMFPF 216
G T FPF
Sbjct: 454 GATAFPF 460
>gi|116008130|ref|NP_001036777.1| CG31524, isoform B [Drosophila melanogaster]
gi|113194860|gb|ABI31221.1| CG31524, isoform B [Drosophila melanogaster]
Length = 535
Score = 56.6 bits (135), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 43/145 (29%), Positives = 72/145 (49%), Gaps = 6/145 (4%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
I ++LS P + + S ++ I +++K ++ PS+ + E K + S
Sbjct: 318 IKTEILSVDPFVILLHDMVSHKEGALIRSSSKNQILPSE-TVNAANEFEIAKFRTSKSVW 376
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA--FNPAEYGP 190
F S + + T L+L + ++ AT L H E F V+ Y IG ++SH+D + +
Sbjct: 377 FDSDANEAT--LKLTQ-RLGEATGLDMKHSEPFQVINYGIGGVFESHFDTSLADEDRFVN 433
Query: 191 QMSQRLASFLLYLSDVEEGGETMFP 215
RLA+ L YL+DV +GG T FP
Sbjct: 434 GYIDRLATTLFYLNDVPQGGATHFP 458
>gi|261245137|gb|ACX54875.1| FI12021p [Drosophila melanogaster]
Length = 538
Score = 56.6 bits (135), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 43/145 (29%), Positives = 72/145 (49%), Gaps = 6/145 (4%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
I ++LS P + + S ++ I +++K ++ PS+ + E K + S
Sbjct: 321 IKTEILSVDPFVILLHDMVSHKEGALIRSSSKNQILPSE-TVNAANEFEIAKFRTSKSVW 379
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA--FNPAEYGP 190
F S + + T L+L + ++ AT L H E F V+ Y IG ++SH+D + +
Sbjct: 380 FDSDANEAT--LKLTQ-RLGEATGLDMKHSEPFQVINYGIGGVFESHFDTSLADEDRFVN 436
Query: 191 QMSQRLASFLLYLSDVEEGGETMFP 215
RLA+ L YL+DV +GG T FP
Sbjct: 437 GYIDRLATTLFYLNDVPQGGATHFP 461
>gi|116008537|ref|NP_733379.2| CG31524, isoform A [Drosophila melanogaster]
gi|113194861|gb|AAN14239.2| CG31524, isoform A [Drosophila melanogaster]
Length = 536
Score = 56.6 bits (135), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 43/145 (29%), Positives = 72/145 (49%), Gaps = 6/145 (4%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
I ++LS P + + S ++ I +++K ++ PS+ + E K + S
Sbjct: 319 IKTEILSVDPFVILLHDMVSHKEGALIRSSSKNQILPSE-TVNAANEFEIAKFRTSKSVW 377
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA--FNPAEYGP 190
F S + + T L+L + ++ AT L H E F V+ Y IG ++SH+D + +
Sbjct: 378 FDSDANEAT--LKLTQ-RLGEATGLDMKHSEPFQVINYGIGGVFESHFDTSLADEDRFVN 434
Query: 191 QMSQRLASFLLYLSDVEEGGETMFP 215
RLA+ L YL+DV +GG T FP
Sbjct: 435 GYIDRLATTLFYLNDVPQGGATHFP 459
>gi|125772813|ref|XP_001357665.1| GA21991 [Drosophila pseudoobscura pseudoobscura]
gi|54637397|gb|EAL26799.1| GA21991 [Drosophila pseudoobscura pseudoobscura]
Length = 534
Score = 56.6 bits (135), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 28/67 (41%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 151 IARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMSQRLASFLLYLSDVEEG 209
++ AT L T E V Y +G Y+ H+D F +P Y + R+A+ + YLSDVE+G
Sbjct: 394 LSDATGLDMTFCEQLQVANYGVGGHYEPHWDFFRDPDHYPAEEGNRMATAIFYLSDVEQG 453
Query: 210 GETMFPF 216
G T FPF
Sbjct: 454 GATAFPF 460
>gi|432891690|ref|XP_004075614.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Oryzias
latipes]
Length = 517
Score = 56.6 bits (135), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 44/146 (30%), Positives = 77/146 (52%), Gaps = 10/146 (6%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
+VLS +P + + NF + + + I A+ L+ S +A GE ++T R S ++
Sbjct: 313 EVLSLQPYVVIYHNFITDREAEEIKGFAQPALRRSVVA--SGEN-QATVEYRISKSAWLK 369
Query: 136 ASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 190
SE + I+ ++ +I+ T L + E V+ Y IG Y+ H+D A +P+ +
Sbjct: 370 GSE--SCIVGKLDQRISMLTGLNVRPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPVFKL 427
Query: 191 QMSQRLASFLLYLSDVEEGGETMFPF 216
+ R+A+F++YLS VE GG T F +
Sbjct: 428 KTGNRVATFMIYLSSVEAGGSTAFIY 453
>gi|281362877|ref|NP_733393.3| CG31016, isoform B [Drosophila melanogaster]
gi|442621939|ref|NP_001263119.1| CG31016, isoform C [Drosophila melanogaster]
gi|272477249|gb|AAF57071.5| CG31016, isoform B [Drosophila melanogaster]
gi|440218076|gb|AGB96498.1| CG31016, isoform C [Drosophila melanogaster]
Length = 536
Score = 56.6 bits (135), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 71/148 (47%), Gaps = 7/148 (4%)
Query: 74 PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
PF++ LS P +++ N S + + + K L+ +++ R + + +R++ G
Sbjct: 313 PFRMEELSLDPYVIFYHNVLSDAEIEKLKPMGKPFLERAKV-FRVEKGSDEIDPSRSADG 371
Query: 132 TFI---SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY 188
++ + D +L I +I T L G L+Y G + HYD FN +
Sbjct: 372 AWLPHQNIDPDDLEVLNRIGRRIEDMTGLNTRSGSKMQFLKYGFGGHFVPHYDYFNSKTF 431
Query: 189 GPQ-MSQRLASFLLYLSDVEEGGETMFP 215
+ + R+A+ L YL++V+ GG T+FP
Sbjct: 432 SLETVGDRIATVLFYLNNVDHGGATVFP 459
>gi|351696981|gb|EHA99899.1| Prolyl 4-hydroxylase subunit alpha-3 [Heterocephalus glaber]
Length = 572
Score = 56.6 bits (135), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 46/152 (30%), Positives = 75/152 (49%), Gaps = 10/152 (6%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+Q + +V+ P + +F S + Q I A+ L+ S +A GE + R S
Sbjct: 362 LQPVRKEVIHLEPYVALYHDFVSDPEAQKIRKLAEPWLQRSVVA--SGEKQLQVE-YRIS 418
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTH--GEAFNVLRYEIGQKYDSHYD-AFNPA 186
++ + D +L ++H+IA T L H E V+ Y IG Y+ H+D A +P+
Sbjct: 419 KSAWLKDTADP--VLVTLDHRIAALTGLDVQHPYAEYLQVVNYGIGGHYEPHFDHATSPS 476
Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPF 216
Y + R+A+F++YLS VE GG T F +
Sbjct: 477 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIY 508
>gi|448924767|gb|AGE48348.1| prolyl 4-hydroxylase [Paramecium bursaria Chlorella virus AN69C]
gi|448933638|gb|AGE57193.1| prolyl 4-hydroxylase [Paramecium bursaria Chlorella virus NE-JV-4]
Length = 239
Score = 56.6 bits (135), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 40/134 (29%), Positives = 67/134 (50%), Gaps = 10/134 (7%)
Query: 89 NFASAEQCQSIIATAKKR--LKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILEL 146
+F S +C +I A K+ +K + ++ +R S T+ + E K I++
Sbjct: 57 DFLSDAECDILINAAIKKGLIKSEVGGATDDDPIKLDPKSRNSEQTWFTPGEHK--IIDK 114
Query: 147 IEHKI-----ARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
I++K ++ + + + E V RY+ GQ Y HYD + + P+ QRLA+ ++
Sbjct: 115 IQNKTRELLDSKKHCIDKYNFEDVQVARYKPGQYYYHHYDGDDCDDACPK-DQRLATLMV 173
Query: 202 YLSDVEEGGETMFP 215
YL EEGGET FP
Sbjct: 174 YLKAPEEGGETDFP 187
>gi|381200505|ref|ZP_09907642.1| procollagen-proline dioxygenase [Sphingobium yanoikuyae XLDN2-5]
Length = 221
Score = 56.6 bits (135), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 46/132 (34%), Positives = 59/132 (44%), Gaps = 13/132 (9%)
Query: 89 NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
+F AE+C + A +PS L T RTS + ED ++ I
Sbjct: 43 DFLGAEECAVMRAIIDAGAQPSTLF-----TGRENADYRTSHSCNLD-REDP--LVHAIS 94
Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN-PAEYGPQM----SQRLASFLLYL 203
+I T L HGE RY GQ+Y H D F A Y P+M QR + ++YL
Sbjct: 95 ARICAMTGLEPDHGETLQGQRYTQGQEYKVHCDYFPVNASYWPEMRKTGGQRNWTAMIYL 154
Query: 204 SDVEEGGETMFP 215
S VE GGET FP
Sbjct: 155 SPVEGGGETHFP 166
>gi|323455897|gb|EGB11765.1| hypothetical protein AURANDRAFT_52419 [Aureococcus anophagefferens]
Length = 478
Score = 56.6 bits (135), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 59/212 (27%), Positives = 84/212 (39%), Gaps = 46/212 (21%)
Query: 39 VPS----IRPKLRTLEVVEKENESGLPHGETG-DASIQSIPFQVLSWRPRALYFPNFASA 93
VPS + P +RT + L ETG DA +S+ LS RP+ F
Sbjct: 114 VPSGVHFVWPTVRTGHATVTD----LADAETGVDAGHRSV-VTTLSMRPQVFRISQFMMG 168
Query: 94 EQCQSIIATAKKRLKPSQLAL--RQGETVESTKGTRTSSGTFISASEDKTGILELIEHKI 151
+ + +I K R+KPS++ L R G+ TRTS+ + +AS + I
Sbjct: 169 HETEKLIERNKPRIKPSEVGLVGRSGDK------TRTSTNAWDTAS-------PVARDVI 215
Query: 152 ARATMLPQTHG-----EAFNVLRYEIGQKYDSHYDAFNPAEYG----------------P 190
RA L + + VL YE Q Y H D F G
Sbjct: 216 GRAFRLLKIDAHRKLEDGLQVLHYERPQWYKPHVDYFTSRNAGGGGASEDAFSNAIPTAN 275
Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFEVDLQI 222
+ R A+ LYL++ GGET+FP +I
Sbjct: 276 NGTNRFATVFLYLNNAGSGGETVFPLSTTHEI 307
>gi|400602974|gb|EJP70572.1| 2OG-Fe(II) oxygenase family Oxidoreductase [Beauveria bassiana
ARSEF 2860]
Length = 269
Score = 56.6 bits (135), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 43/154 (27%), Positives = 70/154 (45%), Gaps = 12/154 (7%)
Query: 71 QSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSS 130
Q+ ++LS P A+Y NF + + + ++A + KPS++A G V +T R+S
Sbjct: 43 QNYTVEILSIDPLAIYLNNFLNDAEIRYLLALGENIYKPSEVASHSGIIVNTT--VRSSE 100
Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEI-GQKYDSHYDAFNPAEYG 189
F+ ED LI + + H E+ +++Y G +Y H D A+
Sbjct: 101 SAFL--LEDDAVCNCLISRMKSLLGNVQHEHVESLQMVKYAAGGDRYRLHTDWSVAAKNN 158
Query: 190 P-------QMSQRLASFLLYLSDVEEGGETMFPF 216
+ S+RL + +YL D GGET FP
Sbjct: 159 TDEASGKLRQSRRLGTIFVYLEDSCAGGETYFPL 192
>gi|198452400|ref|XP_002137470.1| GA26529 [Drosophila pseudoobscura pseudoobscura]
gi|198131917|gb|EDY68028.1| GA26529 [Drosophila pseudoobscura pseudoobscura]
Length = 348
Score = 56.6 bits (135), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 42/148 (28%), Positives = 70/148 (47%), Gaps = 11/148 (7%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPS--QLALRQGETVESTKGTRTSS 130
+ ++ S P + + + + Q +I + ++R+ S Q +RQ E E RTS
Sbjct: 141 LKLEIFSHDPYVVIYHDVLYDAEMQGLIDSTRRRMSRSMVQYEIRQIEISEQ----RTSK 196
Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH---YDAFNPAE 187
+ D +L+ I ++ T E ++L Y+ G +D H +D + E
Sbjct: 197 EAPFTEKNDPQ-LLKRIYDRLKDMTGCDMLRSEHLSILLYDQGGHHDPHVDYHDLYWEYE 255
Query: 188 YGPQMSQRLASFLLYLSDVEEGGETMFP 215
Y P R AS + YL+DVE+GGET+FP
Sbjct: 256 YHP-FGDRQASVVFYLNDVEDGGETVFP 282
>gi|159884097|gb|ABX00727.1| IP12176p [Drosophila melanogaster]
Length = 538
Score = 56.6 bits (135), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 71/148 (47%), Gaps = 7/148 (4%)
Query: 74 PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
PF++ LS P +++ N S + + + K L+ +++ R + + +R++ G
Sbjct: 315 PFRMEELSLDPYVIFYHNVLSDAEIEKLKPMGKPFLERAKV-FRVEKGSDEIDPSRSADG 373
Query: 132 TFI---SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY 188
++ + D +L I +I T L G L+Y G + HYD FN +
Sbjct: 374 AWLPHQNIDPDDLEVLNRIGRRIEDMTGLNTRSGSKMQFLKYGFGGHFVPHYDYFNSKTF 433
Query: 189 GPQ-MSQRLASFLLYLSDVEEGGETMFP 215
+ + R+A+ L YL++V+ GG T+FP
Sbjct: 434 SLETVGDRIATVLFYLNNVDHGGATVFP 461
>gi|346724248|ref|YP_004850917.1| hypothetical protein XACM_1335 [Xanthomonas axonopodis pv.
citrumelo F1]
gi|346648995|gb|AEO41619.1| hypothetical protein XACM_1335 [Xanthomonas axonopodis pv.
citrumelo F1]
Length = 418
Score = 56.6 bits (135), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 41/139 (29%), Positives = 63/139 (45%), Gaps = 7/139 (5%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG-TFISASEDK 140
PR + SA++C+ ++ A+ L+ S++ + + RTS G T ED
Sbjct: 228 PRIEEYAAVLSADECRLLMLLARPHLRASKV-IDPNDASTGRAPVRTSHGATLDPIIEDF 286
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP---AEYGPQMSQRLA 197
+ ++A LP H E +VL Y G++Y +H D P A P R
Sbjct: 287 AA--RAAQSRLAACAQLPLAHAEPLSVLCYAPGEQYRAHRDYLPPGTIAADRPTAGNRQR 344
Query: 198 SFLLYLSDVEEGGETMFPF 216
+ +YL+DV GGET FP
Sbjct: 345 TVCVYLNDVGAGGETEFPV 363
>gi|78046960|ref|YP_363135.1| hypothetical protein XCV1404 [Xanthomonas campestris pv.
vesicatoria str. 85-10]
gi|78035390|emb|CAJ23035.1| conserved hypothetical protein [Xanthomonas campestris pv.
vesicatoria str. 85-10]
Length = 418
Score = 56.6 bits (135), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 41/139 (29%), Positives = 63/139 (45%), Gaps = 7/139 (5%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG-TFISASEDK 140
PR + SA++C+ ++ A+ L+ S++ + + RTS G T ED
Sbjct: 228 PRIEEYAAVLSADECRLLMLLARPHLRASKV-IDPNDASTGRAPVRTSHGATLDPIIEDF 286
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP---AEYGPQMSQRLA 197
+ ++A LP H E +VL Y G++Y +H D P A P R
Sbjct: 287 AA--RAAQSRLAACAQLPLAHAEPLSVLCYAPGEQYRAHRDYLPPGTIAADRPTAGNRQR 344
Query: 198 SFLLYLSDVEEGGETMFPF 216
+ +YL+DV GGET FP
Sbjct: 345 TVCVYLNDVGAGGETEFPV 363
>gi|116008128|ref|NP_001036776.1| CG15539, isoform B [Drosophila melanogaster]
gi|113194857|gb|ABI31220.1| CG15539, isoform B [Drosophila melanogaster]
Length = 509
Score = 56.6 bits (135), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 40/143 (27%), Positives = 64/143 (44%), Gaps = 6/143 (4%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
+ ++LS P + F + S + SI K +L + + G E RT+ GT
Sbjct: 310 LKMELLSLDPYMVLFHDVVSDKDIVSIRNLTKGKLARTVTVSKDGNYTEDPD--RTTKGT 367
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
++ + +++ + T + F VL Y IG Y H+D AE
Sbjct: 368 WLV---ENNALIQRLSQLTQDMTNFDIHDADPFQVLNYGIGGFYGIHFDFLEDAELD-NF 423
Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
S R+A+ + YLSDV +GG T+FP
Sbjct: 424 SDRIATAVFYLSDVPQGGATIFP 446
>gi|194905294|ref|XP_001981167.1| GG11919 [Drosophila erecta]
gi|190655805|gb|EDV53037.1| GG11919 [Drosophila erecta]
Length = 533
Score = 56.6 bits (135), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 37/143 (25%), Positives = 65/143 (45%), Gaps = 14/143 (9%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+P + + SA + ++ A + +K +++ Q E +T RT+ G ++ ++
Sbjct: 325 KPYVVLYHEVLSAREISMLMGKAAQNMKNTRV---QSEKAVNTNRERTAKGYWLKKESNE 381
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA---------EYGPQ 191
+ I +I T E F V+ Y IG Y H+D F A +
Sbjct: 382 --MTRRITRRIVDMTGFDLADSEDFQVINYGIGGHYSLHFDYFGFASSNYTGERSHHSIV 439
Query: 192 MSQRLASFLLYLSDVEEGGETMF 214
+ R+A+ L YL+DVE+GG T+F
Sbjct: 440 LGDRIATVLFYLTDVEQGGATVF 462
>gi|194905376|ref|XP_001981185.1| GG11927 [Drosophila erecta]
gi|190655823|gb|EDV53055.1| GG11927 [Drosophila erecta]
Length = 539
Score = 56.6 bits (135), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 40/145 (27%), Positives = 70/145 (48%), Gaps = 6/145 (4%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
I ++LS P + + S ++ I +++K + PS+ + V S RTS
Sbjct: 322 IKTEILSIDPFVVLLHDMVSPKEAALIRSSSKSTIFPSETVNAANDFVVSK--FRTSKSV 379
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF--NPAEYGP 190
++ ++ + + ++A AT L H E F V+ Y IG ++SH+D + +
Sbjct: 380 WLDRDANEATVK--LTQRLADATGLDVKHSEHFQVINYGIGGVFESHFDTTLEDTNRFVG 437
Query: 191 QMSQRLASFLLYLSDVEEGGETMFP 215
R+A+ L YL+DV +GG T FP
Sbjct: 438 GFIDRIATTLFYLNDVPQGGATHFP 462
>gi|195113245|ref|XP_002001178.1| GI22115 [Drosophila mojavensis]
gi|193917772|gb|EDW16639.1| GI22115 [Drosophila mojavensis]
Length = 498
Score = 56.2 bits (134), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 44/150 (29%), Positives = 74/150 (49%), Gaps = 16/150 (10%)
Query: 74 PF--QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
PF ++LS P + F + + + + TA+ L S + E+V S RT+ G
Sbjct: 281 PFKMELLSEDPYIVVFHDVIYDSEIKHLRNTAEPLLHRSYVKKSNNESVVSK--VRTAKG 338
Query: 132 TFISA---SEDKTGILELIEHKIARATMLPQTHGEAFN---VLRYEIGQKYDSHYDAFNP 185
F+ A S + +++ ++ ++ + L E +N L Y+ G Y H D FN
Sbjct: 339 AFMHADRLSPESAQVVQRLKQRMGDLSDL-NIKREGYNEMQYLNYDFGDHYLLHMDYFNI 397
Query: 186 AEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
+ M+ R+A+FL+YL+DV GG T+FP
Sbjct: 398 S-----MNDRIATFLIYLNDVTRGGGTIFP 422
>gi|410665706|ref|YP_006918077.1| 2OG-Fe(II) oxygenase [Simiduia agarivorans SA1 = DSM 21679]
gi|409028063|gb|AFV00348.1| 2OG-Fe(II) oxygenase [Simiduia agarivorans SA1 = DSM 21679]
Length = 270
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 64/135 (47%), Gaps = 20/135 (14%)
Query: 89 NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSS----GTFISASEDKTGIL 144
+F SA +C+ ++A +L+PS + T G RTSS G +S ++
Sbjct: 88 DFLSAAECEGVMALMDSKLRPSTVT-----TGNLHYGYRTSSTCDLGRMVSP------LI 136
Query: 145 ELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP--AEY---GPQMSQRLASF 199
I+ KIAR + E +Y +GQ++ +H D F P AEY QR +F
Sbjct: 137 RDIDQKIARTLGVNVAWSEEIQGQKYLVGQEFKAHTDYFQPQSAEYEKFAGARGQRSWTF 196
Query: 200 LLYLSDVEEGGETMF 214
++YL+ E GG T F
Sbjct: 197 MIYLNTTEAGGATRF 211
>gi|87199403|ref|YP_496660.1| 2OG-Fe(II) oxygenase [Novosphingobium aromaticivorans DSM 12444]
gi|87135084|gb|ABD25826.1| 2OG-Fe(II) oxygenase [Novosphingobium aromaticivorans DSM 12444]
Length = 211
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 39/131 (29%), Positives = 61/131 (46%), Gaps = 13/131 (9%)
Query: 89 NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
+F S +C +IA ++ +PS +A G+ T T D I+ L E
Sbjct: 34 DFLSQAECNGLIARIERDRRPSTIADANGDHYFRTSET-------CDLPMDDPEIVALDE 86
Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP-----AEYGPQMSQRLASFLLYL 203
K+ + + + GE RYE GQ++ +H D F+P + QR +F++YL
Sbjct: 87 -KLCALSGIGRPFGEPIQGQRYESGQEFKAHTDYFDPHGADFQRFCSVAGQRTWTFMVYL 145
Query: 204 SDVEEGGETMF 214
+DVE GG T F
Sbjct: 146 NDVEAGGATRF 156
>gi|189241578|ref|XP_969458.2| PREDICTED: similar to prolyl 4-hydroxylase alpha subunit 1,
putative [Tribolium castaneum]
Length = 515
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 44/152 (28%), Positives = 74/152 (48%), Gaps = 10/152 (6%)
Query: 74 PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
PF+V P L F N S + +++ A+ RL + + +E R S
Sbjct: 317 PFKVEQAHLDPDILIFHNVLSDCEIETMKQLAQSRLVTAVFENPHSKQLELFP-FRISKV 375
Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ 191
++ E + L ++ ++A T L + E F V+ Y IG Y+ H+D + + P
Sbjct: 376 AWLEDQEHQH--LAVVAQRVAHMTGLTLSTAEEFQVVNYGIGGHYEPHFDFQSTVD--PA 431
Query: 192 MSQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
+ R+ + L YLSDVE+GG T+FP ++Q+S
Sbjct: 432 IGSRIETVLFYLSDVEQGGATVFP---EIQVS 460
>gi|393725345|ref|ZP_10345272.1| putative oxygenase [Sphingomonas sp. PAMC 26605]
Length = 226
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 41/136 (30%), Positives = 62/136 (45%), Gaps = 15/136 (11%)
Query: 86 YFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS-SGTFISASEDKTGIL 144
+ P+F A C +++A + S + E+ + RTS S S D
Sbjct: 44 HHPDFLDAATCDTLVAMIDANKRRSTVLAE-----ENVQEFRTSESCDMDRWSPDVRPTD 98
Query: 145 ELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE-YGPQM----SQRLASF 199
E I H + + +GE RY +GQ + +H+D FN + Y PQM QR +
Sbjct: 99 EAIAHLLG----IDPVYGETMQGQRYAVGQHFRAHFDYFNEKQAYWPQMIETGGQRTWTA 154
Query: 200 LLYLSDVEEGGETMFP 215
++YL+ V EGG T FP
Sbjct: 155 MIYLNHVAEGGATWFP 170
>gi|270001038|gb|EEZ97485.1| hypothetical protein TcasGA2_TC011322 [Tribolium castaneum]
Length = 509
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 44/152 (28%), Positives = 74/152 (48%), Gaps = 10/152 (6%)
Query: 74 PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
PF+V P L F N S + +++ A+ RL + + +E R S
Sbjct: 311 PFKVEQAHLDPDILIFHNVLSDCEIETMKQLAQSRLVTAVFENPHSKQLELFP-FRISKV 369
Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ 191
++ E + L ++ ++A T L + E F V+ Y IG Y+ H+D + + P
Sbjct: 370 AWLEDQEHQH--LAVVAQRVAHMTGLTLSTAEEFQVVNYGIGGHYEPHFDFQSTVD--PA 425
Query: 192 MSQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
+ R+ + L YLSDVE+GG T+FP ++Q+S
Sbjct: 426 IGSRIETVLFYLSDVEQGGATVFP---EIQVS 454
>gi|443730626|gb|ELU16050.1| hypothetical protein CAPTEDRAFT_114796, partial [Capitella teleta]
Length = 150
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 34/88 (38%), Positives = 50/88 (56%), Gaps = 14/88 (15%)
Query: 135 SASEDKTGILELIEHKIARATML-PQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY----- 188
SAS DK + +++ AT L + + E F V Y IG Y+ H+D F+ +Y
Sbjct: 8 SASADK------LSRRVSSATKLDAEKYAELFQVSTYGIGGHYEPHFD-FSKVKYFTNPV 60
Query: 189 -GPQMSQRLASFLLYLSDVEEGGETMFP 215
QM R+A+F++YL+DVE GG T+FP
Sbjct: 61 LNEQMGDRIATFMIYLNDVEAGGRTVFP 88
>gi|327267604|ref|XP_003218589.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Anolis
carolinensis]
Length = 542
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 44/144 (30%), Positives = 69/144 (47%), Gaps = 17/144 (11%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
RPR + F S E+ +++ AK RL S+ + +T + +T R S ++S E+
Sbjct: 342 RPRIVRFVEIISDEEIETVKELAKPRL--SRATVHDPQTGKLTTAHYRVSKSAWLSGYEN 399
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
I+ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 400 P--IVARINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG-- 455
Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 456 --NRIATWLFYMSDVSAGGATVFP 477
>gi|195575113|ref|XP_002105524.1| GD16980 [Drosophila simulans]
gi|194201451|gb|EDX15027.1| GD16980 [Drosophila simulans]
Length = 518
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 43/145 (29%), Positives = 71/145 (48%), Gaps = 6/145 (4%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
I ++LS P + + S + I +++K ++ PS+ + E K + S
Sbjct: 301 IKTEILSVDPFVILLHDMVSPTEGALIRSSSKNQILPSE-TVNAANEFEVAKFRTSKSVW 359
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA--FNPAEYGP 190
F S + + T L+L + ++ AT L H E F V+ Y IG ++SH+D + +
Sbjct: 360 FDSDANEAT--LKLTQ-RLGEATGLDMKHSEPFQVINYGIGGVFESHFDTSLADEDRFVN 416
Query: 191 QMSQRLASFLLYLSDVEEGGETMFP 215
RLA+ L YL+DV +GG T FP
Sbjct: 417 GYIDRLATTLFYLNDVPQGGATHFP 441
>gi|103487007|ref|YP_616568.1| 2OG-Fe(II) oxygenase [Sphingopyxis alaskensis RB2256]
gi|98977084|gb|ABF53235.1| 2OG-Fe(II) oxygenase [Sphingopyxis alaskensis RB2256]
Length = 218
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 37/133 (27%), Positives = 61/133 (45%), Gaps = 13/133 (9%)
Query: 87 FPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILEL 146
P F AE C ++I +PS +A G+ RTSS + + + I+
Sbjct: 32 LPRFLDAETCAALIDLIDSDARPSTIADANGDAA-----FRTSSTCDL---DHRLPIVIA 83
Query: 147 IEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP-----AEYGPQMSQRLASFLL 201
+ +K+ T +P +GE RY+IG+++ +H D F+P Y QR + ++
Sbjct: 84 VNNKLHDLTGIPLAYGEPLQGQRYDIGEEFKAHTDYFDPHGADWDTYCAVPGQRSWTLMI 143
Query: 202 YLSDVEEGGETMF 214
YL+ GG T F
Sbjct: 144 YLNQPAAGGATRF 156
>gi|387016440|gb|AFJ50339.1| Prolyl 4-hydroxylase subunit alpha-1-like [Crotalus adamanteus]
Length = 543
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 41/143 (28%), Positives = 68/143 (47%), Gaps = 15/143 (10%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
RPR + F + S E+ + + +K RL+ + ++ +E T R S ++S E+
Sbjct: 343 RPRIVRFLDIISNEEIEKVKELSKPRLRRATISNPITGVLE-TAHYRISKSAWLSGYENP 401
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 402 --VVARINQRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG--- 456
Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 457 -NRIATWLFYMSDVAAGGATVFP 478
>gi|386368303|gb|AFJ06910.1| procollagen-proline dioxygenase [Mytilus galloprovincialis]
Length = 535
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 49/85 (57%), Gaps = 6/85 (7%)
Query: 143 ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-----AFNPAEYGPQMSQRLA 197
+++ ++++I T L +A V Y IG YD HYD + +E + R+A
Sbjct: 393 VVDRVQNRIKAVTGLDLDSADALQVANYGIGGHYDPHYDFSTRDDDDTSETEKRDGNRIA 452
Query: 198 SFLLYLSDVEEGGETMFPFEVDLQI 222
+FLLY++DV+ GG T+FP +D+++
Sbjct: 453 TFLLYMTDVDAGGATVFPI-IDVRV 476
>gi|427410040|ref|ZP_18900242.1| hypothetical protein HMPREF9718_02716 [Sphingobium yanoikuyae ATCC
51230]
gi|425712173|gb|EKU75188.1| hypothetical protein HMPREF9718_02716 [Sphingobium yanoikuyae ATCC
51230]
Length = 225
Score = 55.8 bits (133), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 46/132 (34%), Positives = 58/132 (43%), Gaps = 13/132 (9%)
Query: 89 NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
+F AE+C + A +PS L T RTS + ED ++ I
Sbjct: 47 DFLGAEECAVMRAIIDAGAQPSTLF-----TGRENADYRTSHSCNLD-REDP--LVHAIS 98
Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN-PAEYGPQM----SQRLASFLLYL 203
+I T L HGE RY GQ+Y H D F A Y P M QR + ++YL
Sbjct: 99 ARICAMTGLEPDHGETLQGQRYTQGQEYKVHCDYFPVNASYWPDMRKTGGQRNWTAMIYL 158
Query: 204 SDVEEGGETMFP 215
S VE GGET FP
Sbjct: 159 SPVEGGGETHFP 170
>gi|224008853|ref|XP_002293385.1| hypothetical protein THAPSDRAFT_264010 [Thalassiosira pseudonana
CCMP1335]
gi|220970785|gb|EED89121.1| hypothetical protein THAPSDRAFT_264010 [Thalassiosira pseudonana
CCMP1335]
Length = 248
Score = 55.8 bits (133), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 42/162 (25%), Positives = 66/162 (40%), Gaps = 20/162 (12%)
Query: 72 SIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
I + +S PR +F S + I+ + + + + TRTS
Sbjct: 34 DITLRTVSCSPRIFELEHFISDVEADHILMLTNRTHELHRSSTGDSSHHSDHDSTRTSMN 93
Query: 132 TFISASEDKTGILELIEHKIARATML---------PQTH---------GEAFNVLRYEIG 173
T+I E T I++ I ++A + P H E ++ Y+ G
Sbjct: 94 TWIYREE--TAIIDTIYRRVADVLRIDEALLRRRQPDEHPRLGTRSSIAEPLQMVHYDPG 151
Query: 174 QKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
++Y +H+D P R + LLYL+DVEEGGET FP
Sbjct: 152 EEYTAHHDFGYTHMSAPHQPSRSINMLLYLNDVEEGGETSFP 193
>gi|312032356|ref|NP_001185665.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Gallus
gallus]
Length = 536
Score = 55.8 bits (133), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 43/144 (29%), Positives = 70/144 (48%), Gaps = 17/144 (11%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S E+ +++ AK RL S+ + ET + +T R S ++S E
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRL--SRATVHDPETGKLTTAHYRVSKSAWLSGYE- 392
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
+ ++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 -SPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 449
Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 450 --NRIATWLFYMSDVSAGGATVFP 471
>gi|312032354|ref|NP_001185664.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Gallus
gallus]
Length = 536
Score = 55.8 bits (133), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 43/144 (29%), Positives = 70/144 (48%), Gaps = 17/144 (11%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S E+ +++ AK RL S+ + ET + +T R S ++S E
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRL--SRATVHDPETGKLTTAHYRVSKSAWLSGYE- 392
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
+ ++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 -SPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG-- 449
Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 450 --NRIATWLFYMSDVSAGGATVFP 471
>gi|48675383|ref|NP_001001598.1| prolyl 4-hydroxylase subunit alpha-3 precursor [Bos taurus]
gi|75053350|sp|Q75UG4.1|P4HA3_BOVIN RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
alpha-3; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-3; Flags: Precursor
gi|47115494|dbj|BAD18888.1| Collagen prolyl 4-hydroxylase alpha III subunit [Bos taurus]
gi|296479828|tpg|DAA21943.1| TPA: prolyl 4-hydroxylase subunit alpha-3 precursor [Bos taurus]
Length = 544
Score = 55.8 bits (133), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 45/152 (29%), Positives = 76/152 (50%), Gaps = 10/152 (6%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+Q + +V+ P + + +F S + Q+I A+ L+ S +A GE + R S
Sbjct: 334 LQPVRKEVIHLEPYVVLYHDFVSDAEAQTIRGLAEPWLQRSVVA--SGEKQLPVE-YRIS 390
Query: 130 SGTFISASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPA 186
++ + D +L ++H+IA T L + E V+ Y IG Y+ H+D A +P+
Sbjct: 391 KSAWLKDTVDP--VLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPS 448
Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPF 216
Y R+A+F++YLS VE GG T F +
Sbjct: 449 SPLYRMNSGNRVATFMIYLSSVEAGGATAFIY 480
>gi|156406532|ref|XP_001641099.1| predicted protein [Nematostella vectensis]
gi|156228236|gb|EDO49036.1| predicted protein [Nematostella vectensis]
Length = 410
Score = 55.8 bits (133), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 44/134 (32%), Positives = 59/134 (44%), Gaps = 15/134 (11%)
Query: 96 CQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASED-------------KTG 142
C + T +K LK +LA + E + T GT + TF D +
Sbjct: 173 CFDGVDTLRKNLKELKLAYKVSER-DFTMGTTCLNETFSGKLRDHFKWSHSMAFYTGENK 231
Query: 143 ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLY 202
+ +I AT L + +G F + Y IG Y SH D E P+ R A+ L+Y
Sbjct: 232 FSTMYAKRIQAATGLREENGGKFQITGYPIGVGYKSHTDCV-VYEGEPEKRDRYATILVY 290
Query: 203 LSDVEEGGETMFPF 216
L DVEEGGET FP
Sbjct: 291 LQDVEEGGETDFPL 304
>gi|126327904|ref|XP_001367838.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Monodelphis
domestica]
Length = 559
Score = 55.8 bits (133), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 46/152 (30%), Positives = 75/152 (49%), Gaps = 10/152 (6%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+Q + +VL P + + +F S + Q I A L+ S +A GE + + R S
Sbjct: 349 LQPVRKEVLHLEPYIVLYHDFVSDSEAQKIRGFAAPWLQRSVVA--SGEKQQQVE-YRIS 405
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYD-AFNPA 186
++ + D +L ++H+IA T L + E V+ Y IG Y+ H+D A +P+
Sbjct: 406 KSAWLKDTVDP--MLVSLDHRIAALTGLNVQPPYAEHLQVVNYGIGGHYEPHFDHATSPS 463
Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPF 216
Y R+A+F++YLS VE GG T F +
Sbjct: 464 SPLYRMNSGNRVATFMIYLSSVEAGGSTAFIY 495
>gi|449280261|gb|EMC87600.1| Prolyl 4-hydroxylase subunit alpha-1 [Columba livia]
Length = 536
Score = 55.8 bits (133), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 43/144 (29%), Positives = 70/144 (48%), Gaps = 17/144 (11%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S E+ +++ AK RL S+ + ET + +T R S ++S E
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRL--SRATVHDPETGKLTTAHYRVSKSAWLSGYE- 392
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
+ ++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 -SPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG-- 449
Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 450 --NRIATWLFYMSDVSAGGATVFP 471
>gi|354483223|ref|XP_003503794.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 1
[Cricetulus griseus]
Length = 534
Score = 55.8 bits (133), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 43/142 (30%), Positives = 68/142 (47%), Gaps = 13/142 (9%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S + + + AK RL S+ + ET + +T R S ++S ED
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYED 391
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ----- 194
++ I +I T L + E V Y +G +Y+ H+D E P Q
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDE--PDAFQELGTG 447
Query: 195 -RLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 448 NRIATWLFYMSDVSAGGATVFP 469
>gi|326923463|ref|XP_003207955.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 2
[Meleagris gallopavo]
Length = 536
Score = 55.8 bits (133), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 43/144 (29%), Positives = 70/144 (48%), Gaps = 17/144 (11%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S E+ +++ AK RL S+ + ET + +T R S ++S E
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRL--SRATVHDPETGKLTTAHYRVSKSAWLSGYE- 392
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
+ ++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 -SPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG-- 449
Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 450 --NRIATWLFYMSDVSAGGATVFP 471
>gi|321474875|gb|EFX85839.1| hypothetical protein DAPPUDRAFT_309105 [Daphnia pulex]
Length = 545
Score = 55.8 bits (133), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 39/148 (26%), Positives = 67/148 (45%), Gaps = 5/148 (3%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
IQ I + +PR + + + S E+ ++I A+ R + + + ++ E ++ R +
Sbjct: 334 IQPIKMEEALLKPRIVVYHDIISDEEIETIKRLAQPRFERATVQKKESGEREFSR-YRIA 392
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
++ E + I ++ T L E V Y IG Y+ HYD E
Sbjct: 393 KSAWLKHEEHD--YVSDINFRVGDITGLDMATSEDLQVCNYGIGGHYEPHYDYARKGEVQ 450
Query: 190 PQMSQ--RLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDVE GG T+FP
Sbjct: 451 QDFGWGGRIATWLFYMSDVEAGGATVFP 478
>gi|224052167|ref|XP_002191912.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Taeniopygia
guttata]
Length = 536
Score = 55.8 bits (133), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 43/144 (29%), Positives = 70/144 (48%), Gaps = 17/144 (11%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S E+ +++ AK RL S+ + ET + +T R S ++S E
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRL--SRATVHDPETGKLTTAHYRVSKSAWLSGYE- 392
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
+ ++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 -SPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG-- 449
Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 450 --NRIATWLFYMSDVSAGGATVFP 471
>gi|198449502|ref|XP_001357605.2| GA15937 [Drosophila pseudoobscura pseudoobscura]
gi|198130635|gb|EAL26739.2| GA15937 [Drosophila pseudoobscura pseudoobscura]
Length = 510
Score = 55.8 bits (133), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 38/142 (26%), Positives = 66/142 (46%), Gaps = 7/142 (4%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
+ ++L P + + + S + I+ A++R+ + + T + TRT+ G
Sbjct: 310 LKMELLGEHPYVVVYHDVLSDSEIAEILEMAERRMARTSTVAQPNRT---SSPTRTAMGA 366
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
++ S + + I ++ + L E V+ Y IG Y H D F ++ M
Sbjct: 367 WLKRSSN--ALTRRIARRVRDMSGLQLEGSERMQVINYGIGGHYVPHKDWFT--QHPEVM 422
Query: 193 SQRLASFLLYLSDVEEGGETMF 214
RLA+ L YL+DVE+GG TMF
Sbjct: 423 GNRLATVLFYLTDVEQGGATMF 444
>gi|410914996|ref|XP_003970973.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Takifugu
rubripes]
Length = 538
Score = 55.8 bits (133), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 43/141 (30%), Positives = 68/141 (48%), Gaps = 13/141 (9%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + +F S E+ + I AK +L ++ +R ++ V +T R S ++ ED
Sbjct: 339 PNIVRYLDFLSNEEIEKIKELAKPKL--ARATVRDPKSGVLTTASYRVSKSAWLEGEEDP 396
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL---- 196
I+ + +I T L E V Y +G +Y+ H+D E P +RL
Sbjct: 397 --IIARVNQRIEDLTGLTVKTAELLQVANYGVGGQYEPHFDFSRKDE--PDAFKRLGTGN 452
Query: 197 --ASFLLYLSDVEEGGETMFP 215
A+FL Y+SDVE GG T+FP
Sbjct: 453 RVATFLNYMSDVEAGGATVFP 473
>gi|81870817|sp|Q6W3F0.1|P4HA3_MOUSE RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
alpha-3; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-3; Flags: Precursor
gi|36962749|gb|AAQ87604.1| collagen prolyl 4-hydroxylase alpha III subunit [Mus musculus]
Length = 542
Score = 55.8 bits (133), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 46/146 (31%), Positives = 74/146 (50%), Gaps = 10/146 (6%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
+V+ RP + +F S E+ Q I A+ L+ S +A GE + R S ++
Sbjct: 338 EVVHLRPLIALYHDFVSDEEAQKIRELAEPWLQRSVVA--SGEKQLQVE-YRISKSAWLK 394
Query: 136 ASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 190
+ D +L ++H+IA T L + E V+ Y IG Y+ H+D A +P+ Y
Sbjct: 395 DTVDP--MLVTLDHRIAALTGLDIQPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRM 452
Query: 191 QMSQRLASFLLYLSDVEEGGETMFPF 216
+ R+A+F++YLS VE GG T F +
Sbjct: 453 KSGNRVATFMIYLSSVEAGGATAFIY 478
>gi|440899661|gb|ELR50930.1| Prolyl 4-hydroxylase subunit alpha-3, partial [Bos grunniens mutus]
Length = 478
Score = 55.8 bits (133), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 45/152 (29%), Positives = 76/152 (50%), Gaps = 10/152 (6%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+Q + +V+ P + + +F S + Q+I A+ L+ S +A GE + R S
Sbjct: 268 LQPVRKEVIHLEPYVVLYHDFVSDAEAQTIRGLAEPWLQRSVVA--SGEKQLPVE-YRIS 324
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYD-AFNPA 186
++ + D +L ++H+IA T L + E V+ Y IG Y+ H+D A +P+
Sbjct: 325 KSAWLKDTVDP--VLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPS 382
Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPF 216
Y R+A+F++YLS VE GG T F +
Sbjct: 383 SPLYRMNSGNRVATFMIYLSSVEAGGATAFIY 414
>gi|129365|sp|P16924.1|P4HA1_CHICK RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1
Length = 516
Score = 55.8 bits (133), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 43/144 (29%), Positives = 70/144 (48%), Gaps = 17/144 (11%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S E+ +++ AK RL S+ + ET + +T R S ++S E
Sbjct: 316 KPRIVRFLDIISDEEIETVKELAKPRL--SRATVHDPETGKLTTAHYRVSKSAWLSGYE- 372
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
+ ++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 373 -SPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG-- 429
Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 430 --NRIATWLFYMSDVSAGGATVFP 451
>gi|227908832|ref|NP_796135.3| prolyl 4-hydroxylase subunit alpha-3 precursor [Mus musculus]
Length = 542
Score = 55.8 bits (133), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 46/146 (31%), Positives = 74/146 (50%), Gaps = 10/146 (6%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
+V+ RP + +F S E+ Q I A+ L+ S +A GE + R S ++
Sbjct: 338 EVVHLRPLIALYHDFVSDEEAQKIRELAEPWLQRSVVA--SGEKQLQVE-YRISKSAWLK 394
Query: 136 ASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 190
+ D +L ++H+IA T L + E V+ Y IG Y+ H+D A +P+ Y
Sbjct: 395 DTVDP--MLVTLDHRIAALTGLDIQPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRM 452
Query: 191 QMSQRLASFLLYLSDVEEGGETMFPF 216
+ R+A+F++YLS VE GG T F +
Sbjct: 453 KSGNRVATFMIYLSSVEAGGATAFIY 478
>gi|390166011|ref|ZP_10218284.1| putative oxygenase [Sphingobium indicum B90A]
gi|390168995|ref|ZP_10220942.1| putative oxygenase [Sphingobium indicum B90A]
gi|389588453|gb|EIM66501.1| putative oxygenase [Sphingobium indicum B90A]
gi|389591170|gb|EIM69145.1| putative oxygenase [Sphingobium indicum B90A]
Length = 225
Score = 55.5 bits (132), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 42/131 (32%), Positives = 60/131 (45%), Gaps = 13/131 (9%)
Query: 90 FASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEH 149
F SAE+C + +PS L + + RTS+ +S + ++ +
Sbjct: 48 FLSAEECAGLRELIDAGAQPSTLF-----SGSANADYRTSASCHLSP---WSPLVMAVSD 99
Query: 150 KIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN-PAEYGPQM----SQRLASFLLYLS 204
+I L HGE RY+ GQ+Y +H D F A Y P M QR + ++YLS
Sbjct: 100 RICALMGLAGEHGETLQGQRYQAGQEYKAHCDYFPVTASYWPAMLKTGGQRSWTAMIYLS 159
Query: 205 DVEEGGETMFP 215
VE GGET FP
Sbjct: 160 PVEAGGETHFP 170
>gi|195145080|ref|XP_002013524.1| GL24183 [Drosophila persimilis]
gi|194102467|gb|EDW24510.1| GL24183 [Drosophila persimilis]
Length = 296
Score = 55.5 bits (132), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 42/150 (28%), Positives = 70/150 (46%), Gaps = 12/150 (8%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPS--QLALRQGETVESTKGTRTSS 130
+ ++ S P + + + + Q +I + ++R+ S Q +RQ E E RTS
Sbjct: 73 LKLEIFSHDPYVVIYHDVLYDAEMQGLIDSTRRRMSRSMVQYEIRQIEISEQ----RTSK 128
Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA----FNPA 186
+ D +L+ I ++ T E ++L Y+ G +D H D ++P
Sbjct: 129 EAPFTEKNDPQ-LLKRIYDRLKDMTGCDMLRSEHLSILLYDQGGHHDPHVDYHDLYWHPQ 187
Query: 187 EYGPQ-MSQRLASFLLYLSDVEEGGETMFP 215
EY R AS + YL+DVE+GGET+FP
Sbjct: 188 EYEYHPFGDRQASVVFYLNDVEDGGETVFP 217
>gi|198477152|ref|XP_002136738.1| GA29216 [Drosophila pseudoobscura pseudoobscura]
gi|198145043|gb|EDY71755.1| GA29216 [Drosophila pseudoobscura pseudoobscura]
Length = 517
Score = 55.5 bits (132), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 37/148 (25%), Positives = 71/148 (47%), Gaps = 9/148 (6%)
Query: 74 PF--QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES-TKGTRTSS 130
PF ++LS P + + + + + ++ +K +K + + V RTS+
Sbjct: 312 PFKTEILSLSPYMVLYHDVITPLESLTLKNLSKPLMKRRAMVMVNNLKVRPFIDSGRTSN 371
Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP 190
++++ E+ ++E +E ++ T + E + ++ Y IG Y H D F + P
Sbjct: 372 SVWLASHEN--AVMERLERRVGVMTNFEMENSEVYQLINYGIGGHYKPHTDHFETPQ-AP 428
Query: 191 QM---SQRLASFLLYLSDVEEGGETMFP 215
+ R+A+ L YLSDV +GG T+FP
Sbjct: 429 EHRGGGDRIATVLFYLSDVPQGGATLFP 456
>gi|395736139|ref|XP_003776705.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Pongo abelii]
Length = 575
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 69/137 (50%), Gaps = 7/137 (5%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 378 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 433
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 434 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 493
Query: 199 FLLYLSDVEEGGETMFP 215
FL Y+SDVE GG T+FP
Sbjct: 494 FLNYMSDVEAGGATVFP 510
>gi|212530|gb|AAA49002.1| prolyl 4-hydroxylase, alpha subunit (EC 1.14.11.2), partial [Gallus
gallus]
Length = 489
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 43/144 (29%), Positives = 70/144 (48%), Gaps = 17/144 (11%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S E+ +++ AK RL S+ + ET + +T R S ++S E
Sbjct: 289 KPRIVRFLDIISDEEIETVKELAKPRL--SRATVHDPETGKLTTAHYRVSKSAWLSGYE- 345
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
+ ++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 346 -SPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG-- 402
Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 403 --NRIATWLFYMSDVSAGGATVFP 424
>gi|195159144|ref|XP_002020442.1| GL13995 [Drosophila persimilis]
gi|194117211|gb|EDW39254.1| GL13995 [Drosophila persimilis]
Length = 535
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 38/142 (26%), Positives = 66/142 (46%), Gaps = 7/142 (4%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
+ ++L P + + + S + I+ A++R+ + + T + TRT+ G
Sbjct: 335 LKMELLGEHPYVVVYHDVLSDSEIAEILEMAERRMARTSTVAQPNRT---SSPTRTALGA 391
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
++ S + + I ++ + L E V+ Y IG Y H D F ++ M
Sbjct: 392 WLKRSSN--ALTRRIARRVRDMSGLQLEGSERMQVINYGIGGHYVPHKDWFT--QHPEVM 447
Query: 193 SQRLASFLLYLSDVEEGGETMF 214
RLA+ L YL+DVE+GG TMF
Sbjct: 448 GNRLATVLFYLTDVEQGGATMF 469
>gi|410447164|ref|ZP_11301266.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [SAR86 cluster
bacterium SAR86E]
gi|409980151|gb|EKO36903.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [SAR86 cluster
bacterium SAR86E]
Length = 214
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 38/143 (26%), Positives = 66/143 (46%), Gaps = 12/143 (8%)
Query: 79 SWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASE 138
S P NF S +C + I A+ RL+ S + G E G RTS +I
Sbjct: 18 SVNPIVYLVKNFLSDLECDAFINEAEGRLQDSTVI---GANDEIKLGARTSQNCWIE--H 72
Query: 139 DKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN------PAEYGPQM 192
D ++ + +++ +P + E + + YE ++Y +D+F+ + P
Sbjct: 73 DANELVHEVSKRLSILAQIPIRNAEQYQLACYEKDEEYKPRFDSFDFDTLEGKKNWEPG- 131
Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
QR+ + ++YL+DV+ GG T FP
Sbjct: 132 GQRMLTIIVYLNDVQSGGGTDFP 154
>gi|52139015|gb|AAH82538.1| P4ha3 protein [Mus musculus]
Length = 404
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 46/146 (31%), Positives = 74/146 (50%), Gaps = 10/146 (6%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
+V+ RP + +F S E+ Q I A+ L+ S +A GE + R S ++
Sbjct: 200 EVVHLRPLIALYHDFVSDEEAQKIRELAEPWLQRSVVA--SGEKQLQVE-YRISKSAWLK 256
Query: 136 ASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 190
+ D +L ++H+IA T L + E V+ Y IG Y+ H+D A +P+ Y
Sbjct: 257 DTVDP--MLVTLDHRIAALTGLDIQPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRM 314
Query: 191 QMSQRLASFLLYLSDVEEGGETMFPF 216
+ R+A+F++YLS VE GG T F +
Sbjct: 315 KSGNRVATFMIYLSSVEAGGATAFIY 340
>gi|340924033|gb|EGS18936.1| hypothetical protein CTHT_0055490 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 311
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/151 (26%), Positives = 73/151 (48%), Gaps = 18/151 (11%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
+++S P +Y +F + + +++ + R KPSQ+A + G + + T +S+G
Sbjct: 78 ELVSLDPVIIYIHDFITKPEIDALLEVGEPRFKPSQVA-KYGVLMNTNDRTSSSAGL--- 133
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGE--AFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 193
+D+ + + + TM+ + A ++RY G+K++ H+D F PQM+
Sbjct: 134 PRDDQAVMCVMRRARNFLGTMIRDGWDDMGAPQLVRYREGEKFNHHHDWFE----NPQMA 189
Query: 194 --------QRLASFLLYLSDVEEGGETMFPF 216
R+ASF L D GGET FP+
Sbjct: 190 YDGTSRKWNRVASFFAILQDNCTGGETYFPY 220
>gi|321461762|gb|EFX72791.1| hypothetical protein DAPPUDRAFT_308081 [Daphnia pulex]
Length = 561
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 44/157 (28%), Positives = 72/157 (45%), Gaps = 15/157 (9%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+Q I + S P + + + Q + + + +L S +G+ V S RTS
Sbjct: 337 LQPIKVEEQSLDPMIVVLHDLITERQTEILRQLGEPKLATSLHRGGEGKFVRSM--IRTS 394
Query: 130 SGTFISASEDKTGILELIEHKIARATML---PQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
++ E+ + L I H++ AT L P+T E F + Y IG Y +H D
Sbjct: 395 KNAWLQEHENAS--LPAIRHRMELATGLIYGPETASEYFQIANYGIGGLYKTHTDNVIHP 452
Query: 187 EYGPQ--------MSQRLASFLLYLSDVEEGGETMFP 215
+ P+ + R+A+ ++YLSDVE GG T+FP
Sbjct: 453 DVRPEDQDPWNLYVGDRIATLMVYLSDVEAGGATVFP 489
>gi|195159164|ref|XP_002020452.1| GL13506 [Drosophila persimilis]
gi|194117221|gb|EDW39264.1| GL13506 [Drosophila persimilis]
Length = 536
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/143 (28%), Positives = 68/143 (47%), Gaps = 9/143 (6%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFISA 136
LS P + + N S + IA ++ +P ++ GE S K RT+ G +I
Sbjct: 328 LSLDPYIVVYHNVLSDAE----IAKVERVAEPLLKSIGVGEMDNSKKSKVRTALGAWIPD 383
Query: 137 SEDKTG---ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ-M 192
+++ I +I T L G+ +++Y G YD+H+D N + Q +
Sbjct: 384 ENMHISGWPVIQRIVRRIHDMTGLIIKRGQVVQLIKYGYGGHYDTHFDYLNDSLPITQAL 443
Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
R+A+ L YL+DV+ GG T+FP
Sbjct: 444 GDRMATVLFYLNDVKHGGSTVFP 466
>gi|426349879|ref|XP_004042513.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Gorilla gorilla
gorilla]
Length = 565
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 69/137 (50%), Gaps = 7/137 (5%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 368 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 423
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 424 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 483
Query: 199 FLLYLSDVEEGGETMFP 215
FL Y+SDVE GG T+FP
Sbjct: 484 FLNYMSDVEAGGATVFP 500
>gi|241598365|ref|XP_002404734.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
gi|215500465|gb|EEC09959.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
Length = 524
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/160 (25%), Positives = 70/160 (43%), Gaps = 9/160 (5%)
Query: 63 GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES 122
GE+G +Q I + + +P + + +IA AK R++ QL+ R V S
Sbjct: 301 GESGFFKLQPIKLEEYNLKPYVVVLRDLLQDRDLADMIAFAKPRVRKLQLSRRI--LVYS 358
Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
TS+ + + + ++ + T+ + E + + Y IG Y H+D
Sbjct: 359 KHYCDTSTWLNDDDAPVAARVNQYLQSLLGLGTLYSKDEAEKYQLANYGIGGHYVPHHDY 418
Query: 183 FNPAEYGPQMS-------QRLASFLLYLSDVEEGGETMFP 215
+S R+A+ ++Y+SDVEEGG T+FP
Sbjct: 419 LEETLTSRHVSIVTRLFGDRVATLMIYMSDVEEGGATVFP 458
>gi|294013234|ref|YP_003546694.1| putative oxygenase [Sphingobium japonicum UT26S]
gi|292676564|dbj|BAI98082.1| putative oxygenase [Sphingobium japonicum UT26S]
Length = 225
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 42/131 (32%), Positives = 60/131 (45%), Gaps = 13/131 (9%)
Query: 90 FASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEH 149
F SAE+C + +PS L + + RTS+ +S + ++ +
Sbjct: 48 FLSAEECAGLRELIDAGAQPSTLF-----SGSANADYRTSASCHLSP---WSPLVMAVSD 99
Query: 150 KIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN-PAEYGPQM----SQRLASFLLYLS 204
+I L HGE RY+ GQ+Y +H D F A Y P M QR + ++YLS
Sbjct: 100 RICALMGLAGEHGETLQGQRYQAGQEYKAHCDYFPVTASYWPAMLKTGGQRSWTAMIYLS 159
Query: 205 DVEEGGETMFP 215
VE GGET FP
Sbjct: 160 AVEAGGETHFP 170
>gi|119582748|gb|EAW62344.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_c
[Homo sapiens]
Length = 565
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 69/137 (50%), Gaps = 7/137 (5%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 368 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 423
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 424 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 483
Query: 199 FLLYLSDVEEGGETMFP 215
FL Y+SDVE GG T+FP
Sbjct: 484 FLNYMSDVEAGGATVFP 500
>gi|386780652|ref|NP_001247763.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Macaca mulatta]
gi|383422579|gb|AFH34503.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Macaca
mulatta]
gi|384939466|gb|AFI33338.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Macaca
mulatta]
Length = 533
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 69/137 (50%), Gaps = 7/137 (5%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451
Query: 199 FLLYLSDVEEGGETMFP 215
FL Y+SDVE GG T+FP
Sbjct: 452 FLNYMSDVEAGGATVFP 468
>gi|57997558|emb|CAI46066.1| hypothetical protein [Homo sapiens]
Length = 533
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 69/137 (50%), Gaps = 7/137 (5%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451
Query: 199 FLLYLSDVEEGGETMFP 215
FL Y+SDVE GG T+FP
Sbjct: 452 FLNYMSDVEAGGATVFP 468
>gi|297675927|ref|XP_002815905.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Pongo
abelii]
gi|395736137|ref|XP_003776704.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Pongo abelii]
Length = 533
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 69/137 (50%), Gaps = 7/137 (5%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451
Query: 199 FLLYLSDVEEGGETMFP 215
FL Y+SDVE GG T+FP
Sbjct: 452 FLNYMSDVEAGGATVFP 468
>gi|195110925|ref|XP_002000030.1| GI22756 [Drosophila mojavensis]
gi|193916624|gb|EDW15491.1| GI22756 [Drosophila mojavensis]
Length = 533
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 38/67 (56%), Gaps = 1/67 (1%)
Query: 151 IARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-SQRLASFLLYLSDVEEG 209
+ AT L T+ E V Y +G Y+ H+D F + + P R+A+ + YLSDVE+G
Sbjct: 393 VGDATGLDMTYCEQLQVANYGVGGHYEPHWDFFRDSRHYPAAEGNRIATAIFYLSDVEQG 452
Query: 210 GETMFPF 216
G T FPF
Sbjct: 453 GATAFPF 459
>gi|114601548|ref|XP_001162501.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 9 [Pan
troglodytes]
gi|114601562|ref|XP_001162805.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 16 [Pan
troglodytes]
gi|114601564|ref|XP_517917.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 19 [Pan
troglodytes]
gi|397518354|ref|XP_003829356.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Pan
paniscus]
gi|397518356|ref|XP_003829357.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Pan
paniscus]
gi|397518360|ref|XP_003829359.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 4 [Pan
paniscus]
gi|410215942|gb|JAA05190.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
gi|410255606|gb|JAA15770.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
gi|410331277|gb|JAA34585.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
gi|410331281|gb|JAA34587.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
Length = 533
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 69/137 (50%), Gaps = 7/137 (5%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451
Query: 199 FLLYLSDVEEGGETMFP 215
FL Y+SDVE GG T+FP
Sbjct: 452 FLNYMSDVEAGGATVFP 468
>gi|291387300|ref|XP_002710241.1| PREDICTED: prolyl 4-hydroxylase, alpha II subunit isoform 1
precursor (predicted)-like isoform 1 [Oryctolagus
cuniculus]
Length = 533
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/137 (29%), Positives = 69/137 (50%), Gaps = 7/137 (5%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQM-SQRLAS 198
++ I ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 392 DPVVARINRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451
Query: 199 FLLYLSDVEEGGETMFP 215
FL Y+SDVE GG T+FP
Sbjct: 452 FLNYMSDVEAGGATVFP 468
>gi|63252891|ref|NP_001017973.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Homo
sapiens]
gi|63252893|ref|NP_001017974.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Homo
sapiens]
gi|217272861|ref|NP_001136070.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Homo
sapiens]
gi|18073925|emb|CAC85688.1| Prolyl 4-hydroxylase alpha IIa subunit [Homo sapiens]
gi|23274221|gb|AAH35813.1| Prolyl 4-hydroxylase, alpha polypeptide II [Homo sapiens]
gi|37183058|gb|AAQ89329.1| P4HA2 [Homo sapiens]
gi|119582745|gb|EAW62341.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_a
[Homo sapiens]
gi|119582750|gb|EAW62346.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_a
[Homo sapiens]
gi|123983232|gb|ABM83357.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II [synthetic
construct]
gi|157928048|gb|ABW03320.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II [synthetic
construct]
Length = 533
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 69/137 (50%), Gaps = 7/137 (5%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451
Query: 199 FLLYLSDVEEGGETMFP 215
FL Y+SDVE GG T+FP
Sbjct: 452 FLNYMSDVEAGGATVFP 468
>gi|332221656|ref|XP_003259979.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Nomascus
leucogenys]
gi|332221658|ref|XP_003259980.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Nomascus
leucogenys]
Length = 535
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 69/137 (50%), Gaps = 7/137 (5%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 338 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 393
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 394 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 453
Query: 199 FLLYLSDVEEGGETMFP 215
FL Y+SDVE GG T+FP
Sbjct: 454 FLNYMSDVEAGGATVFP 470
>gi|289662828|ref|ZP_06484409.1| hypothetical protein XcampvN_06993, partial [Xanthomonas campestris
pv. vasculorum NCPPB 702]
Length = 301
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 61/129 (47%), Gaps = 7/129 (5%)
Query: 92 SAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG-TFISASEDKTGILELIEHK 150
SA++C+ ++ A+ L+ SQ+ + + RTS G T ED + + +
Sbjct: 121 SADECRLLMLLARPHLRDSQV-IDPNDASTQRAPVRTSRGATLDPIIEDFAA--RVAQAR 177
Query: 151 IARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP---AEYGPQMSQRLASFLLYLSDVE 207
+A L TH E +VL Y G++Y +H D P A P R + +YL+ V+
Sbjct: 178 LAACAQLTLTHAEPLSVLCYAPGEQYRAHRDYLPPGTIAADHPNAGNRQRTVCVYLNVVD 237
Query: 208 EGGETMFPF 216
GGET FP
Sbjct: 238 AGGETEFPL 246
>gi|119582749|gb|EAW62345.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_d
[Homo sapiens]
Length = 488
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 69/137 (50%), Gaps = 7/137 (5%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 291 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 346
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 347 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 406
Query: 199 FLLYLSDVEEGGETMFP 215
FL Y+SDVE GG T+FP
Sbjct: 407 FLNYMSDVEAGGATVFP 423
>gi|332221662|ref|XP_003259982.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 4 [Nomascus
leucogenys]
Length = 556
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 69/137 (50%), Gaps = 7/137 (5%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 359 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 414
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 415 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 474
Query: 199 FLLYLSDVEEGGETMFP 215
FL Y+SDVE GG T+FP
Sbjct: 475 FLNYMSDVEAGGATVFP 491
>gi|195575143|ref|XP_002105539.1| GD16913 [Drosophila simulans]
gi|194201466|gb|EDX15042.1| GD16913 [Drosophila simulans]
Length = 534
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 39/142 (27%), Positives = 63/142 (44%), Gaps = 13/142 (9%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + + SA + +I A + +K +++ QG V RT+ G + ++
Sbjct: 326 PYVVLYHEVLSAREISMLIGKAAQNMKNTRVHKEQG--VPKKNRGRTAKGFWFKKESNE- 382
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE---------YGPQM 192
+ + I +I T E F V+ Y IG Y H D F+ A Y +
Sbjct: 383 -LTKGITRRIMDMTGFDLADSEGFQVINYGIGGHYLLHMDYFDFASSNHTDTRSGYSMDL 441
Query: 193 SQRLASFLLYLSDVEEGGETMF 214
R+A+ L YL+DVE+GG T+F
Sbjct: 442 GDRIATVLFYLTDVEQGGATVF 463
>gi|116283554|gb|AAH17062.1| P4HA2 protein [Homo sapiens]
Length = 504
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 69/137 (50%), Gaps = 7/137 (5%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 307 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 362
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 363 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 422
Query: 199 FLLYLSDVEEGGETMFP 215
FL Y+SDVE GG T+FP
Sbjct: 423 FLNYMSDVEAGGATVFP 439
>gi|26336999|dbj|BAC32183.1| unnamed protein product [Mus musculus]
gi|148700261|gb|EDL32208.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide, isoform CRA_b [Mus
musculus]
Length = 534
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 43/144 (29%), Positives = 68/144 (47%), Gaps = 17/144 (11%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S + + + AK RL S+ + ET + +T R S ++S ED
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYED 391
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTG-- 447
Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP 469
>gi|51036657|ref|NP_742059.2| prolyl 4-hydroxylase subunit alpha-1 precursor [Rattus norvegicus]
gi|90111077|sp|P54001.2|P4HA1_RAT RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; Flags: Precursor
gi|50927553|gb|AAH78703.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I [Rattus norvegicus]
gi|149038787|gb|EDL93076.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide, isoform CRA_a
[Rattus norvegicus]
Length = 534
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 43/144 (29%), Positives = 68/144 (47%), Gaps = 17/144 (11%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S + + + AK RL S+ + ET + +T R S ++S ED
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYED 391
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTG-- 447
Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP 469
>gi|403255937|ref|XP_003920661.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Saimiri
boliviensis boliviensis]
gi|403255939|ref|XP_003920662.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Saimiri
boliviensis boliviensis]
gi|403255943|ref|XP_003920664.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 4 [Saimiri
boliviensis boliviensis]
Length = 533
Score = 55.1 bits (131), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 69/137 (50%), Gaps = 7/137 (5%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451
Query: 199 FLLYLSDVEEGGETMFP 215
FL Y+SDVE GG T+FP
Sbjct: 452 FLNYMSDVEAGGATVFP 468
>gi|390358555|ref|XP_003729286.1| PREDICTED: transmembrane prolyl 4-hydroxylase-like
[Strongylocentrotus purpuratus]
Length = 508
Score = 55.1 bits (131), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 42/126 (33%), Positives = 63/126 (50%), Gaps = 26/126 (20%)
Query: 116 QGETVESTKG-TRTSSGTFISASEDKTGILELIEHKIARATMLP----QTHGEAFNVLRY 170
+G V+ KG TR S T+IS++ L+ I+++I T L QT E+ V+RY
Sbjct: 222 EGRMVKKWKGRTRVSDQTWISSNTVNGHPLQKIQNRIIALTRLSPEIVQT-SESMQVVRY 280
Query: 171 EIGQKYDSHYDA---------------FNPAEYGPQMSQR-----LASFLLYLSDVEEGG 210
G Y +H+D+ FN +E P +R + + L YL+DV++GG
Sbjct: 281 RPGGHYHAHFDSSEVTSDLECRHTAITFNQSEETPFHEERCRYCRMITILYYLNDVDQGG 340
Query: 211 ETMFPF 216
ET FPF
Sbjct: 341 ETAFPF 346
>gi|426245942|ref|XP_004016760.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3, partial [Ovis
aries]
Length = 514
Score = 55.1 bits (131), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 45/152 (29%), Positives = 75/152 (49%), Gaps = 10/152 (6%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+Q + +V+ P + + +F S + Q I A+ L+ S +A GE + R S
Sbjct: 304 LQPVRKEVIHLEPYVVLYHDFVSDAEAQKIRGLAEPWLQRSVVA--SGEKQLPVE-YRIS 360
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYD-AFNPA 186
++ + D +L ++H+IA T L + E V+ Y IG Y+ H+D A +P+
Sbjct: 361 KSAWLKDTVDP--VLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPS 418
Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPF 216
Y R+A+F++YLS VE GG T F +
Sbjct: 419 SPLYRMNSGNRVATFMIYLSSVEAGGATAFIY 450
>gi|195452734|ref|XP_002073476.1| GK13124 [Drosophila willistoni]
gi|194169561|gb|EDW84462.1| GK13124 [Drosophila willistoni]
Length = 536
Score = 55.1 bits (131), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 151 IARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMSQRLASFLLYLSDVEEG 209
++ T L T+ E V Y +G Y+ H+D F NP Y + R+A+ + YLS+VE+G
Sbjct: 396 LSDTTGLDMTYCEQLQVANYGVGGHYEPHWDFFRNPDHYPAEEGNRIATAIYYLSEVEQG 455
Query: 210 GETMFPF 216
G T FPF
Sbjct: 456 GATAFPF 462
>gi|474940|emb|CAA55546.1| gamma-butyrobetaine,2-oxoglutarate dioxygenase [Rattus norvegicus]
Length = 534
Score = 55.1 bits (131), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 43/144 (29%), Positives = 68/144 (47%), Gaps = 17/144 (11%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S + + + AK RL S+ + ET + +T R S ++S ED
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYED 391
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTG-- 447
Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP 469
>gi|195341588|ref|XP_002037388.1| GM12140 [Drosophila sechellia]
gi|194131504|gb|EDW53547.1| GM12140 [Drosophila sechellia]
Length = 534
Score = 55.1 bits (131), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 39/142 (27%), Positives = 63/142 (44%), Gaps = 13/142 (9%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + + SA + +I A + +K +++ QG V RT+ G + ++
Sbjct: 326 PYVVLYHEVLSAREISMLIGKATQNMKNTRVHKEQG--VPKKNRGRTAKGFWFKKESNE- 382
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE---------YGPQM 192
+ + I +I T E F V+ Y IG Y H D F+ A Y +
Sbjct: 383 -LTKGITRRIMDMTGFDLADSEGFQVINYGIGGHYLLHMDYFDFASSNHTDTRSSYSMDL 441
Query: 193 SQRLASFLLYLSDVEEGGETMF 214
R+A+ L YL+DVE+GG T+F
Sbjct: 442 GDRIATVLFYLTDVEQGGATVF 463
>gi|195055773|ref|XP_001994787.1| GH17427 [Drosophila grimshawi]
gi|193892550|gb|EDV91416.1| GH17427 [Drosophila grimshawi]
Length = 538
Score = 55.1 bits (131), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 41/67 (61%), Gaps = 1/67 (1%)
Query: 151 IARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP-QMSQRLASFLLYLSDVEEG 209
++ AT L T+ E V Y +G Y+ H+D F +++ P + R+A+ + YLSDVE+G
Sbjct: 398 LSDATGLDMTYCEQLQVANYGVGGHYEPHWDFFVDSQHYPAEEGNRIATAIFYLSDVEQG 457
Query: 210 GETMFPF 216
G T FPF
Sbjct: 458 GATAFPF 464
>gi|255085194|ref|XP_002505028.1| predicted protein [Micromonas sp. RCC299]
gi|226520297|gb|ACO66286.1| predicted protein [Micromonas sp. RCC299]
Length = 439
Score = 55.1 bits (131), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 45/155 (29%), Positives = 61/155 (39%), Gaps = 19/155 (12%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
+S PR NF S E+ +I+ A L PS + Q T G T+ +A
Sbjct: 205 VSRHPRLAVIHNFISKEEAAAIVDVAAPELHPSLVVRHQTAKRGDTAGGDTAVHGEATAG 264
Query: 138 EDKTGILELIEHKIARATM--------LPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
H I RA + L +H E V+RY Q+Y H+D F+ A
Sbjct: 265 RTSHNCRVSSSHPIVRAAIQRAAYLCGLEPSHAEPAQVVRYLPSQEYKPHHDWFDRAH-- 322
Query: 190 PQM---------SQRLASFLLYLSDVEEGGETMFP 215
P+ QR + L YL + E GG T FP
Sbjct: 323 PESFRAKTEGRGGQRAVTCLAYLVEPERGGRTYFP 357
>gi|443721482|gb|ELU10773.1| hypothetical protein CAPTEDRAFT_174752 [Capitella teleta]
Length = 525
Score = 55.1 bits (131), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 50/201 (24%), Positives = 92/201 (45%), Gaps = 30/201 (14%)
Query: 43 RPKLRTLEVVE---KENESGLPHGETGDASIQS------IPF-----QVLSWRPRALYFP 88
RPKL++ + E + + LP ++ +++ +PF ++L+ +P + F
Sbjct: 263 RPKLKSTKAYEALCRGEQLKLPDVDSEQQALKCRYKPGILPFVRYKEEMLNRKPHIVLFH 322
Query: 89 NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
+ S + +++ A +L+ + +A + + S R S +++ I +L
Sbjct: 323 DVMSDAEAKTMKMEAMHKLERAHVADNENKHGHSASAKRISQVSWLWDDHANKTIHQL-S 381
Query: 149 HKIARATMLPQT-------HGEAFNVLRYEIGQKYDSHYDAF-------NPAEYGPQMSQ 194
++A T L QT E F +L Y IG +Y+ H D F + E+
Sbjct: 382 RRVADITGL-QTGVVSGLHSAEPFQILNYGIGGQYEPHVDYFAGNHSHSSLPEHVRASGN 440
Query: 195 RLASFLLYLSDVEEGGETMFP 215
RLA+F+ YL+DV GG T+FP
Sbjct: 441 RLATFMFYLNDVHAGGATVFP 461
>gi|74148153|dbj|BAE36242.1| unnamed protein product [Mus musculus]
Length = 454
Score = 55.1 bits (131), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 43/144 (29%), Positives = 68/144 (47%), Gaps = 17/144 (11%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S + + + AK RL S+ + ET + +T R S ++S ED
Sbjct: 254 KPRIIRFHDIISDAENEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYED 311
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 312 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTG-- 367
Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 368 --NRIATWLFYMSDVSAGGATVFP 389
>gi|354474415|ref|XP_003499426.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2
[Cricetulus griseus]
Length = 533
Score = 55.1 bits (131), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 69/137 (50%), Gaps = 7/137 (5%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQM-SQRLAS 198
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451
Query: 199 FLLYLSDVEEGGETMFP 215
FL Y+SDVE GG T+FP
Sbjct: 452 FLNYMSDVEAGGATVFP 468
>gi|149186836|ref|ZP_01865146.1| 2OG-Fe(II) oxygenase [Erythrobacter sp. SD-21]
gi|148829503|gb|EDL47944.1| 2OG-Fe(II) oxygenase [Erythrobacter sp. SD-21]
Length = 211
Score = 55.1 bits (131), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 45/152 (29%), Positives = 68/152 (44%), Gaps = 21/152 (13%)
Query: 70 IQSIPFQVLSWRPRALYFP--NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTR 127
IQ +P P+A F +FAS C+ ++A +K +PS LA + R
Sbjct: 18 IQRVP------TPKAELFQLRDFASPAMCEQLVALIEKDRRPSTLA-----DAGNDHYFR 66
Query: 128 TSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA- 186
TS + E ++ IE +A + GE RY++GQ++ H D FN
Sbjct: 67 TSETCDLDPDEP---VVCEIEALLAALNGIDPKFGEPLQGQRYDVGQEFKPHCDYFNRGG 123
Query: 187 ----EYGPQMSQRLASFLLYLSDVEEGGETMF 214
+Y QR +F++YL+ VE GG T F
Sbjct: 124 QDWEKYCSVAGQRTWTFMVYLNAVEAGGATRF 155
>gi|194765180|ref|XP_001964705.1| GF23331 [Drosophila ananassae]
gi|190614977|gb|EDV30501.1| GF23331 [Drosophila ananassae]
Length = 535
Score = 55.1 bits (131), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/148 (27%), Positives = 65/148 (43%), Gaps = 7/148 (4%)
Query: 74 PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
PF++ LS P S++ + I A+ ++K S + G RTS G
Sbjct: 316 PFKLEELSHEPLVFQVHQVVSSKSAEFIKKMARPKIKRSTVYSIGGGGGSQAAAFRTSQG 375
Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF---NPAEY 188
+ S + +++ + + L E V Y IG Y+ H+D+F + +
Sbjct: 376 ASFNYS--RNAATKILSRHVGDLSSLDMNFAEELQVANYGIGGHYEPHWDSFPENHIYDE 433
Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPF 216
G R+A+ + YLSDVE GG T FPF
Sbjct: 434 GDDRGNRIATGIYYLSDVEAGGGTAFPF 461
>gi|195505202|ref|XP_002099402.1| GE23382 [Drosophila yakuba]
gi|194185503|gb|EDW99114.1| GE23382 [Drosophila yakuba]
Length = 537
Score = 55.1 bits (131), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 151 IARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMSQRLASFLLYLSDVEEG 209
+ AT L T+ E V Y +G Y+ H+D F +P Y + R+A+ + YLS+VE+G
Sbjct: 397 LKEATGLDTTYCEQLQVANYGVGGHYEPHWDFFRDPNHYPEEEGNRIATAIFYLSEVEQG 456
Query: 210 GETMFPF 216
G T FPF
Sbjct: 457 GATAFPF 463
>gi|448930897|gb|AGE54460.1| prolyl 4-hydroxylase [Paramecium bursaria Chlorella virus KS1B]
Length = 239
Score = 55.1 bits (131), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 39/134 (29%), Positives = 67/134 (50%), Gaps = 10/134 (7%)
Query: 89 NFASAEQCQSIIATAKKR--LKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILEL 146
+F S +C +I A K+ +K + ++ +R S T+ + E K I++
Sbjct: 57 DFLSDAECDILINAAIKKGLIKSEVGGATDDDPIKLDPKSRNSEQTWFTPGEHK--IIDK 114
Query: 147 IEHKI-----ARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
I++K ++ + + + E V RY+ GQ Y HYD + + P+ QRLA+ ++
Sbjct: 115 IQNKTRELLDSKKHCIDKYNFEDVQVARYKPGQYYYHHYDGDDCDDACPK-DQRLATLMV 173
Query: 202 YLSDVEEGGETMFP 215
YL +EGGET FP
Sbjct: 174 YLKAPKEGGETDFP 187
>gi|341614920|ref|ZP_08701789.1| hypothetical protein CJLT1_08183 [Citromicrobium sp. JLT1363]
Length = 210
Score = 55.1 bits (131), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/150 (27%), Positives = 69/150 (46%), Gaps = 17/150 (11%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+Q +P+ L L F AE C +I + + +PS LA G+ RTS
Sbjct: 17 MQRLPYDKLE----LLQKRGFLDAEFCAELITMIEAKRRPSTLADFDGDEF-----FRTS 67
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP---- 185
+ + ++ L+ + +A + + +GE RY +GQ++ +H D FNP
Sbjct: 68 ETCDLPMDDPRSQRLDAM---LADLSGIDPAYGEPLQGQRYAVGQEFKAHCDYFNPDGQD 124
Query: 186 -AEYGPQMSQRLASFLLYLSDVEEGGETMF 214
+Y QR +F++YL++ E GG T F
Sbjct: 125 WEKYCSVAGQRTWTFMIYLNEPEAGGVTRF 154
>gi|323444954|gb|EGB01828.1| tentative prolyl-4-hydroxylase [Aureococcus anophagefferens]
Length = 230
Score = 55.1 bits (131), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/150 (27%), Positives = 63/150 (42%), Gaps = 13/150 (8%)
Query: 68 ASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTR 127
AS+ VLSW PR + +IA + R S+L V + R
Sbjct: 11 ASVSRAHITVLSWEPRIHLHHGMLDENEIAHLIALSSSRFAASEL------EVGANDQRR 64
Query: 128 TSSGTFI-SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
TS + A E ++ ++ A +P +GEA V RY + Y +N
Sbjct: 65 TSESWMVPRAVERDDAVVAGAVKRMHAAVGIPPRYGEALQVARYNSTRNY------YNFH 118
Query: 187 EYGPQMSQRLASFLLYLSDVEEGGETMFPF 216
+ + R+A+ L+ L +EGGET+FPF
Sbjct: 119 ADSAKSTARVATVLISLERADEGGETIFPF 148
>gi|94495931|ref|ZP_01302510.1| 2OG-Fe(II) oxygenase [Sphingomonas sp. SKA58]
gi|94424623|gb|EAT09645.1| 2OG-Fe(II) oxygenase [Sphingomonas sp. SKA58]
Length = 229
Score = 55.1 bits (131), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 42/132 (31%), Positives = 61/132 (46%), Gaps = 13/132 (9%)
Query: 89 NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
+F S ++C + +PS L + + RTS +S + ++E I
Sbjct: 52 DFLSPDECAELRRLIDANAQPSTLF-----SGSANADYRTSHSGNLSPRDP---LVERIT 103
Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQM----SQRLASFLLYL 203
+I T LP +GE RY GQ+Y H D F A+Y +M QR + ++YL
Sbjct: 104 QRICALTGLPAINGETLQGQRYTPGQEYKVHCDYFPATADYWQRMRGTGGQRTWTAMIYL 163
Query: 204 SDVEEGGETMFP 215
S VE GGET FP
Sbjct: 164 SAVEAGGETHFP 175
>gi|190402274|gb|ACE77683.1| prolyl 4-hydroxylase subunit alpha-2 precursor (predicted) [Sorex
araneus]
Length = 533
Score = 55.1 bits (131), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 71/137 (51%), Gaps = 7/137 (5%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V +T R S +++ ++D
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTTASYRVSKSSWLEETDDP 393
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQM-SQRLAS 198
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 394 --VVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451
Query: 199 FLLYLSDVEEGGETMFP 215
FL Y+SDVE GG T+FP
Sbjct: 452 FLNYMSDVEAGGATVFP 468
>gi|85708137|ref|ZP_01039203.1| Prolyl 4-hydroxylase, alpha subunit [Erythrobacter sp. NAP1]
gi|85689671|gb|EAQ29674.1| Prolyl 4-hydroxylase, alpha subunit [Erythrobacter sp. NAP1]
Length = 211
Score = 55.1 bits (131), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 39/130 (30%), Positives = 62/130 (47%), Gaps = 13/130 (10%)
Query: 90 FASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEH 149
F S E C +I +PS +A G+ RTS + A+E T + EL E
Sbjct: 34 FISPELCAELIRLIDLGRRPSTIADANGDDY-----FRTSETCDLDANE--TAVKEL-EA 85
Query: 150 KIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA-----EYGPQMSQRLASFLLYLS 204
+ + +GE RY++GQ++ +H D F P ++ + QR +F++YL+
Sbjct: 86 RFFSLNGIDPKYGEPVQGQRYDVGQEFKAHTDYFTPGGADFEKFCAESGQRTWTFMIYLN 145
Query: 205 DVEEGGETMF 214
+VE GG T F
Sbjct: 146 EVEAGGATRF 155
>gi|148701600|gb|EDL33547.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide, isoform CRA_e [Mus
musculus]
Length = 593
Score = 55.1 bits (131), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 69/137 (50%), Gaps = 7/137 (5%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 396 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 451
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQM-SQRLAS 198
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 452 DPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 511
Query: 199 FLLYLSDVEEGGETMFP 215
FL Y+SDVE GG T+FP
Sbjct: 512 FLNYMSDVEAGGATVFP 528
>gi|326923461|ref|XP_003207954.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 1
[Meleagris gallopavo]
Length = 536
Score = 54.7 bits (130), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 41/143 (28%), Positives = 69/143 (48%), Gaps = 15/143 (10%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+PR + F + S E+ +++ AK RL+ + ++ +E T R S ++S E
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRLRRATISNPITGALE-TAHYRISKSAWLSGYE-- 392
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
+ ++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 SPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG--- 449
Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 450 -NRIATWLFYMSDVSAGGATVFP 471
>gi|341893923|gb|EGT49858.1| CBN-PHY-3 protein [Caenorhabditis brenneri]
Length = 275
Score = 54.7 bits (130), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 38/151 (25%), Positives = 69/151 (45%), Gaps = 13/151 (8%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
+ +++SW P + + N + Q + +KR Q G+++E+T R ++G+
Sbjct: 41 VDMEIVSWSPTLVIYRNLLTPRQASEFLDFIEKRDMVMQKTSDNGKSIETTH--RRANGS 98
Query: 133 FIS--ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE--- 187
FI A+E + + ++ +I L T+ E F+ L Y G Y HYD +
Sbjct: 99 FIDHGATEVTSEVHNWVQKRIP---ALNFTNAELFSALSYLPGGHYSVHYDYLSYRSEKE 155
Query: 188 ---YGPQMSQRLASFLLYLSDVEEGGETMFP 215
+ M R+ + + L E+GG T+FP
Sbjct: 156 HDWWMKNMRNRIGTLIFVLKTAEKGGGTVFP 186
>gi|395817620|ref|XP_003782263.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Otolemur
garnettii]
Length = 540
Score = 54.7 bits (130), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 39/139 (28%), Positives = 66/139 (47%), Gaps = 9/139 (6%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 341 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 396
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----QMSQRL 196
++ + H++ T L E V Y +G +Y+ H+D E R+
Sbjct: 397 DPVVARVNHRMQHITGLSVKTAELLQVANYGVGGQYEPHFDFSRNHERDAFKRLGTGNRV 456
Query: 197 ASFLLYLSDVEEGGETMFP 215
A+FL Y+SDVE GG T+FP
Sbjct: 457 ATFLNYMSDVEAGGATVFP 475
>gi|312032360|ref|NP_001185667.1| prolyl 4-hydroxylase subunit alpha-1 isoform 4 precursor [Gallus
gallus]
Length = 536
Score = 54.7 bits (130), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 41/143 (28%), Positives = 69/143 (48%), Gaps = 15/143 (10%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+PR + F + S E+ +++ AK RL+ + ++ +E T R S ++S E
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRLRRATISNPITGALE-TAHYRISKSAWLSGYE-- 392
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
+ ++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 SPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG--- 449
Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 450 -NRIATWLFYMSDVSAGGATVFP 471
>gi|224006045|ref|XP_002291983.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220972502|gb|EED90834.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 168
Score = 54.7 bits (130), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 42/139 (30%), Positives = 64/139 (46%), Gaps = 14/139 (10%)
Query: 89 NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
NF + +C +I + L P+ + G+ +RTSS ++ A ED L
Sbjct: 8 NFLTPAECDFLIHASNDALGPAPVV---GKGAGEVSPSRTSSTCYL-AREDLPEYLR--- 60
Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE-----YGPQMSQRLASFLLYL 203
K+ T P H E V RY Q+Y H+DAF+ + + QR + L YL
Sbjct: 61 -KVGVLTGKPAEHCELPQVGRYLPSQQYLQHFDAFDLSNEDGRRFAANGGQRTITVLTYL 119
Query: 204 SDVEEGGETMFPFEVDLQI 222
+DV GG T FP +++Q+
Sbjct: 120 NDVPRGGATSFP-NLNIQV 137
>gi|344264849|ref|XP_003404502.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2
[Loxodonta africana]
Length = 534
Score = 54.7 bits (130), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 69/137 (50%), Gaps = 7/137 (5%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 337 PHIVRYYDVMSDEEIERIKQIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 392
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 393 DPVVAQVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 452
Query: 199 FLLYLSDVEEGGETMFP 215
FL Y+SDVE GG T+FP
Sbjct: 453 FLNYMSDVEAGGATVFP 469
>gi|312032358|ref|NP_001185666.1| prolyl 4-hydroxylase subunit alpha-1 isoform 3 precursor [Gallus
gallus]
Length = 536
Score = 54.7 bits (130), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 41/143 (28%), Positives = 69/143 (48%), Gaps = 15/143 (10%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+PR + F + S E+ +++ AK RL+ + ++ +E T R S ++S E
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRLRRATISNPITGALE-TAHYRISKSAWLSGYE-- 392
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
+ ++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 SPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG--- 449
Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 450 -NRIATWLFYMSDVSAGGATVFP 471
>gi|209862961|ref|NP_001129548.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Mus
musculus]
gi|17390970|gb|AAH18411.1| P4ha2 protein [Mus musculus]
gi|18073922|emb|CAC85690.1| Prolyl 4-hydroxylase alpha IIa subunit [Mus musculus]
gi|74211515|dbj|BAE26490.1| unnamed protein product [Mus musculus]
Length = 535
Score = 54.7 bits (130), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 69/137 (50%), Gaps = 7/137 (5%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 338 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 393
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQM-SQRLAS 198
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 394 DPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 453
Query: 199 FLLYLSDVEEGGETMFP 215
FL Y+SDVE GG T+FP
Sbjct: 454 FLNYMSDVEAGGATVFP 470
>gi|156405954|ref|XP_001640996.1| predicted protein [Nematostella vectensis]
gi|156228133|gb|EDO48933.1| predicted protein [Nematostella vectensis]
Length = 182
Score = 54.7 bits (130), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 34/97 (35%), Positives = 51/97 (52%), Gaps = 11/97 (11%)
Query: 129 SSGTFISASED-KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE 187
SS ++ ED K IL I + + E + +Y++GQKY HYD+
Sbjct: 9 SSSLYLKNKEDSKITILRDIAQLAGKLSNTQWRFAEPVALTKYKVGQKYSLHYDS----- 63
Query: 188 YGPQMSQR----LASFLLYLSDVEEGGETMFPFEVDL 220
G M+QR A+FL+YL+DV+ GGET+FP ++
Sbjct: 64 -GFLMNQRRVKRTATFLVYLNDVKSGGETIFPLATNI 99
>gi|448930198|gb|AGE53763.1| prolyl 4-hydroxylase [Paramecium bursaria Chlorella virus IL-3A]
gi|448931603|gb|AGE55164.1| prolyl 4-hydroxylase [Paramecium bursaria Chlorella virus MA-1E]
Length = 239
Score = 54.7 bits (130), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 39/134 (29%), Positives = 66/134 (49%), Gaps = 10/134 (7%)
Query: 89 NFASAEQCQSIIATAKKR--LKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILEL 146
+F S +C +I A K+ +K + ++ +R S T+ + E K I++
Sbjct: 57 DFLSDAECDVLINAAIKKGLIKSEVGGATDDDPIKLDPKSRNSEQTWFTPGEHK--IIDK 114
Query: 147 IEHKI-----ARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
I+ K ++ + + + E V RY+ GQ Y HYD + + P+ QRLA+ ++
Sbjct: 115 IQKKTRELLNSKKHCIDKYNFEDVQVARYKPGQYYYHHYDGDDCDDACPK-DQRLATLMV 173
Query: 202 YLSDVEEGGETMFP 215
YL +EGGET FP
Sbjct: 174 YLKAPKEGGETDFP 187
>gi|406859917|gb|EKD12978.1| hypothetical protein MBM_08740 [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 466
Score = 54.7 bits (130), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 50/204 (24%), Positives = 84/204 (41%), Gaps = 30/204 (14%)
Query: 18 LLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKENESGLPHGETGDASIQSIPFQV 77
L +C + L+ T VP++ L N S + E +
Sbjct: 11 LYICVYLPLSSIILGTSHKPAVPAVTSML---------NNSLIATNEALSCPSHKYNTHI 61
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
LS P +Y +F SAE+ + ++ ++ + + S + + G RTS + S
Sbjct: 62 LSLEPLVIYIEDFLSAEESRHLLEISEDKFEASTV----------STGARTSIQRDVRDS 111
Query: 138 E----DKTGILELIEHKIAR--ATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ 191
E D+ ++ IEH+ AR P H E RY +G Y HYD + +
Sbjct: 112 EVALVDRDDVVRCIEHR-ARDFQGWRPDLHIERLRTQRYGVGGHYAHHYDWSGAS----R 166
Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
+ R+++F++Y+ EGG T FP
Sbjct: 167 DADRVSTFMVYVDADCEGGGTEFP 190
>gi|354483225|ref|XP_003503795.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 2
[Cricetulus griseus]
Length = 534
Score = 54.7 bits (130), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 40/141 (28%), Positives = 67/141 (47%), Gaps = 11/141 (7%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+PR + F + S + + + AK RL+ + ++ +E+ R S ++S ED
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGNLETVH-YRISKSAWLSGYEDP 392
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ------ 194
++ I +I T L + E V Y +G +Y+ H+D E P Q
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDE--PDAFQELGTGN 448
Query: 195 RLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 449 RIATWLFYMSDVSAGGATVFP 469
>gi|410948132|ref|XP_003980795.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Felis
catus]
gi|410948136|ref|XP_003980797.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Felis
catus]
Length = 533
Score = 54.7 bits (130), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 69/137 (50%), Gaps = 7/137 (5%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451
Query: 199 FLLYLSDVEEGGETMFP 215
FL Y+SDVE GG T+FP
Sbjct: 452 FLNYMSDVEAGGATVFP 468
>gi|194213450|ref|XP_001495951.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Equus
caballus]
Length = 548
Score = 54.7 bits (130), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 45/152 (29%), Positives = 75/152 (49%), Gaps = 10/152 (6%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+Q + +V+ P + + +F S + Q I A+ L+ S +A GE + R S
Sbjct: 338 LQPVRKEVIHLEPYVVLYHDFVSDSEAQKIRGLAEPWLQRSVVA--SGEKQLPVE-YRIS 394
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYD-AFNPA 186
++ + D +L ++H+IA T L + E V+ Y IG Y+ H+D A +P
Sbjct: 395 KSAWLKDTVDP--MLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPT 452
Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPF 216
Y + R+A+F++YLS VE GG T F +
Sbjct: 453 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIY 484
>gi|195444366|ref|XP_002069834.1| GK11733 [Drosophila willistoni]
gi|194165919|gb|EDW80820.1| GK11733 [Drosophila willistoni]
Length = 517
Score = 54.7 bits (130), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 39/102 (38%), Positives = 55/102 (53%), Gaps = 13/102 (12%)
Query: 120 VESTKGTRTSSGTFISASEDKTGI--LELIEHKIARATMLPQT--HGEAFNVLRYEIGQK 175
++ RTS+ F+ ++TGI LE I + A T L T E V+ Y +G +
Sbjct: 361 IDQADVDRTSNSVFM----EETGITLLETISQRAADMTDLYVTAISSEDLQVINYGLGGQ 416
Query: 176 YDSHYDAFNP-AEYGPQMSQRLASFLLYLSDVEEGGETMFPF 216
Y H D F+ AE G RLA+ L YL+DV++GG T+FPF
Sbjct: 417 YTPHCDYFDENAENG----DRLATVLFYLTDVQQGGATVFPF 454
>gi|448927821|gb|AGE51393.1| prolyl 4-hydroxylase [Paramecium bursaria Chlorella virus CviKI]
Length = 239
Score = 54.7 bits (130), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 39/134 (29%), Positives = 66/134 (49%), Gaps = 10/134 (7%)
Query: 89 NFASAEQCQSIIATAKKR--LKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILEL 146
+F S +C +I A K+ +K + ++ +R S T+ + E K I++
Sbjct: 57 DFLSDAECDVLINAAIKKGLIKSEVGGATDDDPIKLDPKSRNSEQTWFTPGEHK--IIDK 114
Query: 147 IEHKI-----ARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
I+ K ++ + + + E V RY+ GQ Y HYD + + P+ QRLA+ ++
Sbjct: 115 IQKKTRELLNSKKHCIDKYNFEDVQVARYKPGQYYYHHYDGDDCDDACPK-DQRLATLMV 173
Query: 202 YLSDVEEGGETMFP 215
YL +EGGET FP
Sbjct: 174 YLKAPKEGGETDFP 187
>gi|836898|gb|AAC52197.1| prolyl 4-hydroxylase alpha(I)-subunit, partial [Mus musculus]
gi|1096887|prf||2112362A Pro 4-hydroxylase:SUBUNIT=alpha:ISOTYPE=I
Length = 526
Score = 54.7 bits (130), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 43/144 (29%), Positives = 68/144 (47%), Gaps = 17/144 (11%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S + + + AK RL S+ + ET + +T R S ++S ED
Sbjct: 326 KPRIIRFHDIISDAEIEIVKYLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYED 383
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 384 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTG-- 439
Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 440 --NRIATWLFYMSDVSAGGATVFP 461
>gi|392546611|ref|ZP_10293748.1| 2OG-Fe(II) oxygenase [Pseudoalteromonas rubra ATCC 29570]
Length = 932
Score = 54.7 bits (130), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 40/148 (27%), Positives = 72/148 (48%), Gaps = 11/148 (7%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
R + F S ++CQS+I + +PS + +QG +S +G RTS + +
Sbjct: 743 RAQIFRFEGALSEQECQSLITRLEGHFQPSTVVDQQG---DSNQGYRTSCTACLEQISHQ 799
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ-----MSQR 195
I I++KI+ + + E+ + RY GQ+Y H D F+ + Q + QR
Sbjct: 800 ENI--KIKNKISALMGIDSAYLESLQIQRYGTGQEYKVHCDFFHESSPSYQKCIDSVGQR 857
Query: 196 LASFLLYLSDVEEGGETMFPFEVDLQIS 223
+ ++YL+D GG T F ++D+ ++
Sbjct: 858 TWTCVVYLNDDFSGGNTHFT-QLDITVT 884
>gi|195452778|ref|XP_002073496.1| GK13116 [Drosophila willistoni]
gi|194169581|gb|EDW84482.1| GK13116 [Drosophila willistoni]
Length = 521
Score = 54.7 bits (130), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 52/199 (26%), Positives = 85/199 (42%), Gaps = 22/199 (11%)
Query: 35 LSRDVPSIRPKLRTLEVVEKENESGLPHGETGDASIQSI------PF--------QVLSW 80
L R+ P+I+PK V + E DA + I PF +++
Sbjct: 264 LIRNEPNIKPKPFNKSVGDFERGCRGEFPALTDAKLYCIYNTTSSPFLRLAPLKMELIGL 323
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
P + + + S + + AK LK + + T + K TRT+ + + ++
Sbjct: 324 DPYMVLYHDVISPNEIAELQEMAKPELKRATVYNSTKNTNQFVK-TRTAKVAWFLDTFNQ 382
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ----RL 196
+ E + +I T E V+ Y +G Y H+D FN P +SQ R+
Sbjct: 383 --LTERLNQRIMDMTNFVLNGSEMLQVMNYGLGGYYVKHFDYFNTTT-NPHISQINGDRI 439
Query: 197 ASFLLYLSDVEEGGETMFP 215
A+ L YL+DVE+GG T+FP
Sbjct: 440 ATVLFYLNDVEQGGATVFP 458
>gi|449668268|ref|XP_002154169.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Hydra
magnipapillata]
Length = 531
Score = 54.7 bits (130), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 44/142 (30%), Positives = 64/142 (45%), Gaps = 16/142 (11%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLA--LRQGETVESTKGTRTSSGTFIS-ASE 138
P L F + E + I A RL+PS++ + Q T R S F A E
Sbjct: 342 PDVLVFHEMITEEVAEKIRDVANPRLRPSEVIDPIIQKHV---TASYRVSKNVFFDDAFE 398
Query: 139 DKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA------EYGPQM 192
++ I + + AT L E V Y +G +Y+ H D +P E+G
Sbjct: 399 EELEISRKLRPLVEDATDLNDDFSEQLQVNNYGLGGQYEFHVDFGDPGSPLDKHEHG--- 455
Query: 193 SQRLASFLLYLSDVEEGGETMF 214
R+A+ L+YLSDVE GG+T+F
Sbjct: 456 -NRIATLLIYLSDVERGGDTVF 476
>gi|56118630|ref|NP_001007975.1| prolyl 4-hydroxylase, alpha polypeptide 2 precursor [Xenopus
(Silurana) tropicalis]
gi|51513259|gb|AAH80485.1| p4ha2 protein [Xenopus (Silurana) tropicalis]
Length = 527
Score = 54.3 bits (129), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 41/138 (29%), Positives = 68/138 (49%), Gaps = 9/138 (6%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
PR + + N S E+ I AK +L ++ +R +T V S R S ++ ++D
Sbjct: 338 PRIVRYLNALSDEEIAKIKELAKPKL--ARATVRDPKTGVLSVANYRVSKSAWLEENDDP 395
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS---QRLA 197
++ + ++ T L E V Y +G +Y+ H+D F+ + + RLA
Sbjct: 396 --VIARVNLRMQAITGLTVDTAELLQVANYGMGGQYEPHFD-FSRRPFDSNLKTDGNRLA 452
Query: 198 SFLLYLSDVEEGGETMFP 215
+FL Y+SDVE GG T+FP
Sbjct: 453 TFLNYMSDVEAGGATVFP 470
>gi|195341584|ref|XP_002037386.1| GM12898 [Drosophila sechellia]
gi|194131502|gb|EDW53545.1| GM12898 [Drosophila sechellia]
Length = 536
Score = 54.3 bits (129), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 39/148 (26%), Positives = 70/148 (47%), Gaps = 7/148 (4%)
Query: 74 PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
PF++ LS P + + N S + + + +K L+ +++ R + + +R++ G
Sbjct: 313 PFRMEELSLDPYVVLYHNVLSDPEIEKLKPMSKPFLERAKV-FRVEKGSDEIAPSRSADG 371
Query: 132 TFISASE---DKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN-PAE 187
++ + D +L I +I T L G L+Y G + HYD FN
Sbjct: 372 AWLPHQDTDPDDLEVLRRIGRRIKDLTGLNTRSGSQMQFLKYGFGGHFVPHYDYFNSKTS 431
Query: 188 YGPQMSQRLASFLLYLSDVEEGGETMFP 215
Y ++ R+A+ L YL++V+ GG T FP
Sbjct: 432 YLERVGDRIATVLFYLNNVDHGGATAFP 459
>gi|431838427|gb|ELK00359.1| Prolyl 4-hydroxylase subunit alpha-3 [Pteropus alecto]
Length = 483
Score = 54.3 bits (129), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 45/152 (29%), Positives = 76/152 (50%), Gaps = 10/152 (6%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+Q + +V+ P + + +F S + Q I A+ L+ S +A GE + R S
Sbjct: 273 LQPVRKEVIHLEPYVVLYHDFVSDLEAQKIRGLAEPWLQRSVVA--SGEKQLPVE-YRIS 329
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYD-AFNPA 186
++ + D +L ++H+IA T L + E V+ Y IG Y+ H+D A +P+
Sbjct: 330 KSAWLKDTADP--MLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPS 387
Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPF 216
Y + R+A+F++YLS VE GG T F +
Sbjct: 388 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIY 419
>gi|410900628|ref|XP_003963798.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Takifugu
rubripes]
Length = 548
Score = 54.3 bits (129), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 39/143 (27%), Positives = 68/143 (47%), Gaps = 15/143 (10%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
RP + + + S ++ +++ AK RL+ + ++ +E T R S +++ E
Sbjct: 348 RPYIVRYIDIISDKEIETVKKLAKPRLRRATISNPITGVLE-TASYRISKSAWLTGYEHP 406
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
++E+I +I T L E V Y +G +Y+ H+D AF G
Sbjct: 407 --VIEIINQRIEDLTGLEMDTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG--- 461
Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 462 -NRIATWLFYMSDVAAGGATVFP 483
>gi|74225936|dbj|BAE28745.1| unnamed protein product [Mus musculus]
Length = 561
Score = 54.3 bits (129), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 40/143 (27%), Positives = 67/143 (46%), Gaps = 15/143 (10%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+PR + F + S + + + AK RL+ + ++ +E+ R S ++S ED
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPVTGALETVH-YRISKSAWLSGYEDP 392
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTG--- 447
Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP 469
>gi|195341536|ref|XP_002037362.1| GM12882 [Drosophila sechellia]
gi|194131478|gb|EDW53521.1| GM12882 [Drosophila sechellia]
Length = 550
Score = 54.3 bits (129), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 31/96 (32%), Positives = 47/96 (48%), Gaps = 6/96 (6%)
Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
T R S ++ ED+ ++E + + A T L E V+ Y IG Y+ H+D
Sbjct: 386 TANYRISKSAWLKTQEDR--VIETVVQRTADMTGLDMDSAEELQVVNYGIGGHYEPHFDF 443
Query: 183 FNPAEY----GPQMSQRLASFLLYLSDVEEGGETMF 214
E G + R+A+ L Y+SDVE+GG T+F
Sbjct: 444 ARKEEERAFEGINLGNRIATVLFYMSDVEQGGATVF 479
>gi|428184271|gb|EKX53127.1| hypothetical protein GUITHDRAFT_53615, partial [Guillardia theta
CCMP2712]
Length = 110
Score = 54.3 bits (129), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 31/64 (48%), Positives = 39/64 (60%), Gaps = 5/64 (7%)
Query: 163 EAFNVLRYEIGQKYDSHYDAFNPAE--YGPQMSQRLASFLLYLSDVEE--GGETMFPFEV 218
EA V+ YE GQ+YD H D F P Y ++ QR+ SF YLS+V E GG T FP E+
Sbjct: 34 EAVQVVHYERGQRYDVHNDWFQPGTPYYHDRVWQRIISFFCYLSEVPEGQGGCTFFP-EL 92
Query: 219 DLQI 222
DL+
Sbjct: 93 DLRF 96
>gi|24651407|ref|NP_733371.1| prolyl-4-hydroxylase-alpha EFB [Drosophila melanogaster]
gi|20269806|gb|AAM18058.1|AF495536_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]EFB
[Drosophila melanogaster]
gi|15292529|gb|AAK93533.1| SD05564p [Drosophila melanogaster]
gi|23172692|gb|AAF57053.2| prolyl-4-hydroxylase-alpha EFB [Drosophila melanogaster]
gi|220946562|gb|ACL85824.1| PH4alphaEFB-PA [synthetic construct]
Length = 550
Score = 54.3 bits (129), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 31/96 (32%), Positives = 47/96 (48%), Gaps = 6/96 (6%)
Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
T R S ++ ED+ ++E + + A T L E V+ Y IG Y+ H+D
Sbjct: 386 TANYRISKSAWLKTQEDR--VIETVVQRTADMTGLDMDSAEELQVVNYGIGGHYEPHFDF 443
Query: 183 FNPAEY----GPQMSQRLASFLLYLSDVEEGGETMF 214
E G + R+A+ L Y+SDVE+GG T+F
Sbjct: 444 ARKEEQRAFEGLNLGNRIATVLFYMSDVEQGGATVF 479
>gi|224068121|ref|XP_002191580.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Taeniopygia
guttata]
Length = 539
Score = 54.3 bits (129), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 40/138 (28%), Positives = 68/138 (49%), Gaps = 9/138 (6%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK RL ++ +R +T V + R S +++ ED
Sbjct: 342 PHIVRYYDVMSDEEIEKIKQLAKPRL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 397
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS---QRLA 197
++ + ++ T L E V Y +G +Y+ H+D F+ + + RLA
Sbjct: 398 DPVVAKVNQRMQHITGLTVKTAELLQVANYGMGGQYEPHFD-FSRRPFDSTLKSEGNRLA 456
Query: 198 SFLLYLSDVEEGGETMFP 215
+FL Y+SDVE GG T+FP
Sbjct: 457 TFLNYMSDVEAGGATVFP 474
>gi|163955099|ref|YP_001648203.1| hypothetical protein OsV5_126r [Ostreococcus virus OsV5]
gi|163638548|gb|ABY27907.1| hypothetical protein OsV5_126r [Ostreococcus virus OsV5]
Length = 206
Score = 54.3 bits (129), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 39/126 (30%), Positives = 63/126 (50%), Gaps = 11/126 (8%)
Query: 90 FASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEH 149
F + E+ + II A+K+L+ S +A E K R S ++ AS+ +++ +
Sbjct: 28 FITEEERKHIIRKAQKKLEVSTVA----ENRVVDKKIRDSETAWLDASD---PVVKRVME 80
Query: 150 KIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEG 209
K T P + E VLRY+ G Y H D F+ + ++R+ + +L L+D EG
Sbjct: 81 KCVSLTDRPLVNCEHIQVLRYKPGGHYSPHQDTFSDT----KGNKRMYTVILALNDDYEG 136
Query: 210 GETMFP 215
GET FP
Sbjct: 137 GETEFP 142
>gi|345305838|ref|XP_001508476.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Ornithorhynchus
anatinus]
Length = 493
Score = 54.3 bits (129), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 68/144 (47%), Gaps = 17/144 (11%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + + S + +++ AK RL S+ + ET + +T R S ++S ED
Sbjct: 293 KPRIVRYHEIISDAEIETVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYED 350
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 351 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG-- 406
Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 407 --NRIATWLFYMSDVSAGGATVFP 428
>gi|149038788|gb|EDL93077.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide, isoform CRA_b
[Rattus norvegicus]
Length = 534
Score = 54.3 bits (129), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 40/143 (27%), Positives = 67/143 (46%), Gaps = 15/143 (10%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+PR + F + S + + + AK RL+ + ++ +E+ R S ++S ED
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPVTGALETVH-YRISKSAWLSGYEDP 392
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTG--- 447
Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP 469
>gi|344254200|gb|EGW10304.1| Prolyl 4-hydroxylase subunit alpha-1 [Cricetulus griseus]
Length = 507
Score = 54.3 bits (129), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 40/141 (28%), Positives = 67/141 (47%), Gaps = 11/141 (7%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+PR + F + S + + + AK RL+ + ++ +E+ R S ++S ED
Sbjct: 307 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGNLETVH-YRISKSAWLSGYEDP 365
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ------ 194
++ I +I T L + E V Y +G +Y+ H+D E P Q
Sbjct: 366 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDE--PDAFQELGTGN 421
Query: 195 RLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 422 RIATWLFYMSDVSAGGATVFP 442
>gi|195575089|ref|XP_002105512.1| GD21521 [Drosophila simulans]
gi|194201439|gb|EDX15015.1| GD21521 [Drosophila simulans]
Length = 550
Score = 54.3 bits (129), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 31/96 (32%), Positives = 47/96 (48%), Gaps = 6/96 (6%)
Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
T R S ++ ED+ ++E + + A T L E V+ Y IG Y+ H+D
Sbjct: 386 TANYRISKSAWLKTQEDR--VIETVVQRTADMTGLDMDSAEELQVVNYGIGGHYEPHFDF 443
Query: 183 FNPAEY----GPQMSQRLASFLLYLSDVEEGGETMF 214
E G + R+A+ L Y+SDVE+GG T+F
Sbjct: 444 ARKEEERAFEGLNLGNRIATVLFYMSDVEQGGATVF 479
>gi|148226320|ref|NP_001087703.1| prolyl 4-hydroxylase, alpha polypeptide 2 precursor [Xenopus
laevis]
gi|51703693|gb|AAH81114.1| MGC83530 protein [Xenopus laevis]
Length = 533
Score = 54.3 bits (129), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 39/138 (28%), Positives = 69/138 (50%), Gaps = 9/138 (6%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
PR + + + S E+ + I AK RL ++ +R +T V + R S ++ +D
Sbjct: 336 PRIVRYLDVLSDEEIEKIKELAKPRL--ARATVRDPKTGVLTVANYRVSKSAWLEEYDDP 393
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS---QRLA 197
++ + ++ T L + E V Y +G +Y+ H+D F+ + + RLA
Sbjct: 394 --VIGRVNSRMQAITGLTKDTAELLQVANYGMGGQYEPHFD-FSRRPFDSNLKTEGNRLA 450
Query: 198 SFLLYLSDVEEGGETMFP 215
++L Y+SDVE GG T+FP
Sbjct: 451 TYLNYMSDVEAGGATVFP 468
>gi|74224984|dbj|BAE38205.1| unnamed protein product [Mus musculus]
Length = 534
Score = 54.3 bits (129), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 40/143 (27%), Positives = 67/143 (46%), Gaps = 15/143 (10%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+PR + F + S + + + AK RL+ + ++ +E+ R S ++S ED
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPVTGALETVH-YRISKSAWLSGYEDP 392
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTG--- 447
Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP 469
>gi|33859596|ref|NP_035160.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Mus musculus]
gi|20455506|sp|Q60715.2|P4HA1_MOUSE RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; Flags: Precursor
gi|16307134|gb|AAH09654.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide [Mus musculus]
gi|74144306|dbj|BAE36020.1| unnamed protein product [Mus musculus]
gi|74146660|dbj|BAE41331.1| unnamed protein product [Mus musculus]
gi|148700260|gb|EDL32207.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide, isoform CRA_a [Mus
musculus]
Length = 534
Score = 54.3 bits (129), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 40/143 (27%), Positives = 67/143 (46%), Gaps = 15/143 (10%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+PR + F + S + + + AK RL+ + ++ +E+ R S ++S ED
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPVTGALETVH-YRISKSAWLSGYEDP 392
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTG--- 447
Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP 469
>gi|403263105|ref|XP_003923900.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
alpha-3, partial [Saimiri boliviensis boliviensis]
Length = 534
Score = 54.3 bits (129), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 47/152 (30%), Positives = 74/152 (48%), Gaps = 10/152 (6%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+Q I +VL P + +F S + Q I A+ L+ S +A GE + R S
Sbjct: 324 LQPIQKEVLHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVA--SGEKQLQVE-YRIS 380
Query: 130 SGTFISASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPA 186
++ + D +L + H+IA T L + E V+ Y IG Y+ H+D A +P+
Sbjct: 381 KSAWLKDTVDP--MLVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPS 438
Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPF 216
Y + R+A+F++YLS VE GG T F +
Sbjct: 439 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIY 470
>gi|291190274|ref|NP_001167096.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide precursor [Salmo
salar]
gi|223648100|gb|ACN10808.1| Prolyl 4-hydroxylase subunit alpha-1 precursor [Salmo salar]
Length = 545
Score = 54.3 bits (129), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 40/142 (28%), Positives = 68/142 (47%), Gaps = 15/142 (10%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
RPR + + + S + + + AK RL+ + ++ +E T R S +++A ED
Sbjct: 345 RPRIIRYHDVLSNSEIEKVKELAKPRLRRATISNPITGVLE-TAHYRISKSAWLTAYEDP 403
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
+++ I +I T L E V Y +G +Y+ H+D AF G
Sbjct: 404 --VVDKINQRIEDITGLNVKTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG--- 458
Query: 193 SQRLASFLLYLSDVEEGGETMF 214
R+A++L+Y+SDV GG T+F
Sbjct: 459 -NRIATWLIYMSDVPSGGATVF 479
>gi|395521232|ref|XP_003764722.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Sarcophilus
harrisii]
Length = 521
Score = 54.3 bits (129), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 46/152 (30%), Positives = 74/152 (48%), Gaps = 10/152 (6%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+Q + +VL P + + +F S + Q I A L+ S +A GE + + R S
Sbjct: 311 LQPVRKEVLHLEPYIVLYHDFVSDSEAQKIRGFAAPWLQRSVVA--SGEKQQQVE-YRIS 367
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYD-AFNPA 186
++ + D IL ++ +IA T L + E V+ Y IG Y+ H+D A +P+
Sbjct: 368 KSAWLKDTVDP--ILVSLDRRIAALTGLNVQPPYAEHLQVVNYGIGGHYEPHFDHATSPS 425
Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPF 216
Y R+A+F++YLS VE GG T F +
Sbjct: 426 SPLYRMNSGNRVATFMIYLSSVEAGGSTAFIY 457
>gi|348505573|ref|XP_003440335.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Oreochromis
niloticus]
Length = 517
Score = 54.3 bits (129), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 42/146 (28%), Positives = 76/146 (52%), Gaps = 10/146 (6%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
+++S +P + + +F + + + I + A L+ S +A GE ++T R S ++
Sbjct: 313 ELVSLQPYVVLYHDFVTDTEAEDIKSLAHPGLRRSVVA--AGEK-QATADYRISKSAWLK 369
Query: 136 ASEDKTGILELIEHKIARATMLPQTH--GEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 190
S I+ ++ +I+ T L H GE V+ Y IG Y+ H+D A +P+ +
Sbjct: 370 GS--AQSIVGKLDQRISLLTGLNVKHPYGEYLQVVNYGIGGHYEPHFDHATSPSSPVFKL 427
Query: 191 QMSQRLASFLLYLSDVEEGGETMFPF 216
+ R+A+F++YLS VE GG T F +
Sbjct: 428 KTGNRVATFMIYLSPVEAGGSTAFIY 453
>gi|90085216|dbj|BAE91349.1| unnamed protein product [Macaca fascicularis]
Length = 244
Score = 54.3 bits (129), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 68/144 (47%), Gaps = 17/144 (11%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 44 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 101
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 102 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 157
Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 158 --NRIATWLFYMSDVSAGGATVFP 179
>gi|194765174|ref|XP_001964702.1| GF23328 [Drosophila ananassae]
gi|190614974|gb|EDV30498.1| GF23328 [Drosophila ananassae]
Length = 542
Score = 53.9 bits (128), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 41/146 (28%), Positives = 69/146 (47%), Gaps = 8/146 (5%)
Query: 74 PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
PF+V L+ P YF ++ + + II ++ S++ Q T T RTS+
Sbjct: 329 PFKVEQLNLDPYVAYFHEAINSSEMEQIIEKGLGSMERSRVGQSQNAT---TSEIRTSAN 385
Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN-PAEYGP 190
T++ +E+ L I+ ++ T L E ++ Y IG +Y+ H+D P +
Sbjct: 386 TWLWYNENPW--LSKIKQRLEDITGLSTESAEPLQLVNYGIGGQYEPHFDFVEEPQKVFG 443
Query: 191 QMSQRLASFLLYLSDVEEGGETMFPF 216
R+ + L Y++DV GG T FPF
Sbjct: 444 WKGNRMLTALFYINDVALGGATAFPF 469
>gi|195391760|ref|XP_002054528.1| GJ22757 [Drosophila virilis]
gi|194152614|gb|EDW68048.1| GJ22757 [Drosophila virilis]
Length = 534
Score = 53.9 bits (128), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 38/67 (56%), Gaps = 1/67 (1%)
Query: 151 IARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-QRLASFLLYLSDVEEG 209
++ AT L T E V Y +G Y+ H+D F + + P R+A+ + YLSDVE+G
Sbjct: 394 VSDATGLDMTFCEQLQVANYGVGGHYEPHWDFFRDSRHYPAAEGNRIATAIFYLSDVEQG 453
Query: 210 GETMFPF 216
G T FPF
Sbjct: 454 GATAFPF 460
>gi|17861644|gb|AAL39299.1| GH17175p [Drosophila melanogaster]
Length = 187
Score = 53.9 bits (128), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 27/64 (42%), Positives = 37/64 (57%), Gaps = 1/64 (1%)
Query: 154 ATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMSQRLASFLLYLSDVEEGGET 212
AT L T E V Y +G Y+ H+D F +P Y + R+A+ + YLS+VE+GG T
Sbjct: 50 ATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRIATAIFYLSEVEQGGAT 109
Query: 213 MFPF 216
FPF
Sbjct: 110 AFPF 113
>gi|224013902|ref|XP_002296615.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220968967|gb|EED87311.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 490
Score = 53.9 bits (128), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 36/98 (36%), Positives = 51/98 (52%), Gaps = 6/98 (6%)
Query: 122 STKGTRTSSGTFI---SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDS 178
S +RTS T+ S ED + +IE +IA T + E +L+YE GQ Y
Sbjct: 334 SVSESRTSHNTWCQEPSCYEDPL-VAPIIE-RIANVTGTKVENSEYLQMLQYEPGQYYRQ 391
Query: 179 HYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPF 216
H+D + P +G R+ + +YL+DVEEGG T FP
Sbjct: 392 HHD-YIPHHFGMPCGVRILTLFIYLNDVEEGGGTSFPL 428
>gi|363543307|ref|NP_001241869.1| prolyl 4-hydroxylase 6-4 precursor [Zea mays]
gi|347978826|gb|AEP37755.1| prolyl 4-hydroxylase 6-4 [Zea mays]
Length = 145
Score = 53.9 bits (128), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 34/89 (38%), Positives = 52/89 (58%), Gaps = 5/89 (5%)
Query: 78 LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
LS RPRA + F S +C I++ AK ++ S +A G++V S RTSSGTF++
Sbjct: 38 LSSRPRAFLYSGFLSDTECDHIVSLAKGSMEKSMVADNDSGKSVAS--QARTSSGTFLAK 95
Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAF 165
ED+ I+ IE ++A T LP+ + E+
Sbjct: 96 REDE--IVSAIEKRVAAWTFLPEENAESL 122
>gi|221460681|ref|NP_733394.3| CG31013 [Drosophila melanogaster]
gi|220903261|gb|AAF57073.4| CG31013 [Drosophila melanogaster]
Length = 534
Score = 53.9 bits (128), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 37/142 (26%), Positives = 63/142 (44%), Gaps = 13/142 (9%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + + SA + +I A + +K +++ + V RT+ G ++ ++
Sbjct: 326 PYVVLYHEVLSAREISMLIGKAAQNMKNTKI--HKERAVPKKNRGRTAKGFWLKKESNE- 382
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA---------EYGPQM 192
+ + I +I T E F V+ Y IG Y H D F+ A Y +
Sbjct: 383 -LTKRITRRIMDMTGFDLADSEGFQVINYGIGGHYFLHMDYFDFASSNHTDTRSRYSIDL 441
Query: 193 SQRLASFLLYLSDVEEGGETMF 214
R+A+ L YL+DVE+GG T+F
Sbjct: 442 GDRIATVLFYLTDVEQGGATVF 463
>gi|348557544|ref|XP_003464579.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
[Cavia porcellus]
Length = 533
Score = 53.9 bits (128), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 39/137 (28%), Positives = 70/137 (51%), Gaps = 7/137 (5%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ +D
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLEEEDDP 393
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQM-SQRLAS 198
++ + ++ + T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 394 --VVARVNRRMQQITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451
Query: 199 FLLYLSDVEEGGETMFP 215
FL Y+SDVE GG T+FP
Sbjct: 452 FLNYMSDVEAGGATVFP 468
>gi|85857698|gb|ABC86384.1| IP10964p [Drosophila melanogaster]
Length = 534
Score = 53.9 bits (128), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 37/142 (26%), Positives = 63/142 (44%), Gaps = 13/142 (9%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + + SA + +I A + +K +++ + V RT+ G ++ ++
Sbjct: 326 PYVVLYHEVLSAREISMLIGKAAQNMKNTKI--HKERAVPKKNRGRTAKGFWLKKESNE- 382
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA---------EYGPQM 192
+ + I +I T E F V+ Y IG Y H D F+ A Y +
Sbjct: 383 -LTKRITRRIMDMTGFDLADSEGFQVINYGIGGHYFLHMDYFDFASSNHTDTRSRYSIDL 441
Query: 193 SQRLASFLLYLSDVEEGGETMF 214
R+A+ L YL+DVE+GG T+F
Sbjct: 442 GDRIATVLFYLTDVEQGGATVF 463
>gi|387016442|gb|AFJ50340.1| Prolyl 4-hydroxylase subunit alpha-2-like [Crotalus adamanteus]
Length = 533
Score = 53.9 bits (128), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 39/138 (28%), Positives = 69/138 (50%), Gaps = 9/138 (6%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + S E+ + I AK +L ++ +R +T V + R S +++ +D
Sbjct: 336 PHIVRYYEVLSDEEIEKIKELAKPKL--ARATVRDPKTGVLTVANYRVSKSSWLEEEDDL 393
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS---QRLA 197
++ + H++ + T L E V Y +G +Y+ H+D F+ + + RLA
Sbjct: 394 --VVARVNHRMEQITGLTTKTAELLQVANYGMGGQYEPHFD-FSRRPFDITLKTEGNRLA 450
Query: 198 SFLLYLSDVEEGGETMFP 215
+FL Y+SDVE GG T+FP
Sbjct: 451 TFLNYMSDVEAGGATVFP 468
>gi|432904500|ref|XP_004077362.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oryzias
latipes]
Length = 555
Score = 53.9 bits (128), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 42/143 (29%), Positives = 66/143 (46%), Gaps = 15/143 (10%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
RP + + + S + I AK RL+ + ++ +E T R S +++A ED
Sbjct: 350 RPYIVRYIDIISEAEMDKIKQLAKPRLRRATISNPVTGVLE-TAPYRISKSAWLTAYEDP 408
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
++E I +I T L E V Y +G +Y+ H+D AF G
Sbjct: 409 --VVEKINQRIEDLTGLEMDTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG--- 463
Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 464 -NRIATWLFYMSDVSAGGATVFP 485
>gi|72000637|ref|NP_507251.2| Protein PHY-3 [Caenorhabditis elegans]
gi|30145726|emb|CAB04736.2| Protein PHY-3 [Caenorhabditis elegans]
Length = 318
Score = 53.9 bits (128), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 41/149 (27%), Positives = 68/149 (45%), Gaps = 9/149 (6%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
+ +++SW P + + N S Q S + ++R Q G ++E+T R ++G+
Sbjct: 83 VDMEIISWAPTLVIYRNLMSPRQTASFLNFIEQRDLEIQKTSDFGTSIETTH--RRANGS 140
Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN---PAEYG 189
FI ED +E+ R L T E F+ L Y G Y HYD + +Y
Sbjct: 141 FIPP-EDSNVTVEIKMQAQKRIPGLNLTVAEHFSALSYLPGGHYAVHYDYLDYRSKQDYD 199
Query: 190 PQMSQ---RLASFLLYLSDVEEGGETMFP 215
M++ R+ + + L E+GG T+FP
Sbjct: 200 WWMNKTGNRIGTLIFVLKPAEKGGGTVFP 228
>gi|424864983|ref|ZP_18288870.1| oxidoreductase, 2OG-Fe(II) oxygenase family [SAR86 cluster
bacterium SAR86B]
gi|400759083|gb|EJP73273.1| oxidoreductase, 2OG-Fe(II) oxygenase family [SAR86 cluster
bacterium SAR86B]
Length = 418
Score = 53.9 bits (128), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 39/132 (29%), Positives = 57/132 (43%), Gaps = 7/132 (5%)
Query: 87 FPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILEL 146
PNF + E+C +I S + + G RTSS + + + I++
Sbjct: 4 IPNFITHEECDELIKMIDAHHTRSSVVV-GGTDRSDITDHRTSSTSNLDPN---NVIIKS 59
Query: 147 IEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM---SQRLASFLLYL 203
I KIA L + GE+ YE GQ + H+D F Y R + ++YL
Sbjct: 60 IHKKIADHLNLSISKGESLQGQLYEPGQYFKPHHDFFTGPAYDMHCLASGNRTHTLMIYL 119
Query: 204 SDVEEGGETMFP 215
+D EGGET FP
Sbjct: 120 NDDFEGGETNFP 131
>gi|149185530|ref|ZP_01863846.1| hypothetical protein ED21_20934 [Erythrobacter sp. SD-21]
gi|148830750|gb|EDL49185.1| hypothetical protein ED21_20934 [Erythrobacter sp. SD-21]
Length = 241
Score = 53.9 bits (128), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 42/142 (29%), Positives = 63/142 (44%), Gaps = 14/142 (9%)
Query: 87 FPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILEL 146
P F + +C +I + L PS L + + G RTS + SA E + L
Sbjct: 53 MPGFLAPAECTRLIELIESNLLPSPLF-----SDPTGTGARTSQTHYFSAEEPEVAALGA 107
Query: 147 IEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRLASFLL 201
K+ L + E RY++GQ+Y H D F Q+ QR S ++
Sbjct: 108 ---KLDDLLGLERRQAETVQGQRYDVGQEYRHHRDFFRVEREHWQLERRRGGQRTWSAMV 164
Query: 202 YLSDVEEGGETMFPFEVDLQIS 223
YL+ +E GGET FP +DL ++
Sbjct: 165 YLNAIEAGGETDFPV-LDLSVA 185
>gi|73970649|ref|XP_850109.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Canis
lupus familiaris]
Length = 533
Score = 53.9 bits (128), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 69/137 (50%), Gaps = 7/137 (5%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQM-SQRLAS 198
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 392 DPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451
Query: 199 FLLYLSDVEEGGETMFP 215
FL Y+SDVE GG T+FP
Sbjct: 452 FLNYMSDVEAGGATVFP 468
>gi|195452730|ref|XP_002073475.1| GK13125 [Drosophila willistoni]
gi|194169560|gb|EDW84461.1| GK13125 [Drosophila willistoni]
Length = 539
Score = 53.9 bits (128), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 41/152 (26%), Positives = 68/152 (44%), Gaps = 6/152 (3%)
Query: 69 SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT-R 127
S + + + L P + N S + + A+ ++ SQ+ + E+ R
Sbjct: 316 SYEVLELEELHQDPFVVQVHNIVSQKDMNLLQKIARPNIQRSQVYAQDHNANETVAAAYR 375
Query: 128 TSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPA 186
TS G E ++ +EL+ +A + L E + Y IG Y+ H+D F +
Sbjct: 376 TSKGATFEYFEHRS--MELLSRHVADLSGLDMNSAELLQIANYGIGGHYEPHWDCFPDHH 433
Query: 187 EYGP--QMSQRLASFLLYLSDVEEGGETMFPF 216
Y P + R+A+ + YLS+VE GG T FPF
Sbjct: 434 VYLPDDRDGNRIATGIYYLSEVEAGGGTAFPF 465
>gi|355709025|gb|AES03456.1| prolyl 4-hydroxylase, alpha polypeptide II [Mustela putorius furo]
Length = 532
Score = 53.9 bits (128), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 69/137 (50%), Gaps = 7/137 (5%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQM-SQRLAS 198
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 392 DPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451
Query: 199 FLLYLSDVEEGGETMFP 215
FL Y+SDVE GG T+FP
Sbjct: 452 FLNYMSDVEAGGATVFP 468
>gi|440912197|gb|ELR61789.1| Prolyl 4-hydroxylase subunit alpha-2, partial [Bos grunniens mutus]
Length = 535
Score = 53.9 bits (128), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 69/137 (50%), Gaps = 7/137 (5%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 338 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 393
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 394 DPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 453
Query: 199 FLLYLSDVEEGGETMFP 215
FL Y+SDVE GG T+FP
Sbjct: 454 FLNYMSDVEAGGATVFP 470
>gi|24651420|ref|NP_733374.1| prolyl-4-hydroxylase-alpha NE1 [Drosophila melanogaster]
gi|7301952|gb|AAF57058.1| prolyl-4-hydroxylase-alpha NE1 [Drosophila melanogaster]
gi|363987308|gb|AEW43896.1| FI16820p1 [Drosophila melanogaster]
Length = 537
Score = 53.9 bits (128), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 27/64 (42%), Positives = 37/64 (57%), Gaps = 1/64 (1%)
Query: 154 ATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMSQRLASFLLYLSDVEEGGET 212
AT L T E V Y +G Y+ H+D F +P Y + R+A+ + YLS+VE+GG T
Sbjct: 400 ATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRIATAIFYLSEVEQGGAT 459
Query: 213 MFPF 216
FPF
Sbjct: 460 AFPF 463
>gi|448928822|gb|AGE52391.1| prolyl 4-hydroxylase [Paramecium bursaria Chlorella virus CvsA1]
Length = 239
Score = 53.5 bits (127), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 40/135 (29%), Positives = 66/135 (48%), Gaps = 12/135 (8%)
Query: 89 NFASAEQCQSIIATAKKR--LKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILEL 146
+F S +C +I A K+ +K + ++ +R S T+ + E + +++
Sbjct: 57 DFLSDAECDVLINAAIKKGLIKSEVGGATDDDPIKLDPKSRNSEQTWFTPGEHE--VIDK 114
Query: 147 IEHKIARATMLPQTHG------EAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFL 200
I++K R + + H E V RY+ GQ Y HYD + + P+ QRLA+ +
Sbjct: 115 IQNK-TRELLNNKKHCIDKYIFEDVQVARYKPGQYYYHHYDGDDCDDACPK-DQRLATLM 172
Query: 201 LYLSDVEEGGETMFP 215
+YL EEGGET FP
Sbjct: 173 VYLKAPEEGGETDFP 187
>gi|226874885|ref|NP_001029465.2| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Bos
taurus]
gi|296485623|tpg|DAA27738.1| TPA: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Bos taurus]
Length = 533
Score = 53.5 bits (127), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 69/137 (50%), Gaps = 7/137 (5%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 392 DPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451
Query: 199 FLLYLSDVEEGGETMFP 215
FL Y+SDVE GG T+FP
Sbjct: 452 FLNYMSDVEAGGATVFP 468
>gi|195575099|ref|XP_002105517.1| GD17024 [Drosophila simulans]
gi|194201444|gb|EDX15020.1| GD17024 [Drosophila simulans]
Length = 537
Score = 53.5 bits (127), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 27/64 (42%), Positives = 37/64 (57%), Gaps = 1/64 (1%)
Query: 154 ATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMSQRLASFLLYLSDVEEGGET 212
AT L T E V Y +G Y+ H+D F +P Y + R+A+ + YLS+VE+GG T
Sbjct: 400 ATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRIATAIFYLSEVEQGGAT 459
Query: 213 MFPF 216
FPF
Sbjct: 460 AFPF 463
>gi|195341544|ref|XP_002037366.1| GM12151 [Drosophila sechellia]
gi|194131482|gb|EDW53525.1| GM12151 [Drosophila sechellia]
Length = 537
Score = 53.5 bits (127), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 27/64 (42%), Positives = 37/64 (57%), Gaps = 1/64 (1%)
Query: 154 ATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMSQRLASFLLYLSDVEEGGET 212
AT L T E V Y +G Y+ H+D F +P Y + R+A+ + YLS+VE+GG T
Sbjct: 400 ATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRIATAIFYLSEVEQGGAT 459
Query: 213 MFPF 216
FPF
Sbjct: 460 AFPF 463
>gi|426229221|ref|XP_004008689.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
[Ovis aries]
Length = 487
Score = 53.5 bits (127), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 69/137 (50%), Gaps = 7/137 (5%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 290 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 345
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQM-SQRLAS 198
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 346 DPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 405
Query: 199 FLLYLSDVEEGGETMFP 215
FL Y+SDVE GG T+FP
Sbjct: 406 FLNYMSDVEAGGATVFP 422
>gi|74353841|gb|AAI03334.1| Prolyl 4-hydroxylase, alpha polypeptide II [Bos taurus]
Length = 487
Score = 53.5 bits (127), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 69/137 (50%), Gaps = 7/137 (5%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 290 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 345
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQM-SQRLAS 198
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 346 DPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 405
Query: 199 FLLYLSDVEEGGETMFP 215
FL Y+SDVE GG T+FP
Sbjct: 406 FLNYMSDVEAGGATVFP 422
>gi|395814850|ref|XP_003780953.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Otolemur
garnettii]
Length = 544
Score = 53.5 bits (127), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 46/152 (30%), Positives = 75/152 (49%), Gaps = 10/152 (6%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+Q I +V+ P + +F S + Q I A+ L+ S +A GE + R S
Sbjct: 334 LQPIRKEVIHLEPFVALYHDFVSDSEAQKIRELAEPWLQRSVVA--SGEK-QLQVDYRIS 390
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYD-AFNPA 186
++ + D +L ++H+IA T L + E V+ Y IG Y+ H+D A +P+
Sbjct: 391 KSAWLKDTVDP--MLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPS 448
Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPF 216
Y + R+A+F++YLS VE GG T F +
Sbjct: 449 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIY 480
>gi|332244067|ref|XP_003271193.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
alpha-1 [Nomascus leucogenys]
Length = 502
Score = 53.5 bits (127), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 68/144 (47%), Gaps = 17/144 (11%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 302 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 359
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 360 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 415
Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 416 --NRIATWLFYMSDVSAGGATVFP 437
>gi|334314085|ref|XP_001363658.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1
[Monodelphis domestica]
Length = 537
Score = 53.5 bits (127), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 43/144 (29%), Positives = 67/144 (46%), Gaps = 17/144 (11%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F S + + + AK RL S+ + ET + +T R S ++S ED
Sbjct: 337 KPRIVRFHEIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYED 394
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 395 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG-- 450
Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 451 --NRIATWLFYMSDVSAGGATVFP 472
>gi|260812289|ref|XP_002600853.1| hypothetical protein BRAFLDRAFT_214927 [Branchiostoma floridae]
gi|229286143|gb|EEN56865.1| hypothetical protein BRAFLDRAFT_214927 [Branchiostoma floridae]
Length = 281
Score = 53.5 bits (127), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 32/93 (34%), Positives = 49/93 (52%), Gaps = 6/93 (6%)
Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYDAFN 184
R S ++ +D+ I+ + +I T L T E VL Y +G +Y+ H+D
Sbjct: 126 RISQQAWLHDKDDE--IVARVSKRIGLLTGLNTTPTSTELLQVLNYGLGGQYEPHHDYMT 183
Query: 185 PAE--YGPQMSQRLASFLLYLSDVEEGGETMFP 215
E +G + R+A+FL+YLSDV GG T+FP
Sbjct: 184 AEEKMWGTILGNRMATFLMYLSDVTAGGATVFP 216
>gi|20269818|gb|AAM18064.1| prolyl 4-hydroxylase alpha-related protein PH4[alpha]NE1
[Drosophila melanogaster]
Length = 286
Score = 53.5 bits (127), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 27/64 (42%), Positives = 37/64 (57%), Gaps = 1/64 (1%)
Query: 154 ATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMSQRLASFLLYLSDVEEGGET 212
AT L T E V Y +G Y+ H+D F +P Y + R+A+ + YLS+VE+GG T
Sbjct: 149 ATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRIATAIFYLSEVEQGGAT 208
Query: 213 MFPF 216
FPF
Sbjct: 209 AFPF 212
>gi|410251924|gb|JAA13929.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
Length = 566
Score = 53.5 bits (127), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 68/144 (47%), Gaps = 17/144 (11%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447
Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP 469
>gi|383418721|gb|AFH32574.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Macaca
mulatta]
Length = 534
Score = 53.5 bits (127), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 68/144 (47%), Gaps = 17/144 (11%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447
Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP 469
>gi|190786|gb|AAA36534.1| prolyl 4-hydroxylase alpha subunit (EC 1.14.11.2) [Homo sapiens]
Length = 534
Score = 53.5 bits (127), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 68/144 (47%), Gaps = 17/144 (11%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447
Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP 469
>gi|380813208|gb|AFE78478.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Macaca
mulatta]
gi|384947330|gb|AFI37270.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Macaca
mulatta]
Length = 534
Score = 53.5 bits (127), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 68/144 (47%), Gaps = 17/144 (11%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447
Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP 469
>gi|227553849|gb|ACP40552.1| IP22178p [Drosophila melanogaster]
Length = 467
Score = 53.5 bits (127), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 27/64 (42%), Positives = 37/64 (57%), Gaps = 1/64 (1%)
Query: 154 ATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMSQRLASFLLYLSDVEEGGET 212
AT L T E V Y +G Y+ H+D F +P Y + R+A+ + YLS+VE+GG T
Sbjct: 330 ATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRIATAIFYLSEVEQGGAT 389
Query: 213 MFPF 216
FPF
Sbjct: 390 AFPF 393
>gi|344274272|ref|XP_003408941.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1
[Loxodonta africana]
Length = 534
Score = 53.5 bits (127), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 68/144 (47%), Gaps = 17/144 (11%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 334 KPRIVRFHDIISDAEIEVVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447
Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP 469
>gi|296220402|ref|XP_002756291.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Callithrix
jacchus]
Length = 534
Score = 53.5 bits (127), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 68/144 (47%), Gaps = 17/144 (11%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447
Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP 469
>gi|397490069|ref|XP_003816032.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Pan paniscus]
Length = 488
Score = 53.5 bits (127), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 68/144 (47%), Gaps = 17/144 (11%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 288 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 345
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 346 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 401
Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 402 --NRIATWLFYMSDVSAGGATVFP 423
>gi|428172003|gb|EKX40915.1| hypothetical protein GUITHDRAFT_112917 [Guillardia theta CCMP2712]
Length = 421
Score = 53.5 bits (127), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 49/179 (27%), Positives = 74/179 (41%), Gaps = 30/179 (16%)
Query: 60 LPHGETGD--------ASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQ 111
P E GD A+ SI +V S PR L +F + E+C +I++AK + S
Sbjct: 172 FPTCEVGDTFDIPLPSANRNSIA-KVRSISPRVLEVEDFLTPEECHELISSAKPLMSRST 230
Query: 112 LALRQGETVESTKGTRTSSGTFISASE--------DKTGILELI-----EHKIARATMLP 158
++ V + +RTSS ++ D+ L I EH +
Sbjct: 231 VSAEGDSAVSLQESSRTSSTAWLPPHSHTLANKLYDRVSSLVGIDFRKHEHVVVEDLQAI 290
Query: 159 QTHG----EAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS----QRLASFLLYLSDVEEG 209
G A+ VLRYE+ Q Y H+D F+P + + R + YL+DVE G
Sbjct: 291 DKRGGSSVTAWQVLRYEVNQHYHIHHDYFDPVLHRGFLQGDGRNRFITAFFYLTDVERG 349
>gi|63252886|ref|NP_000908.2| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Homo
sapiens]
gi|114631173|ref|XP_508168.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 13 [Pan
troglodytes]
gi|602676|gb|AAA59069.1| alpha-subunit of prolyl 4-hydroxylase [Homo sapiens]
gi|62897481|dbj|BAD96680.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I variant [Homo
sapiens]
gi|119574852|gb|EAW54467.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I, isoform CRA_a [Homo
sapiens]
gi|119574853|gb|EAW54468.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I, isoform CRA_b [Homo
sapiens]
gi|410349609|gb|JAA41408.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
gi|410349613|gb|JAA41410.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
Length = 534
Score = 53.5 bits (127), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 68/144 (47%), Gaps = 17/144 (11%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447
Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP 469
>gi|194905410|ref|XP_001981191.1| GG11931 [Drosophila erecta]
gi|190655829|gb|EDV53061.1| GG11931 [Drosophila erecta]
Length = 537
Score = 53.5 bits (127), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 27/64 (42%), Positives = 37/64 (57%), Gaps = 1/64 (1%)
Query: 154 ATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMSQRLASFLLYLSDVEEGGET 212
AT L T E V Y +G Y+ H+D F +P+ Y R+A+ + YLS+VE+GG T
Sbjct: 400 ATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPSHYPAAEGNRIATAIFYLSEVEQGGAT 459
Query: 213 MFPF 216
FPF
Sbjct: 460 AFPF 463
>gi|395820524|ref|XP_003783614.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1 [Otolemur
garnettii]
Length = 534
Score = 53.5 bits (127), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 68/144 (47%), Gaps = 17/144 (11%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447
Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP 469
>gi|301770069|ref|XP_002920453.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Ailuropoda
melanoleuca]
Length = 534
Score = 53.5 bits (127), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 68/144 (47%), Gaps = 17/144 (11%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447
Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP 469
>gi|291404182|ref|XP_002718471.1| PREDICTED: prolyl 4-hydroxylase, alpha I subunit isoform 1
[Oryctolagus cuniculus]
Length = 534
Score = 53.5 bits (127), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 68/144 (47%), Gaps = 17/144 (11%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447
Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP 469
>gi|348576112|ref|XP_003473831.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cavia
porcellus]
Length = 534
Score = 53.5 bits (127), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 68/144 (47%), Gaps = 17/144 (11%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447
Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP 469
>gi|326928728|ref|XP_003210527.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Meleagris
gallopavo]
Length = 535
Score = 53.5 bits (127), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 39/138 (28%), Positives = 69/138 (50%), Gaps = 9/138 (6%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 338 PHIVRYYDVMSDEEIEKIKQLAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 393
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS---QRLA 197
++ + ++ + T L E V Y +G +Y+ H+D F+ + + RLA
Sbjct: 394 DPVVAKVNQRMQQITGLTVKTAELLQVANYGMGGQYEPHFD-FSRRPFDSTLKSEGNRLA 452
Query: 198 SFLLYLSDVEEGGETMFP 215
+FL Y+SDVE GG T+FP
Sbjct: 453 TFLNYMSDVEAGGATVFP 470
>gi|195390833|ref|XP_002054072.1| GJ22994 [Drosophila virilis]
gi|194152158|gb|EDW67592.1| GJ22994 [Drosophila virilis]
Length = 496
Score = 53.5 bits (127), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 31/89 (34%), Positives = 46/89 (51%), Gaps = 3/89 (3%)
Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NP 185
RTS GT+I D + + IE +I L + E F V+ Y +G Y +H D +
Sbjct: 345 RTSKGTWIE--RDHNNLTKRIERRITDMVELDLRYSEPFQVMNYGLGGHYAAHEDFLGDT 402
Query: 186 AEYGPQMSQRLASFLLYLSDVEEGGETMF 214
+ R+A+ L YL+DVE+GG T+F
Sbjct: 403 WADKKEEDDRIATVLFYLTDVEQGGATVF 431
>gi|73952886|ref|XP_850682.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3 [Canis
lupus familiaris]
Length = 534
Score = 53.1 bits (126), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 68/144 (47%), Gaps = 17/144 (11%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447
Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP 469
>gi|312080225|ref|XP_003142509.1| prolyl 4-hydroxylase 2 [Loa loa]
Length = 541
Score = 53.1 bits (126), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 45/147 (30%), Positives = 66/147 (44%), Gaps = 9/147 (6%)
Query: 74 PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
PF+V L + P A++F + + E+ I A RL+ + + +E T RTS
Sbjct: 322 PFKVEILRFSPLAVFFRDVITDEEVTIIQMLATPRLRRATVQNSITGELE-TASYRTSKS 380
Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ 191
++ E + I+ I +I T L Q E V Y IG YD H+D E
Sbjct: 381 AWLKDEEHE--IVHRINRRIDLMTNLEQETSEELQVGNYGIGGHYDPHFDFARREEVNAF 438
Query: 192 MS----QRLASFLLYLSDVEEGGETMF 214
S RLA+ L Y++ E GG T+F
Sbjct: 439 QSLNTGNRLATLLFYMTQPESGGATVF 465
>gi|395501518|ref|XP_003755140.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Sarcophilus
harrisii]
Length = 385
Score = 53.1 bits (126), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 43/144 (29%), Positives = 67/144 (46%), Gaps = 17/144 (11%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F S + + + AK RL S+ + ET + +T R S ++S ED
Sbjct: 185 KPRIVRFHEIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYED 242
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 243 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG-- 298
Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 299 --NRIATWLFYMSDVSAGGATVFP 320
>gi|393909803|gb|EFO21561.2| prolyl 4-hydroxylase 2 [Loa loa]
Length = 542
Score = 53.1 bits (126), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 45/147 (30%), Positives = 66/147 (44%), Gaps = 9/147 (6%)
Query: 74 PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
PF+V L + P A++F + + E+ I A RL+ + + +E T RTS
Sbjct: 323 PFKVEILRFSPLAVFFRDVITDEEVTIIQMLATPRLRRATVQNSITGELE-TASYRTSKS 381
Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ 191
++ E + I+ I +I T L Q E V Y IG YD H+D E
Sbjct: 382 AWLKDEEHE--IVHRINRRIDLMTNLEQETSEELQVGNYGIGGHYDPHFDFARREEVNAF 439
Query: 192 MS----QRLASFLLYLSDVEEGGETMF 214
S RLA+ L Y++ E GG T+F
Sbjct: 440 QSLNTGNRLATLLFYMTQPESGGATVF 466
>gi|57525020|ref|NP_001006155.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Gallus gallus]
gi|82082587|sp|Q5ZLK5.1|P4HA2_CHICK RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
alpha-2; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-2; Flags: Precursor
gi|53129464|emb|CAG31388.1| hypothetical protein RCJMB04_5l17 [Gallus gallus]
Length = 534
Score = 53.1 bits (126), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 39/138 (28%), Positives = 69/138 (50%), Gaps = 9/138 (6%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ ED
Sbjct: 337 PHIVRYYDVMSDEEIEKIKQLAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 392
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS---QRLA 197
++ + ++ + T L E V Y +G +Y+ H+D F+ + + RLA
Sbjct: 393 DPVVAKVNQRMQQITGLTVKTAELLQVANYGMGGQYEPHFD-FSRRPFDSTLKSEGNRLA 451
Query: 198 SFLLYLSDVEEGGETMFP 215
+FL Y+SDVE GG T+FP
Sbjct: 452 TFLNYMSDVEAGGATVFP 469
>gi|432106758|gb|ELK32410.1| Prolyl 4-hydroxylase subunit alpha-1 [Myotis davidii]
Length = 534
Score = 53.1 bits (126), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 68/144 (47%), Gaps = 17/144 (11%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447
Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP 469
>gi|195505216|ref|XP_002099408.1| GE23378 [Drosophila yakuba]
gi|194185509|gb|EDW99120.1| GE23378 [Drosophila yakuba]
Length = 546
Score = 53.1 bits (126), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 43/150 (28%), Positives = 71/150 (47%), Gaps = 9/150 (6%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGE---TVESTKGT-R 127
I ++LS P + + S E+ + +K + PS+ A L E E G+ R
Sbjct: 322 IKTEILSIDPFIVLLHDMVSVEEGALLRTFSKNMISPSETAELSDSEEKSIFEFEVGSFR 381
Query: 128 TSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF--NP 185
TS ++ ++ + + ++ AT L +H E F V+ Y IG ++SH+D +
Sbjct: 382 TSKSVWLDNDANEATLK--LTQRLGDATGLDISHSEPFQVINYGIGGIFESHFDTSLQDE 439
Query: 186 AEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
+ RLA+ L YL+DV +GG T FP
Sbjct: 440 NRFLDGYMDRLATTLFYLNDVPQGGATHFP 469
>gi|219113801|ref|XP_002186484.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|209583334|gb|ACI65954.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 431
Score = 53.1 bits (126), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 46/159 (28%), Positives = 75/159 (47%), Gaps = 13/159 (8%)
Query: 65 TGDASIQSIP----FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETV 120
+ D + ++P F++L P L +F +A +C +A A + K + L R
Sbjct: 185 SADLPLDALPPFEKFRILHVDPLVLGIDDFFTAAECDRYVALATQNQKHNILESRSPTVG 244
Query: 121 E--STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDS 178
+ + K RTS+ T+ + E G+ EL+ + E +RY +K+
Sbjct: 245 KDAAAKAQRTST-TYYNHFE---GVPELMAKASRLLGLQSIDRWEEPQTVRYRKNEKFTW 300
Query: 179 HYDAFNPAEYGPQM-SQRLASFLLYLSDV--EEGGETMF 214
H DA P E P + QRLA+ L+Y +D+ +EGG TMF
Sbjct: 301 HLDALGPNENNPGLGGQRLATLLVYHTDLQSDEGGATMF 339
>gi|195390835|ref|XP_002054073.1| GJ22993 [Drosophila virilis]
gi|194152159|gb|EDW67593.1| GJ22993 [Drosophila virilis]
Length = 525
Score = 53.1 bits (126), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 32/94 (34%), Positives = 51/94 (54%), Gaps = 7/94 (7%)
Query: 126 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 185
TRTS T+++ S + + + +I+ T E V+ Y +G YD H+D FN
Sbjct: 372 TRTSKVTWLTDSLNPLTVR--LNRRISDMTGFDLYGSEMLQVMNYGLGGHYDLHFDYFN- 428
Query: 186 AEYGPQMSQ----RLASFLLYLSDVEEGGETMFP 215
A +++ R+A+ L YL+DVE+GG T+FP
Sbjct: 429 ATIAKDLTKLNGDRIATVLFYLTDVEQGGATVFP 462
>gi|347972274|ref|XP_001237637.3| AGAP004611-PA [Anopheles gambiae str. PEST]
gi|333469330|gb|EAU76664.3| AGAP004611-PA [Anopheles gambiae str. PEST]
Length = 514
Score = 53.1 bits (126), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 43/160 (26%), Positives = 71/160 (44%), Gaps = 12/160 (7%)
Query: 62 HGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE 121
H T I + Q ++ P + + + S ++ +II+ +K + S + + V
Sbjct: 296 HNRTPFLRISPLKLQEVNHDPMIVMYHDVISNKEIDAIISISKPLMHRSMVGDDHEKAVS 355
Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
T RTSS ++ + ++ + + T L T E V Y IG Y HYD
Sbjct: 356 KT---RTSSNAWLD--DVMHPVVRTLSQRTEDMTNLAMTAAERLQVGNYGIGGHYLPHYD 410
Query: 182 AFNPAEYGPQM------SQRLASFLLYLSDVEEGGETMFP 215
+ AE G ++ R+A+ + YLSDV GG T+FP
Sbjct: 411 -YAVAEEGKEVYPSIGKGNRIATVMYYLSDVAIGGATVFP 449
>gi|431904119|gb|ELK09541.1| Prolyl 4-hydroxylase subunit alpha-1 [Pteropus alecto]
Length = 507
Score = 53.1 bits (126), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 68/144 (47%), Gaps = 17/144 (11%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447
Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP 469
>gi|260665980|ref|YP_003212934.1| hypothetical protein H665_p111 [Ostreococcus tauri virus 1]
gi|260160998|emb|CAY39699.1| hypothetical protein OTV1_111 [Ostreococcus tauri virus 1]
Length = 185
Score = 53.1 bits (126), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 40/134 (29%), Positives = 65/134 (48%), Gaps = 13/134 (9%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
PR + F + E+ + II A+K+L+ S +A E K R S ++ +D
Sbjct: 10 PRVI--KEFITEEERKHIIRKAQKKLEVSTVA----ENRVVDKKVRDSETAWL---DDSD 60
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
+++ + K T P + E VLRY+ G Y H D F+ + ++R+ + +L
Sbjct: 61 PVVKRVMEKCVSLTDRPLVNCEHIQVLRYKPGGHYSPHQDTFSDT----KGNKRMYTVIL 116
Query: 202 YLSDVEEGGETMFP 215
L+D EGGET FP
Sbjct: 117 GLNDDYEGGETEFP 130
>gi|384252051|gb|EIE25528.1| hypothetical protein COCSUDRAFT_40745 [Coccomyxa subellipsoidea
C-169]
Length = 456
Score = 53.1 bits (126), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 45/141 (31%), Positives = 64/141 (45%), Gaps = 32/141 (22%)
Query: 75 FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 134
++S PR L +F S +CQ +A A+ L+ S +A G V S RTSS F+
Sbjct: 283 ISLVSTSPRMLMIKDFLSPRECQEFMAVAEPLLQNSLVAC--GSCVPS----RTSSSMFL 336
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
++ + IE+ R E GQ YD+H+D + +
Sbjct: 337 QGRHEREPCV--IEYD------------------RKE-GQFYDAHFD-----NKSNECWR 370
Query: 195 RLASFLLYLSDVEEGGETMFP 215
R A+ + YLSDVEEGG T FP
Sbjct: 371 RAATLITYLSDVEEGGATFFP 391
>gi|268536692|ref|XP_002633481.1| C. briggsae CBR-PHY-2 protein [Caenorhabditis briggsae]
gi|94442973|emb|CAJ98659.1| prolyl 4-hydroxylase [Caenorhabditis briggsae]
Length = 539
Score = 53.1 bits (126), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 43/154 (27%), Positives = 66/154 (42%), Gaps = 17/154 (11%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
I I ++L + P A+ F N S + + I A +LK + TV+++K
Sbjct: 313 IAPIKVEILRFDPLAVLFKNVISDSEIEVIKELASPKLKRA--------TVQNSKTGELE 364
Query: 130 SGTF-ISASE----DKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN 184
T+ IS S D +++ + +I T L Q E V Y +G YD H+D
Sbjct: 365 HATYRISKSAWLKGDLDPVIDRVNRRIEDFTGLNQATSEELQVANYGLGGHYDPHFDFAR 424
Query: 185 PAEYGP----QMSQRLASFLLYLSDVEEGGETMF 214
E R+A+ L Y+S E GG T+F
Sbjct: 425 KEEKNAFKTLNTGNRIATVLFYMSQPERGGATVF 458
>gi|391342914|ref|XP_003745760.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Metaseiulus
occidentalis]
Length = 525
Score = 53.1 bits (126), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 39/154 (25%), Positives = 71/154 (46%), Gaps = 15/154 (9%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+Q +V+ RP F + S ++ Q++I + RLK + + + +E R S
Sbjct: 314 LQPSKLEVIHERPYLALFHDIMSDDEIQTVIELSAPRLKRATVQNAKSGELE-VANYRIS 372
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-------- 181
++ + + ++E + + T L E V+ Y IG Y++H+D
Sbjct: 373 KSAWLKNHDHE--VVERLSFRFEYLTGLTHLTAEELQVVNYGIGGHYEAHFDFARRDEKD 430
Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
AF G R+A+++ Y+SDV+ GG T+FP
Sbjct: 431 AFKQLGTG----NRIATWINYMSDVKAGGATVFP 460
>gi|215697788|dbj|BAG91981.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 225
Score = 53.1 bits (126), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 37/121 (30%), Positives = 62/121 (51%), Gaps = 19/121 (15%)
Query: 95 QCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARA 154
+C +++ + ++ S LA T G R SS I ED ++ IE +I+
Sbjct: 2 ECDHLVSMGRGNME-SSLAF--------TDGDRNSSYNNI---EDI--VVSKIEDRISLW 47
Query: 155 TMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMF 214
+ LP+ +GE+ VL+Y + + + + RLA+ L+YLSDV++GGET+F
Sbjct: 48 SFLPKENGESIQVLKYGVNRS-----GSIKEEPKSSSGAHRLATILMYLSDVKQGGETVF 102
Query: 215 P 215
P
Sbjct: 103 P 103
>gi|224001508|ref|XP_002290426.1| prolyl 4-hydroxylase alpha subunit-like protein [Thalassiosira
pseudonana CCMP1335]
gi|220973848|gb|EED92178.1| prolyl 4-hydroxylase alpha subunit-like protein, partial
[Thalassiosira pseudonana CCMP1335]
Length = 180
Score = 53.1 bits (126), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 37/137 (27%), Positives = 62/137 (45%), Gaps = 13/137 (9%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + NF S+ + + ++ S + + ++ TRTSS +
Sbjct: 1 PWVITLENFVSSTEMEILMEWG------SHFGYDRSQAGDTISETRTSSQAWCFNDCYDD 54
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP---AEYGPQMSQRLAS 198
I++ + +I T +P+ + EA +L Y+ GQ Y H D +GP RL +
Sbjct: 55 PIVKGLRQRIHEVTGIPEKNYEALQLLHYDEGQFYKPHDDFIRKHVVQGHGP----RLLT 110
Query: 199 FLLYLSDVEEGGETMFP 215
F +Y ++VEEGG T FP
Sbjct: 111 FFMYFNEVEEGGGTRFP 127
>gi|344296798|ref|XP_003420090.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Loxodonta
africana]
Length = 544
Score = 53.1 bits (126), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 46/150 (30%), Positives = 75/150 (50%), Gaps = 12/150 (8%)
Query: 74 PF--QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
PF +V+ P + + +F + + Q I A+ L+ S +A GE + R S
Sbjct: 336 PFRKEVIHLEPYVVLYHDFVNDMEAQKIKGLAEPWLQRSVVA--SGEK-QLQVDYRISKS 392
Query: 132 TFISASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPAE- 187
++ S D +L ++H+IA T L + E V+ Y IG Y+ H+D A +P+
Sbjct: 393 AWLKDSVDP--MLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPSSP 450
Query: 188 -YGPQMSQRLASFLLYLSDVEEGGETMFPF 216
Y + R+A+F++YLS VE GG T F +
Sbjct: 451 LYRMKSGNRVATFMIYLSAVEAGGATAFIY 480
>gi|323448532|gb|EGB04429.1| hypothetical protein AURANDRAFT_67177 [Aureococcus anophagefferens]
Length = 327
Score = 53.1 bits (126), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 35/92 (38%), Positives = 46/92 (50%), Gaps = 3/92 (3%)
Query: 125 GTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN 184
G RTS ++ + D L+ A P+ EA V RYE G K+ +H DA
Sbjct: 172 GGRTSEAVALNGAGDAALAAPLVARLAAALGASPRDF-EAPVVTRYERGAKFATHNDASA 230
Query: 185 PAE--YGPQMSQRLASFLLYLSDVEEGGETMF 214
E +G QRLA+ +LYL+DV GGET F
Sbjct: 231 DPERDWGDAGGQRLATVILYLNDVAAGGETSF 262
>gi|47550697|ref|NP_999856.1| prolyl 4-hydroxylase, alpha polypeptide I b precursor [Danio rerio]
gi|28277826|gb|AAH45890.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I [Danio rerio]
Length = 536
Score = 53.1 bits (126), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 41/142 (28%), Positives = 65/142 (45%), Gaps = 15/142 (10%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
RPR + + S + +++ AK RL+ + ++ +E T R S ++S E
Sbjct: 336 RPRIVRYHEIISDSEIETVKEMAKPRLRRATISNPITGVLE-TAPYRISKSAWLSGYEHS 394
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
T +E I +I T L E V Y +G +Y+ H+D AF G
Sbjct: 395 T--IERINQRIEDVTGLEMDTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG--- 449
Query: 193 SQRLASFLLYLSDVEEGGETMF 214
R+A++L Y+SDV GG T+F
Sbjct: 450 -NRIATWLFYMSDVSAGGATVF 470
>gi|402880501|ref|XP_003903839.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like, partial
[Papio anubis]
Length = 379
Score = 53.1 bits (126), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 68/144 (47%), Gaps = 17/144 (11%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 179 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 236
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 237 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 292
Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 293 --NRIATWLFYMSDVSAGGATVFP 314
>gi|332211329|ref|XP_003254773.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Nomascus
leucogenys]
Length = 544
Score = 53.1 bits (126), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 46/152 (30%), Positives = 74/152 (48%), Gaps = 10/152 (6%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+Q I +V+ P + +F S + Q I A+ L+ S +A GE + R S
Sbjct: 334 LQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVA--SGEKQLQVE-YRIS 390
Query: 130 SGTFISASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPA 186
++ + D +L + H+IA T L + E V+ Y IG Y+ H+D A +P+
Sbjct: 391 KSAWLKDTVDP--MLVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPS 448
Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPF 216
Y + R+A+F++YLS VE GG T F +
Sbjct: 449 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIY 480
>gi|297689698|ref|XP_002822285.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Pongo abelii]
Length = 544
Score = 53.1 bits (126), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 46/152 (30%), Positives = 74/152 (48%), Gaps = 10/152 (6%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+Q I +V+ P + +F S + Q I A+ L+ S +A GE + R S
Sbjct: 334 LQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVA--SGEKQLQVE-YRIS 390
Query: 130 SGTFISASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPA 186
++ + D +L + H+IA T L + E V+ Y IG Y+ H+D A +P+
Sbjct: 391 KSAWLKDTVDP--MLVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPS 448
Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPF 216
Y + R+A+F++YLS VE GG T F +
Sbjct: 449 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIY 480
>gi|296217074|ref|XP_002754870.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Callithrix
jacchus]
Length = 544
Score = 53.1 bits (126), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 46/152 (30%), Positives = 74/152 (48%), Gaps = 10/152 (6%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+Q I ++L P + +F S + Q I A+ L+ S +A GE + R S
Sbjct: 334 LQPIQKEILHLEPYIALYHDFVSDSEAQKIREFAEPWLQRSVVA--SGEKQLQVE-YRIS 390
Query: 130 SGTFISASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPA 186
++ + D +L + H+IA T L + E V+ Y IG Y+ H+D A +P+
Sbjct: 391 KSAWLKDTVDP--MLVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPS 448
Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPF 216
Y + R+A+F++YLS VE GG T F +
Sbjct: 449 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIY 480
>gi|148701598|gb|EDL33545.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide, isoform CRA_c [Mus
musculus]
gi|149052607|gb|EDM04424.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide (predicted),
isoform CRA_d [Rattus norvegicus]
Length = 189
Score = 53.1 bits (126), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 39/127 (30%), Positives = 65/127 (51%), Gaps = 7/127 (5%)
Query: 92 SAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDKTGILELIEHK 150
S E+ + I AK +L ++ +R +T V + R S +++ ED ++ + +
Sbjct: 2 SDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDDDPVVARVNRR 57
Query: 151 IARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQM-SQRLASFLLYLSDVEE 208
+ T L E V Y +G +Y+ H+D + P + G + RLA+FL Y+SDVE
Sbjct: 58 MQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEA 117
Query: 209 GGETMFP 215
GG T+FP
Sbjct: 118 GGATVFP 124
>gi|349604936|gb|AEQ00344.1| Prolyl 4-hydroxylase subunit alpha-1-like protein, partial [Equus
caballus]
Length = 302
Score = 53.1 bits (126), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 68/144 (47%), Gaps = 17/144 (11%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 102 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 159
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 160 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 215
Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 216 --NRIATWLFYMSDVSAGGATVFP 237
>gi|345324764|ref|XP_001505668.2| PREDICTED: LOW QUALITY PROTEIN: transmembrane prolyl 4-hydroxylase
[Ornithorhynchus anatinus]
Length = 495
Score = 52.8 bits (125), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 35/115 (30%), Positives = 53/115 (46%), Gaps = 17/115 (14%)
Query: 118 ETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQT---HGEAFNVLRYEIGQ 174
+ V+ + R S T++ E ++ I+ ++ R T LPQ H E V+RY+ G
Sbjct: 259 QKVKMSDLVRNSQHTWLYQGEGAHQVMRSIQQRVLRLTRLPQEIVEHSEPLQVVRYDQGG 318
Query: 175 KYDSHYDA-------------FNPAEYGP-QMSQRLASFLLYLSDVEEGGETMFP 215
Y +H D+ F E P + S R + L YL++V GGET FP
Sbjct: 319 HYHAHMDSGPVFPETACSHTKFITNETAPFETSCRYVTVLFYLNNVTGGGETTFP 373
>gi|355752458|gb|EHH56578.1| hypothetical protein EGM_06023, partial [Macaca fascicularis]
Length = 586
Score = 52.8 bits (125), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 46/152 (30%), Positives = 74/152 (48%), Gaps = 10/152 (6%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+Q I +V+ P + +F S + Q I A+ L+ S +A GE + R S
Sbjct: 321 LQPIRKEVIHLEPYIALYHDFVSDSEAQKIREFAEPWLQRSVVA--SGEKQLQVE-YRIS 377
Query: 130 SGTFISASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPA 186
++ + D +L + H+IA T L + E V+ Y IG Y+ H+D A +P+
Sbjct: 378 KSAWLKDTVDP--MLVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPS 435
Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPF 216
Y + R+A+F++YLS VE GG T F +
Sbjct: 436 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIY 467
>gi|195505255|ref|XP_002099425.1| GE23368 [Drosophila yakuba]
gi|194185526|gb|EDW99137.1| GE23368 [Drosophila yakuba]
Length = 528
Score = 52.8 bits (125), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 34/58 (58%), Gaps = 3/58 (5%)
Query: 161 HG-EAFNVLRYEIGQKYDSHYDAFNP--AEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
HG E ++ Y +G YD HYD FN + R+A+ L YL+DVE+GG T+FP
Sbjct: 407 HGSEMLQLMNYGLGGHYDQHYDYFNTINSNLTAMSGDRIATVLFYLTDVEQGGATVFP 464
>gi|334314087|ref|XP_003339988.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2
[Monodelphis domestica]
Length = 537
Score = 52.8 bits (125), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 41/143 (28%), Positives = 66/143 (46%), Gaps = 15/143 (10%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+PR + F S + + + AK RL+ + ++ +E T R S ++S ED
Sbjct: 337 KPRIVRFHEIISDAEIEIVKDLAKPRLRRATISNPITGVLE-TAHYRISKSAWLSGYEDP 395
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 396 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG--- 450
Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 451 -NRIATWLFYMSDVSAGGATVFP 472
>gi|260825357|ref|XP_002607633.1| hypothetical protein BRAFLDRAFT_59428 [Branchiostoma floridae]
gi|229292981|gb|EEN63643.1| hypothetical protein BRAFLDRAFT_59428 [Branchiostoma floridae]
Length = 520
Score = 52.8 bits (125), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 42/146 (28%), Positives = 61/146 (41%), Gaps = 22/146 (15%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+P+ N + + + I A+ RL+ ++ VES T G S K
Sbjct: 321 KPKLWVLHNILTDPEMEVIKKLAQPRLRRAR--------VESPT---TGEGELASYRISK 369
Query: 141 TGILELIEHKIAR--------ATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE---YG 189
+ L EH++ R T L E V+ Y IG Y+ H+D E
Sbjct: 370 SAWLYDWEHRVIRRVNQRVEDVTGLTMETAELLQVVNYGIGGHYEPHFDCATKDEEFALD 429
Query: 190 PQMSQRLASFLLYLSDVEEGGETMFP 215
P R+A+ L Y+SDVE GG T+FP
Sbjct: 430 PNEGDRIATMLFYMSDVEAGGATVFP 455
>gi|241044301|ref|XP_002407178.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
gi|215492128|gb|EEC01769.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
Length = 554
Score = 52.8 bits (125), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 42/165 (25%), Positives = 72/165 (43%), Gaps = 16/165 (9%)
Query: 63 GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPS-QLALRQGETVE 121
G+ G +Q I + + +P + N + ++A AK RL+ S +R E +
Sbjct: 316 GQHGFFKLQPIKVEEANLKPYIVVMHNVIQDRDIEDLMAFAKPRLQRSTHYGVRGMEASQ 375
Query: 122 STKGTRTSSGTFISASED--KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH 179
RTSS +++ + T + + + T E + + Y IG +Y SH
Sbjct: 376 ----VRTSSNAWLNDLDAPVATRLNRFLRSLLGLGTTYLGGEAEQYQLANYGIGGQYMSH 431
Query: 180 YDAFNPAEYGPQM---------SQRLASFLLYLSDVEEGGETMFP 215
+D + P R+A+ ++Y+SDVEEGG T+FP
Sbjct: 432 HDYLQDTYHIPNRVTDDFEKTSGDRIATLMVYMSDVEEGGATVFP 476
>gi|354504916|ref|XP_003514519.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Cricetulus
griseus]
Length = 509
Score = 52.8 bits (125), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 45/146 (30%), Positives = 73/146 (50%), Gaps = 10/146 (6%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
+V+ RP + +F S + Q I A+ L+ S +A GE + R S ++
Sbjct: 305 EVIHLRPFVALYHDFVSDAEAQKIRELAEPWLQRSVVA--SGEKQLPVE-YRISKSAWLK 361
Query: 136 ASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 190
+ D +L ++H+IA T L + E V+ Y IG Y+ H+D A +P+ Y
Sbjct: 362 DTVDP--MLGTLDHRIAALTGLDIQPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRM 419
Query: 191 QMSQRLASFLLYLSDVEEGGETMFPF 216
+ R+A+F++YLS VE GG T F +
Sbjct: 420 KSGNRVATFMIYLSAVEAGGATAFIY 445
>gi|347527209|ref|YP_004833956.1| putative oxygenase [Sphingobium sp. SYK-6]
gi|345135890|dbj|BAK65499.1| putative oxygenase [Sphingobium sp. SYK-6]
Length = 229
Score = 52.8 bits (125), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 42/131 (32%), Positives = 54/131 (41%), Gaps = 14/131 (10%)
Query: 90 FASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEH 149
S +C +I KPS+L + G RTSS + + + I
Sbjct: 49 LVSPAECAKLIDKIDAGCKPSKL------FSGTASGYRTSSSCNLDIYDP---FVIAITK 99
Query: 150 KIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQM----SQRLASFLLYLS 204
+I + HGE RYE Q+Y H D F Y P M QR + + YL
Sbjct: 100 RIDALMGMEGDHGELLQGQRYEQTQQYHLHCDYFPGNVHYWPAMRVSGGQRCWTAMAYLC 159
Query: 205 DVEEGGETMFP 215
DVEEGGET FP
Sbjct: 160 DVEEGGETHFP 170
>gi|151556370|gb|AAI47868.1| P4HA1 protein [Bos taurus]
Length = 534
Score = 52.8 bits (125), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 68/144 (47%), Gaps = 17/144 (11%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEVVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447
Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 448 --NRIATWLFYMSDVLAGGATVFP 469
>gi|47227817|emb|CAG08980.1| unnamed protein product [Tetraodon nigroviridis]
Length = 285
Score = 52.8 bits (125), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 41/146 (28%), Positives = 71/146 (48%), Gaps = 10/146 (6%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
+ LS +P + + +F S + + I A+ L+ S +A R + T R S ++
Sbjct: 81 ETLSLQPYVVLYHDFISDTEAEEIKHHAQLGLRRSVVATRDKQV---TAEYRISKSAWLK 137
Query: 136 ASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 190
S + ++ +I+ T L HGE V+ Y IG Y+ H+D A +P+ +
Sbjct: 138 GSAQSA--VSRLDQRISMLTGLNVQHPHGEYLQVVNYGIGGHYEPHFDHATSPSSPVFKL 195
Query: 191 QMSQRLASFLLYLSDVEEGGETMFPF 216
+ R+A+ ++YLS VE GG T F +
Sbjct: 196 KTGNRVATVMIYLSSVEAGGSTAFIY 221
>gi|410295850|gb|JAA26525.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
gi|410295854|gb|JAA26527.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
Length = 534
Score = 52.8 bits (125), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 41/144 (28%), Positives = 68/144 (47%), Gaps = 17/144 (11%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S + + + AK RL+ + + ET + +T R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLR--RATVHDPETGKLTTAQYRVSKSAWLSGYEN 391
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447
Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP 469
>gi|372272594|ref|ZP_09508642.1| Procollagen-proline dioxygenase [Marinobacterium stanieri S30]
Length = 217
Score = 52.8 bits (125), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 37/149 (24%), Positives = 74/149 (49%), Gaps = 16/149 (10%)
Query: 75 FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 134
++L+ P + + + ++ + +I A+++L ++++ V T G R+ ++
Sbjct: 1 MEILNHHPLVMTLDHILTPKEAKGLIDLAREKLDRAKVSFDDAYGV--TDG-RSGQNCWL 57
Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN--------PA 186
++ + + + +IA+ +P + E+ VL Y Q+Y +HYDA++
Sbjct: 58 RYAD--YPLAKQVGDRIAKLAGIPLENAESLQVLHYGPEQEYRAHYDAYDLSTARGQRCC 115
Query: 187 EYGPQMSQRLASFLLYLSDVEEGGETMFP 215
YG QRL + L+YL+ VE GG T FP
Sbjct: 116 RYG---GQRLVTALVYLNAVEAGGGTAFP 141
>gi|395509387|ref|XP_003758979.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1
[Sarcophilus harrisii]
Length = 534
Score = 52.8 bits (125), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 39/137 (28%), Positives = 69/137 (50%), Gaps = 7/137 (5%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
P + + + S E+ + I AK +L ++ +R +T V + R S +++ +D
Sbjct: 337 PHIVRYYDVLSDEEIERIKELAKPKL--ARATVRDPKTGVLTVANYRVSKSSWLEEGDDP 394
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQM-SQRLAS 198
++ + ++ T L E V Y +G +Y+ H+D + P + G + RLA+
Sbjct: 395 --VIAQLNRRMHYITGLSVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 452
Query: 199 FLLYLSDVEEGGETMFP 215
FL Y+SDVE GG T+FP
Sbjct: 453 FLNYMSDVEAGGATVFP 469
>gi|170064951|ref|XP_001867739.1| prolyl 4-hydroxylase subunit alpha-2 [Culex quinquefasciatus]
gi|167882142|gb|EDS45525.1| prolyl 4-hydroxylase subunit alpha-2 [Culex quinquefasciatus]
Length = 516
Score = 52.8 bits (125), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 39/142 (27%), Positives = 66/142 (46%), Gaps = 6/142 (4%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
+V++ P + + AS + +I + ++ S + + V + RTS ++++
Sbjct: 314 EVVNLDPFVAVYHDAASDAEINKVIELGRPQINRSMVGDAAKKEVSKS---RTSQNSWLT 370
Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-- 193
D + L A L +T E+ V Y IG Y HYD P+++
Sbjct: 371 -DYDHPVVAALSRRTKDMALGLDETAYESLQVNNYGIGGHYLPHYDWSREENPYPELNTG 429
Query: 194 QRLASFLLYLSDVEEGGETMFP 215
R+A+ + YLSDVEEGG T+FP
Sbjct: 430 NRIATLMFYLSDVEEGGATVFP 451
>gi|427783867|gb|JAA57385.1| Putative prolyl 4-hydroxylase subunit alpha-1 [Rhipicephalus
pulchellus]
Length = 548
Score = 52.8 bits (125), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 40/165 (24%), Positives = 68/165 (41%), Gaps = 14/165 (8%)
Query: 62 HGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE 121
+G G +Q + + + +P + F + ++A A RL S GE
Sbjct: 316 YGRNGFLRLQPVKIEEANLKPYIITFHDIIGDRDINDLLAYATPRLFRST---HYGEHGT 372
Query: 122 STKGTRTSSGTFISASED--KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH 179
T RTSS ++ + T + +E + + + E + + Y +G +Y +H
Sbjct: 373 ETSLIRTSSTAWLGDQDAPVATRLNRFVESLLGLGSQYLKGEAEYYQLANYGVGGQYIAH 432
Query: 180 YDAFNPAEYGPQ---------MSQRLASFLLYLSDVEEGGETMFP 215
+D P R+A+ + YLSDVEEGG T+FP
Sbjct: 433 HDFLADIYADPNRKLDDFERSAGDRIATLMFYLSDVEEGGATVFP 477
>gi|402894624|ref|XP_003910453.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
alpha-3 [Papio anubis]
Length = 535
Score = 52.4 bits (124), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 46/152 (30%), Positives = 74/152 (48%), Gaps = 10/152 (6%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+Q I +V+ P + +F S + Q I A+ L+ S +A GE + R S
Sbjct: 325 LQPIRKEVIHLEPYIALYHDFVSDSEAQKIREFAEPWLQRSVVA--SGEKQLQVE-YRIS 381
Query: 130 SGTFISASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPA 186
++ + D +L + H+IA T L + E V+ Y IG Y+ H+D A +P+
Sbjct: 382 KSAWLKDTVDP--MLVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPS 439
Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPF 216
Y + R+A+F++YLS VE GG T F +
Sbjct: 440 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIY 471
>gi|37912909|gb|AAR05245.1| conserved hypothetical protein [uncultured marine proteobacterium
ANT32C12]
Length = 186
Score = 52.4 bits (124), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 55/103 (53%), Gaps = 10/103 (9%)
Query: 126 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 185
+RT+S +I D + I+ + + + +P + E F ++ Y G +Y H+DAF+
Sbjct: 39 SRTNSYAWIQ--HDASEIIHEVSKRFSILVKMPINNAEQFQLVHYGPGTEYKPHFDAFDK 96
Query: 186 A-EYGPQM----SQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
+ E G QR+ + L YL+DVE+GG T FP D+ +S
Sbjct: 97 STEEGRNNWFPGGQRMVTALAYLNDVEDGGATDFP---DIHVS 136
>gi|355709028|gb|AES03457.1| prolyl 4-hydroxylase, alpha polypeptide III [Mustela putorius furo]
Length = 477
Score = 52.4 bits (124), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 45/152 (29%), Positives = 74/152 (48%), Gaps = 10/152 (6%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+Q I +V+ P + + +F S + Q I A+ L+ S +A GE + R S
Sbjct: 268 LQPIRKEVIHLEPYVVLYHDFVSDMEAQKIRGLAEPWLQRSVVA--SGEKQLPVE-YRIS 324
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYD-AFNPA 186
++ + D +L ++H+I T L + E V+ Y IG Y+ H+D A +P
Sbjct: 325 KSAWLKDTVDP--LLVNLDHRIGALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPT 382
Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPF 216
Y + R+A+F++YLS VE GG T F +
Sbjct: 383 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIY 414
>gi|221126103|ref|XP_002165259.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Hydra
magnipapillata]
Length = 533
Score = 52.4 bits (124), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 40/149 (26%), Positives = 69/149 (46%), Gaps = 8/149 (5%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRT 128
++ + +VL P + + ++ + II AK L+ + + + G+ + + R
Sbjct: 321 LKPLKMEVLHHDPYIELYYELITDDEAKHIIKFAKPLLRRAFVHDMVTGDLIYA--DYRV 378
Query: 129 SSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD---AFNP 185
S T+I+ ED I I ++ T L + E V Y I +Y+ H+D P
Sbjct: 379 SKNTWIA--EDMDVIAAKIIRRVGDVTGLNMRYAEHLQVANYGIAGQYEPHFDHSTGTRP 436
Query: 186 AEYGPQMSQRLASFLLYLSDVEEGGETMF 214
+ R+A+ LLYLSDV+ GG T+F
Sbjct: 437 KHFDRWGGNRIATMLLYLSDVDWGGRTVF 465
>gi|355566863|gb|EHH23242.1| hypothetical protein EGK_06672, partial [Macaca mulatta]
Length = 583
Score = 52.4 bits (124), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 46/152 (30%), Positives = 74/152 (48%), Gaps = 10/152 (6%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+Q I +V+ P + +F S + Q I A+ L+ S +A GE + R S
Sbjct: 318 LQPIRKEVIHLEPYIALYHDFVSDSEAQKIREFAEPWLQRSVVA--SGEKQLQVE-YRIS 374
Query: 130 SGTFISASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPA 186
++ + D +L + H+IA T L + E V+ Y IG Y+ H+D A +P+
Sbjct: 375 KSAWLKDTVDP--MLVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPS 432
Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPF 216
Y + R+A+F++YLS VE GG T F +
Sbjct: 433 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIY 464
>gi|401397395|ref|XP_003880042.1| hypothetical protein NCLIV_004830 [Neospora caninum Liverpool]
gi|325114451|emb|CBZ50007.1| hypothetical protein NCLIV_004830 [Neospora caninum Liverpool]
Length = 308
Score = 52.4 bits (124), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 45/156 (28%), Positives = 67/156 (42%), Gaps = 19/156 (12%)
Query: 65 TGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQ----LALRQGETV 120
TG + I + P P + C ++ + R + S+ A +
Sbjct: 102 TGQLEERVIQILAIHENPGVYLIPELLTDSDCAHLLQLCEGRWERSKTSTGYATAEPRDY 161
Query: 121 ESTKG-TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH 179
S K +RTS ++ E TG++E IE ++ +P H E V+RYE GQ + H
Sbjct: 162 MSKKSPSRTSWSVPLAIGE--TGVVENIERIVSAFAGMPVEHLEPLAVVRYEEGQYFKQH 219
Query: 180 YD-AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMF 214
+D F P + LLYL+DVE GGET F
Sbjct: 220 HDGGFRP-----------KTVLLYLNDVEAGGETSF 244
>gi|301759032|ref|XP_002915381.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Ailuropoda
melanoleuca]
Length = 539
Score = 52.4 bits (124), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 44/152 (28%), Positives = 74/152 (48%), Gaps = 10/152 (6%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+Q + +V+ P + + +F S + Q I A+ L+ S +A GE + R S
Sbjct: 329 LQPVRKEVIHLEPYVVLYHDFVSDGEAQKIRGLAEPWLQRSVVA--SGEKQLPVE-YRIS 385
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYD-AFNPA 186
++ + D +L ++H+I T L + E V+ Y IG Y+ H+D A +P
Sbjct: 386 KSAWLKDTVDP--LLVTLDHRIGALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPT 443
Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPF 216
Y + R+A+F++YLS VE GG T F +
Sbjct: 444 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIY 475
>gi|195055775|ref|XP_001994788.1| GH17428 [Drosophila grimshawi]
gi|193892551|gb|EDV91417.1| GH17428 [Drosophila grimshawi]
Length = 540
Score = 52.4 bits (124), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 48/179 (26%), Positives = 78/179 (43%), Gaps = 9/179 (5%)
Query: 43 RPKLRTLEVVEKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIAT 102
R +L+ +++ L G +S Q + L P + + SAE+ +
Sbjct: 292 REELKPEPATQRKLRCRLHRGNGLRSSYQPYRLEELHLDPYVIQVHDIISAEETIVLQQL 351
Query: 103 AKKRLKPSQL-ALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTH 161
A+ L+ S + +L E + + R S GTF E I++ + + + L
Sbjct: 352 ARPELQRSMVYSLSNSEHI--STNFRISQGTFFEYHEHP--IMQRMSQHLENISGLDMRS 407
Query: 162 GEAFNVLRYEIGQKYDSHYDAFNPAE-YGPQM---SQRLASFLLYLSDVEEGGETMFPF 216
E V Y IG Y+ H D+F+ YG + R+A+ + YLS+VE GG T FPF
Sbjct: 408 AEQLQVANYGIGGHYEPHMDSFSENHNYGINTYMSTNRVATGIYYLSNVEAGGGTAFPF 466
>gi|195505214|ref|XP_002099407.1| GE23379 [Drosophila yakuba]
gi|194185508|gb|EDW99119.1| GE23379 [Drosophila yakuba]
Length = 547
Score = 52.4 bits (124), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 43/151 (28%), Positives = 69/151 (45%), Gaps = 16/151 (10%)
Query: 73 IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT-RTSSG 131
I ++LS P L F + S ++ I +++K+ + PS E T RTS
Sbjct: 328 IKTEILSIDPFVLLFHDMISQKESTLIRSSSKEHMLPSATTDVDASGSEDHVATFRTSKS 387
Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA-------FN 184
+ S++ + T + I ++ AT L E F V+ Y +G +++H D FN
Sbjct: 388 VWYSSTSNDTT--KRITERLGDATGLDMNFTEYFQVINYGLGGFFETHLDMLLSDRSRFN 445
Query: 185 PAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
RLA+ L YL++V +GG T FP
Sbjct: 446 GTR------DRLATTLFYLNEVRQGGGTHFP 470
>gi|335294484|ref|XP_003357239.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Sus scrofa]
Length = 545
Score = 52.4 bits (124), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 42/152 (27%), Positives = 76/152 (50%), Gaps = 9/152 (5%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+Q I +V+ P + + +F + + Q I A+ + +++ + GE + R S
Sbjct: 334 LQPIRKEVIHLEPYVVLYHDFVTDAEAQKIRGLAEPWVT-AEILVASGEKQLPVE-YRIS 391
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYD-AFNPA 186
++ + D +L ++H+IA T L + E V+ Y IG Y+ H+D A +P+
Sbjct: 392 KSAWLKDTVDP--MLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPS 449
Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPF 216
Y + R+A+F++YLS VE GG T F +
Sbjct: 450 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIY 481
>gi|254513749|ref|ZP_05125810.1| oxidoreductase, 2OG-Fe(II) oxygenase family [gamma proteobacterium
NOR5-3]
gi|219675992|gb|EED32357.1| oxidoreductase, 2OG-Fe(II) oxygenase family [gamma proteobacterium
NOR5-3]
Length = 375
Score = 52.4 bits (124), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 38/133 (28%), Positives = 55/133 (41%), Gaps = 12/133 (9%)
Query: 87 FPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILEL 146
P F AEQC +I + PSQ+ + + R+S + E +
Sbjct: 190 LPEFLKAEQCDRLIDIIRSNAHPSQV-----DGYQQQSDMRSSRTCNLDVREHP--YIAE 242
Query: 147 IEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP-----AEYGPQMSQRLASFLL 201
I+ I+RA + E YE GQ+Y H D F P A + QR +F++
Sbjct: 243 IDDAISRALGISLGWSEINQGQWYEPGQQYKPHPDYFPPGTPEYARFAATSGQRTWTFMI 302
Query: 202 YLSDVEEGGETMF 214
YL+ E GG T F
Sbjct: 303 YLNKTERGGGTHF 315
>gi|126736198|ref|ZP_01751941.1| response regulator receiver domain protein (CheY-like) [Roseobacter
sp. CCS2]
gi|126714364|gb|EBA11232.1| response regulator receiver domain protein (CheY-like) [Roseobacter
sp. CCS2]
Length = 217
Score = 52.4 bits (124), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 47/150 (31%), Positives = 67/150 (44%), Gaps = 20/150 (13%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-----TRTS- 129
VL P + + Q +I+ Q AL + V+S G +RT+
Sbjct: 7 NVLCLNPLVAVIDDVFDEDLAQHVISLG-------QEALVRATVVDSAGGGKLDESRTND 59
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
SGT S+ K L + I+ LP + E +LRYE QK+D H DAF+ G
Sbjct: 60 SGTIDQWSDPK---LASLVTTISDLVRLPPENSEPSQLLRYEGEQKFDPHTDAFDNTVGG 116
Query: 190 PQM----SQRLASFLLYLSDVEEGGETMFP 215
QRL + + YL++V +GGET FP
Sbjct: 117 RDFISRGGQRLFTTICYLNNVGKGGETEFP 146
>gi|426255746|ref|XP_004021509.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Ovis
aries]
Length = 534
Score = 52.4 bits (124), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 68/144 (47%), Gaps = 17/144 (11%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
+PR + F + S + + + AK RL S+ + ET + +T R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391
Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447
Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 448 --NRIATWLFYMSDVLAGGATVFP 469
>gi|195069801|ref|XP_001997031.1| GH12975 [Drosophila grimshawi]
gi|193891500|gb|EDV90366.1| GH12975 [Drosophila grimshawi]
Length = 242
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 46/162 (28%), Positives = 71/162 (43%), Gaps = 9/162 (5%)
Query: 60 LPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQL-ALRQGE 118
L G +S Q + L P + + SAE+ + A+ L+ S + +L E
Sbjct: 11 LHRGNGLRSSYQPYRLEELHLDPYVIQVHDIISAEETIVLQQLARPELQRSMVYSLSNSE 70
Query: 119 TVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDS 178
+ + R S GTF E I++ + + + L E V Y IG Y+
Sbjct: 71 HISTN--FRISQGTFFEYHEHP--IMQRMSQHLENISGLDMRSAEQLQVANYGIGGHYEP 126
Query: 179 HYDAFNPAE-YGPQM---SQRLASFLLYLSDVEEGGETMFPF 216
H D+F+ YG + R+A+ + YLS+VE GG T FPF
Sbjct: 127 HMDSFSENHNYGINTYMSTNRVATGIYYLSNVEAGGGTAFPF 168
>gi|390178148|ref|XP_001358756.3| GA13990 [Drosophila pseudoobscura pseudoobscura]
gi|388859341|gb|EAL27899.3| GA13990 [Drosophila pseudoobscura pseudoobscura]
Length = 498
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 23/79 (29%), Positives = 43/79 (54%), Gaps = 1/79 (1%)
Query: 139 DKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ-MSQRLA 197
+ T +++ + ++ T L +A ++ Y +G YD HYD+ N +E + R+A
Sbjct: 351 NDTAVVKTLHRRLNDMTGLDMIESDALTLINYGMGGHYDVHYDSHNYSEANRLILGDRIA 410
Query: 198 SFLLYLSDVEEGGETMFPF 216
+ L Y+ +V+ GG T FP+
Sbjct: 411 TVLFYVGEVDSGGATTFPY 429
>gi|410251926|gb|JAA13930.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
Length = 566
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/143 (27%), Positives = 67/143 (46%), Gaps = 15/143 (10%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+PR + F + S + + + AK RL+ + ++ +E+ R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVH-YRISKSAWLSGYENP 392
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG--- 447
Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP 469
>gi|326928035|ref|XP_003210190.1| PREDICTED: WD repeat-containing protein 6-like [Meleagris
gallopavo]
Length = 900
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 33/116 (28%), Positives = 52/116 (44%), Gaps = 17/116 (14%)
Query: 118 ETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLP---QTHGEAFNVLRYEIGQ 174
+ V+++ R S T++ E ++ I ++ R T LP H E V+RY+ G
Sbjct: 134 QKVKTSDAVRNSQHTWLYQGEGAHQVMRAIRQRVMRLTRLPPEIVEHSEPLQVVRYDQGG 193
Query: 175 KYDSHYDA-------------FNPAEYGP-QMSQRLASFLLYLSDVEEGGETMFPF 216
Y +H D+ E P + S R + L YL++V GGET+FP
Sbjct: 194 HYHAHMDSGPVFPETACSHTKLVANESAPFETSCRYVTVLFYLNNVTGGGETVFPI 249
>gi|281350467|gb|EFB26051.1| hypothetical protein PANDA_009188 [Ailuropoda melanoleuca]
Length = 511
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/143 (27%), Positives = 67/143 (46%), Gaps = 15/143 (10%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+PR + F + S + + + AK RL+ + ++ +E+ R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVH-YRISKSAWLSGYENP 392
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG--- 447
Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP 469
>gi|194905290|ref|XP_001981166.1| GG11918 [Drosophila erecta]
gi|190655804|gb|EDV53036.1| GG11918 [Drosophila erecta]
Length = 525
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 31/92 (33%), Positives = 45/92 (48%), Gaps = 4/92 (4%)
Query: 126 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 185
TRTS + S + + + +IA T E ++ Y +G YD HYD FN
Sbjct: 372 TRTSKVAWFPDSYNPLTVR--LNARIADMTGFNLYGSEMLQLMNYGLGGHYDQHYDFFNT 429
Query: 186 --AEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
+ R+A+ L YL+DVE+GG T+FP
Sbjct: 430 INSNLTAMSGDRIATVLFYLTDVEQGGATVFP 461
>gi|190788|gb|AAA36535.1| prolyl 4-hydroxylase alpha subunit (EC 1.14.11.2) [Homo sapiens]
Length = 534
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/143 (27%), Positives = 67/143 (46%), Gaps = 15/143 (10%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+PR + F + S + + + AK RL+ + ++ +E+ R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVH-YRISKSAWLSGYENP 392
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG--- 447
Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP 469
>gi|63252888|ref|NP_001017962.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Homo
sapiens]
gi|197099666|ref|NP_001125733.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Pongo abelii]
gi|217272849|ref|NP_001136067.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Homo
sapiens]
gi|114631177|ref|XP_001140234.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3 [Pan
troglodytes]
gi|114631181|ref|XP_001140652.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 7 [Pan
troglodytes]
gi|2507090|sp|P13674.2|P4HA1_HUMAN RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; Flags: Precursor
gi|75061858|sp|Q5RAG8.1|P4HA1_PONAB RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; Flags: Precursor
gi|602675|gb|AAA59068.1| alpha-subunit of prolyl 4-hydroxylase [Homo sapiens]
gi|23271226|gb|AAH34998.1| Prolyl 4-hydroxylase, alpha polypeptide I [Homo sapiens]
gi|55729010|emb|CAH91242.1| hypothetical protein [Pongo abelii]
gi|56403853|emb|CAI29712.1| hypothetical protein [Pongo abelii]
gi|119574854|gb|EAW54469.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I, isoform CRA_c [Homo
sapiens]
gi|119574855|gb|EAW54470.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I, isoform CRA_d [Homo
sapiens]
gi|123981532|gb|ABM82595.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I [synthetic
construct]
gi|123996359|gb|ABM85781.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I [synthetic
construct]
gi|261861532|dbj|BAI47288.1| prolyl 4-hydroxylase, alpha polypeptide I [synthetic construct]
gi|410295852|gb|JAA26526.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
gi|410349611|gb|JAA41409.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
Length = 534
Score = 52.0 bits (123), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/143 (27%), Positives = 67/143 (46%), Gaps = 15/143 (10%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+PR + F + S + + + AK RL+ + ++ +E+ R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVH-YRISKSAWLSGYENP 392
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG--- 447
Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP 469
>gi|33589818|ref|NP_878907.1| prolyl 4-hydroxylase subunit alpha-3 precursor [Homo sapiens]
gi|114639354|ref|XP_001174896.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Pan troglodytes]
gi|397487266|ref|XP_003814725.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Pan paniscus]
gi|74738714|sp|Q7Z4N8.1|P4HA3_HUMAN RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
alpha-3; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-3; Flags: Precursor
gi|33188232|gb|AAP97874.1| prolyl 4-hydroxylase alpha III subunit [Homo sapiens]
gi|36962719|gb|AAQ87603.1| collagen prolyl 4-hydroxylase alpha III subunit [Homo sapiens]
gi|37182165|gb|AAQ88885.1| GPGA711 [Homo sapiens]
gi|109658570|gb|AAI17334.1| Prolyl 4-hydroxylase, alpha polypeptide III [Homo sapiens]
gi|119595341|gb|EAW74935.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide III, isoform CRA_b
[Homo sapiens]
gi|410219716|gb|JAA07077.1| prolyl 4-hydroxylase, alpha polypeptide III [Pan troglodytes]
gi|410248278|gb|JAA12106.1| prolyl 4-hydroxylase, alpha polypeptide III [Pan troglodytes]
gi|410336087|gb|JAA36990.1| prolyl 4-hydroxylase, alpha polypeptide III [Pan troglodytes]
Length = 544
Score = 52.0 bits (123), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 46/152 (30%), Positives = 73/152 (48%), Gaps = 10/152 (6%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+Q I +V+ P + +F S + Q I A+ L+ S +A GE + R S
Sbjct: 334 LQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVA--SGEKQLQVE-YRIS 390
Query: 130 SGTFISASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPA 186
++ + D L + H+IA T L + E V+ Y IG Y+ H+D A +P+
Sbjct: 391 KSAWLKDTVDPK--LVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPS 448
Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPF 216
Y + R+A+F++YLS VE GG T F +
Sbjct: 449 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIY 480
>gi|344274274|ref|XP_003408942.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2
[Loxodonta africana]
Length = 534
Score = 52.0 bits (123), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/143 (27%), Positives = 67/143 (46%), Gaps = 15/143 (10%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+PR + F + S + + + AK RL+ + ++ +E+ R S ++S E+
Sbjct: 334 KPRIVRFHDIISDAEIEVVKDLAKPRLRRATISNPITGDLETVH-YRISKSAWLSGYENP 392
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG--- 447
Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP 469
>gi|380813206|gb|AFE78477.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Macaca
mulatta]
gi|384947328|gb|AFI37269.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Macaca
mulatta]
Length = 534
Score = 52.0 bits (123), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/143 (27%), Positives = 67/143 (46%), Gaps = 15/143 (10%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+PR + F + S + + + AK RL+ + ++ +E+ R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVH-YRISKSAWLSGYENP 392
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG--- 447
Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP 469
>gi|441432545|ref|YP_007354587.1| Prolyl 4-hydroxylase [Acanthamoeba polyphaga moumouvirus]
gi|371944705|gb|AEX62527.1| putative prolyl4-hydroxylase [Moumouvirus Monve]
gi|440383625|gb|AGC02151.1| Prolyl 4-hydroxylase [Acanthamoeba polyphaga moumouvirus]
Length = 239
Score = 52.0 bits (123), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 35/131 (26%), Positives = 63/131 (48%), Gaps = 14/131 (10%)
Query: 89 NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
NF + E+C+ I+ + +L S++ K R S ++S + +++ +
Sbjct: 61 NFINKEKCKEIMNNTQNKLFDSEV------ISGKNKAIRNSQQCWVSKYDP---MVKSMF 111
Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-----NPAEYGPQMSQRLASFLLYL 203
KI++ +P + E V+RY GQ Y+ H+DA E+ + QR + L+YL
Sbjct: 112 QKISQQFNIPLENAEDLQVVRYLPGQYYNEHHDACCDNNDKCNEFISRGGQRCLTVLVYL 171
Query: 204 SDVEEGGETMF 214
++ EGG T F
Sbjct: 172 NNEFEGGHTFF 182
>gi|341615768|ref|ZP_08702637.1| hypothetical protein CJLT1_12463 [Citromicrobium sp. JLT1363]
Length = 223
Score = 52.0 bits (123), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 44/141 (31%), Positives = 68/141 (48%), Gaps = 20/141 (14%)
Query: 90 FASAEQCQSIIATAKKRLKPSQLALRQGETVEST--KGTRTS-SGTFISASEDKTGILEL 146
F ++ +C+ + +PS L E T G RTS SG F S ++++
Sbjct: 47 FLTSNECRKLAGMIDMVARPSTL-------YEGTYKDGFRTSYSGNFDSQDP----MVKM 95
Query: 147 IEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRLASFLL 201
I ++ A LP GE RY GQ++ H+D F P E +M QR + +L
Sbjct: 96 ISRRMDDALGLPGKIGETMQGQRYLPGQQFKDHHDYFYPEEEYWKMERKNGGQRSWTAML 155
Query: 202 YLSDVEEGGETMFPFEVDLQI 222
+L++V+ GG T FP E+ L+I
Sbjct: 156 FLNNVKAGGATSFP-ELGLRI 175
>gi|429856435|gb|ELA31343.1| 2og-fe oxygenase family protein [Colletotrichum gloeosporioides
Nara gc5]
Length = 279
Score = 52.0 bits (123), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 48/172 (27%), Positives = 78/172 (45%), Gaps = 24/172 (13%)
Query: 60 LPHGETGD---ASIQSIP----FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQL 112
LPH +T + S++ +P Q++S P +Y NF S +++A + KPS +
Sbjct: 22 LPHLQTLNLPSPSLKCLPQNYTTQIVSIDPLVIYIHNFLSESDIPALLAAGEPAFKPSYV 81
Query: 113 ALRQGETVESTKGTRTSSGTFISASEDKTGILELIEH----KIARATMLPQTHGEAFNVL 168
+ G + T +S+G + + + +L E ++ R M P ++
Sbjct: 82 -YKDGRRQGTPDRTSSSAGLPLDDAAVRC-VLARAERFMGARLGRDEMGPP------QLV 133
Query: 169 RYEIGQKYDSH---YDAFNPAEYG--PQMSQRLASFLLYLSDVEEGGETMFP 215
RY GQ+++ H YD F P G + R+ASF L D GGET FP
Sbjct: 134 RYTAGQRFNVHHDWYDTFQPDRAGGRDRRWNRVASFFAILEDECVGGETWFP 185
>gi|324511726|gb|ADY44875.1| Prolyl 4-hydroxylase subunit alpha-1 [Ascaris suum]
Length = 550
Score = 52.0 bits (123), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 43/149 (28%), Positives = 69/149 (46%), Gaps = 13/149 (8%)
Query: 74 PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
PF+V L + P A+ F + S E+ + I A RLK + + + +E T R S
Sbjct: 319 PFKVEILRFNPLAVLFVDIISDEEAKMIQQIATPRLKRATVQNSKTGELE-TAAYRISKS 377
Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ 191
++ + + +++ I +I T L Q E + Y +G YD H+D E P+
Sbjct: 378 AWLKGGDHE--LIDRINRRIELMTNLIQETSEELQIANYGVGGHYDPHFDFARKEE--PK 433
Query: 192 ------MSQRLASFLLYLSDVEEGGETMF 214
RLA+ L YL++ E GG T+F
Sbjct: 434 AFESLGTGNRLATVLFYLTEPEIGGGTVF 462
>gi|119595340|gb|EAW74934.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide III, isoform CRA_a
[Homo sapiens]
Length = 657
Score = 52.0 bits (123), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 46/150 (30%), Positives = 72/150 (48%), Gaps = 10/150 (6%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+Q I +V+ P + +F S + Q I A+ L+ S +A GE + R S
Sbjct: 366 LQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVA--SGEKQLQVE-YRIS 422
Query: 130 SGTFISASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPA 186
++ + D L + H+IA T L + E V+ Y IG Y+ H+D A +P+
Sbjct: 423 KSAWLKDTVDPK--LVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPS 480
Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMF 214
Y + R+A+F++YLS VE GG T F
Sbjct: 481 SPLYRMKSGNRVATFMIYLSSVEAGGATAF 510
>gi|426369750|ref|XP_004051847.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3, partial [Gorilla
gorilla gorilla]
Length = 517
Score = 52.0 bits (123), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 46/152 (30%), Positives = 73/152 (48%), Gaps = 10/152 (6%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+Q I +V+ P + +F S + Q I A+ L+ S +A GE + R S
Sbjct: 307 LQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVA--SGEKQLQVE-YRIS 363
Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYD-AFNPA 186
++ + D L + H+IA T L + E V+ Y IG Y+ H+D A +P+
Sbjct: 364 KSAWLKDTVDPK--LVALNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPS 421
Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPF 216
Y + R+A+F++YLS VE GG T F +
Sbjct: 422 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIY 453
>gi|355562502|gb|EHH19096.1| hypothetical protein EGK_19739 [Macaca mulatta]
gi|355782842|gb|EHH64763.1| hypothetical protein EGM_18071 [Macaca fascicularis]
gi|383418719|gb|AFH32573.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Macaca
mulatta]
Length = 534
Score = 52.0 bits (123), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/143 (27%), Positives = 67/143 (46%), Gaps = 15/143 (10%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+PR + F + S + + + AK RL+ + ++ +E+ R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVH-YRISKSAWLSGYENP 392
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG--- 447
Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP 469
>gi|307108823|gb|EFN57062.1| hypothetical protein CHLNCDRAFT_143806 [Chlorella variabilis]
Length = 514
Score = 52.0 bits (123), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 40/142 (28%), Positives = 62/142 (43%), Gaps = 19/142 (13%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P L F S E+C + A +K S++ + RTS GTF++ +
Sbjct: 287 PCLLLADAFLSPEECGEVRALGAPHMKRSKV------SAGDETPLRTSWGTFLTGPLAQQ 340
Query: 142 GILELIEHKIARATMLPQTH--------GEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 193
+ +E ++ + L GEA ++RY+ GQ Y H D S
Sbjct: 341 PVAARLEGRVRQLAALACEAEGRRALQLGEATQIVRYDPGQFYALHLD-----NRAGDSS 395
Query: 194 QRLASFLLYLSDVEEGGETMFP 215
+R A+ ++Y+SDVE GG T FP
Sbjct: 396 RRAATVMIYISDVEAGGATHFP 417
>gi|219109949|ref|XP_002176727.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217411262|gb|EEC51190.1| predicted protein, partial [Phaeodactylum tricornutum CCAP 1055/1]
Length = 137
Score = 52.0 bits (123), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/136 (28%), Positives = 60/136 (44%), Gaps = 9/136 (6%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFISASEDK 140
P P F S +C ++ A +R+ + + +T K +RTS+ T + E
Sbjct: 8 PVVWIVPGFLSESECDALQRKASRRM--TSCVTKNADTSRVYKDLSRTSTNTNVPRREVP 65
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP--AEYGPQMSQRLAS 198
T + +L + + E VLRY+ GQ + H D F+ G S RL +
Sbjct: 66 TVVSKLKNLTLCD----DEARFEILQVLRYQAGQHFSPHTDGFSGPITACGFWDSGRLVT 121
Query: 199 FLLYLSDVEEGGETMF 214
YL+DVE+GG T F
Sbjct: 122 VFCYLNDVEKGGTTRF 137
>gi|291404184|ref|XP_002718472.1| PREDICTED: prolyl 4-hydroxylase, alpha I subunit isoform 2
[Oryctolagus cuniculus]
Length = 534
Score = 52.0 bits (123), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/143 (27%), Positives = 67/143 (46%), Gaps = 15/143 (10%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+PR + F + S + + + AK RL+ + ++ +E+ R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVH-YRISKSAWLSGYENP 392
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG--- 447
Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP 469
>gi|74216495|dbj|BAE25162.1| unnamed protein product [Mus musculus]
Length = 187
Score = 52.0 bits (123), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 38/125 (30%), Positives = 64/125 (51%), Gaps = 7/125 (5%)
Query: 94 EQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDKTGILELIEHKIA 152
E+ + I AK +L ++ +R +T V + R S +++ ED ++ + ++
Sbjct: 2 EEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDDDPVVARVNRRMQ 57
Query: 153 RATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQM-SQRLASFLLYLSDVEEGG 210
T L E V Y +G +Y+ H+D + P + G + RLA+FL Y+SDVE GG
Sbjct: 58 HITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGG 117
Query: 211 ETMFP 215
T+FP
Sbjct: 118 ATVFP 122
>gi|395820526|ref|XP_003783615.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Otolemur
garnettii]
Length = 534
Score = 52.0 bits (123), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/143 (27%), Positives = 67/143 (46%), Gaps = 15/143 (10%)
Query: 81 RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
+PR + F + S + + + AK RL+ + ++ +E+ R S ++S E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVH-YRISKSAWLSGYENP 392
Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG--- 447
Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
R+A++L Y+SDV GG T+FP
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP 469
>gi|59809017|gb|AAH89446.1| P4HA3 protein [Homo sapiens]
Length = 528
Score = 52.0 bits (123), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 46/152 (30%), Positives = 73/152 (48%), Gaps = 10/152 (6%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+Q I +V+ P + +F S + Q I A+ L+ S +A GE + R S
Sbjct: 318 LQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVA--SGEKQLQVE-YRIS 374
Query: 130 SGTFISASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPA 186
++ + D L + H+IA T L + E V+ Y IG Y+ H+D A +P+
Sbjct: 375 KSAWLKDTVDPK--LVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPS 432
Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPF 216
Y + R+A+F++YLS VE GG T F +
Sbjct: 433 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIY 464
>gi|239792190|dbj|BAH72464.1| ACYPI007079 [Acyrthosiphon pisum]
Length = 249
Score = 52.0 bits (123), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 37/139 (26%), Positives = 62/139 (44%), Gaps = 7/139 (5%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
PR + + + + + I A+ RLK + + + +E R S ++ ED
Sbjct: 44 PRIILYRDVLYDNEIEVIKRMAQPRLKRATVQNYKTGELEFAD-YRISKSAWLKEHEDV- 101
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS----QRLA 197
++ + ++ T L E V+ Y +G YD HYD E S R+A
Sbjct: 102 -VVANVAKRVEVMTGLTTETAEELQVVNYGVGGHYDPHYDFARTEEINAFKSLGTGNRIA 160
Query: 198 SFLLYLSDVEEGGETMFPF 216
+ L Y+SDV +GG T+FP+
Sbjct: 161 TVLFYMSDVAQGGATVFPW 179
>gi|432926124|ref|XP_004080841.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oryzias
latipes]
Length = 523
Score = 52.0 bits (123), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/141 (27%), Positives = 69/141 (48%), Gaps = 15/141 (10%)
Query: 82 PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
P + + + AS ++ +++ AK RL+ + + Q + +T R S ++ + E
Sbjct: 324 PYIVRYHDVASEKEMETVKELAKPRLRRATVHDPQTGKL-TTAQYRVSKSAWLGSHEHP- 381
Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQMS 193
I++ I +I T L + E V Y +G +Y+ H+D AF G
Sbjct: 382 -IVDRINQRIEDITGLDVSTAEDLQVANYGVGGQYEPHFDFGRKDEADAFEELGTG---- 436
Query: 194 QRLASFLLYLSDVEEGGETMF 214
R+A++LLY+SDV+ GG T+F
Sbjct: 437 NRIATWLLYMSDVQAGGNTVF 457
>gi|407917017|gb|EKG10344.1| Oxoglutarate/iron-dependent oxygenase [Macrophomina phaseolina MS6]
Length = 289
Score = 52.0 bits (123), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 42/145 (28%), Positives = 67/145 (46%), Gaps = 19/145 (13%)
Query: 76 QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
+LS P +Y P+F SA++ + ++ ++ + KPS + E ++ + R S
Sbjct: 67 HILSRDPLVIYIPDFLSADEAKHMVDISEGKFKPSTVWTGGEERLDES--VRVSE----K 120
Query: 136 ASEDKTGILELIEHKIAR--ATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 193
A D+ +++ IE + AR P T E RY G Y H+D GP S
Sbjct: 121 AEIDRDTVVQCIEER-ARLFQGWRPYTFIEKLWTQRYGPGGHYVHHFD-------GPTTS 172
Query: 194 Q---RLASFLLYLSDVEEGGETMFP 215
R++SF++YL GG T FP
Sbjct: 173 HQGGRISSFMVYLDANCTGGGTNFP 197
>gi|194373965|dbj|BAG62295.1| unnamed protein product [Homo sapiens]
Length = 604
Score = 52.0 bits (123), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 46/152 (30%), Positives = 73/152 (48%), Gaps = 10/152 (6%)
Query: 70 IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
+Q I +V+ P + +F S + Q I A+ L+ S +A GE + R S
Sbjct: 334 LQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVA--SGEKQLQVE-YRIS 390
Query: 130 SGTFISASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPA 186
++ + D L + H+IA T L + E V+ Y IG Y+ H+D A +P+
Sbjct: 391 KSAWLKDTVDPK--LVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPS 448
Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPF 216
Y + R+A+F++YLS VE GG T F +
Sbjct: 449 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIY 480
>gi|428183403|gb|EKX52261.1| hypothetical protein GUITHDRAFT_84780 [Guillardia theta CCMP2712]
Length = 199
Score = 52.0 bits (123), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 38/140 (27%), Positives = 60/140 (42%), Gaps = 19/140 (13%)
Query: 82 PRALYFPNFASAEQCQSI----IATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
PR N S E+C+++ IA KR S + G +
Sbjct: 42 PRIFVLHNLLSKEECENLRDLGIARGMKRNAQSPVL-----------GDDPRKHEVATLD 90
Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE---YGPQMSQ 194
++ + +E K+A T +HGEAF ++RY Y H D +PA+ G S+
Sbjct: 91 FNENDFVRRLEDKLANLTRTSSSHGEAFQIIRYAQSDFYPEHVDYIDPAKSDLLGKGKSE 150
Query: 195 RLASFLLYLSDVEEGGETMF 214
+A+ ++YL + GGET F
Sbjct: 151 -IATVIIYLKSADSGGETFF 169
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.318 0.133 0.385
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,495,929,268
Number of Sequences: 23463169
Number of extensions: 135441501
Number of successful extensions: 306749
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 598
Number of HSP's successfully gapped in prelim test: 1048
Number of HSP's that attempted gapping in prelim test: 304506
Number of HSP's gapped (non-prelim): 1790
length of query: 227
length of database: 8,064,228,071
effective HSP length: 137
effective length of query: 90
effective length of database: 9,144,741,214
effective search space: 823026709260
effective search space used: 823026709260
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 74 (33.1 bits)