BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 027195
(226 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|225438938|ref|XP_002279411.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
gi|296087348|emb|CBI33722.3| unnamed protein product [Vitis vinifera]
Length = 285
Score = 396 bits (1017), Expect = e-108, Method: Compositional matrix adjust.
Identities = 179/226 (79%), Positives = 207/226 (91%)
Query: 1 MPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET 60
+ HG++G+DSVT+IPFQVLSW PRALYFPNFAT EQC+SIINMAK NL PST+ALR GE
Sbjct: 60 LAHGESGEDSVTSIPFQVLSWRPRALYFPNFATSEQCQSIINMAKSNLTPSTVALRVGEI 119
Query: 61 VDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSH 120
NT+GIRTSSGVFISA+ED++GTLDLIE+KIA+V M+PR +GEAFN+LRY+IGQ+YNSH
Sbjct: 120 RGNTEGIRTSSGVFISASEDKTGTLDLIEQKIARVIMIPRTHGEAFNVLRYEIGQRYNSH 179
Query: 121 YDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPR 180
YDAFDP EYGPQKS R+A+FLVYL+D+EEGGETMFPFENG+N D YD+Q+CIGLKVKP
Sbjct: 180 YDAFDPAEYGPQKSHRIATFLVYLSDVEEGGETMFPFENGLNMDKDYDFQRCIGLKVKPH 239
Query: 181 QGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 226
QGDGLLFYS+ PNGTIDPTS+HGSCPV+KGEKWVATKWIRDQEQ D
Sbjct: 240 QGDGLLFYSMFPNGTIDPTSLHGSCPVIKGEKWVATKWIRDQEQDD 285
>gi|449448264|ref|XP_004141886.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 294
Score = 393 bits (1010), Expect = e-107, Method: Compositional matrix adjust.
Identities = 178/226 (78%), Positives = 206/226 (91%)
Query: 1 MPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET 60
M G+ GDDS+++IPFQVLSW PRALYFP FAT EQC+SI+N+AK LRPSTLALRKGET
Sbjct: 66 MSSGENGDDSISSIPFQVLSWRPRALYFPKFATAEQCQSIVNLAKPKLRPSTLALRKGET 125
Query: 61 VDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSH 120
++T+G+RTSSGVF SA+EDESGTL +IEEKIA+ TM+PR +GEA+NILRY+IGQKYNSH
Sbjct: 126 AESTKGVRTSSGVFFSASEDESGTLGVIEEKIARATMIPRTHGEAYNILRYEIGQKYNSH 185
Query: 121 YDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPR 180
YDAF P EYGPQKSQRVASFL+YLTD+EEGGETMFPFENG+N DG+Y++Q CIGLKVKPR
Sbjct: 186 YDAFKPSEYGPQKSQRVASFLLYLTDVEEGGETMFPFENGLNMDGTYNFQTCIGLKVKPR 245
Query: 181 QGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 226
QGDGLLFYS+ PNGTIDPTS+HGSCPV+KG+KWVATKWIRDQ Q D
Sbjct: 246 QGDGLLFYSVFPNGTIDPTSLHGSCPVIKGQKWVATKWIRDQMQED 291
>gi|449511009|ref|XP_004163837.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
alpha-1-like [Cucumis sativus]
Length = 294
Score = 391 bits (1005), Expect = e-107, Method: Compositional matrix adjust.
Identities = 177/226 (78%), Positives = 205/226 (90%)
Query: 1 MPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET 60
M G+ GDDS+++IPFQVLSW PRALYFP FAT EQC+SI+N+AK LRPSTLALRKGET
Sbjct: 66 MSSGENGDDSISSIPFQVLSWRPRALYFPKFATAEQCQSIVNLAKPKLRPSTLALRKGET 125
Query: 61 VDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSH 120
++T+G+RTSSGVF SA+EDESGTL +IEEK A+ TM+PR +GEA+NILRY+IGQKYNSH
Sbjct: 126 AESTKGVRTSSGVFFSASEDESGTLGVIEEKXARATMIPRTHGEAYNILRYEIGQKYNSH 185
Query: 121 YDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPR 180
YDAF P EYGPQKSQRVASFL+YLTD+EEGGETMFPFENG+N DG+Y++Q CIGLKVKPR
Sbjct: 186 YDAFKPSEYGPQKSQRVASFLLYLTDVEEGGETMFPFENGLNMDGTYNFQTCIGLKVKPR 245
Query: 181 QGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 226
QGDGLLFYS+ PNGTIDPTS+HGSCPV+KG+KWVATKWIRDQ Q D
Sbjct: 246 QGDGLLFYSVFPNGTIDPTSLHGSCPVIKGQKWVATKWIRDQMQED 291
>gi|255584898|ref|XP_002533164.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223527036|gb|EEF29223.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 290
Score = 391 bits (1004), Expect = e-106, Method: Compositional matrix adjust.
Identities = 179/226 (79%), Positives = 205/226 (90%)
Query: 1 MPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET 60
+P G GDD +T IPFQVLSW PRALYFPNFAT EQC+S+INMAK NL PSTLALRKGET
Sbjct: 65 LPSGDTGDDYLTVIPFQVLSWKPRALYFPNFATAEQCQSVINMAKPNLTPSTLALRKGET 124
Query: 61 VDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSH 120
+NT+GIRTSSG+F+SA+ED++G LD IEEKIA+ TMLPR NGEAFNILRY+IGQKYNSH
Sbjct: 125 EENTKGIRTSSGMFLSASEDKTGVLDAIEEKIARATMLPRANGEAFNILRYEIGQKYNSH 184
Query: 121 YDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPR 180
YDAF+P EYGPQKSQRVASFL+YL+D+EEGGETMFPFEN ++ D SYD++KCIGL+V+PR
Sbjct: 185 YDAFNPAEYGPQKSQRVASFLLYLSDVEEGGETMFPFENDLDVDESYDFEKCIGLQVRPR 244
Query: 181 QGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 226
+GDGLLFYSL PN TIDPTS+HGSCPV+KGEKWVATKWIRDQEQ D
Sbjct: 245 RGDGLLFYSLFPNNTIDPTSLHGSCPVIKGEKWVATKWIRDQEQDD 290
>gi|356563543|ref|XP_003550021.1| PREDICTED: putative prolyl 4-hydroxylase-like [Glycine max]
Length = 293
Score = 389 bits (999), Expect = e-106, Method: Compositional matrix adjust.
Identities = 178/226 (78%), Positives = 208/226 (92%)
Query: 1 MPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET 60
MP G+ GDDS+T+IPFQVLSW PRA+YFPNFAT EQC+SII++AK L+PSTLALR+GET
Sbjct: 68 MPVGELGDDSITSIPFQVLSWRPRAVYFPNFATAEQCESIIDVAKDGLKPSTLALRQGET 127
Query: 61 VDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSH 120
DNT+GIRTSSGVF+SA+ED++ TLD+IEEKIA+ TM+PR +GEAFNILRY++ Q+YNSH
Sbjct: 128 EDNTKGIRTSSGVFVSASEDKTRTLDVIEEKIARATMIPRSHGEAFNILRYEVNQRYNSH 187
Query: 121 YDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPR 180
YDAF+P EYGPQKSQR+ASFL+YLTD+EEGGETMFPFENG+N DG+Y Y+ CIGLKVKPR
Sbjct: 188 YDAFNPAEYGPQKSQRMASFLLYLTDVEEGGETMFPFENGLNMDGNYGYEDCIGLKVKPR 247
Query: 181 QGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 226
QGDGLLFYSLL NGTIDPTS+HGSCPV+KGEKWVATKWIRDQE D
Sbjct: 248 QGDGLLFYSLLTNGTIDPTSLHGSCPVIKGEKWVATKWIRDQELDD 293
>gi|255647903|gb|ACU24410.1| unknown [Glycine max]
Length = 293
Score = 389 bits (998), Expect = e-106, Method: Compositional matrix adjust.
Identities = 178/226 (78%), Positives = 208/226 (92%)
Query: 1 MPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET 60
MP G+ GDDS+T+IPFQVLSW PRA+YFPNFAT EQC+SII++AK L+PSTLALR+GET
Sbjct: 68 MPVGELGDDSITSIPFQVLSWRPRAVYFPNFATAEQCESIIDVAKDGLKPSTLALRQGET 127
Query: 61 VDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSH 120
DNT+GIRTSSGVF+SA+ED++ TLD+IEEKIA+ TM+PR +GEAFNILRY++ Q+YNSH
Sbjct: 128 EDNTKGIRTSSGVFVSASEDKTRTLDVIEEKIARATMIPRSHGEAFNILRYEVNQRYNSH 187
Query: 121 YDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPR 180
YDAF+P EYGPQKSQR+ASFL+YLTD+EEGGETMFPFENG+N DG+Y Y+ CIGLKVKPR
Sbjct: 188 YDAFNPAEYGPQKSQRMASFLLYLTDVEEGGETMFPFENGLNMDGNYGYEGCIGLKVKPR 247
Query: 181 QGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 226
QGDGLLFYSLL NGTIDPTS+HGSCPV+KGEKWVATKWIRDQE D
Sbjct: 248 QGDGLLFYSLLTNGTIDPTSLHGSCPVIKGEKWVATKWIRDQELDD 293
>gi|363807682|ref|NP_001242420.1| uncharacterized protein LOC100775302 [Glycine max]
gi|255641811|gb|ACU21174.1| unknown [Glycine max]
Length = 293
Score = 382 bits (981), Expect = e-104, Method: Compositional matrix adjust.
Identities = 176/226 (77%), Positives = 206/226 (91%)
Query: 1 MPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET 60
MP GDDS+T+IPFQVLSW PRALYFPNFAT EQC++II++AK L+PSTLALR+GET
Sbjct: 68 MPVRDLGDDSITSIPFQVLSWRPRALYFPNFATAEQCENIIDVAKDGLKPSTLALRQGET 127
Query: 61 VDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSH 120
+NT+GIRTSSGVF+SA+ D++GTL +IEEKIA+ TM+PR +GEAFNILRY++ Q+YNSH
Sbjct: 128 EENTKGIRTSSGVFVSASGDKTGTLAVIEEKIARATMIPRSHGEAFNILRYEVDQRYNSH 187
Query: 121 YDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPR 180
YDAF+P EYGPQKSQR+ASFL+YLTD+EEGGETMFPFENG+N DG+Y Y+ CIGLKVKPR
Sbjct: 188 YDAFNPAEYGPQKSQRMASFLLYLTDVEEGGETMFPFENGLNMDGNYGYEDCIGLKVKPR 247
Query: 181 QGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 226
QGDGLLFYSLL NGTIDPTS+HGSCPV+KGEKWVATKWIRDQEQ D
Sbjct: 248 QGDGLLFYSLLTNGTIDPTSLHGSCPVIKGEKWVATKWIRDQEQDD 293
>gi|357476355|ref|XP_003608463.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
gi|355509518|gb|AES90660.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
Length = 297
Score = 381 bits (978), Expect = e-103, Method: Compositional matrix adjust.
Identities = 171/226 (75%), Positives = 206/226 (91%)
Query: 1 MPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET 60
M G+ GDDS+T+IPFQVLSW PRALYFPNFAT EQC++I+++AK L+PS+LALRKGET
Sbjct: 70 MTAGEFGDDSITSIPFQVLSWKPRALYFPNFATAEQCENIVSVAKAGLKPSSLALRKGET 129
Query: 61 VDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSH 120
+NT+GIRTSSGVF+SA+ D++ TL+ IEEKIA+ TM+PR +GEAFNILRY++GQ+YNSH
Sbjct: 130 TENTKGIRTSSGVFLSASRDKTKTLEAIEEKIARATMIPRSHGEAFNILRYEVGQRYNSH 189
Query: 121 YDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPR 180
YDAF+P EYGPQKSQRVASFL+YLTD+EEGGETMFPFENG+N DG+Y Y+ C+GL+VKPR
Sbjct: 190 YDAFNPDEYGPQKSQRVASFLLYLTDVEEGGETMFPFENGLNMDGTYGYEDCVGLRVKPR 249
Query: 181 QGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 226
QGDGLLFYSLLPNGTID TS+HGSCPV+KGEKWVATKWIR+ +Q D
Sbjct: 250 QGDGLLFYSLLPNGTIDQTSLHGSCPVIKGEKWVATKWIRNLDQED 295
>gi|388505024|gb|AFK40578.1| unknown [Medicago truncatula]
Length = 297
Score = 373 bits (958), Expect = e-101, Method: Compositional matrix adjust.
Identities = 169/226 (74%), Positives = 204/226 (90%)
Query: 1 MPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET 60
M G+ GDDS+T+IPFQVLSW PRALYFPNFAT EQC++I+++AK L+PS+LALRKGET
Sbjct: 70 MTAGEFGDDSITSIPFQVLSWKPRALYFPNFATAEQCENIVSVAKAGLKPSSLALRKGET 129
Query: 61 VDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSH 120
+NT+GIRTSSGVF+SA+ D++ TL+ IEEKIA+ TM+PR +GEAFNILRY++GQ+Y SH
Sbjct: 130 TENTKGIRTSSGVFLSASRDKTKTLEAIEEKIARATMIPRSHGEAFNILRYEVGQRYYSH 189
Query: 121 YDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPR 180
YDAF+P EYGPQKSQRVASFL+YLTD+EEGGETMFPFENG+N DG+Y Y+ +GL+VKPR
Sbjct: 190 YDAFNPDEYGPQKSQRVASFLLYLTDVEEGGETMFPFENGLNMDGTYGYEDRVGLRVKPR 249
Query: 181 QGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 226
QGDGLLFYSLLPNGTID TS+HGSCPV+KGEKWVATKWIR+ +Q D
Sbjct: 250 QGDGLLFYSLLPNGTIDQTSLHGSCPVIKGEKWVATKWIRNLDQED 295
>gi|297798522|ref|XP_002867145.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297312981|gb|EFH43404.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 288
Score = 372 bits (955), Expect = e-101, Method: Compositional matrix adjust.
Identities = 165/224 (73%), Positives = 196/224 (87%)
Query: 1 MPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET 60
MPHG G++SV +IPFQVLSW PRA+YFPNFAT EQC++II AK+NL+PS LALRKGET
Sbjct: 63 MPHGVTGEESVGSIPFQVLSWRPRAIYFPNFATAEQCQAIIERAKVNLKPSALALRKGET 122
Query: 61 VDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSH 120
+NT+G RTSSG FISA+ED +G LD +E KIA+ TM+PR +GE+FNILRY++GQKY+SH
Sbjct: 123 AENTKGTRTSSGTFISASEDSTGALDFVERKIARATMIPRSHGESFNILRYELGQKYDSH 182
Query: 121 YDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPR 180
YD F+P EYGPQ SQR+ASFL+YL+D+EEGGETMFPFENG N YDY++CIGLKVKPR
Sbjct: 183 YDVFNPTEYGPQSSQRIASFLLYLSDVEEGGETMFPFENGSNMGTGYDYKQCIGLKVKPR 242
Query: 181 QGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
+GDGLLFYS+ PNGTID TS+HGSCPV KGEKWVATKWIRDQ+Q
Sbjct: 243 KGDGLLFYSVFPNGTIDQTSLHGSCPVTKGEKWVATKWIRDQDQ 286
>gi|40809925|dbj|BAD07294.1| prolyl 4-hydroxylase [Nicotiana tabacum]
Length = 286
Score = 371 bits (952), Expect = e-100, Method: Compositional matrix adjust.
Identities = 167/226 (73%), Positives = 200/226 (88%)
Query: 1 MPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET 60
+P G+ G+ S+ +IPFQVLSW PRALYFPNFA+ EQC+SII MAK N+ PS+LALR GET
Sbjct: 61 LPIGETGEHSLISIPFQVLSWFPRALYFPNFASIEQCQSIIKMAKANMEPSSLALRTGET 120
Query: 61 VDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSH 120
+ T+GIRTSSG FISA+ED++G LDLIEEKIAK TM+P+ +GEAFN+LRY+IGQ+Y SH
Sbjct: 121 EETTKGIRTSSGTFISASEDKTGILDLIEEKIAKATMIPKTHGEAFNVLRYEIGQRYQSH 180
Query: 121 YDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPR 180
YDAFDP +YGPQKSQR ASFL+YL+D+EEGGET+FP+ENG N D SYD+ KCIGLKVKPR
Sbjct: 181 YDAFDPAQYGPQKSQRAASFLLYLSDVEEGGETVFPYENGQNMDASYDFSKCIGLKVKPR 240
Query: 181 QGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 226
+GDGLLFYSL PNGTID TS+HGSCPV++GEKWVATKWIR+Q+Q D
Sbjct: 241 RGDGLLFYSLFPNGTIDLTSLHGSCPVIRGEKWVATKWIRNQDQDD 286
>gi|18418321|ref|NP_567941.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
gi|17381226|gb|AAL36425.1| unknown protein [Arabidopsis thaliana]
gi|20465827|gb|AAM20018.1| unknown protein [Arabidopsis thaliana]
gi|21592377|gb|AAM64328.1| putative dioxygenase [Arabidopsis thaliana]
gi|332660892|gb|AEE86292.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
Length = 288
Score = 370 bits (949), Expect = e-100, Method: Compositional matrix adjust.
Identities = 163/224 (72%), Positives = 196/224 (87%)
Query: 1 MPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET 60
MPHG G++S+ +IPFQVLSW PRA+YFPNFAT EQC++II AK+NL+PS LALRKGET
Sbjct: 63 MPHGVTGEESIGSIPFQVLSWRPRAIYFPNFATAEQCQAIIERAKVNLKPSALALRKGET 122
Query: 61 VDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSH 120
+NT+G RTSSG FISA+E+ +G LD +E KIA+ TM+PR +GE+FNILRY++GQKY+SH
Sbjct: 123 AENTKGTRTSSGTFISASEESTGALDFVERKIARATMIPRSHGESFNILRYELGQKYDSH 182
Query: 121 YDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPR 180
YD F+P EYGPQ SQR+ASFL+YL+D+EEGGETMFPFENG N YDY++CIGLKVKPR
Sbjct: 183 YDVFNPTEYGPQSSQRIASFLLYLSDVEEGGETMFPFENGSNMGIGYDYKQCIGLKVKPR 242
Query: 181 QGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
+GDGLLFYS+ PNGTID TS+HGSCPV KGEKWVATKWIRDQ+Q
Sbjct: 243 KGDGLLFYSVFPNGTIDQTSLHGSCPVTKGEKWVATKWIRDQDQ 286
>gi|255573113|ref|XP_002527486.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223533126|gb|EEF34884.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 286
Score = 369 bits (948), Expect = e-100, Method: Compositional matrix adjust.
Identities = 167/226 (73%), Positives = 197/226 (87%)
Query: 1 MPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET 60
MP G G+ + +IPFQVLSW PRA+YFP+FATPEQCK+II MAKL L+PS LALRKGET
Sbjct: 61 MPRGVTGESYIESIPFQVLSWKPRAVYFPDFATPEQCKNIIEMAKLRLKPSGLALRKGET 120
Query: 61 VDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSH 120
++T+G RTSSG F+SA+ED +GTLD IE KIA+ TM+PR +GEAFNILRY+IGQKY+SH
Sbjct: 121 AESTKGTRTSSGTFLSASEDGTGTLDFIEHKIARATMIPRSHGEAFNILRYEIGQKYDSH 180
Query: 121 YDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPR 180
YD+F+P EYGPQ SQRVASFL+YL+D+E+GGETMFPFENG+ YDY+KC GLKVKPR
Sbjct: 181 YDSFNPAEYGPQMSQRVASFLLYLSDVEKGGETMFPFENGVKISSVYDYKKCAGLKVKPR 240
Query: 181 QGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 226
QGDG+LFYSLLPNGTID TS+HGSCPV++GEKWVATKWIRDQ Q D
Sbjct: 241 QGDGILFYSLLPNGTIDQTSLHGSCPVIEGEKWVATKWIRDQVQMD 286
>gi|385137888|gb|AFI41205.1| oxygenase protein, partial [Arabidopsis thaliana]
Length = 288
Score = 369 bits (948), Expect = e-100, Method: Compositional matrix adjust.
Identities = 163/224 (72%), Positives = 196/224 (87%)
Query: 1 MPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET 60
MPHG G++S+ +IPFQVLSW PRA+YFPNFAT EQC++II AK+NL+PS LALRKGET
Sbjct: 63 MPHGVTGEESIGSIPFQVLSWRPRAIYFPNFATAEQCQAIIERAKVNLKPSALALRKGET 122
Query: 61 VDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSH 120
+NT+G RTSSG FISA+E+ +G LD +E KIA+ TM+PR +GE+FNILRY++GQKY+SH
Sbjct: 123 AENTKGTRTSSGTFISASEESTGALDFVERKIARATMIPRSHGESFNILRYELGQKYDSH 182
Query: 121 YDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPR 180
YD F+P EYGPQ SQR+ASFL+YL+D+EEGGETMFPFENG N YDY++CIGLKVKPR
Sbjct: 183 YDVFNPTEYGPQSSQRIASFLLYLSDVEEGGETMFPFENGSNMGIGYDYKQCIGLKVKPR 242
Query: 181 QGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
+GDGLLFYS+ PNGTID TS+HGSCPV KGEKWVATKWIRDQ+Q
Sbjct: 243 KGDGLLFYSVFPNGTIDQTSLHGSCPVTKGEKWVATKWIRDQDQ 286
>gi|147823227|emb|CAN70872.1| hypothetical protein VITISV_009065 [Vitis vinifera]
Length = 276
Score = 369 bits (946), Expect = e-100, Method: Compositional matrix adjust.
Identities = 168/226 (74%), Positives = 198/226 (87%)
Query: 1 MPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET 60
MPHG+ G+ SV IPFQVLSW PRALYFP FAT EQC+SII MAK +LRPSTLALR+GET
Sbjct: 51 MPHGETGESSVDMIPFQVLSWKPRALYFPRFATAEQCQSIIEMAKSHLRPSTLALRQGET 110
Query: 61 VDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSH 120
++T+G RTSSG FISA+ED++G LD +E KIAK TM+PR +GEAFNILRY+IGQ+YNSH
Sbjct: 111 DESTKGTRTSSGTFISASEDKTGILDFVERKIAKATMIPRSHGEAFNILRYEIGQRYNSH 170
Query: 121 YDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPR 180
YDAF+P EYGPQ SQRVASFL+YL+D+EEGGETMFPFE+ +N YDY+KCIGLKVKP+
Sbjct: 171 YDAFNPAEYGPQTSQRVASFLLYLSDVEEGGETMFPFEHDLNIGTGYDYKKCIGLKVKPQ 230
Query: 181 QGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 226
+GDGLLFYS+ PNGTID TS+HGSCPV+ GEKWVATKWIRD++Q D
Sbjct: 231 RGDGLLFYSVFPNGTIDRTSLHGSCPVIAGEKWVATKWIRDEQQDD 276
>gi|225428938|ref|XP_002262952.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
gi|296083079|emb|CBI22483.3| unnamed protein product [Vitis vinifera]
Length = 284
Score = 368 bits (945), Expect = e-100, Method: Compositional matrix adjust.
Identities = 168/226 (74%), Positives = 198/226 (87%)
Query: 1 MPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET 60
MPHG+ G+ SV IPFQVLSW PRALYFP FAT EQC+SII MAK +LRPSTLALR+GET
Sbjct: 59 MPHGETGESSVDMIPFQVLSWKPRALYFPRFATAEQCQSIIEMAKSHLRPSTLALRQGET 118
Query: 61 VDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSH 120
++T+G RTSSG FISA+ED++G LD +E KIAK TM+PR +GEAFNILRY+IGQ+YNSH
Sbjct: 119 DESTKGTRTSSGTFISASEDKTGILDFVERKIAKATMIPRSHGEAFNILRYEIGQRYNSH 178
Query: 121 YDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPR 180
YDAF+P EYGPQ SQRVASFL+YL+D+EEGGETMFPFE+ +N YDY+KCIGLKVKP+
Sbjct: 179 YDAFNPAEYGPQTSQRVASFLLYLSDVEEGGETMFPFEHDLNIGTGYDYKKCIGLKVKPQ 238
Query: 181 QGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 226
+GDGLLFYS+ PNGTID TS+HGSCPV+ GEKWVATKWIRD++Q D
Sbjct: 239 RGDGLLFYSVFPNGTIDRTSLHGSCPVIAGEKWVATKWIRDEQQDD 284
>gi|388523073|gb|AFK49598.1| unknown [Lotus japonicus]
Length = 318
Score = 364 bits (934), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 167/222 (75%), Positives = 193/222 (86%)
Query: 1 MPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET 60
+P G+ GD+ +T IPFQVLSW P ALYFPNFAT EQC+SII AK L+PSTL LR GET
Sbjct: 77 LPAGETGDNFITTIPFQVLSWNPHALYFPNFATAEQCESIIETAKEGLKPSTLVLRVGET 136
Query: 61 VDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSH 120
++T GIRTSSGVFISA ED++G LD+IEEKIA+ T +PR +GEAFN+LRYK+GQKY+SH
Sbjct: 137 DESTTGIRTSSGVFISAFEDKTGVLDVIEEKIARATKIPRTHGEAFNVLRYKVGQKYSSH 196
Query: 121 YDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPR 180
YDA P YGPQKSQR+ASFL+YL+D+ EGGETMFPFENG+N DGSY Y+KCIGLKVKPR
Sbjct: 197 YDALHPDIYGPQKSQRMASFLLYLSDVPEGGETMFPFENGLNMDGSYYYEKCIGLKVKPR 256
Query: 181 QGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
+GDGLLFYSL PNGTIDP S+HGSCPV+KGEKWVATKWIRDQ
Sbjct: 257 KGDGLLFYSLFPNGTIDPMSLHGSCPVIKGEKWVATKWIRDQ 298
>gi|224103711|ref|XP_002313164.1| predicted protein [Populus trichocarpa]
gi|222849572|gb|EEE87119.1| predicted protein [Populus trichocarpa]
Length = 294
Score = 360 bits (925), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 164/222 (73%), Positives = 192/222 (86%)
Query: 1 MPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET 60
MPHG G+ SV +IPFQVLSW PRALYFP FATPEQC+SII M + L+PSTLALRKGET
Sbjct: 67 MPHGVTGEASVESIPFQVLSWKPRALYFPKFATPEQCESIIKMVESKLKPSTLALRKGET 126
Query: 61 VDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSH 120
++T+ RTSSG F+S +EDE+GTLD IE+KIAK TM+P+ +GEAFNILRY+IGQKY+SH
Sbjct: 127 AESTKDTRTSSGSFVSGSEDETGTLDFIEKKIAKATMIPQSHGEAFNILRYEIGQKYDSH 186
Query: 121 YDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPR 180
YDAF+P EYG Q SQR ASFL+YL+++EEGGETMFPFENG +DY++C+GLKVKPR
Sbjct: 187 YDAFNPDEYGQQSSQRTASFLLYLSNVEEGGETMFPFENGSAVIPGFDYKQCVGLKVKPR 246
Query: 181 QGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
QGDGLLFYSL PNGTIDPTS+HGSCPV+KG KWVATKWIRDQ
Sbjct: 247 QGDGLLFYSLFPNGTIDPTSLHGSCPVIKGVKWVATKWIRDQ 288
>gi|357453665|ref|XP_003597113.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
gi|357482683|ref|XP_003611628.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
gi|355486161|gb|AES67364.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
gi|355512963|gb|AES94586.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
Length = 294
Score = 356 bits (914), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 160/223 (71%), Positives = 194/223 (86%)
Query: 4 GQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDN 63
G++GD+ +T+IPFQVLSW PRALYFPNFA+ EQC II MAK L PS L LR+GET +
Sbjct: 71 GKSGDNFITSIPFQVLSWNPRALYFPNFASAEQCDRIIEMAKAELSPSRLMLREGETEEG 130
Query: 64 TQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDA 123
T+GIRTSSG+FISA+ED++G L++I+EKIA+ +P+ +G A+NILRYK+GQKYNSHYDA
Sbjct: 131 TKGIRTSSGMFISASEDKTGLLEVIDEKIARAAKIPKTHGGAYNILRYKVGQKYNSHYDA 190
Query: 124 FDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGD 183
F+P EYGPQ+SQRVASFL+YLTD+ EGGETMFPFENG N D SY+++ CIGLK+KP +GD
Sbjct: 191 FNPAEYGPQESQRVASFLLYLTDVPEGGETMFPFENGSNMDSSYNFEDCIGLKIKPLKGD 250
Query: 184 GLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 226
GLLFYSL PNGTIDPTS+HGSCPV+KGEKWVATKWIR+Q YD
Sbjct: 251 GLLFYSLFPNGTIDPTSLHGSCPVIKGEKWVATKWIREQLHYD 293
>gi|356541677|ref|XP_003539300.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 297
Score = 355 bits (912), Expect = 6e-96, Method: Compositional matrix adjust.
Identities = 167/226 (73%), Positives = 194/226 (85%), Gaps = 2/226 (0%)
Query: 1 MPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET 60
+ G +GDD VT IPFQVLSW PRALYFPNFA+ EQC+SII MA+ L+ STLALRKGET
Sbjct: 74 LKAGDSGDDYVTLIPFQVLSWYPRALYFPNFASAEQCESIIEMARGGLKSSTLALRKGET 133
Query: 61 VDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSH 120
++T+GIRTSSGVF+SA+EDE+G LD IEEKIAK T +PR +GEAFNILRY++GQKYNSH
Sbjct: 134 EESTKGIRTSSGVFMSASEDETGILDAIEEKIAKATKIPRTHGEAFNILRYEVGQKYNSH 193
Query: 121 YDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPR 180
YDAFD EYGP +SQRVASFL+YLTD+ EGGETMFP+ENG N DG + + CIGL+V+PR
Sbjct: 194 YDAFDEAEYGPLQSQRVASFLLYLTDVPEGGETMFPYENGFNRDG--NVEDCIGLRVRPR 251
Query: 181 QGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 226
+GD LLFYSLLPNGTID TS HGSCPV+KGEKWVATKWIR+Q Q D
Sbjct: 252 KGDALLFYSLLPNGTIDQTSAHGSCPVIKGEKWVATKWIRNQVQDD 297
>gi|356496957|ref|XP_003517331.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 299
Score = 355 bits (911), Expect = 8e-96, Method: Compositional matrix adjust.
Identities = 161/219 (73%), Positives = 192/219 (87%)
Query: 4 GQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDN 63
G +GDD +T IPFQVLSW PRALYFPNF + EQC++II MA+ L+PSTL LRKGET ++
Sbjct: 77 GDSGDDYITLIPFQVLSWYPRALYFPNFVSAEQCETIIEMARGGLKPSTLVLRKGETEES 136
Query: 64 TQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDA 123
T+GIRTS GVF+SA+EDE+G LD IEEKIAK T +PR +GEAFNILRY++GQKY+ HYDA
Sbjct: 137 TKGIRTSYGVFMSASEDETGILDSIEEKIAKATKIPRTHGEAFNILRYEVGQKYSPHYDA 196
Query: 124 FDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGD 183
FD E+GP +SQR ASFL+YLTD+ EGGET+FP+ENG N DGSYD++ CIGL+V+PR+GD
Sbjct: 197 FDEAEFGPLQSQRAASFLLYLTDVPEGGETLFPYENGFNRDGSYDFEDCIGLRVRPRKGD 256
Query: 184 GLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
GLLFYSLLPNGTID TS+HGSCPV+KGEKWVATKWIRDQ
Sbjct: 257 GLLFYSLLPNGTIDQTSVHGSCPVIKGEKWVATKWIRDQ 295
>gi|326492085|dbj|BAJ98267.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 347
Score = 353 bits (905), Expect = 4e-95, Method: Compositional matrix adjust.
Identities = 160/222 (72%), Positives = 192/222 (86%)
Query: 1 MPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET 60
M +G++GD + + IP+Q+LSW PRALYFP FAT EQC++++ AK LRPSTLALRKGE+
Sbjct: 124 MAYGESGDPAPSLIPYQILSWQPRALYFPQFATAEQCENVVKTAKARLRPSTLALRKGES 183
Query: 61 VDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSH 120
+ T+GIRTSSG F+SA ED +G L IE KIAK TM+PR +GE FN+LRY+IGQKY SH
Sbjct: 184 EETTKGIRTSSGTFLSAEEDPTGALAEIETKIAKATMMPRSHGEPFNVLRYEIGQKYASH 243
Query: 121 YDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPR 180
YDAFDP +YGPQKSQRVASFL+YLTD+EEGGETMFP+ENG N + YDY++CIGLKVKPR
Sbjct: 244 YDAFDPAQYGPQKSQRVASFLLYLTDVEEGGETMFPYENGDNMNIGYDYEQCIGLKVKPR 303
Query: 181 QGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
+GDGLLFYSL+ NGTIDPTS+HGSCPVV+GEKWVATKWIRD+
Sbjct: 304 KGDGLLFYSLMVNGTIDPTSLHGSCPVVRGEKWVATKWIRDK 345
>gi|125588006|gb|EAZ28670.1| hypothetical protein OsJ_12681 [Oryza sativa Japonica Group]
Length = 280
Score = 351 bits (900), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 160/224 (71%), Positives = 194/224 (86%)
Query: 1 MPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET 60
M +G++G+ + IP+Q+LSW PRALYFP FAT +QC++I+ AK L PSTLALRKGET
Sbjct: 55 MAYGESGEPEPSLIPYQILSWQPRALYFPQFATSQQCENIVKTAKQRLMPSTLALRKGET 114
Query: 61 VDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSH 120
++T+GIRTSSG F+S+ ED +GTL +E+KIAK TM+PR +GE FNILRY+IGQ+Y SH
Sbjct: 115 EESTKGIRTSSGTFLSSDEDPTGTLAEVEKKIAKATMIPRHHGEPFNILRYEIGQRYASH 174
Query: 121 YDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPR 180
YDAFDP +YGPQKSQRVASFL+YLTD+EEGGETMFP+ENG N D YDY+KCIGLKVKPR
Sbjct: 175 YDAFDPAQYGPQKSQRVASFLLYLTDVEEGGETMFPYENGENMDIGYDYEKCIGLKVKPR 234
Query: 181 QGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
+GDGLLFYSL+ NGTIDPTS+HGSCPV+KGEKWVATKWIRD+ +
Sbjct: 235 KGDGLLFYSLMVNGTIDPTSLHGSCPVIKGEKWVATKWIRDKSK 278
>gi|115455509|ref|NP_001051355.1| Os03g0761900 [Oryza sativa Japonica Group]
gi|14488368|gb|AAK63935.1|AC084282_16 putative dioxygenase [Oryza sativa Japonica Group]
gi|17027263|gb|AAL34117.1|AC090713_4 putative hydroxylase subunit [Oryza sativa Japonica Group]
gi|108711218|gb|ABF99013.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
Group]
gi|113549826|dbj|BAF13269.1| Os03g0761900 [Oryza sativa Japonica Group]
gi|125545807|gb|EAY91946.1| hypothetical protein OsI_13633 [Oryza sativa Indica Group]
Length = 310
Score = 350 bits (899), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 160/224 (71%), Positives = 194/224 (86%)
Query: 1 MPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET 60
M +G++G+ + IP+Q+LSW PRALYFP FAT +QC++I+ AK L PSTLALRKGET
Sbjct: 85 MAYGESGEPEPSLIPYQILSWQPRALYFPQFATSQQCENIVKTAKQRLMPSTLALRKGET 144
Query: 61 VDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSH 120
++T+GIRTSSG F+S+ ED +GTL +E+KIAK TM+PR +GE FNILRY+IGQ+Y SH
Sbjct: 145 EESTKGIRTSSGTFLSSDEDPTGTLAEVEKKIAKATMIPRHHGEPFNILRYEIGQRYASH 204
Query: 121 YDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPR 180
YDAFDP +YGPQKSQRVASFL+YLTD+EEGGETMFP+ENG N D YDY+KCIGLKVKPR
Sbjct: 205 YDAFDPAQYGPQKSQRVASFLLYLTDVEEGGETMFPYENGENMDIGYDYEKCIGLKVKPR 264
Query: 181 QGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
+GDGLLFYSL+ NGTIDPTS+HGSCPV+KGEKWVATKWIRD+ +
Sbjct: 265 KGDGLLFYSLMVNGTIDPTSLHGSCPVIKGEKWVATKWIRDKSK 308
>gi|225428943|ref|XP_002263094.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
gi|296083076|emb|CBI22480.3| unnamed protein product [Vitis vinifera]
Length = 282
Score = 350 bits (898), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 162/226 (71%), Positives = 194/226 (85%), Gaps = 1/226 (0%)
Query: 1 MPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET 60
MPHG+ G+ SV IPFQVLSW PRA YFP+FAT EQC+SII MAK L PSTL LRKGET
Sbjct: 58 MPHGETGESSVDLIPFQVLSWKPRARYFPHFATAEQCQSIIEMAKSGLSPSTLVLRKGET 117
Query: 61 VDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSH 120
++T+GIRTSSG FISA+ED++G LD IE KIAK TM+PR +GE FNILRY+IGQ+YNSH
Sbjct: 118 EESTKGIRTSSGTFISASEDKTGILDFIERKIAKATMIPRNHGEVFNILRYEIGQRYNSH 177
Query: 121 YDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPR 180
YDA P EYG Q SQR+ASFL+YL+D+EEGGETMFPFE+ +N + +++ +KCIGLKVKPR
Sbjct: 178 YDAISPAEYGLQTSQRIASFLLYLSDVEEGGETMFPFEHDLNIN-TFNSRKCIGLKVKPR 236
Query: 181 QGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 226
+GDGLLFYS+ PNGTID TS+HGSCPV++GEKWVATKWIRD++Q D
Sbjct: 237 RGDGLLFYSVFPNGTIDWTSMHGSCPVIEGEKWVATKWIRDEQQED 282
>gi|242038031|ref|XP_002466410.1| hypothetical protein SORBIDRAFT_01g007280 [Sorghum bicolor]
gi|241920264|gb|EER93408.1| hypothetical protein SORBIDRAFT_01g007280 [Sorghum bicolor]
Length = 294
Score = 348 bits (893), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 159/221 (71%), Positives = 192/221 (86%)
Query: 1 MPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET 60
MP+G++G+ + + IP+Q+LSW PRALYFP FAT EQC++I+ AK L+PSTLALRKGET
Sbjct: 71 MPYGESGEAAPSLIPYQILSWQPRALYFPQFATSEQCENIVKTAKERLKPSTLALRKGET 130
Query: 61 VDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSH 120
++T+GIRTSSG F+SA ED + TL IE+KIA+ TM+PR +GE FN+LRY IGQ+Y SH
Sbjct: 131 AESTKGIRTSSGTFLSANEDPTRTLAEIEKKIARATMIPRNHGEPFNVLRYNIGQRYASH 190
Query: 121 YDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPR 180
YDAFDP +YGPQKSQRVASFL+YLT++EEGGETMFP+ENG N D YDY+KCIGLKVKPR
Sbjct: 191 YDAFDPVQYGPQKSQRVASFLLYLTNVEEGGETMFPYENGENMDIGYDYEKCIGLKVKPR 250
Query: 181 QGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRD 221
+GDGLLFYSL+ NGTID TS+HGSCPV+KGEKWVATKWIRD
Sbjct: 251 KGDGLLFYSLMVNGTIDRTSLHGSCPVIKGEKWVATKWIRD 291
>gi|223945827|gb|ACN26997.1| unknown [Zea mays]
gi|414872966|tpg|DAA51523.1| TPA: prolyl 4-hydroxylase [Zea mays]
Length = 294
Score = 348 bits (892), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 159/221 (71%), Positives = 191/221 (86%)
Query: 1 MPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET 60
MP+G++G+ + + IP+Q+LSW PRALYFP FAT EQC++I+ AK L+PSTLALRKGET
Sbjct: 71 MPYGESGEAAPSLIPYQILSWQPRALYFPQFATSEQCENIVKTAKERLKPSTLALRKGET 130
Query: 61 VDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSH 120
++T+GIRTSSG F+SA ED + TL IE+KIA+ TMLPR +GE FN+LRY IGQ+Y SH
Sbjct: 131 AESTKGIRTSSGTFLSANEDPTETLAEIEKKIARATMLPRNHGEPFNVLRYNIGQRYASH 190
Query: 121 YDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPR 180
YDAFDP +YGPQK+QRVASFL+YLTD+EEGGETMFP+EN N D YDY+KCIGLKVKPR
Sbjct: 191 YDAFDPAQYGPQKNQRVASFLLYLTDVEEGGETMFPYENSENMDIGYDYEKCIGLKVKPR 250
Query: 181 QGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRD 221
+GDGLLFYSL+ NGTID TS+HGSCPV+KGEKWVATKWIRD
Sbjct: 251 KGDGLLFYSLMVNGTIDRTSLHGSCPVIKGEKWVATKWIRD 291
>gi|357114580|ref|XP_003559078.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
distachyon]
Length = 295
Score = 347 bits (891), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 159/222 (71%), Positives = 191/222 (86%)
Query: 1 MPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET 60
M +G +GD + + IP+Q+LSW PRALYFP FAT EQC++++ AK LRPSTLALRKGET
Sbjct: 72 MAYGDSGDPAPSLIPYQILSWQPRALYFPQFATSEQCENVVKTAKARLRPSTLALRKGET 131
Query: 61 VDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSH 120
+ T+GIRTSSG F+SA ED + TL +E+KIAK TM+PR +GE FN+LRY+IGQKY SH
Sbjct: 132 EETTKGIRTSSGTFLSADEDPTRTLAEVEKKIAKATMIPRSHGEPFNVLRYEIGQKYASH 191
Query: 121 YDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPR 180
YDAFDP +YGPQKSQRVASFL+YLTD+EEGGETMFP+ENG N D YDY++CIGLKVKPR
Sbjct: 192 YDAFDPAQYGPQKSQRVASFLLYLTDVEEGGETMFPYENGENMDIGYDYEQCIGLKVKPR 251
Query: 181 QGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
+GDGLLFYSL+ NGTID TS+HGSCPV+KGEKWVATKWIR++
Sbjct: 252 KGDGLLFYSLMVNGTIDLTSLHGSCPVIKGEKWVATKWIRNK 293
>gi|226499492|ref|NP_001150030.1| LOC100283657 [Zea mays]
gi|195636206|gb|ACG37571.1| prolyl 4-hydroxylase [Zea mays]
gi|347978804|gb|AEP37744.1| prolyl 4-hydroxylase 3 [Zea mays]
Length = 294
Score = 347 bits (891), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 159/221 (71%), Positives = 191/221 (86%)
Query: 1 MPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET 60
MP+G++G+ + + IP+Q+LSW PRALYFP FAT EQC++I+ AK L+PSTLALRKGET
Sbjct: 71 MPYGESGEAAPSLIPYQILSWQPRALYFPQFATSEQCENIVKTAKERLKPSTLALRKGET 130
Query: 61 VDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSH 120
++T+GIRTSSG F+SA ED + TL IE+KIA+ TMLPR +GE FN+LRY IGQ+Y SH
Sbjct: 131 AESTKGIRTSSGTFLSANEDPTETLAEIEKKIARATMLPRNHGEPFNVLRYNIGQRYASH 190
Query: 121 YDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPR 180
YDAFDP +YGPQK+QRVASFL+YLTD+EEGGETMFP+EN N D YDY+KCIGLKVKPR
Sbjct: 191 YDAFDPAQYGPQKNQRVASFLLYLTDVEEGGETMFPYENSENMDIGYDYEKCIGLKVKPR 250
Query: 181 QGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRD 221
+GDGLLFYSL+ NGTID TS+HGSCPV+KGEKWVATKWIRD
Sbjct: 251 KGDGLLFYSLMVNGTIDRTSLHGSCPVIKGEKWVATKWIRD 291
>gi|302802700|ref|XP_002983104.1| hypothetical protein SELMODRAFT_234144 [Selaginella moellendorffii]
gi|300149257|gb|EFJ15913.1| hypothetical protein SELMODRAFT_234144 [Selaginella moellendorffii]
Length = 292
Score = 345 bits (885), Expect = 8e-93, Method: Compositional matrix adjust.
Identities = 152/223 (68%), Positives = 190/223 (85%)
Query: 3 HGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVD 62
HG GDD ++ IPFQVLSW PRAL FP FA+P QC++II++AK L PS+LALRKGET
Sbjct: 69 HGVTGDDQLSFIPFQVLSWTPRALLFPKFASPAQCEAIISLAKTKLTPSSLALRKGETAT 128
Query: 63 NTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD 122
TQ +RTS G F+S+ +D++GTL +EEK+AK TM+P+ +GEAFN+LRY+IGQKYNSHYD
Sbjct: 129 ETQDVRTSHGCFLSSRQDKTGTLAWVEEKMAKATMIPKSHGEAFNVLRYEIGQKYNSHYD 188
Query: 123 AFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQG 182
F+P EYGPQKSQR+ASFL+YL+D+EEGGETMFPFEN + + +YDY++CIGLKVKP+QG
Sbjct: 189 VFNPAEYGPQKSQRMASFLLYLSDVEEGGETMFPFENYEHMNENYDYKECIGLKVKPKQG 248
Query: 183 DGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQY 225
D LLFYS+ PNGT D T++HGSCPV+KGEKWVATKWIRD+E +
Sbjct: 249 DALLFYSMFPNGTFDKTALHGSCPVIKGEKWVATKWIRDKEDW 291
>gi|356536125|ref|XP_003536590.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 286
Score = 344 bits (883), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 156/225 (69%), Positives = 191/225 (84%)
Query: 1 MPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET 60
+ HG++G+ V +IP Q+LSW PRA++FPNF + E C+ II MAK L PS LALRKGET
Sbjct: 61 LEHGESGEPFVDSIPSQILSWRPRAVFFPNFTSVEVCQQIIEMAKPKLEPSKLALRKGET 120
Query: 61 VDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSH 120
++T+ RTSSG FISA+ED+SG LDL+E KIAKVTM+PR +GE FNIL+Y++GQKY+SH
Sbjct: 121 AESTKDTRTSSGTFISASEDKSGILDLVERKIAKVTMIPRTHGEIFNILKYEVGQKYDSH 180
Query: 121 YDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPR 180
YDAF+P EYG +SQR+ASFL+YL+++E GGETMFP+E G+N D YDYQKCIGLKVKPR
Sbjct: 181 YDAFNPDEYGSVESQRIASFLLYLSNVEAGGETMFPYEGGLNIDRGYDYQKCIGLKVKPR 240
Query: 181 QGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQY 225
QGDGLLFYSLLPNG ID TS+HGSCPV+KGEKWVATKWI D+EQ+
Sbjct: 241 QGDGLLFYSLLPNGKIDKTSLHGSCPVIKGEKWVATKWIDDREQH 285
>gi|302764866|ref|XP_002965854.1| hypothetical protein SELMODRAFT_84512 [Selaginella moellendorffii]
gi|300166668|gb|EFJ33274.1| hypothetical protein SELMODRAFT_84512 [Selaginella moellendorffii]
Length = 231
Score = 343 bits (881), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 151/223 (67%), Positives = 189/223 (84%)
Query: 3 HGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVD 62
HG G+D + IPFQVLSW PRAL FP FA+P QC++II++AK L PS+LALRKGET
Sbjct: 8 HGVTGEDQLAFIPFQVLSWTPRALLFPKFASPAQCEAIISLAKTKLTPSSLALRKGETAT 67
Query: 63 NTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD 122
TQ +RTS G F+S+ +D++GTL +EEK+AK TM+P+ +GEAFN+LRY+IGQKYNSHYD
Sbjct: 68 ETQDVRTSHGCFLSSRQDKTGTLAWVEEKMAKATMIPKSHGEAFNVLRYEIGQKYNSHYD 127
Query: 123 AFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQG 182
F+P EYGPQKSQR+ASFL+YL+D+EEGGETMFPFEN + + +YDY++CIGLKVKP+QG
Sbjct: 128 VFNPAEYGPQKSQRMASFLLYLSDVEEGGETMFPFENYEHMNENYDYKECIGLKVKPKQG 187
Query: 183 DGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQY 225
D LLFYS+ PNGT D T++HGSCPV+KGEKWVATKWIRD+E +
Sbjct: 188 DALLFYSMFPNGTFDKTALHGSCPVIKGEKWVATKWIRDKEDW 230
>gi|356574299|ref|XP_003555286.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 290
Score = 342 bits (876), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 155/226 (68%), Positives = 191/226 (84%), Gaps = 1/226 (0%)
Query: 1 MPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET 60
+ HG++G+ + +IPFQ+LSW PRA+YFPNF + E C+ II MAK L PS LALRKGET
Sbjct: 60 LEHGESGEPFLNSIPFQILSWRPRAVYFPNFTSVEVCQQIIEMAKPKLEPSKLALRKGET 119
Query: 61 VDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSH 120
++T+ RTSSG FISA+ED+SG LD +E KIAKVTM+PR +GE FNIL+Y++ QKY+SH
Sbjct: 120 AESTKDTRTSSGTFISASEDKSGILDFVERKIAKVTMIPRTHGEKFNILKYEVAQKYDSH 179
Query: 121 YDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNAD-GSYDYQKCIGLKVKP 179
YDAF+P EYG +SQR+ASFL+YL+++E GGETMFP+E G+N D G YDY+KCIGLKVKP
Sbjct: 180 YDAFNPDEYGTVESQRIASFLLYLSNVEAGGETMFPYEGGLNIDKGYYDYKKCIGLKVKP 239
Query: 180 RQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQY 225
RQGDGLLFYSLLPNG ID TS+HGSCPV+KGEKWVATKWI D+EQ+
Sbjct: 240 RQGDGLLFYSLLPNGKIDKTSLHGSCPVIKGEKWVATKWIDDREQH 285
>gi|224071291|ref|XP_002303388.1| predicted protein [Populus trichocarpa]
gi|222840820|gb|EEE78367.1| predicted protein [Populus trichocarpa]
Length = 297
Score = 338 bits (866), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 153/224 (68%), Positives = 188/224 (83%), Gaps = 1/224 (0%)
Query: 1 MPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET 60
+P G++GDD +T IPFQVLSW PRALY+P F T EQC+ IINMAK +L+PSTLALRKGET
Sbjct: 74 LPSGESGDDFITLIPFQVLSWRPRALYYPGFITAEQCQHIINMAKPSLQPSTLALRKGET 133
Query: 61 VDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSH 120
+ T+GIRTSSG+F+ ++ED++G L +IEEKIA+ TM+P +GEAFN+LRY+IGQKY++H
Sbjct: 134 AETTKGIRTSSGMFVFSSEDQAGVLQVIEEKIARATMIPSTHGEAFNVLRYEIGQKYDAH 193
Query: 121 YDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPR 180
YDAF+P EYGPQ SQRVA+FL+YL++ EEGGET FP EN N +G YD QKC GL+VKP
Sbjct: 194 YDAFNPAEYGPQTSQRVATFLLYLSNFEEGGETTFPIENDENFEG-YDAQKCNGLRVKPH 252
Query: 181 QGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
QGD +LFYS+ PN TIDP S+H SC V+KGEKWVATKWIRDQ Q
Sbjct: 253 QGDAILFYSIFPNNTIDPASLHASCHVIKGEKWVATKWIRDQVQ 296
>gi|168043388|ref|XP_001774167.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674574|gb|EDQ61081.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 284
Score = 333 bits (854), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 149/223 (66%), Positives = 185/223 (82%)
Query: 3 HGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVD 62
HG+ GD SVT+IPFQVLSW PRAL +PNFA+ EQC++II +A+ L PS LALRKGE+
Sbjct: 62 HGRTGDSSVTDIPFQVLSWKPRALLYPNFASKEQCEAIIKLARTRLAPSGLALRKGESEA 121
Query: 63 NTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD 122
T+ IRTSSG F+ A+ED++ +L +EEK+A+ TM+PR NGEAFN+LRY GQKY+ HYD
Sbjct: 122 TTKEIRTSSGTFLRASEDKTQSLAEVEEKMARATMIPRQNGEAFNVLRYNPGQKYDCHYD 181
Query: 123 AFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQG 182
FDP EYGPQ SQR+ASFL+YL+D+EEGGETMFPFEN N + Y+Y+ CIGLKVKPRQG
Sbjct: 182 VFDPAEYGPQPSQRMASFLLYLSDVEEGGETMFPFENFQNMNTGYNYKDCIGLKVKPRQG 241
Query: 183 DGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQY 225
D LLFYS+ PNGT D T++HGSCPV+KGEKWVATKWIR+ +++
Sbjct: 242 DALLFYSMHPNGTFDKTALHGSCPVIKGEKWVATKWIRNTDKF 284
>gi|168006299|ref|XP_001755847.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693166|gb|EDQ79520.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 299
Score = 330 bits (845), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 144/225 (64%), Positives = 186/225 (82%)
Query: 1 MPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET 60
+ HG GD+ + +IPFQVLSW PRAL +P FA+ EQC++I+ +A+ L PS LALRKGE+
Sbjct: 75 LDHGSTGDNFIADIPFQVLSWKPRALLYPRFASKEQCEAIMKLARTRLAPSALALRKGES 134
Query: 61 VDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSH 120
D+T+ IRTSSG F+ A ED + +L+ +EEK+AK TM+PR NGEAFN+L+Y +GQKY+ H
Sbjct: 135 EDSTKDIRTSSGTFLRADEDTTRSLEQVEEKMAKATMIPRENGEAFNVLKYNVGQKYDCH 194
Query: 121 YDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPR 180
YD FDP EYGPQ SQR+ASFL+YL+D+EEGGETMFPFEN N + +DY+KCIG+KVKPR
Sbjct: 195 YDVFDPAEYGPQPSQRMASFLLYLSDVEEGGETMFPFENFQNMNIGFDYKKCIGMKVKPR 254
Query: 181 QGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQY 225
QGD LLFYS+ PNGT D +++HGSCPV+KGEKWVATKWIR+ +++
Sbjct: 255 QGDALLFYSMHPNGTFDKSALHGSCPVIKGEKWVATKWIRNTDKF 299
>gi|3297815|emb|CAA19873.1| putative protein [Arabidopsis thaliana]
gi|7270340|emb|CAB80108.1| putative protein [Arabidopsis thaliana]
Length = 257
Score = 316 bits (809), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 140/197 (71%), Positives = 171/197 (86%)
Query: 1 MPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET 60
MPHG G++S+ +IPFQVLSW PRA+YFPNFAT EQC++II AK+NL+PS LALRKGET
Sbjct: 12 MPHGVTGEESIGSIPFQVLSWRPRAIYFPNFATAEQCQAIIERAKVNLKPSALALRKGET 71
Query: 61 VDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSH 120
+NT+G RTSSG FISA+E+ +G LD +E KIA+ TM+PR +GE+FNILRY++GQKY+SH
Sbjct: 72 AENTKGTRTSSGTFISASEESTGALDFVERKIARATMIPRSHGESFNILRYELGQKYDSH 131
Query: 121 YDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPR 180
YD F+P EYGPQ SQR+ASFL+YL+D+EEGGETMFPFENG N YDY++CIGLKVKPR
Sbjct: 132 YDVFNPTEYGPQSSQRIASFLLYLSDVEEGGETMFPFENGSNMGIGYDYKQCIGLKVKPR 191
Query: 181 QGDGLLFYSLLPNGTID 197
+GDGLLFYS+ PNGTID
Sbjct: 192 KGDGLLFYSVFPNGTID 208
>gi|30681957|ref|NP_850038.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
gi|330252315|gb|AEC07409.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
Length = 274
Score = 305 bits (780), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 141/213 (66%), Positives = 168/213 (78%), Gaps = 3/213 (1%)
Query: 10 SVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRT 69
SV+NIPF LSW PR Y PNFAT +QC+++I+MAK L+PSTLALRKGET + TQ R+
Sbjct: 62 SVSNIPFHGLSWNPRVFYLPNFATKQQCEAVIDMAKPKLKPSTLALRKGETAETTQNYRS 121
Query: 70 SSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEY 129
+ EDESG L IEEKIA T P+ E+FNILRY++GQKY+SHYDAF EY
Sbjct: 122 ---LHQHTDEDESGVLAAIEEKIALATRFPKDYYESFNILRYQLGQKYDSHYDAFHSAEY 178
Query: 130 GPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYS 189
GP SQRV +FL++L+ +EEGGETMFPFENG N +G YDY+KC+GLKVKPRQGD + FY+
Sbjct: 179 GPLISQRVVTFLLFLSSVEEGGETMFPFENGRNMNGRYDYEKCVGLKVKPRQGDAIFFYN 238
Query: 190 LLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
L PNGTID TS+HGSCPV+KGEKWVATKWIRDQ
Sbjct: 239 LFPNGTIDQTSLHGSCPVIKGEKWVATKWIRDQ 271
>gi|297825201|ref|XP_002880483.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297326322|gb|EFH56742.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 272
Score = 304 bits (778), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 141/213 (66%), Positives = 168/213 (78%), Gaps = 3/213 (1%)
Query: 10 SVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRT 69
SV+NIPF LSW PR Y PNFAT +QC+++I+MAK L+PS LALRKGET + TQ +RT
Sbjct: 60 SVSNIPFHGLSWNPRVFYLPNFATKQQCEAVIDMAKPKLKPSLLALRKGETAETTQNVRT 119
Query: 70 SSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEY 129
EDESG L IEEKIA T +P E+FNILRY++GQKY+SHYDAF P EY
Sbjct: 120 R---LKKTDEDESGILAAIEEKIALATRIPIDYYESFNILRYQLGQKYDSHYDAFHPAEY 176
Query: 130 GPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYS 189
GPQ SQRV +F+++L+ +EEGGETMFPFENG N +G YDY+ CIGL+VKPRQGD + FY+
Sbjct: 177 GPQISQRVVTFILFLSSVEEGGETMFPFENGRNMNGRYDYETCIGLRVKPRQGDAIFFYN 236
Query: 190 LLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
LLPN TID TS+HGSCPV+KGEKWVATKWIRDQ
Sbjct: 237 LLPNRTIDQTSLHGSCPVIKGEKWVATKWIRDQ 269
>gi|224056224|ref|XP_002298763.1| predicted protein [Populus trichocarpa]
gi|222846021|gb|EEE83568.1| predicted protein [Populus trichocarpa]
Length = 175
Score = 279 bits (714), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 130/184 (70%), Positives = 153/184 (83%), Gaps = 9/184 (4%)
Query: 43 MAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRIN 102
MAK L+PSTLALRKGET ++T FI +ED++GTLD IE KIAK TM+P+ +
Sbjct: 1 MAKSKLKPSTLALRKGETTEST---------FIGGSEDKTGTLDFIERKIAKATMIPQSH 51
Query: 103 GEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMN 162
GEAFNILRY+IGQKY+SHYDAF+P EYGPQ SQRVASFL+YL+ +EEGGETMFPFENG
Sbjct: 52 GEAFNILRYEIGQKYDSHYDAFNPDEYGPQPSQRVASFLLYLSSVEEGGETMFPFENGSA 111
Query: 163 ADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
++Y++C+GLKVKPRQGDGLLFYSL PNGTID TS+HGSCPV+KGEKWVATKWIRDQ
Sbjct: 112 VSSGFEYKQCVGLKVKPRQGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 171
Query: 223 EQYD 226
+ D
Sbjct: 172 MELD 175
>gi|147834798|emb|CAN75013.1| hypothetical protein VITISV_039948 [Vitis vinifera]
Length = 282
Score = 272 bits (696), Expect = 7e-71, Method: Compositional matrix adjust.
Identities = 129/193 (66%), Positives = 148/193 (76%), Gaps = 33/193 (17%)
Query: 67 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGE---------------------- 104
IR SGVFISA+ED++GTLDLIE+KIA+V M+PR +GE
Sbjct: 90 IRLCSGVFISASEDKTGTLDLIEQKIARVIMIPRTHGEIKPKENCLNWLGQVPPFEFVVM 149
Query: 105 -----------AFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGET 153
AFNILRY+IGQ+YNSHYDAFDP EYGPQKS R+A+FLVYL+D+EEGGET
Sbjct: 150 KRFLTDVVYHVAFNILRYEIGQRYNSHYDAFDPAEYGPQKSHRIATFLVYLSDVEEGGET 209
Query: 154 MFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKW 213
MFPFENG+N D YD+Q+CIGLKVKP QGDGLLFYS+ PNGTIDPTS+HGSCPV+KGEKW
Sbjct: 210 MFPFENGLNMDKDYDFQRCIGLKVKPHQGDGLLFYSMFPNGTIDPTSLHGSCPVIKGEKW 269
Query: 214 VATKWIRDQEQYD 226
VATKWIRDQEQ D
Sbjct: 270 VATKWIRDQEQDD 282
>gi|412994121|emb|CCO14632.1| predicted protein [Bathycoccus prasinos]
Length = 341
Score = 259 bits (662), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 121/223 (54%), Positives = 163/223 (73%), Gaps = 3/223 (1%)
Query: 4 GQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDN 63
G GD+ +T + FQ+LS PR++ + NFA+ C +I+ A+ L S LAL++GET++
Sbjct: 116 GALGDEYLTELKFQLLSTAPRSVMYRNFASDADCDAIVEAARSRLHKSGLALKRGETLET 175
Query: 64 TQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDA 123
T+ IRTSSG F+++ ++SG L +EEK+A+ T +P +GEA+NILRY+IGQKY+SHYD
Sbjct: 176 TKNIRTSSGTFLTSKMEQSGALKRVEEKMARATHIPATHGEAYNILRYEIGQKYDSHYDM 235
Query: 124 FDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFE--NGMNADGSYDYQKC-IGLKVKPR 180
FDP +YGPQ+SQRVASFL+YLT +EGGET+FP E NG+ DY C GLKVKPR
Sbjct: 236 FDPSQYGPQRSQRVASFLLYLTTPDEGGETVFPLEGQNGLYRLRGIDYTSCEAGLKVKPR 295
Query: 181 QGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
+GD LLF+S+ PN T D +S+HG CPV+ G K+VATKWI D
Sbjct: 296 KGDALLFWSVHPNNTFDRSSLHGGCPVISGTKFVATKWIHDNR 338
>gi|384250599|gb|EIE24078.1| hypothetical protein COCSUDRAFT_47131 [Coccomyxa subellipsoidea
C-169]
Length = 327
Score = 246 bits (629), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 115/224 (51%), Positives = 153/224 (68%), Gaps = 5/224 (2%)
Query: 4 GQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDN 63
++G+D + P Q++SW PR + +P F PE+CK + +AK L PS LALR E
Sbjct: 97 AESGNDFYSVQPQQLISWYPRIILYPGFIDPERCKHFVKVAKARLAPSGLALRTTEGPQE 156
Query: 64 TQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDA 123
T+ +RTS G F+S +D +G + +EEK A+VT LP +GE FN+LRY+ GQ Y+SHYD
Sbjct: 157 TENVRTSQGTFMSRKDDPAGVIAWVEEKAAQVTGLPVSHGEPFNVLRYQDGQHYDSHYDI 216
Query: 124 FDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNAD----GSYDYQKC-IGLKVK 178
F+P+ YGPQ SQR+A+ L YLTD+EEGGET+FP E D ++Y+ C G K K
Sbjct: 217 FEPESYGPQPSQRMATILFYLTDVEEGGETIFPLEGRYGPDLLKMTGFNYKSCTTGFKYK 276
Query: 179 PRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
PR GD L+FYS+ PNGT D ++HG CPV+ GEKWVATKWIRD+
Sbjct: 277 PRMGDALMFYSMHPNGTFDKHALHGGCPVMAGEKWVATKWIRDK 320
>gi|302845120|ref|XP_002954099.1| hypothetical protein VOLCADRAFT_64439 [Volvox carteri f.
nagariensis]
gi|300260598|gb|EFJ44816.1| hypothetical protein VOLCADRAFT_64439 [Volvox carteri f.
nagariensis]
Length = 231
Score = 234 bits (597), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 112/226 (49%), Positives = 154/226 (68%), Gaps = 5/226 (2%)
Query: 1 MPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET 60
+P ++G D+V IPFQ+LSW PR + FP F + + +I +A + PS LA R GET
Sbjct: 2 LPAAESGSDNVYVIPFQILSWYPRVVVFPGFIDKARAEYVIKLASKFMYPSGLAYRPGET 61
Query: 61 VDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSH 120
VD +Q RTS+G F++AA D G L +E++IA T+LP NGEAFN+L Y+ Q Y+SH
Sbjct: 62 VDPSQQTRTSTGTFLAAAMDPEGVLGWVEQRIAAATLLPAENGEAFNVLHYEKEQHYDSH 121
Query: 121 YDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKCI--GLK 176
YD FDP+E+GPQ SQR+A+ L+YL+++ EGGET+F E G++ + D++ C K
Sbjct: 122 YDTFDPKEFGPQPSQRIATVLLYLSEVLEGGETVFKRE-GVDGENRVIGDWRNCDDGSFK 180
Query: 177 VKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
PR GD +LF+ PNG IDP ++HG CPV +GEKWVATKWIR +
Sbjct: 181 YMPRMGDAVLFWGTKPNGDIDPHALHGGCPVKRGEKWVATKWIRSR 226
>gi|159489502|ref|XP_001702736.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280758|gb|EDP06515.1| predicted protein [Chlamydomonas reinhardtii]
Length = 231
Score = 226 bits (576), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 109/225 (48%), Positives = 148/225 (65%), Gaps = 3/225 (1%)
Query: 1 MPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET 60
+P +G D IPFQ+LSW PR + FP F + + I+ +A + PS LA R GE
Sbjct: 2 LPAAASGSDVTYIIPFQILSWYPRIVVFPGFIDKARAEHIVKLAGKFMYPSGLAYRPGEQ 61
Query: 61 VDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSH 120
V+++Q RTS+G F+S+ D G L +E++IA T+LP NGEAFN+L Y+ Q Y+SH
Sbjct: 62 VESSQQTRTSTGTFLSSGMDTEGVLGWVEQRIAAATLLPADNGEAFNVLHYEHMQHYDSH 121
Query: 121 YDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY-DYQKCI--GLKV 177
D+FDP+++GPQ SQR+A+ L+YL+++ EGGET+F E AD D++ C K
Sbjct: 122 MDSFDPKDFGPQPSQRIATVLLYLSEVLEGGETVFKKEGVDGADRPIQDWRNCDDGSFKY 181
Query: 178 KPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
PR GD +LF+ PNG IDP S+HG CPV KGEKWVATKWIR +
Sbjct: 182 APRMGDAVLFWGTRPNGEIDPHSLHGGCPVKKGEKWVATKWIRSR 226
>gi|307108817|gb|EFN57056.1| hypothetical protein CHLNCDRAFT_143796 [Chlorella variabilis]
Length = 334
Score = 219 bits (558), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 110/225 (48%), Positives = 145/225 (64%), Gaps = 15/225 (6%)
Query: 1 MPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET 60
+ HG++G T P Q+LS PRA P F + +QC +I MA+ L PS LA + G+T
Sbjct: 116 LEHGESGHSFYTVQPMQLLSLYPRAYLMPRFLSQKQCDHVIAMAERRLAPSGLAFKAGDT 175
Query: 61 VDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSH 120
+NT+ ED G L IE+K+A VTM+P +GE FN+LRY+ Q Y+SH
Sbjct: 176 AENTRD------------EDPDGVLAWIEDKLAAVTMIPAGHGEPFNVLRYEPSQHYDSH 223
Query: 121 YDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFE--NGMNADGSYDYQKC-IGLKV 177
YD+F +EYGPQ SQR+A+ L+YL D+EEGGET+F E G+ DY+ C G+KV
Sbjct: 224 YDSFSEEEYGPQFSQRIATVLLYLADVEEGGETVFLLEGKGGLARLERIDYKACDTGIKV 283
Query: 178 KPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
KPRQGD LLF+S+ NGT+D S+HG CPVV G KW TKWIR++
Sbjct: 284 KPRQGDALLFFSVSVNGTLDKHSLHGGCPVVAGTKWAMTKWIRNR 328
>gi|343172438|gb|AEL98923.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein, partial
[Silene latifolia]
gi|343172440|gb|AEL98924.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein, partial
[Silene latifolia]
Length = 120
Score = 216 bits (549), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 97/120 (80%), Positives = 111/120 (92%), Gaps = 1/120 (0%)
Query: 105 AFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNAD 164
A+N+LRY++GQKYNSHYDAF P EYGPQKSQR+ASFL+YL+D+EEGGETMFP+EN N D
Sbjct: 1 AYNVLRYEVGQKYNSHYDAFHPAEYGPQKSQRIASFLLYLSDVEEGGETMFPYEND-NID 59
Query: 165 GSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
+YDY +CIGLKVKPRQGDGLLFYSL NGTIDPTSIHGSCPV+KGEKWVATKWIR++EQ
Sbjct: 60 SNYDYVQCIGLKVKPRQGDGLLFYSLFSNGTIDPTSIHGSCPVIKGEKWVATKWIRNEEQ 119
>gi|159490898|ref|XP_001703410.1| prolyl 4-hydroxylase [Chlamydomonas reinhardtii]
gi|158280334|gb|EDP06092.1| prolyl 4-hydroxylase [Chlamydomonas reinhardtii]
Length = 429
Score = 200 bits (508), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 108/231 (46%), Positives = 142/231 (61%), Gaps = 7/231 (3%)
Query: 1 MPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET 60
M G +G+ IPFQ+LS PR FPNF + + II +A + PS LA R GE
Sbjct: 200 MTPGDSGEAFYRTIPFQILSLYPRIKVFPNFVDKARREEIIALASKFMYPSGLAYRPGEQ 259
Query: 61 VDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSH 120
V+ Q +RTS G F+ D S L +E KIA VT +PR NGE +N+L YK Q Y+SH
Sbjct: 260 VEAEQQVRTSKGTFLGG--DSSPALTWLESKIAAVTDIPRQNGEFWNVLNYKHTQHYDSH 317
Query: 121 YDAFDPQEYGPQKSQRVASFLVYLTDLE-EGGETMFPFENGMNADGSY-DYQKCI---GL 175
D+FDP+EYG Q SQR+A+ +V L+D GGET+F E N D ++ C GL
Sbjct: 318 MDSFDPKEYGQQYSQRIATVIVVLSDEGLVGGETVFKREGKANIDKPITNWTDCDADGGL 377
Query: 176 KVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 226
+ KPR GD +LF+S P+G +D ++HGSCPVV G KWVA KWIR++ Y+
Sbjct: 378 RYKPRAGDAVLFWSAFPDGRLDQHALHGSCPVVTGNKWVAVKWIRNKGSYN 428
>gi|357517881|ref|XP_003629229.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355523251|gb|AET03705.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 278
Score = 188 bits (477), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 97/217 (44%), Positives = 138/217 (63%), Gaps = 19/217 (8%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTS 70
Q++SW PRA + NF T ++C+ +IN AK PS ++K VDN G +RTS
Sbjct: 68 QIVSWEPRAFLYHNFLTKKECEHLINTAK----PS---MQKSSVVDNETGKSKDSSVRTS 120
Query: 71 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 130
SG F+ DE + IE++IA T +P NGE+FN+LRY++GQKY+ H D F
Sbjct: 121 SGTFLDRGGDE--IVRNIEKRIADFTFIPVENGESFNVLRYEVGQKYDPHLDYFADDYNT 178
Query: 131 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKC--IGLKVKPRQGDGLL 186
QR+A+ L+YL+D+EEGGET+FP G + + + C GL +KP+ GD LL
Sbjct: 179 VNGGQRIATMLMYLSDVEEGGETVFPAAKGNISSVPWWNELSDCGKKGLSIKPKMGDALL 238
Query: 187 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
F+S+ P+GT+DP+S+HG+CPV+KG+KW TKW+R E
Sbjct: 239 FWSMKPDGTLDPSSLHGACPVIKGDKWSCTKWMRINE 275
>gi|357146834|ref|XP_003574128.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Brachypodium
distachyon]
Length = 306
Score = 184 bits (467), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 91/217 (41%), Positives = 138/217 (63%), Gaps = 19/217 (8%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTS 70
+VLSW PRA + NF + E+C+ +I++AK +++ ST+ VD+ G +RTS
Sbjct: 96 EVLSWEPRAFLYHNFLSKEECEYLISLAKPHMKKSTV-------VDSATGGSKDSRVRTS 148
Query: 71 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 130
SG F+ +D+ + IE++I+ T +P NGE +L Y++GQKY H+D F
Sbjct: 149 SGTFLRRGQDK--VIRTIEKRISDFTFIPAENGEGLQVLHYEVGQKYEPHFDYFHDDFNT 206
Query: 131 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKCI--GLKVKPRQGDGLL 186
QR+A+ L+YL+D+EEGGET+FP ++ + + +C G+ VKP+ GD LL
Sbjct: 207 KNGGQRIATLLMYLSDVEEGGETVFPSAKVNSSSIPFYNELSECAKRGISVKPKMGDALL 266
Query: 187 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
F+S+ P+GT+DPTS+HG CPV+KG+KW +TKWIR E
Sbjct: 267 FWSMRPDGTLDPTSLHGGCPVIKGDKWSSTKWIRVHE 303
>gi|255579590|ref|XP_002530636.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223529809|gb|EEF31744.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 287
Score = 183 bits (465), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 95/222 (42%), Positives = 139/222 (62%), Gaps = 9/222 (4%)
Query: 7 GDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQ 65
GDD +V+SW PRA + NF T E+C+ +IN+AK N++ ST+ G + D+
Sbjct: 67 GDDGKGERWAEVISWEPRAFVYHNFLTKEECEYLINLAKPNMQKSTVVDSETGRSKDSR- 125
Query: 66 GIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFD 125
+RTSSG F+S D+ + IE++IA + +P +GE +L Y++GQKY H+D F+
Sbjct: 126 -VRTSSGTFLSRGRDKK--IRDIEKRIADFSFIPVEHGEGLQVLHYEVGQKYEPHFDYFN 182
Query: 126 PQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKC--IGLKVKPRQ 181
+ QRVA+ L+YL+D+EEGGET+FP G + + + +C GL VKP
Sbjct: 183 DEFNTKNGGQRVATLLMYLSDVEEGGETVFPAAKGNFSAVPWWNELSECGKKGLSVKPNM 242
Query: 182 GDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
GD LLF+S+ P+ T+DP+S+HG CPV+ G KW ATKW+R E
Sbjct: 243 GDALLFWSMKPDATLDPSSLHGGCPVINGNKWSATKWMRVNE 284
>gi|240256489|ref|NP_201407.4| iron ion binding / oxidoreductase/ oxidoreductase protein
[Arabidopsis thaliana]
gi|332010770|gb|AED98153.1| iron ion binding / oxidoreductase/ oxidoreductase protein
[Arabidopsis thaliana]
Length = 289
Score = 183 bits (464), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 94/221 (42%), Positives = 140/221 (63%), Gaps = 9/221 (4%)
Query: 8 DDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-GETVDNTQG 66
DDS +++SW PRA + NF T E+CK +I +AK ++ ST+ K G++ D+
Sbjct: 70 DDSKNERWVEIISWEPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKTGKSTDSR-- 127
Query: 67 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP 126
+RTSSG F++ D+ T+ IE++I+ T +P +GE +L Y+IGQKY HYD F
Sbjct: 128 VRTSSGTFLARGRDK--TIREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHYDYFMD 185
Query: 127 QEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKC--IGLKVKPRQG 182
+ QR+A+ L+YL+D+EEGGET+FP G + + + +C GL VKP+ G
Sbjct: 186 EYNTRNGGQRIATVLMYLSDVEEGGETVFPAAKGNYSAVPWWNELSECGKGGLSVKPKMG 245
Query: 183 DGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
D LLF+S+ P+ T+DP+S+HG C V+KG KW +TKW+R E
Sbjct: 246 DALLFWSMTPDATLDPSSLHGGCAVIKGNKWSSTKWLRVHE 286
>gi|449520146|ref|XP_004167095.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 249
Score = 182 bits (462), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 95/222 (42%), Positives = 143/222 (64%), Gaps = 14/222 (6%)
Query: 4 GQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDN 63
G+ GD V + +SW PRA + NF + E+C +I++AK ++ ST+ ET N
Sbjct: 32 GKRGDQWV-----EFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVV--DNETGKN 84
Query: 64 TQ-GIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD 122
+ +RTSSG+F++ +D+ + IE++IA T +P +GE IL Y++GQKY++HYD
Sbjct: 85 VEDSVRTSSGMFLNRGQDK--IVSNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYD 142
Query: 123 AFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKC--IGLKVK 178
FD + + QR+A+ L+YL+D+EEGGET+FP G + + + KC GL VK
Sbjct: 143 FFDDEFNLKEIGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSKCGKGGLSVK 202
Query: 179 PRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
P+ GD LLF+S+ P+ T+DPTS+HG+CPV++G KW TKWI
Sbjct: 203 PKMGDALLFWSMKPDTTLDPTSLHGACPVIRGNKWSCTKWIH 244
>gi|326495334|dbj|BAJ85763.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 300
Score = 182 bits (461), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 91/217 (41%), Positives = 138/217 (63%), Gaps = 19/217 (8%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTS 70
+VLSW PRA + NF + E+C+ +I++AK +++ ST+ VD+ G +RTS
Sbjct: 90 EVLSWEPRAFIYHNFLSKEECEYLISLAKPHMKKSTV-------VDSATGGSKDSRVRTS 142
Query: 71 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 130
SG F+ +D+ + IE++I+ T +P NGE +L Y++GQKY H+D F
Sbjct: 143 SGTFLRRGQDK--IVRTIEKRISDFTFIPVENGEGLQVLHYEVGQKYEPHFDYFHDDFNT 200
Query: 131 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKCI--GLKVKPRQGDGLL 186
QR+A+ L+YL+D+EEGGET+FP ++ + + +C G+ VKP+ GD LL
Sbjct: 201 KNGGQRIATVLMYLSDVEEGGETVFPSAKVNSSSIPFYNELSECAKRGISVKPKMGDALL 260
Query: 187 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
F+S+ P+GT+DPTS+HG CPV+KG+KW +TKWIR E
Sbjct: 261 FWSMRPDGTLDPTSLHGGCPVIKGDKWSSTKWIRVHE 297
>gi|215490181|dbj|BAG86624.1| type 2 proly 4-hydroxylase [Nicotiana tabacum]
Length = 294
Score = 182 bits (461), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 98/208 (47%), Positives = 128/208 (61%), Gaps = 9/208 (4%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA 78
+SW PRA + F T E+C +I++AK L+ S +A + T +RTSSG+FI A
Sbjct: 36 ISWKPRAFVYEGFLTDEECNHLISLAKSELKRSAVADNESGN-SKTSEVRTSSGMFIPKA 94
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVA 138
+D + IEEKIA T LP+ NGE +LRY+ GQKY HYD F + + R+A
Sbjct: 95 KDP--IVSGIEEKIATWTFLPKENGEEIQVLRYEEGQKYEPHYDYFVDKVNIARGGHRLA 152
Query: 139 SFLVYLTDLEEGGETMFP------FENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLP 192
+ L+YLT++E+GGET+FP M AD S G+ VKPR+GD LLFYSL P
Sbjct: 153 TVLMYLTNVEKGGETVFPKAEESPRRRSMIADDSLSECAKKGIPVKPRKGDALLFYSLHP 212
Query: 193 NGTIDPTSIHGSCPVVKGEKWVATKWIR 220
N T DP S+HG CPV++GEKW ATKWI
Sbjct: 213 NATPDPLSLHGGCPVIQGEKWSATKWIH 240
>gi|302791635|ref|XP_002977584.1| hypothetical protein SELMODRAFT_106693 [Selaginella moellendorffii]
gi|300154954|gb|EFJ21588.1| hypothetical protein SELMODRAFT_106693 [Selaginella moellendorffii]
Length = 296
Score = 182 bits (461), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 94/204 (46%), Positives = 131/204 (64%), Gaps = 6/204 (2%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-GETVDNTQGIRTSSGVFISA 77
LSW PRA + F + +C ++ MAK L+ S +A + G++V IRTSSG+F+S
Sbjct: 45 LSWKPRAFLYKGFMSAAECDHVVKMAKDKLQKSMVADNESGKSV--LSNIRTSSGMFLSK 102
Query: 78 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRV 137
+DE ++ IEE+IA T LP+ NGEA +LRY+ G+KY HYD F + R+
Sbjct: 103 GQDE--VINRIEERIAAWTFLPKENGEAIQVLRYEFGEKYEPHYDYFHDKYNQALGGHRI 160
Query: 138 ASFLVYLTDLEEGGETMFP-FENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 196
A+ L+YL+D+ +GGET+FP E+ D S+ G+ VKPR+GD LLFYSL P+ T
Sbjct: 161 ATVLMYLSDVVKGGETVFPSSEDTTVKDDSWSDCAKKGIAVKPRKGDALLFYSLHPDATP 220
Query: 197 DPTSIHGSCPVVKGEKWVATKWIR 220
D +S+HG CPV++GEKW ATKWI
Sbjct: 221 DESSLHGGCPVIEGEKWSATKWIH 244
>gi|356517655|ref|XP_003527502.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Glycine max]
Length = 290
Score = 182 bits (461), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 94/227 (41%), Positives = 143/227 (62%), Gaps = 12/227 (5%)
Query: 5 QAGDDSVTNIPFQ---VLSWMPRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKGET 60
Q+ + V N P Q +LSW PRA + NF + E+C+ +I +AK + + S + + G++
Sbjct: 65 QSAESLVENPPEQWTEILSWEPRAFIYHNFLSKEECEYLIELAKPQMVKSSVVDSKTGKS 124
Query: 61 VDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSH 120
++ +RTSSG+F+ +D+ + IE++IA T +P NGE IL Y++GQKY H
Sbjct: 125 TESR--VRTSSGMFLKRGKDK--IVQNIEKRIADFTFIPEENGEGLQILHYEVGQKYEPH 180
Query: 121 YDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKCI--GLK 176
YD F + QR+A+ L+YL+D+EEGGET+FP N + + D +C GL
Sbjct: 181 YDYFLDEFNTKNGGQRIATVLMYLSDVEEGGETVFPAANANFSSVPWWNDLSQCARKGLS 240
Query: 177 VKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
VKP+ GD LLF+S+ P+ T+DP+S+HG CPV+KG KW +TKW+ +E
Sbjct: 241 VKPKMGDALLFWSMRPDATLDPSSLHGGCPVIKGNKWSSTKWMHLRE 287
>gi|302773668|ref|XP_002970251.1| hypothetical protein SELMODRAFT_411114 [Selaginella moellendorffii]
gi|300161767|gb|EFJ28381.1| hypothetical protein SELMODRAFT_411114 [Selaginella moellendorffii]
Length = 256
Score = 182 bits (461), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 92/217 (42%), Positives = 135/217 (62%), Gaps = 19/217 (8%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTS 70
+ +SW PRA F NF + E+C +I +A+ N++ S + VDN G +RTS
Sbjct: 47 ETISWQPRASVFHNFLSSEECDHLIRLAQPNMKRSAV-------VDNQTGKSKDSRVRTS 99
Query: 71 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 130
SG F+ +DE + IEE+IAK T +P+ +GE +L Y++GQKY++H+D F +
Sbjct: 100 SGTFLRRGQDE--IISRIEERIAKFTFIPKEHGEGLQVLHYEVGQKYDAHHDYFHDKVNT 157
Query: 131 PQKSQRVASFLVYLTDLEEGGETMFPFE--NGMNADGSYDYQKCI--GLKVKPRQGDGLL 186
QRVA+ L+YL+D+EEGGET+FP N + + +C G+ VKPR+GD LL
Sbjct: 158 KNGGQRVATVLMYLSDVEEGGETVFPSAKVNSSSVPWWDELSECAKKGVSVKPRKGDALL 217
Query: 187 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
F+S+ P+ +DP S+HG CPV+KG KW ATKW+ +E
Sbjct: 218 FWSMSPDAELDPFSLHGGCPVIKGNKWSATKWMHLRE 254
>gi|449529555|ref|XP_004171765.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 284
Score = 181 bits (460), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 97/223 (43%), Positives = 147/223 (65%), Gaps = 16/223 (7%)
Query: 4 GQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-GETVD 62
G+ GD V + +SW PRA + NF + E+C +I++AK ++ ST+ K GE+VD
Sbjct: 67 GKRGDQWV-----EFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSKTGESVD 121
Query: 63 NTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD 122
+ +RTSSG+F++ +D+ + IE++IA T +P +GE IL Y++GQKY++HYD
Sbjct: 122 SR--VRTSSGMFLNRGQDK--IIRNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYD 177
Query: 123 AFDPQEYGPQKS-QRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKC--IGLKV 177
F EY +K QR+A+ L+YL+D+EEGGET+FP G + + + +C GL V
Sbjct: 178 YF-VDEYNIKKGGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSECGKGGLSV 236
Query: 178 KPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
KP+ GD LLF+S+ P+ T+DPTS+HG+CPV++G KW TKW+
Sbjct: 237 KPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMH 279
>gi|302793288|ref|XP_002978409.1| hypothetical protein SELMODRAFT_418273 [Selaginella moellendorffii]
gi|300153758|gb|EFJ20395.1| hypothetical protein SELMODRAFT_418273 [Selaginella moellendorffii]
Length = 256
Score = 181 bits (459), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 92/217 (42%), Positives = 135/217 (62%), Gaps = 19/217 (8%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTS 70
+ +SW PRA F NF + E+C +I +A+ N++ S + VDN G +RTS
Sbjct: 47 ETISWQPRASVFHNFLSSEECDHLIRLAQPNMKRSAV-------VDNQTGKSKDSRVRTS 99
Query: 71 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 130
SG F+ +DE + IEE+IAK T +P+ +GE +L Y++GQKY++H+D F +
Sbjct: 100 SGTFLRRGQDE--IISRIEERIAKFTFIPKEHGEGLQVLHYEVGQKYDAHHDYFHDKVNT 157
Query: 131 PQKSQRVASFLVYLTDLEEGGETMFPFE--NGMNADGSYDYQKC--IGLKVKPRQGDGLL 186
QRVA+ L+YL+D+EEGGET+FP N + + +C G+ VKPR+GD LL
Sbjct: 158 KNGGQRVATVLMYLSDVEEGGETVFPSAKVNSSSVPWWDELSECGKKGVSVKPRKGDALL 217
Query: 187 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
F+S+ P+ +DP S+HG CPV+KG KW ATKW+ +E
Sbjct: 218 FWSMSPDAELDPFSLHGGCPVIKGNKWSATKWMHLRE 254
>gi|357517897|ref|XP_003629237.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355523259|gb|AET03713.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|388513409|gb|AFK44766.1| unknown [Medicago truncatula]
gi|388516345|gb|AFK46234.1| unknown [Medicago truncatula]
Length = 275
Score = 181 bits (458), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 94/222 (42%), Positives = 136/222 (61%), Gaps = 29/222 (13%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTS 70
Q++SW PRA + NF T E+C+ +IN+AK PS + K E +D G IRTS
Sbjct: 67 QIISWEPRAFLYHNFLTKEECEHLINIAK----PS---MHKSEVIDEKTGKSLNSSIRTS 119
Query: 71 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 130
SG F+ DE + IE++IA T +P +GE+FN+L Y++GQKY HYD F
Sbjct: 120 SGTFLDREGDE--IVSNIEKRIADFTFIPVEHGESFNVLHYEVGQKYEPHYDYFLDTFST 177
Query: 131 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY-------DYQKCI--GLKVKPRQ 181
QR+A+ L+YL+D+EEGGET+FP NA G++ + C GL +KP+
Sbjct: 178 RHAGQRIATMLMYLSDVEEGGETVFP-----NAKGNFSSVPWWNELSDCGKGGLSIKPKM 232
Query: 182 GDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
G+ +LF+S+ P+ T+DP+S+HG+CPV+KG+KW KW+ E
Sbjct: 233 GNAILFWSMKPDATLDPSSLHGACPVIKGDKWSCAKWMHADE 274
>gi|302786814|ref|XP_002975178.1| hypothetical protein SELMODRAFT_174666 [Selaginella moellendorffii]
gi|300157337|gb|EFJ23963.1| hypothetical protein SELMODRAFT_174666 [Selaginella moellendorffii]
Length = 283
Score = 180 bits (457), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 94/205 (45%), Positives = 130/205 (63%), Gaps = 7/205 (3%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-GETVDNTQGIRTSSGVFISA 77
LSW PRA + F + +C ++ MAK L+ S +A + G++V IRTSSG+F+S
Sbjct: 31 LSWKPRAFLYKGFMSAAECDHVVKMAKDKLQKSMVADNESGKSV--LSNIRTSSGMFLSK 88
Query: 78 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRV 137
+DE ++ IEE+IA T LP+ NGEA +LRY+ G+KY HYD F + R+
Sbjct: 89 GQDE--VINRIEERIAAWTFLPKENGEAIQVLRYEFGEKYEPHYDYFHDKYNQALGGHRI 146
Query: 138 ASFLVYLTDLEEGGETMFPF--ENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGT 195
A+ L+YL+D +GGET+FP E+ D S+ G+ VKPR+GD LLFYSL P+ T
Sbjct: 147 ATVLMYLSDAVKGGETVFPSSEEDTTVKDDSWSDCAKKGIAVKPRKGDALLFYSLHPDAT 206
Query: 196 IDPTSIHGSCPVVKGEKWVATKWIR 220
D +S+HG CPV++GEKW ATKWI
Sbjct: 207 PDESSLHGGCPVIEGEKWSATKWIH 231
>gi|326526235|dbj|BAJ97134.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 308
Score = 180 bits (457), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 93/206 (45%), Positives = 126/206 (61%), Gaps = 9/206 (4%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQ--GIRTSSGVFIS 76
+SW PRA +P+F + ++ ++++A+ L+ S +A ET +Q +RTSSG FIS
Sbjct: 54 ISWHPRAFLYPHFLSDDEANHLVSLARAELKRSAVA---DETSGKSQLSEVRTSSGTFIS 110
Query: 77 AAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQR 136
+D + IE+KIA T LP+ NGE +LRYK G+KY HYD F R
Sbjct: 111 KGKDP--IVAGIEDKIAAWTFLPKENGEDMQVLRYKRGEKYEPHYDFFTDSVNTILGGHR 168
Query: 137 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCI--GLKVKPRQGDGLLFYSLLPNG 194
VA+ L+YLTD+ EGGET+FP G +C G+ VKPR+GD LLF++L P+
Sbjct: 169 VATVLLYLTDVAEGGETVFPLAKGRKGSHHKGLSECAQKGIAVKPRKGDALLFFNLRPDA 228
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIR 220
DPTS+HG C V+KGEKW ATKWIR
Sbjct: 229 ATDPTSLHGGCEVIKGEKWSATKWIR 254
>gi|115482738|ref|NP_001064962.1| Os10g0497800 [Oryza sativa Japonica Group]
gi|78708853|gb|ABB47828.1| prolyl 4-hydroxylase alpha subunit, putative, expressed [Oryza
sativa Japonica Group]
gi|113639571|dbj|BAF26876.1| Os10g0497800 [Oryza sativa Japonica Group]
gi|215767852|dbj|BAH00081.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218184821|gb|EEC67248.1| hypothetical protein OsI_34188 [Oryza sativa Indica Group]
Length = 321
Score = 179 bits (455), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 89/217 (41%), Positives = 138/217 (63%), Gaps = 19/217 (8%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTS 70
+VLSW PRA + NF + E+C+ +I++AK +++ ST+ VD + G +RTS
Sbjct: 111 EVLSWEPRAFLYHNFLSKEECEYLISLAKPHMKKSTV-------VDASTGGSKDSRVRTS 163
Query: 71 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 130
SG+F+ +D+ + IE++I+ T +P NGE +L Y++GQKY H+D F +
Sbjct: 164 SGMFLGRGQDK--IIRTIEKRISDYTFIPVENGEGLQVLHYEVGQKYEPHFDYFHDEFNT 221
Query: 131 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKCI--GLKVKPRQGDGLL 186
QR+A+ L+YL+D+EEGGET+FP ++ + + +C GL VKP+ GD LL
Sbjct: 222 KNGGQRIATLLMYLSDVEEGGETIFPSSKANSSSSPFYNELSECAKKGLAVKPKMGDALL 281
Query: 187 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
F+S+ P+G++D TS+HG CPV+KG KW +TKW+R E
Sbjct: 282 FWSMRPDGSLDATSLHGGCPVIKGNKWSSTKWMRVHE 318
>gi|449443243|ref|XP_004139389.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 284
Score = 179 bits (455), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 96/223 (43%), Positives = 144/223 (64%), Gaps = 16/223 (7%)
Query: 4 GQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDN 63
G+ GD V + +SW PRA + NF + E+C +I++AK ++ ST+ ET N
Sbjct: 67 GKRGDQWV-----EFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVV--DNETGKN 119
Query: 64 TQ-GIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD 122
+ +RTSSG+F++ +D+ + IE++IA T +P +GE IL Y++GQKY++HYD
Sbjct: 120 VEDSVRTSSGMFLNRGQDK--IVSNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYD 177
Query: 123 AFDPQEYGPQKS-QRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKC--IGLKV 177
F EY +K QR+A+ L+YL+D+EEGGET+FP G + + + KC GL V
Sbjct: 178 YF-VDEYNIKKGGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSKCGKGGLSV 236
Query: 178 KPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
KP+ GD LLF+S+ P+ T+DPTS+HG+CPV++G KW TKW+
Sbjct: 237 KPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMH 279
>gi|212720775|ref|NP_001131953.1| uncharacterized protein LOC100193348 [Zea mays]
gi|194693016|gb|ACF80592.1| unknown [Zea mays]
gi|347978798|gb|AEP37741.1| prolyl 4-hydroxylase 1 [Zea mays]
gi|414870898|tpg|DAA49455.1| TPA: hypothetical protein ZEAMMB73_536273 [Zea mays]
Length = 307
Score = 179 bits (454), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 89/217 (41%), Positives = 136/217 (62%), Gaps = 19/217 (8%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTS 70
+VLSW PRA + NF + E+C +I++AK +++ ST+ VD+ G +RTS
Sbjct: 97 EVLSWEPRAFVYHNFLSKEECDHLISLAKPHMKKSTV-------VDSATGGSKDSRVRTS 149
Query: 71 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 130
SG+F+ +D+ + IE++IA T +P GE +L Y++GQKY H+D F
Sbjct: 150 SGMFLRRGQDK--IIRTIEKRIADYTFIPVEQGEGLQVLHYEVGQKYEPHFDYFHDDYNT 207
Query: 131 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKCI--GLKVKPRQGDGLL 186
QR+A+ L+YL+D+E+GGET+FP ++ + + +C GL VKP+ GD LL
Sbjct: 208 KNGGQRIATLLMYLSDVEDGGETVFPSSTTNSSSSPFYNELSECAKGGLSVKPKMGDALL 267
Query: 187 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
F+S+ P+G++DPTS+HG CPV+KG KW +TKW+R E
Sbjct: 268 FWSMKPDGSLDPTSLHGGCPVIKGNKWSSTKWMRVHE 304
>gi|242032633|ref|XP_002463711.1| hypothetical protein SORBIDRAFT_01g004670 [Sorghum bicolor]
gi|241917565|gb|EER90709.1| hypothetical protein SORBIDRAFT_01g004670 [Sorghum bicolor]
Length = 297
Score = 179 bits (453), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 94/216 (43%), Positives = 133/216 (61%), Gaps = 28/216 (12%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 72
LSW PRA + F + +C +IN+AK ++ S +A DN G +RTSSG
Sbjct: 38 LSWRPRAFLYSGFLSDTECDHLINLAKGSMEKSMVA-------DNDSGKSLMSQVRTSSG 90
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
F++ EDE + IE+++A T LP N E+ +LRY+IGQKY++H+D F +
Sbjct: 91 AFLAKHEDE--IVSAIEKRVAAWTFLPEENAESMQVLRYEIGQKYDAHFDYFHDKNNVKH 148
Query: 133 KSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY------DYQKC--IGLKVKPRQGDG 184
QR A+ L+YLTD+++GGET+FP NA+GS+ + +C GL VKP++GD
Sbjct: 149 GGQRFATVLMYLTDVKKGGETVFP-----NAEGSHLQYKDETWSECSRSGLAVKPKKGDA 203
Query: 185 LLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
LLF+ L N T D +S+HGSCPV++GEKW ATKWI
Sbjct: 204 LLFFGLHLNATTDTSSLHGSCPVIEGEKWSATKWIH 239
>gi|222613083|gb|EEE51215.1| hypothetical protein OsJ_32038 [Oryza sativa Japonica Group]
Length = 222
Score = 179 bits (453), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 89/217 (41%), Positives = 138/217 (63%), Gaps = 19/217 (8%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTS 70
+VLSW PRA + NF + E+C+ +I++AK +++ ST+ VD + G +RTS
Sbjct: 12 EVLSWEPRAFLYHNFLSKEECEYLISLAKPHMKKSTV-------VDASTGGSKDSRVRTS 64
Query: 71 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 130
SG+F+ +D+ + IE++I+ T +P NGE +L Y++GQKY H+D F +
Sbjct: 65 SGMFLGRGQDK--IIRTIEKRISDYTFIPVENGEGLQVLHYEVGQKYEPHFDYFHDEFNT 122
Query: 131 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKCI--GLKVKPRQGDGLL 186
QR+A+ L+YL+D+EEGGET+FP ++ + + +C GL VKP+ GD LL
Sbjct: 123 KNGGQRIATLLMYLSDVEEGGETIFPSSKANSSSSPFYNELSECAKKGLAVKPKMGDALL 182
Query: 187 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
F+S+ P+G++D TS+HG CPV+KG KW +TKW+R E
Sbjct: 183 FWSMRPDGSLDATSLHGGCPVIKGNKWSSTKWMRVHE 219
>gi|414870899|tpg|DAA49456.1| TPA: hypothetical protein ZEAMMB73_536273 [Zea mays]
Length = 364
Score = 179 bits (453), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 89/217 (41%), Positives = 136/217 (62%), Gaps = 19/217 (8%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTS 70
+VLSW PRA + NF + E+C +I++AK +++ ST+ VD+ G +RTS
Sbjct: 154 EVLSWEPRAFVYHNFLSKEECDHLISLAKPHMKKSTV-------VDSATGGSKDSRVRTS 206
Query: 71 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 130
SG+F+ +D+ + IE++IA T +P GE +L Y++GQKY H+D F
Sbjct: 207 SGMFLRRGQDK--IIRTIEKRIADYTFIPVEQGEGLQVLHYEVGQKYEPHFDYFHDDYNT 264
Query: 131 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKCI--GLKVKPRQGDGLL 186
QR+A+ L+YL+D+E+GGET+FP ++ + + +C GL VKP+ GD LL
Sbjct: 265 KNGGQRIATLLMYLSDVEDGGETVFPSSTTNSSSSPFYNELSECAKGGLSVKPKMGDALL 324
Query: 187 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
F+S+ P+G++DPTS+HG CPV+KG KW +TKW+R E
Sbjct: 325 FWSMKPDGSLDPTSLHGGCPVIKGNKWSSTKWMRVHE 361
>gi|449491267|ref|XP_004158845.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 287
Score = 178 bits (452), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 87/212 (41%), Positives = 139/212 (65%), Gaps = 9/212 (4%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVFI 75
+V+SW PRA + NF T E+C+ +I++AK +++ ST+ G++ D+ +RTSSG F+
Sbjct: 77 EVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSTVVDSETGQSKDSR--VRTSSGTFL 134
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 135
D+ T+ IE++++ + +P +GE +L Y++GQKY H+D F + Q
Sbjct: 135 PRGRDK--TVRTIEKRLSDFSFIPVEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQ 192
Query: 136 RVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKC--IGLKVKPRQGDGLLFYSLL 191
R+A+ L+YL+D+EEGGET+FP G + + + C GL VKP++GD LLF+S+
Sbjct: 193 RIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMK 252
Query: 192 PNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
P+ ++DP+S+HG CPV+KG KW ATKW+R +E
Sbjct: 253 PDASLDPSSLHGGCPVIKGNKWSATKWVRVEE 284
>gi|357483925|ref|XP_003612249.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355513584|gb|AES95207.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 289
Score = 178 bits (452), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 92/212 (43%), Positives = 135/212 (63%), Gaps = 9/212 (4%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVFI 75
+V+SW PRA + NF T E+C+ +I++AK ++ ST+ G++ D+ +RTSSG F+
Sbjct: 79 EVVSWEPRAFVYHNFLTKEECEYLIDIAKPSMHKSTVVDSETGKSKDSR--VRTSSGTFL 136
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 135
+ D+ + IE+KIA T +P +GE +L Y++GQKY HYD F + Q
Sbjct: 137 ARGRDK--IVRNIEKKIADFTFIPVEHGEGLQVLHYEVGQKYEPHYDYFLDEFNTKNGGQ 194
Query: 136 RVASFLVYLTDLEEGGETMFPFENG--MNADGSYDYQKC--IGLKVKPRQGDGLLFYSLL 191
R+A+ L+YLTD+EEGGET+FP G N + C GL +KP++GD LLF+S+
Sbjct: 195 RIATVLMYLTDVEEGGETVFPAAKGNFSNVPWYNELSDCGKKGLSIKPKRGDALLFWSMK 254
Query: 192 PNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
P+ T+D +S+HG CPV+KG KW +TKWIR E
Sbjct: 255 PDATLDASSLHGGCPVIKGNKWSSTKWIRVNE 286
>gi|215490183|dbj|BAG86625.1| type 2 proly 4-hydroxylase [Nicotiana tabacum]
Length = 318
Score = 178 bits (452), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 93/206 (45%), Positives = 127/206 (61%), Gaps = 8/206 (3%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-GETVDNTQGIRTSSGVFISA 77
+SW PRA + NF T E+C I +AK L S +A + G++V++ +RTSSG+F
Sbjct: 65 ISWRPRAFVYRNFLTDEECDHFITLAKHKLEKSMVADNESGKSVESE--VRTSSGMFFRK 122
Query: 78 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRV 137
A+D+ + +E +IA T LP NGE+ IL Y+ GQKY H+D F + RV
Sbjct: 123 AQDQ--VVANVEARIAAWTFLPEENGESIQILHYEHGQKYEPHFDYFHDKVNQELGGHRV 180
Query: 138 ASFLVYLTDLEEGGETMFPFENGMNADGS-YDYQKCI--GLKVKPRQGDGLLFYSLLPNG 194
A+ L+YL+D+E+GGET+FP D+ C G VKPR+GD LLF+SL P+
Sbjct: 181 ATVLMYLSDVEKGGETVFPNSEAKKTQAKGDDWSDCAKKGYAVKPRKGDALLFFSLHPDA 240
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIR 220
T DP S+HGSCPV++GEKW ATKWI
Sbjct: 241 TTDPLSLHGSCPVIEGEKWSATKWIH 266
>gi|297818458|ref|XP_002877112.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297322950|gb|EFH53371.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 289
Score = 178 bits (451), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 92/207 (44%), Positives = 131/207 (63%), Gaps = 9/207 (4%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLAL--RKGETVDNTQGIRTSSGVFIS 76
LSW PRA + F + E+C +IN+AK L S + GE++D+ + RTSSGVF++
Sbjct: 35 LSWTPRAFLYNGFLSDEECDHLINLAKGKLEKSMVVADDNSGESIDSEE--RTSSGVFLT 92
Query: 77 AAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQR 136
+D+ + +E K+A T LP NGEA IL Y+ GQKY+ H+D + +E R
Sbjct: 93 KRQDD--IVANVEAKLATWTFLPEENGEALQILHYENGQKYDPHFDYYYDKETLKLGGHR 150
Query: 137 VASFLVYLTDLEEGGETMFPFENGMNADGSYD-YQKCI--GLKVKPRQGDGLLFYSLLPN 193
+A+ L+YL+++ +GGET+FP G D + +C G VKPR+GD LLF++L PN
Sbjct: 151 IATVLMYLSNVTKGGETVFPMWKGKTPQLKDDTWSECAKQGYAVKPRKGDALLFFNLHPN 210
Query: 194 GTIDPTSIHGSCPVVKGEKWVATKWIR 220
T DPTS+HGSCPV++GEKW AT+WI
Sbjct: 211 ATTDPTSLHGSCPVIEGEKWSATRWIH 237
>gi|449434114|ref|XP_004134841.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 287
Score = 178 bits (451), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 87/212 (41%), Positives = 139/212 (65%), Gaps = 9/212 (4%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVFI 75
+V+SW PRA + NF T E+C+ +I++AK +++ ST+ G++ D+ +RTSSG F+
Sbjct: 77 EVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSTVVDSETGQSKDSR--VRTSSGTFL 134
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 135
D+ T+ IE++++ + +P +GE +L Y++GQKY H+D F + Q
Sbjct: 135 PRGRDK--TVRTIEKRLSDFSFIPVEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQ 192
Query: 136 RVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKC--IGLKVKPRQGDGLLFYSLL 191
R+A+ L+YL+D+EEGGET+FP G + + + C GL VKP++GD LLF+S+
Sbjct: 193 RIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMK 252
Query: 192 PNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
P+ ++DP+S+HG CPV+KG KW ATKW+R +E
Sbjct: 253 PDASLDPSSLHGGCPVIKGNKWSATKWMRVEE 284
>gi|356540840|ref|XP_003538892.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Glycine max]
Length = 290
Score = 177 bits (450), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 92/226 (40%), Positives = 140/226 (61%), Gaps = 9/226 (3%)
Query: 3 HGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETV 61
H DD +V+SW PRA + NF T E+C+ +I++AK N+ S++ G++
Sbjct: 66 HTSDDDDVRGEQWVEVVSWEPRAFVYHNFLTKEECEYLIDIAKPNMHKSSVVDSETGKSK 125
Query: 62 DNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHY 121
D+ +RTSSG F++ D+ + IE++IA + +P +GE +L Y++GQKY HY
Sbjct: 126 DSR--VRTSSGTFLARGRDK--IVRDIEKRIAHYSFIPVEHGEGLQVLHYEVGQKYEPHY 181
Query: 122 DAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKC--IGLKV 177
D F QR+A+ L+YLTD+EEGGET+FP G + + + +C GL +
Sbjct: 182 DYFLDDFNTKNGGQRIATVLMYLTDVEEGGETVFPAAKGNFSSVPWWNELSECGKKGLSI 241
Query: 178 KPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
KP++GD LLF+S+ P+ T+DP+S+HG CPV+KG KW +TKW+R E
Sbjct: 242 KPKRGDALLFWSMKPDATLDPSSLHGGCPVIKGNKWSSTKWMRVSE 287
>gi|357125236|ref|XP_003564301.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
distachyon]
Length = 293
Score = 177 bits (450), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 93/216 (43%), Positives = 132/216 (61%), Gaps = 28/216 (12%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 72
LSW PRA + F + +C ++ +AK L+ S +A DN G +RTSSG
Sbjct: 34 LSWRPRAFLYSGFLSHAECDHLVKLAKGRLQKSMVA-------DNDSGKSVMSQVRTSSG 86
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
F++ EDE + IE+++A T LP N E+ +L Y++GQKY++H+D F +
Sbjct: 87 TFLNKHEDE--IISGIEKRVAAWTFLPEENAESIQVLHYEVGQKYDAHFDYFHDKNNQKL 144
Query: 133 KSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY------DYQKCI--GLKVKPRQGDG 184
RVA+ L+YLTD+++GGET+FP NA+G + + +C GL VKPR+GD
Sbjct: 145 GGHRVATVLMYLTDVKKGGETVFP-----NAEGRHLQHKDETWSECARSGLAVKPRKGDA 199
Query: 185 LLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
LLF+SL N T DP+S+HGSCPV++GEKW ATKWI
Sbjct: 200 LLFFSLHINATTDPSSLHGSCPVIEGEKWSATKWIH 235
>gi|302830268|ref|XP_002946700.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
gi|300267744|gb|EFJ51926.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
Length = 186
Score = 177 bits (449), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 91/184 (49%), Positives = 122/184 (66%), Gaps = 7/184 (3%)
Query: 48 LRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFN 107
+ PS LA R GE + Q +RTS G F+ D S L +E+KIA VT+LPR NGE +N
Sbjct: 1 MYPSGLAYRPGEKAEAEQQVRTSKGTFLGG--DSSPALRWLEDKIAAVTLLPRTNGEFWN 58
Query: 108 ILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLE-EGGETMFPFENGMNADGS 166
+L YK Q Y+SH D+FDP+EYGPQ SQR+A+ +V L+D GGET+F E + +
Sbjct: 59 VLNYKHSQHYDSHMDSFDPKEYGPQYSQRIATVIVVLSDDGLMGGETVFKREGKSSINKP 118
Query: 167 Y-DYQKCI---GLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
++ C GLK KPR GD +LF+S P+G +DP ++HGSCPVV G KWVA KW+R++
Sbjct: 119 ISNWTDCDADGGLKYKPRAGDAVLFWSARPDGQLDPHALHGSCPVVTGNKWVAVKWLRNK 178
Query: 223 EQYD 226
+YD
Sbjct: 179 GEYD 182
>gi|242039227|ref|XP_002467008.1| hypothetical protein SORBIDRAFT_01g018200 [Sorghum bicolor]
gi|241920862|gb|EER94006.1| hypothetical protein SORBIDRAFT_01g018200 [Sorghum bicolor]
Length = 307
Score = 177 bits (449), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 88/217 (40%), Positives = 136/217 (62%), Gaps = 19/217 (8%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTS 70
+VLSW PRA + NF + E+C +I++AK +++ ST+ VD+ G +RTS
Sbjct: 97 EVLSWEPRAFVYHNFLSKEECDHLISLAKPHMKKSTV-------VDSATGASKDSRVRTS 149
Query: 71 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 130
SG+F+ +D+ + IE++IA T +P +GE +L Y++GQKY H+D F
Sbjct: 150 SGMFLRRGQDK--IIQTIEKRIADFTFIPVEHGEGLQVLHYEVGQKYEPHFDYFHDDYNT 207
Query: 131 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKCI--GLKVKPRQGDGLL 186
QR+A+ L+YL+D+E+GGET+FP ++ + + +C GL VKP+ GD LL
Sbjct: 208 KNGGQRIATLLMYLSDVEDGGETVFPSSTTNSSSSPFYNELSECAKGGLSVKPKMGDALL 267
Query: 187 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
F+S+ P+G++D TS+HG CPV+KG KW +TKW+R E
Sbjct: 268 FWSMKPDGSMDSTSLHGGCPVIKGNKWSSTKWMRVHE 304
>gi|384246332|gb|EIE19822.1| hypothetical protein COCSUDRAFT_25518 [Coccomyxa subellipsoidea
C-169]
Length = 347
Score = 176 bits (447), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 96/207 (46%), Positives = 127/207 (61%), Gaps = 11/207 (5%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALR-KGETVDNTQGIRTSSGVFISA 77
+SW PRA F +C+ +I+ AK ++ ST+ G+++D+T +RTS+G F
Sbjct: 86 VSWSPRAFLLKGFLKEAECEHLISKAKPSMVKSTVVDNDTGKSIDST--VRTSTGTFFGR 143
Query: 78 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQKS-Q 135
EDE + IE +I+ +T LP +NGE IL Y+ GQKY +H+D F D P+ Q
Sbjct: 144 EEDE--VIQGIERRISMITHLPEVNGEGLQILHYEDGQKYEAHHDFFHDKFNSRPENGGQ 201
Query: 136 RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCI--GLKVKPRQGDGLLFYSLLPN 193
R+A+ L+YLT EEGGET+FP N + +C G VK R+GD LLFYSLLPN
Sbjct: 202 RIATVLMYLTTAEEGGETVFPM--AANKVTGPQWSECARGGAAVKSRRGDALLFYSLLPN 259
Query: 194 GTIDPTSIHGSCPVVKGEKWVATKWIR 220
G DPTS+HGSCP KGEKW ATKWI
Sbjct: 260 GETDPTSLHGSCPTTKGEKWSATKWIH 286
>gi|21537370|gb|AAM61711.1| putative prolyl 4-hydroxylase, alpha subunit [Arabidopsis thaliana]
Length = 287
Score = 176 bits (447), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 93/213 (43%), Positives = 133/213 (62%), Gaps = 11/213 (5%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVFI 75
+VLSW PRA + NF + E+C+ +I++AK ++ ST+ G++ D+ +RTSSG F+
Sbjct: 77 EVLSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVVDSETGKSKDSR--VRTSSGTFL 134
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 135
D+ + IE++IA T +P +GE +L Y+ GQKY HYD F + Q
Sbjct: 135 RRGRDK--IIKTIEKRIADYTFIPADHGEGLQVLHYEAGQKYEPHYDYFVDEFNTKNGGQ 192
Query: 136 RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCI-----GLKVKPRQGDGLLFYSL 190
R+A+ L+YL+D+EEGGET+FP N MN Y + GL VKPR GD LLF+S+
Sbjct: 193 RMATMLMYLSDVEEGGETVFPAAN-MNFSSVPWYNELSECGKKGLSVKPRMGDALLFWSM 251
Query: 191 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
P+ T+DPTS+HG CPV++G KW +TKWI E
Sbjct: 252 RPDATLDPTSLHGGCPVIRGNKWSSTKWIHVGE 284
>gi|115456019|ref|NP_001051610.1| Os03g0803500 [Oryza sativa Japonica Group]
gi|29150365|gb|AAO72374.1| putative oxidoreductase [Oryza sativa Japonica Group]
gi|108711618|gb|ABF99413.1| oxidoreductase, 2OG-Fe oxygenase family protein, putative,
expressed [Oryza sativa Japonica Group]
gi|113550081|dbj|BAF13524.1| Os03g0803500 [Oryza sativa Japonica Group]
gi|215765410|dbj|BAG87107.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222625993|gb|EEE60125.1| hypothetical protein OsJ_13003 [Oryza sativa Japonica Group]
Length = 299
Score = 176 bits (447), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 91/211 (43%), Positives = 130/211 (61%), Gaps = 18/211 (8%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 72
LSW PRA + F + ++C ++N+AK + S +A DN G +RTSSG
Sbjct: 40 LSWRPRAFLYSGFLSHDECDHLVNLAKGRMEKSMVA-------DNDSGKSIMSQVRTSSG 92
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
F+S ED+ + IE+++A T LP N E+ IL Y++GQKY++H+D F + +
Sbjct: 93 TFLSKHEDD--IVSGIEKRVAAWTFLPEENAESIQILHYELGQKYDAHFDYFHDKNNLKR 150
Query: 133 KSQRVASFLVYLTDLEEGGETMFPFENGMN---ADGSYDYQKCIGLKVKPRQGDGLLFYS 189
RVA+ L+YLTD+++GGET+FP G + D ++ GL VKP++GD LLF+S
Sbjct: 151 GGHRVATVLMYLTDVKKGGETVFPNAAGRHLQLKDETWSDCARSGLAVKPKKGDALLFFS 210
Query: 190 LLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
L N T DP S+HGSCPV++GEKW ATKWI
Sbjct: 211 LHVNATTDPASLHGSCPVIEGEKWSATKWIH 241
>gi|218193936|gb|EEC76363.1| hypothetical protein OsI_13952 [Oryza sativa Indica Group]
Length = 1062
Score = 176 bits (447), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 91/211 (43%), Positives = 130/211 (61%), Gaps = 18/211 (8%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 72
LSW PRA + F + ++C ++N+AK + S +A DN G +RTSSG
Sbjct: 40 LSWRPRAFLYSGFLSHDECDHLVNLAKGRMEKSMVA-------DNDSGKSIMSQVRTSSG 92
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
F+S ED+ + IE+++A T LP N E+ IL Y++GQKY++H+D F + +
Sbjct: 93 TFLSKHEDD--IVSGIEKRVAAWTFLPEENAESIQILHYELGQKYDAHFDYFHDKNNLKR 150
Query: 133 KSQRVASFLVYLTDLEEGGETMFPFENGMN---ADGSYDYQKCIGLKVKPRQGDGLLFYS 189
RVA+ L+YLTD+++GGET+FP G + D ++ GL VKP++GD LLF+S
Sbjct: 151 GGHRVATVLMYLTDVKKGGETVFPNAAGRHLQLKDETWSDCARSGLAVKPKKGDALLFFS 210
Query: 190 LLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
L N T DP S+HGSCPV++GEKW ATKWI
Sbjct: 211 LHVNATTDPASLHGSCPVIEGEKWSATKWIH 241
>gi|297850430|ref|XP_002893096.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297338938|gb|EFH69355.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 287
Score = 176 bits (446), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 93/213 (43%), Positives = 133/213 (62%), Gaps = 11/213 (5%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVFI 75
+VLSW PRA + NF + E+C+ +I++AK ++ ST+ G++ D+ +RTSSG F+
Sbjct: 77 EVLSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVVDSETGKSKDSR--VRTSSGTFL 134
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 135
D+ + IE++IA T +P +GE IL Y+ GQKY HYD F + Q
Sbjct: 135 RRGRDK--IIKTIEKRIADYTFIPADHGEGLQILHYEAGQKYEPHYDYFVDEFNTKNGGQ 192
Query: 136 RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCI-----GLKVKPRQGDGLLFYSL 190
R+A+ L+YL+D+EEGGET+FP N MN Y + GL VKPR GD LLF+S+
Sbjct: 193 RMATMLMYLSDVEEGGETVFPAAN-MNFSSVPWYNELSECGKKGLSVKPRMGDALLFWSM 251
Query: 191 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
P+ T+DPTS+HG CPV++G KW +TKW+ E
Sbjct: 252 RPDATLDPTSLHGGCPVIRGNKWSSTKWMHVGE 284
>gi|159794879|pdb|2JIG|A Chain A, Crystal Structure Of Chlamydomonas Reinhardtii Prolyl-4
Hydroxylase Type I Complexed With Zinc And Pyridine-2,4-
Dicarboxylate
gi|159794880|pdb|2JIG|B Chain B, Crystal Structure Of Chlamydomonas Reinhardtii Prolyl-4
Hydroxylase Type I Complexed With Zinc And Pyridine-2,4-
Dicarboxylate
Length = 224
Score = 176 bits (445), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 95/208 (45%), Positives = 130/208 (62%), Gaps = 13/208 (6%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKGETVDNTQGIRTSSGVFISA 77
LSW PRA NF + E+C I+ A+ + + S + G++VD+ IRTS+G + +
Sbjct: 16 LSWSPRAFLLKNFLSDEECDYIVEKARPKMVKSSVVDNESGKSVDSE--IRTSTGTWFAK 73
Query: 78 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQKS-Q 135
ED + IE+++A+VTM+P N E +L Y GQKY HYD F DP GP+ Q
Sbjct: 74 GEDS--VISKIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNAGPEHGGQ 131
Query: 136 RVASFLVYLTDLEEGGETMFP-FENGMNADGSYDYQKCI--GLKVKPRQGDGLLFYSLLP 192
RV + L+YLT +EEGGET+ P E + DG + +C GL VKP +GD L+FYSL P
Sbjct: 132 RVVTMLMYLTTVEEGGETVLPNAEQKVTGDG---WSECAKRGLAVKPIKGDALMFYSLKP 188
Query: 193 NGTIDPTSIHGSCPVVKGEKWVATKWIR 220
+G+ DP S+HGSCP +KG+KW ATKWI
Sbjct: 189 DGSNDPASLHGSCPTLKGDKWSATKWIH 216
>gi|159478673|ref|XP_001697425.1| predicted protein [Chlamydomonas reinhardtii]
gi|158274304|gb|EDP00087.1| predicted protein [Chlamydomonas reinhardtii]
Length = 297
Score = 176 bits (445), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 95/206 (46%), Positives = 129/206 (62%), Gaps = 9/206 (4%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKGETVDNTQGIRTSSGVFISA 77
LSW PRA NF + E+C I+ A+ + + S + G++VD+ IRTS+G + +
Sbjct: 45 LSWSPRAFLLKNFLSDEECDYIVEKARPKMVKSSVVDNESGKSVDSE--IRTSTGTWFAK 102
Query: 78 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQKS-Q 135
ED + IE+++A+VTM+P N E +L Y GQKY HYD F DP GP+ Q
Sbjct: 103 GEDS--VISKIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNAGPEHGGQ 160
Query: 136 RVASFLVYLTDLEEGGETMFP-FENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 194
RV + L+YLT +EEGGET+ P E + DG + K GL VKP +GD L+FYSL P+G
Sbjct: 161 RVVTMLMYLTTVEEGGETVLPNAEQKVTGDGWSECAK-RGLAVKPIKGDALMFYSLKPDG 219
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIR 220
+ DP S+HGSCP +KG+KW ATKWI
Sbjct: 220 SNDPASLHGSCPTLKGDKWSATKWIH 245
>gi|159794881|pdb|2JIJ|A Chain A, Crystal Structure Of The Apo Form Of Chlamydomonas
Reinhardtii Prolyl-4 Hydroxylase Type I
gi|159794882|pdb|2JIJ|B Chain B, Crystal Structure Of The Apo Form Of Chlamydomonas
Reinhardtii Prolyl-4 Hydroxylase Type I
gi|159794883|pdb|2JIJ|C Chain C, Crystal Structure Of The Apo Form Of Chlamydomonas
Reinhardtii Prolyl-4 Hydroxylase Type I
Length = 233
Score = 176 bits (445), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 95/208 (45%), Positives = 130/208 (62%), Gaps = 13/208 (6%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKGETVDNTQGIRTSSGVFISA 77
LSW PRA NF + E+C I+ A+ + + S + G++VD+ IRTS+G + +
Sbjct: 25 LSWSPRAFLLKNFLSDEECDYIVEKARPKMVKSSVVDNESGKSVDSE--IRTSTGTWFAK 82
Query: 78 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQKS-Q 135
ED + IE+++A+VTM+P N E +L Y GQKY HYD F DP GP+ Q
Sbjct: 83 GEDS--VISKIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNAGPEHGGQ 140
Query: 136 RVASFLVYLTDLEEGGETMFP-FENGMNADGSYDYQKCI--GLKVKPRQGDGLLFYSLLP 192
RV + L+YLT +EEGGET+ P E + DG + +C GL VKP +GD L+FYSL P
Sbjct: 141 RVVTMLMYLTTVEEGGETVLPNAEQKVTGDG---WSECAKRGLAVKPIKGDALMFYSLKP 197
Query: 193 NGTIDPTSIHGSCPVVKGEKWVATKWIR 220
+G+ DP S+HGSCP +KG+KW ATKWI
Sbjct: 198 DGSNDPASLHGSCPTLKGDKWSATKWIH 225
>gi|241913390|pdb|3GZE|A Chain A, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
(Ser-Pro)5 Peptide Substrate
gi|241913391|pdb|3GZE|B Chain B, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
(Ser-Pro)5 Peptide Substrate
gi|241913392|pdb|3GZE|C Chain C, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
(Ser-Pro)5 Peptide Substrate
gi|241913393|pdb|3GZE|D Chain D, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
(Ser-Pro)5 Peptide Substrate
Length = 225
Score = 176 bits (445), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 95/208 (45%), Positives = 130/208 (62%), Gaps = 13/208 (6%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKGETVDNTQGIRTSSGVFISA 77
LSW PRA NF + E+C I+ A+ + + S + G++VD+ IRTS+G + +
Sbjct: 17 LSWSPRAFLLKNFLSDEECDYIVEKARPKMVKSSVVDNESGKSVDSE--IRTSTGTWFAK 74
Query: 78 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQKS-Q 135
ED + IE+++A+VTM+P N E +L Y GQKY HYD F DP GP+ Q
Sbjct: 75 GEDS--VISKIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNAGPEHGGQ 132
Query: 136 RVASFLVYLTDLEEGGETMFP-FENGMNADGSYDYQKCI--GLKVKPRQGDGLLFYSLLP 192
RV + L+YLT +EEGGET+ P E + DG + +C GL VKP +GD L+FYSL P
Sbjct: 133 RVVTMLMYLTTVEEGGETVLPNAEQKVTGDG---WSECAKRGLAVKPIKGDALMFYSLKP 189
Query: 193 NGTIDPTSIHGSCPVVKGEKWVATKWIR 220
+G+ DP S+HGSCP +KG+KW ATKWI
Sbjct: 190 DGSNDPASLHGSCPTLKGDKWSATKWIH 217
>gi|357517895|ref|XP_003629236.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355523258|gb|AET03712.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 326
Score = 175 bits (444), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 89/214 (41%), Positives = 134/214 (62%), Gaps = 19/214 (8%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-GETVDNTQGIRTSSGVFI 75
Q++SW PRA + NF T E+C+ +IN+AK ++ S + + G VD+ + RTSSG F+
Sbjct: 116 QIISWEPRAFLYHNFLTKEECEHLINIAKPSMHKSAVIDEETGNGVDSRE--RTSSGAFL 173
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 135
D + IE +IA T +P +GE FN+L Y++GQKY HYD F Q
Sbjct: 174 KRGSDR--IVKNIERRIADFTFIPVEHGENFNVLHYEVGQKYEPHYDYFMDTFSTTYAGQ 231
Query: 136 RVASFLVYLTDLEEGGETMFPFENGMNADGSY-------DYQKC--IGLKVKPRQGDGLL 186
R+A+ L+YL+D+EEGGET+FP NA G++ + C GL +KP+ G+ +L
Sbjct: 232 RIATMLMYLSDVEEGGETVFP-----NAKGNFSSVPWWNELSDCGKGGLSIKPKMGNAIL 286
Query: 187 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
F+S+ P+ T+DP+S+HG+CPV+KG+KW+ KW+
Sbjct: 287 FWSMKPDATLDPSSLHGACPVIKGDKWLCAKWMH 320
>gi|18394842|ref|NP_564109.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
gi|9558598|gb|AAF88161.1|AC026234_12 Contains similarity to a prolyl 4-hydroxylase alpha subunit protein
from Gallus gallus gi|212530 [Arabidopsis thaliana]
gi|90962978|gb|ABE02413.1| At1g20270 [Arabidopsis thaliana]
gi|332191835|gb|AEE29956.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
Length = 287
Score = 175 bits (444), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 92/213 (43%), Positives = 133/213 (62%), Gaps = 11/213 (5%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVFI 75
+VLSW PRA + NF + E+C+ +I++AK ++ ST+ G++ D+ +RTSSG F+
Sbjct: 77 EVLSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVVDSETGKSKDSR--VRTSSGTFL 134
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 135
D+ + IE++IA T +P +GE +L Y+ GQKY HYD F + Q
Sbjct: 135 RRGRDK--IIKTIEKRIADYTFIPADHGEGLQVLHYEAGQKYEPHYDYFVDEFNTKNGGQ 192
Query: 136 RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCI-----GLKVKPRQGDGLLFYSL 190
R+A+ L+YL+D+EEGGET+FP N MN Y + GL VKPR GD LLF+S+
Sbjct: 193 RMATMLMYLSDVEEGGETVFPAAN-MNFSSVPWYNELSECGKKGLSVKPRMGDALLFWSM 251
Query: 191 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
P+ T+DPTS+HG CPV++G KW +TKW+ E
Sbjct: 252 RPDATLDPTSLHGGCPVIRGNKWSSTKWMHVGE 284
>gi|449432777|ref|XP_004134175.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 303
Score = 175 bits (443), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 93/214 (43%), Positives = 132/214 (61%), Gaps = 21/214 (9%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 72
+SW PRA + F T +C +I++AK L+ S++A DN G +RTSSG
Sbjct: 44 ISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVA-------DNLSGKSKVSEVRTSSG 96
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
FI A+D + IE+KIA T LP+ NGE +LRY+ GQKY++H+D F + +
Sbjct: 97 AFIHKAKDP--IVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIAR 154
Query: 133 KSQRVASFLVYLTDLEEGGETMFPF----ENGMNADGSYDYQKCI--GLKVKPRQGDGLL 186
R+A+ L+YL+D+E+GGET+FP + ++ + D C G+ VKPR+GD LL
Sbjct: 155 GGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRKGDALL 214
Query: 187 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
F+SL PN D +S+HG CPV++GEKW ATKWIR
Sbjct: 215 FFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIR 248
>gi|356502610|ref|XP_003520111.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 286
Score = 174 bits (441), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 88/226 (38%), Positives = 138/226 (61%), Gaps = 9/226 (3%)
Query: 3 HGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETV 61
H +A +D + +V+SW PRA + NF T E+C+ +IN+A +++ ST+A + G++V
Sbjct: 61 HIEAEEDDQVALRMEVISWQPRAFLYHNFLTKEECEYLINIATPHMQKSTVADNQSGQSV 120
Query: 62 DNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHY 121
+R S+G F+ +DE + IE++IA VT +P NGE ++ Y++GQ Y+ HY
Sbjct: 121 --VHDVRKSTGAFLDRGQDE--IVRNIEKRIADVTFIPIENGEPIYVIHYEVGQYYDPHY 176
Query: 122 DAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKC--IGLKV 177
D F QR+A+ L+YL+++EEGGETMFP + + + C +GL +
Sbjct: 177 DYFIDDFNIENGGQRIATMLMYLSNVEEGGETMFPRAKANFSSVPWWNELSNCGKMGLSI 236
Query: 178 KPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
KP+ GD LLF+S+ PN T+D ++H +CPV+KG KW TKW+ E
Sbjct: 237 KPKMGDALLFWSMKPNATLDALTLHSACPVIKGNKWSCTKWMHPTE 282
>gi|224085946|ref|XP_002307750.1| predicted protein [Populus trichocarpa]
gi|222857199|gb|EEE94746.1| predicted protein [Populus trichocarpa]
Length = 288
Score = 174 bits (441), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 88/209 (42%), Positives = 134/209 (64%), Gaps = 11/209 (5%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-GETVDNTQGIRTSSGVFI 75
++LSW PRA + NF + E+C+ +IN+AK ++ ST+ K G + D+ +RTSSG+F+
Sbjct: 78 EILSWEPRAFLYHNFLSKEECEYLINLAKPHMMKSTVVDSKTGRSKDSR--VRTSSGMFL 135
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 135
D + IE++IA + +P +GE +L Y++GQKY +H+D F + Q
Sbjct: 136 RRGRDR--VIREIEKRIADFSFIPVEHGEGLQVLHYEVGQKYEAHFDYFLDEFNTKNGGQ 193
Query: 136 RVASFLVYLTDLEEGGETMFPFENGMNADGS---YDYQKCI--GLKVKPRQGDGLLFYSL 190
R A+ L+YL+D+EEGGET+FP N MN + +C GL +KP+ G+ LLF+S
Sbjct: 194 RTATLLMYLSDVEEGGETVFPAAN-MNISAVPWWNELSECAKQGLSLKPKMGNALLFWST 252
Query: 191 LPNGTIDPTSIHGSCPVVKGEKWVATKWI 219
P+ T+DP+S+HGSCPV++G KW ATKW+
Sbjct: 253 RPDATLDPSSLHGSCPVIRGNKWSATKWM 281
>gi|225459748|ref|XP_002285898.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Vitis vinifera]
gi|302141716|emb|CBI18919.3| unnamed protein product [Vitis vinifera]
Length = 288
Score = 174 bits (441), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 86/212 (40%), Positives = 137/212 (64%), Gaps = 9/212 (4%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVFI 75
+++SW PRA + NF + E+C+ +I++AK ++ ST+ G + D+ +RTSSG+F+
Sbjct: 78 EIVSWEPRAFIYHNFLSKEECEYMISLAKPYMKKSTVVDSETGRSKDSR--VRTSSGMFL 135
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 135
D+ + IE++IA T +P +GE +L Y++GQKY++HYD F + Q
Sbjct: 136 RRGRDK--IIRDIEKRIADFTFIPVEHGEGLQVLHYEVGQKYDAHYDYFLDEFNTKNGGQ 193
Query: 136 RVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKC--IGLKVKPRQGDGLLFYSLL 191
R+A+ L+YL+D+EEGGET+FP + + + +C GL VKP+ GD LLF+S+
Sbjct: 194 RIATLLMYLSDVEEGGETVFPATKANFSSVPWWNELSECGKKGLSVKPKMGDALLFWSMR 253
Query: 192 PNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
P+ T+DP+S+HG CPV+KG KW +TKW+ +E
Sbjct: 254 PDATLDPSSLHGGCPVIKGNKWSSTKWMHVEE 285
>gi|302845234|ref|XP_002954156.1| hypothetical protein VOLCADRAFT_82641 [Volvox carteri f.
nagariensis]
gi|300260655|gb|EFJ44873.1| hypothetical protein VOLCADRAFT_82641 [Volvox carteri f.
nagariensis]
Length = 309
Score = 174 bits (440), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 94/206 (45%), Positives = 131/206 (63%), Gaps = 9/206 (4%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKGETVDNTQGIRTSSGVFISA 77
LSW PRA F + E+C+ II AK + + S + G++VD+ IRTS+G +++
Sbjct: 57 LSWSPRAFLLKGFLSDEECEHIIAKAKPRMVKSSVVDNASGKSVDSE--IRTSTGAWLAK 114
Query: 78 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQKS-Q 135
EDE + IE+++A+VTM+P N E +L Y GQKY HYD F DP P+ Q
Sbjct: 115 GEDE--IISRIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNASPEHGGQ 172
Query: 136 RVASFLVYLTDLEEGGETMFPF-ENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 194
RV + L+YLT +EEGGET+ P + ++ +G + K GL VKP +GD L+FYSL P+G
Sbjct: 173 RVVTVLMYLTTVEEGGETVLPHADQKVSGEGWSECAK-RGLAVKPVKGDALMFYSLKPDG 231
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIR 220
+ DP S+HGSCP +KG+KW ATKWI
Sbjct: 232 SNDPASLHGSCPTLKGDKWSATKWIH 257
>gi|224102545|ref|XP_002312720.1| predicted protein [Populus trichocarpa]
gi|222852540|gb|EEE90087.1| predicted protein [Populus trichocarpa]
Length = 300
Score = 174 bits (440), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 97/214 (45%), Positives = 130/214 (60%), Gaps = 21/214 (9%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 72
+SW PRA + F T +C +I++AK L+ S +A DN G +RTSSG
Sbjct: 42 VSWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVA-------DNESGKSKLSEVRTSSG 94
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
+FI+ A+D + IE+KIA T LPR NGE +LRY+ GQKY+ HYD F + +
Sbjct: 95 MFITKAKDP--IVAGIEDKIATWTFLPRENGEDIQVLRYEHGQKYDPHYDYFSDKVNIAR 152
Query: 133 KSQRVASFLVYLTDLEEGGETMFPFENGM---NADGSY-DYQKCI--GLKVKPRQGDGLL 186
RVA+ L+YLTD+E+GGET+FP + A S+ D +C G+ VKPR+GD LL
Sbjct: 153 GGHRVATVLMYLTDVEKGGETVFPSAEELPRRKASVSHEDLSECARKGIAVKPRRGDALL 212
Query: 187 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
F+SL P D +SIH CPV++GEKW ATKWI
Sbjct: 213 FFSLYPTAVPDTSSIHAGCPVIEGEKWSATKWIH 246
>gi|363806698|ref|NP_001242522.1| uncharacterized protein LOC100806046 [Glycine max]
gi|255647110|gb|ACU24023.1| unknown [Glycine max]
Length = 289
Score = 174 bits (440), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 90/217 (41%), Positives = 138/217 (63%), Gaps = 19/217 (8%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVFI 75
+V+SW PRA + NF T E+C+ +I++AK ++ ST+ G++ D+ +RTSSG F+
Sbjct: 79 EVVSWEPRAFVYHNFLTKEECEYLIDIAKPSMHKSTVVDSETGKSKDSR--VRTSSGTFL 136
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 135
+ D+ + IE+KI+ T +P +GE +L Y++GQKY HYD F Q
Sbjct: 137 ARGRDK--IVRNIEKKISDFTFIPVEHGEGLQVLHYEVGQKYEPHYDYFLDDFNTKNGGQ 194
Query: 136 RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQ-------KC--IGLKVKPRQGDGLL 186
R+A+ L+YLTD+EEGGET+FP A G++ + +C GL +KP++GD LL
Sbjct: 195 RIATVLMYLTDVEEGGETVFP-----AAKGNFSFVPWWNELFECGKKGLSIKPKRGDALL 249
Query: 187 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
F+S+ P+ ++DP+S+HG CPV+KG KW +TKW+R E
Sbjct: 250 FWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWMRVSE 286
>gi|48716447|dbj|BAD23054.1| putative prolyl 4-hydroxylase [Oryza sativa Japonica Group]
Length = 310
Score = 173 bits (439), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 89/217 (41%), Positives = 135/217 (62%), Gaps = 19/217 (8%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTS 70
+V+SW PRA + NF + E+C +I +AK ++ ST+ VD+T G +RTS
Sbjct: 100 EVISWEPRAFVYHNFLSKEECDYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 152
Query: 71 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 130
SG+F+ D+ + IE++IA T +P +GE +L Y++GQKY H+D F +
Sbjct: 153 SGMFLQRGRDK--VIRAIEKRIADYTFIPMEHGEGLQVLHYEVGQKYEPHFDYFLDEYNT 210
Query: 131 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKCI--GLKVKPRQGDGLL 186
QR+A+ L+YL+D+EEGGET+FP N ++ + + +C GL VKP+ GD LL
Sbjct: 211 KNGGQRMATLLMYLSDVEEGGETIFPDANVNSSSLPWYNELSECARKGLAVKPKMGDALL 270
Query: 187 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
F+S+ P+ T+DP S+HG CPV+KG KW +TKW+ +E
Sbjct: 271 FWSMKPDATLDPLSLHGGCPVIKGNKWSSTKWMHVRE 307
>gi|147800995|emb|CAN64470.1| hypothetical protein VITISV_014644 [Vitis vinifera]
Length = 288
Score = 173 bits (439), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 89/217 (41%), Positives = 136/217 (62%), Gaps = 19/217 (8%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTS 70
+V+SW PRA + NF + ++C+ +I +AK +++ ST+ VD++ G +RTS
Sbjct: 78 EVISWEPRAFVYHNFLSKDECEYLIKLAKPHMQKSTV-------VDSSTGKSKDSRVRTS 130
Query: 71 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 130
SG F++ +D+ + IE++++ T LP +GE IL Y++GQKY HYD F
Sbjct: 131 SGTFLTRGQDK--IIRGIEKRLSDFTFLPVEHGEGLQILHYEVGQKYEPHYDYFLDDYNT 188
Query: 131 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKC--IGLKVKPRQGDGLL 186
QR+A+ L+YL+D+EEGGET+FP G + + + C GL VKP+ GD LL
Sbjct: 189 KNGGQRMATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSXCGKEGLSVKPKMGDALL 248
Query: 187 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
F+S+ P+ ++DP+S+HG CPV+KG KW +TKWIR E
Sbjct: 249 FWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNE 285
>gi|225468574|ref|XP_002263060.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
gi|296084059|emb|CBI24447.3| unnamed protein product [Vitis vinifera]
Length = 288
Score = 173 bits (439), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 89/217 (41%), Positives = 136/217 (62%), Gaps = 19/217 (8%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTS 70
+V+SW PRA + NF + ++C+ +I +AK +++ ST+ VD++ G +RTS
Sbjct: 78 EVISWEPRAFVYHNFLSKDECEYLIKLAKPHMQKSTV-------VDSSTGKSKDSRVRTS 130
Query: 71 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 130
SG F++ +D+ + IE++++ T LP +GE IL Y++GQKY HYD F
Sbjct: 131 SGTFLTRGQDK--IIRGIEKRLSDFTFLPVEHGEGLQILHYEVGQKYEPHYDYFLDDYNT 188
Query: 131 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKC--IGLKVKPRQGDGLL 186
QR+A+ L+YL+D+EEGGET+FP G + + + C GL VKP+ GD LL
Sbjct: 189 KNGGQRMATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKEGLSVKPKMGDALL 248
Query: 187 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
F+S+ P+ ++DP+S+HG CPV+KG KW +TKWIR E
Sbjct: 249 FWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNE 285
>gi|297832394|ref|XP_002884079.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297329919|gb|EFH60338.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 291
Score = 173 bits (438), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 85/211 (40%), Positives = 135/211 (63%), Gaps = 7/211 (3%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFIS 76
+V+SW PRA+ + NF + E+C+ +IN+AK ++ ST+ K +++ +RTSSG F+
Sbjct: 81 EVISWEPRAVVYHNFLSNEECEHLINLAKPSMVKSTVVDEKTGGSKDSR-VRTSSGTFLR 139
Query: 77 AAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQR 136
DE +++IE++I+ T +P NGE +L Y++GQKY HYD F + QR
Sbjct: 140 RGHDE--VVEVIEKRISDFTFIPVENGEGLQVLHYQVGQKYEPHYDYFLDEFNTKNGGQR 197
Query: 137 VASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKC--IGLKVKPRQGDGLLFYSLLP 192
+A+ L+YL+D+++GGET+FP G + + + KC GL V P++ D LLF+++ P
Sbjct: 198 IATVLMYLSDVDDGGETVFPAARGNISAVPWWNELSKCGKEGLSVLPKKRDALLFWNMRP 257
Query: 193 NGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
+ ++DP+S+HG CPVVKG KW +TKW E
Sbjct: 258 DASLDPSSLHGGCPVVKGNKWSSTKWFHVHE 288
>gi|224141327|ref|XP_002324025.1| predicted protein [Populus trichocarpa]
gi|222867027|gb|EEF04158.1| predicted protein [Populus trichocarpa]
Length = 239
Score = 173 bits (438), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 94/206 (45%), Positives = 127/206 (61%), Gaps = 8/206 (3%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLAL-RKGETVDNTQGIRTSSGVFISA 77
LSW PRA + F + E+C +IN+AK L S +A GE++++ + RTSSG+FI
Sbjct: 21 LSWQPRAFVYKGFLSDEECDHLINLAKGKLVKSMVANDETGESMESQE--RTSSGMFIFK 78
Query: 78 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRV 137
EDE ++ IE +IA T LP NGE ILRY+ GQKY +H D F + + R
Sbjct: 79 TEDE--IVNGIEARIAAWTFLPEENGEPIQILRYEHGQKYEAHIDYFVDKANQEEGGHRA 136
Query: 138 ASFLVYLTDLEEGGETMFP---FENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 194
A+ L+YL+D+++GGET+FP E D S+ G VKP +GD LLF+SL P+
Sbjct: 137 ATVLMYLSDVKKGGETVFPTSEAEGSQAKDDSWSDCAKKGYAVKPNKGDALLFFSLHPDA 196
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIR 220
T DP S+H SCPV++GEKW ATKWI
Sbjct: 197 TPDPGSLHASCPVIEGEKWSATKWIH 222
>gi|297818456|ref|XP_002877111.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297322949|gb|EFH53370.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 316
Score = 173 bits (438), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 91/206 (44%), Positives = 126/206 (61%), Gaps = 8/206 (3%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALR-KGETVDNTQGIRTSSGVFISA 77
LSW PRA + F + E+C I +AK L S +A GE+V++ +RTSSG+F+S
Sbjct: 59 LSWTPRAFLYKGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESE--VRTSSGMFLSK 116
Query: 78 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRV 137
+D+ + +E K+A T +P NGE+ IL Y+ GQKY H+D F Q R+
Sbjct: 117 RQDD--IVANVEAKLAAWTFIPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRI 174
Query: 138 ASFLVYLTDLEEGGETMFPFENGMNA---DGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 194
A+ L+YL+++E+GGET+FP G D S+ G VKPR+GD LLF++L PN
Sbjct: 175 ATVLMYLSNVEKGGETVFPMWKGKTTQLKDDSWTECAKQGYAVKPRKGDALLFFNLHPNA 234
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIR 220
T D S+HGSCPVV+GEKW AT+WI
Sbjct: 235 TTDSNSLHGSCPVVEGEKWSATRWIH 260
>gi|242063586|ref|XP_002453082.1| hypothetical protein SORBIDRAFT_04g038020 [Sorghum bicolor]
gi|241932913|gb|EES06058.1| hypothetical protein SORBIDRAFT_04g038020 [Sorghum bicolor]
Length = 307
Score = 173 bits (438), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 91/218 (41%), Positives = 135/218 (61%), Gaps = 21/218 (9%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTS 70
+V+SW PRA + NF + E+C+ +I +AK ++ ST+ VD+T G +RTS
Sbjct: 97 EVISWEPRAFVYHNFLSKEECEYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 149
Query: 71 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 130
SG+F+ D+ + IE++IA T +P +GE +L Y++GQKY H+D F +
Sbjct: 150 SGMFLQRGRDK--VIRAIEKRIADYTFIPADHGEGLQVLHYEVGQKYEPHFDYFLDEFNT 207
Query: 131 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQK---CI--GLKVKPRQGDGL 185
QR+A+ L+YL+D+EEGGET+FP N +NA Y + C GL VKP+ GD L
Sbjct: 208 KNGGQRMATLLMYLSDVEEGGETIFPDAN-VNASSLPWYNELSECAKRGLSVKPKMGDAL 266
Query: 186 LFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
LF+S+ P+ T+DP S+HG CPV++G KW +TKW+ E
Sbjct: 267 LFWSMKPDATLDPLSLHGGCPVIRGNKWSSTKWMHIHE 304
>gi|9294583|dbj|BAB02864.1| prolyl 4-hydroxylase alpha subunit-like protein [Arabidopsis
thaliana]
Length = 332
Score = 173 bits (438), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 91/206 (44%), Positives = 125/206 (60%), Gaps = 8/206 (3%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALR-KGETVDNTQGIRTSSGVFISA 77
LSW PR + F + E+C I +AK L S +A GE+V++ +RTSSG+F+S
Sbjct: 75 LSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESE--VRTSSGMFLSK 132
Query: 78 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRV 137
+D+ + +E K+A T LP NGE+ IL Y+ GQKY H+D F Q R+
Sbjct: 133 RQDD--IVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRI 190
Query: 138 ASFLVYLTDLEEGGETMFPFENGMNA---DGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 194
A+ L+YL+++E+GGET+FP G D S+ G VKPR+GD LLF++L PN
Sbjct: 191 ATVLMYLSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDALLFFNLHPNA 250
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIR 220
T D S+HGSCPVV+GEKW AT+WI
Sbjct: 251 TTDSNSLHGSCPVVEGEKWSATRWIH 276
>gi|18086437|gb|AAL57673.1| AT3g28480/MFJ20_16 [Arabidopsis thaliana]
gi|24796986|gb|AAN64505.1| At3g28480/MFJ20_16 [Arabidopsis thaliana]
Length = 316
Score = 172 bits (437), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 91/206 (44%), Positives = 126/206 (61%), Gaps = 8/206 (3%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALR-KGETVDNTQGIRTSSGVFISA 77
LSW PR + F + E+C I +AK L S +A GE+V++ +RTSSG+F+S
Sbjct: 59 LSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESE--VRTSSGMFLSK 116
Query: 78 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRV 137
+D+ ++ +E K+A T LP NGE+ IL Y+ GQKY H+D F Q R+
Sbjct: 117 RQDD--IVNNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRI 174
Query: 138 ASFLVYLTDLEEGGETMFPFENGMNA---DGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 194
A+ L+YL+++E+GGET+FP G D S+ G VKPR+GD LLF++L PN
Sbjct: 175 ATVLMYLSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDALLFFNLHPNA 234
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIR 220
T D S+HGSCPVV+GEKW AT+WI
Sbjct: 235 TTDSNSLHGSCPVVEGEKWSATRWIH 260
>gi|297802350|ref|XP_002869059.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297314895|gb|EFH45318.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 290
Score = 172 bits (437), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 85/213 (39%), Positives = 137/213 (64%), Gaps = 9/213 (4%)
Query: 16 FQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVF 74
+V+SW PRA + NF T E+C+ +I++AK ++ S + ++ G+++D+ +RTSSG F
Sbjct: 80 LEVISWEPRAFVYHNFLTNEECEHLISLAKPSMVKSKVVDVKTGKSIDSR--VRTSSGTF 137
Query: 75 ISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS 134
+ DE ++ IE +I+ T +P NGE +L Y++GQKY H+D F + +
Sbjct: 138 LKRGHDE--IVEEIENRISDFTFIPIENGEGLQVLHYEVGQKYEPHHDYFFDEFNVRKGG 195
Query: 135 QRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKC--IGLKVKPRQGDGLLFYSL 190
QR+A+ L+YL+D++EGGET+FP G +D + + +C GL V P++ D LLF+S+
Sbjct: 196 QRIATVLMYLSDVDEGGETVFPAAKGNISDVPWWDELSQCGKEGLSVLPKKRDALLFWSM 255
Query: 191 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
P+ ++DP+S+HG CPV+KG KW +TKW E
Sbjct: 256 KPDASLDPSSLHGGCPVIKGNKWSSTKWFHVHE 288
>gi|242088305|ref|XP_002439985.1| hypothetical protein SORBIDRAFT_09g023860 [Sorghum bicolor]
gi|241945270|gb|EES18415.1| hypothetical protein SORBIDRAFT_09g023860 [Sorghum bicolor]
Length = 308
Score = 172 bits (437), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 88/210 (41%), Positives = 126/210 (60%), Gaps = 17/210 (8%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 72
+SW PR + +F + ++ +I++A+ L+ S +A DN G +RTSSG
Sbjct: 54 ISWKPRVFLYQHFLSDDEANHLISLARAELKRSAVA-------DNMSGKSTLSDVRTSSG 106
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
F+ +D ++ IE+KIA T LP+ NGE +LRYK G+KY HYD F +
Sbjct: 107 TFLRKGQDP--IVEGIEDKIAAWTFLPKENGEDIQVLRYKHGEKYEPHYDYFTDNVNTIR 164
Query: 133 KSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCI--GLKVKPRQGDGLLFYSL 190
R A+ L+YLTD+ EGGET+FP ++ + +C G+ VKPR+GD LLF++L
Sbjct: 165 GGHRYATVLLYLTDVAEGGETVFPLAEEVDDAKDATFSECAQKGIAVKPRKGDALLFFNL 224
Query: 191 LPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
P+GT DP S+HG C V++GEKW ATKWIR
Sbjct: 225 KPDGTTDPVSLHGGCAVIRGEKWSATKWIR 254
>gi|359477455|ref|XP_002278454.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1 [Vitis
vinifera]
Length = 296
Score = 172 bits (437), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 93/228 (40%), Positives = 135/228 (59%), Gaps = 19/228 (8%)
Query: 3 HGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVD 62
+ A +V+ + +SW PRA + F + E+C +I++AK L+ S +A D
Sbjct: 24 YADAAGSNVSAAKVRQISWKPRAFVYEGFLSEEECDHLISLAKSELKRSAVA-------D 76
Query: 63 NTQG------IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQK 116
N G +RTSSG+FI +D + IE+KIA T LP+ NGE +LRY+ GQK
Sbjct: 77 NVSGKSRLSEVRTSSGMFIGKGKDP--IVAGIEDKIAAWTFLPKDNGEDMQVLRYEPGQK 134
Query: 117 YNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNAD--GSYDYQKCI- 173
Y++HYD F + + R+A+ L+YL+D+ +GGET+FP ++ + D +C
Sbjct: 135 YDAHYDYFVDKVNIARGGHRIATVLMYLSDVVKGGETVFPMAEVSSSTLPTNDDLSECAR 194
Query: 174 -GLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
G+ VKPR+GD LLF+SL P DP S+HG CPV++GEKW ATKWI
Sbjct: 195 KGIAVKPRKGDALLFFSLHPTAIPDPMSLHGGCPVIEGEKWSATKWIH 242
>gi|15227885|ref|NP_179363.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
gi|25411813|pir||F84555 similar to prolyl 4-hydroxylase alpha subunit [imported] -
Arabidopsis thaliana
gi|89274129|gb|ABD65585.1| At2g17720 [Arabidopsis thaliana]
gi|110738861|dbj|BAF01353.1| similar to prolyl 4-hydroxylase alpha subunit [Arabidopsis
thaliana]
gi|330251579|gb|AEC06673.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
Length = 291
Score = 172 bits (437), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 85/211 (40%), Positives = 135/211 (63%), Gaps = 7/211 (3%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFIS 76
+V+SW PRA+ + NF T E+C+ +I++AK ++ ST+ K +++ +RTSSG F+
Sbjct: 81 EVISWEPRAVVYHNFLTNEECEHLISLAKPSMVKSTVVDEKTGGSKDSR-VRTSSGTFLR 139
Query: 77 AAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQR 136
DE +++IE++I+ T +P NGE +L Y++GQKY HYD F + QR
Sbjct: 140 RGHDE--VVEVIEKRISDFTFIPVENGEGLQVLHYQVGQKYEPHYDYFLDEFNTKNGGQR 197
Query: 137 VASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKC--IGLKVKPRQGDGLLFYSLLP 192
+A+ L+YL+D+++GGET+FP G + + + KC GL V P++ D LLF+++ P
Sbjct: 198 IATVLMYLSDVDDGGETVFPAARGNISAVPWWNELSKCGKEGLSVLPKKRDALLFWNMRP 257
Query: 193 NGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
+ ++DP+S+HG CPVVKG KW +TKW E
Sbjct: 258 DASLDPSSLHGGCPVVKGNKWSSTKWFHVHE 288
>gi|42567428|ref|NP_195306.2| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
gi|332661174|gb|AEE86574.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
Length = 290
Score = 172 bits (437), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 84/213 (39%), Positives = 138/213 (64%), Gaps = 9/213 (4%)
Query: 16 FQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVF 74
+V+SW PRA + NF T E+C+ +I++AK ++ S + ++ G+++D+ +RTSSG F
Sbjct: 80 LEVISWEPRAFVYHNFLTNEECEHLISLAKPSMMKSKVVDVKTGKSIDSR--VRTSSGTF 137
Query: 75 ISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS 134
++ DE ++ IE +I+ T +P NGE +L Y++GQ+Y H+D F + +
Sbjct: 138 LNRGHDE--IVEEIENRISDFTFIPPENGEGLQVLHYEVGQRYEPHHDYFFDEFNVRKGG 195
Query: 135 QRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKC--IGLKVKPRQGDGLLFYSL 190
QR+A+ L+YL+D++EGGET+FP G +D + + +C GL V P++ D LLF+S+
Sbjct: 196 QRIATVLMYLSDVDEGGETVFPAAKGNVSDVPWWDELSQCGKEGLSVLPKKRDALLFWSM 255
Query: 191 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
P+ ++DP+S+HG CPV+KG KW +TKW E
Sbjct: 256 KPDASLDPSSLHGGCPVIKGNKWSSTKWFHVHE 288
>gi|449495423|ref|XP_004159836.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 304
Score = 172 bits (437), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 92/215 (42%), Positives = 131/215 (60%), Gaps = 22/215 (10%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 72
+SW PRA + F T +C +I++AK L+ S++A DN G +RTSSG
Sbjct: 44 ISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVA-------DNLSGKSKVSEVRTSSG 96
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
FI A+D + IE+KIA T LP+ NGE +LRY+ GQKY++H+D F + +
Sbjct: 97 AFIHKAKDP--IVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIAR 154
Query: 133 KSQRVASFLVYLTDLEEGGETMFPFENGMN-----ADGSYDYQKCI--GLKVKPRQGDGL 185
R+A+ L+YL+D+E+GGET+F + ++ + D C G+ VKPR+GD L
Sbjct: 155 GGHRMATVLMYLSDVEKGGETVFLLRRSESQRRQASETNEDLSDCAKKGIAVKPRKGDAL 214
Query: 186 LFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
LF+SL PN D +S+HG CPV++GEKW ATKWIR
Sbjct: 215 LFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIR 249
>gi|259490206|ref|NP_001159002.1| prolyl 4-hydroxylase alpha-2 subunit [Zea mays]
gi|195626402|gb|ACG35031.1| prolyl 4-hydroxylase alpha-2 subunit precursor [Zea mays]
gi|347978830|gb|AEP37757.1| prolyl 4-hydroxylase 8 [Zea mays]
gi|347978832|gb|AEP37758.1| prolyl 4-hydroxylase 8-1 [Zea mays]
gi|413939569|gb|AFW74120.1| prolyl 4-hydroxylase alpha-2 subunit isoform 1 [Zea mays]
gi|413939570|gb|AFW74121.1| prolyl 4-hydroxylase alpha-2 subunit isoform 2 [Zea mays]
gi|413939571|gb|AFW74122.1| prolyl 4-hydroxylase alpha-2 subunit isoform 3 [Zea mays]
gi|413939572|gb|AFW74123.1| prolyl 4-hydroxylase alpha-2 subunit isoform 4 [Zea mays]
Length = 307
Score = 172 bits (436), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 91/218 (41%), Positives = 135/218 (61%), Gaps = 21/218 (9%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTS 70
+V+SW PRA + NF + ++C+ +I +AK ++ ST+ VD+T G +RTS
Sbjct: 97 EVISWEPRAFVYHNFLSKDECEYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 149
Query: 71 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 130
SG+F+ D+ + IE++IA T +P +GE +L Y++GQKY H+D F +
Sbjct: 150 SGMFLQRGRDK--VIRAIEKRIADYTFIPVDHGEGLQVLHYEVGQKYEPHFDYFLDEFNT 207
Query: 131 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQK---CI--GLKVKPRQGDGL 185
QR+A+ L+YL+D+EEGGET+FP N +NA Y + C GL VKP+ GD L
Sbjct: 208 KNGGQRIATLLMYLSDVEEGGETIFPDAN-VNASSLPWYNELSDCAKRGLSVKPKMGDAL 266
Query: 186 LFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
LF+S+ P+ T+DP S+HG CPV+KG KW +TKW+ E
Sbjct: 267 LFWSMKPDATLDPLSLHGGCPVIKGNKWSSTKWMHIHE 304
>gi|357517885|ref|XP_003629231.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355523253|gb|AET03707.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 279
Score = 172 bits (436), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 89/223 (39%), Positives = 135/223 (60%), Gaps = 19/223 (8%)
Query: 8 DDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG- 66
DD +++SW PR + NF E+C+ +IN+AK +++ ST+ VD+T G
Sbjct: 62 DDDDNKRWVEIVSWEPRVFLYHNFLAKEECEHLINIAKPDVQKSTV-------VDDTTGK 114
Query: 67 -----IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHY 121
RTSSG FI D+ L IE++IA T +P +GE NIL Y++GQKY+ H
Sbjct: 115 SVNSSARTSSGTFIDRGYDK--ILSDIEKRIADFTFIPVEHGEDVNILHYEVGQKYDFHT 172
Query: 122 DAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKC--IGLKV 177
D F+ + +R+A+ L+YL+D+EEGGET+FP G + + + C GL +
Sbjct: 173 DYFEDEVNTKHGGERIATMLMYLSDVEEGGETVFPSAKGNFSSVPWWNELSDCGKKGLSI 232
Query: 178 KPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
KP+ G+ +LF+ + P+ T+DP S+HG+CPV+KG+KW TKW+R
Sbjct: 233 KPKMGNAILFWGMKPDATVDPLSVHGACPVIKGDKWSCTKWMR 275
>gi|18405808|ref|NP_566838.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
gi|21617881|gb|AAM66931.1| prolyl 4-hydroxylase, putative [Arabidopsis thaliana]
gi|332643929|gb|AEE77450.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
Length = 316
Score = 172 bits (436), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 91/206 (44%), Positives = 125/206 (60%), Gaps = 8/206 (3%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALR-KGETVDNTQGIRTSSGVFISA 77
LSW PR + F + E+C I +AK L S +A GE+V++ +RTSSG+F+S
Sbjct: 59 LSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESE--VRTSSGMFLSK 116
Query: 78 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRV 137
+D+ + +E K+A T LP NGE+ IL Y+ GQKY H+D F Q R+
Sbjct: 117 RQDD--IVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRI 174
Query: 138 ASFLVYLTDLEEGGETMFPFENGMNA---DGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 194
A+ L+YL+++E+GGET+FP G D S+ G VKPR+GD LLF++L PN
Sbjct: 175 ATVLMYLSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDALLFFNLHPNA 234
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIR 220
T D S+HGSCPVV+GEKW AT+WI
Sbjct: 235 TTDSNSLHGSCPVVEGEKWSATRWIH 260
>gi|116788056|gb|ABK24739.1| unknown [Picea sitchensis]
Length = 303
Score = 172 bits (435), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 87/218 (39%), Positives = 136/218 (62%), Gaps = 16/218 (7%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVD---------NTQG 66
+VLSW PRA+ + NF E+C+ +IN+AK ++ ST+ G++ D N
Sbjct: 82 EVLSWEPRAILYHNFLNKEECEYLINLAKPHMAKSTVVDSATGKSKDSRFVHRWKSNDSR 141
Query: 67 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP 126
+RTSSG+F++ +D+ T+ IE++IA T +P +GE +L Y++GQKY H+D F
Sbjct: 142 VRTSSGMFLNRGQDK--TIRSIEKRIADFTFIPAEHGEGLQVLHYEVGQKYEPHFDYFLD 199
Query: 127 QEYGPQKSQRVASFLVYLTDLEEGGETMFPFE--NGMNADGSYDYQKC--IGLKVKPRQG 182
+ QR+A+ L+YL+D+E+GGET+FP N + + +C G+ V+PR G
Sbjct: 200 EFNTKNGGQRIATVLMYLSDVEKGGETVFPASKVNSSSVPWWDELSECAKAGISVRPRMG 259
Query: 183 DGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
D LLF+S+ P+ +DP+S+H CPV++G+KW ATKWI
Sbjct: 260 DALLFWSMRPDAELDPSSLHAGCPVIQGDKWSATKWIH 297
>gi|303285562|ref|XP_003062071.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226456482|gb|EEH53783.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 522
Score = 172 bits (435), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 92/215 (42%), Positives = 133/215 (61%), Gaps = 13/215 (6%)
Query: 15 PFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVF 74
P + + P+A F NF T E+C+ +I +AK L PST+ G+ GIRTS+G+F
Sbjct: 228 PLVLSATRPKAYLFRNFLTEEECRHLIALAKAQLAPSTVVADGGKK-STKSGIRTSAGMF 286
Query: 75 ISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQK 133
++ + ++ T+ ++EE++A LP NGE ILRY+ GQKY+ HYD F D P +
Sbjct: 287 LT--KGQTPTVRMVEERVAAAVGLPEENGEGMQILRYEHGQKYDPHYDYFHDKINPSPNR 344
Query: 134 S-QRVASFLVYLTDLEEGGETMFP-------FENGMNADGSYDYQKCIGLKVKPRQGDGL 185
QR+A+ L+YL D EEGGET+FP F +G DG++ GL VK ++GD +
Sbjct: 345 GGQRMATMLIYLKDTEEGGETIFPNAKKPEGFHDG-EKDGAFSDCAKRGLPVKSKRGDAV 403
Query: 186 LFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
LF+SL + +D S+HG+CPV++GEKW A KWIR
Sbjct: 404 LFWSLTSDYKLDEGSLHGACPVLRGEKWTAVKWIR 438
>gi|357137804|ref|XP_003570489.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Brachypodium
distachyon]
Length = 318
Score = 172 bits (435), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 88/217 (40%), Positives = 133/217 (61%), Gaps = 19/217 (8%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTS 70
+V+SW PRA + NF + E+C+ +I +AK + ST+ VD+T G +RTS
Sbjct: 108 EVISWEPRAFVYHNFLSKEECEYLIGLAKPRMEKSTV-------VDSTTGKSKDSRVRTS 160
Query: 71 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 130
SG+F+ D+ + IE +IA T +P +GE +L Y++GQKY H+D F +
Sbjct: 161 SGMFLRRGRDK--VIRAIERRIADYTFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEFNT 218
Query: 131 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKCI--GLKVKPRQGDGLL 186
QR+A+ L+YL+D+EEGGET+FP N ++ + + +C GL VKP+ GD LL
Sbjct: 219 KNGGQRMATILMYLSDVEEGGETIFPDANVNSSSLPWHNELSECARKGLAVKPKMGDALL 278
Query: 187 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
F+S+ P+ T+DP S+HG CPV++G KW +TKW+ E
Sbjct: 279 FWSMNPDATLDPLSLHGGCPVIRGNKWSSTKWMHVGE 315
>gi|21593091|gb|AAM65040.1| putative prolyl 4-hydroxylase, alpha subunit [Arabidopsis thaliana]
Length = 291
Score = 172 bits (435), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 85/211 (40%), Positives = 134/211 (63%), Gaps = 7/211 (3%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFIS 76
+V+SW PRA+ + NF T E+C+ +I++AK ++ ST+ K +++ +RTSSG F+
Sbjct: 81 EVISWEPRAVVYHNFLTNEECEHLISLAKPSMVKSTVVDEKTGGSKDSR-VRTSSGTFLR 139
Query: 77 AAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQR 136
DE +++IE++I+ T +P NGE +L Y++GQKY HYD F + QR
Sbjct: 140 RGHDE--VVEVIEKRISDFTFIPVENGEGLQVLHYQVGQKYEPHYDYFLDEFNTKNGGQR 197
Query: 137 VASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKC--IGLKVKPRQGDGLLFYSLLP 192
+A+ L+YL+D+++GGET+FP G + + + KC GL V P+ D LLF+++ P
Sbjct: 198 IATVLMYLSDVDDGGETVFPAARGNISAVPWWNELSKCGKEGLSVLPKXRDALLFWNMRP 257
Query: 193 NGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
+ ++DP+S+HG CPVVKG KW +TKW E
Sbjct: 258 DASLDPSSLHGGCPVVKGNKWSSTKWFHVHE 288
>gi|226529219|ref|NP_001151238.1| LOC100284871 [Zea mays]
gi|195645242|gb|ACG42089.1| prolyl 4-hydroxylase alpha-2 subunit precursor [Zea mays]
gi|347978812|gb|AEP37748.1| prolyl 4-hydroxylase 5 [Zea mays]
gi|413923983|gb|AFW63915.1| prolyl 4-hydroxylase alpha-2 subunit [Zea mays]
Length = 308
Score = 172 bits (435), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 88/217 (40%), Positives = 135/217 (62%), Gaps = 19/217 (8%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTS 70
+V+SW PRA + NF + E+C+ +I +AK ++ ST+ VD+T G +RTS
Sbjct: 98 EVISWEPRAFVYHNFLSKEECEYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 150
Query: 71 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 130
SG+F+ D+ + +IE++IA T +P +GE +L Y++GQKY H+D F +
Sbjct: 151 SGMFLQRGRDK--VIRVIEKRIADYTFIPVDHGEGLQVLHYEVGQKYEPHFDYFLDEFNT 208
Query: 131 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKCI--GLKVKPRQGDGLL 186
QR+A+ L+YL+D+EEGGET+FP N + + + +C GL VKP+ GD LL
Sbjct: 209 KNGGQRMATLLMYLSDVEEGGETIFPDANVNVSSLPWYNELSECAKRGLSVKPKMGDALL 268
Query: 187 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
F+S+ P+ T+DP S+HG CPV++G KW +TKW+ E
Sbjct: 269 FWSMKPDATLDPLSLHGGCPVIRGNKWSSTKWMHIHE 305
>gi|363543371|ref|NP_001241695.1| prolyl 4-hydroxylase 8-5 [Zea mays]
gi|347978840|gb|AEP37762.1| prolyl 4-hydroxylase 8-5 [Zea mays]
Length = 307
Score = 171 bits (434), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 91/218 (41%), Positives = 134/218 (61%), Gaps = 21/218 (9%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTS 70
+V+SW PRA + NF + ++C+ +I +AK ++ ST+ VD+T G +RTS
Sbjct: 97 EVISWEPRAFVYHNFLSKDECEYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 149
Query: 71 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 130
SG+F+ D+ + IE++IA T +P +GE +L Y++GQKY H+D F +
Sbjct: 150 SGMFLQRGRDK--VIRAIEKRIADYTFIPVDHGEGLQVLHYEVGQKYEPHFDYFLDEFNT 207
Query: 131 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQK---CI--GLKVKPRQGDGL 185
QR+A+ L+YL+D+EEGGET+FP N +NA Y + C GL VKP+ GD L
Sbjct: 208 KNGGQRIATLLMYLSDVEEGGETIFPDAN-VNASSLPWYNELSDCAKRGLSVKPKMGDAL 266
Query: 186 LFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
LF+S+ P T+DP S+HG CPV+KG KW +TKW+ E
Sbjct: 267 LFWSMKPGATLDPLSLHGGCPVIKGNKWSSTKWMHIHE 304
>gi|255539064|ref|XP_002510597.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223551298|gb|EEF52784.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 289
Score = 171 bits (434), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 87/208 (41%), Positives = 132/208 (63%), Gaps = 9/208 (4%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-GETVDNTQGIRTSSGVFI 75
+++SW PRA + NF + E+C+ +I +AK ++ ST+ K G + D+ +RTSSG+F+
Sbjct: 79 EIISWEPRAFVYHNFLSKEECEYLIALAKPHMVKSTVVDSKTGRSKDSR--VRTSSGMFL 136
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 135
D+ + IE++IA + +P +GE +L Y++GQKY +HYD F + Q
Sbjct: 137 RRGRDK--IIRNIEKRIADFSFIPIEHGEGLQVLHYEVGQKYEAHYDYFLDEFNTKNGGQ 194
Query: 136 RVASFLVYLTDLEEGGETMFPFE--NGMNADGSYDYQKCI--GLKVKPRQGDGLLFYSLL 191
R A+ L+YL+D+EEGGET+FP N N + +C GL VKP+ G+ LLF+S
Sbjct: 195 RTATLLMYLSDVEEGGETVFPAAKANISNVPSWNELSECARQGLSVKPKMGNALLFWSTR 254
Query: 192 PNGTIDPTSIHGSCPVVKGEKWVATKWI 219
P+ T+DP S+HGSCPV++G KW ATKW+
Sbjct: 255 PDATLDPASLHGSCPVIRGNKWSATKWM 282
>gi|255552788|ref|XP_002517437.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223543448|gb|EEF44979.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 311
Score = 171 bits (434), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 91/206 (44%), Positives = 130/206 (63%), Gaps = 8/206 (3%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-GETVDNTQGIRTSSGVFISA 77
LSW PRA + F + E+C +I++A+ L S +A + G+++++ +RTSSG+FI+
Sbjct: 52 LSWHPRAFLYKGFLSYEECDHLIDLARDKLEKSMVADNESGKSIESE--VRTSSGMFIAK 109
Query: 78 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRV 137
A+DE + IE +IA T LP NGE+ IL Y+ GQKY H+D F + RV
Sbjct: 110 AQDE--IVADIEARIAAWTFLPEENGESMQILHYEHGQKYEPHFDYFHDKANQELGGHRV 167
Query: 138 ASFLVYLTDLEEGGETMFPFENGMNADGSYD-YQKCI--GLKVKPRQGDGLLFYSLLPNG 194
A+ L+YL+++E+GGET+FP G + D + C G VKP +GD LLF+SL P+
Sbjct: 168 ATVLMYLSNVEKGGETVFPNAEGKLSQPKEDSWSDCAKGGYAVKPEKGDALLFFSLHPDA 227
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIR 220
T D S+HGSCPV++GEKW ATKWI
Sbjct: 228 TTDSDSLHGSCPVIEGEKWSATKWIH 253
>gi|224133600|ref|XP_002327635.1| predicted protein [Populus trichocarpa]
gi|222836720|gb|EEE75113.1| predicted protein [Populus trichocarpa]
Length = 291
Score = 171 bits (433), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 89/225 (39%), Positives = 138/225 (61%), Gaps = 9/225 (4%)
Query: 4 GQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVD 62
G D+ +V+SW PRA + NF T +C+ +IN+AK ++ ST+ G++ D
Sbjct: 68 GSGDDEGKAEQWAEVISWKPRAFVYHNFLTKAECEYLINLAKPRMQKSTVVDSSTGKSKD 127
Query: 63 NTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD 122
+ +RTSSG F+ D+ + IE++IA + +P +GE IL Y++GQ+Y H+D
Sbjct: 128 SK--VRTSSGTFLPRGRDK--IVRDIEKRIADFSFIPVEHGEGLQILHYEVGQRYEPHFD 183
Query: 123 AFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKCI--GLKVK 178
F + QR+A+ L+YL+D+EEGGET+FP G + + + +C GL VK
Sbjct: 184 YFMDEYNTKNGGQRIATVLMYLSDVEEGGETVFPSAEGNISAVPWWNELSECGKGGLSVK 243
Query: 179 PRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
P+ GD LLF+S+ P+G+ DP+S+HG CPV++G KW +TKW+R E
Sbjct: 244 PKMGDALLFWSMNPDGSPDPSSLHGGCPVIRGNKWSSTKWMRVNE 288
>gi|326489721|dbj|BAK01841.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 315
Score = 171 bits (433), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 86/208 (41%), Positives = 132/208 (63%), Gaps = 9/208 (4%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVFI 75
+V+SW PRA + NF + E+C+ +I +AK + ST+ G++ D+ +RTSSG+F+
Sbjct: 105 EVISWEPRAFVYHNFLSKEECEYLIELAKPRMVKSTVVDSETGKSKDSR--VRTSSGMFL 162
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 135
D+ + IE +IA T +P +GE +L Y++GQKY H+D F + Q
Sbjct: 163 QRGRDK--VIRAIERRIADYTFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEFNTKNGGQ 220
Query: 136 RVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKCI--GLKVKPRQGDGLLFYSLL 191
R+A+ L+YL+D+EEGGET+FP N ++ + + +C GL VKP+ GD LLF+S+
Sbjct: 221 RMATILMYLSDIEEGGETIFPDANVNSSSLPWYNELSECARKGLAVKPKMGDALLFWSMK 280
Query: 192 PNGTIDPTSIHGSCPVVKGEKWVATKWI 219
P+ T+DP S+HG CPV+KG KW +TKW+
Sbjct: 281 PDATLDPLSLHGGCPVIKGNKWSSTKWL 308
>gi|356502598|ref|XP_003520105.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 296
Score = 171 bits (432), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 88/213 (41%), Positives = 131/213 (61%), Gaps = 9/213 (4%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPST-LALRKGETVDNTQGIRTSSGVFI 75
+++SW PR + NF T E+C+ +IN+AK N+R ST + G ++++ +RTSSG F+
Sbjct: 86 EIISWEPRIFLYHNFLTKEECEHLINIAKPNMRKSTVIESETGMSIESR--VRTSSGTFL 143
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 135
+ D+ + IE +IA T +P NGE +L Y++G+KY H+D F
Sbjct: 144 ARGRDK--IVRNIENRIADFTFIPVDNGEELQVLHYQVGEKYVPHHDYFMDDINTANGGD 201
Query: 136 RVASFLVYLTDLEEGGETMFPFENG--MNADGSYDYQKC--IGLKVKPRQGDGLLFYSLL 191
R+A+ L+YL+D+EEGGET+FP G + G + C GL +KP+ + LLF+S+
Sbjct: 202 RIATMLMYLSDVEEGGETVFPDAKGNFSSMPGWNELSVCGKKGLSIKPKMRNALLFWSIK 261
Query: 192 PNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
P+ T DP S+HGSCPV+KG KW +TKWIR E
Sbjct: 262 PDATYDPLSLHGSCPVIKGNKWSSTKWIRIGEH 294
>gi|359477453|ref|XP_003631980.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Vitis
vinifera]
gi|297736941|emb|CBI26142.3| unnamed protein product [Vitis vinifera]
Length = 298
Score = 171 bits (432), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 95/232 (40%), Positives = 134/232 (57%), Gaps = 25/232 (10%)
Query: 3 HGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVD 62
+ A +V+ + +SW PRA + F + E+C +I++AK L+ S +A D
Sbjct: 24 YADAAGSNVSAAKVRQISWKPRAFVYEGFLSEEECDHLISLAKSELKRSAVA-------D 76
Query: 63 NTQG------IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQK 116
N G +RTSSG+FI +D + IE+KIA T LP+ NGE +LRY+ GQK
Sbjct: 77 NVSGKSRLSEVRTSSGMFIGKGKDP--IVAGIEDKIAAWTFLPKDNGEDMQVLRYEPGQK 134
Query: 117 YNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENG--------MNADGSYD 168
Y++HYD F + + R+A+ L+YL+D+ +GGET+FP N D S
Sbjct: 135 YDAHYDYFVDKVNIARGGHRIATVLMYLSDVVKGGETVFPMAEEPSRRKPLPTNDDLSEC 194
Query: 169 YQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
+K G+ VKPR+GD LLF+SL P DP S+HG CPV++GEKW ATKWI
Sbjct: 195 ARK--GIAVKPRKGDALLFFSLHPTAIPDPMSLHGGCPVIEGEKWSATKWIH 244
>gi|356555587|ref|XP_003546112.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 2
[Glycine max]
Length = 297
Score = 171 bits (432), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 95/225 (42%), Positives = 133/225 (59%), Gaps = 19/225 (8%)
Query: 6 AGDDSVTNIPFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDN 63
AG S P +V +SW PRA + F T +C +I++AK L+ S +A DN
Sbjct: 28 AGSASAIIDPSKVKQVSWKPRAFVYEGFLTELECDHLISIAKSELKRSAVA-------DN 80
Query: 64 TQG------IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKY 117
G +RTSSG+FI +D + +E+KI+ T+LP+ NGE +LRY+ GQKY
Sbjct: 81 LSGESKLSEVRTSSGMFIPKNKDP--IVAGVEDKISSWTLLPKENGEDIQVLRYEHGQKY 138
Query: 118 NSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCI--GL 175
+ HYD F + + RVA+ L+YLTD+ +GGET+FP +++ D +C G+
Sbjct: 139 DPHYDYFADKVNIARGGHRVATVLMYLTDVTKGGETVFPNAELKSSETKEDLSECAQKGI 198
Query: 176 KVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
VKPR+GD LLF+SL PN D S+H CPV++GEKW ATKWI
Sbjct: 199 AVKPRRGDALLFFSLYPNAIPDTMSLHAGCPVIEGEKWSATKWIH 243
>gi|50845214|gb|AAT84604.1| prolyl 4-hydroxylase [Dianthus caryophyllus]
Length = 316
Score = 171 bits (432), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 93/211 (44%), Positives = 125/211 (59%), Gaps = 18/211 (8%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 72
LSW PRA + F T E+C +I+MAK L S +A DN G +RTSSG
Sbjct: 58 LSWKPRAFLYEGFLTHEECDHLIDMAKDKLEKSMVA-------DNESGKSIPSEVRTSSG 110
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
+F+ A+D+ + IE +IA T LP NGEA IL Y+ GQKY H+D F +
Sbjct: 111 MFLQKAQDD--VVAAIEARIAAWTFLPIENGEAMQILHYERGQKYEPHFDYFHDKVNQQL 168
Query: 133 KSQRVASFLVYLTDLEEGGETMFP-FENGMNADGSYDYQKCI--GLKVKPRQGDGLLFYS 189
R+A+ L+YL+++EEGGET+FP E + + C G VKP++GD LLF+S
Sbjct: 169 GGHRIATVLMYLSNVEEGGETVFPNAEAKLQLANNESLSDCAKGGYSVKPKKGDALLFFS 228
Query: 190 LLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
L P+ + D S+HGSCPV++GEKW ATKWI
Sbjct: 229 LHPDASTDSLSLHGSCPVIEGEKWSATKWIH 259
>gi|357467085|ref|XP_003603827.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355492875|gb|AES74078.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 280
Score = 171 bits (432), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 87/212 (41%), Positives = 133/212 (62%), Gaps = 9/212 (4%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-GETVDNTQGIRTSSGVFI 75
++LSW PRA + NF + E+C+ +IN+AK L S++ K G++ ++ +RTSSG+F+
Sbjct: 70 EILSWEPRAFVYHNFLSKEECEHLINLAKPFLAKSSVVDSKTGKSTESR--VRTSSGMFL 127
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 135
+D+ + IE +IA T +P NGE +L Y +G+KY HYD F + Q
Sbjct: 128 KRGKDK--IIQNIERRIADFTFIPVENGEGLQVLHYGVGEKYEPHYDYFLDEFNTKNGGQ 185
Query: 136 RVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKCI--GLKVKPRQGDGLLFYSLL 191
RVA+ L+YL+D+EEGGET+FP + + D +C GL +KP+ GD LLF+S+
Sbjct: 186 RVATVLMYLSDVEEGGETVFPAAKANFSSVPWWNDLSECARKGLSLKPKMGDALLFWSMR 245
Query: 192 PNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
P+ T+D +S+HG CPV+ G KW +TKW+ +E
Sbjct: 246 PDATLDASSLHGGCPVIVGNKWSSTKWMHLEE 277
>gi|363543369|ref|NP_001241694.1| prolyl 4-hydroxylase 8-4 [Zea mays]
gi|347978838|gb|AEP37761.1| prolyl 4-hydroxylase 8-4 [Zea mays]
Length = 307
Score = 171 bits (432), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 90/218 (41%), Positives = 135/218 (61%), Gaps = 21/218 (9%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTS 70
+V+SW PRA + NF + ++C+ +I +AK ++ ST+ VD+T G +RTS
Sbjct: 97 EVISWEPRAFVYHNFLSKDECEYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 149
Query: 71 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 130
SG+F+ ++ + IE++IA T +P +GE +L Y++GQKY H+D F +
Sbjct: 150 SGMFLQRGRNK--VIRAIEKRIADYTFIPVDHGEGLQVLHYEVGQKYEPHFDYFLDEFNT 207
Query: 131 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQK---CI--GLKVKPRQGDGL 185
QR+A+ L+YL+D+EEGGET+FP N +NA Y + C GL VKP+ GD L
Sbjct: 208 KNGGQRIATLLMYLSDVEEGGETIFPDAN-VNASSLPWYNELSDCAKRGLSVKPKMGDAL 266
Query: 186 LFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
LF+S+ P+ T+DP S+HG CPV+KG KW +TKW+ E
Sbjct: 267 LFWSMKPDATLDPLSLHGGCPVIKGNKWSSTKWMHIHE 304
>gi|30689216|ref|NP_189490.2| Oxoglutarate/iron-dependent oxygenase [Arabidopsis thaliana]
gi|332643931|gb|AEE77452.1| Oxoglutarate/iron-dependent oxygenase [Arabidopsis thaliana]
Length = 288
Score = 170 bits (431), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 90/207 (43%), Positives = 129/207 (62%), Gaps = 9/207 (4%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA--LRKGETVDNTQGIRTSSGVFIS 76
LSW PRA + F + E+C +I +AK L S + + GE+ D+ +RTSSG+F++
Sbjct: 35 LSWTPRAFLYKGFLSDEECDHLIKLAKGKLEKSMVVADVDSGESEDSE--VRTSSGMFLT 92
Query: 77 AAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQR 136
+D+ + +E K+A T LP NGEA IL Y+ GQKY+ H+D F ++ R
Sbjct: 93 KRQDD--IVANVEAKLAAWTFLPEENGEALQILHYENGQKYDPHFDYFYDKKALELGGHR 150
Query: 137 VASFLVYLTDLEEGGETMFPFENGMNADGSYD-YQKCI--GLKVKPRQGDGLLFYSLLPN 193
+A+ L+YL+++ +GGET+FP G D + KC G VKPR+GD LLF++L N
Sbjct: 151 IATVLMYLSNVTKGGETVFPNWKGKTPQLKDDSWSKCAKQGYAVKPRKGDALLFFNLHLN 210
Query: 194 GTIDPTSIHGSCPVVKGEKWVATKWIR 220
GT DP S+HGSCPV++GEKW AT+WI
Sbjct: 211 GTTDPNSLHGSCPVIEGEKWSATRWIH 237
>gi|159795555|pdb|2V4A|A Chain A, Crystal Structure Of The Semet-Labeled Prolyl-4
Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
Reinhardtii.
gi|159795556|pdb|2V4A|B Chain B, Crystal Structure Of The Semet-Labeled Prolyl-4
Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
Reinhardtii.
gi|159795557|pdb|2V4A|C Chain C, Crystal Structure Of The Semet-Labeled Prolyl-4
Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
Reinhardtii.
gi|159795558|pdb|2V4A|D Chain D, Crystal Structure Of The Semet-Labeled Prolyl-4
Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
Reinhardtii
Length = 233
Score = 170 bits (430), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 94/208 (45%), Positives = 127/208 (61%), Gaps = 13/208 (6%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAK-LNLRPSTLALRKGETVDNTQGIRTSSGVFISA 77
LSW PRA NF + E+C I+ A+ ++ S + G++VD+ IRTS+G + +
Sbjct: 25 LSWSPRAFLLKNFLSDEECDYIVEKARPKXVKSSVVDNESGKSVDSE--IRTSTGTWFAK 82
Query: 78 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQKS-Q 135
ED + IE+++A+VT +P N E +L Y GQKY HYD F DP GP+ Q
Sbjct: 83 GEDS--VISKIEKRVAQVTXIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNAGPEHGGQ 140
Query: 136 RVASFLVYLTDLEEGGETMFP-FENGMNADGSYDYQKCI--GLKVKPRQGDGLLFYSLLP 192
RV + L YLT +EEGGET+ P E + DG + +C GL VKP +GD L FYSL P
Sbjct: 141 RVVTXLXYLTTVEEGGETVLPNAEQKVTGDG---WSECAKRGLAVKPIKGDALXFYSLKP 197
Query: 193 NGTIDPTSIHGSCPVVKGEKWVATKWIR 220
+G+ DP S+HGSCP +KG+KW ATKWI
Sbjct: 198 DGSNDPASLHGSCPTLKGDKWSATKWIH 225
>gi|224141325|ref|XP_002324024.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Populus
trichocarpa]
gi|222867026|gb|EEF04157.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Populus
trichocarpa]
Length = 308
Score = 169 bits (429), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 89/206 (43%), Positives = 129/206 (62%), Gaps = 8/206 (3%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-GETVDNTQGIRTSSGVFISA 77
LSW PRA + F + E+C ++N+A+ L S +A + G+++++ +RTSSG+FI
Sbjct: 49 LSWNPRAFLYKGFLSDEECDHLMNLARDKLEKSMVADNESGKSIESE--VRTSSGMFIGK 106
Query: 78 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRV 137
++DE +D IE +IA T LP+ NGE+ IL Y+ GQKY H+D F + RV
Sbjct: 107 SQDE--IVDDIEARIAAWTFLPQENGESIQILHYEHGQKYEPHFDYFHDKANQELGGHRV 164
Query: 138 ASFLVYLTDLEEGGETMFPFENGMN---ADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 194
+ L+YL+++ +GGET+FP G D S+ G VKP++GD LLF+SL P+
Sbjct: 165 VTVLMYLSNVGKGGETVFPNSEGKTIQPKDDSWSDCAKNGYAVKPQKGDALLFFSLHPDA 224
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIR 220
T D S+HGSCPV++GEKW ATKWI
Sbjct: 225 TTDTNSLHGSCPVIEGEKWSATKWIH 250
>gi|357140446|ref|XP_003571778.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
distachyon]
Length = 298
Score = 169 bits (429), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 93/211 (44%), Positives = 127/211 (60%), Gaps = 18/211 (8%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-GETVDNTQGIRTSSGVFISA 77
LSW PRA F + +C +I +AK L S +A + G++V + +RTSSG+F+
Sbjct: 38 LSWRPRAFLHKGFLSEPECDHMIELAKDKLEKSMVADNESGKSVQSE--VRTSSGMFLEK 95
Query: 78 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRV 137
+DE + IEE+IA T LP NGE+ IL YK G+KY HYD F + R+
Sbjct: 96 RQDE--VVARIEERIAAWTFLPSENGESIQILHYKNGEKYEPHYDYFHDKNNQALGGHRI 153
Query: 138 ASFLVYLTDLEEGGETMFPFENGMNADGSYDYQK------CI--GLKVKPRQGDGLLFYS 189
A+ L+YL+++E+GGET+FP NA+G K C G VKP +GD LLF+S
Sbjct: 154 ATVLMYLSNVEKGGETIFP-----NAEGKLTQHKDETASECAKNGYAVKPMKGDALLFFS 208
Query: 190 LLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
L P+ T DP S+HGSCPV++G+KW ATKWI
Sbjct: 209 LHPDATTDPDSLHGSCPVIEGQKWSATKWIH 239
>gi|114796723|gb|ABI79328.1| prolyl 4-hydroxylase [Dianthus caryophyllus]
Length = 297
Score = 169 bits (429), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 95/220 (43%), Positives = 129/220 (58%), Gaps = 32/220 (14%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 72
+SW PRA + F T E+C +I++AK L+ S +A DN G +RTSSG
Sbjct: 40 ISWKPRAFVYEGFLTDEECDHLISIAKTELKRSAVA-------DNESGKSQVSEVRTSSG 92
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
FIS A+D + IEEK+A T LP NGE +LRY+ GQKY +H+D F + +
Sbjct: 93 AFISKAKD--AIVQRIEEKLATWTFLPIENGEDIQVLRYEEGQKYENHFDFFSDKVNIAR 150
Query: 133 KSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY----------DYQKCI--GLKVKPR 180
R A+ L+YL+++E+GG+T+FP NA+ S D +C G+ VKPR
Sbjct: 151 GGHRYATVLMYLSNVEKGGDTVFP-----NAELSERQKAAIAANDDLSECAKRGISVKPR 205
Query: 181 QGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
+GD LLF+SL P T D S+HG CPV++GEKW ATKWI
Sbjct: 206 KGDALLFFSLTPTATPDQLSLHGGCPVIEGEKWSATKWIH 245
>gi|302815629|ref|XP_002989495.1| hypothetical protein SELMODRAFT_129912 [Selaginella moellendorffii]
gi|300142673|gb|EFJ09371.1| hypothetical protein SELMODRAFT_129912 [Selaginella moellendorffii]
Length = 213
Score = 169 bits (427), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 86/212 (40%), Positives = 136/212 (64%), Gaps = 9/212 (4%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVFI 75
+++SW PRA NF T ++C +I +A ++ ST+ + G + D+ +RTSSG+F+
Sbjct: 3 EIISWTPRASLVHNFLTDDECDHLIRVAMPLMQKSTVVDSQTGGSRDSR--VRTSSGMFL 60
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 135
+ +D + IE+KIAK+T +P+ +GE +L Y+ GQKY++H+D F Q
Sbjct: 61 NRGQDR--VISEIEDKIAKLTFIPKDHGEGIQVLHYEPGQKYDAHHDFFYDTVNTRNGGQ 118
Query: 136 RVASFLVYLTDLEEGGETMFP--FENGMNADGSYDYQKC--IGLKVKPRQGDGLLFYSLL 191
R+A+ L+YLTD+EEGGET+FP +N + +C G+ V+P++GD LLF+S+
Sbjct: 119 RIATLLMYLTDVEEGGETVFPKSAKNSSSLPWHNQLSECGRRGVSVRPKRGDALLFWSMS 178
Query: 192 PNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
P+ +D +S+HG CPV+KG+KW ATKW+R E
Sbjct: 179 PDAQLDHSSLHGGCPVIKGDKWSATKWMRVSE 210
>gi|302762452|ref|XP_002964648.1| hypothetical protein SELMODRAFT_82355 [Selaginella moellendorffii]
gi|300168377|gb|EFJ34981.1| hypothetical protein SELMODRAFT_82355 [Selaginella moellendorffii]
Length = 225
Score = 169 bits (427), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 86/212 (40%), Positives = 136/212 (64%), Gaps = 9/212 (4%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVFI 75
+++SW PRA NF T ++C +I +A ++ ST+ + G + D+ +RTSSG+F+
Sbjct: 15 EIISWTPRASLVHNFLTDDECDHLIRVAMPLMQKSTVVDSQTGGSRDSR--VRTSSGMFL 72
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 135
+ +D + IE+KIAK+T +P+ +GE +L Y+ GQKY++H+D F Q
Sbjct: 73 NRGQDR--VISEIEDKIAKLTFIPKDHGEGIQVLHYEPGQKYDAHHDFFYDTVNTRNGGQ 130
Query: 136 RVASFLVYLTDLEEGGETMFP--FENGMNADGSYDYQKC--IGLKVKPRQGDGLLFYSLL 191
R+A+ L+YLTD+EEGGET+FP +N + +C G+ V+P++GD LLF+S+
Sbjct: 131 RIATLLMYLTDVEEGGETVFPKSAKNSSSLPWHNQLSECGRRGVSVRPKRGDALLFWSMS 190
Query: 192 PNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
P+ +D +S+HG CPV+KG+KW ATKW+R E
Sbjct: 191 PDAQLDHSSLHGGCPVIKGDKWSATKWMRVSE 222
>gi|28393447|gb|AAO42145.1| putative prolyl 4-hydroxylase [Arabidopsis thaliana]
Length = 253
Score = 168 bits (426), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 89/206 (43%), Positives = 128/206 (62%), Gaps = 9/206 (4%)
Query: 20 SWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA--LRKGETVDNTQGIRTSSGVFISA 77
SW PRA + F + E+C +I +AK L S + + GE+ D+ +RTSSG+F++
Sbjct: 1 SWTPRAFLYKGFLSDEECDHLIKLAKGKLEKSMVVADVDSGESEDSE--VRTSSGMFLTK 58
Query: 78 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRV 137
+D+ + +E K+A T LP NGEA IL Y+ GQKY+ H+D F ++ R+
Sbjct: 59 RQDD--IVANVEAKLAAWTFLPEENGEALQILHYENGQKYDPHFDYFYDKKALELGGHRI 116
Query: 138 ASFLVYLTDLEEGGETMFPFENGMNADGSYD-YQKCI--GLKVKPRQGDGLLFYSLLPNG 194
A+ L+YL+++ +GGET+FP G D + KC G VKPR+GD LLF++L NG
Sbjct: 117 ATVLMYLSNVTKGGETVFPNWKGKTPQLKDDSWSKCAKQGYAVKPRKGDALLFFNLHLNG 176
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIR 220
T DP S+HGSCPV++GEKW AT+WI
Sbjct: 177 TTDPNSLHGSCPVIEGEKWSATRWIH 202
>gi|168046048|ref|XP_001775487.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162673157|gb|EDQ59684.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 263
Score = 168 bits (426), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 88/205 (42%), Positives = 129/205 (62%), Gaps = 7/205 (3%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-GETVDNTQGIRTSSGVFISA 77
LSW PRA + NF + +C +I++AK L S +A + G++V + IRTSSG+F+
Sbjct: 9 LSWKPRAFLYSNFLSDAECDHMISLAKDKLEKSMVADNESGKSVKSE--IRTSSGMFLMK 66
Query: 78 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRV 137
+D+ + IE++IA T LP+ NGEA +LRY+ G+KY H+D F + R+
Sbjct: 67 GQDD--IISRIEDRIAAWTFLPKENGEAIQVLRYQDGEKYEPHFDYFHDKNNQALGGHRI 124
Query: 138 ASFLVYLTDLEEGGETMFPF--ENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGT 195
A+ L+YL+D+ +GGET+FP + G D S+ G+ VKPR+GD LLF+SL P+
Sbjct: 125 ATVLMYLSDVVKGGETVFPSSEDRGGPKDDSWSACGKTGVAVKPRKGDALLFFSLHPSAV 184
Query: 196 IDPTSIHGSCPVVKGEKWVATKWIR 220
D +S+H CPV++GEKW ATKWI
Sbjct: 185 PDESSLHTGCPVIEGEKWSATKWIH 209
>gi|224117220|ref|XP_002331751.1| predicted protein [Populus trichocarpa]
gi|222874448|gb|EEF11579.1| predicted protein [Populus trichocarpa]
Length = 266
Score = 168 bits (426), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 85/225 (37%), Positives = 131/225 (58%), Gaps = 7/225 (3%)
Query: 3 HGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVD 62
H D+ + +SW PRA + NF T +C +IN+AK +++ S + +
Sbjct: 42 HESGDDEGKAEQWVEAISWEPRAFIYHNFLTKAECDYLINLAKPHMQKS-MVVDSSSGKS 100
Query: 63 NTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD 122
+RTSSG F+ D+ + IE++IA + +P +GE IL Y++GQKY H+D
Sbjct: 101 KDSRVRTSSGTFLPRGRDK--IIRDIEKRIADFSFIPSEHGEGLQILHYEVGQKYEPHFD 158
Query: 123 AFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKCI--GLKVK 178
F QR+A+ L+YL+D+EEGGET+FP G + + + +C GL VK
Sbjct: 159 YFMDDYNTENGGQRIATVLMYLSDVEEGGETVFPSAKGNISSVPWWNELSECGKGGLSVK 218
Query: 179 PRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
P+ GD LLF+S+ P+ ++DP+S+HG CPV++G KW +TKW+R E
Sbjct: 219 PKMGDALLFWSMKPDASLDPSSLHGGCPVIRGNKWSSTKWMRVNE 263
>gi|384251901|gb|EIE25378.1| hypothetical protein COCSUDRAFT_35772 [Coccomyxa subellipsoidea
C-169]
Length = 222
Score = 168 bits (425), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 85/213 (39%), Positives = 125/213 (58%), Gaps = 17/213 (7%)
Query: 16 FQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRT 69
+VLSW PRA + NF T + ++ K ++ K E VDN G +RT
Sbjct: 1 MEVLSWEPRAYLYHNFLTEAEADYLVQKGKPHME-------KSEVVDNETGKSAPSKVRT 53
Query: 70 SSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEY 129
SSG+F++ ED+ ++ IE +IAK T +P+ NGE IL Y+ ++Y H+D F
Sbjct: 54 SSGMFLNRGEDD--VIERIEARIAKYTAIPKENGEGLQILHYQASEEYRPHFDYFHDNFN 111
Query: 130 GPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKC--IGLKVKPRQGDGLLF 187
QR+A+ L+YL+D+E+GGET+FP + G+ + +C G KP++GD L F
Sbjct: 112 TQNGGQRIATMLMYLSDVEDGGETVFPESSDKPNVGNTKFSQCAQAGAAAKPKKGDALFF 171
Query: 188 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
YSL P+G +D S+H CPV+KG+KW ATKW+R
Sbjct: 172 YSLTPDGRMDEKSLHAGCPVMKGDKWSATKWLR 204
>gi|307106819|gb|EFN55064.1| hypothetical protein CHLNCDRAFT_35843 [Chlorella variabilis]
Length = 287
Score = 167 bits (424), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 95/208 (45%), Positives = 131/208 (62%), Gaps = 15/208 (7%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-GETVDNTQGIRTSSGVFISA 77
+SW PRA + NF + E+C+ + +A+ L ST+ K G+++D+T +RTSSG F++
Sbjct: 41 VSWRPRAFVYHNFLSDEECEHLKELARKRLTKSTVVDNKTGKSMDST--VRTSSGTFLAR 98
Query: 78 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS--- 134
EDE + IE++I+ VTM+P NGEA IL+Y GQKY H D F +Y +
Sbjct: 99 GEDE--VVRAIEKRISLVTMIPEENGEAIQILKYVDGQKYEPHTDYFH-DKYNSRTENGG 155
Query: 135 QRVASFLVYLTDLEEGGETMFPF-ENGMNADGSYDYQKCI--GLKVKPRQGDGLLFYSLL 191
QRVA+ L+YL+ EEGGET+FP+ E + +G + +C GL VK +G LLFYSL
Sbjct: 156 QRVATILMYLSTPEEGGETVFPYAEKKVEGEG---WSECARKGLAVKAVKGSALLFYSLK 212
Query: 192 PNGTIDPTSIHGSCPVVKGEKWVATKWI 219
PNG D S HGSCP + GEKW AT+WI
Sbjct: 213 PNGEEDQASTHGSCPTLAGEKWSATRWI 240
>gi|363543301|ref|NP_001241866.1| prolyl 4-hydroxylase 6 precursor [Zea mays]
gi|195624808|gb|ACG34234.1| oxidoreductase [Zea mays]
gi|347978818|gb|AEP37751.1| prolyl 4-hydroxylase 6 [Zea mays]
Length = 297
Score = 167 bits (424), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 92/216 (42%), Positives = 131/216 (60%), Gaps = 28/216 (12%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 72
LS PRA + F + +C I+++AK ++ S +A DN G RTSSG
Sbjct: 38 LSSRPRAFLYSGFLSDTECDHIVSLAKGSMEKSMVA-------DNDSGKSVASQARTSSG 90
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
F++ EDE + IE+++A T LP N E+ +LRY+ GQKY++H+D F +
Sbjct: 91 TFLAKREDE--IVSAIEKRVAAWTFLPEENAESLQVLRYETGQKYDAHFDYFHDRNNLKL 148
Query: 133 KSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY------DYQKC--IGLKVKPRQGDG 184
QRVA+ L+YLTD+++GGET+FP NA+GS+ + +C GL VKP++GD
Sbjct: 149 GGQRVATVLMYLTDVKKGGETVFP-----NAEGSHLQYKDETWSECSRSGLAVKPKKGDA 203
Query: 185 LLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
LLF++L N T D S+HGSCPV++GEKW ATKWI
Sbjct: 204 LLFFNLHVNATADTGSLHGSCPVIEGEKWSATKWIH 239
>gi|115464581|ref|NP_001055890.1| Os05g0489100 [Oryza sativa Japonica Group]
gi|50511363|gb|AAT77286.1| putative prolyl 4-hydroxylase alpha subunit [Oryza sativa Japonica
Group]
gi|113579441|dbj|BAF17804.1| Os05g0489100 [Oryza sativa Japonica Group]
gi|125587281|gb|EAZ27945.1| hypothetical protein OsJ_11906 [Oryza sativa Japonica Group]
gi|215737307|dbj|BAG96236.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 319
Score = 167 bits (423), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 91/215 (42%), Positives = 127/215 (59%), Gaps = 23/215 (10%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 72
+SW PR + +F + ++ ++++A+ L+ S +A DN G RTSSG
Sbjct: 61 ISWKPRVFLYQHFLSDDEANHLVSLARTELKRSAVA-------DNLSGKSELSDARTSSG 113
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
FI ++D + IEEKIA T LP+ NGE +LRYK G+KY HYD F +
Sbjct: 114 TFIRKSQDP--IVAGIEEKIAAWTFLPKENGEDIQVLRYKHGEKYERHYDYFSDNVNTLR 171
Query: 133 KSQRVASFLVYLTDLEEGGETMFPF-----ENGMNADGSYDYQKCI--GLKVKPRQGDGL 185
R+A+ L+YLTD+ EGGET+FP E+G N + S +C G+ VKPR+GD L
Sbjct: 172 GGHRIATVLMYLTDVAEGGETVFPLAEEFTESGTNNEDS-TLSECAKKGVAVKPRKGDAL 230
Query: 186 LFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
LF++L P+ + D S+H CPV+KGEKW ATKWIR
Sbjct: 231 LFFNLSPDASKDSLSLHAGCPVIKGEKWSATKWIR 265
>gi|388496942|gb|AFK36537.1| unknown [Lotus japonicus]
Length = 302
Score = 167 bits (423), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 97/231 (41%), Positives = 132/231 (57%), Gaps = 25/231 (10%)
Query: 6 AGDDSVTNIPFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDN 63
AG S P +V +SW PRA + F T +C +I++AK L+ S +A DN
Sbjct: 29 AGSASAIIDPSKVKQVSWKPRAFVYKGFLTELECDHLISLAKSELKRSAVA-------DN 81
Query: 64 TQG------IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKY 117
G +RTSSG+FIS +D + IE+KI+ T LP+ NGE +LRY+ GQKY
Sbjct: 82 LSGDSKLSDVRTSSGMFISKNKDP--IVAGIEDKISSWTFLPKENGEDIQVLRYEHGQKY 139
Query: 118 NSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFP------FENGMNADGSYDYQK 171
+ HYD F + + RVA+ L+YLT++ GGET+FP F ++ D +
Sbjct: 140 DPHYDFFADKVNIARGGHRVATVLMYLTNVTRGGETVFPNAEVEEFPRHRGSETIDDLSE 199
Query: 172 CI--GLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
C G+ VKPR+GD LLF+SL PN D S+H CPV++GEKW ATKWI
Sbjct: 200 CAKKGIAVKPRRGDALLFFSLYPNAVPDTMSLHAGCPVIEGEKWSATKWIH 250
>gi|125552794|gb|EAY98503.1| hypothetical protein OsI_20415 [Oryza sativa Indica Group]
Length = 319
Score = 167 bits (423), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 91/215 (42%), Positives = 127/215 (59%), Gaps = 23/215 (10%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 72
+SW PR + +F + ++ ++++A+ L+ S +A DN G RTSSG
Sbjct: 61 ISWKPRVFLYQHFLSDDEANHLVSLARAELKRSAVA-------DNLSGKSELSDARTSSG 113
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
FI ++D + IEEKIA T LP+ NGE +LRYK G+KY HYD F +
Sbjct: 114 TFIRKSQDP--IVAGIEEKIAAWTFLPKENGEDIQVLRYKHGEKYERHYDYFSDNVNTLR 171
Query: 133 KSQRVASFLVYLTDLEEGGETMFPF-----ENGMNADGSYDYQKCI--GLKVKPRQGDGL 185
R+A+ L+YLTD+ EGGET+FP E+G N + S +C G+ VKPR+GD L
Sbjct: 172 GGHRIATVLMYLTDVAEGGETVFPLAEEFTESGTNNEDS-TLSECAKKGVAVKPRKGDAL 230
Query: 186 LFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
LF++L P+ + D S+H CPV+KGEKW ATKWIR
Sbjct: 231 LFFNLSPDASKDSLSLHAGCPVIKGEKWSATKWIR 265
>gi|225452614|ref|XP_002281420.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
gi|296087745|emb|CBI35001.3| unnamed protein product [Vitis vinifera]
Length = 316
Score = 167 bits (423), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 92/211 (43%), Positives = 123/211 (58%), Gaps = 18/211 (8%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 72
LSW PRA + F + E+C +I +AK L S +A DN G +RTSSG
Sbjct: 57 LSWRPRAFLYKGFLSEEECDHLITLAKDKLEKSMVA-------DNESGKSIMSEVRTSSG 109
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
+F+ A+DE + IE +IA T LP NGE+ IL Y+ G+KY H+D F +
Sbjct: 110 MFLLKAQDE--IVADIEARIAAWTFLPVENGESIQILHYENGEKYEPHFDYFHDKVNQLL 167
Query: 133 KSQRVASFLVYLTDLEEGGETMFPFENGMNA---DGSYDYQKCIGLKVKPRQGDGLLFYS 189
R+A+ L+YL +EEGGET+FP G + D S+ G V P++GD LLF+S
Sbjct: 168 GGHRIATVLMYLATVEEGGETVFPNSEGRFSQPKDDSWSDCAKKGYAVNPKKGDALLFFS 227
Query: 190 LLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
L P+ T DP+S+HGSCPV+ GEKW ATKWI
Sbjct: 228 LHPDATTDPSSLHGSCPVIAGEKWSATKWIH 258
>gi|363807286|ref|NP_001242363.1| uncharacterized protein LOC100796794 precursor [Glycine max]
gi|255641119|gb|ACU20838.1| unknown [Glycine max]
Length = 297
Score = 167 bits (422), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 97/229 (42%), Positives = 133/229 (58%), Gaps = 23/229 (10%)
Query: 6 AGDDSVTNIPFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDN 63
AG S P +V +SW PRA + F T +C +I++AK L+ S +A DN
Sbjct: 24 AGSASSVINPSKVKQISWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVA-------DN 76
Query: 64 TQG------IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKY 117
G +RTSSG+FIS +D + IE+KI+ T LP+ NGE + RY+ GQKY
Sbjct: 77 LSGESQLSDVRTSSGMFISKNKDP--IVAGIEDKISSWTFLPKENGEDIQVSRYEHGQKY 134
Query: 118 NSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFE----NGMNADGSYDYQKCI 173
+ HYD F + + R+A+ L+YLTD+ +GGET+FP A+ S D +C
Sbjct: 135 DPHYDYFTDKVNIARGGHRIATVLMYLTDVAKGGETVFPSAEEPPRRRGAETSSDLSECA 194
Query: 174 --GLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
G+ VKPR+GD LLF+SL N T D +S+H CPV++GEKW ATKWI
Sbjct: 195 KKGIAVKPRRGDALLFFSLHTNATPDTSSLHAGCPVIEGEKWSATKWIH 243
>gi|413932756|gb|AFW67307.1| oxidoreductase [Zea mays]
Length = 297
Score = 167 bits (422), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 91/216 (42%), Positives = 130/216 (60%), Gaps = 28/216 (12%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 72
LS PRA + F + +C ++++AK ++ S +A DN G RTSSG
Sbjct: 38 LSSRPRAFLYSGFLSDTECDHLVSLAKGSMEKSMVA-------DNDSGKSVASQARTSSG 90
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
F++ EDE + IE+++A T LP N E+ +LRY+ GQKY++H+D F +
Sbjct: 91 TFLAKREDE--IVSAIEKRVAAWTFLPEENAESLQVLRYETGQKYDAHFDYFHDRNNLKL 148
Query: 133 KSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY------DYQKC--IGLKVKPRQGDG 184
QRVA+ L+YLTD+ +GGET+FP NA+GS+ + +C GL VKP++GD
Sbjct: 149 GGQRVATVLMYLTDVNKGGETVFP-----NAEGSHLQYKDETWSECSRSGLAVKPKKGDA 203
Query: 185 LLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
LLF++L N T D S+HGSCPV++GEKW ATKWI
Sbjct: 204 LLFFNLHVNATADTGSLHGSCPVIEGEKWSATKWIH 239
>gi|90704797|dbj|BAE92293.1| putative prolyl 4-hydroxylase, alpha subunit [Cryptomeria japonica]
Length = 302
Score = 166 bits (421), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 85/209 (40%), Positives = 136/209 (65%), Gaps = 9/209 (4%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-GETVDNTQGIRTSSGVFI 75
+VLSW PRA + NF ++C+ +IN+AK ++ S + K G ++D+ +RTSSG F+
Sbjct: 92 EVLSWEPRAFLYHNFLAKDECEYLINIAKPHMVKSMVVDSKTGGSMDSN--VRTSSGWFL 149
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 135
+ +D+ + IE++IA + +P +GE ++L Y++ QKY++HYD F Q
Sbjct: 150 NRGQDK--IIRRIEKRIADFSHIPVEHGEGLHVLHYEVEQKYDAHYDYFSDTINVKNGGQ 207
Query: 136 RVASFLVYLTDLEEGGETMFPFE--NGMNADGSYDYQKC--IGLKVKPRQGDGLLFYSLL 191
R A+ L+YL+D+E+GGET+FP N + + +C GL V+P+ GD LLF+S+
Sbjct: 208 RGATMLMYLSDVEKGGETVFPQSKVNSSSVPWWDELSECGRSGLSVRPKMGDALLFWSVK 267
Query: 192 PNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
P+ ++DP+S+HGSCPV++G KW ATKW+R
Sbjct: 268 PDASLDPSSLHGSCPVIQGNKWSATKWMR 296
>gi|356550516|ref|XP_003543632.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 318
Score = 166 bits (421), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 96/225 (42%), Positives = 129/225 (57%), Gaps = 20/225 (8%)
Query: 7 GDDSVTNIPFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNT 64
G SV P +V LSW PRA + F + E+C +I +AK L S +A DN
Sbjct: 45 GGSSVKFDPTRVTQLSWSPRAFLYKGFLSDEECDHLITLAKDKLEKSMVA-------DNE 97
Query: 65 QG------IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYN 118
G +RTSSG+F++ A+DE + IE +IA T LP NGE+ IL Y+ GQKY
Sbjct: 98 SGKSIMSEVRTSSGMFLNKAQDE--IVAGIEARIAAWTFLPIENGESMQILHYENGQKYE 155
Query: 119 SHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENG---MNADGSYDYQKCIGL 175
H+D F + R+A+ L+YL+D+E+GGET+FP D S+ G
Sbjct: 156 PHFDYFHDKANQVMGGHRIATVLMYLSDVEKGGETIFPNAKAKLLQPKDESWSECAHKGY 215
Query: 176 KVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
VKPR+GD LLF+SL + + D S+HGSCPV++GEKW ATKWI
Sbjct: 216 AVKPRKGDALLFFSLHLDASTDNKSLHGSCPVIEGEKWSATKWIH 260
>gi|449459442|ref|XP_004147455.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
gi|449515722|ref|XP_004164897.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 319
Score = 166 bits (421), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 94/221 (42%), Positives = 135/221 (61%), Gaps = 9/221 (4%)
Query: 5 QAGDDSVTNIPFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVD 62
+ G ++T P +V LS PRA + F + E+C+ +IN AK L S +A G++V
Sbjct: 47 KTGGSAMTIDPTRVIQLSSKPRAFLYKGFLSAEECQHLINSAKGKLHQSLVAAGTGQSVT 106
Query: 63 NTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD 122
+ + RTS+G+F+ A+DE + IE +IA T LP NGE ILRY+ GQKY H+D
Sbjct: 107 SKE--RTSTGMFLHKAQDE--IVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFD 162
Query: 123 AFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFEN-GMNADGSYDYQKC--IGLKVKP 179
F R+A+ L+YL+++E+GGET+FP ++ + D +C +G V+P
Sbjct: 163 FFQDPGNIAIGGHRIATILMYLSNVEKGGETVFPNSPVKLSEEEKADLSECGKVGYGVRP 222
Query: 180 RQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
+ GD LLF+S+ PN T D TS HGSCPV++GEKW ATKWI
Sbjct: 223 KLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIH 263
>gi|357496283|ref|XP_003618430.1| Prolyl 4-hydroxylase subunit alpha-2 [Medicago truncatula]
gi|217073992|gb|ACJ85356.1| unknown [Medicago truncatula]
gi|355493445|gb|AES74648.1| Prolyl 4-hydroxylase subunit alpha-2 [Medicago truncatula]
gi|388494436|gb|AFK35284.1| unknown [Medicago truncatula]
Length = 313
Score = 166 bits (421), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 90/211 (42%), Positives = 125/211 (59%), Gaps = 18/211 (8%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 72
LSW PRA + NF T E+C +I ++K L S +A DN G +RTSSG
Sbjct: 54 LSWSPRAFLYKNFLTDEECDHLIELSKDKLEKSMVA-------DNESGKSIQSEVRTSSG 106
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
+F++ +DE + IE +IA T LP NGE+ +L Y G+KY H+D F +
Sbjct: 107 MFLNKQQDE--IVSGIEARIAAWTFLPVENGESMQVLHYMNGEKYEPHFDFFHDKANQRL 164
Query: 133 KSQRVASFLVYLTDLEEGGETMFPFENG-MNADGSYDYQKCI--GLKVKPRQGDGLLFYS 189
RVA+ L+YL+++E+GGET+FP G ++ + +C G VKPR+GD LLF+S
Sbjct: 165 GGHRVATVLMYLSNVEKGGETIFPHAEGKLSQPKDESWSECAHKGYAVKPRKGDALLFFS 224
Query: 190 LLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
L + T D S+HGSCPV++GEKW ATKWI
Sbjct: 225 LHLDATTDSKSLHGSCPVIEGEKWSATKWIH 255
>gi|356555585|ref|XP_003546111.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 1
[Glycine max]
Length = 301
Score = 166 bits (420), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 98/229 (42%), Positives = 133/229 (58%), Gaps = 23/229 (10%)
Query: 6 AGDDSVTNIPFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDN 63
AG S P +V +SW PRA + F T +C +I++AK L+ S +A DN
Sbjct: 28 AGSASAIIDPSKVKQVSWKPRAFVYEGFLTELECDHLISIAKSELKRSAVA-------DN 80
Query: 64 TQG------IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKY 117
G +RTSSG+FI +D + +E+KI+ T+LP+ NGE +LRY+ GQKY
Sbjct: 81 LSGESKLSEVRTSSGMFIPKNKDP--IVAGVEDKISSWTLLPKENGEDIQVLRYEHGQKY 138
Query: 118 NSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFP-FENGMNADGS---YDYQKCI 173
+ HYD F + + RVA+ L+YLTD+ +GGET+FP E GS D +C
Sbjct: 139 DPHYDYFADKVNIARGGHRVATVLMYLTDVTKGGETVFPNAEESPRHRGSETKEDLSECA 198
Query: 174 --GLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
G+ VKPR+GD LLF+SL PN D S+H CPV++GEKW ATKWI
Sbjct: 199 QKGIAVKPRRGDALLFFSLYPNAIPDTMSLHAGCPVIEGEKWSATKWIH 247
>gi|449454448|ref|XP_004144967.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
gi|449474082|ref|XP_004154068.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
gi|449515181|ref|XP_004164628.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 300
Score = 166 bits (419), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 94/229 (41%), Positives = 133/229 (58%), Gaps = 23/229 (10%)
Query: 6 AGDDSVTNIPFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDN 63
AG S T P +V +SW PRA + F T +C ++++A+ L+ + E DN
Sbjct: 27 AGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELK-------RSEVADN 79
Query: 64 TQG------IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKY 117
G +RTSSG+FIS +D + IE+KI+ T LP+ NGE +LRY+ GQKY
Sbjct: 80 DSGKSKLSTVRTSSGMFISKNKDP--IVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKY 137
Query: 118 NSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY----DYQKCI 173
SHYD F + R+A+ L+YL+++ +GGET+FP + +Y D +C
Sbjct: 138 ESHYDYFVDKVNIAWGGHRLATVLMYLSNVTQGGETVFPLAEKPSHRRAYETDEDLSECA 197
Query: 174 --GLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
G+ VKP++GD LLF+SL PN D S+HG CPV++GEKW ATKWI
Sbjct: 198 KKGVAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIH 246
>gi|168001068|ref|XP_001753237.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162695523|gb|EDQ81866.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 284
Score = 166 bits (419), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 86/216 (39%), Positives = 132/216 (61%), Gaps = 16/216 (7%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTS 70
+V+SW PR + NF + ++C +IN+A+ L ST+ VD T G +RTS
Sbjct: 79 EVISWQPRIILLHNFLSADECDHLINLARPRLVKSTV-------VDATTGKGIESKVRTS 131
Query: 71 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 130
+G+F++ + T+ IE +IA +M+P NGE +LRY+ Q Y +H+D F +
Sbjct: 132 TGMFLNGNDRRHHTIQAIETRIAAYSMVPVQNGELLQVLRYESDQYYKAHHDYFSDEFNL 191
Query: 131 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 190
+ QRVA+ L+YLT+ EGGET+FP + + S + IG+ VKP++GD +LF+S+
Sbjct: 192 KRGGQRVATMLMYLTEGVEGGETIFP--QAGDKECSCGGEMKIGVCVKPKRGDAVLFWSI 249
Query: 191 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 226
+G +DPTS+HG C V+ GEKW +TKW+R Q +D
Sbjct: 250 KLDGQVDPTSLHGGCKVLSGEKWSSTKWMR-QRAFD 284
>gi|168002780|ref|XP_001754091.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162694645|gb|EDQ80992.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 214
Score = 166 bits (419), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 89/210 (42%), Positives = 128/210 (60%), Gaps = 11/210 (5%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVFI 75
+VLSW PRA + +F T E+C +I +A+ +L ST+ G++ D+ +RTSSG F+
Sbjct: 4 EVLSWEPRAFLYHHFLTEEECNHLIEVARPSLVKSTVVDSDTGKSKDSR--LRTSSGTFL 61
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 135
+D + IE++IA T +P GE +L+YK +KY HYD F Q
Sbjct: 62 MRGQDP--VIKRIEKRIADFTFIPAEQGEGLQVLQYKESEKYEPHYDYFHDAYNTKNGGQ 119
Query: 136 RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQK---CI--GLKVKPRQGDGLLFYSL 190
R+A+ L+YL+++EEGGET+FP +N D+ K C GL V+PR GD LLF+S+
Sbjct: 120 RIATVLMYLSNVEEGGETVFPAAQ-VNKTEVPDWDKLSECAQKGLSVRPRMGDALLFWSM 178
Query: 191 LPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
P+ T+D TS+HG CPV+KG KW ATKW+
Sbjct: 179 KPDATLDSTSLHGGCPVIKGTKWSATKWLH 208
>gi|168060785|ref|XP_001782374.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666166|gb|EDQ52828.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 211
Score = 165 bits (417), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 87/210 (41%), Positives = 130/210 (61%), Gaps = 11/210 (5%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVFI 75
+VLSW PRA + +F T +C +I +AK +L ST+ G++ D+ +RTSSG F+
Sbjct: 3 EVLSWEPRAFLYHHFLTQVECNHLIEVAKPSLVKSTVIDSATGKSKDSR--VRTSSGTFL 60
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 135
+D + IE++IA T +P GE +L+Y+ +KY HYD F Q
Sbjct: 61 VRGQDH--IIKRIEKRIADFTFIPVEQGEGLQVLQYRESEKYEPHYDYFHDAFNTKNGGQ 118
Query: 136 RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQK---CI--GLKVKPRQGDGLLFYSL 190
R+A+ L+YL+D+E+GGET+FP + +NA D+ + C GL V+PR GD LLF+S+
Sbjct: 119 RIATVLMYLSDVEKGGETVFP-ASKVNASEVPDWDQRSECAKRGLSVRPRMGDALLFWSM 177
Query: 191 LPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
P+ +DPTS+HG+CPV++G KW ATKW+
Sbjct: 178 KPDAKLDPTSLHGACPVIQGTKWSATKWLH 207
>gi|356572148|ref|XP_003554232.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Glycine max]
Length = 319
Score = 164 bits (416), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 96/225 (42%), Positives = 131/225 (58%), Gaps = 20/225 (8%)
Query: 7 GDDSVTNIPFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNT 64
G SV P +V LSW PRA + F + E+C +I +AK L S +A DN
Sbjct: 46 GGSSVKFDPTRVTQLSWSPRAFLYKGFLSEEECDHLIVLAKDKLEKSMVA-------DND 98
Query: 65 QG------IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYN 118
G IRTSSG+F++ A+DE + IE +IA T LP NGE+ IL Y+ GQKY
Sbjct: 99 SGKSIMSDIRTSSGMFLNKAQDE--IVAGIEARIAAWTFLPVENGESMQILHYENGQKYE 156
Query: 119 SHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFP-FENGMNADGSYDYQKCI--GL 175
H+D F + R+A+ L+YL+D+E+GGET+FP E + + +C G
Sbjct: 157 PHFDYFHDKANQVMGGHRIATVLMYLSDVEKGGETIFPNAEAKLLQPKDESWSECAHKGY 216
Query: 176 KVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
VKP++GD LLF+SL + + D S+HGSCPV++GEKW ATKWI
Sbjct: 217 AVKPQKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIH 261
>gi|359806348|ref|NP_001241485.1| uncharacterized protein LOC100783075 precursor [Glycine max]
gi|255645457|gb|ACU23224.1| unknown [Glycine max]
Length = 298
Score = 164 bits (416), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 95/229 (41%), Positives = 132/229 (57%), Gaps = 23/229 (10%)
Query: 6 AGDDSVTNIPFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDN 63
AG S P +V +SW PRA + F T +C +I++AK L+ S +A DN
Sbjct: 25 AGSASSIVNPSKVKQISWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVA-------DN 77
Query: 64 TQG------IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKY 117
G +RTSSG+FIS +D + IE+KI+ T LP+ NGE +LRY+ GQKY
Sbjct: 78 LSGESQLSDVRTSSGMFISKNKDP--IISGIEDKISSWTFLPKENGEDIQVLRYEHGQKY 135
Query: 118 NSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFE----NGMNADGSYDYQKCI 173
+ HYD F + + R+A+ L+YLT++ +GGET+FP + S D +C
Sbjct: 136 DPHYDYFTDKVNIARGGHRIATVLMYLTNVTKGGETVFPSAEEPPRRRGTETSSDLSECA 195
Query: 174 --GLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
G+ VKP +GD LLF+SL N T D +S+H CPV++GEKW ATKWI
Sbjct: 196 KKGIAVKPHRGDALLFFSLHTNATPDTSSLHAGCPVIEGEKWSATKWIH 244
>gi|356546462|ref|XP_003541645.1| PREDICTED: uncharacterized protein LOC100818794 [Glycine max]
Length = 839
Score = 164 bits (415), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 98/229 (42%), Positives = 132/229 (57%), Gaps = 23/229 (10%)
Query: 6 AGDDSVTNIPFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDN 63
AG S P +V +SW PRA + F T +C +I++AK L+ S +A DN
Sbjct: 566 AGSASAIIDPSKVKQVSWKPRAFVYEGFLTELECDHLISIAKSELKRSAVA-------DN 618
Query: 64 TQG------IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKY 117
G +RTSSG+FI +D + IE+KI+ T LP+ NGE +LRY+ GQKY
Sbjct: 619 LSGESKLSEVRTSSGMFIPKNKDL--IVAGIEDKISSWTFLPKENGEDIQVLRYEHGQKY 676
Query: 118 NSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFP-FENGMNADGS---YDYQKCI 173
+ HYD F + + RVA+ L+YLTD+ +GGET+FP E GS + +C
Sbjct: 677 DPHYDYFADKVNIARGGHRVATVLMYLTDVTKGGETVFPDAEESPRHKGSETNENLSECA 736
Query: 174 --GLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
G+ VKPR+GD LLF+SL PN D S+H CPV++GEKW ATKWI
Sbjct: 737 QKGIAVKPRRGDALLFFSLYPNAIPDTLSLHAGCPVIEGEKWSATKWIH 785
>gi|226495689|ref|NP_001149322.1| LOC100282945 precursor [Zea mays]
gi|194697650|gb|ACF82909.1| unknown [Zea mays]
gi|194708468|gb|ACF88318.1| unknown [Zea mays]
gi|195626376|gb|ACG35018.1| oxidoreductase [Zea mays]
gi|347978842|gb|AEP37763.1| prolyl 4-hydroxylase 9 [Zea mays]
gi|413945802|gb|AFW78451.1| oxidoreductase [Zea mays]
Length = 308
Score = 164 bits (415), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 90/212 (42%), Positives = 127/212 (59%), Gaps = 21/212 (9%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 72
+S PR + +F + ++ +I++A+ L+ S +A DN G +RTSSG
Sbjct: 54 ISCKPRVFLYQHFLSDDEANHLISLARAELKRSAVA-------DNMSGKSTLSEVRTSSG 106
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
F+ +D ++ IE+KIA T LP+ NGE +LRYK G+KY HYD F +
Sbjct: 107 TFLRKGQDP--IVEGIEDKIAAWTFLPKENGEDIQVLRYKHGEKYEPHYDYFTDNVNTVR 164
Query: 133 KSQRVASFLVYLTDLEEGGETMFPF----ENGMNADGSYDYQKCIGLKVKPRQGDGLLFY 188
R A+ L+YLTD+ EGGET+FP ++ +A S QK G+ V+PR+GD LLF+
Sbjct: 165 GGHRYATVLLYLTDVPEGGETVFPLAEEPDDAKDATLSECAQK--GIAVRPRKGDALLFF 222
Query: 189 SLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
+L P+GT D S+HG CPV+KGEKW ATKWIR
Sbjct: 223 NLNPDGTTDSVSLHGGCPVIKGEKWSATKWIR 254
>gi|115481998|ref|NP_001064592.1| Os10g0413500 [Oryza sativa Japonica Group]
gi|110289075|gb|ABG66075.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
Group]
gi|113639201|dbj|BAF26506.1| Os10g0413500 [Oryza sativa Japonica Group]
gi|215692577|dbj|BAG87997.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222612821|gb|EEE50953.1| hypothetical protein OsJ_31503 [Oryza sativa Japonica Group]
Length = 308
Score = 164 bits (414), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 91/211 (43%), Positives = 122/211 (57%), Gaps = 18/211 (8%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 72
LSW PRA F T +C+ +I++AK L S +A DN G +RTSSG
Sbjct: 48 LSWRPRAFLHKGFLTDAECEHLISLAKDKLEKSMVA-------DNESGKSVMSEVRTSSG 100
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
+F+ +DE + IEE+IA T LP NGE+ IL Y+ G+KY HYD F +
Sbjct: 101 MFLEKKQDE--VVARIEERIAAWTFLPPDNGESIQILHYQNGEKYEPHYDYFHDKNNQAL 158
Query: 133 KSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYD-YQKCI--GLKVKPRQGDGLLFYS 189
R+A+ L+YL+D+ +GGET+FP G D + C G VKP +GD LLF+S
Sbjct: 159 GGHRIATVLMYLSDVGKGGETIFPEAEGKLLQPKDDTWSDCAKNGYAVKPVKGDALLFFS 218
Query: 190 LLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
L P+ T D S+HGSCPV++G+KW ATKWI
Sbjct: 219 LHPDATTDSDSLHGSCPVIEGQKWSATKWIH 249
>gi|218184507|gb|EEC66934.1| hypothetical protein OsI_33548 [Oryza sativa Indica Group]
Length = 308
Score = 164 bits (414), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 91/211 (43%), Positives = 122/211 (57%), Gaps = 18/211 (8%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 72
LSW PRA F T +C+ +I++AK L S +A DN G +RTSSG
Sbjct: 48 LSWRPRAFLHKGFLTDAECEHLISLAKDKLEKSMVA-------DNESGKSVMSEVRTSSG 100
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
+F+ +DE + IEE+IA T LP NGE+ IL Y+ G+KY HYD F +
Sbjct: 101 MFLEKKQDE--VVARIEERIAAWTFLPPDNGESIQILHYQNGEKYEPHYDYFHDKNNQAL 158
Query: 133 KSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYD-YQKCI--GLKVKPRQGDGLLFYS 189
R+A+ L+YL+D+ +GGET+FP G D + C G VKP +GD LLF+S
Sbjct: 159 GGHRIATVLMYLSDVGKGGETIFPEAEGKLLQPKDDTWSDCAKNGYAVKPVKGDALLFFS 218
Query: 190 LLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
L P+ T D S+HGSCPV++G+KW ATKWI
Sbjct: 219 LHPDATTDSDSLHGSCPVIEGQKWSATKWIH 249
>gi|357447553|ref|XP_003594052.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
truncatula]
gi|355483100|gb|AES64303.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
truncatula]
Length = 301
Score = 164 bits (414), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 94/219 (42%), Positives = 128/219 (58%), Gaps = 31/219 (14%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 72
+SW PRA + F T +C +I++AK L+ S +A DN G +RTSSG
Sbjct: 43 VSWKPRAFVYKGFLTDLECDHLISIAKSELKRSAVA-------DNLSGESKLSEVRTSSG 95
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
+FIS +D + IE+KI+ T LP+ NGE +LRY+ GQKY+ HYD F + +
Sbjct: 96 MFISKNKD--AIVSGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDYFADKVNIAR 153
Query: 133 KSQRVASFLVYLTDLEEGGETMFPFENGMNADGS---------YDYQKC--IGLKVKPRQ 181
RVA+ L+YLT++ +GGET+FP NA+ S D +C G+ VKPR+
Sbjct: 154 GGHRVATVLMYLTNVTKGGETVFP-----NAEESPRHKLSETDEDLSECGKKGVAVKPRR 208
Query: 182 GDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
GD LLF+SL PN D S+H CPV++GEKW ATKWI
Sbjct: 209 GDALLFFSLHPNAIPDTLSLHAGCPVIEGEKWSATKWIH 247
>gi|255551575|ref|XP_002516833.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223543921|gb|EEF45447.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 297
Score = 164 bits (414), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 90/214 (42%), Positives = 125/214 (58%), Gaps = 21/214 (9%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 72
+SW PRA + F T +C +I++AK L+ S +A DN G +RTSSG
Sbjct: 39 VSWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVA-------DNESGKSKLSEVRTSSG 91
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
+FI+ +D + IEEKI+ T LP+ NGE +LRY+ GQKY+ HYD F + +
Sbjct: 92 MFIAKGKDP--IIAGIEEKISTWTFLPKENGEDLQVLRYEHGQKYDPHYDYFADKINIAR 149
Query: 133 KSQRVASFLVYLTDLEEGGETMFPFE----NGMNADGSYDYQKCI--GLKVKPRQGDGLL 186
R+A+ L+YL+D+ +GGET+FP + D +C G+ VKPR+GD LL
Sbjct: 150 GGHRMATVLMYLSDVVKGGETVFPNAEEPPRRKATESHEDLSECAKKGISVKPRRGDALL 209
Query: 187 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
F+SL P DP S+H CPV++GEKW ATKWI
Sbjct: 210 FFSLHPTAIPDPNSLHAGCPVIEGEKWSATKWIH 243
>gi|255637501|gb|ACU19077.1| unknown [Glycine max]
Length = 318
Score = 163 bits (413), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 95/225 (42%), Positives = 128/225 (56%), Gaps = 20/225 (8%)
Query: 7 GDDSVTNIPFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNT 64
G SV P +V LSW PRA + F + E+C +I +AK L S +A DN
Sbjct: 45 GGSSVKFDPTRVTQLSWSPRAFLYKGFLSDEECDHLITLAKDKLEKSMVA-------DNE 97
Query: 65 QG------IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYN 118
G +RTSSG+F++ A+DE + IE +IA T LP NGE+ IL Y+ GQKY
Sbjct: 98 SGKSIMSEVRTSSGMFLNKAQDE--IVAGIEARIAAWTFLPIENGESMQILHYENGQKYE 155
Query: 119 SHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENG---MNADGSYDYQKCIGL 175
H+D F + R+A+ L+YL+D+E+GGET+F D S+ G
Sbjct: 156 PHFDYFHDKANQVMGGHRIATVLMYLSDVEKGGETIFSNAKAKLLQPKDESWSECAHKGY 215
Query: 176 KVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
VKPR+GD LLF+SL + + D S+HGSCPV++GEKW ATKWI
Sbjct: 216 AVKPRKGDALLFFSLHLDASTDNKSLHGSCPVIEGEKWSATKWIH 260
>gi|255072321|ref|XP_002499835.1| prolyl 4-hydroxylase [Micromonas sp. RCC299]
gi|226515097|gb|ACO61093.1| prolyl 4-hydroxylase [Micromonas sp. RCC299]
Length = 454
Score = 163 bits (412), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 94/218 (43%), Positives = 135/218 (61%), Gaps = 17/218 (7%)
Query: 15 PFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVF 74
P + + P+A F NF TP +C+ ++ +AK L PST+ KG + IRTS+G+F
Sbjct: 169 PLVLSNHEPKAYMFRNFLTPHECEHLMQLAKKQLAPSTVVGDKG-SGSMVSKIRTSAGMF 227
Query: 75 ISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQ-EYGPQK 133
+ +D T+ IEE+IA + LP NGE ILRY+ GQKY+ H+D F Q P++
Sbjct: 228 LGRGQDP--TVRAIEERIAAASGLPEPNGEGLQILRYENGQKYDPHFDYFHDQVNSSPRR 285
Query: 134 S-QRVASFLVYLTDLEEGGETMFPFENGM-----NAD--GSYD-YQKCI--GLKVKPRQG 182
QR+A+ L+YL D EGGET+FP NG+ +AD G+++ + C G+ VK +G
Sbjct: 286 GGQRMATMLIYLEDTTEGGETIFP--NGVRPEDWDADEPGNHNSWSDCAKKGIPVKSHRG 343
Query: 183 DGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
D +LF+SL + T+D S+HG+CPV+ GEKW A KWIR
Sbjct: 344 DAVLFWSLKEDYTLDNGSLHGACPVIAGEKWTAVKWIR 381
>gi|412993142|emb|CCO16675.1| predicted protein [Bathycoccus prasinos]
Length = 564
Score = 163 bits (412), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 95/220 (43%), Positives = 136/220 (61%), Gaps = 19/220 (8%)
Query: 14 IPFQVLSWM-PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSG 72
+P VLS + P+A F NF + E+C ++ +AK L PST+ G +V +T IRTS+G
Sbjct: 276 MPPLVLSAVKPKAYLFRNFLSAEECDHLMKLAKAELAPSTVVGAGGTSVPST--IRTSAG 333
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGP 131
+F+ A D+ TL+ IE +IA + P NGE ILRY +GQKY+ H+D F D P
Sbjct: 334 MFLRKAADK--TLENIEYRIAAASGTPEPNGEGMQILRYDVGQKYDPHFDYFHDAVNPSP 391
Query: 132 QKS-QRVASFLVYLTDLEEGGETMFPFENGMNAD--------GSYDYQKCI--GLKVKPR 180
++ QR+A+ L+YL + +EGGET+FP G A+ +++ +C GL VK
Sbjct: 392 KRGGQRMATMLIYLENTKEGGETIFP--RGTRAETFDLTEEGNPHEWSECTKHGLPVKSV 449
Query: 181 QGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
+GD LLF+SL + +D S+HG+CPVVKG+KW A KWIR
Sbjct: 450 KGDALLFWSLTDDYKLDMGSLHGACPVVKGQKWTAVKWIR 489
>gi|357447555|ref|XP_003594053.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
truncatula]
gi|355483101|gb|AES64304.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
truncatula]
Length = 303
Score = 163 bits (412), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 91/216 (42%), Positives = 126/216 (58%), Gaps = 23/216 (10%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 72
+SW PRA + F T +C +I++AK L+ S +A DN G +RTSSG
Sbjct: 43 VSWKPRAFVYKGFLTDLECDHLISIAKSELKRSAVA-------DNLSGESKLSEVRTSSG 95
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
+FIS +D + IE+KI+ T LP+ NGE +LRY+ GQKY+ HYD F + +
Sbjct: 96 MFISKNKD--AIVSGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDYFADKVNIAR 153
Query: 133 KSQRVASFLVYLTDLEEGGETMFPFENGMN------ADGSYDYQKC--IGLKVKPRQGDG 184
RVA+ L+YLT++ +GGET+FP ++ D +C G+ VKPR+GD
Sbjct: 154 GGHRVATVLMYLTNVTKGGETVFPNAELQESPRHKLSETDEDLSECGKKGVAVKPRRGDA 213
Query: 185 LLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
LLF+SL PN D S+H CPV++GEKW ATKWI
Sbjct: 214 LLFFSLHPNAIPDTLSLHAGCPVIEGEKWSATKWIH 249
>gi|195627276|gb|ACG35468.1| prolyl 4-hydroxylase [Zea mays]
Length = 298
Score = 163 bits (412), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 90/206 (43%), Positives = 123/206 (59%), Gaps = 8/206 (3%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-GETVDNTQGIRTSSGVFISA 77
LSW PRA F +C +I +AK L S +A K G++V + +RTSSG+F+
Sbjct: 38 LSWRPRAFLHKGFLLDAECDHLIALAKDKLEKSMVADNKSGKSVQSE--VRTSSGMFLEK 95
Query: 78 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRV 137
+DE + IEE+I+ T LP NGEA IL Y+ G+KY HYD F + R+
Sbjct: 96 KQDE--VVTRIEERISAWTFLPPENGEAIQILHYQNGEKYEPHYDYFHDKNNQALGGHRI 153
Query: 138 ASFLVYLTDLEEGGETMFPFENGMNADGSYD-YQKCI--GLKVKPRQGDGLLFYSLLPNG 194
A+ L+YL+++E+GGET+FP G D + C G VKP +GD LLF+SL P+
Sbjct: 154 ATVLMYLSNVEKGGETIFPNAEGKLLQPKDDTWSDCARNGYAVKPVKGDALLFFSLHPDS 213
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIR 220
T D S+HGSCPV++G+KW ATKWI
Sbjct: 214 TTDSDSLHGSCPVIEGQKWSATKWIH 239
>gi|255085784|ref|XP_002505323.1| predicted protein [Micromonas sp. RCC299]
gi|226520592|gb|ACO66581.1| predicted protein [Micromonas sp. RCC299]
Length = 215
Score = 163 bits (412), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 91/218 (41%), Positives = 125/218 (57%), Gaps = 19/218 (8%)
Query: 16 FQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVD-NTQGIRTSSGVF 74
+ +SW PRA + NF TPE+C ++N+AK + L++ D T G SG F
Sbjct: 2 IEQISWEPRAFVYHNFLTPEECAHLVNLAK----ATDGGLKRATVADARTGGTFPGSGAF 57
Query: 75 ISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ-K 133
+ D + IEE+I+ M+P +GE ILRY G+KY+ H+D FD + +
Sbjct: 58 LLRNHDP--IVTRIEERISAFAMIPADHGEGMRILRYGRGEKYDPHHDYFDDGDKNLRFY 115
Query: 134 SQRVASFLVYLTDLEEGGETMFPFENG------MNADG---SYDYQKCI--GLKVKPRQG 182
QRVA+ L+YL+D+E GGET+FP M+ G S D KC L VKPR+G
Sbjct: 116 GQRVATVLMYLSDVESGGETVFPKHGAWIEPDEMDVRGRSSSKDSSKCAKGALHVKPRRG 175
Query: 183 DGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
D LLF++ NG DPTS+H CPV++GEKW ATKW+R
Sbjct: 176 DALLFHNCHLNGREDPTSLHAGCPVLRGEKWTATKWMR 213
>gi|357128903|ref|XP_003566109.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Brachypodium
distachyon]
Length = 313
Score = 162 bits (411), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 87/214 (40%), Positives = 122/214 (57%), Gaps = 21/214 (9%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 72
+SW PR + +F + ++ ++++A+ L+ S +A DNT G +RTS G
Sbjct: 55 ISWKPRVFLYQHFLSDDEANHLLSLARAELKRSAVA-------DNTSGKSTLSEVRTSYG 107
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
FIS +D + IE+KIA T LP+ NGE +LRYK G+K +D F +
Sbjct: 108 TFISKGKDP--IVAGIEDKIAAWTFLPKENGEDMQVLRYKRGEKDEPQFDFFTDTVNTVR 165
Query: 133 KSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCI------GLKVKPRQGDGLL 186
RVA+ L+YLTD+ EGGET+FP G +D + G+ VKPR+GD LL
Sbjct: 166 GGHRVATVLLYLTDVAEGGETVFPLAKDFTDTGLHDKDTTLSECAQKGIAVKPRKGDALL 225
Query: 187 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
F++L P+ DP S+HG C V+KGEKW ATKWIR
Sbjct: 226 FFNLRPDAATDPLSLHGGCTVIKGEKWTATKWIR 259
>gi|255641919|gb|ACU21228.1| unknown [Glycine max]
Length = 301
Score = 162 bits (409), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 97/229 (42%), Positives = 132/229 (57%), Gaps = 23/229 (10%)
Query: 6 AGDDSVTNIPFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDN 63
AG S P +V +SW PRA + F T +C +I++AK L+ S +A DN
Sbjct: 28 AGSASAIIDPSKVKQVSWKPRAFVYEGFLTELECDHLISIAKSELKRSAVA-------DN 80
Query: 64 TQG------IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKY 117
G +RTSSG+FI +D + IE+KI+ T LP+ NGE +LRY+ GQKY
Sbjct: 81 LSGESKLSEVRTSSGMFIPKNKDL--IVAGIEDKISSWTFLPKENGEDIQVLRYEHGQKY 138
Query: 118 NSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFP-FENGMNADGS---YDYQKCI 173
+ HYD F + + RVA+ L+YLTD+ +GGET+FP E GS + +C
Sbjct: 139 DPHYDYFADKVNIARGGHRVATVLMYLTDVTKGGETVFPDAEESPRHKGSETNENLSECA 198
Query: 174 --GLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
G+ VKPR+GD LLF+SL PN D S+H CPV++GEKW AT+WI
Sbjct: 199 QKGIAVKPRRGDALLFFSLYPNAIPDTLSLHAGCPVIEGEKWSATEWIH 247
>gi|218199253|gb|EEC81680.1| hypothetical protein OsI_25242 [Oryza sativa Indica Group]
Length = 487
Score = 162 bits (409), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 92/230 (40%), Positives = 133/230 (57%), Gaps = 17/230 (7%)
Query: 4 GQAGDDSVTNI----PF-----QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA 54
G+AGDD V + PF + +SW PR + F + ++C ++ + K ++ S +A
Sbjct: 36 GEAGDDGVGAVAAAPPFNASRVRAVSWRPRVFVYKGFLSDDECDHLVKLGKRKMQRSMVA 95
Query: 55 LRK-GETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKI 113
K G++V +RTSSG+F+ +D + IE++IA T LP N E ILRY+
Sbjct: 96 DNKSGKSV--MSEVRTSSGMFLDKRQDP--VVSRIEKRIAAWTFLPEENAENIQILRYEH 151
Query: 114 GQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYD-YQKC 172
GQKY H+D F + R A+ L+YL+ +E+GGET+FP G D + +C
Sbjct: 152 GQKYEPHFDYFHDKVNQALGGHRYATVLMYLSTVEKGGETVFPNAEGWENQPKDDTFSEC 211
Query: 173 I--GLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
GL VKP +GD +LF+SL +G DP S+HGSCPV++GEKW A KWIR
Sbjct: 212 AQKGLAVKPVKGDAVLFFSLHIDGVPDPLSLHGSCPVIEGEKWSAPKWIR 261
>gi|449522594|ref|XP_004168311.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Cucumis
sativus]
Length = 313
Score = 162 bits (409), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 87/205 (42%), Positives = 125/205 (60%), Gaps = 8/205 (3%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALR-KGETVDNTQGIRTSSGVFISA 77
LSW PRA + F + +C +I++AK L S +A G++V + +RTSSG+F+
Sbjct: 56 LSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVSSE--VRTSSGMFLRK 113
Query: 78 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRV 137
A+DE + +E +IA T+LP NGE+ IL Y+ GQKY H+D F + R+
Sbjct: 114 AQDE--VVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRI 171
Query: 138 ASFLVYLTDLEEGGETMFP---FENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 194
A+ L+YL+++E+GGET+FP F+ D S+ G VK ++GD LLF+SL +
Sbjct: 172 ATVLMYLSNVEKGGETIFPNSEFKESQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDA 231
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWI 219
T D S+HGSCPV+ GEKW ATKWI
Sbjct: 232 TTDERSLHGSCPVIAGEKWSATKWI 256
>gi|293337056|ref|NP_001169835.1| uncharacterized protein LOC100383727 precursor [Zea mays]
gi|224031897|gb|ACN35024.1| unknown [Zea mays]
gi|347978800|gb|AEP37742.1| prolyl 4-hydroxylase 2 [Zea mays]
gi|414871435|tpg|DAA49992.1| TPA: hypothetical protein ZEAMMB73_500506 [Zea mays]
Length = 299
Score = 162 bits (409), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 87/206 (42%), Positives = 125/206 (60%), Gaps = 8/206 (3%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-GETVDNTQGIRTSSGVFISA 77
LSW PRA F + +C +I +AK L S +A + G++V + +RTSSG+F+
Sbjct: 39 LSWRPRAFLHKGFLSDAECDHLIALAKDKLEKSMVADNESGKSVQSE--VRTSSGMFLER 96
Query: 78 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRV 137
+DE + IEE+I+ T LP NGE+ IL Y+ G+KY HYD F ++ R+
Sbjct: 97 KQDE--VVTRIEERISAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKKNQALGGHRI 154
Query: 138 ASFLVYLTDLEEGGETMFPFENG---MNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 194
A+ L+YL+++E+GGET+FP G D ++ G VKP +GD LLF+SL P+
Sbjct: 155 ATVLMYLSNVEKGGETIFPNAEGKLLQPKDNTWSDCARNGYAVKPVKGDALLFFSLHPDA 214
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIR 220
T D S+HGSCPV++G+KW ATKWI
Sbjct: 215 TTDSDSLHGSCPVIEGQKWSATKWIH 240
>gi|388492638|gb|AFK34385.1| unknown [Medicago truncatula]
Length = 299
Score = 162 bits (409), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 95/229 (41%), Positives = 132/229 (57%), Gaps = 23/229 (10%)
Query: 6 AGDDSVTNIPFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDN 63
AG S P +V +SW+PRA + F T +C +I++AK L+ S +A DN
Sbjct: 25 AGSASSIINPSKVKQISWIPRAFVYQGFLTDLECDHLISLAKSELKRSAVA-------DN 77
Query: 64 TQG------IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKY 117
G +RTSSG+FIS +D + IE++I+ T LP+ NGE +LRY+ GQKY
Sbjct: 78 LSGDSQLSDVRTSSGMFISKNKDP--IVSGIEDRISAWTFLPKENGEDIQVLRYEHGQKY 135
Query: 118 NSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFE----NGMNADGSYDYQKCI 173
+ HYD F + Q R+A+ L+YLT++ +GGET+FP + S D +C
Sbjct: 136 DPHYDYFADKVNIVQGGHRLATVLMYLTNVTKGGETVFPEAEEPPRRRGSKKSSDLSECA 195
Query: 174 --GLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
G+ VKPR+GD LLF+SL N D S+H CPV++GEKW ATKWI
Sbjct: 196 KKGIAVKPRRGDALLFFSLDTNAIPDTNSLHAGCPVLEGEKWSATKWIH 244
>gi|357478545|ref|XP_003609558.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
gi|355510613|gb|AES91755.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
Length = 299
Score = 161 bits (408), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 95/229 (41%), Positives = 132/229 (57%), Gaps = 23/229 (10%)
Query: 6 AGDDSVTNIPFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDN 63
AG S P +V +SW+PRA + F T +C +I++AK L+ S +A DN
Sbjct: 25 AGSASSIINPSKVKQISWIPRAFVYQGFLTDLECDHLISLAKSELKRSAVA-------DN 77
Query: 64 TQG------IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKY 117
G +RTSSG+FIS +D + IE++I+ T LP+ NGE +LRY+ GQKY
Sbjct: 78 LSGDSQLSDVRTSSGMFISKNKDP--IVSGIEDRISAWTFLPKENGEDIQVLRYEHGQKY 135
Query: 118 NSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFE----NGMNADGSYDYQKCI 173
+ HYD F + Q R+A+ L+YLT++ +GGET+FP + S D +C
Sbjct: 136 DPHYDYFADKVNIVQGGHRLATVLMYLTNVTKGGETVFPEAEEPPRRRGSKKSSDLSECA 195
Query: 174 --GLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
G+ VKPR+GD LLF+SL N D S+H CPV++GEKW ATKWI
Sbjct: 196 KKGIAVKPRRGDALLFFSLDTNAIPDTNSLHAGCPVLEGEKWSATKWIH 244
>gi|29150368|gb|AAO72377.1| putative oxidoreductase [Oryza sativa Japonica Group]
gi|108711617|gb|ABF99412.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
Group]
gi|125546090|gb|EAY92229.1| hypothetical protein OsI_13949 [Oryza sativa Indica Group]
gi|125588294|gb|EAZ28958.1| hypothetical protein OsJ_13002 [Oryza sativa Japonica Group]
Length = 310
Score = 161 bits (408), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 86/212 (40%), Positives = 125/212 (58%), Gaps = 18/212 (8%)
Query: 18 VLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSS 71
++SW PR ++ F + ++C ++ + K L+ S +A DN G +RTSS
Sbjct: 50 IISWKPRIFFYKGFLSDDECDHLVKLGKEKLKRSMVA-------DNESGKSVMSEVRTSS 102
Query: 72 GVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP 131
G+F+ +D + IEE+IA T+LP+ N E ILRY+ GQKY+ H+D F +
Sbjct: 103 GMFLDKQQDP--VVSGIEERIAAWTLLPQENAENIQILRYENGQKYDPHFDYFQDKVNQL 160
Query: 132 QKSQRVASFLVYLTDLEEGGETMFPFENGMNA---DGSYDYQKCIGLKVKPRQGDGLLFY 188
Q R A+ L YL+ +E+GGET+FP G + D S+ GL VK +GD +LF+
Sbjct: 161 QGGHRYATVLTYLSTVEKGGETVFPNAEGWESQPKDDSFSDCAKKGLAVKAVKGDSVLFF 220
Query: 189 SLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
+L P+GT DP S+HGSCPV++GEKW A KWI
Sbjct: 221 NLQPDGTPDPLSLHGSCPVIEGEKWSAPKWIH 252
>gi|242039723|ref|XP_002467256.1| hypothetical protein SORBIDRAFT_01g022150 [Sorghum bicolor]
gi|241921110|gb|EER94254.1| hypothetical protein SORBIDRAFT_01g022150 [Sorghum bicolor]
Length = 303
Score = 161 bits (408), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 89/206 (43%), Positives = 124/206 (60%), Gaps = 8/206 (3%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-GETVDNTQGIRTSSGVFISA 77
LSW PRA F + +C +I +AK L S +A + G++V + +RTSSG+F+
Sbjct: 43 LSWRPRAFLHKGFLSDAECDHLIVLAKDKLEKSMVADNESGKSVQSE--VRTSSGMFLEK 100
Query: 78 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRV 137
+DE + IEE+IA T LP NGE+ IL Y+ G+KY HYD F + R+
Sbjct: 101 KQDE--VVRGIEERIAAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKNNQALGGHRI 158
Query: 138 ASFLVYLTDLEEGGETMFPFENGMNADGSYD-YQKCI--GLKVKPRQGDGLLFYSLLPNG 194
A+ L+YL+++E+GGET+FP G D + C G VKP +GD LLF+SL P+
Sbjct: 159 ATVLMYLSNVEKGGETIFPNAEGKLLQPKDDTWSDCARNGYAVKPVKGDALLFFSLHPDA 218
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIR 220
T D S+HGSCPV++G+KW ATKWI
Sbjct: 219 TTDSESLHGSCPVIEGQKWSATKWIH 244
>gi|212720650|ref|NP_001132477.1| uncharacterized protein LOC100193935 precursor [Zea mays]
gi|194694488|gb|ACF81328.1| unknown [Zea mays]
gi|347978828|gb|AEP37756.1| prolyl 4-hydroxylase 7 [Zea mays]
gi|413934218|gb|AFW68769.1| prolyl 4-hydroxylase [Zea mays]
Length = 298
Score = 161 bits (408), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 89/206 (43%), Positives = 122/206 (59%), Gaps = 8/206 (3%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-GETVDNTQGIRTSSGVFISA 77
LSW PRA F +C +I +AK L S +A K G++V + +RTSSG+F+
Sbjct: 38 LSWRPRAFLHKGFLLDAECDHLIALAKDKLEKSMVADNKSGKSVQSE--VRTSSGMFLEK 95
Query: 78 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRV 137
+DE + IEE+I+ T LP NGEA IL Y+ G+KY HYD F + R+
Sbjct: 96 KQDE--VVTRIEERISAWTFLPPENGEAIQILHYQNGEKYEPHYDYFHDKNNQALGGHRI 153
Query: 138 ASFLVYLTDLEEGGETMFPFENGMNADGSYD-YQKCI--GLKVKPRQGDGLLFYSLLPNG 194
A+ L+YL+++E+GGET+FP G D + C G VKP +GD LLF+SL P+
Sbjct: 154 ATVLMYLSNVEKGGETIFPNAEGKLLQPKDDTWSDCARNGYAVKPVKGDALLFFSLHPDS 213
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIR 220
T D S+HGSCP ++G+KW ATKWI
Sbjct: 214 TTDSDSLHGSCPAIEGQKWSATKWIH 239
>gi|222636605|gb|EEE66737.1| hypothetical protein OsJ_23428 [Oryza sativa Japonica Group]
Length = 487
Score = 161 bits (407), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 92/230 (40%), Positives = 133/230 (57%), Gaps = 17/230 (7%)
Query: 4 GQAGDDSVTNI----PF-----QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA 54
G+AGDD V + PF + +SW PR + F + ++C ++ + K ++ S +A
Sbjct: 36 GEAGDDGVGAVAAAPPFNASRVRAVSWRPRVFVYKGFLSDDECDHLVKLGKRKMQRSMVA 95
Query: 55 LRK-GETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKI 113
K G++V +RTSSG+F+ +D + IE++IA T LP N E ILRY+
Sbjct: 96 DNKSGKSV--MSEVRTSSGMFLDKRQDP--VVSRIEKRIAAWTFLPEENAENIQILRYEH 151
Query: 114 GQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYD-YQKC 172
GQKY H+D F + R A+ L+YL+ +E+GGET+FP G D + +C
Sbjct: 152 GQKYEPHFDYFHDKVNQALGGHRYATVLMYLSTVEKGGETVFPNAEGWENQPKDDTFSEC 211
Query: 173 I--GLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
GL VKP +GD +LF+SL +G DP S+HGSCPV++GEKW A KWIR
Sbjct: 212 AQKGLAVKPVKGDTVLFFSLHIDGVPDPLSLHGSCPVIEGEKWSAPKWIR 261
>gi|34393269|dbj|BAC83179.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein [Oryza
sativa Japonica Group]
gi|50509101|dbj|BAD30161.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein [Oryza
sativa Japonica Group]
Length = 313
Score = 161 bits (407), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 92/230 (40%), Positives = 133/230 (57%), Gaps = 17/230 (7%)
Query: 4 GQAGDDSVTNI----PF-----QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA 54
G+AGDD V + PF + +SW PR + F + ++C ++ + K ++ S +A
Sbjct: 30 GEAGDDGVGAVAAAPPFNASRVRAVSWRPRVFVYKGFLSDDECDHLVKLGKRKMQRSMVA 89
Query: 55 LRK-GETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKI 113
K G++V +RTSSG+F+ +D + IE++IA T LP N E ILRY+
Sbjct: 90 DNKSGKSV--MSEVRTSSGMFLDKRQDP--VVSRIEKRIAAWTFLPEENAENIQILRYEH 145
Query: 114 GQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYD-YQKC 172
GQKY H+D F + R A+ L+YL+ +E+GGET+FP G D + +C
Sbjct: 146 GQKYEPHFDYFHDKVNQALGGHRYATVLMYLSTVEKGGETVFPNAEGWENQPKDDTFSEC 205
Query: 173 I--GLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
GL VKP +GD +LF+SL +G DP S+HGSCPV++GEKW A KWIR
Sbjct: 206 AQKGLAVKPVKGDTVLFFSLHIDGVPDPLSLHGSCPVIEGEKWSAPKWIR 255
>gi|110289076|gb|ABB47602.2| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
Group]
Length = 309
Score = 161 bits (407), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 92/212 (43%), Positives = 123/212 (58%), Gaps = 19/212 (8%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 72
LSW PRA F T +C+ +I++AK L S +A DN G +RTSSG
Sbjct: 48 LSWRPRAFLHKGFLTDAECEHLISLAKDKLEKSMVA-------DNESGKSVMSEVRTSSG 100
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
+F+ +DE + IEE+IA T LP NGE+ IL Y+ G+KY HYD F +
Sbjct: 101 MFLEKKQDE--VVARIEERIAAWTFLPPDNGESIQILHYQNGEKYEPHYDYFHDKNNQAL 158
Query: 133 KSQRVASFLVYLTDLEEGGETMFP-FENGMNADGSYD-YQKCI--GLKVKPRQGDGLLFY 188
R+A+ L+YL+D+ +GGET+FP E G D + C G VKP +GD LLF+
Sbjct: 159 GGHRIATVLMYLSDVGKGGETIFPEAEVGKLLQPKDDTWSDCAKNGYAVKPVKGDALLFF 218
Query: 189 SLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
SL P+ T D S+HGSCPV++G+KW ATKWI
Sbjct: 219 SLHPDATTDSDSLHGSCPVIEGQKWSATKWIH 250
>gi|115471029|ref|NP_001059113.1| Os07g0194500 [Oryza sativa Japonica Group]
gi|113610649|dbj|BAF21027.1| Os07g0194500 [Oryza sativa Japonica Group]
gi|215768445|dbj|BAH00674.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 319
Score = 161 bits (407), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 92/230 (40%), Positives = 133/230 (57%), Gaps = 17/230 (7%)
Query: 4 GQAGDDSVTNI----PF-----QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA 54
G+AGDD V + PF + +SW PR + F + ++C ++ + K ++ S +A
Sbjct: 36 GEAGDDGVGAVAAAPPFNASRVRAVSWRPRVFVYKGFLSDDECDHLVKLGKRKMQRSMVA 95
Query: 55 LRK-GETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKI 113
K G++V +RTSSG+F+ +D + IE++IA T LP N E ILRY+
Sbjct: 96 DNKSGKSV--MSEVRTSSGMFLDKRQDP--VVSRIEKRIAAWTFLPEENAENIQILRYEH 151
Query: 114 GQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYD-YQKC 172
GQKY H+D F + R A+ L+YL+ +E+GGET+FP G D + +C
Sbjct: 152 GQKYEPHFDYFHDKVNQALGGHRYATVLMYLSTVEKGGETVFPNAEGWENQPKDDTFSEC 211
Query: 173 I--GLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
GL VKP +GD +LF+SL +G DP S+HGSCPV++GEKW A KWIR
Sbjct: 212 AQKGLAVKPVKGDTVLFFSLHIDGVPDPLSLHGSCPVIEGEKWSAPKWIR 261
>gi|414587756|tpg|DAA38327.1| TPA: hypothetical protein ZEAMMB73_894856 [Zea mays]
Length = 263
Score = 160 bits (405), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 86/212 (40%), Positives = 127/212 (59%), Gaps = 9/212 (4%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTL---ALRKGETVDNTQGIRTSSGV 73
+V+SW PR + F NF + E+C ++ +A+ L+ ST+ A KG D +RTSSG+
Sbjct: 58 EVISWTPRIIVFHNFLSSEECDYLMAIARPRLQISTVVDVATGKGVKSD----VRTSSGM 113
Query: 74 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK 133
F+++ E +S + IE++I+ + +P+ NGE +LRY+ Q Y H+D F +
Sbjct: 114 FVNSEERKSPVVQAIEKRISVFSQIPKENGELIQVLRYEASQYYRPHHDYFSDTFNLKRG 173
Query: 134 SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPN 193
QRVA+ L+YLTD GGET FP + + S GL VKP +GD +LF+S+ +
Sbjct: 174 GQRVATMLMYLTDGVVGGETHFP--QAGDGECSCGGNVVKGLCVKPNKGDAVLFWSMGLD 231
Query: 194 GTIDPTSIHGSCPVVKGEKWVATKWIRDQEQY 225
G DP SIH CPV+KGEKW ATKW+R + +
Sbjct: 232 GNTDPNSIHSGCPVLKGEKWSATKWMRQKMTF 263
>gi|242075290|ref|XP_002447581.1| hypothetical protein SORBIDRAFT_06g004550 [Sorghum bicolor]
gi|241938764|gb|EES11909.1| hypothetical protein SORBIDRAFT_06g004550 [Sorghum bicolor]
Length = 263
Score = 160 bits (405), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 89/224 (39%), Positives = 134/224 (59%), Gaps = 9/224 (4%)
Query: 5 QAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTL---ALRKGETV 61
+AG + + +V+SW PR + F NF + E+C ++ +A+ L+ ST+ A KG
Sbjct: 46 EAGLLRLRYVKPEVISWTPRIIIFHNFLSSEECDYLMAIARPRLQMSTVVDVATGKGVKS 105
Query: 62 DNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHY 121
D +RTSSG+F+++ E +S + IE++I+ + +P+ NGE +LRY+ Q Y H+
Sbjct: 106 D----VRTSSGMFVNSEERKSPVIQAIEKRISVFSQIPKENGELIQVLRYEASQYYRPHH 161
Query: 122 DAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQ 181
D F + QRVA+ L+YLTD EGGET F + G + + S GL VKP +
Sbjct: 162 DYFSDTFNLKRGGQRVATMLMYLTDGVEGGETHF-LQAG-DGECSCGGNVVKGLCVKPNK 219
Query: 182 GDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQY 225
GD +LF+S+ +G DP SIH CPV+KGEKW ATKW+R + +
Sbjct: 220 GDAVLFWSMGLDGNTDPNSIHSGCPVLKGEKWSATKWMRQKMTF 263
>gi|218192156|gb|EEC74583.1| hypothetical protein OsI_10158 [Oryza sativa Indica Group]
Length = 299
Score = 160 bits (405), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 83/206 (40%), Positives = 123/206 (59%), Gaps = 9/206 (4%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNL--RPSTLALRKGETVDNTQGIRTSSGVFIS 76
+SW PR + F + +C+ +I +AK R + + + GE+V RTSSG+F+
Sbjct: 40 VSWSPRVFLYEGFLSDAECEHLIALAKQGRMERSTVVNGKSGESV--MSKTRTSSGMFLI 97
Query: 77 AAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQR 136
+DE + IEE+IA TM P NGE+ +LRY G+KY H+D ++ + R
Sbjct: 98 RKQDE--VVARIEERIAAWTMFPAENGESMQMLRYGQGEKYEPHFDYIRGRQASARGGHR 155
Query: 137 VASFLVYLTDLEEGGETMFP-FENGMNADGSYDYQKCI--GLKVKPRQGDGLLFYSLLPN 193
+A+ L+YL++++ GGET+FP E ++ + C G VKP +G +LF+SL PN
Sbjct: 156 IATVLMYLSNVKMGGETVFPDAEARLSQPKDETWSDCAEQGFAVKPTKGSAVLFFSLYPN 215
Query: 194 GTIDPTSIHGSCPVVKGEKWVATKWI 219
T DP S+HGSCPV++GEKW ATKWI
Sbjct: 216 ATFDPGSLHGSCPVIQGEKWSATKWI 241
>gi|363543295|ref|NP_001241863.1| prolyl 4-hydroxylase 4 precursor [Zea mays]
gi|347978806|gb|AEP37745.1| prolyl 4-hydroxylase 4 [Zea mays]
gi|414591890|tpg|DAA42461.1| TPA: hypothetical protein ZEAMMB73_637248 [Zea mays]
Length = 274
Score = 160 bits (404), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 83/207 (40%), Positives = 125/207 (60%), Gaps = 8/207 (3%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-GETVDNTQGIRTSSGVFI 75
+ +SW PR + F + +C ++ +AK ++ S +A + G++V + +RTSSG+F+
Sbjct: 46 KAVSWHPRIFVYKGFLSDAECDHLVTLAKKKIQRSMVADNESGKSVKSE--VRTSSGMFL 103
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 135
+D + IEE+IA T LP+ N E +LRY+ GQKY H+D F + +
Sbjct: 104 DKRQDP--VVSRIEERIAAWTFLPQENAENMQVLRYEPGQKYEPHFDYFHDRVNQARGGH 161
Query: 136 RVASFLVYLTDLEEGGETMFPFENGMNA---DGSYDYQKCIGLKVKPRQGDGLLFYSLLP 192
R A+ L+YL+ + EGGET+FP G + D ++ GL VKP +GD +LF+SL
Sbjct: 162 RYATVLMYLSTVREGGETVFPNAKGWESQPKDATFSECAHKGLAVKPVKGDAVLFFSLHA 221
Query: 193 NGTIDPTSIHGSCPVVKGEKWVATKWI 219
+GT DP S+HGSCPV++GEKW A KWI
Sbjct: 222 DGTPDPLSLHGSCPVIRGEKWSAPKWI 248
>gi|388500582|gb|AFK38357.1| unknown [Medicago truncatula]
Length = 299
Score = 159 bits (403), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 94/229 (41%), Positives = 131/229 (57%), Gaps = 23/229 (10%)
Query: 6 AGDDSVTNIPFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDN 63
AG S P +V +SW+PRA + F T +C +I++AK L+ S +A DN
Sbjct: 25 AGSASSIINPSKVKQISWIPRAFVYQGFLTDLECDHLISLAKSELKRSAVA-------DN 77
Query: 64 TQG------IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKY 117
G +RTSSG+ IS +D + IE++I+ T LP+ NGE +LRY+ GQKY
Sbjct: 78 LSGDSQLSDVRTSSGMLISKNKDP--IVSGIEDRISAWTFLPKENGEDIQVLRYEHGQKY 135
Query: 118 NSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFE----NGMNADGSYDYQKCI 173
+ HYD F + Q R+A+ L+YLT++ +GGET+FP + S D +C
Sbjct: 136 DPHYDYFADKVNIVQGGHRLATVLMYLTNVTKGGETVFPEAEEPPRRRGSKKSSDLSECA 195
Query: 174 --GLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
G+ VKPR+GD LLF+SL N D S+H CPV++GEKW ATKWI
Sbjct: 196 KKGIAVKPRRGDALLFFSLDTNAIPDTNSLHAGCPVLEGEKWSATKWIH 244
>gi|108706361|gb|ABF94156.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
Group]
gi|222624253|gb|EEE58385.1| hypothetical protein OsJ_09545 [Oryza sativa Japonica Group]
Length = 299
Score = 159 bits (402), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 83/206 (40%), Positives = 123/206 (59%), Gaps = 9/206 (4%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNL--RPSTLALRKGETVDNTQGIRTSSGVFIS 76
+SW PR + F + +C+ +I +AK R + + + GE+V RTSSG+F+
Sbjct: 40 VSWSPRVFLYEGFLSDVECEHLIALAKQGRMERSTVVNGKSGESV--MSKTRTSSGMFLI 97
Query: 77 AAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQR 136
+DE + IEE+IA TM P NGE+ +LRY G+KY H+D ++ + R
Sbjct: 98 RKQDE--VVARIEERIAAWTMFPAENGESMQMLRYGQGEKYEPHFDYIRGRQASARGGHR 155
Query: 137 VASFLVYLTDLEEGGETMFP-FENGMNADGSYDYQKCI--GLKVKPRQGDGLLFYSLLPN 193
+A+ L+YL++++ GGET+FP E ++ + C G VKP +G +LF+SL PN
Sbjct: 156 IATVLMYLSNVKMGGETVFPDAEARLSQPKDETWSDCAEQGFAVKPTKGSAVLFFSLYPN 215
Query: 194 GTIDPTSIHGSCPVVKGEKWVATKWI 219
T DP S+HGSCPV++GEKW ATKWI
Sbjct: 216 ATFDPGSLHGSCPVIQGEKWSATKWI 241
>gi|145345764|ref|XP_001417370.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577597|gb|ABO95663.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 328
Score = 159 bits (402), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 94/216 (43%), Positives = 124/216 (57%), Gaps = 22/216 (10%)
Query: 16 FQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRT 69
+ +SW P A + F T E+C + +A +L ST+ VD + G IRT
Sbjct: 56 IERVSWRPHAEVYRGFLTREECDHLKALATPSLGRSTV-------VDASNGGSVPSDIRT 108
Query: 70 SSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEY 129
SSG+F+ ED+ + IE +IA T +P +GE F +LRY+ GQ+Y H+D F E+
Sbjct: 109 SSGMFLLRGEDD--VVASIERRIASWTHVPESHGEGFQVLRYEFGQEYRPHFDYFQ-DEF 165
Query: 130 GPQK---SQRVASFLVYLTDLEEGGETMFP-FENGMNADGSYDYQKCIG--LKVKPRQGD 183
++ QRVA+ L+YLTD+EEGGET+FP E G N G D C L VKPR+GD
Sbjct: 166 NQKREKGGQRVATVLMYLTDVEEGGETIFPDAEAGANPGGGDDASSCAAGKLAVKPRKGD 225
Query: 184 GLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWI 219
L F SL NGT D S H CPVVKG K+ ATKW+
Sbjct: 226 ALFFRSLHHNGTSDAMSSHAGCPVVKGVKFSATKWM 261
>gi|255083627|ref|XP_002508388.1| predicted protein [Micromonas sp. RCC299]
gi|226523665|gb|ACO69646.1| predicted protein [Micromonas sp. RCC299]
Length = 253
Score = 159 bits (402), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 89/223 (39%), Positives = 129/223 (57%), Gaps = 35/223 (15%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 72
+SW PRA + NF + E+C I+ +A+ +R ST+ +D+ G IRTS
Sbjct: 1 VSWYPRAFHLHNFMSHEECDRILEIARPRVRRSTV-------IDSVTGQSKVDPIRTSEQ 53
Query: 73 VFISAAEDESGTLDLI---EEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-----AF 124
F++ GT D++ EE++A VT LP +GE IL+Y +GQKY++H+D +
Sbjct: 54 TFLN-----RGTWDIVTKVEERLAVVTQLPAYHGEDMQILKYGLGQKYDAHHDVGELTSA 108
Query: 125 DPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMN------ADGSYDYQKCI--GLK 176
++ + RVA+ L+YL+D+EEGGET FP M A+G + C +
Sbjct: 109 SGKQLAAEGGHRVATVLLYLSDVEEGGETAFPDSEWMTPELRKWAEGQ-KWSDCAEGNVA 167
Query: 177 VKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWI 219
VKPR+GDGLLF+S+ IDP S+H CPV++GEKW ATKWI
Sbjct: 168 VKPRKGDGLLFWSVNNENAIDPHSMHAGCPVIRGEKWTATKWI 210
>gi|294461211|gb|ADE76168.1| unknown [Picea sitchensis]
Length = 280
Score = 159 bits (401), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 86/208 (41%), Positives = 125/208 (60%), Gaps = 18/208 (8%)
Query: 22 MPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFI 75
+P + NF T +C +I +A+ L+ S +A DN G IRTSSG+F+
Sbjct: 27 IPGLFLYKNFLTDAECDHLIFLARDKLQKSMVA-------DNESGKSVMSEIRTSSGMFL 79
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 135
+ A+DE + +E++IA T LP NGEA +L Y++GQKY H+D F +
Sbjct: 80 NKAQDE--IVASVEDRIAAWTFLPIENGEAMQVLHYELGQKYEPHFDYFHDKINQAMGGH 137
Query: 136 RVASFLVYLTDLEEGGETMFPFENGMNADGSYD-YQKCI--GLKVKPRQGDGLLFYSLLP 192
R+A+ L+YL+D+ +GGET+FP ++ D + +C G VKP +GD LLF+SL P
Sbjct: 138 RIATVLMYLSDVVKGGETVFPNAETKDSQPKDDSWSECAKGGYSVKPNKGDALLFFSLRP 197
Query: 193 NGTIDPTSIHGSCPVVKGEKWVATKWIR 220
+ T D +S+HGSCPV++GEKW ATKWI
Sbjct: 198 DATTDQSSLHGSCPVIEGEKWSATKWIH 225
>gi|145343778|ref|XP_001416487.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144576712|gb|ABO94780.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 255
Score = 159 bits (401), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 87/202 (43%), Positives = 123/202 (60%), Gaps = 11/202 (5%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK--GETVDNTQGIRTSSGVFISAAED 80
PRA + F T E+C I+ ++K +L S + K G T T IRTS+G FIS A D
Sbjct: 1 PRAFVYEGFLTDEECDHILALSKGHLHKSGVVDAKTGGST---TSDIRTSTGTFISRAHD 57
Query: 81 ESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASF 140
T+ IEE+I + +P +GEA +LRY+ GQ+Y +H+D F + G +++ R+A+
Sbjct: 58 P--TITAIEERIELWSQIPVDHGEALQVLRYENGQEYKAHFDYFFHK--GGKRNNRIATV 113
Query: 141 LVYLTDLEEGGETMFPFENGMNADGSYDYQKCI--GLKVKPRQGDGLLFYSLLPNGTIDP 198
L+YL+D+EEGGET+FP + Y +C G VK R+GD LLF+S+ P G +DP
Sbjct: 114 LLYLSDVEEGGETVFPNTDVPTDRDRSQYSECGNGGKSVKARKGDALLFWSMKPGGELDP 173
Query: 199 TSIHGSCPVVKGEKWVATKWIR 220
S H CPV+KG KW ATKW+
Sbjct: 174 GSSHAGCPVIKGVKWTATKWMH 195
>gi|116309432|emb|CAH66506.1| OSIGBa0111I14.1 [Oryza sativa Indica Group]
Length = 267
Score = 159 bits (401), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 85/208 (40%), Positives = 128/208 (61%), Gaps = 5/208 (2%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVFI 75
+V+SW PR + F NF + E+C + ++A+ L+ ST+ + G+ V + +RTSSG+F+
Sbjct: 62 EVISWSPRIIVFHNFLSSEECDYLRSIARPRLQISTVVDVATGKGVKSN--VRTSSGMFV 119
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 135
S+ E + + IE++I+ + +P NGE +LRY+ Q Y H+D F + Q
Sbjct: 120 SSEERKLPVIQSIEKRISVYSQIPEENGELIQVLRYEPSQYYRPHHDYFSDTFNIKRGGQ 179
Query: 136 RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGT 195
RVA+ L+YLTD EGGET FP + + S + GL VKP +GD +LF+S+ +G
Sbjct: 180 RVATMLMYLTDGVEGGETHFP--QAGDGECSCGGKMVKGLCVKPNKGDAVLFWSMGLDGE 237
Query: 196 IDPTSIHGSCPVVKGEKWVATKWIRDQE 223
D SIHG CPV++GEKW ATKW+R +E
Sbjct: 238 TDSNSIHGGCPVLEGEKWSATKWMRQKE 265
>gi|115457822|ref|NP_001052511.1| Os04g0346000 [Oryza sativa Japonica Group]
gi|38346023|emb|CAE03962.2| OSJNBb0085H11.11 [Oryza sativa Japonica Group]
gi|113564082|dbj|BAF14425.1| Os04g0346000 [Oryza sativa Japonica Group]
gi|125547818|gb|EAY93640.1| hypothetical protein OsI_15426 [Oryza sativa Indica Group]
gi|125589953|gb|EAZ30303.1| hypothetical protein OsJ_14349 [Oryza sativa Japonica Group]
gi|215693934|dbj|BAG89133.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 267
Score = 158 bits (400), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 85/208 (40%), Positives = 128/208 (61%), Gaps = 5/208 (2%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVFI 75
+V+SW PR + F NF + E+C + ++A+ L+ ST+ + G+ V + +RTSSG+F+
Sbjct: 62 EVISWSPRIIVFHNFLSSEECDYLRSIARPRLQISTVVDVATGKGVKSN--VRTSSGMFV 119
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 135
S+ E + + IE++I+ + +P NGE +LRY+ Q Y H+D F + Q
Sbjct: 120 SSEERKLPVIQSIEKRISVYSQIPEENGELIQVLRYEPSQYYRPHHDYFSDTFNIKRGGQ 179
Query: 136 RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGT 195
RVA+ L+YLTD EGGET FP + + S + GL VKP +GD +LF+S+ +G
Sbjct: 180 RVATMLMYLTDGVEGGETHFP--QAGDGECSCGGKMVKGLCVKPNKGDAVLFWSMGLDGE 237
Query: 196 IDPTSIHGSCPVVKGEKWVATKWIRDQE 223
D SIHG CPV++GEKW ATKW+R +E
Sbjct: 238 TDSNSIHGGCPVLEGEKWSATKWMRQKE 265
>gi|148537204|dbj|BAF63493.1| prolyl 4-hydroxylase [Potamogeton distinctus]
Length = 246
Score = 158 bits (399), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 86/198 (43%), Positives = 117/198 (59%), Gaps = 18/198 (9%)
Query: 31 FATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFISAAEDESGT 84
F + E+C +I + K L S +A DN G IRTSSG+F+ +DE T
Sbjct: 3 FLSHEECDHLIALGKDKLEKSMVA-------DNESGKSVMSEIRTSSGMFLERRQDE--T 53
Query: 85 LDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYL 144
+ IE++IA T LP NGE IL Y+ GQKY++HYD F + R+A+ L+YL
Sbjct: 54 ITRIEKRIAAWTFLPEENGEPIQILHYEKGQKYDAHYDYFHDKNNQRVGGHRMATVLMYL 113
Query: 145 TDLEEGGETMFPFENG---MNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSI 201
+D+++GGET+FP G D ++ G VKPR+GD LLF+S PN T DP S+
Sbjct: 114 SDVKKGGETVFPDAEGKLLQVKDDTWSDCARSGYAVKPRKGDALLFFSCHPNATTDPNSL 173
Query: 202 HGSCPVVKGEKWVATKWI 219
H SCPV++GEKW AT+WI
Sbjct: 174 HASCPVIEGEKWSATRWI 191
>gi|20260280|gb|AAM13038.1| unknown protein [Arabidopsis thaliana]
gi|22136524|gb|AAM91340.1| unknown protein [Arabidopsis thaliana]
Length = 298
Score = 158 bits (399), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 86/214 (40%), Positives = 128/214 (59%), Gaps = 21/214 (9%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 72
+S PRA + F T +C ++++AK +L+ S +A DN G +RTSSG
Sbjct: 40 VSSKPRAFVYEGFLTELECDHMVSLAKASLKRSAVA-------DNDSGESKFSEVRTSSG 92
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
FIS +D + IE+KI+ T LP+ NGE +LRY+ GQKY++H+D F + +
Sbjct: 93 TFISKGKDP--IVSGIEDKISTWTFLPKENGEDIQVLRYEHGQKYDAHFDYFHDKVNIVR 150
Query: 133 KSQRVASFLVYLTDLEEGGETMFPF----ENGMNADGSYDYQKCI--GLKVKPRQGDGLL 186
R+A+ L+YL+++ +GGET+FP + ++ D C G+ VKPR+GD LL
Sbjct: 151 GGHRMATILMYLSNVTKGGETVFPDAEIPSRRVLSENEEDLSDCAKRGIAVKPRKGDALL 210
Query: 187 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
F++L P+ DP S+HG CPV++GEKW ATKWI
Sbjct: 211 FFNLHPDAIPDPLSLHGGCPVIEGEKWSATKWIH 244
>gi|15239594|ref|NP_197391.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
gi|21593296|gb|AAM65245.1| prolyl 4-hydroxylase alpha subunit-like protein [Arabidopsis
thaliana]
gi|332005243|gb|AED92626.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
Length = 298
Score = 158 bits (399), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 86/214 (40%), Positives = 128/214 (59%), Gaps = 21/214 (9%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 72
+S PRA + F T +C ++++AK +L+ S +A DN G +RTSSG
Sbjct: 40 VSSKPRAFVYEGFLTELECDHMVSLAKASLKRSAVA-------DNDSGESKFSEVRTSSG 92
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
FIS +D + IE+KI+ T LP+ NGE +LRY+ GQKY++H+D F + +
Sbjct: 93 TFISKGKDP--IVSGIEDKISTWTFLPKENGEDIQVLRYEHGQKYDAHFDYFHDKVNIVR 150
Query: 133 KSQRVASFLVYLTDLEEGGETMFPF----ENGMNADGSYDYQKCI--GLKVKPRQGDGLL 186
R+A+ L+YL+++ +GGET+FP + ++ D C G+ VKPR+GD LL
Sbjct: 151 GGHRMATILMYLSNVTKGGETVFPDAEIPSRRVLSENKEDLSDCAKRGIAVKPRKGDALL 210
Query: 187 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
F++L P+ DP S+HG CPV++GEKW ATKWI
Sbjct: 211 FFNLHPDAIPDPLSLHGGCPVIEGEKWSATKWIH 244
>gi|242047772|ref|XP_002461632.1| hypothetical protein SORBIDRAFT_02g005750 [Sorghum bicolor]
gi|241925009|gb|EER98153.1| hypothetical protein SORBIDRAFT_02g005750 [Sorghum bicolor]
Length = 307
Score = 157 bits (398), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 89/232 (38%), Positives = 130/232 (56%), Gaps = 24/232 (10%)
Query: 4 GQAGDDSVTNIP------FQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK 57
G+ G D V P + +SW PR + F + +C ++ +AK ++ S +A
Sbjct: 26 GEDGGDVVAPAPPFNSSRVKAVSWQPRIFVYKGFLSDAECDHLVTLAKKKIQRSMVA--- 82
Query: 58 GETVDNTQG------IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRY 111
DN G +RTSSG+F++ +D + IEE+IA T LP+ N E ILRY
Sbjct: 83 ----DNQSGKSVMSEVRTSSGMFLNKRQDP--VVSRIEERIAAWTFLPQENAENMQILRY 136
Query: 112 KIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYD-YQ 170
+ GQKY H+D F + + R A+ L+YL+ +++GGET+FP G + D +
Sbjct: 137 EHGQKYEPHFDYFHDKINQVRGGHRYATVLMYLSTVDKGGETVFPNAKGWESQPKDDTFS 196
Query: 171 KCI--GLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
+C GL VKP +GD +LF+SL +G DP S+HGSCPV++GEKW A KWI
Sbjct: 197 ECAHQGLAVKPVKGDAVLFFSLHVDGVPDPLSLHGSCPVIQGEKWSAPKWIH 248
>gi|412992163|emb|CCO19876.1| predicted protein [Bathycoccus prasinos]
Length = 350
Score = 157 bits (397), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 88/212 (41%), Positives = 123/212 (58%), Gaps = 15/212 (7%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALR-KGETVDNTQGIRTSSGVFISA 77
+SW PRA + + E+C+ I+ +AK ++ ST+ GE T IRTS F+
Sbjct: 83 ISWQPRAFVLHSILSEEECEEILRIAKPMMKRSTVVDSITGEI--KTDPIRTSKQTFL-- 138
Query: 78 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP-----QEYGPQ 132
A + + +EE++++ TMLP NGE IL Y +G+KY++H+D + Q+
Sbjct: 139 ARGKYPVVTRVEERLSRFTMLPWYNGEDMQILSYGVGEKYSAHHDVGEKNTKSGQQLSAD 198
Query: 133 KSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQK---CI--GLKVKPRQGDGLLF 187
QRVA+ L+YL D EEGGET FP + + Y QK C G+ KP++GDGLLF
Sbjct: 199 GGQRVATVLLYLQDTEEGGETAFPDSEWIEPESEYAQQKFSECAKNGVAFKPKRGDGLLF 258
Query: 188 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWI 219
+S+ P G ID S+H CPVVKG KW ATKWI
Sbjct: 259 FSITPEGDIDQKSMHAGCPVVKGTKWTATKWI 290
>gi|308802438|ref|XP_003078532.1| prolyl 4-hydroxylase alpha-1 subunit precursor (IC) [Ostreococcus
tauri]
gi|116056985|emb|CAL51412.1| prolyl 4-hydroxylase alpha-1 subunit precursor (IC) [Ostreococcus
tauri]
Length = 369
Score = 157 bits (397), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 89/220 (40%), Positives = 131/220 (59%), Gaps = 18/220 (8%)
Query: 15 PFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVF 74
P + S P+A NF +P++C ++ +AK L PST+ G +V + IRTS+G+F
Sbjct: 82 PLVLSSKKPKAYLMRNFLSPQECDHLMMLAKRELAPSTVVGDGGSSVASE--IRTSAGMF 139
Query: 75 ISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQK 133
+ ++D+ T+ IEE+IA+++ +P NGE ILRY GQKY+ H+D F D P++
Sbjct: 140 LRKSQDD--TVREIEERIARLSGVPVDNGEGMQILRYDKGQKYDPHFDYFHDKVNPAPKR 197
Query: 134 S-QRVASFLVYLTDLEEGGETMFP-------FE-----NGMNADGSYDYQKCIGLKVKPR 180
QRVA+ L+YL D EEGGET FP FE N A + G+ VK
Sbjct: 198 GGQRVATVLIYLVDTEEGGETTFPNGRLPENFEEDEPDNPFAAHIKHTDCAKNGIPVKSV 257
Query: 181 QGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
+GD +LF+S+ +G +D S+HG+CPV+ G+KW A KW+R
Sbjct: 258 RGDAILFFSMTKDGELDHGSLHGACPVIAGQKWTAVKWLR 297
>gi|326501992|dbj|BAK06488.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 306
Score = 157 bits (397), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 87/212 (41%), Positives = 122/212 (57%), Gaps = 19/212 (8%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 72
+SW PRA + F T +C ++ +A+ L+K VD G +RTSSG
Sbjct: 41 VSWRPRAFLYKGFLTEAECDHLVALAEEG------GLQKSMVVDRQTGKSVMSEVRTSSG 94
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG-- 130
F++ +D+ + IE +IA T+LP+ NGE+ +LRY+ GQKY H D G
Sbjct: 95 TFLAKKQDQ--VVATIEARIAAWTLLPQENGESIQVLRYENGQKYEPHVDFIRHAAKGHH 152
Query: 131 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQ-KCI--GLKVKPRQGDGLLF 187
+ RVA+ L+YL+D++ GGET+FP + D Q +C G VKP +GD +LF
Sbjct: 153 SRGGHRVATVLMYLSDVKMGGETVFPNSDAKTLQPKDDTQSECARRGYAVKPVKGDAVLF 212
Query: 188 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWI 219
+SL PNGT D S+HG CPV++GEKW ATKWI
Sbjct: 213 FSLHPNGTTDRDSLHGGCPVIEGEKWSATKWI 244
>gi|449461905|ref|XP_004148682.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 295
Score = 157 bits (396), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 86/209 (41%), Positives = 123/209 (58%), Gaps = 11/209 (5%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALR-KGETVDNTQGIRTSSGVFISA 77
LSW PRA + F + +C +I++AK L S +A G++V + +RTSSG+F+
Sbjct: 35 LSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVSSE--VRTSSGMFLRK 92
Query: 78 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRV 137
A+DE + +E +IA T+LP NGE+ IL Y+ GQKY H+D F + R+
Sbjct: 93 AQDE--VVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRI 150
Query: 138 ASFLVYLTDLEEGGETMFPF------ENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLL 191
A+ L+YL+++E+GGET+FP D S+ G VK ++GD LLF+SL
Sbjct: 151 ATVLMYLSNVEKGGETIFPNSEVWYGSESQAKDESWSDCSRKGYAVKAQKGDALLFFSLN 210
Query: 192 PNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
+ T D S+HGSCPV+ GEKW ATKWI
Sbjct: 211 LDATTDERSLHGSCPVIAGEKWSATKWIH 239
>gi|145345836|ref|XP_001417405.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577632|gb|ABO95698.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 330
Score = 156 bits (395), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 84/220 (38%), Positives = 130/220 (59%), Gaps = 18/220 (8%)
Query: 15 PFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVF 74
P + + P+A NF + E+C ++ +AK L PST+ G++V + IRTS+G+F
Sbjct: 41 PLVLSATQPKAYLLRNFLSAEECDHLMKLAKRELAPSTVVGEAGDSVPSD--IRTSAGMF 98
Query: 75 ISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQK 133
+ +D+ + IEE+IA+++ P NGE ILRY +GQKY+ H+D F D P++
Sbjct: 99 LRKGQDK--IVKAIEERIARLSGTPVDNGEGMQILRYDVGQKYDPHFDYFHDKVNPAPKR 156
Query: 134 S-QRVASFLVYLTDLEEGGETMFP---FENGMNAD-------GSYDYQKCI--GLKVKPR 180
QR+A+ L+YL D ++GGET FP AD ++ C G+ VK
Sbjct: 157 GGQRLATMLIYLVDTDKGGETTFPNAKLPQSFEADEPENPFASHIEHTDCAKKGIPVKSV 216
Query: 181 QGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
+GD +LF+S+ +G +D S+HG+CPV++G+KW A KWIR
Sbjct: 217 RGDAILFFSMTQDGVLDRGSLHGACPVIEGQKWTAVKWIR 256
>gi|334185677|ref|NP_001189994.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
gi|332643930|gb|AEE77451.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
Length = 324
Score = 156 bits (395), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 87/210 (41%), Positives = 117/210 (55%), Gaps = 8/210 (3%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALR-KGETVDNTQGIRTSSGVFISA 77
LSW PR + F + E+C I +AK L S +A GE+V++ +
Sbjct: 59 LSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEDSVSVVRQSSSFI 118
Query: 78 AEDESGTLDLI----EEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK 133
A +S +D I E K+A T LP NGE+ IL Y+ GQKY H+D F Q
Sbjct: 119 ANMDSLEIDDIVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELG 178
Query: 134 SQRVASFLVYLTDLEEGGETMFPFENGMNA---DGSYDYQKCIGLKVKPRQGDGLLFYSL 190
R+A+ L+YL+++E+GGET+FP G D S+ G VKPR+GD LLF++L
Sbjct: 179 GHRIATVLMYLSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDALLFFNL 238
Query: 191 LPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
PN T D S+HGSCPVV+GEKW AT+WI
Sbjct: 239 HPNATTDSNSLHGSCPVVEGEKWSATRWIH 268
>gi|302823087|ref|XP_002993198.1| hypothetical protein SELMODRAFT_431327 [Selaginella moellendorffii]
gi|300138968|gb|EFJ05718.1| hypothetical protein SELMODRAFT_431327 [Selaginella moellendorffii]
Length = 269
Score = 156 bits (394), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 84/213 (39%), Positives = 123/213 (57%), Gaps = 18/213 (8%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG---------I 67
+VL+W PR + F + E+C +I +A L ST+ VD + G +
Sbjct: 61 EVLNWSPRIILLHKFLSAEECDYLIAIAGPRLAKSTV-------VDTSTGKARHGIESKV 113
Query: 68 RTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQ 127
RTS+G+F+S + + IE +IA +M+P NGE +LRY+ Q Y H+D F Q
Sbjct: 114 RTSTGMFLSNYDRRYPMIQAIERRIAVYSMIPVENGELLQVLRYEPNQYYKPHHDYFSDQ 173
Query: 128 EYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 187
+ QRVA+ L+YL+D+EEGGET+FP + + +K GL VKPR+GD +LF
Sbjct: 174 FNLKRGGQRVATVLMYLSDVEEGGETIFPSVGDGECECGGELRK--GLCVKPRKGDAILF 231
Query: 188 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
+S +G +D S+HG C V++GEKW ATKW+R
Sbjct: 232 WSAALDGNVDSNSLHGGCSVLRGEKWSATKWLR 264
>gi|18397528|ref|NP_566279.1| P4H isoform 2 [Arabidopsis thaliana]
gi|332640849|gb|AEE74370.1| P4H isoform 2 [Arabidopsis thaliana]
Length = 299
Score = 156 bits (394), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 85/214 (39%), Positives = 127/214 (59%), Gaps = 21/214 (9%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 72
+S PRA + F T +C +I++AK NL+ S +A DN G +RTSSG
Sbjct: 41 VSSKPRAFVYEGFLTDLECDHLISLAKENLQRSAVA-------DNDNGESQVSDVRTSSG 93
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
FIS +D + IE+K++ T LP+ NGE +LRY+ GQKY++H+D F + +
Sbjct: 94 TFISKGKDP--IVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQKYDAHFDYFHDKVNIAR 151
Query: 133 KSQRVASFLVYLTDLEEGGETMFP----FENGMNADGSYDYQKCI--GLKVKPRQGDGLL 186
R+A+ L+YL+++ +GGET+FP F ++ D C G+ VKP++G+ LL
Sbjct: 152 GGHRIATVLLYLSNVTKGGETVFPDAQEFSRRSLSENKDDLSDCAKKGIAVKPKKGNALL 211
Query: 187 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
F++L + DP S+HG CPV++GEKW ATKWI
Sbjct: 212 FFNLQQDAIPDPFSLHGGCPVIEGEKWSATKWIH 245
>gi|297812067|ref|XP_002873917.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297319754|gb|EFH50176.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 298
Score = 156 bits (394), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 85/214 (39%), Positives = 127/214 (59%), Gaps = 21/214 (9%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 72
+S PRA + F T +C ++++AK +L+ S +A DN G +RTSSG
Sbjct: 40 VSSKPRAFVYEGFLTELECDHMVSLAKASLKRSAVA-------DNDSGESKFSEVRTSSG 92
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
FI +D + IE+KI+ T LP+ NGE +LRY+ GQKY++H+D F + +
Sbjct: 93 TFIPKGKDP--IVSGIEDKISTWTFLPKENGEDIQVLRYEHGQKYDAHFDYFHDKVNIVR 150
Query: 133 KSQRVASFLVYLTDLEEGGETMFPFEN----GMNADGSYDYQKCI--GLKVKPRQGDGLL 186
R+A+ L+YL+++ +GGET+FP + ++ D C G+ VKPR+GD LL
Sbjct: 151 GGHRIATVLMYLSNVTKGGETVFPDAEVPSCRVLSENKEDLSDCAKRGIAVKPRKGDALL 210
Query: 187 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
F++L P+ DP S+HG CPV++GEKW ATKWI
Sbjct: 211 FFNLHPDAIPDPLSLHGGCPVIEGEKWSATKWIH 244
>gi|302764100|ref|XP_002965471.1| hypothetical protein SELMODRAFT_67344 [Selaginella moellendorffii]
gi|300166285|gb|EFJ32891.1| hypothetical protein SELMODRAFT_67344 [Selaginella moellendorffii]
Length = 264
Score = 156 bits (394), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 84/213 (39%), Positives = 123/213 (57%), Gaps = 18/213 (8%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG---------I 67
+VL+W PR F + E+C +I +A L ST+ VD + G +
Sbjct: 60 EVLNWSPRITLLHKFLSAEECDYLIAIAGPRLAKSTV-------VDTSTGKARHGIESKV 112
Query: 68 RTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQ 127
RTS+G+F+S + ++ IE +IA +M+P NGE +LRY+ Q Y H+D F Q
Sbjct: 113 RTSTGMFLSNYDRRYPMIEAIERRIAVYSMIPVENGELLQVLRYEPNQYYKPHHDYFSDQ 172
Query: 128 EYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 187
+ QRVA+ L+YL+D+EEGGET+FP + + +K GL VKPR+GD +LF
Sbjct: 173 FNLKRGGQRVATVLMYLSDVEEGGETIFPSVGDGECECGGELRK--GLCVKPRKGDAILF 230
Query: 188 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
+S +G +D S+HG C V++GEKW ATKW+R
Sbjct: 231 WSAALDGNVDSNSLHGGCSVLRGEKWSATKWLR 263
>gi|388495016|gb|AFK35574.1| unknown [Lotus japonicus]
Length = 297
Score = 156 bits (394), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 89/216 (41%), Positives = 124/216 (57%), Gaps = 25/216 (11%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 72
+SW PRA + F T +C +I++AK L+ S +A DN G +RTSSG
Sbjct: 39 VSWKPRAFVYEGFLTGLECDHLISLAKSELKRSAVA-------DNLPGDSKLSEVRTSSG 91
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
+FIS +D + IE+KI+ T LP+ NGE +LRY+ GQKY+ HYD F + +
Sbjct: 92 MFISKKKDP--IVAGIEDKISAWTFLPKENGEDMQVLRYEHGQKYDPHYDYFTDKVNIVR 149
Query: 133 KSQRVASFLVYLTDLEEGGETMFPF------ENGMNADGSYDYQKCI--GLKVKPRQGDG 184
R+A+ L+YLT++ GGET+FP G+ + D +C G+ VKPR+GD
Sbjct: 150 GGHRMATVLLYLTNVTRGGETVFPVAEEPPRRRGLETNS--DLSECAKKGIAVKPRRGDA 207
Query: 185 LLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
LLF+SL D S+H CPV++GEKW ATKWI
Sbjct: 208 LLFFSLHTTAIPDTDSLHAGCPVIEGEKWSATKWIH 243
>gi|21618073|gb|AAM67123.1| prolyl 4-hydroxylase alpha subunit-like protein [Arabidopsis
thaliana]
Length = 297
Score = 155 bits (393), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 85/214 (39%), Positives = 127/214 (59%), Gaps = 21/214 (9%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 72
+S PRA + F T +C +I++AK NL+ S +A DN G +RTSSG
Sbjct: 39 VSSKPRAFVYEGFLTDLECDHLISLAKENLQRSAVA-------DNDNGESQVSDVRTSSG 91
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
FIS +D + IE+K++ T LP+ NGE +LRY+ GQKY++H+D F + +
Sbjct: 92 TFISKGKDP--IVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQKYDAHFDYFHDKVNIAR 149
Query: 133 KSQRVASFLVYLTDLEEGGETMFP----FENGMNADGSYDYQKCI--GLKVKPRQGDGLL 186
R+A+ L+YL+++ +GGET+FP F ++ D C G+ VKP++G+ LL
Sbjct: 150 GGHRIATVLLYLSNVTKGGETVFPDAQEFSRRSLSENKDDLSDCAKKGIAVKPKKGNALL 209
Query: 187 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
F++L + DP S+HG CPV++GEKW ATKWI
Sbjct: 210 FFNLQQDAIPDPFSLHGGCPVIEGEKWSATKWIH 243
>gi|297824279|ref|XP_002880022.1| AT-P4H-1 [Arabidopsis lyrata subsp. lyrata]
gi|297325861|gb|EFH56281.1| AT-P4H-1 [Arabidopsis lyrata subsp. lyrata]
Length = 283
Score = 155 bits (393), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 83/213 (38%), Positives = 129/213 (60%), Gaps = 5/213 (2%)
Query: 11 VTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRT 69
+ N+ +V+SW PR + +F +PE+C+ + +A+ L+ ST+ ++ G+ V + +RT
Sbjct: 72 IGNVKPEVVSWSPRIIVLHDFLSPEECEYLKAIARPRLQVSTVVDVKTGKGVKSD--VRT 129
Query: 70 SSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEY 129
SSG+F++ E + + IE++IA + +P NGE +LRY+ Q Y H+D F
Sbjct: 130 SSGMFLTHVERSNPIIQAIEKRIAVFSQVPAENGELIQVLRYEPKQFYKPHHDYFADTFN 189
Query: 130 GPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYS 189
+ QRVA+ L+YLTD EGGET FP + D + + G+ VKP +GD +LF+S
Sbjct: 190 LKRGGQRVATMLMYLTDDVEGGETYFPLAG--DGDCTCGGKIMKGISVKPTKGDAVLFWS 247
Query: 190 LLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
+ +G DP SIHG C V+ GEKW ATKW+R +
Sbjct: 248 MGLDGQSDPRSIHGGCEVLSGEKWSATKWMRQK 280
>gi|110738390|dbj|BAF01121.1| hypothetical protein [Arabidopsis thaliana]
Length = 299
Score = 155 bits (393), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 85/214 (39%), Positives = 127/214 (59%), Gaps = 21/214 (9%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 72
+S PRA + F T +C +I++AK NL+ S +A DN G +RTSSG
Sbjct: 41 VSSKPRAFVYGGFLTDLECDHLISLAKENLQRSAVA-------DNDNGESQVSDVRTSSG 93
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
FIS +D + IE+K++ T LP+ NGE +LRY+ GQKY++H+D F + +
Sbjct: 94 TFISKGKDP--IVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQKYDAHFDYFHDKVNIAR 151
Query: 133 KSQRVASFLVYLTDLEEGGETMFP----FENGMNADGSYDYQKCI--GLKVKPRQGDGLL 186
R+A+ L+YL+++ +GGET+FP F ++ D C G+ VKP++G+ LL
Sbjct: 152 GGHRIATVLLYLSNVTKGGETVFPDAQEFSRRSLSENKDDLSDCAKKGIAVKPKKGNALL 211
Query: 187 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
F++L + DP S+HG CPV++GEKW ATKWI
Sbjct: 212 FFNLQQDAIPDPFSLHGGCPVIEGEKWSATKWIH 245
>gi|10177121|dbj|BAB10411.1| prolyl 4-hydroxylase, alpha subunit-like protein [Arabidopsis
thaliana]
Length = 267
Score = 155 bits (392), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 83/201 (41%), Positives = 126/201 (62%), Gaps = 9/201 (4%)
Query: 8 DDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-GETVDNTQG 66
DDS +++SW PRA + NF T E+CK +I +AK ++ ST+ K G++ D+
Sbjct: 70 DDSKNERWVEIISWEPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKTGKSTDSR-- 127
Query: 67 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP 126
+RTSSG F++ D+ T+ IE++I+ T +P +GE +L Y+IGQKY HYD F
Sbjct: 128 VRTSSGTFLARGRDK--TIREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHYDYFMD 185
Query: 127 QEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKC--IGLKVKPRQG 182
+ QR+A+ L+YL+D+EEGGET+FP G + + + +C GL VKP+ G
Sbjct: 186 EYNTRNGGQRIATVLMYLSDVEEGGETVFPAAKGNYSAVPWWNELSECGKGGLSVKPKMG 245
Query: 183 DGLLFYSLLPNGTIDPTSIHG 203
D LLF+S+ P+ T+DP+S+HG
Sbjct: 246 DALLFWSMTPDATLDPSSLHG 266
>gi|297797785|ref|XP_002866777.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297312612|gb|EFH43036.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 266
Score = 155 bits (391), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 83/201 (41%), Positives = 126/201 (62%), Gaps = 9/201 (4%)
Query: 8 DDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-GETVDNTQG 66
DDS +++SW PRA + NF T E+CK +I +AK ++ ST+ K G++ D+
Sbjct: 69 DDSKNERWVEIISWEPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKTGKSTDSR-- 126
Query: 67 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP 126
+RTSSG F++ D+ T+ IE++I+ T +P +GE +L Y+IGQKY HYD F
Sbjct: 127 VRTSSGTFLARGRDK--TIREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHYDYFMD 184
Query: 127 QEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKC--IGLKVKPRQG 182
+ QR+A+ L+YL+D+EEGGET+FP G + + + +C GL VKP+ G
Sbjct: 185 EYNTRNGGQRIATVLMYLSDVEEGGETVFPAAKGNYSAVPWWNELSECGKGGLSVKPKMG 244
Query: 183 DGLLFYSLLPNGTIDPTSIHG 203
D LLF+S+ P+ T+DP+S+HG
Sbjct: 245 DALLFWSMTPDATLDPSSLHG 265
>gi|15224220|ref|NP_181836.1| P4H isoform 1 [Arabidopsis thaliana]
gi|3763917|gb|AAC64297.1| hypothetical protein [Arabidopsis thaliana]
gi|20197628|gb|AAM15158.1| hypothetical protein [Arabidopsis thaliana]
gi|26450452|dbj|BAC42340.1| unknown protein [Arabidopsis thaliana]
gi|29824245|gb|AAP04083.1| unknown protein [Arabidopsis thaliana]
gi|330255112|gb|AEC10206.1| P4H isoform 1 [Arabidopsis thaliana]
Length = 283
Score = 155 bits (391), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 83/213 (38%), Positives = 128/213 (60%), Gaps = 5/213 (2%)
Query: 11 VTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRT 69
+ N+ +V+SW PR + +F +PE+C+ + +A+ L+ ST+ ++ G+ V + +RT
Sbjct: 72 IGNVKPEVVSWSPRIIVLHDFLSPEECEYLKAIARPRLQVSTVVDVKTGKGVKSD--VRT 129
Query: 70 SSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEY 129
SSG+F++ E + IE++IA + +P NGE +LRY+ Q Y H+D F
Sbjct: 130 SSGMFLTHVERSYPIIQAIEKRIAVFSQVPAENGELIQVLRYEPQQFYKPHHDYFADTFN 189
Query: 130 GPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYS 189
+ QRVA+ L+YLTD EGGET FP + D + + G+ VKP +GD +LF+S
Sbjct: 190 LKRGGQRVATMLMYLTDDVEGGETYFPLAG--DGDCTCGGKIMKGISVKPTKGDAVLFWS 247
Query: 190 LLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
+ +G DP SIHG C V+ GEKW ATKW+R +
Sbjct: 248 MGLDGQSDPRSIHGGCEVLSGEKWSATKWMRQK 280
>gi|297829156|ref|XP_002882460.1| hypothetical protein ARALYDRAFT_896741 [Arabidopsis lyrata subsp.
lyrata]
gi|297328300|gb|EFH58719.1| hypothetical protein ARALYDRAFT_896741 [Arabidopsis lyrata subsp.
lyrata]
Length = 299
Score = 154 bits (389), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 84/214 (39%), Positives = 127/214 (59%), Gaps = 21/214 (9%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 72
+S PRA + F T +C +I++AK NL+ S +A DN G +RTSSG
Sbjct: 41 VSAKPRAFVYEGFLTDLECDHLISLAKENLQRSAVA-------DNDNGESQVSDVRTSSG 93
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
FIS +D + IE+K++ T LP+ NGE +LRY+ GQKY++H+D F + +
Sbjct: 94 TFISKGKDP--IVSGIEDKLSTWTFLPKENGEDLQVLRYEPGQKYDAHFDYFHDKVNIAR 151
Query: 133 KSQRVASFLVYLTDLEEGGETMFP----FENGMNADGSYDYQKCI--GLKVKPRQGDGLL 186
R+A+ L+YL+++ +GGET+FP + ++ D C G+ VKP++G+ LL
Sbjct: 152 GGHRIATVLLYLSNVTKGGETVFPDAQEYSRRSLSENKDDLSDCAKKGIAVKPKKGNALL 211
Query: 187 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
F++L + DP S+HG CPV++GEKW ATKWI
Sbjct: 212 FFNLQQDAIPDPFSLHGGCPVIEGEKWSATKWIH 245
>gi|224034451|gb|ACN36301.1| unknown [Zea mays]
gi|413945801|gb|AFW78450.1| hypothetical protein ZEAMMB73_588774 [Zea mays]
Length = 295
Score = 154 bits (389), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 86/206 (41%), Positives = 120/206 (58%), Gaps = 22/206 (10%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA 78
+S PR + +F + ++ +I++A+ L+ S +A DN G T S
Sbjct: 54 ISCKPRVFLYQHFLSDDEANHLISLARAELKRSAVA-------DNMSGKSTLS------- 99
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVA 138
E ++ IE+KIA T LP+ NGE +LRYK G+KY HYD F + R A
Sbjct: 100 --EDPIVEGIEDKIAAWTFLPKENGEDIQVLRYKHGEKYEPHYDYFTDNVNTVRGGHRYA 157
Query: 139 SFLVYLTDLEEGGETMFPF----ENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 194
+ L+YLTD+ EGGET+FP ++ +A S QK G+ V+PR+GD LLF++L P+G
Sbjct: 158 TVLLYLTDVPEGGETVFPLAEEPDDAKDATLSECAQK--GIAVRPRKGDALLFFNLNPDG 215
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIR 220
T D S+HG CPV+KGEKW ATKWIR
Sbjct: 216 TTDSVSLHGGCPVIKGEKWSATKWIR 241
>gi|357467075|ref|XP_003603822.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355492870|gb|AES74073.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 683
Score = 154 bits (389), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 84/211 (39%), Positives = 129/211 (61%), Gaps = 13/211 (6%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAK-LNLRPSTLALRKGETVDNTQGIRTSSGVFI 75
++LS +PRA + NF + E+C+ +IN+AK R + GE +++ RTSSG+F+
Sbjct: 113 EILSSVPRASMYHNFLSKEECEHLINLAKPFMARSLVVDGVTGEVKESSS--RTSSGMFL 170
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 135
+D+ + IE +IA +T +P NGE +++ Y +GQK HYD
Sbjct: 171 DRGKDK--IVQNIERRIADITSVPIENGEGLHVIHYGVGQKCEPHYDYTSDGVVTKNGGP 228
Query: 136 RVASFLVYLTDLEEGGETMFPFENGMNADGSY-DYQKCIG--LKVKPRQGDGLLFYSLLP 192
RVA+ L+YL+D+EEGGET+FP +A ++ KC G L VKP+ GD LLF+S+ P
Sbjct: 229 RVATVLMYLSDVEEGGETVFP-----DAQPNFTSVSKCSGDGLSVKPKMGDALLFWSMKP 283
Query: 193 NGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
+GT+D +S+HG PV++G KW +TKW+ +E
Sbjct: 284 DGTLDTSSLHGGSPVIRGNKWASTKWLHLRE 314
Score = 114 bits (284), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 73/200 (36%), Positives = 111/200 (55%), Gaps = 34/200 (17%)
Query: 31 FATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFISAAEDESGT 84
F + E+C+ +IN+AK + S + VD G RTSSG F+ +D+
Sbjct: 372 FGSKEECEHLINLAKPFMTRSLV-------VDGLTGKGRESSARTSSGRFLERGKDK--I 422
Query: 85 LDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYL 144
+ IE++IA +T +PR+ A + + + G + GP RVA+ L+YL
Sbjct: 423 VQNIEQRIADITSIPRM---ARDFMLFTAG--------GVVTKNGGP----RVATVLMYL 467
Query: 145 TDLEEGGETMFPFEN-GMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHG 203
+D+EEGGET+FP +N+ Y + GL VKP+ GD LLF S+ P+GT+D +S+HG
Sbjct: 468 SDVEEGGETVFPNAKPNINSVSKYPEK---GLSVKPKMGDALLFRSMKPDGTLDTSSLHG 524
Query: 204 SCPVVKGEKWVATKWIRDQE 223
PV++G KW +TKW+ E
Sbjct: 525 GSPVIRGNKWASTKWLHLTE 544
>gi|302834449|ref|XP_002948787.1| hypothetical protein VOLCADRAFT_80309 [Volvox carteri f.
nagariensis]
gi|300265978|gb|EFJ50167.1| hypothetical protein VOLCADRAFT_80309 [Volvox carteri f.
nagariensis]
Length = 329
Score = 154 bits (389), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 87/223 (39%), Positives = 125/223 (56%), Gaps = 17/223 (7%)
Query: 5 QAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNT 64
++G V + VLSW PR + T E+C +I +A+ L S ++ D T
Sbjct: 39 RSGRTDVPDSRMVVLSWQPRVFLYKGILTQEECDYLIKIAQGRLERSGVS-------DAT 91
Query: 65 QG------IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYN 118
G IRTSSG+F + E++ + IE ++A TMLP NGE +LRY+ QKY+
Sbjct: 92 TGEGGVSDIRTSSGMFYTRGEND--VVKRIETRLAMWTMLPVENGEGIQVLRYEKTQKYD 149
Query: 119 SHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKC--IGLK 176
H+D F + R+A+ L+YL EEGGET+FP ++ +C GL
Sbjct: 150 PHHDYFSFEGRDANGGNRMATVLMYLATPEEGGETVFPKIPVPAGQTRANFSECGMKGLA 209
Query: 177 VKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWI 219
VKP +GD +LF+S+ P+G +P S+HGSCPV++G KW ATKWI
Sbjct: 210 VKPVKGDAVLFWSIRPDGRFEPGSLHGSCPVIRGVKWSATKWI 252
>gi|125542543|gb|EAY88682.1| hypothetical protein OsI_10157 [Oryza sativa Indica Group]
Length = 321
Score = 154 bits (388), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 88/223 (39%), Positives = 128/223 (57%), Gaps = 26/223 (11%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKL-NLRPSTLA-LRKGETVDNTQGIRTSSGVFIS 76
+SW PRA + F + +C +I++AK + ST+ GE+V T +RTSSG+F+
Sbjct: 45 VSWRPRAFLYEGFLSDAECDHLISLAKQGKMEKSTVVDGESGESV--TSKVRTSSGMFLD 102
Query: 77 AAEDESGTLDLIEEKIAKVTMLP-----------------RINGEAFNILRYKIGQKYNS 119
+DE + IEE+IA TMLP NGE+ ILRY G+KY
Sbjct: 103 KKQDE--VVARIEERIAAWTMLPTECIIFYCFANFAILKLSENGESMQILRYGQGEKYEP 160
Query: 120 HYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFP-FENGMNADGSYDYQKCI--GLK 176
H+D ++ ++ RVA+ L+YL++++ GGET+FP E ++ + C G
Sbjct: 161 HFDYISGRQGSTREGDRVATVLMYLSNVKMGGETIFPDCEARLSQPKDETWSDCAEQGFA 220
Query: 177 VKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWI 219
VKP +G +LF+SL PN T+D S+HGSCPV++GEKW ATKWI
Sbjct: 221 VKPAKGSAVLFFSLHPNATLDTDSLHGSCPVIEGEKWSATKWI 263
>gi|18071415|gb|AAL58274.1|AC068923_16 putative prolyl 4-hydroxylase, alpha subunit [Oryza sativa Japonica
Group]
Length = 343
Score = 153 bits (387), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 80/210 (38%), Positives = 129/210 (61%), Gaps = 23/210 (10%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTS 70
+VLSW PRA + NF + E+C+ +I++AK +++ ST+ VD + G +RTS
Sbjct: 111 EVLSWEPRAFLYHNFLSKEECEYLISLAKPHMKKSTV-------VDASTGGSKDSRVRTS 163
Query: 71 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 130
SG+F+ +D+ + IE++I+ T +P NGE +L Y++GQKY H+D F +
Sbjct: 164 SGMFLGRGQDK--IIRTIEKRISDYTFIPVENGEGLQVLHYEVGQKYEPHFDYFHDEFNT 221
Query: 131 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKCI--GLKVKPRQGDGLL 186
QR+A+ L+YL+D+EEGGET+FP ++ + + +C GL VKP+ GD LL
Sbjct: 222 KNGGQRIATLLMYLSDVEEGGETIFPSSKANSSSSPFYNELSECAKKGLAVKPKMGDALL 281
Query: 187 FYSLLPNGTIDPTSIHGSCPVVKGEKWVAT 216
F+S+ P+G++D TS+HG P++ W+ T
Sbjct: 282 FWSMRPDGSLDATSLHGEIPIL----WLLT 307
>gi|363807814|ref|NP_001242181.1| uncharacterized protein LOC100782154 [Glycine max]
gi|255644463|gb|ACU22735.1| unknown [Glycine max]
Length = 285
Score = 153 bits (387), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 80/220 (36%), Positives = 130/220 (59%), Gaps = 9/220 (4%)
Query: 3 HGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLN-LRPSTLALRKGETV 61
H G+++ +V+SW PRA + NF T E+C+ +IN A N L+ + GE +
Sbjct: 70 HVSEGENNRVKRWVEVMSWEPRAFLYHNFLTKEECEYLINTATPNMLKSLVIDNESGEGI 129
Query: 62 DNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHY 121
+ + RTS+ + +D+ + IE++IA VT +P +GE +++RY +GQ Y H
Sbjct: 130 ETSY--RTSTEYVVERGKDK--IVRNIEKRIADVTFIPIEHGEPLHVIRYAVGQYYEPHV 185
Query: 122 DAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKC--IGLKV 177
D F+ + QR+A+ L+YL+++E GGET+FP N + + + +C GL +
Sbjct: 186 DYFEEEFSLVNGGQRIATMLMYLSNVEGGGETVFPIANANFSSVPWWNELSECGQTGLSI 245
Query: 178 KPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATK 217
KP+ GD LLF+S+ P+ T+DP ++H +CPV+KG KW TK
Sbjct: 246 KPKMGDALLFWSMKPDATLDPLTLHRACPVIKGNKWSCTK 285
>gi|308799217|ref|XP_003074389.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
gi|116000560|emb|CAL50240.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
Length = 294
Score = 152 bits (384), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 86/210 (40%), Positives = 122/210 (58%), Gaps = 12/210 (5%)
Query: 16 FQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFI 75
+ LSW P A + F T +C+ I +A L+PST+ + D + IRTSSG+F+
Sbjct: 26 IERLSWAPHAEVYRGFLTEAECEHIERLATAELKPSTV-VDASTGGDASSEIRTSSGMFL 84
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFD-----PQEYG 130
AED+ ++ IE +IA T +P +GE F +LRY+ Q+Y +HYD F +E G
Sbjct: 85 GRAEDD--VIEAIEARIAAWTHVPESHGEGFQVLRYEKHQEYRAHYDYFHDKFNVKREKG 142
Query: 131 PQKSQRVASFLVYLTDLEEGGETMFP-FENGMNADGSYDYQKCIGLKVKPRQGDGLLFYS 189
QR+ + L+YL+D+EEGGET+FP FE+G A L V+PR+GD L F S
Sbjct: 143 ---GQRMGTVLMYLSDVEEGGETVFPKFEDGTPAGSEASECARNKLAVRPRKGDALFFRS 199
Query: 190 LLPNGTIDPTSIHGSCPVVKGEKWVATKWI 219
L +G D S H CPV++G K+ ATKW+
Sbjct: 200 LRHDGVPDTFSEHAGCPVIRGVKFSATKWM 229
>gi|159476104|ref|XP_001696154.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein
[Chlamydomonas reinhardtii]
gi|158275325|gb|EDP01103.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein
[Chlamydomonas reinhardtii]
Length = 343
Score = 152 bits (384), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 89/224 (39%), Positives = 122/224 (54%), Gaps = 21/224 (9%)
Query: 6 AGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQ 65
AG+ + VLSW PR + T E+C +++ ++ L S ++ D T
Sbjct: 57 AGEHRAQDSRMVVLSWHPRVFLYKGILTHEECDQLMDNSRSRLERSGVS-------DATT 109
Query: 66 G------IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNS 119
G IRTSSG+F E E + IE ++A TMLP NGE +LRY+ QKY+
Sbjct: 110 GAGAVSDIRTSSGMFYERGETE--LVKRIENRLAMWTMLPVENGEGIQVLRYEKTQKYDP 167
Query: 120 HYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENG----MNADGSYDYQKCIGL 175
H+D F R+A+ L+YL EEGGET+FP G + S ++ GL
Sbjct: 168 HHDYFSFDGADDNGGNRMATVLMYLATPEEGGETVFPKVVGWVVQLTTTASAPCRQ--GL 225
Query: 176 KVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWI 219
VKP +GD +LF+S+ P+G DP S+HGSCPV+KG KW ATKWI
Sbjct: 226 AVKPAKGDAVLFWSIRPDGRFDPGSLHGSCPVIKGVKWSATKWI 269
>gi|242047774|ref|XP_002461633.1| hypothetical protein SORBIDRAFT_02g005760 [Sorghum bicolor]
gi|241925010|gb|EER98154.1| hypothetical protein SORBIDRAFT_02g005760 [Sorghum bicolor]
Length = 275
Score = 152 bits (383), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 81/206 (39%), Positives = 123/206 (59%), Gaps = 11/206 (5%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFIS 76
+ LSW PR + F + ++C ++ +AK + + +A + T RTSSG+F+
Sbjct: 49 KALSWQPRIFVYKGFLSDDECDHLVTLAK---KGTMVAHNRSSYYRQT---RTSSGMFLR 102
Query: 77 AAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQR 136
+D + IEE+IA T+LPR N E I RY+ GQKY+ H+D FD + + + R
Sbjct: 103 KRQDP--VVSRIEERIAAWTLLPRENVEKMQIQRYQHGQKYDPHFDYFDDKIHHTRGGPR 160
Query: 137 VASFLVYLTDLEEGGETMFPFENGMNADGSYD-YQKCI--GLKVKPRQGDGLLFYSLLPN 193
A+ L+YL+ +++GGET+FP G + D + +C GL VKP +GD +LF+SL +
Sbjct: 161 YATVLMYLSTVDKGGETVFPKAKGWESQPKDDTFSECAHKGLAVKPVKGDAVLFFSLHVD 220
Query: 194 GTIDPTSIHGSCPVVKGEKWVATKWI 219
G DP ++HGSCPV++GEKW A WI
Sbjct: 221 GGPDPLTLHGSCPVIQGEKWSAPNWI 246
>gi|388520325|gb|AFK48224.1| unknown [Lotus japonicus]
Length = 188
Score = 151 bits (382), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 77/189 (40%), Positives = 121/189 (64%), Gaps = 9/189 (4%)
Query: 40 IINMAKLNL-RPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTML 98
+IN+AK ++ + S + + G++V + +RTSSG+F+ +D+ + IE++IA +
Sbjct: 1 MINLAKPHMAKSSVVDSQTGKSVGSR--VRTSSGMFLKRGKDK--VIQTIEKRIADFAFI 56
Query: 99 PRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFE 158
P NGE +L Y++GQKY HYD F + QR+A+ L+YL+D+EEGGET+FP
Sbjct: 57 PVENGEGLQVLHYEVGQKYEPHYDYFLDEFNTKNGGQRIATVLMYLSDVEEGGETIFPAA 116
Query: 159 NGMNADGSY--DYQKCI--GLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWV 214
+ + D C GL VKP++GD LLF+S+ P+ T+DP+S+HG CPV++G KW
Sbjct: 117 KANFSSVPWYNDLSVCAKKGLSVKPKRGDALLFWSIRPDATLDPSSLHGGCPVIRGNKWS 176
Query: 215 ATKWIRDQE 223
+TKW+ +E
Sbjct: 177 STKWMHLEE 185
>gi|6437556|gb|AAF08583.1|AC011623_16 unknown protein [Arabidopsis thaliana]
Length = 278
Score = 151 bits (381), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 83/208 (39%), Positives = 123/208 (59%), Gaps = 30/208 (14%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 72
+S PRA + F T +C +I++AK NL+ S +A DN G +RTSSG
Sbjct: 41 VSSKPRAFVYEGFLTDLECDHLISLAKENLQRSAVA-------DNDNGESQVSDVRTSSG 93
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
FIS +D + IE+K++ T LP+ NGE +LRY+ GQKY++H+D F + +
Sbjct: 94 TFISKGKDP--IVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQKYDAHFDYFHDKVNIAR 151
Query: 133 KSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLP 192
R+A+ L+YL+++ +GGET+FP D Q C+ KP++G+ LLF++L
Sbjct: 152 GGHRIATVLLYLSNVTKGGETVFP-----------DAQVCL----KPKKGNALLFFNLQQ 196
Query: 193 NGTIDPTSIHGSCPVVKGEKWVATKWIR 220
+ DP S+HG CPV++GEKW ATKWI
Sbjct: 197 DAIPDPFSLHGGCPVIEGEKWSATKWIH 224
>gi|357162904|ref|XP_003579560.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
distachyon]
Length = 266
Score = 151 bits (381), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 84/212 (39%), Positives = 123/212 (58%), Gaps = 9/212 (4%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTL---ALRKGETVDNTQGIRTSSGV 73
+V+SW PR + F NF + E+C + +A+ L ST+ A KG D +RTSSG+
Sbjct: 61 EVISWTPRIIVFHNFLSSEECDFLKEIARPRLEISTVVDVATGKGVKSD----VRTSSGM 116
Query: 74 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK 133
F+++ E + + IE++I+ + +P NGE +LRY+ Q Y H+D F +
Sbjct: 117 FVNSEERKFPVIQAIEKRISVFSQIPVENGELIQVLRYEPSQYYRPHHDYFSDTFNLKRG 176
Query: 134 SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPN 193
QRVA+ L+YLTD EGGET FP + + S + GL VKP +GD +LF+S+ +
Sbjct: 177 GQRVATMLMYLTDGVEGGETHFP--QAGDGECSCGGRIVRGLCVKPNKGDAVLFWSMGLD 234
Query: 194 GTIDPTSIHGSCPVVKGEKWVATKWIRDQEQY 225
G D SIH C V+KGEKW ATKW+R + +
Sbjct: 235 GNTDSNSIHSGCAVLKGEKWSATKWMRQKMTF 266
>gi|326503458|dbj|BAJ86235.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326516134|dbj|BAJ88090.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 266
Score = 150 bits (380), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 89/230 (38%), Positives = 128/230 (55%), Gaps = 23/230 (10%)
Query: 6 AGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTL---ALRKGETVD 62
A D + + +V+SW PR + F NF + E+C + +A+ L ST+ A KG D
Sbjct: 50 AADLRLGYVKPEVISWTPRIIVFHNFLSSEECDYLREIARPRLEISTVVDVATGKGVKSD 109
Query: 63 NTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD 122
+RTSSG+F+++ E + + IE++I+ + +P NGE +LRY+ Q Y H+D
Sbjct: 110 ----VRTSSGMFVNSEERKLPVIKAIEKRISVFSQIPVENGELIQVLRYEPNQYYRPHHD 165
Query: 123 AFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCI-------GL 175
F + QRVA+ L+YLTD EGGET FP DG +CI GL
Sbjct: 166 YFSDTFNLKRGGQRVATMLMYLTDGVEGGETHFP----QAGDG-----ECICGGRLVRGL 216
Query: 176 KVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQY 225
VKP +GD +LF+S+ +G D S+H C VVKGEKW ATKW+R + +
Sbjct: 217 CVKPNKGDAVLFWSMGLDGNTDSNSLHSGCAVVKGEKWSATKWMRQKMTF 266
>gi|255085592|ref|XP_002505227.1| predicted protein [Micromonas sp. RCC299]
gi|226520496|gb|ACO66485.1| predicted protein [Micromonas sp. RCC299]
Length = 267
Score = 150 bits (380), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 87/207 (42%), Positives = 124/207 (59%), Gaps = 11/207 (5%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-GETVDNTQGIRTSSGVFISA 77
LS P+A + F +C I AK L ST+ K G++V + IRTS G+F
Sbjct: 8 LSEKPKAYLYRGFLRQAECDYIKERAKPKLEKSTVVDNKTGQSVPSN--IRTSDGMFFDR 65
Query: 78 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS--- 134
ED+ ++ IE +IA+ T +P NGE +LRY++GQKY H DAF ++ ++S
Sbjct: 66 HEDD--IIEDIERRIAEWTNVPWENGEGIQVLRYEVGQKYEPHLDAF-SDKFNTEESKGG 122
Query: 135 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCI--GLKVKPRQGDGLLFYSLLP 192
QR+A+ L+YL+D+EEGGET+FP G + +C G+ VK R+GD LLF+SL
Sbjct: 123 QRMATVLMYLSDVEEGGETVFPRSVDKPHKGDPKWSECAQRGVAVKARKGDALLFWSLDI 182
Query: 193 NGTIDPTSIHGSCPVVKGEKWVATKWI 219
+ +D S+HG CPV+KG KW ATKW+
Sbjct: 183 DSNVDELSLHGGCPVIKGTKWSATKWM 209
>gi|307110744|gb|EFN58979.1| hypothetical protein CHLNCDRAFT_137600 [Chlorella variabilis]
Length = 327
Score = 150 bits (379), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 86/216 (39%), Positives = 125/216 (57%), Gaps = 16/216 (7%)
Query: 16 FQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFI 75
+V++W PRAL F + +C II +A +L ST+ +G ++ + IRTSSG+FI
Sbjct: 42 VEVVAWKPRALLLHGFLSHAECDHIIRVADPSLERSTVVSPEGGSMLDE--IRTSSGMFI 99
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 135
D + +EE++A +T LP + E +LRY++GQKY++H+D D E Q
Sbjct: 100 LKGHD--AVISGLEERVAALTHLPVSHQEDLQVLRYELGQKYSAHWDINDSPERAQQMRA 157
Query: 136 -------RVASFLVYLTDLEEGGETMFP----FENGMNADGSYDYQKCIGLKVKPRQGDG 184
R A+ L+YL+D+EEGGET FP + G+ A Y G+ VKPR+GD
Sbjct: 158 KGVLGGLRTATLLMYLSDVEEGGETAFPHGRWLDEGVQAAPPYTECASKGVVVKPRKGDA 217
Query: 185 LLFYSLLPNG-TIDPTSIHGSCPVVKGEKWVATKWI 219
+LF+SL NG D S+H CPVV+G K+ ATKW+
Sbjct: 218 ILFFSLKLNGQKKDVYSLHAGCPVVRGVKYSATKWV 253
>gi|224001336|ref|XP_002290340.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220973762|gb|EED92092.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 483
Score = 149 bits (376), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 85/226 (37%), Positives = 127/226 (56%), Gaps = 16/226 (7%)
Query: 6 AGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQ 65
A S + + LS P + F + E+C I +A ++ S+++L+ + ++
Sbjct: 251 ASSKSQKQVTIETLSLRPLVVSVEGFLSDEECDYIAEIASPQVKYSSVSLKDADKGKDSS 310
Query: 66 GIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFD 125
RTS F+SA +DE L I+ ++A +T +PR + E +LRY G+KY+SH+D FD
Sbjct: 311 EWRTSQSAFLSARDDE--VLTEIDHRVASLTRIPRNHQEYVQVLRYGAGEKYDSHHDYFD 368
Query: 126 PQEYGPQKS----------QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKC-IG 174
P Y KS R A+ YLTD+ +GGET+FP G A S ++ C IG
Sbjct: 369 PSAYRSDKSTLRLIENGKKNRYATVFWYLTDVHDGGETIFPRYGGAPAPRS--HKDCSIG 426
Query: 175 LKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGE-KWVATKWI 219
LKVKP++G ++FYSL +G +DP S+HG+CPV + KW A KWI
Sbjct: 427 LKVKPQKGKVVIFYSLDASGEMDPFSLHGACPVGENNLKWAANKWI 472
>gi|343171882|gb|AEL98645.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein, partial
[Silene latifolia]
gi|343171884|gb|AEL98646.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein, partial
[Silene latifolia]
Length = 162
Score = 149 bits (376), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 66/103 (64%), Positives = 86/103 (83%)
Query: 1 MPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET 60
+P G GDDS+++IPFQVLSW PR LYFP FAT + C++II++A+ L+PS LALRKGET
Sbjct: 60 LPSGDTGDDSISSIPFQVLSWRPRVLYFPKFATADHCETIISIARSQLKPSRLALRKGET 119
Query: 61 VDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRING 103
+D+T+ IRTSSG+FISA ED++G LD I+EKIA+ TM+PR NG
Sbjct: 120 LDSTREIRTSSGMFISADEDKTGILDFIDEKIARATMIPRANG 162
>gi|307102975|gb|EFN51240.1| hypothetical protein CHLNCDRAFT_28187 [Chlorella variabilis]
Length = 322
Score = 149 bits (375), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 85/226 (37%), Positives = 124/226 (54%), Gaps = 19/226 (8%)
Query: 10 SVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKG----ETVDNTQ 65
S +++SW PRAL F +C +I++A+ L PS + R G ++V Q
Sbjct: 9 STVRRRIELVSWKPRALLLHGFLAHSECDHMISLAEARLEPSKVVSRDGSGKLDSVRTRQ 68
Query: 66 GIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFD 125
G+ +SSG F++ +D + +E++I T LP + E +L+Y++GQKY++HYD
Sbjct: 69 GL-SSSGTFLTKRQDS--VVAGVEDRIELATHLPFSHSEQLQVLKYELGQKYSAHYDVHG 125
Query: 126 PQE-------YGPQKSQRVASFLVYLTDLEEGGETMFP----FENGMNADGSYDYQKCIG 174
E G Q R A+ L+YL+D+EEGGET FP + G A Y G
Sbjct: 126 SNEQAQLAIRRGEQGGSRYATMLMYLSDVEEGGETSFPHGRWIDEGAQAQPPYSECGSRG 185
Query: 175 LKVKPRQGDGLLFYSLLPNG-TIDPTSIHGSCPVVKGEKWVATKWI 219
+ VKPR+GD +LFYSL +G + D S+H CPV KG K+ AT WI
Sbjct: 186 VAVKPRKGDAILFYSLKSDGQSKDFFSLHAGCPVAKGVKYSATAWI 231
>gi|224069056|ref|XP_002302889.1| predicted protein [Populus trichocarpa]
gi|222844615|gb|EEE82162.1| predicted protein [Populus trichocarpa]
Length = 287
Score = 149 bits (375), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 79/210 (37%), Positives = 125/210 (59%), Gaps = 5/210 (2%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVFI 75
+++SW PR + +F + E+C + +AK LR ST+ ++ G+ +++ +RTSSG+F+
Sbjct: 82 EIISWSPRIIVLHDFLSSEECDYLRALAKPRLRISTVVDVKTGKGIESK--VRTSSGMFL 139
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 135
S+ E + IE++I+ + +P NGE +LRY+ Q Y H+D F + Q
Sbjct: 140 SSEEKTYQVVQAIEKRISVYSQVPIENGELIQVLRYEKNQYYKPHHDYFSDTFNLKRGGQ 199
Query: 136 RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGT 195
RVA+ L+YL+D EGGET FP + S + GL VKP +G+ +LF+S+ +G
Sbjct: 200 RVATMLMYLSDNVEGGETYFPMAG--SGKCSCGGKVVDGLSVKPIKGNAVLFWSMGLDGQ 257
Query: 196 IDPTSIHGSCPVVKGEKWVATKWIRDQEQY 225
DP+SIHG C V+ G KW ATKW+R + +
Sbjct: 258 SDPSSIHGGCEVLSGVKWSATKWMRQRATF 287
>gi|449520144|ref|XP_004167094.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 323
Score = 148 bits (374), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 78/211 (36%), Positives = 124/211 (58%), Gaps = 9/211 (4%)
Query: 14 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGV 73
+ + L +PRA + NF + ++C +IN+AK + S ++ + RTSSG
Sbjct: 65 LDYTSLHAVPRAFIYHNFLSEKECSQLINLAKPRMERSLVSAQNTNWEGVVSSRRTSSGR 124
Query: 74 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK 133
F++ +++ + IE++IA+ T +P NGE +IL Y++GQK+ H+D P + +
Sbjct: 125 FLAKGQNQ--LVRRIEKRIAEFTFIPVENGEGLSILHYEVGQKFEPHHDYTHPDSFSFKS 182
Query: 134 -SQRVASFLVYLTDLEEGGETMFPFENGMNADGSY------DYQKCIGLKVKPRQGDGLL 186
QR A+ ++YL+ ++EGG T+FP + +Y K GL VKP+ GD LL
Sbjct: 183 LGQRNATLVMYLSGVKEGGATVFPEAKKCASSARRWWKKLPEYGKDNGLSVKPKMGDALL 242
Query: 187 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATK 217
F+S+ P+GT+DPTS+H S PVVKG+KWV K
Sbjct: 243 FWSVKPDGTLDPTSLHASSPVVKGDKWVGVK 273
Score = 52.8 bits (125), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 40/66 (60%), Gaps = 8/66 (12%)
Query: 144 LTDLEEGGETMFPFENGMNADGSYDYQKCI------GLKVKPRQGDGLLFYSLLPNGTID 197
+ ++EEGGET+FP N S + K + GL +KP+ GD L F+S+ P+GT+D
Sbjct: 9 ILNIEEGGETVFPAAN--KCVSSVPWWKKLPTHGKDGLSIKPKMGDALFFWSMKPDGTLD 66
Query: 198 PTSIHG 203
TS+H
Sbjct: 67 YTSLHA 72
>gi|357517893|ref|XP_003629235.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355523257|gb|AET03711.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 196
Score = 148 bits (374), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 81/194 (41%), Positives = 117/194 (60%), Gaps = 21/194 (10%)
Query: 31 FATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEE 90
T E+C+ +IN+AK ++ ST+ G++VDN+ RTSSG FI+ D+ L IE+
Sbjct: 12 ITTKEECEHLINIAKPSMHKSTVDDETGKSVDNSA--RTSSGTFINRGHDK--ILRNIEQ 67
Query: 91 KIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEG 150
+IA T +P NGE+ NIL Y++GQKY H D F + + E+G
Sbjct: 68 RIADFTFIPVENGESVNILHYEVGQKYEPHPDFFTDE-------------INTKNGGEQG 114
Query: 151 GETMFPFENGMNADGSY--DYQKC--IGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCP 206
GET+FPF G + + + C GL +KP+ GD LLF+S+ P+GT+DP S+HG+CP
Sbjct: 115 GETVFPFAEGNFSSVPWWNELSDCGKKGLSIKPKMGDALLFWSMKPDGTLDPLSMHGACP 174
Query: 207 VVKGEKWVATKWIR 220
V+KG+KW TKW+R
Sbjct: 175 VIKGDKWSCTKWMR 188
>gi|224033439|gb|ACN35795.1| unknown [Zea mays]
Length = 180
Score = 148 bits (373), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 78/180 (43%), Positives = 111/180 (61%), Gaps = 14/180 (7%)
Query: 55 LRKGETVDNTQG------IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNI 108
+ K VD+T G +RTSSG+F+ D+ + IE++IA T +P +GE +
Sbjct: 1 MVKSTVVDSTTGKSKDSRVRTSSGMFLQRGRDK--VIRAIEKRIADYTFIPVDHGEGLQV 58
Query: 109 LRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYD 168
L Y++GQKY H+D F + QR+A+ L+YL+D+EEGGET+FP N +NA
Sbjct: 59 LHYEVGQKYEPHFDYFLDEFNTKNGGQRIATLLMYLSDVEEGGETIFPDAN-VNASSLPW 117
Query: 169 YQK---CI--GLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
Y + C GL VKP+ GD LLF+S+ P+ T+DP S+HG CPV+KG KW +TKW+ E
Sbjct: 118 YNELSDCAKRGLSVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIKGNKWSSTKWMHIHE 177
>gi|449443245|ref|XP_004139390.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 295
Score = 148 bits (373), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 77/202 (38%), Positives = 120/202 (59%), Gaps = 9/202 (4%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PRA + NF + ++C +IN+AK + S ++ + RTSSG F++ +++
Sbjct: 83 PRAFIYHNFLSEKECSQLINLAKPRMERSLVSAQNTNWEGVVSSRRTSSGRFLAKGQNQ- 141
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK-SQRVASFL 141
+ IE++IA+ T +P NGE +IL Y++GQK+ H+D P + + QR A+ +
Sbjct: 142 -LVRRIEKRIAEFTFIPVENGEGLSILHYEVGQKFEPHHDYTHPDSFSFKSLGQRNATLV 200
Query: 142 VYLTDLEEGGETMFPFENGMNADGSY------DYQKCIGLKVKPRQGDGLLFYSLLPNGT 195
+YL+ ++EGG T+FP + +Y K GL VKP+ GD LLF+S+ P+GT
Sbjct: 201 MYLSGVKEGGATVFPEAKKCASSARRWWKKLPEYGKDNGLSVKPKMGDALLFWSVKPDGT 260
Query: 196 IDPTSIHGSCPVVKGEKWVATK 217
+DPTS+H S PVVKG+KWV K
Sbjct: 261 LDPTSLHASSPVVKGDKWVGVK 282
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 33/76 (43%), Positives = 50/76 (65%), Gaps = 8/76 (10%)
Query: 144 LTDLEEGGETMFPFENGMNADGSYDYQKCI------GLKVKPRQGDGLLFYSLLPNGTID 197
+ ++EEGGET+FP N S + K + GL +KP+ GD L F+S+ P+GT+D
Sbjct: 9 ILNIEEGGETVFPAAN--QCVSSVPWWKKLPTHGKDGLSIKPKMGDALFFWSMKPDGTLD 66
Query: 198 PTSIHGSCPVVKGEKW 213
TS+HGS PV++G++W
Sbjct: 67 YTSLHGSYPVIRGDEW 82
>gi|159487419|ref|XP_001701720.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280939|gb|EDP06695.1| predicted protein [Chlamydomonas reinhardtii]
Length = 274
Score = 147 bits (371), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 90/215 (41%), Positives = 123/215 (57%), Gaps = 18/215 (8%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGE-TVDNTQGIRTSSGVFI 75
Q + PRA YF NF T + ++ +A L+ ST+ GE VDN IRTS G+FI
Sbjct: 2 QQVGLHPRAYYFHNFLTKAERGHLVKLAAPKLKRSTVVGNDGEGVVDN---IRTSYGMFI 58
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFD-PQEYGPQKS 134
+D + IE++I+ T LP + E +LRY GQ Y +HYD+ D E GP+
Sbjct: 59 RRLQDP--VVARIEKRISLWTHLPVEHQEDIQVLRYAHGQTYGAHYDSGDKSNEPGPK-- 114
Query: 135 QRVASFLVYLTDLEEGGETMFPFENGMNADGSY------DYQKCI--GLKVKPRQGDGLL 186
R+A+FL+YL+D+EEGGET FP N + AD S + C + KP+ GD +L
Sbjct: 115 WRLATFLMYLSDVEEGGETAFP-HNSVWADPSIPEKVGDKFSDCAKGNVAAKPKAGDAVL 173
Query: 187 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRD 221
FYS PN T+DP ++H CPV+KG KW A W+ D
Sbjct: 174 FYSFYPNMTMDPAAMHTGCPVIKGVKWAAPVWMHD 208
>gi|357467077|ref|XP_003603823.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355492871|gb|AES74074.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 291
Score = 147 bits (371), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 81/209 (38%), Positives = 121/209 (57%), Gaps = 16/209 (7%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTS 70
+VLS PRA + NF + E+C+ +IN+AK ++ S + VD G +RTS
Sbjct: 86 EVLSSEPRASMYHNFLSKEECEHLINLAKPFMQRSLV-------VDGVTGQGILNSVRTS 138
Query: 71 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 130
SG F+ +D+ + +E +IA +T +P NGE I+ Y++GQK+ HYD
Sbjct: 139 SGTFLERGKDK--IVQNVERRIADITSIPIENGEGLQIIHYEVGQKFEPHYDYNFNWRIT 196
Query: 131 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 190
RVA+ L+YL+D+EEGGET+FP N + Y GL VKP+ GD LLF+S+
Sbjct: 197 NNGGPRVATVLMYLSDVEEGGETVFP-NAKPNFNSVSKYHPGKGLVVKPKMGDALLFWSV 255
Query: 191 LPNGTIDPTSIHGSCPVVKGEKWVATKWI 219
P+G++D S+HG PV++G KW + K +
Sbjct: 256 KPDGSLDTASLHGGSPVIRGSKWASNKLL 284
>gi|238007346|gb|ACR34708.1| unknown [Zea mays]
Length = 180
Score = 146 bits (368), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 74/177 (41%), Positives = 110/177 (62%), Gaps = 12/177 (6%)
Query: 57 KGETVDNTQG------IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILR 110
K VD+T G +RTSSG+F+ D+ + +IE++IA T +P +GE +L
Sbjct: 3 KSTVVDSTTGKSKDSRVRTSSGMFLQRGRDK--VIRVIEKRIADYTFIPVDHGEGLQVLH 60
Query: 111 YKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--D 168
Y++GQKY H+D F + QR+A+ L+YL+D+EEGGET+FP N + + +
Sbjct: 61 YEVGQKYEPHFDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETIFPDANVNVSSLPWYNE 120
Query: 169 YQKCI--GLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
+C GL VKP+ GD LLF+S+ P+ T+DP S+HG CPV++G KW +TKW+ E
Sbjct: 121 LSECAKRGLSVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIRGNKWSSTKWMHIHE 177
>gi|308801080|ref|XP_003075321.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
gi|116061875|emb|CAL52593.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
Length = 541
Score = 145 bits (366), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 78/201 (38%), Positives = 123/201 (61%), Gaps = 11/201 (5%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTL--ALRKGETVDNTQGIRTSSGVFISAAED 80
PRA + NF + ++C+ ++ ++K L S + A G ++ +RTS+G FIS D
Sbjct: 265 PRAFLYENFLSEKECEHLLALSKGKLHKSGVVDAQTGGSSLSE---VRTSTGTFISRKYD 321
Query: 81 ESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASF 140
+ + +EE+I + +P+ + EAF ILRY+ GQ+Y +H+D F + ++ R+A+
Sbjct: 322 D--IIAGVEERIELWSQIPQSHHEAFQILRYEPGQEYKAHFDYFFHKSG--MRNNRIATV 377
Query: 141 LVYLTDLEEGGETMFPFENGMNADGSYDYQKCI--GLKVKPRQGDGLLFYSLLPNGTIDP 198
L+YL+D+EEGGET+FP + + Y +C G +K R+GD LLF+S+ P G +D
Sbjct: 378 LLYLSDVEEGGETVFPNTDVPTSRNRSMYSECGNGGKALKARKGDALLFWSMKPGGELDA 437
Query: 199 TSIHGSCPVVKGEKWVATKWI 219
S H CPV+KGEKW ATKW+
Sbjct: 438 GSSHAGCPVIKGEKWTATKWM 458
>gi|363543299|ref|NP_001241865.1| prolyl 4-hydroxylase 5-1 [Zea mays]
gi|347978814|gb|AEP37749.1| prolyl 4-hydroxylase 5-1 [Zea mays]
Length = 180
Score = 144 bits (363), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 72/177 (40%), Positives = 110/177 (62%), Gaps = 12/177 (6%)
Query: 57 KGETVDNTQG------IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILR 110
K VD+T G +RTSSG+F+ D+ + +IE++I T +P +GE +L
Sbjct: 3 KSTVVDSTTGKSKDSRVRTSSGMFLQRGRDK--VIRVIEKRITDYTFIPVDHGEGLQVLH 60
Query: 111 YKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--D 168
Y++GQKY H+D F + QR+A+ L++L+D+EEGGET+FP N ++ + +
Sbjct: 61 YEVGQKYEPHFDYFLDEFNTKNGGQRMATLLMHLSDVEEGGETIFPDANVNDSSLPWYNE 120
Query: 169 YQKCI--GLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
+C GL VKP+ GD LLF+S+ P+ T+DP S+HG CPV++G KW +TKW+ E
Sbjct: 121 LSECAKRGLSVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIRGNKWSSTKWMHIHE 177
>gi|384250156|gb|EIE23636.1| hypothetical protein COCSUDRAFT_53414 [Coccomyxa subellipsoidea
C-169]
Length = 285
Score = 144 bits (363), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 78/211 (36%), Positives = 121/211 (57%), Gaps = 7/211 (3%)
Query: 12 TNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKGETVDNTQGIRTS 70
+++ + +SW PRA + + ++C IIN A+ N+ + + L + + V N +R +
Sbjct: 49 SSLMVERISWNPRAFLYRGLLSQDECDYIINAARPNMVKATVLDAKTKKQVPNK--LRNN 106
Query: 71 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 130
+I + D+ +D IE +IA+ T LP +GE F+I++Y GQ Y H D D +
Sbjct: 107 KEAYIDGSADD--VIDQIERRIARYTFLPAAHGEPFHIMQYLPGQGYAPHTDWLDDWWHP 164
Query: 131 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCI--GLKVKPRQGDGLLFY 188
++R+A+ ++YL+D+ EGGET+FP G Y KC G+ VKP +GD LL Y
Sbjct: 165 RLGNERIATMIIYLSDVVEGGETVFPNSTMQPHVGDAAYSKCAQQGIAVKPVKGDALLLY 224
Query: 189 SLLPNGTIDPTSIHGSCPVVKGEKWVATKWI 219
+LL NG D S+H CPV++G KW ATK I
Sbjct: 225 NLLENGRNDGESLHQGCPVIRGVKWTATKRI 255
>gi|9294584|dbj|BAB02865.1| unnamed protein product [Arabidopsis thaliana]
Length = 328
Score = 144 bits (363), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 76/166 (45%), Positives = 108/166 (65%), Gaps = 7/166 (4%)
Query: 58 GETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKY 117
GE+ D+ +RTSSG+F++ +D+ + +E K+A T LP NGEA IL Y+ GQKY
Sbjct: 9 GESEDSE--VRTSSGMFLTKRQDD--IVANVEAKLAAWTFLPEENGEALQILHYENGQKY 64
Query: 118 NSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYD-YQKCI--G 174
+ H+D F ++ R+A+ L+YL+++ +GGET+FP G D + KC G
Sbjct: 65 DPHFDYFYDKKALELGGHRIATVLMYLSNVTKGGETVFPNWKGKTPQLKDDSWSKCAKQG 124
Query: 175 LKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
VKPR+GD LLF++L NGT DP S+HGSCPV++GEKW AT+WI
Sbjct: 125 YAVKPRKGDALLFFNLHLNGTTDPNSLHGSCPVIEGEKWSATRWIH 170
>gi|302765413|ref|XP_002966127.1| hypothetical protein SELMODRAFT_86017 [Selaginella moellendorffii]
gi|300165547|gb|EFJ32154.1| hypothetical protein SELMODRAFT_86017 [Selaginella moellendorffii]
Length = 201
Score = 144 bits (363), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 78/202 (38%), Positives = 120/202 (59%), Gaps = 8/202 (3%)
Query: 26 LYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTL 85
L F + ++C +I +A LR S++ K +++ RTS G F+ D +
Sbjct: 1 LIFFYLYSDDECDHLIGLALPRLRRSSVIDEKTGLGKDSRN-RTSWGAFLR--RDHDNIV 57
Query: 86 DLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLT 145
IE++I+ +T +P+ GE+ ++RYK GQK+ H D + E R+ + L+YLT
Sbjct: 58 SGIEDRISSITFIPKEYGESLQVVRYKTGQKFEPHQDYYKLTENNNNGGHRIGTLLLYLT 117
Query: 146 DLEEGGETMFP--FENGMNADGSYDYQKCI--GLKVKPRQGDGLLFYSLLPNGTIDPTSI 201
++E GGET+FP N +N D S + +C G+ ++PR+GDGLLF+ P+G IDP S
Sbjct: 118 NVENGGETVFPRALANVIN-DYSTNTSECTKKGIVIRPRRGDGLLFWITRPSGEIDPFSF 176
Query: 202 HGSCPVVKGEKWVATKWIRDQE 223
HG CPVVKGEKW+ATK++ + E
Sbjct: 177 HGGCPVVKGEKWLATKFLHEHE 198
>gi|307111754|gb|EFN59988.1| hypothetical protein CHLNCDRAFT_49444 [Chlorella variabilis]
Length = 344
Score = 144 bits (362), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 78/208 (37%), Positives = 117/208 (56%), Gaps = 9/208 (4%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKGET-VDNTQGIRTSSGVF 74
QVL R + NF T E+C II +A+ + R + G++ +DN +RTS G F
Sbjct: 64 QVLHEDARIFLYHNFLTDEECDHIIKLAEPTMARSGVVETDSGKSKIDN---VRTSKGTF 120
Query: 75 ISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS 134
++ D + IE +IAK T++P NGE +L+Y+ GQ+Y HYD F +
Sbjct: 121 LNRGHDS--VIADIEARIAKWTLMPAGNGEGLQVLKYEHGQEYEGHYDYFFHKAGTANGG 178
Query: 135 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIG--LKVKPRQGDGLLFYSLLP 192
R + L+YL D+EEGGET FP N D ++ +C L KP++G+ +LF+S+ P
Sbjct: 179 NRYLTVLMYLNDVEEGGETCFPNIPSPNGDNGPEFSECARKVLAAKPKKGNAVLFHSIKP 238
Query: 193 NGTIDPTSIHGSCPVVKGEKWVATKWIR 220
G ++ S+H +CPV+KG KW A KW+
Sbjct: 239 TGELERRSLHTACPVIKGVKWSAPKWVH 266
>gi|222623961|gb|EEE58093.1| hypothetical protein OsJ_08962 [Oryza sativa Japonica Group]
Length = 387
Score = 144 bits (362), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 78/201 (38%), Positives = 121/201 (60%), Gaps = 19/201 (9%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTS 70
+V+SW PRA + NF + E+C +I +AK ++ ST+ VD+T G +RTS
Sbjct: 100 EVISWEPRAFVYHNFLSKEECDYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 152
Query: 71 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 130
SG+F+ D+ + IE++IA T +P +GE +L Y++GQKY H+D F +
Sbjct: 153 SGMFLQRGRDK--VIRAIEKRIADYTFIPMEHGEGLQVLHYEVGQKYEPHFDYFLDEYNT 210
Query: 131 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKCI--GLKVKPRQGDGLL 186
QR+A+ L+YL+D+EEGGET+FP N ++ + + +C GL VKP+ GD LL
Sbjct: 211 KNGGQRMATLLMYLSDVEEGGETIFPDANVNSSSLPWYNELSECARKGLAVKPKMGDALL 270
Query: 187 FYSLLPNGTIDPTSIHGSCPV 207
F+S+ P+ T+DP S+H + V
Sbjct: 271 FWSMKPDATLDPLSLHDTLRV 291
>gi|218191856|gb|EEC74283.1| hypothetical protein OsI_09531 [Oryza sativa Indica Group]
Length = 376
Score = 143 bits (361), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 77/196 (39%), Positives = 119/196 (60%), Gaps = 19/196 (9%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTS 70
+V+SW PRA + NF + E+C +I +AK ++ ST+ VD+T G +RTS
Sbjct: 100 EVISWEPRAFVYHNFLSKEECDYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 152
Query: 71 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 130
SG+F+ D+ + IE++IA T +P +GE +L Y++GQKY H+D F +
Sbjct: 153 SGMFLQRGRDK--VIRAIEKRIADYTFIPMEHGEGLQVLHYEVGQKYEPHFDYFLDEYNT 210
Query: 131 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKCI--GLKVKPRQGDGLL 186
QR+A+ L+YL+D+EEGGET+FP N ++ + + +C GL VKP+ GD LL
Sbjct: 211 KNGGQRMATLLMYLSDVEEGGETIFPDANVNSSSLPWYNELSECARKGLAVKPKMGDALL 270
Query: 187 FYSLLPNGTIDPTSIH 202
F+S+ P+ T+DP S+H
Sbjct: 271 FWSMKPDATLDPLSLH 286
>gi|357445147|ref|XP_003592851.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
gi|355481899|gb|AES63102.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
Length = 281
Score = 142 bits (359), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 80/205 (39%), Positives = 121/205 (59%), Gaps = 5/205 (2%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVFI 75
+VLSW PR + NF + E+C + +A L+ ST+ G+ + + +RTSSG+F+
Sbjct: 76 EVLSWSPRIILLHNFLSYEECDYLRGVALPRLKISTVVDANTGKGIKSD--VRTSSGMFL 133
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 135
S E + + IE++I+ + +P NGE +LRY+ Q Y H+D F + Q
Sbjct: 134 SHEERKYPMIHAIEKRISVYSQIPIENGELMQVLRYEKNQYYRPHHDYFSDTFNLKRGGQ 193
Query: 136 RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGT 195
R+A+ L+YL D EGGET FP + + + S + GL VKP +G+ +LF+S+ +G
Sbjct: 194 RIATMLMYLGDNVEGGETHFP--SAGSDECSCGGKLTKGLCVKPVKGNAVLFWSMGLDGQ 251
Query: 196 IDPTSIHGSCPVVKGEKWVATKWIR 220
DP S+HG CPV+ GEKW ATKW+R
Sbjct: 252 SDPDSVHGGCPVLAGEKWSATKWMR 276
>gi|302841711|ref|XP_002952400.1| hypothetical protein VOLCADRAFT_81799 [Volvox carteri f.
nagariensis]
gi|300262336|gb|EFJ46543.1| hypothetical protein VOLCADRAFT_81799 [Volvox carteri f.
nagariensis]
Length = 269
Score = 142 bits (359), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 83/202 (41%), Positives = 114/202 (56%), Gaps = 14/202 (6%)
Query: 24 RALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
R + F TPE+C I A+ L R + G +V IRTS G+F ED
Sbjct: 44 RIYLWRGFLTPEECDYIRMKAEKRLERSGVVDTASGSSV--VSDIRTSDGMFFERGED-- 99
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
L+ +E+++A TM P GEA +LRY+ QKY+SH + F +E R A+ L
Sbjct: 100 AILEAVEQRLADWTMTPIWAGEALQVLRYRKDQKYDSHVNYFFHKEGSANGGNRWATVLT 159
Query: 143 YLTDLEEGGETMF---PFENGMNADGSYDYQKCI--GLKVKPRQGDGLLFYSLLPNGTID 197
YLTD EEGGET+F P G+N + +C L VKPR+GD +LF+S+ NG ++
Sbjct: 160 YLTDTEEGGETVFPKIPAPGGVNV----GFSECAKYNLAVKPRKGDAILFHSMKTNGQLE 215
Query: 198 PTSIHGSCPVVKGEKWVATKWI 219
S+HG+CPV+KGEK+ TKWI
Sbjct: 216 ERSLHGACPVIKGEKFSMTKWI 237
>gi|159469311|ref|XP_001692811.1| predicted protein [Chlamydomonas reinhardtii]
gi|158278064|gb|EDP03830.1| predicted protein [Chlamydomonas reinhardtii]
Length = 273
Score = 142 bits (358), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 79/201 (39%), Positives = 115/201 (57%), Gaps = 12/201 (5%)
Query: 24 RALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESG 83
R + F TPE+C I A+ L S + + G IRTS G+F ED
Sbjct: 44 RIYLWKGFLTPEECDYIRMKAEKRLERSGV-VDTGSGGSVVSDIRTSDGMFFERGED--A 100
Query: 84 TLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVY 143
++ +E+++A TM P GE+ +LRY+ QKY+SH+D F ++ R A+ L+Y
Sbjct: 101 IIEAVEQRLADWTMTPIWGGESLQVLRYRKDQKYDSHWDYFFHKDGSSNGGNRWATVLLY 160
Query: 144 LTDLEEGGETMF---PFENGMNADGSYDYQKCI--GLKVKPRQGDGLLFYSLLPNGTIDP 198
LT+ EEGGET+F P NG+N + +C L VKP +GD LLF+S+ P G ++
Sbjct: 161 LTETEEGGETVFPKIPAPNGINV----GFSECAKYNLAVKPHKGDALLFHSMKPTGELEE 216
Query: 199 TSIHGSCPVVKGEKWVATKWI 219
S+HG+CPV++GEK+ TKWI
Sbjct: 217 RSMHGACPVIRGEKFSMTKWI 237
>gi|449468746|ref|XP_004152082.1| PREDICTED: putative prolyl 4-hydroxylase-like [Cucumis sativus]
Length = 290
Score = 142 bits (357), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 79/207 (38%), Positives = 119/207 (57%), Gaps = 5/207 (2%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVFI 75
+V+SW PR + NF + ++C + +A L ST+ + G+ V + RTSSG+F+
Sbjct: 83 EVVSWSPRIIVLHNFLSTKECDYLKGIALARLEISTVVDTKTGKGVKSD--FRTSSGMFL 140
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 135
S E + IE++I+ + +P NGE +LRY+ Q Y H+D F + Q
Sbjct: 141 SHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ 200
Query: 136 RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGT 195
R+A+ L+YL++ EGGET FP + + S + GL VKP +GD +LF+S+ +G
Sbjct: 201 RIATMLMYLSENIEGGETYFP--KAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQ 258
Query: 196 IDPTSIHGSCPVVKGEKWVATKWIRDQ 222
DP SIHG C V+ GEKW ATKW+R +
Sbjct: 259 SDPKSIHGGCEVLSGEKWSATKWMRQK 285
>gi|219121927|ref|XP_002181308.1| proly 4-hydroxylase [Phaeodactylum tricornutum CCAP 1055/1]
gi|217407294|gb|EEC47231.1| proly 4-hydroxylase [Phaeodactylum tricornutum CCAP 1055/1]
Length = 226
Score = 141 bits (356), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 77/216 (35%), Positives = 114/216 (52%), Gaps = 14/216 (6%)
Query: 14 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGV 73
+ + LS +P L F + ++C I A+ ++ S + L + RTS
Sbjct: 5 VTLETLSLVPLVLSVEGFLSDDECTYIQETAEPHMEYSEVTLMDKDQGRPASDFRTSQSA 64
Query: 74 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK 133
FI A +D L I+ + A + +PR + E +LRY + +KY+SH D FDP Y K
Sbjct: 65 FIRAHDD--AILTDIDYRTASLVRIPRRHQEDVQVLRYDVTEKYDSHADYFDPALYTKDK 122
Query: 134 S----------QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGD 183
R+A+ YL+D+E+GGET+FP NG D + GLKVKP +G
Sbjct: 123 RTLALIRNGHRNRMATVFWYLSDVEKGGETVFPRFNGAQETSMKDCK--TGLKVKPEKGK 180
Query: 184 GLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWI 219
++FYS+ P+G +D S+HG+CPV KG KW A KW+
Sbjct: 181 VIIFYSMTPDGALDEYSLHGACPVQKGTKWAANKWV 216
>gi|302844247|ref|XP_002953664.1| prolyl 4-hydroxylase alpha subunit-like protein [Volvox carteri f.
nagariensis]
gi|300261073|gb|EFJ45288.1| prolyl 4-hydroxylase alpha subunit-like protein [Volvox carteri f.
nagariensis]
Length = 364
Score = 141 bits (355), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 86/208 (41%), Positives = 122/208 (58%), Gaps = 17/208 (8%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGE-TVDNTQGIRTSSGVFISAAEDE 81
PRA F NF T + ++ +A L+ ST+ KGE VDN IRTS G+FI D
Sbjct: 55 PRAYLFHNFLTKAERAHMVRLAAPKLKRSTVVGSKGEGVVDN---IRTSFGMFIRRLSDP 111
Query: 82 SGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEY-GPQKSQRVASF 140
+ IE++I+ T LP + E +LRY GQ Y +HYD+ ++ GP+ R+A+F
Sbjct: 112 --IIARIEKRISLWTHLPIEHQEDIQVLRYAHGQTYGAHYDSGASSDHVGPK--WRLATF 167
Query: 141 LVYLTDLEEGGETMFPFENGMNADGSYDYQ-----KCIG--LKVKPRQGDGLLFYSLLPN 193
L+YL+D+EEGGET FP +N + D + + +C + KP+ GD +LFYS LPN
Sbjct: 168 LMYLSDVEEGGETAFP-QNSVWYDPTIPERIGPVSECAKGHVAAKPKAGDAVLFYSFLPN 226
Query: 194 GTIDPTSIHGSCPVVKGEKWVATKWIRD 221
T+DP ++H CPV+KG KW A W+ D
Sbjct: 227 NTMDPAAMHTGCPVIKGIKWAAPVWMHD 254
>gi|225433714|ref|XP_002268409.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
gi|296089634|emb|CBI39453.3| unnamed protein product [Vitis vinifera]
Length = 287
Score = 140 bits (354), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 80/211 (37%), Positives = 120/211 (56%), Gaps = 7/211 (3%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG-IRTSSGVFI 75
++L+W PR + +F + E+C + MA+ L+ ST+ +T Q +RTSSG+F+
Sbjct: 82 EILNWSPRIILLHSFLSSEECDYLRAMAEPLLQISTVV--DAQTGKGIQSDVRTSSGMFL 139
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 135
S + + IE++I+ + +P NGE +LRYK Q Y H+D F + Q
Sbjct: 140 SPDDSTYPIVRAIEKRISVYSQVPVENGELIQVLRYKKSQFYKPHHDYFSDSFNLKRGGQ 199
Query: 136 RVASFLVYLTDLEEGGETMFPFE-NGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 194
RVA+ L+YL+D EGGET FP +G G + GL V P +G+ +LF+S+ +G
Sbjct: 200 RVATMLIYLSDNVEGGETYFPMAGSGFCRCGGKSVR---GLSVAPVKGNAVLFWSMGLDG 256
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQY 225
DP SIHG C V+ GEKW ATKW+R + +
Sbjct: 257 QSDPNSIHGGCEVLAGEKWSATKWMRQRSTH 287
>gi|297727581|ref|NP_001176154.1| Os10g0415128 [Oryza sativa Japonica Group]
gi|255679404|dbj|BAH94882.1| Os10g0415128 [Oryza sativa Japonica Group]
Length = 241
Score = 140 bits (353), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 73/157 (46%), Positives = 98/157 (62%), Gaps = 5/157 (3%)
Query: 67 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP 126
+RTSSG+F+ +DE + IEE+IA T LP NGE+ IL Y+ G+KY HYD F
Sbjct: 15 VRTSSGMFLEKKQDE--VVARIEERIAAWTFLPPDNGESIQILHYQNGEKYEPHYDYFHD 72
Query: 127 QEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYD-YQKCI--GLKVKPRQGD 183
+ R+A+ L+YL+D+ +GGET+FP G D + C G VKP +GD
Sbjct: 73 KNNQALGGHRIATVLMYLSDVGKGGETIFPEAEGKLLQPKDDTWSDCAKNGYAVKPVKGD 132
Query: 184 GLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
LLF+SL P+ T D S+HGSCPV++G+KW ATKWI
Sbjct: 133 ALLFFSLHPDATTDSDSLHGSCPVIEGQKWSATKWIH 169
>gi|356576923|ref|XP_003556579.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 287
Score = 139 bits (351), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 76/205 (37%), Positives = 119/205 (58%), Gaps = 5/205 (2%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVFI 75
+VL+W PR + NF + E+C + +A L S + + G+ + + +RTSSG+F+
Sbjct: 82 EVLNWSPRIILLHNFLSMEECDYLRAIALPRLHISNVVDTKTGKGIKSD--VRTSSGMFL 139
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 135
+ E + + IE++I+ + +P NGE +LRY+ Q Y H+D F + Q
Sbjct: 140 NPQERKYPMVQAIEKRISVYSQIPIENGELMQVLRYEKNQYYKPHHDYFSDTFNLKRGGQ 199
Query: 136 RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGT 195
R+A+ L+YL+D EGGET FP + + S + GL VKP +G+ +LF+S+ +G
Sbjct: 200 RIATMLMYLSDNIEGGETYFPLAG--SGECSCGGKLVKGLSVKPIKGNAVLFWSMGLDGQ 257
Query: 196 IDPTSIHGSCPVVKGEKWVATKWIR 220
DP S+HG C V+ GEKW ATKW+R
Sbjct: 258 SDPNSVHGGCEVISGEKWSATKWMR 282
>gi|159487421|ref|XP_001701721.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280940|gb|EDP06696.1| predicted protein [Chlamydomonas reinhardtii]
Length = 336
Score = 139 bits (351), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 88/208 (42%), Positives = 120/208 (57%), Gaps = 16/208 (7%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PRA YF NF T + ++ +A L+ ST+ KGE V + IRTS G+FI D
Sbjct: 26 PRAYYFHNFLTKAERAHLVRVAAPKLKRSTVVGGKGEGVVDD--IRTSYGMFIRRLSDP- 82
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEY-GPQKSQRVASFL 141
+ IE++I+ T LP + E ILRY GQ Y +HYD+ ++ GP+ R+A+FL
Sbjct: 83 -VVTRIEKRISLWTHLPVEHQEDIQILRYAHGQTYGAHYDSGASSDHVGPK--WRLATFL 139
Query: 142 VYLTDLEEGGETMFPFENGMNADGSY------DYQKCI--GLKVKPRQGDGLLFYSLLPN 193
+YL+D+EEGGET FP N + AD S + C + KP+ GD +LFYS PN
Sbjct: 140 MYLSDVEEGGETAFP-HNSVWADPSIPEQVGDKFSDCAKGHVAAKPKAGDAVLFYSFYPN 198
Query: 194 GTIDPTSIHGSCPVVKGEKWVATKWIRD 221
T+DP S+H CPV+KG KW A W+ D
Sbjct: 199 NTMDPASMHTGCPVIKGVKWAAPVWMHD 226
>gi|307110383|gb|EFN58619.1| hypothetical protein CHLNCDRAFT_19485 [Chlorella variabilis]
Length = 328
Score = 139 bits (351), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 82/204 (40%), Positives = 114/204 (55%), Gaps = 26/204 (12%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA 78
+SW PRA F NF T E+ I+ +AK ++ ST+ G +V++ IRTS G F+
Sbjct: 35 VSWKPRAFVFHNFMTEEEADHIVALAKPFMKRSTVVGAGGASVEDQ--IRTSYGTFLKRL 92
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVA 138
+D + +E+++A T L + E ILRY IGQKY +HYD+ D S RV
Sbjct: 93 QDP--IVTAVEQRLATWTKLNVSHQEDMQILRYGIGQKYGAHYDSLD------NDSPRVC 144
Query: 139 SFLVYLTDL--EEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 196
+ L+YL+D+ + GGET FP G+ Y P++GD LLFYSL P+GT
Sbjct: 145 TVLLYLSDVPADGGGETAFP---GVRRQALY-----------PKKGDALLFYSLKPDGTS 190
Query: 197 DPTSIHGSCPVVKGEKWVATKWIR 220
D S+H CP++ G KW ATKWI
Sbjct: 191 DAYSLHTGCPIISGVKWTATKWIH 214
>gi|397568865|gb|EJK46391.1| hypothetical protein THAOC_34939 [Thalassiosira oceanica]
Length = 488
Score = 139 bits (349), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 79/219 (36%), Positives = 114/219 (52%), Gaps = 16/219 (7%)
Query: 13 NIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSG 72
N+ + LS P L F E+C I+ A ++ S ++L+ + RTS
Sbjct: 264 NVTIETLSMKPLVLSISGFLADEECDYIMEKAAPTMKYSGVSLKDADKGRPASDWRTSQS 323
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
F++A D L IE + A +T +P + E +LRY + +KY++H+D FDP Y
Sbjct: 324 TFVAAMGDP--ILRDIELRTASLTRVPVTHQEFVQVLRYGVTEKYDAHHDFFDPSSYRSD 381
Query: 133 ----------KSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQG 182
K R A+ YLTD+ GGET FP G A D+ C GLKVKP++G
Sbjct: 382 PGTLQLIENGKKNRYATVFWYLTDVARGGETCFPRHGG--APPPRDFSMCTGLKVKPQKG 439
Query: 183 DGLLFYSLLPNGTIDPTSIHGSCPVVKGE--KWVATKWI 219
++FYSL +G +DP S+HG+CPV+ E KW A KW+
Sbjct: 440 KVIIFYSLDASGEMDPLSLHGACPVLGKEDIKWAANKWL 478
>gi|145341735|ref|XP_001415959.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144576182|gb|ABO94251.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 254
Score = 137 bits (346), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 79/217 (36%), Positives = 121/217 (55%), Gaps = 27/217 (12%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 72
LSW PRA + T QC++++ + +R ST+ VD+ G IRTS
Sbjct: 6 LSWYPRAFALRDALTEAQCEAVLRATRARVRRSTV-------VDSVTGESKVDPIRTSKQ 58
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-----AFDPQ 127
F++ E+ + I + ++ VTMLP + E +L Y++G+KY++H D + +
Sbjct: 59 TFLNRDEE---VVREIYDALSAVTMLPWTHNEDMQVLEYRVGEKYDAHEDVGAEDSLSGR 115
Query: 128 EYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMN---ADGSYDYQKCIGLKV--KPRQG 182
E +RVA+ L+YL + E GGET FP ++ A+G+ + KC +V KPR+G
Sbjct: 116 ELSKDGGKRVATVLLYLEEPEAGGETAFPDSEWIDPKMAEGT-SWSKCAEHRVAMKPRRG 174
Query: 183 DGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWI 219
DGL+F+S+ PNG ID ++H CPVV G KW AT W+
Sbjct: 175 DGLIFWSVDPNGKIDHRALHVGCPVVAGVKWTATVWV 211
>gi|299115886|emb|CBN75895.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein
[Ectocarpus siliculosus]
Length = 404
Score = 137 bits (345), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 77/218 (35%), Positives = 121/218 (55%), Gaps = 10/218 (4%)
Query: 7 GDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG 66
G +++ +I + LS P NF E+CK I A +++PS ++L +
Sbjct: 184 GLETLGSIDMKTLSMEPLVFEARNFLLDEECKHIREKADPHMKPSPVSLMDHDKGKPDTN 243
Query: 67 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP 126
RTS+ F+ + D L I+ ++ + T +P+ + E +L+Y GQ+Y +H+D D
Sbjct: 244 WRTSTTYFMPSTRDP--LLQGIDRRVEEFTRVPKSHQEQVQVLKYDKGQRYTAHHDFLDE 301
Query: 127 QEY----GPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCI-GLKVKPRQ 181
+ G +K++ + F YL+D+EEGGET+FP G G D+ C GLKVKP +
Sbjct: 302 RTMRNMDGGRKNRMITVFW-YLSDVEEGGETIFPRYGGRT--GRVDFSDCTTGLKVKPVE 358
Query: 182 GDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWI 219
G +FYSL P+G D S+HG+CPV+ G+KW A KW+
Sbjct: 359 GKVAMFYSLKPDGQFDDFSLHGACPVITGQKWAANKWV 396
>gi|303287328|ref|XP_003062953.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226455589|gb|EEH52892.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 259
Score = 137 bits (345), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 80/224 (35%), Positives = 125/224 (55%), Gaps = 32/224 (14%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 72
+SW PRA + N T +C ++ +A+ +R ST+ VD+T G IRTS
Sbjct: 4 ISWHPRAFHLHNIMTDAECDEVLELARTRVRRSTV-------VDSTTGESKVDPIRTSEQ 56
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFN-----ILRYKIGQKYNSHYDAFD-- 125
F++ + +IE+++ + TMLP NGE +L+Y GQKY++H+D +
Sbjct: 57 CFLNRGH--FPIVSVIEKRLERYTMLPWYNGEDLQARPSRVLKYSNGQKYDAHHDVGELD 114
Query: 126 ---PQEYGPQKSQRVASFLVYLTDLEE--GGETMFPFENGMN--ADGSYDYQKCI--GLK 176
++ + RVA+ L+YL+D+++ GGET FP ++ AD + +C +
Sbjct: 115 TASGKQLAAEGGHRVATVLLYLSDVDDDGGGETAFPDSEWIDPTADRGSGWSECAEDHVA 174
Query: 177 VKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
VKP++GDGLLF+S+ P G ID S+H CPV+ G+ W ATKWI
Sbjct: 175 VKPKKGDGLLFWSITPEGVIDQQSMHAGCPVL-GKSWTATKWIH 217
>gi|307102963|gb|EFN51228.1| hypothetical protein CHLNCDRAFT_141231 [Chlorella variabilis]
Length = 313
Score = 137 bits (344), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 82/212 (38%), Positives = 114/212 (53%), Gaps = 23/212 (10%)
Query: 21 WM----PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTS 70
WM A F NF T E+C I+ +AK +L S + VD G IRTS
Sbjct: 33 WMQVLDAEARIFINFLTEEECDHIVALAKPHLERSGV-------VDTATGGSEISDIRTS 85
Query: 71 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 130
G+F+ D+ T+ IEE+IA+ T+LP NGE +L Y G+KY+ ++ FD
Sbjct: 86 KGMFLERGHDD--TVAAIEERIARWTLLPVGNGEGLQVLNYHPGEKYDDYF--FDKVNGE 141
Query: 131 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCI--GLKVKPRQGDGLLFY 188
R A+ L+YL +EEGGET+FP D + +C L KP +G +LF+
Sbjct: 142 SNGGNRYATVLMYLNTVEEGGETVFPNIPAPGGDNGPTFTECARRHLAAKPTKGSAVLFH 201
Query: 189 SLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
S+ P+G ++ S+H +CPVVKGEKW A KWI
Sbjct: 202 SIKPSGDLERRSLHTACPVVKGEKWSAPKWIH 233
>gi|307102962|gb|EFN51227.1| hypothetical protein CHLNCDRAFT_28161 [Chlorella variabilis]
Length = 300
Score = 136 bits (342), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 75/211 (35%), Positives = 114/211 (54%), Gaps = 5/211 (2%)
Query: 11 VTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTS 70
+ +I +VLSW PR + T E+C ++ A L S + ++ IRTS
Sbjct: 11 LLHILLKVLSWDPRIFLYQRLLTEEECDHMMTKAGPRLTRSGVVDVDNPGGESVSDIRTS 70
Query: 71 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 130
G+F EDE + +E ++++ +++P +GE +LRY+ G++Y H+D F
Sbjct: 71 YGMFFDRGEDE--VVREVERRLSEWSLIPPGHGEGIQVLRYENGEEYKPHFDYFFDNLSV 128
Query: 131 PQKSQRVASFLVYLTDLEEGGETMFPFENG---MNADGSYDYQKCIGLKVKPRQGDGLLF 187
R+A+ L+YL + E GGET+FP + Y GL VKPR+GD +LF
Sbjct: 129 QNGGNRLATILMYLAEPEFGGETVFPNVKAPPEQTLEAGYSECATQGLAVKPRKGDAVLF 188
Query: 188 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKW 218
+SL GT+D S+HGSCP +KG K+ ATKW
Sbjct: 189 FSLRTEGTLDKGSLHGSCPTLKGFKFAATKW 219
>gi|226494249|ref|NP_001141909.1| uncharacterized protein LOC100274058 [Zea mays]
gi|194706408|gb|ACF87288.1| unknown [Zea mays]
gi|413932757|gb|AFW67308.1| hypothetical protein ZEAMMB73_919439 [Zea mays]
gi|413932758|gb|AFW67309.1| hypothetical protein ZEAMMB73_919439 [Zea mays]
Length = 217
Score = 136 bits (342), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 68/141 (48%), Positives = 94/141 (66%), Gaps = 13/141 (9%)
Query: 88 IEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDL 147
IE+++A T LP N E+ +LRY+ GQKY++H+D F + QRVA+ L+YLTD+
Sbjct: 24 IEKRVAAWTFLPEENAESLQVLRYETGQKYDAHFDYFHDRNNLKLGGQRVATVLMYLTDV 83
Query: 148 EEGGETMFPFENGMNADGSY------DYQKC--IGLKVKPRQGDGLLFYSLLPNGTIDPT 199
+GGET+FP NA+GS+ + +C GL VKP++GD LLF++L N T D
Sbjct: 84 NKGGETVFP-----NAEGSHLQYKDETWSECSRSGLAVKPKKGDALLFFNLHVNATADTG 138
Query: 200 SIHGSCPVVKGEKWVATKWIR 220
S+HGSCPV++GEKW ATKWI
Sbjct: 139 SLHGSCPVIEGEKWSATKWIH 159
>gi|255637879|gb|ACU19258.1| unknown [Glycine max]
Length = 287
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 75/205 (36%), Positives = 119/205 (58%), Gaps = 5/205 (2%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVFI 75
+VL+W PR + NF + E+C + +A L ST+ + G+ + + +RTSSG+F+
Sbjct: 82 EVLNWSPRIILLHNFLSMEECDYLRALALPRLHISTVVDTKTGKGIKSD--VRTSSGMFL 139
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 135
++ E + + IE++I+ + +P NGE +LRY+ Q Y +D F + Q
Sbjct: 140 NSKERKYPMVQAIEKRISVYSQIPIENGELMQVLRYEKNQYYKPRHDYFFDTFNLKRGGQ 199
Query: 136 RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGT 195
+A+ L+YL+D EGGET FP + + S + GL VKP +G+ +LF+S+ +G
Sbjct: 200 GIATMLMYLSDNIEGGETYFPLAG--SGECSCGGKLVKGLSVKPIKGNAVLFWSMGLDGQ 257
Query: 196 IDPTSIHGSCPVVKGEKWVATKWIR 220
DP S+HG C V+ GEKW ATKW+R
Sbjct: 258 SDPNSVHGGCEVISGEKWSATKWLR 282
>gi|302845026|ref|XP_002954052.1| hypothetical protein VOLCADRAFT_64430 [Volvox carteri f.
nagariensis]
gi|300260551|gb|EFJ44769.1| hypothetical protein VOLCADRAFT_64430 [Volvox carteri f.
nagariensis]
Length = 311
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 77/212 (36%), Positives = 119/212 (56%), Gaps = 17/212 (8%)
Query: 18 VLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISA 77
V+SW PRA NF T +C I ++A++++R ST+ G +V RTS G FI+
Sbjct: 3 VISWQPRAFVIRNFLTEHECTHIADLAQVHMRRSTVVADNGSSV--LDDYRTSYGTFINR 60
Query: 78 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRV 137
++ + +E+++A +T P + E +LRY +GQ Y+ H D+ + S R+
Sbjct: 61 Y--QTPVIAAVEDRVALLTRTPVVYQEDMQVLRYGLGQYYHRHTDSLE------NDSPRM 112
Query: 138 ASFLVYLTDLEEGGETMFP----FENGMNADGSYDYQKCI--GLKVKPRQGDGLLFYSLL 191
A+ L+YL++ E GGET FP + + A + C+ + KPR+GD LLF+S+
Sbjct: 113 ATVLLYLSEPELGGETAFPQAASWAHPAMAQLFGPFSDCVKGNVAFKPRRGDALLFWSVK 172
Query: 192 PNG-TIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
P+G T DP S H CPV++G KW AT W+ Q
Sbjct: 173 PDGRTEDPYSEHEGCPVIRGVKWTATVWVHTQ 204
>gi|255071007|ref|XP_002507585.1| predicted protein [Micromonas sp. RCC299]
gi|226522860|gb|ACO68843.1| predicted protein [Micromonas sp. RCC299]
Length = 433
Score = 135 bits (339), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 74/203 (36%), Positives = 114/203 (56%), Gaps = 11/203 (5%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTL--ALRKGETVDNTQGIRTSSGVFISAA-- 78
PRA F + +C ++ A+ N+ S + A G + N IRTS+G F+
Sbjct: 166 PRAFMHIGFLSERECDLLVEYARPNMYKSGVVDASNGGSSFSN---IRTSTGSFVPTVFP 222
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVA 138
+ + IE +IA T +P +GE +LRY+IGQ+Y SH+D F + G K+ R+A
Sbjct: 223 LGMNDVVRRIERRIAAWTQIPAAHGEPIQVLRYQIGQEYQSHFDYFFHE--GGMKNNRIA 280
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCI--GLKVKPRQGDGLLFYSLLPNGTI 196
+ L+YL+D+++GGET+FP + + C G+ V P++GD +LF+++ G +
Sbjct: 281 TVLMYLSDVKDGGETVFPSAESLQVKPEPIHHACAKNGITVIPKKGDAILFWNMKVGGDL 340
Query: 197 DPTSIHGSCPVVKGEKWVATKWI 219
D S H CPVV GEKW ATKW+
Sbjct: 341 DGGSTHAGCPVVLGEKWTATKWL 363
>gi|145354086|ref|XP_001421326.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144581563|gb|ABO99619.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 309
Score = 134 bits (336), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 80/216 (37%), Positives = 119/216 (55%), Gaps = 16/216 (7%)
Query: 16 FQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFI 75
+ +S PRA + NF T E+ ++ I A+ +R S + + + + T RTSSG ++
Sbjct: 78 IERISESPRAYVYRNFLTREEAEATIAAARRTMRRSEV-VNEADGTSKTSDERTSSGGWV 136
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 135
S + E + IE ++A TMLPR GE ++RY+ GQ+Y +H D F + Q
Sbjct: 137 SGEDSE--VMANIERRVAAWTMLPRNRGETTQVMRYEAGQEYAAHDDYFHDEVNVKNGGQ 194
Query: 136 RVASFLVYLTDLEEGGETMFPF---------ENGMNADGSYDYQKCIG----LKVKPRQG 182
R A+ L+YL+D+EEGGET+FP E G+ + G L VKPR+G
Sbjct: 195 RAATVLMYLSDVEEGGETVFPRGTPLGGAAPEKSGVTQGNACERALRGDPNVLAVKPRRG 254
Query: 183 DGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKW 218
D LLF+++ NG +D + H CPVV+G KW AT+W
Sbjct: 255 DALLFFNVHLNGEVDERARHAGCPVVRGTKWTATRW 290
>gi|303282201|ref|XP_003060392.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226457863|gb|EEH55161.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 369
Score = 134 bits (336), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 83/203 (40%), Positives = 108/203 (53%), Gaps = 13/203 (6%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVFISAAEDE 81
PRA + F T +C I A L S + GE V + IRTS G+F ED+
Sbjct: 83 PRAYVYRGFLTDAECDHFIARASPKLAKSNVVDTDTGEGVPSA--IRTSDGMFFDRGEDD 140
Query: 82 SGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQ--EYGPQKSQRVAS 139
+D +E +I+ T LP NGE +LRY GQKY++H DAF + QRVA+
Sbjct: 141 --VVDAVERRISAWTRLPTENGEGMQVLRYAGGQKYDAHLDAFVDKFNADDAHGGQRVAT 198
Query: 140 FLVYLTDLEEGGETMFPFENGMNADGSYDYQKCI--GLKVKPRQGDGLLFYSLLPNGTID 197
L+YL D+++GGET+FP G Y C G+ VKPR+GD LLF+S+ T
Sbjct: 199 VLMYLNDVDDGGETVFPETTAKPHVGDERYSACARRGVAVKPRRGDALLFWSMDETFT-- 256
Query: 198 PTSIHGSCPV-VKGEKWVATKWI 219
S+HG CPV G KW TKWI
Sbjct: 257 -RSLHGGCPVGAGGVKWSMTKWI 278
>gi|3805847|emb|CAA21467.1| putative protein [Arabidopsis thaliana]
gi|7270533|emb|CAB81490.1| putative protein [Arabidopsis thaliana]
Length = 307
Score = 132 bits (333), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 74/222 (33%), Positives = 127/222 (57%), Gaps = 36/222 (16%)
Query: 16 FQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVD------------ 62
+V+SW PRA + NF T E+C+ +I++AK ++ S + ++ G+++D
Sbjct: 80 LEVISWEPRAFVYHNFLTNEECEHLISLAKPSMMKSKVVDVKTGKSIDSRFCTLTSVVVF 139
Query: 63 ----------NTQ-------GIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEA 105
N++ +RTSSG F++ DE ++ IE +I+ T +P NGE
Sbjct: 140 TFQLNLERFENSKFANPSLCRVRTSSGTFLNRGHDE--IVEEIENRISDFTFIPPENGEG 197
Query: 106 FNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADG 165
+L Y++GQ+Y H+D F + + QR+A+ L+YL+D++EGGET+FP G +D
Sbjct: 198 LQVLHYEVGQRYEPHHDYFFDEFNVRKGGQRIATVLMYLSDVDEGGETVFPAAKGNVSDV 257
Query: 166 SY--DYQKC--IGLKVKPRQGDGLLFYSLLPNGTIDPTSIHG 203
+ + +C GL V P++ D LLF+S+ P+ ++DP+S+HG
Sbjct: 258 PWWDELSQCGKEGLSVLPKKRDALLFWSMKPDASLDPSSLHG 299
>gi|159464219|ref|XP_001690339.1| hypothetical protein CHLREDRAFT_114525 [Chlamydomonas reinhardtii]
gi|158279839|gb|EDP05598.1| predicted protein [Chlamydomonas reinhardtii]
Length = 244
Score = 132 bits (332), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 74/205 (36%), Positives = 117/205 (57%), Gaps = 11/205 (5%)
Query: 24 RALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESG 83
R +F T E+ I+ +++ L S + G + ++ IRTS GVF+ ED
Sbjct: 1 RIFLIEHFLTDEEADHIVQVSERRLERSGVVATNGGSEESQ--IRTSFGVFLERGEDP-- 56
Query: 84 TLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVY 143
+ +EE+I+ +T++P NGE +LRY+ QKY++H+D F ++ R A+ L+Y
Sbjct: 57 VVKGVEERISALTLMPVGNGEGLQVLRYQKEQKYDAHWDYFFHKDGIANGGNRYATVLMY 116
Query: 144 LTDLEEGGETMFPFENGMNADGSYD--YQKCI--GLKVKPRQGDGLLFYSLLPNGTIDPT 199
L D EEGGET+FP + A G + + +C L KP++G +LF+S+ P G ++
Sbjct: 117 LVDTEEGGETVFP---NIAAPGGENVGFSECARYHLAAKPKKGTAILFHSIKPTGELERK 173
Query: 200 SIHGSCPVVKGEKWVATKWIRDQEQ 224
S+H +CPV+KG KW A KWI + Q
Sbjct: 174 SLHTACPVIKGIKWSAAKWIHVKPQ 198
>gi|302831512|ref|XP_002947321.1| hypothetical protein VOLCADRAFT_120451 [Volvox carteri f.
nagariensis]
gi|300267185|gb|EFJ51369.1| hypothetical protein VOLCADRAFT_120451 [Volvox carteri f.
nagariensis]
Length = 797
Score = 132 bits (331), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 80/222 (36%), Positives = 115/222 (51%), Gaps = 24/222 (10%)
Query: 16 FQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRT 69
+ +SW PRA + NF T +C ++ + T + + VD+ G IRT
Sbjct: 493 IETISWSPRAFVYHNFLTSAECDHLVQIG-------TQRVSRSLVVDSQTGQSKLDDIRT 545
Query: 70 SSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEY 129
S G ED + IEE+IA+ T LP +GE ILRY GQKY++H+D FD +
Sbjct: 546 SYGAAFGRGEDP--VIAEIEERIAEWTHLPPEHGEPMQILRYVDGQKYDAHWDWFDDPVH 603
Query: 130 GPQ---KSQRVASFLVYLTDLEEGGETMFPFEN--GMNADGSYDYQKC---IGLKVKPRQ 181
R A+ L+YL+++E GGET P + M+ + C +GL ++PR+
Sbjct: 604 HRSYLVDGNRYATVLLYLSEVEAGGETNLPLADPIDMSVQAIENPSPCAAKMGLSIRPRK 663
Query: 182 GDGLLFYSLLPNGTI-DPTSIHGSCPVVKGEKWVATKWIRDQ 222
GD LLFY + G D ++H SCP +KG KW ATKWI +
Sbjct: 664 GDALLFYDMDIEGQKGDRKALHASCPTLKGMKWTATKWIHSK 705
>gi|319763870|ref|YP_004127807.1| procollagen-proline dioxygenase [Alicycliphilus denitrificans BC]
gi|330823866|ref|YP_004387169.1| procollagen-proline dioxygenase [Alicycliphilus denitrificans K601]
gi|317118431|gb|ADV00920.1| Procollagen-proline dioxygenase [Alicycliphilus denitrificans BC]
gi|329309238|gb|AEB83653.1| Procollagen-proline dioxygenase [Alicycliphilus denitrificans K601]
Length = 284
Score = 131 bits (330), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 83/229 (36%), Positives = 121/229 (52%), Gaps = 40/229 (17%)
Query: 6 AGDDSVTNIPFQVLSWM--PRALYFPNFATPEQCKSIINMAKLNLRPS--TLALRKGETV 61
AGD V +VL M PR + F N +PE+C+++I A+ + S A GE V
Sbjct: 83 AGDRRV-----EVLMAMANPRVVLFGNLLSPEECQAVIEAARTRMARSLTVQAASGGEEV 137
Query: 62 DNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHY 121
+ RTS G+F E+E+ + +EE+IA++ P NGE +L Y+ G +Y HY
Sbjct: 138 NKD---RTSDGMFFQRGENEA--VARLEERIARLVRWPVENGEGLQVLHYRPGAEYKPHY 192
Query: 122 DAFDPQEYGPQK-----SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLK 176
D FDP E G + QRVA+ ++YL D GG T FP + L+
Sbjct: 193 DYFDPAEPGTPRLLRRGGQRVATLVIYLNDPVRGGGTTFP---------------DVPLE 237
Query: 177 VKPRQGDGLLFYSLLPNGTIDPTS--IHGSCPVVKGEKWVATKWIRDQE 223
+ PRQG+ + F G P+S +HG PV++GEKW+ATKW+R++E
Sbjct: 238 IGPRQGNAVFFSY----GRAHPSSRTLHGGAPVIEGEKWIATKWLRERE 282
>gi|242085722|ref|XP_002443286.1| hypothetical protein SORBIDRAFT_08g016950 [Sorghum bicolor]
gi|241943979|gb|EES17124.1| hypothetical protein SORBIDRAFT_08g016950 [Sorghum bicolor]
Length = 147
Score = 131 bits (330), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 64/133 (48%), Positives = 87/133 (65%), Gaps = 4/133 (3%)
Query: 88 IEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDL 147
IE++IA T +P NGE +L Y +GQK+ H+D D R A+FL+YL+D+
Sbjct: 14 IEQRIADYTSVPIENGEPLQVLHYAVGQKFEPHFDYTDGTSVTKIGGPRKATFLMYLSDV 73
Query: 148 EEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPV 207
EEGGET+FP A GS K G+ VKP+ GD LLF+S+ P+G++DP S+HG+ PV
Sbjct: 74 EEGGETVFP---NATAKGSAPSAKS-GISVKPKMGDALLFWSMKPDGSLDPKSLHGASPV 129
Query: 208 VKGEKWVATKWIR 220
+KG+KW ATKWI
Sbjct: 130 IKGDKWSATKWIH 142
>gi|159486447|ref|XP_001701251.1| hypothetical protein CHLREDRAFT_122372 [Chlamydomonas reinhardtii]
gi|158271833|gb|EDO97644.1| predicted protein [Chlamydomonas reinhardtii]
Length = 251
Score = 131 bits (329), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 76/212 (35%), Positives = 113/212 (53%), Gaps = 18/212 (8%)
Query: 16 FQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFI 75
+ +SW PR + NF + +C+ I A ++ S++ G +V +T IRTS G FI
Sbjct: 2 IETVSWNPRVFIYHNFLSDAECRHIKRTAAPMMKRSSVVGTNGSSVLDT--IRTSYGTFI 59
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 135
D ++ + ++A T P N E +LRY GQKY +H D+ S
Sbjct: 60 RRRHDP--VVERVLRRVAAWTKAPPENQEDLQVLRYGPGQKYGAHMDSLI------DDSP 111
Query: 136 RVASFLVYLTDLEEGGETMFPFENGMNADGSYD-----YQKCI--GLKVKPRQGDGLLFY 188
R+A+ L+YL D E GGET FP ++G D S + +C + +P++GD L+F+
Sbjct: 112 RMATVLLYLHDTEYGGETAFP-DSGHWLDPSLAQSMGPFSECAQGHVAFRPKKGDALMFW 170
Query: 189 SLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
S+ P+GT DP S+H CPVV G KW AT W+
Sbjct: 171 SIKPDGTHDPLSLHTGCPVVTGVKWTATSWVH 202
>gi|302838815|ref|XP_002950965.1| hypothetical protein VOLCADRAFT_60971 [Volvox carteri f.
nagariensis]
gi|300263660|gb|EFJ47859.1| hypothetical protein VOLCADRAFT_60971 [Volvox carteri f.
nagariensis]
Length = 298
Score = 130 bits (326), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 80/222 (36%), Positives = 115/222 (51%), Gaps = 28/222 (12%)
Query: 16 FQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFI 75
+ +SW PR + NF T +C+ I A ++ S++ + G +V T IRTS G FI
Sbjct: 2 IEAVSWNPRVFIYHNFLTDGECRHIKRTAAPMMKRSSVVGQNGSSV--TDNIRTSYGTFI 59
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAF------------NILRYKIGQKYNSHYDA 123
D ++ I ++A T P N E +LRY IGQKY +H D+
Sbjct: 60 RRRHDP--VIERILRRVAAWTKAPPENQEDLQAGRGEGGREKERVLRYGIGQKYGAHMDS 117
Query: 124 FDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENG-MNADGSYD---YQKCIGLKV-- 177
S R+A+ L+YL D EEGGET FP + + D + + +C V
Sbjct: 118 L------IDDSPRMATVLLYLHDTEEGGETAFPDSSSWLTPDLATRMGPFSECAQGHVAF 171
Query: 178 KPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWI 219
+P++GD L+F+S+ P+GT DP S+H CPVVKG KW AT W+
Sbjct: 172 RPKKGDALMFWSIKPDGTHDPLSMHTGCPVVKGVKWTATSWV 213
>gi|412985583|emb|CCO19029.1| predicted protein [Bathycoccus prasinos]
Length = 458
Score = 130 bits (326), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 77/213 (36%), Positives = 119/213 (55%), Gaps = 16/213 (7%)
Query: 16 FQVLSW-MPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG-IRTSSGV 73
Q++S PRA + F T E+C +I+ +K + S + ET + IRTS+G
Sbjct: 177 MQIISLDHPRAFLYKRFMTDEECDFLIDHSKSRMSKSGVV--DAETGGTAKSDIRTSTGS 234
Query: 74 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK 133
F+ ++ + +E+++A +MLP + EA +LRY++ Q+Y +HYD F + G
Sbjct: 235 FVGIGAND--LMKKLEKRVATFSMLPVKHQEATQVLRYEVKQEYRAHYDYFFHK--GGMA 290
Query: 134 SQRVASFLVYLTDLEEGGETMFP-----FENGMNADGSYDYQKC--IGLKVKPRQGDGLL 186
+ R+ + L+YL + E GGET+FP E G ++ +C G R+GD L+
Sbjct: 291 NNRIVTILMYLHEPEFGGETVFPNTEVPLERAEKGWGK-NFSECGNRGRAAVVRKGDALI 349
Query: 187 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWI 219
F+S+ P G +DP S H CPVV+GEKW ATKWI
Sbjct: 350 FWSMKPGGELDPGSSHAGCPVVRGEKWTATKWI 382
>gi|302835042|ref|XP_002949083.1| hypothetical protein VOLCADRAFT_89416 [Volvox carteri f.
nagariensis]
gi|300265828|gb|EFJ50018.1| hypothetical protein VOLCADRAFT_89416 [Volvox carteri f.
nagariensis]
Length = 263
Score = 130 bits (326), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 79/218 (36%), Positives = 117/218 (53%), Gaps = 29/218 (13%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFIS 76
+ +SWMPRA + F TP +C +I +A L S + + +D+ IRTS I
Sbjct: 60 ETVSWMPRAFVYHQFLTPAECDHLIELATPKLERSMVVGTDSDLIDD---IRTSFSASIM 116
Query: 77 AAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK-SQ 135
E + + IEE+IA+ T +LRY GQKY++H+D FD E S
Sbjct: 117 YGE--TSIVSSIEERIARWT-----------VLRYVNGQKYDAHWDWFDDNEVAKAGGSN 163
Query: 136 RVASFLVYLTDLE--EGGETMFPFENGMN-----ADGSYDYQKC---IGLKVKPRQGDGL 185
R+A+ L+YL+D++ GGET P ++ DG Y +C +G+ ++PR+GD L
Sbjct: 164 RMATVLMYLSDVDPAAGGETALPLAEPLDPHKQSVDGQ-GYSQCAARMGISIRPRKGDVL 222
Query: 186 LFYSLLPNGTI-DPTSIHGSCPVVKGEKWVATKWIRDQ 222
LF+ + P G I D ++H SCP G KW ATKWI ++
Sbjct: 223 LFWDMDPAGLIPDRHALHASCPTFSGTKWTATKWIHNK 260
>gi|357467087|ref|XP_003603828.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355492876|gb|AES74079.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 156
Score = 129 bits (325), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 62/140 (44%), Positives = 88/140 (62%), Gaps = 4/140 (2%)
Query: 88 IEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDL 147
IE +IA T +P NGE +L Y +G+KY HYD F + QRVA+ L+YL+D+
Sbjct: 14 IERRIADFTFIPVENGEGLQVLHYGVGEKYEPHYDYFLDEFNTKNGGQRVATVLMYLSDV 73
Query: 148 EEGGETMFPFENGMNADGSY--DYQKCI--GLKVKPRQGDGLLFYSLLPNGTIDPTSIHG 203
EEGGET+FP + + D +C GL +KP+ GD LLF+S+ P+ T+D +S+HG
Sbjct: 74 EEGGETVFPAAKANFSSVPWWNDLSECARKGLSLKPKMGDALLFWSMRPDATLDASSLHG 133
Query: 204 SCPVVKGEKWVATKWIRDQE 223
CPV+ G KW +TKW+ +E
Sbjct: 134 GCPVIVGNKWSSTKWMHLEE 153
>gi|3169183|gb|AAC17826.1| hypothetical protein [Arabidopsis thaliana]
Length = 1036
Score = 129 bits (325), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 69/121 (57%), Positives = 82/121 (67%), Gaps = 4/121 (3%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFIS 76
Q LSW PR Y PNFAT +QC+++I+MAK L+PSTLALRK ET R+ +
Sbjct: 798 QGLSWNPRVFYLPNFATKQQCEAVIDMAKPKLKPSTLALRK-ETKHFQMQYRS---LHQH 853
Query: 77 AAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQR 136
EDESG L IEEKIA T P+ E+FNILRY++GQKY+SHYDAF EYGP SQR
Sbjct: 854 TDEDESGVLAAIEEKIALATRFPKDYYESFNILRYQLGQKYDSHYDAFHSAEYGPLISQR 913
Query: 137 V 137
V
Sbjct: 914 V 914
>gi|120609859|ref|YP_969537.1| 2OG-Fe(II) oxygenase [Acidovorax citrulli AAC00-1]
gi|120588323|gb|ABM31763.1| 2OG-Fe(II) oxygenase [Acidovorax citrulli AAC00-1]
Length = 309
Score = 129 bits (324), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 75/207 (36%), Positives = 116/207 (56%), Gaps = 29/207 (14%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKG-ETVDNTQGIRTSSGVFISAAED 80
PR + F N +PE+C +II+ A+ + R T+A R G E V++ RTS+G+F +
Sbjct: 122 PRVVLFGNLLSPEECDAIIDAARPRMARSLTVATRTGGEEVNDD---RTSNGMFFQ--RE 176
Query: 81 ESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP-----QKSQ 135
E+ + +E +IA++ P NGE +L Y+ G +Y HYD FDP E G + Q
Sbjct: 177 ENPVVARLEARIARLVNWPLENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPTILRRGGQ 236
Query: 136 RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGT 195
RVA+ ++YL D E+GG T FP + L+V PR+G+ + F P+ +
Sbjct: 237 RVATIVIYLNDPEKGGGTTFP---------------DVHLEVAPRRGNAVFFSYERPHPS 281
Query: 196 IDPTSIHGSCPVVKGEKWVATKWIRDQ 222
++HG PVV G+KW+ATKW+R++
Sbjct: 282 T--RTLHGGAPVVAGDKWIATKWLRER 306
>gi|229086310|ref|ZP_04218488.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-44]
gi|228697005|gb|EEL49812.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-44]
Length = 220
Score = 129 bits (324), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 70/219 (31%), Positives = 125/219 (57%), Gaps = 28/219 (12%)
Query: 5 QAGDDSVT-NIPFQVLSWM--PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETV 61
GD VT + Q++S + P + N + E+C+S+I ++K +++ S + + V
Sbjct: 22 HIGDTIVTEDREIQIISRVEEPLIVVLENVLSDEECESLIELSKDSMKRSKIGASR--EV 79
Query: 62 DNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHY 121
DN IRTSSG F+ +E+ T+ +IE++++ + +P +GE +IL+Y GQ+Y +HY
Sbjct: 80 DN---IRTSSGTFL----EENETVAIIEKRVSSIMNIPVEHGEGLHILKYTPGQEYKAHY 132
Query: 122 DAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQ 181
D F + + R+++ ++YL D+EEGGET FP + L + P++
Sbjct: 133 DYFAEHSRAAE-NNRISTLVMYLNDVEEGGETFFP---------------KLNLSIAPKK 176
Query: 182 GDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
G + F + +++ ++HG PV+KGEKWVAT+W++
Sbjct: 177 GSAVYFEYFYNDKSLNELTLHGGAPVIKGEKWVATQWMK 215
>gi|159485424|ref|XP_001700744.1| hypothetical protein CHLREDRAFT_187378 [Chlamydomonas reinhardtii]
gi|158281243|gb|EDP06998.1| predicted protein [Chlamydomonas reinhardtii]
Length = 253
Score = 129 bits (324), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 78/217 (35%), Positives = 116/217 (53%), Gaps = 16/217 (7%)
Query: 16 FQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFI 75
+ +SW+PRA + F + +C +I +A L S + K + VD IRTS I
Sbjct: 38 IETISWVPRAFIYHGFLSHAECDHLIGLALPKLERSLVVGNKSDEVDP---IRTSYSASI 94
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFD-PQEYGPQKS 134
+E+ + IE +IA+ T LPR + E +LRY GQKY++H+D FD + G
Sbjct: 95 --GYNETDVVADIEGRIARWTHLPRSHQEPMEVLRYINGQKYDAHWDWFDETETGGTGGG 152
Query: 135 QRVASFLVYLTDLE--EGGETMFPFE-------NGMNADGSYDYQKCIGLKVKPRQGDGL 185
R+A+ L+YL+D+E GGET P G+ G + +G+ V+P++GD L
Sbjct: 153 NRMATALMYLSDMEPAAGGETALPLAQPLDWEVQGVEGRGYSECASKMGISVRPKKGDVL 212
Query: 186 LFYSLLPNG-TIDPTSIHGSCPVVKGEKWVATKWIRD 221
LF+ + P G D ++H SCP G KW ATKWI +
Sbjct: 213 LFWDMEPGGREPDRHALHASCPTFSGTKWTATKWIHN 249
>gi|307109700|gb|EFN57937.1| hypothetical protein CHLNCDRAFT_142031 [Chlorella variabilis]
Length = 325
Score = 129 bits (324), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 77/223 (34%), Positives = 123/223 (55%), Gaps = 16/223 (7%)
Query: 6 AGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQ 65
A D + + F+ +SW PRA NFA+ E+ +I +A+ LR ST+ +GE+V
Sbjct: 22 AIDTAAAHPWFEPVSWYPRAFVAHNFASKEETDHMIKLAQPQLRRSTVVGSRGESV--VD 79
Query: 66 GIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFD 125
RTS G+FI DE + +E+++A T + E +LRY Q+Y +H+D+ D
Sbjct: 80 NYRTSYGMFIRRHHDE--VVSTLEKRVATWTKYNVTHQEDIQVLRYGTTQEYKAHFDSLD 137
Query: 126 PQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMN-----ADGSYDYQKCIGLKVKPR 180
S R A+ L+YL+D+E GGET FP ++ A G + + +KP+
Sbjct: 138 ------DDSPRTATVLIYLSDVESGGETTFPNSEWIDPALPKALGPFSECAQGHVAMKPK 191
Query: 181 QGDGLLFYSLLPNG-TIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
+GD ++F+SL P+G + D ++H +CPV+ G K+VA WI +
Sbjct: 192 RGDAIVFHSLNPDGRSHDQHALHTACPVIVGVKYVAIFWIHTK 234
>gi|55741082|gb|AAV64222.1| unknown [Zea mays]
Length = 369
Score = 129 bits (323), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 67/146 (45%), Positives = 92/146 (63%), Gaps = 5/146 (3%)
Query: 77 AAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQR 136
A +DE T IEE+I+ T LP NGE+ IL Y+ G+KY HYD F ++ R
Sbjct: 191 ATQDEVVTR--IEERISAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKKNQALGGHR 248
Query: 137 VASFLVYLTDLEEGGETMFPFENG---MNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPN 193
+A+ L+YL+++E+GGET+FP G D ++ G VKP +GD LLF+SL P+
Sbjct: 249 IATVLMYLSNVEKGGETIFPNAEGKLLQPKDNTWSDCARNGYAVKPVKGDALLFFSLHPD 308
Query: 194 GTIDPTSIHGSCPVVKGEKWVATKWI 219
T D S+HGSCPV++G+KW ATKWI
Sbjct: 309 ATTDSDSLHGSCPVIEGQKWSATKWI 334
>gi|302842389|ref|XP_002952738.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
gi|300262082|gb|EFJ46291.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
Length = 281
Score = 129 bits (323), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 68/164 (41%), Positives = 99/164 (60%), Gaps = 9/164 (5%)
Query: 64 TQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDA 123
T IRTS GVF+ EDE + +EE+IA T++P NGE +LRY+ QKY++H+D
Sbjct: 35 TSNIRTSYGVFLDRGEDE--IVKRVEERIAAWTLMPVGNGEGLQVLRYQKEQKYDAHWDY 92
Query: 124 FDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYD--YQKCI--GLKVKP 179
F ++ R A+ L+YL D EEGGET+FP + A G + + +C L KP
Sbjct: 93 FFHKDGITNGGNRYATVLMYLVDTEEGGETVFP---NVAAPGGENVGFSECARYHLAAKP 149
Query: 180 RQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
++G +LF+S+ P G ++ S+H +CPV++G KW A KWI E
Sbjct: 150 KKGTAILFHSIKPTGELERKSLHTACPVIRGIKWSAAKWIHHAE 193
>gi|55741040|gb|AAV64184.1| unknown [Zea mays]
Length = 394
Score = 128 bits (322), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 66/146 (45%), Positives = 92/146 (63%), Gaps = 5/146 (3%)
Query: 77 AAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQR 136
A +DE + IEE+I+ T LP NGE+ IL Y+ G+KY HYD F ++ R
Sbjct: 191 ATQDE--VVTRIEERISAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKKNQALGGHR 248
Query: 137 VASFLVYLTDLEEGGETMFPFENG---MNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPN 193
+A+ L+YL+++E+GGET+FP G D ++ G VKP +GD LLF+SL P+
Sbjct: 249 IATVLMYLSNVEKGGETIFPNAEGKLLQPKDNTWSDCARNGYAVKPVKGDALLFFSLHPD 308
Query: 194 GTIDPTSIHGSCPVVKGEKWVATKWI 219
T D S+HGSCPV++G+KW ATKWI
Sbjct: 309 ATTDSDSLHGSCPVIEGQKWSATKWI 334
>gi|372266874|ref|ZP_09502922.1| peptidyl prolyl 4-hydroxylase-like protein subunit alpha
[Alteromonas sp. S89]
Length = 294
Score = 128 bits (322), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 71/206 (34%), Positives = 111/206 (53%), Gaps = 23/206 (11%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + F NF +C +++ M++ NL PS + + + + RTS G A E+
Sbjct: 103 PNIVLFANFLAEWECDALVEMSRPNLSPSRVVNTQHGAFE-LKPSRTSGGTHF--ARGET 159
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK-----SQRV 137
+ IE +IA + +P +GE IL Y + +Y HYD FDP++ G Q+ QRV
Sbjct: 160 PLIADIEARIASLLKVPEAHGEPLQILHYPVSGEYRPHYDFFDPEKPGNQEVLAAGGQRV 219
Query: 138 ASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTID 197
+ ++YL+D+E GG T+FP +GL+V+P++G L F + +G +D
Sbjct: 220 GTLIMYLSDVESGGATVFP---------------RVGLEVQPQKGAALFFSYVGEHGKLD 264
Query: 198 PTSIHGSCPVVKGEKWVATKWIRDQE 223
S+HG PV+ GEKW+ATKW+R E
Sbjct: 265 LQSLHGGSPVLAGEKWIATKWLRAAE 290
>gi|326316001|ref|YP_004233673.1| procollagen-proline dioxygenase [Acidovorax avenae subsp. avenae
ATCC 19860]
gi|323372837|gb|ADX45106.1| Procollagen-proline dioxygenase [Acidovorax avenae subsp. avenae
ATCC 19860]
Length = 298
Score = 128 bits (322), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 75/207 (36%), Positives = 116/207 (56%), Gaps = 29/207 (14%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKG-ETVDNTQGIRTSSGVFISAAED 80
PR + F N +PE+C +II+ A+ + R T+A R G E V++ RTS+G+F +
Sbjct: 111 PRVVLFGNLLSPEECDAIIDAARPRMARSLTVATRTGGEEVNDD---RTSNGMFFQ--RE 165
Query: 81 ESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP-----QKSQ 135
E+ + +E +IA++ P NGE +L Y+ G +Y HYD FDP E G + Q
Sbjct: 166 ENPMVAKLEARIARLVNWPLENGEGLQVLHYRPGAEYKPHYDYFDPTEPGTPTILRRGGQ 225
Query: 136 RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGT 195
RVA+ ++YL D E+GG T FP + L+V PR+G+ + F P+ +
Sbjct: 226 RVATIVIYLNDPEKGGGTTFP---------------DVHLEVAPRRGNAVFFSYERPHPS 270
Query: 196 IDPTSIHGSCPVVKGEKWVATKWIRDQ 222
++HG PVV G+KW+ATKW+R++
Sbjct: 271 T--RTLHGGAPVVAGDKWIATKWLRER 295
>gi|239814309|ref|YP_002943219.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
gi|239800886|gb|ACS17953.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
Length = 279
Score = 128 bits (322), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 86/228 (37%), Positives = 119/228 (52%), Gaps = 38/228 (16%)
Query: 6 AGDDSVTNIPFQVLSWM--PRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKGETVD 62
AGD V QVL M PR + F N +PE+C+ +I A++ L R T+ R G V
Sbjct: 78 AGDRRV-----QVLQTMRHPRVVVFGNLVSPEECEGLIAAARVRLARSLTVETRTGGEVL 132
Query: 63 NTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD 122
N RTS G+F E++ + +E++IA + P GE ILRY G +Y HYD
Sbjct: 133 NVD--RTSEGMFFERGEND--IVARLEQRIAALLRWPVEFGEGLQILRYAPGAQYRPHYD 188
Query: 123 AFDPQEYG-----PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKV 177
FDP E G + QRVA+ ++YL + +GG T FP +GL+V
Sbjct: 189 YFDPGEPGTPTILKRGGQRVATLVMYLQEPGQGGATTFP---------------DVGLEV 233
Query: 178 KPRQGDGLLFYSLLPNGTIDPTS--IHGSCPVVKGEKWVATKWIRDQE 223
P +G G+ F P DP + +HG PV+ GEKWVATKW+R++E
Sbjct: 234 APVRGTGVFFSYEEP----DPATRTLHGGAPVLAGEKWVATKWLRERE 277
>gi|319792090|ref|YP_004153730.1| procollagen-proline dioxygenase [Variovorax paradoxus EPS]
gi|315594553|gb|ADU35619.1| Procollagen-proline dioxygenase [Variovorax paradoxus EPS]
Length = 280
Score = 128 bits (322), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 86/228 (37%), Positives = 119/228 (52%), Gaps = 38/228 (16%)
Query: 6 AGDDSVTNIPFQVLSWM--PRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKGETVD 62
AGD V QVL M PR + F N + E+C+ +I A++ L R T+ R G V
Sbjct: 79 AGDRQV-----QVLQTMRHPRVIVFGNLLSTEECEGLIAAARVRLARSLTVETRTGGEVL 133
Query: 63 NTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD 122
N RTS G+F E+E + +E+++A + P GE ILRY G +Y HYD
Sbjct: 134 NVD--RTSDGMFFERGENE--IVARLEQRLAMLLRWPLEYGEGLQILRYAPGAQYRPHYD 189
Query: 123 AFDPQEYG-----PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKV 177
FDP E G + QRVA+ ++YL + E+GG T FP +GL+V
Sbjct: 190 YFDPNEPGTPTILKRGGQRVATLVMYLQEPEQGGATTFP---------------DVGLEV 234
Query: 178 KPRQGDGLLFYSLLPNGTIDPTS--IHGSCPVVKGEKWVATKWIRDQE 223
P +G G+ F P DP + +HG PV+ GEKWVATKW+R++E
Sbjct: 235 APVRGTGVFFSYDRP----DPVTRTLHGGAPVLAGEKWVATKWLRERE 278
>gi|403238305|ref|ZP_10916891.1| procollagen-proline dioxygenase [Bacillus sp. 10403023]
Length = 296
Score = 128 bits (321), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 74/204 (36%), Positives = 109/204 (53%), Gaps = 19/204 (9%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P L+ F + E+C +I M++ L+PST+ K G RTS G+ E+E
Sbjct: 109 PFILHLDYFLSEEECDQLIEMSRERLKPSTVIDPKTGEEKAATG-RTSKGMSFYLQENE- 166
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS-QRVASFL 141
+ +E++IA++ P NGE +L Y IG++Y SH+D F + P+K QRV +FL
Sbjct: 167 -FIKKVEKRIAELIEFPVENGEGLQVLNYGIGEEYKSHFDYFPQSKVVPEKGGQRVGTFL 225
Query: 142 VYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSI 201
+YL D+ GGET+FP G+ + P++G + F G +D S+
Sbjct: 226 IYLNDVPAGGETVFP---------------KAGVSIVPKKGSAVYFQYGNSKGEVDRMSL 270
Query: 202 HGSCPVVKGEKWVATKWIRDQEQY 225
H S PV +GEKWVATKWIR + Y
Sbjct: 271 HSSIPVSEGEKWVATKWIRQENIY 294
>gi|354334983|gb|AER23925.1| procollagen-proline dioxygenase [Variovorax sp. HH01]
Length = 280
Score = 128 bits (321), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 87/228 (38%), Positives = 118/228 (51%), Gaps = 38/228 (16%)
Query: 6 AGDDSVTNIPFQVLSWM--PRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKGETVD 62
AGD V QVL M PR + F N + E+C+ +I A++ L R T+ R G V
Sbjct: 79 AGDRRV-----QVLQTMRHPRVVVFGNLLSAEECEGLIAAARVRLARSLTVETRTGGEVL 133
Query: 63 NTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD 122
N RTS G+F E+E + +E++IA + P GE ILRY G +Y HYD
Sbjct: 134 NVD--RTSDGMFFERGENE--IVARVEQRIAALLRWPLEFGEGLQILRYAPGAQYRPHYD 189
Query: 123 AFDPQEYG-----PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKV 177
FDP E G + QRVA+ ++YL + E GG T FP +GL+V
Sbjct: 190 YFDPSEPGTPTILKRGGQRVATLVMYLQEPEGGGATTFP---------------DVGLEV 234
Query: 178 KPRQGDGLLFYSLLPNGTIDPT--SIHGSCPVVKGEKWVATKWIRDQE 223
P +G G+ F P DP ++HG PV+ GEKWVATKW+R++E
Sbjct: 235 APARGCGVFFSYDRP----DPVTRTLHGGAPVLAGEKWVATKWLRERE 278
>gi|308812133|ref|XP_003083374.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein (ISS)
[Ostreococcus tauri]
gi|116055254|emb|CAL57650.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein (ISS)
[Ostreococcus tauri]
Length = 311
Score = 128 bits (321), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 80/237 (33%), Positives = 118/237 (49%), Gaps = 28/237 (11%)
Query: 3 HGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVD 62
HG D ++ + +S PRA F F T +C +I A + S E D
Sbjct: 55 HGVDAADGGSSGWIEKISDSPRAYVFREFLTDAECDRVIERAYPTMEAS-------EVTD 107
Query: 63 NTQG------IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQK 116
+ G R+S G ++S +DE + IE + + MLP GE +LRY+ GQK
Sbjct: 108 DDSGEARPDDARSSIGGWVSGDDDE--VIRNIELRASTWAMLPMNRGETMQVLRYEKGQK 165
Query: 117 YNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPF----------ENGMNADGS 166
Y++H D F + QRVA+ L+YL+D+EEGGET+FP ++G+ D +
Sbjct: 166 YDAHDDFFHDEHNVKNGGQRVATILMYLSDVEEGGETVFPLGTPLGGRDPEKSGVTGDNA 225
Query: 167 YDYQKCIG---LKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
+ L VKPR+GD LLF++ +G +D + H CPV +G KW T+W R
Sbjct: 226 CELASQNDPRVLAVKPRRGDALLFFNAHLSGEMDEKANHAGCPVNRGTKWTMTRWHR 282
>gi|108706360|gb|ABF94155.1| oxidoreductase, 2OG-Fe oxygenase family protein, putative [Oryza
sativa Japonica Group]
gi|125585047|gb|EAZ25711.1| hypothetical protein OsJ_09544 [Oryza sativa Japonica Group]
Length = 277
Score = 127 bits (320), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 78/217 (35%), Positives = 121/217 (55%), Gaps = 27/217 (12%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKL-NLRPSTLAL-RKGETVDNTQGIRTSSGVFIS 76
+SW PRA + F + +C +I++AK + ST+ GE+V T +RTSSG+F+
Sbjct: 45 VSWRPRAFLYEGFLSDAECDHLISLAKQGKMEKSTVVDGESGESV--TSKVRTSSGMFLD 102
Query: 77 AAEDESGTLDLIEEKIAKVTMLP-----------------RINGEAFNILRYKIGQKYNS 119
+DE + IEE+IA TMLP NGE+ ILRY G+KY
Sbjct: 103 KKQDE--VVARIEERIAAWTMLPTECIIFYCFANFAILKLSENGESMQILRYGQGEKYEP 160
Query: 120 HYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCI--GLKV 177
H+D ++ ++ RVA+ L+YL++++ G +++ P + ++ + C G V
Sbjct: 161 HFDYISGRQGSTREGDRVATVLMYLSNVKMG-DSLLP-QARLSQPKDETWSDCAEQGFAV 218
Query: 178 KPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWV 214
KP +G +LF+SL PN T+D S+HGSCPV++GEK V
Sbjct: 219 KPAKGSAVLFFSLHPNATLDTDSLHGSCPVIEGEKVV 255
>gi|328876967|gb|EGG25330.1| putative prolyl 4-hydroxylase alpha subunit [Dictyostelium
fasciculatum]
Length = 244
Score = 127 bits (320), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 73/212 (34%), Positives = 118/212 (55%), Gaps = 35/212 (16%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGI-RTSSGVFISA 77
+S PR P+F +P +C+ +I+++K LRP + + G+ R+ G+F+
Sbjct: 29 MSQCPRVYRVPDFLSPAECEHLIDISKNKLRPCN---------EISSGVHRSGWGLFMKE 79
Query: 78 AEDESGTLDLIEEKIAKVTMLPRI--NGEAFNILRYKIGQKYNSHYDAFDP-QEYGPQK- 133
E++ D++++ ++ ML + N E ++RY G++ ++HYD F+P G K
Sbjct: 80 GEEDH---DVVKKIFQRMKMLVNLTENCEVMQVIRYHPGEETSAHYDYFNPLTTNGAMKI 136
Query: 134 ---SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 190
QRV + L+YL+++EEGGET FP +G+KVKP +GD +LFY+
Sbjct: 137 GLYGQRVCTILMYLSEVEEGGETSFPE---------------VGVKVKPVKGDAVLFYNC 181
Query: 191 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
PNG +DP S+H PV+KG KWVA K I +
Sbjct: 182 KPNGEVDPLSLHQGDPVIKGTKWVAIKLINQK 213
>gi|229002593|ref|ZP_04160640.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock3-17]
gi|229003816|ref|ZP_04161625.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock1-4]
gi|228757417|gb|EEM06653.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock1-4]
gi|228758520|gb|EEM07660.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock3-17]
Length = 219
Score = 127 bits (320), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 68/209 (32%), Positives = 114/209 (54%), Gaps = 27/209 (12%)
Query: 16 FQVLSWM--PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGV 73
Q++S + P + N + E+C+++I M+K ++ S + + + T IRTSSG
Sbjct: 33 IQIISRLEEPLIVVLANVLSDEECETLIEMSKNKMKRSKIGISR-----KTNDIRTSSGA 87
Query: 74 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK 133
F+ +ES IE +IA + +P +GE IL+Y +GQ+Y +HYD F +
Sbjct: 88 FL----EESEITTRIERRIASIMNVPAPHGEGLQILKYTVGQEYQAHYDFF-VENSAAAS 142
Query: 134 SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPN 193
+ R+++ ++YL +EEGGET FP + L V P++G + F +
Sbjct: 143 NNRMSTLVMYLNHVEEGGETFFP---------------KLNLSVSPKKGMAVYFEYFYQD 187
Query: 194 GTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
+I+ ++HG PV+KGEKWVAT+W+R +
Sbjct: 188 ESINKLTLHGGAPVIKGEKWVATQWMRRR 216
>gi|228990015|ref|ZP_04149988.1| Prolyl 4-hydroxylase alpha subunit [Bacillus pseudomycoides DSM
12442]
gi|228769681|gb|EEM18271.1| Prolyl 4-hydroxylase alpha subunit [Bacillus pseudomycoides DSM
12442]
Length = 219
Score = 127 bits (319), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 68/209 (32%), Positives = 114/209 (54%), Gaps = 27/209 (12%)
Query: 16 FQVLSWM--PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGV 73
Q++S + P + N + E+C+++I M+K ++ S + + + T IRTSSG
Sbjct: 33 IQIISRLEEPLIVVLANVLSDEECETLIEMSKNKMKRSKIGVSR-----KTNDIRTSSGA 87
Query: 74 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK 133
F+ +ES IE +IA + +P +GE IL+Y +GQ+Y +HYD F +
Sbjct: 88 FL----EESEITTRIERRIASIMNVPAPHGEGLQILKYTVGQEYQAHYDFF-VENSAAAS 142
Query: 134 SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPN 193
+ R+++ ++YL +EEGGET FP + L V P++G + F +
Sbjct: 143 NNRMSTLVMYLNHVEEGGETFFP---------------KLNLSVSPKKGMAVYFEYFYQD 187
Query: 194 GTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
+I+ ++HG PV+KGEKWVAT+W+R +
Sbjct: 188 ESINKLTLHGGAPVIKGEKWVATQWMRRR 216
>gi|413934216|gb|AFW68767.1| hypothetical protein ZEAMMB73_452923 [Zea mays]
Length = 210
Score = 127 bits (318), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 67/149 (44%), Positives = 90/149 (60%), Gaps = 5/149 (3%)
Query: 75 ISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS 134
I +DE + IEE+I+ T LP NGEA IL Y+ G+KY HYD F +
Sbjct: 5 ILTCQDE--VVTRIEERISAWTFLPPENGEAIQILHYQNGEKYEPHYDYFHDKNNQALGG 62
Query: 135 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYD-YQKCI--GLKVKPRQGDGLLFYSLL 191
R+A+ L+YL+++E+GGET+FP G D + C G VKP +GD LLF+SL
Sbjct: 63 HRIATVLMYLSNVEKGGETIFPNAEGKLLQPKDDTWSDCARNGYAVKPVKGDALLFFSLH 122
Query: 192 PNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
P+ T D S+HGSCP ++G+KW ATKWI
Sbjct: 123 PDSTTDSDSLHGSCPAIEGQKWSATKWIH 151
>gi|413934217|gb|AFW68768.1| hypothetical protein ZEAMMB73_452923 [Zea mays]
Length = 204
Score = 127 bits (318), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 64/136 (47%), Positives = 85/136 (62%), Gaps = 3/136 (2%)
Query: 88 IEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDL 147
IEE+I+ T LP NGEA IL Y+ G+KY HYD F + R+A+ L+YL+++
Sbjct: 10 IEERISAWTFLPPENGEAIQILHYQNGEKYEPHYDYFHDKNNQALGGHRIATVLMYLSNV 69
Query: 148 EEGGETMFPFENGMNADGSYD-YQKCI--GLKVKPRQGDGLLFYSLLPNGTIDPTSIHGS 204
E+GGET+FP G D + C G VKP +GD LLF+SL P+ T D S+HGS
Sbjct: 70 EKGGETIFPNAEGKLLQPKDDTWSDCARNGYAVKPVKGDALLFFSLHPDSTTDSDSLHGS 129
Query: 205 CPVVKGEKWVATKWIR 220
CP ++G+KW ATKWI
Sbjct: 130 CPAIEGQKWSATKWIH 145
>gi|159489450|ref|XP_001702710.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280732|gb|EDP06489.1| predicted protein [Chlamydomonas reinhardtii]
Length = 252
Score = 127 bits (318), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 80/223 (35%), Positives = 119/223 (53%), Gaps = 29/223 (13%)
Query: 18 VLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISA 77
V+SW PRA NF T ++ I ++A++++R ST+ G +V RTS G FI+
Sbjct: 3 VISWEPRAFVIRNFLTDQEATHIADVAQVHMRRSTVVADNGSSV--LDDYRTSYGTFINR 60
Query: 78 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRV 137
+ + +E+++A +T +P E +LRY GQ Y+ H D+ + S R+
Sbjct: 61 YA--TPVVARVEDRVAVLTRVPVHYQEDMQVLRYGNGQYYHRHTDSLE------NDSPRL 112
Query: 138 ASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIG---------LKVKPRQGDGLLFY 188
A+ L+YL+D E GGET FP A D K G + KPR+GD LLF+
Sbjct: 113 ATVLLYLSDPELGGETAFPL-----AWAHPDMPKVFGPFSECVKNNVAFKPRKGDALLFW 167
Query: 189 SLLPNG-TIDPTSIHGSCPVVKGEKWVATKWIRDQ----EQYD 226
S+ P+G T DP S H CPV++G KW AT W+ + E++D
Sbjct: 168 SVKPDGKTEDPLSEHEGCPVIRGVKWTATVWVHTKPFRPEEWD 210
>gi|319652187|ref|ZP_08006306.1| hypothetical protein HMPREF1013_02919 [Bacillus sp. 2_A_57_CT2]
gi|317396176|gb|EFV76895.1| hypothetical protein HMPREF1013_02919 [Bacillus sp. 2_A_57_CT2]
Length = 283
Score = 126 bits (317), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 72/204 (35%), Positives = 107/204 (52%), Gaps = 19/204 (9%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P L+ + E+C +I++++ L+PS L + +G + RTS + E+E
Sbjct: 96 PFVLHLDQVLSSEECDELISLSRSRLQPS-LVVDRGSGEERAGSGRTSKSMAFRLKENE- 153
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS-QRVASFL 141
++ IE +IA++T P NGE IL Y +G++Y H+D F P K QRV +FL
Sbjct: 154 -LVERIETRIAELTGYPAENGEGLQILNYGLGEEYKPHFDFFPPHMADASKGGQRVGTFL 212
Query: 142 VYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSI 201
+YL D+E+GGET+F GL P++G + F+ G +D S+
Sbjct: 213 IYLNDVEDGGETVFS---------------KAGLSFVPKKGAAIYFHYGNAQGQLDRLSV 257
Query: 202 HGSCPVVKGEKWVATKWIRDQEQY 225
H S PV KGEKW ATKWIR+ Y
Sbjct: 258 HSSVPVRKGEKWAATKWIRESNIY 281
>gi|159487763|ref|XP_001701892.1| predicted protein [Chlamydomonas reinhardtii]
gi|158281111|gb|EDP06867.1| predicted protein [Chlamydomonas reinhardtii]
Length = 259
Score = 126 bits (317), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 75/209 (35%), Positives = 114/209 (54%), Gaps = 15/209 (7%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA 78
++W PR + NF T + K +I +A ++ ST+ G++V++ RTS G F+
Sbjct: 4 VAWKPRVFIYHNFITEVEAKHLIELAAPQMKRSTVVGAGGKSVEDN--YRTSYGTFLKRY 61
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVA 138
+DE ++ IE ++A T +P + E ILRY +GQ+Y H D +E G RVA
Sbjct: 62 QDE--IVERIENRVAAWTQIPVAHQEDTQILRYGLGQQYKVHADTLRDEEAG----VRVA 115
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGS----YDYQKCIGLKV--KPRQGDGLLFYSLLP 192
+ L+YL + + GGET FP +N + ++ C V P++GD LLF+S+ P
Sbjct: 116 TVLIYLNEPDGGGETAFPSSEWVNPQLAKTLGANFSDCAKNHVAFAPKRGDALLFWSINP 175
Query: 193 NG-TIDPTSIHGSCPVVKGEKWVATKWIR 220
+G T D + H CPV+ G KW ATKWI
Sbjct: 176 DGNTEDTHASHTGCPVLSGVKWTATKWIH 204
>gi|412988743|emb|CCO15334.1| predicted protein [Bathycoccus prasinos]
Length = 352
Score = 125 bits (315), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 75/220 (34%), Positives = 111/220 (50%), Gaps = 24/220 (10%)
Query: 16 FQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFI 75
+ LSW PRA + NF + E+ K ++++ + + ST+ G IRTS G FI
Sbjct: 68 IEALSWDPRAFLYHNFLSKEEAKHLVDLGEPRVTRSTVV---GGQTGRVSDIRTSFGTFI 124
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 135
DE L+ IE++ A + +P + E +LRY+ GQKY+ H D + G +
Sbjct: 125 PKKYDE--VLEKIEDRCAVFSGIPVAHQEQMQLLRYRDGQKYSDHTDGLISENGG----K 178
Query: 136 RVASFLVYLTDLEEGGETMFPFENGMN-------------ADGSYDYQKCIGLKVKPRQG 182
R+A+ L++L + EGGET F N + +D Y K G VKP+ G
Sbjct: 179 RIATILMFLHEPTEGGETSFVLGNPLGKVKERIERTKDQFSDCGYRSGK--GFAVKPKVG 236
Query: 183 DGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
D +LF+S G D S+H SCP + G KW AT WI ++
Sbjct: 237 DAILFFSFSEAGITDNNSMHASCPTLGGTKWTATMWIHER 276
>gi|229061929|ref|ZP_04199257.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH603]
gi|228717372|gb|EEL69042.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH603]
Length = 216
Score = 125 bits (315), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 108/198 (54%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I ++K N++ S + + + IRTSSG F+ +E+
Sbjct: 39 PLIVVLANVLSDEECAELIELSKSNMKRSKVGSSR-----DVNDIRTSSGAFL----EEN 89
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ +T +P ++GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTSKIEKRISSITNVPVVHGEGLHILNYEVDQEYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + ++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQLLNELTLH 193
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|398804098|ref|ZP_10563100.1| 2OG-Fe(II) oxygenase superfamily enzyme [Polaromonas sp. CF318]
gi|398094921|gb|EJL85274.1| 2OG-Fe(II) oxygenase superfamily enzyme [Polaromonas sp. CF318]
Length = 277
Score = 125 bits (315), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 83/225 (36%), Positives = 119/225 (52%), Gaps = 32/225 (14%)
Query: 6 AGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKGETVDNT 64
AGD VT + P F N + +C+++I A+ L R T+ +R G N
Sbjct: 76 AGDKWVTVREHRS---APELWVFDNLLSAAECEALIAAAESRLARSLTVDIRTGGEELNH 132
Query: 65 QGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF 124
RTS G+F + E+E + IE +IA++ P NGE +LRY+ G +Y HYD F
Sbjct: 133 D--RTSHGMFYTRGENE--VIRRIEARIARLLNWPVQNGEGLQVLRYRRGAEYKPHYDYF 188
Query: 125 DPQEYGP-----QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKP 179
DP E G + QRVAS ++YL + EGG T+FP IGLKV+P
Sbjct: 189 DPGEPGTAAILRRGGQRVASLIMYLREPGEGGATVFP---------------DIGLKVRP 233
Query: 180 RQGDGLLF-YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
+QG + F Y+L ++ ++HG PV GEKW+ATKW+R++E
Sbjct: 234 QQGSAVFFSYALAHPASL---TLHGGEPVKSGEKWIATKWLRERE 275
>gi|398808448|ref|ZP_10567311.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
gi|398087480|gb|EJL78066.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
Length = 280
Score = 125 bits (315), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 86/228 (37%), Positives = 118/228 (51%), Gaps = 38/228 (16%)
Query: 6 AGDDSVTNIPFQVLSWM--PRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKGETVD 62
AGD V QVL M PR + F N + E+C+ +I A++ L R T+ R G V
Sbjct: 79 AGDRRV-----QVLQTMRHPRVVVFGNLLSAEECEGLIAAARVRLARSLTVETRTGGEVL 133
Query: 63 NTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD 122
N RTS G+F E+E + +E+++A + P GE ILRY G +Y HYD
Sbjct: 134 NVD--RTSDGMFFERGENE--IVARLEQRLATLLRWPLEYGEGLQILRYAPGAQYRPHYD 189
Query: 123 AFDPQEYG-----PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKV 177
FDP E G + QRVA+ ++YL + E GG T FP +GL+V
Sbjct: 190 YFDPGEPGTPTILKRGGQRVATLVMYLQEPEGGGATTFP---------------DVGLEV 234
Query: 178 KPRQGDGLLFYSLLPNGTIDPTS--IHGSCPVVKGEKWVATKWIRDQE 223
P +G G+ F P DP + +HG PV+ GEKWVATKW+R++E
Sbjct: 235 APVRGCGVFFSYDRP----DPVTRTLHGGAPVLAGEKWVATKWLRERE 278
>gi|299532490|ref|ZP_07045880.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni S44]
gi|298719437|gb|EFI60404.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni S44]
Length = 299
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 75/212 (35%), Positives = 112/212 (52%), Gaps = 37/212 (17%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFIS 76
PR + F N + E+C +II A RP +R+ TVDN G RTS+G+F
Sbjct: 112 PRVVVFGNLLSDEECDAIIAAA----RPR---MRRSLTVDNQSGGEAVNDDRTSNGMFFQ 164
Query: 77 AAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG-----P 131
E+E + L+E++IA++ P NGE +L Y+ G +Y HYD F P E G
Sbjct: 165 RGENE--LISLVEQRIARLLNWPLENGEGMQVLHYRPGAEYKPHYDYFAPNEPGTPTILK 222
Query: 132 QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLL 191
+ QRV + ++YL + GG T FP +GL+V PR+G+ + F
Sbjct: 223 RGGQRVGTLVMYLNEPARGGATTFP---------------DVGLQVVPRRGNAVFFSYNR 267
Query: 192 PNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
P+ ++HG PV++GEKW+ATKW+R++E
Sbjct: 268 PDPATK--TLHGGAPVLEGEKWIATKWLRERE 297
>gi|339327280|ref|YP_004686973.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator N-1]
gi|338167437|gb|AEI78492.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator N-1]
Length = 297
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 74/221 (33%), Positives = 113/221 (51%), Gaps = 25/221 (11%)
Query: 7 GDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG 66
G D I F++ S P+ F T ++C +++ +++ L S + + +N
Sbjct: 91 GGDRQVPILFRLAS--PQVQLFQQLLTDDECDALVALSRGRLARSPV-VNPDTGDENLID 147
Query: 67 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP 126
RTS G AE + IE +IA VT +P +GE IL YK G +Y H+D F+P
Sbjct: 148 ARTSMGAMFQVAEH--ALIARIEARIAAVTGVPAEHGEGLQILNYKPGGEYQPHFDYFNP 205
Query: 127 QEYGPQK-----SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQ 181
Q G + QR+A+ ++YL E GG T FP +GL+V P +
Sbjct: 206 QRPGEARQLSVGGQRIATLVIYLNTPEAGGATAFP---------------RVGLEVAPVK 250
Query: 182 GDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
G+ + F LLP+GT+D ++H PV GEKW+ATKW+R++
Sbjct: 251 GNAVYFSYLLPDGTLDERTLHAGLPVASGEKWIATKWLRER 291
>gi|423521903|ref|ZP_17498376.1| hypothetical protein IGC_01286 [Bacillus cereus HuA4-10]
gi|401176565|gb|EJQ83760.1| hypothetical protein IGC_01286 [Bacillus cereus HuA4-10]
Length = 216
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 64/198 (32%), Positives = 107/198 (54%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I ++K N++ S + + + IRTSSG F+ E S
Sbjct: 39 PLIVVLANVLSDEECDKLIELSKNNMKRSKVGSSR-----DVNDIRTSSGAFLEENELTS 93
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ +T +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 94 K----IEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|430751569|ref|YP_007214477.1| 2OG-Fe(II) oxygenase [Thermobacillus composti KWC4]
gi|430735534|gb|AGA59479.1| 2OG-Fe(II) oxygenase superfamily enzyme [Thermobacillus composti
KWC4]
Length = 215
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 73/215 (33%), Positives = 115/215 (53%), Gaps = 26/215 (12%)
Query: 8 DDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGI 67
DD V + VL P + F + ++C+ +I A L+ S L + V + I
Sbjct: 17 DDGV--VEATVLHQEPLIVRFERLLSDDECRQLIETAAPRLKESKLV---NKVVSD---I 68
Query: 68 RTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQ 127
RTS G+F E+ES + IE +IA++ +P + E +L Y GQ+Y +H+D F P
Sbjct: 69 RTSRGMFFE--EEESPFIHRIERRIAQLMNVPIEHAEGLQVLHYGPGQEYKAHHDFFAPG 126
Query: 128 EYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 187
++ R+++ +VYL D+EEGGET+FP +G+ +KP++G L F
Sbjct: 127 SPAA-RNNRISTLIVYLNDVEEGGETVFPL---------------LGIAMKPKRGAALYF 170
Query: 188 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
N ++ ++H S PVV+GEKWVAT+W+R Q
Sbjct: 171 EYFYRNQALNDLTLHSSVPVVRGEKWVATQWMRRQ 205
>gi|255577610|ref|XP_002529682.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223530830|gb|EEF32693.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 165
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 68/156 (43%), Positives = 96/156 (61%), Gaps = 4/156 (2%)
Query: 67 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP 126
+RTSSG+F+S+ E +S IE++I+ + +P NGE +LRY+ Q Y H+D F
Sbjct: 11 VRTSSGMFLSSEERKSPMA--IEKRISVYSQVPIENGELVQVLRYEKSQFYRPHHDYFSD 68
Query: 127 QEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLL 186
+ QRVA+ L+YL+D EGGET FP + + S + GL VKP +GD +L
Sbjct: 69 TFNLKRGGQRVATMLMYLSDNVEGGETYFPMAG--SGECSCGGKIVKGLSVKPIKGDAVL 126
Query: 187 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
F+S+ +G DP SIHG C V+ GEKW ATKW+R +
Sbjct: 127 FWSMGLDGQSDPKSIHGGCEVLAGEKWSATKWMRQR 162
>gi|229135058|ref|ZP_04263863.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST196]
gi|228648443|gb|EEL04473.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST196]
Length = 216
Score = 124 bits (311), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 107/198 (54%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I ++K N++ S + + + IRTSSG F+ +E+
Sbjct: 39 PLIVVLANVLSDEECAELIELSKSNMKRSKVGSSR-----DVNDIRTSSGAFL----EEN 89
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ +T +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + ++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQLLNELTLH 193
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|423512354|ref|ZP_17488885.1| hypothetical protein IG3_03851 [Bacillus cereus HuA2-1]
gi|402449325|gb|EJV81162.1| hypothetical protein IG3_03851 [Bacillus cereus HuA2-1]
Length = 216
Score = 124 bits (311), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 107/198 (54%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I ++K N++ S + + + IRTSSG F+ +E+
Sbjct: 39 PLIVVLANVLSDEECAELIELSKSNMKRSKVGSSR-----DVNDIRTSSGAFL----EEN 89
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ +T +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + ++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQLLNELTLH 193
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|423518940|ref|ZP_17495421.1| hypothetical protein IG7_04010 [Bacillus cereus HuA2-4]
gi|401159995|gb|EJQ67374.1| hypothetical protein IG7_04010 [Bacillus cereus HuA2-4]
Length = 216
Score = 124 bits (310), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 107/198 (54%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I ++K N++ S + + + IRTSSG F+ +E+
Sbjct: 39 PLIVVLANVLSDEECAELIELSKNNMKRSKVGSSR-----DVNDIRTSSGAFL----EEN 89
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ +T +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + ++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------QLNLSVHPRKGMAVYFEYFYQDQLLNELTLH 193
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|229075940|ref|ZP_04208916.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-18]
gi|229117732|ref|ZP_04247101.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-3]
gi|407706764|ref|YP_006830349.1| alpha/beta fold family hydrolase [Bacillus thuringiensis MC28]
gi|423377905|ref|ZP_17355189.1| hypothetical protein IC9_01258 [Bacillus cereus BAG1O-2]
gi|423464099|ref|ZP_17440867.1| hypothetical protein IEK_01286 [Bacillus cereus BAG6O-1]
gi|423547540|ref|ZP_17523898.1| hypothetical protein IGO_03975 [Bacillus cereus HuB5-5]
gi|423622677|ref|ZP_17598455.1| hypothetical protein IK3_01275 [Bacillus cereus VD148]
gi|228665709|gb|EEL21182.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-3]
gi|228707255|gb|EEL59452.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-18]
gi|401179261|gb|EJQ86434.1| hypothetical protein IGO_03975 [Bacillus cereus HuB5-5]
gi|401260797|gb|EJR66965.1| hypothetical protein IK3_01275 [Bacillus cereus VD148]
gi|401636171|gb|EJS53925.1| hypothetical protein IC9_01258 [Bacillus cereus BAG1O-2]
gi|402420366|gb|EJV52637.1| hypothetical protein IEK_01286 [Bacillus cereus BAG6O-1]
gi|407384449|gb|AFU14950.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis MC28]
Length = 216
Score = 124 bits (310), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 107/198 (54%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I M+K ++ ST+ + + IRTSSG F+ +E+
Sbjct: 39 PLIVVLGNVISDEECNELIEMSKNKIKRSTIG-----SARDVNDIRTSSGAFL----EEN 89
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|222111817|ref|YP_002554081.1| procollagen-proline dioxygenase [Acidovorax ebreus TPSY]
gi|221731261|gb|ACM34081.1| Procollagen-proline dioxygenase [Acidovorax ebreus TPSY]
Length = 289
Score = 124 bits (310), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 69/206 (33%), Positives = 111/206 (53%), Gaps = 25/206 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F N +PE+C++II+ A+ + S L ++ + RTS G+F E+
Sbjct: 102 PRVVLFGNLLSPEECQAIIDAAQPRMARS-LTVQTTTGGEEVNADRTSDGMFFQ--RGET 158
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP-----QKSQRV 137
+ +EE+IA++ P NGE +L Y+ G +Y HYD FDP + G + QRV
Sbjct: 159 PVVQRLEERIARLVRWPIQNGEGLQVLHYRPGAEYKPHYDYFDPDQPGTSTIVRRGGQRV 218
Query: 138 ASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTID 197
A+ ++YL + +GG T FP + L+V PRQG+ + F P+ +
Sbjct: 219 ATLVIYLNNPRKGGGTTFP---------------DVPLEVAPRQGNAVFFSYERPHPST- 262
Query: 198 PTSIHGSCPVVKGEKWVATKWIRDQE 223
++HG V++GEKW+ATKW+R++E
Sbjct: 263 -RTLHGGASVIEGEKWIATKWLRERE 287
>gi|308799555|ref|XP_003074558.1| putative oxidoreductase (ISS) [Ostreococcus tauri]
gi|116000729|emb|CAL50409.1| putative oxidoreductase (ISS) [Ostreococcus tauri]
Length = 274
Score = 124 bits (310), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 78/223 (34%), Positives = 123/223 (55%), Gaps = 17/223 (7%)
Query: 7 GDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKGETVDNTQ 65
GD V I + LSW PRA N + ++I+ +A+ + R + + G++V N
Sbjct: 2 GDARV--IAVEPLSWYPRAFALRNALDETEMRAILALARTRVARSTVIDSESGKSVVNP- 58
Query: 66 GIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFD 125
IRTS F+S + + + E+++ VT LP + E +L Y G+KY++H D +
Sbjct: 59 -IRTSKQTFLSRNDP---VVRKVLERMSSVTHLPWYHCEDLQVLEYSAGEKYDAHEDVGE 114
Query: 126 P-QEYGPQKSQ----RVASFLVYLTDLEEGGETMFPFENGMNAD--GSYDYQKCIGLKV- 177
+ G Q S+ RVA+ L+YL + EEGGET FP ++ + + + KC +V
Sbjct: 115 EGTKSGDQLSKNGGKRVATILLYLEEPEEGGETAFPDSEWIDPERAKTETWSKCAHRRVA 174
Query: 178 -KPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWI 219
KP +GDGL+F+S+ P+GTID ++H CP +G KW AT W+
Sbjct: 175 MKPTRGDGLMFWSVRPDGTIDHRALHVGCPPTRGTKWTATIWV 217
>gi|423368291|ref|ZP_17345723.1| hypothetical protein IC3_03392 [Bacillus cereus VD142]
gi|401081042|gb|EJP89322.1| hypothetical protein IC3_03392 [Bacillus cereus VD142]
Length = 216
Score = 124 bits (310), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 107/198 (54%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I ++K N++ S + + + IRTSSG F+ +E+
Sbjct: 39 PLIVVLANVLSDEECAELIELSKNNMKRSKVGSSR-----DVNDIRTSSGAFL----EEN 89
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ +T +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + ++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQLLNELTLH 193
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|421749438|ref|ZP_16186877.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator HPC(L)]
gi|409771699|gb|EKN53918.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator HPC(L)]
Length = 319
Score = 123 bits (309), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 73/221 (33%), Positives = 113/221 (51%), Gaps = 25/221 (11%)
Query: 6 AGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQ 65
AGD + F + S PR F P++C+++I +++ L S + + +N
Sbjct: 112 AGDGRDVPVLFAIES--PRIALFQRLLMPDECEALIALSRGRLARSPV-VNPDTGDENLI 168
Query: 66 GIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFD 125
RTS G E ++ +E +IA VT +P +GE IL YK G +Y HYD F+
Sbjct: 169 DARTSMGAMFQVGEHP--LIERLEARIAAVTGVPVEHGEGLQILNYKPGAEYQPHYDFFN 226
Query: 126 PQEYGPQK-----SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPR 180
PQ G + QR+A+ ++YL D+ GG T FP +GL+V P
Sbjct: 227 PQRPGEARQLRVGGQRMATLVIYLNDVPAGGATAFP---------------KLGLRVNPV 271
Query: 181 QGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRD 221
QG+ + F L +G++D ++H PV +GEKW+ATKW+R+
Sbjct: 272 QGNAVFFAYLGEDGSLDERTLHAGLPVEQGEKWIATKWLRE 312
>gi|194290782|ref|YP_002006689.1| prolyl 4-hydroxylase subunit alpha [Cupriavidus taiwanensis LMG
19424]
gi|193224617|emb|CAQ70628.1| putative Prolyl 4-hydroxylase alpha subunit [Cupriavidus
taiwanensis LMG 19424]
Length = 296
Score = 123 bits (309), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 73/221 (33%), Positives = 113/221 (51%), Gaps = 25/221 (11%)
Query: 7 GDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG 66
G D I F++ S P+ F + ++C +++ +++ L S + + +N
Sbjct: 90 GGDRRVPILFRLAS--PQVQLFQQLLSDDECDALVALSRGRLARSPV-VNPDTGDENLID 146
Query: 67 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP 126
RTS G AE + IE +IA VT +P +GE IL YK G +Y H+D F+P
Sbjct: 147 ARTSMGAMFQVAE--HALIARIEARIAAVTGVPADHGEGLQILNYKPGGEYQPHFDYFNP 204
Query: 127 QEYGPQK-----SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQ 181
Q G + QR+A+ ++YL E GG T FP +GL+V P +
Sbjct: 205 QRPGEARQLSVGGQRIATLVIYLNTPEAGGATAFP---------------RVGLEVAPVK 249
Query: 182 GDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
G+ + F LLP+GT+D ++H PV GEKW+ATKW+R++
Sbjct: 250 GNAVYFSYLLPDGTLDDRTLHAGLPVAAGEKWIATKWLRER 290
>gi|423541303|ref|ZP_17517694.1| hypothetical protein IGK_03395 [Bacillus cereus HuB4-10]
gi|401172491|gb|EJQ79712.1| hypothetical protein IGK_03395 [Bacillus cereus HuB4-10]
Length = 216
Score = 123 bits (309), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 64/198 (32%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I M+K ++ ST+ + + IRTSSG F+ E S
Sbjct: 39 PLIVVLGNVISDEECDELIEMSKNKIKRSTIGSSR-----DVNDIRTSSGAFLEENELTS 93
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 94 K----IEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|344172475|emb|CCA85118.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia syzygii R24]
Length = 289
Score = 123 bits (308), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 70/215 (32%), Positives = 110/215 (51%), Gaps = 25/215 (11%)
Query: 14 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSG 72
IP PR + F +F + E+C +I + + L+ S + GE +N RTS G
Sbjct: 88 IPILFAIETPRIVLFQHFLSDEECDELIALGRHRLKRSPVVNPETGE--ENLISARTSQG 145
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
E + IE +IA+ T +P +GE F +L Y+ G +Y H+D F+P G
Sbjct: 146 AMFQVGEHP--LIARIEARIAQATGVPVEHGEGFQVLHYQPGGEYQPHFDYFNPGRSGEA 203
Query: 133 KS-----QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 187
+ QRVA+ ++YL ++ GG T FP +GL+V P +G+ + F
Sbjct: 204 RQLEVGGQRVATLVIYLNSVQAGGATGFP---------------KLGLEVAPVKGNAVFF 248
Query: 188 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
P+GT+D ++H PV +GEKW+ATKW+R++
Sbjct: 249 VYKRPDGTLDDNTLHAGLPVERGEKWIATKWLRER 283
>gi|423615424|ref|ZP_17591258.1| hypothetical protein IIO_00750 [Bacillus cereus VD115]
gi|401259961|gb|EJR66134.1| hypothetical protein IIO_00750 [Bacillus cereus VD115]
Length = 216
Score = 123 bits (308), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 107/198 (54%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I M+K ++ ST+ + + IRTSSG F+ +E+
Sbjct: 39 PLIVVLGNVISDEECDELIEMSKNKIKRSTIGSSR-----DVNDIRTSSGAFL----EEN 89
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|264677094|ref|YP_003277000.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni CNB-2]
gi|262207606|gb|ACY31704.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni CNB-2]
Length = 306
Score = 123 bits (308), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 73/212 (34%), Positives = 112/212 (52%), Gaps = 37/212 (17%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFIS 76
PR + F N + E+C +II A RP +R+ TVDN G RTS+G+F
Sbjct: 119 PRVVVFGNLLSDEECDAIIAAA----RPR---MRRSLTVDNQSGGEAVNDDRTSNGMFFQ 171
Query: 77 AAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG-----P 131
E++ + L+E++IA++ P NGE +L Y+ G +Y HYD F P E G
Sbjct: 172 RGEND--LISLVEQRIARLLNWPLENGEGMQVLHYRPGAEYKPHYDYFAPNEPGTPTILK 229
Query: 132 QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLL 191
+ QRV + ++YL + GG T FP +GL++ PR+G+ + F
Sbjct: 230 RGGQRVGTLVMYLNEPARGGATTFP---------------DVGLQIVPRRGNAVFFSYNR 274
Query: 192 PNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
P+ ++HG PV++GEKW+ATKW+R++E
Sbjct: 275 PDPATK--TLHGGAPVLEGEKWIATKWLRERE 304
>gi|423483822|ref|ZP_17460512.1| hypothetical protein IEQ_03600 [Bacillus cereus BAG6X1-2]
gi|401141373|gb|EJQ48928.1| hypothetical protein IEQ_03600 [Bacillus cereus BAG6X1-2]
Length = 216
Score = 123 bits (308), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 107/198 (54%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I M+K ++ ST+ + + IRTSSG F+ +E+
Sbjct: 39 PLIVVLGNVISDEECDELIEMSKNKIKRSTIGSSR-----DVNDIRTSSGAFL----EEN 89
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|449520827|ref|XP_004167434.1| PREDICTED: putative prolyl 4-hydroxylase-like, partial [Cucumis
sativus]
Length = 164
Score = 123 bits (308), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 65/156 (41%), Positives = 93/156 (59%), Gaps = 2/156 (1%)
Query: 67 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP 126
RTSSG+F+S E + IE++I+ + +P NGE +LRY+ Q Y H+D F
Sbjct: 6 FRTSSGMFLSHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQFYKPHHDYFSD 65
Query: 127 QEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLL 186
+ QR+A+ L+YL++ EGGET FP + + S + GL VKP +GD +L
Sbjct: 66 TFNLKRGGQRIATMLMYLSENIEGGETYFP--KAGSGECSCGGKTVPGLSVKPAKGDAVL 123
Query: 187 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
F+S+ +G DP SIHG C V+ GEKW ATKW+R +
Sbjct: 124 FWSMGLDGQSDPKSIHGGCEVLSGEKWSATKWMRQK 159
>gi|344169181|emb|CCA81504.1| putative Prolyl 4-hydroxylase alpha subunit [blood disease
bacterium R229]
Length = 289
Score = 123 bits (308), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 70/215 (32%), Positives = 110/215 (51%), Gaps = 25/215 (11%)
Query: 14 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSG 72
IP PR + F +F + E+C +I + + L+ S + GE +N RTS G
Sbjct: 88 IPILFAIETPRIVLFQHFLSDEECDELIALGRHRLKRSPVVNPETGE--ENLISARTSQG 145
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
E + IE +IA+ T +P +GE F +L Y+ G +Y H+D F+P G
Sbjct: 146 AMFQVGEHP--LIARIEARIAQATGVPVEHGEGFQVLHYQPGGEYQPHFDYFNPGRSGEA 203
Query: 133 KS-----QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 187
+ QRVA+ ++YL ++ GG T FP +GL+V P +G+ + F
Sbjct: 204 RQLEVGGQRVATLVIYLNSVQAGGATGFP---------------KLGLEVAPVKGNAVFF 248
Query: 188 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
P+GT+D ++H PV +GEKW+ATKW+R++
Sbjct: 249 VYKRPDGTLDDNTLHAGLPVERGEKWIATKWLRER 283
>gi|300690371|ref|YP_003751366.1| prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum PSI07]
gi|299077431|emb|CBJ50057.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
PSI07]
Length = 289
Score = 123 bits (308), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 70/215 (32%), Positives = 110/215 (51%), Gaps = 25/215 (11%)
Query: 14 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSG 72
IP PR + F +F + E+C +I + + L+ S + GE +N RTS G
Sbjct: 88 IPILFAIETPRIVLFQHFLSDEECDELIALGRHRLKRSPVVNPETGE--ENLISARTSQG 145
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
E + IE +IA+ T +P +GE F +L Y+ G +Y H+D F+P G
Sbjct: 146 AMFQVGEHP--LIARIEARIAQATGVPVEHGEGFQVLHYQPGGEYQPHFDYFNPGRSGEA 203
Query: 133 KS-----QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 187
+ QRVA+ ++YL ++ GG T FP +GL+V P +G+ + F
Sbjct: 204 RQLEVGGQRVATLVIYLNSVQAGGATGFP---------------KLGLEVAPVKGNAVFF 248
Query: 188 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
P+GT+D ++H PV +GEKW+ATKW+R++
Sbjct: 249 VYKRPDGTLDDNTLHAGLPVERGEKWIATKWLRER 283
>gi|423389445|ref|ZP_17366671.1| hypothetical protein ICG_01293 [Bacillus cereus BAG1X1-3]
gi|401641536|gb|EJS59253.1| hypothetical protein ICG_01293 [Bacillus cereus BAG1X1-3]
Length = 216
Score = 123 bits (308), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 62/198 (31%), Positives = 108/198 (54%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C+ +I ++K ++ S + + + IRTSSG F+ +E+
Sbjct: 39 PLIVVLANVLSDEECEELIELSKNKMKRSKVGSSR-----DVNDIRTSSGAFL----EEN 89
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ +T +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|228922987|ref|ZP_04086280.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
huazhongensis BGSC 4BD1]
gi|228836620|gb|EEM81968.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
huazhongensis BGSC 4BD1]
Length = 216
Score = 123 bits (308), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 65/198 (32%), Positives = 107/198 (54%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I M+K ++ S + + + IRTSSG F+ ED
Sbjct: 39 PLIVVLANVLSDEECDELIEMSKNKMKRSKVGSSR-----DVNDIRTSSGAFL---EDSE 90
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
TL IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 91 LTLK-IEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|229104864|ref|ZP_04235524.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-28]
gi|228678581|gb|EEL32798.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-28]
Length = 216
Score = 122 bits (307), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 64/198 (32%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I M+K ++ ST+ + + IRTSSG F+ E S
Sbjct: 39 PLIVVLGNVISDEECGELIEMSKNKIKRSTIGSSR-----DVNDIRTSSGAFLEENELTS 93
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 94 K----IEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|423598444|ref|ZP_17574444.1| hypothetical protein III_01246 [Bacillus cereus VD078]
gi|423660914|ref|ZP_17636083.1| hypothetical protein IKM_01311 [Bacillus cereus VDM022]
gi|401236714|gb|EJR43171.1| hypothetical protein III_01246 [Bacillus cereus VD078]
gi|401300955|gb|EJS06544.1| hypothetical protein IKM_01311 [Bacillus cereus VDM022]
Length = 216
Score = 122 bits (307), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 62/198 (31%), Positives = 107/198 (54%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I ++K ++ S + + + IRTSSG F+ +E+
Sbjct: 39 PLIVVLANVLSDEECDELIELSKSKMKRSKVGSSR-----DVNDIRTSSGAFL----EEN 89
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ +T +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|421890664|ref|ZP_16321519.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
K60-1]
gi|378964031|emb|CCF98267.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
K60-1]
Length = 288
Score = 122 bits (307), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 70/215 (32%), Positives = 109/215 (50%), Gaps = 25/215 (11%)
Query: 14 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSG 72
IP PR + F +F + E+C +I + + L+ S + GE +N RTS G
Sbjct: 87 IPILFAIETPRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGE--ENLISARTSEG 144
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
E + IE +IA+ T +P +GE F +L Y G +Y H+D F+P G
Sbjct: 145 AMFQVGEHP--LVARIEARIAQATGVPVEHGEGFQVLHYHPGGEYQPHFDYFNPGRSGEA 202
Query: 133 KS-----QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 187
+ QRVA+ ++YL ++ GG T FP +GL+V P +G+ + F
Sbjct: 203 RQLDVGGQRVATLVIYLNSVQAGGATGFP---------------KLGLEVAPVKGNAVFF 247
Query: 188 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
P+GT+D ++H PV +GEKW+ATKW+R++
Sbjct: 248 VYKRPDGTLDDNTLHAGLPVERGEKWIATKWLRER 282
>gi|300702992|ref|YP_003744594.1| prolyl 4-hydroxylase subunit alpha [Ralstonia solanacearum
CFBP2957]
gi|299070655|emb|CBJ41950.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
CFBP2957]
Length = 289
Score = 122 bits (307), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 70/215 (32%), Positives = 109/215 (50%), Gaps = 25/215 (11%)
Query: 14 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSG 72
IP PR + F +F + E+C +I + + L+ S + GE +N RTS G
Sbjct: 88 IPILFAIETPRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGE--ENLISARTSEG 145
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
E + IE +IA+ T +P +GE F +L Y G +Y H+D F+P G
Sbjct: 146 AMFQVGEHP--LVARIEARIAQATGVPVEHGEGFQVLHYHPGGEYQPHFDYFNPGRSGEA 203
Query: 133 KS-----QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 187
+ QRVA+ ++YL ++ GG T FP +GL+V P +G+ + F
Sbjct: 204 RQLEVGGQRVATLVIYLNSVQAGGATGFP---------------KLGLEVAPVKGNAVFF 248
Query: 188 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
P+GT+D ++H PV +GEKW+ATKW+R++
Sbjct: 249 VYKRPDGTLDDNTLHAGLPVERGEKWIATKWLRER 283
>gi|121595595|ref|YP_987491.1| 2OG-Fe(II) oxygenase [Acidovorax sp. JS42]
gi|120607675|gb|ABM43415.1| 2OG-Fe(II) oxygenase [Acidovorax sp. JS42]
Length = 289
Score = 122 bits (307), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 69/206 (33%), Positives = 111/206 (53%), Gaps = 25/206 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F N +PE+C++II+ A+ + S L ++ + RTS G+F E+
Sbjct: 102 PRVVLFGNLLSPEECQAIIDAAQPRMARS-LTVQTTTGGEEVNADRTSDGMFFQ--RGET 158
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP-----QKSQRV 137
+ +EE+IA++ P NGE +L Y+ G +Y HYD FDP + G + QRV
Sbjct: 159 PVVQRLEERIARLVRWPIQNGEGLQVLHYRPGAEYKPHYDYFDPDQPGTSTIVRRGGQRV 218
Query: 138 ASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTID 197
A+ ++YL + +GG T FP + L+V PRQG+ + F P+ +
Sbjct: 219 ATLVIYLNNPLKGGGTTFP---------------DVPLEVAPRQGNAVFFSYERPHPST- 262
Query: 198 PTSIHGSCPVVKGEKWVATKWIRDQE 223
++HG V++GEKW+ATKW+R++E
Sbjct: 263 -RTLHGGASVIEGEKWIATKWLRERE 287
>gi|229168980|ref|ZP_04296697.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH621]
gi|423591765|ref|ZP_17567796.1| hypothetical protein IIG_00633 [Bacillus cereus VD048]
gi|228614572|gb|EEK71680.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH621]
gi|401231898|gb|EJR38400.1| hypothetical protein IIG_00633 [Bacillus cereus VD048]
Length = 216
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 107/198 (54%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I ++K N++ S + + + IRTSSG F+ +E+
Sbjct: 39 PLIVVLANVLSDEECAELIELSKSNMKRSKVGSSR-----DVNDIRTSSGAFL----EEN 89
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ +T +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTWKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + ++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQLLNELTLH 193
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|423612451|ref|ZP_17588312.1| hypothetical protein IIM_03166 [Bacillus cereus VD107]
gi|401246040|gb|EJR52392.1| hypothetical protein IIM_03166 [Bacillus cereus VD107]
Length = 254
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 64/198 (32%), Positives = 104/198 (52%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I ++K + S + + N IRTSSG F+ E S
Sbjct: 77 PLIVVLANVLSDEECDELIELSKNKMERSKIGSSR-----NVNDIRTSSGAFLEENEFTS 131
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ +T +P +GE +IL Y + Q+Y +HYD F + + R+++ ++
Sbjct: 132 K----IEKRISSITNVPVAHGEGLHILNYAVDQEYKAHYDYF-AEHSRSAANNRISTLVM 186
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 187 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 231
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 232 GGAPVTKGEKWIATQWMR 249
>gi|229019457|ref|ZP_04176278.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1273]
gi|229025700|ref|ZP_04182104.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1272]
gi|423417837|ref|ZP_17394926.1| hypothetical protein IE3_01309 [Bacillus cereus BAG3X2-1]
gi|228735575|gb|EEL86166.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1272]
gi|228741812|gb|EEL91991.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1273]
gi|401107008|gb|EJQ14965.1| hypothetical protein IE3_01309 [Bacillus cereus BAG3X2-1]
Length = 216
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 62/198 (31%), Positives = 107/198 (54%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I ++K ++ S + + + IRTSSG F+ +E+
Sbjct: 39 PLIVVLANVLSDEECDELIELSKNKMKRSKVGSSR-----DVNDIRTSSGAFL----EEN 89
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ +T +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|388567209|ref|ZP_10153646.1| procollagen-proline dioxygenase [Hydrogenophaga sp. PBC]
gi|388265592|gb|EIK91145.1| procollagen-proline dioxygenase [Hydrogenophaga sp. PBC]
Length = 296
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 72/206 (34%), Positives = 108/206 (52%), Gaps = 25/206 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + N + E+C +II AK L S L ++ + RTSSG+F + ++
Sbjct: 109 PRVVVLGNLLSAEECDAIIESAKPKLARS-LTVQTATGGEELNADRTSSGMFFT--RGQT 165
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG-----PQKSQRV 137
+ +E +IA++ P NGE +L Y+ G +Y HYD FDP+E G + QRV
Sbjct: 166 PEVTAVERRIARLVGWPVENGEGLQVLHYRPGAEYKPHYDYFDPKEAGTPTILKRGGQRV 225
Query: 138 ASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTID 197
A+ ++YL + GG T FP +GL+V P +G + F P+ T
Sbjct: 226 ATLVMYLNEPARGGGTTFP---------------DVGLEVAPVKGSAVFFSYDRPHPTTR 270
Query: 198 PTSIHGSCPVVKGEKWVATKWIRDQE 223
S+HG PV++GEKWVATKW+R++E
Sbjct: 271 --SLHGGAPVLEGEKWVATKWLRERE 294
>gi|83746819|ref|ZP_00943867.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum UW551]
gi|83726588|gb|EAP73718.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum UW551]
Length = 289
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 70/215 (32%), Positives = 109/215 (50%), Gaps = 25/215 (11%)
Query: 14 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSG 72
IP PR + F +F + E+C +I + + L+ S + GE +N RTS G
Sbjct: 88 IPILFAIETPRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGE--ENLISARTSEG 145
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
E + IE +IA+ T +P +GE F +L Y G +Y H+D F+P G
Sbjct: 146 AMFQVGEHP--LVARIEARIAQATGVPVEHGEGFQVLHYHPGGEYQPHFDYFNPGRSGEA 203
Query: 133 KS-----QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 187
+ QRVA+ ++YL ++ GG T FP +GL+V P +G+ + F
Sbjct: 204 RQLEVGGQRVATLVIYLNSVQAGGATGFP---------------KLGLEVAPVKGNAVFF 248
Query: 188 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
P+GT+D ++H PV +GEKW+ATKW+R++
Sbjct: 249 VYKRPDGTLDDNTLHAGLPVERGEKWIATKWLRER 283
>gi|423634936|ref|ZP_17610589.1| hypothetical protein IK7_01345 [Bacillus cereus VD156]
gi|401278922|gb|EJR84852.1| hypothetical protein IK7_01345 [Bacillus cereus VD156]
Length = 248
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 67/198 (33%), Positives = 108/198 (54%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I M+K ++ S + G + D IRTSSG F+ ED
Sbjct: 71 PLIVVLANVLSDEECDELIEMSKNKMKRSKV----GSSRD-VNDIRTSSGAFL---EDSE 122
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
TL IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 123 LTLK-IEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 180
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 181 YLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 225
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 226 GGAPVTKGEKWIATQWVR 243
>gi|207744371|ref|YP_002260763.1| prolyl 4-hydroxylase subunit alpha [Ralstonia solanacearum IPO1609]
gi|206595776|emb|CAQ62703.1| prolyl 4-hydroxylase alpha subunit homologue protein [Ralstonia
solanacearum IPO1609]
Length = 280
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 70/215 (32%), Positives = 109/215 (50%), Gaps = 25/215 (11%)
Query: 14 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSG 72
IP PR + F +F + E+C +I + + L+ S + GE +N RTS G
Sbjct: 79 IPILFAIETPRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGE--ENLISARTSEG 136
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
E + IE +IA+ T +P +GE F +L Y G +Y H+D F+P G
Sbjct: 137 AMFQVGEHP--LVARIEARIAQATGVPVEHGEGFQVLHYHPGGEYQPHFDYFNPGRSGEA 194
Query: 133 KS-----QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 187
+ QRVA+ ++YL ++ GG T FP +GL+V P +G+ + F
Sbjct: 195 RQLEVGGQRVATLVIYLNSVQAGGATGFP---------------KLGLEVAPVKGNAVFF 239
Query: 188 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
P+GT+D ++H PV +GEKW+ATKW+R++
Sbjct: 240 VYKRPDGTLDDNTLHAGLPVERGEKWIATKWLRER 274
>gi|423582447|ref|ZP_17558558.1| hypothetical protein IIA_03962 [Bacillus cereus VD014]
gi|401213326|gb|EJR20067.1| hypothetical protein IIA_03962 [Bacillus cereus VD014]
Length = 248
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 67/198 (33%), Positives = 108/198 (54%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I M+K ++ S + G + D IRTSSG F+ ED
Sbjct: 71 PLIVVLANVLSDEECDELIEMSKNKMKRSKV----GSSRD-VNDIRTSSGAFL---EDSE 122
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
TL IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 123 LTLK-IEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 180
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 181 YLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 225
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 226 GGAPVTKGEKWIATQWVR 243
>gi|330799463|ref|XP_003287764.1| hypothetical protein DICPUDRAFT_151895 [Dictyostelium purpureum]
gi|325082219|gb|EGC35708.1| hypothetical protein DICPUDRAFT_151895 [Dictyostelium purpureum]
Length = 220
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 75/212 (35%), Positives = 108/212 (50%), Gaps = 31/212 (14%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGI-RTSSGVFISA 77
LS PR P F T E+C +I+ +K LRP + + G+ R+ G+F+
Sbjct: 28 LSQKPRVYRIPEFLTEEECNHLIDTSKNKLRPCN---------EISSGVHRSGWGLFMKE 78
Query: 78 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP-QEYGPQK--- 133
E+E I K+ + + E I+RY G++ ++HYD F+P G K
Sbjct: 79 GEEEHPVTKNIFNKMKNFVNISD-SCEVMQIIRYNPGEETSAHYDYFNPLTTNGSMKIGL 137
Query: 134 -SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLP 192
QR+ + L+YL D+EEGGET FP +G+KVKP +GD +LFY+ P
Sbjct: 138 YGQRICTILMYLCDVEEGGETSFPE---------------VGIKVKPIRGDAVLFYNCKP 182
Query: 193 NGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
NG +DP S+H PV KG KWVA K I + +
Sbjct: 183 NGDVDPLSLHQGDPVTKGTKWVAIKLINQKSK 214
>gi|386332363|ref|YP_006028532.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum Po82]
gi|334194811|gb|AEG67996.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum Po82]
Length = 292
Score = 122 bits (305), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 70/215 (32%), Positives = 109/215 (50%), Gaps = 25/215 (11%)
Query: 14 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSG 72
IP PR + F +F + E+C +I + + L+ S + GE +N RTS G
Sbjct: 91 IPILFAIETPRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGE--ENLISARTSEG 148
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
E + IE +IA+ T +P +GE F +L Y G +Y H+D F+P G
Sbjct: 149 AMFQVGEHP--LVARIEARIAQATGVPVEHGEGFQVLHYHPGGEYQPHFDYFNPGRSGEA 206
Query: 133 KS-----QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 187
+ QRVA+ ++YL ++ GG T FP +GL+V P +G+ + F
Sbjct: 207 RQLEVGGQRVATLVIYLNSVQAGGATGFP---------------KLGLEVAPVKGNAVFF 251
Query: 188 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
P+GT+D ++H PV +GEKW+ATKW+R++
Sbjct: 252 VYKRPDGTLDDNTLHAGLPVERGEKWIATKWLRER 286
>gi|297802348|ref|XP_002869058.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297314894|gb|EFH45317.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 245
Score = 122 bits (305), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 68/197 (34%), Positives = 114/197 (57%), Gaps = 31/197 (15%)
Query: 16 FQVLSWMPRALYFPNF--------ATPEQCKSIINMAKLNLRPSTLALRKGET-VDNTQG 66
+V++ PRA + NF T E+C+ +I++AK ++ S +R T +
Sbjct: 55 LEVIAKEPRAFVYHNFLALFFKFCKTNEECEHLISLAKPSMARS--KVRNAITGLGEESS 112
Query: 67 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP 126
RTSSG F+ D+ + IE++I++ T +P NGEA ++ Y++GQK+ H+D F
Sbjct: 113 SRTSSGTFLRKGHDK--IVKEIEKRISEFTFIPEENGEALQVIHYEVGQKFEPHFDGF-- 168
Query: 127 QEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLL 186
QR+A+ L+YL+D+++GGET+FP G+ + G+ V+P++GD LL
Sbjct: 169 --------QRIATVLMYLSDVDKGGETVFPEAKGIKSKK--------GVSVRPKKGDALL 212
Query: 187 FYSLLPNGTIDPTSIHG 203
F+S+ P+G+ DP+S HG
Sbjct: 213 FWSMRPDGSQDPSSKHG 229
>gi|163941996|ref|YP_001646880.1| 2OG-Fe(II) oxygenase [Bacillus weihenstephanensis KBAB4]
gi|229013455|ref|ZP_04170592.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides DSM 2048]
gi|423495146|ref|ZP_17471790.1| hypothetical protein IEW_04044 [Bacillus cereus CER057]
gi|423498060|ref|ZP_17474677.1| hypothetical protein IEY_01287 [Bacillus cereus CER074]
gi|163864193|gb|ABY45252.1| 2OG-Fe(II) oxygenase [Bacillus weihenstephanensis KBAB4]
gi|228747867|gb|EEL97733.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides DSM 2048]
gi|401151239|gb|EJQ58691.1| hypothetical protein IEW_04044 [Bacillus cereus CER057]
gi|401161347|gb|EJQ68714.1| hypothetical protein IEY_01287 [Bacillus cereus CER074]
Length = 216
Score = 122 bits (305), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 62/198 (31%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I ++K + S + + + IRTSSG F+ +E+
Sbjct: 39 PLIVVLANVLSDEECDELIELSKSKMERSKVGSSR-----DVNDIRTSSGAFL----EEN 89
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ +T +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|299065638|emb|CBJ36810.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
CMR15]
Length = 289
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 70/215 (32%), Positives = 109/215 (50%), Gaps = 25/215 (11%)
Query: 14 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSG 72
IP PR + F +F + E+C +I + + L+ S + GE +N RTS G
Sbjct: 88 IPILFAIETPRIVLFQHFLSDEECDQLITLGRHRLKRSPVVNPETGE--ENLISARTSQG 145
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
E + IE +IA+ T +P +GE F +L Y+ G +Y H+D F+P G
Sbjct: 146 AMFQVGEHP--LIARIEARIAQATGVPVEHGEGFQVLHYQPGGEYQPHFDYFNPGRSGEA 203
Query: 133 KS-----QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 187
+ QRVA+ ++YL + GG T FP +GL+V P +G+ + F
Sbjct: 204 RQLEVGGQRVATLVIYLNSVPAGGATGFP---------------KLGLEVAPVKGNAVFF 248
Query: 188 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
P+GT+D ++H PV +GEKW+ATKW+R++
Sbjct: 249 VYKRPDGTLDDKTLHAGLPVERGEKWIATKWLRER 283
>gi|15233345|ref|NP_195307.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
gi|3805848|emb|CAA21468.1| putative protein [Arabidopsis thaliana]
gi|7270534|emb|CAB81491.1| putative protein [Arabidopsis thaliana]
gi|332661175|gb|AEE86575.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
Length = 272
Score = 121 bits (304), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 68/197 (34%), Positives = 114/197 (57%), Gaps = 31/197 (15%)
Query: 16 FQVLSWMPRALYFPNF--------ATPEQCKSIINMAKLNLRPSTLALRKGET-VDNTQG 66
+V++ PRA + NF T E+C +I++AK ++ S +R T +
Sbjct: 88 LEVITKEPRAFVYHNFLALFFKICKTNEECDHLISLAKPSMARS--KVRNALTGLGEESS 145
Query: 67 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP 126
RTSSG FI + D+ + IE++I++ T +P+ NGE ++ Y++GQK+ H+D F
Sbjct: 146 SRTSSGTFIRSGHDK--IVKEIEKRISEFTFIPQENGETLQVINYEVGQKFEPHFDGF-- 201
Query: 127 QEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLL 186
QR+A+ L+YL+D+++GGET+FP G+ + G+ V+P++GD LL
Sbjct: 202 --------QRIATVLMYLSDVDKGGETVFPEAKGIKSKK--------GVSVRPKKGDALL 245
Query: 187 FYSLLPNGTIDPTSIHG 203
F+S+ P+G+ DP+S HG
Sbjct: 246 FWSMRPDGSRDPSSKHG 262
>gi|229098707|ref|ZP_04229647.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-29]
gi|423441025|ref|ZP_17417931.1| hypothetical protein IEA_01355 [Bacillus cereus BAG4X2-1]
gi|423533441|ref|ZP_17509859.1| hypothetical protein IGI_01273 [Bacillus cereus HuB2-9]
gi|228684786|gb|EEL38724.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-29]
gi|402417686|gb|EJV49986.1| hypothetical protein IEA_01355 [Bacillus cereus BAG4X2-1]
gi|402463660|gb|EJV95360.1| hypothetical protein IGI_01273 [Bacillus cereus HuB2-9]
Length = 216
Score = 121 bits (304), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 62/198 (31%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I M+K ++ ST+ + + IRTSSG F+ +E+
Sbjct: 39 PLIVVLGNVISDEECNELIEMSKNKIKRSTIG-----SARDVNDIRTSSGAFL----EEN 89
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 203 GSCPVVKGEKWVATKWIR 220
G V KGEKW+AT+W+R
Sbjct: 194 GGASVTKGEKWIATQWVR 211
>gi|228954520|ref|ZP_04116545.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
kurstaki str. T03a001]
gi|449091198|ref|YP_007423639.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
kurstaki str. HD73]
gi|228805177|gb|EEM51771.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
kurstaki str. T03a001]
gi|449024955|gb|AGE80118.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
kurstaki str. HD73]
Length = 216
Score = 121 bits (304), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 105/198 (53%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I M+K ++ S + + + IRTSSG F+ E S
Sbjct: 39 PLIVVLANVLSDEECDELIEMSKNKMKRSKVG-----SARDVNDIRTSSGAFLEDNELTS 93
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 94 K----IEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|423489423|ref|ZP_17466105.1| hypothetical protein IEU_04046 [Bacillus cereus BtB2-4]
gi|402431659|gb|EJV63723.1| hypothetical protein IEU_04046 [Bacillus cereus BtB2-4]
Length = 216
Score = 121 bits (303), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 62/198 (31%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I ++K + S + + + IRTSSG F+ +E+
Sbjct: 39 PLIVVLANVLSDEECDELIELSKSKMERSKVGSSR-----DVNDIRTSSGAFL----EEN 89
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ +T +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTSKIEKRISSITNVPVSHGEGLHILNYEVDQEYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|308804269|ref|XP_003079447.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
gi|116057902|emb|CAL54105.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
Length = 363
Score = 121 bits (303), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 73/219 (33%), Positives = 115/219 (52%), Gaps = 27/219 (12%)
Query: 16 FQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFI 75
+ LSW PRA + NF T ++C+ +I + + L ST+ KG+ D RTS G FI
Sbjct: 91 WTTLSWSPRAFLYQNFLTEDECEHLIALGEKKLERSTVVGSKGKEGD-VHSARTSFGTFI 149
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 135
+ + TL +E+++A+ + +P + E +LRY+ GQ+Y + +
Sbjct: 150 T--RRLTPTLSAVEDRVAEYSGIPWRHQEQLQLLRYEKGQEYGN-------------GEK 194
Query: 136 RVASFLVYLTDLEEGGETMFPFENGMNADGS-----------YDYQKCIGLKVKPRQGDG 184
R+A+ L++L + E GGET FP + A S + + G V PR+GD
Sbjct: 195 RIATVLMFLREPEFGGETHFPDATPLPATRSEFLGSRAKLSDCGWNEGRGFSVIPRKGDA 254
Query: 185 LLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
+LF+S NGT D + H SCP ++G K+ ATKWI ++E
Sbjct: 255 ILFFSHHINGTSDDAASHASCPTLRGIKYTATKWIHEKE 293
>gi|423457579|ref|ZP_17434376.1| hypothetical protein IEI_00719 [Bacillus cereus BAG5X2-1]
gi|401147963|gb|EJQ55456.1| hypothetical protein IEI_00719 [Bacillus cereus BAG5X2-1]
Length = 216
Score = 121 bits (303), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 66/198 (33%), Positives = 107/198 (54%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I ++K S LA K + + IRTSSG F+ ED
Sbjct: 39 PLIVVLGNVLSDEECDELIELSK-----SKLARSKVGSSRDVNDIRTSSGAFL---EDNE 90
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
T+ IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 91 LTVK-IEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|66820122|ref|XP_643703.1| hypothetical protein DDB_G0275385 [Dictyostelium discoideum AX4]
gi|60471803|gb|EAL69758.1| hypothetical protein DDB_G0275385 [Dictyostelium discoideum AX4]
Length = 221
Score = 121 bits (303), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 74/212 (34%), Positives = 110/212 (51%), Gaps = 31/212 (14%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGI-RTSSGVFISA 77
LS PR P F T E+C+ +I+ +K LRP + + G+ R+ G+F+
Sbjct: 28 LSQAPRIYRIPGFLTDEECEFLIDTSKNKLRPCN---------EISSGVHRSGWGLFMKE 78
Query: 78 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP-QEYGPQK--- 133
E++ I K+ + + E ++RY G++ +SH+D F+P G K
Sbjct: 79 GEEDHQITKNIFNKMKSFVNISE-SCEVMQVIRYNQGEETSSHFDYFNPLTTNGSMKIGL 137
Query: 134 -SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLP 192
QRV + L+YL D+EEGGET FP +G+KVKP +GD +LFY+ P
Sbjct: 138 YGQRVCTILMYLCDVEEGGETTFPE---------------VGIKVKPIKGDAVLFYNCKP 182
Query: 193 NGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
NG +DP S+H PV+KG KWVA K I + +
Sbjct: 183 NGDVDPLSLHQGDPVLKGNKWVAIKLINQKSK 214
>gi|113869198|ref|YP_727687.1| prolyl 4-hydroxylase alpha subunit [Ralstonia eutropha H16]
gi|113527974|emb|CAJ94319.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia eutropha H16]
Length = 297
Score = 121 bits (303), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 73/221 (33%), Positives = 112/221 (50%), Gaps = 25/221 (11%)
Query: 7 GDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG 66
G D I F++ S P+ F T ++C +++ +++ L S + + +N
Sbjct: 91 GGDRQVPILFRLAS--PQVQLFQQLLTDDECDALVALSRGRLARSPV-VNPDTGDENLID 147
Query: 67 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP 126
RTS G AE + IE +IA VT +P +GE IL YK G +Y H+D F+P
Sbjct: 148 ARTSMGAMFQVAEHP--LITRIEARIAAVTGVPAEHGEGLQILNYKPGGEYQPHFDYFNP 205
Query: 127 QEYGPQK-----SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQ 181
Q G + QR+A+ ++YL E GG T FP +GL+V P +
Sbjct: 206 QRPGEARQLSVGGQRIATLVIYLNTPEAGGATAFP---------------RVGLEVAPVK 250
Query: 182 GDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
G+ + F LLP+G +D ++H PV GEKW+ATKW+R++
Sbjct: 251 GNAVYFSYLLPDGALDERTLHAGLPVAFGEKWIATKWLRER 291
>gi|228902749|ref|ZP_04066896.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL
4222]
gi|228967277|ref|ZP_04128313.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
sotto str. T04001]
gi|402564350|ref|YP_006607074.1| prolyl 4-hydroxylase subunit alpha domain-containing protein
[Bacillus thuringiensis HD-771]
gi|434377355|ref|YP_006611999.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
thuringiensis HD-789]
gi|228792646|gb|EEM40212.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
sotto str. T04001]
gi|228856936|gb|EEN01449.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL
4222]
gi|401793002|gb|AFQ19041.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
thuringiensis HD-771]
gi|401875912|gb|AFQ28079.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
thuringiensis HD-789]
Length = 216
Score = 121 bits (303), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 105/198 (53%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I M+K ++ S + + + IRTSSG F+ E S
Sbjct: 39 PLIVVLANVLSDEECDKLIEMSKNKMKRSKVGSSR-----DVNDIRTSSGAFLEDNELTS 93
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 94 K----IEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|423669823|ref|ZP_17644852.1| hypothetical protein IKO_03520 [Bacillus cereus VDM034]
gi|423673973|ref|ZP_17648912.1| hypothetical protein IKS_01516 [Bacillus cereus VDM062]
gi|401298950|gb|EJS04550.1| hypothetical protein IKO_03520 [Bacillus cereus VDM034]
gi|401309524|gb|EJS14857.1| hypothetical protein IKS_01516 [Bacillus cereus VDM062]
Length = 216
Score = 120 bits (302), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 62/198 (31%), Positives = 105/198 (53%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I ++K + S + + + IRTSSG F+ +E+
Sbjct: 39 PLIVVLANVLSDEECDELIELSKSKMERSKVGSSR-----DVNDIRTSSGAFL----EEN 89
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ +T +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + ++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQLLNELTLH 193
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|145347188|ref|XP_001418057.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144578285|gb|ABO96350.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 317
Score = 120 bits (302), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 71/225 (31%), Positives = 113/225 (50%), Gaps = 19/225 (8%)
Query: 8 DDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGI 67
DD + + LSW PR NF + E+C+ +I + + L ST+ +
Sbjct: 28 DDVERSKVVETLSWSPRVFLLKNFLSDEECEHLIELGEKKLERSTVV--NSDESGAVSTA 85
Query: 68 RTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQ 127
RTS G F++ E TL +E+++AK + +P + E +LRY+ GQ+Y +H+D +
Sbjct: 86 RTSFGTFVTRRLTE--TLQRVEDRVAKYSGIPWEHQEQLQLLRYRDGQEYVAHHDGIISE 143
Query: 128 EYGPQKSQRVASFLVYLTDLEEGGETMFP-----------FENGMNADGSYDYQKCIGLK 176
G +R+A+ L++L + GGET FP F + + G
Sbjct: 144 NGG----KRIATVLMFLREPTSGGETSFPQGTPLPETKAAFLANKDKLSECGWNDGNGFS 199
Query: 177 VKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRD 221
V P++G+ +LF+S NGT DP + H SCP + G K+ ATKWI +
Sbjct: 200 VIPKKGEAVLFFSFHINGTNDPFANHASCPTLGGTKYTATKWIHE 244
>gi|423448819|ref|ZP_17425698.1| hypothetical protein IEC_03427 [Bacillus cereus BAG5O-1]
gi|401129413|gb|EJQ37096.1| hypothetical protein IEC_03427 [Bacillus cereus BAG5O-1]
Length = 216
Score = 120 bits (302), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 105/198 (53%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I M+K ++ ST+ + + IRTSSG F+ E S
Sbjct: 39 PLIVVLGNVISDEECDELIEMSKNKIKRSTIGSSR-----DVNDIRTSSGAFLEENELTS 93
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 94 K----IEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 203 GSCPVVKGEKWVATKWIR 220
G V KGEKW+AT+W+R
Sbjct: 194 GGASVTKGEKWIATQWVR 211
>gi|340787855|ref|YP_004753320.1| peptidyl prolyl 4-hydroxylase-like protein subunit alpha
[Collimonas fungivorans Ter331]
gi|340553122|gb|AEK62497.1| Peptidyl prolyl 4-hydroxylase-like protein, alpha subunit
[Collimonas fungivorans Ter331]
Length = 289
Score = 120 bits (302), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 71/212 (33%), Positives = 106/212 (50%), Gaps = 35/212 (16%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFIS 76
PRA+ F N + ++C +I ++K L LR G VD+ G RTSSG F
Sbjct: 100 PRAILFGNVLSHDECDQLIALSKTKL------LRSG-VVDHQTGNTKLHEHRTSSGTFFH 152
Query: 77 AAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK--- 133
+ + +I++++A + +P +GE IL Y++G +Y HYD F P G K
Sbjct: 153 --RGTTPFIAMIDKRLAALMQVPESHGEGLQILNYQMGGEYRPHYDYFRPDAPGSAKHLA 210
Query: 134 --SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLL 191
QR A+ ++YL D++ GGET+FP GL + P +G + F
Sbjct: 211 RGGQRTATLIIYLNDVDGGGETIFPRN---------------GLSIVPAKGSAIYFSYTN 255
Query: 192 PNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
+D S HG PV++GEKW+ATKW+R E
Sbjct: 256 AENQLDSLSFHGGSPVIEGEKWIATKWVRQNE 287
>gi|337280547|ref|YP_004620019.1| hypothetical protein Rta_28970 [Ramlibacter tataouinensis TTB310]
gi|334731624|gb|AEG94000.1| conserved hypothetical protein [Ramlibacter tataouinensis TTB310]
Length = 286
Score = 120 bits (302), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 72/206 (34%), Positives = 111/206 (53%), Gaps = 27/206 (13%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKGETVDNTQGIRTSSGVFISAAEDE 81
PR + F + + ++C+ +I +AK L R T+A + G N RTSSG+F E+E
Sbjct: 99 PRVVVFGSLLSDQECEQLIGLAKPRLARSLTVATKTGGEEVNED--RTSSGMFFQRGENE 156
Query: 82 SGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG-----PQKSQR 136
+ IE +IA++ P NGE +L Y+ G +Y HYD FDP E G + QR
Sbjct: 157 --LVARIEARIARLVNWPVENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPTILKRGGQR 214
Query: 137 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 196
V + ++YL + E+GG T FP + L+V P++G G+ F P+ +
Sbjct: 215 VGTLVMYLGEPEKGGGTTFP---------------DVHLEVAPKRGHGVFFSYERPHPST 259
Query: 197 DPTSIHGSCPVVKGEKWVATKWIRDQ 222
++HG PV+ GEKW+ATKW+R++
Sbjct: 260 --RTLHGGAPVLAGEKWIATKWLRER 283
>gi|218231188|ref|YP_002369041.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
B4264]
gi|218159145|gb|ACK59137.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
B4264]
Length = 216
Score = 120 bits (302), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 64/198 (32%), Positives = 104/198 (52%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I M+K + S + + + IRTSSG F+ E S
Sbjct: 39 PLIVVLANVLSDEECGELIEMSKNKMERSKIGSSR-----DVNDIRTSSGAFLEDNELTS 93
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 94 K----IEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +I+ ++H
Sbjct: 149 YLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSINELTLH 193
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|17547533|ref|NP_520935.1| hypothetical protein RSc2814 [Ralstonia solanacearum GMI1000]
gi|17429837|emb|CAD16521.1| putative prolyl 4-hydroxylase alpha subunit homologue
oxidoreductase protein [Ralstonia solanacearum GMI1000]
Length = 289
Score = 120 bits (302), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 70/216 (32%), Positives = 109/216 (50%), Gaps = 25/216 (11%)
Query: 13 NIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSS 71
IP PR + F +F + E+C +I + + L+ S + GE +N RTS
Sbjct: 87 EIPILFAIETPRIVLFQHFLSDEECDQLIALGRHRLKRSPVVNPETGE--ENLISARTSQ 144
Query: 72 GVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP 131
G E + IE +IA+ T +P +GE F +L Y+ G +Y H+D F+P G
Sbjct: 145 GAMFQVGEHP--LVARIEARIAQATGVPVEHGEGFQVLHYQPGGEYQPHFDYFNPGRSGE 202
Query: 132 QKS-----QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLL 186
+ QRVA+ ++YL + GG T FP +GL+V P +G+ +
Sbjct: 203 ARQLEVGGQRVATLVIYLNSVPAGGATGFP---------------KLGLEVAPVKGNAVF 247
Query: 187 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
F P+GT+D ++H PV +GEKW+ATKW+R++
Sbjct: 248 FVYKRPDGTLDDNTLHAGLPVERGEKWIATKWLRER 283
>gi|423558182|ref|ZP_17534484.1| hypothetical protein II3_03386 [Bacillus cereus MC67]
gi|401191450|gb|EJQ98472.1| hypothetical protein II3_03386 [Bacillus cereus MC67]
Length = 216
Score = 120 bits (302), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 105/198 (53%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I ++K ++ S + + + IRTSSG F+ E S
Sbjct: 39 PLIVVLANVLSDEECDGLIELSKNKIKRSKIGSSR-----DVNDIRTSSGAFLEENELTS 93
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 94 K----IEKRISSIMNVPVAHGEGLHILNYEVDQEYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKWVAT+W+R
Sbjct: 194 GGAPVTKGEKWVATQWVR 211
>gi|365090417|ref|ZP_09328465.1| 2OG-Fe(II) oxygenase [Acidovorax sp. NO-1]
gi|363416516|gb|EHL23626.1| 2OG-Fe(II) oxygenase [Acidovorax sp. NO-1]
Length = 302
Score = 120 bits (302), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 73/208 (35%), Positives = 113/208 (54%), Gaps = 29/208 (13%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKG-ETVDNTQGIRTSSGVFISAAED 80
PR + F N +PE+C ++I A+ L R T+A + G E +++ RTS G+F
Sbjct: 115 PRIVVFGNLLSPEECDALIADAQPRLARSLTVATKTGGEEINDD---RTSDGMFFQ--RG 169
Query: 81 ESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP-----QKSQ 135
+S + IEE+IA++ P NGE +L Y+ G +Y HYD FDP E G + Q
Sbjct: 170 QSPLIQRIEERIARLLNWPIENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPSIVNRGGQ 229
Query: 136 RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGT 195
RV + ++YL E+GG T FP + L+V P++G+ + F P+ +
Sbjct: 230 RVGTLVMYLNTPEKGGGTTFP---------------DVHLEVAPQRGNAVFFSYERPHPS 274
Query: 196 IDPTSIHGSCPVVKGEKWVATKWIRDQE 223
++HG PV+ GEKW+ATKW+R++E
Sbjct: 275 T--RTLHGGAPVIAGEKWIATKWLRERE 300
>gi|423437685|ref|ZP_17414666.1| hypothetical protein IE9_03866 [Bacillus cereus BAG4X12-1]
gi|423503075|ref|ZP_17479667.1| hypothetical protein IG1_00641 [Bacillus cereus HD73]
gi|401120840|gb|EJQ28636.1| hypothetical protein IE9_03866 [Bacillus cereus BAG4X12-1]
gi|402459296|gb|EJV91033.1| hypothetical protein IG1_00641 [Bacillus cereus HD73]
Length = 248
Score = 120 bits (302), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 105/198 (53%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I M+K ++ S + + + IRTSSG F+ E S
Sbjct: 71 PLIVVLANVLSDEECDELIEMSKNKMKRSKVG-----SARDVNDIRTSSGAFLEDNELTS 125
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 126 K----IEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 180
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 181 YLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 225
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 226 GGAPVTKGEKWIATQWVR 243
>gi|206971296|ref|ZP_03232247.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
AH1134]
gi|229081494|ref|ZP_04213993.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-2]
gi|423411965|ref|ZP_17389085.1| hypothetical protein IE1_01269 [Bacillus cereus BAG3O-2]
gi|423432249|ref|ZP_17409253.1| hypothetical protein IE7_04065 [Bacillus cereus BAG4O-1]
gi|206734068|gb|EDZ51239.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
AH1134]
gi|228701801|gb|EEL54288.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-2]
gi|401104033|gb|EJQ12010.1| hypothetical protein IE1_01269 [Bacillus cereus BAG3O-2]
gi|401117005|gb|EJQ24843.1| hypothetical protein IE7_04065 [Bacillus cereus BAG4O-1]
Length = 216
Score = 120 bits (301), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 104/198 (52%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I M+K + S + + + IRTSSG F+ E S
Sbjct: 39 PLIVVLANVLSDEECDELIEMSKNKMERSKIGSSR-----DVNDIRTSSGAFLEDNELTS 93
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 94 K----IEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|218899396|ref|YP_002447807.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
G9842]
gi|218542449|gb|ACK94843.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
G9842]
Length = 216
Score = 120 bits (301), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 105/198 (53%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I M+K ++ S + + + IRTSSG F+ E S
Sbjct: 39 PLIVVLANVLSDEECDELIEMSKNKMKRSKVGSSR-----DVNDIRTSSGAFLEDNELTS 93
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 94 K----IEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAVNNRISTLVM 148
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|30264308|ref|NP_846685.1| prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. Ames]
gi|47529753|ref|YP_021102.1| prolyl 4-hydroxylase subunit alpha [Bacillus anthracis str. 'Ames
Ancestor']
gi|65321616|ref|ZP_00394575.1| hypothetical protein Bant_01005109 [Bacillus anthracis str. A2012]
gi|165873278|ref|ZP_02217887.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0488]
gi|167634610|ref|ZP_02392930.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0442]
gi|167638693|ref|ZP_02396969.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0193]
gi|170687507|ref|ZP_02878724.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0465]
gi|170709341|ref|ZP_02899757.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0389]
gi|177655890|ref|ZP_02937082.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0174]
gi|190566156|ref|ZP_03019075.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Tsiankovskii-I]
gi|196034803|ref|ZP_03102210.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
W]
gi|227817011|ref|YP_002817020.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
anthracis str. CDC 684]
gi|228929280|ref|ZP_04092307.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pondicheriensis BGSC 4BA1]
gi|228935557|ref|ZP_04098373.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
andalousiensis BGSC 4AW1]
gi|229123754|ref|ZP_04252949.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 95/8201]
gi|229604260|ref|YP_002868528.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0248]
gi|254683996|ref|ZP_05147856.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. CNEVA-9066]
gi|254721830|ref|ZP_05183619.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A1055]
gi|254736344|ref|ZP_05194050.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Western North America USA6153]
gi|254741382|ref|ZP_05199069.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Kruger B]
gi|254753983|ref|ZP_05206018.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Vollum]
gi|254757854|ref|ZP_05209881.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Australia 94]
gi|386738126|ref|YP_006211307.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. H9401]
gi|421506493|ref|ZP_15953416.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. UR-1]
gi|421638315|ref|ZP_16078911.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. BF1]
gi|30258953|gb|AAP28171.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Ames]
gi|47504901|gb|AAT33577.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. 'Ames Ancestor']
gi|164710995|gb|EDR16563.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0488]
gi|167513541|gb|EDR88911.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0193]
gi|167530062|gb|EDR92797.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0442]
gi|170125767|gb|EDS94678.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0389]
gi|170668702|gb|EDT19448.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0465]
gi|172079923|gb|EDT65028.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0174]
gi|190563075|gb|EDV17041.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Tsiankovskii-I]
gi|195992342|gb|EDX56303.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
W]
gi|227005734|gb|ACP15477.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. CDC 684]
gi|228659889|gb|EEL15534.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 95/8201]
gi|228824095|gb|EEM69911.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
andalousiensis BGSC 4AW1]
gi|228830570|gb|EEM76180.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pondicheriensis BGSC 4BA1]
gi|229268668|gb|ACQ50305.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0248]
gi|384387978|gb|AFH85639.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. H9401]
gi|401823486|gb|EJT22633.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. UR-1]
gi|403394741|gb|EJY91981.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. BF1]
Length = 216
Score = 120 bits (301), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 64/198 (32%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I ++K S LA K + + IRTSSG F+ D++
Sbjct: 39 PLIVVLGNVLSDEECDELIELSK-----SKLARSKVGSSRDVNDIRTSSGAFL----DDN 89
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|229192445|ref|ZP_04319408.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 10876]
gi|228591022|gb|EEK48878.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 10876]
Length = 216
Score = 120 bits (301), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 104/198 (52%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I M+K + S + + + IRTSSG F+ E S
Sbjct: 39 PLIVVLANVISDEECDELIEMSKNKMERSKIGSSR-----DVNDIRTSSGAFLEDNELTS 93
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 94 K----IEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|281307110|pdb|3ITQ|A Chain A, Crystal Structure Of A Prolyl 4-Hydroxylase From Bacillus
Anthracis
gi|281307111|pdb|3ITQ|B Chain B, Crystal Structure Of A Prolyl 4-Hydroxylase From Bacillus
Anthracis
Length = 216
Score = 120 bits (301), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 64/198 (32%), Positives = 105/198 (53%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I ++K S LA K + + IRTSSG F+ D++
Sbjct: 39 PLIVVLGNVLSDEECDELIELSK-----SKLARSKVGSSRDVNDIRTSSGAFL----DDN 89
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ +
Sbjct: 90 ELTAKIEKRISSIXNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVX 148
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGXAVYFEYFYQDQSLNELTLH 193
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|423358724|ref|ZP_17336227.1| hypothetical protein IC1_00704 [Bacillus cereus VD022]
gi|401084596|gb|EJP92842.1| hypothetical protein IC1_00704 [Bacillus cereus VD022]
Length = 248
Score = 120 bits (301), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 65/198 (32%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I M+K ++ S + G + D IRTSSG F+ E S
Sbjct: 71 PLIVVLANVLSDEECDKLIEMSKNKMKRSKV----GSSRD-VNDIRTSSGAFLEDNELTS 125
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 126 K----IEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 180
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 181 YLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 225
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 226 GGAPVTKGEKWIATQWVR 243
>gi|228960501|ref|ZP_04122151.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pakistani str. T13001]
gi|229047930|ref|ZP_04193506.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH676]
gi|423630961|ref|ZP_17606708.1| hypothetical protein IK5_03811 [Bacillus cereus VD154]
gi|423650103|ref|ZP_17625673.1| hypothetical protein IKA_03890 [Bacillus cereus VD169]
gi|228723387|gb|EEL74756.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH676]
gi|228799198|gb|EEM46165.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pakistani str. T13001]
gi|401264328|gb|EJR70440.1| hypothetical protein IK5_03811 [Bacillus cereus VD154]
gi|401282521|gb|EJR88420.1| hypothetical protein IKA_03890 [Bacillus cereus VD169]
Length = 248
Score = 120 bits (301), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 65/198 (32%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I M+K ++ S + G + D IRTSSG F+ E S
Sbjct: 71 PLIVVLANVLSDEECDELIEMSKNKMKRSKV----GSSRD-VNDIRTSSGAFLEDNELTS 125
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 126 K----IEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 180
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 181 YLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 225
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 226 GGAPVTKGEKWIATQWVR 243
>gi|228941395|ref|ZP_04103947.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
berliner ATCC 10792]
gi|228974327|ref|ZP_04134896.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
thuringiensis str. T01001]
gi|228980919|ref|ZP_04141223.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
gi|384188306|ref|YP_005574202.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
chinensis CT-43]
gi|410676625|ref|YP_006928996.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
gi|452200698|ref|YP_007480779.1| Peptidyl prolyl 4-hydroxylase-like protein, alpha subunit [Bacillus
thuringiensis serovar thuringiensis str. IS5056]
gi|228778855|gb|EEM27118.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
gi|228785377|gb|EEM33387.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
thuringiensis str. T01001]
gi|228818321|gb|EEM64394.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
berliner ATCC 10792]
gi|326942015|gb|AEA17911.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
chinensis CT-43]
gi|409175754|gb|AFV20059.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
gi|452106091|gb|AGG03031.1| Peptidyl prolyl 4-hydroxylase-like protein, alpha subunit [Bacillus
thuringiensis serovar thuringiensis str. IS5056]
Length = 216
Score = 120 bits (301), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 105/198 (53%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I M+K ++ S + + + IRTSSG F+ E S
Sbjct: 39 PLIVVLANVLSDEECGELIEMSKNKMKRSKVGSSR-----DVNDIRTSSGAFLEDNELTS 93
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 94 K----IEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|75760922|ref|ZP_00740932.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
israelensis ATCC 35646]
gi|423385740|ref|ZP_17362996.1| hypothetical protein ICE_03486 [Bacillus cereus BAG1X1-2]
gi|423561293|ref|ZP_17537569.1| hypothetical protein II5_00697 [Bacillus cereus MSX-A1]
gi|74491592|gb|EAO54798.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
israelensis ATCC 35646]
gi|401201550|gb|EJR08415.1| hypothetical protein II5_00697 [Bacillus cereus MSX-A1]
gi|401635796|gb|EJS53551.1| hypothetical protein ICE_03486 [Bacillus cereus BAG1X1-2]
Length = 248
Score = 120 bits (300), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 65/198 (32%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I M+K ++ S + G + D IRTSSG F+ E S
Sbjct: 71 PLIVVLANVLSDEECDKLIEMSKNKMKRSKV----GSSRD-VNDIRTSSGAFLEDNELTS 125
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 126 K----IEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 180
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 181 YLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 225
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 226 GGAPVTKGEKWIATQWVR 243
>gi|423527903|ref|ZP_17504348.1| hypothetical protein IGE_01455 [Bacillus cereus HuB1-1]
gi|402451566|gb|EJV83385.1| hypothetical protein IGE_01455 [Bacillus cereus HuB1-1]
Length = 248
Score = 120 bits (300), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 65/198 (32%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I M+K ++ S + G + D IRTSSG F+ E S
Sbjct: 71 PLIVVLANVLSDEECDKLIEMSKNKMKRSKV----GSSRD-VNDIRTSSGAFLEDNELTS 125
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 126 K----IEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 180
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 181 YLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 225
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 226 GGAPVTKGEKWIATQWVR 243
>gi|229152436|ref|ZP_04280628.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1550]
gi|228631044|gb|EEK87681.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1550]
Length = 248
Score = 120 bits (300), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 66/198 (33%), Positives = 105/198 (53%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I M+K + S + G + D IRTSSG F+ E S
Sbjct: 71 PLIVVLANVLSDEECGELIEMSKNKMERSKI----GSSRD-VNDIRTSSGAFLEDNELTS 125
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 126 K----IEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 180
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +I+ ++H
Sbjct: 181 YLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSINELTLH 225
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 226 GGAPVTKGEKWIATQWVR 243
>gi|229186477|ref|ZP_04313640.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BGSC 6E1]
gi|228596991|gb|EEK54648.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BGSC 6E1]
Length = 216
Score = 120 bits (300), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I ++K + LA K + + IRTSSG F+ D++
Sbjct: 39 PLIVVLGNVLSDEECDELIELSK-----NKLARSKVGSSRDVNDIRTSSGAFL----DDN 89
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVI 148
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|325267002|ref|ZP_08133672.1| 2OG-Fe(II) oxygenase [Kingella denitrificans ATCC 33394]
gi|324981502|gb|EGC17144.1| 2OG-Fe(II) oxygenase [Kingella denitrificans ATCC 33394]
Length = 279
Score = 120 bits (300), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 69/206 (33%), Positives = 108/206 (52%), Gaps = 25/206 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVFISAAEDE 81
P + NF T E+C +I +A+ + +T+ GE V + RTS + AE
Sbjct: 91 PEVVVLDNFITAEECAQLIALAEGKVEDATVVDPATGEFVKHQD--RTSMNAAFARAEHP 148
Query: 82 SGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS-----QR 136
+ +E +IA P NGE +LRY+ G +Y +H+D FD Q G +K+ QR
Sbjct: 149 --LIARLEARIAAAIHWPAENGEGMQVLRYRSGGEYKAHFDYFDTQSEGGRKNMQTGGQR 206
Query: 137 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 196
V +FLVYL D++ GG T FP + +++P++G L F + LPNG
Sbjct: 207 VGTFLVYLCDVDAGGATRFP---------------ALNFEIRPKKGMALFFANTLPNGEG 251
Query: 197 DPTSIHGSCPVVKGEKWVATKWIRDQ 222
+P ++H PVV G K++A+KW+R++
Sbjct: 252 NPLTLHAGVPVVSGVKYLASKWLREK 277
>gi|49187135|ref|YP_030387.1| prolyl 4-hydroxylase subunit alpha [Bacillus anthracis str. Sterne]
gi|228947951|ref|ZP_04110238.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
monterrey BGSC 4AJ1]
gi|49181062|gb|AAT56438.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Sterne]
gi|228811938|gb|EEM58272.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
monterrey BGSC 4AJ1]
Length = 232
Score = 120 bits (300), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 64/198 (32%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I ++K S LA K + + IRTSSG F+ D++
Sbjct: 55 PLIVVLGNVLSDEECDELIELSK-----SKLARSKVGSSRDVNDIRTSSGAFL----DDN 105
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 106 ELTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 164
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 165 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 209
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 210 GGAPVTKGEKWIATQWVR 227
>gi|228910069|ref|ZP_04073889.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL 200]
gi|228849586|gb|EEM94420.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL 200]
Length = 248
Score = 120 bits (300), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 65/198 (32%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I M+K ++ S + G + D IRTSSG F+ E S
Sbjct: 71 PLIVVLANVLSDEECDELIEMSKNKMKRSKV----GSSRD-VNDIRTSSGAFLEDNELTS 125
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 126 K----IEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAVNNRISTLVM 180
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 181 YLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 225
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 226 GGAPVTKGEKWIATQWVR 243
>gi|49480949|ref|YP_038297.1| prolyl 4-hydroxylase subunit alpha [Bacillus thuringiensis serovar
konkukian str. 97-27]
gi|49332505|gb|AAT63151.1| prolyl 4-hydroxylase, alpha subunit [Bacillus thuringiensis serovar
konkukian str. 97-27]
Length = 232
Score = 120 bits (300), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 107/198 (54%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I ++K + LA K + + IRTSSG F+ D++
Sbjct: 55 PLIVVLGNVLSDEECDELIELSK-----NKLARSKVGSSRDVNDIRTSSGAFL----DDN 105
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
+ IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 106 ELTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 164
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 165 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 209
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 210 GGAPVTKGEKWIATQWVR 227
>gi|229180513|ref|ZP_04307855.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 172560W]
gi|228602937|gb|EEK60416.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 172560W]
Length = 232
Score = 120 bits (300), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 104/198 (52%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I M+K + S + + + IRTSSG F+ E S
Sbjct: 55 PLIVVLANVLSDEECDELIEMSKNKMERSKIGSSR-----DVNDIRTSSGAFLEDNELTS 109
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 110 K----IEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 164
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 165 YLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 209
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 210 GGAPVTKGEKWIATQWVR 227
>gi|428175714|gb|EKX44602.1| hypothetical protein GUITHDRAFT_71994 [Guillardia theta CCMP2712]
Length = 244
Score = 119 bits (299), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 75/231 (32%), Positives = 121/231 (52%), Gaps = 23/231 (9%)
Query: 10 SVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGI-- 67
S + + LS PR NF + E+C+ II A L PST+ L++G+ + + +
Sbjct: 14 SSRTVEVKRLSSTPRLFVVENFLSAEECEEIIKTATPLLAPSTV-LKQGDQSNGEEKVKD 72
Query: 68 --RTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFD 125
RTS ++ + + + I +++ ++ +P E +L+Y Q Y+ HYD FD
Sbjct: 73 EVRTSETAWL--MDKKVPIVAKIRQRVEELIRIPMSYAEDMQVLKYTFKQHYHVHYDFFD 130
Query: 126 PQEYGPQKSQ---RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQK---C-----IG 174
P+ Y + S R+ + YLT +E+GGET+FPF N +A+ + Q C
Sbjct: 131 PKMYPGRWSSGHNRLVTVFFYLTSVEKGGETIFPFGN-TSAEEHHKIQSWGPCENAVESS 189
Query: 175 LKVKPRQGDGLLFYSLLPNG----TIDPTSIHGSCPVVKGEKWVATKWIRD 221
+KVKP +G ++FY + P+G +D TS+HG C + GEKW A WIR+
Sbjct: 190 IKVKPVRGSAVIFYLMKPHGHTHGELDHTSLHGGCDPIVGEKWAANYWIRN 240
>gi|253575459|ref|ZP_04852796.1| prolyl 4-hydroxylase [Paenibacillus sp. oral taxon 786 str. D14]
gi|251845106|gb|EES73117.1| prolyl 4-hydroxylase [Paenibacillus sp. oral taxon 786 str. D14]
Length = 215
Score = 119 bits (299), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 70/206 (33%), Positives = 106/206 (51%), Gaps = 26/206 (12%)
Query: 18 VLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISA 77
VL P + F T ++C+ +I A LR S L + IRTS G+F
Sbjct: 25 VLHKEPLIMRFERLLTDDECRQLIEAAAPRLRESKLVNKV------VSEIRTSRGMFFE- 77
Query: 78 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ-R 136
E+E+ + IE++I+ + +P + E +L Y GQ+Y +HYD F P P S R
Sbjct: 78 -EEENPFIHRIEKRISALMNVPIEHAEGLQVLHYGPGQEYQAHYDFFGPN--SPSASNNR 134
Query: 137 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 196
+++ ++YL D+E GGET+FP + L+VKP +G L F +
Sbjct: 135 ISTLIIYLNDVEAGGETVFPL---------------LDLEVKPERGSALYFEYFYRQQEL 179
Query: 197 DPTSIHGSCPVVKGEKWVATKWIRDQ 222
+ ++H S PVV+GEKWVAT+W+R Q
Sbjct: 180 NNLTLHSSVPVVRGEKWVATQWMRRQ 205
>gi|421895470|ref|ZP_16325871.1| prolyl 4-hydroxylase alpha subunit homologue protein [Ralstonia
solanacearum MolK2]
gi|206586635|emb|CAQ17221.1| prolyl 4-hydroxylase alpha subunit homologue protein [Ralstonia
solanacearum MolK2]
Length = 283
Score = 119 bits (299), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 69/215 (32%), Positives = 108/215 (50%), Gaps = 25/215 (11%)
Query: 14 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSG 72
IP PR + F +F + E+C +I + + L+ S + GE +N RTS G
Sbjct: 82 IPILFAIETPRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGE--ENLISARTSEG 139
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
E + IE +IA+ T +P +GE F +L Y G +Y H+D F+P G
Sbjct: 140 AMFQVGEHP--LVARIEARIAQATGVPVEHGEGFQVLHYHPGGEYQPHFDYFNPGRGGEA 197
Query: 133 K-----SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 187
+ QRVA+ ++YL ++ GG T FP +GL+V P +G+ + F
Sbjct: 198 RQLEVGGQRVATLVIYLNSVQAGGATGFP---------------KLGLEVAPVKGNAVFF 242
Query: 188 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
P+G +D ++H PV +GEKW+ATKW+R++
Sbjct: 243 VYKRPDGMLDDNTLHAGLPVERGEKWIATKWLRER 277
>gi|423657194|ref|ZP_17632493.1| hypothetical protein IKG_04182 [Bacillus cereus VD200]
gi|401289937|gb|EJR95641.1| hypothetical protein IKG_04182 [Bacillus cereus VD200]
Length = 248
Score = 119 bits (299), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 65/198 (32%), Positives = 105/198 (53%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I M+K + S + G + D IRTSSG F+ E S
Sbjct: 71 PLIVVLANVLSDEECDELIEMSKNKMERSKI----GSSRD-VNDIRTSSGAFLEDNELTS 125
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 126 K----IEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 180
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 181 YLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 225
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 226 GGAPVTKGEKWIATQWVR 243
>gi|365158975|ref|ZP_09355162.1| hypothetical protein HMPREF1014_00625 [Bacillus sp. 7_6_55CFAA_CT2]
gi|363625964|gb|EHL76973.1| hypothetical protein HMPREF1014_00625 [Bacillus sp. 7_6_55CFAA_CT2]
Length = 248
Score = 119 bits (299), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 65/198 (32%), Positives = 105/198 (53%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I M+K + S + G + D IRTSSG F+ E S
Sbjct: 71 PLIVVLANVLSDEECDELIEMSKNKMERSKI----GSSRD-VNDIRTSSGAFLEDNELTS 125
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 126 K----IEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 180
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 181 YLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 225
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 226 GGAPVTKGEKWIATQWVR 243
>gi|229111709|ref|ZP_04241257.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-15]
gi|296504733|ref|YP_003666433.1| prolyl 4-hydroxylase subunit alpha [Bacillus thuringiensis BMB171]
gi|423585282|ref|ZP_17561369.1| hypothetical protein IIE_00694 [Bacillus cereus VD045]
gi|423640681|ref|ZP_17616299.1| hypothetical protein IK9_00626 [Bacillus cereus VD166]
gi|228671703|gb|EEL26999.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-15]
gi|296325785|gb|ADH08713.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis BMB171]
gi|401233925|gb|EJR40411.1| hypothetical protein IIE_00694 [Bacillus cereus VD045]
gi|401279742|gb|EJR85664.1| hypothetical protein IK9_00626 [Bacillus cereus VD166]
Length = 248
Score = 119 bits (299), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 65/198 (32%), Positives = 105/198 (53%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I M+K + S + G + D IRTSSG F+ E S
Sbjct: 71 PLIVVLANVLSDEECDELIEMSKNKMERSKI----GSSRD-VNDIRTSSGAFLEDNEFTS 125
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 126 K----IEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 180
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 181 YLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 225
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 226 GGAPVTKGEKWIATQWVR 243
>gi|423478381|ref|ZP_17455096.1| hypothetical protein IEO_03839 [Bacillus cereus BAG6X1-1]
gi|402428543|gb|EJV60640.1| hypothetical protein IEO_03839 [Bacillus cereus BAG6X1-1]
Length = 216
Score = 119 bits (299), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 61/198 (30%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I ++K ++ S + + + IRTSSG F+ D++
Sbjct: 39 PLIVVLGNVLSDEECDELIELSKSKMKRSKVGSSR-----DVNDIRTSSGAFL----DDN 89
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|423452458|ref|ZP_17429311.1| hypothetical protein IEE_01202 [Bacillus cereus BAG5X1-1]
gi|401140096|gb|EJQ47653.1| hypothetical protein IEE_01202 [Bacillus cereus BAG5X1-1]
Length = 216
Score = 119 bits (299), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 104/198 (52%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I ++K + S + + + IRTSSG F+ E S
Sbjct: 39 PLIVVLANVLSDEECDGLIELSKNKIERSKIGSSR-----DVNDIRTSSGAFLEENELTS 93
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 94 K----IEKRISSIMNVPVAHGEGLHILNYEVDQEYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKWVAT+W+R
Sbjct: 194 GGAPVTKGEKWVATQWVR 211
>gi|423400914|ref|ZP_17378087.1| hypothetical protein ICW_01312 [Bacillus cereus BAG2X1-2]
gi|401653904|gb|EJS71447.1| hypothetical protein ICW_01312 [Bacillus cereus BAG2X1-2]
Length = 216
Score = 119 bits (299), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 61/198 (30%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I ++K ++ S + + + IRTSSG F+ D++
Sbjct: 39 PLIVVLGNVLSDEECDELIELSKSKMKRSKVGSSR-----DVNDIRTSSGAFL----DDN 89
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|430808003|ref|ZP_19435118.1| prolyl 4-hydroxylase [Cupriavidus sp. HMR-1]
gi|429499635|gb|EKZ98045.1| prolyl 4-hydroxylase [Cupriavidus sp. HMR-1]
Length = 293
Score = 119 bits (299), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 72/206 (34%), Positives = 103/206 (50%), Gaps = 25/206 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVD-NTQGIRTSSGVFISAAEDE 81
PR L N +C +++ +A+ L+ S + +T D N RTS G E
Sbjct: 101 PRILLLQNLLDDAECDAVVALARDRLQRSPVV--NPDTGDENLIDARTSMGAMFQVGE-- 156
Query: 82 SGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK-----SQR 136
L IE +IA VT P +GE F +L YK G +Y H+D F+P+ G + QR
Sbjct: 157 HALLQRIEARIAAVTGWPVEHGEGFQVLNYKPGGEYQPHFDFFNPKRPGEARQLRVGGQR 216
Query: 137 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 196
VA+ ++YL GG T FP IGL+V P +G+ +LF LP+G +
Sbjct: 217 VATMVIYLNSPASGGATAFP---------------RIGLEVAPVKGNAVLFSYGLPDGAL 261
Query: 197 DPTSIHGSCPVVKGEKWVATKWIRDQ 222
D ++H PV GEKW+ATKW+R+
Sbjct: 262 DERTLHAGLPVEAGEKWIATKWLREH 287
>gi|116784858|gb|ABK23496.1| unknown [Picea sitchensis]
Length = 208
Score = 119 bits (298), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 65/156 (41%), Positives = 90/156 (57%), Gaps = 12/156 (7%)
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
+FI +D + IE+KIA T LP+ NGE +LRY+ G+KY+ H+D F + +
Sbjct: 1 MFIPKGKD--AIISRIEDKIAAWTFLPKENGEDMQVLRYEPGEKYDPHFDFFQDKVNIVR 58
Query: 133 KSQRVASFLVYLTDLEEGGETMFP---------FENGMNADGSYDYQKCIGLKVKPRQGD 183
RVA+ L+YLTD+ +GGET+FP + + D D K G VKP++GD
Sbjct: 59 GGHRVATVLMYLTDVSKGGETVFPSAEEDTHRRISSIIKDDTLSDCAK-RGTAVKPKRGD 117
Query: 184 GLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWI 219
LLF+SL D S+H CPV++GEKW TKWI
Sbjct: 118 ALLFFSLTTQAKPDTRSLHAGCPVIEGEKWSVTKWI 153
>gi|229146822|ref|ZP_04275187.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST24]
gi|228636650|gb|EEK93115.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST24]
Length = 216
Score = 119 bits (298), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 103/198 (52%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I M+K + S + + + IRTSSG F+ E S
Sbjct: 39 PLIVVLANVLSDEECDELIEMSKNKMERSKIGSSR-----DVNDIRTSSGAFLEDNELTS 93
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 94 K----IEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F +++ ++H
Sbjct: 149 YLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQGQSLNELTLH 193
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|94312029|ref|YP_585239.1| prolyl 4-hydroxylase [Cupriavidus metallidurans CH34]
gi|93355881|gb|ABF09970.1| prolyl 4-hydroxylase [Cupriavidus metallidurans CH34]
Length = 293
Score = 119 bits (298), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 72/206 (34%), Positives = 103/206 (50%), Gaps = 25/206 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVD-NTQGIRTSSGVFISAAEDE 81
PR L N +C +++ +A+ L+ S + +T D N RTS G E
Sbjct: 101 PRILLLQNLLDDAECDAVVALARDRLQRSPVV--NPDTGDENLIDARTSMGAMFQVGE-- 156
Query: 82 SGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK-----SQR 136
L IE +IA VT P +GE F +L YK G +Y H+D F+P+ G + QR
Sbjct: 157 HALLQRIEARIAAVTGWPVEHGEGFQVLNYKPGGEYQPHFDFFNPKRPGEARQLRVGGQR 216
Query: 137 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 196
VA+ ++YL GG T FP IGL+V P +G+ +LF LP+G +
Sbjct: 217 VATMVIYLNSPASGGATAFP---------------RIGLEVAPVKGNAVLFSYGLPDGAL 261
Query: 197 DPTSIHGSCPVVKGEKWVATKWIRDQ 222
D ++H PV GEKW+ATKW+R+
Sbjct: 262 DERTLHAGLPVEAGEKWIATKWLREH 287
>gi|229071739|ref|ZP_04204954.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus F65185]
gi|228711334|gb|EEL63294.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus F65185]
Length = 232
Score = 119 bits (298), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 104/198 (52%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I M+K + S + + + IRTSSG F+ E S
Sbjct: 55 PLIVVLANVLSDEECDELIEMSKNKMERSKIGSSR-----DVNDIRTSSGAFLEDNELTS 109
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 110 K----IEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 164
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 165 YLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 209
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 210 GGAPVTKGEKWIATQWMR 227
>gi|196046329|ref|ZP_03113555.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
03BB108]
gi|376268135|ref|YP_005120847.1| Peptidyl prolyl 4- hydroxylase like protein [Bacillus cereus
F837/76]
gi|196022799|gb|EDX61480.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
03BB108]
gi|364513935|gb|AEW57334.1| Peptidyl prolyl 4- hydroxylase like protein [Bacillus cereus
F837/76]
Length = 216
Score = 119 bits (298), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I ++K + LA K + + IRTSSG F+ D++
Sbjct: 39 PLIVVLGNVLSDEECDELIELSK-----NKLARSKVGSSRDVNDIRTSSGAFL----DDN 89
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|196041590|ref|ZP_03108882.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
NVH0597-99]
gi|218905373|ref|YP_002453207.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
AH820]
gi|225866219|ref|YP_002751597.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
03BB102]
gi|423550018|ref|ZP_17526345.1| hypothetical protein IGW_00649 [Bacillus cereus ISP3191]
gi|196027578|gb|EDX66193.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
NVH0597-99]
gi|218537435|gb|ACK89833.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
AH820]
gi|225786013|gb|ACO26230.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
03BB102]
gi|401189634|gb|EJQ96684.1| hypothetical protein IGW_00649 [Bacillus cereus ISP3191]
Length = 216
Score = 119 bits (298), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I ++K + LA K + + IRTSSG F+ D++
Sbjct: 39 PLIVVLGNVLSDEECDELIELSK-----NKLARSKVGSSRDVNDIRTSSGAFL----DDN 89
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|118479416|ref|YP_896567.1| prolyl 4-hydroxylase subunit alpha [Bacillus thuringiensis str. Al
Hakam]
gi|118418641|gb|ABK87060.1| prolyl 4-hydroxylase, alpha subunit [Bacillus thuringiensis str. Al
Hakam]
Length = 232
Score = 119 bits (298), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I ++K + LA K + + IRTSSG F+ D++
Sbjct: 55 PLIVVLGNVLSDEECDELIELSK-----NKLARSKVGSSRDVNDIRTSSGAFL----DDN 105
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 106 ELTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVI 164
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 165 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 209
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 210 GGAPVTKGEKWIATQWVR 227
>gi|301055727|ref|YP_003793938.1| prolyl 4-hydroxylase subunit alpha [Bacillus cereus biovar
anthracis str. CI]
gi|300377896|gb|ADK06800.1| prolyl 4-hydroxylase, alpha subunit [Bacillus cereus biovar
anthracis str. CI]
Length = 216
Score = 119 bits (298), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I ++K + LA K + + IRTSSG F+ D++
Sbjct: 39 PLIVVLGNVLSDEECDELIELSK-----NKLARSKVGSSRDVNDIRTSSGAFL----DDN 89
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|302844281|ref|XP_002953681.1| hypothetical protein VOLCADRAFT_63898 [Volvox carteri f.
nagariensis]
gi|300261090|gb|EFJ45305.1| hypothetical protein VOLCADRAFT_63898 [Volvox carteri f.
nagariensis]
Length = 304
Score = 119 bits (298), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 71/208 (34%), Positives = 110/208 (52%), Gaps = 15/208 (7%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA 78
++W PR + NF T + K +I +A ++ ST+ G++V+++ ++GV
Sbjct: 4 VAWKPRVFIYHNFITDMEAKHMIELAAPQMKRSTVVGAGGQSVEDSYRTLYTAGV----R 59
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVA 138
+ ++ IE ++A T + ++ E ILRY IGQ+Y H D E G RVA
Sbjct: 60 RYQDDVVERIENRVAAWTQISVLHQEDMQILRYGIGQQYKVHADTLRDDEAG----VRVA 115
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGS----YDYQKCIGLKV--KPRQGDGLLFYSLLP 192
+ L+YL + E GGET FP +N + ++ C V P++GD LLF+S+ P
Sbjct: 116 TVLIYLNEPEAGGETAFPDSQWVNPKLAETIGANFSACAKNHVAFAPKRGDALLFWSIGP 175
Query: 193 NGTI-DPTSIHGSCPVVKGEKWVATKWI 219
+GT D + H CPV+ G KW ATKWI
Sbjct: 176 DGTTEDYHASHTGCPVLSGVKWTATKWI 203
>gi|302844249|ref|XP_002953665.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
gi|300261074|gb|EFJ45289.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
Length = 245
Score = 119 bits (298), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 76/192 (39%), Positives = 109/192 (56%), Gaps = 15/192 (7%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PRA F NF T + ++ +A L+ ST+ GE V + IRTS G+FI D
Sbjct: 61 PRAYLFHNFLTKAERAHMVRLAAPKLKRSTVVGNDGEGVVDE--IRTSYGMFIRRLADP- 117
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFD-PQEYGPQKSQRVASFL 141
+ IE++I+ T LP + E +LRY GQ Y +HYD+ D E GP+ R+A+FL
Sbjct: 118 -VITRIEKRISLWTHLPIEHQEDIQVLRYAHGQTYGAHYDSGDKSNEPGPK--WRLATFL 174
Query: 142 VYLTDLEEGGETMFPFENGMNADGSYDYQ-----KCI--GLKVKPRQGDGLLFYSLLPNG 194
+YL+D+EEGGET FP +N + D + + +C + KP+ GD +LFYS PN
Sbjct: 175 MYLSDVEEGGETAFP-QNSVWYDPTIPERIGPVSECAKGHVAAKPKAGDAVLFYSFYPNL 233
Query: 195 TIDPTSIHGSCP 206
T+DP ++H CP
Sbjct: 234 TMDPAAMHTGCP 245
>gi|30022316|ref|NP_833947.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 14579]
gi|229129515|ref|ZP_04258486.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-Cer4]
gi|29897873|gb|AAP11148.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 14579]
gi|228654120|gb|EEL09987.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-Cer4]
Length = 232
Score = 119 bits (297), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 64/198 (32%), Positives = 105/198 (53%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I M+K + S + + + IRTSSG F+ ED
Sbjct: 55 PLIVVLANVLSDEECDELIEMSKNKMERSKIGSSR-----DVNDIRTSSGAFL---EDNK 106
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
T IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 107 LT-SKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 164
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 165 YLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 209
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 210 GGAPVTKGEKWIATQWVR 227
>gi|229163182|ref|ZP_04291137.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus R309803]
gi|228620245|gb|EEK77116.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus R309803]
Length = 229
Score = 119 bits (297), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 105/198 (53%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I ++K S LA K + + IRTS G F+ D++
Sbjct: 52 PLIVVLGNVLSDEECDELIELSK-----SKLARSKVGSSRDVNDIRTSKGAFL----DDN 102
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 103 ELTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 161
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 162 YLNDVEEGGETFFP---------------KLNLSVNPRKGMAVYFEYFYQDQSLNELTLH 206
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 207 GGAPVTKGEKWIATQWVR 224
>gi|407938132|ref|YP_006853773.1| 2OG-Fe(II) oxygenase [Acidovorax sp. KKS102]
gi|407895926|gb|AFU45135.1| 2OG-Fe(II) oxygenase [Acidovorax sp. KKS102]
Length = 303
Score = 119 bits (297), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 70/207 (33%), Positives = 110/207 (53%), Gaps = 27/207 (13%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKGETVDNTQGIRTSSGVFISAAEDE 81
PR + F N +PE+C ++I A+ + R T+A + G N RTS G+F +
Sbjct: 116 PRIVVFGNLLSPEECDALIAAAEPRMARSLTVATKTGGEEINAD--RTSDGMFFQ--RGQ 171
Query: 82 SGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG-----PQKSQR 136
S + IEE+IA++ P NGE +L Y+ G +Y HYD FDP E G + QR
Sbjct: 172 SPLIQRIEERIARLLQWPIENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPSIIKRGGQR 231
Query: 137 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 196
V + ++YL ++GG T FP + L+V P++G+ + F P+ +
Sbjct: 232 VGTLVMYLNTPDKGGGTTFP---------------DVHLEVAPQRGNAVFFSYERPHPST 276
Query: 197 DPTSIHGSCPVVKGEKWVATKWIRDQE 223
++HG PV+ G+KW+ATKW+R++E
Sbjct: 277 --RTLHGGAPVIAGDKWIATKWLRERE 301
>gi|47567794|ref|ZP_00238502.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus G9241]
gi|47555471|gb|EAL13814.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus G9241]
Length = 216
Score = 119 bits (297), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 62/198 (31%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I ++K + LA K + + IRTS G F+ D++
Sbjct: 39 PLIVVLGNVLSDEECDELIELSK-----NKLARSKVGSSRDVNDIRTSKGAFL----DDN 89
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
+ IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|217961727|ref|YP_002340297.1| prolyl 4-hydroxylase subunit alpha domain-containing protein
[Bacillus cereus AH187]
gi|222097680|ref|YP_002531737.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
Q1]
gi|229198365|ref|ZP_04325071.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1293]
gi|375286242|ref|YP_005106681.1| prolyl 4-hydroxylase subunit alpha domain-containing protein
[Bacillus cereus NC7401]
gi|423354732|ref|ZP_17332357.1| hypothetical protein IAU_02806 [Bacillus cereus IS075]
gi|423566803|ref|ZP_17543050.1| hypothetical protein II7_00026 [Bacillus cereus MSX-A12]
gi|423574080|ref|ZP_17550199.1| hypothetical protein II9_01301 [Bacillus cereus MSX-D12]
gi|217067199|gb|ACJ81449.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
AH187]
gi|221241738|gb|ACM14448.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
Q1]
gi|228585065|gb|EEK43177.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1293]
gi|358354769|dbj|BAL19941.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
NC7401]
gi|401086280|gb|EJP94507.1| hypothetical protein IAU_02806 [Bacillus cereus IS075]
gi|401212649|gb|EJR19392.1| hypothetical protein II9_01301 [Bacillus cereus MSX-D12]
gi|401215318|gb|EJR22035.1| hypothetical protein II7_00026 [Bacillus cereus MSX-A12]
Length = 216
Score = 119 bits (297), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I ++K + LA K + + IRTSSG F+ D++
Sbjct: 39 PLIVVLGNVLSDEECDKLIELSK-----NKLARSKVGSSRDVNDIRTSSGAFL----DDN 89
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|423470454|ref|ZP_17447198.1| hypothetical protein IEM_01760 [Bacillus cereus BAG6O-2]
gi|402436583|gb|EJV68613.1| hypothetical protein IEM_01760 [Bacillus cereus BAG6O-2]
Length = 216
Score = 119 bits (297), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 104/198 (52%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I ++K + S + + + IRTSSG F+ E S
Sbjct: 39 PLIVVLANVLSDEECDGLIELSKNKIERSKIGSSR-----DVNDIRTSSGAFLEENELTS 93
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 94 K----IEKRISSIMNVPVAHGEGLHILNYEVDQEYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKWVAT+W+R
Sbjct: 194 GGAPVTKGEKWVATQWMR 211
>gi|418530659|ref|ZP_13096582.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni ATCC 11996]
gi|371452378|gb|EHN65407.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni ATCC 11996]
Length = 299
Score = 118 bits (296), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 72/212 (33%), Positives = 110/212 (51%), Gaps = 37/212 (17%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFIS 76
PR + F N + E+C +II A RP +++ TVDN G RTS+G+F
Sbjct: 112 PRVVVFGNLLSNEECDAIIAAA----RPR---MQRSLTVDNQSGGEAVNDDRTSNGMFFQ 164
Query: 77 AAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG-----P 131
E++ + +E++IA++ P NGE +L Y+ G +Y HYD F P E G
Sbjct: 165 RGEND--LISRVEQRIARLLNWPLENGEGMQVLHYRPGAEYKPHYDYFAPNEPGTPTILK 222
Query: 132 QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLL 191
+ QRV + ++YL + GG T FP +GL+V PR+G+ + F
Sbjct: 223 RGGQRVGTLVMYLNEPARGGATTFP---------------DVGLQVVPRRGNAVFFSYNR 267
Query: 192 PNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
P ++HG PV++GEKW+ATKW+R++E
Sbjct: 268 PEPATK--TLHGGAPVLEGEKWIATKWLRERE 297
>gi|229031885|ref|ZP_04187873.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1271]
gi|228729503|gb|EEL80492.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1271]
Length = 216
Score = 118 bits (296), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 105/198 (53%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I ++K S LA K + + IRTS G F+ D++
Sbjct: 39 PLIVVLGNVLSDEECGELIELSK-----SKLARSKVGSSRDVNDIRTSKGAFL----DDN 89
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTTKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|52141260|ref|YP_085568.1| prolyl 4-hydroxylase, alpha subunit [Bacillus cereus E33L]
gi|51974729|gb|AAU16279.1| prolyl 4-hydroxylase, alpha subunit [Bacillus cereus E33L]
Length = 232
Score = 118 bits (296), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I ++K + LA K + + IRTSSG F+ D++
Sbjct: 55 PLIVVLGNVLSDEECDELIELSK-----NKLARSKVGSSRDVNDIRTSSGAFL----DDN 105
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 106 ELTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 164
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 165 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 209
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 210 GGAPVTKGEKWIATQWVR 227
>gi|384182063|ref|YP_005567825.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
finitimus YBT-020]
gi|324328147|gb|ADY23407.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
finitimus YBT-020]
Length = 216
Score = 118 bits (296), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I ++K + LA K + + IRTSSG F+ D++
Sbjct: 39 PLIVVLGNVLSDEECDELIELSK-----NKLARSKVGSSRDVNDIRTSSGAFL----DDN 89
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDRSLNELTLH 193
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|206978009|ref|ZP_03238895.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
H3081.97]
gi|423373947|ref|ZP_17351286.1| hypothetical protein IC5_03002 [Bacillus cereus AND1407]
gi|206743809|gb|EDZ55230.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
H3081.97]
gi|401094762|gb|EJQ02832.1| hypothetical protein IC5_03002 [Bacillus cereus AND1407]
Length = 216
Score = 118 bits (296), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 105/198 (53%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I ++K + LA K + + IRTSSG F+ D+
Sbjct: 39 PLIVVLGNVLSDEECDKLIELSK-----NKLARSKVGSSRDVNDIRTSSGAFL----DDD 89
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|423604110|ref|ZP_17580003.1| hypothetical protein IIK_00691 [Bacillus cereus VD102]
gi|401245796|gb|EJR52149.1| hypothetical protein IIK_00691 [Bacillus cereus VD102]
Length = 216
Score = 118 bits (296), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I ++K + LA K + + IRTSSG F+ D++
Sbjct: 39 PLIVVLGNVLSDEECDELIELSK-----NKLARSKVGSSRDVNDIRTSSGAFL----DDN 89
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFHQDQSLNELTLH 193
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|388520887|gb|AFK48505.1| unknown [Lotus japonicus]
Length = 187
Score = 118 bits (296), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 60/127 (47%), Positives = 82/127 (64%), Gaps = 4/127 (3%)
Query: 98 LPRI-NGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFP 156
+P I NGE+ IL Y+ G+KY HYD F + R+A+ L+YL+D+ +GGET+FP
Sbjct: 3 IPSIENGESIQILHYENGRKYEPHYDYFHDRANQFMGGHRIATVLMYLSDVGKGGETIFP 62
Query: 157 -FENGMNADGSYDYQKCI--GLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKW 213
E+ ++ + +C G VKPR+GD LLF+SL N T D S+HGSCPV++GEKW
Sbjct: 63 NAESKLSQPKDESWSECAHKGYAVKPRKGDALLFFSLHLNATTDSNSLHGSCPVIEGEKW 122
Query: 214 VATKWIR 220
ATKWI
Sbjct: 123 SATKWIH 129
>gi|229174912|ref|ZP_04302432.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus MM3]
gi|228608580|gb|EEK65882.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus MM3]
Length = 216
Score = 118 bits (296), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 105/198 (53%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I ++K S LA K + + IRTS G F+ D++
Sbjct: 39 PLIVVLGNVLSDEECDELIELSK-----SKLARSKVGSSRDVNDIRTSKGAFL----DDN 89
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTVKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|229157835|ref|ZP_04285910.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 4342]
gi|228625792|gb|EEK82544.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 4342]
Length = 232
Score = 118 bits (296), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 62/198 (31%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I ++K + LA K + + IRTS G F+ D++
Sbjct: 55 PLIVVLGNVLSDEECDELIELSK-----NKLARSKVGSSRDVNDIRTSKGAFL----DDN 105
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
+ IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 106 ELTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 164
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 165 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 209
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 210 GGAPVTKGEKWIATQWVR 227
>gi|42783360|ref|NP_980607.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 10987]
gi|42739288|gb|AAS43215.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
ATCC 10987]
Length = 216
Score = 118 bits (296), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I ++K + LA K + + IRTSSG F+ D++
Sbjct: 39 PLIVVLGNVLSDEECDELIELSK-----NKLARSKVGSSRDVNDIRTSSGAFL----DDN 89
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWMR 211
>gi|228987427|ref|ZP_04147547.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
tochigiensis BGSC 4Y1]
gi|228772399|gb|EEM20845.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
tochigiensis BGSC 4Y1]
Length = 232
Score = 118 bits (295), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 62/198 (31%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I ++K + LA K + + IRTS G F+ D++
Sbjct: 55 PLIVVLGNVLSDEECDELIELSK-----NKLARSKVGSSRDVNDIRTSKGAFL----DDN 105
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
+ IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 106 ELTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 164
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 165 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 209
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 210 GGAPVTKGEKWIATQWVR 227
>gi|383642155|ref|ZP_09954561.1| hypothetical protein SeloA3_06917 [Sphingomonas elodea ATCC 31461]
Length = 327
Score = 118 bits (295), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 73/201 (36%), Positives = 102/201 (50%), Gaps = 24/201 (11%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPS-TLALRKGETVDNTQGIRTSSGVFISAAEDE 81
PR +FP F + E+C + A+ L PS L G + + IRTS G I +E
Sbjct: 140 PRVEHFPGFLSREECAHVATTAQDLLEPSFVLDPNSGRPIPHP--IRTSDGGAIGPT-NE 196
Query: 82 SGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFL 141
+ + I +IA T GE+ +LRY GQ+Y H D E +QR+A+F+
Sbjct: 197 NLVVRAINLRIAAATGTAVEQGESLTVLRYARGQEYRRHLDTIAGAE-----NQRIATFI 251
Query: 142 VYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSI 201
VYL D EGGET FP N ++V+PR GD + F ++ P+GT DP +
Sbjct: 252 VYLNDGFEGGETHFPLLN---------------IQVRPRIGDAIRFDTIRPDGTPDPRLV 296
Query: 202 HGSCPVVKGEKWVATKWIRDQ 222
H PV G KW+AT+WIR +
Sbjct: 297 HAGQPVRNGVKWIATRWIRRE 317
>gi|229093299|ref|ZP_04224414.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-42]
gi|228690082|gb|EEL43879.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-42]
Length = 232
Score = 118 bits (295), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 62/198 (31%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I ++K + LA K + + IRTSSG F+ D++
Sbjct: 55 PLIVVLGNVLSDEECDELIELSK-----NKLARSKVGSSRDVNDIRTSSGAFL----DDN 105
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
+ IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 106 ELTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 164
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 165 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 209
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+ T+W+R
Sbjct: 210 GGAPVTKGEKWITTQWVR 227
>gi|229140971|ref|ZP_04269515.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST26]
gi|228642547|gb|EEK98834.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST26]
Length = 232
Score = 118 bits (295), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I ++K + LA K + + IRTSSG F+ D++
Sbjct: 55 PLIVVLGNVLSDEECDKLIELSK-----NKLARSKVGSSRDVNDIRTSSGAFL----DDN 105
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 106 ELTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 164
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 165 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 209
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 210 GGAPVTKGEKWIATQWVR 227
>gi|241767624|ref|ZP_04765273.1| Procollagen-proline dioxygenase [Acidovorax delafieldii 2AN]
gi|241361463|gb|EER57922.1| Procollagen-proline dioxygenase [Acidovorax delafieldii 2AN]
Length = 318
Score = 118 bits (295), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 82/226 (36%), Positives = 119/226 (52%), Gaps = 36/226 (15%)
Query: 7 GDDSVTNIPFQVLSWM--PRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKG-ETVD 62
GD SV +VL M PR + F N +PE+C+++I A + R T+A + G E V+
Sbjct: 118 GDRSV-----EVLLTMAHPRVVVFGNLLSPEECEALIAAAAPRMARSLTVATQTGGEEVN 172
Query: 63 NTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD 122
+ RTS G+F ES + IEE+IA + P NGE +L Y+ G +Y HYD
Sbjct: 173 DD---RTSHGMFFQ--RGESPLVQRIEERIASLLNWPIENGEGLQVLHYRPGAEYKPHYD 227
Query: 123 AFDPQEYGP-----QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKV 177
FDP E G + QRV + ++YL E+GG T FP D Q ++V
Sbjct: 228 YFDPAEPGTPTVIQRGGQRVGTLVMYLNTPEQGGGTTFP-----------DAQ----IEV 272
Query: 178 KPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
P++G+ F P T ++HG PV+ G+KW+ATKW+R++E
Sbjct: 273 APQRGNAAFFSYERP--TPSTRTLHGGAPVLAGDKWIATKWLRERE 316
>gi|423426372|ref|ZP_17403403.1| hypothetical protein IE5_04061 [Bacillus cereus BAG3X2-2]
gi|401111119|gb|EJQ19018.1| hypothetical protein IE5_04061 [Bacillus cereus BAG3X2-2]
Length = 248
Score = 118 bits (295), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 64/198 (32%), Positives = 105/198 (53%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I ++K + S + G + D IRTSSG F+ E S
Sbjct: 71 PLIVVLANVLSDEECDELIEISKNKMERSKI----GSSRD-VNDIRTSSGAFLEDNELTS 125
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 126 K----IEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 180
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 181 YLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 225
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 226 GGAPVTKGEKWIATQWVR 243
>gi|333912984|ref|YP_004486716.1| procollagen-proline dioxygenase [Delftia sp. Cs1-4]
gi|333743184|gb|AEF88361.1| Procollagen-proline dioxygenase [Delftia sp. Cs1-4]
Length = 294
Score = 117 bits (294), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 74/217 (34%), Positives = 117/217 (53%), Gaps = 31/217 (14%)
Query: 16 FQVLSWM--PRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKG-ETVDNTQGIRTSS 71
QVL M PR + F N + E+C +II A+ + R T+A + G E +++ RTS+
Sbjct: 98 VQVLVSMRNPRIVVFGNLLSHEECDAIIAAARPRMARSLTVATQSGGEEINDD---RTSN 154
Query: 72 GVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG- 130
G+F E +G + +EE+IA++ P +GE +L Y G +Y H+D F P E G
Sbjct: 155 GMFFQRGE--TGIVSQLEERIARLLRWPLDHGEGLQVLHYGPGAEYKPHHDYFAPGEPGT 212
Query: 131 ----PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLL 186
+ QRV + ++YL + E GG T+FP + L+V PR+G+ +
Sbjct: 213 PTILKRGGQRVGTLVIYLNEPERGGATIFP---------------EVPLQVVPRRGNAVF 257
Query: 187 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
F P+ + ++HG PV+ GEKW+ATKW+R++E
Sbjct: 258 FSYERPDPST--RTLHGGAPVLAGEKWIATKWLRERE 292
>gi|351731158|ref|ZP_08948849.1| 2OG-Fe(II) oxygenase [Acidovorax radicis N35]
Length = 303
Score = 117 bits (294), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 71/208 (34%), Positives = 112/208 (53%), Gaps = 29/208 (13%)
Query: 23 PRALYFPNFATPEQCKSII-NMAKLNLRPSTLALRKG-ETVDNTQGIRTSSGVFISAAED 80
PR + F N +PE+C ++I + A R T+A + G E +++ RTS G+F +
Sbjct: 116 PRVVVFGNLLSPEECDALIADAAPRMARSLTVATKTGGEEINDD---RTSDGMFFQRGQ- 171
Query: 81 ESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG-----PQKSQ 135
S + IEE+IA++ P NGE +L Y+ G +Y HYD FDP E G + Q
Sbjct: 172 -SPLIQRIEERIARLLNWPIENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPTIVKRGGQ 230
Query: 136 RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGT 195
RV + ++YL E+GG T FP + ++V P++G+ + F P+ +
Sbjct: 231 RVGTLVMYLNTPEKGGGTTFP---------------DVHVEVAPQRGNAVFFSYERPHPS 275
Query: 196 IDPTSIHGSCPVVKGEKWVATKWIRDQE 223
++HG PV+ GEKW+ATKW+R++E
Sbjct: 276 T--RTLHGGAPVLAGEKWIATKWLRERE 301
>gi|254254263|ref|ZP_04947580.1| hypothetical protein BDAG_03558 [Burkholderia dolosa AUO158]
gi|124898908|gb|EAY70751.1| hypothetical protein BDAG_03558 [Burkholderia dolosa AUO158]
Length = 285
Score = 117 bits (294), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 75/227 (33%), Positives = 106/227 (46%), Gaps = 29/227 (12%)
Query: 4 GQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDN 63
G D + IP + P+ + F N ++C +I + L ST ET
Sbjct: 77 GNVIDAADRRIPVLIRCERPQIVVFGNVLDQDECDEMIQRSMHKLEQSTTV--NAET--G 132
Query: 64 TQGI---RTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSH 120
TQ + RTS G + ED + IE ++A + P NGE +LRY G +Y SH
Sbjct: 133 TQEVIRHRTSHGTWFQNGED--ALIRRIETRLAALMNCPVENGEGLQVLRYTPGGEYRSH 190
Query: 121 YDAFDPQEYGP-----QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGL 175
YD F P G QRVA+ +VYL D+ GGET+FP G+
Sbjct: 191 YDYFQPTAAGSLTHVRTGGQRVATLIVYLNDVPSGGETVFPEA---------------GI 235
Query: 176 KVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
V PR+GD + F + +DP ++H PV GEKW+ TKW+R++
Sbjct: 236 SVVPRRGDAVYFRYMNRLRQLDPATLHAGAPVRDGEKWIMTKWVRER 282
>gi|160900716|ref|YP_001566298.1| procollagen-proline dioxygenase [Delftia acidovorans SPH-1]
gi|160366300|gb|ABX37913.1| Procollagen-proline dioxygenase [Delftia acidovorans SPH-1]
Length = 294
Score = 117 bits (294), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 74/217 (34%), Positives = 117/217 (53%), Gaps = 31/217 (14%)
Query: 16 FQVLSWM--PRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKG-ETVDNTQGIRTSS 71
QVL M PR + F N + E+C +II A+ + R T+A + G E +++ RTS+
Sbjct: 98 VQVLVSMRNPRIVVFGNLLSHEECDAIIAAARPRMARSLTVATQSGGEEINDD---RTSN 154
Query: 72 GVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG- 130
G+F E +G + +EE+IA++ P +GE +L Y G +Y H+D F P E G
Sbjct: 155 GMFFQRGE--TGIVSQLEERIARLLRWPLDHGEGLQVLHYGPGAEYKPHHDYFAPGEPGT 212
Query: 131 ----PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLL 186
+ QRV + ++YL + E GG T+FP + L+V PR+G+ +
Sbjct: 213 PTILKRGGQRVGTLVIYLNEPERGGATIFP---------------EVPLQVVPRRGNAVF 257
Query: 187 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
F P+ + ++HG PV+ GEKW+ATKW+R++E
Sbjct: 258 FSYERPDPST--RTLHGGAPVLAGEKWIATKWLRERE 292
>gi|402555628|ref|YP_006596899.1| prolyl 4-hydroxylase subunit alpha [Bacillus cereus FRI-35]
gi|401796838|gb|AFQ10697.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus FRI-35]
Length = 216
Score = 117 bits (294), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I ++K + LA K + + IRTSSG F+ D++
Sbjct: 39 PLIVVLGNVLSDEECGELIELSK-----NKLARSKVGSSRDVNDIRTSSGAFL----DDN 89
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWMR 211
>gi|255083957|ref|XP_002508553.1| predicted protein [Micromonas sp. RCC299]
gi|226523830|gb|ACO69811.1| predicted protein [Micromonas sp. RCC299]
Length = 262
Score = 117 bits (293), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 75/217 (34%), Positives = 112/217 (51%), Gaps = 22/217 (10%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET--VDNTQGIRTSSGVFIS 76
LS P+A + F + E+C +I + +L+ ST+ K +T +D+ +RTS G F+
Sbjct: 4 LSDEPKAFLYHGFLSAEECDHLIKIGTPHLKRSTVVGGKDDTGVLDD---VRTSFGTFLP 60
Query: 77 AAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQR 136
D+ L IE ++ + + N E +L+Y GQ+Y H D P +R
Sbjct: 61 KKYDD--VLYGIERRVEDFSQISYENQEQLQLLKYHDGQEYKDHQDGL----TSPNGGRR 114
Query: 137 VASFLVYLTDLEEGGETMFPFENGMNA-----DGSYD------YQKCIGLKVKPRQGDGL 185
+A+ L++L + E+GGET FP + A G D ++ GL VKPR+GD +
Sbjct: 115 IATVLMFLHEPEKGGETSFPQGKPLPAVAQRLRGMRDELSDCAWRDGRGLAVKPRRGDAV 174
Query: 186 LFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
LF+S NG D S H SCP V G KW ATKWI ++
Sbjct: 175 LFFSFKKNGGSDIASTHASCPTVGGVKWTATKWIHEK 211
>gi|390570433|ref|ZP_10250698.1| procollagen-proline dioxygenase [Burkholderia terrae BS001]
gi|389937613|gb|EIM99476.1| procollagen-proline dioxygenase [Burkholderia terrae BS001]
Length = 285
Score = 117 bits (293), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 69/219 (31%), Positives = 110/219 (50%), Gaps = 27/219 (12%)
Query: 9 DSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIR 68
D I F+ P+ + F + + E+C +I A+ L+ ST + + D Q +R
Sbjct: 86 DVTVRIRFE----RPQVIAFDDVLSGEECAELIERARHRLKRSTTVNPENGSEDVIQ-LR 140
Query: 69 TSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQE 128
TS G + ED ++ ++ +I+ + P +GE IL Y+ G +Y H+D F P +
Sbjct: 141 TSEGFWFQRCED--AFIERLDHRISALMNWPLEHGEGLQILHYRQGGEYRPHFDYFPPGQ 198
Query: 129 YG-----PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGD 183
G + QRVA+ +VYL+D+E GGET+FP GL V RQG
Sbjct: 199 NGSVLHTARGGQRVATLIVYLSDVEGGGETVFP---------------DAGLAVMARQGG 243
Query: 184 GLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
+ F + +DP ++HG PV G+KW+ TKW+R++
Sbjct: 244 AIYFRYMNGRRQLDPLTLHGGAPVTSGDKWIMTKWMRER 282
>gi|423406337|ref|ZP_17383486.1| hypothetical protein ICY_01022 [Bacillus cereus BAG2X1-3]
gi|401660331|gb|EJS77813.1| hypothetical protein ICY_01022 [Bacillus cereus BAG2X1-3]
Length = 216
Score = 117 bits (292), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 62/198 (31%), Positives = 105/198 (53%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I ++K + LA K + + IRTS G F+ D++
Sbjct: 39 PLIVVLGNVLSDEECDKLIELSK-----NKLARSKVGSSRDVNDIRTSKGAFL----DDN 89
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|423395462|ref|ZP_17372663.1| hypothetical protein ICU_01156 [Bacillus cereus BAG2X1-1]
gi|401654873|gb|EJS72412.1| hypothetical protein ICU_01156 [Bacillus cereus BAG2X1-1]
Length = 216
Score = 117 bits (292), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 62/198 (31%), Positives = 105/198 (53%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I ++K + LA K + + IRTS G F+ D++
Sbjct: 39 PLIVVLGNVLSDEECDKLIELSK-----NKLARSKVGSSRDVNDIRTSKGAFL----DDN 89
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|323528042|ref|YP_004230194.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1001]
gi|323385044|gb|ADX57134.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1001]
Length = 300
Score = 117 bits (292), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 65/208 (31%), Positives = 106/208 (50%), Gaps = 29/208 (13%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGI---RTSSGVFISAAE 79
P+ + F N +PE+C +I ++ L+ ST+ + +G+ RTS G++ E
Sbjct: 111 PQVIVFANVLSPEECDEVIERSRHRLKRSTIV----DPATGQEGVIRNRTSEGIWYQRGE 166
Query: 80 DESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG-----PQKS 134
D ++ ++++IA + P NGE IL Y +Y H+D F P + G +
Sbjct: 167 D--AFIERLDQRIASLMNWPVENGEGLQILHYGPTGEYRPHFDYFPPDQPGSAVHTARGG 224
Query: 135 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 194
QRVA+ +VYL D+ +GGET+FP GL V +QG + F +
Sbjct: 225 QRVATLVVYLNDVADGGETIFP---------------AAGLSVAAKQGGAVYFRYMNGQR 269
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
+DP ++HG PV G+KW+ TKW+R++
Sbjct: 270 QLDPLTLHGGAPVHAGDKWIMTKWMRER 297
>gi|228916870|ref|ZP_04080433.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pulsiensis BGSC 4CC1]
gi|228842793|gb|EEM87878.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pulsiensis BGSC 4CC1]
Length = 232
Score = 117 bits (292), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 62/198 (31%), Positives = 105/198 (53%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I ++K + LA K + + IRTS G F+ D++
Sbjct: 55 PLIVVLGNVLSDEECDELIELSK-----NKLARSKVGSSRDVNDIRTSKGAFL----DDN 105
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 106 ELTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 164
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 165 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 209
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV KGEKW+AT+W+R
Sbjct: 210 GGAPVTKGEKWIATQWVR 227
>gi|241664232|ref|YP_002982592.1| procollagen-proline dioxygenase [Ralstonia pickettii 12D]
gi|309783051|ref|ZP_07677770.1| procollagen-proline dioxygenase [Ralstonia sp. 5_7_47FAA]
gi|404397139|ref|ZP_10988932.1| hypothetical protein HMPREF0989_00773 [Ralstonia sp. 5_2_56FAA]
gi|240866259|gb|ACS63920.1| Procollagen-proline dioxygenase [Ralstonia pickettii 12D]
gi|308918159|gb|EFP63837.1| procollagen-proline dioxygenase [Ralstonia sp. 5_7_47FAA]
gi|348610674|gb|EGY60360.1| hypothetical protein HMPREF0989_00773 [Ralstonia sp. 5_2_56FAA]
Length = 288
Score = 117 bits (292), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 66/215 (30%), Positives = 109/215 (50%), Gaps = 23/215 (10%)
Query: 13 NIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSG 72
+IP PR + F +F + ++C +I + + L+ S + + +N RTS G
Sbjct: 86 DIPILFAIETPRIVLFQHFLSDQECDELIAIGRNRLKRSPV-VNPDTGEENLISARTSQG 144
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
E + IE +IA+ +P +GE F +L Y+ G +Y H+D F+P G
Sbjct: 145 GMFQVGEHP--LIAKIEARIAQAVGVPVEHGEGFQVLNYQPGGEYQPHFDFFNPGRSGEA 202
Query: 133 K-----SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 187
+ QRVA+ ++YL ++ GG T FP +GL+V P +G+ + F
Sbjct: 203 RQLEVGGQRVATMVIYLNSVQAGGATGFP---------------KLGLEVAPVKGNAVFF 247
Query: 188 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
P+GT+D ++H PV +GEKW+ATKW+R++
Sbjct: 248 VYKRPDGTLDEDTLHAGLPVERGEKWIATKWLRER 282
>gi|420246706|ref|ZP_14750139.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. BT03]
gi|398073616|gb|EJL64785.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. BT03]
Length = 282
Score = 117 bits (292), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 69/219 (31%), Positives = 110/219 (50%), Gaps = 27/219 (12%)
Query: 9 DSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIR 68
D I F+ P+ + F + + E+C +I A+ L+ ST + + D Q +R
Sbjct: 83 DVTVRIRFE----RPQVIAFDDVLSGEECAELIERARHRLKRSTTVNPENGSEDVIQ-LR 137
Query: 69 TSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQE 128
TS G + ED ++ ++ +I+ + P +GE IL Y+ G +Y H+D F P +
Sbjct: 138 TSEGFWFQRCED--AFIERLDHRISALMNWPLEHGEGLQILHYRQGGEYRPHFDYFPPGQ 195
Query: 129 YG-----PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGD 183
G + QRVA+ +VYL+D+E GGET+FP GL V RQG
Sbjct: 196 NGSVLHTARGGQRVATLIVYLSDVEGGGETVFP---------------DAGLAVMARQGG 240
Query: 184 GLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
+ F + +DP ++HG PV G+KW+ TKW+R++
Sbjct: 241 AIYFRYMNGRRQLDPLTLHGGAPVTSGDKWIMTKWMRER 279
>gi|91789558|ref|YP_550510.1| procollagen-proline,2-oxoglutarate-4-dioxygenase [Polaromonas sp.
JS666]
gi|91698783|gb|ABE45612.1| Procollagen-proline,2-oxoglutarate-4-dioxygenase [Polaromonas sp.
JS666]
Length = 277
Score = 117 bits (292), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 71/208 (34%), Positives = 109/208 (52%), Gaps = 27/208 (12%)
Query: 22 MPRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKGETVDNTQGIRTSSGVFISAAED 80
+P + F N + +C++++ +A+ L R T+ ++ G N RTS G+F A
Sbjct: 89 LPDLVVFGNLLSDSECEALMEVAQPRLARSLTVNIKTGGEERNRD--RTSQGMFF--ARG 144
Query: 81 ESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP-----QKSQ 135
E+ + +E +IA++ P GE +LRY+ G +Y HYD FDP E G + Q
Sbjct: 145 ENPLVQRVEARIARLVGWPVDRGEGLQVLRYRQGAQYKPHYDYFDPAEPGTPAILQRGGQ 204
Query: 136 RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGT 195
RVA+ ++YL + E+GG T+FP IGL+V PR+G + F P
Sbjct: 205 RVATLIMYLNEPEQGGATVFP---------------DIGLQVTPRRGTAVFFS--YPAAN 247
Query: 196 IDPTSIHGSCPVVKGEKWVATKWIRDQE 223
+ HG PV GEKW+ATKW+R++E
Sbjct: 248 PASLTRHGGEPVKAGEKWIATKWLRERE 275
>gi|221068712|ref|ZP_03544817.1| Procollagen-proline dioxygenase [Comamonas testosteroni KF-1]
gi|220713735|gb|EED69103.1| Procollagen-proline dioxygenase [Comamonas testosteroni KF-1]
Length = 299
Score = 116 bits (291), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 71/212 (33%), Positives = 109/212 (51%), Gaps = 37/212 (17%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFIS 76
PR + F N + E+C +II A ++ S TVDN G RTS+G+F
Sbjct: 112 PRVVVFGNLLSDEECDAIIAAAGPRMQRSL-------TVDNQSGGEAVNDDRTSNGMFFQ 164
Query: 77 AAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG-----P 131
E++ + +E++IA++ P NGE +L Y+ G +Y HYD F P E G
Sbjct: 165 RGEND--LICRVEQRIARLLNWPLENGEGMQVLHYRPGAEYKPHYDYFAPNEPGTPTILK 222
Query: 132 QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLL 191
+ QRV + ++YL + GG T FP +GL+V PR+G+ + F
Sbjct: 223 RGGQRVGTLVMYLNEPARGGATTFP---------------DVGLQVVPRRGNAVFFSYNR 267
Query: 192 PNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
P+ ++HG PV++GEKW+ATKW+R++E
Sbjct: 268 PDPATK--TLHGGAPVLEGEKWIATKWLRERE 297
>gi|407708877|ref|YP_006792741.1| prolyl 4-hydroxylase [Burkholderia phenoliruptrix BR3459a]
gi|407237560|gb|AFT87758.1| prolyl 4-hydroxylase [Burkholderia phenoliruptrix BR3459a]
Length = 300
Score = 116 bits (291), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 65/208 (31%), Positives = 105/208 (50%), Gaps = 29/208 (13%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGI---RTSSGVFISAAE 79
P+ + F N +PE+C +I ++ L+ ST+ + +G+ RTS G++ E
Sbjct: 111 PQVIVFANVLSPEECDEVIERSRHRLKRSTIV----DPATGQEGVIRNRTSEGIWYQRGE 166
Query: 80 DESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG-----PQKS 134
D ++ ++ +IA + P NGE IL Y +Y H+D F P + G +
Sbjct: 167 D--AFIERLDRRIASLMNWPVENGEGLQILHYGPTGEYRPHFDYFPPDQPGSAVHTARGG 224
Query: 135 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 194
QRVA+ +VYL D+ +GGET+FP GL V +QG + F +
Sbjct: 225 QRVATLVVYLNDVADGGETIFP---------------AAGLSVAAKQGGAVYFRYMNGQR 269
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
+DP ++HG PV G+KW+ TKW+R++
Sbjct: 270 QLDPLTLHGGAPVRAGDKWIMTKWMRER 297
>gi|186474111|ref|YP_001861453.1| procollagen-proline dioxygenase [Burkholderia phymatum STM815]
gi|184196443|gb|ACC74407.1| Procollagen-proline dioxygenase [Burkholderia phymatum STM815]
Length = 305
Score = 116 bits (290), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 69/215 (32%), Positives = 110/215 (51%), Gaps = 23/215 (10%)
Query: 13 NIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSG 72
++ +V P+ + F + + ++C +I A+ L+ ST + D Q +RTS G
Sbjct: 106 DVAVRVRFERPQVIVFDDVLSRDECDELIERARHRLKRSTTVNPESGREDVIQ-LRTSEG 164
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP- 131
+ ED ++ ++ +I+ + P +GE IL Y G +Y H+D F P + G
Sbjct: 165 FWFQRCED--AFIERLDRRISALMNWPLEHGEGLQILHYTKGGEYRPHFDYFPPSQSGSV 222
Query: 132 ----QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 187
+ QRVA+ +VYL+D+ GGET+FP NA GL V RQG + F
Sbjct: 223 LHTSRGGQRVATLIVYLSDVAGGGETVFP-----NA----------GLAVMARQGGAIYF 267
Query: 188 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
L + +DP ++HG PV GEKW+ TKW+R++
Sbjct: 268 RYLNGHRQLDPLTLHGGAPVTNGEKWIMTKWMRER 302
>gi|307725787|ref|YP_003909000.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1003]
gi|307586312|gb|ADN59709.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1003]
Length = 313
Score = 115 bits (289), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 65/208 (31%), Positives = 108/208 (51%), Gaps = 29/208 (13%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTL---ALRKGETVDNTQGIRTSSGVFISAAE 79
P+ + F N +P++C +I ++ L+ ST+ A + + + N RTS G++ E
Sbjct: 124 PQVIVFGNVLSPDECAEMIERSRHRLKRSTIVDPATGREDVIRN----RTSEGIWYQRGE 179
Query: 80 DESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG-----PQKS 134
D ++ ++++IA + P NGE IL Y +Y H+D F P + G +
Sbjct: 180 D--ALIERLDQRIASLMNWPLENGEGLQILHYGPSGEYRPHFDYFPPDQPGSAVHTARGG 237
Query: 135 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 194
QRVA+ +VYL D+ +GGET+FP GL V +QG + F +
Sbjct: 238 QRVATLVVYLNDVPDGGETIFPEA---------------GLSVAAQQGGAVYFRYMNGRR 282
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
+DP ++HG PV+ G+KW+ TKW+R++
Sbjct: 283 QLDPLTLHGGAPVLSGDKWIMTKWVRER 310
>gi|281206564|gb|EFA80750.1| putative prolyl 4-hydroxylase alpha subunit [Polysphondylium
pallidum PN500]
Length = 251
Score = 115 bits (289), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 72/220 (32%), Positives = 112/220 (50%), Gaps = 32/220 (14%)
Query: 10 SVTNIPFQV-LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGI- 67
S NIP + +S PR P F T E+C+ +I +K L+P + + G+
Sbjct: 50 STDNIPKLIEVSQKPRIYRIPKFLTDEECEHLIETSKNKLKPCN---------EISSGVH 100
Query: 68 RTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP- 126
R+ G+F+ E++ I ++ L + E ++RY G++ ++H+D F+P
Sbjct: 101 RSGWGLFMKEGEEDHPVTQNIFNRMKTFVNLTE-SSEVMQVIRYNPGEETSAHFDYFNPL 159
Query: 127 QEYGPQK----SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQG 182
G K QR+ + L+YL D+EEGGET FP N +KVKP +G
Sbjct: 160 TTNGAMKIGLYGQRICTILMYLADVEEGGETSFPEVN---------------VKVKPIKG 204
Query: 183 DGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
D +LFY+ PNG +DP S+H PV+KG KW+A K + +
Sbjct: 205 DAVLFYNCKPNGEVDPLSLHQGDPVIKGTKWIAIKLVNQK 244
>gi|187930127|ref|YP_001900614.1| procollagen-proline dioxygenase [Ralstonia pickettii 12J]
gi|187727017|gb|ACD28182.1| Procollagen-proline dioxygenase [Ralstonia pickettii 12J]
Length = 288
Score = 115 bits (289), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 66/215 (30%), Positives = 108/215 (50%), Gaps = 23/215 (10%)
Query: 13 NIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSG 72
+IP PR + F +F + +C +I + + L+ S + + +N RTS G
Sbjct: 86 DIPILFAIETPRIVLFQHFLSDAECDELIAIGRNRLKRSPV-VNPDTGEENLISARTSQG 144
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
E + IE +IA+ +P +GE F +L Y+ G +Y H+D F+P G
Sbjct: 145 GMFQVGEHP--LIAKIEVRIAQAVGVPVEHGEGFQVLNYQPGGEYQPHFDFFNPGRSGEA 202
Query: 133 K-----SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 187
+ QRVA+ ++YL ++ GG T FP +GL+V P +G+ + F
Sbjct: 203 RQLEVGGQRVATMVIYLNSVQAGGATGFP---------------KLGLEVAPVKGNAVFF 247
Query: 188 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
P+GT+D ++H PV +GEKW+ATKW+R++
Sbjct: 248 VYKRPDGTLDEDTLHAGLPVERGEKWIATKWLRER 282
>gi|301093292|ref|XP_002997494.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262110636|gb|EEY68688.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 324
Score = 115 bits (288), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 69/219 (31%), Positives = 112/219 (51%), Gaps = 17/219 (7%)
Query: 13 NIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSG 72
++ + LS P F ++ I+ ++ +L+PST+ L G RTS+
Sbjct: 105 DVVLETLSLTPLVFSVDEFLKDDEIDIIMALSLEHLKPSTVTLMDGHEDRAATDWRTSTT 164
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
F+S+++ LD I++++A +T +P + E +LRY+ QKY+ H D F P E+
Sbjct: 165 YFLSSSK--HSKLDEIDQRVADLTKVPVDHQEDVQVLRYEETQKYDHHTDYF-PVEHHKN 221
Query: 133 KSQ-----------RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKC-IGLKVKPR 180
R+ + Y++D+ +GG T+FP G A + C GLKV P+
Sbjct: 222 SPHVLESIDYGYKNRMITVFWYMSDVAKGGHTIFPRAGG--APRPQSMKDCSTGLKVSPK 279
Query: 181 QGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWI 219
+ ++FYS+LPNG DP S+HG CPV G K+ KW+
Sbjct: 280 KRKVIVFYSMLPNGQGDPMSLHGGCPVEDGIKYSGNKWV 318
>gi|149180354|ref|ZP_01858859.1| prolyl 4-hydroxylase, alpha subunit [Bacillus sp. SG-1]
gi|148852546|gb|EDL66691.1| prolyl 4-hydroxylase, alpha subunit [Bacillus sp. SG-1]
Length = 212
Score = 115 bits (288), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 62/201 (30%), Positives = 105/201 (52%), Gaps = 27/201 (13%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C ++I ++K L+ S + N +RTSS F+ E ES
Sbjct: 37 PLIVVLGNVLSDEECDALIGLSKDKLKRSKIG-----NTRNENDMRTSSSTFME--EGES 89
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
+ +E++I+++ +P NGE IL YKIGQ+Y +H+D F + R+++ ++
Sbjct: 90 EVVTRVEKRISQIMNIPYENGEGLQILNYKIGQEYKAHFDFFK-----NASNPRISTLVM 144
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + V P++G + F N ++ ++H
Sbjct: 145 YLNDVEEGGETYFP---------------KLNFSVSPQKGMAVYFEYFYDNQELNDLTLH 189
Query: 203 GSCPVVKGEKWVATKWIRDQE 223
G PV+ G+KW AT+W+R ++
Sbjct: 190 GGAPVIIGDKWAATQWMRRKQ 210
>gi|340357957|ref|ZP_08680560.1| prolyl 4-hydroxylase [Sporosarcina newyorkensis 2681]
gi|339616017|gb|EGQ20677.1| prolyl 4-hydroxylase [Sporosarcina newyorkensis 2681]
Length = 211
Score = 115 bits (288), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 62/200 (31%), Positives = 106/200 (53%), Gaps = 24/200 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I +A ++ S + + E +RTSS +FI +DE+
Sbjct: 33 PLIVVLGNVLSDEECDELIQLAGDKVKRSKIGTTREEN-----ELRTSSSMFIE--DDEN 85
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
+ ++++I+ + +P +GE ILRY GQ+Y +H+D F + R+++ ++
Sbjct: 86 LIVTRVKKRISAIMKIPMEHGEGLQILRYTPGQQYKAHHDFFSSD--SKITNNRISTLVM 143
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+E+GGET FP + V PR+G + F + T++ ++H
Sbjct: 144 YLNDVEQGGETFFPH---------------LKFSVSPRKGMAVYFEYFYSDQTLNDFTLH 188
Query: 203 GSCPVVKGEKWVATKWIRDQ 222
G PVV+GEKWVAT+W+R Q
Sbjct: 189 GGAPVVEGEKWVATQWMRKQ 208
>gi|395003644|ref|ZP_10387769.1| 2OG-Fe(II) oxygenase superfamily enzyme [Acidovorax sp. CF316]
gi|394318439|gb|EJE54870.1| 2OG-Fe(II) oxygenase superfamily enzyme [Acidovorax sp. CF316]
Length = 299
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 78/226 (34%), Positives = 118/226 (52%), Gaps = 34/226 (15%)
Query: 6 AGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKG-ETVDN 63
AGD +V NI + PR + F N + E+C ++I A + R T+A + G E V++
Sbjct: 98 AGDRAV-NILLAIAK--PRIVVFGNLLSAEECDALIAAAAPRMARSLTVATKTGGEEVND 154
Query: 64 TQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDA 123
RTS G+F E+ + IEE+IA++ P NGE +L Y+ G +Y HYD
Sbjct: 155 D---RTSDGMFFQRGENP--VVQRIEERIARLLDWPIENGEGLQVLHYRPGAEYKPHYDY 209
Query: 124 FDPQEYG-----PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVK 178
FDP E G + QRV + ++YL E+GG T FP + ++V
Sbjct: 210 FDPGEPGTPTILKRGGQRVGTLVMYLNTPEKGGGTTFP---------------DVHVEVA 254
Query: 179 PRQGDGLLF-YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
P++G+ + F Y T ++HG PV+ GEKW+ATKW+R++E
Sbjct: 255 PQRGNAVFFSYERAHPAT---RTLHGGAPVIAGEKWIATKWLRERE 297
>gi|334188665|ref|NP_001190630.1| iron ion binding / oxidoreductase/ oxidoreductase protein
[Arabidopsis thaliana]
gi|332010771|gb|AED98154.1| iron ion binding / oxidoreductase/ oxidoreductase protein
[Arabidopsis thaliana]
Length = 243
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 63/154 (40%), Positives = 95/154 (61%), Gaps = 7/154 (4%)
Query: 8 DDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-GETVDNTQG 66
DDS +++SW PRA + NF E+CK +I +AK ++ ST+ K G++ D+
Sbjct: 70 DDSKNERWVEIISWEPRASVYHNFL--EECKYLIELAKPHMEKSTVVDEKTGKSTDSR-- 125
Query: 67 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP 126
+RTSSG F++ D+ T+ IE++I+ T +P +GE +L Y+IGQKY HYD F
Sbjct: 126 VRTSSGTFLARGRDK--TIREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHYDYFMD 183
Query: 127 QEYGPQKSQRVASFLVYLTDLEEGGETMFPFENG 160
+ QR+A+ L+YL+D+EEGGET+FP G
Sbjct: 184 EYNTRNGGQRIATVLMYLSDVEEGGETVFPAAKG 217
>gi|384046522|ref|YP_005494539.1| prolyl 4-hydroxylase alpha subunit [Bacillus megaterium WSH-002]
gi|345444213|gb|AEN89230.1| Prolyl 4-hydroxylase alpha subunit [Bacillus megaterium WSH-002]
Length = 219
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 65/202 (32%), Positives = 106/202 (52%), Gaps = 25/202 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P L N + E+C +I ++K ++ S + + IRTSSG+F +E+E
Sbjct: 39 PLVLVLGNVLSNEECDELIQLSKDKMQRSKIGAER-----EVNSIRTSSGMFFEESENE- 92
Query: 83 GTLDLIEEKIAKVTMLPRIN-GEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFL 141
+ IE +++K+ M P I E IL+Y Q+Y +H+D F K+ R+++ +
Sbjct: 93 -LVHQIERRLSKI-MGPSIEYAEGLQILKYLPDQEYKAHHDYFTSASKAS-KNNRISTLV 149
Query: 142 VYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSI 201
+YL D+EEGGET FP +GL + P +G + F + ++ ++
Sbjct: 150 MYLNDVEEGGETYFP---------------KLGLSISPTKGMAVYFEYFYSDAELNDRTL 194
Query: 202 HGSCPVVKGEKWVATKWIRDQE 223
HG PV+KGEKWVAT+W+R Q+
Sbjct: 195 HGGAPVIKGEKWVATQWMRKQK 216
>gi|195061068|ref|XP_001995918.1| GH14106 [Drosophila grimshawi]
gi|193891710|gb|EDV90576.1| GH14106 [Drosophila grimshawi]
Length = 511
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 66/212 (31%), Positives = 115/212 (54%), Gaps = 21/212 (9%)
Query: 14 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGV 73
+ +++ P + F + +P++ + N+A+ L+ +T+ + G+ V ++ +RTS G
Sbjct: 308 LKMEIVLLNPFIVVFHDALSPQEIDYLQNLARPLLKRTTVHV-NGKYV--SRRVRTSKGA 364
Query: 74 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP-QEYGPQ 132
++ D + IE ++ +T L EA+NI+ Y +G Y +HYD F+ ++ +
Sbjct: 365 WLE--RDLNNLTRRIERRVVDMTELSMQGSEAYNIMNYGLGGHYAAHYDFFNTTKQQTSE 422
Query: 133 KSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLP 192
R+A+ L YL+D+E+GG T+FP + L V P +G L +Y+LL
Sbjct: 423 TGDRIATVLFYLSDVEQGGATVFP---------------NLKLAVSPERGMALFWYNLLD 467
Query: 193 NGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
NGT D ++HG CPV+ G KWV T WI ++ Q
Sbjct: 468 NGTGDTRTLHGGCPVLVGSKWVMTLWIHERAQ 499
>gi|170690448|ref|ZP_02881615.1| Procollagen-proline dioxygenase [Burkholderia graminis C4D1M]
gi|170144883|gb|EDT13044.1| Procollagen-proline dioxygenase [Burkholderia graminis C4D1M]
Length = 307
Score = 114 bits (286), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 65/208 (31%), Positives = 107/208 (51%), Gaps = 29/208 (13%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTL---ALRKGETVDNTQGIRTSSGVFISAAE 79
P+ + F N +PE+C +I ++ L+ ST+ A + + + N RTS G++ E
Sbjct: 118 PQVIVFANVLSPEECDEVIERSRHRLKRSTIVDPATGQEDVIRN----RTSEGIWYQRGE 173
Query: 80 DESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG-----PQKS 134
D ++ ++++IA + P NGE IL Y +Y H+D F P + G +
Sbjct: 174 D--AFIERLDQRIASLMNWPVENGEGLQILHYGPTGEYRPHFDYFPPDQPGSMVHTARGG 231
Query: 135 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 194
QRVA+ ++YL D+ +GGET+FP GL V +QG + F +
Sbjct: 232 QRVATLVIYLNDVPDGGETIFPEA---------------GLSVAAKQGGAVYFRYMNGQR 276
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
+DP ++HG PV G+KW+ TKW+R++
Sbjct: 277 QLDPLTLHGGAPVRAGDKWIMTKWMRER 304
>gi|295704991|ref|YP_003598066.1| 2OG-Fe(II) oxygenase [Bacillus megaterium DSM 319]
gi|294802650|gb|ADF39716.1| 2OG-Fe(II) oxygenase family protein [Bacillus megaterium DSM 319]
Length = 219
Score = 114 bits (286), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 66/202 (32%), Positives = 106/202 (52%), Gaps = 25/202 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P L N + E+C +I ++K ++ S + + IRTSSG+F +E+E
Sbjct: 39 PLVLVLGNVLSNEECDELIRLSKDKMQRSKIGAAR-----EVNSIRTSSGMFFDESENE- 92
Query: 83 GTLDLIEEKIAKVTMLPRIN-GEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFL 141
+ IE +++K+ M P I E IL+Y Q+Y +H+D F K+ R+++ +
Sbjct: 93 -LVHQIERRLSKI-MGPSIEYAEGLQILKYLPDQEYKAHHDYFTSASKAS-KNNRISTLV 149
Query: 142 VYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSI 201
+YL D+EEGGET FP +GL V P +G + F + ++ ++
Sbjct: 150 MYLNDVEEGGETYFP---------------KLGLSVSPTKGMAVYFEYFYSDAELNDRTL 194
Query: 202 HGSCPVVKGEKWVATKWIRDQE 223
HG PV+KGEKWVAT+W+R Q+
Sbjct: 195 HGGAPVIKGEKWVATQWMRKQK 216
>gi|406665340|ref|ZP_11073114.1| hypothetical protein B857_00901 [Bacillus isronensis B3W22]
gi|405387266|gb|EKB46691.1| hypothetical protein B857_00901 [Bacillus isronensis B3W22]
Length = 211
Score = 114 bits (286), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 65/210 (30%), Positives = 113/210 (53%), Gaps = 24/210 (11%)
Query: 14 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGV 73
I +VL P + F N + E+C+++I+ A L S LA ++ IRTSSG+
Sbjct: 21 ITAEVLHEEPLIVKFLNVLSDEECQNLIDCASSRLERSKLAKKE------ISSIRTSSGM 74
Query: 74 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK 133
F E+E+ + IE++I+ + LP + E +L Y+ GQ++ +H+D F P +
Sbjct: 75 FFE--ENENPLISEIEKRISSLMHLPIEHAEGLQVLHYEPGQEFKAHFDFFGPN-HPSSS 131
Query: 134 SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPN 193
+ R+++ +VYL D+EEGG T FP +G+ P++G + F +
Sbjct: 132 NNRISTLVVYLNDVEEGGVTTFP---------------NLGIVNVPKKGTAVYFEYFYND 176
Query: 194 GTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
++ ++H PV++GEKWVAT+W+R ++
Sbjct: 177 QKLNELTLHSGEPVIQGEKWVATQWMRKKQ 206
>gi|413963357|ref|ZP_11402584.1| ProCollegen-proline dioxygenase [Burkholderia sp. SJ98]
gi|413929189|gb|EKS68477.1| ProCollegen-proline dioxygenase [Burkholderia sp. SJ98]
Length = 286
Score = 114 bits (285), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 66/191 (34%), Positives = 104/191 (54%), Gaps = 25/191 (13%)
Query: 36 QCKSIINMAKLNL-RPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAK 94
+C +I + + ++ R S + G+ + T R S G F++A+ D ++ I+ +IA+
Sbjct: 107 ECDRLIEIGREHVQRSSVVDPDSGKEI--TIEERRSEGAFVNASTD--ALVETIDRRIAE 162
Query: 95 VTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS-----QRVASFLVYLTDLEE 149
+ P NGE +ILRY +G +Y HYD F ++ G + QR+A+ ++YL ++E+
Sbjct: 163 LFRQPVENGEDLHILRYGMGGEYRPHYDYFPEEQAGSKHHMQRGGQRIATVILYLNEVEQ 222
Query: 150 GGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVK 209
GG+T FP IGL + PR+G L F + G DP ++H PV K
Sbjct: 223 GGDTTFP---------------DIGLAIHPRRGSALYFEYVNELGQSDPKTLHAGTPVEK 267
Query: 210 GEKWVATKWIR 220
GEKW+ATKWIR
Sbjct: 268 GEKWIATKWIR 278
>gi|294499597|ref|YP_003563297.1| 2OG-Fe(II) oxygenase family protein [Bacillus megaterium QM B1551]
gi|294349534|gb|ADE69863.1| 2OG-Fe(II) oxygenase family protein [Bacillus megaterium QM B1551]
Length = 219
Score = 114 bits (285), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 65/202 (32%), Positives = 106/202 (52%), Gaps = 25/202 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P L N + E+C +I ++K ++ S + + IRTSSG+F +E+E
Sbjct: 39 PLVLVLGNVLSNEECDELIQLSKDKMQRSKIGAAR-----EVNSIRTSSGMFFEESENE- 92
Query: 83 GTLDLIEEKIAKVTMLPRIN-GEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFL 141
+ IE +++K+ M P I E +L+Y Q+Y +H+D F K+ R+++ +
Sbjct: 93 -LVHQIERRLSKI-MGPSIEYAEGLQVLKYLPDQEYKAHHDYFTSASKAS-KNNRISTLV 149
Query: 142 VYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSI 201
+YL D+EEGGET FP +GL V P +G + F + ++ ++
Sbjct: 150 MYLNDVEEGGETYFP---------------KLGLSVSPTKGMAVYFEYFYSDAELNDRTL 194
Query: 202 HGSCPVVKGEKWVATKWIRDQE 223
HG PV+KGEKWVAT+W+R Q+
Sbjct: 195 HGGAPVIKGEKWVATQWMRKQK 216
>gi|385206010|ref|ZP_10032880.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
gi|385185901|gb|EIF35175.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
Length = 296
Score = 114 bits (284), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 78/215 (36%), Positives = 112/215 (52%), Gaps = 27/215 (12%)
Query: 17 QVLSWM--PRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGV 73
+V+S M P A+ +F + +C+ +I++A+ L ST+ G V G R+S G+
Sbjct: 94 RVISRMQRPAAILLDDFLSANECEQLISLARPRLSRSTVVDPVTGRNV--VAGHRSSDGM 151
Query: 74 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD---AFDP--QE 128
F E + + +E +IA++T LP NGE +L Y++G + H D A +P QE
Sbjct: 152 FFRLGE--TPLIARLEARIAELTGLPVENGEGLQLLHYEVGAESTPHVDYLIAGNPANQE 209
Query: 129 YGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFY 188
+ QRV + L+YL D+E GGETMFP G V PR+G L F
Sbjct: 210 SIARSGQRVGTLLMYLNDVEGGGETMFP---------------QTGWSVVPRRGQALYFE 254
Query: 189 SLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
G DP+S+H S P+ GEKWVATKWIR +
Sbjct: 255 YGNRFGLADPSSLHTSTPLRVGEKWVATKWIRTRR 289
>gi|357417854|ref|YP_004930874.1| procollagen-proline dioxygenase [Pseudoxanthomonas spadix BD-a59]
gi|355335432|gb|AER56833.1| Procollagen-proline dioxygenase [Pseudoxanthomonas spadix BD-a59]
Length = 283
Score = 113 bits (283), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 75/227 (33%), Positives = 110/227 (48%), Gaps = 32/227 (14%)
Query: 5 QAGDDSVTNIPFQVLSWM--PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVD 62
QAGD V QVL+ + PR + F N E+C ++I +A+ ++ S + D
Sbjct: 81 QAGDRQV-----QVLASLLHPRVIVFGNLLAAEECDALIALARRQIKRSPV-FDPDTGQD 134
Query: 63 NTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD 122
RTS G+F + +E +IA + P NGE +LRY G +Y HYD
Sbjct: 135 QQHQARTSEGMFFGRGANP--LCARVEARIAALLNWPLENGEGLQVLRYGPGAQYEPHYD 192
Query: 123 AFDPQEYGPQKS-----QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKV 177
FDP G + + QRVAS ++YL +GG T FP + L+V
Sbjct: 193 YFDPARPGAEVALRRGGQRVASLVIYLNTPTQGGATTFPDAH---------------LEV 237
Query: 178 KPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
P +G+ + F P+ ++HG PVV+GEKWVATKW+R++
Sbjct: 238 APIKGNAVYFSYDRPHPMTG--TLHGGAPVVEGEKWVATKWLRERRH 282
>gi|377811809|ref|YP_005044249.1| ProCollegen-proline dioxygenase [Burkholderia sp. YI23]
gi|357941170|gb|AET94726.1| ProCollegen-proline dioxygenase [Burkholderia sp. YI23]
Length = 283
Score = 113 bits (282), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 65/205 (31%), Positives = 104/205 (50%), Gaps = 27/205 (13%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLAL--RKGETVDNTQGIRTSSGVFISAAED 80
P + +P +C +I + + +R S++ GE + + R S G F++ + D
Sbjct: 91 PVVALLADVLSPRECDRLIEIGRERVRRSSVVDPDSGGEVLIDA---RKSEGAFVNGSTD 147
Query: 81 ESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK-----SQ 135
+ I+ +IA++ P NGE +ILRY G +Y H+D F ++ G + Q
Sbjct: 148 P--LVATIDRRIAELVQQPVENGEDLHILRYGAGGEYRPHFDYFPEEQAGSKHHMQRGGQ 205
Query: 136 RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGT 195
R+A+ ++YL +EEGG+T FP IGL + PR+G L F + G
Sbjct: 206 RIATLILYLNQVEEGGDTTFPD---------------IGLTIHPRRGAALYFEYVNALGQ 250
Query: 196 IDPTSIHGSCPVVKGEKWVATKWIR 220
DP ++H PV +GEKW+ATKW+R
Sbjct: 251 TDPRTLHAGMPVERGEKWIATKWMR 275
>gi|393200372|ref|YP_006462214.1| prolyl 4-hydroxylase [Solibacillus silvestris StLB046]
gi|327439703|dbj|BAK16068.1| prolyl 4-hydroxylase [Solibacillus silvestris StLB046]
Length = 211
Score = 112 bits (281), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 65/210 (30%), Positives = 111/210 (52%), Gaps = 24/210 (11%)
Query: 14 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGV 73
I +VL P + F N + E+C+++I+ A L S LA ++ IRTSSG+
Sbjct: 21 ITAEVLHEEPLIVKFLNVLSDEECQNLIDCASSRLERSKLAKKE------ISSIRTSSGM 74
Query: 74 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK 133
F E+E+ + IE++I+ + LP + E +L Y+ GQ++ H+D F P +
Sbjct: 75 FFE--ENENPLISEIEKRISSLMHLPIEHAEGLQVLHYEPGQEFKPHFDFFGPN-HPSSS 131
Query: 134 SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPN 193
+ R+ + +VYL D+EEGG T FP +G+ P++G + F +
Sbjct: 132 NNRICTLVVYLNDVEEGGVTTFP---------------NLGIVNVPKKGTAVYFEYFYND 176
Query: 194 GTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
++ ++H PV++GEKWVAT+W+R ++
Sbjct: 177 QKLNELTLHSGEPVIQGEKWVATQWMRKKQ 206
>gi|326518408|dbj|BAJ88233.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 276
Score = 112 bits (281), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 72/195 (36%), Positives = 106/195 (54%), Gaps = 23/195 (11%)
Query: 6 AGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTL---ALRKGETVD 62
A D + + +V+SW PR + F NF + E+C + +A+ L ST+ A KG D
Sbjct: 50 AADLRLGYVKPEVISWTPRIIVFHNFLSSEECDYLREIARPRLEISTVVDVATGKGVKSD 109
Query: 63 NTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD 122
+RTSSG+F+++ E + + IE++I+ + +P NGE +LRY+ Q Y H+D
Sbjct: 110 ----VRTSSGMFVNSEERKLPVIKAIEKRISVFSQIPVENGELIQVLRYEPNQYYRPHHD 165
Query: 123 AFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCI-------GL 175
F + QRVA+ L+YLTD EGGET FP DG +CI GL
Sbjct: 166 YFSDTFNLKRGGQRVATMLMYLTDGVEGGETHFP----QAGDG-----ECICGGRLVRGL 216
Query: 176 KVKPRQGDGLLFYSL 190
VKP +GD +LF+S+
Sbjct: 217 CVKPNKGDAVLFWSM 231
>gi|383757171|ref|YP_005436156.1| putative prolyl 4-hydroxylase alpha subunit [Rubrivivax gelatinosus
IL144]
gi|381377840|dbj|BAL94657.1| putative prolyl 4-hydroxylase alpha subunit homologue
oxidoreductase protein [Rubrivivax gelatinosus IL144]
Length = 279
Score = 112 bits (281), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 70/211 (33%), Positives = 105/211 (49%), Gaps = 37/211 (17%)
Query: 22 MPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFI 75
+PR + F + E+C ++ +A RP L + ETVDN+ G RTS G+F
Sbjct: 91 LPRVVVFGGLLSDEECDELVALA----RPR---LARSETVDNSTGGSEVNAARTSDGMFF 143
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP---- 131
E ++ IE +IA++ P GE +LRY+ G +Y H+D FDP G
Sbjct: 144 ERGEKP--LIERIERRIAELVRWPVERGEGLQVLRYRPGAQYKPHHDFFDPAHPGTANIL 201
Query: 132 -QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 190
+ QRV + ++YL GG T FP +GL+V+P +G+ + F
Sbjct: 202 RRGGQRVGTVVMYLNTPAGGGATTFP---------------EVGLEVQPVKGNAVFFSYE 246
Query: 191 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRD 221
P + ++HG PV+ GEKWVATKW+R+
Sbjct: 247 RPLAST--RTLHGGAPVLDGEKWVATKWMRE 275
>gi|295699617|ref|YP_003607510.1| procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
gi|295438830|gb|ADG17999.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
Length = 286
Score = 112 bits (281), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 65/207 (31%), Positives = 104/207 (50%), Gaps = 27/207 (13%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLA--LRKGETVDNTQGIRTSSGVFISAAED 80
P+ + F + + +C +I ++ L+ ST L E V RTS GV+ ED
Sbjct: 97 PQLVVFADVLSAAECAELIERSRHRLKRSTTVNPLTGREDVIRN---RTSEGVWYRRGED 153
Query: 81 ESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP-----QKSQ 135
+ + +E +IA +T P NGE +L Y +Y+ H+D F P + G Q Q
Sbjct: 154 Q--LIARVERRIASLTNWPLENGEGLQVLHYGTSGEYSPHFDFFAPDQPGSAVHTTQGGQ 211
Query: 136 RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGT 195
RVA+ ++YL D+ +GGET+FP GL V + G + F +
Sbjct: 212 RVATLIIYLNDVADGGETVFP---------------TAGLSVAAQAGGAVYFRYMNAERQ 256
Query: 196 IDPTSIHGSCPVVKGEKWVATKWIRDQ 222
+DP+++HG PV+ G+KW+ TKW+R++
Sbjct: 257 LDPSTLHGGAPVLAGDKWIMTKWMRER 283
>gi|388519941|gb|AFK48032.1| unknown [Lotus japonicus]
Length = 151
Score = 112 bits (280), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 59/150 (39%), Positives = 89/150 (59%), Gaps = 2/150 (1%)
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
+F++ E + + IE++I+ + +P NGE +LRY+ Q Y H+D F +
Sbjct: 1 MFLTPEERKYPMVHAIEKRISVYSQVPIENGELMQVLRYEKNQYYKPHHDYFADTFNLKR 60
Query: 133 KSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLP 192
QR+A+ L+YL+D EGGET FP N + S + GL VKP +G+ +LF+S+
Sbjct: 61 GGQRIATMLMYLSDNVEGGETYFP--NIGSGQCSCGGKTVEGLSVKPTKGNAVLFWSMGL 118
Query: 193 NGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
+G DP S+HG C V+ GEKW ATKW+R +
Sbjct: 119 DGQSDPLSVHGGCEVLAGEKWSATKWMRQK 148
>gi|348683507|gb|EGZ23322.1| hypothetical protein PHYSODRAFT_310730 [Phytophthora sojae]
Length = 417
Score = 112 bits (280), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 67/225 (29%), Positives = 114/225 (50%), Gaps = 17/225 (7%)
Query: 13 NIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSG 72
++ + LS P F ++ I+N++ +L+PS + L G RTS+
Sbjct: 198 DVVLETLSMTPLVFSVEEFLKDDEIDIIMNLSLEHLKPSGVTLMDGHENRAATDWRTSTT 257
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
F+ + D +D I+++++ +T +P + E +LRY+ QKY+ H D F P E+
Sbjct: 258 YFLPS--DAHPKIDEIDQRVSDLTKVPIDHQEDVQVLRYEKTQKYDHHTDYF-PVEHHKN 314
Query: 133 KSQ-----------RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCI-GLKVKPR 180
R+ + Y++D+ +GG T+FP G A + C GL V P+
Sbjct: 315 APHILESIDYGYKNRMITVFWYMSDVAKGGHTIFPRAGG--APRPTSMKDCTTGLNVPPK 372
Query: 181 QGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQY 225
+ ++FYS+LPNG DP S+HG CPV +G K+ KW+ ++ +Y
Sbjct: 373 KRKVIVFYSMLPNGEGDPMSLHGGCPVEEGVKYSGNKWVWNKARY 417
>gi|403234403|ref|ZP_10912989.1| Procollagen-proline dioxygenase [Bacillus sp. 10403023]
Length = 217
Score = 112 bits (280), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 62/198 (31%), Positives = 107/198 (54%), Gaps = 24/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I ++K + S +A VDN +RTSS FI E+E+
Sbjct: 39 PLIVVLGNVLSDEECDELIRLSKDRINRSKIA---NANVDN---MRTSSSTFIE--ENEN 90
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
+ IE++I+++ +P GE IL Y++GQ+Y SH+D F + + R+++ ++
Sbjct: 91 IIVSRIEKRISQIMNIPTEYGEGLQILNYQVGQEYKSHFDFFS-SPHNAINNPRISTLVM 149
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL+D+E+GGET FP + V P++G + F + T++ ++H
Sbjct: 150 YLSDVEQGGETYFP---------------KLHFSVSPQKGMAVYFEYFYNDQTLNELTLH 194
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV+ G+KW AT+W+R
Sbjct: 195 GGAPVIVGDKWAATQWMR 212
>gi|385205097|ref|ZP_10031967.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
gi|385184988|gb|EIF34262.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
Length = 292
Score = 112 bits (279), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 65/208 (31%), Positives = 108/208 (51%), Gaps = 29/208 (13%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTL---ALRKGETVDNTQGIRTSSGVFISAAE 79
P+ + F + +P++C +I ++ L+ ST A K + + N RTS G++ E
Sbjct: 103 PQMIVFADVLSPDECAEMIERSRHRLKRSTTVNPATGKEDVIRN----RTSEGIWYQRGE 158
Query: 80 DESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG-----PQKS 134
D ++ ++ +I+ + P NGE +LRY +Y H+D F P + G Q
Sbjct: 159 DP--FIERMDRRISSLMNWPVENGEGLQLLRYGTTGEYRPHFDYFPPDQPGSTVHTAQGG 216
Query: 135 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 194
QRVA+ ++YL D+ +GGET+FP E GM+ S QG + F +
Sbjct: 217 QRVATLVIYLNDVPDGGETIFP-EAGMSVAAS--------------QGGAVYFRYMNGRR 261
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
+DP ++HG PV+ G+KW+ TKW+R++
Sbjct: 262 QLDPLTLHGGAPVLSGDKWIMTKWMRER 289
>gi|125546091|gb|EAY92230.1| hypothetical protein OsI_13950 [Oryza sativa Indica Group]
Length = 178
Score = 112 bits (279), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 57/144 (39%), Positives = 87/144 (60%), Gaps = 15/144 (10%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 72
LSW PRA + F + ++C ++N+AK + S +A DN G +RTSSG
Sbjct: 40 LSWRPRAFLYSGFLSHDECDHLVNLAKGRMEKSMVA-------DNDSGKSIMSQVRTSSG 92
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
F+S ED+ + IE+++A T LP N E+ IL Y++GQKY++H+D F + +
Sbjct: 93 TFLSKHEDD--IVSGIEKRVAAWTFLPEENAESIQILHYELGQKYDAHFDYFHDKNNLKR 150
Query: 133 KSQRVASFLVYLTDLEEGGETMFP 156
RVA+ L+YLTD+++GGET+FP
Sbjct: 151 GGHRVATVLMYLTDVKKGGETVFP 174
>gi|171059332|ref|YP_001791681.1| procollagen-proline dioxygenase [Leptothrix cholodnii SP-6]
gi|170776777|gb|ACB34916.1| Procollagen-proline dioxygenase [Leptothrix cholodnii SP-6]
Length = 287
Score = 111 bits (277), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 75/216 (34%), Positives = 106/216 (49%), Gaps = 40/216 (18%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFIS 76
PR + F F + ++C +++ +A+ L + ETVDN G RTS G+F
Sbjct: 100 PRVVVFGGFLSHDECDALVALAQPRLA-------RSETVDNDTGGSEVNEARTSQGMFFM 152
Query: 77 AAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG-----P 131
E E + IE +IA + P NGE +L Y+ G +Y HYD FDP + G
Sbjct: 153 RGEGE--LISRIEARIAALLDWPLENGEGVQVLHYRPGAEYKPHYDYFDPAQPGTPTILK 210
Query: 132 QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF-YSL 190
+ QRV + ++YL E GG T FP N L+V P +G+ + F Y
Sbjct: 211 RGGQRVGTLVMYLNTPERGGGTTFPDVN---------------LEVAPIKGNAVFFSYER 255
Query: 191 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 226
T S+HG PV+ GEKWVATKW+R Q ++D
Sbjct: 256 AHPST---RSLHGGAPVLAGEKWVATKWLR-QARFD 287
>gi|428182311|gb|EKX51172.1| hypothetical protein GUITHDRAFT_92735 [Guillardia theta CCMP2712]
Length = 190
Score = 111 bits (277), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 67/191 (35%), Positives = 101/191 (52%), Gaps = 22/191 (11%)
Query: 51 STLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILR 110
ST+A E + RTSS ++S D + I ++A++ LP E +L
Sbjct: 4 STIAEAGNEAKNGVGSARTSSTAWLSKTADP--LVAKIRTRVAELVKLPMELAEDMQVLH 61
Query: 111 YKIGQKYNSHYDAFDPQEY-----GPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADG 165
Y Q Y +H+D FDP Y P +++ + F YL+D+EEGGET+FPF NG +
Sbjct: 62 YSKNQHYWAHHDFFDPNIYRGFVTSPGQNRFITVFF-YLSDVEEGGETVFPFANGDDRRV 120
Query: 166 SYDYQKCI-GLKVKPRQGDGLLFYSLLPN------------GTIDPTSIHGSCPVVKGEK 212
+ D+ C GLKVKP+ G+ ++FYS+L +D S+HG C V+KG+K
Sbjct: 121 T-DFADCSRGLKVKPKAGNAIIFYSMLAKRQQEICPPDDLGCNLDVRSLHGGCDVIKGDK 179
Query: 213 WVATKWIRDQE 223
W A WI +++
Sbjct: 180 WAANYWIANKK 190
>gi|333981907|ref|YP_004511117.1| procollagen-proline dioxygenase [Methylomonas methanica MC09]
gi|333805948|gb|AEF98617.1| Procollagen-proline dioxygenase [Methylomonas methanica MC09]
Length = 286
Score = 111 bits (277), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 64/209 (30%), Positives = 102/209 (48%), Gaps = 35/209 (16%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGI------RTSSGVFIS 76
P + F + E+C+ +I ++ L PS + VD G R+S G +
Sbjct: 96 PDIVVVDEFMSGEECEQLIEQSRRKLTPSAI-------VDPQTGKFQVIADRSSEGTYFQ 148
Query: 77 AAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG-----P 131
E S + ++ +I+++ P +GE IL Y +G +Y H+D F E G
Sbjct: 149 RGE--SPLISRLDRRISELMNWPEDHGEGIQILHYGVGAQYKPHFDYFLENESGGALQMT 206
Query: 132 QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLL 191
Q QRVA+ ++YL ++ EGGET+FP +G+ + P++G F
Sbjct: 207 QSGQRVATLVMYLNEVTEGGETVFPD---------------VGISITPKRGSAAYFAYCN 251
Query: 192 PNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
G +DP ++HG PV+ GEKW+ATKW+R
Sbjct: 252 SLGQVDPATLHGGAPVLTGEKWIATKWMR 280
>gi|91778899|ref|YP_554107.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
xenovorans LB400]
gi|91691559|gb|ABE34757.1| Procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
xenovorans LB400]
Length = 292
Score = 111 bits (277), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 65/208 (31%), Positives = 107/208 (51%), Gaps = 29/208 (13%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTL---ALRKGETVDNTQGIRTSSGVFISAAE 79
P+ + F + +P++C +I ++ L+ ST A K + + N RTS G++ E
Sbjct: 103 PQVIVFADVLSPDECAEMIERSRHRLKRSTTVNPATGKEDVIRN----RTSEGIWYQRGE 158
Query: 80 DESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG-----PQKS 134
D ++ ++ +I+ + P NGE IL Y +Y H+D F P + G Q
Sbjct: 159 DP--FIERMDRRISSLMNWPVENGEGLQILHYGTTGEYRPHFDYFPPDQPGSAVHTAQGG 216
Query: 135 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 194
QRVA+ ++YL D+ +GGET+FP E GM+ S QG + F +
Sbjct: 217 QRVATLVIYLNDVPDGGETIFP-EAGMSVAAS--------------QGGAVYFRYMNDRR 261
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
+DP ++HG PV+ G+KW+ TKW+R++
Sbjct: 262 QLDPLTLHGGAPVLAGDKWIMTKWMRER 289
>gi|187920106|ref|YP_001889137.1| procollagen-proline dioxygenase [Burkholderia phytofirmans PsJN]
gi|187718544|gb|ACD19767.1| Procollagen-proline dioxygenase [Burkholderia phytofirmans PsJN]
Length = 295
Score = 111 bits (277), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 63/208 (30%), Positives = 105/208 (50%), Gaps = 29/208 (13%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLA---LRKGETVDNTQGIRTSSGVFISAAE 79
P+ + F + +P++C +I ++ L+ ST K + + N RTS G++ E
Sbjct: 106 PQVIVFGDVLSPDECAEMIERSRHRLKRSTTVNPETGKEDVIRN----RTSEGIWYQRGE 161
Query: 80 DESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG-----PQKS 134
D ++ ++ +I+ + P NGE IL Y +Y H+D F P + G Q
Sbjct: 162 D--AFIERMDRRISSLMNWPVENGEGLQILHYGTTGEYRPHFDYFPPDQPGSAVHTAQGG 219
Query: 135 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 194
QRVA+ ++YL D+ +GGET+FP G+ V RQG + F +
Sbjct: 220 QRVATLVIYLNDVPDGGETIFPEA---------------GISVAARQGGAVYFRYMNGQR 264
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
+DP ++HG PV+ G+KW+ TKW+R++
Sbjct: 265 QLDPLTLHGGAPVLGGDKWIMTKWMRER 292
>gi|377810637|ref|YP_005043077.1| proCollegen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia sp.
YI23]
gi|357939998|gb|AET93554.1| proCollegen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia sp.
YI23]
Length = 297
Score = 111 bits (277), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 72/206 (34%), Positives = 99/206 (48%), Gaps = 23/206 (11%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P A+ F T +C +I +A+ L ST+ + D G R+S G F AE +
Sbjct: 102 PAAVLLDEFLTGSECDQLIALARPRLSRSTV-VDPVTGRDVAAGHRSSDGTFFRLAE--T 158
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-----DPQEYGPQKSQRV 137
+ +E +IA +T L NGE +LRY+ G + H D +E + QRV
Sbjct: 159 PLVARLEMRIAALTGLAAENGEGLQLLRYQPGAESTPHVDYLVAGNETNRESIARSGQRV 218
Query: 138 ASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTID 197
+ L+YL D+E GGET+FP +G V PR+G L F G D
Sbjct: 219 GTLLMYLNDVEGGGETVFP---------------QVGCSVVPRRGQALYFEYCNRAGVCD 263
Query: 198 PTSIHGSCPVVKGEKWVATKWIRDQE 223
P S+H S P+ GEKWVATKWIR +
Sbjct: 264 PASLHASTPLRSGEKWVATKWIRARR 289
>gi|89096248|ref|ZP_01169141.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus sp.
NRRL B-14911]
gi|89089102|gb|EAR68210.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus sp.
NRRL B-14911]
Length = 217
Score = 110 bits (276), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 61/198 (30%), Positives = 103/198 (52%), Gaps = 23/198 (11%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C+ +I M++ L+ S + TVD+ IRTSS +F E+E
Sbjct: 39 PLIVILGNVLSDEECEGLIRMSEDKLKRSKIG--NTRTVDD---IRTSSSMFFEEGENE- 92
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
+ IE +++++ +P +GE +L Y IGQ+Y +H+D + R+++ ++
Sbjct: 93 -LVARIERRLSQIMNIPVEHGEGLQMLNYHIGQEYKAHFDF-FSSSSRAASNPRISTLVM 150
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + V P++G + F N ++ ++H
Sbjct: 151 YLNDVEEGGETYFP---------------KLNFSVNPQKGSAVYFEYFYDNQDLNDLTLH 195
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV+KG KW AT+W+R
Sbjct: 196 GGAPVIKGSKWAATQWMR 213
>gi|195159311|ref|XP_002020525.1| GL13465 [Drosophila persimilis]
gi|194117294|gb|EDW39337.1| GL13465 [Drosophila persimilis]
Length = 578
Score = 110 bits (275), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 63/213 (29%), Positives = 110/213 (51%), Gaps = 20/213 (9%)
Query: 15 PF--QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSG 72
PF ++LS P + + + TP + ++ N++K +++ + K + RTS+
Sbjct: 375 PFKTELLSLAPYMVLYHDVITPLESLTLKNLSKPHMKRRAMTFNKQKLRPLIDSGRTSNS 434
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFD-PQEYGP 131
V++++ E+ ++ +E ++ +T N E + ++ Y IG Y H D F+ PQ
Sbjct: 435 VWLTSHEN--AVMERLERRVGVMTNFEMENSEVYQLINYGIGGHYKPHTDHFETPQLEHR 492
Query: 132 QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLL 191
R+A+ L YL+D+ +GG T+FP N + V+PRQGD LL+Y+L
Sbjct: 493 GGGDRIATVLFYLSDVPQGGATLFPRLN---------------ISVQPRQGDALLWYNLN 537
Query: 192 PNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
G + ++H SCP++KG KW KWI + Q
Sbjct: 538 DRGQGEIGTVHTSCPIIKGSKWALVKWIDELSQ 570
>gi|91779740|ref|YP_554948.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
xenovorans LB400]
gi|91692400|gb|ABE35598.1| Procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
xenovorans LB400]
Length = 296
Score = 110 bits (275), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 77/215 (35%), Positives = 110/215 (51%), Gaps = 27/215 (12%)
Query: 17 QVLSWM--PRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGV 73
+V+S M P A+ +F + +C+ +I +A+ L ST+ G V G R+S G+
Sbjct: 94 RVISRMQRPAAVLLDDFLSANECEQLIALARPRLSRSTVVDPVTGRNV--VAGHRSSDGM 151
Query: 74 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD---AFDP--QE 128
F E + + +E +IA++T LP NGE +L Y+ G + H D A +P +E
Sbjct: 152 FFRLGE--TPLIARLEARIAELTGLPVENGEGLQLLHYEAGAESTPHVDYLIAGNPANRE 209
Query: 129 YGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFY 188
+ QRV + L+YL D+E GGETMFP G V PR+G L F
Sbjct: 210 SIARSGQRVGTLLMYLNDVEGGGETMFP---------------QTGWSVVPRRGQALYFE 254
Query: 189 SLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
G DP+S+H S P+ GEKWVATKWIR +
Sbjct: 255 YGNRFGLADPSSLHTSTPLRAGEKWVATKWIRTRR 289
>gi|330821584|ref|YP_004350446.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
gladioli BSR3]
gi|327373579|gb|AEA64934.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
gladioli BSR3]
Length = 302
Score = 110 bits (275), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 69/207 (33%), Positives = 100/207 (48%), Gaps = 25/207 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVFISAAEDE 81
P A+ F + +C+ +I +A+ L ST+ G + G R+S G+F E
Sbjct: 102 PAAVLLDGFLSAGECRQLIELARPRLNRSTVVDPVTGRNI--VAGHRSSDGMFFRLGE-- 157
Query: 82 SGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP-----QEYGPQKSQR 136
+ + IE++IA +T P NGE +L Y+ G + H D P E + QR
Sbjct: 158 TPLISRIEQRIAALTGFPVENGEGLQMLHYEAGAESTPHVDYLVPGNPANAESIARSGQR 217
Query: 137 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 196
V + L+YL D+E GGET+FP +G V PR+G F +G
Sbjct: 218 VGTLLMYLNDVESGGETLFP---------------QVGCSVVPRRGQAFYFEYGNGSGRS 262
Query: 197 DPTSIHGSCPVVKGEKWVATKWIRDQE 223
DP S+H S P+ G+KWVATKWIR +
Sbjct: 263 DPASLHASSPIGSGDKWVATKWIRTRR 289
>gi|332526359|ref|ZP_08402485.1| procollagen-proline dioxygenase [Rubrivivax benzoatilyticus JA2]
gi|332110495|gb|EGJ10818.1| procollagen-proline dioxygenase [Rubrivivax benzoatilyticus JA2]
Length = 224
Score = 110 bits (275), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 73/230 (31%), Positives = 112/230 (48%), Gaps = 44/230 (19%)
Query: 5 QAGDDSVTNIPFQVLSWM--PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVD 62
+AGD V VL+ M PR + F + ++C ++ +A+ L + ETVD
Sbjct: 22 RAGDREV-----HVLATMALPRVVVFGGLLSEQECDELVALAQPRLL-------RSETVD 69
Query: 63 NTQG------IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQK 116
N+ G RTS G+F E+ ++ IE +IA++ P GE +L Y+ G +
Sbjct: 70 NSTGGSEVNAARTSDGMFFE--RGETPLIERIERRIAELVHWPVERGEGLQVLHYRPGAQ 127
Query: 117 YNSHYDAFDPQEYGP-----QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQK 171
Y H+D FDP G + QRV + ++YL GG T FP
Sbjct: 128 YKPHHDFFDPAHPGTANILRRGGQRVGTVVIYLNTPAGGGATTFP--------------- 172
Query: 172 CIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRD 221
+GL+V+P +G+ + F P + ++HG PV+ GEKWVATKW+R+
Sbjct: 173 EVGLEVQPIKGNAVFFSYERPLASTR--TLHGGAPVLDGEKWVATKWLRE 220
>gi|329913962|ref|ZP_08276011.1| hypothetical protein IMCC9480_1311 [Oxalobacteraceae bacterium
IMCC9480]
gi|327545257|gb|EGF30515.1| hypothetical protein IMCC9480_1311 [Oxalobacteraceae bacterium
IMCC9480]
Length = 280
Score = 110 bits (275), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 69/212 (32%), Positives = 103/212 (48%), Gaps = 35/212 (16%)
Query: 22 MPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGI------RTSSGVFI 75
+PR + N + ++C +I M++ ST T+DN GI RTS I
Sbjct: 91 IPRIVVLGNVLSDDECDAIAAMSRTRFARST-------TIDNASGINRFDDSRTSESAHI 143
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK-- 133
E E + I+ ++A ++ P +GE + +Y+ G +Y H+D FDP G K
Sbjct: 144 QRGETE--LIARIDARLAALSGWPVDHGEPLQLQKYQAGNEYRPHFDWFDPALAGTAKHL 201
Query: 134 ---SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 190
QR+A+ ++YLTD+EEGG T FP IGL V P++G L F +
Sbjct: 202 EKSGQRLATIILYLTDVEEGGGTSFP---------------GIGLDVHPQKGGALFFRNT 246
Query: 191 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
P G D + H PV KG K +A KW+R++
Sbjct: 247 TPYGVPDRKTQHAGLPVEKGTKIIANKWLREK 278
>gi|390176896|ref|XP_002136934.2| GA26861 [Drosophila pseudoobscura pseudoobscura]
gi|388858831|gb|EDY67492.2| GA26861 [Drosophila pseudoobscura pseudoobscura]
Length = 513
Score = 110 bits (274), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 63/213 (29%), Positives = 111/213 (52%), Gaps = 22/213 (10%)
Query: 15 PF--QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSG 72
PF ++LS P + + + TP + ++ N++K +++ + K + RTS+
Sbjct: 312 PFKTEILSLSPYMVLYHDVITPLESLTLKNLSKPHMKRRAMTFNKQKLRPLIDSGRTSNS 371
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFD-PQEYGP 131
V++++ E+ ++ +E ++ +T N E + ++ Y IG Y H D F+ PQ G
Sbjct: 372 VWLTSHEN--AVMERLERRVGVMTNFEMENSEVYQLINYGIGGHYKPHTDHFETPQHRG- 428
Query: 132 QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLL 191
R+A+ L YL+D+ +GG T+FP N + V+PRQGD LL+Y+L
Sbjct: 429 -GGDRIATVLFYLSDVPQGGATLFPRLN---------------ISVQPRQGDALLWYNLN 472
Query: 192 PNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
G + ++H SCP+++G KW KWI + Q
Sbjct: 473 DRGQGEIGTVHTSCPIIQGSKWALVKWIDELSQ 505
>gi|433460968|ref|ZP_20418587.1| prolyl 4-hydroxylase alpha subunit [Halobacillus sp. BAB-2008]
gi|432190746|gb|ELK47751.1| prolyl 4-hydroxylase alpha subunit [Halobacillus sp. BAB-2008]
Length = 211
Score = 109 bits (273), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 64/201 (31%), Positives = 104/201 (51%), Gaps = 25/201 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P+ N + E+C+++I ++K + S + + + IRTSS F+ +DE
Sbjct: 34 PKIAILGNVVSEEECEALIRLSKDKVNRSKIG-----SDHDVSDIRTSSSAFL--PDDE- 85
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
IE+++A++ +P +GE +IL YK GQ+Y +H+D F K+ R+++ ++
Sbjct: 86 -LTGRIEKRLAQIMNVPVEHGEGIHILHYKPGQEYKAHHDYFRSTSRAA-KNPRISTLVL 143
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP N L V P +G + F + I+ ++H
Sbjct: 144 YLNDVEEGGETYFPEMN---------------LTVSPHKGMAVYFEYFYNDPAINERTLH 188
Query: 203 GSCPVVKGEKWVATKWIRDQE 223
G PV GEKW AT W+R Q+
Sbjct: 189 GGSPVTAGEKWAATMWVRRQQ 209
>gi|303279839|ref|XP_003059212.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226459048|gb|EEH56344.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 409
Score = 109 bits (272), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 82/260 (31%), Positives = 120/260 (46%), Gaps = 50/260 (19%)
Query: 4 GQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTL----ALRKGE 59
G D V + + LS PRA F F T E+C +I ++ +L+ ST+ AL GE
Sbjct: 71 GPTRDIGVGDARVEKLSDSPRAYLFREFLTKEECAHLIEISTPHLKRSTVVGDDAL-LGE 129
Query: 60 TVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGE---AFNILRYKIGQK 116
RTS+G F+ D+ + +E ++ + LP N E A ++LRY++GQ+
Sbjct: 130 ADGRRSDYRTSTGAFLPKLYDD--VVTRVERRVEAFSRLPFENQEQLQARSLLRYELGQE 187
Query: 117 YNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--------- 167
Y H D F + G +RVA+ L++L + EEGGET FP NG ++
Sbjct: 188 YRDHVDGFATENGG----KRVATVLMFLAEPEEGGETAFP--NGEPSEAVAARVAAQRAR 241
Query: 168 -DYQKCI-----------------GLKVKPRQGDGLLFYSLLPNGT-------IDPTSIH 202
+ C G VKPR GD +LF+S + + S H
Sbjct: 242 GELSDCAWRGGGGGTAGGGRGNLRGFAVKPRLGDAVLFFSYDADDDGGYDGAEVSHASTH 301
Query: 203 GSCPVVKGEKWVATKWIRDQ 222
SCP +G KW ATKWI ++
Sbjct: 302 ASCPTTRGVKWTATKWIHER 321
>gi|205374182|ref|ZP_03226981.1| prolyl 4-hydroxylase alpha subunit [Bacillus coahuilensis m4-4]
Length = 210
Score = 108 bits (271), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 59/198 (29%), Positives = 105/198 (53%), Gaps = 25/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P N + E+C +I+++K + S +A + IRTS+ VF+ ED S
Sbjct: 33 PFVAVLGNVLSDEECDELISLSKDRMNRSKIA------GNQENDIRTSTSVFL--PEDAS 84
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
+ +E++I+++ +P +GE +L Y+IGQ+Y +H+D F P++ ++ R+++ ++
Sbjct: 85 EVVQRVEKRISQIMNIPVEHGEGLQLLNYQIGQEYKAHFDFFSPKKL--IENPRISTLVL 142
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGG+T FP + L V P +G + F + ++ ++H
Sbjct: 143 YLNDVEEGGDTYFP---------------NLKLSVSPHKGMAVYFEYFYDDPMLNELTLH 187
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV G+KW AT W+R
Sbjct: 188 GGAPVTIGDKWAATMWMR 205
>gi|386712780|ref|YP_006179102.1| prolyl 4-hydroxylase alpha subunit [Halobacillus halophilus DSM
2266]
gi|384072335|emb|CCG43825.1| prolyl 4-hydroxylase alpha subunit [Halobacillus halophilus DSM
2266]
Length = 211
Score = 108 bits (270), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 64/196 (32%), Positives = 103/196 (52%), Gaps = 26/196 (13%)
Query: 30 NFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIE 89
N + E+C+ +I ++K + S + + E D IRTSS F+ E + IE
Sbjct: 41 NVVSEEECEELIFLSKNKMNRSKIG-SQHEVSD----IRTSSSTFLP----EDDLTNRIE 91
Query: 90 EKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEE 149
+++A++ +P +GE +IL YK GQ+Y +HYD F + + R+++ ++YL D+EE
Sbjct: 92 KRVAQIMNVPVEHGEGLHILNYKQGQEYKAHYDYFRSKAKAAN-NPRISTLVLYLNDVEE 150
Query: 150 GGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVK 209
GGET FP N L + P +G + F + I+ ++HG PV
Sbjct: 151 GGETYFPHMN---------------LSISPHKGMAVYFEYFYSDPLINERTLHGGSPVTS 195
Query: 210 GEKWVATKWIRDQEQY 225
GEKW AT W+R ++QY
Sbjct: 196 GEKWAATMWVR-RKQY 210
>gi|302850293|ref|XP_002956674.1| hypothetical protein VOLCADRAFT_67269 [Volvox carteri f.
nagariensis]
gi|300258035|gb|EFJ42276.1| hypothetical protein VOLCADRAFT_67269 [Volvox carteri f.
nagariensis]
Length = 325
Score = 108 bits (270), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 75/222 (33%), Positives = 113/222 (50%), Gaps = 22/222 (9%)
Query: 5 QAGDDSVTNIP---FQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETV 61
Q+ S++ +P Q +SW PRA+ + NF + ++ + II++A ++ ST+ K E V
Sbjct: 27 QSHFQSLSQLPTCRIQTISWKPRAVVYHNFLSDQEARHIIDLAHEQMKRSTVVGNKNEGV 86
Query: 62 DNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHY 121
IRTS G F+ A+D + IEE++A + +P + E +LRY KY H
Sbjct: 87 --VDDIRTSYGTFLRRAQDP--VIMAIEERLALWSHMPPSHQEDMQVLRYGRTNKYGPHI 142
Query: 122 DAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFE--NGMNADGSYDYQKCIG-LKVK 178
D +RVA+ L+YL E G + P M A+ S G + K
Sbjct: 143 DGL----------ERVATVLMYLVG-ESPGPDLAPVSACECMYAEQSNPSACAKGHVAYK 191
Query: 179 PRQGDGLLFYSLLPN-GTIDPTSIHGSCPVVKGEKWVATKWI 219
P++GD L+F+ + P+ T D S+H CPVV G KW A KWI
Sbjct: 192 PKRGDALMFFDVKPDYTTTDGHSMHTGCPVVAGVKWNAVKWI 233
>gi|402813396|ref|ZP_10862991.1| hypothetical protein PAV_1c08470 [Paenibacillus alvei DSM 29]
gi|402509339|gb|EJW19859.1| hypothetical protein PAV_1c08470 [Paenibacillus alvei DSM 29]
Length = 215
Score = 108 bits (270), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 67/200 (33%), Positives = 102/200 (51%), Gaps = 25/200 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I +K L+ S + GE Q IRTSSGVF +E+
Sbjct: 36 PLIVILGNVLSNEECDELIEHSKERLQRSKI----GEERSVNQ-IRTSSGVFC----EEN 86
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
T+ IE++I+++ +P +G+ +L Y GQ+Y H+D F + R+++ ++
Sbjct: 87 ETVAKIEKRISQIMNIPIEHGDGLQVLLYAPGQEYKPHFDFFADTSRAS-ANNRISTLVM 145
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP N L V P +G + F N ++ ++H
Sbjct: 146 YLNDVEEGGETTFPMLN---------------LSVFPSKGMAVYFEYFYSNHELNERTLH 190
Query: 203 GSCPVVKGEKWVATKWIRDQ 222
PV KGEKWVAT W+R Q
Sbjct: 191 AGAPVRKGEKWVATMWMRRQ 210
>gi|319652240|ref|ZP_08006358.1| prolyl 4-hydroxylase [Bacillus sp. 2_A_57_CT2]
gi|317396063|gb|EFV76783.1| prolyl 4-hydroxylase [Bacillus sp. 2_A_57_CT2]
Length = 216
Score = 108 bits (270), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 61/198 (30%), Positives = 102/198 (51%), Gaps = 23/198 (11%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I +K ++ S +A VD +RTSS F E+E
Sbjct: 38 PLIVILGNVLSDEECDQLIQQSKDRMQRSKVA--NSLEVDE---LRTSSSTFFHEGENE- 91
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
+ IE++I+++ +P +GE IL YKIGQ+Y +H+D F + R+++ ++
Sbjct: 92 -IVARIEKRISQIMNIPVEHGEGLQILNYKIGQEYKAHFDFFSSTSRAA-SNPRISTLVM 149
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+E+GGET FP + V P++G + F + ++ ++H
Sbjct: 150 YLNDVEQGGETYFP---------------KLNFSVSPQKGMAVYFEYFYNDQNLNDLTLH 194
Query: 203 GSCPVVKGEKWVATKWIR 220
G PVV G+KW AT+W+R
Sbjct: 195 GGAPVVMGDKWAATQWMR 212
>gi|218665910|ref|YP_002425647.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrooxidans ATCC 23270]
gi|218518123|gb|ACK78709.1| oxidoreductase, 2OG-Fe(II) oxygenase family [Acidithiobacillus
ferrooxidans ATCC 23270]
Length = 248
Score = 108 bits (270), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 72/205 (35%), Positives = 100/205 (48%), Gaps = 32/205 (15%)
Query: 28 FPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFIS----AAEDESG 83
+ TPE C+++I + + LRP+T+ D G + G +S D+
Sbjct: 68 WAGLLTPENCQNLIAIGQSLLRPATV-------TDEQTGQEVAHGERVSEMAWPKRDDYP 120
Query: 84 TLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP---QKSQRVASF 140
L + E IA++T +P E IL Y+ G +Y HYDAF P Q R A+
Sbjct: 121 ILQSLAEGIAQLTGIPIDCQEPLQILHYRPGGEYKPHYDAFAAD--APTLRQGGNRQATL 178
Query: 141 LVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTS 200
++YL +EEGGET FP +GL+V P G G+ F +L G P S
Sbjct: 179 ILYLNAVEEGGETAFPE---------------LGLQVSPIPGGGVFFRNLNEEGQRHPLS 223
Query: 201 IHGSCPVVKGEKWVATKWIRDQEQY 225
+H PV KGEKW+AT+WIR QE Y
Sbjct: 224 LHAGLPVRKGEKWIATQWIR-QEAY 247
>gi|317127314|ref|YP_004093596.1| Procollagen-proline dioxygenase [Bacillus cellulosilyticus DSM
2522]
gi|315472262|gb|ADU28865.1| Procollagen-proline dioxygenase [Bacillus cellulosilyticus DSM
2522]
Length = 229
Score = 108 bits (269), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 57/198 (28%), Positives = 104/198 (52%), Gaps = 24/198 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + E+C +I+++K + S ++ + + +RTSS +F AE++
Sbjct: 44 PLIVLLGNVLSEEECDQLISLSKDRIERSKISNK------SVHDLRTSSSMFFDDAEND- 96
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
+ +E++++++ +P +GE IL Y IGQ+Y +HYD F + R+++ ++
Sbjct: 97 -VVSTVEKRVSQIMKIPVDHGEGIQILNYAIGQEYKAHYDYFSSGN-SKVNNPRISTLVM 154
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+E GGET FP + V P++G + F + T++ ++H
Sbjct: 155 YLNDVEAGGETYFP---------------KLNFYVAPKKGMAVYFEYFYNDTTLNELTLH 199
Query: 203 GSCPVVKGEKWVATKWIR 220
G PVV G+KW AT+W+R
Sbjct: 200 GGAPVVIGDKWAATQWMR 217
>gi|428183249|gb|EKX52107.1| hypothetical protein GUITHDRAFT_150687 [Guillardia theta CCMP2712]
Length = 315
Score = 108 bits (269), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 67/204 (32%), Positives = 100/204 (49%), Gaps = 21/204 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR N T E+C+S+ ++ + L + G RT++ ++ +
Sbjct: 88 PRIYVLHNILTKEECESLKSLGVMAGMEKALIIPYGGKELVESSTRTNTAAWLEY--HQG 145
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK----SQRVA 138
+ +E +AKVT NGE IL Y+ Q++ H+D FDP P+ R+A
Sbjct: 146 PVVTKLENLLAKVTNTEPENGENLQILHYQTSQQFKEHHDYFDPATDPPENFEPGGNRLA 205
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
+ ++YL + EEGGET D+ K I KVKP G +LFY L P+G++D
Sbjct: 206 TAIIYLQNAEEGGET--------------DFMK-IDTKVKPEAGSAVLFYDLKPDGSVDK 250
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQ 222
+IH P GEKWVATKWI ++
Sbjct: 251 LTIHSGNPPKGGEKWVATKWIHER 274
>gi|319795182|ref|YP_004156822.1| procollagen-proline dioxygenase [Variovorax paradoxus EPS]
gi|315597645|gb|ADU38711.1| Procollagen-proline dioxygenase [Variovorax paradoxus EPS]
Length = 296
Score = 107 bits (268), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 64/199 (32%), Positives = 98/199 (49%), Gaps = 23/199 (11%)
Query: 30 NFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIE 89
N +CK++I MAK L PSTL + D R S G+F E++ + ++
Sbjct: 107 NVVDAHECKALIEMAKPRLAPSTL-VDPMSGRDVVSDKRASWGMFFRLCEND--LVARLD 163
Query: 90 EKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS-----QRVASFLVYL 144
+++ + LP NGE ++L Y G H+D P ++S QRV++ + YL
Sbjct: 164 RRLSALMNLPLENGEGLHLLYYPTGAGSEPHHDYLAPTNAANRESIARSGQRVSTLVTYL 223
Query: 145 TDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGS 204
D EGG+T+FP +GL V P +G+ F NG +D S+H S
Sbjct: 224 NDAPEGGQTVFPQ---------------LGLAVSPIRGNACYFEYCDGNGRVDARSLHAS 268
Query: 205 CPVVKGEKWVATKWIRDQE 223
PV +G+KWV TKW+R++
Sbjct: 269 APVTRGDKWVMTKWMRERR 287
>gi|389795384|ref|ZP_10198508.1| procollagen-proline dioxygenase [Rhodanobacter fulvus Jip2]
gi|388430823|gb|EIL87950.1| procollagen-proline dioxygenase [Rhodanobacter fulvus Jip2]
Length = 293
Score = 107 bits (268), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 65/199 (32%), Positives = 99/199 (49%), Gaps = 35/199 (17%)
Query: 35 EQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFISAAEDESGTLDLI 88
E+C +I R S L++ TVD G R+S G F D+ + +
Sbjct: 109 EECDELI-------RRSADKLQRSTTVDPVNGGYEVIAARSSEGTFFPVNADD--FIARL 159
Query: 89 EEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK-----SQRVASFLVY 143
+ +IA++ P NGE +L Y G +Y H+D F P + G + QRV++ L+Y
Sbjct: 160 DRRIAELMNCPVENGEGLQVLHYGEGGEYQPHFDYFSPGDPGSEAQMVVGGQRVSTLLIY 219
Query: 144 LTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHG 203
L D+ +GG T+FP +GL+V PR+G + F +G +DP ++HG
Sbjct: 220 LNDVAQGGATVFP---------------TLGLRVLPRKGMAVYFEYSNRDGQVDPLTLHG 264
Query: 204 SCPVVKGEKWVATKWIRDQ 222
PV KGEKW+ TKW+R +
Sbjct: 265 GEPVEKGEKWIITKWMRQR 283
>gi|416009427|ref|ZP_11561250.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein
[Acidithiobacillus sp. GGI-221]
gi|339836568|gb|EGQ64151.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein
[Acidithiobacillus sp. GGI-221]
Length = 196
Score = 107 bits (267), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 71/205 (34%), Positives = 99/205 (48%), Gaps = 32/205 (15%)
Query: 28 FPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISA----AEDESG 83
+ TPE C+++I + + LRP+T+ D G + G +S D+
Sbjct: 16 WAGLLTPENCQNLIAIGQSLLRPATV-------TDEQTGQEVAHGERVSEMAWPKRDDHP 68
Query: 84 TLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP---QKSQRVASF 140
L + E IA++T +P E IL Y+ G +Y HYDAF P Q R +
Sbjct: 69 ILQSLAEGIAQLTGIPIDCQEPLQILHYRPGGEYKPHYDAFAAD--APTLRQGGNRQGTL 126
Query: 141 LVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTS 200
++YL +EEGGET FP +GL+V P G G+ F +L G P S
Sbjct: 127 ILYLNAVEEGGETAFPE---------------LGLQVSPIPGGGVFFRNLNEEGQRHPLS 171
Query: 201 IHGSCPVVKGEKWVATKWIRDQEQY 225
+H PV KGEKW+AT+WIR QE Y
Sbjct: 172 LHAGLPVRKGEKWIATQWIR-QEAY 195
>gi|167519971|ref|XP_001744325.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163777411|gb|EDQ91028.1| predicted protein [Monosiga brevicollis MX1]
Length = 492
Score = 107 bits (267), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 63/204 (30%), Positives = 99/204 (48%), Gaps = 28/204 (13%)
Query: 24 RALYFPNFATPEQCKSIINMAKLNLRPSTL----ALRKGETVDNTQGIRTSSGVFISAAE 79
R F NFA+ ++C + + L + A R E R S+ ++
Sbjct: 305 RLQIFRNFASAQECAHLREEGRKKLSRAVAWTDGAFRPVE-------FRISTAAWLQPDH 357
Query: 80 DESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVAS 139
D+ T + +IA T L EA + Y IG Y +HYD +E + R+A+
Sbjct: 358 DDVVTN--LHTRIADATQLDLEFAEALQVSNYGIGGFYETHYDHHASRERELPEGDRIAT 415
Query: 140 FLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPT 199
F++YL +E+GG T FP +G V+P GD + +Y+LLP+G D
Sbjct: 416 FMIYLNQVEQGGYTAFPR---------------LGAAVEPGHGDAVFWYNLLPDGESDNN 460
Query: 200 SIHGSCPVVKGEKWVATKWIRDQE 223
++HG+CPV++G KWVA KWI +++
Sbjct: 461 TLHGACPVLQGSKWVANKWIHEKK 484
>gi|73542634|ref|YP_297154.1| procollagen-proline,2-oxoglutarate-4-dioxygenase [Ralstonia
eutropha JMP134]
gi|72120047|gb|AAZ62310.1| Procollagen-proline,2-oxoglutarate-4-dioxygenase [Ralstonia
eutropha JMP134]
Length = 282
Score = 107 bits (267), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 63/205 (30%), Positives = 100/205 (48%), Gaps = 23/205 (11%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + + + +C +++ +A+ L S + + +N RTS G E
Sbjct: 90 PSIRLYQHLLSDAECDALVELARGRLARSPV-INPDTGDENLIDARTSMGAMFQVGEHT- 147
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK-----SQRV 137
+ IE++IA V +P +GE IL YK G +Y H+D F+P+ G + QR
Sbjct: 148 -LIQRIEDRIAAVLGVPVDHGEGLQILNYKPGGEYQPHFDFFNPKRPGEARQLRVGGQRT 206
Query: 138 ASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTID 197
A+ ++YL + GG T FP IGL+V P +G+ + F L P+G +D
Sbjct: 207 ATLVIYLNTPQAGGATAFP---------------RIGLEVAPVKGNAVYFSYLQPDGKLD 251
Query: 198 PTSIHGSCPVVKGEKWVATKWIRDQ 222
++H PV GEKW+ATKW+R+
Sbjct: 252 ERTLHAGLPVQSGEKWIATKWLREH 276
>gi|115434812|ref|NP_001042164.1| Os01g0174500 [Oryza sativa Japonica Group]
gi|55296794|dbj|BAD68120.1| prolyl 4-hydroxylase -like [Oryza sativa Japonica Group]
gi|113531695|dbj|BAF04078.1| Os01g0174500 [Oryza sativa Japonica Group]
gi|222617830|gb|EEE53962.1| hypothetical protein OsJ_00571 [Oryza sativa Japonica Group]
Length = 303
Score = 107 bits (266), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 63/202 (31%), Positives = 107/202 (52%), Gaps = 20/202 (9%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA 78
LSW PR + F + +C +++M + N+ S+LA T G R SS I
Sbjct: 63 LSWHPRIFLYEGFLSDMECDHLVSMGRGNME-SSLAF--------TDGDRNSSYNNI--- 110
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVA 138
E + IE++I+ + LP+ NGE+ +L+Y + + + + + R+A
Sbjct: 111 --EDIVVSKIEDRISLWSFLPKENGESIQVLKYGVNRS-----GSIKEEPKSSSGAHRLA 163
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDY-QKCIGLKVKPRQGDGLLFYSLLPNGTID 197
+ L+YL+D+++GGET+FP +A +C G V+P +G+ +L ++L P+G D
Sbjct: 164 TILMYLSDVKQGGETVFPRSEMKDAQAKEGAPSQCSGYAVRPAKGNAILLFNLRPDGETD 223
Query: 198 PTSIHGSCPVVKGEKWVATKWI 219
S + CPV++GEKW+A K I
Sbjct: 224 KDSQYEECPVLEGEKWLAIKHI 245
>gi|226314793|ref|YP_002774689.1| hypothetical protein BBR47_52080 [Brevibacillus brevis NBRC 100599]
gi|226097743|dbj|BAH46185.1| conserved hypothetical protein [Brevibacillus brevis NBRC 100599]
Length = 215
Score = 106 bits (265), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 60/200 (30%), Positives = 100/200 (50%), Gaps = 25/200 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + +C +I ++ L+ S + + + IRTSSGVF E
Sbjct: 36 PLVVVLGNVLSDSECDELIEHSRERLQRSKIGEDR-----SVNSIRTSSGVFCEQTE--- 87
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
T+ IE++I+++ +P +G+ +LRY GQ+Y HYD F + + R+++ ++
Sbjct: 88 -TITRIEKRISQIMNIPIEHGDGLQVLRYTPGQEYKPHYDFF-AETSRASTNNRISTLVM 145
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+E+GGET+FP + L V P +G + F N ++ ++H
Sbjct: 146 YLNDVEQGGETVFPL---------------LHLSVFPTKGMAVYFEYFYRNQEVNEFTLH 190
Query: 203 GSCPVVKGEKWVATKWIRDQ 222
V+ GEKWVAT W+R Q
Sbjct: 191 AGAQVIHGEKWVATMWMRRQ 210
>gi|398818543|ref|ZP_10577128.1| 2OG-Fe(II) oxygenase superfamily enzyme [Brevibacillus sp. BC25]
gi|398027481|gb|EJL21031.1| 2OG-Fe(II) oxygenase superfamily enzyme [Brevibacillus sp. BC25]
Length = 220
Score = 106 bits (264), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 62/200 (31%), Positives = 101/200 (50%), Gaps = 25/200 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + N + +C +I ++ L+ S + GE + IRTSSGVF E
Sbjct: 41 PLVVVLGNVLSDSECDELIEHSRERLQRSKI----GED-GSVNSIRTSSGVFCEQTE--- 92
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
T+ IE++I+++ +P +G+ +LRY GQ+Y HYD F + + R+++ ++
Sbjct: 93 -TITRIEKRISQIMNIPIEHGDGLQVLRYTPGQEYKPHYDFF-AETSRASTNNRISTLVM 150
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+E+GGET+FP + L V P +G + F N ++ ++H
Sbjct: 151 YLNDVEQGGETVFPL---------------LHLSVFPTKGMAVYFEYFYSNQELNDFTLH 195
Query: 203 GSCPVVKGEKWVATKWIRDQ 222
V+ GEKWVAT W+R Q
Sbjct: 196 AGTQVIHGEKWVATMWMRRQ 215
>gi|389770666|ref|ZP_10192118.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
gi|388429637|gb|EIL86932.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
Length = 286
Score = 105 bits (263), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 64/199 (32%), Positives = 102/199 (51%), Gaps = 30/199 (15%)
Query: 35 EQCKSIINMAKLNLRPSTLA---LRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEK 91
E+C +I A L+ ST+ K ET+ + R+S G F D+ + ++ +
Sbjct: 107 EECDELIRRAAAKLQRSTIVDPTTGKHETIAD----RSSEGTFFEINADD--FIARLDRR 160
Query: 92 IAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG-----PQKSQRVASFLVYLTD 146
I+ + LP +GE IL Y G +Y H+D F P + G QRV++ ++YL +
Sbjct: 161 ISALMNLPVDHGEGLQILHYGPGGEYKPHFDFFPPGDPGSAVQMATGGQRVSTLVMYLNE 220
Query: 147 LEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCP 206
+E+GG T+FP +GL V P++G + F G +DP ++HG P
Sbjct: 221 VEDGGATIFP---------------ELGLSVLPKKGSAVYFEYTNSRGQLDPRTLHGGAP 265
Query: 207 VVKGEKWVATKWIRDQEQY 225
V++GEKW+ TKW+R Q +Y
Sbjct: 266 VLRGEKWIVTKWMR-QRRY 283
>gi|375106426|ref|ZP_09752687.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderiales bacterium
JOSHI_001]
gi|374667157|gb|EHR71942.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderiales bacterium
JOSHI_001]
Length = 295
Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 71/213 (33%), Positives = 105/213 (49%), Gaps = 39/213 (18%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFIS 76
PR + F + E+C +++++A RP L + ETV N G RTS G+F
Sbjct: 108 PRVMVFGGLLSDEECDAMVDLA----RPR---LARSETVHNGSGGSEVNAARTSDGMFFD 160
Query: 77 AAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG-----P 131
E IE++IA + P NGE +LRY+ G +Y +H+D FDP + G
Sbjct: 161 --RGEFPLCRTIEQRIAALVNWPVENGEGLQVLRYRPGSEYKAHHDYFDPAQPGTPTILK 218
Query: 132 QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF-YSL 190
+ QRV + ++YL GG T FP +GL+V P +G+ + F Y
Sbjct: 219 RGGQRVGTVVMYLNHPIRGGGTAFP---------------DVGLEVAPFKGNAVFFSYDR 263
Query: 191 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
T ++H PV++GEKWVATKW+R+ E
Sbjct: 264 AHPMT---RTLHAGTPVLEGEKWVATKWVREGE 293
>gi|418523362|ref|ZP_13089380.1| hypothetical protein WS7_20388 [Xanthomonas axonopodis pv.
malvacearum str. GSPB2388]
gi|410699993|gb|EKQ58573.1| hypothetical protein WS7_20388 [Xanthomonas axonopodis pv.
malvacearum str. GSPB2388]
Length = 286
Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 70/212 (33%), Positives = 99/212 (46%), Gaps = 37/212 (17%)
Query: 22 MPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFI 75
+PR + F + +C ++I +A RP L + TVDN G RTS G+ +
Sbjct: 95 LPRVVVLGGFLSDGECDALIALA----RPR---LARSRTVDNANGEHLVHAARTSDGMCL 147
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP---- 131
+D IE +IA++ P +GE +LRY G +Y HYD FDP G
Sbjct: 148 RVGQD--ALCQRIEARIARLFDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAVGTPILL 205
Query: 132 -QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 190
QRVAS ++YL E GG T FP + L V +G+ + F
Sbjct: 206 QAGGQRVASLVMYLNTPERGGATRFPDAH---------------LDVAAVKGNAVFFSYD 250
Query: 191 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
P+ S+H PV+ GEKWVATKW+R++
Sbjct: 251 RPHPMT--RSLHAGAPVLAGEKWVATKWLRER 280
>gi|295700439|ref|YP_003608332.1| procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
gi|295439652|gb|ADG18821.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
Length = 296
Score = 105 bits (261), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 70/215 (32%), Positives = 109/215 (50%), Gaps = 27/215 (12%)
Query: 16 FQVLSWM--PRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKGETVDNTQGIRTSSG 72
+VLS + P A++ NF + ++C+ +I +A+ L R + + G V T R+S G
Sbjct: 93 VRVLSRLQRPAAVHLANFLSADECEQLIALAQPRLDRSAVVDPVTGRDVIATH--RSSHG 150
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-----DPQ 127
+F E + + IE +IA++T P NGE +L Y+ G + H D +
Sbjct: 151 MFFRLGE--TPLIARIEARIAELTATPVENGEGLQMLHYEEGAESTPHVDYLMTGNEANR 208
Query: 128 EYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 187
E + QR+ + L+YL D+E GGET+FP +G + P++G L F
Sbjct: 209 ESIARSGQRMGTLLMYLKDVEGGGETVFP---------------QVGWSIVPQRGHALYF 253
Query: 188 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
G DP+S+H S P+ G+KWVATKWIR +
Sbjct: 254 EYGNRYGMCDPSSLHASTPLRTGDKWVATKWIRTR 288
>gi|363543309|ref|NP_001241870.1| prolyl 4-hydroxylase 6-3 precursor [Zea mays]
gi|347978824|gb|AEP37754.1| prolyl 4-hydroxylase 6-3 [Zea mays]
Length = 208
Score = 105 bits (261), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 64/179 (35%), Positives = 98/179 (54%), Gaps = 20/179 (11%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 72
LS PRA + F + +C ++++AK ++ S +A DN G RTSSG
Sbjct: 38 LSSRPRAFLYSGFLSDTECDHLVSLAKGSMEKSMVA-------DNDSGKSVASQARTSSG 90
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
F++ EDE + IE+++A T LP N E+ +LRY+ GQKY++H+D F +
Sbjct: 91 TFLAKREDE--IVSAIEKRVAAWTFLPEENAESLQVLRYETGQKYDAHFDYFHDRNNLKL 148
Query: 133 KSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLL 191
QRVA+ L+YLTD+++GGE +FP +A+GS+ K R G +F+S L
Sbjct: 149 GGQRVATVLMYLTDVKKGGEAVFP-----DAEGSHLQYKDETWSDCSRSGLAGIFFSEL 202
>gi|148653656|ref|YP_001280749.1| procollagen-proline dioxygenase [Psychrobacter sp. PRwf-1]
gi|148572740|gb|ABQ94799.1| Procollagen-proline dioxygenase [Psychrobacter sp. PRwf-1]
Length = 268
Score = 105 bits (261), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 64/232 (27%), Positives = 112/232 (48%), Gaps = 29/232 (12%)
Query: 1 MPHGQAGDDSV----TNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-L 55
+PH ++ V + + + P +F +PE+C ++I+ A L+ S +
Sbjct: 53 IPHINMTNNYVELSDKRVSLSFVCYKPFVTVINDFLSPEECDALISDADQKLKASRVVDP 112
Query: 56 RKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQ 115
G V+++ TS+G E + IE +IA + P +GE +LRY+ G
Sbjct: 113 EDGSFVEHSARTSTSTGYH----RGEIDIIKTIEARIADLINWPVDHGEGLQVLRYEDGG 168
Query: 116 KYNSHYDAFDPQEYGP-----QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQ 170
+Y H+D FDP + Q QRV +FL+YL++++ GG T FP
Sbjct: 169 EYRPHFDFFDPAKKSSRLVTKQGGQRVGTFLMYLSEVDSGGSTRFP-------------- 214
Query: 171 KCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
+ +++P +G L F + I+P ++H PV +G K++ATKW+R++
Sbjct: 215 -NLNFEIRPNKGSALYFANTNLKAEIEPLTLHAGMPVTEGVKYLATKWLREK 265
>gi|78046308|ref|YP_362483.1| 2OG-Fe(II) oxygenase [Xanthomonas campestris pv. vesicatoria str.
85-10]
gi|78034738|emb|CAJ22383.1| putative 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas
campestris pv. vesicatoria str. 85-10]
Length = 296
Score = 105 bits (261), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 69/212 (32%), Positives = 99/212 (46%), Gaps = 37/212 (17%)
Query: 22 MPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFI 75
+PR + F + E+C ++I +A RP L + TVDN G RTS + +
Sbjct: 105 LPRVVVLGGFLSDEECDALIALA----RPR---LARSRTVDNANGEHVVHAARTSDSMCL 157
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP---- 131
+D IE +IA++ P +GE +LRY G +Y HYD FDP G
Sbjct: 158 RLGQD--ALCQRIEARIARLLDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAAGTPVLV 215
Query: 132 -QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 190
QRVAS ++YL E GG T FP + L V +G+ + F
Sbjct: 216 QAGGQRVASLVMYLNTPERGGATRFPDAH---------------LDVAAVKGNAVFFSYD 260
Query: 191 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
P+ S+H PV+ G+KWVATKW+R++
Sbjct: 261 RPHPMT--RSLHAGAPVLAGDKWVATKWLRER 290
>gi|325925807|ref|ZP_08187179.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas perforans
91-118]
gi|325543793|gb|EGD15204.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas perforans
91-118]
Length = 286
Score = 105 bits (261), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 69/212 (32%), Positives = 99/212 (46%), Gaps = 37/212 (17%)
Query: 22 MPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFI 75
+PR + F + E+C ++I +A RP L + TVDN G RTS + +
Sbjct: 95 LPRVVVLGGFLSDEECDALIALA----RPH---LARSRTVDNANGEHVVHAARTSDSMCL 147
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP---- 131
+D IE +IA++ P +GE +LRY G +Y HYD FDP G
Sbjct: 148 RLGQD--ALCQRIEARIARLLDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAAGTPVLV 205
Query: 132 -QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 190
QRVAS ++YL E GG T FP + L V +G+ + F
Sbjct: 206 QAGGQRVASLVMYLNTPERGGATRFPDAH---------------LDVAAVKGNAVFFSYD 250
Query: 191 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
P+ S+H PV+ G+KWVATKW+R++
Sbjct: 251 RPHPMT--RSLHAGAPVLAGDKWVATKWLRER 280
>gi|124267278|ref|YP_001021282.1| hypothetical protein Mpe_A2091 [Methylibium petroleiphilum PM1]
gi|124260053|gb|ABM95047.1| conserved hypothetical protein [Methylibium petroleiphilum PM1]
Length = 289
Score = 104 bits (260), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 68/209 (32%), Positives = 95/209 (45%), Gaps = 37/209 (17%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGI------RTSSGVFIS 76
PR + F + +C I+ +A L + TVD G RTS G+F +
Sbjct: 102 PRVIVFSGLLSDAECDEIVALAGARLA-------RSHTVDTATGASEVNAARTSDGMFFT 154
Query: 77 AAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP----- 131
E E +IA + P NGE +L Y+ G +Y HYD FDP + G
Sbjct: 155 RGEHP--VCARFEARIAALLNWPVENGEGLQVLHYRPGAEYKPHYDYFDPDQPGTPAVLR 212
Query: 132 QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLL 191
+ QRVA+ + YL GG T FP IGL+V P +G + F
Sbjct: 213 RGGQRVATLVTYLNTPTRGGGTTFP---------------DIGLEVTPLKGHAVFFSYDR 257
Query: 192 PNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
P+ + S+HG PV++G+KWVATKW+R
Sbjct: 258 PHPST--RSLHGGAPVLEGDKWVATKWLR 284
>gi|239915958|ref|NP_001070123.2| prolyl 4-hydroxylase alpha II-like precursor [Danio rerio]
Length = 490
Score = 104 bits (260), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 55/155 (35%), Positives = 85/155 (54%), Gaps = 25/155 (16%)
Query: 67 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP 126
IRTS VF+ +E GT+ I ++IA +T L + E ++ Y IG +Y H+D
Sbjct: 346 IRTSQSVFL----EEVGTVARISQRIADITGLSVESAEKLHVQNYGIGGRYTPHFDT--- 398
Query: 127 QEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLL 186
G + ++R A+FL+Y++D+E GG T+F +G+ VKP +G +
Sbjct: 399 ---GDEVNERTATFLIYMSDVEVGGATVFT---------------NVGVAVKPEKGSAVF 440
Query: 187 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRD 221
+Y+L NG +D + H CPV+ G KWVA KWI +
Sbjct: 441 WYNLHKNGELDLKTKHAGCPVLVGNKWVANKWIHE 475
>gi|357135727|ref|XP_003569460.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like isoform 2
[Brachypodium distachyon]
Length = 314
Score = 104 bits (260), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 60/208 (28%), Positives = 107/208 (51%), Gaps = 19/208 (9%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA 78
L+W PR + F + +C ++ +A+LN+ S L + T + A
Sbjct: 63 LAWHPRVFLYEGFLSGMECDHLVYVARLNIESSLLVNAGARNITQNS---TDARFKFQLA 119
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS---Q 135
+ + + IE++I+ + +P+ +GE+ IL+Y Q D + G Q S
Sbjct: 120 DSKDIVVSKIEDRISLWSFIPKEHGESMQILKYGSNQS--------DHNKDGTQSSSGGN 171
Query: 136 RVASFLVYLTDLEEGGETMFP---FENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLP 192
R+ + L+YL+D+++GGET+FP ++ +G+ +C G VKP +GD +L ++L P
Sbjct: 172 RLVTILMYLSDVKQGGETVFPRSELKDTQAKEGAL--SECAGYAVKPVKGDAILLFNLRP 229
Query: 193 NGTIDPTSIHGSCPVVKGEKWVATKWIR 220
+G D S + C V++GEKW+A K +
Sbjct: 230 DGVTDSDSHYEDCSVLEGEKWLAIKHLH 257
>gi|346723630|ref|YP_004850299.1| hypothetical protein XACM_0696 [Xanthomonas axonopodis pv.
citrumelo F1]
gi|346648377|gb|AEO41001.1| hypothetical protein XACM_0696 [Xanthomonas axonopodis pv.
citrumelo F1]
Length = 286
Score = 104 bits (259), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 67/212 (31%), Positives = 99/212 (46%), Gaps = 37/212 (17%)
Query: 22 MPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFI 75
+PR + F + E+C ++I +A+ +L + TVDN G RTS + +
Sbjct: 95 LPRVVVLGGFLSDEECDALIALAQPHLA-------RSRTVDNANGEHVVHAARTSDSMCL 147
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP---- 131
+D IE +IA++ P +GE +LRY G +Y HYD FDP G
Sbjct: 148 RLGQD--ALCQRIEARIARLLDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAAGTPVLV 205
Query: 132 -QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 190
QRVAS ++YL E GG T FP + L V +G+ + F
Sbjct: 206 QAGGQRVASLVMYLNTPERGGATRFPDAH---------------LDVAAVKGNAVFFSYD 250
Query: 191 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
P+ S+H PV+ G+KWVATKW+R++
Sbjct: 251 RPHPMT--RSLHAGAPVLAGDKWVATKWLRER 280
>gi|92096574|gb|AAI15350.1| LOC557059 protein [Danio rerio]
Length = 508
Score = 104 bits (259), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 55/155 (35%), Positives = 85/155 (54%), Gaps = 25/155 (16%)
Query: 67 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP 126
IRTS VF+ +E GT+ I ++IA +T L + E ++ Y IG +Y H+D
Sbjct: 364 IRTSQSVFL----EEVGTVARISQRIADITGLSVESAEKLHVQNYGIGGRYTPHFDT--- 416
Query: 127 QEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLL 186
G + ++R A+FL+Y++D+E GG T+F +G+ VKP +G +
Sbjct: 417 ---GDEVNERTATFLIYMSDVEVGGATVFT---------------NVGVAVKPEKGSAVF 458
Query: 187 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRD 221
+Y+L NG +D + H CPV+ G KWVA KWI +
Sbjct: 459 WYNLHKNGELDLKTKHAGCPVLVGNKWVANKWIHE 493
>gi|251794605|ref|YP_003009336.1| procollagen-proline dioxygenase [Paenibacillus sp. JDR-2]
gi|247542231|gb|ACS99249.1| Procollagen-proline dioxygenase [Paenibacillus sp. JDR-2]
Length = 209
Score = 104 bits (259), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 57/200 (28%), Positives = 105/200 (52%), Gaps = 25/200 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P L N + +C +I++A ++ + + G + D ++ +RTSS +F +E+E
Sbjct: 32 PLILILDNVLSWAECDLLIDLASARMQRAKI----GSSHDVSE-VRTSSSMFFEESENE- 85
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
+ +E ++A++ +P + E +LRY+ G++Y+ H+D F G + R+++ ++
Sbjct: 86 -CIGQVEARVAELMNIPVSHAEPLQVLRYQPGEQYHPHFDYFTQ---GSSMNNRISTLVM 141
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+EEGGET FP + V P++G + F + ++ ++H
Sbjct: 142 YLNDVEEGGETYFP---------------SLHFSVTPKKGSAVYFEYFYNDTRLNELTLH 186
Query: 203 GSCPVVKGEKWVATKWIRDQ 222
PV GEKWVAT+W+R Q
Sbjct: 187 AGHPVEAGEKWVATQWMRRQ 206
>gi|452752943|ref|ZP_21952682.1| eukaryotic Peptidyl prolyl 4-hydroxylase, alpha subunit [alpha
proteobacterium JLT2015]
gi|451959765|gb|EMD82182.1| eukaryotic Peptidyl prolyl 4-hydroxylase, alpha subunit [alpha
proteobacterium JLT2015]
Length = 314
Score = 104 bits (259), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 65/185 (35%), Positives = 90/185 (48%), Gaps = 22/185 (11%)
Query: 36 QCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKV 95
+C + M+ LRPST+ L +RTS G +S E E + ++ +IA
Sbjct: 142 ECAYLQQMSAPRLRPSTI-LDPQTGARRPDPVRTSVGAALSPVE-EDLVVGMLNRRIAAA 199
Query: 96 TMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMF 155
T R+ GE +ILRY Q+Y H+DA E +QR + +VYLT EGGET F
Sbjct: 200 TGTDRMQGEPLHILRYSGAQEYRPHHDAVAGLE-----NQRSHTLIVYLTADYEGGETAF 254
Query: 156 PFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVA 215
P +G +++ RQGD LLF +L +G D H P G KW+A
Sbjct: 255 PE---------------LGFRLRGRQGDALLFANLREDGRPDLRMRHAGLPATSGAKWIA 299
Query: 216 TKWIR 220
T+WIR
Sbjct: 300 TRWIR 304
>gi|363543293|ref|NP_001241862.1| prolyl 4-hydroxylase 2-1 precursor [Zea mays]
gi|347978802|gb|AEP37743.1| prolyl 4-hydroxylase 2-1 [Zea mays]
Length = 204
Score = 104 bits (259), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 59/148 (39%), Positives = 89/148 (60%), Gaps = 10/148 (6%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-GETVDNTQGIRTSSGVFISA 77
LSW PRA F + +C +I +AK L S +A + G++V + +RTSSG+F+
Sbjct: 39 LSWRPRAFLHKGFLSDAECDHLIALAKDKLEKSMVADNESGKSVQSE--VRTSSGMFLER 96
Query: 78 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRV 137
+DE + IEE+I+ T LP NGE+ IL Y+ G+KY HYD F ++ R+
Sbjct: 97 KQDE--VVTRIEERISAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKKNQALGGHRI 154
Query: 138 ASFLVYLTDLEEGGETMFPFENGMNADG 165
A+ L+YL+++E+GGET+FP NA+G
Sbjct: 155 ATVLMYLSNVEKGGETIFP-----NAEG 177
>gi|398810140|ref|ZP_10568970.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
gi|398083831|gb|EJL74535.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
Length = 296
Score = 104 bits (259), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 65/201 (32%), Positives = 101/201 (50%), Gaps = 35/201 (17%)
Query: 34 PEQCKSIINMAKLNLRPSTLALRKGETVDNTQGI------RTSSGVFISAAEDESGTLDL 87
P++C+ +I +A+ L PST TVD G R+S G+F E+ +
Sbjct: 110 PQECEELIALARPRLAPST-------TVDPLSGRDLVGEQRSSLGMFFRLREN--AFIAR 160
Query: 88 IEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS-----QRVASFLV 142
++++++++ LP NGE +L Y G + H+D P + S QRV++ +
Sbjct: 161 LDQRVSELMNLPVENGEGLQVLCYPAGAQSMPHFDFLVPSNAANKASLARSGQRVSTLVS 220
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL ++EEGGET+FP +C G V PR+G + F G +D S+H
Sbjct: 221 YLNEVEEGGETIFP--------------EC-GWSVPPRRGSAVYFEYCNSLGQVDHASLH 265
Query: 203 GSCPVVKGEKWVATKWIRDQE 223
PV+ GEKWVATKW+R +
Sbjct: 266 AGGPVLHGEKWVATKWMRQRR 286
>gi|414587755|tpg|DAA38326.1| TPA: hypothetical protein ZEAMMB73_894856 [Zea mays]
Length = 244
Score = 103 bits (258), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 64/176 (36%), Positives = 96/176 (54%), Gaps = 12/176 (6%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTL---ALRKGETVDNTQGIRTSSGV 73
+V+SW PR + F NF + E+C ++ +A+ L+ ST+ A KG D +RTSSG+
Sbjct: 58 EVISWTPRIIVFHNFLSSEECDYLMAIARPRLQISTVVDVATGKGVKSD----VRTSSGM 113
Query: 74 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK 133
F+++ E +S + IE++I+ + +P+ NGE +LRY+ Q Y H+D F +
Sbjct: 114 FVNSEERKSPVVQAIEKRISVFSQIPKENGELIQVLRYEASQYYRPHHDYFSDTFNLKRG 173
Query: 134 SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYS 189
QRVA+ L+YLTD GGET FP E A + C+ Q G LF S
Sbjct: 174 GQRVATMLMYLTDGVVGGETHFPQEMESAAVEETWSKDCV-----LSQTKGTLFSS 224
>gi|239816557|ref|YP_002945467.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
gi|239803134|gb|ACS20201.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
Length = 296
Score = 103 bits (257), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 64/200 (32%), Positives = 99/200 (49%), Gaps = 35/200 (17%)
Query: 35 EQCKSIINMAKLNLRPSTLALRKGETVDNTQGI------RTSSGVFISAAEDESGTLDLI 88
E+C+++I +A+ L PST +VD G R+S G+F E+ + +
Sbjct: 111 EECEALIALARPRLAPST-------SVDPLTGRNRLGAQRSSLGMFFRLREN--AFVARL 161
Query: 89 EEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS-----QRVASFLVY 143
+E+++++ LP NGE +L Y G + H+D P Q S QRV++ + Y
Sbjct: 162 DERLSELMNLPVENGEGLQVLHYPAGAQSLPHFDFLVPSNAANQASLQRSGQRVSTLVAY 221
Query: 144 LTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHG 203
L ++EEGGET+FP G V P++G + F G +D S+H
Sbjct: 222 LNEVEEGGETVFPE---------------TGWSVSPQRGGAVYFEYCNSLGQVDHASLHA 266
Query: 204 SCPVVKGEKWVATKWIRDQE 223
PV+ GEKWVATKW+R +
Sbjct: 267 GAPVLSGEKWVATKWMRQRR 286
>gi|445499353|ref|ZP_21466208.1| prolyl 4-hydroxylase alpha subunit [Janthinobacterium sp. HH01]
gi|444789348|gb|ELX10896.1| prolyl 4-hydroxylase alpha subunit [Janthinobacterium sp. HH01]
Length = 272
Score = 103 bits (257), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 70/220 (31%), Positives = 105/220 (47%), Gaps = 25/220 (11%)
Query: 6 AGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQ 65
A D V + F VL P+ + N + E+C +II ST+ + +
Sbjct: 68 AAPDRVAEVLF-VLK-QPQIILLGNVLSDEECDAIIAHCGTRYTRSTVTGEADGSSMVHE 125
Query: 66 GIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFD 125
G RTS FI E E + IE ++A + P E F + +Y Q+Y HYD D
Sbjct: 126 G-RTSEMAFIQRGEAE--VAERIERRLAALAHWPAECSEPFQLQKYDATQEYRPHYDWLD 182
Query: 126 PQEYGPQK-----SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPR 180
P G + QR+A+F++YL+D+E+GG T+FP +GL+V P+
Sbjct: 183 PDSSGHRSHLARGGQRLATFILYLSDVEQGGGTVFP---------------GLGLEVYPK 227
Query: 181 QGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
+G L F + N D ++HG PVV+G K +A KW+R
Sbjct: 228 KGSALWFLNTDINHQPDKRTLHGGAPVVRGTKIIANKWLR 267
>gi|229084249|ref|ZP_04216532.1| 2OG-Fe(II) oxygenase [Bacillus cereus Rock3-44]
gi|228699049|gb|EEL51751.1| 2OG-Fe(II) oxygenase [Bacillus cereus Rock3-44]
Length = 235
Score = 103 bits (257), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 66/217 (30%), Positives = 104/217 (47%), Gaps = 26/217 (11%)
Query: 15 PFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVF 74
P +L P + T +C +I++A+ L+PS + G + T +RTS +
Sbjct: 39 PSNLLHDNPFIGCYEKVVTQTECHQLIDLARHGLQPSKVI---GNSEQKTSAVRTSDTIG 95
Query: 75 ISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP-----QEY 129
E TL + + +IA + LP E I RY++G K+N+H+D F+P + Y
Sbjct: 96 FQHHLTEL-TLQICK-RIASIVELPLNYAEHLQIARYQVGGKFNAHFDTFNPSTELGKMY 153
Query: 130 GPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYS 189
+ QR+ + L+YL ++ GGET FP N ++V P +G L+F +
Sbjct: 154 LSENGQRIITALLYLNNVSAGGETSFPLLN---------------IQVAPSEGTLLVFEN 198
Query: 190 LLPNGT-IDPTSIHGSCPVVKGEKWVATKWIRDQEQY 225
N SIH C V +GEKW+AT W ++ QY
Sbjct: 199 CKKNSNERHALSIHEGCAVHEGEKWIATLWFHEKSQY 235
>gi|418515355|ref|ZP_13081536.1| hypothetical protein MOU_00890 [Xanthomonas axonopodis pv.
malvacearum str. GSPB1386]
gi|410708074|gb|EKQ66523.1| hypothetical protein MOU_00890 [Xanthomonas axonopodis pv.
malvacearum str. GSPB1386]
Length = 216
Score = 103 bits (257), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 69/212 (32%), Positives = 98/212 (46%), Gaps = 37/212 (17%)
Query: 22 MPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFI 75
+PR + F + +C ++I +A RP L + TVDN G RTS + +
Sbjct: 25 LPRVVVLGGFLSDGECDALIALA----RPR---LARSRTVDNANGEHLVHAARTSDSMCL 77
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP---- 131
+D IE +IA++ P +GE +LRY G +Y HYD FDP G
Sbjct: 78 RVGQD--ALCQRIEARIARLFDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAVGTPILL 135
Query: 132 -QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 190
QRVAS ++YL E GG T FP + L V +G+ + F
Sbjct: 136 QAGGQRVASLVMYLNTPERGGATRFPDAH---------------LDVAAVKGNAVFFSYD 180
Query: 191 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
P+ S+H PV+ GEKWVATKW+R++
Sbjct: 181 RPHPMT--RSLHAGAPVLAGEKWVATKWLRER 210
>gi|198477152|ref|XP_002136738.1| GA29216 [Drosophila pseudoobscura pseudoobscura]
gi|198145043|gb|EDY71755.1| GA29216 [Drosophila pseudoobscura pseudoobscura]
Length = 517
Score = 103 bits (256), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 64/217 (29%), Positives = 110/217 (50%), Gaps = 26/217 (11%)
Query: 15 PF--QVLSWMPRALYFPNFATPEQCKSIINMAK-LNLRPSTLALRKGETVDNTQGIRTSS 71
PF ++LS P + + + TP + ++ N++K L R + + + + RTS+
Sbjct: 312 PFKTEILSLSPYMVLYHDVITPLESLTLKNLSKPLMKRRAMVMVNNLKVRPFIDSGRTSN 371
Query: 72 GVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFD----PQ 127
V++ A E+ ++ +E ++ +T N E + ++ Y IG Y H D F+ P+
Sbjct: 372 SVWL--ASHENAVMERLERRVGVMTNFEMENSEVYQLINYGIGGHYKPHTDHFETPQAPE 429
Query: 128 EYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 187
G R+A+ L YL+D+ +GG T+FP N + V+PRQGD LL+
Sbjct: 430 HRG--GGDRIATVLFYLSDVPQGGATLFPRLN---------------ISVQPRQGDALLW 472
Query: 188 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
Y+L G + ++H SCP+++G KW KWI + Q
Sbjct: 473 YNLNDRGQGEIGTVHTSCPIIQGSKWALVKWIDELSQ 509
>gi|374370415|ref|ZP_09628419.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus basilensis OR16]
gi|373098067|gb|EHP39184.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus basilensis OR16]
Length = 454
Score = 103 bits (256), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 69/224 (30%), Positives = 101/224 (45%), Gaps = 27/224 (12%)
Query: 5 QAGDDSV----TNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET 60
QAG ++ IP PR F T +C +++ +A+ L S + +
Sbjct: 110 QAGRNAAHFAGREIPILFTLAAPRVTLFQQLLTDAECDALVALARGRLARSPV-INPDTG 168
Query: 61 VDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSH 120
+N RTS G E ++ IE+ IA VT + GE IL YK G +Y H
Sbjct: 169 DENLIEARTSLGAMFQVGEHP--LIERIEDCIAAVTGIAAERGEGLQILNYKPGGEYQPH 226
Query: 121 YDAFDPQEYGPQK-----SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGL 175
YD F+PQ G + QRV + ++YL GG T FP +GL
Sbjct: 227 YDFFNPQRPGEARQLKVGGQRVGTLVIYLNSPLAGGATAFPK---------------LGL 271
Query: 176 KVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWI 219
+V P +G+ + F +G +D ++H PV GEKW+ATKW+
Sbjct: 272 EVAPVKGNAVYFSYRKSDGALDERTLHAGLPVEAGEKWIATKWL 315
>gi|21106803|gb|AAM35580.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri
str. 306]
Length = 306
Score = 103 bits (256), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 69/212 (32%), Positives = 98/212 (46%), Gaps = 37/212 (17%)
Query: 22 MPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFI 75
+PR + F + +C ++I +A RP L + TVDN G RTS + +
Sbjct: 115 LPRVVVLGGFLSDGECDALIALA----RPR---LARSRTVDNANGEHMVHAARTSDSMCL 167
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP---- 131
+D IE +IA++ P +GE +LRY G +Y HYD FDP G
Sbjct: 168 RVGQD--ALCQRIEARIARLFDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAAGTPILL 225
Query: 132 -QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 190
QRVAS ++YL E GG T FP + L V +G+ + F
Sbjct: 226 QAGGQRVASLVMYLNTPERGGATRFPDAH---------------LDVAAVKGNAVFFSYD 270
Query: 191 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
P+ S+H PV+ GEKWVATKW+R++
Sbjct: 271 RPHPMT--RSLHAGAPVLAGEKWVATKWLRER 300
>gi|77748547|ref|NP_641044.2| hypothetical protein XAC0691 [Xanthomonas axonopodis pv. citri str.
306]
gi|381169877|ref|ZP_09879039.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas citri pv.
mangiferaeindicae LMG 941]
gi|380689647|emb|CCG35526.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas citri pv.
mangiferaeindicae LMG 941]
Length = 286
Score = 103 bits (256), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 69/212 (32%), Positives = 98/212 (46%), Gaps = 37/212 (17%)
Query: 22 MPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFI 75
+PR + F + +C ++I +A RP L + TVDN G RTS + +
Sbjct: 95 LPRVVVLGGFLSDGECDALIALA----RPR---LARSRTVDNANGEHMVHAARTSDSMCL 147
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP---- 131
+D IE +IA++ P +GE +LRY G +Y HYD FDP G
Sbjct: 148 RVGQD--ALCQRIEARIARLFDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAAGTPILL 205
Query: 132 -QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 190
QRVAS ++YL E GG T FP + L V +G+ + F
Sbjct: 206 QAGGQRVASLVMYLNTPERGGATRFPDAH---------------LDVAAVKGNAVFFSYD 250
Query: 191 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
P+ S+H PV+ GEKWVATKW+R++
Sbjct: 251 RPHPMT--RSLHAGAPVLAGEKWVATKWLRER 280
>gi|410637601|ref|ZP_11348175.1| prolyl 4-hydroxylase [Glaciecola lipolytica E3]
gi|410142794|dbj|GAC15380.1| prolyl 4-hydroxylase [Glaciecola lipolytica E3]
Length = 280
Score = 102 bits (255), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 64/197 (32%), Positives = 100/197 (50%), Gaps = 25/197 (12%)
Query: 30 NFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIE 89
NF T ++C++++ + K LRPS + R+G D +G RTSS + +D I+
Sbjct: 92 NFLTAQECEALVALTKSKLRPSEIPEREG---DQYKGFRTSSTCDLPFTKDPLA--HEID 146
Query: 90 EKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP-----QEYGPQKSQRVASFLVYL 144
+KI L E Y IGQ++ +H D F P + Y QR +F++YL
Sbjct: 147 QKIVDALGLGVGEKEVIQAQHYAIGQEFKAHCDYFVPGSKDFKTYSKDGGQRTWTFMIYL 206
Query: 145 TDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGS 204
+L EGGET F +G+K KP+QG L++ +L +G+I+ ++H +
Sbjct: 207 NELCEGGETEFV---------------KLGIKFKPKQGTALVWNNLHEDGSINEDTLHHA 251
Query: 205 CPVVKGEKWVATKWIRD 221
P+ GEK V TKW R+
Sbjct: 252 HPIESGEKVVITKWFRE 268
>gi|218187602|gb|EEC70029.1| hypothetical protein OsI_00603 [Oryza sativa Indica Group]
Length = 549
Score = 102 bits (255), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 63/202 (31%), Positives = 105/202 (51%), Gaps = 20/202 (9%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA 78
LSW PR + F + +C +++ + N+ S+LA T G R SS I
Sbjct: 309 LSWHPRIFLYEGFLSDMECDHLVSTGRGNM-DSSLAF--------TDGDRNSSYNNI--- 356
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVA 138
E + IE++I+ + LP+ NGE +L+Y + ++ + + +A
Sbjct: 357 --EDIVVSKIEDRISLWSFLPKENGENIQVLKYGVNRR-----GSIKEEPKSSTGGHWLA 409
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDY-QKCIGLKVKPRQGDGLLFYSLLPNGTID 197
+ L+YL+D+++GGET+FP +A +C G V+P +G+ LL ++L P+G ID
Sbjct: 410 TILIYLSDVKQGGETVFPRSEMKDAQAKEGAPSQCSGYAVRPAKGNALLLFNLRPDGEID 469
Query: 198 PTSIHGSCPVVKGEKWVATKWI 219
S + CPV++GEKW+A K I
Sbjct: 470 KDSQYEECPVLEGEKWLAIKHI 491
>gi|209522122|ref|ZP_03270769.1| Procollagen-proline dioxygenase [Burkholderia sp. H160]
gi|209497434|gb|EDZ97642.1| Procollagen-proline dioxygenase [Burkholderia sp. H160]
Length = 296
Score = 102 bits (255), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 72/215 (33%), Positives = 108/215 (50%), Gaps = 27/215 (12%)
Query: 17 QVLSWM--PRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGV 73
+VLS + P A++ +F + ++C+ +I +A+ L ST+ G V G R+S G+
Sbjct: 94 RVLSRLQRPAAVHLADFLSADECEQLIALAQPRLDRSTVVDPVTGRNV--VAGHRSSHGM 151
Query: 74 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-----DPQE 128
F E + + IE +IA +T P NGE +L Y+ G + H D +E
Sbjct: 152 FFRLGE--TPLIVRIEARIAALTGTPVENGEGLQMLHYEEGAESTPHVDYLITGNEANRE 209
Query: 129 YGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFY 188
+ QR+ + L+YL D+E GGET+FP IG V P++G L F
Sbjct: 210 SIARSGQRMGTLLMYLKDVEGGGETVFP---------------QIGWSVAPQRGHALYFE 254
Query: 189 SLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
G DP+S+H S P+ G+KWVATKWIR +
Sbjct: 255 YGNRFGLCDPSSLHASTPLRVGDKWVATKWIRTRR 289
>gi|325915062|ref|ZP_08177391.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas vesicatoria
ATCC 35937]
gi|325538760|gb|EGD10427.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas vesicatoria
ATCC 35937]
Length = 286
Score = 102 bits (254), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 66/212 (31%), Positives = 97/212 (45%), Gaps = 37/212 (17%)
Query: 22 MPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGI------RTSSGVFI 75
+PR + F + +C ++I +A+ L + TVDN G RTS + +
Sbjct: 95 LPRVMVLGGFLSDAECDAMIALAQPRLA-------RSRTVDNANGAHVVHAARTSDSMCL 147
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP---- 131
+D IE +IA++ P NGE +LRY G +Y HYD FDP G
Sbjct: 148 QLGQD--ALCQRIEARIARLLDWPVENGEGLQVLRYGTGAEYQPHYDYFDPDAAGTPVLL 205
Query: 132 -QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 190
QRVAS ++YL + GG T FP + L + +G+ + F
Sbjct: 206 QAGGQRVASLVMYLNTPDRGGATRFPD---------------VHLDIAAIKGNAVFFSYD 250
Query: 191 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
P+ S+H PV+ GEKWVATKW+R++
Sbjct: 251 RPHPMT--RSLHAGAPVLAGEKWVATKWLRER 280
>gi|115313004|gb|AAI24075.1| Zgc:152670 [Danio rerio]
Length = 235
Score = 102 bits (254), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 56/155 (36%), Positives = 84/155 (54%), Gaps = 25/155 (16%)
Query: 67 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP 126
IRTS VF+ DE GT+ I ++IA +T L + E ++ Y IG +Y H+DA
Sbjct: 91 IRTSQSVFL----DEVGTVARISQRIADITGLSVESAEKLHVQNYGIGGRYTPHFDA--- 143
Query: 127 QEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLL 186
G ++R A+FL+Y++D+E GG T+F +G+ VKP +G +
Sbjct: 144 ---GGDVNERTATFLIYMSDVEVGGATVFT---------------NVGVAVKPEKGSAVF 185
Query: 187 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRD 221
+ +L NG +D + H CPV+ G KWVA KWI +
Sbjct: 186 WNNLHKNGELDLKTKHAGCPVLVGNKWVANKWIHE 220
>gi|352086439|ref|ZP_08953941.1| Procollagen-proline dioxygenase [Rhodanobacter sp. 2APBS1]
gi|389799401|ref|ZP_10202396.1| procollagen-proline dioxygenase [Rhodanobacter sp. 116-2]
gi|351679404|gb|EHA62545.1| Procollagen-proline dioxygenase [Rhodanobacter sp. 2APBS1]
gi|388442818|gb|EIL98985.1| procollagen-proline dioxygenase [Rhodanobacter sp. 116-2]
Length = 284
Score = 102 bits (254), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 67/199 (33%), Positives = 104/199 (52%), Gaps = 30/199 (15%)
Query: 30 NFATPEQCKSIINMAKLNL-RPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLI 88
N + ++C+ +I +A+ L R T+ + VD RTS G+F + +E + I
Sbjct: 102 NILSTQECEELIALARPRLQRALTVDSEGRQQVDRR---RTSEGMFFTL--NEVPLVGRI 156
Query: 89 EEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQE--YGPQKS---QRVASFLVY 143
E+++A + +P +GE IL Y GQ+Y H+D FDP++ YG + QR+AS ++Y
Sbjct: 157 EQRLAALLRVPASHGEGLQILHYLPGQEYEPHFDWFDPEQPGYGAITAVGGQRIASVVMY 216
Query: 144 LTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHG 203
L GG T FP +GL V R+G + F G DP+S+H
Sbjct: 217 LNTPARGGGTAFP---------------ELGLTVTARRGSAVYFA--YEGG--DPSSLHA 257
Query: 204 SCPVVKGEKWVATKWIRDQ 222
PV+ GEKW+ATKW+R++
Sbjct: 258 GLPVLDGEKWIATKWLRER 276
>gi|323445926|gb|EGB02303.1| hypothetical protein AURANDRAFT_39521 [Aureococcus anophagefferens]
Length = 239
Score = 102 bits (254), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 61/208 (29%), Positives = 109/208 (52%), Gaps = 23/208 (11%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA 78
LS P + +FA + C+ +I A+ +L + + R+G IR +S +++A
Sbjct: 31 LSADPLVYFIDDFADEDSCEHLIRQARPSLGGAEVQTRRGSAART--AIRRASSCWLAAR 88
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYK--IGQKYNSHYDAFDPQEYGPQKS-Q 135
DE+ L+ +E+ I P E F+++RY+ G++Y +H DAF+ ++ Q
Sbjct: 89 GDEA--LEHLEDAICAELGAPEERTEFFHVVRYRPSTGERYAAHADAFEAGNAELERGGQ 146
Query: 136 RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGT 195
R+ + L+YL+D+ GG T+FP +GL V PR+G L+F ++ + T
Sbjct: 147 RLTTALLYLSDVGAGGATVFP---------------ALGLSVAPRRGRLLVFANVADDTT 191
Query: 196 IDPTSIHGSCPVV-KGEKWVATKWIRDQ 222
+D ++H P+ EKW+A KW+R++
Sbjct: 192 VDARTVHAGEPIAGDTEKWIANKWVRER 219
>gi|357135725|ref|XP_003569459.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like isoform 1
[Brachypodium distachyon]
Length = 303
Score = 102 bits (253), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 63/210 (30%), Positives = 108/210 (51%), Gaps = 34/210 (16%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTL--ALRKGETVDNTQGIRTSSGVFIS 76
L+W PR + F + +C ++ +A+LN+ S L A + T ++T I S
Sbjct: 63 LAWHPRVFLYEGFLSGMECDHLVYVARLNIESSLLVNAGARNITQNSTDDIVVSK----- 117
Query: 77 AAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS-- 134
IE++I+ + +P+ +GE+ IL+Y Q D + G Q S
Sbjct: 118 -----------IEDRISLWSFIPKEHGESMQILKYGSNQS--------DHNKDGTQSSSG 158
Query: 135 -QRVASFLVYLTDLEEGGETMFP---FENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 190
R+ + L+YL+D+++GGET+FP ++ +G+ +C G VKP +GD +L ++L
Sbjct: 159 GNRLVTILMYLSDVKQGGETVFPRSELKDTQAKEGAL--SECAGYAVKPVKGDAILLFNL 216
Query: 191 LPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
P+G D S + C V++GEKW+A K +
Sbjct: 217 RPDGVTDSDSHYEDCSVLEGEKWLAIKHLH 246
>gi|224006596|ref|XP_002292258.1| hypothetical protein THAPSDRAFT_263536 [Thalassiosira pseudonana
CCMP1335]
gi|220971900|gb|EED90233.1| hypothetical protein THAPSDRAFT_263536 [Thalassiosira pseudonana
CCMP1335]
Length = 206
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 66/211 (31%), Positives = 108/211 (51%), Gaps = 20/211 (9%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG---IRTSSGVFISAAE 79
PR Y NF + ++ ++ ++ PST K + RTS F
Sbjct: 2 PRVFYVHNFLSADEADELV---AFSMAPSTGGTHKAWNQGGSNAKLTTRTSMNAF-DITT 57
Query: 80 DESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQE-----YGPQK- 133
S + ++ ++ + ILRY++GQ Y +H+D F ++ + P K
Sbjct: 58 KLSFRIKRRAFRLLRMGAYKENLADGIQILRYELGQAYIAHHDYFPVRQSNDHLWDPSKG 117
Query: 134 -SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYD---YQKCIG-LKVKPRQGDGLLFY 188
S R A+ +YL+D+E GG+T+ + G++A GS++ +C L V PR+GD +LFY
Sbjct: 118 GSNRFATIFLYLSDVEVGGQTLEK-DAGVDA-GSWEDKLVDQCYSKLAVPPRRGDAILFY 175
Query: 189 SLLPNGTIDPTSIHGSCPVVKGEKWVATKWI 219
S P+G +DP S+HG+CP++KG KW A W+
Sbjct: 176 SQYPDGHLDPNSLHGACPILKGTKWGANLWV 206
>gi|90022913|ref|YP_528740.1| hypothetical protein Sde_3273 [Saccharophagus degradans 2-40]
gi|89952513|gb|ABD82528.1| 2OG-Fe(II) oxygenase [Saccharophagus degradans 2-40]
Length = 478
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 69/221 (31%), Positives = 108/221 (48%), Gaps = 41/221 (18%)
Query: 23 PRALYFPN----------------FATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG 66
PR ++ PN F T E+C+ II + LRPS L+ ++ + +
Sbjct: 86 PRKIFIPNALKLNSDKLEMYALGEFLTTEECERIIANIRSKLRPSELSSQESD-----KT 140
Query: 67 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP 126
RTS + +D + ++ +I K+ + E Y++GQ++ +H D F+
Sbjct: 141 YRTSRTCDLGTIDDP--FIHYVDSRICKLVGIDPSYSEVIQGQLYEVGQEFKAHTDYFEI 198
Query: 127 QE---YGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGD 183
+E +G QR + ++YL D+EEGGET FP ADG+ +KPR G
Sbjct: 199 KEMPEHGAVMGQRTYTVMIYLNDVEEGGETDFP-----AADGA----------IKPRAGL 243
Query: 184 GLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
L++ SL NG +P S+H + PV+KG K V TKW R Q +
Sbjct: 244 ALIWNSLQSNGAPNPHSMHQAYPVLKGHKAVITKWFRSQSR 284
>gi|325922187|ref|ZP_08183974.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas gardneri ATCC
19865]
gi|325547306|gb|EGD18373.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas gardneri ATCC
19865]
Length = 285
Score = 101 bits (252), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 66/212 (31%), Positives = 98/212 (46%), Gaps = 37/212 (17%)
Query: 22 MPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGI------RTSSGVFI 75
+PR + +F + +C ++I +A+ L + TVDN G RTS + +
Sbjct: 95 LPRVVVLGDFLSDAECDALIALAQPRLA-------RSRTVDNDNGAQIVHAARTSDSMCL 147
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP---- 131
+D IE +IA++ P +GE +LRY G +Y HYD FDP G
Sbjct: 148 QLGQD--ALCQRIEARIARLLDWPVDHGEGLQVLRYATGAEYQPHYDYFDPTAAGTPVLL 205
Query: 132 -QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 190
QR+AS ++YL E GG T FP + L V +G+ + F
Sbjct: 206 QAGGQRLASLVMYLNTPERGGATRFPD---------------VHLDVAAVKGNAVFFSYD 250
Query: 191 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
P+ S+H PV+ GEKWVATKW+R++
Sbjct: 251 RPHPMT--RSLHAGAPVLAGEKWVATKWLRER 280
>gi|196011912|ref|XP_002115819.1| hypothetical protein TRIADDRAFT_59908 [Trichoplax adhaerens]
gi|190581595|gb|EDV21671.1| hypothetical protein TRIADDRAFT_59908 [Trichoplax adhaerens]
Length = 300
Score = 101 bits (252), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 61/217 (28%), Positives = 110/217 (50%), Gaps = 24/217 (11%)
Query: 14 IPFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG-IRTS 70
+P+ + +S P + + N + + +S+ +A L+P+ + N +G R +
Sbjct: 93 MPYAIEEMSRDPLIILYHNLTSNAEMESLKALAAKQLQPAGVYHTTSADNRNLEGYTRIA 152
Query: 71 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 130
FI ++ES I +++ VT L E ++ Y I +Y HYD F P + G
Sbjct: 153 KMAFI--LDEESAVASAITQRLQDVTGLNMNFSEPLQVINYGIAGQYTPHYDTF-PAKSG 209
Query: 131 PQKS---QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 187
+ R+A+ ++YL+D+E GG T+F I ++V PR+G+ +++
Sbjct: 210 DRSHPSHDRLATAILYLSDVERGGATVF---------------TNINVRVLPRKGNVIIW 254
Query: 188 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
Y+ LP+G + P ++H CPV+ G KW+A KWI+ + Q
Sbjct: 255 YNYLPDGNLHPGTLHAGCPVLVGSKWIANKWIQSKGQ 291
>gi|389728965|ref|ZP_10189244.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
gi|388441204|gb|EIL97500.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
Length = 285
Score = 101 bits (251), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 71/227 (31%), Positives = 106/227 (46%), Gaps = 33/227 (14%)
Query: 2 PHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKGET 60
PH G+ SV + + P F + ++C ++I +AK L R T+A +
Sbjct: 77 PHAVIGERSVR---VMLAAETPPLRVFDGLLSDDECAALIELAKPRLQRARTVAEDGAQQ 133
Query: 61 VDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSH 120
+D RTS G+F E ++ IE +IA + +P +GE +L Y GQ+Y H
Sbjct: 134 IDEH---RTSDGMFFGLGEQP--LIERIEARIAALLGIPVDHGEGLQVLHYLPGQQYEPH 188
Query: 121 YDAFDPQEYG-----PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGL 175
D FDP + G QR+AS ++YL + GG T FP IGL
Sbjct: 189 QDWFDPTQPGYAAITATGGQRIASLVIYLNTPDAGGGTAFPE---------------IGL 233
Query: 176 KVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
V +G + F + D S+H PV +GEKW+ATKW+R++
Sbjct: 234 TVTALRGSAVCFT----YESGDVFSLHAGLPVTRGEKWIATKWLRER 276
>gi|389793983|ref|ZP_10197143.1| 2OG-Fe(II) oxygenase [Rhodanobacter fulvus Jip2]
gi|388433014|gb|EIL89992.1| 2OG-Fe(II) oxygenase [Rhodanobacter fulvus Jip2]
Length = 282
Score = 101 bits (251), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 72/221 (32%), Positives = 105/221 (47%), Gaps = 32/221 (14%)
Query: 8 DDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKGETVDNTQG 66
D IP V + P + +C +I +A+ L R T+ + +D
Sbjct: 80 DGRTIGIPLSVDA--PALRVLDGLLSERECADLIELARPRLQRALTVDSDGKQQIDQR-- 135
Query: 67 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP 126
RTS G+F A E + + IE+++A++ +P +GE IL Y GQ+Y HYD FDP
Sbjct: 136 -RTSEGMFFRAGE--TPLVAAIEQRLAQLLGVPASHGEGLQILHYGPGQEYEPHYDWFDP 192
Query: 127 QEYGPQK-----SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQ 181
G K QR+AS ++YL E GG T FP IGL V R+
Sbjct: 193 ALPGYDKLTARAGQRIASVVMYLNTPERGGGTAFP---------------EIGLTVTARR 237
Query: 182 GDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
G + F G D +S+H PV++GEKW+AT W+R++
Sbjct: 238 GAAVYF--AYEGG--DQSSLHAGLPVLQGEKWIATHWLRER 274
>gi|224009604|ref|XP_002293760.1| prolyl 4-hydroxylase alpha subunit [Thalassiosira pseudonana
CCMP1335]
gi|220970432|gb|EED88769.1| prolyl 4-hydroxylase alpha subunit [Thalassiosira pseudonana
CCMP1335]
Length = 206
Score = 101 bits (251), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 67/213 (31%), Positives = 105/213 (49%), Gaps = 15/213 (7%)
Query: 14 IPFQVLSWMPRALYFPNFATPEQCKSIINMAK-LNLRPSTLALRKGETVD---NTQGIRT 69
+ +VLS PRA NF + + I+ + + L ST A T D +T+ RT
Sbjct: 1 MTLKVLSCAPRAFEIENFLSQTEVDHIMYLTTGMKLHRSTTAGSDQITADERDSTRNTRT 60
Query: 70 SSGVFISAAEDESGTLDLIEEKIAKVTMLPR-INGEAFNILRYKIGQKYNSHYDAFDPQE 128
S ++ ++S +D I + A + ++ + EA ++ Y +GQ+Y +H+D P
Sbjct: 61 SLNTWVY--REKSAIIDTIYRRAADLQLMNEALIAEALQLVHYDVGQEYTAHHDWGHPDI 118
Query: 129 YGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFY 188
+ R + L+YL + EGG T FP +NA+ GL V+P+ G +LFY
Sbjct: 119 DNEYQPARYCTLLLYLNEGMEGGATQFP--RWVNAETRN------GLDVEPKIGKAVLFY 170
Query: 189 SLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRD 221
S LP+G +D S H + PV GEKW+ W D
Sbjct: 171 SQLPDGNMDDWSHHAAMPVRVGEKWLMNLWTWD 203
>gi|389775678|ref|ZP_10193553.1| procollagen-proline dioxygenase [Rhodanobacter spathiphylli B39]
gi|388437120|gb|EIL93940.1| procollagen-proline dioxygenase [Rhodanobacter spathiphylli B39]
Length = 284
Score = 101 bits (251), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 70/206 (33%), Positives = 101/206 (49%), Gaps = 30/206 (14%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKGETVDNTQGIRTSSGVFISAAEDE 81
P N E+C+ +I +A+ L R T+A VD RTS G+F + +E
Sbjct: 95 PALRVLENLLAAEECEELIALAQPRLKRALTVASDGSNQVDQR---RTSEGMFFTL--NE 149
Query: 82 SGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG-----PQKSQR 136
+ IE+++A + +P +GE IL Y GQ+Y H+D FDPQ+ G QR
Sbjct: 150 LPLVGRIEQRLATLLGMPVSHGEGLQILHYLPGQEYEPHFDWFDPQQPGYDTITAVGGQR 209
Query: 137 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 196
VAS ++YL +GG T FP +GL V R+G + F G
Sbjct: 210 VASVVMYLNTPAQGGGTAFP---------------ELGLTVTARRGAAVYF--AYEGG-- 250
Query: 197 DPTSIHGSCPVVKGEKWVATKWIRDQ 222
D S+H PV +GEKW+ATKW+R++
Sbjct: 251 DQQSLHAGLPVQRGEKWIATKWLRER 276
>gi|326435474|gb|EGD81044.1| hypothetical protein PTSG_10986 [Salpingoeca sp. ATCC 50818]
Length = 264
Score = 100 bits (250), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 69/208 (33%), Positives = 102/208 (49%), Gaps = 26/208 (12%)
Query: 18 VLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVF--- 74
+LS P + F NF + E+ +I++ AK ST + + RTSS +
Sbjct: 64 MLSEDPPVIQFNNFISQERIDAILHFAKPKFARSTSGIER-----EVSNYRTSSTAWMLP 118
Query: 75 -ISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK 133
+ + L +EE+IA++ LP N E F +L+Y+ Q Y H D + Q P
Sbjct: 119 DVLGNDPMQAHLKDMEEEIARIVRLPVENQEHFQVLQYQKNQYYKVHSDYIEEQRQQP-C 177
Query: 134 SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPN 193
RVA+F +YL D+EEGG T FP + L V+P +G+ +L+YS PN
Sbjct: 178 GIRVATFFLYLNDVEEGGGTRFP---------------NLNLTVQPAKGNAVLWYSAYPN 222
Query: 194 GT-IDPTSIHGSCPVVKGEKWVATKWIR 220
T +D + H + PV KG K+ A KWI
Sbjct: 223 TTRMDSRTDHEAMPVAKGMKYGANKWIH 250
>gi|319786559|ref|YP_004146034.1| Procollagen-proline dioxygenase [Pseudoxanthomonas suwonensis 11-1]
gi|317465071|gb|ADV26803.1| Procollagen-proline dioxygenase [Pseudoxanthomonas suwonensis 11-1]
Length = 289
Score = 100 bits (250), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 66/211 (31%), Positives = 97/211 (45%), Gaps = 37/211 (17%)
Query: 22 MPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFI 75
+PR + + E+C +++ +++ LR ST TVD G RTS G F
Sbjct: 101 LPRVVVLGGLLSDEECDALVELSRPRLRRST-------TVDAQTGGSQVHADRTSRGTFF 153
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP---- 131
IE +IA++ P NGE +L Y G ++ HYD FDP E G
Sbjct: 154 ERGAHP--VCATIEARIARLLEWPVENGEGLQVLHYPPGAEFRPHYDYFDPDEPGAEVLL 211
Query: 132 -QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 190
Q QRVA+ ++YL GG T FP + L+V +G+ + F
Sbjct: 212 RQGGQRVATVVMYLNTPARGGATTFPDAH---------------LEVAAVKGNAVFFSYD 256
Query: 191 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRD 221
P+ ++HG PV +GEKW+ATKW+R+
Sbjct: 257 RPHPMT--RTLHGGAPVTEGEKWIATKWLRE 285
>gi|319943342|ref|ZP_08017624.1| 2OG-Fe(II) oxygenase [Lautropia mirabilis ATCC 51599]
gi|319743157|gb|EFV95562.1| 2OG-Fe(II) oxygenase [Lautropia mirabilis ATCC 51599]
Length = 311
Score = 100 bits (249), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 64/232 (27%), Positives = 112/232 (48%), Gaps = 43/232 (18%)
Query: 15 PFQVLSWMPR------------ALYFPNFA------TPEQCKSIINMAKLNLRPSTLALR 56
P +++S +PR + PN A + E+C +I +++ ++ S + R
Sbjct: 95 PIRLISQLPRFTVADREVELAAVMSNPNIAVIRGLLSDEECDEVIRLSRGKMKTSQVVDR 154
Query: 57 K-GETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQ 115
+ G + +++ +R S G E+E + IE +++ + LP GE IL Y G
Sbjct: 155 ESGGSYESS--VRKSEGSHFERGENE--LVRRIEARLSALVDLPVNRGEPLQILHYGPGG 210
Query: 116 KYNSHYDAFDPQEYGPQ-----KSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQ 170
+Y +H D F+P++ G QR+ + ++YL D+ EGGET FP
Sbjct: 211 EYKAHQDFFEPKDPGSAVLTRVGGQRIGTVVMYLNDVPEGGETAFP-------------- 256
Query: 171 KCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
IG KP +G + F +G +D +H PV++G+KW+ TKW+R++
Sbjct: 257 -DIGFSAKPIKGSAVYFEYQNADGQLDYRCLHAGMPVIRGDKWIMTKWLRER 307
>gi|24417248|gb|AAN60234.1| unknown [Arabidopsis thaliana]
Length = 190
Score = 100 bits (249), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 55/136 (40%), Positives = 81/136 (59%), Gaps = 5/136 (3%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALR-KGETVDNTQGIRTSSGVFISA 77
LSW PR + F + E+C I +AK L S +A GE+V++ +RTSSG+F+S
Sbjct: 59 LSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESE--VRTSSGMFLSK 116
Query: 78 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRV 137
+D+ + +E K+A T LP NGE+ IL Y+ GQKY H+D F Q R+
Sbjct: 117 RQDD--IVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRI 174
Query: 138 ASFLVYLTDLEEGGET 153
A+ L+YL+++E+GGET
Sbjct: 175 ATVLMYLSNVEKGGET 190
>gi|242051901|ref|XP_002455096.1| hypothetical protein SORBIDRAFT_03g004265 [Sorghum bicolor]
gi|241927071|gb|EES00216.1| hypothetical protein SORBIDRAFT_03g004265 [Sorghum bicolor]
Length = 303
Score = 100 bits (248), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 63/204 (30%), Positives = 108/204 (52%), Gaps = 21/204 (10%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA 78
LSW PR + F + +C +I+MA + S+L + G +N+QG A
Sbjct: 62 LSWHPRVFLYEGFLSDMECDHLISMAH-GKKQSSLVV-GGSAGNNSQG-----------A 108
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVA 138
E + IE++I+ + LP+ GE+ IL+Y++ + S Y+ ++ Q R+
Sbjct: 109 SIEDTIVSTIEDRISVWSFLPKDFGESMQILKYEVNK---SDYNNYESQSSSGH--DRLV 163
Query: 139 SFLVYLTDLEEGGETMFPFE--NGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 196
+ L+YL+D++ GGET FP G + + +C G V+P +G+ +L ++L P+G I
Sbjct: 164 TVLMYLSDVKRGGETAFPRSELKGTKVELAAP-SECAGYAVQPVRGNAILLFNLKPDGVI 222
Query: 197 DPTSIHGSCPVVKGEKWVATKWIR 220
D S + C V++GE+W+A K I
Sbjct: 223 DKDSQYEMCSVLEGEEWLAIKHIH 246
>gi|384429387|ref|YP_005638747.1| procollagen-proline, 2-oxoglutarate-4-dioxygenase [Xanthomonas
campestris pv. raphani 756C]
gi|341938490|gb|AEL08629.1| procollagen-proline, 2-oxoglutarate-4-dioxygenase [Xanthomonas
campestris pv. raphani 756C]
Length = 286
Score = 99.8 bits (247), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 66/212 (31%), Positives = 98/212 (46%), Gaps = 37/212 (17%)
Query: 22 MPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFI 75
+PR + + ++C ++I +A RP L + TVDN G RTS + +
Sbjct: 95 LPRVVVLGGLLSDDECDALIALA----RPQ---LARSRTVDNRDGSEIVHAARTSHSMAL 147
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP---- 131
+D IE +IA++ P +GE +LRY G +Y HYD F+P G
Sbjct: 148 QPGQD--ALCQRIEARIARLLEWPVEHGEGLQVLRYATGAQYAPHYDYFEPDAPGTPVLL 205
Query: 132 -QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 190
QRVAS ++YL E GG T FP + L V +G+ + F
Sbjct: 206 QHGGQRVASLVMYLNTPERGGATRFP---------------DVHLDVAAVKGNAVFFSYD 250
Query: 191 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
P+ ++H PV+ GEKWVATKW+R++
Sbjct: 251 RPHPMT--RTLHAGAPVLAGEKWVATKWLRER 280
>gi|294666178|ref|ZP_06731433.1| 2OG-Fe II oxygenase superfamily protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
gi|292604043|gb|EFF47439.1| 2OG-Fe II oxygenase superfamily protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
Length = 296
Score = 99.8 bits (247), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 68/212 (32%), Positives = 97/212 (45%), Gaps = 37/212 (17%)
Query: 22 MPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFI 75
+P + F + +C ++I +A RP L + TVDN G RTS + +
Sbjct: 105 LPCVVVLGGFLSGGECDALIALA----RPR---LARSRTVDNANGEHVVHAARTSDSMCL 157
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP---- 131
+D IE +IA++ P +GE +LRY G +Y HYD FDP G
Sbjct: 158 RVGQD--ALCQRIEARIARLLDWPVDHGEGLQVLRYGTGAEYRPHYDYFDPDAAGTPVLL 215
Query: 132 -QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 190
QRVAS ++YL E GG T FP + L V +G+ + F
Sbjct: 216 QAGGQRVASLVMYLNTPERGGATRFPDAH---------------LDVAAVKGNAVFFSYD 260
Query: 191 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
P+ S+H PV+ GEKWVATKW+R++
Sbjct: 261 RPHPMT--RSLHAGAPVLAGEKWVATKWLRER 290
>gi|428170517|gb|EKX39441.1| hypothetical protein GUITHDRAFT_114401 [Guillardia theta CCMP2712]
Length = 322
Score = 99.8 bits (247), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 67/218 (30%), Positives = 105/218 (48%), Gaps = 29/218 (13%)
Query: 16 FQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFI 75
+ +S PR N T E+C ++++A ++L G RT+ ++
Sbjct: 75 IETVSVDPRIFIVHNLLTEEECDHLVSLALQKGLSASLITPYGTNKLVESTTRTNKQAWL 134
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 135
+D+ + +E+KIAK+T GE +L Y Q++ H+D FDP P+ +
Sbjct: 135 DFQQDD--VVKRVEDKIAKLTKTTPEQGENLQVLHYAKSQQFTEHHDYFDPATDPPENYE 192
Query: 136 ----RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLL 191
R+ + +VYL EEGGET F N LK+ +GD ++FY+ L
Sbjct: 193 KGGNRLITVIVYLQAAEEGGETHFGAAN---------------LKLTAAKGDAVMFYN-L 236
Query: 192 PNGT--IDPT-----SIHGSCPVVKGEKWVATKWIRDQ 222
+G IDPT ++H P +KGEKWVATKWI ++
Sbjct: 237 KHGCDGIDPTCVDKQTLHAGLPPIKGEKWVATKWIHER 274
>gi|294627644|ref|ZP_06706226.1| 2OG-Fe II oxygenase superfamily protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 11122]
gi|292597996|gb|EFF42151.1| 2OG-Fe II oxygenase superfamily protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 11122]
Length = 296
Score = 99.8 bits (247), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 68/212 (32%), Positives = 97/212 (45%), Gaps = 37/212 (17%)
Query: 22 MPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFI 75
+P + F + +C ++I +A RP L + TVDN G RTS + +
Sbjct: 105 LPCVVVLGGFLSGGECDALIALA----RPR---LARSRTVDNANGEHVVHAARTSDSMCL 157
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP---- 131
+D IE +IA++ P +GE +LRY G +Y HYD FDP G
Sbjct: 158 RVGQD--ALCQRIEARIARLLDWPVDHGEGLQVLRYGTGAEYRPHYDYFDPDAAGTPVLL 215
Query: 132 -QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 190
QRVAS ++YL E GG T FP + L V +G+ + F
Sbjct: 216 QAGGQRVASLVMYLNTPERGGATRFPDAH---------------LDVAAVKGNAVFFSYD 260
Query: 191 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
P+ S+H PV+ GEKWVATKW+R++
Sbjct: 261 RPHPMT--RSLHAGAPVLAGEKWVATKWLRER 290
>gi|195390833|ref|XP_002054072.1| GJ22994 [Drosophila virilis]
gi|194152158|gb|EDW67592.1| GJ22994 [Drosophila virilis]
Length = 496
Score = 99.4 bits (246), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 63/212 (29%), Positives = 100/212 (47%), Gaps = 21/212 (9%)
Query: 14 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGV 73
I ++ P + F + +P + + +A+ L +T+ K D+ RTS G
Sbjct: 294 IKMEIRLLNPFIIVFHDVLSPREIDELQKLARPLLERTTVVKFKKYEKDSR---RTSKGT 350
Query: 74 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQ 132
+I D + IE +I + L E F ++ Y +G Y +H D D +
Sbjct: 351 WIE--RDHNNLTKRIERRITDMVELDLRYSEPFQVMNYGLGGHYAAHEDFLGDTWADKKE 408
Query: 133 KSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLP 192
+ R+A+ L YLTD+E+GG T+F N V P++G L +Y+L
Sbjct: 409 EDDRIATVLFYLTDVEQGGATVFTILNQ---------------AVSPKRGTALFWYNLHR 453
Query: 193 NGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
NGT D ++HG CPV+ G KW+ T WIR++ Q
Sbjct: 454 NGTGDTRTLHGGCPVLVGSKWIMTLWIRERMQ 485
>gi|66572403|gb|AAY47813.1| conserved hypothetical protein [Xanthomonas campestris pv.
campestris str. 8004]
Length = 308
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 66/212 (31%), Positives = 97/212 (45%), Gaps = 37/212 (17%)
Query: 22 MPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFI 75
+PR + ++C ++I +A RP L + TVDN G RTS + +
Sbjct: 117 LPRVVVLGGLLADDECDALIALA----RPQ---LARSRTVDNRDGSEIVHAARTSHSMAL 169
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP---- 131
+D IE +IA++ P +GE +LRY G +Y HYD F+P G
Sbjct: 170 QPGQD--ALCQRIEARIAQLLEWPVEHGEGLQVLRYATGAQYAPHYDYFEPDAPGTPVLL 227
Query: 132 -QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 190
QRVAS ++YL E GG T FP + L V +G+ + F
Sbjct: 228 QHGGQRVASLVMYLNTPERGGATRFPD---------------VHLDVAAVKGNAVFFSYD 272
Query: 191 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
P+ ++H PV+ GEKWVATKW+R++
Sbjct: 273 RPHPMT--RTLHAGAPVLAGEKWVATKWLRER 302
>gi|77761111|ref|YP_241833.2| hypothetical protein XC_0735 [Xanthomonas campestris pv. campestris
str. 8004]
Length = 288
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 66/212 (31%), Positives = 97/212 (45%), Gaps = 37/212 (17%)
Query: 22 MPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFI 75
+PR + ++C ++I +A RP L + TVDN G RTS + +
Sbjct: 97 LPRVVVLGGLLADDECDALIALA----RPQ---LARSRTVDNRDGSEIVHAARTSHSMAL 149
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP---- 131
+D IE +IA++ P +GE +LRY G +Y HYD F+P G
Sbjct: 150 QPGQD--ALCQRIEARIAQLLEWPVEHGEGLQVLRYATGAQYAPHYDYFEPDAPGTPVLL 207
Query: 132 -QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 190
QRVAS ++YL E GG T FP + L V +G+ + F
Sbjct: 208 QHGGQRVASLVMYLNTPERGGATRFP---------------DVHLDVAAVKGNAVFFSYD 252
Query: 191 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
P+ ++H PV+ GEKWVATKW+R++
Sbjct: 253 RPHPMT--RTLHAGAPVLAGEKWVATKWLRER 282
>gi|332187533|ref|ZP_08389270.1| 2OG-Fe(II) oxygenase superfamily protein [Sphingomonas sp. S17]
gi|332012462|gb|EGI54530.1| 2OG-Fe(II) oxygenase superfamily protein [Sphingomonas sp. S17]
Length = 228
Score = 98.6 bits (244), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 65/200 (32%), Positives = 100/200 (50%), Gaps = 28/200 (14%)
Query: 27 YFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLD 86
Y +F TP QC ++I M N RPSTL + + G RTS ++ E +
Sbjct: 47 YQADFLTPAQCDALIAMIDANRRPSTLLSDRPDY-----GFRTSESCDMNRWSPE---VQ 98
Query: 87 LIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK-----SQRVASFL 141
I+E IA++ +P GE RY GQ++ +H+D F E +K QR + +
Sbjct: 99 PIDESIAQLLGIPPEQGETMQGQRYAPGQQFRAHHDYFHESESYWEKVKVHGGQRTWTAM 158
Query: 142 VYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSI 201
+YL D+ EGG T FP G++V PR+G L + ++L +G+ + ++
Sbjct: 159 IYLNDVPEGGATWFP---------------QAGIRVAPRRGLLLAWNNMLLDGSPNDATL 203
Query: 202 HGSCPVVKGEKWVATKWIRD 221
H PVV+G K+V TKW R+
Sbjct: 204 HEGMPVVEGVKYVITKWFRE 223
>gi|389809938|ref|ZP_10205598.1| procollagen-proline dioxygenase [Rhodanobacter thiooxydans LCS2]
gi|388441354|gb|EIL97635.1| procollagen-proline dioxygenase [Rhodanobacter thiooxydans LCS2]
Length = 284
Score = 98.6 bits (244), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 66/199 (33%), Positives = 98/199 (49%), Gaps = 30/199 (15%)
Query: 30 NFATPEQCKSIINMAKLNL-RPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLI 88
N + +C +I +A+ L R T+ + VD RTS G+F + DE + I
Sbjct: 102 NILSARECDELIALARPRLQRALTVDSEGRQQVDRR---RTSEGMFFTL--DEVPLVGRI 156
Query: 89 EEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK-----SQRVASFLVY 143
E ++A + +P +GE IL Y GQ Y H+D FDP + G + QR+AS ++Y
Sbjct: 157 ERRVAALLDVPASHGEGLQILHYLPGQAYEPHFDWFDPDQPGYETITAVGGQRIASVVMY 216
Query: 144 LTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHG 203
L GG T FP +GL V R+G + F G D +S+H
Sbjct: 217 LNTPARGGGTAFP---------------ALGLTVTARRGAAVYFA--YEGG--DCSSLHA 257
Query: 204 SCPVVKGEKWVATKWIRDQ 222
PV++GEKW+ATKW+R++
Sbjct: 258 GLPVLEGEKWIATKWLRER 276
>gi|344175386|emb|CCA88057.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia syzygii R24]
Length = 331
Score = 98.6 bits (244), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 62/215 (28%), Positives = 98/215 (45%), Gaps = 23/215 (10%)
Query: 14 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPS-TLALRKGETVDNTQGIRTSSG 72
+ Q +S PRA + + ++C ++I A+ L S + G+ V N S
Sbjct: 123 VSVQFVSHHPRAALISDLLSTQECDALIEQARSRLTTSYVIEYESGQEVVNEATRSCSCA 182
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
F E+ S I E+ A++ P + E RY G+++ H D F
Sbjct: 183 SF--PPEEMSMLQKRIVERAARLVGQPGAHCEGVTFARYLPGEQFRPHVDYFRGAVLNND 240
Query: 133 K-----SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 187
K R+A+ L+YL ++E GG T FP G +V+P++G L F
Sbjct: 241 KIMGSSGHRIATVLLYLNEVEAGGATFFPNP---------------GFEVRPQKGGALYF 285
Query: 188 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
+G++DPTS+H C V +GEKW+AT W R++
Sbjct: 286 AYQQADGSMDPTSLHEGCAVTQGEKWIATLWFRER 320
>gi|157111033|ref|XP_001651361.1| prolyl 4-hydroxylase alpha subunit 1, putative [Aedes aegypti]
gi|108878552|gb|EAT42777.1| AAEL005714-PA, partial [Aedes aegypti]
Length = 522
Score = 98.2 bits (243), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 58/213 (27%), Positives = 113/213 (53%), Gaps = 24/213 (11%)
Query: 15 PFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSG 72
PF++ + P+ + F + + + + + +AK L +T+A ++ + ++ + S
Sbjct: 319 PFKLEEMHLKPKIVIFHDVLSDTEIELLKRLAKPILERATIANQQTGKAERSKDRVSKSS 378
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
F ++ T+ I +++A +T L E ++ Y +G +Y+ H+D F +G
Sbjct: 379 WF---PDEYHSTIRTITKRVADMTGLSMDTAEELQVVNYGLGGQYDPHFDFF---HWGKL 432
Query: 133 KS-QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLL 191
K R+A+ L Y++D+ GG T+FP +G+ ++ R+G +Y+L
Sbjct: 433 KEVNRIATVLFYMSDVSIGGATVFP---------------KLGVTLEARKGTAAFWYNLH 477
Query: 192 PNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
+G +D +++HG+CPV+ GEKWVA KWIR++ Q
Sbjct: 478 SSGELDYSTLHGACPVLIGEKWVANKWIRERGQ 510
>gi|195113239|ref|XP_002001175.1| GI10638 [Drosophila mojavensis]
gi|193917769|gb|EDW16636.1| GI10638 [Drosophila mojavensis]
Length = 511
Score = 98.2 bits (243), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 61/210 (29%), Positives = 105/210 (50%), Gaps = 27/210 (12%)
Query: 14 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGI-RTSSG 72
I +VL P + F + + + + +A+ +L S + + N QG R S+G
Sbjct: 310 IKMEVLVLDPLVVIFHDVLSSREIDGLQEIARPHLERSMVVKYRA----NVQGKHRISAG 365
Query: 73 VFISAAEDESGTLDL-IEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP 131
++ E + L IE +IA + L E F ++ Y IG +Y +H+D F
Sbjct: 366 TWV---ERKYNNLTWRIERRIADMVDLNLEGSEPFYVINYGIGGQYKAHWDFFGADTV-- 420
Query: 132 QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLL 191
+ R+A+ L Y+ D+E+GG T+FP +G V+ ++G+ L +Y++
Sbjct: 421 -EDNRLATVLFYMNDVEQGGATVFP---------------RLGQTVRAKRGNALFWYNMQ 464
Query: 192 PNGTIDPTSIHGSCPVVKGEKWVATKWIRD 221
NGT+D ++HG CP++ G KW+ T+WI D
Sbjct: 465 HNGTVDDRTLHGGCPILVGSKWIFTQWISD 494
>gi|381200649|ref|ZP_09907785.1| Prolyl 4-hydroxylase alpha subunit [Sphingobium yanoikuyae XLDN2-5]
Length = 305
Score = 98.2 bits (243), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 64/202 (31%), Positives = 100/202 (49%), Gaps = 29/202 (14%)
Query: 28 FPNFATPEQCKSIINMAKLNLRPS-TLALRKGETVDNTQGIRTS-SGVFISAAEDESGTL 85
F F T ++C +I+ + L P+ + R G + + +RTS G+F A ED +
Sbjct: 126 FRQFLTGDECHHVISEGQALLEPAMVIDPRSGRPMPHP--VRTSDGGIFGPAREDL--VI 181
Query: 86 DLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ-KSQRVASFLVYL 144
I +IA + GE +LRY +GQ+Y H+D P ++QR + L+YL
Sbjct: 182 QAINRRIAAASGTMLSGGEPLTLLRYAVGQQYRQHHDCL------PHVRNQRAWTMLIYL 235
Query: 145 TDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGS 204
+ GGET+FP +GL VK R+GD LLF + G ++H
Sbjct: 236 NEGYAGGETIFPR---------------LGLSVKGRKGDALLFRNTDAQGQAAEAAVHLG 280
Query: 205 CPVVKGEKWVATKWIRDQEQYD 226
PV+ G+KW+ T+WIR +++D
Sbjct: 281 APVMAGQKWLCTRWIR-HDRHD 301
>gi|414591891|tpg|DAA42462.1| TPA: hypothetical protein ZEAMMB73_637248 [Zea mays]
Length = 207
Score = 97.8 bits (242), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 52/145 (35%), Positives = 84/145 (57%), Gaps = 5/145 (3%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-GETVDNTQGIRTSSGVFI 75
+ +SW PR + F + +C ++ +AK ++ S +A + G++V + +RTSSG+F+
Sbjct: 46 KAVSWHPRIFVYKGFLSDAECDHLVTLAKKKIQRSMVADNESGKSVKSE--VRTSSGMFL 103
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 135
+D + IEE+IA T LP+ N E +LRY+ GQKY H+D F + +
Sbjct: 104 DKRQDP--VVSRIEERIAAWTFLPQENAENMQVLRYEPGQKYEPHFDYFHDRVNQARGGH 161
Query: 136 RVASFLVYLTDLEEGGETMFPFENG 160
R A+ L+YL+ + EGGET+FP G
Sbjct: 162 RYATVLMYLSTVREGGETVFPNAKG 186
>gi|323454062|gb|EGB09933.1| hypothetical protein AURANDRAFT_14928, partial [Aureococcus
anophagefferens]
Length = 182
Score = 97.4 bits (241), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 65/198 (32%), Positives = 100/198 (50%), Gaps = 29/198 (14%)
Query: 30 NFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIE 89
NF T E+C ++I+ AK ++ P+ + G RTSS ++ A ED L +
Sbjct: 8 NFLTEEECDALIDSAKDHMTPAPVV---GPGNGEVSVSRTSSTCYL-ARED----LPSVC 59
Query: 90 EKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP-----QEYGPQKSQRVASFLVYL 144
K+ +T P + E + RY+ G+ Y HYDAFD + + QRVA+ LVYL
Sbjct: 60 TKVCALTGKPLEHLELPQVGRYRGGEFYKPHYDAFDTSSADGRRFAQNGGQRVATVLVYL 119
Query: 145 TDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGS 204
D+E GGET F +G+++KPR+G+ L+F+ +G +D +H +
Sbjct: 120 NDVERGGETSF---------------SKLGVRIKPRKGNALIFFPATLDGVLDQNYLHAA 164
Query: 205 CPVVKGEKWVATKWIRDQ 222
P V KWV+ WIR +
Sbjct: 165 EPAVD-PKWVSQIWIRQR 181
>gi|427410797|ref|ZP_18900999.1| hypothetical protein HMPREF9718_03473 [Sphingobium yanoikuyae ATCC
51230]
gi|425710785|gb|EKU73805.1| hypothetical protein HMPREF9718_03473 [Sphingobium yanoikuyae ATCC
51230]
Length = 322
Score = 97.1 bits (240), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 64/202 (31%), Positives = 100/202 (49%), Gaps = 29/202 (14%)
Query: 28 FPNFATPEQCKSIINMAKLNLRPS-TLALRKGETVDNTQGIRTS-SGVFISAAEDESGTL 85
F F T ++C +I+ + L P+ + R G + + IRTS G+F A ED +
Sbjct: 143 FRQFLTGDECHHVISEGQALLEPAMVIDPRSGRPMPHP--IRTSDGGIFGPAREDL--VI 198
Query: 86 DLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ-KSQRVASFLVYL 144
I +IA + GE +LRY +GQ+Y H+D P ++QR + L+YL
Sbjct: 199 QAINRRIAAASGTMLSGGEPLTLLRYAVGQQYRQHHDCL------PHVRNQRAWTMLIYL 252
Query: 145 TDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGS 204
+ GGET+FP +GL VK R+G+ LLF + G ++H
Sbjct: 253 NEGYAGGETIFPR---------------LGLSVKGRKGNALLFRNTDAQGQAAEAAVHLG 297
Query: 205 CPVVKGEKWVATKWIRDQEQYD 226
PV+ G+KW+ T+WIR +++D
Sbjct: 298 APVMAGQKWLCTRWIR-HDRHD 318
>gi|326436053|gb|EGD81623.1| p4ha2 protein [Salpingoeca sp. ATCC 50818]
Length = 548
Score = 96.7 bits (239), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 60/200 (30%), Positives = 98/200 (49%), Gaps = 24/200 (12%)
Query: 24 RALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
R F FA+PE+C+ + + K L R + + V+ R S+ ++ D
Sbjct: 339 RLQVFRQFASPEECRHLQHAGKRRLERAVAWTDGRFQPVE----FRISTAAWLQP--DHD 392
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
+ I +I T + EA I Y +G Y H+D + P +R+A+F++
Sbjct: 393 AIVKRIHGRIEDATQVDIEYAEALQISNYGMGGFYEPHFD-HSSRGTNP-DGERLATFMI 450
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL +++GG T FP +G V+P GD + +Y+L P+G DP ++H
Sbjct: 451 YLNPVKQGGFTAFPR---------------LGAAVQPGYGDAVFWYNLQPSGVGDPLTLH 495
Query: 203 GSCPVVKGEKWVATKWIRDQ 222
G+CPV++G KWVA KWI ++
Sbjct: 496 GACPVLRGSKWVANKWIHER 515
>gi|321474898|gb|EFX85862.1| hypothetical protein DAPPUDRAFT_309117 [Daphnia pulex]
Length = 541
Score = 96.7 bits (239), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 62/215 (28%), Positives = 102/215 (47%), Gaps = 24/215 (11%)
Query: 14 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGV 73
I ++ S PR + + N T E+ ++ +A+ LR ST+ + T+ R +
Sbjct: 333 IKMELASLKPRLVIYHNVVTDEEIETAKKLAQSRLRRSTVQNSLTGASEPTK-YRIAKAA 391
Query: 74 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK 133
F+ +E + + + +I VT L E + Y IG Y HYD E QK
Sbjct: 392 FLQNSEHDH--IVKMTRRIGDVTGLDMTTAEELQVCNYGIGGHYEPHYDHARKGEV--QK 447
Query: 134 S----QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYS 189
R+A+++ Y++D+E GG T+FP I L + P++G +++
Sbjct: 448 DFGWGNRIATWMFYMSDVEAGGATVFP---------------QINLALWPQKGSAAFWFN 492
Query: 190 LLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
L PNG D + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 493 LHPNGEGDDLTQHAACPVLTGSKWVSNKWIHERNQ 527
>gi|21114687|gb|AAM42699.1| conserved hypothetical protein [Xanthomonas campestris pv.
campestris str. ATCC 33913]
Length = 308
Score = 96.7 bits (239), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 65/212 (30%), Positives = 96/212 (45%), Gaps = 37/212 (17%)
Query: 22 MPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFI 75
+PR + ++C ++I +A RP L + TVDN G RTS + +
Sbjct: 117 LPRVVVLGGLLADDECDALIALA----RPQ---LARSRTVDNRDGSEIVHAARTSHSMAL 169
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP---- 131
+D IE +IA++ P +GE +LRY G +Y HYD F+P G
Sbjct: 170 QPGQD--ALCQRIEARIAQLLEWPVEHGEGLQVLRYATGAQYAPHYDYFEPDAPGTPVLL 227
Query: 132 -QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 190
QRVAS ++YL E GG T P + L V +G+ + F
Sbjct: 228 QHGGQRVASLVMYLNTPERGGATRVPD---------------VHLDVAAVKGNAVFFSYD 272
Query: 191 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
P+ ++H PV+ GEKWVATKW+R++
Sbjct: 273 RPHPMT--RTLHAGAPVLAGEKWVATKWLRER 302
>gi|363543297|ref|NP_001241864.1| prolyl 4-hydroxylase 4-2 precursor [Zea mays]
gi|194704960|gb|ACF86564.1| unknown [Zea mays]
gi|347978810|gb|AEP37747.1| prolyl 4-hydroxylase 4-2 [Zea mays]
Length = 207
Score = 96.7 bits (239), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 52/145 (35%), Positives = 83/145 (57%), Gaps = 5/145 (3%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-GETVDNTQGIRTSSGVFI 75
+ +SW PR + F + +C ++ +AK + S +A + G++V + +RTSSG+F+
Sbjct: 46 KAVSWHPRIFVYKGFLSDAECDHLVTLAKKKTQRSMVADNESGKSVKSE--VRTSSGMFL 103
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 135
+D + IEE+IA T LP+ N E +LRY+ GQKY H+D F + +
Sbjct: 104 DKRQDP--VVSRIEERIAAWTFLPQENAENMQVLRYEPGQKYEPHFDYFHDRVNQARGGH 161
Query: 136 RVASFLVYLTDLEEGGETMFPFENG 160
R A+ L+YL+ + EGGET+FP G
Sbjct: 162 RYATVLMYLSTVREGGETVFPNAKG 186
>gi|77747935|ref|NP_638775.2| hypothetical protein XCC3429 [Xanthomonas campestris pv. campestris
str. ATCC 33913]
Length = 288
Score = 96.7 bits (239), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 65/212 (30%), Positives = 96/212 (45%), Gaps = 37/212 (17%)
Query: 22 MPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFI 75
+PR + ++C ++I +A RP L + TVDN G RTS + +
Sbjct: 97 LPRVVVLGGLLADDECDALIALA----RPQ---LARSRTVDNRDGSEIVHAARTSHSMAL 149
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP---- 131
+D IE +IA++ P +GE +LRY G +Y HYD F+P G
Sbjct: 150 QPGQD--ALCQRIEARIAQLLEWPVEHGEGLQVLRYATGAQYAPHYDYFEPDAPGTPVLL 207
Query: 132 -QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 190
QRVAS ++YL E GG T P + L V +G+ + F
Sbjct: 208 QHGGQRVASLVMYLNTPERGGATRVPD---------------VHLDVAAVKGNAVFFSYD 252
Query: 191 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
P+ ++H PV+ GEKWVATKW+R++
Sbjct: 253 RPHPMT--RTLHAGAPVLAGEKWVATKWLRER 282
>gi|398806116|ref|ZP_10565064.1| 2OG-Fe(II) oxygenase superfamily enzyme [Polaromonas sp. CF318]
gi|398089832|gb|EJL80333.1| 2OG-Fe(II) oxygenase superfamily enzyme [Polaromonas sp. CF318]
Length = 294
Score = 96.7 bits (239), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 63/216 (29%), Positives = 101/216 (46%), Gaps = 24/216 (11%)
Query: 7 GDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG 66
G D+V + F+ L+ PR + NF + E+C + A+ P+T+ + V
Sbjct: 83 GADAV--VTFEQLA--PRIVVLDNFLSSEECDGLCEEARPAFAPATVVDPHQDAVHAAH- 137
Query: 67 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP 126
R++ + AA E + +E +I ++T P E + RY GQ Y HYD F
Sbjct: 138 FRSNDSAQLPAAGSE--LVRRVEARIERLTGWPSAFCETLQLQRYAQGQDYRPHYDFFGQ 195
Query: 127 QEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLL 186
Q QR+A+ ++YL E GG T F +G+++ PR+G L
Sbjct: 196 DMVEAQGGQRLATLILYLRAPEAGGATYF---------------ANLGMRIAPRKGSALF 240
Query: 187 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
F P+ + ++HG V+ GEKW+AT+W RD+
Sbjct: 241 F--TYPDPGNNSGTLHGGEAVLAGEKWIATQWFRDR 274
>gi|421871431|ref|ZP_16303052.1| 2OG-Fe(II) oxygenase superfamily protein [Brevibacillus
laterosporus GI-9]
gi|372459315|emb|CCF12601.1| 2OG-Fe(II) oxygenase superfamily protein [Brevibacillus
laterosporus GI-9]
Length = 201
Score = 96.3 bits (238), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 63/217 (29%), Positives = 106/217 (48%), Gaps = 25/217 (11%)
Query: 15 PFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVF 74
P Q+L+ P +P+ + E C+S+IN+A+ L P+T+ + G V + +R S +
Sbjct: 4 PTQLLNQQPFIGCYPSLISSEACQSLINLARGQLTPATVVGQSGLEVSH---VRISELAW 60
Query: 75 ISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQE----YG 130
+E + I ++IA++ P E + Y G K+ +H D +D QE +
Sbjct: 61 FCHNYNE--VVQSICKQIAEIVEQPIHYAEKLQVAHYGAGGKFEAHLDCYDSQEANKTFL 118
Query: 131 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 190
QR+ + ++YL D+ GGET FP + ++V P G L+F +
Sbjct: 119 EHSGQRLYTAILYLNDVVSGGETYFPN---------------LKIEVSPTTGTLLVFENC 163
Query: 191 LPNGTI-DPTSIHGSCPVVKGEKWVATKWIRDQEQYD 226
P+ +I D S+HGS + GEKW+ T W ++ QY
Sbjct: 164 QPDTSIPDLRSLHGSKILQSGEKWIGTLWFCERPQYQ 200
>gi|339009924|ref|ZP_08642495.1| 2OG-Fe(II) oxygenase [Brevibacillus laterosporus LMG 15441]
gi|338773194|gb|EGP32726.1| 2OG-Fe(II) oxygenase [Brevibacillus laterosporus LMG 15441]
Length = 201
Score = 96.3 bits (238), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 63/217 (29%), Positives = 106/217 (48%), Gaps = 25/217 (11%)
Query: 15 PFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVF 74
P Q+L+ P +P+ + E C+S+IN+A+ L P+T+ + G V + +R S +
Sbjct: 4 PTQLLNQQPFIGCYPSLISSEACQSLINLARGQLTPATVVGQSGLEVSH---VRISELAW 60
Query: 75 ISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQE----YG 130
+E + I ++IA++ P E + Y G K+ +H D +D QE +
Sbjct: 61 FCHNYNE--VVQSICKQIAEIVEQPIHYAEKLQVAHYGAGGKFEAHLDCYDSQEANKPFL 118
Query: 131 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 190
QR+ + ++YL D+ GGET FP + ++V P G L+F +
Sbjct: 119 EHSGQRLYTAILYLNDVVSGGETYFPN---------------LKIEVSPTTGTLLVFENC 163
Query: 191 LPNGTI-DPTSIHGSCPVVKGEKWVATKWIRDQEQYD 226
P+ +I D S+HGS + GEKW+ T W ++ QY
Sbjct: 164 QPDTSIPDLRSLHGSKILQSGEKWIGTLWFCERPQYQ 200
>gi|413945803|gb|AFW78452.1| hypothetical protein ZEAMMB73_588774 [Zea mays]
Length = 239
Score = 96.3 bits (238), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 55/144 (38%), Positives = 81/144 (56%), Gaps = 15/144 (10%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 72
+S PR + +F + ++ +I++A+ L+ S +A DN G +RTSSG
Sbjct: 54 ISCKPRVFLYQHFLSDDEANHLISLARAELKRSAVA-------DNMSGKSTLSEVRTSSG 106
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
F+ +D ++ IE+KIA T LP+ NGE +LRYK G+KY HYD F +
Sbjct: 107 TFLRKGQDP--IVEGIEDKIAAWTFLPKENGEDIQVLRYKHGEKYEPHYDYFTDNVNTVR 164
Query: 133 KSQRVASFLVYLTDLEEGGETMFP 156
R A+ L+YLTD+ EGGET+FP
Sbjct: 165 GGHRYATVLLYLTDVPEGGETVFP 188
>gi|441432545|ref|YP_007354587.1| Prolyl 4-hydroxylase [Acanthamoeba polyphaga moumouvirus]
gi|371944705|gb|AEX62527.1| putative prolyl4-hydroxylase [Moumouvirus Monve]
gi|440383625|gb|AGC02151.1| Prolyl 4-hydroxylase [Acanthamoeba polyphaga moumouvirus]
Length = 239
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 64/199 (32%), Positives = 93/199 (46%), Gaps = 30/199 (15%)
Query: 30 NFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIE 89
NF E+CK I+N + L S + K + IR S ++S + + +
Sbjct: 61 NFINKEKCKEIMNNTQNKLFDSEVISGK------NKAIRNSQQCWVSKYDP---MVKSMF 111
Query: 90 EKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-----DPQEYGPQKSQRVASFLVYL 144
+KI++ +P N E ++RY GQ YN H+DA E+ + QR + LVYL
Sbjct: 112 QKISQQFNIPLENAEDLQVVRYLPGQYYNEHHDACCDNNDKCNEFISRGGQRCLTVLVYL 171
Query: 145 TDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGT-IDPTSIHG 203
+ EGG T F K + LKVKP GD ++FY L N + P S+H
Sbjct: 172 NNEFEGGHTFF---------------KNLNLKVKPETGDAIVFYPLAKNTSKCHPLSLHA 216
Query: 204 SCPVVKGEKWVATKWIRDQ 222
PV GEKW+A W R++
Sbjct: 217 GMPVTSGEKWIANLWFRER 235
>gi|255607134|ref|XP_002538686.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223510975|gb|EEF23697.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 318
Score = 95.9 bits (237), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 69/228 (30%), Positives = 112/228 (49%), Gaps = 34/228 (14%)
Query: 4 GQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKG--ETV 61
G A S +I ++ PR F + + +C ++I ++ L+ S + +G E V
Sbjct: 107 GNAIALSDRDIKVVMVCTAPRIALFDDVLSDAECDALIAASRSRLQRSKVVANRGSGEFV 166
Query: 62 DNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHY 121
D+T RTS G + + + E+ + I+ +IA++T P + E IL Y +G +Y H+
Sbjct: 167 DDT---RTSYGAYFN--KGENSLVATIQRRIAELTRWPLTHAEPLQILNYGLGGEYLPHF 221
Query: 122 DAFDPQEYG---PQKS--QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLK 176
D F+PQ+ G P +S QR+A+ ++YL D+E GG T+FP N L+
Sbjct: 222 DYFEPQQPGLPSPLESGGQRIATVVMYLNDVEAGGGTIFPHLN---------------LE 266
Query: 177 VKPRQGDGLLFYSLLPNGTIDPTSIHGSCPV---VKGEKWVATKWIRD 221
+PR+G + F L SI C + KW+AT+W RD
Sbjct: 267 TRPRKGGAIYFSYQLAVA----RSIRSRCMAARRIARRKWIATQWFRD 310
>gi|428178571|gb|EKX47446.1| hypothetical protein GUITHDRAFT_152114 [Guillardia theta CCMP2712]
Length = 262
Score = 95.9 bits (237), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 67/223 (30%), Positives = 107/223 (47%), Gaps = 29/223 (13%)
Query: 12 TNIPF-QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTS 70
+N+P+ + ++ PR N T ++C+ ++ +A T+ + G RT+
Sbjct: 52 SNLPYLEQINASPRVFRIRNLLTKQECEHLMLLAFRKGLSKTMIMPYGTHKLVESTTRTN 111
Query: 71 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIG-QKYNSHYDAFDPQEY 129
G ++ +D+ + +EE + K+T GE +L Y G Q + HYD FDP
Sbjct: 112 DGAWLDFLQDD--VVRRLEETLGKLTKTTPQQGENLQVLHYSNGAQFFQEHYDYFDPARD 169
Query: 130 GP----QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGL 185
P Q R + +VYL EGGET FP +GLK+ + GD L
Sbjct: 170 PPESFEQGGNRYITVIVYLEAALEGGETHFP---------------ELGLKLTAQPGDAL 214
Query: 186 LFYSLLPN--GT----IDPTSIHGSCPVVKGEKWVATKWIRDQ 222
+FY+L + GT ++ +IH + P V+GEKWVA KWI ++
Sbjct: 215 MFYNLKEHCSGTDPDCVEKKTIHAALPPVRGEKWVAVKWIHEK 257
>gi|195505209|ref|XP_002099405.1| GE10885 [Drosophila yakuba]
gi|194185506|gb|EDW99117.1| GE10885 [Drosophila yakuba]
Length = 473
Score = 95.9 bits (237), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 61/211 (28%), Positives = 104/211 (49%), Gaps = 21/211 (9%)
Query: 14 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGV 73
+ ++LS P + F + + + SI N+AK L + + G ++ RT+ G
Sbjct: 274 LKMELLSLDPYMVLFHDVVSDKDITSIRNLAKGGLVRAVTVTKDGSYEEDPA--RTTKGT 331
Query: 74 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK 133
++ + S + + + +T L + + F +L Y IG Y +H+D E G
Sbjct: 332 WLV---ENSKLIQRLSQLAQDMTNLDIRDADPFQVLNYGIGGYYGTHFDFLADTEMG-NF 387
Query: 134 SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPN 193
S R+A+ + YL+D+ +GG T+FP +GL V P++G LL+Y+L
Sbjct: 388 SNRIATAVFYLSDVPQGGATIFP---------------KLGLSVFPKKGSALLWYNLDHK 432
Query: 194 GTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
G D + H +CP + G +WV TKWI ++EQ
Sbjct: 433 GDGDNRTAHSACPTIVGSRWVMTKWINEREQ 463
>gi|307211752|gb|EFN87747.1| Prolyl 4-hydroxylase subunit alpha-1 [Harpegnathos saltator]
Length = 415
Score = 95.5 bits (236), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 56/206 (27%), Positives = 99/206 (48%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR +++ N E+ ++I MA+ + +T+ K ++ R S ++ E E
Sbjct: 208 PRIVFYHNVIYDEEIETIKRMAQPRFKRATVQNYKTGALE-IANYRISKSAWLQ--EHEH 264
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ + +++ +T + E ++ Y IG Y H+D +E KS R+A
Sbjct: 265 KHVAAVSKRVEHMTSMSVETAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIA 324
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
+ L Y++D+E+GG T+F I + + PR+G +Y+L PNG D
Sbjct: 325 TVLYYMSDVEQGGGTVFT---------------AINISLWPRKGSAAFWYNLKPNGEGDF 369
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
+ H +CPV+ G KWVA KW+ ++ Q
Sbjct: 370 KTRHAACPVLTGSKWVANKWLHERGQ 395
>gi|290243077|ref|YP_003494747.1| Procollagen-proline dioxygenase [Thioalkalivibrio sp. K90mix]
gi|288945582|gb|ADC73280.1| Procollagen-proline dioxygenase [Thioalkalivibrio sp. K90mix]
Length = 575
Score = 95.5 bits (236), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 67/217 (30%), Positives = 109/217 (50%), Gaps = 26/217 (11%)
Query: 16 FQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFI 75
+ LS P +Y F P +C+++I++A+ ++ + ++L V +QG RT S ++
Sbjct: 50 METLSQDPLVVYLDEFLEPGECEALIHLAQGRMKRALVSLDGSSGV--SQG-RTGSNCWL 106
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFD---PQ--EYG 130
E+ I E++AK P E ++ Y Q+Y HYDA+D P+
Sbjct: 107 RYQEEPLARR--IGERVAKRVGFPLEYAEPLQVIHYGHEQEYRPHYDAYDLDTPRGLRCT 164
Query: 131 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 190
Q QR+ + L+YL ++EEGG T FP NA G++V PR+G +F ++
Sbjct: 165 RQGGQRMVTALLYLNEVEEGGATAFP-----NA----------GVEVAPRKGRIAIFNNV 209
Query: 191 LPN-GTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 226
+ G P S+HG PV GEKW A+ W R + ++
Sbjct: 210 GADPGRPHPRSLHGGMPVKSGEKWAASIWFRARPAHE 246
>gi|451927223|gb|AGF85101.1| 4-hydroxylase [Moumouvirus goulette]
Length = 239
Score = 95.5 bits (236), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 63/199 (31%), Positives = 93/199 (46%), Gaps = 30/199 (15%)
Query: 30 NFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIE 89
NF E+C I+N + L S + K + IR S ++S + + +
Sbjct: 61 NFINKEKCGEIMNNTQSKLFDSEVISGK------NKAIRNSQQCWVSKYDP---MVKSMF 111
Query: 90 EKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-----DPQEYGPQKSQRVASFLVYL 144
+KI++ +P N E ++RY GQ YN H+DA E+ + QR + L+YL
Sbjct: 112 QKISQQFNIPIQNAEDLQVVRYLPGQYYNEHHDACCDNNDKCNEFISRGGQRCLTVLIYL 171
Query: 145 TDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGT-IDPTSIHG 203
+ EGG T F K +GLKVKP GD ++FY L N + P S+H
Sbjct: 172 NNEFEGGHTFF---------------KNLGLKVKPETGDAIVFYPLAKNTSKCHPLSLHA 216
Query: 204 SCPVVKGEKWVATKWIRDQ 222
PV GEKW+A W R++
Sbjct: 217 GMPVTNGEKWIANLWFRER 235
>gi|219113023|ref|XP_002186095.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|209582945|gb|ACI65565.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 508
Score = 95.5 bits (236), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 68/224 (30%), Positives = 108/224 (48%), Gaps = 26/224 (11%)
Query: 9 DSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMA-KLNLRPSTLALRKGETVDNTQGI 67
D N+ VLS +PR +F + + + ++N+A K L+ ST+
Sbjct: 274 DPTMNMTMTVLSCVPRVFEVKDFLSDMEVEHLLNIASKRKLKRSTMHAGGSSEATTNDDT 333
Query: 68 RTSSGVFISAAED--------ESGTLDLIEEKIAKV---TMLPRIN------GEAFNILR 110
RTS+ +I +D + L ++E + + + +P E ++
Sbjct: 334 RTSTNDWIPRHQDLITDTIYRRAADLLQMDEALLRWRRKSEIPEFTESHISISERLQLVN 393
Query: 111 YKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQ 170
Y++GQ+Y H+D P Q S R A+ L YL D +GGET FP ++AD +
Sbjct: 394 YQVGQQYTPHHDFTMPGLVNMQPS-RFATLLFYLNDDMDGGETAFP--RWLHAD-----E 445
Query: 171 KCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWV 214
+ LKVKP +G +LFY+LLP+G D S H + PV +GEKW+
Sbjct: 446 EGGSLKVKPEKGKAILFYNLLPDGNYDERSEHAALPVRRGEKWL 489
>gi|424863736|ref|ZP_18287648.1| prolyl 4-hydroxylase subunit alpha-2 [SAR86 cluster bacterium
SAR86A]
gi|400757057|gb|EJP71269.1| prolyl 4-hydroxylase subunit alpha-2 [SAR86 cluster bacterium
SAR86A]
Length = 205
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 57/209 (27%), Positives = 97/209 (46%), Gaps = 26/209 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P NF + ++C++ + M K + + + + D ++ + + F S
Sbjct: 17 PIVYVVNNFLSDDECEAFVEMGKGKMERAKVI-----SDDESEFHASRTNDFCWLEHSAS 71
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS-----QRV 137
+ + ++ + + +P N E F ++ Y G +Y H+DAFD Q + QR+
Sbjct: 72 DVIHEVSKRFSVLVKMPINNAEQFQLVYYGPGNEYKPHFDAFDKTTKEGQNNWFPGGQRM 131
Query: 138 ASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGT-I 196
+ L YL D+EEGG T FP I + VKP +GD ++F++ + T I
Sbjct: 132 VTALAYLNDVEEGGATDFP---------------KINVSVKPNKGDVVVFHNCIEGTTEI 176
Query: 197 DPTSIHGSCPVVKGEKWVATKWIRDQEQY 225
+P ++HG PVV GEKW W R+ Y
Sbjct: 177 NPQALHGGSPVVAGEKWAVNLWFRESAIY 205
>gi|194905392|ref|XP_001981188.1| GG11756 [Drosophila erecta]
gi|190655826|gb|EDV53058.1| GG11756 [Drosophila erecta]
Length = 509
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 61/211 (28%), Positives = 101/211 (47%), Gaps = 21/211 (9%)
Query: 14 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGV 73
+ ++LS P + F + + + SI N+AK L + + G D RT+ G
Sbjct: 310 LKMELLSLDPYVVLFHDVVSDQDILSIRNLAKGGLARAVTVTQDGN--DKEDPARTTKGT 367
Query: 74 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK 133
++ + S + + + +T + + F +L Y IG Y +H+D + E G
Sbjct: 368 WLV---ENSKLIQRLSQLSQDMTNFDVRDADPFQVLNYGIGGFYGTHFDFLEDTEMG-HF 423
Query: 134 SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPN 193
S R+A+ + YL+D+ +GG T FP +GL V P +G LL+Y+L
Sbjct: 424 SDRIATAVFYLSDVPQGGATTFP---------------DLGLSVFPEKGAALLWYNLDHK 468
Query: 194 GTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
G D + H +CP + G +WV TKWI ++EQ
Sbjct: 469 GVGDNRTAHSACPTIVGSRWVMTKWINEREQ 499
>gi|21711777|gb|AAM75079.1| RE70601p [Drosophila melanogaster]
Length = 316
Score = 95.1 bits (235), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 56/187 (29%), Positives = 89/187 (47%), Gaps = 38/187 (20%)
Query: 61 VDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLP--------RING--------- 103
+ QG+ T S + + SG ++++ + +KV P R+N
Sbjct: 134 IKELQGMATPSLKRATVYQASSGRNEVVKTRTSKVAWFPDGYNPLTVRLNARISDMTGFN 193
Query: 104 ----EAFNILRYKIGQKYNSHYDAFDP--QEYGPQKSQRVASFLVYLTDLEEGGETMFPF 157
E ++ Y +G Y+ HYD F+ R+A+ L YLTD+E+GG T+FP
Sbjct: 194 LYGSEMLQLMNYGLGGHYDQHYDFFNKTNSNMTAMSGDRIATVLFYLTDVEQGGATVFP- 252
Query: 158 ENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATK 217
I V P++G +++Y+L NG ID ++H +CPV+ G KWV K
Sbjct: 253 --------------NIRKAVFPQRGSVVMWYNLKDNGQIDTQTLHAACPVIVGSKWVCNK 298
Query: 218 WIRDQEQ 224
WIR++EQ
Sbjct: 299 WIREREQ 305
>gi|414870897|tpg|DAA49454.1| TPA: hypothetical protein ZEAMMB73_536273 [Zea mays]
Length = 222
Score = 95.1 bits (235), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 49/134 (36%), Positives = 77/134 (57%), Gaps = 15/134 (11%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTS 70
+VLSW PRA + NF + E+C +I++AK +++ ST+ VD+ G +RTS
Sbjct: 97 EVLSWEPRAFVYHNFLSKEECDHLISLAKPHMKKSTV-------VDSATGGSKDSRVRTS 149
Query: 71 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 130
SG+F+ +D+ + IE++IA T +P GE +L Y++GQKY H+D F
Sbjct: 150 SGMFLRRGQDK--IIRTIEKRIADYTFIPVEQGEGLQVLHYEVGQKYEPHFDYFHDDYNT 207
Query: 131 PQKSQRVASFLVYL 144
QR+A+ L+YL
Sbjct: 208 KNGGQRIATLLMYL 221
>gi|301613004|ref|XP_002936004.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Xenopus
(Silurana) tropicalis]
Length = 526
Score = 95.1 bits (235), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 58/206 (28%), Positives = 103/206 (50%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + + + + E+ + +AK LR +T++ ++ Q R + ++S ED
Sbjct: 327 PRIVRYHDIISDEEISKVKELAKPRLRRATISNPITGVLETAQ-YRITKSAWLSGYEDP- 384
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD---AFDPQEYGPQKS-QRVA 138
+ + +I VT L E + Y IG +Y H+D ++P + + RVA
Sbjct: 385 -VVARLNRRIEGVTGLDMSTAEELQVANYGIGGQYEPHFDFLRKYEPDAFKKLGTGNRVA 443
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+E GG T+FP +G V P++G + +Y+LL +G D
Sbjct: 444 TWLFYMSDVEAGGATVFPE---------------VGAAVYPKKGTAVFWYNLLESGEGDY 488
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 489 STRHAACPVLVGNKWVSNKWIHERGQ 514
>gi|255633460|gb|ACU17088.1| unknown [Glycine max]
Length = 207
Score = 95.1 bits (235), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 53/146 (36%), Positives = 84/146 (57%), Gaps = 5/146 (3%)
Query: 3 HGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETV 61
H DD +V+SW PRA + NF T E+C+ +I++AK N+ S++ G++
Sbjct: 66 HTSDDDDVRGEQWVEVVSWEPRAFVYHNFLTKEECEYLIDIAKPNMHKSSVVDSETGKSK 125
Query: 62 DNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHY 121
D+ +RTSSG F++ D+ + IE++IA + +P +GE +L Y++GQKY HY
Sbjct: 126 DSR--VRTSSGTFLARGRDK--IVRDIEKRIAHYSFIPVEHGEGLQVLHYEVGQKYEPHY 181
Query: 122 DAFDPQEYGPQKSQRVASFLVYLTDL 147
D F QR+A+ L+YLTD+
Sbjct: 182 DYFLDDFNTKNGGQRIATVLMYLTDV 207
>gi|403183473|gb|EJY58123.1| AAEL017524-PA, partial [Aedes aegypti]
Length = 212
Score = 95.1 bits (235), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 62/207 (29%), Positives = 104/207 (50%), Gaps = 25/207 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + + N + ++ + II ++K L+ S + + V N RTS +++ + E
Sbjct: 13 PLIVIYHNAISDKEIEQIIQVSKPMLKRSMVGESFSKEVSNE---RTSQNAWLADYDFE- 68
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQ-EYGPQKS----QRV 137
+ ++ + +T L R + E+ + Y IG Y H+D P K R+
Sbjct: 69 -LVKVLSLRTEDMTGLDRKSYESLQVNNYGIGGFYLPHFDWVRTNGTEEPYKDMGLGNRI 127
Query: 138 ASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTID 197
A+ + YL+D+E+GG T+FP IG+ V P++G + +Y+LLP+GT D
Sbjct: 128 ATLMYYLSDVEQGGATVFP---------------QIGVGVFPKKGSAIFWYNLLPDGTGD 172
Query: 198 PTSIHGSCPVVKGEKWVATKWIRDQEQ 224
++HG+CPV+ G KWVA KWI Q
Sbjct: 173 ERTLHGACPVLLGSKWVANKWIHQYHQ 199
>gi|170064953|ref|XP_001867740.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
gi|167882143|gb|EDS45526.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
Length = 509
Score = 94.7 bits (234), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 62/211 (29%), Positives = 106/211 (50%), Gaps = 23/211 (10%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFIS 76
+VL+ P + + A+ + +I +AK + +T+ V N RTS ++
Sbjct: 303 EVLNLDPFITVYHDVASDREISKLIELAKSRISRATIRDDGEPQVSNA---RTSQNAWLD 359
Query: 77 AAEDESGTLDLIEEKIAKVTM-LPRINGEAFNILRYKIGQKYNSHYD-AFDPQEY-GPQK 133
A +D T ++ ++ +T L + + E + Y +G Y +H+D A + Y G +
Sbjct: 360 AGDDRVVTT--LDRRVGDMTGGLRQQSYEMLQVNNYGVGGHYVAHHDWAMEAVPYAGLRV 417
Query: 134 SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPN 193
R+A+ + YL+D+E GG T+FP +GL V PR+G +L+Y+L N
Sbjct: 418 GNRIATVMFYLSDVEIGGATVFP---------------QLGLAVFPRKGSAILWYNLYRN 462
Query: 194 GTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
G D ++H +CPV+ G KWVA +WI + Q
Sbjct: 463 GKGDRRTLHAACPVLSGSKWVANQWIHEYHQ 493
>gi|393718270|ref|ZP_10338197.1| putative oxygenase [Sphingomonas echinoides ATCC 14820]
Length = 226
Score = 94.7 bits (234), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 98/201 (48%), Gaps = 28/201 (13%)
Query: 26 LYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTL 85
Y P+F C ++ + N R ST+ ++ Q RTS + + S +
Sbjct: 43 FYHPDFLDAATCDRLVALIDANRRRSTVLAE-----ESVQDFRTSDSCDM---DRWSPDV 94
Query: 86 DLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQK----SQRVASF 140
+E IA + + ++GE RY +GQ + +H+D F + Q Y P+ QR +
Sbjct: 95 RPTDEAIADLLGIDPVHGETMQGQRYAVGQHFRAHFDYFNEAQAYWPKMVETGGQRTWTA 154
Query: 141 LVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTS 200
++YL D+EEGG T FP IG++V P++G L + ++ P+G + +
Sbjct: 155 MIYLNDVEEGGATWFP---------------TIGIRVAPKKGLLLTWNNMKPDGDRNTAT 199
Query: 201 IHGSCPVVKGEKWVATKWIRD 221
+H PVV+G K++ TKW R+
Sbjct: 200 LHEGMPVVQGTKYIVTKWFRE 220
>gi|195390835|ref|XP_002054073.1| GJ22993 [Drosophila virilis]
gi|194152159|gb|EDW67593.1| GJ22993 [Drosophila virilis]
Length = 525
Score = 94.7 bits (234), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 63/227 (27%), Positives = 111/227 (48%), Gaps = 35/227 (15%)
Query: 12 TNIPF--------QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK--GETV 61
TN PF ++L+ P + + + TP + + + +A L+ +T+ +K TV
Sbjct: 310 TNSPFLRLAPLKTELLALDPYMVLYHDVITPSEIRELQYLAVPTLKRATVFNQKMGRNTV 369
Query: 62 DNTQGIRTSSGVFISAAEDESGTLDL-IEEKIAKVTMLPRINGEAFNILRYKIGQKYNSH 120
T RTS +++ D L + + +I+ +T E ++ Y +G Y+ H
Sbjct: 370 VKT---RTSKVTWLT---DSLNPLTVRLNRRISDMTGFDLYGSEMLQVMNYGLGGHYDLH 423
Query: 121 YDAFDP---QEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKV 177
+D F+ ++ R+A+ L YLTD+E+GG T+FP I +
Sbjct: 424 FDYFNATIAKDLTKLNGDRIATVLFYLTDVEQGGATVFP---------------NIKQAI 468
Query: 178 KPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
P++G +++Y+L N DP ++H +CPV+ G KWV KWIR+ +Q
Sbjct: 469 FPKKGTAVMWYNLRHNNDGDPQTLHAACPVIVGSKWVCNKWIREHQQ 515
>gi|292619367|ref|XP_001922562.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Danio rerio]
Length = 541
Score = 94.4 bits (233), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 59/206 (28%), Positives = 100/206 (48%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + + T ++ + I ++K LR +T++ V T R S +++A E
Sbjct: 342 PRIIRYHEIITEQEIEKIKELSKPRLRRATIS-NPITGVLETAHYRISKSAWLAAYE--H 398
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+D I ++I +T L E + Y +G +Y H+D E K R+A
Sbjct: 399 PVVDRINQRIEDITGLNVKTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIA 458
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G VKP +G + +Y+L P+G D
Sbjct: 459 TWLFYMSDVAAGGATVFP---------------EVGAAVKPLKGTAVFWYNLFPSGEGDY 503
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 504 STRHAACPVLVGNKWVSNKWIHERGQ 529
>gi|215697788|dbj|BAG91981.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 225
Score = 94.4 bits (233), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 57/185 (30%), Positives = 99/185 (53%), Gaps = 20/185 (10%)
Query: 36 QCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKV 95
+C +++M + N+ S+LA T G R SS I E + IE++I+
Sbjct: 2 ECDHLVSMGRGNME-SSLAF--------TDGDRNSSYNNI-----EDIVVSKIEDRISLW 47
Query: 96 TMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMF 155
+ LP+ NGE+ +L+Y + + + + + R+A+ L+YL+D+++GGET+F
Sbjct: 48 SFLPKENGESIQVLKYGVNRS-----GSIKEEPKSSSGAHRLATILMYLSDVKQGGETVF 102
Query: 156 PFENGMNADGSYDY-QKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWV 214
P +A +C G V+P +G+ +L ++L P+G D S + CPV++GEKW+
Sbjct: 103 PRSEMKDAQAKEGAPSQCSGYAVRPAKGNAILLFNLRPDGETDKDSQYEECPVLEGEKWL 162
Query: 215 ATKWI 219
A K I
Sbjct: 163 AIKHI 167
>gi|255545252|ref|XP_002513687.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223547595|gb|EEF49090.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 309
Score = 94.4 bits (233), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 56/206 (27%), Positives = 105/206 (50%), Gaps = 25/206 (12%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA 78
LSW PR + F T E+C +I++A + + +G + + + ++++
Sbjct: 61 LSWRPRVFLYKGFLTDEECDRLISLA-----------HGAKEISKGKGDGSRNNIQLASS 109
Query: 79 EDESGTLD----LIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS 134
E S D IEE+I+ T +P+ N + ++ Y I ++ H+D FD + S
Sbjct: 110 ESRSHIYDDLLARIEERISAWTFIPKENSKPLQVMHYGI-EEAREHFDYFDNKTLISNVS 168
Query: 135 QRVASFLVYLTDLEEGGETMFP---FENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLL 191
+A+ ++YL+++ GGE +FP ++ + +D + D ++P +G+ +L ++
Sbjct: 169 L-MATLVLYLSNVTRGGEILFPKSELKDKVWSDCTKDSSI-----LRPVKGNAVLIFNAH 222
Query: 192 PNGTIDPTSIHGSCPVVKGEKWVATK 217
N + D S HG CPV++GE W ATK
Sbjct: 223 LNASADSRSTHGRCPVLEGEMWCATK 248
>gi|312599252|gb|ADQ91275.1| hypothetical protein BpV2_108c [Bathycoccus sp. RCC1105 virus BpV2]
Length = 197
Score = 94.0 bits (232), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 70/214 (32%), Positives = 102/214 (47%), Gaps = 35/214 (16%)
Query: 14 IPFQVLSWM-------PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG 66
+ F ++ W PR L N + ++CK I N+A L+ ST++ K +D +
Sbjct: 8 VSFLLIIWFFIPIYEKPRVL--KNVLSEDECKHIQNIASKKLQTSTVS--KSRDID--ES 61
Query: 67 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP 126
IR S ++ A+ED +D + K +T P N E +L+YK G Y H D F P
Sbjct: 62 IRKSETAWLKASED--PVVDKLIRKCVSMTDRPLRNCEDLQVLKYKPGGFYKPHQDTF-P 118
Query: 127 QEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLL 186
+ K++R+ +F++ L D EGGET FP N SY +K GD L
Sbjct: 119 DD----KNKRMYTFIIALNDEYEGGETEFP-----NIKKSYRLEK----------GDALF 159
Query: 187 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
F +L I ++HG PV GEKWV W+R
Sbjct: 160 FNTLNNYECITKKALHGGTPVKSGEKWVCNLWVR 193
>gi|24651477|ref|NP_733395.1| prolyl-4-hydroxylase-alpha PV [Drosophila melanogaster]
gi|20269812|gb|AAM18061.1|AF495539_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]PV [Drosophila
melanogaster]
gi|23172718|gb|AAN14252.1| prolyl-4-hydroxylase-alpha PV [Drosophila melanogaster]
Length = 525
Score = 94.0 bits (232), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 63/224 (28%), Positives = 107/224 (47%), Gaps = 30/224 (13%)
Query: 12 TNIPFQVLSWM--------PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDN 63
T PF +L+ + P + + + +P++ K + MA L+ +T+ + +
Sbjct: 310 TTSPFLILAPLKMELVGLDPYMVLYHDVLSPKEIKELQGMATPGLKRATV-YQASSGRNE 368
Query: 64 TQGIRTSSGVFISAAEDESGTLDL-IEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD 122
RTS ++ D L + + +I+ +T E ++ Y +G Y+ HYD
Sbjct: 369 VVKTRTSK---VAWFPDGYNPLTVRLNARISDMTGFNLYGSEMLQLMNYGLGGHYDQHYD 425
Query: 123 AFDP--QEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPR 180
F+ R+A+ L YLTD+E+GG T+FP I V P+
Sbjct: 426 FFNKTNSNMTAMSGDRIATVLFYLTDVEQGGATVFP---------------NIRKAVFPQ 470
Query: 181 QGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
+G +++Y+L NG ID ++H +CPV+ G KWV KWIR++EQ
Sbjct: 471 RGSVVMWYNLKDNGQIDTQTLHAACPVIVGSKWVCNKWIREREQ 514
>gi|428172003|gb|EKX40915.1| hypothetical protein GUITHDRAFT_112917 [Guillardia theta CCMP2712]
Length = 421
Score = 94.0 bits (232), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 71/230 (30%), Positives = 107/230 (46%), Gaps = 39/230 (16%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFI- 75
+V S PR L +F TPE+C +I+ AK + ST++ V + RTSS ++
Sbjct: 195 KVRSISPRVLEVEDFLTPEECHELISSAKPLMSRSTVSAEGDSAVSLQESSRTSSTAWLP 254
Query: 76 -------SAAEDESGTL---DLIEEKIAKVTMLPRINGE------AFNILRYKIGQKYNS 119
+ D +L D + + V L I+ A+ +LRY++ Q Y+
Sbjct: 255 PHSHTLANKLYDRVSSLVGIDFRKHEHVVVEDLQAIDKRGGSSVTAWQVLRYEVNQHYHI 314
Query: 120 HYDAFDPQEY-----GPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKC-I 173
H+D FDP + G +++ + +F YLTD+E G DY C
Sbjct: 315 HHDYFDPVLHRGFLQGDGRNRFITAFF-YLTDVERGDPRPIT-----------DYSDCNR 362
Query: 174 GLKVKPRQGDGLLFYSLLPNGT----IDPTSIHGSCPVVKGEKWVATKWI 219
GL+V P++G ++FYSLL +G +D S HG C V G KW A WI
Sbjct: 363 GLRVPPKRGKAIIFYSLLADGQRSGGLDVASWHGGCDVHNGTKWAANYWI 412
>gi|195575145|ref|XP_002105540.1| GD16902 [Drosophila simulans]
gi|194201467|gb|EDX15043.1| GD16902 [Drosophila simulans]
Length = 525
Score = 94.0 bits (232), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 47/139 (33%), Positives = 73/139 (52%), Gaps = 17/139 (12%)
Query: 88 IEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP--QEYGPQKSQRVASFLVYLT 145
+ +I+ +T E ++ Y +G Y+ HYD F+ R+A+ L YLT
Sbjct: 391 LNARISDMTGFNLYGSEMLQLMNYGLGGHYDQHYDFFNKTNSNMTAMSGDRIATVLFYLT 450
Query: 146 DLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSC 205
D+E+GG T+FP I V P++G +++Y+L NG ID ++H +C
Sbjct: 451 DVEQGGATVFP---------------NIRKAVFPQRGSVVMWYNLRDNGQIDTQTLHAAC 495
Query: 206 PVVKGEKWVATKWIRDQEQ 224
PV+ G KWV KWIR++EQ
Sbjct: 496 PVIVGSKWVCNKWIREREQ 514
>gi|195341590|ref|XP_002037389.1| GM12139 [Drosophila sechellia]
gi|194131505|gb|EDW53548.1| GM12139 [Drosophila sechellia]
Length = 525
Score = 93.6 bits (231), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 47/139 (33%), Positives = 73/139 (52%), Gaps = 17/139 (12%)
Query: 88 IEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP--QEYGPQKSQRVASFLVYLT 145
+ +I+ +T E ++ Y +G Y+ HYD F+ R+A+ L YLT
Sbjct: 391 LNARISDMTGFNLYGSEMLQLMNYGLGGHYDQHYDFFNNTNSNMTAMSGDRIATVLFYLT 450
Query: 146 DLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSC 205
D+E+GG T+FP I V P++G +++Y+L NG ID ++H +C
Sbjct: 451 DVEQGGATVFP---------------NIRKAVFPQRGSVVMWYNLRDNGQIDTQTLHAAC 495
Query: 206 PVVKGEKWVATKWIRDQEQ 224
PV+ G KWV KWIR++EQ
Sbjct: 496 PVIVGSKWVCNKWIREREQ 514
>gi|194905290|ref|XP_001981166.1| GG11918 [Drosophila erecta]
gi|190655804|gb|EDV53036.1| GG11918 [Drosophila erecta]
Length = 525
Score = 93.6 bits (231), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 47/139 (33%), Positives = 72/139 (51%), Gaps = 17/139 (12%)
Query: 88 IEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP--QEYGPQKSQRVASFLVYLT 145
+ +IA +T E ++ Y +G Y+ HYD F+ R+A+ L YLT
Sbjct: 391 LNARIADMTGFNLYGSEMLQLMNYGLGGHYDQHYDFFNTINSNLTAMSGDRIATVLFYLT 450
Query: 146 DLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSC 205
D+E+GG T+FP I V P++G +++Y+L NG D ++H +C
Sbjct: 451 DVEQGGATVFP---------------NIRKAVFPQRGSVIMWYNLQDNGQTDNKTLHAAC 495
Query: 206 PVVKGEKWVATKWIRDQEQ 224
PV+ G KWV KWIR++EQ
Sbjct: 496 PVIVGSKWVCNKWIREREQ 514
>gi|307190793|gb|EFN74662.1| Prolyl 4-hydroxylase subunit alpha-2 [Camponotus floridanus]
Length = 476
Score = 93.6 bits (231), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 56/206 (27%), Positives = 98/206 (47%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + + N E+ ++I MA+ + +T+ K ++ R S ++ E E
Sbjct: 269 PRIVVYHNVIYDEEIETIKRMAQPRFKRATVQNYKTGALE-IANYRISKSAWLQ--EHEH 325
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ + +++ +T + E ++ Y IG Y H+D +E KS R+A
Sbjct: 326 KHVAAVSKRVEHMTSMSIETAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIA 385
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
+ L Y++D+E+GG T+F I + + PR+G +Y+L PNG D
Sbjct: 386 TVLYYMSDVEQGGGTVFT---------------AINISLWPRKGSAAFWYNLKPNGEGDF 430
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
+ H +CPV+ G KWVA KW+ ++ Q
Sbjct: 431 KTRHAACPVLTGSKWVANKWLHERGQ 456
>gi|195113237|ref|XP_002001174.1| GI10637 [Drosophila mojavensis]
gi|193917768|gb|EDW16635.1| GI10637 [Drosophila mojavensis]
Length = 529
Score = 93.6 bits (231), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 56/215 (26%), Positives = 104/215 (48%), Gaps = 23/215 (10%)
Query: 14 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGV 73
+ +++S P + + + +P + + ++A L+ +T+ + ++ RTS
Sbjct: 321 LKMELISLDPYMVIYHDVISPSEISELQSLAVPGLKRATV-FNQQSMRNHVVKTRTSKVT 379
Query: 74 FISAAEDESGTLDL-IEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQ---EY 129
++ D L + + +I +T E ++ Y +G Y+ HYD F+ +
Sbjct: 380 WLL---DTLNQLTIRLNRRITDMTGFDMYGSEMLQVMNYGLGGHYDKHYDYFNSSVAADL 436
Query: 130 GPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYS 189
R+A+ L YLTD+E+GG T+FP I V P+ G +++Y+
Sbjct: 437 TRLNGDRIATVLFYLTDVEQGGATVFP---------------NIEKAVFPKSGTAVVWYN 481
Query: 190 LLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
L +G DP ++H +CPV+ G KWV KWIR+++Q
Sbjct: 482 LRHDGNGDPQTLHAACPVIVGSKWVCNKWIRERQQ 516
>gi|195061074|ref|XP_001995919.1| GH14105 [Drosophila grimshawi]
gi|193891711|gb|EDV90577.1| GH14105 [Drosophila grimshawi]
Length = 513
Score = 93.2 bits (230), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 45/137 (32%), Positives = 71/137 (51%), Gaps = 15/137 (10%)
Query: 88 IEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDL 147
+ ++I ++ E ++ Y +G Y SHYD + R+A+ + YL+D+
Sbjct: 383 LNKRIEDMSGFTMYGSEMLQVMNYGLGGHYASHYDFLNATSKTRLNGDRIATVMFYLSDV 442
Query: 148 EEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPV 207
E+GG T+FP QK V P++G +++Y+L NG D +IH +CPV
Sbjct: 443 EQGGATVFP-----------KIQKA----VFPQRGTAIIWYNLKENGDFDTNTIHAACPV 487
Query: 208 VKGEKWVATKWIRDQEQ 224
+ G KWV KWIR+ EQ
Sbjct: 488 IVGSKWVCNKWIRENEQ 504
>gi|224122338|ref|XP_002318810.1| predicted protein [Populus trichocarpa]
gi|222859483|gb|EEE97030.1| predicted protein [Populus trichocarpa]
Length = 310
Score = 93.2 bits (230), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 60/203 (29%), Positives = 101/203 (49%), Gaps = 13/203 (6%)
Query: 18 VLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISA 77
+SW PR + F T E+C +I++A+ S E D+ G + +F S+
Sbjct: 60 TVSWQPRVFVYKGFLTDEECDHLISLAQGTKETS-------EGKDDDSGRIERNRLFASS 112
Query: 78 AE---DESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS 134
+ L IEE+++ T+LP+ N + ++ Y I N ++D F +
Sbjct: 113 TSLLNMDDNILSRIEERVSAWTLLPKENSKPLQVMHYGIEDAKN-YFDYFGNKSAIISSE 171
Query: 135 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 194
+A+ + YL+++ +GGE FP N S D K I ++P +G+ +LF+++ PN
Sbjct: 172 PLMATLVFYLSNVTQGGEIFFPKSEVKNKIWS-DCTK-ISDSLRPIKGNAILFFTVHPNT 229
Query: 195 TIDPTSIHGSCPVVKGEKWVATK 217
+ D S H CPV++GE W ATK
Sbjct: 230 SPDMGSSHSRCPVLEGEMWYATK 252
>gi|291190274|ref|NP_001167096.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide precursor [Salmo
salar]
gi|223648100|gb|ACN10808.1| Prolyl 4-hydroxylase subunit alpha-1 precursor [Salmo salar]
Length = 545
Score = 92.8 bits (229), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 57/206 (27%), Positives = 101/206 (49%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + + + + + + + +AK LR +T++ V T R S +++A ED
Sbjct: 346 PRIIRYHDVLSNSEIEKVKELAKPRLRRATIS-NPITGVLETAHYRISKSAWLTAYEDP- 403
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+D I ++I +T L E + Y +G +Y H+D E K R+A
Sbjct: 404 -VVDKINQRIEDITGLNVKTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIA 462
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L+Y++D+ GG T+F +G V P++G + +Y+L P+G D
Sbjct: 463 TWLIYMSDVPSGGATVFT---------------DVGAAVWPKKGSAVFWYNLFPSGEGDY 507
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 508 STRHAACPVLVGNKWVSNKWIHERGQ 533
>gi|397643670|gb|EJK76008.1| hypothetical protein THAOC_02250 [Thalassiosira oceanica]
Length = 480
Score = 92.8 bits (229), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 70/256 (27%), Positives = 110/256 (42%), Gaps = 59/256 (23%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR Y NF + + + + P +A G T + + G ++ E+
Sbjct: 205 PRVFYVHNFLSAAEADEFVKFSTAPENPYKMAPSTGGT--HKAWNQGGDGAVLTTRTSEN 262
Query: 83 GTLDLIEEKIAKVT----MLPRING------EAFNILRYKIGQKYNSHYDAF-------- 124
D+ ++ V L R+NG + ILRYK+GQ Y +H+D F
Sbjct: 263 -AFDITTKQSFDVKKRAFRLLRMNGYQENMADGIQILRYKVGQAYVAHHDYFPTHQSKDF 321
Query: 125 --DPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNAD------------------ 164
DP G S R A+ +YL+D+ GG+T+FP ++A+
Sbjct: 322 NWDPLSGG---SNRFATIFLYLSDVSYGGQTVFPNCEKLSAEKSPELVERLGESPSASEL 378
Query: 165 -----------GSYD---YQKCI-GLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVK 209
GS++ KC V PR+GD +LFYS P+G +D S+HG+CP++
Sbjct: 379 KEFVSNAGLMEGSWEDNLIHKCYEKFAVPPRRGDAILFYSQRPDGLLDTNSLHGACPILN 438
Query: 210 GEKWVATKWIRDQEQY 225
G KW A W+ + +Y
Sbjct: 439 GTKWGANLWVWNACRY 454
>gi|357483927|ref|XP_003612250.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355513585|gb|AES95208.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 204
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 50/129 (38%), Positives = 79/129 (61%), Gaps = 5/129 (3%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVFI 75
+V+SW PRA + NF T E+C+ +I++AK ++ ST+ G++ D+ +RTSSG F+
Sbjct: 79 EVVSWEPRAFVYHNFLTKEECEYLIDIAKPSMHKSTVVDSETGKSKDSR--VRTSSGTFL 136
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 135
+ D+ + IE+KIA T +P +GE +L Y++GQKY HYD F + Q
Sbjct: 137 ARGRDK--IVRNIEKKIADFTFIPVEHGEGLQVLHYEVGQKYEPHYDYFLDEFNTKNGGQ 194
Query: 136 RVASFLVYL 144
R+A+ L+YL
Sbjct: 195 RIATVLMYL 203
>gi|313229039|emb|CBY18191.1| unnamed protein product [Oikopleura dioica]
Length = 522
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 48/154 (31%), Positives = 86/154 (55%), Gaps = 18/154 (11%)
Query: 68 RTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQ 127
R S ++ +++S T++ +I+++T L E + Y IG +Y HYD + +
Sbjct: 367 RVSKSAWLK--DEDSDTVEKYNRRISRLTGLDLEYAEQLQMSNYGIGGQYEPHYD-YSRR 423
Query: 128 EYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 187
E+ ++R+A++L YLT +E+GG T+F +GL ++ +G + +
Sbjct: 424 EWDIYNNRRIATWLSYLTTVEQGGGTVFT---------------ELGLHIRSIKGSAVFW 468
Query: 188 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRD 221
Y+LLPNG+ D + H +CPV++G KWV+ KWI +
Sbjct: 469 YNLLPNGSGDERTRHAACPVLRGNKWVSNKWIHE 502
>gi|291230950|ref|XP_002735430.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Saccoglossus
kowalevskii]
Length = 533
Score = 92.4 bits (228), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 56/206 (27%), Positives = 98/206 (47%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P+ + F + + + + +A LR +T+ ++ + R S ++S ED+
Sbjct: 333 PKLIIFHDAILTNEIRKVKALASPRLRRATIQNSVTGNLEFAE-YRISKSAWLS--EDDG 389
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ + +I + T L E + Y +G Y H+D +E KS R+A
Sbjct: 390 DVVHRLNHRIEQYTGLTMDTAEELQVANYGLGGHYEPHFDFARKEEINAFKSLNTGNRIA 449
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
+FL Y++D+E GG T+FP +G ++ P +G +Y+LL NG D
Sbjct: 450 TFLFYMSDVEAGGATVFP---------------QVGARLIPEKGSAAFWYNLLKNGEGDY 494
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 495 STRHAACPVLVGSKWVSNKWIHERGQ 520
>gi|219124513|ref|XP_002182546.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217405892|gb|EEC45833.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 193
Score = 92.4 bits (228), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 59/208 (28%), Positives = 99/208 (47%), Gaps = 21/208 (10%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFIS 76
+ LS PRA NF T + I+ + + +++ T + RTSS +++
Sbjct: 2 KALSCAPRAFQVENFLTDVEADHIVGLVQ-----KKNDMQRSSTNGHISETRTSSTTWLA 56
Query: 77 AAEDESGTLDLIEEKIAKV-----TMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP 131
D +D I ++A ML R E I+ Y +GQ+Y +H+D F + P
Sbjct: 57 RHSDP--VIDSIFRRVADTLKMDEAMLHRRINEDLQIVHYGVGQQYTAHHD-FGYPKGDP 113
Query: 132 QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLL 191
R +F +YL D+ GG+T FP +G+ L V P++G ++FY +
Sbjct: 114 GSPSRSINFCMYLNDVPAGGQTSFPRWRNAETNGA--------LNVVPKKGTAMIFYMVN 165
Query: 192 PNGTIDPTSIHGSCPVVKGEKWVATKWI 219
P+G +D + H + PV++GEK+ + WI
Sbjct: 166 PDGNLDDLTHHAALPVIEGEKFFSNLWI 193
>gi|195505251|ref|XP_002099423.1| GE23370 [Drosophila yakuba]
gi|194185524|gb|EDW99135.1| GE23370 [Drosophila yakuba]
Length = 534
Score = 92.4 bits (228), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 61/211 (28%), Positives = 97/211 (45%), Gaps = 29/211 (13%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + + + + +I+ A N++ + + ET T RT+ G ++ +E
Sbjct: 327 PYVVLYHEVLSAREISMLISKAAQNMKNTRV---HRETKPKTNRGRTAKGHWLKKESNE- 382
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFD---PQEYGPQKSQ---- 135
I +I +T + E F ++ Y IG Y H D FD GP+ Q
Sbjct: 383 -LTRRITRRIVDMTGFDLADSEDFQVINYGIGGHYFLHMDYFDYASSNYTGPRSRQSKVL 441
Query: 136 --RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPN 193
R+A+ L YL+D+E+GG T+F +G V P+ G + +Y+L +
Sbjct: 442 GDRIATVLFYLSDVEQGGATVF---------------GNVGYSVYPQAGTAIFWYNLDTD 486
Query: 194 GTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
G DP + H SCPV+ G KWV T+WIR+ Q
Sbjct: 487 GNGDPLTRHASCPVIVGSKWVMTEWIRESRQ 517
>gi|195341588|ref|XP_002037388.1| GM12140 [Drosophila sechellia]
gi|194131504|gb|EDW53547.1| GM12140 [Drosophila sechellia]
Length = 534
Score = 92.4 bits (228), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 61/211 (28%), Positives = 95/211 (45%), Gaps = 28/211 (13%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + + + + +I A N++ + + +G N RT+ G + +E
Sbjct: 326 PYVVLYHEVLSAREISMLIGKATQNMKNTRVHKEQGVPKKNRG--RTAKGFWFKKESNE- 382
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFD---------PQEYGPQK 133
I +I +T + E F ++ Y IG Y H D FD Y
Sbjct: 383 -LTKGITRRIMDMTGFDLADSEGFQVINYGIGGHYLLHMDYFDFASSNHTDTRSSYSMDL 441
Query: 134 SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPN 193
R+A+ L YLTD+E+GG T+F AD +G V P+ G + +Y+L N
Sbjct: 442 GDRIATVLFYLTDVEQGGATVF-------AD--------VGYSVYPQAGTAIFWYNLDTN 486
Query: 194 GTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
G DP + H +CPV+ G KWV T+WIR++ Q
Sbjct: 487 GKGDPRTKHAACPVIVGSKWVMTEWIREKRQ 517
>gi|383864775|ref|XP_003707853.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Megachile
rotundata]
Length = 550
Score = 92.4 bits (228), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 56/206 (27%), Positives = 98/206 (47%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + + N E+ ++I MA+ + +T+ K ++ R S ++ E E
Sbjct: 343 PRIVIYHNVIYDEEIETIKRMAQPRFKRATVQNYKTGALE-IANYRISKSAWLQ--EHEH 399
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ + +++ +T L E ++ Y IG Y H+D +E KS R+A
Sbjct: 400 KHVAAVSKRVEHMTSLNVETAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIA 459
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
+ L Y++D+E+GG T+F I + + PR+G +++L PNG D
Sbjct: 460 TVLYYMSDVEQGGGTVFT---------------AINISLWPRKGSAAFWFNLKPNGEGDL 504
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
+ H +CPV+ G KWVA KW+ ++ Q
Sbjct: 505 RTRHAACPVLTGSKWVANKWLHERGQ 530
>gi|194905294|ref|XP_001981167.1| GG11919 [Drosophila erecta]
gi|190655805|gb|EDV53037.1| GG11919 [Drosophila erecta]
Length = 533
Score = 92.4 bits (228), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 59/211 (27%), Positives = 97/211 (45%), Gaps = 29/211 (13%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + + + + ++ A N++ + + + E NT RT+ G ++ +E
Sbjct: 326 PYVVLYHEVLSAREISMLMGKAAQNMKNTRV---QSEKAVNTNRERTAKGYWLKKESNE- 381
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD--AFDPQEYGPQKSQ----- 135
I +I +T + E F ++ Y IG Y+ H+D F Y ++S
Sbjct: 382 -MTRRITRRIVDMTGFDLADSEDFQVINYGIGGHYSLHFDYFGFASSNYTGERSHHSIVL 440
Query: 136 --RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPN 193
R+A+ L YLTD+E+GG T+F +G V P+ G + +Y+L +
Sbjct: 441 GDRIATVLFYLTDVEQGGATVF---------------GNVGYSVYPQAGTAIFWYNLDTD 485
Query: 194 GTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
G DP + H SCPVV G KWV T+WI + Q
Sbjct: 486 GNGDPLTRHASCPVVVGSKWVMTEWIHEARQ 516
>gi|312032360|ref|NP_001185667.1| prolyl 4-hydroxylase subunit alpha-1 isoform 4 precursor [Gallus
gallus]
Length = 536
Score = 92.4 bits (228), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 57/206 (27%), Positives = 101/206 (49%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F + + E+ +++ +AK LR +T++ ++ T R S ++S E S
Sbjct: 337 PRIVRFLDIISDEEIETVKELAKPRLRRATISNPITGALE-TAHYRISKSAWLSGYE--S 393
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 394 PVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 453
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V P++G + +Y+L P+G D
Sbjct: 454 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFPSGEGDY 498
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 499 STRHAACPVLVGNKWVSNKWLHERGQ 524
>gi|195505255|ref|XP_002099425.1| GE23368 [Drosophila yakuba]
gi|194185526|gb|EDW99137.1| GE23368 [Drosophila yakuba]
Length = 528
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 54/190 (28%), Positives = 88/190 (46%), Gaps = 38/190 (20%)
Query: 58 GETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLP--------RING------ 103
+ + QG+ T + + SG +++ + +KV P R+N
Sbjct: 343 AKEIKELQGMATPGLKRATVFQAASGRNEVVRTRTSKVAWFPDGYSPLTVRLNARITDMT 402
Query: 104 -------EAFNILRYKIGQKYNSHYDAFDP--QEYGPQKSQRVASFLVYLTDLEEGGETM 154
E ++ Y +G Y+ HYD F+ R+A+ L YLTD+E+GG T+
Sbjct: 403 GFNLHGSEMLQLMNYGLGGHYDQHYDYFNTINSNLTAMSGDRIATVLFYLTDVEQGGATV 462
Query: 155 FPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWV 214
FP I V P++G +++Y+L +G ID ++H +CPV+ G KWV
Sbjct: 463 FP---------------NIRKAVFPQRGSVIMWYNLKDDGQIDTQTLHAACPVIVGSKWV 507
Query: 215 ATKWIRDQEQ 224
KWIR++EQ
Sbjct: 508 CNKWIREREQ 517
>gi|332026992|gb|EGI67088.1| Prolyl 4-hydroxylase subunit alpha-1 [Acromyrmex echinatior]
Length = 415
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 55/206 (26%), Positives = 98/206 (47%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + + N E+ ++I MA+ + +T+ K ++ R S ++ E E
Sbjct: 208 PRIVVYHNVIYDEEIETIKRMAQPRFKRATVQNYKTGALE-IANYRISKSAWLQ--EHEH 264
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ + +++ +T + E ++ Y IG Y H+D +E KS R+A
Sbjct: 265 KHVAAVSKRVEHMTSMSVETAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIA 324
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
+ L Y++D+E+GG T+F I + + PR+G +++L PNG D
Sbjct: 325 TVLYYMSDVEQGGGTVFT---------------AINISLWPRKGSAAFWHNLKPNGEGDF 369
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
+ H +CPV+ G KWVA KW+ ++ Q
Sbjct: 370 KTRHAACPVLTGSKWVANKWLHERGQ 395
>gi|47550697|ref|NP_999856.1| prolyl 4-hydroxylase, alpha polypeptide I b precursor [Danio rerio]
gi|28277826|gb|AAH45890.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I [Danio rerio]
Length = 536
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 58/206 (28%), Positives = 99/206 (48%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + + + + +++ MAK LR +T++ V T R S ++S E
Sbjct: 337 PRIVRYHEIISDSEIETVKEMAKPRLRRATIS-NPITGVLETAPYRISKSAWLSGYE--H 393
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
T++ I ++I VT L E + Y +G +Y H+D E K R+A
Sbjct: 394 STIERINQRIEDVTGLEMDTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIA 453
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+F +G V P++G + +Y+L P+G D
Sbjct: 454 TWLFYMSDVSAGGATVFT---------------DVGAAVWPKKGTAVFWYNLFPSGEGDY 498
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 499 STRHAACPVLVGNKWVSNKWIHERGQ 524
>gi|194765178|ref|XP_001964704.1| GF23330 [Drosophila ananassae]
gi|190614976|gb|EDV30500.1| GF23330 [Drosophila ananassae]
Length = 537
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 59/221 (26%), Positives = 102/221 (46%), Gaps = 27/221 (12%)
Query: 13 NIPFQVLSWM--------PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNT 64
N PF++L+ + P + + + + ++ + + MA +R ST+ G +
Sbjct: 313 NHPFRLLAPLKLEEHNLDPYVVTYHDMLSAQKIRDLRQMAVPRMRRSTVNPLPGGQ-NKK 371
Query: 65 QGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF 124
R S ++ A + T++ + + T L E + Y +G Y H+D F
Sbjct: 372 SAFRVSKNAWL--AYESHPTMEGMLRDLKDATGLDTTYCEQLQVANYGVGGHYEPHWDFF 429
Query: 125 -DPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGD 183
DP Y ++ R+A+ + YL+D+E+GG T FPF + VKP+ G+
Sbjct: 430 RDPNHYPAEEGNRIATAIFYLSDVEQGGATAFPF---------------LDFAVKPQLGN 474
Query: 184 GLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
L +Y+L + +D + H CPV+KG KW+ WI D Q
Sbjct: 475 VLFWYNLHRSLDMDYRTKHAGCPVLKGSKWIGNVWIHDMTQ 515
>gi|413923982|gb|AFW63914.1| hypothetical protein ZEAMMB73_179176 [Zea mays]
Length = 222
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 49/134 (36%), Positives = 79/134 (58%), Gaps = 15/134 (11%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTS 70
+V+SW PRA + NF + E+C+ +I +AK ++ ST+ VD+T G +RTS
Sbjct: 98 EVISWEPRAFVYHNFLSKEECEYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 150
Query: 71 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 130
SG+F+ D+ + +IE++IA T +P +GE +L Y++GQKY H+D F +
Sbjct: 151 SGMFLQRGRDK--VIRVIEKRIADYTFIPVDHGEGLQVLHYEVGQKYEPHFDYFLDEFNT 208
Query: 131 PQKSQRVASFLVYL 144
QR+A+ L+YL
Sbjct: 209 KNGGQRMATLLMYL 222
>gi|340367965|ref|XP_003382523.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Amphimedon
queenslandica]
Length = 525
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 58/214 (27%), Positives = 101/214 (47%), Gaps = 20/214 (9%)
Query: 14 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGV 73
I +V P+ F + T + + + +A L +T+ GE + T R S
Sbjct: 317 IKTEVAFVKPKIYIFYDIVTDREIERLKELANPKLNRATVHGENGELLHAT--YRISKSG 374
Query: 74 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQE---YG 130
++S ++D G +D I+++I VT L E ++ Y IG +Y HYD E
Sbjct: 375 WLSGSDDPLGYVDRIDQRIEDVTGLTMSTAEQLQVVNYGIGGQYEPHYDFARTGEDTFTS 434
Query: 131 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 190
R+++ L+Y++D+E+GG T+FP +G ++ P + +++L
Sbjct: 435 LGSGNRISTLLIYMSDVEKGGATVFP---------------GVGARLVPIKRAAAYWWNL 479
Query: 191 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
+G D ++ H CPV+ G KWV KWI ++ Q
Sbjct: 480 KRSGDGDYSTRHAGCPVLVGSKWVCNKWIHERGQ 513
>gi|326923461|ref|XP_003207954.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 1
[Meleagris gallopavo]
Length = 536
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 57/206 (27%), Positives = 101/206 (49%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F + + E+ +++ +AK LR +T++ ++ T R S ++S E S
Sbjct: 337 PRIVRFLDIISDEEIETVKELAKPRLRRATISNPITGALE-TAHYRISKSAWLSGYE--S 393
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 394 PVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIA 453
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V P++G + +Y+L P+G D
Sbjct: 454 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFPSGEGDY 498
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 499 STRHAACPVLVGNKWVSNKWLHERGQ 524
>gi|87199403|ref|YP_496660.1| 2OG-Fe(II) oxygenase [Novosphingobium aromaticivorans DSM 12444]
gi|87135084|gb|ABD25826.1| 2OG-Fe(II) oxygenase [Novosphingobium aromaticivorans DSM 12444]
Length = 211
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 60/198 (30%), Positives = 97/198 (48%), Gaps = 28/198 (14%)
Query: 30 NFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIE 89
+F + +C +I + + RPST+A G+ RTS + + E LD
Sbjct: 34 DFLSQAECNGLIARIERDRRPSTIADANGDHY-----FRTSETCDLPMDDPEIVALD--- 85
Query: 90 EKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP-----QEYGPQKSQRVASFLVYL 144
EK+ ++ + R GE RY+ GQ++ +H D FDP Q + QR +F+VYL
Sbjct: 86 EKLCALSGIGRPFGEPIQGQRYESGQEFKAHTDYFDPHGADFQRFCSVAGQRTWTFMVYL 145
Query: 145 TDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGS 204
D+E GG T F K I ++P +G + + + P+GT++P ++H +
Sbjct: 146 NDVEAGGATRF---------------KVIDKTIQPERGKLVCWNNRRPDGTVNPCTLHHA 190
Query: 205 CPVVKGEKWVATKWIRDQ 222
V KG K+V TKW R++
Sbjct: 191 MKVRKGLKYVITKWYREK 208
>gi|340722330|ref|XP_003399560.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Bombus
terrestris]
Length = 557
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 55/206 (26%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + + N E+ ++I MA+ + +T+ K ++ R S ++ E E
Sbjct: 350 PRIVVYHNVIYDEEIETIKRMAQPRFKRATVQNYKTGALE-IANYRISKSAWLQEHEHEH 408
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ + ++ +T + E ++ Y IG Y H+D +E KS R+A
Sbjct: 409 --VAAVSRRVEHMTSMTVDTAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIA 466
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
+ L Y++D+E+GG T+F I + + P++G +Y+L PNG D
Sbjct: 467 TVLYYMSDVEQGGGTVFT---------------AINISLWPKKGSAAFWYNLKPNGEGDF 511
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
+ H +CPV+ G KWVA KW+ ++ Q
Sbjct: 512 KTRHAACPVLTGSKWVANKWLHERGQ 537
>gi|312032358|ref|NP_001185666.1| prolyl 4-hydroxylase subunit alpha-1 isoform 3 precursor [Gallus
gallus]
Length = 536
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 57/206 (27%), Positives = 101/206 (49%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F + + E+ +++ +AK LR +T++ ++ T R S ++S E S
Sbjct: 337 PRIVRFLDIISDEEIETVKELAKPRLRRATISNPITGALE-TAHYRISKSAWLSGYE--S 393
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 394 PVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIA 453
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V P++G + +Y+L P+G D
Sbjct: 454 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFPSGEGDY 498
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 499 STRHAACPVLVGNKWVSNKWLHERGQ 524
>gi|195391766|ref|XP_002054531.1| GJ24504 [Drosophila virilis]
gi|194152617|gb|EDW68051.1| GJ24504 [Drosophila virilis]
Length = 545
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 60/220 (27%), Positives = 99/220 (45%), Gaps = 28/220 (12%)
Query: 15 PFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSG 72
P +V LS P + + + + ++ + K + +T+ V N RTS
Sbjct: 320 PLKVEELSHDPLLVLYHDVIYQSEIDTLAKLTKNKIHRATVTGNNASVVSNA---RTSQF 376
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEY--- 129
FI + L I++++A +T L + E + Y IG Y H D F P +
Sbjct: 377 TFIPKTRHK--VLRTIDQRVADMTDLNMVFAEDHQLANYGIGGHYAQHMDWFSPNAFETK 434
Query: 130 ---GPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLL 186
+ R+A+ L YLTD+E+GG T FP + +KP++
Sbjct: 435 QVANSEMGNRIATVLFYLTDVEQGGGTAFPVLKQL---------------LKPKKYAAAF 479
Query: 187 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 226
+Y+L +G D ++HG+CP++ G KWV +WIR+ Q D
Sbjct: 480 WYNLHASGAGDVRTMHGACPIIVGSKWVLNRWIREFVQSD 519
>gi|363814557|ref|NP_001242754.1| uncharacterized protein LOC100794585 [Glycine max]
gi|255628535|gb|ACU14612.1| unknown [Glycine max]
Length = 238
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 51/148 (34%), Positives = 84/148 (56%), Gaps = 3/148 (2%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVFI 75
+VL+W PR + NF + E+C + +A L ST+ + G+ + + +RTSSG+F+
Sbjct: 82 EVLNWSPRIILLHNFLSMEECDYLRALALPRLHISTVVDTKTGKGIKSD--VRTSSGMFL 139
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 135
++ E + + IE++I+ + +P NGE +LRY+ Q Y H+D F + Q
Sbjct: 140 NSKERKYPMVQAIEKRISVYSQIPIENGELMQVLRYEKNQYYKPHHDYFSDTFNLKRGGQ 199
Query: 136 RVASFLVYLTDLEEGGETMFPFENGMNA 163
R+A+ L+YL+D E GET FP +NA
Sbjct: 200 RIATMLMYLSDNIERGETYFPLAGSVNA 227
>gi|350416719|ref|XP_003491070.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Bombus
impatiens]
Length = 557
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 55/206 (26%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + + N E+ ++I MA+ + +T+ K ++ R S ++ E E
Sbjct: 350 PRIVVYHNVIYDEEIETIKRMAQPRFKRATVQNYKTGALE-IANYRISKSAWLQEHEHEH 408
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ + ++ +T + E ++ Y IG Y H+D +E KS R+A
Sbjct: 409 --VAAVSRRVEHMTSMTVDTAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIA 466
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
+ L Y++D+E+GG T+F I + + P++G +Y+L PNG D
Sbjct: 467 TVLYYMSDVEQGGGTVFT---------------AINISLWPKKGSAAFWYNLKPNGEGDF 511
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
+ H +CPV+ G KWVA KW+ ++ Q
Sbjct: 512 KTRHAACPVLTGSKWVANKWLHERGQ 537
>gi|194765180|ref|XP_001964705.1| GF23331 [Drosophila ananassae]
gi|190614977|gb|EDV30501.1| GF23331 [Drosophila ananassae]
Length = 535
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 59/218 (27%), Positives = 101/218 (46%), Gaps = 24/218 (11%)
Query: 15 PFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSG 72
PF++ LS P + + + I MA+ ++ ST+ G RTS G
Sbjct: 316 PFKLEELSHEPLVFQVHQVVSSKSAEFIKKMARPKIKRSTVYSIGGGGGSQAAAFRTSQG 375
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEY--- 129
+ + + ++ + ++ L E + Y IG Y H+D+F P+ +
Sbjct: 376 ASFNYSRN--AATKILSRHVGDLSSLDMNFAEELQVANYGIGGHYEPHWDSF-PENHIYD 432
Query: 130 -GPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFY 188
G + R+A+ + YL+D+E GG T FPF + L V P +G L +Y
Sbjct: 433 EGDDRGNRIATGIYYLSDVEAGGGTAFPF---------------LPLLVTPEKGSLLFWY 477
Query: 189 SLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 226
+L +G D + H +CPV++G KW+A WIR++ Q++
Sbjct: 478 NLHESGDQDYRTKHAACPVLQGSKWIANVWIRERNQHN 515
>gi|195575143|ref|XP_002105539.1| GD16913 [Drosophila simulans]
gi|194201466|gb|EDX15042.1| GD16913 [Drosophila simulans]
Length = 534
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 61/211 (28%), Positives = 95/211 (45%), Gaps = 28/211 (13%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + + + + +I A N++ + + +G N RT+ G + +E
Sbjct: 326 PYVVLYHEVLSAREISMLIGKAAQNMKNTRVHKEQGVPKKNRG--RTAKGFWFKKESNE- 382
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQE---------YGPQK 133
I +I +T + E F ++ Y IG Y H D FD Y
Sbjct: 383 -LTKGITRRIMDMTGFDLADSEGFQVINYGIGGHYLLHMDYFDFASSNHTDTRSGYSMDL 441
Query: 134 SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPN 193
R+A+ L YLTD+E+GG T+F AD +G V P+ G + +Y+L N
Sbjct: 442 GDRIATVLFYLTDVEQGGATVF-------AD--------VGYSVYPQAGTAIFWYNLDTN 486
Query: 194 GTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
G DP + H +CPV+ G KWV T+WIR++ Q
Sbjct: 487 GKGDPRTRHAACPVIVGSKWVMTEWIREKRQ 517
>gi|195425415|ref|XP_002061004.1| GK10713 [Drosophila willistoni]
gi|194157089|gb|EDW71990.1| GK10713 [Drosophila willistoni]
Length = 502
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 51/163 (31%), Positives = 85/163 (52%), Gaps = 22/163 (13%)
Query: 66 GIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEA--FNILRYKIGQKYNSHYDA 123
RTS+ VF+ + S +D++ +++A +T L + ++ Y +G Y H+D
Sbjct: 329 NFRTSNSVFL--LNNASYLVDILRQRVADMTHLNVFKNSSDDLQVMNYGLGGYYRYHFDF 386
Query: 124 FDPQEYGPQK--SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQ 181
F E P K R+ + L+Y+TD+++GG T+FP + + P++
Sbjct: 387 FGKDE-SPNKLLGDRIITVLIYMTDVQQGGATVFP---------------ALRITNFPKK 430
Query: 182 GDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
G L+F +L N + DP+++H CPV+ G KW ATKWI EQ
Sbjct: 431 GSALIFRNLDNNISPDPSTLHAGCPVLFGSKWAATKWIYSAEQ 473
>gi|195452778|ref|XP_002073496.1| GK13116 [Drosophila willistoni]
gi|194169581|gb|EDW84482.1| GK13116 [Drosophila willistoni]
Length = 521
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 55/206 (26%), Positives = 99/206 (48%), Gaps = 23/206 (11%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + + + +P + + MAK L+ +T+ T + Q ++T + + +
Sbjct: 325 PYMVLYHDVISPNEIAELQEMAKPELKRATVY---NSTKNTNQFVKTRTAKVAWFLDTFN 381
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ----RVA 138
+ + ++I +T E ++ Y +G Y H+D F+ P SQ R+A
Sbjct: 382 QLTERLNQRIMDMTNFVLNGSEMLQVMNYGLGGYYVKHFDYFNTTT-NPHISQINGDRIA 440
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
+ L YL D+E+GG T+FP I V P++G +++Y+L +G +
Sbjct: 441 TVLFYLNDVEQGGATVFP---------------EIKKAVFPKRGSAIMWYNLKDDGEGNR 485
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++H +CPV+ G KWV KWIR++EQ
Sbjct: 486 DTLHAACPVIVGSKWVCNKWIREREQ 511
>gi|195110931|ref|XP_002000033.1| GI24862 [Drosophila mojavensis]
gi|193916627|gb|EDW15494.1| GI24862 [Drosophila mojavensis]
Length = 549
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 60/220 (27%), Positives = 100/220 (45%), Gaps = 28/220 (12%)
Query: 15 PFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSG 72
P +V LS P + F + + +++ +AK + +T+ V N RTS
Sbjct: 324 PLKVEELSHDPLLVLFHDVIYQSEIDTLMRLAKNKIHRATVTGHNSSVVSNA---RTSQF 380
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP------ 126
F+ + L I++++A +T L E + Y IG Y H D F P
Sbjct: 381 TFLPKTRHK--VLRTIDQRVADMTDLHLEYAEDHQLANYGIGGHYAQHMDWFYPITFETK 438
Query: 127 QEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLL 186
Q P+ R+ + L YL+D+E+GG T FP + ++P++
Sbjct: 439 QVSNPEMGNRIGTVLFYLSDVEQGGATAFP---------------ALKQLLRPKKHAAAF 483
Query: 187 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 226
+Y+L +G D ++HG+CP++ G KWV +WIR+ Q D
Sbjct: 484 WYNLHASGVGDARTMHGACPIIVGSKWVLNRWIREFVQSD 523
>gi|301115862|ref|XP_002905660.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262110449|gb|EEY68501.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 215
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 59/217 (27%), Positives = 104/217 (47%), Gaps = 19/217 (8%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P F ++ I+ ++ +L PS + L+ G RTS+ ++ ++
Sbjct: 3 PLVFSVEEFLRDDEIDVILELSMPHLAPSGVTLQDGHENRPATDWRTSTTYWLDSSSHP- 61
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQ------------EYG 130
+ I+++ A + +P + E+ +LRY+ Q Y+ H D F + EYG
Sbjct: 62 -VVQTIDKRTADLVKVPISHQESVQVLRYEPTQHYDQHLDYFSAERHRNSPDVLKRIEYG 120
Query: 131 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCI-GLKVKPRQGDGLLFYS 189
+ R+ + Y++D+ +GG T F G+ S + C G+ V P++ ++FYS
Sbjct: 121 YK--NRMITVFWYMSDVAKGGHTNFARSGGLPRPSSN--KDCSQGISVAPKKRKVVVFYS 176
Query: 190 LLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 226
+LPNG DP S+H CPV +G K KWI ++ + D
Sbjct: 177 MLPNGEGDPMSLHAGCPVEEGIKLSGNKWIWNKPRSD 213
>gi|289526401|gb|ADD01323.1| FI13021p [Drosophila melanogaster]
gi|373432715|gb|AEY70761.1| FI17809p1 [Drosophila melanogaster]
Length = 193
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 57/194 (29%), Positives = 88/194 (45%), Gaps = 28/194 (14%)
Query: 40 IINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLP 99
+I A N++ + + K V RT+ G ++ +E I +I +T
Sbjct: 2 LIGKAAQNMKNT--KIHKERAVPKKNRGRTAKGFWLKKESNE--LTKRITRRIMDMTGFD 57
Query: 100 RINGEAFNILRYKIGQKYNSHYDAFD---------PQEYGPQKSQRVASFLVYLTDLEEG 150
+ E F ++ Y IG Y H D FD Y R+A+ L YLTD+E+G
Sbjct: 58 LADSEGFQVINYGIGGHYFLHMDYFDFASSNHTDTRSRYSIDLGDRIATVLFYLTDVEQG 117
Query: 151 GETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKG 210
G T+F +G V P+ G + +Y+L +G DP + H +CPV+ G
Sbjct: 118 GATVFG---------------DVGYYVSPQAGTAIFWYNLDTDGNGDPRTRHAACPVIVG 162
Query: 211 EKWVATKWIRDQEQ 224
KWV T+WIR++ Q
Sbjct: 163 SKWVMTEWIREKRQ 176
>gi|221460681|ref|NP_733394.3| CG31013 [Drosophila melanogaster]
gi|220903261|gb|AAF57073.4| CG31013 [Drosophila melanogaster]
Length = 534
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 59/211 (27%), Positives = 94/211 (44%), Gaps = 28/211 (13%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + + + + +I A N++ + + K V RT+ G ++ +E
Sbjct: 326 PYVVLYHEVLSAREISMLIGKAAQNMKNT--KIHKERAVPKKNRGRTAKGFWLKKESNE- 382
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFD---------PQEYGPQK 133
I +I +T + E F ++ Y IG Y H D FD Y
Sbjct: 383 -LTKRITRRIMDMTGFDLADSEGFQVINYGIGGHYFLHMDYFDFASSNHTDTRSRYSIDL 441
Query: 134 SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPN 193
R+A+ L YLTD+E+GG T+F D +G V P+ G + +Y+L +
Sbjct: 442 GDRIATVLFYLTDVEQGGATVF-------GD--------VGYYVSPQAGTAIFWYNLDTD 486
Query: 194 GTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
G DP + H +CPV+ G KWV T+WIR++ Q
Sbjct: 487 GNGDPRTRHAACPVIVGSKWVMTEWIREKRQ 517
>gi|85857698|gb|ABC86384.1| IP10964p [Drosophila melanogaster]
Length = 534
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 59/211 (27%), Positives = 94/211 (44%), Gaps = 28/211 (13%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + + + + +I A N++ + + K V RT+ G ++ +E
Sbjct: 326 PYVVLYHEVLSAREISMLIGKAAQNMKNT--KIHKERAVPKKNRGRTAKGFWLKKESNE- 382
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFD---------PQEYGPQK 133
I +I +T + E F ++ Y IG Y H D FD Y
Sbjct: 383 -LTKRITRRIMDMTGFDLADSEGFQVINYGIGGHYFLHMDYFDFASSNHTDTRSRYSIDL 441
Query: 134 SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPN 193
R+A+ L YLTD+E+GG T+F D +G V P+ G + +Y+L +
Sbjct: 442 GDRIATVLFYLTDVEQGGATVF-------GD--------VGYYVSPQAGTAIFWYNLDTD 486
Query: 194 GTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
G DP + H +CPV+ G KWV T+WIR++ Q
Sbjct: 487 GNGDPRTRHAACPVIVGSKWVMTEWIREKRQ 517
>gi|407699315|ref|YP_006824102.1| prolyl 4-hydroxylase subunit alpha [Alteromonas macleodii str.
'Black Sea 11']
gi|407248462|gb|AFT77647.1| Prolyl 4-hydroxylase alpha subunit [Alteromonas macleodii str.
'Black Sea 11']
Length = 354
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 69/213 (32%), Positives = 102/213 (47%), Gaps = 26/213 (12%)
Query: 15 PFQVLSW-MPRALYFPNFATPEQCKSIINMAKLNLRPSTLA--LRKGETVDNTQGIRTSS 71
P +VL +P LY + + +C +I L+PS + L VDN +RTS
Sbjct: 146 PTEVLDQTLPVELYV-DVLSEYECAYLITKFSSLLQPSMVVDPLTGNGKVDN---VRTSY 201
Query: 72 GVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP 131
I+ + + T L ++ I++VT PR NGEA N+LRY GQ+Y HYDA + G
Sbjct: 202 VAIIAPSYCDWITRKL-DKVISQVTHTPRCNGEALNLLRYTPGQQYKPHYDALNEDHDGS 260
Query: 132 QK---SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFY 188
QR+ + LVYL + +GGET FP + + V P G+ ++F
Sbjct: 261 MYKDGKQRIKTALVYLNTVRQGGETRFPK---------------LDISVSPTLGNMVVFS 305
Query: 189 SLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRD 221
+ +G + S H P KW+ TKWIR+
Sbjct: 306 NSDESGKLLLNSYHLGAPTFSENKWLVTKWIRE 338
>gi|195055767|ref|XP_001994784.1| GH14132 [Drosophila grimshawi]
gi|193892547|gb|EDV91413.1| GH14132 [Drosophila grimshawi]
Length = 537
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 53/178 (29%), Positives = 88/178 (49%), Gaps = 24/178 (13%)
Query: 56 RKGETVDNTQGI---RTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYK 112
R G +++T + RTS +FI+A + L I++++A +T L E + Y
Sbjct: 362 RAGVVINSTSTVSKKRTSQHIFIAATRHK--VLRTIDQRVADMTNLNMQYAEDHQLADYG 419
Query: 113 IGQKYNSHYDAFDPQEYGPQKS----QRVASFLVYLTDLEEGGETMFPFENGMNADGSYD 168
IG Y+ H+D F + K R+A+ L YL+D+ +GG T FP +
Sbjct: 420 IGGHYSQHFDWFGNSDLANSKCDEMGNRIATVLFYLSDVAQGGGTAFPILKQL------- 472
Query: 169 YQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 226
+KP++ +Y+L +G D ++HG CP++ G KWV +WIR+ +Q D
Sbjct: 473 --------LKPKKYAAAFWYNLHASGKGDWRNLHGGCPIIVGSKWVLNRWIREYDQSD 522
>gi|359400227|ref|ZP_09193216.1| 2OG-Fe(II) oxygenase [Novosphingobium pentaromativorans US6-1]
gi|357598467|gb|EHJ60196.1| 2OG-Fe(II) oxygenase [Novosphingobium pentaromativorans US6-1]
Length = 193
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 53/198 (26%), Positives = 100/198 (50%), Gaps = 28/198 (14%)
Query: 30 NFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIE 89
+F QC ++I + + RPST+A G+ V RTSS +S + G + +
Sbjct: 16 DFLDTAQCDALIALIEAEHRPSTVANYNGDDV-----FRTSSTCDLSP---DVGAVAALA 67
Query: 90 EKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP-----QEYGPQKSQRVASFLVYL 144
K+ ++ + + E RY++GQ++ +H D F+P ++Y QR +F++YL
Sbjct: 68 RKLCDISGIDPAHAEPLQGQRYEVGQEFKAHTDYFEPNNSDFEKYCSVSGQRTWTFMIYL 127
Query: 145 TDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGS 204
D++ GG T F K I ++P +G + + + P+G+++P ++H +
Sbjct: 128 NDVDAGGATRF---------------KVINKLIQPERGKLVAWNNRRPDGSLNPATLHHA 172
Query: 205 CPVVKGEKWVATKWIRDQ 222
V +G K+V T+W R++
Sbjct: 173 MKVRQGRKYVVTQWFRER 190
>gi|198449500|ref|XP_001357604.2| GA15939 [Drosophila pseudoobscura pseudoobscura]
gi|198130634|gb|EAL26738.2| GA15939 [Drosophila pseudoobscura pseudoobscura]
Length = 528
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 60/206 (29%), Positives = 98/206 (47%), Gaps = 28/206 (13%)
Query: 25 ALYFPNFATPEQCKSIINMAKLNLRPSTL---ALRKGETVDNTQGIRTSSGVFISAAEDE 81
LY + PE + +MA L+ +T+ + R+ E V RTS + +E
Sbjct: 333 VLYHDVISAPE-ISQLQDMATPGLKRATVYKASGRRSEVVKT----RTSKVAWFPDTFNE 387
Query: 82 SGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQ---EYGPQKSQRVA 138
+ + +IA +T + E + Y +G Y+ HYD F+ R+A
Sbjct: 388 --LTERLNRRIADMTNFDLLGSEMLQAMNYGLGGHYDKHYDFFNASTATNLTQMNGDRIA 445
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
+ L YLTD+E+GG T+FP I V P++G +++Y+L +G +P
Sbjct: 446 TVLFYLTDVEQGGATVFP---------------NIRKAVFPQRGSAIIWYNLKDDGDPNP 490
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++H +CPV+ G KWV KWIR++ Q
Sbjct: 491 QTLHAACPVLVGSKWVCNKWIRERAQ 516
>gi|348518914|ref|XP_003446976.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Oreochromis
niloticus]
Length = 536
Score = 90.9 bits (224), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 61/207 (29%), Positives = 100/207 (48%), Gaps = 24/207 (11%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET-VDNTQGIRTSSGVFISAAEDE 81
P + + + + E+ + I +AK L +T+ R +T V T R S ++ ED
Sbjct: 337 PHIVRYLDLLSDEEIEKIKELAKPRLARATV--RDPKTGVLTTANYRVSKSAWLEGEEDP 394
Query: 82 SGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRV 137
+D + ++I +T L E + Y +G +Y H+D E K RV
Sbjct: 395 --VIDRVNQRIEAITGLTVETAELLQVANYGVGGQYEPHFDFSRKDEPDAFKRLGTGNRV 452
Query: 138 ASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTID 197
A+FL Y++D+E GG T+FP D+ G + PR+G + +Y+L +G D
Sbjct: 453 ATFLNYMSDVEAGGATVFP-----------DF----GAAIWPRKGTSVFWYNLFRSGEGD 497
Query: 198 PTSIHGSCPVVKGEKWVATKWIRDQEQ 224
+ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 498 YRTRHAACPVLVGSKWVSNKWIHERGQ 524
>gi|348523976|ref|XP_003449499.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oreochromis
niloticus]
Length = 594
Score = 90.9 bits (224), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 61/210 (29%), Positives = 100/210 (47%), Gaps = 30/210 (14%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + + N + + + + +AK LR +T++ V T R S ++ A E
Sbjct: 395 PHIVRYHNIVSEKDMEKVKELAKPRLRRATIS-NPVTGVLETAHYRISKSAWLGAYE--H 451
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD--------AFDPQEYGPQKS 134
+D I + I VT L E + Y +G +Y H+D AF+ G
Sbjct: 452 PVVDKINQLIEDVTGLNVKTAEDLQVANYGLGGQYEPHFDFGRKDEPDAFEELGTG---- 507
Query: 135 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 194
R+A++L+Y+TD++ GG T+F IG VKP++G + +Y+L P+G
Sbjct: 508 NRIATWLLYMTDVQAGGATVFT---------------DIGAAVKPKKGTAVFWYNLYPSG 552
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 553 EGDYRTRHAACPVLLGNKWVSNKWIHERGQ 582
>gi|410632646|ref|ZP_11343301.1| prolyl 4-hydroxylase [Glaciecola arctica BSs20135]
gi|410147883|dbj|GAC20168.1| prolyl 4-hydroxylase [Glaciecola arctica BSs20135]
Length = 480
Score = 90.9 bits (224), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 57/194 (29%), Positives = 102/194 (52%), Gaps = 25/194 (12%)
Query: 30 NFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIE 89
+F P++C+++I + + +PST+ E D Q RTSS + +D + I+
Sbjct: 103 DFLLPQECQALIELIEQAKQPSTIT---SENPD--QQFRTSSTCHLGNMQDP--VIRKID 155
Query: 90 EKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQE---YGPQKSQRVASFLVYLTD 146
+I + + E Y++GQ++ H D F+P E YG + QR +F++YL +
Sbjct: 156 LQICQYLGIDPSYSEVIQGQHYQLGQQFKPHTDYFEPYELAHYGGIQGQRTYTFMIYLNE 215
Query: 147 LEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCP 206
+E+GG+T+FP + IG K K +G +++ ++ P+G+++ ++H P
Sbjct: 216 VEQGGDTVFP-------------ELAIGFKAK--KGMAVIWNNINPDGSVNYQTLHQGMP 260
Query: 207 VVKGEKWVATKWIR 220
V KGEK + TKW R
Sbjct: 261 VQKGEKLIITKWFR 274
>gi|89092696|ref|ZP_01165649.1| hypothetical protein MED92_15358 [Neptuniibacter caesariensis]
gi|89083208|gb|EAR62427.1| hypothetical protein MED92_15358 [Oceanospirillum sp. MED92]
Length = 441
Score = 90.9 bits (224), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 58/207 (28%), Positives = 99/207 (47%), Gaps = 26/207 (12%)
Query: 25 ALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGT 84
Y NF +PE+C +I + + + RPST G + + RTS +S E S
Sbjct: 72 VFYVNNFLSPEECAQMIELIQHHQRPSTTTNETG----HYKHYRTSKTCDLSLLE--STF 125
Query: 85 LDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQ-----EYGPQKSQRVAS 139
+ I+++I K+ + E Y IG+++ H D F+P+ E+ + QR +
Sbjct: 126 VAEIDQRICKMLGIEPSYSEGIQGQWYDIGEEFKPHTDYFEPKSDEFLEHAEARGQRTWT 185
Query: 140 FLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPT 199
F++YL + +EGG T FP +G + P QG +++ +L +G+ +P
Sbjct: 186 FMIYLNNTQEGGGTFFPE---------------LGQRFLPSQGKAVIWNNLTTDGSPNPA 230
Query: 200 SIHGSCPVVKGEKWVATKWIRDQEQYD 226
++H PV +G K + TKW R + D
Sbjct: 231 TLHHGEPVKRGYKAIITKWFRSKGTAD 257
>gi|303273602|ref|XP_003056161.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226462245|gb|EEH59537.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 750
Score = 90.9 bits (224), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 65/233 (27%), Positives = 110/233 (47%), Gaps = 56/233 (24%)
Query: 28 FPNFATPEQCKSIINMAKLNLRPSTLA---LRKGETVDNTQGIRTSSGVFISAAEDESGT 84
F +F + +C ++ +A +LR S + L +G RTSS F++ + E
Sbjct: 534 FDHFLSAVECDDLVAIAAPDLRRSRVTDGKLSEG---------RTSSSTFLTGCKQEEPL 584
Query: 85 LDLIEEKIAK-------VTMLPRI--------------------------NGEAFNILRY 111
+ IE+++ + + P + E ++RY
Sbjct: 585 VRAIEQRLLRAVQSATLIAAQPNVYDSNERHGQPYRGSTSRFSQRPNLLQGAEPMQVVRY 644
Query: 112 KIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNA-DGSYDYQ 170
GQ Y +HYD + G + R A+F++YLTD+ GG T FP ++ DG D
Sbjct: 645 TEGQMYTAHYD----NKQGCLR--RTATFMMYLTDVHSGGATHFPRAVPVSMRDGCGDA- 697
Query: 171 KCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
G+++ P++G L+F+S + G D S+H + PV++GEKW+ATKW+R+ E
Sbjct: 698 --AGIRIWPKRGRALVFWS-VSGGIEDVRSLHEAEPVIEGEKWIATKWLREDE 747
>gi|387016440|gb|AFJ50339.1| Prolyl 4-hydroxylase subunit alpha-1-like [Crotalus adamanteus]
Length = 543
Score = 90.9 bits (224), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 57/206 (27%), Positives = 100/206 (48%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F + + E+ + + ++K LR +T++ V T R S ++S E+
Sbjct: 344 PRIVRFLDIISNEEIEKVKELSKPRLRRATIS-NPITGVLETAHYRISKSAWLSGYENP- 401
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ I ++I +T L E + Y +G +Y H+D E K R+A
Sbjct: 402 -VVARINQRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIA 460
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V P++G + +Y+L P+G D
Sbjct: 461 TWLFYMSDVAAGGATVFP---------------EVGASVWPKKGTAVFWYNLFPSGEGDY 505
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 506 STRHAACPVLVGNKWVSNKWIHERGQ 531
>gi|224008853|ref|XP_002293385.1| hypothetical protein THAPSDRAFT_264010 [Thalassiosira pseudonana
CCMP1335]
gi|220970785|gb|EED89121.1| hypothetical protein THAPSDRAFT_264010 [Thalassiosira pseudonana
CCMP1335]
Length = 248
Score = 90.5 bits (223), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 61/231 (26%), Positives = 99/231 (42%), Gaps = 34/231 (14%)
Query: 9 DSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIR 68
++ +I + +S PR +F + + I+ + + + R
Sbjct: 30 NATLDITLRTVSCSPRIFELEHFISDVEADHILMLTNRTHELHRSSTGDSSHHSDHDSTR 89
Query: 69 TSSGVFISAAEDESGTLDLIEEKIAKVTML-------------PRIN-----GEAFNILR 110
TS +I +E+ +D I ++A V + PR+ E ++
Sbjct: 90 TSMNTWI--YREETAIIDTIYRRVADVLRIDEALLRRRQPDEHPRLGTRSSIAEPLQMVH 147
Query: 111 YKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQ 170
Y G++Y +H+D P + R + L+YL D+EEGGET FP
Sbjct: 148 YDPGEEYTAHHDFGYTHMSAPHQPSRSINMLLYLNDVEEGGETSFP-------------- 193
Query: 171 KCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRD 221
+ GL VKP +G +LFY L +G D S H + PV+KGEKW++ WI D
Sbjct: 194 RWGGLDVKPVKGKAVLFYMLTADGNSDDLSQHAALPVIKGEKWMSNLWIWD 244
>gi|332140647|ref|YP_004426385.1| Prolyl 4-hydroxylase alpha subunit [Alteromonas macleodii str.
'Deep ecotype']
gi|327550669|gb|AEA97387.1| Prolyl 4-hydroxylase alpha subunit [Alteromonas macleodii str.
'Deep ecotype']
Length = 376
Score = 90.5 bits (223), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 63/204 (30%), Positives = 96/204 (47%), Gaps = 34/204 (16%)
Query: 28 FPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFISAAEDE 81
+ + + +C+ +I L+PS + VD G +RTS I A +
Sbjct: 181 YESILSEYECRYLITKFNALLKPSMV-------VDPVTGRGKIDSVRTSYVAVIEPAHCD 233
Query: 82 SGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF----DPQEYGPQKSQRV 137
T L ++ I+++T R NGEA N+LRY GQ+Y HYD D + K QR+
Sbjct: 234 WITRKL-DKTISQITHTLRQNGEALNLLRYSPGQQYKPHYDGLNEINDALMFKDGK-QRI 291
Query: 138 ASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTID 197
+ LVYL + EGGET+FP + +++ P+ G ++F + NG +
Sbjct: 292 KTALVYLNTISEGGETLFPK---------------LDIRIAPKSGTMVVFSNSDENGKLL 336
Query: 198 PTSIHGSCPVVKGEKWVATKWIRD 221
S H P V KW+ TKWIR+
Sbjct: 337 LNSYHAGAPTVSENKWLVTKWIRE 360
>gi|195159303|ref|XP_002020521.1| GL13468 [Drosophila persimilis]
gi|194117290|gb|EDW39333.1| GL13468 [Drosophila persimilis]
Length = 415
Score = 90.5 bits (223), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 60/213 (28%), Positives = 104/213 (48%), Gaps = 35/213 (16%)
Query: 15 PF--QVLSWMPRALYFPNFATPEQCKSIINMAK-LNLRPSTLALRKGETVDNTQGIRTSS 71
PF ++LS P + + + TP + ++ N++K L R + + + + RTS+
Sbjct: 227 PFKTELLSLSPYMVLYHDVITPLESLTLKNLSKPLMKRRAMVMVNNLKVRPFIDSGRTSN 286
Query: 72 GVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP 131
V++++ E+ ++ +E ++ +T N E + ++ Y IG Y H D F+ P
Sbjct: 287 SVWLTSHEN--AVMERLERRVGVMTNFEMENSEVYQLINYGIGGHYKPHTDHFE----TP 340
Query: 132 QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLL 191
Q L+D+ +GG T+FP N + V+PRQGD LL+Y+L
Sbjct: 341 Q-----------LSDVPQGGATLFPRLN---------------ISVQPRQGDALLWYNLN 374
Query: 192 PNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
G + ++H SCP++KG KW KWI + Q
Sbjct: 375 DRGQGEIGTVHTSCPIIKGSKWALVKWIDELSQ 407
>gi|156405954|ref|XP_001640996.1| predicted protein [Nematostella vectensis]
gi|156228133|gb|EDO48933.1| predicted protein [Nematostella vectensis]
Length = 182
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 56/166 (33%), Positives = 85/166 (51%), Gaps = 15/166 (9%)
Query: 70 SSGVFISAAEDESGT-LDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQE 128
SS +++ ED T L I + K++ E + +YK+GQKY+ HYD+
Sbjct: 9 SSSLYLKNKEDSKITILRDIAQLAGKLSNTQWRFAEPVALTKYKVGQKYSLHYDS--GFL 66
Query: 129 YGPQKSQRVASFLVYLTDLEEGGETMFPFENGM--------NADGSYDYQKCIG----LK 176
++ +R A+FLVYL D++ GGET+FP + N D C +K
Sbjct: 67 MNQRRVKRTATFLVYLNDVKSGGETIFPLATNISSIQLKKENVDKPSLDSICGKENNMVK 126
Query: 177 VKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
V P LLF++ + +D S+HGSCPVV GEKW+A W+ ++
Sbjct: 127 VSPEAQSCLLFWNHVDGDDVDAFSLHGSCPVVSGEKWIAQIWLHNE 172
>gi|159462456|ref|XP_001689458.1| predicted protein [Chlamydomonas reinhardtii]
gi|158283446|gb|EDP09196.1| predicted protein [Chlamydomonas reinhardtii]
Length = 221
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 60/196 (30%), Positives = 90/196 (45%), Gaps = 30/196 (15%)
Query: 26 LYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTL 85
+ + NF + +C+ II++A ++ ST+ K V IRTS G F+ D +
Sbjct: 1 MVYHNFLSDRECRHIIDLAHAQMKRSTVVGSKNAGV--VDDIRTSYGTFLRRVPDP--VI 56
Query: 86 DLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLT 145
IE ++A + LP + E +LRY KY H D +RVA+ L+YL
Sbjct: 57 AAIEHRLALWSHLPASHQEDMQVLRYGPTNKYGPHIDGL----------ERVATVLIYLG 106
Query: 146 DLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPN-GTIDPTSIHGS 204
E + A G Y KP++GD L+F+ +P+ D S+H
Sbjct: 107 QAERANLSQC-------ARGRVAY--------KPKRGDALMFFDTMPDYKQTDVHSMHTG 151
Query: 205 CPVVKGEKWVATKWIR 220
CPVV+G KW A KW+
Sbjct: 152 CPVVEGVKWNAVKWLH 167
>gi|321461762|gb|EFX72791.1| hypothetical protein DAPPUDRAFT_308081 [Daphnia pulex]
Length = 561
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 63/213 (29%), Positives = 96/213 (45%), Gaps = 30/213 (14%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + + T Q + + + + L +T R GE IRTS ++ E E+
Sbjct: 349 PMIVVLHDLITERQTEILRQLGEPKL--ATSLHRGGEGKFVRSMIRTSKNAWLQ--EHEN 404
Query: 83 GTLDLIEEKIAKVTML---PRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ---- 135
+L I ++ T L P E F I Y IG Y +H D + P+
Sbjct: 405 ASLPAIRHRMELATGLIYGPETASEYFQIANYGIGGLYKTHTDNVIHPDVRPEDQDPWNL 464
Query: 136 ----RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLL 191
R+A+ +VYL+D+E GG T+FP G+ PR+G +++L
Sbjct: 465 YVGDRIATLMVYLSDVEAGGATVFPRA---------------GVTCWPRKGSAAFWWNLY 509
Query: 192 PNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
+G D T+ HG+CPV+ G KWV+ KWIR +Q
Sbjct: 510 KSGEPDLTTRHGACPVLHGSKWVSNKWIRQYDQ 542
>gi|116008432|ref|NP_651804.2| CG15539, isoform A [Drosophila melanogaster]
gi|66772391|gb|AAY55507.1| IP10910p [Drosophila melanogaster]
gi|66772535|gb|AAY55579.1| IP10810p [Drosophila melanogaster]
gi|113194858|gb|AAF57060.2| CG15539, isoform A [Drosophila melanogaster]
Length = 386
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 58/211 (27%), Positives = 101/211 (47%), Gaps = 21/211 (9%)
Query: 14 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGV 73
+ ++LS P + F + + + SI N+ K L + + G ++ RT+ G
Sbjct: 187 LKMELLSLDPYMVLFHDVVSDKDIVSIRNLTKGKLARTVTVSKDGNYTEDPD--RTTKGT 244
Query: 74 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK 133
++ + + + + + +T + + F +L Y IG Y H+D + E
Sbjct: 245 WLV---ENNALIQRLSQLTQDMTNFDIHDADPFQVLNYGIGGFYGIHFDFLEDAELD-NF 300
Query: 134 SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPN 193
S R+A+ + YL+D+ +GG T+FP +GL V P++G LL+Y+L
Sbjct: 301 SDRIATAVFYLSDVPQGGATIFP---------------KLGLSVFPKKGSALLWYNLDHK 345
Query: 194 GTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
G D + H +CP V G +WV TKWI ++EQ
Sbjct: 346 GDGDNRTAHSACPTVVGSRWVMTKWINEREQ 376
>gi|313768105|ref|YP_004061536.1| hypothetical protein BpV1_106c [Bathycoccus sp. RCC1105 virus BpV1]
gi|312599712|gb|ADQ91733.1| hypothetical protein BpV1_106c [Bathycoccus sp. RCC1105 virus BpV1]
Length = 197
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 66/214 (30%), Positives = 98/214 (45%), Gaps = 35/214 (16%)
Query: 14 IPFQVLSWM-------PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG 66
+ F ++ W PR L N + ++CK I ++A L+ ST+++ + D +
Sbjct: 8 VSFLLIIWFFIPIYEKPRVL--KNVLSEDECKHIQDIASKKLQTSTVSMSR----DIDEK 61
Query: 67 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP 126
IR S ++ A+ED +D + K +T P N E +L+YK G Y H D F
Sbjct: 62 IRKSETAWLKASED--PVVDKLIRKCVSMTDRPLHNCEDLQVLKYKPGGFYKPHQDCF-- 117
Query: 127 QEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLL 186
K++R+ +F++ L D EGGET FP N Y +K GD L
Sbjct: 118 ---KNDKNKRMYTFIIALNDEYEGGETEFP-----NIKRRYRLEK----------GDALF 159
Query: 187 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
F +L ++HG PV GEKWV WIR
Sbjct: 160 FNTLNNYECTTKQALHGGAPVKSGEKWVCNLWIR 193
>gi|113682363|ref|NP_001038463.1| prolyl 4-hydroxylase, alpha polypeptide I a precursor [Danio rerio]
Length = 522
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 60/225 (26%), Positives = 102/225 (45%), Gaps = 38/225 (16%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLA---------------LRKGETVDNTQGI 67
PR + + T ++ + I ++K LR +T++ R+ D G
Sbjct: 301 PRIIRYHEIITEQEIEKIKELSKPRLRRATISNPITGVLETAHYRISKRRATVHDPQTGK 360
Query: 68 RTSSGVFISA----AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDA 123
T++ +S A E +D I ++I +T L E + Y +G +Y H+D
Sbjct: 361 LTTAQYRVSKSAWLAAYEHPVVDRINQRIEDITGLNVKTAEELQVANYGVGGQYEPHFDF 420
Query: 124 FDPQEYGPQKS----QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKP 179
E K R+A++L Y++D+ GG T+FP +G VKP
Sbjct: 421 GRKDEPDAFKELGTGNRIATWLFYMSDVAAGGATVFPE---------------VGAAVKP 465
Query: 180 RQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
+G + +Y+L P+G D ++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 466 LKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKWVSNKWIHERGQ 510
>gi|347972274|ref|XP_001237637.3| AGAP004611-PA [Anopheles gambiae str. PEST]
gi|333469330|gb|EAU76664.3| AGAP004611-PA [Anopheles gambiae str. PEST]
Length = 514
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 60/219 (27%), Positives = 106/219 (48%), Gaps = 25/219 (11%)
Query: 11 VTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTS 70
++ + Q ++ P + + + + ++ +II+++K + S + + V T RTS
Sbjct: 304 ISPLKLQEVNHDPMIVMYHDVISNKEIDAIISISKPLMHRSMVGDDHEKAVSKT---RTS 360
Query: 71 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD---AFDPQ 127
S ++ + + ++ +T L E + Y IG Y HYD A + +
Sbjct: 361 SNAWLDDVMHP--VVRTLSQRTEDMTNLAMTAAERLQVGNYGIGGHYLPHYDYAVAEEGK 418
Query: 128 EYGPQ--KSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGL 185
E P K R+A+ + YL+D+ GG T+FP +GL V P++G +
Sbjct: 419 EVYPSIGKGNRIATVMYYLSDVAIGGATVFP---------------QLGLGVFPQKGSAI 463
Query: 186 LFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
+Y+L NGT+D ++HG+CPV G KWV KWI ++ Q
Sbjct: 464 FWYNLHANGTVDHRTLHGACPVFVGSKWVGNKWIHERGQ 502
>gi|198429625|ref|XP_002128613.1| PREDICTED: similar to procollagen-proline, 2-oxoglutarate
4-dioxygenase (proline 4-hydroxylase), alpha 1
polypeptide [Ciona intestinalis]
Length = 195
Score = 90.1 bits (222), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 44/149 (29%), Positives = 80/149 (53%), Gaps = 18/149 (12%)
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP---QKSQ 135
+++ + + ++I+ VT L E I Y +G +Y H+D ++G +
Sbjct: 45 DEDHPVIKRVCQRISDVTGLSMETAEELQIANYGVGGQYEPHFDYSRKSDFGKFDDEVGN 104
Query: 136 RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGT 195
R+A+FL Y++++E+GG T+F G+ V+P +G + +Y+LLP+G
Sbjct: 105 RIATFLTYMSNVEQGGSTVFLHP---------------GIAVRPIKGSAVFWYNLLPSGA 149
Query: 196 IDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D + H +CPV+ G KWV+ KWI +++Q
Sbjct: 150 GDERTRHAACPVLTGVKWVSNKWIHERDQ 178
>gi|405964867|gb|EKC30309.1| Prolyl 4-hydroxylase subunit alpha-1 [Crassostrea gigas]
Length = 591
Score = 90.1 bits (222), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 64/241 (26%), Positives = 110/241 (45%), Gaps = 43/241 (17%)
Query: 12 TNIPF-----QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKG-----ETV 61
T IP+ +V+++ PR F + +P + + ++A ST+ L T
Sbjct: 351 TVIPYYKAKEEVVNYEPRIAIFHDVISPTSIEHLKSVASKGFTRSTVFLENTGPDGHVTY 410
Query: 62 DNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLP------RINGEAFNILRYKIGQ 115
+R S ++ DE L +E +I T L R + E F +L Y +G
Sbjct: 411 GKLDNVRVSQTSWLGT--DEYPELSRLENRIKLTTGLSAEYKSVRSHSEKFQVLNYGVGG 468
Query: 116 KYNSHYDAF--------DPQEYGPQKS--QRVASFLVYLTDLEEGGETMFPFENGMNADG 165
Y HYD +P + ++ +R+A+++ YL D++ GG T+FP
Sbjct: 469 MYTVHYDYTGYMLGIPSNPLDSDDIRTSGERMATWMFYLNDVKAGGATVFP--------- 519
Query: 166 SYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQY 225
+ ++ +G +Y++ P+G DP ++HG CPV+ G KWV+ KWIR++ Q
Sbjct: 520 ------EVKTRIPVAKGGAAFWYNVRPSGATDPRTLHGGCPVLVGSKWVSNKWIREEGQM 573
Query: 226 D 226
D
Sbjct: 574 D 574
>gi|321463241|gb|EFX74258.1| hypothetical protein DAPPUDRAFT_22132 [Daphnia pulex]
Length = 523
Score = 90.1 bits (222), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 67/225 (29%), Positives = 104/225 (46%), Gaps = 34/225 (15%)
Query: 14 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTL--ALRKGETVDNTQGIRTSS 71
I + S+ P F + + E+ ++I +AK L S + L G V N +RTS
Sbjct: 310 IKIEQHSFEPAIYTFHDVLSDEEIETIKELAKPLLARSMVQGKLGVGHEVSN---VRTSK 366
Query: 72 GVFISAAEDESGTLDLIEEKIAKVTMLP----RINGEAFNILRYKIGQKYNSHYDAF--D 125
++ E L+ + +I +T L R E + Y IG Y+ H+D D
Sbjct: 367 TAWLP--EGLHPLLNRLSRRIGLITGLKTDPIRDEAELLQVANYGIGGHYSPHHDYLMKD 424
Query: 126 PQEYG------PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKP 179
++ Q R+A+F+ YL D+E GG T FP G+ VKP
Sbjct: 425 KADFEYMHHRELQAGDRIATFMFYLNDVERGGSTAFP---------------RAGVAVKP 469
Query: 180 RQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
+G +++L +G DP ++HG+CPV+ G KWV+ KWIR+ Q
Sbjct: 470 VKGGAAFWFNLKRSGKPDPLTLHGACPVLLGHKWVSNKWIRETAQ 514
>gi|195159142|ref|XP_002020441.1| GL13994 [Drosophila persimilis]
gi|194117210|gb|EDW39253.1| GL13994 [Drosophila persimilis]
Length = 493
Score = 90.1 bits (222), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 45/140 (32%), Positives = 72/140 (51%), Gaps = 18/140 (12%)
Query: 88 IEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG---PQKSQRVASFLVYL 144
+ +IA +T + E + Y +G Y+ HYD F+ R+A+ L YL
Sbjct: 357 LNRRIADMTNFDLLGSEMLQAMNYGLGGHYDKHYDFFNASTAANLTQMNGDRIATVLFYL 416
Query: 145 TDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGS 204
TD+E+GG T+FP I V P++G +++Y+L +G +P ++H +
Sbjct: 417 TDVEQGGATVFP---------------NIRKAVFPQRGSAIIWYNLKDDGDPNPQTLHAA 461
Query: 205 CPVVKGEKWVATKWIRDQEQ 224
CPV+ G KWV KWIR++ Q
Sbjct: 462 CPVLVGSKWVCNKWIRERAQ 481
>gi|449280261|gb|EMC87600.1| Prolyl 4-hydroxylase subunit alpha-1 [Columba livia]
Length = 536
Score = 90.1 bits (222), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 57/206 (27%), Positives = 99/206 (48%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F + + E+ +++ +AK L +T+ + + T R S ++S E S
Sbjct: 337 PRIVRFLDIISDEEIETVKELAKPRLSRATVHDPETGKL-TTAHYRVSKSAWLSGYE--S 393
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 394 PVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIA 453
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V PR+G + +Y+L P+G D
Sbjct: 454 TWLFYMSDVSAGGATVFP---------------EVGASVWPRKGTAVFWYNLFPSGEGDY 498
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 499 STRHAACPVLVGNKWVSNKWLHERGQ 524
>gi|328790718|ref|XP_392392.4| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Apis mellifera]
Length = 415
Score = 90.1 bits (222), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + + N ++ ++I MA+ + +T+ K ++ R S ++ E E
Sbjct: 208 PRIVVYHNVIYDDEIETIKRMAQPRFKRATVQNYKTGALE-IANYRISKSAWLQ--EHEH 264
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ + ++ +T + E ++ Y IG Y H+D +E KS R+A
Sbjct: 265 KHVAAVSRRVEHMTSMTVDTAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIA 324
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
+ L Y++D+E+GG T+F I + + P++G +Y+L PNG D
Sbjct: 325 TVLYYMSDVEQGGGTVFT---------------AINIALWPKKGSAAFWYNLKPNGEGDF 369
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
+ H +CPV+ G KWVA KW+ ++ Q
Sbjct: 370 KTRHAACPVLTGSKWVANKWLHERGQ 395
>gi|195444366|ref|XP_002069834.1| GK11733 [Drosophila willistoni]
gi|194165919|gb|EDW80820.1| GK11733 [Drosophila willistoni]
Length = 517
Score = 90.1 bits (222), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 57/168 (33%), Positives = 87/168 (51%), Gaps = 26/168 (15%)
Query: 61 VDNTQGIRTSSGVFISAAEDESGT--LDLIEEKIAKVTML--PRINGEAFNILRYKIGQK 116
+D RTS+ VF+ +E+G L+ I ++ A +T L I+ E ++ Y +G +
Sbjct: 361 IDQADVDRTSNSVFM----EETGITLLETISQRAADMTDLYVTAISSEDLQVINYGLGGQ 416
Query: 117 YNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLK 176
Y H D FD + R+A+ L YLTD+++GG T+FPF + L
Sbjct: 417 YTPHCDYFDE---NAENGDRLATVLFYLTDVQQGGATVFPF---------------LRLS 458
Query: 177 VKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
P++G L+F +L + D S H +CPV+ G KWVATKWI +Q
Sbjct: 459 YFPKKGSALIFRNLDNAMSGDKDSTHSACPVLFGNKWVATKWIYHFDQ 506
>gi|380025232|ref|XP_003696381.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Apis florea]
Length = 537
Score = 89.7 bits (221), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + + N ++ ++I MA+ + +T+ K ++ R S ++ E E
Sbjct: 330 PRIVVYHNVIYDDEIETIKRMAQPRFKRATVQNYKTGALE-IANYRISKSAWLQ--EHEH 386
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ + ++ +T + E ++ Y IG Y H+D +E KS R+A
Sbjct: 387 KHVAAVSRRVEHMTSMTVDTAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIA 446
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
+ L Y++D+E+GG T+F I + + P++G +Y+L PNG D
Sbjct: 447 TVLYYMSDVEQGGGTVFT---------------AINIALWPKKGSAAFWYNLKPNGEGDF 491
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
+ H +CPV+ G KWVA KW+ ++ Q
Sbjct: 492 KTRHAACPVLTGSKWVANKWLHERGQ 517
>gi|116008128|ref|NP_001036776.1| CG15539, isoform B [Drosophila melanogaster]
gi|113194857|gb|ABI31220.1| CG15539, isoform B [Drosophila melanogaster]
Length = 509
Score = 89.7 bits (221), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 58/211 (27%), Positives = 101/211 (47%), Gaps = 21/211 (9%)
Query: 14 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGV 73
+ ++LS P + F + + + SI N+ K L + + G ++ RT+ G
Sbjct: 310 LKMELLSLDPYMVLFHDVVSDKDIVSIRNLTKGKLARTVTVSKDGNYTEDPD--RTTKGT 367
Query: 74 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK 133
++ + + + + + +T + + F +L Y IG Y H+D + E
Sbjct: 368 WLV---ENNALIQRLSQLTQDMTNFDIHDADPFQVLNYGIGGFYGIHFDFLEDAELD-NF 423
Query: 134 SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPN 193
S R+A+ + YL+D+ +GG T+FP +GL V P++G LL+Y+L
Sbjct: 424 SDRIATAVFYLSDVPQGGATIFP---------------KLGLSVFPKKGSALLWYNLDHK 468
Query: 194 GTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
G D + H +CP V G +WV TKWI ++EQ
Sbjct: 469 GDGDNRTAHSACPTVVGSRWVMTKWINEREQ 499
>gi|194905419|ref|XP_001981192.1| GG11932 [Drosophila erecta]
gi|190655830|gb|EDV53062.1| GG11932 [Drosophila erecta]
Length = 535
Score = 89.7 bits (221), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 56/191 (29%), Positives = 88/191 (46%), Gaps = 20/191 (10%)
Query: 37 CKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVT 96
+S+ A+ ++ ST+ G RTS G + + + L+ + +
Sbjct: 340 AESLQRTARPRIKRSTVYSLAGNGDSTAAAFRTSQGASFNYSRN--AATKLLSHHVGDFS 397
Query: 97 MLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEY--GPQKSQRVASFLVYLTDLEEGGET 153
L E + Y IG Y H+D+F D Y G R+A+ + YL+D+E GG T
Sbjct: 398 GLNMEYAEDLQVANYGIGGHYEPHWDSFPDNHVYQEGDLHGNRIATAIYYLSDVEAGGGT 457
Query: 154 MFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKW 213
FPF + L V P +G L +Y+L P+G D + H +CPV++G KW
Sbjct: 458 AFPF---------------LPLLVTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKW 502
Query: 214 VATKWIRDQEQ 224
+A WIR++ Q
Sbjct: 503 IANVWIRERNQ 513
>gi|410914996|ref|XP_003970973.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Takifugu
rubripes]
Length = 538
Score = 89.7 bits (221), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 60/206 (29%), Positives = 98/206 (47%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + + +F + E+ + I +AK L +T+ K V T R S ++ ED
Sbjct: 339 PNIVRYLDFLSNEEIEKIKELAKPKLARATVRDPKS-GVLTTASYRVSKSAWLEGEEDP- 396
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ + ++I +T L E + Y +G +Y H+D E K RVA
Sbjct: 397 -IIARVNQRIEDLTGLTVKTAELLQVANYGVGGQYEPHFDFSRKDEPDAFKRLGTGNRVA 455
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
+FL Y++D+E GG T+FP D+ G + PR+G + +Y+L +G D
Sbjct: 456 TFLNYMSDVEAGGATVFP-----------DF----GAAIWPRKGTAVFWYNLFKSGEGDY 500
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
+ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 501 RTRHAACPVLVGNKWVSNKWIHERGQ 526
>gi|410910256|ref|XP_003968606.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Takifugu
rubripes]
Length = 540
Score = 89.7 bits (221), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 60/213 (28%), Positives = 101/213 (47%), Gaps = 25/213 (11%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFIS 76
+VLS P + + +F + + + I A+L LR S +A + T R S ++
Sbjct: 336 EVLSLRPYVVLYHDFISDSESEEIKQHAQLGLRRSVVATGDKQA---TAEYRISKSAWLK 392
Query: 77 AAEDESGTLDLIEEKIAKVTML--PRINGEAFNILRYKIGQKYNSHYD-AFDPQE--YGP 131
+ T+ +++KI+ +T L +GE ++ Y IG Y H+D A P +
Sbjct: 393 GSAH--STVSRLDQKISMLTGLNVQHPHGEYLQVVNYGIGGHYEPHFDHATSPSSPVFKL 450
Query: 132 QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLL 191
+ RVA+F++YL+ +E GG T F + N V + + +++L
Sbjct: 451 KTGNRVATFMIYLSSVEAGGSTAFIYAN---------------FSVPVMKNAAIFWWNLH 495
Query: 192 PNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
NG D ++H CPV+ G+KWVA KWI + Q
Sbjct: 496 RNGEGDADTLHAGCPVLIGDKWVANKWIHEYGQ 528
>gi|224007761|ref|XP_002292840.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220971702|gb|EED90036.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 490
Score = 89.7 bits (221), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 68/214 (31%), Positives = 108/214 (50%), Gaps = 32/214 (14%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINM---AKLNLRPSTLALRKGETVDN--TQGIRTSSGV 73
+S P + F NF T E+C +I + AK ++ + D+ ++G RTS
Sbjct: 284 MSQPPWIITFDNFLTDEECNQMIQLGYKAKYERSKDVGEMQIDGSYDSVVSKG-RTSENA 342
Query: 74 FISAAED--ESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQE--- 128
+ S + + T LI ++I+ VT +P + E F IL+Y+ GQ Y SH+D + QE
Sbjct: 343 WCSFRDKCRNTTTAQLIHDRISTVTGIPANHSEDFQILKYEKGQFYRSHHDYIEHQEKRR 402
Query: 129 YGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFY 188
GP RV +F +YL+D+EEGG+T FP + + VKP++G +L+
Sbjct: 403 CGP----RVLTFFLYLSDVEEGGDTNFPK---------------LSIAVKPKKGSAVLWP 443
Query: 189 SLLPN--GTIDPTSIHGSCPVVKGEKWVATKWIR 220
S+L + DP + H + VV G K+ A W+
Sbjct: 444 SVLDSNPSMKDPRTDHEAQEVVNGTKFGANAWLH 477
>gi|194905376|ref|XP_001981185.1| GG11927 [Drosophila erecta]
gi|190655823|gb|EDV53055.1| GG11927 [Drosophila erecta]
Length = 539
Score = 89.4 bits (220), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 63/213 (29%), Positives = 102/213 (47%), Gaps = 21/213 (9%)
Query: 14 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGV 73
I ++LS P + + +P++ I + +K + PS + V RTS V
Sbjct: 322 IKTEILSIDPFVVLLHDMVSPKEAALIRSSSKSTIFPSETVNAANDFV--VSKFRTSKSV 379
Query: 74 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF--DPQEYGP 131
++ +E+ T+ L + ++A T L + E F ++ Y IG + SH+D D +
Sbjct: 380 WLDRDANEA-TVKLTQ-RLADATGLDVKHSEHFQVINYGIGGVFESHFDTTLEDTNRFVG 437
Query: 132 QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLL 191
R+A+ L YL D+ +GG T FP G+N + V PR G L +Y+L
Sbjct: 438 GFIDRIATTLFYLNDVPQGGATHFP---GLN------------ITVFPRLGAALFWYNLD 482
Query: 192 PNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
G + ++H CPV+ G KWV +KWI D+ Q
Sbjct: 483 TQGMLQVRTMHTGCPVIVGSKWVVSKWIDDKGQ 515
>gi|195055773|ref|XP_001994787.1| GH17427 [Drosophila grimshawi]
gi|193892550|gb|EDV91416.1| GH17427 [Drosophila grimshawi]
Length = 538
Score = 89.4 bits (220), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 55/203 (27%), Positives = 94/203 (46%), Gaps = 19/203 (9%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + + + +P+Q + MA +++ ST+ G + R S ++ D
Sbjct: 332 PLVVSYHDMLSPQQIIELRQMAVPHMKRSTVNPLPGRQSKKS-AFRVSKNAWLEY--DTH 388
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQKSQRVASFL 141
+ + ++ T L E + Y +G Y H+D F D Q Y ++ R+A+ +
Sbjct: 389 PMMGRMLRDLSDATGLDMTYCEQLQVANYGVGGHYEPHWDFFVDSQHYPAEEGNRIATAI 448
Query: 142 VYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSI 201
YL+D+E+GG T FPF N V+P+ G+ L +Y+L + +D +
Sbjct: 449 FYLSDVEQGGATAFPFLN---------------FAVRPQLGNILFWYNLHRSLDMDYRTK 493
Query: 202 HGSCPVVKGEKWVATKWIRDQEQ 224
H CPV+KG KW+A WI + Q
Sbjct: 494 HAGCPVLKGSKWIANIWIHEATQ 516
>gi|325920649|ref|ZP_08182559.1| 2OG-Fe(II) oxygenase superfamily enzyme,Sel1 repeat protein
[Xanthomonas gardneri ATCC 19865]
gi|325548839|gb|EGD19783.1| 2OG-Fe(II) oxygenase superfamily enzyme,Sel1 repeat protein
[Xanthomonas gardneri ATCC 19865]
Length = 422
Score = 89.4 bits (220), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 61/201 (30%), Positives = 88/201 (43%), Gaps = 38/201 (18%)
Query: 35 EQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAK 94
++C+ ++ +A+ +LR S + + T IRTS G TLD I E A
Sbjct: 244 DECRLLMLLARPHLRASQVVDPNDASTHRTP-IRTSRG----------ATLDPILEDFAA 292
Query: 95 VTM---------LPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG---PQKSQRVASFLV 142
LP + EA ++L Y G+ Y +H D P P R+ + V
Sbjct: 293 RAAQARVAACAQLPLTHAEALSVLCYAPGEHYRAHRDYLPPGTIAADRPGAGNRLRTACV 352
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D++ GGET FP G++V+PR G + F +L +G DP S+H
Sbjct: 353 YLNDVDAGGETEFPVA---------------GIRVQPRAGSVVCFDNLQADGCPDPDSLH 397
Query: 203 GSCPVVKGEKWVATKWIRDQE 223
PV G KW+ T W R Q
Sbjct: 398 AGLPVTTGSKWLGTLWFRQQR 418
>gi|195505199|ref|XP_002099401.1| GE23383 [Drosophila yakuba]
gi|194185502|gb|EDW99113.1| GE23383 [Drosophila yakuba]
Length = 535
Score = 89.4 bits (220), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 56/191 (29%), Positives = 89/191 (46%), Gaps = 22/191 (11%)
Query: 38 KSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTM 97
+S+ A+ ++ ST+ G RTS G + + S L+ + +
Sbjct: 341 ESLQRTARPRIKRSTVYSLAGNGGSTAAAFRTSQGASFNYSR--SAATKLLSHHVGDFSG 398
Query: 98 LPRINGEAFNILRYKIGQKYNSHYDAFDPQEY----GPQKSQRVASFLVYLTDLEEGGET 153
L E + Y IG Y H+D+F P+ + G R+A+ + YL+D+E GG T
Sbjct: 399 LNMEYAEDLQVANYGIGGHYEPHWDSF-PENHVYQEGDLHGNRIATGIYYLSDVEAGGGT 457
Query: 154 MFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKW 213
FPF + L V P +G L +Y+L P+G D + H +CPV++G KW
Sbjct: 458 AFPF---------------LPLLVTPEKGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKW 502
Query: 214 VATKWIRDQEQ 224
+A WIR++ Q
Sbjct: 503 IANVWIRERNQ 513
>gi|47227817|emb|CAG08980.1| unnamed protein product [Tetraodon nigroviridis]
Length = 285
Score = 89.4 bits (220), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 58/213 (27%), Positives = 102/213 (47%), Gaps = 25/213 (11%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFIS 76
+ LS P + + +F + + + I + A+L LR S +A R + T R S ++
Sbjct: 81 ETLSLQPYVVLYHDFISDTEAEEIKHHAQLGLRRSVVATRDKQV---TAEYRISKSAWLK 137
Query: 77 AAEDESGTLDLIEEKIAKVTML--PRINGEAFNILRYKIGQKYNSHYD-AFDPQE--YGP 131
+ + + ++++I+ +T L +GE ++ Y IG Y H+D A P +
Sbjct: 138 GSAQSA--VSRLDQRISMLTGLNVQHPHGEYLQVVNYGIGGHYEPHFDHATSPSSPVFKL 195
Query: 132 QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLL 191
+ RVA+ ++YL+ +E GG T F + N V + + +++L
Sbjct: 196 KTGNRVATVMIYLSSVEAGGSTAFIYAN---------------FSVPVMKNAAIFWWNLH 240
Query: 192 PNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
NG DP ++H CPV+ G+KWVA KWI + Q
Sbjct: 241 RNGRGDPDTLHAGCPVLIGDKWVANKWIHEYGQ 273
>gi|297803562|ref|XP_002869665.1| ShTK domain-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297315501|gb|EFH45924.1| ShTK domain-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 290
Score = 89.4 bits (220), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 64/211 (30%), Positives = 99/211 (46%), Gaps = 35/211 (16%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA 78
LSW PR + F + E+ +I+ LR T + G+ TQ +G
Sbjct: 61 LSWQPRVFLYRGFLSEEESDHLIS-----LRKDTSEVTSGDADGKTQLDPVVAG------ 109
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVA 138
IEEKI+ T LPR NG + + Y +K D F + + +A
Sbjct: 110 ---------IEEKISAWTFLPRENGGSIKVRSY-TSEKSGKKLDYFGEEPSSVLRESLLA 159
Query: 139 SFLVYLTDLEEGGETMFPF-----ENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPN 193
+ ++YL++ +GGE +FP + + DG+ ++P +G+ +LF+S L N
Sbjct: 160 TVVLYLSNTTQGGELLFPNSEVKPKKSCSEDGNI---------LRPVKGNAVLFFSRLLN 210
Query: 194 GTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
++D TS H CPVVKGE VATK I ++Q
Sbjct: 211 ASLDETSTHLICPVVKGELLVATKLIYAKKQ 241
>gi|390989473|ref|ZP_10259770.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas axonopodis
pv. punicae str. LMG 859]
gi|372555742|emb|CCF66745.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas axonopodis
pv. punicae str. LMG 859]
Length = 152
Score = 89.0 bits (219), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 55/164 (33%), Positives = 76/164 (46%), Gaps = 24/164 (14%)
Query: 64 TQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDA 123
RTS + + +D IE +IA++ P +GE +LRY G +Y HYD
Sbjct: 2 VHAARTSDSMCLRVGQD--ALCQRIEARIARLFDWPVDHGEGLQVLRYATGAEYRPHYDY 59
Query: 124 FDPQEYGP-----QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVK 178
FDP G QRVAS ++YL E GG T FP + L V
Sbjct: 60 FDPDAAGTPILLQAGGQRVASLVMYLNTPERGGATRFPDAH---------------LDVA 104
Query: 179 PRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
+G+ + F P+ S+H PV+ GEKWVATKW+R++
Sbjct: 105 AVKGNAVFFSYDRPHPMT--RSLHAGAPVLTGEKWVATKWLRER 146
>gi|312032356|ref|NP_001185665.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Gallus
gallus]
Length = 536
Score = 89.0 bits (219), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 56/206 (27%), Positives = 99/206 (48%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F + + E+ +++ +AK L +T+ + + T R S ++S E S
Sbjct: 337 PRIVRFLDIISDEEIETVKELAKPRLSRATVHDPETGKL-TTAHYRVSKSAWLSGYE--S 393
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 394 PVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 453
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V P++G + +Y+L P+G D
Sbjct: 454 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFPSGEGDY 498
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 499 STRHAACPVLVGNKWVSNKWLHERGQ 524
>gi|194765138|ref|XP_001964684.1| GF23317 [Drosophila ananassae]
gi|190614956|gb|EDV30480.1| GF23317 [Drosophila ananassae]
Length = 520
Score = 89.0 bits (219), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 58/217 (26%), Positives = 106/217 (48%), Gaps = 27/217 (12%)
Query: 14 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTL---ALRKGETVDNTQGIRTS 70
+ +++ P + + + + + + MA +L+ +T+ +L K E V RTS
Sbjct: 318 LKMEIVGLNPYMVIYHDVLSSAEIDEMKEMATPSLKRATVYKASLGKNEVVKT----RTS 373
Query: 71 SGVFISAAEDESGTLDL-IEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEY 129
++ D +L L + +I +T E ++ Y +G Y+ HYD F+ E
Sbjct: 374 K---VAWFPDSYNSLTLRLNARIHDMTGFDLSGSEMLQLMNYGLGGHYDKHYDFFNATEK 430
Query: 130 GPQKS-QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFY 188
+ R+A+ L Y++D+E+GG T+FP I V P++G +++Y
Sbjct: 431 SSSLTGDRIATVLFYMSDVEQGGATVFP---------------NIYKTVYPQRGTAVMWY 475
Query: 189 SLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQY 225
+L +G D ++H +CPV+ G KWV KWIR++ Q+
Sbjct: 476 NLKDDGQPDEQTLHAACPVLVGSKWVCNKWIRERAQF 512
>gi|299115443|emb|CBN75608.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 548
Score = 89.0 bits (219), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 64/247 (25%), Positives = 109/247 (44%), Gaps = 40/247 (16%)
Query: 14 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGI---RTS 70
+ + LS PR NF E+ SII A L + L++ T + I RTS
Sbjct: 205 VVLETLSHSPRVFSLYNFMDMEEADSIIEDA-LGMTQEAYRLKRSSTGTKGKAISKTRTS 263
Query: 71 SGVFISAAEDESGTLDLIEEKIAKVTMLPRIN---GEAFNILRYKIGQKYNSHYDAFDPQ 127
F++ + T ++ +I ++ + + + +LRY Q Y +H+D +
Sbjct: 264 DNAFVT----HTNTAQALKRRIFQLLGIEEYHETWADGLQVLRYNESQAYVAHFDYLESA 319
Query: 128 EYGPQKSQ-----RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCI--------- 173
E KS+ R A+ ++Y D+ EGGET+F G++ D + +
Sbjct: 320 EGHDFKSEGLGTNRFATVVLYFNDVREGGETVFTHAPGIDHHLVPDTKVPVREVLENLDL 379
Query: 174 ---------------GLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKW 218
+ V P++G +LFY+ P+G D +S HG+CPV+ G+KW A W
Sbjct: 380 PRSGWEEKLLLQCRRHMVVAPKRGQAVLFYNQHPDGRKDLSSEHGACPVIDGQKWAANLW 439
Query: 219 IRDQEQY 225
+ + +Y
Sbjct: 440 VWNGPRY 446
>gi|327267604|ref|XP_003218589.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Anolis
carolinensis]
Length = 542
Score = 89.0 bits (219), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 57/206 (27%), Positives = 98/206 (47%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F + E+ +++ +AK L +T+ + + T R S ++S E+
Sbjct: 343 PRIVRFVEIISDEEIETVKELAKPRLSRATVHDPQTGKL-TTAHYRVSKSAWLSGYENP- 400
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 401 -IVARINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIA 459
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V PR+G + +Y+L P+G D
Sbjct: 460 TWLFYMSDVSAGGATVFP---------------EVGASVWPRKGTAVFWYNLFPSGEGDY 504
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 505 STRHAACPVLVGNKWVSNKWIHERGQ 530
>gi|195452776|ref|XP_002073495.1| GK13117 [Drosophila willistoni]
gi|194169580|gb|EDW84481.1| GK13117 [Drosophila willistoni]
Length = 487
Score = 89.0 bits (219), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 56/216 (25%), Positives = 103/216 (47%), Gaps = 25/216 (11%)
Query: 14 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGV 73
+ +++ P + + + +P + + MAK L+ + R + NT + +
Sbjct: 279 LKMELIGLDPYMVLYHDVISPNEIAELQEMAKPQLKRA----RVYNSTKNTDQLSKTRTA 334
Query: 74 FISAAEDESGTL-DLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
++ D L + + ++I +T E ++ Y +G Y H+D F+ + GP
Sbjct: 335 KLAWFLDTFNQLTERLNQRIMDMTNFVLNGSEMLQVMNYGLGGYYVKHFDYFNTTK-GPH 393
Query: 133 KSQ----RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFY 188
+Q R+A+ L YL D+E+GG T+FP I V P++G +++Y
Sbjct: 394 ITQINGDRIATVLFYLNDVEQGGATVFP---------------EIKKAVFPKRGSAIMWY 438
Query: 189 SLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
+L +G + ++H CPV+ G KWV KWIR++EQ
Sbjct: 439 NLKDDGEGNRDTLHAGCPVIVGSKWVCNKWIREREQ 474
>gi|224052167|ref|XP_002191912.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Taeniopygia
guttata]
Length = 536
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 57/206 (27%), Positives = 98/206 (47%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F + + E+ +++ +AK L +T+ + + T R S ++S E S
Sbjct: 337 PRIVRFLDIISDEEIETVKELAKPRLSRATVHDPETGKL-TTAHYRVSKSAWLSGYE--S 393
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 394 PVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIA 453
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V PR+G + +Y+L P+G D
Sbjct: 454 TWLFYMSDVSAGGATVFP---------------EVGASVWPRKGTAVFWYNLFPSGEGDY 498
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV KW+ ++ Q
Sbjct: 499 STRHAACPVLVGNKWVFNKWLHERGQ 524
>gi|212530|gb|AAA49002.1| prolyl 4-hydroxylase, alpha subunit (EC 1.14.11.2), partial [Gallus
gallus]
Length = 489
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 56/206 (27%), Positives = 99/206 (48%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F + + E+ +++ +AK L +T+ + + T R S ++S E S
Sbjct: 290 PRIVRFLDIISDEEIETVKELAKPRLSRATVHDPETGKL-TTAHYRVSKSAWLSGYE--S 346
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 347 PVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIA 406
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V P++G + +Y+L P+G D
Sbjct: 407 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFPSGEGDY 451
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 452 STRHAACPVLVGNKWVSNKWLHERGQ 477
>gi|195390805|ref|XP_002054058.1| GJ23004 [Drosophila virilis]
gi|194152144|gb|EDW67578.1| GJ23004 [Drosophila virilis]
Length = 446
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 52/158 (32%), Positives = 83/158 (52%), Gaps = 21/158 (13%)
Query: 68 RTSSGVFISAAEDESGTL-DLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP 126
R+ VFI E E G L IE ++ ++ L + +++ Y IG Y H+D+F
Sbjct: 296 RSGKNVFI---ELEKGELVKTIEMRVTDMSGLSMEGSDDLSLINYGIGGHYIPHHDSFSE 352
Query: 127 QEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLL 186
+E + R+A+ L YL+D+E GG T FP N L + P +G +L
Sbjct: 353 EE--NKTEDRIATALFYLSDVELGGATTFPLLN---------------LTISPEKGTAVL 395
Query: 187 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
+++L +GT P ++H +CPV+ G K+V TKWI + +Q
Sbjct: 396 WHNLKDSGTPHPKTVHAACPVIVGSKYVMTKWIYNMDQ 433
>gi|326923463|ref|XP_003207955.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 2
[Meleagris gallopavo]
Length = 536
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 56/206 (27%), Positives = 99/206 (48%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F + + E+ +++ +AK L +T+ + + T R S ++S E S
Sbjct: 337 PRIVRFLDIISDEEIETVKELAKPRLSRATVHDPETGKL-TTAHYRVSKSAWLSGYE--S 393
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 394 PVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIA 453
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V P++G + +Y+L P+G D
Sbjct: 454 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFPSGEGDY 498
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 499 STRHAACPVLVGNKWVSNKWLHERGQ 524
>gi|312032354|ref|NP_001185664.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Gallus
gallus]
Length = 536
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 56/206 (27%), Positives = 99/206 (48%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F + + E+ +++ +AK L +T+ + + T R S ++S E S
Sbjct: 337 PRIVRFLDIISDEEIETVKELAKPRLSRATVHDPETGKL-TTAHYRVSKSAWLSGYE--S 393
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 394 PVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIA 453
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V P++G + +Y+L P+G D
Sbjct: 454 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFPSGEGDY 498
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 499 STRHAACPVLVGNKWVSNKWLHERGQ 524
>gi|194765168|ref|XP_001964699.1| GF22909 [Drosophila ananassae]
gi|190614971|gb|EDV30495.1| GF22909 [Drosophila ananassae]
Length = 525
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 58/214 (27%), Positives = 95/214 (44%), Gaps = 26/214 (12%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA 78
LS P + + + + +I + L+ +T+ V N +RTS F+
Sbjct: 298 LSRDPLLILYHDVIYQSEIDTIRKLTTNKLKRATITSTNESVVSN---VRTSQFTFLPVT 354
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEY------GPQ 132
ED+ L I+ ++A +T E Y IG Y H D F + P+
Sbjct: 355 EDK--VLATIDRRVADMTNFNMRYAEDHQFANYGIGGHYGQHMDWFYQPSFDAGLVSSPE 412
Query: 133 KSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLP 192
R+A+ L YL+D+ +GG T FP + + +KP++ +Y+L
Sbjct: 413 MGNRIATVLFYLSDVTQGGGTAFPH---------------LRVLLKPKKYAAAFWYNLHA 457
Query: 193 NGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 226
+G DP + HG+CP++ G KWV +WIR+ Q D
Sbjct: 458 SGVGDPRTQHGACPIISGSKWVQNRWIREFIQSD 491
>gi|129365|sp|P16924.1|P4HA1_CHICK RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1
Length = 516
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 56/206 (27%), Positives = 99/206 (48%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F + + E+ +++ +AK L +T+ + + T R S ++S E S
Sbjct: 317 PRIVRFLDIISDEEIETVKELAKPRLSRATVHDPETGKL-TTAHYRVSKSAWLSGYE--S 373
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 374 PVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIA 433
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V P++G + +Y+L P+G D
Sbjct: 434 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFPSGEGDY 478
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 479 STRHAACPVLVGNKWVSNKWLHERGQ 504
>gi|219123691|ref|XP_002182153.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217406114|gb|EEC46054.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 188
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 62/208 (29%), Positives = 97/208 (46%), Gaps = 29/208 (13%)
Query: 18 VLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISA 77
VL+ P NF TP +C+ +I+MA+ + P+ + G+ RTSS ++S
Sbjct: 1 VLNTSPPMFAVDNFLTPLECEFLIHMAQDSFGPAPVV---GKGAGEVSPSRTSSTCYLSR 57
Query: 78 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFD-----PQEYGPQ 132
+ L + K++ +T P + E + RY Q+Y HYDAFD +
Sbjct: 58 ED-----LPDLMRKVSSLTGKPIEHCELPQVGRYFPSQQYLQHYDAFDLGTEDGLRFAAN 112
Query: 133 KSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLP 192
QR + L+YL D+ GG T FP + L V+PRQG L+F+
Sbjct: 113 GGQRTITVLLYLNDVARGGATRFP---------------ALNLDVQPRQGMALVFFPATI 157
Query: 193 NGTIDPTSIHGSCPVVKGEKWVATKWIR 220
+G +D ++H + P V K+V+ WIR
Sbjct: 158 DGMLDRMALHAAMPAVD-TKYVSQVWIR 184
>gi|323452216|gb|EGB08091.1| hypothetical protein AURANDRAFT_26622 [Aureococcus anophagefferens]
Length = 190
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 64/209 (30%), Positives = 98/209 (46%), Gaps = 27/209 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + +C II + +R S + G+ T RTS ++ + S
Sbjct: 1 PRVFLVREMLSEFECDHIIELGTKVVRKSMV----GQGGGFTSKTRTSENGWLRRSA--S 54
Query: 83 GTLDLIEEKIAKVTMLPR------INGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQR 136
L+ I ++ V + N E ++RY Q+Y H+D D + PQ QR
Sbjct: 55 PILENIYKRFGDVLGIDHDLLRSGKNAEELQVVRYDRSQEYAPHHDFGD--DGTPQ--QR 110
Query: 137 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 196
+ L+Y+ EEGG T FP N DG +G++V P +GD +LFYS+LP+G
Sbjct: 111 FLTLLLYIQLPEEGGATSFPKAN----DG-------MGVQVVPARGDAVLFYSMLPDGNA 159
Query: 197 DPTSIHGSCPVVKGEKWVATKWIRDQEQY 225
D ++H PV KG+KWV W+ D ++
Sbjct: 160 DDLALHAGMPVRKGQKWVCNLWVWDPHRH 188
>gi|195575097|ref|XP_002105516.1| GD17035 [Drosophila simulans]
gi|194201443|gb|EDX15019.1| GD17035 [Drosophila simulans]
Length = 535
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 59/216 (27%), Positives = 96/216 (44%), Gaps = 24/216 (11%)
Query: 15 PFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSG 72
PF++ L P + +S+ A+ ++ ST+ G RTS G
Sbjct: 316 PFKLEELHLDPLVVQLHQVIGSNDSESLQKTARPRIKRSTVYSLGGNGGSTAAAFRTSQG 375
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEY--- 129
+ + + L+ + + L E + Y IG Y H+D+F P+ +
Sbjct: 376 ASFNYSRN--AATKLLSHHVGDFSGLNMDYAEDLQVANYGIGGHYEPHWDSF-PENHIYQ 432
Query: 130 -GPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFY 188
G R+A+ + YL+D+E GG T FPF + L V P +G L +Y
Sbjct: 433 EGDLHGNRIATGIYYLSDVEAGGGTAFPF---------------LPLLVTPEKGSLLFWY 477
Query: 189 SLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
+L P+G D + H +CPV++G KW+A WIR++ Q
Sbjct: 478 NLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQ 513
>gi|195145314|ref|XP_002013641.1| GL24244 [Drosophila persimilis]
gi|194102584|gb|EDW24627.1| GL24244 [Drosophila persimilis]
Length = 496
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 59/197 (29%), Positives = 94/197 (47%), Gaps = 28/197 (14%)
Query: 25 ALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGT 84
ALY + EQ ++ L S L ++G D IRT + S A + + T
Sbjct: 312 ALYHEVVSAAEQRHLML------LSESQLQRQRGHQYDK---IRTFASA--SVAANATPT 360
Query: 85 LDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ--KSQRVASFLV 142
++ + ++ +T L E IL Y IG +Y H D PQ + K R+A+ L+
Sbjct: 361 VEQLHRRLEDITGLDLAESEPLRILNYGIGGQYYIHVDCEQPQTHVEPYPKEYRLATVLL 420
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL+D+ GG T FP +GL ++P +G L++++ G D ++H
Sbjct: 421 YLSDVRLGGFTSFP---------------ALGLGIRPNRGSALVWHNANNAGNCDYRALH 465
Query: 203 GSCPVVKGEKWVATKWI 219
+CPV+ G +WVA+KWI
Sbjct: 466 AACPVLLGTRWVASKWI 482
>gi|156406532|ref|XP_001641099.1| predicted protein [Nematostella vectensis]
gi|156228236|gb|EDO49036.1| predicted protein [Nematostella vectensis]
Length = 410
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 66/208 (31%), Positives = 92/208 (44%), Gaps = 30/208 (14%)
Query: 24 RALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAED--- 80
R L + TP C ++ + NL+ LA + E D T G + F D
Sbjct: 160 RRLEILDKMTPIMCFDGVDTLRKNLKELKLAYKVSER-DFTMGTTCLNETFSGKLRDHFK 218
Query: 81 ----------ESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 130
E+ + ++I T L NG F I Y IG Y SH D E
Sbjct: 219 WSHSMAFYTGENKFSTMYAKRIQAATGLREENGGKFQITGYPIGVGYKSHTDCV-VYEGE 277
Query: 131 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 190
P+K R A+ LVYL D+EEGGET FP +G+KVKP++G L++ S+
Sbjct: 278 PEKRDRYATILVYLQDVEEGGETDFPL---------------LGIKVKPKKGLALVWNSM 322
Query: 191 LPNGTIDPTSIHGSCPVVKGEKWVATKW 218
G DP S+H + V KG K++ +W
Sbjct: 323 DARGNCDPLSLHDAKQVTKGHKYIIQRW 350
>gi|85708137|ref|ZP_01039203.1| Prolyl 4-hydroxylase, alpha subunit [Erythrobacter sp. NAP1]
gi|85689671|gb|EAQ29674.1| Prolyl 4-hydroxylase, alpha subunit [Erythrobacter sp. NAP1]
Length = 211
Score = 88.2 bits (217), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 56/198 (28%), Positives = 93/198 (46%), Gaps = 28/198 (14%)
Query: 31 FATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEE 90
F +PE C +I + L RPST+A G+ RTS + A E + +E
Sbjct: 34 FISPELCAELIRLIDLGRRPSTIADANGDDY-----FRTSETCDLDANET---AVKELEA 85
Query: 91 KIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP-----QEYGPQKSQRVASFLVYLT 145
+ + + GE RY +GQ++ +H D F P +++ + QR +F++YL
Sbjct: 86 RFFSLNGIDPKYGEPVQGQRYDVGQEFKAHTDYFTPGGADFEKFCAESGQRTWTFMIYLN 145
Query: 146 DLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSC 205
++E GG T F K I +P G + + + P+G+++P ++H
Sbjct: 146 EVEAGGATRF---------------KVIKKSFQPETGKLVCWNNARPDGSVNPATLHHGM 190
Query: 206 PVVKGEKWVATKWIRDQE 223
V KG K+V TKW R++E
Sbjct: 191 KVRKGVKYVITKWYRERE 208
>gi|114799222|ref|YP_760562.1| 2OG-Fe(II) oxygenase [Hyphomonas neptunium ATCC 15444]
gi|114739396|gb|ABI77521.1| oxidoreductase, 2OG-Fe(II) oxygenase family [Hyphomonas neptunium
ATCC 15444]
Length = 298
Score = 88.2 bits (217), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 64/209 (30%), Positives = 99/209 (47%), Gaps = 33/209 (15%)
Query: 23 PRA-LY-FPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG--IRTSSGVFISAA 78
P+A LY +PNF PE C ++I + LR ST T D IRTS I
Sbjct: 100 PKAQLYVWPNFLAPETCDALIALTDERLRAST-------TTDAFADPKIRTSRSSDIGTM 152
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP-----QEYGPQK 133
+ ++E IA+ + +A RY + Q+Y +HYD F P Q +
Sbjct: 153 G--HNLVMQLDELIAEALGIHWSYSDATQTQRYDVNQEYKAHYDYFTPGTRDYQVHCQFT 210
Query: 134 SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPN 193
QR +F++YL D+EEGG T F + + + P +G +++ +L P+
Sbjct: 211 GQRTWTFMIYLNDVEEGGGTRF---------------RRLEKTIMPEKGKAVIWNNLNPD 255
Query: 194 GTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
G+++P +IH V G K+V TKW R++
Sbjct: 256 GSVNPYTIHHGMKVRSGAKYVITKWFRER 284
>gi|260825357|ref|XP_002607633.1| hypothetical protein BRAFLDRAFT_59428 [Branchiostoma floridae]
gi|229292981|gb|EEN63643.1| hypothetical protein BRAFLDRAFT_59428 [Branchiostoma floridae]
Length = 520
Score = 88.2 bits (217), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 59/207 (28%), Positives = 98/207 (47%), Gaps = 25/207 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISA--AED 80
P+ N T + + I +A+ LR + + E+ +G S + SA +
Sbjct: 322 PKLWVLHNILTDPEMEVIKKLAQPRLRRARV-----ESPTTGEGELASYRISKSAWLYDW 376
Query: 81 ESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYG--PQKSQRV 137
E + + +++ VT L E ++ Y IG Y H+D A +E+ P + R+
Sbjct: 377 EHRVIRRVNQRVEDVTGLTMETAELLQVVNYGIGGHYEPHFDCATKDEEFALDPNEGDRI 436
Query: 138 ASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTID 197
A+ L Y++D+E GG T+FP +G +V P +G G +Y+LL +G D
Sbjct: 437 ATMLFYMSDVEAGGATVFP---------------QVGARVVPEKGAGAFWYNLLKSGEGD 481
Query: 198 PTSIHGSCPVVKGEKWVATKWIRDQEQ 224
+ H CPV+ G KWV+ KWI ++ Q
Sbjct: 482 MLTEHAGCPVLVGSKWVSNKWIHERGQ 508
>gi|432949777|ref|XP_004084253.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Oryzias
latipes]
Length = 532
Score = 88.2 bits (217), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 58/206 (28%), Positives = 101/206 (49%), Gaps = 24/206 (11%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET-VDNTQGIRTSSGVFISAAEDE 81
P + + N + ++ + I +AK L +T+ R +T V T R S ++ +D
Sbjct: 335 PHIVRYLNILSDQEIEKIKELAKPRLARATV--RDPKTGVLTTAPYRVSKSAWLEGEDDP 392
Query: 82 SGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ---KSQRVA 138
+D + ++I +T L E + Y +G +Y H+D F + + R+A
Sbjct: 393 --VIDRVNQRIQDITGLTVETAELLQVANYGVGGQYEPHFD-FSRRPFDSNLKVDGNRLA 449
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
+FL Y++D+E GG T+FP D+ G + PR+G + +Y+L +G D
Sbjct: 450 TFLNYMSDVEAGGATVFP-----------DF----GASIWPRKGTAVFWYNLFRSGEGDY 494
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
+ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 495 RTRHAACPVLVGSKWVSNKWIHERGQ 520
>gi|346724248|ref|YP_004850917.1| hypothetical protein XACM_1335 [Xanthomonas axonopodis pv.
citrumelo F1]
gi|346648995|gb|AEO41619.1| hypothetical protein XACM_1335 [Xanthomonas axonopodis pv.
citrumelo F1]
Length = 418
Score = 88.2 bits (217), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 55/207 (26%), Positives = 91/207 (43%), Gaps = 20/207 (9%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + + ++C+ ++ +A+ +LR S + + + +RTS G + ++
Sbjct: 228 PRIEEYAAVLSADECRLLMLLARPHLRASKV-IDPNDASTGRAPVRTSHGATLDPIIEDF 286
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG---PQKSQRVAS 139
+ ++A LP + E ++L Y G++Y +H D P P R +
Sbjct: 287 AA-RAAQSRLAACAQLPLAHAEPLSVLCYAPGEQYRAHRDYLPPGTIAADRPTAGNRQRT 345
Query: 140 FLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPT 199
VYL D+ GGET FP G++V+PR G + F +L +G D
Sbjct: 346 VCVYLNDVGAGGETEFPVA---------------GVRVRPRPGTLVCFDNLHADGRPDAD 390
Query: 200 SIHGSCPVVKGEKWVATKWIRDQEQYD 226
S+H PV G KW+ T W R Q D
Sbjct: 391 SLHAGLPVTAGSKWLGTLWFRQQRYRD 417
>gi|357605723|gb|EHJ64752.1| prolyl 4-hydroxylase alpha subunit [Danaus plexippus]
Length = 235
Score = 88.2 bits (217), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 44/143 (30%), Positives = 74/143 (51%), Gaps = 17/143 (11%)
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK--SQR 136
++ES + + ++A +T L E ++ Y IG Y+ H+D +E +K R
Sbjct: 75 DEESAVVARVSRRVADITGLSMTTAEELQVVNYGIGGHYDPHFDFARKEENAFEKFNGNR 134
Query: 137 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 196
+A+ L Y++D+ +GG T+F +GL V PR+G + + +L P+G
Sbjct: 135 IATVLFYMSDVAQGGATVF---------------TELGLSVFPRRGSAVFWLNLHPSGEG 179
Query: 197 DPTSIHGSCPVVKGEKWVATKWI 219
D + H +CPV++G KWV KWI
Sbjct: 180 DLATRHAACPVLRGSKWVCNKWI 202
>gi|78046960|ref|YP_363135.1| hypothetical protein XCV1404 [Xanthomonas campestris pv.
vesicatoria str. 85-10]
gi|78035390|emb|CAJ23035.1| conserved hypothetical protein [Xanthomonas campestris pv.
vesicatoria str. 85-10]
Length = 418
Score = 87.8 bits (216), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 55/207 (26%), Positives = 91/207 (43%), Gaps = 20/207 (9%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + + ++C+ ++ +A+ +LR S + + + +RTS G + ++
Sbjct: 228 PRIEEYAAVLSADECRLLMLLARPHLRASKV-IDPNDASTGRAPVRTSHGATLDPIIEDF 286
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG---PQKSQRVAS 139
+ ++A LP + E ++L Y G++Y +H D P P R +
Sbjct: 287 AA-RAAQSRLAACAQLPLAHAEPLSVLCYAPGEQYRAHRDYLPPGTIAADRPTAGNRQRT 345
Query: 140 FLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPT 199
VYL D+ GGET FP G++V+PR G + F +L +G D
Sbjct: 346 VCVYLNDVGAGGETEFPVA---------------GVRVRPRPGTLVCFDNLHADGRPDAD 390
Query: 200 SIHGSCPVVKGEKWVATKWIRDQEQYD 226
S+H PV G KW+ T W R Q D
Sbjct: 391 SLHAGLPVTAGSKWLGTLWFRQQRYRD 417
>gi|189241578|ref|XP_969458.2| PREDICTED: similar to prolyl 4-hydroxylase alpha subunit 1,
putative [Tribolium castaneum]
Length = 515
Score = 87.8 bits (216), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 46/146 (31%), Positives = 76/146 (52%), Gaps = 17/146 (11%)
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVA 138
+ E L ++ +++A +T L E F ++ Y IG Y H+D P R+
Sbjct: 380 DQEHQHLAVVAQRVAHMTGLTLSTAEEFQVVNYGIGGHYEPHFDF--QSTVDPAIGSRIE 437
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
+ L YL+D+E+GG T+FP I + V P++G +++++L P+G D
Sbjct: 438 TVLFYLSDVEQGGATVFP---------------EIQVSVWPQKGSAVVWFNLHPSGDGDQ 482
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
+ H CPV+ G KW+ATKWI ++ Q
Sbjct: 483 RTKHAGCPVLIGSKWIATKWIHERGQ 508
>gi|270001038|gb|EEZ97485.1| hypothetical protein TcasGA2_TC011322 [Tribolium castaneum]
Length = 509
Score = 87.8 bits (216), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 46/146 (31%), Positives = 76/146 (52%), Gaps = 17/146 (11%)
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVA 138
+ E L ++ +++A +T L E F ++ Y IG Y H+D P R+
Sbjct: 374 DQEHQHLAVVAQRVAHMTGLTLSTAEEFQVVNYGIGGHYEPHFDF--QSTVDPAIGSRIE 431
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
+ L YL+D+E+GG T+FP I + V P++G +++++L P+G D
Sbjct: 432 TVLFYLSDVEQGGATVFP---------------EIQVSVWPQKGSAVVWFNLHPSGDGDQ 476
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
+ H CPV+ G KW+ATKWI ++ Q
Sbjct: 477 RTKHAGCPVLIGSKWIATKWIHERGQ 502
>gi|228993272|ref|ZP_04153188.1| hypothetical protein bpmyx0001_40040 [Bacillus pseudomycoides DSM
12442]
gi|228766340|gb|EEM14983.1| hypothetical protein bpmyx0001_40040 [Bacillus pseudomycoides DSM
12442]
Length = 195
Score = 87.8 bits (216), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 60/215 (27%), Positives = 95/215 (44%), Gaps = 33/215 (15%)
Query: 18 VLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISA 77
VL P + TP +C+ +I ++K +++P+ GE + F
Sbjct: 7 VLHDEPFVAQYEQIITPAECQELIELSKKHIQPAQAYGHTGERKSD----------FTWL 56
Query: 78 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-----DPQEYGPQ 132
G + + E IA LP + E RY++G K+++H D + D + Q
Sbjct: 57 PHYSHGLVSQVSELIATAMPLPLNHAEPLQAARYEVGGKFDAHIDCYGTWHEDGRNRVEQ 116
Query: 133 KSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLP 192
QR+ + ++YL + GGET FP + L V P +G LL +
Sbjct: 117 GGQRLYTAILYLNTVNAGGETFFP---------------SLNLTVTPSEGK-LLVFENCK 160
Query: 193 NGTIDP--TSIHGSCPVVKGEKWVATKWIRDQEQY 225
GT +P S+H C V +GEKW+AT W R++ QY
Sbjct: 161 RGTNEPHPLSLHEGCAVHEGEKWIATLWFREKPQY 195
>gi|386368303|gb|AFJ06910.1| procollagen-proline dioxygenase [Mytilus galloprovincialis]
Length = 535
Score = 87.8 bits (216), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 47/145 (32%), Positives = 76/145 (52%), Gaps = 20/145 (13%)
Query: 85 LDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-----DPQEYGPQKSQRVAS 139
+D ++ +I VT L + +A + Y IG Y+ HYD D E + R+A+
Sbjct: 394 VDRVQNRIKAVTGLDLDSADALQVANYGIGGHYDPHYDFSTRDDDDTSETEKRDGNRIAT 453
Query: 140 FLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPT 199
FL+Y+TD++ GG T+FP I ++V P++G + +Y+L +G
Sbjct: 454 FLLYMTDVDAGGATVFPI---------------IDVRVLPKKGTAVFWYNLRRSGKGIME 498
Query: 200 SIHGSCPVVKGEKWVATKWIRDQEQ 224
+ H +CPV+ G KWV+ KWIR + Q
Sbjct: 499 TRHAACPVLVGTKWVSNKWIRTRGQ 523
>gi|321474876|gb|EFX85840.1| hypothetical protein DAPPUDRAFT_309107 [Daphnia pulex]
Length = 528
Score = 87.8 bits (216), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 47/161 (29%), Positives = 79/161 (49%), Gaps = 21/161 (13%)
Query: 68 RTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDP 126
R + F+ + E + + ++ +T L E + Y IG Y H+D A
Sbjct: 373 RIAKAAFLK--DSEHNLIVKMSRRVGDITGLDMAASEDLQVCNYGIGGHYVPHFDYARQG 430
Query: 127 QEYGPQK---SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGD 183
+ +GP+ R+A++L Y++D+E GG T+FP +G + P++G
Sbjct: 431 EIHGPRDLDWGNRIATWLFYMSDVEAGGATVFP---------------AVGAALWPQKGS 475
Query: 184 GLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
+Y+L PNG D ++H CPV+ G KWV+ KWI ++ Q
Sbjct: 476 AAFWYNLRPNGNGDEDTLHAGCPVLTGSKWVSNKWIHERSQ 516
>gi|291190128|ref|NP_001167431.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Salmo salar]
gi|223649060|gb|ACN11288.1| Prolyl 4-hydroxylase subunit alpha-2 precursor [Salmo salar]
Length = 538
Score = 87.8 bits (216), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 60/207 (28%), Positives = 100/207 (48%), Gaps = 24/207 (11%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET-VDNTQGIRTSSGVFISAAEDE 81
P + + N + + + I +AK L +T+ R +T V T R S ++ ED
Sbjct: 339 PHIVRYLNALSDSEIEKIKELAKPRLARATV--RDPKTGVLTTANYRVSKSAWLEGEEDP 396
Query: 82 SGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRV 137
++ + ++I +T L E I Y +G +Y H+D E K+ RV
Sbjct: 397 --VIERVNQRIEDITGLTTQTAELLQIANYGVGGQYEPHFDFSRKDEPDAFKTLGTGNRV 454
Query: 138 ASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTID 197
A+FL Y++D+E GG T+FP D+ G + P++G + +Y+L +G D
Sbjct: 455 ATFLNYMSDVEAGGATVFP-----------DF----GAAIYPKKGTAVFWYNLFRSGEGD 499
Query: 198 PTSIHGSCPVVKGEKWVATKWIRDQEQ 224
+ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 500 YRTRHAACPVLVGCKWVSNKWIHERGQ 526
>gi|395492951|ref|ZP_10424530.1| putative oxygenase [Sphingomonas sp. PAMC 26617]
Length = 226
Score = 87.8 bits (216), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 55/198 (27%), Positives = 93/198 (46%), Gaps = 28/198 (14%)
Query: 29 PNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLI 88
P+F C + M N R ST+ D Q RTS + + S +
Sbjct: 46 PDFLDAATCAKFVEMIDANRRRSTVLAD-----DAVQAFRTSESCDM---DRWSPDVRPT 97
Query: 89 EEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQK----SQRVASFLVY 143
+E IA + + ++GE RY +GQ + +H D F + Q Y P+ QR + ++Y
Sbjct: 98 DEAIAALLGIDPVHGETMQGQRYAVGQHFRAHNDYFNEAQPYWPKMIESGGQRTWTAMIY 157
Query: 144 LTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHG 203
L D+EEGG T FP +G+++ P++G + + ++ +G+ +P ++H
Sbjct: 158 LNDVEEGGATWFPL---------------VGVRIAPKRGLLIAWNNMRADGSPNPDTLHE 202
Query: 204 SCPVVKGEKWVATKWIRD 221
PV G K++ TKW R+
Sbjct: 203 GMPVTAGTKYIITKWFRE 220
>gi|334140935|ref|YP_004534141.1| 2OG-Fe(II) oxygenase [Novosphingobium sp. PP1Y]
gi|333938965|emb|CCA92323.1| 2OG-Fe(II) oxygenase [Novosphingobium sp. PP1Y]
Length = 209
Score = 87.8 bits (216), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 52/198 (26%), Positives = 99/198 (50%), Gaps = 28/198 (14%)
Query: 30 NFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIE 89
+F QC ++I + + RPST+A G+ V RTSS +S + + +
Sbjct: 32 DFLDTAQCDALIALIEAEHRPSTVANYNGDDV-----FRTSSTCDLSP---DVPAVAALA 83
Query: 90 EKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP-----QEYGPQKSQRVASFLVYL 144
K+ ++ + + E RY++GQ++ +H D F+P ++Y QR +F++YL
Sbjct: 84 RKLCDISGIDPAHAEPLQGQRYEVGQEFKAHTDYFEPNNSDFEKYCSVSGQRTWTFMIYL 143
Query: 145 TDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGS 204
D++ GG T F K I ++P +G + + + P+G+++P ++H +
Sbjct: 144 NDVDAGGATRF---------------KVINKLIQPERGKLVAWNNRRPDGSLNPATLHHA 188
Query: 205 CPVVKGEKWVATKWIRDQ 222
V +G K+V T+W R++
Sbjct: 189 MKVRQGRKYVVTQWFRER 206
>gi|404253277|ref|ZP_10957245.1| putative oxygenase [Sphingomonas sp. PAMC 26621]
Length = 226
Score = 87.8 bits (216), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 55/198 (27%), Positives = 93/198 (46%), Gaps = 28/198 (14%)
Query: 29 PNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLI 88
P+F C + M N R ST+ D Q RTS + + S +
Sbjct: 46 PDFLDAATCAKFVEMIDANRRRSTVLAD-----DAVQAFRTSESCDM---DRWSPDVRPT 97
Query: 89 EEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQK----SQRVASFLVY 143
+E IA + + ++GE RY +GQ + +H D F + Q Y P+ QR + ++Y
Sbjct: 98 DEAIATLLGIDPVHGETMQGQRYAVGQHFRAHNDYFNEAQPYWPKMIESGGQRTWTAMIY 157
Query: 144 LTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHG 203
L D+EEGG T FP +G+++ P++G + + ++ +G+ +P ++H
Sbjct: 158 LNDVEEGGATWFPL---------------VGVRIAPKRGLLIAWNNMRADGSPNPDTLHE 202
Query: 204 SCPVVKGEKWVATKWIRD 221
PV G K++ TKW R+
Sbjct: 203 GMPVTAGTKYIITKWFRE 220
>gi|410860761|ref|YP_006975995.1| prolyl 4-hydroxylase subunit alpha [Alteromonas macleodii AltDE1]
gi|410818023|gb|AFV84640.1| Prolyl 4-hydroxylase alpha subunit [Alteromonas macleodii AltDE1]
Length = 376
Score = 87.4 bits (215), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 62/204 (30%), Positives = 95/204 (46%), Gaps = 34/204 (16%)
Query: 28 FPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFISAAEDE 81
+ + + +C+ +I L+PS + VD G +RTS I +
Sbjct: 181 YESILSEYECRYLIAKFSALLKPSMV-------VDPVTGRGKIDSVRTSYVAVIEPTHCD 233
Query: 82 SGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF----DPQEYGPQKSQRV 137
T L ++ I+++T R NGEA N+LRY GQ+Y HYD D + K QR+
Sbjct: 234 WITRKL-DKIISQITHTLRQNGEALNLLRYSPGQQYKPHYDGLNEINDALMFKDGK-QRI 291
Query: 138 ASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTID 197
+ LVYL + EGGET+FP + +++ P+ G ++F + NG +
Sbjct: 292 KTALVYLNTINEGGETLFPK---------------LDIRIAPKSGTMVVFSNSDENGKLL 336
Query: 198 PTSIHGSCPVVKGEKWVATKWIRD 221
S H P V KW+ TKWIR+
Sbjct: 337 LNSYHAGAPTVSENKWLVTKWIRE 360
>gi|147791524|emb|CAN70717.1| hypothetical protein VITISV_029140 [Vitis vinifera]
Length = 173
Score = 87.4 bits (215), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 54/144 (37%), Positives = 75/144 (52%), Gaps = 28/144 (19%)
Query: 85 LDLIEEKIAKVTMLPRINGE--AFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 142
L IE++I+ + +P NGE FN+ R QRVA+ L+
Sbjct: 55 LQAIEKRISVYSQVPVENGELIQFNLKR----------------------GGQRVATMLI 92
Query: 143 YLTDLEEGGETMFPFE-NGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSI 201
YL+D EGGET FP +G G + GL V P +G+ +LF+S+ +G DP SI
Sbjct: 93 YLSDNVEGGETYFPMAGSGFCRCGGKSVR---GLSVAPVKGNAVLFWSMGLDGQSDPNSI 149
Query: 202 HGSCPVVKGEKWVATKWIRDQEQY 225
HG C V+ GEKW ATKW+R + +
Sbjct: 150 HGGCEVLAGEKWSATKWMRQRSTH 173
>gi|393725345|ref|ZP_10345272.1| putative oxygenase [Sphingomonas sp. PAMC 26605]
Length = 226
Score = 87.4 bits (215), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 57/200 (28%), Positives = 95/200 (47%), Gaps = 28/200 (14%)
Query: 27 YFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLD 86
+ P+F C +++ M N R ST+ +N Q RTS + + S +
Sbjct: 44 HHPDFLDAATCDTLVAMIDANKRRSTVLAE-----ENVQEFRTSESCDM---DRWSPDVR 95
Query: 87 LIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQK----SQRVASFL 141
+E IA + + + GE RY +GQ + +H+D F + Q Y PQ QR + +
Sbjct: 96 PTDEAIAHLLGIDPVYGETMQGQRYAVGQHFRAHFDYFNEKQAYWPQMIETGGQRTWTAM 155
Query: 142 VYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSI 201
+YL + EGG T FP IG++V P++G L + ++ +G+ + ++
Sbjct: 156 IYLNHVAEGGATWFP---------------QIGIRVAPKKGLLLAWNNMNADGSRNTETL 200
Query: 202 HGSCPVVKGEKWVATKWIRD 221
H PVV G K++ TKW R+
Sbjct: 201 HEGMPVVSGTKYIVTKWFRE 220
>gi|359490628|ref|XP_002271805.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Vitis
vinifera]
Length = 312
Score = 87.4 bits (215), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 59/205 (28%), Positives = 108/205 (52%), Gaps = 13/205 (6%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET--VDNTQGIRTSSG-VFI 75
LSW PRA + F + E+C +I++A + LA G++ V + +++S G ++I
Sbjct: 60 LSWQPRAFLYRGFLSDEECDHLISLALG--KKEELATNGGDSGNVVLKRLLKSSEGPLYI 117
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 135
+DE IE++I+ T LP+ N E +++Y+ + Y+ F +
Sbjct: 118 ---DDEVAAR--IEKRISAWTFLPKENSEPLEVVQYQF-ENAKQKYNYFSNKSTSKFGEP 171
Query: 136 RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGT 195
+A+ L++L+++ GGE FP + S + GL+ P +G+ +LF+++ PN +
Sbjct: 172 LMATVLLHLSNVTRGGELFFPESESKSGILSDCTESSSGLR--PVKGNAILFFNVHPNAS 229
Query: 196 IDPTSIHGSCPVVKGEKWVATKWIR 220
D +S + CPV++GE W ATK+
Sbjct: 230 PDKSSSYARCPVLEGEMWCATKFFH 254
>gi|37912909|gb|AAR05245.1| conserved hypothetical protein [uncultured marine proteobacterium
ANT32C12]
Length = 186
Score = 87.4 bits (215), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 55/165 (33%), Positives = 83/165 (50%), Gaps = 25/165 (15%)
Query: 68 RTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQ 127
RT+S +I D S + + ++ + + +P N E F ++ Y G +Y H+DAFD
Sbjct: 40 RTNSYAWIQ--HDASEIIHEVSKRFSILVKMPINNAEQFQLVHYGPGTEYKPHFDAFDKS 97
Query: 128 -EYGPQK----SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQG 182
E G QR+ + L YL D+E+GG T FP I + VKP +G
Sbjct: 98 TEEGRNNWFPGGQRMVTALAYLNDVEDGGATDFP---------------DIHVSVKPNKG 142
Query: 183 DGLLFYSLLPNGT--IDPTSIHGSCPVVKGEKWVATKWIRDQEQY 225
D ++F++ +GT I+P S+HG PV+ GEKW W R + Y
Sbjct: 143 DVVVFHNC-KDGTSDINPNSLHGGSPVISGEKWAVNLWFRQEAIY 186
>gi|443730626|gb|ELU16050.1| hypothetical protein CAPTEDRAFT_114796, partial [Capitella teleta]
Length = 150
Score = 87.4 bits (215), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 50/152 (32%), Positives = 76/152 (50%), Gaps = 23/152 (15%)
Query: 80 DESGTLDLIEEKIAKVTML-PRINGEAFNILRYKIGQKYNSHYDAFDPQEY------GPQ 132
+ S + D + +++ T L E F + Y IG Y H+D F +Y Q
Sbjct: 6 ENSASADKLSRRVSSATKLDAEKYAELFQVSTYGIGGHYEPHFD-FSKVKYFTNPVLNEQ 64
Query: 133 KSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLP 192
R+A+F++YL D+E GG T+FP N L ++P + + +++LL
Sbjct: 65 MGDRIATFMIYLNDVEAGGRTVFPRLN---------------LVIEPIKNSAVFWHNLLD 109
Query: 193 NGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
+G D +IHG+CPVV G KWVA KWI + Q
Sbjct: 110 DGQQDDRTIHGACPVVLGRKWVANKWIHEYGQ 141
>gi|449469338|ref|XP_004152378.1| PREDICTED: uncharacterized protein LOC101218968 [Cucumis sativus]
Length = 311
Score = 87.4 bits (215), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 59/209 (28%), Positives = 106/209 (50%), Gaps = 23/209 (11%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKGETVDNTQGIRTSSGVFISA 77
+SW PR + F + E+C +I++A + PS + G TV + + SSGV ++
Sbjct: 59 VSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGITV--STELLNSSGVILNT 116
Query: 78 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRV 137
+D + IE ++A T+LP+ + F I++Y+ G++ Y + P +
Sbjct: 117 TDD---IVARIENRLAIWTLLPKDHSMPFQIMQYR-GEEAKHKYFYGNRSAMLPSSEPLM 172
Query: 138 ASFLVYLTDLEEGGETMFP-------FENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 190
A+ ++YL+D GGE +FP F +G ++ ++P +G+ +LF+S+
Sbjct: 173 ATVVLYLSDSASGGEILFPESKVKSKFWSGRRKKNNF---------LRPVKGNAILFFSV 223
Query: 191 LPNGTIDPTSIHGSCPVVKGEKWVATKWI 219
N + D +S H P+ GE WVATK++
Sbjct: 224 HLNASPDKSSYHIRSPIRDGELWVATKFL 252
>gi|312080225|ref|XP_003142509.1| prolyl 4-hydroxylase 2 [Loa loa]
Length = 541
Score = 87.4 bits (215), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 66/217 (30%), Positives = 101/217 (46%), Gaps = 26/217 (11%)
Query: 15 PFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALR-KGETVDNTQGIRTSS 71
PF+V L + P A++F + T E+ I +A LR +T+ GE T RTS
Sbjct: 322 PFKVEILRFSPLAVFFRDVITDEEVTIIQMLATPRLRRATVQNSITGEL--ETASYRTSK 379
Query: 72 GVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP 131
++ E E + I +I +T L + E + Y IG Y+ H+D +E
Sbjct: 380 SAWLKDEEHE--IVHRINRRIDLMTNLEQETSEELQVGNYGIGGHYDPHFDFARREEVNA 437
Query: 132 QKS----QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 187
+S R+A+ L Y+T E GG T+F + V P + D L +
Sbjct: 438 FQSLNTGNRLATLLFYMTQPESGGATVFT---------------EVKTTVMPSKNDALFW 482
Query: 188 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
Y+LL +G D + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 483 YNLLRSGEGDLRTRHAACPVLIGSKWVSNKWIHERGQ 519
>gi|393909803|gb|EFO21561.2| prolyl 4-hydroxylase 2 [Loa loa]
Length = 542
Score = 87.4 bits (215), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 66/217 (30%), Positives = 101/217 (46%), Gaps = 26/217 (11%)
Query: 15 PFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALR-KGETVDNTQGIRTSS 71
PF+V L + P A++F + T E+ I +A LR +T+ GE T RTS
Sbjct: 323 PFKVEILRFSPLAVFFRDVITDEEVTIIQMLATPRLRRATVQNSITGEL--ETASYRTSK 380
Query: 72 GVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP 131
++ E E + I +I +T L + E + Y IG Y+ H+D +E
Sbjct: 381 SAWLKDEEHE--IVHRINRRIDLMTNLEQETSEELQVGNYGIGGHYDPHFDFARREEVNA 438
Query: 132 QKS----QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 187
+S R+A+ L Y+T E GG T+F + V P + D L +
Sbjct: 439 FQSLNTGNRLATLLFYMTQPESGGATVFT---------------EVKTTVMPSKNDALFW 483
Query: 188 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
Y+LL +G D + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 484 YNLLRSGEGDLRTRHAACPVLIGSKWVSNKWIHERGQ 520
>gi|198449502|ref|XP_001357605.2| GA15937 [Drosophila pseudoobscura pseudoobscura]
gi|198130635|gb|EAL26739.2| GA15937 [Drosophila pseudoobscura pseudoobscura]
Length = 510
Score = 87.4 bits (215), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 57/213 (26%), Positives = 99/213 (46%), Gaps = 26/213 (12%)
Query: 14 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGV 73
+ ++L P + + + + + I+ MA+ + ++ + T T RT+ G
Sbjct: 310 LKMELLGEHPYVVVYHDVLSDSEIAEILEMAERRMARTSTVAQPNRTSSPT---RTAMGA 366
Query: 74 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF--DPQEYGP 131
++ + + I ++ ++ L E ++ Y IG Y H D F P+ G
Sbjct: 367 WLKRSSN--ALTRRIARRVRDMSGLQLEGSERMQVINYGIGGHYVPHKDWFTQHPEVMG- 423
Query: 132 QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLL 191
R+A+ L YLTD+E+GG TMF KV PR+G L +Y+L
Sbjct: 424 ---NRLATVLFYLTDVEQGGATMFNKAEH---------------KVLPRRGTALFWYNLH 465
Query: 192 PNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
+G D ++ H +CP++ G KWV T+WIR++ Q
Sbjct: 466 TDGEGDWSTTHAACPIIVGSKWVLTQWIRERNQ 498
>gi|334314087|ref|XP_003339988.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2
[Monodelphis domestica]
Length = 537
Score = 87.4 bits (215), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 57/206 (27%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F + + + + ++AK LR +T++ V T R S ++S ED
Sbjct: 338 PRIVRFHEIISDAEIEIVKDLAKPRLRRATIS-NPITGVLETAHYRISKSAWLSGYEDP- 395
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 396 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIA 454
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 455 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 499
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 500 STRHAACPVLVGNKWVSNKWIHERGQ 525
>gi|410900628|ref|XP_003963798.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Takifugu
rubripes]
Length = 548
Score = 87.4 bits (215), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 100/206 (48%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + + + + ++ +++ +AK LR +T++ V T R S +++ E
Sbjct: 349 PYIVRYIDIISDKEIETVKKLAKPRLRRATIS-NPITGVLETASYRISKSAWLTGYE--H 405
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+++I ++I +T L E + Y +G +Y H+D E K R+A
Sbjct: 406 PVIEIINQRIEDLTGLEMDTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIA 465
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V P++G + +Y+L NG D
Sbjct: 466 TWLFYMSDVAAGGATVFP---------------DVGAAVWPQKGTAVFWYNLFANGEGDY 510
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 511 STRHAACPVLVGNKWVSNKWIHERGQ 536
>gi|354483225|ref|XP_003503795.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 2
[Cricetulus griseus]
Length = 534
Score = 87.4 bits (215), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 58/212 (27%), Positives = 102/212 (48%), Gaps = 34/212 (16%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGI--RTSSGVFISAAED 80
PR + F + + + + + ++AK LR +T++ N + + R S ++S ED
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLRRATIS---NPITGNLETVHYRISKSAWLSGYED 391
Query: 81 ESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD--------AFDPQEYGPQ 132
+ I +I +T L E + Y +G +Y H+D AF QE G
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAF--QELGT- 446
Query: 133 KSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLP 192
R+A++L Y++D+ GG T+FP +G V P++G + +Y+L
Sbjct: 447 -GNRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 490
Query: 193 NGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|345481336|ref|XP_001600680.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Nasonia
vitripennis]
Length = 556
Score = 87.4 bits (215), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 54/207 (26%), Positives = 98/207 (47%), Gaps = 24/207 (11%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVFISAAEDE 81
PR + + + ++ ++I MA+ + +T+ + GE R S ++ E E
Sbjct: 349 PRIVIYHDVIYDDEIETIKRMAQPRFKRATVQNYKTGEL--EIANYRISKSAWLQ--EHE 404
Query: 82 SGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRV 137
+ + +++ +T + E ++ Y IG Y H+D +E KS R+
Sbjct: 405 HKHVRAVSQRVEHMTSMSIETAEELQVVNYGIGGHYEPHFDFARREEKNAFKSLGTGNRI 464
Query: 138 ASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTID 197
A+ L Y++D+E+GG T+F I + + P++G +Y+L PNG D
Sbjct: 465 ATVLYYMSDVEQGGGTVFT---------------KINISLWPKKGSAAFWYNLKPNGEGD 509
Query: 198 PTSIHGSCPVVKGEKWVATKWIRDQEQ 224
+ H +CPV+ G KWVA KW+ ++ Q
Sbjct: 510 YKTRHAACPVLTGSKWVANKWLHERGQ 536
>gi|195341542|ref|XP_002037365.1| GM12152 [Drosophila sechellia]
gi|194131481|gb|EDW53524.1| GM12152 [Drosophila sechellia]
Length = 535
Score = 87.4 bits (215), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 59/216 (27%), Positives = 97/216 (44%), Gaps = 24/216 (11%)
Query: 15 PFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSG 72
PF++ L P + +S+ A+ ++ ST+ G RTS G
Sbjct: 316 PFKLEELHLDPLVVQLHQVIGSNDSESLQKSARPMIKRSTVYSLGGNGGSTAAAFRTSQG 375
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEY--- 129
+ +++ L+ + + L E + Y IG Y H+D+F P+ +
Sbjct: 376 ASFNYSKN--AATKLLSHHVGDFSDLNMDYAEDLQVANYGIGGHYEPHWDSF-PENHIYQ 432
Query: 130 -GPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFY 188
G R+A+ + YL+D+E GG T FPF + L V P +G L +Y
Sbjct: 433 EGDLHGNRIATGIYYLSDVEAGGGTAFPF---------------LPLLVTPEKGSLLFWY 477
Query: 189 SLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
+L P+G D + H +CPV++G KW+A WIR++ Q
Sbjct: 478 NLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQ 513
>gi|344254200|gb|EGW10304.1| Prolyl 4-hydroxylase subunit alpha-1 [Cricetulus griseus]
Length = 507
Score = 87.0 bits (214), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 58/212 (27%), Positives = 102/212 (48%), Gaps = 34/212 (16%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGI--RTSSGVFISAAED 80
PR + F + + + + + ++AK LR +T++ N + + R S ++S ED
Sbjct: 308 PRIIRFHDIISDAEIEIVKDLAKPRLRRATIS---NPITGNLETVHYRISKSAWLSGYED 364
Query: 81 ESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD--------AFDPQEYGPQ 132
+ I +I +T L E + Y +G +Y H+D AF QE G
Sbjct: 365 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAF--QELGT- 419
Query: 133 KSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLP 192
R+A++L Y++D+ GG T+FP +G V P++G + +Y+L
Sbjct: 420 -GNRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 463
Query: 193 NGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
+G D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 464 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 495
>gi|344199983|ref|YP_004784309.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrivorans SS3]
gi|343775427|gb|AEM47983.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrivorans SS3]
Length = 212
Score = 87.0 bits (214), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 55/200 (27%), Positives = 91/200 (45%), Gaps = 23/200 (11%)
Query: 26 LYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTL 85
++F +PE+C +I + +PS + + T G R++ S + D+ +
Sbjct: 15 VHFSGLLSPEECTELIAAGGSHAKPSEVIYGVSDVSHETSGRRSTVA---SPSADKYPII 71
Query: 86 DLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ---KSQRVASFLV 142
+ +I+ + N E +L Y G +Y+ HYD+F E PQ R+ + L+
Sbjct: 72 KAVRRRISLFIGVAEENQEPLQVLHYTRGGRYDIHYDSF--LEGSPQLENGGNRMLTVLL 129
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D+E+GG T FP I + P G G+LF + S+H
Sbjct: 130 YLNDVEQGGWTQFPH---------------IMANIVPNVGTGILFRNTDAQNLQLRESLH 174
Query: 203 GSCPVVKGEKWVATKWIRDQ 222
PV+ GEKW+A+ WIR++
Sbjct: 175 AGLPVIDGEKWIASIWIREK 194
>gi|66772633|gb|AAY55628.1| IP02961p [Drosophila melanogaster]
Length = 409
Score = 87.0 bits (214), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 59/216 (27%), Positives = 95/216 (43%), Gaps = 24/216 (11%)
Query: 15 PFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSG 72
PF++ L P + + S+ A+ ++ ST+ G RTS G
Sbjct: 190 PFKLEELHLDPLVVQLHQVIGSKDSDSLQKTARPRIKRSTVYSLGGNGGSTAAAFRTSQG 249
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEY--- 129
+ + + L+ + + L E + Y IG Y H+D+F P+ +
Sbjct: 250 ASFNYSRN--AATKLLSRHVGDFSGLNMDYAEDLQVANYGIGGHYEPHWDSF-PENHIYQ 306
Query: 130 -GPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFY 188
G R+A+ + YL D+E GG T FPF + L V P +G L +Y
Sbjct: 307 EGDLHGNRMATGIYYLADVEAGGGTAFPF---------------LPLLVTPERGSLLFWY 351
Query: 189 SLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
+L P+G D + H +CPV++G KW+A WIR++ Q
Sbjct: 352 NLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQ 387
>gi|241999340|ref|XP_002434313.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
gi|215496072|gb|EEC05713.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
Length = 267
Score = 87.0 bits (214), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 69/229 (30%), Positives = 107/229 (46%), Gaps = 37/229 (16%)
Query: 15 PF--QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSG 72
PF +VLS PR + FP+F P +C+ ++++ L + + L G + +R ++
Sbjct: 48 PFKIEVLSEDPRIVVFPDFLNPRECEIFRSISQEKLSRAKVYL--GGPPEGGFSLRRTNK 105
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSH--YDAFDPQEYG 130
V ++D L + +IA T L + E + + Y +G Y H Y F +
Sbjct: 106 V-AWMSDDLHPLLGKVSRRIALATGLTLTSAEMYQVANYGLGGHYIPHPDYAGFGEAQGD 164
Query: 131 PQKS--QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFY 188
KS R+A+ L+YL D+ GG T F +N + L VKP G L +Y
Sbjct: 165 IYKSSGNRLATMLIYLADVAGGGATAF-----IN----------MRLAVKPTLGTALFWY 209
Query: 189 SLLP-NGTI------------DPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
+L P +G I DP + H CPV+ G KW+ TKWI ++EQ
Sbjct: 210 NLKPYDGPIVNESFWNQRRFGDPRTFHMGCPVLTGSKWIVTKWIHEREQ 258
>gi|91091610|ref|XP_969386.1| PREDICTED: similar to prolyl 4-hydroxylase alpha subunit 1,
putative [Tribolium castaneum]
gi|270001037|gb|EEZ97484.1| hypothetical protein TcasGA2_TC011321 [Tribolium castaneum]
Length = 536
Score = 87.0 bits (214), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 57/216 (26%), Positives = 100/216 (46%), Gaps = 24/216 (11%)
Query: 15 PFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSG 72
PF+V P F + + +I MA+ + +T+ ++ Q R S
Sbjct: 322 PFKVEEAHHRPDIFIFRDVLADSEIATIKRMAQPRFKRATVQNTDTGELEIAQ-YRISKS 380
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
++ E+E + + ++++ +T L E ++ Y IG Y H+D E
Sbjct: 381 AWLK--EEEHKHIADVSQRVSDMTGLTMSTAEELQVVNYGIGGHYEPHFDFARRDERNAF 438
Query: 133 KS----QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFY 188
KS R+A+ L Y++D+E+GG T+FP I + + P++G +Y
Sbjct: 439 KSLGTGNRIATVLFYMSDVEQGGATVFP---------------SIQVSLWPQKGSAAFWY 483
Query: 189 SLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
+L P+G D + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 484 NLHPSGDGDKMTRHAACPVLTGSKWVSNKWIHERGQ 519
>gi|363539943|ref|YP_004894760.1| mg709 gene product [Megavirus chiliensis]
gi|448825700|ref|YP_007418631.1| putative prolyl 4-hydroxylase [Megavirus lba]
gi|350611108|gb|AEQ32552.1| putative prolyl 4-hydroxylase [Megavirus chiliensis]
gi|371944083|gb|AEX61911.1| putative prolyl4-hydroxylase [Megavirus courdo7]
gi|425701637|gb|AFX92799.1| putative prolyl 4-hydroxylase [Megavirus courdo11]
gi|444236885|gb|AGD92655.1| putative prolyl 4-hydroxylase [Megavirus lba]
Length = 240
Score = 87.0 bits (214), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 60/200 (30%), Positives = 89/200 (44%), Gaps = 30/200 (15%)
Query: 30 NFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIE 89
NF P +C+ I+ + L S + K + N+Q +++ L++ E
Sbjct: 62 NFIEPSKCQEIMKNCRNKLFDSEVISGKNSKIRNSQQCWI--------PKNDPMVLNMFE 113
Query: 90 EKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-----DPQEYGPQKSQRVASFLVYL 144
I+K +P N E ++RY GQ YN H+DA +E+ + QR + L+YL
Sbjct: 114 N-ISKQFGIPFENAEDLQVVRYLPGQYYNEHHDACCDDTDKCREFISRGGQRKLTVLIYL 172
Query: 145 TDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPN-GTIDPTSIHG 203
+ EGG T F K + L+ KP GD L+FY L N P S+H
Sbjct: 173 NNEFEGGCTYF---------------KNLELRAKPSTGDALVFYPLAKNVNKCHPLSLHA 217
Query: 204 SCPVVKGEKWVATKWIRDQE 223
PV GEKW+A W R+
Sbjct: 218 GMPVTSGEKWIANIWFRENR 237
>gi|195159144|ref|XP_002020442.1| GL13995 [Drosophila persimilis]
gi|194117211|gb|EDW39254.1| GL13995 [Drosophila persimilis]
Length = 535
Score = 87.0 bits (214), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 57/213 (26%), Positives = 99/213 (46%), Gaps = 26/213 (12%)
Query: 14 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGV 73
+ ++L P + + + + + I+ MA+ + ++ + T T RT+ G
Sbjct: 335 LKMELLGEHPYVVVYHDVLSDSEIAEILEMAERRMARTSTVAQPNRTSSPT---RTALGA 391
Query: 74 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF--DPQEYGP 131
++ + + I ++ ++ L E ++ Y IG Y H D F P+ G
Sbjct: 392 WLKRSSN--ALTRRIARRVRDMSGLQLEGSERMQVINYGIGGHYVPHKDWFTQHPEVMG- 448
Query: 132 QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLL 191
R+A+ L YLTD+E+GG TMF KV PR+G L +Y+L
Sbjct: 449 ---NRLATVLFYLTDVEQGGATMFNKAEH---------------KVLPRRGTALFWYNLH 490
Query: 192 PNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
+G D ++ H +CP++ G KWV T+WIR++ Q
Sbjct: 491 TDGEGDWSTTHAACPIIVGSKWVLTQWIRERNQ 523
>gi|443707037|gb|ELU02831.1| hypothetical protein CAPTEDRAFT_181697 [Capitella teleta]
Length = 538
Score = 87.0 bits (214), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 55/211 (26%), Positives = 95/211 (45%), Gaps = 20/211 (9%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFIS 76
+V+ P + N T ++ I ++K L S + G Q RTS +I
Sbjct: 333 EVMFLDPFIAIYHNLMTDKEADMIKRISKPKLHRSGVFTYSGGNQKPVQDYRTSKSAWIE 392
Query: 77 AAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQE---YGPQK 133
++E + + E+ + +T L E F ++ Y IG Y H+D P E + P+
Sbjct: 393 --DEEHPMIRRVSERTSALTDLSLDTVELFQVVNYGIGGHYEPHFDFARPNEIATFDPEV 450
Query: 134 SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPN 193
R+ + + Y+ E GG T+FP +G+K+ P +G ++++L+ N
Sbjct: 451 GNRIITVIFYVAAPEAGGATVFP---------------DLGVKLWPEKGSCAVWWNLMRN 495
Query: 194 GTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
G D + H CP + G KW+A KW ++ Q
Sbjct: 496 GEGDYRTKHAGCPTITGSKWIANKWYHERGQ 526
>gi|4336512|gb|AAD17844.1| prolyl 4-hydroxylase alpha subunit [Drosophila melanogaster]
Length = 535
Score = 87.0 bits (214), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 59/216 (27%), Positives = 96/216 (44%), Gaps = 24/216 (11%)
Query: 15 PFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSG 72
PF++ L P + + S+ A+ ++ ST+ G RTS G
Sbjct: 316 PFKLEELHLDPLVVQLHQVIGSKDSDSLQKTARPRIKRSTVYSLGGNGGSTAAAFRTSQG 375
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEY--- 129
+ + + L+ + + L E + Y IG Y H+D+F P+ +
Sbjct: 376 ASFNYSRN--AATKLLSRHVGDFSGLNMDYAEDLQVANYGIGGHYEPHWDSF-PENHIYQ 432
Query: 130 -GPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFY 188
G R+A+ + YL+D+E GG T FPF + L V P +G L +Y
Sbjct: 433 EGDLHGNRMATGIYYLSDVEAGGGTAFPF---------------LPLLVTPERGSLLFWY 477
Query: 189 SLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
+L P+G D + H +CPV++G KW+A WIR++ Q
Sbjct: 478 NLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQ 513
>gi|221126103|ref|XP_002165259.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Hydra
magnipapillata]
Length = 533
Score = 87.0 bits (214), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 62/215 (28%), Positives = 96/215 (44%), Gaps = 21/215 (9%)
Query: 14 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSG 72
+ +VL P + T ++ K II AK LR + + + G+ + R S
Sbjct: 324 LKMEVLHHDPYIELYYELITDDEAKHIIKFAKPLLRRAFVHDMVTGDLI--YADYRVSKN 381
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD---AFDPQEY 129
+I AED I ++ VT L E + Y I +Y H+D P+ +
Sbjct: 382 TWI--AEDMDVIAAKIIRRVGDVTGLNMRYAEHLQVANYGIAGQYEPHFDHSTGTRPKHF 439
Query: 130 GPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYS 189
R+A+ L+YL+D++ GG T+F G+ P +G G+ +Y+
Sbjct: 440 DRWGGNRIATMLLYLSDVDWGGRTVFT-------------NTAPGVGTDPIKGAGVFWYN 486
Query: 190 LLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
LL NG +P + H CPVV G+KWVA WI + Q
Sbjct: 487 LLRNGKSNPKTQHAGCPVVLGQKWVANLWIHEHGQ 521
>gi|302143843|emb|CBI22704.3| unnamed protein product [Vitis vinifera]
Length = 317
Score = 87.0 bits (214), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 61/211 (28%), Positives = 112/211 (53%), Gaps = 20/211 (9%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET--VDNTQGIRTSSG-VFI 75
LSW PRA + F + E+C +I++A + LA G++ V + +++S G ++I
Sbjct: 60 LSWQPRAFLYRGFLSDEECDHLISLALG--KKEELATNGGDSGNVVLKRLLKSSEGPLYI 117
Query: 76 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKI---GQKYNSHYDAFDPQEYGPQ 132
+DE IE++I+ T LP+ N E +++Y+ QKYN ++ ++G
Sbjct: 118 ---DDEVAAR--IEKRISAWTFLPKENSEPLEVVQYQFENAKQKYN-YFSNKSTSKFG-- 169
Query: 133 KSQRVASFLVYLTDLEEGGETMFP---FENGMNADGSYDYQKCIGLKVKPRQGDGLLFYS 189
+A+ L++L+++ GGE FP +N + G ++P +G+ +LF++
Sbjct: 170 -EPLMATVLLHLSNVTRGGELFFPESELKNSQSKSGILSDCTESSSGLRPVKGNAILFFN 228
Query: 190 LLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
+ PN + D +S + CPV++GE W ATK+
Sbjct: 229 VHPNASPDKSSSYARCPVLEGEMWCATKFFH 259
>gi|260825355|ref|XP_002607632.1| hypothetical protein BRAFLDRAFT_84679 [Branchiostoma floridae]
gi|229292980|gb|EEN63642.1| hypothetical protein BRAFLDRAFT_84679 [Branchiostoma floridae]
Length = 519
Score = 87.0 bits (214), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 56/206 (27%), Positives = 99/206 (48%), Gaps = 23/206 (11%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPS-TLALRKGETVDNTQGIRTSSGVFISAAEDE 81
P+ N + + + I +A+ LRP+ T G V ++ I ++ ++ E
Sbjct: 321 PKLWVLHNILSDPEMEVIKKLAQPRLRPAATQNPTTGGAVLSSYRISKNAWLYYW----E 376
Query: 82 SGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYG--PQKSQRVA 138
++ +++++ T L E ++ Y IG Y H+D A +E+ P + R+A
Sbjct: 377 HRLINRVKQRVEDATGLTMETAEPLQVINYGIGGHYEPHFDCATKDEEFALDPNEGDRIA 436
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
+ L Y++D+E GG T+FP +G +V P +G G +Y+LL +G D
Sbjct: 437 TMLFYMSDVEAGGATVFP---------------QVGARVVPEKGAGAFWYNLLKSGEGDM 481
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
+ H CPV+ G KWV+ WI ++ Q
Sbjct: 482 LTEHAGCPVLVGSKWVSNMWIHERGQ 507
>gi|321474952|gb|EFX85916.1| hypothetical protein DAPPUDRAFT_45616 [Daphnia pulex]
Length = 537
Score = 86.7 bits (213), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 53/210 (25%), Positives = 97/210 (46%), Gaps = 30/210 (14%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + + + + ++ +++ MAK + +T+ K ++ R S ++ + E +
Sbjct: 336 PMIVVYHDVMSDDEIETVKKMAKPRFKRATIRNSKTGELE-PANYRISKSAWLKSEEHDH 394
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD--------AFDPQEYGPQKS 134
+ + ++ +T L E ++ Y IG Y H+D AF +G
Sbjct: 395 --ILKVTRRVGDITGLDMSTAEDLQVVNYGIGGHYEPHFDYARTETTEAFKELGWG---- 448
Query: 135 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 194
R+A++L Y++D+E GG T+FP G V PR+G +Y+L PNG
Sbjct: 449 NRIATWLFYMSDVEAGGATVFP---------------PTGAAVWPRKGSAAFWYNLYPNG 493
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
+ + H +CPV+ G KWV+ +WI + Q
Sbjct: 494 KGNELTRHAACPVLSGSKWVSNRWIHEHRQ 523
>gi|24651418|ref|NP_524594.2| prolyl-4-hydroxylase-alpha MP [Drosophila melanogaster]
gi|7301951|gb|AAF57057.1| prolyl-4-hydroxylase-alpha MP [Drosophila melanogaster]
gi|359807686|gb|AEV66559.1| FI17802p1 [Drosophila melanogaster]
Length = 535
Score = 86.7 bits (213), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 59/216 (27%), Positives = 95/216 (43%), Gaps = 24/216 (11%)
Query: 15 PFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSG 72
PF++ L P + + S+ A+ ++ ST+ G RTS G
Sbjct: 316 PFKLEELHLDPLVVQLHQVIGSKDSDSLQKTARPRIKRSTVYSLGGNGGSTAAAFRTSQG 375
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEY--- 129
+ + + L+ + + L E + Y IG Y H+D+F P+ +
Sbjct: 376 ASFNYSRN--AATKLLSRHVGDFSGLNMDYAEDLQVANYGIGGHYEPHWDSF-PENHIYQ 432
Query: 130 -GPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFY 188
G R+A+ + YL D+E GG T FPF + L V P +G L +Y
Sbjct: 433 EGDLHGNRMATGIYYLADVEAGGGTAFPF---------------LPLLVTPERGSLLFWY 477
Query: 189 SLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
+L P+G D + H +CPV++G KW+A WIR++ Q
Sbjct: 478 NLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQ 513
>gi|170591592|ref|XP_001900554.1| prolyl 4-hydroxylase [Brugia malayi]
gi|16415740|emb|CAC82616.1| prolyl 4-hydroxylase [Brugia malayi]
gi|21425621|emb|CAD19314.1| prolyl 4-hydroxylase [Brugia malayi]
gi|158592166|gb|EDP30768.1| prolyl 4-hydroxylase, putative [Brugia malayi]
Length = 541
Score = 86.7 bits (213), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 66/217 (30%), Positives = 101/217 (46%), Gaps = 26/217 (11%)
Query: 15 PFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALR-KGETVDNTQGIRTSS 71
PF+V L + P A+ F + T E+ I +A LR +T+ GE T RTS
Sbjct: 322 PFKVEILRFNPLAVLFRDVITDEEVTMIQMLATPRLRRATVQNSITGEL--ETASYRTSK 379
Query: 72 GVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP 131
++ E E + I ++I +T L + E + Y IG Y+ H+D +E
Sbjct: 380 SAWLKDEEHE--VVHRINKRIDLMTNLEQETSEELQVGNYGIGGHYDPHFDFARREEVNA 437
Query: 132 QKS----QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 187
+S R+A+ L Y+T E GG T+F + V P + D L +
Sbjct: 438 FQSLNTGNRLATLLFYMTQPESGGATVFT---------------EVKTTVMPSKNDALFW 482
Query: 188 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
Y+LL +G D + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 483 YNLLRSGEGDLRTRHAACPVLTGTKWVSNKWIHERGQ 519
>gi|402593814|gb|EJW87741.1| hypothetical protein WUBG_01349 [Wuchereria bancrofti]
Length = 541
Score = 86.7 bits (213), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 66/217 (30%), Positives = 101/217 (46%), Gaps = 26/217 (11%)
Query: 15 PFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALR-KGETVDNTQGIRTSS 71
PF+V L + P A+ F + T E+ I +A LR +T+ GE T RTS
Sbjct: 322 PFKVEILRFNPLAVLFRDVITDEEITMIQMLATPRLRRATVQNSITGEL--ETASYRTSK 379
Query: 72 GVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP 131
++ E E + I ++I +T L + E + Y IG Y+ H+D +E
Sbjct: 380 SAWLKDEEHE--VVHRINKRIDLMTNLEQETSEELQVGNYGIGGHYDPHFDFARREEVNA 437
Query: 132 QKS----QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 187
+S R+A+ L Y+T E GG T+F + V P + D L +
Sbjct: 438 FQSLNTGNRLATLLFYMTQPESGGATVFT---------------EVKTTVMPSKNDALFW 482
Query: 188 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
Y+LL +G D + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 483 YNLLRSGEGDLRTRHAACPVLTGTKWVSNKWIHERGQ 519
>gi|74225936|dbj|BAE28745.1| unnamed protein product [Mus musculus]
Length = 561
Score = 86.7 bits (213), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 99/206 (48%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F + + + + + ++AK LR +T++ ++ T R S ++S ED
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPVTGALE-TVHYRISKSAWLSGYEDP- 392
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ I +I +T L E + Y +G +Y H+D E + R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIA 451
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|310831339|ref|YP_003969982.1| putative prolyl 4-hydroxylase alpha subunit [Cafeteria
roenbergensis virus BV-PW1]
gi|309386523|gb|ADO67383.1| putative prolyl 4-hydroxylase alpha subunit [Cafeteria
roenbergensis virus BV-PW1]
Length = 210
Score = 86.7 bits (213), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 66/211 (31%), Positives = 95/211 (45%), Gaps = 27/211 (12%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFIS 76
+LS P Y N ++C II + L+P AL G + RT + ++S
Sbjct: 4 HILSQDPLIYYVDNVLNKQECYHIIKITSNKLKP---ALVSGNSRGFLSTGRTGTNCWLS 60
Query: 77 AAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF--DPQEYG---- 130
DE T + I KI + P N E F +L Y QKY HYDAF D E
Sbjct: 61 HKNDEI-TFN-IALKITNLVNKPLENAENFQVLHYSTNQKYEYHYDAFPIDNSEKAKRCL 118
Query: 131 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 190
+ QR+ + L+YL ++ +GGET F K + +K+ P+ G L+F +
Sbjct: 119 KKGGQRLLTALIYLNNVTKGGETEF---------------KNLNIKITPKIGRILVFENT 163
Query: 191 LPNG-TIDPTSIHGSCPVVKGEKWVATKWIR 220
L N P S+H V++GEK+V W R
Sbjct: 164 LQNSLNKHPDSLHSGKQVIEGEKYVINLWFR 194
>gi|239792190|dbj|BAH72464.1| ACYPI007079 [Acyrthosiphon pisum]
Length = 249
Score = 86.3 bits (212), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 52/206 (25%), Positives = 100/206 (48%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + + + + + I MA+ L+ +T+ K ++ R S ++ E E
Sbjct: 44 PRIILYRDVLYDNEIEVIKRMAQPRLKRATVQNYKTGELEFAD-YRISKSAWLK--EHED 100
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ + +++ +T L E ++ Y +G Y+ HYD +E KS R+A
Sbjct: 101 VVVANVAKRVEVMTGLTTETAEELQVVNYGVGGHYDPHYDFARTEEINAFKSLGTGNRIA 160
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
+ L Y++D+ +GG T+FP+ +G+ ++P +G ++++L P+G D
Sbjct: 161 TVLFYMSDVAQGGATVFPW---------------LGVALQPVKGTAAVWFNLYPSGNGDL 205
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
+ H +CPV++G KWV KW+ + Q
Sbjct: 206 RTRHAACPVLQGSKWVCNKWLHEAGQ 231
>gi|410927705|ref|XP_003977281.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Takifugu
rubripes]
Length = 531
Score = 86.3 bits (212), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 55/206 (26%), Positives = 99/206 (48%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + + + + + +++ +AK LR +T+ + + T R S ++ A E
Sbjct: 332 PHIVRYHDILSNREMETVKELAKPRLRRATVHDPQTGQL-TTAPYRVSKSAWLGAFE--H 388
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+D I ++I +T L E + Y +G +Y HYD E K R+A
Sbjct: 389 PVVDRINQRIEDITGLDVSTAEDLQVANYGVGGQYEPHYDFGRKDEPDAFKELGTGNRIA 448
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L+Y+++++ GG T+F IG V P++G + +Y+L P+G D
Sbjct: 449 TWLLYMSEVQAGGATVFT---------------DIGASVSPKKGSAVFWYNLHPSGDGDY 493
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
+ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 494 RTRHAACPVLLGNKWVSNKWIHERGQ 519
>gi|198417610|ref|XP_002125349.1| PREDICTED: similar to Prolyl 4-hydroxylase subunit alpha-1
precursor (4-PH alpha-1)
(Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1) [Ciona intestinalis]
Length = 527
Score = 86.3 bits (212), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 48/147 (32%), Positives = 75/147 (51%), Gaps = 19/147 (12%)
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQE----YGPQKS 134
+D+ + I E+I+ +T L E + Y +G +Y H+D E Q
Sbjct: 372 DDDGPEVAKITERISDITGLTLNTSEEIQVANYGVGGEYPPHFDIPTTDEERDDLKSQDG 431
Query: 135 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 194
+R+A+FL+YL+D+E GG T F +NA G+ KP +G + +Y++ P+G
Sbjct: 432 ERIATFLIYLSDVEVGGRTAF-----VNA----------GVSAKPIKGSAVFWYNVFPSG 476
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIRD 221
D + HG+CPV G KW KWIR+
Sbjct: 477 EPDLRTYHGACPVAFGNKWAGNKWIRE 503
>gi|149038788|gb|EDL93077.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide, isoform CRA_b
[Rattus norvegicus]
Length = 534
Score = 86.3 bits (212), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 99/206 (48%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F + + + + + ++AK LR +T++ ++ T R S ++S ED
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPVTGALE-TVHYRISKSAWLSGYEDP- 392
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ I +I +T L E + Y +G +Y H+D E + R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIA 451
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|195159317|ref|XP_002020528.1| GL14042 [Drosophila persimilis]
gi|194117297|gb|EDW39340.1| GL14042 [Drosophila persimilis]
Length = 534
Score = 86.3 bits (212), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 55/203 (27%), Positives = 92/203 (45%), Gaps = 19/203 (9%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + + + +P + + MA + ST+ G + R S ++ A D
Sbjct: 328 PFVVTYHDMLSPRKIADLRLMAVPRMHRSTVNPLPGGQ-NKKSSFRVSKNAWL--AYDSH 384
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQKSQRVASFL 141
T+ + ++ T L E + Y +G Y H+D F DP Y ++ R+A+ +
Sbjct: 385 PTMGGMLSDLSDATGLDMTFCEQLQVANYGVGGHYEPHWDFFRDPDHYPAEEGNRMATAI 444
Query: 142 VYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSI 201
YL+D+E+GG T FPF N VKP+ G+ L +Y++ + +D +
Sbjct: 445 FYLSDVEQGGATAFPFLN---------------FAVKPQLGNVLFWYNVHRSLDVDYRTK 489
Query: 202 HGSCPVVKGEKWVATKWIRDQEQ 224
H CPV+KG KW+ WI + Q
Sbjct: 490 HAGCPVLKGSKWIGNVWIHEATQ 512
>gi|33859596|ref|NP_035160.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Mus musculus]
gi|20455506|sp|Q60715.2|P4HA1_MOUSE RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; Flags: Precursor
gi|16307134|gb|AAH09654.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide [Mus musculus]
gi|74144306|dbj|BAE36020.1| unnamed protein product [Mus musculus]
gi|74146660|dbj|BAE41331.1| unnamed protein product [Mus musculus]
gi|148700260|gb|EDL32207.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide, isoform CRA_a [Mus
musculus]
Length = 534
Score = 86.3 bits (212), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 99/206 (48%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F + + + + + ++AK LR +T++ ++ T R S ++S ED
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPVTGALE-TVHYRISKSAWLSGYEDP- 392
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ I +I +T L E + Y +G +Y H+D E + R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIA 451
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|125772813|ref|XP_001357665.1| GA21991 [Drosophila pseudoobscura pseudoobscura]
gi|54637397|gb|EAL26799.1| GA21991 [Drosophila pseudoobscura pseudoobscura]
Length = 534
Score = 86.3 bits (212), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 55/203 (27%), Positives = 92/203 (45%), Gaps = 19/203 (9%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + + + +P + + MA + ST+ G + R S ++ A D
Sbjct: 328 PFVVTYHDMLSPRKIADLRLMAVPRMHRSTVNPLPGGQ-NKKSSFRVSKNAWL--AYDSH 384
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQKSQRVASFL 141
T+ + ++ T L E + Y +G Y H+D F DP Y ++ R+A+ +
Sbjct: 385 PTMGGMLSDLSDATGLDMTFCEQLQVANYGVGGHYEPHWDFFRDPDHYPAEEGNRMATAI 444
Query: 142 VYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSI 201
YL+D+E+GG T FPF N VKP+ G+ L +Y++ + +D +
Sbjct: 445 FYLSDVEQGGATAFPFLN---------------FAVKPQLGNVLFWYNVHRSLDVDYRTK 489
Query: 202 HGSCPVVKGEKWVATKWIRDQEQ 224
H CPV+KG KW+ WI + Q
Sbjct: 490 HAGCPVLKGSKWIGNVWIHEATQ 512
>gi|74224984|dbj|BAE38205.1| unnamed protein product [Mus musculus]
Length = 534
Score = 86.3 bits (212), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 99/206 (48%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F + + + + + ++AK LR +T++ ++ T R S ++S ED
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPVTGALE-TVHYRISKSAWLSGYEDP- 392
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ I +I +T L E + Y +G +Y H+D E + R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIA 451
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|228999322|ref|ZP_04158902.1| hypothetical protein bmyco0003_38780 [Bacillus mycoides Rock3-17]
gi|229006877|ref|ZP_04164509.1| hypothetical protein bmyco0002_37790 [Bacillus mycoides Rock1-4]
gi|228754370|gb|EEM03783.1| hypothetical protein bmyco0002_37790 [Bacillus mycoides Rock1-4]
gi|228760519|gb|EEM09485.1| hypothetical protein bmyco0003_38780 [Bacillus mycoides Rock3-17]
Length = 195
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 59/215 (27%), Positives = 94/215 (43%), Gaps = 33/215 (15%)
Query: 18 VLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISA 77
VL P + TP +C+ +I ++K +++P+ GE + F
Sbjct: 7 VLHDEPFVAQYEQIITPAECQELIELSKKHIQPAQAYGHTGERKSD----------FTWL 56
Query: 78 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-----DPQEYGPQ 132
G + + E IA LP + E RY++G K+++H D + D + Q
Sbjct: 57 PHYSHGLVSQVSELIATAMPLPLNHAEPLQAARYEVGGKFDAHIDCYGTWHEDGRNRVEQ 116
Query: 133 KSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLP 192
QR+ + ++YL + GGET FP + L V P +G LL +
Sbjct: 117 GGQRLYTAILYLNTVNAGGETFFP---------------SLNLTVTPSEGK-LLVFENCK 160
Query: 193 NGTIDP--TSIHGSCPVVKGEKWVATKWIRDQEQY 225
GT +P S+H C V +GEKW+ T W R++ QY
Sbjct: 161 RGTNEPHPLSLHEGCAVHEGEKWIVTLWFREKPQY 195
>gi|347964867|ref|XP_309164.4| AGAP000971-PA [Anopheles gambiae str. PEST]
gi|333466515|gb|EAA04901.5| AGAP000971-PA [Anopheles gambiae str. PEST]
Length = 553
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + + + + + + I + A+ R +T+ K ++ R S ++ AEDE
Sbjct: 349 PYIVIYHDVMSDREIERIKHYARPRFRRATVQNYKTGELEFA-NYRISKSAWLKDAEDE- 406
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ I +++ +T L E ++ Y IG Y H+D +E KS R+A
Sbjct: 407 -MIRTISQRVEDMTGLTMETAEELQVVNYGIGGHYEPHFDFARREERNAFKSLGTGNRIA 465
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
+ L Y++D+ +GG T+FP + L + PR+G +++L +G D
Sbjct: 466 TVLFYMSDVTQGGATVFP---------------SLNLALWPRKGTAAFWFNLHASGRGDY 510
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
+ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 511 ATRHAACPVLTGTKWVSNKWIHERGQ 536
>gi|198466401|ref|XP_002135182.1| GA23910 [Drosophila pseudoobscura pseudoobscura]
gi|198150583|gb|EDY73809.1| GA23910 [Drosophila pseudoobscura pseudoobscura]
Length = 530
Score = 85.9 bits (211), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 45/147 (30%), Positives = 76/147 (51%), Gaps = 28/147 (19%)
Query: 80 DESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEY-------GPQ 132
+++ T I ++I +T E N+ Y +G + HYD + P+ Y GP
Sbjct: 385 EQTTTRARIYQRITDITGFQLFVQEELNVANYGLGTIFGPHYD-YTPENYDIGWFMGGP- 442
Query: 133 KSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLP 192
+ + L Y++DL++GG T+FP I + V PR+G LL+++L
Sbjct: 443 ----LGTILFYVSDLQQGGATIFP---------------SINITVSPRKGSALLWFNLYD 483
Query: 193 NGTIDPTSIHGSCPVVKGEKWVATKWI 219
+G DP ++H SCPV++G++W TKW+
Sbjct: 484 DGEPDPRTLHSSCPVIEGDRWTLTKWV 510
>gi|328696638|ref|XP_003240086.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
[Acyrthosiphon pisum]
Length = 534
Score = 85.9 bits (211), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 52/206 (25%), Positives = 100/206 (48%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + + + + + I MA+ L+ +T+ K ++ R S ++ ED
Sbjct: 329 PRIILYRDVLYDNEIEVIKRMAQPRLKRATVQNYKTGELEFAD-YRISKSAWLKEHED-- 385
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ + +++ +T L E ++ Y +G Y+ HYD +E KS R+A
Sbjct: 386 VVVANVAKRVEVMTGLTTETAEELQVVNYGVGGHYDPHYDFARTEEINAFKSLGTGNRIA 445
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
+ L Y++D+ +GG T+FP+ +G+ ++P +G ++++L P+G D
Sbjct: 446 TVLFYMSDVAQGGATVFPW---------------LGVALQPVKGTAAVWFNLYPSGNGDL 490
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
+ H +CPV++G KWV KW+ + Q
Sbjct: 491 RTRHAACPVLQGSKWVCNKWLHEAGQ 516
>gi|301104296|ref|XP_002901233.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262101167|gb|EEY59219.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 535
Score = 85.9 bits (211), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 69/259 (26%), Positives = 109/259 (42%), Gaps = 54/259 (20%)
Query: 14 IPFQVLSWMPRALYFPNFATPEQCKSIINMA------KLNLRPSTLALRKGETVDNTQGI 67
+ + +S PR NF + E+ +I L+ ST+ +
Sbjct: 179 VIIESISESPRTFRLHNFFSGEEADKLIKRTLEIDDPSNKLQQSTVGANDNKNKKKKSKH 238
Query: 68 RTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGE---AFNILRYKIGQKYNSHYD-- 122
RTS F + +E +D I +++ V L + +LRY+ Q Y +H D
Sbjct: 239 RTSENAFDTVSE---AAVD-IRKRVFDVLSLGEFQADMADGLQLLRYQQKQAYIAHEDYF 294
Query: 123 --------AFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPF---------ENGMNADG 165
FDP + G S R A+ +YL+D+ GG+T+FP E +
Sbjct: 295 PVGAAKDFNFDPHKGG---SNRFATVFLYLSDVPRGGQTVFPLAEMPEGLPTEYQHPPNS 351
Query: 166 SYDYQ------------------KC-IGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCP 206
+ DY+ KC L P +G +LFYS PNG +DP S+HG CP
Sbjct: 352 AQDYEAIGAELFEPGSWEMDMVRKCSTKLASYPSKGGAVLFYSQKPNGELDPKSLHGGCP 411
Query: 207 VVKGEKWVATKWIRDQEQY 225
V++G KW A W+ ++ ++
Sbjct: 412 VLEGTKWGANLWVWNRRRH 430
>gi|195110923|ref|XP_002000029.1| GI22757 [Drosophila mojavensis]
gi|193916623|gb|EDW15490.1| GI22757 [Drosophila mojavensis]
Length = 535
Score = 85.9 bits (211), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 57/179 (31%), Positives = 84/179 (46%), Gaps = 25/179 (13%)
Query: 54 ALRKGETVDNTQG-----IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNI 108
L++ E T G RTS G DE ++ + + + ++ L E I
Sbjct: 354 VLQRSEVYSPTNGSTAATFRTSQGTVFEY--DEHPIIEKLSQHMTLISGLDMGFAEPLQI 411
Query: 109 LRYKIGQKYNSHYDAF-DPQEYGPQ--KSQRVASFLVYLTDLEEGGETMFPFENGMNADG 165
Y IG Y H D+F + +Y Q K+ R+A+ + YL+++E GG T FPF
Sbjct: 412 ANYGIGGHYEPHMDSFPESFDYSLQRFKTNRIATGIFYLSNVEAGGATAFPF-------- 463
Query: 166 SYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
+ L VKP QG L +Y+L +G D + H CPV++G KW+A WIR Q
Sbjct: 464 -------LPLLVKPEQGSLLFWYNLHRSGDADYRTKHAGCPVLQGSKWIANVWIRLSHQ 515
>gi|260812289|ref|XP_002600853.1| hypothetical protein BRAFLDRAFT_214927 [Branchiostoma floridae]
gi|229286143|gb|EEN56865.1| hypothetical protein BRAFLDRAFT_214927 [Branchiostoma floridae]
Length = 281
Score = 85.9 bits (211), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 52/159 (32%), Positives = 79/159 (49%), Gaps = 21/159 (13%)
Query: 67 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRI--NGEAFNILRYKIGQKYNSHYDAF 124
IR S ++ +DE + + ++I +T L + E +L Y +G +Y H+D
Sbjct: 125 IRISQQAWLHDKDDE--IVARVSKRIGLLTGLNTTPTSTELLQVLNYGLGGQYEPHHDYM 182
Query: 125 DPQE--YGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQG 182
+E +G R+A+FL+YL+D+ GG T+FP N + V +
Sbjct: 183 TAEEKMWGTILGNRMATFLMYLSDVTAGGATVFPVAN---------------VTVPVVKN 227
Query: 183 DGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRD 221
GLLF LL +G D S+H CPVV G KW+A KWI +
Sbjct: 228 AGLLFMDLLRSGRGDVNSLHAGCPVVIGSKWIANKWIHE 266
>gi|193688213|ref|XP_001943683.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 1
[Acyrthosiphon pisum]
Length = 552
Score = 85.9 bits (211), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 52/206 (25%), Positives = 100/206 (48%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + + + + + I MA+ L+ +T+ K ++ R S ++ ED
Sbjct: 347 PRIILYRDVLYDNEIEVIKRMAQPRLKRATVQNYKTGELEFAD-YRISKSAWLKEHED-- 403
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ + +++ +T L E ++ Y +G Y+ HYD +E KS R+A
Sbjct: 404 VVVANVAKRVEVMTGLTTETAEELQVVNYGVGGHYDPHYDFARTEEINAFKSLGTGNRIA 463
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
+ L Y++D+ +GG T+FP+ +G+ ++P +G ++++L P+G D
Sbjct: 464 TVLFYMSDVAQGGATVFPW---------------LGVALQPVKGTAAVWFNLYPSGNGDL 508
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
+ H +CPV++G KWV KW+ + Q
Sbjct: 509 RTRHAACPVLQGSKWVCNKWLHEAGQ 534
>gi|432904500|ref|XP_004077362.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oryzias
latipes]
Length = 555
Score = 85.9 bits (211), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 56/206 (27%), Positives = 98/206 (47%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + + + + + I +AK LR +T++ V T R S +++A ED
Sbjct: 351 PYIVRYIDIISEAEMDKIKQLAKPRLRRATIS-NPVTGVLETAPYRISKSAWLTAYEDP- 408
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
++ I ++I +T L E + Y +G +Y H+D E K R+A
Sbjct: 409 -VVEKINQRIEDLTGLEMDTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIA 467
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 468 TWLFYMSDVSAGGATVFP---------------DVGASVGPQKGTAVFWYNLFASGEGDY 512
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 513 STRHAACPVLVGNKWVSNKWIHERGQ 538
>gi|156398644|ref|XP_001638298.1| predicted protein [Nematostella vectensis]
gi|156225417|gb|EDO46235.1| predicted protein [Nematostella vectensis]
Length = 495
Score = 85.9 bits (211), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 48/162 (29%), Positives = 80/162 (49%), Gaps = 20/162 (12%)
Query: 64 TQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDA 123
T R S ++S E +D +E +IA +T L E F + Y + +Y+ H+D
Sbjct: 335 TAHYRISKNCWLSGRE-HGEVIDRVERRIAAMTRLNLETAEGFQVQNYGLAGQYDPHFDF 393
Query: 124 FDPQEYGPQKS----QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKP 179
S R+A+ LV+++ +E GG T+FP+ +G ++ P
Sbjct: 394 SRDLANSSLGSLGTGNRIATVLVWMSQVESGGATVFPY---------------VGARILP 438
Query: 180 RQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRD 221
++GD + +++LL +G D + H CPV+ G KWVA KWI +
Sbjct: 439 QKGDAVFWHNLLRSGDGDFRTRHAGCPVLSGIKWVANKWIHE 480
>gi|443709455|gb|ELU04127.1| hypothetical protein CAPTEDRAFT_149240 [Capitella teleta]
Length = 532
Score = 85.9 bits (211), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 48/150 (32%), Positives = 79/150 (52%), Gaps = 20/150 (13%)
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD---AFDPQEYGPQKSQ 135
++E + I E+ + +T L E ++ Y IG +Y H+D +P + +
Sbjct: 389 DEEDPLIARISERCSALTNLSLTTVEELQVVNYGIGGQYEPHFDFSRRSEPTAFEKWRGN 448
Query: 136 RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGT 195
R+ + + Y+TD+E GG T+F ++A G+KV P +G ++++LLP+G
Sbjct: 449 RILTVIYYMTDVEAGGATVF-----LDA----------GVKVYPEKGSAAVWHNLLPSGE 493
Query: 196 IDPTSIHGSCPVVKGEKWVATKWI--RDQE 223
D + H +CPV+ G KWVA KW RDQE
Sbjct: 494 GDMRTRHAACPVLTGSKWVANKWFHERDQE 523
>gi|410295850|gb|JAA26525.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
gi|410295854|gb|JAA26527.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
Length = 534
Score = 85.5 bits (210), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 98/206 (47%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F + + + + + ++AK LR +T+ + + Q R S ++S E+
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLRRATVHDPETGKLTTAQ-YRVSKSAWLSGYENP- 392
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 451
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|195452730|ref|XP_002073475.1| GK13125 [Drosophila willistoni]
gi|194169560|gb|EDW84461.1| GK13125 [Drosophila willistoni]
Length = 539
Score = 85.5 bits (210), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 60/208 (28%), Positives = 96/208 (46%), Gaps = 25/208 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALR---KGETVDNTQGIRTSSGVFISAAE 79
P + N + + + +A+ N++ S + + ETV RTS G E
Sbjct: 329 PFVVQVHNIVSQKDMNLLQKIARPNIQRSQVYAQDHNANETV--AAAYRTSKGATFEYFE 386
Query: 80 DESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGP--QKSQR 136
S ++L+ +A ++ L + E I Y IG Y H+D F D Y P + R
Sbjct: 387 HRS--MELLSRHVADLSGLDMNSAELLQIANYGIGGHYEPHWDCFPDHHVYLPDDRDGNR 444
Query: 137 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 196
+A+ + YL+++E GG T FPF + L V P +G + +Y+L +G
Sbjct: 445 IATGIYYLSEVEAGGGTAFPF---------------LPLLVTPERGSLVFWYNLHRSGDQ 489
Query: 197 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D + H +CPV++G KW+A WIR Q
Sbjct: 490 DYRTKHAACPVLQGSKWIANVWIRQSNQ 517
>gi|307108823|gb|EFN57062.1| hypothetical protein CHLNCDRAFT_143806 [Chlorella variabilis]
Length = 514
Score = 85.5 bits (210), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 63/237 (26%), Positives = 102/237 (43%), Gaps = 47/237 (19%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P L F +PE+C + + +++ S + + + +RTS G F++ +
Sbjct: 287 PCLLLADAFLSPEECGEVRALGAPHMKRSKV------SAGDETPLRTSWGTFLTGPLAQQ 340
Query: 83 GTLDLIEEKIAKVTMLPRIN--------GEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS 134
+E ++ ++ L GEA I+RY GQ Y H D S
Sbjct: 341 PVAARLEGRVRQLAALACEAEGRRALQLGEATQIVRYDPGQFYALHLD-----NRAGDSS 395
Query: 135 QRVASFLVYLTDLEEGGETMFPFENG--------MNADGSYDY----------------- 169
+R A+ ++Y++D+E GG T FP G A G+ +
Sbjct: 396 RRAATVMIYISDVEAGGATHFPRSCGYPLERALEACAAGARNEPAPPPGACPPAGHSPRA 455
Query: 170 ---QKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
Q GL V+PR+G ++F+S LP G D SIH + V G KW+ T+W R+Q+
Sbjct: 456 GQPQHPPGLWVQPREGRAVIFWSRLPGGGEDKASIHEAERVEAGTKWICTRWCREQD 512
>gi|195452734|ref|XP_002073476.1| GK13124 [Drosophila willistoni]
gi|194169561|gb|EDW84462.1| GK13124 [Drosophila willistoni]
Length = 536
Score = 85.5 bits (210), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 53/203 (26%), Positives = 94/203 (46%), Gaps = 19/203 (9%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + + + +P + + MA ++R ST+ G + R S ++ A +
Sbjct: 330 PFVVTYHDMLSPNKIAQLREMAVPHMRRSTVNPLPGGQ-NKKSSFRVSKNAWL--AYETH 386
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQKSQRVASFL 141
T+ + ++ T L E + Y +G Y H+D F +P Y ++ R+A+ +
Sbjct: 387 PTMGKMLRDLSDTTGLDMTYCEQLQVANYGVGGHYEPHWDFFRNPDHYPAEEGNRIATAI 446
Query: 142 VYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSI 201
YL+++E+GG T FPF N V+P+ G+ L +Y+L + +D +
Sbjct: 447 YYLSEVEQGGATAFPFLN---------------FAVRPQLGNVLFWYNLHRSSDMDYRTK 491
Query: 202 HGSCPVVKGEKWVATKWIRDQEQ 224
H CPV+KG KW+ WI + Q
Sbjct: 492 HAGCPVLKGSKWIGNVWIHEVTQ 514
>gi|170064951|ref|XP_001867739.1| prolyl 4-hydroxylase subunit alpha-2 [Culex quinquefasciatus]
gi|167882142|gb|EDS45525.1| prolyl 4-hydroxylase subunit alpha-2 [Culex quinquefasciatus]
Length = 516
Score = 85.5 bits (210), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 43/123 (34%), Positives = 67/123 (54%), Gaps = 17/123 (13%)
Query: 104 EAFNILRYKIGQKYNSHYDAFDPQEYGPQ--KSQRVASFLVYLTDLEEGGETMFPFENGM 161
E+ + Y IG Y HYD + P+ R+A+ + YL+D+EEGG T+FP
Sbjct: 397 ESLQVNNYGIGGHYLPHYDWSREENPYPELNTGNRIATLMFYLSDVEEGGATVFPH---- 452
Query: 162 NADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRD 221
+G+ V P++G + +Y+L +G D ++HG+CPV+ G KWVA KWI +
Sbjct: 453 -----------LGVGVFPKKGTAIFWYNLRASGKGDEKTLHGACPVLIGSKWVANKWIHE 501
Query: 222 QEQ 224
+ Q
Sbjct: 502 RHQ 504
>gi|195575113|ref|XP_002105524.1| GD16980 [Drosophila simulans]
gi|194201451|gb|EDX15027.1| GD16980 [Drosophila simulans]
Length = 518
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 64/218 (29%), Positives = 104/218 (47%), Gaps = 31/218 (14%)
Query: 14 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVD-----NTQGIR 68
I ++LS P + + +P + I + +K + PS ETV+ R
Sbjct: 301 IKTEILSVDPFVILLHDMVSPTEGALIRSSSKNQILPS-------ETVNAANEFEVAKFR 353
Query: 69 TSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDA--FDP 126
TS V+ + +E+ TL L + ++ + T L + E F ++ Y IG + SH+D D
Sbjct: 354 TSKSVWFDSDANEA-TLKLTQ-RLGEATGLDMKHSEPFQVINYGIGGVFESHFDTSLADE 411
Query: 127 QEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLL 186
+ R+A+ L YL D+ +GG T FP G+N + V P+ G L+
Sbjct: 412 DRFVNGYIDRLATTLFYLNDVPQGGATHFP---GLN------------ITVFPKFGTVLM 456
Query: 187 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
+Y+L G + ++H CPV+ G KWV +KWI D+ Q
Sbjct: 457 WYNLHTEGLLHVRTMHTGCPVIVGSKWVVSKWIDDKGQ 494
>gi|432891690|ref|XP_004075614.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Oryzias
latipes]
Length = 517
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 63/217 (29%), Positives = 99/217 (45%), Gaps = 33/217 (15%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFIS 76
+VLS P + + NF T + + I A+ LR S +A GE T R S ++
Sbjct: 313 EVLSLQPYVVIYHNFITDREAEEIKGFAQPALRRSVVA--SGEN-QATVEYRISKSAWLK 369
Query: 77 AAED-ESGTLDLIEEKIAKVTMLPRIN-----GEAFNILRYKIGQKYNSHYD-AFDPQE- 128
+E G LD +++ML +N E ++ Y IG Y H+D A P
Sbjct: 370 GSESCIVGKLD------QRISMLTGLNVRPPYAEYLQVVNYGIGGHYEPHFDHATSPSSP 423
Query: 129 -YGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 187
+ + RVA+F++YL+ +E GG T F + N V + + +
Sbjct: 424 VFKLKTGNRVATFMIYLSSVEAGGSTAFIYAN---------------FSVPVLKKAAIFW 468
Query: 188 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
++L NG D ++H CPV+ G+KWVA KW+ + Q
Sbjct: 469 WNLHRNGRGDAETLHAGCPVLIGDKWVANKWVHEYGQ 505
>gi|198449648|ref|XP_001357666.2| GA21989 [Drosophila pseudoobscura pseudoobscura]
gi|198130700|gb|EAL26801.2| GA21989 [Drosophila pseudoobscura pseudoobscura]
Length = 536
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 50/161 (31%), Positives = 78/161 (48%), Gaps = 20/161 (12%)
Query: 67 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP 126
RTS G + ++ T + + +A ++ L E I Y IG Y H+D+F
Sbjct: 371 FRTSQGASFNYSQ--YATTQRLSQHVADLSGLDMDYAENLQIANYGIGGHYEPHWDSFPE 428
Query: 127 QEYGPQKS---QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGD 183
P+ R+A+ + YL+D+ GG T FPF + L V P +G
Sbjct: 429 HHEYPEDDLYGNRLATAIYYLSDVVAGGGTAFPF---------------LPLLVTPERGS 473
Query: 184 GLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
L +Y+L P+G D + H +CPV++G KW+A WIR++ Q
Sbjct: 474 LLFWYNLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQ 514
>gi|159481038|ref|XP_001698589.1| predicted protein [Chlamydomonas reinhardtii]
gi|158282329|gb|EDP08082.1| predicted protein [Chlamydomonas reinhardtii]
Length = 258
Score = 85.1 bits (209), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 51/149 (34%), Positives = 75/149 (50%), Gaps = 6/149 (4%)
Query: 14 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGV 73
I + +SW PRA + NF + +C + ++ + S L + IRTS G
Sbjct: 6 IRIETISWSPRAFIYHNFLSEAECDHLTDIGNKRVSRS-LVVDSKTGQSKLDDIRTSYGA 64
Query: 74 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGP- 131
ED + +EE+IA+ T LP GE ILRY GQKY++H+D F DP +
Sbjct: 65 AFGRGEDP--VIAAVEERIAEWTHLPPEYGEPMQILRYVDGQKYDAHWDWFDDPVHHAAY 122
Query: 132 -QKSQRVASFLVYLTDLEEGGETMFPFEN 159
+ R A+ L+YL+ +E GGET P +
Sbjct: 123 LHEGNRYATVLLYLSGVEGGGETNLPLAD 151
>gi|195159319|ref|XP_002020529.1| GL14044 [Drosophila persimilis]
gi|194117298|gb|EDW39341.1| GL14044 [Drosophila persimilis]
Length = 536
Score = 85.1 bits (209), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 50/161 (31%), Positives = 78/161 (48%), Gaps = 20/161 (12%)
Query: 67 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP 126
RTS G + ++ T + + +A ++ L E I Y IG Y H+D+F
Sbjct: 371 FRTSQGASFNYSQ--YATTQRLSQHVADLSGLDMDYAENLQIANYGIGGHYEPHWDSFPE 428
Query: 127 QEYGPQKS---QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGD 183
P+ R+A+ + YL+D+ GG T FPF + L V P +G
Sbjct: 429 HHEYPEDDLYGNRLATAIYYLSDVVAGGGTAFPF---------------LPLLVTPERGS 473
Query: 184 GLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
L +Y+L P+G D + H +CPV++G KW+A WIR++ Q
Sbjct: 474 LLFWYNLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQ 514
>gi|354483223|ref|XP_003503794.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 1
[Cricetulus griseus]
Length = 534
Score = 85.1 bits (209), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 57/210 (27%), Positives = 100/210 (47%), Gaps = 30/210 (14%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F + + + + + ++AK L +T+ + + Q R S ++S ED
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYEDP- 392
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD--------AFDPQEYGPQKS 134
+ I +I +T L E + Y +G +Y H+D AF QE G
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAF--QELGT--G 447
Query: 135 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 194
R+A++L Y++D+ GG T+FP +G V P++G + +Y+L +G
Sbjct: 448 NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASG 492
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 493 EGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|321474877|gb|EFX85841.1| hypothetical protein DAPPUDRAFT_208740 [Daphnia pulex]
Length = 545
Score = 85.1 bits (209), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 52/208 (25%), Positives = 97/208 (46%), Gaps = 26/208 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQ--GIRTSSGVFISAAED 80
P + + N ++ +++ MA+ + +T+ + N + R S ++ + E
Sbjct: 344 PLIVIYHNVINDDEIETVKKMAQPRFKRATV---QNSVTGNLEPANYRISKSAWLKSEEH 400
Query: 81 ESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QR 136
+ + + ++ VT L E ++ Y IG Y H+D +E K R
Sbjct: 401 DH--VFKVTRRVGDVTGLDMATAEDLQVVNYGIGGHYEPHFDYARKEEVNAFKDLGWGNR 458
Query: 137 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 196
VA++L Y++++E GG T+FP + L + P++G +Y+L PNG
Sbjct: 459 VATWLFYMSEVEAGGATVFP---------------KLNLALWPQKGSAAFWYNLHPNGEG 503
Query: 197 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
+ + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 504 NELTRHAACPVLTGSKWVSNKWIHERNQ 531
>gi|196011900|ref|XP_002115813.1| hypothetical protein TRIADDRAFT_59899 [Trichoplax adhaerens]
gi|190581589|gb|EDV21665.1| hypothetical protein TRIADDRAFT_59899 [Trichoplax adhaerens]
Length = 581
Score = 85.1 bits (209), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 55/214 (25%), Positives = 94/214 (43%), Gaps = 23/214 (10%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFIS 76
+VLS P + + N T + + +A L+ + + + + R S ++
Sbjct: 346 EVLSLQPYIVIYHNLLTNSEVVLLKTLASPLLKRAVVVGKPDKEYGEETTYRISKTAWLD 405
Query: 77 AAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQ------EYG 130
+++ + I I + L E I Y IG Y H D + + EY
Sbjct: 406 --KEDHPAVKRITTLIGDIIGLTSETAEPLQIANYGIGGHYEPHLDFIESEDKEALSEYT 463
Query: 131 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 190
+ R+A+ L+YL+++E GG T+FP G++V+PRQG +Y++
Sbjct: 464 SRIGNRIATVLIYLSNVEAGGATVFP---------------KAGVRVEPRQGSAAFWYNM 508
Query: 191 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
NG + S+H +CPV+ G KW A W R+ Q
Sbjct: 509 HRNGEGNKLSVHAACPVLIGSKWAANLWFREVGQ 542
>gi|195392288|ref|XP_002054791.1| GJ24631 [Drosophila virilis]
gi|194152877|gb|EDW68311.1| GJ24631 [Drosophila virilis]
Length = 499
Score = 85.1 bits (209), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 53/210 (25%), Positives = 101/210 (48%), Gaps = 27/210 (12%)
Query: 14 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGV 73
+ + LS P + + + + + I+ +AK +LR + + G ++Q ++G
Sbjct: 296 LKLEQLSLDPYMVLYHDVVQANEREHIMQLAKPHLRRALV----GAARAHSQRFAMNAGF 351
Query: 74 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ- 132
+ ++S + +++ ++ N +L Y IG +Y HYD + Q+ Q
Sbjct: 352 ----SYNDSRQGQRLRQRLEDMSGFDLTNSGQLAVLNYGIGGQYYMHYDCWFSQDDAAQV 407
Query: 133 ---KSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYS 189
K R+A+ L+YLTD++ GG T FP +GL V+P G L++++
Sbjct: 408 ASIKDNRIATILLYLTDVQLGGLTSFP---------------ALGLAVQPSPGSALIWHN 452
Query: 190 LLPNGTIDPTSIHGSCPVVKGEKWVATKWI 219
+ D ++H +CP++ G +WVAT+WI
Sbjct: 453 MNNAAECDRRTLHAACPLLLGTRWVATQWI 482
>gi|405964866|gb|EKC30308.1| KRR1 small subunit processome component-like protein [Crassostrea
gigas]
Length = 885
Score = 85.1 bits (209), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 66/244 (27%), Positives = 112/244 (45%), Gaps = 47/244 (19%)
Query: 12 TNIPF-----QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK----GE--- 59
T IP+ +V+++ PR F + + + + ++A L ST+ L G+
Sbjct: 643 TVIPYYKAKEEVVNYEPRIAIFHDVISSTSIEHLKSIASKGLTRSTVFLENTGPNGQVTI 702
Query: 60 TVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLP------RINGEAFNILRYKI 113
T IR S +I DE L +E +I +T L R + E F ++ Y +
Sbjct: 703 TYGKQDNIRVSQTCWIRT--DEYPELLRLENRIQLITGLSAEYKPVRSHSEKFQVVNYGV 760
Query: 114 GQKYNSHYDAFDPQEYG----PQKSQ-------RVASFLVYLTDLEEGGETMFPFENGMN 162
G Y +H+D + + G P S+ R+A+++ Y+ D + GG T+FP
Sbjct: 761 GGMYTAHHD-YTGYKLGIISNPMDSEDISTSGDRMATWMFYMNDAKAGGATVFP------ 813
Query: 163 ADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
+ ++ +G +++L P+G DP ++HG CPV+ G KWV KWIR++
Sbjct: 814 ---------EVRTRIPVAKGGAAFWFNLRPSGATDPRTLHGGCPVLVGSKWVTNKWIREE 864
Query: 223 EQYD 226
Q D
Sbjct: 865 GQMD 868
>gi|103487007|ref|YP_616568.1| 2OG-Fe(II) oxygenase [Sphingopyxis alaskensis RB2256]
gi|98977084|gb|ABF53235.1| 2OG-Fe(II) oxygenase [Sphingopyxis alaskensis RB2256]
Length = 218
Score = 85.1 bits (209), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 56/200 (28%), Positives = 90/200 (45%), Gaps = 28/200 (14%)
Query: 28 FPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDL 87
P F E C ++I++ + RPST+A G+ RTSS + + +
Sbjct: 32 LPRFLDAETCAALIDLIDSDARPSTIADANGDAA-----FRTSSTCDL---DHRLPIVIA 83
Query: 88 IEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQ-----EYGPQKSQRVASFLV 142
+ K+ +T +P GE RY IG+++ +H D FDP Y QR + ++
Sbjct: 84 VNNKLHDLTGIPLAYGEPLQGQRYDIGEEFKAHTDYFDPHGADWDTYCAVPGQRSWTLMI 143
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL GG T F M+ +P G L + ++ P+GTI+P ++H
Sbjct: 144 YLNQPAAGGATRFLATGKMH---------------QPEVGKLLAWNNVRPDGTINPDTLH 188
Query: 203 GSCPVVKGEKWVATKWIRDQ 222
V KG K++ TKW R++
Sbjct: 189 HGMKVRKGRKYIITKWFRER 208
>gi|149186836|ref|ZP_01865146.1| 2OG-Fe(II) oxygenase [Erythrobacter sp. SD-21]
gi|148829503|gb|EDL47944.1| 2OG-Fe(II) oxygenase [Erythrobacter sp. SD-21]
Length = 211
Score = 84.7 bits (208), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 62/208 (29%), Positives = 95/208 (45%), Gaps = 30/208 (14%)
Query: 23 PRALYFP--NFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAED 80
P+A F +FA+P C+ ++ + + + RPSTLA N RTS + E
Sbjct: 24 PKAELFQLRDFASPAMCEQLVALIEKDRRPSTLA-----DAGNDHYFRTSETCDLDPDEP 78
Query: 81 ESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-----DPQEYGPQKSQ 135
+ IE +A + + GE RY +GQ++ H D F D ++Y Q
Sbjct: 79 ---VVCEIEALLAALNGIDPKFGEPLQGQRYDVGQEFKPHCDYFNRGGQDWEKYCSVAGQ 135
Query: 136 RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGT 195
R +F+VYL +E GG T F K +G +P G + + ++ P G
Sbjct: 136 RTWTFMVYLNAVEAGGATRF---------------KAVGKTFQPEPGKLVCWNNMRPEGR 180
Query: 196 IDPTSIHGSCPVVKGEKWVATKWIRDQE 223
+P +IH V KG K+V TKW R++E
Sbjct: 181 ENPNTIHHGMKVRKGVKYVITKWYREKE 208
>gi|443697959|gb|ELT98193.1| hypothetical protein CAPTEDRAFT_162820 [Capitella teleta]
Length = 347
Score = 84.7 bits (208), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 59/213 (27%), Positives = 98/213 (46%), Gaps = 25/213 (11%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFIS 76
++L++ P + + T Q I +++ L S + K + RTS +
Sbjct: 143 EMLNFDPAIYVYHDVLTDSQNAIIKEVSRPKLHRSGV-FSKTDADTGLSNFRTSQ----T 197
Query: 77 AAEDESG--TLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF---DPQEYGP 131
A D+S + + +K + ++ L E +L Y IG Y H+D + E+
Sbjct: 198 AWHDDSTHPLIARLSQKASAISNLTLETVEHLQVLNYGIGGLYEPHWDFVQGEERNEFSE 257
Query: 132 QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLL 191
RVA+F+ YL++LE GG T++P +G V PR+ L+Y+L+
Sbjct: 258 SDRNRVATFICYLSELEAGGYTVYP---------------TVGAAVVPRKNSCALWYNLM 302
Query: 192 PNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
NGT D + H +CP++ G KWVA KW + Q
Sbjct: 303 RNGTGDYRTYHAACPILYGYKWVANKWFHEGGQ 335
>gi|195391758|ref|XP_002054527.1| GJ22759 [Drosophila virilis]
gi|194152613|gb|EDW68047.1| GJ22759 [Drosophila virilis]
Length = 539
Score = 84.7 bits (208), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 56/186 (30%), Positives = 89/186 (47%), Gaps = 21/186 (11%)
Query: 42 NMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRI 101
++A+ L+ S + R G + RTS G + + + +A+++ L
Sbjct: 350 HLARPELQRSQVYSRTGHE-HISANFRTSQGTTFEYTDHP--IMQKMSHHVAEISGLDMR 406
Query: 102 NGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQ--KSQRVASFLVYLTDLEEGGETMFPFE 158
+ E I Y IG Y H D+F D +Y K+ R+A+ + YL+++E GG T FPF
Sbjct: 407 SAEPLQIANYGIGGHYEPHMDSFPDSYDYSLNMYKTNRLATGIYYLSNVEAGGGTAFPF- 465
Query: 159 NGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKW 218
+ L V P +G L +Y+L P+G D + H +CPV++G KW+A W
Sbjct: 466 --------------LPLLVTPERGSLLFWYNLHPSGDADYRTKHAACPVLQGSKWIANVW 511
Query: 219 IRDQEQ 224
IR Q
Sbjct: 512 IRLSNQ 517
>gi|196011902|ref|XP_002115814.1| hypothetical protein TRIADDRAFT_30039 [Trichoplax adhaerens]
gi|190581590|gb|EDV21666.1| hypothetical protein TRIADDRAFT_30039 [Trichoplax adhaerens]
Length = 534
Score = 84.7 bits (208), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 58/212 (27%), Positives = 99/212 (46%), Gaps = 23/212 (10%)
Query: 14 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGV 73
I +V+S P L + N + +++ +A L+ +T+ + ++ R S
Sbjct: 325 INVEVISLQPYILIYHNLLNDLEVEALKTLAAPMLQRATVHNKDTGKLEYAT-YRISKSA 383
Query: 74 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQE----Y 129
+++ +D+ + I I VT L + EA I Y IG Y H+D D + +
Sbjct: 384 WLN--DDDHPLVRRISTLIEDVTGLTMESAEALQIANYGIGGHYEPHFDHADVRSGTDVF 441
Query: 130 GPQKS-QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFY 188
K R+A+ L+YL+ +E GG T+F G++++PRQG +Y
Sbjct: 442 KTWKGGNRIATMLIYLSSVELGGATVFS---------------SAGVRIEPRQGSAAFWY 486
Query: 189 SLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
+L NG + + H +CPV+ G KW+A KWI
Sbjct: 487 NLHRNGNGNNLTRHAACPVLIGSKWIANKWIH 518
>gi|443697961|gb|ELT98195.1| hypothetical protein CAPTEDRAFT_181380 [Capitella teleta]
Length = 530
Score = 84.7 bits (208), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 58/210 (27%), Positives = 97/210 (46%), Gaps = 25/210 (11%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFIS 76
++L++ P + + T Q I +++ L S + K + RTS +
Sbjct: 326 EMLNFDPAIYVYHDVLTDSQNAIIKEVSRPKLHRSGV-FSKTDADTGLSNFRTSQ----T 380
Query: 77 AAEDESG--TLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF---DPQEYGP 131
A D+S + + +K + ++ L E +L Y IG Y H+D + E+
Sbjct: 381 AWHDDSTHPLIARLSQKASAISNLTLETVEHLQVLNYGIGGLYEPHWDFVQGEERNEFSE 440
Query: 132 QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLL 191
RVA+F+ YL++LE GG T++P +G V PR+ L+Y+L+
Sbjct: 441 SDRNRVATFICYLSELEAGGYTVYP---------------TVGAAVVPRKNSCALWYNLM 485
Query: 192 PNGTIDPTSIHGSCPVVKGEKWVATKWIRD 221
NGT D + H +CP++ G KWVA KW +
Sbjct: 486 RNGTGDYRTYHAACPILYGYKWVANKWFHE 515
>gi|30686940|ref|NP_194290.2| Oxoglutarate/iron-dependent oxygenase [Arabidopsis thaliana]
gi|26451153|dbj|BAC42680.1| unknown protein [Arabidopsis thaliana]
gi|29893542|gb|AAP06823.1| unknown protein [Arabidopsis thaliana]
gi|332659681|gb|AEE85081.1| Oxoglutarate/iron-dependent oxygenase [Arabidopsis thaliana]
Length = 291
Score = 84.7 bits (208), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 62/208 (29%), Positives = 98/208 (47%), Gaps = 29/208 (13%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA 78
LSW+PR + F + E+C +I+ LR T + + TQ +G
Sbjct: 62 LSWLPRVFLYRGFLSEEECDHLIS-----LRKETTEVYSVDADGKTQLDPVVAG------ 110
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVA 138
IEEK++ T LP NG + + Y +K D F + +A
Sbjct: 111 ---------IEEKVSAWTFLPGENGGSIKVRSY-TSEKSGKKLDYFGEEPSSVLHESLLA 160
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCI--GLKVKPRQGDGLLFYSLLPNGTI 196
+ ++YL++ +GGE +FP + M S C+ G ++P +G+ +LF++ L N ++
Sbjct: 161 TVVLYLSNTTQGGELLFP-NSEMKPKNS-----CLEGGNILRPVKGNAILFFTRLLNASL 214
Query: 197 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D S H CPVVKGE VATK I ++Q
Sbjct: 215 DGKSTHLRCPVVKGELLVATKLIYAKKQ 242
>gi|344274274|ref|XP_003408942.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2
[Loxodonta africana]
Length = 534
Score = 84.7 bits (208), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 99/206 (48%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F + + + + + ++AK LR +T++ ++ T R S ++S E+
Sbjct: 335 PRIVRFHDIISDAEIEVVKDLAKPRLRRATISNPITGDLE-TVHYRISKSAWLSGYENP- 392
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 451
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------DVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|395501518|ref|XP_003755140.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Sarcophilus
harrisii]
Length = 385
Score = 84.7 bits (208), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 55/206 (26%), Positives = 96/206 (46%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F + + + + ++AK L +T+ + + Q R S ++S ED
Sbjct: 186 PRIVRFHEIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYEDP- 243
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 244 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIA 302
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 303 TWLFYMSDVSAGGATVFPE---------------VGASVWPKKGTAVFWYNLFASGEGDY 347
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 348 STRHAACPVLVGNKWVSNKWIHERGQ 373
>gi|402824057|ref|ZP_10873445.1| 2OG-Fe(II) oxygenase [Sphingomonas sp. LH128]
gi|402262407|gb|EJU12382.1| 2OG-Fe(II) oxygenase [Sphingomonas sp. LH128]
Length = 210
Score = 84.7 bits (208), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 51/197 (25%), Positives = 99/197 (50%), Gaps = 28/197 (14%)
Query: 31 FATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEE 90
F + E+C ++++ + N RPST+A G+ RTSS +S + +D + +
Sbjct: 34 FLSAERCAQLVDLIETNNRPSTIADYNGD-----DAFRTSSTCDLSR---DYPVVDELAQ 85
Query: 91 KIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP-----QEYGPQKSQRVASFLVYLT 145
+++++ + + E RY++GQ++ +H D F+P ++Y QR +F++YL
Sbjct: 86 ALSRLSGIDLAHAEPLQGQRYEVGQEFKAHTDYFEPDSADFEKYCKVPGQRTWTFMIYLN 145
Query: 146 DLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSC 205
D+E GG T F K I ++P G + + + +G+ + ++H +
Sbjct: 146 DVEAGGATRF---------------KVIDKMIQPETGKLIGWNNRRADGSCNAATLHHAM 190
Query: 206 PVVKGEKWVATKWIRDQ 222
V KG K+V T+W R++
Sbjct: 191 KVRKGRKYVITQWYRER 207
>gi|395820526|ref|XP_003783615.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Otolemur
garnettii]
Length = 534
Score = 84.7 bits (208), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 99/206 (48%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F + + + + + ++AK LR +T++ ++ T R S ++S E+
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLE-TVHYRISKSAWLSGYENP- 392
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 451
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|195505214|ref|XP_002099407.1| GE23379 [Drosophila yakuba]
gi|194185508|gb|EDW99119.1| GE23379 [Drosophila yakuba]
Length = 547
Score = 84.7 bits (208), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 60/214 (28%), Positives = 100/214 (46%), Gaps = 21/214 (9%)
Query: 14 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPS-TLALRKGETVDNTQGIRTSSG 72
I ++LS P L F + + ++ I + +K ++ PS T + + D+ RTS
Sbjct: 328 IKTEILSIDPFVLLFHDMISQKESTLIRSSSKEHMLPSATTDVDASGSEDHVATFRTSKS 387
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF--DPQEYG 130
V+ S+ ++ T I E++ T L E F ++ Y +G + +H D D +
Sbjct: 388 VWYSSTSND--TTKRITERLGDATGLDMNFTEYFQVINYGLGGFFETHLDMLLSDRSRFN 445
Query: 131 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 190
+ R+A+ L YL ++ +GG T FP N L V P+ G L +Y+L
Sbjct: 446 GTR-DRLATTLFYLNEVRQGGGTHFPRLN---------------LTVFPQPGSALFWYNL 489
Query: 191 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
G +++H CPV+ G KWV +KW+ D Q
Sbjct: 490 DTRGNDHTSTLHTGCPVIVGSKWVMSKWVEDAGQ 523
>gi|190788|gb|AAA36535.1| prolyl 4-hydroxylase alpha subunit (EC 1.14.11.2) [Homo sapiens]
Length = 534
Score = 84.7 bits (208), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 99/206 (48%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F + + + + + ++AK LR +T++ ++ T R S ++S E+
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLE-TVHYRISKSAWLSGYENP- 392
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 451
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|390178148|ref|XP_001358756.3| GA13990 [Drosophila pseudoobscura pseudoobscura]
gi|388859341|gb|EAL27899.3| GA13990 [Drosophila pseudoobscura pseudoobscura]
Length = 498
Score = 84.7 bits (208), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 42/146 (28%), Positives = 76/146 (52%), Gaps = 16/146 (10%)
Query: 80 DESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ-KSQRVA 138
+++ + + ++ +T L I +A ++ Y +G Y+ HYD+ + E R+A
Sbjct: 351 NDTAVVKTLHRRLNDMTGLDMIESDALTLINYGMGGHYDVHYDSHNYSEANRLILGDRIA 410
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
+ L Y+ +++ GG T FP+ I + V P++G +L+Y+L G ++P
Sbjct: 411 TVLFYVGEVDSGGATTFPY---------------INVSVTPKKGSAVLWYNLDNAGQMNP 455
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
+IH CPV+ G K+V TKWI + Q
Sbjct: 456 KAIHAGCPVIVGSKYVLTKWINEIPQ 481
>gi|63252888|ref|NP_001017962.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Homo
sapiens]
gi|197099666|ref|NP_001125733.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Pongo abelii]
gi|217272849|ref|NP_001136067.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Homo
sapiens]
gi|114631177|ref|XP_001140234.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3 [Pan
troglodytes]
gi|114631181|ref|XP_001140652.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 7 [Pan
troglodytes]
gi|2507090|sp|P13674.2|P4HA1_HUMAN RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; Flags: Precursor
gi|75061858|sp|Q5RAG8.1|P4HA1_PONAB RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; Flags: Precursor
gi|602675|gb|AAA59068.1| alpha-subunit of prolyl 4-hydroxylase [Homo sapiens]
gi|23271226|gb|AAH34998.1| Prolyl 4-hydroxylase, alpha polypeptide I [Homo sapiens]
gi|55729010|emb|CAH91242.1| hypothetical protein [Pongo abelii]
gi|56403853|emb|CAI29712.1| hypothetical protein [Pongo abelii]
gi|119574854|gb|EAW54469.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I, isoform CRA_c [Homo
sapiens]
gi|119574855|gb|EAW54470.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I, isoform CRA_d [Homo
sapiens]
gi|123981532|gb|ABM82595.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I [synthetic
construct]
gi|123996359|gb|ABM85781.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I [synthetic
construct]
gi|261861532|dbj|BAI47288.1| prolyl 4-hydroxylase, alpha polypeptide I [synthetic construct]
gi|410295852|gb|JAA26526.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
gi|410349611|gb|JAA41409.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
Length = 534
Score = 84.3 bits (207), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 99/206 (48%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F + + + + + ++AK LR +T++ ++ T R S ++S E+
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLE-TVHYRISKSAWLSGYENP- 392
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 451
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|291404184|ref|XP_002718472.1| PREDICTED: prolyl 4-hydroxylase, alpha I subunit isoform 2
[Oryctolagus cuniculus]
Length = 534
Score = 84.3 bits (207), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 99/206 (48%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F + + + + + ++AK LR +T++ ++ T R S ++S E+
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLE-TVHYRISKSAWLSGYENP- 392
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 451
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|410251926|gb|JAA13930.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
Length = 566
Score = 84.3 bits (207), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 99/206 (48%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F + + + + + ++AK LR +T++ ++ T R S ++S E+
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLE-TVHYRISKSAWLSGYENP- 392
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 451
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|355562502|gb|EHH19096.1| hypothetical protein EGK_19739 [Macaca mulatta]
gi|355782842|gb|EHH64763.1| hypothetical protein EGM_18071 [Macaca fascicularis]
gi|383418719|gb|AFH32573.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Macaca
mulatta]
Length = 534
Score = 84.3 bits (207), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 99/206 (48%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F + + + + + ++AK LR +T++ ++ T R S ++S E+
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLE-TVHYRISKSAWLSGYENP- 392
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 451
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|301607015|ref|XP_002933103.1| PREDICTED: transmembrane prolyl 4-hydroxylase-like [Xenopus
(Silurana) tropicalis]
Length = 469
Score = 84.3 bits (207), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 67/239 (28%), Positives = 102/239 (42%), Gaps = 41/239 (17%)
Query: 21 WMP----RALYFPNFATPEQ--CKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVF 74
WM R +Y A P+ S+ KLNLR L K E V + +R S +
Sbjct: 189 WMTPENIREMYSALRADPDGNGVLSLDEFKKLNLRDFHKYLGKQE-VTMSDLVRNSHHTW 247
Query: 75 ISAAEDESGTLDLIEEKIAKVTMLPRI---NGEAFNILRYKIGQKYNSHYDA-------- 123
+ E L I +++ K+T LP N E ++RY G Y++H D+
Sbjct: 248 LYQGEGAHHVLRSIRQRVIKLTHLPLDIVENSEPLQVVRYDTGGHYHAHMDSGPVFPETA 307
Query: 124 -----FDPQEYGP-QKSQRVASFLVYLTDLEEGGETMFP------------FENGMNADG 165
E P + S R + L YL ++ GGET FP +N ++
Sbjct: 308 CTHTKLTTNETAPFETSCRYVTVLFYLNNVTGGGETTFPVADNRTYEELSLIQNDVDLRD 367
Query: 166 SYDYQKCIGLKVKPRQGDGLLFYSLLPNGT-----IDPTSIHGSCPVVKGEKWVATKWI 219
+ + L++KPRQG + +Y+ L +G +D ++HG C V G KW+A WI
Sbjct: 368 TRKHCDKGNLRIKPRQGTAVFWYNYLSDGKGWVGDVDEFALHGGCLVTAGTKWIANNWI 426
>gi|380813206|gb|AFE78477.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Macaca
mulatta]
gi|384947328|gb|AFI37269.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Macaca
mulatta]
Length = 534
Score = 84.3 bits (207), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 99/206 (48%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F + + + + + ++AK LR +T++ ++ T R S ++S E+
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLE-TVHYRISKSAWLSGYENP- 392
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 451
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|449488641|ref|XP_004158125.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized LOC101218968
[Cucumis sativus]
Length = 311
Score = 84.3 bits (207), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 58/209 (27%), Positives = 105/209 (50%), Gaps = 23/209 (11%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKGETVDNTQGIRTSSGVFISA 77
+SW PR + F + E+C +I++A + PS + G TV + + SSGV ++
Sbjct: 59 VSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGITV--STELLNSSGVILNT 116
Query: 78 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRV 137
+D + IE ++A T+LP+ + F I++Y+ G++ Y + P +
Sbjct: 117 TDD---IVARIENRLAIWTLLPKDHSMPFQIMQYR-GEEAKHKYFYGNRSAMLPSSEPLM 172
Query: 138 ASFLVYLTDLEEGGETMFP-------FENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 190
A+ ++YL+D GGE +FP F +G ++ ++P +G+ +L +S+
Sbjct: 173 ATVVLYLSDSASGGEILFPESKVKSKFWSGRRKKNNF---------LRPVKGNAILXFSV 223
Query: 191 LPNGTIDPTSIHGSCPVVKGEKWVATKWI 219
N + D +S H P+ GE WVATK++
Sbjct: 224 HLNASPDKSSYHIRSPIRDGELWVATKFL 252
>gi|268536692|ref|XP_002633481.1| C. briggsae CBR-PHY-2 protein [Caenorhabditis briggsae]
gi|94442973|emb|CAJ98659.1| prolyl 4-hydroxylase [Caenorhabditis briggsae]
Length = 539
Score = 84.3 bits (207), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 56/215 (26%), Positives = 98/215 (45%), Gaps = 22/215 (10%)
Query: 14 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGV 73
I ++L + P A+ F N + + + I +A L+ +T+ K +++ R S
Sbjct: 316 IKVEILRFDPLAVLFKNVISDSEIEVIKELASPKLKRATVQNSKTGELEHAT-YRISKSA 374
Query: 74 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK 133
++ D +D + +I T L + E + Y +G Y+ H+D +E K
Sbjct: 375 WLKG--DLDPVIDRVNRRIEDFTGLNQATSEELQVANYGLGGHYDPHFDFARKEEKNAFK 432
Query: 134 S----QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYS 189
+ R+A+ L Y++ E GG T+F +G V P + D L +Y+
Sbjct: 433 TLNTGNRIATVLFYMSQPERGGATVF---------------NHLGTAVFPSKNDALFWYN 477
Query: 190 LLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
L +G D + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 478 LRRDGEGDLRTRHAACPVLLGVKWVSNKWIHERGQ 512
>gi|51036657|ref|NP_742059.2| prolyl 4-hydroxylase subunit alpha-1 precursor [Rattus norvegicus]
gi|90111077|sp|P54001.2|P4HA1_RAT RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; Flags: Precursor
gi|50927553|gb|AAH78703.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I [Rattus norvegicus]
gi|149038787|gb|EDL93076.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide, isoform CRA_a
[Rattus norvegicus]
Length = 534
Score = 84.3 bits (207), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F + + + + + ++AK L +T+ + + Q R S ++S ED
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYEDP- 392
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ I +I +T L E + Y +G +Y H+D E + R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIA 451
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|395817618|ref|XP_003782262.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Otolemur
garnettii]
Length = 538
Score = 84.3 bits (207), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 55/208 (26%), Positives = 101/208 (48%), Gaps = 28/208 (13%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 78
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 341 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 393
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQR 136
ED+ + + ++ +T L E + Y +G +Y H+D + P + G + + R
Sbjct: 394 EDDDPVVARVNHRMQHITGLSVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNR 453
Query: 137 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 196
VA+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 454 VATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEG 498
Query: 197 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 499 DYRTRHAACPVLVGCKWVSNKWFHERGQ 526
>gi|170064956|ref|XP_001867741.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
gi|167882144|gb|EDS45527.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
Length = 520
Score = 84.3 bits (207), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 43/123 (34%), Positives = 65/123 (52%), Gaps = 17/123 (13%)
Query: 104 EAFNILRYKIGQKYNSHYDAFDPQEYGPQKS--QRVASFLVYLTDLEEGGETMFPFENGM 161
E + Y +G Y+ HYD P K R+A+ + YL+D++EGG T+FP N
Sbjct: 401 ELLQVNNYGLGGFYSIHYDWSTSANPFPNKGMGNRIATLMFYLSDVQEGGSTVFPRLN-- 458
Query: 162 NADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRD 221
L V+PR+G + +Y+L NG + ++H +CPV+ G KWVA KWI +
Sbjct: 459 -------------LAVRPRKGTAIFWYNLHRNGKGNKKTLHAACPVLIGSKWVANKWIHE 505
Query: 222 QEQ 224
+ Q
Sbjct: 506 RHQ 508
>gi|194905410|ref|XP_001981191.1| GG11931 [Drosophila erecta]
gi|190655829|gb|EDV53061.1| GG11931 [Drosophila erecta]
Length = 537
Score = 84.0 bits (206), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 55/203 (27%), Positives = 90/203 (44%), Gaps = 19/203 (9%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P F + +P + + MA ++ ST+ R G + R S ++ A +
Sbjct: 331 PYVASFHDMLSPRKISQLREMAVPRMQRSTVNPRPGGQHKKS-AFRVSKNAWL--AYEAH 387
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQKSQRVASFL 141
T+ + + T L E + Y +G Y H+D F DP Y + R+A+ +
Sbjct: 388 PTMAGMLRDLKDATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPSHYPAAEGNRIATAI 447
Query: 142 VYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSI 201
YL+++E+GG T FPF + VKP+ G+ L +Y+L + D +
Sbjct: 448 FYLSEVEQGGATAFPF---------------LDFAVKPQLGNVLFWYNLHRSLDKDYRTK 492
Query: 202 HGSCPVVKGEKWVATKWIRDQEQ 224
H CPV+KG KW+ WI + Q
Sbjct: 493 HAGCPVLKGSKWIGNVWIHEVTQ 515
>gi|323453493|gb|EGB09364.1| hypothetical protein AURANDRAFT_15704, partial [Aureococcus
anophagefferens]
Length = 148
Score = 84.0 bits (206), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 49/157 (31%), Positives = 82/157 (52%), Gaps = 18/157 (11%)
Query: 68 RTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF--- 124
RTS + + A + + + +I +VT +P+ N E+F +LRY GQ+Y +H+D
Sbjct: 4 RTSENAWCTGACESNRATRAVMARIEEVTGVPKENYESFQVLRYTHGQQYRAHHDMSRGD 63
Query: 125 DPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDG 184
+ GP R+ +F +Y +D+E+GGET FP + +K+ P++G
Sbjct: 64 NALACGP----RIYTFFMYFSDVEKGGETEFPMVKRPSGK---------TVKIAPKRGSA 110
Query: 185 LLFYSLLPNGTI--DPTSIHGSCPVVKGEKWVATKWI 219
LL+ S+ + DP + H + PVV+G K+ A WI
Sbjct: 111 LLWPSVTSDDPTAQDPRTRHAALPVVEGTKFAANAWI 147
>gi|474940|emb|CAA55546.1| gamma-butyrobetaine,2-oxoglutarate dioxygenase [Rattus norvegicus]
Length = 534
Score = 84.0 bits (206), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F + + + + + ++AK L +T+ + + Q R S ++S ED
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYEDP- 392
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ I +I +T L E + Y +G +Y H+D E + R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIA 451
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|357010489|ref|ZP_09075488.1| response regulator receiver domain-containing protein
[Paenibacillus elgii B69]
Length = 397
Score = 84.0 bits (206), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 59/219 (26%), Positives = 100/219 (45%), Gaps = 28/219 (12%)
Query: 15 PFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVF 74
P V P + + +C+ +I +A+ L P+ + GE R S F
Sbjct: 4 PSVVFHEEPLVASYEQVVSGPECRQLIELARHQLEPAKVI---GEKEVVASEFRKSE--F 58
Query: 75 ISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFD-----PQEY 129
D + + E++A + P E+ I RY +G ++ +H+D +D + +
Sbjct: 59 AWFHHDSHPLVREVSERLAALAGRPLHYAESLQIARYVVGGRFGAHFDTYDLNTVDGKRF 118
Query: 130 GPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYS 189
Q QR+ + L+YL ++ GGET FP N L + P +G+ L+ +
Sbjct: 119 YDQGGQRLYTALLYLNTVDAGGETYFPELN---------------LDIAPSEGN-LIVFE 162
Query: 190 LLPNGTID--PTSIHGSCPVVKGEKWVATKWIRDQEQYD 226
GT + P S+HGS + +GEKW+AT W R++ QY+
Sbjct: 163 TCKWGTNERHPLSLHGSRELREGEKWIATLWFRERPQYE 201
>gi|395817620|ref|XP_003782263.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Otolemur
garnettii]
Length = 540
Score = 84.0 bits (206), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 97/210 (46%), Gaps = 30/210 (14%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 78
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 341 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 393
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS---- 134
ED+ + + ++ +T L E + Y +G +Y H+D E K
Sbjct: 394 EDDDPVVARVNHRMQHITGLSVKTAELLQVANYGVGGQYEPHFDFSRNHERDAFKRLGTG 453
Query: 135 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 194
RVA+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 454 NRVATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSG 498
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 499 EGDYRTRHAACPVLVGCKWVSNKWFHERGQ 528
>gi|115495019|ref|NP_001069238.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Bos taurus]
gi|122144801|sp|Q1RMU3.1|P4HA1_BOVIN RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; Flags: Precursor
gi|92097479|gb|AAI14709.1| Prolyl 4-hydroxylase, alpha polypeptide I [Bos taurus]
gi|296472132|tpg|DAA14247.1| TPA: prolyl 4-hydroxylase subunit alpha-1 precursor [Bos taurus]
gi|440892721|gb|ELR45796.1| Prolyl 4-hydroxylase subunit alpha-1 [Bos grunniens mutus]
Length = 534
Score = 84.0 bits (206), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 99/206 (48%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F + + + + + ++AK LR +T++ ++ T R S ++S E+
Sbjct: 335 PRIIRFHDIISDAEIEVVKDLAKPRLRRATISNPITGDLE-TVHYRISKSAWLSGYENP- 392
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 451
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVLAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|26336999|dbj|BAC32183.1| unnamed protein product [Mus musculus]
gi|148700261|gb|EDL32208.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide, isoform CRA_b [Mus
musculus]
Length = 534
Score = 84.0 bits (206), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F + + + + + ++AK L +T+ + + Q R S ++S ED
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYEDP- 392
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ I +I +T L E + Y +G +Y H+D E + R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIA 451
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|196011908|ref|XP_002115817.1| hypothetical protein TRIADDRAFT_30052 [Trichoplax adhaerens]
gi|190581593|gb|EDV21669.1| hypothetical protein TRIADDRAFT_30052, partial [Trichoplax
adhaerens]
Length = 495
Score = 84.0 bits (206), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 62/212 (29%), Positives = 102/212 (48%), Gaps = 29/212 (13%)
Query: 14 IPFQVLSWMPRALYFPNFATPEQCKSI--INMAKLNLRPSTLALRKGETVDNTQGIRTSS 71
IP + +S P + + + Q ++I I+ +K N P+ L G + TQ
Sbjct: 290 IPVEEISLDPFIVIYYDIINDHQIETIKKISPSKSNKSPNHAMLCSGIKSEATQ-----V 344
Query: 72 GVFISAAEDESGTLDLIEEKIAKVTM-LPRIN---GEAFNILRYKIGQKYNSHYDAFDPQ 127
+F + E D + EKI+++T L ++ E + Y IG Y HYD+
Sbjct: 345 SIFCCSTWLEDA-YDPVVEKISRLTQELTHLDVNYAEDLQVANYGIGGHYVPHYDSTIIA 403
Query: 128 EYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 187
P QR+A+ + YL+++E GG T+FP +G+ V+P++G L +
Sbjct: 404 PEDPL--QRLATMMFYLSNVEIGGATIFPR---------------LGVAVRPQKGSALFW 446
Query: 188 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWI 219
+L NG + ++H +CPVV G KW+A KWI
Sbjct: 447 INLKRNGLTNRQTLHAACPVVIGSKWIANKWI 478
>gi|321474953|gb|EFX85917.1| hypothetical protein DAPPUDRAFT_309108 [Daphnia pulex]
Length = 549
Score = 84.0 bits (206), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 52/207 (25%), Positives = 93/207 (44%), Gaps = 23/207 (11%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + + + E+ +++ +A + +T+ + T R S F+ E
Sbjct: 347 PLLVIYHDVIFDEEIETVKKLAHPRFKRTTV-MNSATGKLETAKYRISKAAFLKNKEHHH 405
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQKS----QRV 137
+ + ++ +T L E + Y IG Y H+D A + G K R+
Sbjct: 406 --VLKMSRRVGAITGLDMSTAEDLQVCNYGIGGHYEPHFDYARKNETIGFNKDSGWRNRI 463
Query: 138 ASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTID 197
A++L Y++D+E GG T+FP + + + P++G +Y+L PNG +
Sbjct: 464 ATWLFYMSDVEAGGATVFP---------------ALNVALWPQKGSAAFWYNLFPNGEGN 508
Query: 198 PTSIHGSCPVVKGEKWVATKWIRDQEQ 224
+ H +CPV+ G KWVA KWI ++ Q
Sbjct: 509 ELTRHAACPVLTGSKWVANKWIHEKNQ 535
>gi|334314085|ref|XP_001363658.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1
[Monodelphis domestica]
Length = 537
Score = 84.0 bits (206), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 55/206 (26%), Positives = 96/206 (46%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F + + + + ++AK L +T+ + + Q R S ++S ED
Sbjct: 338 PRIVRFHEIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYEDP- 395
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 396 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIA 454
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 455 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 499
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 500 STRHAACPVLVGNKWVSNKWIHERGQ 525
>gi|345305838|ref|XP_001508476.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Ornithorhynchus
anatinus]
Length = 493
Score = 84.0 bits (206), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + + + + +++ ++AK L +T+ + + Q R S ++S ED
Sbjct: 294 PRIVRYHEIISDAEIETVKDLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYEDP- 351
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 352 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIA 410
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 411 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 455
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 456 STRHAACPVLVGNKWVSNKWIHERGQ 481
>gi|194905381|ref|XP_001981186.1| GG11928 [Drosophila erecta]
gi|190655824|gb|EDV53056.1| GG11928 [Drosophila erecta]
Length = 543
Score = 84.0 bits (206), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 63/214 (29%), Positives = 96/214 (44%), Gaps = 21/214 (9%)
Query: 14 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSG 72
I ++LS P L + ++ I +K +L S + + DN RTS
Sbjct: 324 IKTEILSLDPFVLLLHDMVRQKESTLIRASSKEHLLQSEITNTDASSSEDNVAIFRTSKS 383
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF--DPQEYG 130
V+ S+ D + T I E++A T L E F ++ Y +G + +H D D +
Sbjct: 384 VWYSS--DFNDTTKKITERLADATGLDMHFTEYFQVINYGLGGFFATHLDMLLSDKTRFN 441
Query: 131 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 190
S R+A+ + YL + +GG T FP N L V P+ G L +Y+L
Sbjct: 442 G-TSDRIATTVFYLNGVRQGGATHFPLLN---------------LTVFPQPGSALFWYNL 485
Query: 191 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
G +++H CPV+ G KWV TKW+ DQ Q
Sbjct: 486 DTKGNDQRSTMHTGCPVIVGSKWVMTKWVGDQGQ 519
>gi|195505202|ref|XP_002099402.1| GE23382 [Drosophila yakuba]
gi|194185503|gb|EDW99114.1| GE23382 [Drosophila yakuba]
Length = 537
Score = 84.0 bits (206), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 54/203 (26%), Positives = 93/203 (45%), Gaps = 19/203 (9%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + + +P + + MA +R ST+ G + R S ++ A E
Sbjct: 331 PYVATYHDMLSPRKISQLREMAVPRMRRSTVNPLPGGQHKKS-AFRVSKNAWL-AYESHP 388
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQKSQRVASFL 141
+ ++ + + + T L E + Y +G Y H+D F DP Y ++ R+A+ +
Sbjct: 389 TMVGMLRD-LKEATGLDTTYCEQLQVANYGVGGHYEPHWDFFRDPNHYPEEEGNRIATAI 447
Query: 142 VYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSI 201
YL+++E+GG T FPF + + VKP+ G+ L +Y+L + D +
Sbjct: 448 FYLSEVEQGGATAFPF---------------LDIAVKPQLGNVLFWYNLHRSLDKDYRTK 492
Query: 202 HGSCPVVKGEKWVATKWIRDQEQ 224
H CPV+KG KW+ WI + Q
Sbjct: 493 HAGCPVLKGSKWIGNVWIHEVTQ 515
>gi|348505573|ref|XP_003440335.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Oreochromis
niloticus]
Length = 517
Score = 83.6 bits (205), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 59/217 (27%), Positives = 102/217 (47%), Gaps = 33/217 (15%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFIS 76
+++S P + + +F T + + I ++A LR S +A + + T R S ++
Sbjct: 313 ELVSLQPYVVLYHDFVTDTEAEDIKSLAHPGLRRSVVAAGEKQA---TADYRISKSAWLK 369
Query: 77 -AAEDESGTLDLIEEKIAKVTMLPRIN-----GEAFNILRYKIGQKYNSHYD-AFDPQE- 128
+A+ G LD ++++L +N GE ++ Y IG Y H+D A P
Sbjct: 370 GSAQSIVGKLD------QRISLLTGLNVKHPYGEYLQVVNYGIGGHYEPHFDHATSPSSP 423
Query: 129 -YGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 187
+ + RVA+F++YL+ +E GG T F + N V + + +
Sbjct: 424 VFKLKTGNRVATFMIYLSPVEAGGSTAFIYAN---------------FSVPVVEKAAIFW 468
Query: 188 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
++L NG D ++H CPV+ G+KWVA KWI + Q
Sbjct: 469 WNLHRNGEGDDDTLHAGCPVLIGDKWVANKWIHEYGQ 505
>gi|170064960|ref|XP_001867743.1| prolyl 4-hydroxylase subunit alpha-1 [Culex quinquefasciatus]
gi|167882146|gb|EDS45529.1| prolyl 4-hydroxylase subunit alpha-1 [Culex quinquefasciatus]
Length = 545
Score = 83.6 bits (205), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 52/207 (25%), Positives = 96/207 (46%), Gaps = 24/207 (11%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVFISAAEDE 81
P + + + + + I +AK R +T+ + GE R S ++ ++E
Sbjct: 342 PYIVIYHEVMSDAEIEVIKRLAKPRFRRATVQNYKTGEL--EVANYRISKSAWLK--DEE 397
Query: 82 SGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRV 137
+ + +++ +T L E ++ Y IG Y H+D +E KS R+
Sbjct: 398 HSVVRTVGQRVEDMTGLTMTTAEELQVVNYGIGGHYEPHFDFARREEKNAFKSLGTGNRI 457
Query: 138 ASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTID 197
A+ L Y++D+ +GG T+FP I + ++P++G +Y+L +G D
Sbjct: 458 ATVLFYMSDVSQGGATVFP---------------SIRVALRPKKGTAAFWYNLHASGHGD 502
Query: 198 PTSIHGSCPVVKGEKWVATKWIRDQEQ 224
+ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 503 YATRHAACPVLTGTKWVSNKWIHERGQ 529
>gi|195145084|ref|XP_002013526.1| GL24185 [Drosophila persimilis]
gi|194102469|gb|EDW24512.1| GL24185 [Drosophila persimilis]
Length = 229
Score = 83.6 bits (205), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 41/146 (28%), Positives = 76/146 (52%), Gaps = 16/146 (10%)
Query: 80 DESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ-KSQRVA 138
+++ + + ++ +T L I + ++ Y +G Y+ HYD+ + E R+A
Sbjct: 82 NDTAVVKTLHRRLNDMTGLDMIESDTLTLINYGMGGHYDVHYDSHNYSEANRLILGDRIA 141
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
+ L Y+ +++ GG T FP+ I + V P++G +L+Y+L +G ++P
Sbjct: 142 TVLFYVGEVDSGGATTFPY---------------INVSVTPKKGSAVLWYNLDNSGQMNP 186
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
+IH CPV+ G K+V TKWI + Q
Sbjct: 187 KAIHAGCPVIVGSKYVLTKWINEIPQ 212
>gi|426255744|ref|XP_004021508.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1 [Ovis
aries]
Length = 534
Score = 83.6 bits (205), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 99/206 (48%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F + + + + + ++AK LR +T++ ++ T R S ++S E+
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLE-TVHYRISKSAWLSGYENP- 392
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 451
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVLAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|344264847|ref|XP_003404501.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1
[Loxodonta africana]
Length = 536
Score = 83.6 bits (205), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 97/210 (46%), Gaps = 30/210 (14%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 78
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 337 PHIVRYYDVMSDEEIERIKQIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 389
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS---- 134
ED+ + + ++ +T L E + Y +G +Y H+D E K
Sbjct: 390 EDDDPVVAQVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRSHEQDAFKRLGTG 449
Query: 135 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 194
RVA+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 450 NRVATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSG 494
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 495 EGDYRTRHAACPVLVGCKWVSNKWFHERGQ 524
>gi|195575099|ref|XP_002105517.1| GD17024 [Drosophila simulans]
gi|194201444|gb|EDX15020.1| GD17024 [Drosophila simulans]
Length = 537
Score = 83.6 bits (205), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 54/203 (26%), Positives = 92/203 (45%), Gaps = 19/203 (9%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P F + +P + + MA + ST+ G + + R S ++ A E
Sbjct: 331 PYVATFHDMLSPRKISQLREMAVPRMHRSTVNPLPGGQLKKS-AFRVSKNAWL-AYESHP 388
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQKSQRVASFL 141
+ ++ + + T L E + Y +G Y H+D F DP Y ++ R+A+ +
Sbjct: 389 TMVGMLRD-LKDATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRIATAI 447
Query: 142 VYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSI 201
YL+++E+GG T FPF + + VKP+ G+ L +Y+L + D +
Sbjct: 448 FYLSEVEQGGATAFPF---------------LDIAVKPQLGNVLFWYNLHRSLDKDYRTK 492
Query: 202 HGSCPVVKGEKWVATKWIRDQEQ 224
H CPV+KG KW+ WI + Q
Sbjct: 493 HAGCPVLKGSKWIGNVWIHEVTQ 515
>gi|17541712|ref|NP_502317.1| Protein PHY-2 [Caenorhabditis elegans]
gi|32171589|sp|Q20065.1|P4HA2_CAEEL RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
alpha-2; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-2; Flags: Precursor
gi|3876769|emb|CAA93469.1| Protein PHY-2 [Caenorhabditis elegans]
Length = 539
Score = 83.6 bits (205), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 56/215 (26%), Positives = 97/215 (45%), Gaps = 22/215 (10%)
Query: 14 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGV 73
I ++L + P A+ F N + + I +A L+ +T+ K +++ R S
Sbjct: 316 IKVEILRFDPLAVLFKNVIHDSEIEVIKELASPKLKRATVQNSKTGELEHAT-YRISKSA 374
Query: 74 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK 133
++ D +D + +I T L + E + Y +G Y+ H+D +E K
Sbjct: 375 WLKG--DLDPVIDRVNRRIEDFTNLNQATSEELQVANYGLGGHYDPHFDFARKEEKNAFK 432
Query: 134 S----QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYS 189
+ R+A+ L Y++ E GG T+F +G V P + D L +Y+
Sbjct: 433 TLNTGNRIATVLFYMSQPERGGATVF---------------NHLGTAVFPSKNDALFWYN 477
Query: 190 LLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
L +G D + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 478 LRRDGEGDLRTRHAACPVLLGVKWVSNKWIHEKGQ 512
>gi|195391760|ref|XP_002054528.1| GJ22757 [Drosophila virilis]
gi|194152614|gb|EDW68048.1| GJ22757 [Drosophila virilis]
Length = 534
Score = 83.6 bits (205), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 44/142 (30%), Positives = 70/142 (49%), Gaps = 16/142 (11%)
Query: 84 TLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQKSQRVASFLV 142
T+ + ++ T L E + Y +G Y H+D F D + Y + R+A+ +
Sbjct: 386 TMGRMLRDVSDATGLDMTFCEQLQVANYGVGGHYEPHWDFFRDSRHYPAAEGNRIATAIF 445
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL+D+E+GG T FPF N V+P+ G+ L +Y+L + +D + H
Sbjct: 446 YLSDVEQGGATAFPFLN---------------FAVRPQLGNILFWYNLHRSSDMDFRTKH 490
Query: 203 GSCPVVKGEKWVATKWIRDQEQ 224
CPV+KG KW+A WI + Q
Sbjct: 491 AGCPVLKGSKWIANIWIHEATQ 512
>gi|195110925|ref|XP_002000030.1| GI22756 [Drosophila mojavensis]
gi|193916624|gb|EDW15491.1| GI22756 [Drosophila mojavensis]
Length = 533
Score = 83.6 bits (205), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 54/204 (26%), Positives = 94/204 (46%), Gaps = 21/204 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTL-ALRKGETVDNTQGIRTSSGVFISAAEDE 81
P + + + +P+Q + MA +++ ST+ L G+ + + R S ++ +
Sbjct: 327 PLVVSYHDMLSPQQIGELRAMAVPHMQRSTVNPLSGGQRMKS--AFRVSKNAWLPYSTHP 384
Query: 82 SGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQKSQRVASF 140
+ + + T L E + Y +G Y H+D F D + Y + R+A+
Sbjct: 385 --MMGRMLRDVGDATGLDMTYCEQLQVANYGVGGHYEPHWDFFRDSRHYPAAEGNRIATA 442
Query: 141 LVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTS 200
+ YL+D+E+GG T FPF N V+P+ G+ L +Y+L + D +
Sbjct: 443 IFYLSDVEQGGATAFPFLN---------------FAVRPQLGNILFWYNLHRSSDEDYRT 487
Query: 201 IHGSCPVVKGEKWVATKWIRDQEQ 224
H CPV+KG KW+A WI + Q
Sbjct: 488 KHAGCPVLKGSKWIANIWIHEATQ 511
>gi|357542083|gb|AET84843.1| hypothetical protein MPXG_00045 [Micromonas pusilla virus SP1]
Length = 196
Score = 83.6 bits (205), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 66/196 (33%), Positives = 92/196 (46%), Gaps = 29/196 (14%)
Query: 28 FPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDL 87
F NF T + +I AK NL PST++ +D + +R S ++S + +
Sbjct: 24 FRNFITSGERAHVIEEAKKNLTPSTVSTE--HKLD--ESVRKSETAWLSF---DDPIIRG 76
Query: 88 IEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDL 147
I EK + T P IN E +LRY+ G Y H D K+QR+ +F++ L D
Sbjct: 77 IAEKCIRYTDRPLINCEKLQVLRYEEGGHYIPHQDILRNA-----KNQRMYTFILALNDD 131
Query: 148 EEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPT-SIHGSCP 206
EGGET+FP N SY K R GD L F+ L N D + ++HG P
Sbjct: 132 YEGGETVFP-----NLRKSY----------KLRAGDAL-FFDTLDNYEYDTSRALHGGKP 175
Query: 207 VVKGEKWVATKWIRDQ 222
V GEKW+ W+R
Sbjct: 176 VKSGEKWICNLWVRKH 191
>gi|226874876|ref|NP_035161.2| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Mus
musculus]
gi|148701601|gb|EDL33548.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide, isoform CRA_f [Mus
musculus]
Length = 537
Score = 83.6 bits (205), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 97/210 (46%), Gaps = 30/210 (14%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 78
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 338 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 390
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS---- 134
ED+ + + ++ +T L E + Y +G +Y H+D E K
Sbjct: 391 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRSDEQDAFKRLGTG 450
Query: 135 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 194
RVA+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 451 NRVATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSG 495
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 496 EGDYRTRHAACPVLVGCKWVSNKWFHERGQ 525
>gi|74148153|dbj|BAE36242.1| unnamed protein product [Mus musculus]
Length = 454
Score = 83.6 bits (205), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F + + + + + ++AK L +T+ + + Q R S ++S ED
Sbjct: 255 PRIIRFHDIISDAENEIVKDLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYEDP- 312
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ I +I +T L E + Y +G +Y H+D E + R+A
Sbjct: 313 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIA 371
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 372 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 416
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 417 STRHAACPVLVGNKWVSNKWLHERGQ 442
>gi|351706369|gb|EHB09288.1| Prolyl 4-hydroxylase subunit alpha-2 [Heterocephalus glaber]
Length = 535
Score = 83.6 bits (205), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 56/210 (26%), Positives = 96/210 (45%), Gaps = 30/210 (14%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 78
P + + N + E+ I +AK L +T+ D G+ T + +S +
Sbjct: 336 PHIVRYYNVMSDEEIDRIKELAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 388
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS---- 134
ED+ + + ++ +T L E + Y +G +Y H+D E K
Sbjct: 389 EDDDPVVARVNRRMQYITGLTVQTAELLQVANYGMGGQYEPHFDFSRNHERDAFKRLGTG 448
Query: 135 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 194
RVA+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 449 NRVATFLNYMSDVEAGGATVFP---------------DLGAALWPKKGTAVFWYNLLRSG 493
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 494 EGDYRTRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|291387304|ref|XP_002710243.1| PREDICTED: prolyl 4-hydroxylase, alpha II subunit isoform 1
precursor (predicted)-like isoform 3 [Oryctolagus
cuniculus]
Length = 535
Score = 83.6 bits (205), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 56/210 (26%), Positives = 97/210 (46%), Gaps = 30/210 (14%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 78
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 388
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS---- 134
ED+ + I ++ +T L E + Y +G +Y H+D E K
Sbjct: 389 EDDDPVVARINRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRNNERDAFKRLGTG 448
Query: 135 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 194
RVA+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 449 NRVATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSG 493
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 494 EGDYRTRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|149052606|gb|EDM04423.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide (predicted),
isoform CRA_c [Rattus norvegicus]
Length = 506
Score = 83.6 bits (205), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 97/210 (46%), Gaps = 30/210 (14%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 78
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 307 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 359
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS---- 134
ED+ + + ++ +T L E + Y +G +Y H+D E K
Sbjct: 360 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRSDERDAFKRLGTG 419
Query: 135 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 194
RVA+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 420 NRVATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSG 464
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 465 EGDYRTRHAACPVLVGCKWVSNKWFHERGQ 494
>gi|219116348|ref|XP_002178969.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217409736|gb|EEC49667.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 302
Score = 83.2 bits (204), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 64/217 (29%), Positives = 99/217 (45%), Gaps = 30/217 (13%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET--VDNTQGIRTSSGVF 74
V+S P + +F + CK++I+ A ST + + T T IRTS+ V+
Sbjct: 90 HVVSSEPPLVLIHDFLSTSMCKNLIDTAT-----STDKMIRSTTGSEQETSTIRTSTTVW 144
Query: 75 ISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS 134
++ E T +I EKI+ ++ P + E ++RY+ GQ + H D D K
Sbjct: 145 LND-EQVPETSRIIAEKISSISGFPANHMENLQVVRYETGQSFKLHTDTIDAYNEM-DKR 202
Query: 135 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 194
RVA+ L+YL GGET+FP ++AD K I +++ PRQG + F++
Sbjct: 203 GRVATCLIYLAAPTIGGETLFP---DVHAD------KAIQIRIAPRQGSAIFFWNTHEKP 253
Query: 195 ------------TIDPTSIHGSCPVVKGEKWVATKWI 219
D H PV GEKWV +W+
Sbjct: 254 GSPAYDGADMFLNTDLRMRHAGMPVDGGEKWVCNRWV 290
>gi|403255941|ref|XP_003920663.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Saimiri
boliviensis boliviensis]
gi|403255945|ref|XP_003920665.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 5 [Saimiri
boliviensis boliviensis]
Length = 535
Score = 83.2 bits (204), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 97/210 (46%), Gaps = 30/210 (14%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 78
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 388
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS---- 134
ED+ + + ++ +T L E + Y +G +Y H+D E K
Sbjct: 389 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDAFKHLGTG 448
Query: 135 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 194
RVA+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 449 NRVATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSG 493
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 494 EGDYRTRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|90085216|dbj|BAE91349.1| unnamed protein product [Macaca fascicularis]
Length = 244
Score = 83.2 bits (204), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F + + + + + ++AK L +T+ + + Q R S ++S E+
Sbjct: 45 PRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYENP- 102
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 103 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 161
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 162 TWLFYMSDVSAGGATVFPE---------------VGASVWPKKGTAVFWYNLFASGEGDY 206
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 207 STRHAACPVLVGNKWVSNKWLHERGQ 232
>gi|157818741|ref|NP_001101745.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Rattus norvegicus]
gi|149052604|gb|EDM04421.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide (predicted),
isoform CRA_a [Rattus norvegicus]
Length = 535
Score = 83.2 bits (204), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 97/210 (46%), Gaps = 30/210 (14%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 78
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 388
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS---- 134
ED+ + + ++ +T L E + Y +G +Y H+D E K
Sbjct: 389 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRSDERDAFKRLGTG 448
Query: 135 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 194
RVA+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 449 NRVATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSG 493
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 494 EGDYRTRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|224006261|ref|XP_002292091.1| hypothetical protein THAPSDRAFT_263436 [Thalassiosira pseudonana
CCMP1335]
gi|220972610|gb|EED90942.1| hypothetical protein THAPSDRAFT_263436 [Thalassiosira pseudonana
CCMP1335]
Length = 232
Score = 83.2 bits (204), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 68/236 (28%), Positives = 114/236 (48%), Gaps = 39/236 (16%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAK-----LNLRPSTLALR--KGETVDNTQGIRT 69
+VLS PR L F +P + + +I++A + ++ ST+ +G T +T R+
Sbjct: 1 KVLSCAPRVLEVKKFLSPVEVQHLIDLASGAKGDVAMQRSTVLASNIRGATKTDT---RS 57
Query: 70 SSGVFISAAEDESGTLDLIEEKIAKVTML----------PRING----EAFNILRYKIGQ 115
SSG +I +D +D I +IA + + P + G EA +LRY+ G+
Sbjct: 58 SSGGWIHREQDV--IVDTIFRRIADLLKIDKNLMRDQRPPHLIGAHVVEAMQLLRYEPGE 115
Query: 116 KYNSHYDAFDPQEYGPQKSQRVASFLVYLT---DLEEGGETMFPFENGMNADG----SYD 168
+YN H+D P + +R + L+YLT D+ + G + P + DG
Sbjct: 116 EYNPHHDFTYPSIDNRYQPKRYVTILLYLTGEGDVIQDGIRLSPKNTNTDVDGLQGGETT 175
Query: 169 YQKCI------GLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKW 218
+ + I G+KV P+ G ++FY++LP+G +D S H V KG K++A W
Sbjct: 176 FPRAITTEYHDGIKVAPQSGKAVVFYNILPDGNMDDLSQHSGGKVEKGVKYLANVW 231
>gi|148701597|gb|EDL33544.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide, isoform CRA_b [Mus
musculus]
Length = 506
Score = 83.2 bits (204), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 97/210 (46%), Gaps = 30/210 (14%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 78
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 307 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 359
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS---- 134
ED+ + + ++ +T L E + Y +G +Y H+D E K
Sbjct: 360 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRSDEQDAFKRLGTG 419
Query: 135 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 194
RVA+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 420 NRVATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSG 464
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 465 EGDYRTRHAACPVLVGCKWVSNKWFHERGQ 494
>gi|156370129|ref|XP_001628324.1| predicted protein [Nematostella vectensis]
gi|156215298|gb|EDO36261.1| predicted protein [Nematostella vectensis]
Length = 541
Score = 83.2 bits (204), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 64/226 (28%), Positives = 97/226 (42%), Gaps = 41/226 (18%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPST---------------LALRKGETVDNTQG- 66
P L F NF T + K I +A L+ +T ++ R+ G
Sbjct: 310 PEVLIFRNFITDSEIKRIKELATPRLKRATVKDPVTGELIFANYRISKRRATIQHPVTGK 369
Query: 67 -----IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHY 121
R S ++ EDE + I ++ + L E ++ Y IG Y HY
Sbjct: 370 LEFANYRISKSGWLRDEEDE--LVKRISYRVQAYSGLNMTTSEDLQVVNYGIGGHYEPHY 427
Query: 122 D-AFDPQEYGPQ--KSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVK 178
D A D ++ R+A+FL YL+D+E GG T+F +G V
Sbjct: 428 DFARDGEDKFTSLGTGNRIATFLSYLSDVEAGGGTVFT---------------RVGATVW 472
Query: 179 PRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
P++GD +Y+L +G D ++ H +CPV+ G KWVA KWI + Q
Sbjct: 473 PQKGDAAFWYNLKRSGDGDSSTRHAACPVLVGSKWVANKWIHEVGQ 518
>gi|410948134|ref|XP_003980796.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Felis
catus]
Length = 535
Score = 83.2 bits (204), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 97/210 (46%), Gaps = 30/210 (14%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 78
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 388
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS---- 134
ED+ + + ++ +T L E + Y +G +Y H+D E K
Sbjct: 389 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRKNEQDAFKRLGTG 448
Query: 135 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 194
RVA+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 449 NRVATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSG 493
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 494 EGDYRTRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|354474413|ref|XP_003499425.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1
[Cricetulus griseus]
Length = 535
Score = 83.2 bits (204), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 97/210 (46%), Gaps = 30/210 (14%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 78
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 388
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS---- 134
ED+ + + ++ +T L E + Y +G +Y H+D E K
Sbjct: 389 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRSDEQDAFKRLGTG 448
Query: 135 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 194
RVA+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 449 NRVATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSG 493
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 494 EGDYRTRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|73952886|ref|XP_850682.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3 [Canis
lupus familiaris]
Length = 534
Score = 83.2 bits (204), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F + + + + + ++AK L +T+ + + Q R S ++S E+
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYENP- 392
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 451
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|383640592|ref|ZP_09952998.1| 2OG-Fe(II) oxygenase [Sphingomonas elodea ATCC 31461]
Length = 221
Score = 83.2 bits (204), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 61/198 (30%), Positives = 95/198 (47%), Gaps = 28/198 (14%)
Query: 30 NFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIE 89
NF QC +++ + RPSTLA + RTS + AAED +DL +
Sbjct: 43 NFLDSGQCAALMARIDEHRRPSTLA-----NAGDDYAFRTSETCDL-AAEDPLA-IDL-K 94
Query: 90 EKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP-----QEYGPQKSQRVASFLVYL 144
+I ++ L + E RY +GQ++ +H D FDP +Y QR + ++YL
Sbjct: 95 ARILELIGLDPDHAEPMQGQRYAVGQEFKAHTDYFDPGSVDFDKYCSVAGQRTWTVMLYL 154
Query: 145 TDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGS 204
++E GG T F K I V+P G L + +L P+GT++P +IH +
Sbjct: 155 NEVEAGGATRF---------------KAIDKIVQPEAGKLLAWNNLRPDGTVNPATIHHA 199
Query: 205 CPVVKGEKWVATKWIRDQ 222
V G K+V T+W R++
Sbjct: 200 MKVRAGCKYVITQWFRER 217
>gi|4758868|ref|NP_004190.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Homo
sapiens]
gi|217272863|ref|NP_001136071.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Homo
sapiens]
gi|20455169|sp|O15460.1|P4HA2_HUMAN RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
alpha-2; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-2; Flags: Precursor
gi|2439985|gb|AAB71339.1| prolyl 4-hydroxylase alpha (II) subunit [Homo sapiens]
gi|18073926|emb|CAC85689.1| Prolyl 4-hydroxylase alpha IIb subunit [Homo sapiens]
gi|119582746|gb|EAW62342.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_b
[Homo sapiens]
gi|119582747|gb|EAW62343.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_b
[Homo sapiens]
Length = 535
Score = 83.2 bits (204), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 97/210 (46%), Gaps = 30/210 (14%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 78
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 388
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS---- 134
ED+ + + ++ +T L E + Y +G +Y H+D E K
Sbjct: 389 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTFKHLGTG 448
Query: 135 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 194
RVA+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 449 NRVATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSG 493
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 494 EGDYRTRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|348501574|ref|XP_003438344.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oreochromis
niloticus]
Length = 615
Score = 83.2 bits (204), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 98/206 (47%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + + + + + + + +AK LR +T++ V T R S +++ +D
Sbjct: 416 PYIVRYLDIISDAEIERVKQLAKPRLRRATIS-NPITGVLETASYRISKSAWLTEYDDP- 473
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
++ I ++I VT L E + Y +G +Y H+D E K R+A
Sbjct: 474 -MIEKINDRIEGVTGLEMDTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIA 532
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 533 TWLFYMSDVSAGGATVFP---------------DVGAAVWPQKGTAVFWYNLFASGEGDY 577
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 578 STRHAACPVLVGNKWVSNKWIHERGQ 603
>gi|332221664|ref|XP_003259983.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 5 [Nomascus
leucogenys]
Length = 558
Score = 83.2 bits (204), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 97/210 (46%), Gaps = 30/210 (14%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 78
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 359 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 411
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS---- 134
ED+ + + ++ +T L E + Y +G +Y H+D E K
Sbjct: 412 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTFKHLGTG 471
Query: 135 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 194
RVA+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 472 NRVATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSG 516
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 517 EGDYRTRHAACPVLVGCKWVSNKWFHERGQ 546
>gi|297675929|ref|XP_002815906.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Pongo
abelii]
Length = 535
Score = 83.2 bits (204), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 97/210 (46%), Gaps = 30/210 (14%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 78
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 388
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS---- 134
ED+ + + ++ +T L E + Y +G +Y H+D E K
Sbjct: 389 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTFKHLGTG 448
Query: 135 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 194
RVA+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 449 NRVATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSG 493
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 494 EGDYRTRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|332221660|ref|XP_003259981.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Nomascus
leucogenys]
Length = 537
Score = 83.2 bits (204), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 97/210 (46%), Gaps = 30/210 (14%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 78
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 338 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 390
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS---- 134
ED+ + + ++ +T L E + Y +G +Y H+D E K
Sbjct: 391 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTFKHLGTG 450
Query: 135 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 194
RVA+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 451 NRVATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSG 495
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 496 EGDYRTRHAACPVLVGCKWVSNKWFHERGQ 525
>gi|335283456|ref|XP_003354320.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Sus scrofa]
Length = 535
Score = 83.2 bits (204), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 97/210 (46%), Gaps = 30/210 (14%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 78
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 388
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK----S 134
ED+ + + ++ +T L E + Y +G +Y H+D E K
Sbjct: 389 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRKDEQDAFKRLGTG 448
Query: 135 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 194
RVA+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 449 NRVATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSG 493
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 494 EGDYRTRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|332244067|ref|XP_003271193.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
alpha-1 [Nomascus leucogenys]
Length = 502
Score = 83.2 bits (204), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F + + + + + ++AK L +T+ + + Q R S ++S E+
Sbjct: 303 PRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYENP- 360
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 361 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 419
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 420 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 464
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 465 STRHAACPVLVGNKWVSNKWLHERGQ 490
>gi|344274272|ref|XP_003408941.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1
[Loxodonta africana]
Length = 534
Score = 83.2 bits (204), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F + + + + + ++AK L +T+ + + Q R S ++S E+
Sbjct: 335 PRIVRFHDIISDAEIEVVKDLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYENP- 392
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 451
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------DVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|114601566|ref|XP_001162222.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Pan
troglodytes]
gi|114601568|ref|XP_001162843.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 17 [Pan
troglodytes]
gi|397518358|ref|XP_003829358.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Pan
paniscus]
gi|397518362|ref|XP_003829360.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 5 [Pan
paniscus]
gi|410215944|gb|JAA05191.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
gi|410255608|gb|JAA15771.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
gi|410331279|gb|JAA34586.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
Length = 535
Score = 83.2 bits (204), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 97/210 (46%), Gaps = 30/210 (14%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 78
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 388
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS---- 134
ED+ + + ++ +T L E + Y +G +Y H+D E K
Sbjct: 389 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTFKHLGTG 448
Query: 135 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 194
RVA+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 449 NRVATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSG 493
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 494 EGDYRTRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|194905372|ref|XP_001981184.1| GG11758 [Drosophila erecta]
gi|190655822|gb|EDV53054.1| GG11758 [Drosophila erecta]
Length = 550
Score = 83.2 bits (204), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 49/166 (29%), Positives = 79/166 (47%), Gaps = 23/166 (13%)
Query: 67 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP 126
+RTS FI A+ + L I++++A +T L E Y IG Y H D F
Sbjct: 371 VRTSQFTFIPASAHK--VLSTIDQRVADMTNLNMKYAEDHQFANYGIGGHYGQHMDWFYQ 428
Query: 127 QEY------GPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPR 180
+ P+ R+A+ L YL+D+ +GG T FP + +KP+
Sbjct: 429 TTFDAGLVSSPEMGNRIATVLFYLSDVSQGGGTAFPQLRTL---------------LKPK 473
Query: 181 QGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 226
+ +++L +G D + HG+CP++ G KWV +WIR+ +Q D
Sbjct: 474 KYAAAFWHNLHASGVGDVRTQHGACPIIAGSKWVQNRWIREFDQSD 519
>gi|291404182|ref|XP_002718471.1| PREDICTED: prolyl 4-hydroxylase, alpha I subunit isoform 1
[Oryctolagus cuniculus]
Length = 534
Score = 83.2 bits (204), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F + + + + + ++AK L +T+ + + Q R S ++S E+
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYENP- 392
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 451
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|63252886|ref|NP_000908.2| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Homo
sapiens]
gi|114631173|ref|XP_508168.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 13 [Pan
troglodytes]
gi|602676|gb|AAA59069.1| alpha-subunit of prolyl 4-hydroxylase [Homo sapiens]
gi|62897481|dbj|BAD96680.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I variant [Homo
sapiens]
gi|119574852|gb|EAW54467.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I, isoform CRA_a [Homo
sapiens]
gi|119574853|gb|EAW54468.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I, isoform CRA_b [Homo
sapiens]
gi|410349609|gb|JAA41408.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
gi|410349613|gb|JAA41410.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
Length = 534
Score = 83.2 bits (204), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F + + + + + ++AK L +T+ + + Q R S ++S E+
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYENP- 392
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 451
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|190786|gb|AAA36534.1| prolyl 4-hydroxylase alpha subunit (EC 1.14.11.2) [Homo sapiens]
Length = 534
Score = 83.2 bits (204), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F + + + + + ++AK L +T+ + + Q R S ++S E+
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYENP- 392
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 451
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|397490069|ref|XP_003816032.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Pan paniscus]
Length = 488
Score = 83.2 bits (204), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F + + + + + ++AK L +T+ + + Q R S ++S E+
Sbjct: 289 PRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYENP- 346
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 347 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 405
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 406 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 450
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 451 STRHAACPVLVGNKWVSNKWLHERGQ 476
>gi|119582749|gb|EAW62345.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_d
[Homo sapiens]
Length = 488
Score = 83.2 bits (204), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 101/208 (48%), Gaps = 28/208 (13%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 78
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 291 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 343
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQR 136
ED+ + + ++ +T L E + Y +G +Y H+D + P + G + + R
Sbjct: 344 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNR 403
Query: 137 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 196
+A+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 404 LATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEG 448
Query: 197 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 449 DYRTRHAACPVLVGCKWVSNKWFHERGQ 476
>gi|281361323|ref|NP_652183.2| CG15864 [Drosophila melanogaster]
gi|272476864|gb|AAF54202.3| CG15864 [Drosophila melanogaster]
Length = 490
Score = 83.2 bits (204), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 45/159 (28%), Positives = 80/159 (50%), Gaps = 23/159 (14%)
Query: 67 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFD- 125
+RTS +I A+ + E++ +T + F+++ Y +G Y HYD +
Sbjct: 336 VRTSKDSYIVDAKT-------LNERVTDMTGFSMEMSDPFSLINYGLGGHYMLHYDFHEY 388
Query: 126 PQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGL 185
P++ R+A+ L YL +++ GG T+FP I + V P++G +
Sbjct: 389 TNTTRPKQGDRIATVLFYLGEVDSGGATIFPM---------------INITVTPKKGSAV 433
Query: 186 LFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
+Y+L +G ++ S+H +CPV+ G K+V TKWI + Q
Sbjct: 434 FWYNLHNSGAMNLKSLHSACPVISGSKYVLTKWINELPQ 472
>gi|395820524|ref|XP_003783614.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1 [Otolemur
garnettii]
Length = 534
Score = 83.2 bits (204), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F + + + + + ++AK L +T+ + + Q R S ++S E+
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYENP- 392
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 451
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|227553849|gb|ACP40552.1| IP22178p [Drosophila melanogaster]
Length = 467
Score = 83.2 bits (204), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 54/203 (26%), Positives = 92/203 (45%), Gaps = 19/203 (9%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P F + +P + + MA + ST+ G + + R S ++ A E
Sbjct: 261 PYVATFHDILSPGKISQLREMAVPRMHRSTVNPLPGGQLKKS-AFRVSKNAWL-AYESHP 318
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQKSQRVASFL 141
+ ++ + + T L E + Y +G Y H+D F DP Y ++ R+A+ +
Sbjct: 319 TMVGMLRD-LKDATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRIATAI 377
Query: 142 VYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSI 201
YL+++E+GG T FPF + + VKP+ G+ L +Y+L + D +
Sbjct: 378 FYLSEVEQGGATAFPF---------------LDIAVKPQLGNVLFWYNLHRSLDKDYRTK 422
Query: 202 HGSCPVVKGEKWVATKWIRDQEQ 224
H CPV+KG KW+ WI + Q
Sbjct: 423 HAGCPVLKGSKWIGNVWIHEVTQ 445
>gi|20269818|gb|AAM18064.1| prolyl 4-hydroxylase alpha-related protein PH4[alpha]NE1
[Drosophila melanogaster]
Length = 286
Score = 83.2 bits (204), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 54/203 (26%), Positives = 92/203 (45%), Gaps = 19/203 (9%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P F + +P + + MA + ST+ G + + R S ++ A E
Sbjct: 80 PYVATFHDILSPGKISQLREMAVPRMHRSTVNPLPGGQLKKS-AFRVSKNAWL-AYESHP 137
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQKSQRVASFL 141
+ ++ + + T L E + Y +G Y H+D F DP Y ++ R+A+ +
Sbjct: 138 TMVGMLRD-LKDATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRIATAI 196
Query: 142 VYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSI 201
YL+++E+GG T FPF + + VKP+ G+ L +Y+L + D +
Sbjct: 197 FYLSEVEQGGATAFPF---------------LDIAVKPQLGNVLFWYNLHRSLDKDYRTK 241
Query: 202 HGSCPVVKGEKWVATKWIRDQEQ 224
H CPV+KG KW+ WI + Q
Sbjct: 242 HAGCPVLKGSKWIGNVWIHEVTQ 264
>gi|349604936|gb|AEQ00344.1| Prolyl 4-hydroxylase subunit alpha-1-like protein, partial [Equus
caballus]
Length = 302
Score = 82.8 bits (203), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F + + + + + ++AK L +T+ + + Q R S ++S E+
Sbjct: 103 PRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYENP- 160
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 161 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 219
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 220 TWLFYMSDVSAGGATVFPE---------------VGASVWPKKGTAVFWYNLFASGEGDY 264
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 265 STRHAACPVLVGNKWVSNKWLHERGQ 290
>gi|119582752|gb|EAW62348.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_f
[Homo sapiens]
Length = 567
Score = 82.8 bits (203), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 97/210 (46%), Gaps = 30/210 (14%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 78
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 368 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 420
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS---- 134
ED+ + + ++ +T L E + Y +G +Y H+D E K
Sbjct: 421 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTFKHLGTG 480
Query: 135 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 194
RVA+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 481 NRVATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSG 525
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 526 EGDYRTRHAACPVLVGCKWVSNKWFHERGQ 555
>gi|355691582|gb|EHH26767.1| hypothetical protein EGK_16829 [Macaca mulatta]
gi|355750162|gb|EHH54500.1| hypothetical protein EGM_15360 [Macaca fascicularis]
gi|384939464|gb|AFI33337.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Macaca
mulatta]
Length = 535
Score = 82.8 bits (203), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 97/210 (46%), Gaps = 30/210 (14%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 78
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 388
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS---- 134
ED+ + + ++ +T L E + Y +G +Y H+D E K
Sbjct: 389 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERHTFKHLGTG 448
Query: 135 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 194
RVA+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 449 NRVATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSG 493
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 494 EGDYRTRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|296220402|ref|XP_002756291.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Callithrix
jacchus]
Length = 534
Score = 82.8 bits (203), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F + + + + + ++AK L +T+ + + Q R S ++S E+
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYENP- 392
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 451
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|198449635|ref|XP_001357660.2| GA21971 [Drosophila pseudoobscura pseudoobscura]
gi|198130694|gb|EAL26794.2| GA21971 [Drosophila pseudoobscura pseudoobscura]
Length = 549
Score = 82.8 bits (203), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 56/214 (26%), Positives = 92/214 (42%), Gaps = 26/214 (12%)
Query: 19 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA 78
LS P + + + + I + + + + L TV N +RTS FI+
Sbjct: 324 LSHDPLLVLYHDVIYQSEIDVIRQLTTNRMARAMVTLTNQSTVSN---VRTSQITFIAKT 380
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEY------GPQ 132
E E L I+ ++A +T L E Y IG Y H D F + +
Sbjct: 381 EHE--VLQTIDRRVADMTNLNMDYAEDHQFANYGIGGHYGQHMDWFTETTFDNGLVSSTE 438
Query: 133 KSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLP 192
R+A+ L YL+D+ +GG T FP+ + ++P++ +++L
Sbjct: 439 MGNRIATVLFYLSDVAQGGGTAFPY---------------LKQHLRPKKYAAAFWHNLHA 483
Query: 193 NGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 226
G D + HG+CP++ G KWV +WIR+ Q D
Sbjct: 484 AGRGDARTQHGACPIIAGSKWVLNRWIREFVQSD 517
>gi|148233143|ref|NP_001090904.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Sus scrofa]
gi|83778522|gb|ABC47142.1| procollagen-proline 2-oxoglutarate-4-dioxygenase [Sus scrofa]
Length = 534
Score = 82.8 bits (203), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 98/206 (47%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F + + + + ++AK LR +T++ ++ T R S ++S E+
Sbjct: 335 PRIIRFHDIISDAEIDIVKDLAKPRLRRATISNPITGDLE-TVHYRISKSAWLSGYENP- 392
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ + +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 393 -VVSRLNMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 451
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|402880501|ref|XP_003903839.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like, partial
[Papio anubis]
Length = 379
Score = 82.8 bits (203), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F + + + + + ++AK L +T+ + + Q R S ++S E+
Sbjct: 180 PRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYENP- 237
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 238 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 296
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 297 TWLFYMSDVSAGGATVFPE---------------VGASVWPKKGTAVFWYNLFASGEGDY 341
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 342 STRHAACPVLVGNKWVSNKWLHERGQ 367
>gi|395736141|ref|XP_003776706.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Pongo abelii]
Length = 577
Score = 82.8 bits (203), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 97/210 (46%), Gaps = 30/210 (14%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 78
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 378 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 430
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS---- 134
ED+ + + ++ +T L E + Y +G +Y H+D E K
Sbjct: 431 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTFKHLGTG 490
Query: 135 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 194
RVA+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 491 NRVATFLNYMSDVEAGGATVFPD---------------LGAAIWPKKGTAVFWYNLLRSG 535
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 536 EGDYRTRHAACPVLVGCKWVSNKWFHERGQ 565
>gi|348576112|ref|XP_003473831.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cavia
porcellus]
Length = 534
Score = 82.8 bits (203), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F + + + + + ++AK L +T+ + + Q R S ++S E+
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYENP- 392
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 451
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|332221656|ref|XP_003259979.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Nomascus
leucogenys]
gi|332221658|ref|XP_003259980.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Nomascus
leucogenys]
Length = 535
Score = 82.8 bits (203), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 101/208 (48%), Gaps = 28/208 (13%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 78
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 338 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 390
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQR 136
ED+ + + ++ +T L E + Y +G +Y H+D + P + G + + R
Sbjct: 391 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNR 450
Query: 137 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 196
+A+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 451 LATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEG 495
Query: 197 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 496 DYRTRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|291387300|ref|XP_002710241.1| PREDICTED: prolyl 4-hydroxylase, alpha II subunit isoform 1
precursor (predicted)-like isoform 1 [Oryctolagus
cuniculus]
Length = 533
Score = 82.8 bits (203), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 55/208 (26%), Positives = 101/208 (48%), Gaps = 28/208 (13%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 78
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 388
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQR 136
ED+ + I ++ +T L E + Y +G +Y H+D + P + G + + R
Sbjct: 389 EDDDPVVARINRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNR 448
Query: 137 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 196
+A+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 449 LATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEG 493
Query: 197 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 494 DYRTRHAACPVLVGCKWVSNKWFHERGQ 521
>gi|219113719|ref|XP_002186443.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|209583293|gb|ACI65913.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 230
Score = 82.8 bits (203), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 67/239 (28%), Positives = 107/239 (44%), Gaps = 44/239 (18%)
Query: 13 NIPFQVLSWMPRALYFPNFATPEQCKSIINMAK-LNLRPSTLALRKG------ETVDNTQ 65
N+ +VLS PRA NF + ++ + I+ +A ++L+ S+ G E +++
Sbjct: 2 NLTLKVLSCAPRAFEIENFLSRQEVEHIVQLASGVDLKLSSTGDITGHKETPKELQTDSR 61
Query: 66 GIRTSSGVFISAAEDESGTLDLIEEKIAKVTML-------------------PRINGEAF 106
RTS ++ ++S +D I + A V + + E
Sbjct: 62 RTRTSYNSWV--PREKSPIIDAIYRRAADVMRIDEALLRHRSDHTEWTNLTSTKPLAEQL 119
Query: 107 NILRYKIGQKYNSHYD----AFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMN 162
++ Y GQ+Y +H+D D Q G R + L+YL + GGET FP N
Sbjct: 120 QLVHYGPGQEYTAHHDFGFSRIDDQFQGA----RFGTLLLYLNEGMTGGETSFP--RWSN 173
Query: 163 ADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRD 221
A+ ++ L +KP G +LFYS LP+G +D S H + PV GEKW+ W D
Sbjct: 174 AETFHE------LSIKPEVGKAVLFYSQLPDGNLDDLSHHAAKPVTDGEKWLINLWTWD 226
>gi|432106758|gb|ELK32410.1| Prolyl 4-hydroxylase subunit alpha-1 [Myotis davidii]
Length = 534
Score = 82.8 bits (203), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F + + + + + ++AK L +T+ + + Q R S ++S E+
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYENP- 392
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 451
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|403255937|ref|XP_003920661.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Saimiri
boliviensis boliviensis]
gi|403255939|ref|XP_003920662.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Saimiri
boliviensis boliviensis]
gi|403255943|ref|XP_003920664.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 4 [Saimiri
boliviensis boliviensis]
Length = 533
Score = 82.8 bits (203), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 101/208 (48%), Gaps = 28/208 (13%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 78
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 388
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQR 136
ED+ + + ++ +T L E + Y +G +Y H+D + P + G + + R
Sbjct: 389 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNR 448
Query: 137 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 196
+A+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 449 LATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEG 493
Query: 197 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 494 DYRTRHAACPVLVGCKWVSNKWFHERGQ 521
>gi|301770069|ref|XP_002920453.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Ailuropoda
melanoleuca]
Length = 534
Score = 82.8 bits (203), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F + + + + + ++AK L +T+ + + Q R S ++S E+
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYENP- 392
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 451
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|386780652|ref|NP_001247763.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Macaca mulatta]
gi|383422579|gb|AFH34503.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Macaca
mulatta]
gi|384939466|gb|AFI33338.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Macaca
mulatta]
Length = 533
Score = 82.8 bits (203), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 101/208 (48%), Gaps = 28/208 (13%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 78
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 388
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQR 136
ED+ + + ++ +T L E + Y +G +Y H+D + P + G + + R
Sbjct: 389 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNR 448
Query: 137 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 196
+A+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 449 LATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEG 493
Query: 197 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 494 DYRTRHAACPVLVGCKWVSNKWFHERGQ 521
>gi|195390831|ref|XP_002054071.1| GJ22995 [Drosophila virilis]
gi|194152157|gb|EDW67591.1| GJ22995 [Drosophila virilis]
Length = 485
Score = 82.8 bits (203), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 55/211 (26%), Positives = 96/211 (45%), Gaps = 32/211 (15%)
Query: 14 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGV 73
+ +VL P + F + +P + + +A L+ +T+ + +G RTS G+
Sbjct: 295 LKMEVLVVKPFIVAFHDVLSPHEIGELQQLAMPLLKRTTVYDSNAGLHGSVKGTRTSKGI 354
Query: 74 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK 133
++S + + I +I+ +T + ++ Y + Y H D F+ E
Sbjct: 355 WLSRSHN--NLTKRIGRRISDMTGFHLEGSTSLQVMNYGLSGHYALHTDYFNTAE----- 407
Query: 134 SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPN 193
L+D+E+GG+T+FP I KP +G LL+Y+L N
Sbjct: 408 ----------LSDVEQGGDTVFPR---------------IEQAFKPERGKALLWYNLHRN 442
Query: 194 GTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
GT D + HG+CPV+ G KW+ T+WI ++ Q
Sbjct: 443 GTGDKRTEHGACPVLVGSKWIMTQWINERPQ 473
>gi|114601548|ref|XP_001162501.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 9 [Pan
troglodytes]
gi|114601562|ref|XP_001162805.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 16 [Pan
troglodytes]
gi|114601564|ref|XP_517917.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 19 [Pan
troglodytes]
gi|397518354|ref|XP_003829356.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Pan
paniscus]
gi|397518356|ref|XP_003829357.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Pan
paniscus]
gi|397518360|ref|XP_003829359.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 4 [Pan
paniscus]
gi|410215942|gb|JAA05190.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
gi|410255606|gb|JAA15770.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
gi|410331277|gb|JAA34585.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
gi|410331281|gb|JAA34587.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
Length = 533
Score = 82.8 bits (203), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 101/208 (48%), Gaps = 28/208 (13%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 78
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 388
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQR 136
ED+ + + ++ +T L E + Y +G +Y H+D + P + G + + R
Sbjct: 389 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNR 448
Query: 137 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 196
+A+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 449 LATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEG 493
Query: 197 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 494 DYRTRHAACPVLVGCKWVSNKWFHERGQ 521
>gi|332221662|ref|XP_003259982.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 4 [Nomascus
leucogenys]
Length = 556
Score = 82.8 bits (203), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 101/208 (48%), Gaps = 28/208 (13%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 78
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 359 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 411
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQR 136
ED+ + + ++ +T L E + Y +G +Y H+D + P + G + + R
Sbjct: 412 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNR 471
Query: 137 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 196
+A+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 472 LATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEG 516
Query: 197 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 517 DYRTRHAACPVLVGCKWVSNKWFHERGQ 544
>gi|195341544|ref|XP_002037366.1| GM12151 [Drosophila sechellia]
gi|194131482|gb|EDW53525.1| GM12151 [Drosophila sechellia]
Length = 537
Score = 82.8 bits (203), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 54/203 (26%), Positives = 91/203 (44%), Gaps = 19/203 (9%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P F + P + + MA + ST+ G + + R S ++ A E
Sbjct: 331 PYVATFHDMLNPRKISQLREMAVPRMHRSTVNPLPGGQLKKS-AFRVSKNAWL-AYESHP 388
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQKSQRVASFL 141
+ ++ + + T L E + Y +G Y H+D F DP Y ++ R+A+ +
Sbjct: 389 TMVGMLRD-LKDATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRIATAI 447
Query: 142 VYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSI 201
YL+++E+GG T FPF + + VKP+ G+ L +Y+L + D +
Sbjct: 448 FYLSEVEQGGATAFPF---------------LDIAVKPQLGNVLFWYNLHRSLDKDYRTK 492
Query: 202 HGSCPVVKGEKWVATKWIRDQEQ 224
H CPV+KG KW+ WI + Q
Sbjct: 493 HAGCPVLKGSKWIGNVWIHEVTQ 515
>gi|63252891|ref|NP_001017973.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Homo
sapiens]
gi|63252893|ref|NP_001017974.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Homo
sapiens]
gi|217272861|ref|NP_001136070.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Homo
sapiens]
gi|18073925|emb|CAC85688.1| Prolyl 4-hydroxylase alpha IIa subunit [Homo sapiens]
gi|23274221|gb|AAH35813.1| Prolyl 4-hydroxylase, alpha polypeptide II [Homo sapiens]
gi|37183058|gb|AAQ89329.1| P4HA2 [Homo sapiens]
gi|119582745|gb|EAW62341.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_a
[Homo sapiens]
gi|119582750|gb|EAW62346.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_a
[Homo sapiens]
gi|123983232|gb|ABM83357.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II [synthetic
construct]
gi|157928048|gb|ABW03320.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II [synthetic
construct]
Length = 533
Score = 82.8 bits (203), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 101/208 (48%), Gaps = 28/208 (13%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 78
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 388
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQR 136
ED+ + + ++ +T L E + Y +G +Y H+D + P + G + + R
Sbjct: 389 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNR 448
Query: 137 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 196
+A+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 449 LATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEG 493
Query: 197 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 494 DYRTRHAACPVLVGCKWVSNKWFHERGQ 521
>gi|297675927|ref|XP_002815905.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Pongo
abelii]
gi|395736137|ref|XP_003776704.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Pongo abelii]
Length = 533
Score = 82.8 bits (203), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 101/208 (48%), Gaps = 28/208 (13%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 78
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 388
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQR 136
ED+ + + ++ +T L E + Y +G +Y H+D + P + G + + R
Sbjct: 389 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNR 448
Query: 137 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 196
+A+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 449 LATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEG 493
Query: 197 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 494 DYRTRHAACPVLVGCKWVSNKWFHERGQ 521
>gi|57997558|emb|CAI46066.1| hypothetical protein [Homo sapiens]
Length = 533
Score = 82.8 bits (203), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 101/208 (48%), Gaps = 28/208 (13%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 78
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 388
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQR 136
ED+ + + ++ +T L E + Y +G +Y H+D + P + G + + R
Sbjct: 389 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNR 448
Query: 137 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 196
+A+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 449 LATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEG 493
Query: 197 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 494 DYRTRHAACPVLVGCKWVSNKWFHERGQ 521
>gi|410251924|gb|JAA13929.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
Length = 566
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F + + + + + ++AK L +T+ + + Q R S ++S E+
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYENP- 392
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 451
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|116283554|gb|AAH17062.1| P4HA2 protein [Homo sapiens]
Length = 504
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 101/208 (48%), Gaps = 28/208 (13%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 78
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 307 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 359
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQR 136
ED+ + + ++ +T L E + Y +G +Y H+D + P + G + + R
Sbjct: 360 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNR 419
Query: 137 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 196
+A+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 420 LATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEG 464
Query: 197 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 465 DYRTRHAACPVLVGCKWVSNKWFHERGQ 492
>gi|380813208|gb|AFE78478.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Macaca
mulatta]
gi|384947330|gb|AFI37270.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Macaca
mulatta]
Length = 534
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F + + + + + ++AK L +T+ + + Q R S ++S E+
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYENP- 392
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 451
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|426349879|ref|XP_004042513.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Gorilla gorilla
gorilla]
Length = 565
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 101/208 (48%), Gaps = 28/208 (13%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 78
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 368 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 420
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQR 136
ED+ + + ++ +T L E + Y +G +Y H+D + P + G + + R
Sbjct: 421 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNR 480
Query: 137 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 196
+A+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 481 LATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEG 525
Query: 197 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 526 DYRTRHAACPVLVGCKWVSNKWFHERGQ 553
>gi|383418721|gb|AFH32574.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Macaca
mulatta]
Length = 534
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F + + + + + ++AK L +T+ + + Q R S ++S E+
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYENP- 392
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 451
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|119582748|gb|EAW62344.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_c
[Homo sapiens]
Length = 565
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 101/208 (48%), Gaps = 28/208 (13%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 78
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 368 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 420
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQR 136
ED+ + + ++ +T L E + Y +G +Y H+D + P + G + + R
Sbjct: 421 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNR 480
Query: 137 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 196
+A+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 481 LATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEG 525
Query: 197 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 526 DYRTRHAACPVLVGCKWVSNKWFHERGQ 553
>gi|397644356|gb|EJK76358.1| hypothetical protein THAOC_01879, partial [Thalassiosira oceanica]
Length = 539
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 64/206 (31%), Positives = 96/206 (46%), Gaps = 29/206 (14%)
Query: 23 PRALYFPNFATPEQCKSIINMAKL-NLRPST---LALRKGETVDNTQGIRTSSGVFISAA 78
P + F NF T E+ ++ +L ST A GE RTSS +
Sbjct: 336 PWVVVFDNFLTDEEVADLVKGGELEGYERSTDQGAANAYGEQEKVVSRTRTSSNAWCMHK 395
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVA 138
+ + +KI VT +P++N E+F +L+Y GQ Y SH+D+ + R+
Sbjct: 396 CERLPGVRSASKKIEAVTGIPQVNYESFQLLKYDGGQFYRSHHDSSSVDD--SPAGHRIL 453
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
+F +YL+D+EEGGET F +G+ VKP++G L++ S+L DP
Sbjct: 454 TFFLYLSDVEEGGETYFSK---------------LGIAVKPKKGRALVWPSVLDE---DP 495
Query: 199 T-----SIHGSCPVVKGEKWVATKWI 219
T H + V+KGEK A WI
Sbjct: 496 TYWDKRMYHEAKDVIKGEKKAANHWI 521
>gi|395736139|ref|XP_003776705.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Pongo abelii]
Length = 575
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 101/208 (48%), Gaps = 28/208 (13%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 78
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 378 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 430
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQR 136
ED+ + + ++ +T L E + Y +G +Y H+D + P + G + + R
Sbjct: 431 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNR 490
Query: 137 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 196
+A+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 491 LATFLNYMSDVEAGGATVFPD---------------LGAAIWPKKGTAVFWYNLLRSGEG 535
Query: 197 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 536 DYRTRHAACPVLVGCKWVSNKWFHERGQ 563
>gi|836898|gb|AAC52197.1| prolyl 4-hydroxylase alpha(I)-subunit, partial [Mus musculus]
gi|1096887|prf||2112362A Pro 4-hydroxylase:SUBUNIT=alpha:ISOTYPE=I
Length = 526
Score = 82.4 bits (202), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 96/206 (46%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F + + + + + +AK L +T+ + + Q R S ++S ED
Sbjct: 327 PRIIRFHDIISDAEIEIVKYLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYEDP- 384
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ I +I +T L E + Y +G +Y H+D E + R+A
Sbjct: 385 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIA 443
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 444 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 488
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 489 STRHAACPVLVGNKWVSNKWLHERGQ 514
>gi|432926124|ref|XP_004080841.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oryzias
latipes]
Length = 523
Score = 82.4 bits (202), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 102/210 (48%), Gaps = 30/210 (14%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + + + A+ ++ +++ +AK LR +T+ + + Q R S ++ + E
Sbjct: 324 PYIVRYHDVASEKEMETVKELAKPRLRRATVHDPQTGKLTTAQ-YRVSKSAWLGS--HEH 380
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD--------AFDPQEYGPQKS 134
+D I ++I +T L E + Y +G +Y H+D AF+ G
Sbjct: 381 PIVDRINQRIEDITGLDVSTAEDLQVANYGVGGQYEPHFDFGRKDEADAFEELGTG---- 436
Query: 135 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 194
R+A++L+Y++D++ GG T+F IG V P++G + +Y+L +G
Sbjct: 437 NRIATWLLYMSDVQAGGNTVFTD---------------IGAVVWPKKGTAVFWYNLHRSG 481
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 482 EGDYRTRHAACPVLVGNKWVSNKWIHERGQ 511
>gi|24651420|ref|NP_733374.1| prolyl-4-hydroxylase-alpha NE1 [Drosophila melanogaster]
gi|7301952|gb|AAF57058.1| prolyl-4-hydroxylase-alpha NE1 [Drosophila melanogaster]
gi|363987308|gb|AEW43896.1| FI16820p1 [Drosophila melanogaster]
Length = 537
Score = 82.4 bits (202), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 54/203 (26%), Positives = 92/203 (45%), Gaps = 19/203 (9%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P F + +P + + MA + ST+ G + + R S ++ A E
Sbjct: 331 PYVATFHDILSPGKISQLREMAVPRMHRSTVNPLPGGQLKKS-AFRVSKNAWL-AYESHP 388
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQKSQRVASFL 141
+ ++ + + T L E + Y +G Y H+D F DP Y ++ R+A+ +
Sbjct: 389 TMVGMLRD-LKDATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRIATAI 447
Query: 142 VYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSI 201
YL+++E+GG T FPF + + VKP+ G+ L +Y+L + D +
Sbjct: 448 FYLSEVEQGGATAFPF---------------LDIAVKPQLGNVLFWYNLHRSLDKDYRTK 492
Query: 202 HGSCPVVKGEKWVATKWIRDQEQ 224
H CPV+KG KW+ WI + Q
Sbjct: 493 HAGCPVLKGSKWIGNVWIHEVTQ 515
>gi|148701600|gb|EDL33547.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide, isoform CRA_e [Mus
musculus]
Length = 593
Score = 82.4 bits (202), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 101/208 (48%), Gaps = 28/208 (13%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 78
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 396 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 448
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQR 136
ED+ + + ++ +T L E + Y +G +Y H+D + P + G + + R
Sbjct: 449 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNR 508
Query: 137 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 196
+A+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 509 LATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEG 553
Query: 197 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 554 DYRTRHAACPVLVGCKWVSNKWFHERGQ 581
>gi|209862961|ref|NP_001129548.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Mus
musculus]
gi|17390970|gb|AAH18411.1| P4ha2 protein [Mus musculus]
gi|18073922|emb|CAC85690.1| Prolyl 4-hydroxylase alpha IIa subunit [Mus musculus]
gi|74211515|dbj|BAE26490.1| unnamed protein product [Mus musculus]
Length = 535
Score = 82.4 bits (202), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 101/208 (48%), Gaps = 28/208 (13%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 78
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 338 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 390
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQR 136
ED+ + + ++ +T L E + Y +G +Y H+D + P + G + + R
Sbjct: 391 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNR 450
Query: 137 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 196
+A+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 451 LATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEG 495
Query: 197 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 496 DYRTRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|195572619|ref|XP_002104293.1| GD18524 [Drosophila simulans]
gi|194200220|gb|EDX13796.1| GD18524 [Drosophila simulans]
Length = 472
Score = 82.4 bits (202), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 45/159 (28%), Positives = 80/159 (50%), Gaps = 23/159 (14%)
Query: 67 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFD- 125
+RTS +I +E + E++ +T + F+++ Y +G Y HYD +
Sbjct: 318 VRTSKDSYIVDSES-------LNERVTDMTGFSMEMSDPFSLINYGLGGHYMLHYDFHEY 370
Query: 126 PQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGL 185
P++ R+A+ L YL +++ GG T+FP I + V P++G +
Sbjct: 371 TNTTRPKQGDRIATVLFYLGEVDSGGATIFP---------------KINIAVTPKKGSAV 415
Query: 186 LFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
+Y+L +G ++ S+H +CPV+ G K+V TKWI + Q
Sbjct: 416 FWYNLHNSGAMNLKSLHSACPVISGSKYVLTKWINELPQ 454
>gi|116008130|ref|NP_001036777.1| CG31524, isoform B [Drosophila melanogaster]
gi|113194860|gb|ABI31221.1| CG31524, isoform B [Drosophila melanogaster]
Length = 535
Score = 82.4 bits (202), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 104/218 (47%), Gaps = 31/218 (14%)
Query: 14 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVD-----NTQGIR 68
I ++LS P + + + ++ I + +K + PS ETV+ R
Sbjct: 318 IKTEILSVDPFVILLHDMVSHKEGALIRSSSKNQILPS-------ETVNAANEFEIAKFR 370
Query: 69 TSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDA--FDP 126
TS V+ + +E+ TL L + ++ + T L + E F ++ Y IG + SH+D D
Sbjct: 371 TSKSVWFDSDANEA-TLKLTQ-RLGEATGLDMKHSEPFQVINYGIGGVFESHFDTSLADE 428
Query: 127 QEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLL 186
+ R+A+ L YL D+ +GG T FP G+N + V P+ G L+
Sbjct: 429 DRFVNGYIDRLATTLFYLNDVPQGGATHFP---GLN------------ITVFPKFGTVLM 473
Query: 187 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
+Y+L G + ++H CPV+ G KWV +KWI D+ Q
Sbjct: 474 WYNLHTEGMLHVRTMHTGCPVIVGSKWVVSKWIDDKGQ 511
>gi|66770643|gb|AAY54633.1| IP12395p [Drosophila melanogaster]
Length = 538
Score = 82.4 bits (202), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 104/218 (47%), Gaps = 31/218 (14%)
Query: 14 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVD-----NTQGIR 68
I ++LS P + + + ++ I + +K + PS ETV+ R
Sbjct: 321 IKTEILSVDPFVILLHDMVSHKEGALIRSSSKNQILPS-------ETVNAANEFEIAKFR 373
Query: 69 TSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDA--FDP 126
TS V+ + +E+ TL L + ++ + T L + E F ++ Y IG + SH+D D
Sbjct: 374 TSKSVWFDSDANEA-TLKLTQ-RLGEATGLDMKHSEPFQVINYGIGGVFESHFDTSLADE 431
Query: 127 QEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLL 186
+ R+A+ L YL D+ +GG T FP G+N + V P+ G L+
Sbjct: 432 DRFVNGYIDRLATTLFYLNDVPQGGATHFP---GLN------------ITVFPKFGTVLM 476
Query: 187 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
+Y+L G + ++H CPV+ G KWV +KWI D+ Q
Sbjct: 477 WYNLHTEGMLHVRTMHTGCPVIVGSKWVVSKWIDDKGQ 514
>gi|261245137|gb|ACX54875.1| FI12021p [Drosophila melanogaster]
Length = 538
Score = 82.4 bits (202), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 104/218 (47%), Gaps = 31/218 (14%)
Query: 14 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVD-----NTQGIR 68
I ++LS P + + + ++ I + +K + PS ETV+ R
Sbjct: 321 IKTEILSVDPFVILLHDMVSHKEGALIRSSSKNQILPS-------ETVNAANEFEIAKFR 373
Query: 69 TSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDA--FDP 126
TS V+ + +E+ TL L + ++ + T L + E F ++ Y IG + SH+D D
Sbjct: 374 TSKSVWFDSDANEA-TLKLTQ-RLGEATGLDMKHSEPFQVINYGIGGVFESHFDTSLADE 431
Query: 127 QEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLL 186
+ R+A+ L YL D+ +GG T FP G+N + V P+ G L+
Sbjct: 432 DRFVNGYIDRLATTLFYLNDVPQGGATHFP---GLN------------ITVFPKFGTVLM 476
Query: 187 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
+Y+L G + ++H CPV+ G KWV +KWI D+ Q
Sbjct: 477 WYNLHTEGMLHVRTMHTGCPVIVGSKWVVSKWIDDKGQ 514
>gi|195575111|ref|XP_002105523.1| GD16991 [Drosophila simulans]
gi|194201450|gb|EDX15026.1| GD16991 [Drosophila simulans]
Length = 542
Score = 82.4 bits (202), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 62/213 (29%), Positives = 98/213 (46%), Gaps = 19/213 (8%)
Query: 14 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPS-TLALRKGETVDNTQGIRTSSG 72
I ++LS P L + + ++ I N +K ++ PS T +T RTS
Sbjct: 323 IKTEILSVDPFVLLLHDMISQKESTLIRNSSKEHMLPSATTDPDSSDTETQVDTYRTSKS 382
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
V+ S+ D + T I E++ T L E + ++ Y +G + +H D ++
Sbjct: 383 VWYSS--DFNDTTKKITERLGDATGLDTNFTEFYQVINYGLGGFFETHLDMLLSEKNRFN 440
Query: 133 KSQ-RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLL 191
++ R+A+ L YL ++ +GG T FP I L V P+ G L +Y+L
Sbjct: 441 GTRDRIATTLFYLNEVRQGGGTYFPR---------------INLTVFPQPGSALFWYNLD 485
Query: 192 PNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
NG S+H CPV+ G KWV +KWI D Q
Sbjct: 486 TNGNDHMGSLHTGCPVIVGSKWVMSKWINDMGQ 518
>gi|354474415|ref|XP_003499426.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2
[Cricetulus griseus]
Length = 533
Score = 82.4 bits (202), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 101/208 (48%), Gaps = 28/208 (13%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 78
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 388
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQR 136
ED+ + + ++ +T L E + Y +G +Y H+D + P + G + + R
Sbjct: 389 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNR 448
Query: 137 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 196
+A+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 449 LATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEG 493
Query: 197 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 494 DYRTRHAACPVLVGCKWVSNKWFHERGQ 521
>gi|66771513|gb|AAY55068.1| IP12095p [Drosophila melanogaster]
Length = 538
Score = 82.4 bits (202), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 104/218 (47%), Gaps = 31/218 (14%)
Query: 14 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVD-----NTQGIR 68
I ++LS P + + + ++ I + +K + PS ETV+ R
Sbjct: 321 IKTEILSVDPFVILLHDMVSHKEGALIRSSSKNQILPS-------ETVNAANEFEIAKFR 373
Query: 69 TSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDA--FDP 126
TS V+ + +E+ TL L + ++ + T L + E F ++ Y IG + SH+D D
Sbjct: 374 TSKSVWFDSDANEA-TLKLTQ-RLGEATGLDMKHSEPFQVINYGIGGVFESHFDTSLADE 431
Query: 127 QEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLL 186
+ R+A+ L YL D+ +GG T FP G+N + V P+ G L+
Sbjct: 432 DRFVNGYIDRLATTLFYLNDVPQGGATHFP---GLN------------ITVFPKFGTVLM 476
Query: 187 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
+Y+L G + ++H CPV+ G KWV +KWI D+ Q
Sbjct: 477 WYNLHTEGMLHVRTMHTGCPVIVGSKWVVSKWIDDKGQ 514
>gi|116008537|ref|NP_733379.2| CG31524, isoform A [Drosophila melanogaster]
gi|113194861|gb|AAN14239.2| CG31524, isoform A [Drosophila melanogaster]
Length = 536
Score = 82.4 bits (202), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 104/218 (47%), Gaps = 31/218 (14%)
Query: 14 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVD-----NTQGIR 68
I ++LS P + + + ++ I + +K + PS ETV+ R
Sbjct: 319 IKTEILSVDPFVILLHDMVSHKEGALIRSSSKNQILPS-------ETVNAANEFEIAKFR 371
Query: 69 TSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDA--FDP 126
TS V+ + +E+ TL L + ++ + T L + E F ++ Y IG + SH+D D
Sbjct: 372 TSKSVWFDSDANEA-TLKLTQ-RLGEATGLDMKHSEPFQVINYGIGGVFESHFDTSLADE 429
Query: 127 QEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLL 186
+ R+A+ L YL D+ +GG T FP G+N + V P+ G L+
Sbjct: 430 DRFVNGYIDRLATTLFYLNDVPQGGATHFP---GLN------------ITVFPKFGTVLM 474
Query: 187 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
+Y+L G + ++H CPV+ G KWV +KWI D+ Q
Sbjct: 475 WYNLHTEGMLHVRTMHTGCPVIVGSKWVVSKWIDDKGQ 512
>gi|311977988|ref|YP_003987108.1| putative prolyl 4-hydroxylase [Acanthamoeba polyphaga mimivirus]
gi|81999799|sp|Q5UP57.1|P4H_MIMIV RecName: Full=Putative prolyl 4-hydroxylase; Short=4-PH; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
gi|55417206|gb|AAV50856.1| prolyl 4-hydroxylase [Acanthamoeba polyphaga mimivirus]
gi|308204490|gb|ADO18291.1| putative prolyl 4-hydroxylase [Acanthamoeba polyphaga mimivirus]
gi|339061535|gb|AEJ34839.1| prolyl 4-hydroxylase [Acanthamoeba polyphaga mimivirus]
gi|351737756|gb|AEQ60791.1| Prolyl 4-hydroxylase [Acanthamoeba castellanii mamavirus]
gi|398257408|gb|EJN41016.1| prolyl 4-hydroxylase [Acanthamoeba polyphaga lentillevirus]
Length = 242
Score = 82.4 bits (202), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 56/200 (28%), Positives = 85/200 (42%), Gaps = 30/200 (15%)
Query: 30 NFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIE 89
N P +C+ I+ A L S + + + N+Q + S + + I
Sbjct: 65 NLINPTKCQEIMQFANGKLFDSQVLSGTDKNIRNSQQMWISKN---------NPMVKPIF 115
Query: 90 EKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-----DPQEYGPQKSQRVASFLVYL 144
E I + +P N E ++RY Q YN H+D+ E+ + QR+ + L+YL
Sbjct: 116 ENICRQFNVPFDNAEDLQVVRYLPNQYYNEHHDSCCDSSKQCSEFIERGGQRILTVLIYL 175
Query: 145 TDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGT-IDPTSIHG 203
+ G T FP N K KP+ GD L+FY L N P S+H
Sbjct: 176 NNEFSDGHTYFPNLNQ---------------KFKPKTGDALVFYPLANNSNKCHPYSLHA 220
Query: 204 SCPVVKGEKWVATKWIRDQE 223
PV GEKW+A W R+++
Sbjct: 221 GMPVTSGEKWIANLWFRERK 240
>gi|410948132|ref|XP_003980795.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Felis
catus]
gi|410948136|ref|XP_003980797.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Felis
catus]
Length = 533
Score = 82.4 bits (202), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 101/208 (48%), Gaps = 28/208 (13%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 78
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 388
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQR 136
ED+ + + ++ +T L E + Y +G +Y H+D + P + G + + R
Sbjct: 389 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNR 448
Query: 137 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 196
+A+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 449 LATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEG 493
Query: 197 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 494 DYRTRHAACPVLVGCKWVSNKWFHERGQ 521
>gi|151556370|gb|AAI47868.1| P4HA1 protein [Bos taurus]
Length = 534
Score = 82.4 bits (202), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F + + + + + ++AK L +T+ + + Q R S ++S E+
Sbjct: 335 PRIIRFHDIISDAEIEVVKDLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYENP- 392
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 451
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVLAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|292621357|ref|XP_691737.4| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Danio rerio]
Length = 538
Score = 82.0 bits (201), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 58/217 (26%), Positives = 95/217 (43%), Gaps = 33/217 (15%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFIS 76
+++S P + F F T + K+I A LR S +A + T R S ++
Sbjct: 334 EIISLQPYVVLFHGFVTQAEAKNIRKYAMPGLRRSVVASGMNQA---TAEYRISKSAWLK 390
Query: 77 -AAEDESGTLDLIEEKIAKVTMLPRIN-----GEAFNILRYKIGQKYNSHYDAFDPQE-- 128
+A + G LD ++T++ +N E ++ Y IG Y H+D
Sbjct: 391 ESAHEVVGKLD------QRITLVTGLNVQPPYAEYLQVVNYGIGGHYEPHFDHATSDSSP 444
Query: 129 -YGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 187
Y + RVA+ ++YL+ ++ GG T F + N V Q L +
Sbjct: 445 LYRLKTGNRVATIMIYLSPVQAGGSTAFIYAN---------------FSVPVVQNAALFW 489
Query: 188 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
++L NG + ++H CPV+ G KWVA KW+ + EQ
Sbjct: 490 WNLHKNGQGNVDTLHAGCPVIVGNKWVANKWVHEYEQ 526
>gi|226874889|ref|NP_001152881.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Bos
taurus]
gi|296485624|tpg|DAA27739.1| TPA: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Bos taurus]
Length = 535
Score = 82.0 bits (201), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 97/210 (46%), Gaps = 30/210 (14%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 78
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 388
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS---- 134
ED+ + + ++ +T L E + Y +G +Y H+D E K
Sbjct: 389 EDDDPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRKDEQDAFKRLGTG 448
Query: 135 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 194
RVA+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 449 NRVATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSG 493
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 494 EGDYRTRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|156333122|ref|XP_001619372.1| hypothetical protein NEMVEDRAFT_v1g151555 [Nematostella vectensis]
gi|156202442|gb|EDO27272.1| predicted protein [Nematostella vectensis]
Length = 144
Score = 82.0 bits (201), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 45/146 (30%), Positives = 70/146 (47%), Gaps = 15/146 (10%)
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVA 138
++E + I ++ + L E ++ Y IG Y HYD + R+A
Sbjct: 8 DEEDELVKRISYRVQAYSGLNMTTSEDLQVVNYGIGGHYEPHYDFARDKFTSLGTGNRIA 67
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
+FL YL+D+E GG T+F +G V P++GD +Y+L +G D
Sbjct: 68 TFLSYLSDVEAGGGTVFTR---------------VGATVWPQKGDAAFWYNLKRSGDGDS 112
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWVA KWI + Q
Sbjct: 113 STRHAACPVLVGSKWVANKWIHEVGQ 138
>gi|344276265|ref|XP_003409929.1| PREDICTED: transmembrane prolyl 4-hydroxylase [Loxodonta africana]
Length = 475
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 58/215 (26%), Positives = 96/215 (44%), Gaps = 35/215 (16%)
Query: 39 SIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTML 98
S+ + ++LR +R + ++ +R S ++ E + I +++ ++T L
Sbjct: 218 SLQEFSNMDLRDFHKYMRS-HKAETSELVRNSHHTWLYQGEGAHHVMRAIRQRVLRLTRL 276
Query: 99 -PRIN--GEAFNILRYKIGQKYNSHYDA-------------FDPQEYGP-QKSQRVASFL 141
P I E ++RY G Y++H D+ E GP + S R + L
Sbjct: 277 SPEIVELSEPLQVVRYGEGGHYHAHVDSGPVYPETICSHTKLVANESGPFETSCRYMTVL 336
Query: 142 VYLTDLEEGGETMFPFENGMNADGSYDYQKCI------------GLKVKPRQGDGLLFYS 189
YL ++ GGET+FP + D Q + L+VKPRQG + +Y+
Sbjct: 337 FYLNNVTGGGETVFPVADNRTYDEMSLIQDDVDLRDTRRHCDKGNLRVKPRQGTAVFWYN 396
Query: 190 LLPNGT-----IDPTSIHGSCPVVKGEKWVATKWI 219
LP+G +D S+HG C V +G KW+A WI
Sbjct: 397 YLPDGQGWVGDVDDYSLHGGCLVTRGTKWIANNWI 431
>gi|344264849|ref|XP_003404502.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2
[Loxodonta africana]
Length = 534
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 101/208 (48%), Gaps = 28/208 (13%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 78
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 337 PHIVRYYDVMSDEEIERIKQIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 389
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQR 136
ED+ + + ++ +T L E + Y +G +Y H+D + P + G + + R
Sbjct: 390 EDDDPVVAQVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNR 449
Query: 137 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 196
+A+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 450 LATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEG 494
Query: 197 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 495 DYRTRHAACPVLVGCKWVSNKWFHERGQ 522
>gi|426229219|ref|XP_004008688.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 1
[Ovis aries]
Length = 535
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 97/210 (46%), Gaps = 30/210 (14%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 78
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 388
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS---- 134
ED+ + + ++ +T L E + Y +G +Y H+D E K
Sbjct: 389 EDDDPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRKDEQDAFKRLGTG 448
Query: 135 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 194
RVA+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 449 NRVATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSG 493
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 494 EGDYRTRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|390352104|ref|XP_003727818.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like
[Strongylocentrotus purpuratus]
Length = 121
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 42/121 (34%), Positives = 63/121 (52%), Gaps = 16/121 (13%)
Query: 104 EAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNA 163
E I Y +G Y H+D F + R+AS L YL+D+ +GG+T+F ++A
Sbjct: 5 EFLQIANYGLGGHYLPHFD-FTRDVATHKNGNRIASMLFYLSDVAKGGDTVF-----IDA 58
Query: 164 DGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
G K+KP +G + +Y+L NG +D + H SCPV+ G KWVA W+ +
Sbjct: 59 ----------GAKIKPEKGSAIFWYNLFKNGKVDERTKHASCPVISGSKWVANMWMHEHG 108
Query: 224 Q 224
Q
Sbjct: 109 Q 109
>gi|397615311|gb|EJK63351.1| hypothetical protein THAOC_15991 [Thalassiosira oceanica]
Length = 463
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 47/122 (38%), Positives = 69/122 (56%), Gaps = 10/122 (8%)
Query: 103 GEAFNILRYKIGQKYNSHYD-AFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGM 161
EA ++ Y++GQ+Y +H+D + P + Q + R A+ L+YL + GGET FP
Sbjct: 349 AEALQLVHYEVGQEYTAHHDFGYAPFDRKDQPA-RFATLLLYLNEGMVGGETQFP--RWA 405
Query: 162 NADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRD 221
NA+ GL V+P+ G +LFYS LP+G +D S H + PV GEKW+ W+ D
Sbjct: 406 NAETR------AGLDVEPKIGKAVLFYSQLPDGNMDDLSQHAARPVKIGEKWLMNLWVWD 459
Query: 222 QE 223
E
Sbjct: 460 PE 461
>gi|301754231|ref|XP_002912939.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Ailuropoda
melanoleuca]
Length = 535
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 97/210 (46%), Gaps = 30/210 (14%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 78
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 388
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS---- 134
ED+ + + ++ +T L E + Y +G +Y H+D E K
Sbjct: 389 EDDDPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRKNEQDAFKRLGTG 448
Query: 135 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 194
RVA+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 449 NRVATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSG 493
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 494 EGDYRTRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|341614920|ref|ZP_08701789.1| hypothetical protein CJLT1_08183 [Citromicrobium sp. JLT1363]
Length = 210
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 56/197 (28%), Positives = 90/197 (45%), Gaps = 28/197 (14%)
Query: 31 FATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEE 90
F E C +I M + RPSTLA D + RTS + + S LD +
Sbjct: 33 FLDAEFCAELITMIEAKRRPSTLA-----DFDGDEFFRTSETCDLPMDDPRSQRLDAM-- 85
Query: 91 KIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP-----QEYGPQKSQRVASFLVYLT 145
+A ++ + GE RY +GQ++ +H D F+P ++Y QR +F++YL
Sbjct: 86 -LADLSGIDPAYGEPLQGQRYAVGQEFKAHCDYFNPDGQDWEKYCSVAGQRTWTFMIYLN 144
Query: 146 DLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSC 205
+ E GG T F K I +P G + + + P+ +++P ++H
Sbjct: 145 EPEAGGVTRF---------------KTIKKSFQPETGKLICWNNRRPDQSVNPNTMHHGM 189
Query: 206 PVVKGEKWVATKWIRDQ 222
V KG K+V TKW R++
Sbjct: 190 KVRKGMKYVITKWYREK 206
>gi|194764881|ref|XP_001964556.1| GF23245 [Drosophila ananassae]
gi|190614828|gb|EDV30352.1| GF23245 [Drosophila ananassae]
Length = 460
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 45/140 (32%), Positives = 73/140 (52%), Gaps = 20/140 (14%)
Query: 88 IEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF---DPQEYGPQKSQRVASFLVYL 144
IE++I +T L E F ++ Y IG Y HYD + +P + + +R+ + L YL
Sbjct: 324 IEKRIKDMTGLSMDLSEDFMLINYGIGGTYKMHYDFYVYSEPLRF--LRGERIVTVLFYL 381
Query: 145 TDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGS 204
D+E G T+FPF N + + P++G +++Y+L +G + + H +
Sbjct: 382 GDVELSGSTVFPFLN---------------ISITPKKGSAVMWYNLHNSGDVHQKTQHCA 426
Query: 205 CPVVKGEKWVATKWIRDQEQ 224
CPVV G K+V TKWI + Q
Sbjct: 427 CPVVVGSKYVLTKWINELHQ 446
>gi|324511726|gb|ADY44875.1| Prolyl 4-hydroxylase subunit alpha-1 [Ascaris suum]
Length = 550
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 63/216 (29%), Positives = 101/216 (46%), Gaps = 24/216 (11%)
Query: 15 PFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSG 72
PF+V L + P A+ F + + E+ K I +A L+ +T+ K ++ T R S
Sbjct: 319 PFKVEILRFNPLAVLFVDIISDEEAKMIQQIATPRLKRATVQNSKTGELE-TAAYRISKS 377
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
++ + E +D I +I +T L + E I Y +G Y+ H+D +E
Sbjct: 378 AWLKGGDHE--LIDRINRRIELMTNLIQETSEELQIANYGVGGHYDPHFDFARKEEPKAF 435
Query: 133 KS----QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFY 188
+S R+A+ L YLT+ E GG T+F + V P + L +Y
Sbjct: 436 ESLGTGNRLATVLFYLTEPEIGGGTVF---------------TELRTAVMPSKNGALFWY 480
Query: 189 SLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
+L +G D + H +CPV+ G KWVA KWI ++ Q
Sbjct: 481 NLYRSGEGDLRTRHAACPVLVGIKWVANKWIHERGQ 516
>gi|198449643|ref|XP_001357664.2| GA15938 [Drosophila pseudoobscura pseudoobscura]
gi|198130698|gb|EAL26798.2| GA15938 [Drosophila pseudoobscura pseudoobscura]
Length = 549
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 62/214 (28%), Positives = 99/214 (46%), Gaps = 25/214 (11%)
Query: 15 PFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSG 72
PF+V LS P YF + + ++ + II K + S + TV + IRTS
Sbjct: 336 PFKVEQLSGDPYVAYFHDVLSDKESEQIIEHGKGQVTRSEIGQTGNSTVSD---IRTSQN 392
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQE--YG 130
++ + + L I++++ +T L E ++ Y IG +Y H+D D E +G
Sbjct: 393 TWLWY--ENNPWLADIKQRLEDITGLSTDTAEPLQLVNYGIGGQYEPHFDFMDDAEKNFG 450
Query: 131 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 190
K R+ + L YL D+ GG T FPF + L V P +G L++Y+L
Sbjct: 451 -WKGNRLLTALFYLNDVPLGGATAFPF---------------LHLAVPPVKGSLLVWYNL 494
Query: 191 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
+ D + H CPV+KG KW+ +W + Q
Sbjct: 495 HRSLHKDFRTKHAGCPVLKGSKWICNQWFHEAAQ 528
>gi|443721482|gb|ELU10773.1| hypothetical protein CAPTEDRAFT_174752 [Capitella teleta]
Length = 525
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 60/232 (25%), Positives = 106/232 (45%), Gaps = 36/232 (15%)
Query: 14 IPF-----QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIR 68
+PF ++L+ P + F + + + K++ A L + +A + + + R
Sbjct: 302 LPFVRYKEEMLNRKPHIVLFHDVMSDAEAKTMKMEAMHKLERAHVADNENKHGHSASAKR 361
Query: 69 TSSGVFISAAEDESGTLDLIEEKIAKVTMLPR--ING----EAFNILRYKIGQKYNSHYD 122
S ++ + + T+ + ++A +T L ++G E F IL Y IG +Y H D
Sbjct: 362 ISQVSWL-WDDHANKTIHQLSRRVADITGLQTGVVSGLHSAEPFQILNYGIGGQYEPHVD 420
Query: 123 AFDPQ-------EYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGL 175
F E+ R+A+F+ YL D+ GG T+FP + +
Sbjct: 421 YFAGNHSHSSLPEHVRASGNRLATFMFYLNDVHAGGATVFP---------------KLKV 465
Query: 176 KVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRD--QEQY 225
+ P + +Y++ NG +DP + H CPV+ G+KWVA KWI + Q+QY
Sbjct: 466 GIPPTKNGAAFWYNIGLNGDVDPLTEHAGCPVLLGQKWVANKWIHEHGQDQY 517
>gi|426255746|ref|XP_004021509.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Ovis
aries]
Length = 534
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + F + + + + + ++AK L +T+ + + Q R S ++S E+
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYENP- 392
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 138
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 451
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVLAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|313768324|ref|YP_004062004.1| hypothetical protein MpV1_121c [Micromonas sp. RCC1109 virus MpV1]
gi|312599020|gb|ADQ91044.1| hypothetical protein MpV1_121c [Micromonas sp. RCC1109 virus MpV1]
Length = 219
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 60/198 (30%), Positives = 95/198 (47%), Gaps = 29/198 (14%)
Query: 24 RALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESG 83
R F +F TP++ K I+ MA L+PST++ + + N + +R S ++ E
Sbjct: 45 RPRVFHDFITPQERKHIMEMASKELKPSTVS---TDRILN-ESVRKSETAWLGR---EDP 97
Query: 84 TLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVY 143
+D + + K P N E +LRYK G YN H D+ + + R+ +F++
Sbjct: 98 VVDAVIHRCLKYIDRPIKNCEKLQVLRYKPGGYYNPHQDSINDGS-----NPRLYTFILA 152
Query: 144 LTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPT-SIH 202
L D EGGET FP +G + K GD L F+ +L N ++ + ++H
Sbjct: 153 LNDEYEGGETEFP---------------KLGNEYKLTAGDAL-FFDILDNYELETSKALH 196
Query: 203 GSCPVVKGEKWVATKWIR 220
G PV GEKW+ W+R
Sbjct: 197 GGKPVKSGEKWICNLWVR 214
>gi|297515507|gb|ADI44133.1| RT08151p [Drosophila melanogaster]
Length = 546
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 54/210 (25%), Positives = 91/210 (43%), Gaps = 26/210 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + + + + I + + L +T+ V N +RTS FI +
Sbjct: 333 PLLVLYHDVIYQSEIDVIRKLTENRLMRATITSHNESVVSN---VRTSQFTFIPVTAHK- 388
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEY------GPQKSQR 136
L I++++A +T L E Y IG Y H D F + P+ R
Sbjct: 389 -VLSTIDQRVADMTNLNMKYAEDHQFANYGIGGHYGQHMDWFYQTTFDAGLVSSPEMGNR 447
Query: 137 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 196
+A+ L YL+D+ +GG T FP + +KP++ +++L +G
Sbjct: 448 IAAVLFYLSDVAQGGGTAFPQLRTL---------------LKPKKYAAAFWHNLHASGVG 492
Query: 197 DPTSIHGSCPVVKGEKWVATKWIRDQEQYD 226
D + HG+CP++ G KWV +WIR+ +Q D
Sbjct: 493 DVRTQHGACPIIAGSKWVQNRWIRENDQSD 522
>gi|116008434|ref|NP_651806.2| CG9698 [Drosophila melanogaster]
gi|113194862|gb|AAF57062.2| CG9698 [Drosophila melanogaster]
Length = 547
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 54/210 (25%), Positives = 91/210 (43%), Gaps = 26/210 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + + + + I + + L +T+ V N +RTS FI +
Sbjct: 333 PLLVLYHDVIYQSEIDVIRKLTENRLMRATITSHNESVVSN---VRTSQFTFIPVTAHK- 388
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEY------GPQKSQR 136
L I++++A +T L E Y IG Y H D F + P+ R
Sbjct: 389 -VLSTIDQRVADMTNLNMKYAEDHQFANYGIGGHYGQHMDWFYQTTFDAGLVSSPEMGNR 447
Query: 137 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 196
+A+ L YL+D+ +GG T FP + +KP++ +++L +G
Sbjct: 448 IATVLFYLSDVAQGGGTAFPQLRTL---------------LKPKKYAAAFWHNLHASGVG 492
Query: 197 DPTSIHGSCPVVKGEKWVATKWIRDQEQYD 226
D + HG+CP++ G KWV +WIR+ +Q D
Sbjct: 493 DVRTQHGACPIIAGSKWVQNRWIRENDQSD 522
>gi|348557542|ref|XP_003464578.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 1
[Cavia porcellus]
Length = 535
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 54/210 (25%), Positives = 98/210 (46%), Gaps = 30/210 (14%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 78
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 388
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS---- 134
E++ + + ++ ++T L E + Y +G +Y H+D E K
Sbjct: 389 EEDDPVVARVNRRMQQITGLTVKTAELLQVANYGMGGQYEPHFDFSRSHERDAFKRLGTG 448
Query: 135 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 194
RVA+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 449 NRVATFLNYMSDVEAGGATVFP---------------DLGAALWPKKGTAVFWYNLLRSG 493
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 494 EGDYRTRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|195330778|ref|XP_002032080.1| GM23711 [Drosophila sechellia]
gi|194121023|gb|EDW43066.1| GM23711 [Drosophila sechellia]
Length = 490
Score = 81.6 bits (200), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 45/159 (28%), Positives = 80/159 (50%), Gaps = 23/159 (14%)
Query: 67 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFD- 125
+RTS +I A+ + E++ +T + F+++ Y +G Y HYD +
Sbjct: 336 VRTSKDSYIVDAKS-------LNERVTDMTGFSMEMSDPFSLINYGLGGHYMLHYDFHEY 388
Query: 126 PQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGL 185
P++ R+A+ L YL +++ GG T+FP I + V P++G +
Sbjct: 389 TNTTRPKQGDRIATVLFYLGEVDSGGATIFP---------------KINIAVTPKKGSAV 433
Query: 186 LFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
+Y+L +G ++ S+H +CPV+ G K+V TKWI + Q
Sbjct: 434 FWYNLHNSGAMNLKSLHSACPVISGSKYVLTKWINELPQ 472
>gi|2498741|sp|Q60716.1|P4HA2_MOUSE RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
alpha-2; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-2; Flags: Precursor
gi|836900|gb|AAC52198.1| prolyl 4-hydroxylase alpha(II)-subunit [Mus musculus]
gi|18073923|emb|CAC85691.1| Prolyl 4-hydroxylase alpha IIb subunit [Mus musculus]
gi|1096888|prf||2112362B Pro 4-hydroxylase:SUBUNIT=alpha:ISOTYPE=II
Length = 537
Score = 81.6 bits (200), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 54/210 (25%), Positives = 97/210 (46%), Gaps = 30/210 (14%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 78
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 338 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 390
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS---- 134
ED+ + + ++ +T L E + Y +G +Y H+D + K
Sbjct: 391 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRSDDEDAFKRLGTG 450
Query: 135 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 194
RVA+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 451 NRVATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSG 495
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 496 EGDYRTRHAACPVLVGCKWVSNKWFHERGQ 525
>gi|256083648|ref|XP_002578053.1| prolyl 4-hydroxylase alpha subunit 1 [Schistosoma mansoni]
gi|360044447|emb|CCD81995.1| putative prolyl 4-hydroxylase alpha subunit 1 [Schistosoma mansoni]
Length = 584
Score = 81.6 bits (200), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 57/211 (27%), Positives = 95/211 (45%), Gaps = 33/211 (15%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGI------RTSSGVFIS 76
PR + + + P + + I +A LR +T+ + GI RTS ++
Sbjct: 379 PRIVMWYDLIFPSEIEKIKELATPRLRRATVK-------NPVTGILEIAFYRTSKSAWLP 431
Query: 77 AAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQE---YGPQK 133
+ E D I ++I VT L E + Y +G Y H+D +E + +
Sbjct: 432 HSMSE--ITDQISQRIRAVTGLSLETAEDLQVGNYGLGGHYAPHFDFGRKREKDAFEVKN 489
Query: 134 SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPN 193
R+A+ + YL+D++ GG T+F IG +V P++G +++LLPN
Sbjct: 490 GNRIATIIFYLSDVQAGGATVF---------------NRIGTRVVPKKGAAGFWFNLLPN 534
Query: 194 GTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
G D + H +CPV+ G KWV W ++ Q
Sbjct: 535 GEGDLRTRHAACPVLAGSKWVMNLWFHERGQ 565
>gi|195055775|ref|XP_001994788.1| GH17428 [Drosophila grimshawi]
gi|193892551|gb|EDV91417.1| GH17428 [Drosophila grimshawi]
Length = 540
Score = 81.6 bits (200), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 94/207 (45%), Gaps = 24/207 (11%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTL-ALRKGETVDNTQGIRTSSGVFISAAEDE 81
P + + + E+ + +A+ L+ S + +L E + + R S G F E
Sbjct: 331 PYVIQVHDIISAEETIVLQQLARPELQRSMVYSLSNSEHI--STNFRISQGTFFEY--HE 386
Query: 82 SGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQ---KSQRV 137
+ + + + ++ L + E + Y IG Y H D+F + YG + RV
Sbjct: 387 HPIMQRMSQHLENISGLDMRSAEQLQVANYGIGGHYEPHMDSFSENHNYGINTYMSTNRV 446
Query: 138 ASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTID 197
A+ + YL+++E GG T FPF + L V+P +G L +Y+L +G +D
Sbjct: 447 ATGIYYLSNVEAGGGTAFPF---------------LPLLVEPERGSLLFWYNLHRSGDLD 491
Query: 198 PTSIHGSCPVVKGEKWVATKWIRDQEQ 224
+ H CPV+ G KW+A WIR Q
Sbjct: 492 YRTKHAGCPVLMGSKWIANVWIRLSNQ 518
>gi|324507368|gb|ADY43128.1| Prolyl 4-hydroxylase subunit alpha-2 [Ascaris suum]
Length = 534
Score = 81.6 bits (200), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 57/216 (26%), Positives = 100/216 (46%), Gaps = 24/216 (11%)
Query: 14 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSG 72
I ++L + P + F + + + I +A L+ +T+ R G+ R S
Sbjct: 316 IKVEILRFSPLVVLFKQVISDYEIEVIEKLAIPKLKRATVQNARTGDL--EYANYRISKS 373
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 132
++ + + +D I ++I +T L + E Y IG Y+ H+D ++
Sbjct: 374 AWLKGTDHPA--IDRINKRIDLMTNLNQETAEELQAQNYGIGGHYDPHFDFARKEDINAF 431
Query: 133 KS----QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFY 188
K+ R+A+ L+Y++D+E GG T+F +G V P + D L +Y
Sbjct: 432 KTLNTGNRIATILIYMSDVESGGATVFNH---------------LGNAVFPSKYDALFWY 476
Query: 189 SLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
+L +G D + H +CPV+ G KWV+ KWI D+ Q
Sbjct: 477 NLRRDGEGDLRTRHAACPVLTGIKWVSNKWIHDRGQ 512
>gi|442757047|gb|JAA70682.1| Putative prolyl 4-hydroxylase alpha subunit [Ixodes ricinus]
Length = 532
Score = 81.6 bits (200), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 38/127 (29%), Positives = 68/127 (53%), Gaps = 18/127 (14%)
Query: 96 TMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEY---GPQKSQRVASFLVYLTDLEEGGE 152
T+ R E + + Y IG Y H+D F+ + G + RVA+ ++Y++D+EEGG
Sbjct: 398 TLFSRDEAEKYQLANYGIGGHYVPHHDYFEEFQTPSKGNRFGNRVATLMIYMSDVEEGGA 457
Query: 153 TMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEK 212
T+FP +G++V P++GD + +++++ + + + H CPV+ G K
Sbjct: 458 TVFP---------------SLGVRVSPKKGDAVFWWNIMSSWEGEMLTWHAGCPVLYGSK 502
Query: 213 WVATKWI 219
W+A KW
Sbjct: 503 WIANKWF 509
>gi|198459366|ref|XP_002138685.1| GA24919 [Drosophila pseudoobscura pseudoobscura]
gi|198136669|gb|EDY69243.1| GA24919 [Drosophila pseudoobscura pseudoobscura]
Length = 448
Score = 81.6 bits (200), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 45/119 (37%), Positives = 64/119 (53%), Gaps = 18/119 (15%)
Query: 108 ILRYKIGQKYNSHYDAFDP--QEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADG 165
+L Y +Y +H D F P EY Q+ R+A+ L YL D+E+GG+T+FP
Sbjct: 336 VLNYATAAQYLTHSDYFGPAYSEY-IQRGDRIATVLFYLNDVEQGGKTVFP--------- 385
Query: 166 SYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
+G+ P +G ++FY+L + DP + HG CPV+ G KW ATKWI EQ
Sbjct: 386 ------RLGIFRSPMKGSAVVFYNLNSSLQGDPRTEHGGCPVLVGTKWAATKWIYSAEQ 438
>gi|410447164|ref|ZP_11301266.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [SAR86 cluster
bacterium SAR86E]
gi|409980151|gb|EKO36903.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [SAR86 cluster
bacterium SAR86E]
Length = 214
Score = 81.6 bits (200), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 54/214 (25%), Positives = 99/214 (46%), Gaps = 31/214 (14%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P NF + +C + IN A+ L+ ST+ G + G RTS +I D +
Sbjct: 21 PIVYLVKNFLSDLECDAFINEAEGRLQDSTVI---GANDEIKLGARTSQNCWIE--HDAN 75
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS-----QRV 137
+ + ++++ + +P N E + + Y+ ++Y +D+FD +K+ QR+
Sbjct: 76 ELVHEVSKRLSILAQIPIRNAEQYQLACYEKDEEYKPRFDSFDFDTLEGKKNWEPGGQRM 135
Query: 138 ASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGT-- 195
+ +VYL D++ GG T FP +G + P++GD ++ + + +
Sbjct: 136 LTIIVYLNDVQSGGGTDFP---------------KLGFTIPPKKGDVVVLNNTCDDDSQN 180
Query: 196 ----IDPTSIHGSCPVVKGEKWVATKWIRDQEQY 225
I P S+H PV+ G+KW+ T W R +Y
Sbjct: 181 GHPNIHPNSLHAGMPVLSGKKWIVTLWFRQNLRY 214
>gi|148226320|ref|NP_001087703.1| prolyl 4-hydroxylase, alpha polypeptide 2 precursor [Xenopus
laevis]
gi|51703693|gb|AAH81114.1| MGC83530 protein [Xenopus laevis]
Length = 533
Score = 81.6 bits (200), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 100/206 (48%), Gaps = 24/206 (11%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET-VDNTQGIRTSSGVFISAAEDE 81
PR + + + + E+ + I +AK L +T+ R +T V R S ++ +D
Sbjct: 336 PRIVRYLDVLSDEEIEKIKELAKPRLARATV--RDPKTGVLTVANYRVSKSAWLEEYDDP 393
Query: 82 SGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ---KSQRVA 138
+ + ++ +T L + E + Y +G +Y H+D F + + + R+A
Sbjct: 394 --VIGRVNSRMQAITGLTKDTAELLQVANYGMGGQYEPHFD-FSRRPFDSNLKTEGNRLA 450
Query: 139 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 198
++L Y++D+E GG T+FP D+ G + PR+G + +Y+L +G D
Sbjct: 451 TYLNYMSDVEAGGATVFP-----------DF----GAAIWPRKGTAVFWYNLFRSGEGDY 495
Query: 199 TSIHGSCPVVKGEKWVATKWIRDQEQ 224
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 496 RTRHAACPVLVGSKWVSNKWFHERGQ 521
>gi|407686446|ref|YP_006801619.1| hypothetical protein AMBAS45_03290 [Alteromonas macleodii str.
'Balearic Sea AD45']
gi|407289826|gb|AFT94138.1| hypothetical protein AMBAS45_03290 [Alteromonas macleodii str.
'Balearic Sea AD45']
Length = 263
Score = 81.6 bits (200), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 57/200 (28%), Positives = 94/200 (47%), Gaps = 27/200 (13%)
Query: 28 FPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDL 87
+ +F + ++C I+ + K L PS LA + D+ IRTSS ++ ++ +
Sbjct: 85 YDDFLSSQECDDIVALTKDKLAPSKLA--GAASADD---IRTSSTCELAFLGNK--LVKD 137
Query: 88 IEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP-----QEYGPQKSQRVASFLV 142
++ +I L GE Y +G+ Y HYD F P + + + QR + ++
Sbjct: 138 VDSRIVSTLSLGVGEGEVIQAQHYNVGEYYKPHYDFFPPGSPQYKAHCLSRGQRTWTCMI 197
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D +GG T F + + VKP++G L + +LLP+G + SIH
Sbjct: 198 YLNDECDGGHTRF---------------TKLDIAVKPKKGMALFWNNLLPSGDPNLNSIH 242
Query: 203 GSCPVVKGEKWVATKWIRDQ 222
+ PV +G K V TKW R +
Sbjct: 243 FAEPVTRGHKTVITKWFRTK 262
>gi|194765174|ref|XP_001964702.1| GF23328 [Drosophila ananassae]
gi|190614974|gb|EDV30498.1| GF23328 [Drosophila ananassae]
Length = 542
Score = 81.6 bits (200), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 59/213 (27%), Positives = 99/213 (46%), Gaps = 23/213 (10%)
Query: 15 PFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSG 72
PF+V L+ P YF + + II ++ S + + T T IRTS+
Sbjct: 329 PFKVEQLNLDPYVAYFHEAINSSEMEQIIEKGLGSMERSRVGQSQNAT---TSEIRTSAN 385
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFD-PQEYGP 131
++ +E+ L I++++ +T L + E ++ Y IG +Y H+D + PQ+
Sbjct: 386 TWLWY--NENPWLSKIKQRLEDITGLSTESAEPLQLVNYGIGGQYEPHFDFVEEPQKVFG 443
Query: 132 QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLL 191
K R+ + L Y+ D+ GG T FPF + L V P +G L++Y+L
Sbjct: 444 WKGNRMLTALFYINDVALGGATAFPF---------------LQLAVPPVKGSLLVWYNLH 488
Query: 192 PNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
+ D + H CPV+KG KW+ +W + Q
Sbjct: 489 RSLHKDFRTKHAGCPVIKGSKWICNEWFHEGTQ 521
>gi|298712929|emb|CBJ26831.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 294
Score = 81.6 bits (200), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 61/196 (31%), Positives = 94/196 (47%), Gaps = 32/196 (16%)
Query: 31 FATPEQCKSIINMA-KLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIE 89
F+ PE C ++I +A + + GE ++ RTSS F+ A ED L +
Sbjct: 110 FSGPE-CDALIALAGNYMIVSPVVGAGAGEVSES----RTSSSCFL-ARED----LPTVC 159
Query: 90 EKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFD-----PQEYGPQKSQRVASFLVYL 144
K+ +T P + E + RY QKY +H+DAFD + + QRV + LVYL
Sbjct: 160 HKVMALTGKPIEHLELPQVGRYYTSQKYANHWDAFDLNTEDGRRFAQNGGQRVCTVLVYL 219
Query: 145 TDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGS 204
D+ GG T FP +G+KV+PR+G ++F+ +G +D +H +
Sbjct: 220 NDVPSGGCTAFPQ---------------LGMKVQPRKGMAVVFFPATLDGVLDSRLLHAA 264
Query: 205 CPVVKGEKWVATKWIR 220
P + KWV+ WIR
Sbjct: 265 EPAID-TKWVSQIWIR 279
>gi|406595590|ref|YP_006746720.1| hypothetical protein MASE_03040 [Alteromonas macleodii ATCC 27126]
gi|407682553|ref|YP_006797727.1| hypothetical protein AMEC673_03255 [Alteromonas macleodii str.
'English Channel 673']
gi|406372911|gb|AFS36166.1| hypothetical protein MASE_03040 [Alteromonas macleodii ATCC 27126]
gi|407244164|gb|AFT73350.1| hypothetical protein AMEC673_03255 [Alteromonas macleodii str.
'English Channel 673']
Length = 263
Score = 81.3 bits (199), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 57/200 (28%), Positives = 94/200 (47%), Gaps = 27/200 (13%)
Query: 28 FPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDL 87
+ +F + ++C I+ + K L PS LA + D+ IRTSS ++ ++ +
Sbjct: 85 YDDFLSSQECDDIVALTKDKLAPSKLA--GAASADD---IRTSSTCELAFLGNK--LVKD 137
Query: 88 IEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP-----QEYGPQKSQRVASFLV 142
++ +I L GE Y +G+ Y HYD F P + + + QR + ++
Sbjct: 138 VDNRIVSTLSLGVGEGEVIQAQHYNVGEYYKPHYDFFPPGSPQYKAHCLSRGQRTWTCMI 197
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D +GG T F + + VKP++G L + +LLP+G + SIH
Sbjct: 198 YLNDECDGGHTRF---------------TKLDIAVKPKKGMALFWNNLLPSGDPNLNSIH 242
Query: 203 GSCPVVKGEKWVATKWIRDQ 222
+ PV +G K V TKW R +
Sbjct: 243 FAEPVTRGHKTVITKWFRTK 262
>gi|426229221|ref|XP_004008689.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
[Ovis aries]
Length = 487
Score = 81.3 bits (199), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 101/208 (48%), Gaps = 28/208 (13%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 78
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 290 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 342
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQR 136
ED+ + + ++ +T L E + Y +G +Y H+D + P + G + + R
Sbjct: 343 EDDDPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNR 402
Query: 137 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 196
+A+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 403 LATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEG 447
Query: 197 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 448 DYRTRHAACPVLVGCKWVSNKWFHERGQ 475
>gi|90023340|ref|YP_529167.1| response regulator receiver domain-containing protein
[Saccharophagus degradans 2-40]
gi|89952940|gb|ABD82955.1| 2OG-Fe(II) oxygenase [Saccharophagus degradans 2-40]
Length = 269
Score = 81.3 bits (199), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 55/209 (26%), Positives = 102/209 (48%), Gaps = 26/209 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P +F T + II A ++ + ++ K E +++ RT S ++ A D +
Sbjct: 63 PSVTICEDFLTQAEVFQIIKAAGDKMQRARVSSGK-EGIESAG--RTGSNCWV--AHDHN 117
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFD-PQEYG----PQKSQRV 137
+ ++I+K+ + N E+F ++ Y + Q+Y+SH+DA++ E G + QR+
Sbjct: 118 KVTHALAKRISKLVGISLQNAESFQVIHYGVSQEYSSHFDAWEFNTERGERCMARGGQRL 177
Query: 138 ASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI- 196
+ L+YL D+ GG T FP + L+V+ ++G ++F++ P
Sbjct: 178 VTCLIYLNDVPAGGGTGFPE---------------LDLEVQAKKGRMVIFHNCYPGTNYR 222
Query: 197 DPTSIHGSCPVVKGEKWVATKWIRDQEQY 225
P S+HG PV +GEKW W R+ + +
Sbjct: 223 HPHSLHGGLPVEEGEKWAVNLWFREADYW 251
>gi|94495931|ref|ZP_01302510.1| 2OG-Fe(II) oxygenase [Sphingomonas sp. SKA58]
gi|94424623|gb|EAT09645.1| 2OG-Fe(II) oxygenase [Sphingomonas sp. SKA58]
Length = 229
Score = 81.3 bits (199), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 59/198 (29%), Positives = 93/198 (46%), Gaps = 28/198 (14%)
Query: 30 NFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIE 89
+F +P++C + + N +PSTL D RTS +S + ++ I
Sbjct: 52 DFLSPDECAELRRLIDANAQPSTL-FSGSANAD----YRTSHSGNLSPRDP---LVERIT 103
Query: 90 EKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP-----QEYGPQKSQRVASFLVYL 144
++I +T LP INGE RY GQ+Y H D F Q QR + ++YL
Sbjct: 104 QRICALTGLPAINGETLQGQRYTPGQEYKVHCDYFPATADYWQRMRGTGGQRTWTAMIYL 163
Query: 145 TDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGS 204
+ +E GGET FP +C V P +G L++ ++ +G + S+H +
Sbjct: 164 SAVEAGGETHFP--------------QC-EFMVPPVEGMILIWNNMDRDGAPNRFSLHAA 208
Query: 205 CPVVKGEKWVATKWIRDQ 222
PV +G K+V TKW R++
Sbjct: 209 LPVERGTKYVVTKWFRER 226
>gi|426249581|ref|XP_004018528.1| PREDICTED: LOW QUALITY PROTEIN: transmembrane prolyl 4-hydroxylase
[Ovis aries]
Length = 484
Score = 81.3 bits (199), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 59/219 (26%), Positives = 98/219 (44%), Gaps = 35/219 (15%)
Query: 39 SIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTML 98
S+ + ++LR +R +++Q +R S ++ E + I +++ ++T L
Sbjct: 245 SLQEFSNMDLRDFHKYMRS-HRAESSQLVRNSHHTWLYQGEGAHHVMRAIRQRVLRLTRL 303
Query: 99 -PRIN--GEAFNILRYKIGQKYNSHYDA-------------FDPQEYGP-QKSQRVASFL 141
P I E ++RY G Y++H D+ E P + S R + L
Sbjct: 304 SPEIVELSEPLQVVRYGEGGHYHAHVDSGPVYPETICSHTKLVANESVPFETSCRYMTVL 363
Query: 142 VYLTDLEEGGETMFPFENGMNADGSYDYQKCI------------GLKVKPRQGDGLLFYS 189
YL ++ GGET+FP + D Q + L+VKPRQG + +Y+
Sbjct: 364 FYLNNVTGGGETVFPVADNRTYDEMSLIQDDVDLRDTRRHCDKGNLRVKPRQGTAVFWYN 423
Query: 190 LLPNGT-----IDPTSIHGSCPVVKGEKWVATKWIRDQE 223
LP+G +D S+HG C V +G KW+A WI +E
Sbjct: 424 YLPDGQGWVGDVDDYSLHGGCLVTRGTKWIANNWINARE 462
>gi|226874885|ref|NP_001029465.2| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Bos
taurus]
gi|296485623|tpg|DAA27738.1| TPA: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Bos taurus]
Length = 533
Score = 81.3 bits (199), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 101/208 (48%), Gaps = 28/208 (13%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 78
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 388
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQR 136
ED+ + + ++ +T L E + Y +G +Y H+D + P + G + + R
Sbjct: 389 EDDDPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNR 448
Query: 137 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 196
+A+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 449 LATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEG 493
Query: 197 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 494 DYRTRHAACPVLVGCKWVSNKWFHERGQ 521
>gi|440912197|gb|ELR61789.1| Prolyl 4-hydroxylase subunit alpha-2, partial [Bos grunniens mutus]
Length = 535
Score = 81.3 bits (199), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 101/208 (48%), Gaps = 28/208 (13%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 78
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 338 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 390
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQR 136
ED+ + + ++ +T L E + Y +G +Y H+D + P + G + + R
Sbjct: 391 EDDDPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNR 450
Query: 137 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 196
+A+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 451 LATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEG 495
Query: 197 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 496 DYRTRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|74353841|gb|AAI03334.1| Prolyl 4-hydroxylase, alpha polypeptide II [Bos taurus]
Length = 487
Score = 81.3 bits (199), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 101/208 (48%), Gaps = 28/208 (13%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 78
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 290 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 342
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQR 136
ED+ + + ++ +T L E + Y +G +Y H+D + P + G + + R
Sbjct: 343 EDDDPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNR 402
Query: 137 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 196
+A+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 403 LATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEG 447
Query: 197 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 448 DYRTRHAACPVLVGCKWVSNKWFHERGQ 475
>gi|73970649|ref|XP_850109.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Canis
lupus familiaris]
Length = 533
Score = 81.3 bits (199), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 101/208 (48%), Gaps = 28/208 (13%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 78
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 388
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQR 136
ED+ + + ++ +T L E + Y +G +Y H+D + P + G + + R
Sbjct: 389 EDDDPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNR 448
Query: 137 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 196
+A+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 449 LATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEG 493
Query: 197 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 494 DYRTRHAACPVLVGCKWVSNKWFHERGQ 521
>gi|355709025|gb|AES03456.1| prolyl 4-hydroxylase, alpha polypeptide II [Mustela putorius furo]
Length = 532
Score = 81.3 bits (199), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 101/208 (48%), Gaps = 28/208 (13%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 78
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 388
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQR 136
ED+ + + ++ +T L E + Y +G +Y H+D + P + G + + R
Sbjct: 389 EDDDPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNR 448
Query: 137 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 196
+A+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 449 LATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEG 493
Query: 197 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 494 DYRTRHAACPVLVGCKWVSNKWFHERGQ 521
>gi|381173085|ref|ZP_09882194.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas citri pv.
mangiferaeindicae LMG 941]
gi|380686458|emb|CCG38681.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas citri pv.
mangiferaeindicae LMG 941]
Length = 418
Score = 81.3 bits (199), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 59/213 (27%), Positives = 89/213 (41%), Gaps = 38/213 (17%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + + ++C+ ++ +A+ +LR S + + + IRTS G
Sbjct: 228 PRIEEYAAVLSADECRLLMLLARPHLRASKV-IDPNDASTQRAPIRTSRG---------- 276
Query: 83 GTLDLIEEKIAKVTM---------LPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG--- 130
TLD I E A LP + E ++L Y G++Y +H D P
Sbjct: 277 ATLDPIIEDFAARAAQARLAACAQLPLAHAEPLSVLCYAPGEQYRAHRDYLPPGTIAADR 336
Query: 131 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 190
P R + VYL D+ GG+T FP G++V+PR G + F +L
Sbjct: 337 PTAGNRQRTVCVYLNDVGAGGDTEFPIA---------------GVRVRPRPGTLVCFDNL 381
Query: 191 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
+G D S+H PV G KW+ T W R Q
Sbjct: 382 HADGRPDADSLHAGLPVTAGSKWLGTLWFRQQR 414
>gi|418521653|ref|ZP_13087695.1| hypothetical protein WS7_11622 [Xanthomonas axonopodis pv.
malvacearum str. GSPB2388]
gi|410702188|gb|EKQ60697.1| hypothetical protein WS7_11622 [Xanthomonas axonopodis pv.
malvacearum str. GSPB2388]
Length = 418
Score = 81.3 bits (199), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 59/213 (27%), Positives = 89/213 (41%), Gaps = 38/213 (17%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + + ++C+ ++ +A+ +LR S + + + IRTS G
Sbjct: 228 PRIEEYAAVLSADECRLLMLLARPHLRASKV-IDPNDASTQRAPIRTSRGA--------- 277
Query: 83 GTLDLIEEKIAKVTM---------LPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG--- 130
TLD I E A LP + E ++L Y G++Y +H D P
Sbjct: 278 -TLDPIIEDFAARAAQARLAACAQLPLAHAEPLSVLCYAPGEQYRAHRDYLPPGTIAADR 336
Query: 131 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 190
P R + VYL D+ GG+T FP G++V+PR G + F +L
Sbjct: 337 PTAGNRQRTVCVYLNDVGAGGDTEFPIA---------------GVRVRPRPGTLVCFDNL 381
Query: 191 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
+G D S+H PV G KW+ T W R Q
Sbjct: 382 HADGRPDADSLHAGLPVTAGSKWLGTLWFRQQR 414
>gi|242018356|ref|XP_002429643.1| Prolyl 4-hydroxylase alpha-1 subunit precursor, putative [Pediculus
humanus corporis]
gi|212514628|gb|EEB16905.1| Prolyl 4-hydroxylase alpha-1 subunit precursor, putative [Pediculus
humanus corporis]
Length = 534
Score = 81.3 bits (199), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 43/143 (30%), Positives = 71/143 (49%), Gaps = 23/143 (16%)
Query: 88 IEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVASFLVY 143
+ +++ +T L E+ ++ Y IG Y H+D +E +S R+A+ L Y
Sbjct: 396 VSQRVEDITGLNMATAESLQVVNYGIGGHYEPHFDFARKEEKNAFQSLGTGNRIATILFY 455
Query: 144 LTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVK--PRQGDGLLFYSLLPNGTIDPTSI 201
++D+ +GG T+FP G+KV P++G +Y+L NG D +
Sbjct: 456 MSDVSQGGATVFP-----------------GIKVSLWPKKGTAAFWYNLRKNGEGDYLTR 498
Query: 202 HGSCPVVKGEKWVATKWIRDQEQ 224
H +CPV+ G KWV KWI ++ Q
Sbjct: 499 HAACPVLTGSKWVCNKWIHERGQ 521
>gi|195069801|ref|XP_001997031.1| GH12975 [Drosophila grimshawi]
gi|193891500|gb|EDV90366.1| GH12975 [Drosophila grimshawi]
Length = 242
Score = 81.3 bits (199), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 94/207 (45%), Gaps = 24/207 (11%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTL-ALRKGETVDNTQGIRTSSGVFISAAEDE 81
P + + + E+ + +A+ L+ S + +L E + + R S G F E
Sbjct: 33 PYVIQVHDIISAEETIVLQQLARPELQRSMVYSLSNSEHI--STNFRISQGTFFEYHEHP 90
Query: 82 SGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQ---KSQRV 137
+ + + + ++ L + E + Y IG Y H D+F + YG + RV
Sbjct: 91 --IMQRMSQHLENISGLDMRSAEQLQVANYGIGGHYEPHMDSFSENHNYGINTYMSTNRV 148
Query: 138 ASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTID 197
A+ + YL+++E GG T FPF + L V+P +G L +Y+L +G +D
Sbjct: 149 ATGIYYLSNVEAGGGTAFPF---------------LPLLVEPERGSLLFWYNLHRSGDLD 193
Query: 198 PTSIHGSCPVVKGEKWVATKWIRDQEQ 224
+ H CPV+ G KW+A WIR Q
Sbjct: 194 YRTKHAGCPVLMGSKWIANVWIRLSNQ 220
>gi|195159313|ref|XP_002020526.1| GL14040 [Drosophila persimilis]
gi|194117295|gb|EDW39338.1| GL14040 [Drosophila persimilis]
Length = 549
Score = 81.3 bits (199), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 62/214 (28%), Positives = 98/214 (45%), Gaps = 25/214 (11%)
Query: 15 PFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSG 72
PF+V LS P YF + + ++ + II K + S + TV IRTS
Sbjct: 336 PFKVEQLSGDPYVAYFHDVLSDKESEQIIEHGKGQVTRSEIGQTGNSTVSE---IRTSQN 392
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQE--YG 130
++ + + L I++++ +T L E ++ Y IG +Y H+D D E +G
Sbjct: 393 TWLWY--ENNPWLADIKQRLEDITGLSTDTAEPLQLVNYGIGGQYEPHFDFMDDAEKNFG 450
Query: 131 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 190
K R+ + L YL D+ GG T FPF + L V P +G L++Y+L
Sbjct: 451 -WKGNRLLTALFYLNDVPLGGATAFPF---------------LHLAVPPVKGSLLVWYNL 494
Query: 191 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
+ D + H CPV+KG KW+ +W + Q
Sbjct: 495 HRSLHKDFRTKHAGCPVLKGSKWICNEWFHEAAQ 528
>gi|321474875|gb|EFX85839.1| hypothetical protein DAPPUDRAFT_309105 [Daphnia pulex]
Length = 545
Score = 81.3 bits (199), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 55/207 (26%), Positives = 97/207 (46%), Gaps = 26/207 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-GETVDNTQGIRTSSGVFISAAEDE 81
PR + + + + E+ ++I +A+ +T+ ++ GE + I S+ + +E
Sbjct: 346 PRIVVYHDIISDEEIETIKRLAQPRFERATVQKKESGEREFSRYRIAKSAWL----KHEE 401
Query: 82 SGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP----QEYGPQKSQRV 137
+ I ++ +T L E + Y IG Y HYD Q++G R+
Sbjct: 402 HDYVSDINFRVGDITGLDMATSEDLQVCNYGIGGHYEPHYDYARKGEVQQDFGW--GGRI 459
Query: 138 ASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTID 197
A++L Y++D+E GG T+FP N L + P++G +++L PNG +
Sbjct: 460 ATWLFYMSDVEAGGATVFPKLN---------------LSLWPQKGSAAFWFNLYPNGEGN 504
Query: 198 PTSIHGSCPVVKGEKWVATKWIRDQEQ 224
+ H CPV+ G KWVA WI ++ Q
Sbjct: 505 EMTQHAGCPVLTGSKWVANYWIHERGQ 531
>gi|77748579|ref|NP_641686.2| hypothetical protein XAC1351 [Xanthomonas axonopodis pv. citri str.
306]
Length = 418
Score = 81.3 bits (199), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 59/213 (27%), Positives = 89/213 (41%), Gaps = 38/213 (17%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + + ++C+ ++ +A+ +LR S + + + IRTS G
Sbjct: 228 PRIEEYAAVLSADECRLLMLLARPHLRASKV-IDPNDASTQRAPIRTSRGA--------- 277
Query: 83 GTLDLIEEKIAKVTM---------LPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG--- 130
TLD I E A LP + E ++L Y G++Y +H D P
Sbjct: 278 -TLDPIIEDFAARAAQARLAACAQLPLAHAEPLSVLCYAPGEQYRAHRDYLPPGTIAADR 336
Query: 131 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 190
P R + VYL D+ GG+T FP G++V+PR G + F +L
Sbjct: 337 PTAGNRQRTVCVYLNDVGAGGDTEFPIA---------------GVRVRPRPGTLVCFDNL 381
Query: 191 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
+G D S+H PV G KW+ T W R Q
Sbjct: 382 HADGRPDADSLHAGLPVTAGSKWLGTLWFRQQR 414
>gi|167524906|ref|XP_001746788.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774568|gb|EDQ88195.1| predicted protein [Monosiga brevicollis MX1]
Length = 321
Score = 80.9 bits (198), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 46/133 (34%), Positives = 68/133 (51%), Gaps = 16/133 (12%)
Query: 88 IEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDL 147
+EE+I K+ LP +N E F +LRY Q Y H D D ++Y RV + +YL D+
Sbjct: 191 LEERIGKLVGLPVVNQEHFQVLRYNNNQYYRVHNDLID-EQYDMPCGPRVLTLFIYLNDV 249
Query: 148 EEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPV 207
GGET F +GL VKP++G +L+YS+ + + + H + PV
Sbjct: 250 PAGGETSF---------------TRLGLAVKPKKGKAVLWYSVTNDLEPEERTDHEARPV 294
Query: 208 VKGEKWVATKWIR 220
+G K+ A KWI
Sbjct: 295 KQGTKYAANKWIH 307
>gi|195452746|ref|XP_002073482.1| GK14141 [Drosophila willistoni]
gi|194169567|gb|EDW84468.1| GK14141 [Drosophila willistoni]
Length = 541
Score = 80.9 bits (198), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 56/220 (25%), Positives = 98/220 (44%), Gaps = 28/220 (12%)
Query: 15 PFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSG 72
P +V L+ P + + + + I N+ + + +T+ KG V +RTS
Sbjct: 320 PLKVEELNHNPLLVLYHDVIYQSEIDVIRNLTENEISRATVIGAKGSEVSK---VRTSQF 376
Query: 73 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEY--- 129
FI + L I++++A ++ L E Y IG Y H D F +
Sbjct: 377 TFIPKTRHK--VLQTIDQRVADMSNLNMDYAELHQFANYGIGGHYAQHNDWFGQDAFDNE 434
Query: 130 ---GPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLL 186
P+ R+A+ L YL+D+ +GG T FP + ++P++
Sbjct: 435 LVSSPEMGNRIATVLFYLSDVAQGGGTAFPHLKQL---------------LQPKKYAAAF 479
Query: 187 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 226
+++L +G D ++HG+CP++ G KWV +WIR+ Q D
Sbjct: 480 WHNLHASGVGDLRTLHGACPIIAGSKWVQNRWIREFIQAD 519
>gi|149185530|ref|ZP_01863846.1| hypothetical protein ED21_20934 [Erythrobacter sp. SD-21]
gi|148830750|gb|EDL49185.1| hypothetical protein ED21_20934 [Erythrobacter sp. SD-21]
Length = 241
Score = 80.9 bits (198), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 60/198 (30%), Positives = 84/198 (42%), Gaps = 28/198 (14%)
Query: 28 FPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDL 87
P F P +C +I + + NL PS L + G RTS + SA E E L
Sbjct: 53 MPGFLAPAECTRLIELIESNLLPSPLF-----SDPTGTGARTSQTHYFSAEEPEVAALG- 106
Query: 88 IEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ-----KSQRVASFLV 142
K+ + L R E RY +GQ+Y H D F + Q QR S +V
Sbjct: 107 --AKLDDLLGLERRQAETVQGQRYDVGQEYRHHRDFFRVEREHWQLERRRGGQRTWSAMV 164
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL +E GGET FP + L V P G + + ++ +G +P ++H
Sbjct: 165 YLNAIEAGGETDFP---------------VLDLSVAPEPGLLIAWNNVDRHGHPNPATLH 209
Query: 203 GSCPVVKGEKWVATKWIR 220
PV G K+V T+W R
Sbjct: 210 AGMPVEAGRKYVVTQWYR 227
>gi|449267219|gb|EMC78185.1| Prolyl 4-hydroxylase subunit alpha-2 [Columba livia]
Length = 538
Score = 80.9 bits (198), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 98/208 (47%), Gaps = 30/208 (14%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 78
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 339 PHIVRYYDVMSDEEIEKIKQLAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 391
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS---- 134
ED+ + + +++ ++T L E + Y +G +Y H+D E K
Sbjct: 392 EDDDPVVAKVNQRMQQITGLTVKTAELLQVANYGMGGQYEPHFDFSRKDEPDAFKRLGTG 451
Query: 135 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 194
RVA+FL Y++D+E GG T+FP D+ G + P++G + +Y+L +G
Sbjct: 452 NRVATFLNYMSDVEAGGATVFP-----------DF----GAAIWPKKGTAVFWYNLFRSG 496
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
D + H +CPV+ G KWV+ KW ++
Sbjct: 497 EGDYRTRHAACPVLVGCKWVSNKWFHER 524
>gi|325929527|ref|ZP_08190641.1| 2OG-Fe(II) oxygenase superfamily enzyme,Sel1 repeat protein
[Xanthomonas perforans 91-118]
gi|325540037|gb|EGD11665.1| 2OG-Fe(II) oxygenase superfamily enzyme,Sel1 repeat protein
[Xanthomonas perforans 91-118]
Length = 418
Score = 80.9 bits (198), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 60/216 (27%), Positives = 89/216 (41%), Gaps = 38/216 (17%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + + ++C+ ++ +A+ +LR S + + + IRTS G
Sbjct: 228 PRIEEYAAVLSADECRLLMLLARPHLRASKV-IDPNDASTGRAPIRTSHGA--------- 277
Query: 83 GTLDLIEEKIAKVTM---------LPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG--- 130
TLD I E A LP + E ++L Y G++Y +H D P
Sbjct: 278 -TLDPIIEDFAARAAQARLAACAQLPLAHAEPLSVLCYAPGEQYRAHRDYLPPGTIAADR 336
Query: 131 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 190
P R + VYL D+ GET FP G++V+PR G + F +L
Sbjct: 337 PTAGNRQRTVCVYLNDVGAAGETEFPVA---------------GVRVRPRPGTLVCFDNL 381
Query: 191 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 226
+G D S+H PV G KW+ T W R Q D
Sbjct: 382 HADGRPDADSLHAGLPVTAGSKWLGTLWFRQQRYRD 417
>gi|327265288|ref|XP_003217440.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Anolis
carolinensis]
Length = 554
Score = 80.9 bits (198), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 98/208 (47%), Gaps = 30/208 (14%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 78
P + + N + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 355 PHIVRYYNVLSDEEIEKIKELAKPKLARATVR-------DPKTGVLTVANYRVSKSSWLE 407
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS---- 134
E++ + + +++ +T L E + Y +G +Y H+D +E K
Sbjct: 408 EEDDLVVAKVNQRMEHITGLTVKTAELLQVANYGMGGQYEPHFDFSRKEEPDAFKRLGTG 467
Query: 135 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 194
RVA+FL Y++D+E GG T+FP D+ G + P++G + +Y+L +G
Sbjct: 468 NRVATFLNYMSDVEAGGATVFP-----------DF----GAAIWPKKGTAVFWYNLFRSG 512
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
D + H +CPV+ G KWV+ KW ++
Sbjct: 513 EGDYRTRHAACPVLVGCKWVSNKWFHER 540
>gi|431913403|gb|ELK15078.1| Protein ariadne-2 like protein [Pteropus alecto]
Length = 843
Score = 80.9 bits (198), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 56/212 (26%), Positives = 97/212 (45%), Gaps = 38/212 (17%)
Query: 39 SIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTML 98
S+ + ++LR +R+ + ++++ +R S ++ E + I +++ ++T L
Sbjct: 595 SLQEFSNMDLRDFHKYMRRHKA-ESSELVRNSHHTWLYQGEGAHHVMRAIRQRVLRLTRL 653
Query: 99 -PRIN--GEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASF-----------LVYL 144
P I E ++RY G Y++H D+ GP + + S L YL
Sbjct: 654 SPEIVELSEPLQVVRYGEGGHYHAHVDS------GPVYPETICSHTKLVANDYMTVLFYL 707
Query: 145 TDLEEGGETMFPFENGMNADGSYDYQKCI------------GLKVKPRQGDGLLFYSLLP 192
++ GGET+FP + D Q + L+VKPRQG + +Y+ LP
Sbjct: 708 NNVTGGGETVFPVADNRTYDEMSLIQDDVDLRDTRRHCDKGNLRVKPRQGTAVFWYNYLP 767
Query: 193 NGT-----IDPTSIHGSCPVVKGEKWVATKWI 219
+G +D S+HG C V +G KW+A WI
Sbjct: 768 DGQGWVGDVDDYSLHGGCLVTRGTKWIANNWI 799
>gi|148701598|gb|EDL33545.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide, isoform CRA_c [Mus
musculus]
gi|149052607|gb|EDM04424.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide (predicted),
isoform CRA_d [Rattus norvegicus]
Length = 189
Score = 80.9 bits (198), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 53/196 (27%), Positives = 96/196 (48%), Gaps = 28/196 (14%)
Query: 35 EQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA----EDESGTLDLIEE 90
E+ + I +AK L +T+ D G+ T + +S + ED+ + +
Sbjct: 4 EEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLEEDDDPVVARVNR 56
Query: 91 KIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQRVASFLVYLTDLE 148
++ +T L E + Y +G +Y H+D + P + G + + R+A+FL Y++D+E
Sbjct: 57 RMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 116
Query: 149 EGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVV 208
GG T+FP +G + P++G + +Y+LL +G D + H +CPV+
Sbjct: 117 AGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVL 161
Query: 209 KGEKWVATKWIRDQEQ 224
G KWV+ KW ++ Q
Sbjct: 162 VGCKWVSNKWFHERGQ 177
>gi|195172672|ref|XP_002027120.1| GL20071 [Drosophila persimilis]
gi|194112933|gb|EDW34976.1| GL20071 [Drosophila persimilis]
Length = 455
Score = 80.9 bits (198), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 44/119 (36%), Positives = 64/119 (53%), Gaps = 18/119 (15%)
Query: 108 ILRYKIGQKYNSHYDAFDP--QEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADG 165
+L Y +Y +H D F P EY Q+ R+A+ L YL D+E+GG+T+FP
Sbjct: 343 VLNYATAAQYLTHSDYFGPAYSEY-IQRGDRIATVLFYLNDVEQGGKTVFP--------- 392
Query: 166 SYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
+G+ P +G ++FY++ + DP + HG CPV+ G KW ATKWI EQ
Sbjct: 393 ------RLGIFRSPMKGSAVVFYNMNSSLQGDPRTEHGGCPVLVGTKWAATKWIYSAEQ 445
>gi|345324764|ref|XP_001505668.2| PREDICTED: LOW QUALITY PROTEIN: transmembrane prolyl 4-hydroxylase
[Ornithorhynchus anatinus]
Length = 495
Score = 80.9 bits (198), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 58/215 (26%), Positives = 95/215 (44%), Gaps = 35/215 (16%)
Query: 39 SIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTML 98
S+ +LNLR + + V + +R S ++ E + I++++ ++T L
Sbjct: 240 SLEEFKRLNLRDFHKYM-GSQKVKMSDLVRNSQHTWLYQGEGAHQVMRSIQQRVLRLTRL 298
Query: 99 PRI---NGEAFNILRYKIGQKYNSHYDA-------------FDPQEYGP-QKSQRVASFL 141
P+ + E ++RY G Y++H D+ F E P + S R + L
Sbjct: 299 PQEIVEHSEPLQVVRYDQGGHYHAHMDSGPVFPETACSHTKFITNETAPFETSCRYVTVL 358
Query: 142 VYLTDLEEGGETMFPFENGMNADGSYDYQKCI------------GLKVKPRQGDGLLFYS 189
YL ++ GGET FP + D Q I L+VKP+QG + +Y+
Sbjct: 359 FYLNNVTGGGETTFPVADNRTYDEMSLIQNDIDLRDTRKHCDKGNLRVKPKQGTAVFWYN 418
Query: 190 LLPNGT-----IDPTSIHGSCPVVKGEKWVATKWI 219
L +G +D S+HG C V +G KW+A WI
Sbjct: 419 YLSDGQGWVGDLDEYSLHGGCLVTQGTKWIANNWI 453
>gi|393774561|ref|ZP_10362923.1| 2OG-Fe(II) oxygenase [Novosphingobium sp. Rr 2-17]
gi|392720044|gb|EIZ77547.1| 2OG-Fe(II) oxygenase [Novosphingobium sp. Rr 2-17]
Length = 210
Score = 80.9 bits (198), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 50/198 (25%), Positives = 98/198 (49%), Gaps = 28/198 (14%)
Query: 30 NFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIE 89
+F + +C+ ++ + + RPST+A G+ RTSS +SA + + +
Sbjct: 33 DFLSAPECEELVALIEAEHRPSTIADFTGD-----DAFRTSSTCDLSA---QVPAVADLA 84
Query: 90 EKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP-----QEYGPQKSQRVASFLVYL 144
K+A+++ + + E RY++GQ++ +H D F+P Q Y QR +F+VYL
Sbjct: 85 AKLARLSGIDPAHAEPLQGQRYEVGQQFKAHTDTFEPGTADYQRYCSASGQRTWTFMVYL 144
Query: 145 TDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGS 204
+++ GG T F + I ++P +G + + + P+ +P ++H +
Sbjct: 145 NEVDAGGATRF---------------REIDKLIQPERGKLVAWNNRKPDRQPNPATLHHA 189
Query: 205 CPVVKGEKWVATKWIRDQ 222
V +G K+V T+W R++
Sbjct: 190 MKVRRGRKYVITQWYRER 207
>gi|54792285|emb|CAG28668.1| prolyl 4-hydroxylase alpha-2 subunit [Gallus gallus]
Length = 538
Score = 80.9 bits (198), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 98/208 (47%), Gaps = 30/208 (14%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 78
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 340 PHIVRYYDVMSDEEIEKIKQLAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 392
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS---- 134
ED+ + + +++ ++T L E + Y +G +Y H+D E K
Sbjct: 393 EDDDPVVAKVNQRMQQITGLTVKTAELLQVANYGMGGQYEPHFDFSRKDEPDAFKRLGTG 452
Query: 135 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 194
RVA+FL Y++D+E GG T+FP D+ G + P++G + +Y+L +G
Sbjct: 453 NRVATFLNYMSDVEAGGATVFP-----------DF----GAAIWPKKGTAVFWYNLFRSG 497
Query: 195 TIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
D + H +CPV+ G KWV+ KW ++
Sbjct: 498 EGDYRTRHAACPVLVGCKWVSNKWFHER 525
>gi|443709454|gb|ELU04126.1| hypothetical protein CAPTEDRAFT_167710 [Capitella teleta]
Length = 535
Score = 80.5 bits (197), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 47/149 (31%), Positives = 74/149 (49%), Gaps = 18/149 (12%)
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQE---YGPQKSQ 135
++E T+ I + + +T L E I Y IG Y H+D E + +
Sbjct: 390 DEEHPTVAKISNRCSALTNLSLSTVEELQIANYGIGGHYEPHFDYSRLAEVTSFDHWRGN 449
Query: 136 RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGT 195
R+ + + YL+D+E GG T+F M A G K++P +G ++Y+L P+GT
Sbjct: 450 RILTVIFYLSDVEAGGGTVF-----MTA----------GTKLRPEKGAAAVWYNLHPDGT 494
Query: 196 IDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D + H +CPV+ G KWVA KW ++ Q
Sbjct: 495 GDDETKHAACPVLTGNKWVANKWFHERGQ 523
>gi|21107513|gb|AAM36222.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri
str. 306]
Length = 273
Score = 80.5 bits (197), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 59/213 (27%), Positives = 89/213 (41%), Gaps = 38/213 (17%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
PR + + ++C+ ++ +A+ +LR S + + + IRTS G
Sbjct: 83 PRIEEYAAVLSADECRLLMLLARPHLRASKV-IDPNDASTQRAPIRTSRG---------- 131
Query: 83 GTLDLIEEKIAKVTM---------LPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG--- 130
TLD I E A LP + E ++L Y G++Y +H D P
Sbjct: 132 ATLDPIIEDFAARAAQARLAACAQLPLAHAEPLSVLCYAPGEQYRAHRDYLPPGTIAADR 191
Query: 131 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 190
P R + VYL D+ GG+T FP G++V+PR G + F +L
Sbjct: 192 PTAGNRQRTVCVYLNDVGAGGDTEFPIA---------------GVRVRPRPGTLVCFDNL 236
Query: 191 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 223
+G D S+H PV G KW+ T W R Q
Sbjct: 237 HADGRPDADSLHAGLPVTAGSKWLGTLWFRQQR 269
>gi|355709034|gb|AES03459.1| prolyl 4-hydroxylase, transmembrane [Mustela putorius furo]
Length = 444
Score = 80.5 bits (197), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 59/215 (27%), Positives = 96/215 (44%), Gaps = 35/215 (16%)
Query: 39 SIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTML 98
S+ +K++LR + KG +++ +R S ++ E + I +++ ++T L
Sbjct: 187 SLQEFSKMDLRDFHKYM-KGHKAASSELVRNSHHTWLYQGEGAHHVMRAIRQRVLRLTRL 245
Query: 99 -PRIN--GEAFNILRYKIGQKYNSHYDA-------------FDPQEYGP-QKSQRVASFL 141
P I E ++RY G Y++H D+ E P + S R + L
Sbjct: 246 SPEIVELSEPLQVVRYGEGGHYHAHVDSGPVYPETICSHTKLIANESVPFETSCRYMTVL 305
Query: 142 VYLTDLEEGGETMFPFENGMNADGSYDYQKCI------------GLKVKPRQGDGLLFYS 189
YL ++ GGET+FP + D Q + L+VKPRQG + +Y+
Sbjct: 306 FYLNNVTGGGETVFPVADNRTYDEMSLIQDDVDLRDTRRHCDKGNLRVKPRQGTAVFWYN 365
Query: 190 LLPNGT-----IDPTSIHGSCPVVKGEKWVATKWI 219
LP+G +D S+HG C V G KW+A WI
Sbjct: 366 YLPDGQGWVGDVDDYSLHGGCLVTSGTKWIANNWI 400
>gi|407698902|ref|YP_006823689.1| hypothetical protein AMBLS11_03220 [Alteromonas macleodii str.
'Black Sea 11']
gi|407248049|gb|AFT77234.1| hypothetical protein AMBLS11_03220 [Alteromonas macleodii str.
'Black Sea 11']
Length = 263
Score = 80.5 bits (197), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 56/200 (28%), Positives = 94/200 (47%), Gaps = 27/200 (13%)
Query: 28 FPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDL 87
+ +F + ++C I+ + K L PS LA + D+ IRTSS ++ ++ +
Sbjct: 85 YDDFLSSQECDDIVALTKDKLAPSKLA--GAASADD---IRTSSTCELAFLGNK--LVKD 137
Query: 88 IEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP-----QEYGPQKSQRVASFLV 142
++ +I L GE Y +G+ Y HYD F P + + + QR + ++
Sbjct: 138 VDSRIVSTLSLGVGEGEVIQAQHYNVGEYYKPHYDFFPPGSPQYKTHCLSRGQRTWTCMI 197
Query: 143 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 202
YL D +GG T F + + V+P++G L + +LLP+G + SIH
Sbjct: 198 YLNDECDGGHTRF---------------TKLDIAVRPKKGMALFWNNLLPSGDPNLNSIH 242
Query: 203 GSCPVVKGEKWVATKWIRDQ 222
+ PV +G K V TKW R +
Sbjct: 243 FAEPVTRGHKTVITKWFRTK 262
>gi|74216495|dbj|BAE25162.1| unnamed protein product [Mus musculus]
Length = 187
Score = 80.5 bits (197), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 53/196 (27%), Positives = 96/196 (48%), Gaps = 28/196 (14%)
Query: 35 EQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA----EDESGTLDLIEE 90
E+ + I +AK L +T+ D G+ T + +S + ED+ + +
Sbjct: 2 EEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLEEDDDPVVARVNR 54
Query: 91 KIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQRVASFLVYLTDLE 148
++ +T L E + Y +G +Y H+D + P + G + + R+A+FL Y++D+E
Sbjct: 55 RMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 114
Query: 149 EGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVV 208
GG T+FP +G + P++G + +Y+LL +G D + H +CPV+
Sbjct: 115 AGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVL 159
Query: 209 KGEKWVATKWIRDQEQ 224
G KWV+ KW ++ Q
Sbjct: 160 VGCKWVSNKWFHERGQ 175
>gi|223997846|ref|XP_002288596.1| hypothetical protein THAPSDRAFT_261963 [Thalassiosira pseudonana
CCMP1335]
gi|220975704|gb|EED94032.1| hypothetical protein THAPSDRAFT_261963 [Thalassiosira pseudonana
CCMP1335]
Length = 373
Score = 80.5 bits (197), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 56/162 (34%), Positives = 83/162 (51%), Gaps = 23/162 (14%)
Query: 68 RTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQ 127
RTSS + + + + ++I ++T +P+ N E+F IL+YK G+ Y SH+D+ D
Sbjct: 218 RTSSNAWCRKECENLTGVKGVSKRIEEMTGIPQNNYESFQILQYKPGEYYKSHHDSSDAN 277
Query: 128 EYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 187
+ RV +F +YL D+EEGGET F N + VKP++G L++
Sbjct: 278 K-DKVTGHRVLTFFLYLNDVEEGGETHFTKLN---------------ISVKPKRGRALVW 321
Query: 188 YSLL---PNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 226
S+L PN T D H + V KG K+ A WI QYD
Sbjct: 322 PSVLNEDPNST-DNRMYHEAKSVEKGIKYAANHWIH---QYD 359
>gi|195505218|ref|XP_002099409.1| GE10887 [Drosophila yakuba]
gi|194185510|gb|EDW99121.1| GE10887 [Drosophila yakuba]
Length = 521
Score = 80.5 bits (197), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 54/210 (25%), Positives = 92/210 (43%), Gaps = 26/210 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 82
P + + + + I + + L+ +T+ V N +RTS FI + +
Sbjct: 302 PLLVLYHDVIYQSEIDVIRKLTENRLKRATVTGHNESVVSN---VRTSQFTFIPVSAHK- 357
Query: 83 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEY------GPQKSQR 136
L I++++A +T L E Y IG Y H D F P+ R
Sbjct: 358 -VLSTIDQRVADMTNLNMKYAEDHQFANYGIGGHYGQHMDWFYQTTIDAGLISSPEMGNR 416
Query: 137 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 196
+A+ L YL+D+ +GG T FP + +KP++ +++L +G
Sbjct: 417 IATVLFYLSDVSQGGGTAFPQLRTL---------------LKPKKYAAAFWHNLHASGVG 461
Query: 197 DPTSIHGSCPVVKGEKWVATKWIRDQEQYD 226
D + HG+CP++ G KWV +WIR+ +Q D
Sbjct: 462 DVRTQHGACPIIAGSKWVQNRWIREVDQSD 491
>gi|414587754|tpg|DAA38325.1| TPA: hypothetical protein ZEAMMB73_894856 [Zea mays]
Length = 169
Score = 80.5 bits (197), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 41/111 (36%), Positives = 68/111 (61%), Gaps = 7/111 (6%)
Query: 17 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTL---ALRKGETVDNTQGIRTSSGV 73
+V+SW PR + F NF + E+C ++ +A+ L+ ST+ A KG D +RTSSG+
Sbjct: 58 EVISWTPRIIVFHNFLSSEECDYLMAIARPRLQISTVVDVATGKGVKSD----VRTSSGM 113
Query: 74 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF 124
F+++ E +S + IE++I+ + +P+ NGE +LRY+ Q Y H+D F
Sbjct: 114 FVNSEERKSPVVQAIEKRISVFSQIPKENGELIQVLRYEASQYYRPHHDYF 164
>gi|323455897|gb|EGB11765.1| hypothetical protein AURANDRAFT_52419 [Aureococcus anophagefferens]
Length = 478
Score = 80.5 bits (197), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 67/262 (25%), Positives = 107/262 (40%), Gaps = 54/262 (20%)
Query: 1 MPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET 60
+ + G D+ LS P+ F + + +I K ++PS + L G +
Sbjct: 135 LADAETGVDAGHRSVVTTLSMRPQVFRISQFMMGHETEKLIERNKPRIKPSEVGL-VGRS 193
Query: 61 VDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRING-----EAFNILRYKIGQ 115
D T RTS+ + +A+ + + I + L +I+ + +L Y+ Q
Sbjct: 194 GDKT---RTSTNAWDTASP-------VARDVIGRAFRLLKIDAHRKLEDGLQVLHYERPQ 243
Query: 116 KYNSHYDAFDPQEYGP----------------QKSQRVASFLVYLTDLEEGGETMFPF-- 157
Y H D F + G + R A+ +YL + GGET+FP
Sbjct: 244 WYKPHVDYFTSRNAGGGGASEDAFSNAIPTANNGTNRFATVFLYLNNAGSGGETVFPLST 303
Query: 158 -----------ENGMN--------ADGSYDYQ-KCIGLKVKPRQGDGLLFYSLLPNGTID 197
+ G N AD ++ K L+V PR GD +LFYS + ++D
Sbjct: 304 THEIYQGGRLTQAGTNRTPGFIRDADAAWVCDTKSEALRVTPRTGDSVLFYSQRGDASLD 363
Query: 198 PTSIHGSCPVVKGEKWVATKWI 219
S+HGSCP+ GEKW A W+
Sbjct: 364 GYSLHGSCPMGDGEKWAANLWV 385
>gi|296284739|ref|ZP_06862737.1| hypothetical protein CbatJ_14013 [Citromicrobium bathyomarinum
JL354]
Length = 210
Score = 80.5 bits (197), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 54/202 (26%), Positives = 93/202 (46%), Gaps = 28/202 (13%)
Query: 26 LYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTL 85
L F P+ C +I M + RPSTLA G+ RTS + + + L
Sbjct: 28 LQMRQFLDPDFCGELIAMIDADRRPSTLADHDGDMY-----FRTSETCDLPMDDPRTQRL 82
Query: 86 DLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP-----QEYGPQKSQRVASF 140
+ + +A+++ + +GE RY +GQ++ +H D F+P ++Y QR +F
Sbjct: 83 EAM---LAELSGIDPRHGEPLQGQRYAVGQEFKAHCDYFNPDGQDWEKYCSVAGQRTWTF 139
Query: 141 LVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTS 200
++YL + E GG T F K + +P G + + + P+ + +P +
Sbjct: 140 MIYLNEPEAGGATRF---------------KVLKKSFQPETGKLVCWNNRRPDQSTNPNT 184
Query: 201 IHGSCPVVKGEKWVATKWIRDQ 222
+H V KG K+V TKW R++
Sbjct: 185 MHHGMKVRKGTKYVITKWYREK 206
>gi|348557544|ref|XP_003464579.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
[Cavia porcellus]
Length = 533
Score = 80.5 bits (197), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 53/208 (25%), Positives = 102/208 (49%), Gaps = 28/208 (13%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 78
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 388
Query: 79 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQR 136
E++ + + ++ ++T L E + Y +G +Y H+D + P + G + + R
Sbjct: 389 EEDDPVVARVNRRMQQITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNR 448
Query: 137 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 196
+A+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 449 LATFLNYMSDVEAGGATVFP---------------DLGAALWPKKGTAVFWYNLLRSGEG 493
Query: 197 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 494 DYRTRHAACPVLVGCKWVSNKWFHERGQ 521
>gi|47213360|emb|CAF90979.1| unnamed protein product [Tetraodon nigroviridis]
Length = 511
Score = 80.5 bits (197), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 58/218 (26%), Positives = 102/218 (46%), Gaps = 39/218 (17%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLAL-RKGETVDNTQGIRTSSGVFISAAEDE 81
PR + + + + + + + +A+ LR +T+ R G+ T R S ++ A E
Sbjct: 307 PRIVRYHDVLSNREMEKVKELARPRLRRATVHDPRTGQLT--TAPYRVSKSAWLGAFE-- 362
Query: 82 SGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD--------AFDPQEYGPQK 133
+D I ++I +T L E + Y +G +Y H+D AF+ G
Sbjct: 363 HPIVDQINQRIEDITGLDVSTAEDLQVANYGVGGQYEPHFDFGQKDEPDAFEELGTG--- 419
Query: 134 SQRVASFLVY-------LTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLL 186
R+A++L+Y ++D++ GG T+F IG V P++G +
Sbjct: 420 -NRIATWLLYVSAAVLRMSDVQAGGATVFT---------------DIGASVLPQKGSAVF 463
Query: 187 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
+Y+L P+G D + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 464 WYNLRPSGDGDYRTRHAACPVLLGNKWVSNKWIHERGQ 501
>gi|289662828|ref|ZP_06484409.1| hypothetical protein XcampvN_06993, partial [Xanthomonas campestris
pv. vasculorum NCPPB 702]
Length = 301
Score = 80.5 bits (197), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 50/195 (25%), Positives = 87/195 (44%), Gaps = 20/195 (10%)
Query: 35 EQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAK 94
++C+ ++ +A+ +LR S + + + +RTS G + ++ + + ++A
Sbjct: 123 DECRLLMLLARPHLRDSQV-IDPNDASTQRAPVRTSRGATLDPIIEDFAA-RVAQARLAA 180
Query: 95 VTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG---PQKSQRVASFLVYLTDLEEGG 151
L + E ++L Y G++Y +H D P P R + VYL ++ GG
Sbjct: 181 CAQLTLTHAEPLSVLCYAPGEQYRAHRDYLPPGTIAADHPNAGNRQRTVCVYLNVVDAGG 240
Query: 152 ETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGE 211
ET FP G++V+PR G + F +L +G + S+H PV G
Sbjct: 241 ETEFPLA---------------GVRVQPRPGALVCFDNLHADGRPNADSLHAGLPVTAGS 285
Query: 212 KWVATKWIRDQEQYD 226
KW+ T W R Q D
Sbjct: 286 KWLGTLWFRQQRYRD 300
>gi|198477150|ref|XP_002136737.1| GA29215 [Drosophila pseudoobscura pseudoobscura]
gi|198145042|gb|EDY71754.1| GA29215 [Drosophila pseudoobscura pseudoobscura]
Length = 508
Score = 80.5 bits (197), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 48/157 (30%), Positives = 77/157 (49%), Gaps = 20/157 (12%)
Query: 68 RTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQ 127
RT+ ++ + + + ++ E ++ + + E F +L Y IG Y H D F+
Sbjct: 363 RTTKAGWLDPSHNLIRRMGILTEDMSNLDL---ERSEDFQVLNYGIGGHYAVHPDFFEGS 419
Query: 128 EYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 187
P+ RVA+ L YL+D+ GG T+FP + L V P++G L++
Sbjct: 420 --NPELPDRVATLLFYLSDVPLGGATVFPL---------------LDLSVFPKKGAVLMW 462
Query: 188 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
Y+L G +IH +CPVV G +WV TKW+ Q Q
Sbjct: 463 YNLDHKGQGMEKTIHSACPVVVGSRWVMTKWVNQQPQ 499
>gi|359401514|ref|ZP_09194482.1| 2OG-Fe(II) oxygenase [Novosphingobium pentaromativorans US6-1]
gi|357597189|gb|EHJ58939.1| 2OG-Fe(II) oxygenase [Novosphingobium pentaromativorans US6-1]
Length = 232
Score = 80.5 bits (197), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 66/221 (29%), Positives = 100/221 (45%), Gaps = 34/221 (15%)
Query: 6 AGDDSVTNIPFQVLSWMPRALYFPN-FATPEQCKSIINMAKLNLRPSTLALRKGETVDNT 64
+G +V +P + + +Y N F C +I + PS L +G
Sbjct: 35 SGQPTVRRLPVEAVE-----IYTSNSFLNEADCAHLIALIDTCACPSRLLEEEG-----W 84
Query: 65 QGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF 124
G RTS I + L+L +++ T + GE+ RY+ GQ +N H D F
Sbjct: 85 DGYRTSYSGDIDTHDRIVRDLEL---RLSDFTGIAPSCGESAQGQRYECGQYFNEHCDWF 141
Query: 125 DPQE-YGPQK----SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKP 179
D + Y Q+ QR + ++YL +EEGG T F IGL + P
Sbjct: 142 DTEAGYWRQERRCGGQRSWTAMIYLNAVEEGGRTDFTH---------------IGLSIPP 186
Query: 180 RQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 220
G LL+ + LP+GT +P ++H + PVV+G K+V TKW R
Sbjct: 187 EPGCLLLWNNALPDGTPNPLTMHAARPVVRGVKYVVTKWFR 227
>gi|296474834|tpg|DAA16949.1| TPA: hypoxia-inducible factor prolyl 4-hydroxylase [Bos taurus]
Length = 494
Score = 80.1 bits (196), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 58/215 (26%), Positives = 96/215 (44%), Gaps = 35/215 (16%)
Query: 39 SIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTML 98
S+ + ++LR +R +++Q +R S ++ E + I +++ ++T L
Sbjct: 237 SLQEFSNMDLRDFHKYMRS-HRAESSQLVRNSHHTWLYQGEGAHHVMRAIRQRVLRLTRL 295
Query: 99 -PRIN--GEAFNILRYKIGQKYNSHYDA-------------FDPQEYGP-QKSQRVASFL 141
P I E ++RY G Y++H D+ E P + S R + L
Sbjct: 296 SPEIVELSEPLQVVRYGEGGHYHAHVDSGPVYPETICSHTKLVANESVPFETSCRYMTVL 355
Query: 142 VYLTDLEEGGETMFPFENGMNADGSYDYQKCI------------GLKVKPRQGDGLLFYS 189
YL ++ GGET+FP + D Q + L+VKPRQG + +Y+
Sbjct: 356 FYLNNVTGGGETVFPVADNRTYDEMSLIQDDVDLRDTRRHCDKGNLRVKPRQGTAVFWYN 415
Query: 190 LLPNGT-----IDPTSIHGSCPVVKGEKWVATKWI 219
LP+G +D S+HG C V +G KW+A WI
Sbjct: 416 YLPDGQGWVGDVDDYSLHGGCLVTRGTKWIANNWI 450
>gi|17861644|gb|AAL39299.1| GH17175p [Drosophila melanogaster]
Length = 187
Score = 80.1 bits (196), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 40/122 (32%), Positives = 63/122 (51%), Gaps = 16/122 (13%)
Query: 104 EAFNILRYKIGQKYNSHYDAF-DPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMN 162
E + Y +G Y H+D F DP Y ++ R+A+ + YL+++E+GG T FPF
Sbjct: 59 EQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRIATAIFYLSEVEQGGATAFPF----- 113
Query: 163 ADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
+ + VKP+ G+ L +Y+L + D + H CPV+KG KW+ WI +
Sbjct: 114 ----------LDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGSKWIGNVWIHEV 163
Query: 223 EQ 224
Q
Sbjct: 164 TQ 165
>gi|294889729|ref|XP_002772943.1| prolyl 4-hydroxylase alpha subunit, putative [Perkinsus marinus
ATCC 50983]
gi|239877523|gb|EER04759.1| prolyl 4-hydroxylase alpha subunit, putative [Perkinsus marinus
ATCC 50983]
Length = 383
Score = 80.1 bits (196), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 57/226 (25%), Positives = 105/226 (46%), Gaps = 39/226 (17%)
Query: 8 DDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLAL--------RKGE 59
++ VT + V+ P+ P+F TPE+C+ +I++A+ RPST+ + +
Sbjct: 160 EEGVTRLSAYVICRSPKVRLVPDFLTPEECEYMISLAEGKWRPSTVGRSSSSISDGKSDK 219
Query: 60 TVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNS 119
V+ RTSS + ++D+ + IE + A + P + E N+LRY+ G+ +
Sbjct: 220 YVNKRSKGRTSSSFMLLHSQDD--VVAEIERRAASLVGFPADHVERLNMLRYESGEFFGQ 277
Query: 120 HYD-AFDPQEYGPQKSQRVASFLVYLTDLEE--GGETMFPFENGMNADGSYDYQKCIGLK 176
H+D AF P + + L D+ GGET+FP +GLK
Sbjct: 278 HHDGAFRPW-----------TVFITLNDIPRGAGGETLFP---------------ALGLK 311
Query: 177 VKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 222
++P+ G L++ + L +G D +H + P K+ ++ ++
Sbjct: 312 IRPKAGTALVWPNCLEDGQADDRVVHEALPPTGVRKYAINCFVNEK 357
>gi|399057802|ref|ZP_10744231.1| 2OG-Fe(II) oxygenase superfamily enzyme [Novosphingobium sp. AP12]
gi|398041550|gb|EJL34606.1| 2OG-Fe(II) oxygenase superfamily enzyme [Novosphingobium sp. AP12]
Length = 210
Score = 80.1 bits (196), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 51/199 (25%), Positives = 96/199 (48%), Gaps = 28/199 (14%)
Query: 30 NFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIE 89
NF EQC ++ + + + RPST+A G+ RTSS +S + + +
Sbjct: 33 NFVAAEQCAELMALIEDSHRPSTIADYNGD-----DAFRTSSTCDLST---DVPVVANLA 84
Query: 90 EKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP-----QEYGPQKSQRVASFLVYL 144
+++++ + + E RY++GQ++ +H D F+P +Y QR +F++YL
Sbjct: 85 AALSRLSGIDLAHAEPLQGQRYEVGQEFKAHTDYFEPGNADYDKYCAVPGQRTWTFMIYL 144
Query: 145 TDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGS 204
++E GG T F + I ++P G + + + P+GT + ++H +
Sbjct: 145 NEVEAGGATRF---------------RVIDKMIQPEIGKLIAWNNRRPDGTPNAATLHHA 189
Query: 205 CPVVKGEKWVATKWIRDQE 223
V KG K+V T+W R++
Sbjct: 190 MKVRKGYKYVITQWYRERH 208
>gi|224012759|ref|XP_002295032.1| hypothetical protein THAPSDRAFT_264808 [Thalassiosira pseudonana
CCMP1335]
gi|220969471|gb|EED87812.1| hypothetical protein THAPSDRAFT_264808 [Thalassiosira pseudonana
CCMP1335]
Length = 194
Score = 80.1 bits (196), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 94/201 (46%), Gaps = 26/201 (12%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISA-AEDE 81
P + NF T E+ +I + K + E ++ G RTS + +
Sbjct: 2 PWLVSLENFLTDEEADYLIEVGKRQ------QYQLSEQRKDSLGTRTSYSAWCRRDCWKD 55
Query: 82 SGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFL 141
T+ + ++IAKVT + +LRY+ GQK+ H D G + QR+ +FL
Sbjct: 56 DATVSSVVDRIAKVTKVETKQLSNLQLLRYEEGQKFKQHTDFAAMLSRGRAQGQRLMTFL 115
Query: 142 VYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSI 201
+YL+D+EEGGET FP+ G+ ++PR+G +L+ +++ N D I
Sbjct: 116 IYLSDVEEGGETSFPYS---------------GVTIQPRKGHAVLWPNVM-NDDPDAKEI 159
Query: 202 ---HGSCPVVKGEKWVATKWI 219
H S PV+KG K + +I
Sbjct: 160 RADHMSLPVLKGVKHAVSIYI 180
>gi|190402274|gb|ACE77683.1| prolyl 4-hydroxylase subunit alpha-2 precursor (predicted) [Sorex
araneus]
Length = 533
Score = 80.1 bits (196), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 55/205 (26%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 23 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET-VDNTQGIRTSSGVFISAAEDE 81
P + + + + E+ + I +AK L +T+ R +T V T R S ++ +D
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATV--RDPKTGVLTTASYRVSKSSWLEETDDP 393
Query: 82 SGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQRVAS 139
+ + ++ +T L E + Y +G +Y H+D + P + G + + R+A+
Sbjct: 394 --VVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451
Query: 140 FLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPT 199
FL Y++D+E GG T+FP +G + P++G + +Y+LL +G D
Sbjct: 452 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 496
Query: 200 SIHGSCPVVKGEKWVATKWIRDQEQ 224
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 497 TRHAACPVLVGCKWVSNKWFHERGQ 521
>gi|195505216|ref|XP_002099408.1| GE23378 [Drosophila yakuba]
gi|194185509|gb|EDW99120.1| GE23378 [Drosophila yakuba]
Length = 546
Score = 80.1 bits (196), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 52/161 (32%), Positives = 80/161 (49%), Gaps = 19/161 (11%)
Query: 66 GIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF- 124
RTS V++ +E+ TL L + ++ T L + E F ++ Y IG + SH+D
Sbjct: 379 SFRTSKSVWLDNDANEA-TLKLTQ-RLGDATGLDISHSEPFQVINYGIGGIFESHFDTSL 436
Query: 125 -DPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGD 183
D + R+A+ L YL D+ +GG T FP G+N + V P+ G
Sbjct: 437 QDENRFLDGYMDRLATTLFYLNDVPQGGATHFP---GLN------------ITVFPKFGT 481
Query: 184 GLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 224
L +Y+L G + ++H CPV+ G KWV +KWI D+ Q
Sbjct: 482 ALFWYNLDTKGLLRLRTMHTGCPVIVGSKWVVSKWIDDKGQ 522
>gi|115497762|ref|NP_001069583.1| transmembrane prolyl 4-hydroxylase [Bos taurus]
gi|92097562|gb|AAI14844.1| Prolyl 4-hydroxylase, transmembrane (endoplasmic reticulum) [Bos
taurus]
Length = 494
Score = 80.1 bits (196), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 58/215 (26%), Positives = 96/215 (44%), Gaps = 35/215 (16%)
Query: 39 SIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTML 98
S+ + ++LR +R +++Q +R S ++ E + I +++ ++T L
Sbjct: 237 SLQEFSNMDLRDFHKYMRS-HRAESSQLVRNSHHTWLYQGEGAHHVMRAIRQRVLRLTRL 295
Query: 99 -PRIN--GEAFNILRYKIGQKYNSHYDA-------------FDPQEYGP-QKSQRVASFL 141
P I E ++RY G Y++H D+ E P + S R + L
Sbjct: 296 SPEIVELSEPLQVVRYGEGGHYHAHVDSGPVYPETICSHTKLVANESVPFETSCRYMTVL 355
Query: 142 VYLTDLEEGGETMFPFENGMNADGSYDYQKCI------------GLKVKPRQGDGLLFYS 189
YL ++ GGET+FP + D Q + L+VKPRQG + +Y+
Sbjct: 356 FYLNNVTGGGETVFPVADNRTYDEMSLIQDDVDLRDTRRHCDKGNLRVKPRQGTAVFWYN 415
Query: 190 LLPNGT-----IDPTSIHGSCPVVKGEKWVATKWI 219
LP+G +D S+HG C V +G KW+A WI
Sbjct: 416 YLPDGQGWVGDVDDYSLHGGCLVTRGTKWIADNWI 450
>gi|159474434|ref|XP_001695330.1| predicted protein [Chlamydomonas reinhardtii]
gi|158275813|gb|EDP01588.1| predicted protein [Chlamydomonas reinhardtii]
Length = 1887
Score = 79.7 bits (195), Expect = 7e-13, Method: Composition-based stats.
Identities = 57/211 (27%), Positives = 90/211 (42%), Gaps = 25/211 (11%)
Query: 20 SWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAE 79
S PR L F P C ++ +A L +R + R S F +
Sbjct: 1687 SLSPRVLVVDGFLPPGLCDALCAVAAPRL------IRSRVSTGAETPSRVSQSTFFTGDS 1740
Query: 80 DESGTLDLIEEKIAKVTMLPRING---------EAFNILRYKIGQKYNSHYDAFDPQEYG 130
+ +E ++ + P + EA ++ Y +G Y+ HYD + G
Sbjct: 1741 ARLPEVVAVEARLQALMERPEVTAGGRPTLVKSEALQVVSYDVGGFYSEHYD----NKTG 1796
Query: 131 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 190
S R A+ ++YL D + GG T FP N GL+V P +G L+F+S
Sbjct: 1797 GVIS-RAATIIIYLQDTQAGGSTHFP-----NQQLRLMRVARPGLRVYPAKGRALIFWSR 1850
Query: 191 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRD 221
LP+G+ D S+H + PV G KW+ T+W ++
Sbjct: 1851 LPDGSEDLASLHSAEPVRAGSKWICTRWFKE 1881
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.317 0.136 0.409
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,905,396,982
Number of Sequences: 23463169
Number of extensions: 162636039
Number of successful extensions: 296657
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1437
Number of HSP's successfully gapped in prelim test: 561
Number of HSP's that attempted gapping in prelim test: 291773
Number of HSP's gapped (non-prelim): 2166
length of query: 226
length of database: 8,064,228,071
effective HSP length: 137
effective length of query: 89
effective length of database: 9,144,741,214
effective search space: 813881968046
effective search space used: 813881968046
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 74 (33.1 bits)