BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 022995
(289 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|449448264|ref|XP_004141886.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 294
Score = 457 bits (1177), Expect = e-126, Method: Compositional matrix adjust.
Identities = 212/289 (73%), Positives = 251/289 (86%), Gaps = 3/289 (1%)
Query: 4 GKSNKANWSLKSKIELP-FVFLACLFFFLAGLLGSSLLSQDVTAARPSARVVESVKD--E 60
GKS ++NWSL+SK+ LP +F+ CLF FLAG GS+LLSQDV RP AR+++S D E
Sbjct: 3 GKSGRSNWSLRSKLGLPALIFVLCLFCFLAGFFGSTLLSQDVDDDRPRARLLQSASDVTE 62
Query: 61 YKWMPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK 120
+ M G+ GDDS+++IPFQVLSW PRALYFP FAT EQC+SI+N+AK LRPSTLALRK
Sbjct: 63 FDLMSSGENGDDSISSIPFQVLSWRPRALYFPKFATAEQCQSIVNLAKPKLRPSTLALRK 122
Query: 121 GETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKY 180
GET ++T+G+RTSSGVF SA+EDESGTL +IEEKIA+ TM+PR +GEA+NILRY+IGQKY
Sbjct: 123 GETAESTKGVRTSSGVFFSASEDESGTLGVIEEKIARATMIPRTHGEAYNILRYEIGQKY 182
Query: 181 NSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKV 240
NSHYDAF P EYGPQKSQRVASFL+YLTD+EEGGETMFPFENG+N DG+Y++Q CIGLKV
Sbjct: 183 NSHYDAFKPSEYGPQKSQRVASFLLYLTDVEEGGETMFPFENGLNMDGTYNFQTCIGLKV 242
Query: 241 KPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 289
KPRQGDGLLFYS+ PNGTIDPTS+HGSCPV+KG+KWVATKWIRDQ Q D
Sbjct: 243 KPRQGDGLLFYSVFPNGTIDPTSLHGSCPVIKGQKWVATKWIRDQMQED 291
>gi|449511009|ref|XP_004163837.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
alpha-1-like [Cucumis sativus]
Length = 294
Score = 456 bits (1172), Expect = e-126, Method: Compositional matrix adjust.
Identities = 211/289 (73%), Positives = 250/289 (86%), Gaps = 3/289 (1%)
Query: 4 GKSNKANWSLKSKIELP-FVFLACLFFFLAGLLGSSLLSQDVTAARPSARVVESVKD--E 60
GKS ++NWSL+SK+ LP +F+ CLF FLAG GS+LLSQDV RP AR+++S D E
Sbjct: 3 GKSGRSNWSLRSKLGLPALIFVLCLFXFLAGFFGSTLLSQDVDDDRPRARLLQSASDVTE 62
Query: 61 YKWMPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK 120
+ M G+ GDDS+++IPFQVLSW PRALYFP FAT EQC+SI+N+AK LRPSTLALRK
Sbjct: 63 FDLMSSGENGDDSISSIPFQVLSWRPRALYFPKFATAEQCQSIVNLAKPKLRPSTLALRK 122
Query: 121 GETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKY 180
GET ++T+G+RTSSGVF SA+EDESGTL +IEEK A+ TM+PR +GEA+NILRY+IGQKY
Sbjct: 123 GETAESTKGVRTSSGVFFSASEDESGTLGVIEEKXARATMIPRTHGEAYNILRYEIGQKY 182
Query: 181 NSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKV 240
NSHYDAF P EYGPQKSQRVASFL+YLTD+EEGGETMFPFENG+N DG+Y++Q CIGLKV
Sbjct: 183 NSHYDAFKPSEYGPQKSQRVASFLLYLTDVEEGGETMFPFENGLNMDGTYNFQTCIGLKV 242
Query: 241 KPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 289
KPRQGDGLLFYS+ PNGTIDPTS+HGSCPV+KG+KWVATKWIRDQ Q D
Sbjct: 243 KPRQGDGLLFYSVFPNGTIDPTSLHGSCPVIKGQKWVATKWIRDQMQED 291
>gi|255584898|ref|XP_002533164.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223527036|gb|EEF29223.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 290
Score = 445 bits (1144), Expect = e-122, Method: Compositional matrix adjust.
Identities = 210/289 (72%), Positives = 248/289 (85%), Gaps = 4/289 (1%)
Query: 4 GKSNKANWSLKSKIELPFVFLACLFFFLAGLLGSSLLSQDVTAARPSARVVESVKDE--- 60
K +K WS+KSK+ LP VFL+CLFFFLAGL S+L+SQ+V + + R ++ VK+E
Sbjct: 3 AKGSKGKWSIKSKLGLPVVFLSCLFFFLAGLFASNLISQNVNGDK-NRRQLQWVKEEIIE 61
Query: 61 YKWMPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK 120
Y +P G GDD +T IPFQVLSW PRALYFPNFAT EQC+S+INMAK NL PSTLALRK
Sbjct: 62 YDLLPSGDTGDDYLTVIPFQVLSWKPRALYFPNFATAEQCQSVINMAKPNLTPSTLALRK 121
Query: 121 GETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKY 180
GET +NT+GIRTSSG+F+SA+ED++G LD IEEKIA+ TMLPR NGEAFNILRY+IGQKY
Sbjct: 122 GETEENTKGIRTSSGMFLSASEDKTGVLDAIEEKIARATMLPRANGEAFNILRYEIGQKY 181
Query: 181 NSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKV 240
NSHYDAF+P EYGPQKSQRVASFL+YL+D+EEGGETMFPFEN ++ D SYD++KCIGL+V
Sbjct: 182 NSHYDAFNPAEYGPQKSQRVASFLLYLSDVEEGGETMFPFENDLDVDESYDFEKCIGLQV 241
Query: 241 KPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 289
+PR+GDGLLFYSL PN TIDPTS+HGSCPV+KGEKWVATKWIRDQEQ D
Sbjct: 242 RPRRGDGLLFYSLFPNNTIDPTSLHGSCPVIKGEKWVATKWIRDQEQDD 290
>gi|357476355|ref|XP_003608463.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
gi|355509518|gb|AES90660.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
Length = 297
Score = 431 bits (1109), Expect = e-118, Method: Compositional matrix adjust.
Identities = 199/287 (69%), Positives = 245/287 (85%), Gaps = 6/287 (2%)
Query: 9 ANWSLKS-KIELPFVFLACLFFFLAGLLGSSLLSQ----DVTAARPSARVVESVKD-EYK 62
+NWSL++ K+ P+VFL C+FFFLAG GS+L S D RP R+++S K+ EY
Sbjct: 9 SNWSLRTNKLTFPYVFLICIFFFLAGFFGSTLFSHSQDGDGYGLRPRPRLLDSTKETEYN 68
Query: 63 WMPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGE 122
M G+ GDDS+T+IPFQVLSW PRALYFPNFAT EQC++I+++AK L+PS+LALRKGE
Sbjct: 69 LMTAGEFGDDSITSIPFQVLSWKPRALYFPNFATAEQCENIVSVAKAGLKPSSLALRKGE 128
Query: 123 TVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNS 182
T +NT+GIRTSSGVF+SA+ D++ TL+ IEEKIA+ TM+PR +GEAFNILRY++GQ+YNS
Sbjct: 129 TTENTKGIRTSSGVFLSASRDKTKTLEAIEEKIARATMIPRSHGEAFNILRYEVGQRYNS 188
Query: 183 HYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKP 242
HYDAF+P EYGPQKSQRVASFL+YLTD+EEGGETMFPFENG+N DG+Y Y+ C+GL+VKP
Sbjct: 189 HYDAFNPDEYGPQKSQRVASFLLYLTDVEEGGETMFPFENGLNMDGTYGYEDCVGLRVKP 248
Query: 243 RQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 289
RQGDGLLFYSLLPNGTID TS+HGSCPV+KGEKWVATKWIR+ +Q D
Sbjct: 249 RQGDGLLFYSLLPNGTIDQTSLHGSCPVIKGEKWVATKWIRNLDQED 295
>gi|388505024|gb|AFK40578.1| unknown [Medicago truncatula]
Length = 297
Score = 424 bits (1089), Expect = e-116, Method: Compositional matrix adjust.
Identities = 197/287 (68%), Positives = 243/287 (84%), Gaps = 6/287 (2%)
Query: 9 ANWSLKS-KIELPFVFLACLFFFLAGLLGSSLLSQ----DVTAARPSARVVESVKD-EYK 62
+NWSL++ K+ P+VFL C+FFFLAG GS+L S D RP R+++S K+ EY
Sbjct: 9 SNWSLRTNKLTFPYVFLICIFFFLAGFFGSTLFSHSQDGDGYGLRPRPRLLDSTKETEYN 68
Query: 63 WMPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGE 122
M G+ GDDS+T+IPFQVLSW PRALYFPNFAT EQC++I+++AK L+PS+LALRKGE
Sbjct: 69 LMTAGEFGDDSITSIPFQVLSWKPRALYFPNFATAEQCENIVSVAKAGLKPSSLALRKGE 128
Query: 123 TVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNS 182
T +NT+GIRTSSGVF+SA+ D++ TL+ IEEKIA+ TM+PR +GEAFNILRY++GQ+Y S
Sbjct: 129 TTENTKGIRTSSGVFLSASRDKTKTLEAIEEKIARATMIPRSHGEAFNILRYEVGQRYYS 188
Query: 183 HYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKP 242
HYDAF+P EYGPQKSQRVASFL+YLTD+EEGGETMFPFENG+N DG+Y Y+ +GL+VKP
Sbjct: 189 HYDAFNPDEYGPQKSQRVASFLLYLTDVEEGGETMFPFENGLNMDGTYGYEDRVGLRVKP 248
Query: 243 RQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 289
RQGDGLLFYSLLPNGTID TS+HGSCPV+KGEKWVATKWIR+ +Q D
Sbjct: 249 RQGDGLLFYSLLPNGTIDQTSLHGSCPVIKGEKWVATKWIRNLDQED 295
>gi|356563543|ref|XP_003550021.1| PREDICTED: putative prolyl 4-hydroxylase-like [Glycine max]
Length = 293
Score = 420 bits (1080), Expect = e-115, Method: Compositional matrix adjust.
Identities = 206/291 (70%), Positives = 247/291 (84%), Gaps = 5/291 (1%)
Query: 4 GKSNKANWSLKS-KIELPFVFLACLFFFLAGLLGSSLLSQ---DVTAARPSARVVESVKD 59
K+ K NWSL++ K+ P+VFL C+FFFLAG G +L S D RP R+++S +
Sbjct: 3 AKAVKGNWSLRTNKLSFPYVFLVCIFFFLAGFFGFTLFSHSQGDGDGLRPRQRLLDSTNE 62
Query: 60 -EYKWMPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLAL 118
EY MP G+ GDDS+T+IPFQVLSW PRA+YFPNFAT EQC+SII++AK L+PSTLAL
Sbjct: 63 AEYNLMPVGELGDDSITSIPFQVLSWRPRAVYFPNFATAEQCESIIDVAKDGLKPSTLAL 122
Query: 119 RKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQ 178
R+GET DNT+GIRTSSGVF+SA+ED++ TLD+IEEKIA+ TM+PR +GEAFNILRY++ Q
Sbjct: 123 RQGETEDNTKGIRTSSGVFVSASEDKTRTLDVIEEKIARATMIPRSHGEAFNILRYEVNQ 182
Query: 179 KYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGL 238
+YNSHYDAF+P EYGPQKSQR+ASFL+YLTD+EEGGETMFPFENG+N DG+Y Y+ CIGL
Sbjct: 183 RYNSHYDAFNPAEYGPQKSQRMASFLLYLTDVEEGGETMFPFENGLNMDGNYGYEDCIGL 242
Query: 239 KVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 289
KVKPRQGDGLLFYSLL NGTIDPTS+HGSCPV+KGEKWVATKWIRDQE D
Sbjct: 243 KVKPRQGDGLLFYSLLTNGTIDPTSLHGSCPVIKGEKWVATKWIRDQELDD 293
>gi|255647903|gb|ACU24410.1| unknown [Glycine max]
Length = 293
Score = 420 bits (1079), Expect = e-115, Method: Compositional matrix adjust.
Identities = 206/291 (70%), Positives = 247/291 (84%), Gaps = 5/291 (1%)
Query: 4 GKSNKANWSLKS-KIELPFVFLACLFFFLAGLLGSSLLSQ---DVTAARPSARVVESVKD 59
K+ K NWSL++ K+ P+VFL C+FFFLAG G +L S D RP R+++S +
Sbjct: 3 AKAVKGNWSLRTNKLSFPYVFLVCIFFFLAGFFGFTLFSHSQGDGDGLRPRQRLLDSTNE 62
Query: 60 -EYKWMPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLAL 118
EY MP G+ GDDS+T+IPFQVLSW PRA+YFPNFAT EQC+SII++AK L+PSTLAL
Sbjct: 63 AEYNLMPVGELGDDSITSIPFQVLSWRPRAVYFPNFATAEQCESIIDVAKDGLKPSTLAL 122
Query: 119 RKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQ 178
R+GET DNT+GIRTSSGVF+SA+ED++ TLD+IEEKIA+ TM+PR +GEAFNILRY++ Q
Sbjct: 123 RQGETEDNTKGIRTSSGVFVSASEDKTRTLDVIEEKIARATMIPRSHGEAFNILRYEVNQ 182
Query: 179 KYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGL 238
+YNSHYDAF+P EYGPQKSQR+ASFL+YLTD+EEGGETMFPFENG+N DG+Y Y+ CIGL
Sbjct: 183 RYNSHYDAFNPAEYGPQKSQRMASFLLYLTDVEEGGETMFPFENGLNMDGNYGYEGCIGL 242
Query: 239 KVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 289
KVKPRQGDGLLFYSLL NGTIDPTS+HGSCPV+KGEKWVATKWIRDQE D
Sbjct: 243 KVKPRQGDGLLFYSLLTNGTIDPTSLHGSCPVIKGEKWVATKWIRDQELDD 293
>gi|225438938|ref|XP_002279411.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
gi|296087348|emb|CBI33722.3| unnamed protein product [Vitis vinifera]
Length = 285
Score = 415 bits (1066), Expect = e-113, Method: Compositional matrix adjust.
Identities = 204/282 (72%), Positives = 240/282 (85%), Gaps = 2/282 (0%)
Query: 8 KANWSLKSKIELPFVFLACLFFFLAGLLGSSLLSQDVTAARPSARVVESVKDEYKWMPHG 67
K W +K+ LP VFL CLFFFLAGL GS LL Q+ +++ P + E +Y + HG
Sbjct: 6 KGKWRFGTKLGLPVVFLFCLFFFLAGLFGSGLLPQEFSSSEPRRLIREET--DYDPLAHG 63
Query: 68 QAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNT 127
++G+DSVT+IPFQVLSW PRALYFPNFAT EQC+SIINMAK NL PST+ALR GE NT
Sbjct: 64 ESGEDSVTSIPFQVLSWRPRALYFPNFATSEQCQSIINMAKSNLTPSTVALRVGEIRGNT 123
Query: 128 QGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF 187
+GIRTSSGVFISA+ED++GTLDLIE+KIA+V M+PR +GEAFN+LRY+IGQ+YNSHYDAF
Sbjct: 124 EGIRTSSGVFISASEDKTGTLDLIEQKIARVIMIPRTHGEAFNVLRYEIGQRYNSHYDAF 183
Query: 188 DPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDG 247
DP EYGPQKS R+A+FLVYL+D+EEGGETMFPFENG+N D YD+Q+CIGLKVKP QGDG
Sbjct: 184 DPAEYGPQKSHRIATFLVYLSDVEEGGETMFPFENGLNMDKDYDFQRCIGLKVKPHQGDG 243
Query: 248 LLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 289
LLFYS+ PNGTIDPTS+HGSCPV+KGEKWVATKWIRDQEQ D
Sbjct: 244 LLFYSMFPNGTIDPTSLHGSCPVIKGEKWVATKWIRDQEQDD 285
>gi|363807682|ref|NP_001242420.1| uncharacterized protein LOC100775302 [Glycine max]
gi|255641811|gb|ACU21174.1| unknown [Glycine max]
Length = 293
Score = 412 bits (1059), Expect = e-113, Method: Compositional matrix adjust.
Identities = 204/290 (70%), Positives = 244/290 (84%), Gaps = 5/290 (1%)
Query: 5 KSNKANWSLKS-KIELPFVFLACLFFFLAGLLGSSLLSQ---DVTAARPSARVVESVKD- 59
K+ K NWSL++ K+ P+VFL C+FFFLAG G +L S D RP R++E +
Sbjct: 4 KTVKGNWSLRTNKLSFPYVFLVCIFFFLAGFFGFTLFSHSQGDGDGLRPRQRLLEPGNEA 63
Query: 60 EYKWMPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALR 119
EY MP GDDS+T+IPFQVLSW PRALYFPNFAT EQC++II++AK L+PSTLALR
Sbjct: 64 EYNLMPVRDLGDDSITSIPFQVLSWRPRALYFPNFATAEQCENIIDVAKDGLKPSTLALR 123
Query: 120 KGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQK 179
+GET +NT+GIRTSSGVF+SA+ D++GTL +IEEKIA+ TM+PR +GEAFNILRY++ Q+
Sbjct: 124 QGETEENTKGIRTSSGVFVSASGDKTGTLAVIEEKIARATMIPRSHGEAFNILRYEVDQR 183
Query: 180 YNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLK 239
YNSHYDAF+P EYGPQKSQR+ASFL+YLTD+EEGGETMFPFENG+N DG+Y Y+ CIGLK
Sbjct: 184 YNSHYDAFNPAEYGPQKSQRMASFLLYLTDVEEGGETMFPFENGLNMDGNYGYEDCIGLK 243
Query: 240 VKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 289
VKPRQGDGLLFYSLL NGTIDPTS+HGSCPV+KGEKWVATKWIRDQEQ D
Sbjct: 244 VKPRQGDGLLFYSLLTNGTIDPTSLHGSCPVIKGEKWVATKWIRDQEQDD 293
>gi|225428938|ref|XP_002262952.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
gi|296083079|emb|CBI22483.3| unnamed protein product [Vitis vinifera]
Length = 284
Score = 411 bits (1056), Expect = e-112, Method: Compositional matrix adjust.
Identities = 197/289 (68%), Positives = 235/289 (81%), Gaps = 5/289 (1%)
Query: 1 MKGGKSNKANWSLKSKIELPFVFLACLFFFLAGLLGSSLLSQDVTAARPSARVVESVKDE 60
MKG K W + K+ L +F++ FFFLAGL GS L SQDV R R++ESV +E
Sbjct: 1 MKG--KGKGVW--RPKLGLLLLFISWSFFFLAGLFGSMLFSQDVNGVRSQPRLLESV-EE 55
Query: 61 YKWMPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK 120
Y MPHG+ G+ SV IPFQVLSW PRALYFP FAT EQC+SII MAK +LRPSTLALR+
Sbjct: 56 YSPMPHGETGESSVDMIPFQVLSWKPRALYFPRFATAEQCQSIIEMAKSHLRPSTLALRQ 115
Query: 121 GETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKY 180
GET ++T+G RTSSG FISA+ED++G LD +E KIAK TM+PR +GEAFNILRY+IGQ+Y
Sbjct: 116 GETDESTKGTRTSSGTFISASEDKTGILDFVERKIAKATMIPRSHGEAFNILRYEIGQRY 175
Query: 181 NSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKV 240
NSHYDAF+P EYGPQ SQRVASFL+YL+D+EEGGETMFPFE+ +N YDY+KCIGLKV
Sbjct: 176 NSHYDAFNPAEYGPQTSQRVASFLLYLSDVEEGGETMFPFEHDLNIGTGYDYKKCIGLKV 235
Query: 241 KPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 289
KP++GDGLLFYS+ PNGTID TS+HGSCPV+ GEKWVATKWIRD++Q D
Sbjct: 236 KPQRGDGLLFYSVFPNGTIDRTSLHGSCPVIAGEKWVATKWIRDEQQDD 284
>gi|255573113|ref|XP_002527486.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223533126|gb|EEF34884.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 286
Score = 405 bits (1041), Expect = e-110, Method: Compositional matrix adjust.
Identities = 186/272 (68%), Positives = 225/272 (82%), Gaps = 1/272 (0%)
Query: 19 LPFVFLACLFFFLAGLLGSSLLSQDVTAARPSARVVESVKDE-YKWMPHGQAGDDSVTNI 77
LP V L C FF+AG S+L+SQDV +P R++E +E ++ MP G G+ + +I
Sbjct: 15 LPAVILVCSVFFVAGFYASTLISQDVPVIKPRLRMLEVTDEEKHQAMPRGVTGESYIESI 74
Query: 78 PFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVF 137
PFQVLSW PRA+YFP+FATPEQCK+II MAKL L+PS LALRKGET ++T+G RTSSG F
Sbjct: 75 PFQVLSWKPRAVYFPDFATPEQCKNIIEMAKLRLKPSGLALRKGETAESTKGTRTSSGTF 134
Query: 138 ISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS 197
+SA+ED +GTLD IE KIA+ TM+PR +GEAFNILRY+IGQKY+SHYD+F+P EYGPQ S
Sbjct: 135 LSASEDGTGTLDFIEHKIARATMIPRSHGEAFNILRYEIGQKYDSHYDSFNPAEYGPQMS 194
Query: 198 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 257
QRVASFL+YL+D+E+GGETMFPFENG+ YDY+KC GLKVKPRQGDG+LFYSLLPNG
Sbjct: 195 QRVASFLLYLSDVEKGGETMFPFENGVKISSVYDYKKCAGLKVKPRQGDGILFYSLLPNG 254
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 289
TID TS+HGSCPV++GEKWVATKWIRDQ Q D
Sbjct: 255 TIDQTSLHGSCPVIEGEKWVATKWIRDQVQMD 286
>gi|225428943|ref|XP_002263094.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
gi|296083076|emb|CBI22480.3| unnamed protein product [Vitis vinifera]
Length = 282
Score = 403 bits (1035), Expect = e-110, Method: Compositional matrix adjust.
Identities = 194/289 (67%), Positives = 233/289 (80%), Gaps = 7/289 (2%)
Query: 1 MKGGKSNKANWSLKSKIELPFVFLACLFFFLAGLLGSSLLSQDVTAARPSARVVESVKDE 60
MKG K WS K+ELP +FL+ FFLAGL GS L SQDV R R++ESV++E
Sbjct: 1 MKG----KGGWS--PKLELPLLFLSWSLFFLAGLFGSMLFSQDVNGVRSPPRLLESVEEE 54
Query: 61 YKWMPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK 120
Y MPHG+ G+ SV IPFQVLSW PRA YFP+FAT EQC+SII MAK L PSTL LRK
Sbjct: 55 YSSMPHGETGESSVDLIPFQVLSWKPRARYFPHFATAEQCQSIIEMAKSGLSPSTLVLRK 114
Query: 121 GETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKY 180
GET ++T+GIRTSSG FISA+ED++G LD IE KIAK TM+PR +GE FNILRY+IGQ+Y
Sbjct: 115 GETEESTKGIRTSSGTFISASEDKTGILDFIERKIAKATMIPRNHGEVFNILRYEIGQRY 174
Query: 181 NSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKV 240
NSHYDA P EYG Q SQR+ASFL+YL+D+EEGGETMFPFE+ +N + +++ +KCIGLKV
Sbjct: 175 NSHYDAISPAEYGLQTSQRIASFLLYLSDVEEGGETMFPFEHDLNIN-TFNSRKCIGLKV 233
Query: 241 KPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 289
KPR+GDGLLFYS+ PNGTID TS+HGSCPV++GEKWVATKWIRD++Q D
Sbjct: 234 KPRRGDGLLFYSVFPNGTIDWTSMHGSCPVIEGEKWVATKWIRDEQQED 282
>gi|297798522|ref|XP_002867145.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297312981|gb|EFH43404.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 288
Score = 400 bits (1029), Expect = e-109, Method: Compositional matrix adjust.
Identities = 183/277 (66%), Positives = 225/277 (81%), Gaps = 3/277 (1%)
Query: 14 KSKIELPFVFLACLFFFLAGLLGSSLLSQDVTAARPSARVVESVK---DEYKWMPHGQAG 70
+ K+ L V + C FL G GS+LLSQ+V +P R++E V+ ++ MPHG G
Sbjct: 10 RKKLGLATVIVFCSLCFLVGFYGSTLLSQNVPRVKPRLRMLEMVENGEEDTGSMPHGVTG 69
Query: 71 DDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGI 130
++SV +IPFQVLSW PRA+YFPNFAT EQC++II AK+NL+PS LALRKGET +NT+G
Sbjct: 70 EESVGSIPFQVLSWRPRAIYFPNFATAEQCQAIIERAKVNLKPSALALRKGETAENTKGT 129
Query: 131 RTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQ 190
RTSSG FISA+ED +G LD +E KIA+ TM+PR +GE+FNILRY++GQKY+SHYD F+P
Sbjct: 130 RTSSGTFISASEDSTGALDFVERKIARATMIPRSHGESFNILRYELGQKYDSHYDVFNPT 189
Query: 191 EYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 250
EYGPQ SQR+ASFL+YL+D+EEGGETMFPFENG N YDY++CIGLKVKPR+GDGLLF
Sbjct: 190 EYGPQSSQRIASFLLYLSDVEEGGETMFPFENGSNMGTGYDYKQCIGLKVKPRKGDGLLF 249
Query: 251 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
YS+ PNGTID TS+HGSCPV KGEKWVATKWIRDQ+Q
Sbjct: 250 YSVFPNGTIDQTSLHGSCPVTKGEKWVATKWIRDQDQ 286
>gi|388523073|gb|AFK49598.1| unknown [Lotus japonicus]
Length = 318
Score = 398 bits (1023), Expect = e-108, Method: Compositional matrix adjust.
Identities = 193/295 (65%), Positives = 232/295 (78%), Gaps = 14/295 (4%)
Query: 5 KSNKANWSLK-SKIELPFVFLACLFFFLAGLLGSSLLSQDVTAA---------RP-SARV 53
K+ + NWS + +KI P VFL C+FFFLAG GS+L Q + RP + R+
Sbjct: 4 KTVRGNWSWRMNKIAFPSVFLLCIFFFLAGFFGSTLFFQYHSQEDEHGLRQIQRPRTTRL 63
Query: 54 VESVKD---EYKWMPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLN 110
+E++ + E +P G+ GD+ +T IPFQVLSW P ALYFPNFAT EQC+SII AK
Sbjct: 64 LENLTEKETESHLLPAGETGDNFITTIPFQVLSWNPHALYFPNFATAEQCESIIETAKEG 123
Query: 111 LRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFN 170
L+PSTL LR GET ++T GIRTSSGVFISA ED++G LD+IEEKIA+ T +PR +GEAFN
Sbjct: 124 LKPSTLVLRVGETDESTTGIRTSSGVFISAFEDKTGVLDVIEEKIARATKIPRTHGEAFN 183
Query: 171 ILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY 230
+LRYK+GQKY+SHYDA P YGPQKSQR+ASFL+YL+D+ EGGETMFPFENG+N DGSY
Sbjct: 184 VLRYKVGQKYSSHYDALHPDIYGPQKSQRMASFLLYLSDVPEGGETMFPFENGLNMDGSY 243
Query: 231 DYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
Y+KCIGLKVKPR+GDGLLFYSL PNGTIDP S+HGSCPV+KGEKWVATKWIRDQ
Sbjct: 244 YYEKCIGLKVKPRKGDGLLFYSLFPNGTIDPMSLHGSCPVIKGEKWVATKWIRDQ 298
>gi|385137888|gb|AFI41205.1| oxygenase protein, partial [Arabidopsis thaliana]
Length = 288
Score = 398 bits (1023), Expect = e-108, Method: Compositional matrix adjust.
Identities = 182/277 (65%), Positives = 225/277 (81%), Gaps = 3/277 (1%)
Query: 14 KSKIELPFVFLACLFFFLAGLLGSSLLSQDVTAARPSARVVESVK---DEYKWMPHGQAG 70
+ K+ L V + C FL G GS+LLSQ+V +P R++E V+ +E MPHG G
Sbjct: 10 RKKLGLATVIVFCSLCFLFGFYGSTLLSQNVPRVKPRLRMLEMVENGEEEAGSMPHGVTG 69
Query: 71 DDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGI 130
++S+ +IPFQVLSW PRA+YFPNFAT EQC++II AK+NL+PS LALRKGET +NT+G
Sbjct: 70 EESIGSIPFQVLSWRPRAIYFPNFATAEQCQAIIERAKVNLKPSALALRKGETAENTKGT 129
Query: 131 RTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQ 190
RTSSG FISA+E+ +G LD +E KIA+ TM+PR +GE+FNILRY++GQKY+SHYD F+P
Sbjct: 130 RTSSGTFISASEESTGALDFVERKIARATMIPRSHGESFNILRYELGQKYDSHYDVFNPT 189
Query: 191 EYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 250
EYGPQ SQR+ASFL+YL+D+EEGGETMFPFENG N YDY++CIGLKVKPR+GDGLLF
Sbjct: 190 EYGPQSSQRIASFLLYLSDVEEGGETMFPFENGSNMGIGYDYKQCIGLKVKPRKGDGLLF 249
Query: 251 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
YS+ PNGTID TS+HGSCPV KGEKWVATKWIRDQ+Q
Sbjct: 250 YSVFPNGTIDQTSLHGSCPVTKGEKWVATKWIRDQDQ 286
>gi|18418321|ref|NP_567941.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
gi|17381226|gb|AAL36425.1| unknown protein [Arabidopsis thaliana]
gi|20465827|gb|AAM20018.1| unknown protein [Arabidopsis thaliana]
gi|21592377|gb|AAM64328.1| putative dioxygenase [Arabidopsis thaliana]
gi|332660892|gb|AEE86292.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
Length = 288
Score = 398 bits (1022), Expect = e-108, Method: Compositional matrix adjust.
Identities = 182/277 (65%), Positives = 225/277 (81%), Gaps = 3/277 (1%)
Query: 14 KSKIELPFVFLACLFFFLAGLLGSSLLSQDVTAARPSARV---VESVKDEYKWMPHGQAG 70
+ K+ L V + C FL G GS+LLSQ+V +P R+ VE+ ++E MPHG G
Sbjct: 10 RKKLGLATVIVFCSLCFLFGFYGSTLLSQNVPRVKPRLRMLDMVENGEEEASSMPHGVTG 69
Query: 71 DDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGI 130
++S+ +IPFQVLSW PRA+YFPNFAT EQC++II AK+NL+PS LALRKGET +NT+G
Sbjct: 70 EESIGSIPFQVLSWRPRAIYFPNFATAEQCQAIIERAKVNLKPSALALRKGETAENTKGT 129
Query: 131 RTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQ 190
RTSSG FISA+E+ +G LD +E KIA+ TM+PR +GE+FNILRY++GQKY+SHYD F+P
Sbjct: 130 RTSSGTFISASEESTGALDFVERKIARATMIPRSHGESFNILRYELGQKYDSHYDVFNPT 189
Query: 191 EYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 250
EYGPQ SQR+ASFL+YL+D+EEGGETMFPFENG N YDY++CIGLKVKPR+GDGLLF
Sbjct: 190 EYGPQSSQRIASFLLYLSDVEEGGETMFPFENGSNMGIGYDYKQCIGLKVKPRKGDGLLF 249
Query: 251 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
YS+ PNGTID TS+HGSCPV KGEKWVATKWIRDQ+Q
Sbjct: 250 YSVFPNGTIDQTSLHGSCPVTKGEKWVATKWIRDQDQ 286
>gi|147823227|emb|CAN70872.1| hypothetical protein VITISV_009065 [Vitis vinifera]
Length = 276
Score = 397 bits (1021), Expect = e-108, Method: Compositional matrix adjust.
Identities = 194/289 (67%), Positives = 232/289 (80%), Gaps = 13/289 (4%)
Query: 1 MKGGKSNKANWSLKSKIELPFVFLACLFFFLAGLLGSSLLSQDVTAARPSARVVESVKDE 60
MKG K W + K+ L +F++ FFFLAGL GS L SQ R++ESV +E
Sbjct: 1 MKG--KGKGVW--RPKLGLLLLFISWSFFFLAGLFGSMLFSQ--------PRLLESV-EE 47
Query: 61 YKWMPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK 120
Y MPHG+ G+ SV IPFQVLSW PRALYFP FAT EQC+SII MAK +LRPSTLALR+
Sbjct: 48 YSPMPHGETGESSVDMIPFQVLSWKPRALYFPRFATAEQCQSIIEMAKSHLRPSTLALRQ 107
Query: 121 GETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKY 180
GET ++T+G RTSSG FISA+ED++G LD +E KIAK TM+PR +GEAFNILRY+IGQ+Y
Sbjct: 108 GETDESTKGTRTSSGTFISASEDKTGILDFVERKIAKATMIPRSHGEAFNILRYEIGQRY 167
Query: 181 NSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKV 240
NSHYDAF+P EYGPQ SQRVASFL+YL+D+EEGGETMFPFE+ +N YDY+KCIGLKV
Sbjct: 168 NSHYDAFNPAEYGPQTSQRVASFLLYLSDVEEGGETMFPFEHDLNIGTGYDYKKCIGLKV 227
Query: 241 KPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 289
KP++GDGLLFYS+ PNGTID TS+HGSCPV+ GEKWVATKWIRD++Q D
Sbjct: 228 KPQRGDGLLFYSVFPNGTIDRTSLHGSCPVIAGEKWVATKWIRDEQQDD 276
>gi|356496957|ref|XP_003517331.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 299
Score = 390 bits (1003), Expect = e-106, Method: Compositional matrix adjust.
Identities = 188/292 (64%), Positives = 228/292 (78%), Gaps = 11/292 (3%)
Query: 5 KSNKANWSLK---SKIELPFVFLACLFFFLAGLLGSSLLSQDVTAAR-----PSARVVES 56
K+ K NW +K++LP VFL C+FFFLAG SSL S P AR++E
Sbjct: 4 KTVKGNWRWNWRMNKLDLPSVFLICIFFFLAGFSASSLFSPSQEHEYDLRLIPRARLLEK 63
Query: 57 V---KDEYKWMPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRP 113
K E+ + G +GDD +T IPFQVLSW PRALYFPNF + EQC++II MA+ L+P
Sbjct: 64 STHEKTEHHLLKAGDSGDDYITLIPFQVLSWYPRALYFPNFVSAEQCETIIEMARGGLKP 123
Query: 114 STLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILR 173
STL LRKGET ++T+GIRTS GVF+SA+EDE+G LD IEEKIAK T +PR +GEAFNILR
Sbjct: 124 STLVLRKGETEESTKGIRTSYGVFMSASEDETGILDSIEEKIAKATKIPRTHGEAFNILR 183
Query: 174 YKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQ 233
Y++GQKY+ HYDAFD E+GP +SQR ASFL+YLTD+ EGGET+FP+ENG N DGSYD++
Sbjct: 184 YEVGQKYSPHYDAFDEAEFGPLQSQRAASFLLYLTDVPEGGETLFPYENGFNRDGSYDFE 243
Query: 234 KCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
CIGL+V+PR+GDGLLFYSLLPNGTID TS+HGSCPV+KGEKWVATKWIRDQ
Sbjct: 244 DCIGLRVRPRKGDGLLFYSLLPNGTIDQTSVHGSCPVIKGEKWVATKWIRDQ 295
>gi|224103711|ref|XP_002313164.1| predicted protein [Populus trichocarpa]
gi|222849572|gb|EEE87119.1| predicted protein [Populus trichocarpa]
Length = 294
Score = 389 bits (999), Expect = e-106, Method: Compositional matrix adjust.
Identities = 183/280 (65%), Positives = 219/280 (78%), Gaps = 8/280 (2%)
Query: 14 KSKIELPFVFLACLFFFLAGLLGS--------SLLSQDVTAARPSARVVESVKDEYKWMP 65
K K+ LP + C FFLAG S SL + DV RP R++E + ++ MP
Sbjct: 9 KRKLGLPAAIILCSLFFLAGFYSSIFFPHVSASLSACDVPVTRPRLRMLEVAEGDHDTMP 68
Query: 66 HGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVD 125
HG G+ SV +IPFQVLSW PRALYFP FATPEQC+SII M + L+PSTLALRKGET +
Sbjct: 69 HGVTGEASVESIPFQVLSWKPRALYFPKFATPEQCESIIKMVESKLKPSTLALRKGETAE 128
Query: 126 NTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD 185
+T+ RTSSG F+S +EDE+GTLD IE+KIAK TM+P+ +GEAFNILRY+IGQKY+SHYD
Sbjct: 129 STKDTRTSSGSFVSGSEDETGTLDFIEKKIAKATMIPQSHGEAFNILRYEIGQKYDSHYD 188
Query: 186 AFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQG 245
AF+P EYG Q SQR ASFL+YL+++EEGGETMFPFENG +DY++C+GLKVKPRQG
Sbjct: 189 AFNPDEYGQQSSQRTASFLLYLSNVEEGGETMFPFENGSAVIPGFDYKQCVGLKVKPRQG 248
Query: 246 DGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
DGLLFYSL PNGTIDPTS+HGSCPV+KG KWVATKWIRDQ
Sbjct: 249 DGLLFYSLFPNGTIDPTSLHGSCPVIKGVKWVATKWIRDQ 288
>gi|326492085|dbj|BAJ98267.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 347
Score = 386 bits (991), Expect = e-105, Method: Compositional matrix adjust.
Identities = 183/279 (65%), Positives = 224/279 (80%), Gaps = 9/279 (3%)
Query: 13 LKSKIELPFVFLACLFFFLAGLLGSSLLSQDVTAA----RP--SARVVESVKDEYKWMPH 66
+++++ LP V L+C FFLAG LGS L +QD RP RV+E+V E M +
Sbjct: 70 MRTRLRLPVVLLSCSLFFLAGFLGSLLFTQDPQGEEDPERPLRRERVMEAVWPE---MAY 126
Query: 67 GQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDN 126
G++GD + + IP+Q+LSW PRALYFP FAT EQC++++ AK LRPSTLALRKGE+ +
Sbjct: 127 GESGDPAPSLIPYQILSWQPRALYFPQFATAEQCENVVKTAKARLRPSTLALRKGESEET 186
Query: 127 TQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDA 186
T+GIRTSSG F+SA ED +G L IE KIAK TM+PR +GE FN+LRY+IGQKY SHYDA
Sbjct: 187 TKGIRTSSGTFLSAEEDPTGALAEIETKIAKATMMPRSHGEPFNVLRYEIGQKYASHYDA 246
Query: 187 FDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGD 246
FDP +YGPQKSQRVASFL+YLTD+EEGGETMFP+ENG N + YDY++CIGLKVKPR+GD
Sbjct: 247 FDPAQYGPQKSQRVASFLLYLTDVEEGGETMFPYENGDNMNIGYDYEQCIGLKVKPRKGD 306
Query: 247 GLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
GLLFYSL+ NGTIDPTS+HGSCPVV+GEKWVATKWIRD+
Sbjct: 307 GLLFYSLMVNGTIDPTSLHGSCPVVRGEKWVATKWIRDK 345
>gi|40809925|dbj|BAD07294.1| prolyl 4-hydroxylase [Nicotiana tabacum]
Length = 286
Score = 385 bits (990), Expect = e-105, Method: Compositional matrix adjust.
Identities = 179/255 (70%), Positives = 219/255 (85%), Gaps = 2/255 (0%)
Query: 37 SSLLS-QDVTAARPSARVVESV-KDEYKWMPHGQAGDDSVTNIPFQVLSWMPRALYFPNF 94
S+L S QDV + RP R +ESV ++++ +P G+ G+ S+ +IPFQVLSW PRALYFPNF
Sbjct: 32 SALFSHQDVPSVRPRPRFLESVYQEDFDPLPIGETGEHSLISIPFQVLSWFPRALYFPNF 91
Query: 95 ATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEK 154
A+ EQC+SII MAK N+ PS+LALR GET + T+GIRTSSG FISA+ED++G LDLIEEK
Sbjct: 92 ASIEQCQSIIKMAKANMEPSSLALRTGETEETTKGIRTSSGTFISASEDKTGILDLIEEK 151
Query: 155 IAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGG 214
IAK TM+P+ +GEAFN+LRY+IGQ+Y SHYDAFDP +YGPQKSQR ASFL+YL+D+EEGG
Sbjct: 152 IAKATMIPKTHGEAFNVLRYEIGQRYQSHYDAFDPAQYGPQKSQRAASFLLYLSDVEEGG 211
Query: 215 ETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGE 274
ET+FP+ENG N D SYD+ KCIGLKVKPR+GDGLLFYSL PNGTID TS+HGSCPV++GE
Sbjct: 212 ETVFPYENGQNMDASYDFSKCIGLKVKPRRGDGLLFYSLFPNGTIDLTSLHGSCPVIRGE 271
Query: 275 KWVATKWIRDQEQYD 289
KWVATKWIR+Q+Q D
Sbjct: 272 KWVATKWIRNQDQDD 286
>gi|356541677|ref|XP_003539300.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 297
Score = 383 bits (983), Expect = e-104, Method: Compositional matrix adjust.
Identities = 190/297 (63%), Positives = 223/297 (75%), Gaps = 15/297 (5%)
Query: 5 KSNKANWSLKSKIELPFVFLACLFFFLAGLLGSSLLSQD------------VTAARPSAR 52
K+ K N + +K++LP VFL C+FFFLAG SSL S
Sbjct: 4 KTVKGNRRM-NKLDLPSVFLICIFFFLAGFSASSLFSHSQKHEYDLRLRPRPRPRARLLE 62
Query: 53 VVESVKDEYKWMPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLR 112
K EY + G +GDD VT IPFQVLSW PRALYFPNFA+ EQC+SII MA+ L+
Sbjct: 63 KSTQEKTEYHLLKAGDSGDDYVTLIPFQVLSWYPRALYFPNFASAEQCESIIEMARGGLK 122
Query: 113 PSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNIL 172
STLALRKGET ++T+GIRTSSGVF+SA+EDE+G LD IEEKIAK T +PR +GEAFNIL
Sbjct: 123 SSTLALRKGETEESTKGIRTSSGVFMSASEDETGILDAIEEKIAKATKIPRTHGEAFNIL 182
Query: 173 RYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDY 232
RY++GQKYNSHYDAFD EYGP +SQRVASFL+YLTD+ EGGETMFP+ENG N DG +
Sbjct: 183 RYEVGQKYNSHYDAFDEAEYGPLQSQRVASFLLYLTDVPEGGETMFPYENGFNRDG--NV 240
Query: 233 QKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 289
+ CIGL+V+PR+GD LLFYSLLPNGTID TS HGSCPV+KGEKWVATKWIR+Q Q D
Sbjct: 241 EDCIGLRVRPRKGDALLFYSLLPNGTIDQTSAHGSCPVIKGEKWVATKWIRNQVQDD 297
>gi|356536125|ref|XP_003536590.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 286
Score = 381 bits (979), Expect = e-103, Method: Compositional matrix adjust.
Identities = 178/278 (64%), Positives = 222/278 (79%), Gaps = 1/278 (0%)
Query: 12 SLKSKIELPFVFLACLFFFLAGLLGSSLLSQDVTAARPSARVV-ESVKDEYKWMPHGQAG 70
S K K+ +P +F+ C FF G S LL QD+ P +R++ ESVK EY+ + HG++G
Sbjct: 8 SSKLKLGVPTLFILCALFFFVGFFVSPLLFQDLDDVGPRSRILQESVKKEYEPLEHGESG 67
Query: 71 DDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGI 130
+ V +IP Q+LSW PRA++FPNF + E C+ II MAK L PS LALRKGET ++T+
Sbjct: 68 EPFVDSIPSQILSWRPRAVFFPNFTSVEVCQQIIEMAKPKLEPSKLALRKGETAESTKDT 127
Query: 131 RTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQ 190
RTSSG FISA+ED+SG LDL+E KIAKVTM+PR +GE FNIL+Y++GQKY+SHYDAF+P
Sbjct: 128 RTSSGTFISASEDKSGILDLVERKIAKVTMIPRTHGEIFNILKYEVGQKYDSHYDAFNPD 187
Query: 191 EYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 250
EYG +SQR+ASFL+YL+++E GGETMFP+E G+N D YDYQKCIGLKVKPRQGDGLLF
Sbjct: 188 EYGSVESQRIASFLLYLSNVEAGGETMFPYEGGLNIDRGYDYQKCIGLKVKPRQGDGLLF 247
Query: 251 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQY 288
YSLLPNG ID TS+HGSCPV+KGEKWVATKWI D+EQ+
Sbjct: 248 YSLLPNGKIDKTSLHGSCPVIKGEKWVATKWIDDREQH 285
>gi|223945827|gb|ACN26997.1| unknown [Zea mays]
gi|414872966|tpg|DAA51523.1| TPA: prolyl 4-hydroxylase [Zea mays]
Length = 294
Score = 381 bits (978), Expect = e-103, Method: Compositional matrix adjust.
Identities = 180/275 (65%), Positives = 221/275 (80%), Gaps = 6/275 (2%)
Query: 13 LKSKIELPFVFLACLFFFLAGLLGSSLLSQDVTAAR---PSARVVESVKDEYKWMPHGQA 69
L++++ LP V L+C FFLAG GS L +QD A P R++E+ E MP+G++
Sbjct: 20 LRTRLRLPVVLLSCSLFFLAGFFGSLLFTQDPEEADMPVPRERLLEAAWPE---MPYGES 76
Query: 70 GDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG 129
G+ + + IP+Q+LSW PRALYFP FAT EQC++I+ AK L+PSTLALRKGET ++T+G
Sbjct: 77 GEAAPSLIPYQILSWQPRALYFPQFATSEQCENIVKTAKERLKPSTLALRKGETAESTKG 136
Query: 130 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP 189
IRTSSG F+SA ED + TL IE+KIA+ TMLPR +GE FN+LRY IGQ+Y SHYDAFDP
Sbjct: 137 IRTSSGTFLSANEDPTETLAEIEKKIARATMLPRNHGEPFNVLRYNIGQRYASHYDAFDP 196
Query: 190 QEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLL 249
+YGPQK+QRVASFL+YLTD+EEGGETMFP+EN N D YDY+KCIGLKVKPR+GDGLL
Sbjct: 197 AQYGPQKNQRVASFLLYLTDVEEGGETMFPYENSENMDIGYDYEKCIGLKVKPRKGDGLL 256
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRD 284
FYSL+ NGTID TS+HGSCPV+KGEKWVATKWIRD
Sbjct: 257 FYSLMVNGTIDRTSLHGSCPVIKGEKWVATKWIRD 291
>gi|242038031|ref|XP_002466410.1| hypothetical protein SORBIDRAFT_01g007280 [Sorghum bicolor]
gi|241920264|gb|EER93408.1| hypothetical protein SORBIDRAFT_01g007280 [Sorghum bicolor]
Length = 294
Score = 380 bits (976), Expect = e-103, Method: Compositional matrix adjust.
Identities = 179/275 (65%), Positives = 222/275 (80%), Gaps = 6/275 (2%)
Query: 13 LKSKIELPFVFLACLFFFLAGLLGSSLLSQDVTAAR---PSARVVESVKDEYKWMPHGQA 69
+++++ LP V L+C FFLAG GS L +QD A P R++E+ E MP+G++
Sbjct: 20 MRTRLRLPVVLLSCSLFFLAGFFGSLLFTQDPEEADMPVPRERLLETAWPE---MPYGES 76
Query: 70 GDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG 129
G+ + + IP+Q+LSW PRALYFP FAT EQC++I+ AK L+PSTLALRKGET ++T+G
Sbjct: 77 GEAAPSLIPYQILSWQPRALYFPQFATSEQCENIVKTAKERLKPSTLALRKGETAESTKG 136
Query: 130 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP 189
IRTSSG F+SA ED + TL IE+KIA+ TM+PR +GE FN+LRY IGQ+Y SHYDAFDP
Sbjct: 137 IRTSSGTFLSANEDPTRTLAEIEKKIARATMIPRNHGEPFNVLRYNIGQRYASHYDAFDP 196
Query: 190 QEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLL 249
+YGPQKSQRVASFL+YLT++EEGGETMFP+ENG N D YDY+KCIGLKVKPR+GDGLL
Sbjct: 197 VQYGPQKSQRVASFLLYLTNVEEGGETMFPYENGENMDIGYDYEKCIGLKVKPRKGDGLL 256
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRD 284
FYSL+ NGTID TS+HGSCPV+KGEKWVATKWIRD
Sbjct: 257 FYSLMVNGTIDRTSLHGSCPVIKGEKWVATKWIRD 291
>gi|226499492|ref|NP_001150030.1| LOC100283657 [Zea mays]
gi|195636206|gb|ACG37571.1| prolyl 4-hydroxylase [Zea mays]
gi|347978804|gb|AEP37744.1| prolyl 4-hydroxylase 3 [Zea mays]
Length = 294
Score = 379 bits (974), Expect = e-103, Method: Compositional matrix adjust.
Identities = 179/275 (65%), Positives = 221/275 (80%), Gaps = 6/275 (2%)
Query: 13 LKSKIELPFVFLACLFFFLAGLLGSSLLSQDVTAAR---PSARVVESVKDEYKWMPHGQA 69
L++++ LP V L+C FFLAG GS L +QD A P R++++ E MP+G++
Sbjct: 20 LRTRLRLPVVLLSCSLFFLAGFFGSLLFTQDPEEADMPVPRERLLDAAWPE---MPYGES 76
Query: 70 GDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG 129
G+ + + IP+Q+LSW PRALYFP FAT EQC++I+ AK L+PSTLALRKGET ++T+G
Sbjct: 77 GEAAPSLIPYQILSWQPRALYFPQFATSEQCENIVKTAKERLKPSTLALRKGETAESTKG 136
Query: 130 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP 189
IRTSSG F+SA ED + TL IE+KIA+ TMLPR +GE FN+LRY IGQ+Y SHYDAFDP
Sbjct: 137 IRTSSGTFLSANEDPTETLAEIEKKIARATMLPRNHGEPFNVLRYNIGQRYASHYDAFDP 196
Query: 190 QEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLL 249
+YGPQK+QRVASFL+YLTD+EEGGETMFP+EN N D YDY+KCIGLKVKPR+GDGLL
Sbjct: 197 AQYGPQKNQRVASFLLYLTDVEEGGETMFPYENSENMDIGYDYEKCIGLKVKPRKGDGLL 256
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRD 284
FYSL+ NGTID TS+HGSCPV+KGEKWVATKWIRD
Sbjct: 257 FYSLMVNGTIDRTSLHGSCPVIKGEKWVATKWIRD 291
>gi|115455509|ref|NP_001051355.1| Os03g0761900 [Oryza sativa Japonica Group]
gi|14488368|gb|AAK63935.1|AC084282_16 putative dioxygenase [Oryza sativa Japonica Group]
gi|17027263|gb|AAL34117.1|AC090713_4 putative hydroxylase subunit [Oryza sativa Japonica Group]
gi|108711218|gb|ABF99013.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
Group]
gi|113549826|dbj|BAF13269.1| Os03g0761900 [Oryza sativa Japonica Group]
gi|125545807|gb|EAY91946.1| hypothetical protein OsI_13633 [Oryza sativa Indica Group]
Length = 310
Score = 378 bits (971), Expect = e-102, Method: Compositional matrix adjust.
Identities = 177/281 (62%), Positives = 223/281 (79%), Gaps = 9/281 (3%)
Query: 13 LKSKIELPFVFLACLFFFLAGLLGSSLLSQD------VTAARPSARVVESVKDEYKWMPH 66
+++++ LP V L+C FFLAG GS L +QD + R++E+ + M +
Sbjct: 31 MRTRLRLPVVLLSCSLFFLAGFFGSILFTQDPQGEEELDTPMRRERLMEAA---WPGMAY 87
Query: 67 GQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDN 126
G++G+ + IP+Q+LSW PRALYFP FAT +QC++I+ AK L PSTLALRKGET ++
Sbjct: 88 GESGEPEPSLIPYQILSWQPRALYFPQFATSQQCENIVKTAKQRLMPSTLALRKGETEES 147
Query: 127 TQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDA 186
T+GIRTSSG F+S+ ED +GTL +E+KIAK TM+PR +GE FNILRY+IGQ+Y SHYDA
Sbjct: 148 TKGIRTSSGTFLSSDEDPTGTLAEVEKKIAKATMIPRHHGEPFNILRYEIGQRYASHYDA 207
Query: 187 FDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGD 246
FDP +YGPQKSQRVASFL+YLTD+EEGGETMFP+ENG N D YDY+KCIGLKVKPR+GD
Sbjct: 208 FDPAQYGPQKSQRVASFLLYLTDVEEGGETMFPYENGENMDIGYDYEKCIGLKVKPRKGD 267
Query: 247 GLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
GLLFYSL+ NGTIDPTS+HGSCPV+KGEKWVATKWIRD+ +
Sbjct: 268 GLLFYSLMVNGTIDPTSLHGSCPVIKGEKWVATKWIRDKSK 308
>gi|125588006|gb|EAZ28670.1| hypothetical protein OsJ_12681 [Oryza sativa Japonica Group]
Length = 280
Score = 378 bits (971), Expect = e-102, Method: Compositional matrix adjust.
Identities = 177/281 (62%), Positives = 223/281 (79%), Gaps = 9/281 (3%)
Query: 13 LKSKIELPFVFLACLFFFLAGLLGSSLLSQD------VTAARPSARVVESVKDEYKWMPH 66
+++++ LP V L+C FFLAG GS L +QD + R++E+ + M +
Sbjct: 1 MRTRLRLPVVLLSCSLFFLAGFFGSILFTQDPQGEEELDTPMRRERLMEAA---WPGMAY 57
Query: 67 GQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDN 126
G++G+ + IP+Q+LSW PRALYFP FAT +QC++I+ AK L PSTLALRKGET ++
Sbjct: 58 GESGEPEPSLIPYQILSWQPRALYFPQFATSQQCENIVKTAKQRLMPSTLALRKGETEES 117
Query: 127 TQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDA 186
T+GIRTSSG F+S+ ED +GTL +E+KIAK TM+PR +GE FNILRY+IGQ+Y SHYDA
Sbjct: 118 TKGIRTSSGTFLSSDEDPTGTLAEVEKKIAKATMIPRHHGEPFNILRYEIGQRYASHYDA 177
Query: 187 FDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGD 246
FDP +YGPQKSQRVASFL+YLTD+EEGGETMFP+ENG N D YDY+KCIGLKVKPR+GD
Sbjct: 178 FDPAQYGPQKSQRVASFLLYLTDVEEGGETMFPYENGENMDIGYDYEKCIGLKVKPRKGD 237
Query: 247 GLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
GLLFYSL+ NGTIDPTS+HGSCPV+KGEKWVATKWIRD+ +
Sbjct: 238 GLLFYSLMVNGTIDPTSLHGSCPVIKGEKWVATKWIRDKSK 278
>gi|357114580|ref|XP_003559078.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
distachyon]
Length = 295
Score = 378 bits (970), Expect = e-102, Method: Compositional matrix adjust.
Identities = 180/279 (64%), Positives = 223/279 (79%), Gaps = 9/279 (3%)
Query: 13 LKSKIELPFVFLACLFFFLAGLLGSSLLSQDVTAA----RPSAR--VVESVKDEYKWMPH 66
+++++ LP V L+C FFLAG GS L +QD RP +R ++E+V E M +
Sbjct: 18 MRTRLRLPVVLLSCSLFFLAGFFGSLLFTQDPQGEEELERPLSRERLMEAVWPE---MAY 74
Query: 67 GQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDN 126
G +GD + + IP+Q+LSW PRALYFP FAT EQC++++ AK LRPSTLALRKGET +
Sbjct: 75 GDSGDPAPSLIPYQILSWQPRALYFPQFATSEQCENVVKTAKARLRPSTLALRKGETEET 134
Query: 127 TQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDA 186
T+GIRTSSG F+SA ED + TL +E+KIAK TM+PR +GE FN+LRY+IGQKY SHYDA
Sbjct: 135 TKGIRTSSGTFLSADEDPTRTLAEVEKKIAKATMIPRSHGEPFNVLRYEIGQKYASHYDA 194
Query: 187 FDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGD 246
FDP +YGPQKSQRVASFL+YLTD+EEGGETMFP+ENG N D YDY++CIGLKVKPR+GD
Sbjct: 195 FDPAQYGPQKSQRVASFLLYLTDVEEGGETMFPYENGENMDIGYDYEQCIGLKVKPRKGD 254
Query: 247 GLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
GLLFYSL+ NGTID TS+HGSCPV+KGEKWVATKWIR++
Sbjct: 255 GLLFYSLMVNGTIDLTSLHGSCPVIKGEKWVATKWIRNK 293
>gi|357453665|ref|XP_003597113.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
gi|357482683|ref|XP_003611628.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
gi|355486161|gb|AES67364.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
gi|355512963|gb|AES94586.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
Length = 294
Score = 376 bits (965), Expect = e-102, Method: Compositional matrix adjust.
Identities = 191/293 (65%), Positives = 235/293 (80%), Gaps = 9/293 (3%)
Query: 4 GKSNKANWSLKSK-IELPFVFLACLFFFLAGLLGSSLL---SQDVTAARPSARVVE-SVK 58
K+ KANWSL++K LP VFL C+FFFLAG G S +D R R++E SVK
Sbjct: 3 AKTVKANWSLRTKKFSLPSVFLLCIFFFLAGFFGFSFFHHSQEDEYGVR--VRLLEKSVK 60
Query: 59 DE--YKWMPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTL 116
DE ++ + G++GD+ +T+IPFQVLSW PRALYFPNFA+ EQC II MAK L PS L
Sbjct: 61 DETEHRLLHAGKSGDNFITSIPFQVLSWNPRALYFPNFASAEQCDRIIEMAKAELSPSRL 120
Query: 117 ALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKI 176
LR+GET + T+GIRTSSG+FISA+ED++G L++I+EKIA+ +P+ +G A+NILRYK+
Sbjct: 121 MLREGETEEGTKGIRTSSGMFISASEDKTGLLEVIDEKIARAAKIPKTHGGAYNILRYKV 180
Query: 177 GQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCI 236
GQKYNSHYDAF+P EYGPQ+SQRVASFL+YLTD+ EGGETMFPFENG N D SY+++ CI
Sbjct: 181 GQKYNSHYDAFNPAEYGPQESQRVASFLLYLTDVPEGGETMFPFENGSNMDSSYNFEDCI 240
Query: 237 GLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 289
GLK+KP +GDGLLFYSL PNGTIDPTS+HGSCPV+KGEKWVATKWIR+Q YD
Sbjct: 241 GLKIKPLKGDGLLFYSLFPNGTIDPTSLHGSCPVIKGEKWVATKWIREQLHYD 293
>gi|224071291|ref|XP_002303388.1| predicted protein [Populus trichocarpa]
gi|222840820|gb|EEE78367.1| predicted protein [Populus trichocarpa]
Length = 297
Score = 374 bits (961), Expect = e-101, Method: Compositional matrix adjust.
Identities = 180/295 (61%), Positives = 225/295 (76%), Gaps = 12/295 (4%)
Query: 4 GKSNKANWSL--KSKIELPFVFLACLFFFLAGLLGSSLL-----SQDVTAARPSARVVES 56
GK+ K NW K + P + L+C FFF+AGL S+LL S D R AR ++S
Sbjct: 3 GKACKENWRFYKKPNLGFPALILSCSFFFIAGLFASNLLLSQGTSSDERWLRARARQLQS 62
Query: 57 VKDE----YKWMPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLR 112
V++E Y +P G++GDD +T IPFQVLSW PRALY+P F T EQC+ IINMAK +L+
Sbjct: 63 VEEEIISKYDLLPSGESGDDFITLIPFQVLSWRPRALYYPGFITAEQCQHIINMAKPSLQ 122
Query: 113 PSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNIL 172
PSTLALRKGET + T+GIRTSSG+F+ ++ED++G L +IEEKIA+ TM+P +GEAFN+L
Sbjct: 123 PSTLALRKGETAETTKGIRTSSGMFVFSSEDQAGVLQVIEEKIARATMIPSTHGEAFNVL 182
Query: 173 RYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDY 232
RY+IGQKY++HYDAF+P EYGPQ SQRVA+FL+YL++ EEGGET FP EN N +G YD
Sbjct: 183 RYEIGQKYDAHYDAFNPAEYGPQTSQRVATFLLYLSNFEEGGETTFPIENDENFEG-YDA 241
Query: 233 QKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
QKC GL+VKP QGD +LFYS+ PN TIDP S+H SC V+KGEKWVATKWIRDQ Q
Sbjct: 242 QKCNGLRVKPHQGDAILFYSIFPNNTIDPASLHASCHVIKGEKWVATKWIRDQVQ 296
>gi|168043388|ref|XP_001774167.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674574|gb|EDQ61081.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 284
Score = 361 bits (927), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 167/276 (60%), Positives = 211/276 (76%), Gaps = 5/276 (1%)
Query: 13 LKSKIELPFVFLACLFFFLAGLLGSSLLSQDVTAARPSARVVESVKDEYKWMPHGQAGDD 72
L+ K+ P V L C FFFL G GS+ LS++ V++ V +E HG+ GD
Sbjct: 14 LRYKLGFPLVLLCCTFFFLVGFYGSNSLSKEEKHV-----VIDPVTNEKLVFEHGRTGDS 68
Query: 73 SVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRT 132
SVT+IPFQVLSW PRAL +PNFA+ EQC++II +A+ L PS LALRKGE+ T+ IRT
Sbjct: 69 SVTDIPFQVLSWKPRALLYPNFASKEQCEAIIKLARTRLAPSGLALRKGESEATTKEIRT 128
Query: 133 SSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEY 192
SSG F+ A+ED++ +L +EEK+A+ TM+PR NGEAFN+LRY GQKY+ HYD FDP EY
Sbjct: 129 SSGTFLRASEDKTQSLAEVEEKMARATMIPRQNGEAFNVLRYNPGQKYDCHYDVFDPAEY 188
Query: 193 GPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYS 252
GPQ SQR+ASFL+YL+D+EEGGETMFPFEN N + Y+Y+ CIGLKVKPRQGD LLFYS
Sbjct: 189 GPQPSQRMASFLLYLSDVEEGGETMFPFENFQNMNTGYNYKDCIGLKVKPRQGDALLFYS 248
Query: 253 LLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQY 288
+ PNGT D T++HGSCPV+KGEKWVATKWIR+ +++
Sbjct: 249 MHPNGTFDKTALHGSCPVIKGEKWVATKWIRNTDKF 284
>gi|356574299|ref|XP_003555286.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 290
Score = 359 bits (922), Expect = 6e-97, Method: Compositional matrix adjust.
Identities = 168/259 (64%), Positives = 209/259 (80%), Gaps = 2/259 (0%)
Query: 32 AGLLGSSLLSQDVTAARPSARVV-ESVKDEYKWMPHGQAGDDSVTNIPFQVLSWMPRALY 90
G S LL QD+ P +R++ ESVK Y+ + HG++G+ + +IPFQ+LSW PRA+Y
Sbjct: 27 VGFFVSPLLFQDLGDVGPRSRILQESVKKAYEPLEHGESGEPFLNSIPFQILSWRPRAVY 86
Query: 91 FPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDL 150
FPNF + E C+ II MAK L PS LALRKGET ++T+ RTSSG FISA+ED+SG LD
Sbjct: 87 FPNFTSVEVCQQIIEMAKPKLEPSKLALRKGETAESTKDTRTSSGTFISASEDKSGILDF 146
Query: 151 IEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDL 210
+E KIAKVTM+PR +GE FNIL+Y++ QKY+SHYDAF+P EYG +SQR+ASFL+YL+++
Sbjct: 147 VERKIAKVTMIPRTHGEKFNILKYEVAQKYDSHYDAFNPDEYGTVESQRIASFLLYLSNV 206
Query: 211 EEGGETMFPFENGMNAD-GSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCP 269
E GGETMFP+E G+N D G YDY+KCIGLKVKPRQGDGLLFYSLLPNG ID TS+HGSCP
Sbjct: 207 EAGGETMFPYEGGLNIDKGYYDYKKCIGLKVKPRQGDGLLFYSLLPNGKIDKTSLHGSCP 266
Query: 270 VVKGEKWVATKWIRDQEQY 288
V+KGEKWVATKWI D+EQ+
Sbjct: 267 VIKGEKWVATKWIDDREQH 285
>gi|302802700|ref|XP_002983104.1| hypothetical protein SELMODRAFT_234144 [Selaginella moellendorffii]
gi|300149257|gb|EFJ15913.1| hypothetical protein SELMODRAFT_234144 [Selaginella moellendorffii]
Length = 292
Score = 348 bits (893), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 161/280 (57%), Positives = 208/280 (74%), Gaps = 8/280 (2%)
Query: 16 KIELPFVFLACLFFFLAGLLGSSLLSQDVTAARPSARVVESVKDEYKWM-------PHGQ 68
++ P V + C+ FL G Q + + S S ++ + HG
Sbjct: 13 QVSAPLVLVLCVLAFLVGYFIPEF-QQVILVTKHSIPSFSSFANQRHELLEDVTVAEHGV 71
Query: 69 AGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQ 128
GDD ++ IPFQVLSW PRAL FP FA+P QC++II++AK L PS+LALRKGET TQ
Sbjct: 72 TGDDQLSFIPFQVLSWTPRALLFPKFASPAQCEAIISLAKTKLTPSSLALRKGETATETQ 131
Query: 129 GIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFD 188
+RTS G F+S+ +D++GTL +EEK+AK TM+P+ +GEAFN+LRY+IGQKYNSHYD F+
Sbjct: 132 DVRTSHGCFLSSRQDKTGTLAWVEEKMAKATMIPKSHGEAFNVLRYEIGQKYNSHYDVFN 191
Query: 189 PQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGL 248
P EYGPQKSQR+ASFL+YL+D+EEGGETMFPFEN + + +YDY++CIGLKVKP+QGD L
Sbjct: 192 PAEYGPQKSQRMASFLLYLSDVEEGGETMFPFENYEHMNENYDYKECIGLKVKPKQGDAL 251
Query: 249 LFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQY 288
LFYS+ PNGT D T++HGSCPV+KGEKWVATKWIRD+E +
Sbjct: 252 LFYSMFPNGTFDKTALHGSCPVIKGEKWVATKWIRDKEDW 291
>gi|302764866|ref|XP_002965854.1| hypothetical protein SELMODRAFT_84512 [Selaginella moellendorffii]
gi|300166668|gb|EFJ33274.1| hypothetical protein SELMODRAFT_84512 [Selaginella moellendorffii]
Length = 231
Score = 343 bits (879), Expect = 6e-92, Method: Compositional matrix adjust.
Identities = 151/223 (67%), Positives = 189/223 (84%)
Query: 66 HGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVD 125
HG G+D + IPFQVLSW PRAL FP FA+P QC++II++AK L PS+LALRKGET
Sbjct: 8 HGVTGEDQLAFIPFQVLSWTPRALLFPKFASPAQCEAIISLAKTKLTPSSLALRKGETAT 67
Query: 126 NTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD 185
TQ +RTS G F+S+ +D++GTL +EEK+AK TM+P+ +GEAFN+LRY+IGQKYNSHYD
Sbjct: 68 ETQDVRTSHGCFLSSRQDKTGTLAWVEEKMAKATMIPKSHGEAFNVLRYEIGQKYNSHYD 127
Query: 186 AFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQG 245
F+P EYGPQKSQR+ASFL+YL+D+EEGGETMFPFEN + + +YDY++CIGLKVKP+QG
Sbjct: 128 VFNPAEYGPQKSQRMASFLLYLSDVEEGGETMFPFENYEHMNENYDYKECIGLKVKPKQG 187
Query: 246 DGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQY 288
D LLFYS+ PNGT D T++HGSCPV+KGEKWVATKWIRD+E +
Sbjct: 188 DALLFYSMFPNGTFDKTALHGSCPVIKGEKWVATKWIRDKEDW 230
>gi|168006299|ref|XP_001755847.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693166|gb|EDQ79520.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 299
Score = 342 bits (877), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 161/299 (53%), Positives = 212/299 (70%), Gaps = 11/299 (3%)
Query: 1 MKGGKSNKANWS-LKSKIELPFVFLACLFFFLAGLLGSSLLSQDVTAARPSARVVESVKD 59
MK K+ W+ K+ LP V L C LAG S+ Q + +S +
Sbjct: 1 MKSSKNKGRKWNPFTYKLSLPQVLLWCTICLLAGYAASNFFPQKIEEEAIYQPYRKSAQQ 60
Query: 60 EYKW----------MPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKL 109
E ++ + HG GD+ + +IPFQVLSW PRAL +P FA+ EQC++I+ +A+
Sbjct: 61 EGEFPFGEFSEKVVLDHGSTGDNFIADIPFQVLSWKPRALLYPRFASKEQCEAIMKLART 120
Query: 110 NLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAF 169
L PS LALRKGE+ D+T+ IRTSSG F+ A ED + +L+ +EEK+AK TM+PR NGEAF
Sbjct: 121 RLAPSALALRKGESEDSTKDIRTSSGTFLRADEDTTRSLEQVEEKMAKATMIPRENGEAF 180
Query: 170 NILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGS 229
N+L+Y +GQKY+ HYD FDP EYGPQ SQR+ASFL+YL+D+EEGGETMFPFEN N +
Sbjct: 181 NVLKYNVGQKYDCHYDVFDPAEYGPQPSQRMASFLLYLSDVEEGGETMFPFENFQNMNIG 240
Query: 230 YDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQY 288
+DY+KCIG+KVKPRQGD LLFYS+ PNGT D +++HGSCPV+KGEKWVATKWIR+ +++
Sbjct: 241 FDYKKCIGMKVKPRQGDALLFYSMHPNGTFDKSALHGSCPVIKGEKWVATKWIRNTDKF 299
>gi|297825201|ref|XP_002880483.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297326322|gb|EFH56742.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 272
Score = 331 bits (848), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 162/273 (59%), Positives = 200/273 (73%), Gaps = 11/273 (4%)
Query: 14 KSKIELPFVFLACLFFF-LAGLLGSSLLSQDVTAARPSARVVESVKDEYKWMPHGQAGDD 72
+ K+ P+VFLAC FF + G +LLSQ ++ + S R SV +E + + HG
Sbjct: 7 EKKLGFPYVFLACCFFLAIFGFCFFNLLSQIISFSEISTR--RSVNEETESLDHG----- 59
Query: 73 SVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRT 132
SV+NIPF LSW PR Y PNFAT +QC+++I+MAK L+PS LALRKGET + TQ +RT
Sbjct: 60 SVSNIPFHGLSWNPRVFYLPNFATKQQCEAVIDMAKPKLKPSLLALRKGETAETTQNVRT 119
Query: 133 SSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEY 192
EDESG L IEEKIA T +P E+FNILRY++GQKY+SHYDAF P EY
Sbjct: 120 R---LKKTDEDESGILAAIEEKIALATRIPIDYYESFNILRYQLGQKYDSHYDAFHPAEY 176
Query: 193 GPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYS 252
GPQ SQRV +F+++L+ +EEGGETMFPFENG N +G YDY+ CIGL+VKPRQGD + FY+
Sbjct: 177 GPQISQRVVTFILFLSSVEEGGETMFPFENGRNMNGRYDYETCIGLRVKPRQGDAIFFYN 236
Query: 253 LLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
LLPN TID TS+HGSCPV+KGEKWVATKWIRDQ
Sbjct: 237 LLPNRTIDQTSLHGSCPVIKGEKWVATKWIRDQ 269
>gi|30681957|ref|NP_850038.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
gi|330252315|gb|AEC07409.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
Length = 274
Score = 329 bits (844), Expect = 8e-88, Method: Compositional matrix adjust.
Identities = 161/278 (57%), Positives = 201/278 (72%), Gaps = 11/278 (3%)
Query: 10 NWSLKSKIELPFVFLACLFFF-LAGLLGSSLLSQDVTAAR-PSARVVESVKDEYKWMPHG 67
++ + K+ P+VF+AC FF + G +L SQ ++ + P+ R SV DE + HG
Sbjct: 3 SYGKEKKLVFPYVFIACCFFLAIFGFCFFNLFSQGISFSEIPTTR--RSVNDETDSLDHG 60
Query: 68 QAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNT 127
SV+NIPF LSW PR Y PNFAT +QC+++I+MAK L+PSTLALRKGET + T
Sbjct: 61 S----SVSNIPFHGLSWNPRVFYLPNFATKQQCEAVIDMAKPKLKPSTLALRKGETAETT 116
Query: 128 QGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF 187
Q R+ + EDESG L IEEKIA T P+ E+FNILRY++GQKY+SHYDAF
Sbjct: 117 QNYRS---LHQHTDEDESGVLAAIEEKIALATRFPKDYYESFNILRYQLGQKYDSHYDAF 173
Query: 188 DPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDG 247
EYGP SQRV +FL++L+ +EEGGETMFPFENG N +G YDY+KC+GLKVKPRQGD
Sbjct: 174 HSAEYGPLISQRVVTFLLFLSSVEEGGETMFPFENGRNMNGRYDYEKCVGLKVKPRQGDA 233
Query: 248 LLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
+ FY+L PNGTID TS+HGSCPV+KGEKWVATKWIRDQ
Sbjct: 234 IFFYNLFPNGTIDQTSLHGSCPVIKGEKWVATKWIRDQ 271
>gi|3297815|emb|CAA19873.1| putative protein [Arabidopsis thaliana]
gi|7270340|emb|CAB80108.1| putative protein [Arabidopsis thaliana]
Length = 257
Score = 322 bits (825), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 143/208 (68%), Positives = 178/208 (85%)
Query: 53 VVESVKDEYKWMPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLR 112
+VE+ ++E MPHG G++S+ +IPFQVLSW PRA+YFPNFAT EQC++II AK+NL+
Sbjct: 1 MVENGEEEASSMPHGVTGEESIGSIPFQVLSWRPRAIYFPNFATAEQCQAIIERAKVNLK 60
Query: 113 PSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNIL 172
PS LALRKGET +NT+G RTSSG FISA+E+ +G LD +E KIA+ TM+PR +GE+FNIL
Sbjct: 61 PSALALRKGETAENTKGTRTSSGTFISASEESTGALDFVERKIARATMIPRSHGESFNIL 120
Query: 173 RYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDY 232
RY++GQKY+SHYD F+P EYGPQ SQR+ASFL+YL+D+EEGGETMFPFENG N YDY
Sbjct: 121 RYELGQKYDSHYDVFNPTEYGPQSSQRIASFLLYLSDVEEGGETMFPFENGSNMGIGYDY 180
Query: 233 QKCIGLKVKPRQGDGLLFYSLLPNGTID 260
++CIGLKVKPR+GDGLLFYS+ PNGTID
Sbjct: 181 KQCIGLKVKPRKGDGLLFYSVFPNGTID 208
>gi|224056224|ref|XP_002298763.1| predicted protein [Populus trichocarpa]
gi|222846021|gb|EEE83568.1| predicted protein [Populus trichocarpa]
Length = 175
Score = 280 bits (715), Expect = 7e-73, Method: Compositional matrix adjust.
Identities = 130/184 (70%), Positives = 153/184 (83%), Gaps = 9/184 (4%)
Query: 106 MAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRIN 165
MAK L+PSTLALRKGET ++T FI +ED++GTLD IE KIAK TM+P+ +
Sbjct: 1 MAKSKLKPSTLALRKGETTEST---------FIGGSEDKTGTLDFIERKIAKATMIPQSH 51
Query: 166 GEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMN 225
GEAFNILRY+IGQKY+SHYDAF+P EYGPQ SQRVASFL+YL+ +EEGGETMFPFENG
Sbjct: 52 GEAFNILRYEIGQKYDSHYDAFNPDEYGPQPSQRVASFLLYLSSVEEGGETMFPFENGSA 111
Query: 226 ADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
++Y++C+GLKVKPRQGDGLLFYSL PNGTID TS+HGSCPV+KGEKWVATKWIRDQ
Sbjct: 112 VSSGFEYKQCVGLKVKPRQGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 171
Query: 286 EQYD 289
+ D
Sbjct: 172 MELD 175
>gi|147834798|emb|CAN75013.1| hypothetical protein VITISV_039948 [Vitis vinifera]
Length = 282
Score = 273 bits (699), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 129/193 (66%), Positives = 148/193 (76%), Gaps = 33/193 (17%)
Query: 130 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGE---------------------- 167
IR SGVFISA+ED++GTLDLIE+KIA+V M+PR +GE
Sbjct: 90 IRLCSGVFISASEDKTGTLDLIEQKIARVIMIPRTHGEIKPKENCLNWLGQVPPFEFVVM 149
Query: 168 -----------AFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGET 216
AFNILRY+IGQ+YNSHYDAFDP EYGPQKS R+A+FLVYL+D+EEGGET
Sbjct: 150 KRFLTDVVYHVAFNILRYEIGQRYNSHYDAFDPAEYGPQKSHRIATFLVYLSDVEEGGET 209
Query: 217 MFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKW 276
MFPFENG+N D YD+Q+CIGLKVKP QGDGLLFYS+ PNGTIDPTS+HGSCPV+KGEKW
Sbjct: 210 MFPFENGLNMDKDYDFQRCIGLKVKPHQGDGLLFYSMFPNGTIDPTSLHGSCPVIKGEKW 269
Query: 277 VATKWIRDQEQYD 289
VATKWIRDQEQ D
Sbjct: 270 VATKWIRDQEQDD 282
>gi|412994121|emb|CCO14632.1| predicted protein [Bathycoccus prasinos]
Length = 341
Score = 259 bits (663), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 121/223 (54%), Positives = 163/223 (73%), Gaps = 3/223 (1%)
Query: 67 GQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDN 126
G GD+ +T + FQ+LS PR++ + NFA+ C +I+ A+ L S LAL++GET++
Sbjct: 116 GALGDEYLTELKFQLLSTAPRSVMYRNFASDADCDAIVEAARSRLHKSGLALKRGETLET 175
Query: 127 TQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDA 186
T+ IRTSSG F+++ ++SG L +EEK+A+ T +P +GEA+NILRY+IGQKY+SHYD
Sbjct: 176 TKNIRTSSGTFLTSKMEQSGALKRVEEKMARATHIPATHGEAYNILRYEIGQKYDSHYDM 235
Query: 187 FDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFE--NGMNADGSYDYQKC-IGLKVKPR 243
FDP +YGPQ+SQRVASFL+YLT +EGGET+FP E NG+ DY C GLKVKPR
Sbjct: 236 FDPSQYGPQRSQRVASFLLYLTTPDEGGETVFPLEGQNGLYRLRGIDYTSCEAGLKVKPR 295
Query: 244 QGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
+GD LLF+S+ PN T D +S+HG CPV+ G K+VATKWI D
Sbjct: 296 KGDALLFWSVHPNNTFDRSSLHGGCPVISGTKFVATKWIHDNR 338
>gi|384250599|gb|EIE24078.1| hypothetical protein COCSUDRAFT_47131 [Coccomyxa subellipsoidea
C-169]
Length = 327
Score = 248 bits (633), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 115/223 (51%), Positives = 153/223 (68%), Gaps = 5/223 (2%)
Query: 68 QAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNT 127
++G+D + P Q++SW PR + +P F PE+CK + +AK L PS LALR E T
Sbjct: 98 ESGNDFYSVQPQQLISWYPRIILYPGFIDPERCKHFVKVAKARLAPSGLALRTTEGPQET 157
Query: 128 QGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF 187
+ +RTS G F+S +D +G + +EEK A+VT LP +GE FN+LRY+ GQ Y+SHYD F
Sbjct: 158 ENVRTSQGTFMSRKDDPAGVIAWVEEKAAQVTGLPVSHGEPFNVLRYQDGQHYDSHYDIF 217
Query: 188 DPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNAD----GSYDYQKC-IGLKVKP 242
+P+ YGPQ SQR+A+ L YLTD+EEGGET+FP E D ++Y+ C G K KP
Sbjct: 218 EPESYGPQPSQRMATILFYLTDVEEGGETIFPLEGRYGPDLLKMTGFNYKSCTTGFKYKP 277
Query: 243 RQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
R GD L+FYS+ PNGT D ++HG CPV+ GEKWVATKWIRD+
Sbjct: 278 RMGDALMFYSMHPNGTFDKHALHGGCPVMAGEKWVATKWIRDK 320
>gi|302845120|ref|XP_002954099.1| hypothetical protein VOLCADRAFT_64439 [Volvox carteri f.
nagariensis]
gi|300260598|gb|EFJ44816.1| hypothetical protein VOLCADRAFT_64439 [Volvox carteri f.
nagariensis]
Length = 231
Score = 234 bits (597), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 112/226 (49%), Positives = 154/226 (68%), Gaps = 5/226 (2%)
Query: 64 MPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET 123
+P ++G D+V IPFQ+LSW PR + FP F + + +I +A + PS LA R GET
Sbjct: 2 LPAAESGSDNVYVIPFQILSWYPRVVVFPGFIDKARAEYVIKLASKFMYPSGLAYRPGET 61
Query: 124 VDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSH 183
VD +Q RTS+G F++AA D G L +E++IA T+LP NGEAFN+L Y+ Q Y+SH
Sbjct: 62 VDPSQQTRTSTGTFLAAAMDPEGVLGWVEQRIAAATLLPAENGEAFNVLHYEKEQHYDSH 121
Query: 184 YDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKCI--GLK 239
YD FDP+E+GPQ SQR+A+ L+YL+++ EGGET+F E G++ + D++ C K
Sbjct: 122 YDTFDPKEFGPQPSQRIATVLLYLSEVLEGGETVFKRE-GVDGENRVIGDWRNCDDGSFK 180
Query: 240 VKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
PR GD +LF+ PNG IDP ++HG CPV +GEKWVATKWIR +
Sbjct: 181 YMPRMGDAVLFWGTKPNGDIDPHALHGGCPVKRGEKWVATKWIRSR 226
>gi|159489502|ref|XP_001702736.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280758|gb|EDP06515.1| predicted protein [Chlamydomonas reinhardtii]
Length = 231
Score = 226 bits (576), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 109/225 (48%), Positives = 148/225 (65%), Gaps = 3/225 (1%)
Query: 64 MPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET 123
+P +G D IPFQ+LSW PR + FP F + + I+ +A + PS LA R GE
Sbjct: 2 LPAAASGSDVTYIIPFQILSWYPRIVVFPGFIDKARAEHIVKLAGKFMYPSGLAYRPGEQ 61
Query: 124 VDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSH 183
V+++Q RTS+G F+S+ D G L +E++IA T+LP NGEAFN+L Y+ Q Y+SH
Sbjct: 62 VESSQQTRTSTGTFLSSGMDTEGVLGWVEQRIAAATLLPADNGEAFNVLHYEHMQHYDSH 121
Query: 184 YDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY-DYQKCI--GLKV 240
D+FDP+++GPQ SQR+A+ L+YL+++ EGGET+F E AD D++ C K
Sbjct: 122 MDSFDPKDFGPQPSQRIATVLLYLSEVLEGGETVFKKEGVDGADRPIQDWRNCDDGSFKY 181
Query: 241 KPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
PR GD +LF+ PNG IDP S+HG CPV KGEKWVATKWIR +
Sbjct: 182 APRMGDAVLFWGTRPNGEIDPHSLHGGCPVKKGEKWVATKWIRSR 226
>gi|307108817|gb|EFN57056.1| hypothetical protein CHLNCDRAFT_143796 [Chlorella variabilis]
Length = 334
Score = 219 bits (559), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 110/225 (48%), Positives = 145/225 (64%), Gaps = 15/225 (6%)
Query: 64 MPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET 123
+ HG++G T P Q+LS PRA P F + +QC +I MA+ L PS LA + G+T
Sbjct: 116 LEHGESGHSFYTVQPMQLLSLYPRAYLMPRFLSQKQCDHVIAMAERRLAPSGLAFKAGDT 175
Query: 124 VDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSH 183
+NT+ ED G L IE+K+A VTM+P +GE FN+LRY+ Q Y+SH
Sbjct: 176 AENTRD------------EDPDGVLAWIEDKLAAVTMIPAGHGEPFNVLRYEPSQHYDSH 223
Query: 184 YDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFE--NGMNADGSYDYQKC-IGLKV 240
YD+F +EYGPQ SQR+A+ L+YL D+EEGGET+F E G+ DY+ C G+KV
Sbjct: 224 YDSFSEEEYGPQFSQRIATVLLYLADVEEGGETVFLLEGKGGLARLERIDYKACDTGIKV 283
Query: 241 KPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
KPRQGD LLF+S+ NGT+D S+HG CPVV G KW TKWIR++
Sbjct: 284 KPRQGDALLFFSVSVNGTLDKHSLHGGCPVVAGTKWAMTKWIRNR 328
>gi|343172438|gb|AEL98923.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein, partial
[Silene latifolia]
gi|343172440|gb|AEL98924.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein, partial
[Silene latifolia]
Length = 120
Score = 216 bits (551), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 97/120 (80%), Positives = 111/120 (92%), Gaps = 1/120 (0%)
Query: 168 AFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNAD 227
A+N+LRY++GQKYNSHYDAF P EYGPQKSQR+ASFL+YL+D+EEGGETMFP+EN N D
Sbjct: 1 AYNVLRYEVGQKYNSHYDAFHPAEYGPQKSQRIASFLLYLSDVEEGGETMFPYEND-NID 59
Query: 228 GSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
+YDY +CIGLKVKPRQGDGLLFYSL NGTIDPTSIHGSCPV+KGEKWVATKWIR++EQ
Sbjct: 60 SNYDYVQCIGLKVKPRQGDGLLFYSLFSNGTIDPTSIHGSCPVIKGEKWVATKWIRNEEQ 119
>gi|159490898|ref|XP_001703410.1| prolyl 4-hydroxylase [Chlamydomonas reinhardtii]
gi|158280334|gb|EDP06092.1| prolyl 4-hydroxylase [Chlamydomonas reinhardtii]
Length = 429
Score = 199 bits (507), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 108/231 (46%), Positives = 142/231 (61%), Gaps = 7/231 (3%)
Query: 64 MPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET 123
M G +G+ IPFQ+LS PR FPNF + + II +A + PS LA R GE
Sbjct: 200 MTPGDSGEAFYRTIPFQILSLYPRIKVFPNFVDKARREEIIALASKFMYPSGLAYRPGEQ 259
Query: 124 VDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSH 183
V+ Q +RTS G F+ D S L +E KIA VT +PR NGE +N+L YK Q Y+SH
Sbjct: 260 VEAEQQVRTSKGTFLGG--DSSPALTWLESKIAAVTDIPRQNGEFWNVLNYKHTQHYDSH 317
Query: 184 YDAFDPQEYGPQKSQRVASFLVYLTDLE-EGGETMFPFENGMNADGSY-DYQKCI---GL 238
D+FDP+EYG Q SQR+A+ +V L+D GGET+F E N D ++ C GL
Sbjct: 318 MDSFDPKEYGQQYSQRIATVIVVLSDEGLVGGETVFKREGKANIDKPITNWTDCDADGGL 377
Query: 239 KVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 289
+ KPR GD +LF+S P+G +D ++HGSCPVV G KWVA KWIR++ Y+
Sbjct: 378 RYKPRAGDAVLFWSAFPDGRLDQHALHGSCPVVTGNKWVAVKWIRNKGSYN 428
>gi|357517881|ref|XP_003629229.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355523251|gb|AET03705.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 278
Score = 188 bits (477), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 97/217 (44%), Positives = 138/217 (63%), Gaps = 19/217 (8%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTS 133
Q++SW PRA + NF T ++C+ +IN AK PS ++K VDN G +RTS
Sbjct: 68 QIVSWEPRAFLYHNFLTKKECEHLINTAK----PS---MQKSSVVDNETGKSKDSSVRTS 120
Query: 134 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 193
SG F+ DE + IE++IA T +P NGE+FN+LRY++GQKY+ H D F
Sbjct: 121 SGTFLDRGGDE--IVRNIEKRIADFTFIPVENGESFNVLRYEVGQKYDPHLDYFADDYNT 178
Query: 194 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKC--IGLKVKPRQGDGLL 249
QR+A+ L+YL+D+EEGGET+FP G + + + C GL +KP+ GD LL
Sbjct: 179 VNGGQRIATMLMYLSDVEEGGETVFPAAKGNISSVPWWNELSDCGKKGLSIKPKMGDALL 238
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
F+S+ P+GT+DP+S+HG+CPV+KG+KW TKW+R E
Sbjct: 239 FWSMKPDGTLDPSSLHGACPVIKGDKWSCTKWMRINE 275
>gi|357146834|ref|XP_003574128.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Brachypodium
distachyon]
Length = 306
Score = 184 bits (467), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 92/217 (42%), Positives = 137/217 (63%), Gaps = 19/217 (8%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTS 133
+VLSW PRA + NF + E+C+ +I++AK +++ ST+ VD+ G +RTS
Sbjct: 96 EVLSWEPRAFLYHNFLSKEECEYLISLAKPHMKKSTV-------VDSATGGSKDSRVRTS 148
Query: 134 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 193
SG F+ +D+ + IE++I+ T +P NGE +L Y++GQKY H+D F
Sbjct: 149 SGTFLRRGQDK--VIRTIEKRISDFTFIPAENGEGLQVLHYEVGQKYEPHFDYFHDDFNT 206
Query: 194 PQKSQRVASFLVYLTDLEEGGETMFPFE--NGMNADGSYDYQKCI--GLKVKPRQGDGLL 249
QR+A+ L+YL+D+EEGGET+FP N + + +C G+ VKP+ GD LL
Sbjct: 207 KNGGQRIATLLMYLSDVEEGGETVFPSAKVNSSSIPFYNELSECAKRGISVKPKMGDALL 266
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
F+S+ P+GT+DPTS+HG CPV+KG+KW +TKWIR E
Sbjct: 267 FWSMRPDGTLDPTSLHGGCPVIKGDKWSSTKWIRVHE 303
>gi|240256489|ref|NP_201407.4| iron ion binding / oxidoreductase/ oxidoreductase protein
[Arabidopsis thaliana]
gi|332010770|gb|AED98153.1| iron ion binding / oxidoreductase/ oxidoreductase protein
[Arabidopsis thaliana]
Length = 289
Score = 184 bits (466), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 94/221 (42%), Positives = 140/221 (63%), Gaps = 9/221 (4%)
Query: 71 DDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-GETVDNTQG 129
DDS +++SW PRA + NF T E+CK +I +AK ++ ST+ K G++ D+
Sbjct: 70 DDSKNERWVEIISWEPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKTGKSTDSR-- 127
Query: 130 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP 189
+RTSSG F++ D+ T+ IE++I+ T +P +GE +L Y+IGQKY HYD F
Sbjct: 128 VRTSSGTFLARGRDK--TIREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHYDYFMD 185
Query: 190 QEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKC--IGLKVKPRQG 245
+ QR+A+ L+YL+D+EEGGET+FP G + + + +C GL VKP+ G
Sbjct: 186 EYNTRNGGQRIATVLMYLSDVEEGGETVFPAAKGNYSAVPWWNELSECGKGGLSVKPKMG 245
Query: 246 DGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
D LLF+S+ P+ T+DP+S+HG C V+KG KW +TKW+R E
Sbjct: 246 DALLFWSMTPDATLDPSSLHGGCAVIKGNKWSSTKWLRVHE 286
>gi|255579590|ref|XP_002530636.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223529809|gb|EEF31744.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 287
Score = 183 bits (465), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 95/222 (42%), Positives = 139/222 (62%), Gaps = 9/222 (4%)
Query: 70 GDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQ 128
GDD +V+SW PRA + NF T E+C+ +IN+AK N++ ST+ G + D+
Sbjct: 67 GDDGKGERWAEVISWEPRAFVYHNFLTKEECEYLINLAKPNMQKSTVVDSETGRSKDSR- 125
Query: 129 GIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFD 188
+RTSSG F+S D+ + IE++IA + +P +GE +L Y++GQKY H+D F+
Sbjct: 126 -VRTSSGTFLSRGRDKK--IRDIEKRIADFSFIPVEHGEGLQVLHYEVGQKYEPHFDYFN 182
Query: 189 PQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKC--IGLKVKPRQ 244
+ QRVA+ L+YL+D+EEGGET+FP G + + + +C GL VKP
Sbjct: 183 DEFNTKNGGQRVATLLMYLSDVEEGGETVFPAAKGNFSAVPWWNELSECGKKGLSVKPNM 242
Query: 245 GDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
GD LLF+S+ P+ T+DP+S+HG CPV+ G KW ATKW+R E
Sbjct: 243 GDALLFWSMKPDATLDPSSLHGGCPVINGNKWSATKWMRVNE 284
>gi|326495334|dbj|BAJ85763.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 300
Score = 182 bits (463), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 92/217 (42%), Positives = 137/217 (63%), Gaps = 19/217 (8%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTS 133
+VLSW PRA + NF + E+C+ +I++AK +++ ST+ VD+ G +RTS
Sbjct: 90 EVLSWEPRAFIYHNFLSKEECEYLISLAKPHMKKSTV-------VDSATGGSKDSRVRTS 142
Query: 134 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 193
SG F+ +D+ + IE++I+ T +P NGE +L Y++GQKY H+D F
Sbjct: 143 SGTFLRRGQDK--IVRTIEKRISDFTFIPVENGEGLQVLHYEVGQKYEPHFDYFHDDFNT 200
Query: 194 PQKSQRVASFLVYLTDLEEGGETMFPFE--NGMNADGSYDYQKCI--GLKVKPRQGDGLL 249
QR+A+ L+YL+D+EEGGET+FP N + + +C G+ VKP+ GD LL
Sbjct: 201 KNGGQRIATVLMYLSDVEEGGETVFPSAKVNSSSIPFYNELSECAKRGISVKPKMGDALL 260
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
F+S+ P+GT+DPTS+HG CPV+KG+KW +TKWIR E
Sbjct: 261 FWSMRPDGTLDPTSLHGGCPVIKGDKWSSTKWIRVHE 297
>gi|356517655|ref|XP_003527502.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Glycine max]
Length = 290
Score = 182 bits (462), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 94/227 (41%), Positives = 143/227 (62%), Gaps = 12/227 (5%)
Query: 68 QAGDDSVTNIPFQ---VLSWMPRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKGET 123
Q+ + V N P Q +LSW PRA + NF + E+C+ +I +AK + + S + + G++
Sbjct: 65 QSAESLVENPPEQWTEILSWEPRAFIYHNFLSKEECEYLIELAKPQMVKSSVVDSKTGKS 124
Query: 124 VDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSH 183
++ +RTSSG+F+ +D+ + IE++IA T +P NGE IL Y++GQKY H
Sbjct: 125 TESR--VRTSSGMFLKRGKDK--IVQNIEKRIADFTFIPEENGEGLQILHYEVGQKYEPH 180
Query: 184 YDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKCI--GLK 239
YD F + QR+A+ L+YL+D+EEGGET+FP N + + D +C GL
Sbjct: 181 YDYFLDEFNTKNGGQRIATVLMYLSDVEEGGETVFPAANANFSSVPWWNDLSQCARKGLS 240
Query: 240 VKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
VKP+ GD LLF+S+ P+ T+DP+S+HG CPV+KG KW +TKW+ +E
Sbjct: 241 VKPKMGDALLFWSMRPDATLDPSSLHGGCPVIKGNKWSSTKWMHLRE 287
>gi|449520146|ref|XP_004167095.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 249
Score = 182 bits (462), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 95/222 (42%), Positives = 143/222 (64%), Gaps = 14/222 (6%)
Query: 67 GQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDN 126
G+ GD V + +SW PRA + NF + E+C +I++AK ++ ST+ ET N
Sbjct: 32 GKRGDQWV-----EFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVV--DNETGKN 84
Query: 127 TQ-GIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD 185
+ +RTSSG+F++ +D+ + IE++IA T +P +GE IL Y++GQKY++HYD
Sbjct: 85 VEDSVRTSSGMFLNRGQDK--IVSNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYD 142
Query: 186 AFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKC--IGLKVK 241
FD + + QR+A+ L+YL+D+EEGGET+FP G + + + KC GL VK
Sbjct: 143 FFDDEFNLKEIGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSKCGKGGLSVK 202
Query: 242 PRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
P+ GD LLF+S+ P+ T+DPTS+HG+CPV++G KW TKWI
Sbjct: 203 PKMGDALLFWSMKPDTTLDPTSLHGACPVIRGNKWSCTKWIH 244
>gi|215490181|dbj|BAG86624.1| type 2 proly 4-hydroxylase [Nicotiana tabacum]
Length = 294
Score = 182 bits (462), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 98/208 (47%), Positives = 128/208 (61%), Gaps = 9/208 (4%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA 141
+SW PRA + F T E+C +I++AK L+ S +A + T +RTSSG+FI A
Sbjct: 36 ISWKPRAFVYEGFLTDEECNHLISLAKSELKRSAVADNESGN-SKTSEVRTSSGMFIPKA 94
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVA 201
+D + IEEKIA T LP+ NGE +LRY+ GQKY HYD F + + R+A
Sbjct: 95 KDP--IVSGIEEKIATWTFLPKENGEEIQVLRYEEGQKYEPHYDYFVDKVNIARGGHRLA 152
Query: 202 SFLVYLTDLEEGGETMFP------FENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLP 255
+ L+YLT++E+GGET+FP M AD S G+ VKPR+GD LLFYSL P
Sbjct: 153 TVLMYLTNVEKGGETVFPKAEESPRRRSMIADDSLSECAKKGIPVKPRKGDALLFYSLHP 212
Query: 256 NGTIDPTSIHGSCPVVKGEKWVATKWIR 283
N T DP S+HG CPV++GEKW ATKWI
Sbjct: 213 NATPDPLSLHGGCPVIQGEKWSATKWIH 240
>gi|302773668|ref|XP_002970251.1| hypothetical protein SELMODRAFT_411114 [Selaginella moellendorffii]
gi|300161767|gb|EFJ28381.1| hypothetical protein SELMODRAFT_411114 [Selaginella moellendorffii]
Length = 256
Score = 182 bits (461), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 92/217 (42%), Positives = 135/217 (62%), Gaps = 19/217 (8%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTS 133
+ +SW PRA F NF + E+C +I +A+ N++ S + VDN G +RTS
Sbjct: 47 ETISWQPRASVFHNFLSSEECDHLIRLAQPNMKRSAV-------VDNQTGKSKDSRVRTS 99
Query: 134 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 193
SG F+ +DE + IEE+IAK T +P+ +GE +L Y++GQKY++H+D F +
Sbjct: 100 SGTFLRRGQDE--IISRIEERIAKFTFIPKEHGEGLQVLHYEVGQKYDAHHDYFHDKVNT 157
Query: 194 PQKSQRVASFLVYLTDLEEGGETMFPFE--NGMNADGSYDYQKCI--GLKVKPRQGDGLL 249
QRVA+ L+YL+D+EEGGET+FP N + + +C G+ VKPR+GD LL
Sbjct: 158 KNGGQRVATVLMYLSDVEEGGETVFPSAKVNSSSVPWWDELSECAKKGVSVKPRKGDALL 217
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
F+S+ P+ +DP S+HG CPV+KG KW ATKW+ +E
Sbjct: 218 FWSMSPDAELDPFSLHGGCPVIKGNKWSATKWMHLRE 254
>gi|449529555|ref|XP_004171765.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 284
Score = 182 bits (461), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 97/223 (43%), Positives = 147/223 (65%), Gaps = 16/223 (7%)
Query: 67 GQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-GETVD 125
G+ GD V + +SW PRA + NF + E+C +I++AK ++ ST+ K GE+VD
Sbjct: 67 GKRGDQWV-----EFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSKTGESVD 121
Query: 126 NTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD 185
+ +RTSSG+F++ +D+ + IE++IA T +P +GE IL Y++GQKY++HYD
Sbjct: 122 SR--VRTSSGMFLNRGQDK--IIRNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYD 177
Query: 186 AFDPQEYGPQKS-QRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKC--IGLKV 240
F EY +K QR+A+ L+YL+D+EEGGET+FP G + + + +C GL V
Sbjct: 178 YF-VDEYNIKKGGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSECGKGGLSV 236
Query: 241 KPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
KP+ GD LLF+S+ P+ T+DPTS+HG+CPV++G KW TKW+
Sbjct: 237 KPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMH 279
>gi|302791635|ref|XP_002977584.1| hypothetical protein SELMODRAFT_106693 [Selaginella moellendorffii]
gi|300154954|gb|EFJ21588.1| hypothetical protein SELMODRAFT_106693 [Selaginella moellendorffii]
Length = 296
Score = 181 bits (460), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 94/204 (46%), Positives = 131/204 (64%), Gaps = 6/204 (2%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-GETVDNTQGIRTSSGVFISA 140
LSW PRA + F + +C ++ MAK L+ S +A + G++V IRTSSG+F+S
Sbjct: 45 LSWKPRAFLYKGFMSAAECDHVVKMAKDKLQKSMVADNESGKSV--LSNIRTSSGMFLSK 102
Query: 141 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRV 200
+DE ++ IEE+IA T LP+ NGEA +LRY+ G+KY HYD F + R+
Sbjct: 103 GQDE--VINRIEERIAAWTFLPKENGEAIQVLRYEFGEKYEPHYDYFHDKYNQALGGHRI 160
Query: 201 ASFLVYLTDLEEGGETMFP-FENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 259
A+ L+YL+D+ +GGET+FP E+ D S+ G+ VKPR+GD LLFYSL P+ T
Sbjct: 161 ATVLMYLSDVVKGGETVFPSSEDTTVKDDSWSDCAKKGIAVKPRKGDALLFYSLHPDATP 220
Query: 260 DPTSIHGSCPVVKGEKWVATKWIR 283
D +S+HG CPV++GEKW ATKWI
Sbjct: 221 DESSLHGGCPVIEGEKWSATKWIH 244
>gi|357517897|ref|XP_003629237.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355523259|gb|AET03713.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|388513409|gb|AFK44766.1| unknown [Medicago truncatula]
gi|388516345|gb|AFK46234.1| unknown [Medicago truncatula]
Length = 275
Score = 181 bits (459), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 94/222 (42%), Positives = 136/222 (61%), Gaps = 29/222 (13%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTS 133
Q++SW PRA + NF T E+C+ +IN+AK PS + K E +D G IRTS
Sbjct: 67 QIISWEPRAFLYHNFLTKEECEHLINIAK----PS---MHKSEVIDEKTGKSLNSSIRTS 119
Query: 134 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 193
SG F+ DE + IE++IA T +P +GE+FN+L Y++GQKY HYD F
Sbjct: 120 SGTFLDREGDE--IVSNIEKRIADFTFIPVEHGESFNVLHYEVGQKYEPHYDYFLDTFST 177
Query: 194 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY-------DYQKCI--GLKVKPRQ 244
QR+A+ L+YL+D+EEGGET+FP NA G++ + C GL +KP+
Sbjct: 178 RHAGQRIATMLMYLSDVEEGGETVFP-----NAKGNFSSVPWWNELSDCGKGGLSIKPKM 232
Query: 245 GDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
G+ +LF+S+ P+ T+DP+S+HG+CPV+KG+KW KW+ E
Sbjct: 233 GNAILFWSMKPDATLDPSSLHGACPVIKGDKWSCAKWMHADE 274
>gi|302793288|ref|XP_002978409.1| hypothetical protein SELMODRAFT_418273 [Selaginella moellendorffii]
gi|300153758|gb|EFJ20395.1| hypothetical protein SELMODRAFT_418273 [Selaginella moellendorffii]
Length = 256
Score = 181 bits (459), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 92/217 (42%), Positives = 135/217 (62%), Gaps = 19/217 (8%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTS 133
+ +SW PRA F NF + E+C +I +A+ N++ S + VDN G +RTS
Sbjct: 47 ETISWQPRASVFHNFLSSEECDHLIRLAQPNMKRSAV-------VDNQTGKSKDSRVRTS 99
Query: 134 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 193
SG F+ +DE + IEE+IAK T +P+ +GE +L Y++GQKY++H+D F +
Sbjct: 100 SGTFLRRGQDE--IISRIEERIAKFTFIPKEHGEGLQVLHYEVGQKYDAHHDYFHDKVNT 157
Query: 194 PQKSQRVASFLVYLTDLEEGGETMFPFE--NGMNADGSYDYQKC--IGLKVKPRQGDGLL 249
QRVA+ L+YL+D+EEGGET+FP N + + +C G+ VKPR+GD LL
Sbjct: 158 KNGGQRVATVLMYLSDVEEGGETVFPSAKVNSSSVPWWDELSECGKKGVSVKPRKGDALL 217
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
F+S+ P+ +DP S+HG CPV+KG KW ATKW+ +E
Sbjct: 218 FWSMSPDAELDPFSLHGGCPVIKGNKWSATKWMHLRE 254
>gi|326526235|dbj|BAJ97134.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 308
Score = 180 bits (457), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 93/206 (45%), Positives = 126/206 (61%), Gaps = 9/206 (4%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQ--GIRTSSGVFIS 139
+SW PRA +P+F + ++ ++++A+ L+ S +A ET +Q +RTSSG FIS
Sbjct: 54 ISWHPRAFLYPHFLSDDEANHLVSLARAELKRSAVA---DETSGKSQLSEVRTSSGTFIS 110
Query: 140 AAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQR 199
+D + IE+KIA T LP+ NGE +LRYK G+KY HYD F R
Sbjct: 111 KGKDP--IVAGIEDKIAAWTFLPKENGEDMQVLRYKRGEKYEPHYDFFTDSVNTILGGHR 168
Query: 200 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCI--GLKVKPRQGDGLLFYSLLPNG 257
VA+ L+YLTD+ EGGET+FP G +C G+ VKPR+GD LLF++L P+
Sbjct: 169 VATVLLYLTDVAEGGETVFPLAKGRKGSHHKGLSECAQKGIAVKPRKGDALLFFNLRPDA 228
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIR 283
DPTS+HG C V+KGEKW ATKWIR
Sbjct: 229 ATDPTSLHGGCEVIKGEKWSATKWIR 254
>gi|302786814|ref|XP_002975178.1| hypothetical protein SELMODRAFT_174666 [Selaginella moellendorffii]
gi|300157337|gb|EFJ23963.1| hypothetical protein SELMODRAFT_174666 [Selaginella moellendorffii]
Length = 283
Score = 180 bits (456), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 94/205 (45%), Positives = 130/205 (63%), Gaps = 7/205 (3%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-GETVDNTQGIRTSSGVFISA 140
LSW PRA + F + +C ++ MAK L+ S +A + G++V IRTSSG+F+S
Sbjct: 31 LSWKPRAFLYKGFMSAAECDHVVKMAKDKLQKSMVADNESGKSV--LSNIRTSSGMFLSK 88
Query: 141 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRV 200
+DE ++ IEE+IA T LP+ NGEA +LRY+ G+KY HYD F + R+
Sbjct: 89 GQDE--VINRIEERIAAWTFLPKENGEAIQVLRYEFGEKYEPHYDYFHDKYNQALGGHRI 146
Query: 201 ASFLVYLTDLEEGGETMFPF--ENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGT 258
A+ L+YL+D +GGET+FP E+ D S+ G+ VKPR+GD LLFYSL P+ T
Sbjct: 147 ATVLMYLSDAVKGGETVFPSSEEDTTVKDDSWSDCAKKGIAVKPRKGDALLFYSLHPDAT 206
Query: 259 IDPTSIHGSCPVVKGEKWVATKWIR 283
D +S+HG CPV++GEKW ATKWI
Sbjct: 207 PDESSLHGGCPVIEGEKWSATKWIH 231
>gi|449443243|ref|XP_004139389.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 284
Score = 179 bits (455), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 96/223 (43%), Positives = 144/223 (64%), Gaps = 16/223 (7%)
Query: 67 GQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDN 126
G+ GD V + +SW PRA + NF + E+C +I++AK ++ ST+ ET N
Sbjct: 67 GKRGDQWV-----EFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVV--DNETGKN 119
Query: 127 TQ-GIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD 185
+ +RTSSG+F++ +D+ + IE++IA T +P +GE IL Y++GQKY++HYD
Sbjct: 120 VEDSVRTSSGMFLNRGQDK--IVSNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYD 177
Query: 186 AFDPQEYGPQKS-QRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKC--IGLKV 240
F EY +K QR+A+ L+YL+D+EEGGET+FP G + + + KC GL V
Sbjct: 178 YF-VDEYNIKKGGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSKCGKGGLSV 236
Query: 241 KPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
KP+ GD LLF+S+ P+ T+DPTS+HG+CPV++G KW TKW+
Sbjct: 237 KPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMH 279
>gi|115482738|ref|NP_001064962.1| Os10g0497800 [Oryza sativa Japonica Group]
gi|78708853|gb|ABB47828.1| prolyl 4-hydroxylase alpha subunit, putative, expressed [Oryza
sativa Japonica Group]
gi|113639571|dbj|BAF26876.1| Os10g0497800 [Oryza sativa Japonica Group]
gi|215767852|dbj|BAH00081.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218184821|gb|EEC67248.1| hypothetical protein OsI_34188 [Oryza sativa Indica Group]
Length = 321
Score = 179 bits (455), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 89/217 (41%), Positives = 138/217 (63%), Gaps = 19/217 (8%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTS 133
+VLSW PRA + NF + E+C+ +I++AK +++ ST+ VD + G +RTS
Sbjct: 111 EVLSWEPRAFLYHNFLSKEECEYLISLAKPHMKKSTV-------VDASTGGSKDSRVRTS 163
Query: 134 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 193
SG+F+ +D+ + IE++I+ T +P NGE +L Y++GQKY H+D F +
Sbjct: 164 SGMFLGRGQDK--IIRTIEKRISDYTFIPVENGEGLQVLHYEVGQKYEPHFDYFHDEFNT 221
Query: 194 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKCI--GLKVKPRQGDGLL 249
QR+A+ L+YL+D+EEGGET+FP ++ + + +C GL VKP+ GD LL
Sbjct: 222 KNGGQRIATLLMYLSDVEEGGETIFPSSKANSSSSPFYNELSECAKKGLAVKPKMGDALL 281
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
F+S+ P+G++D TS+HG CPV+KG KW +TKW+R E
Sbjct: 282 FWSMRPDGSLDATSLHGGCPVIKGNKWSSTKWMRVHE 318
>gi|212720775|ref|NP_001131953.1| uncharacterized protein LOC100193348 [Zea mays]
gi|194693016|gb|ACF80592.1| unknown [Zea mays]
gi|347978798|gb|AEP37741.1| prolyl 4-hydroxylase 1 [Zea mays]
gi|414870898|tpg|DAA49455.1| TPA: hypothetical protein ZEAMMB73_536273 [Zea mays]
Length = 307
Score = 179 bits (454), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 89/217 (41%), Positives = 136/217 (62%), Gaps = 19/217 (8%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTS 133
+VLSW PRA + NF + E+C +I++AK +++ ST+ VD+ G +RTS
Sbjct: 97 EVLSWEPRAFVYHNFLSKEECDHLISLAKPHMKKSTV-------VDSATGGSKDSRVRTS 149
Query: 134 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 193
SG+F+ +D+ + IE++IA T +P GE +L Y++GQKY H+D F
Sbjct: 150 SGMFLRRGQDK--IIRTIEKRIADYTFIPVEQGEGLQVLHYEVGQKYEPHFDYFHDDYNT 207
Query: 194 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKCI--GLKVKPRQGDGLL 249
QR+A+ L+YL+D+E+GGET+FP ++ + + +C GL VKP+ GD LL
Sbjct: 208 KNGGQRIATLLMYLSDVEDGGETVFPSSTTNSSSSPFYNELSECAKGGLSVKPKMGDALL 267
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
F+S+ P+G++DPTS+HG CPV+KG KW +TKW+R E
Sbjct: 268 FWSMKPDGSLDPTSLHGGCPVIKGNKWSSTKWMRVHE 304
>gi|414870899|tpg|DAA49456.1| TPA: hypothetical protein ZEAMMB73_536273 [Zea mays]
Length = 364
Score = 179 bits (453), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 89/217 (41%), Positives = 136/217 (62%), Gaps = 19/217 (8%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTS 133
+VLSW PRA + NF + E+C +I++AK +++ ST+ VD+ G +RTS
Sbjct: 154 EVLSWEPRAFVYHNFLSKEECDHLISLAKPHMKKSTV-------VDSATGGSKDSRVRTS 206
Query: 134 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 193
SG+F+ +D+ + IE++IA T +P GE +L Y++GQKY H+D F
Sbjct: 207 SGMFLRRGQDK--IIRTIEKRIADYTFIPVEQGEGLQVLHYEVGQKYEPHFDYFHDDYNT 264
Query: 194 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKCI--GLKVKPRQGDGLL 249
QR+A+ L+YL+D+E+GGET+FP ++ + + +C GL VKP+ GD LL
Sbjct: 265 KNGGQRIATLLMYLSDVEDGGETVFPSSTTNSSSSPFYNELSECAKGGLSVKPKMGDALL 324
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
F+S+ P+G++DPTS+HG CPV+KG KW +TKW+R E
Sbjct: 325 FWSMKPDGSLDPTSLHGGCPVIKGNKWSSTKWMRVHE 361
>gi|449491267|ref|XP_004158845.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 287
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 87/212 (41%), Positives = 139/212 (65%), Gaps = 9/212 (4%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVFI 138
+V+SW PRA + NF T E+C+ +I++AK +++ ST+ G++ D+ +RTSSG F+
Sbjct: 77 EVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSTVVDSETGQSKDSR--VRTSSGTFL 134
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 198
D+ T+ IE++++ + +P +GE +L Y++GQKY H+D F + Q
Sbjct: 135 PRGRDK--TVRTIEKRLSDFSFIPVEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQ 192
Query: 199 RVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKC--IGLKVKPRQGDGLLFYSLL 254
R+A+ L+YL+D+EEGGET+FP G + + + C GL VKP++GD LLF+S+
Sbjct: 193 RIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMK 252
Query: 255 PNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
P+ ++DP+S+HG CPV+KG KW ATKW+R +E
Sbjct: 253 PDASLDPSSLHGGCPVIKGNKWSATKWVRVEE 284
>gi|222613083|gb|EEE51215.1| hypothetical protein OsJ_32038 [Oryza sativa Japonica Group]
Length = 222
Score = 178 bits (452), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 89/217 (41%), Positives = 138/217 (63%), Gaps = 19/217 (8%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTS 133
+VLSW PRA + NF + E+C+ +I++AK +++ ST+ VD + G +RTS
Sbjct: 12 EVLSWEPRAFLYHNFLSKEECEYLISLAKPHMKKSTV-------VDASTGGSKDSRVRTS 64
Query: 134 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 193
SG+F+ +D+ + IE++I+ T +P NGE +L Y++GQKY H+D F +
Sbjct: 65 SGMFLGRGQDK--IIRTIEKRISDYTFIPVENGEGLQVLHYEVGQKYEPHFDYFHDEFNT 122
Query: 194 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKCI--GLKVKPRQGDGLL 249
QR+A+ L+YL+D+EEGGET+FP ++ + + +C GL VKP+ GD LL
Sbjct: 123 KNGGQRIATLLMYLSDVEEGGETIFPSSKANSSSSPFYNELSECAKKGLAVKPKMGDALL 182
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
F+S+ P+G++D TS+HG CPV+KG KW +TKW+R E
Sbjct: 183 FWSMRPDGSLDATSLHGGCPVIKGNKWSSTKWMRVHE 219
>gi|357483925|ref|XP_003612249.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355513584|gb|AES95207.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 289
Score = 178 bits (452), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 92/212 (43%), Positives = 135/212 (63%), Gaps = 9/212 (4%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVFI 138
+V+SW PRA + NF T E+C+ +I++AK ++ ST+ G++ D+ +RTSSG F+
Sbjct: 79 EVVSWEPRAFVYHNFLTKEECEYLIDIAKPSMHKSTVVDSETGKSKDSR--VRTSSGTFL 136
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 198
+ D+ + IE+KIA T +P +GE +L Y++GQKY HYD F + Q
Sbjct: 137 ARGRDK--IVRNIEKKIADFTFIPVEHGEGLQVLHYEVGQKYEPHYDYFLDEFNTKNGGQ 194
Query: 199 RVASFLVYLTDLEEGGETMFPFENG--MNADGSYDYQKC--IGLKVKPRQGDGLLFYSLL 254
R+A+ L+YLTD+EEGGET+FP G N + C GL +KP++GD LLF+S+
Sbjct: 195 RIATVLMYLTDVEEGGETVFPAAKGNFSNVPWYNELSDCGKKGLSIKPKRGDALLFWSMK 254
Query: 255 PNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
P+ T+D +S+HG CPV+KG KW +TKWIR E
Sbjct: 255 PDATLDASSLHGGCPVIKGNKWSSTKWIRVNE 286
>gi|215490183|dbj|BAG86625.1| type 2 proly 4-hydroxylase [Nicotiana tabacum]
Length = 318
Score = 178 bits (452), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 95/207 (45%), Positives = 129/207 (62%), Gaps = 10/207 (4%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-GETVDNTQGIRTSSGVFISA 140
+SW PRA + NF T E+C I +AK L S +A + G++V++ +RTSSG+F
Sbjct: 65 ISWRPRAFVYRNFLTDEECDHFITLAKHKLEKSMVADNESGKSVESE--VRTSSGMFFRK 122
Query: 141 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRV 200
A+D+ + +E +IA T LP NGE+ IL Y+ GQKY H+D F + RV
Sbjct: 123 AQDQ--VVANVEARIAAWTFLPEENGESIQILHYEHGQKYEPHFDYFHDKVNQELGGHRV 180
Query: 201 ASFLVYLTDLEEGGETMFPFENG--MNADGSYDYQKCI--GLKVKPRQGDGLLFYSLLPN 256
A+ L+YL+D+E+GGET+FP A G D+ C G VKPR+GD LLF+SL P+
Sbjct: 181 ATVLMYLSDVEKGGETVFPNSEAKKTQAKGD-DWSDCAKKGYAVKPRKGDALLFFSLHPD 239
Query: 257 GTIDPTSIHGSCPVVKGEKWVATKWIR 283
T DP S+HGSCPV++GEKW ATKWI
Sbjct: 240 ATTDPLSLHGSCPVIEGEKWSATKWIH 266
>gi|356540840|ref|XP_003538892.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Glycine max]
Length = 290
Score = 178 bits (452), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 92/226 (40%), Positives = 140/226 (61%), Gaps = 9/226 (3%)
Query: 66 HGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETV 124
H DD +V+SW PRA + NF T E+C+ +I++AK N+ S++ G++
Sbjct: 66 HTSDDDDVRGEQWVEVVSWEPRAFVYHNFLTKEECEYLIDIAKPNMHKSSVVDSETGKSK 125
Query: 125 DNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHY 184
D+ +RTSSG F++ D+ + IE++IA + +P +GE +L Y++GQKY HY
Sbjct: 126 DSR--VRTSSGTFLARGRDK--IVRDIEKRIAHYSFIPVEHGEGLQVLHYEVGQKYEPHY 181
Query: 185 DAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKC--IGLKV 240
D F QR+A+ L+YLTD+EEGGET+FP G + + + +C GL +
Sbjct: 182 DYFLDDFNTKNGGQRIATVLMYLTDVEEGGETVFPAAKGNFSSVPWWNELSECGKKGLSI 241
Query: 241 KPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
KP++GD LLF+S+ P+ T+DP+S+HG CPV+KG KW +TKW+R E
Sbjct: 242 KPKRGDALLFWSMKPDATLDPSSLHGGCPVIKGNKWSSTKWMRVSE 287
>gi|242032633|ref|XP_002463711.1| hypothetical protein SORBIDRAFT_01g004670 [Sorghum bicolor]
gi|241917565|gb|EER90709.1| hypothetical protein SORBIDRAFT_01g004670 [Sorghum bicolor]
Length = 297
Score = 178 bits (451), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 94/216 (43%), Positives = 133/216 (61%), Gaps = 28/216 (12%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 135
LSW PRA + F + +C +IN+AK ++ S +A DN G +RTSSG
Sbjct: 38 LSWRPRAFLYSGFLSDTECDHLINLAKGSMEKSMVA-------DNDSGKSLMSQVRTSSG 90
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 195
F++ EDE + IE+++A T LP N E+ +LRY+IGQKY++H+D F +
Sbjct: 91 AFLAKHEDE--IVSAIEKRVAAWTFLPEENAESMQVLRYEIGQKYDAHFDYFHDKNNVKH 148
Query: 196 KSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY------DYQKC--IGLKVKPRQGDG 247
QR A+ L+YLTD+++GGET+FP NA+GS+ + +C GL VKP++GD
Sbjct: 149 GGQRFATVLMYLTDVKKGGETVFP-----NAEGSHLQYKDETWSECSRSGLAVKPKKGDA 203
Query: 248 LLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
LLF+ L N T D +S+HGSCPV++GEKW ATKWI
Sbjct: 204 LLFFGLHLNATTDTSSLHGSCPVIEGEKWSATKWIH 239
>gi|449434114|ref|XP_004134841.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 287
Score = 178 bits (451), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 87/212 (41%), Positives = 139/212 (65%), Gaps = 9/212 (4%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVFI 138
+V+SW PRA + NF T E+C+ +I++AK +++ ST+ G++ D+ +RTSSG F+
Sbjct: 77 EVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSTVVDSETGQSKDSR--VRTSSGTFL 134
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 198
D+ T+ IE++++ + +P +GE +L Y++GQKY H+D F + Q
Sbjct: 135 PRGRDK--TVRTIEKRLSDFSFIPVEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQ 192
Query: 199 RVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKC--IGLKVKPRQGDGLLFYSLL 254
R+A+ L+YL+D+EEGGET+FP G + + + C GL VKP++GD LLF+S+
Sbjct: 193 RIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMK 252
Query: 255 PNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
P+ ++DP+S+HG CPV+KG KW ATKW+R +E
Sbjct: 253 PDASLDPSSLHGGCPVIKGNKWSATKWMRVEE 284
>gi|297818458|ref|XP_002877112.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297322950|gb|EFH53371.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 289
Score = 178 bits (451), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 92/207 (44%), Positives = 131/207 (63%), Gaps = 9/207 (4%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLAL--RKGETVDNTQGIRTSSGVFIS 139
LSW PRA + F + E+C +IN+AK L S + GE++D+ + RTSSGVF++
Sbjct: 35 LSWTPRAFLYNGFLSDEECDHLINLAKGKLEKSMVVADDNSGESIDSEE--RTSSGVFLT 92
Query: 140 AAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQR 199
+D+ + +E K+A T LP NGEA IL Y+ GQKY+ H+D + +E R
Sbjct: 93 KRQDD--IVANVEAKLATWTFLPEENGEALQILHYENGQKYDPHFDYYYDKETLKLGGHR 150
Query: 200 VASFLVYLTDLEEGGETMFPFENGMNADGSYD-YQKCI--GLKVKPRQGDGLLFYSLLPN 256
+A+ L+YL+++ +GGET+FP G D + +C G VKPR+GD LLF++L PN
Sbjct: 151 IATVLMYLSNVTKGGETVFPMWKGKTPQLKDDTWSECAKQGYAVKPRKGDALLFFNLHPN 210
Query: 257 GTIDPTSIHGSCPVVKGEKWVATKWIR 283
T DPTS+HGSCPV++GEKW AT+WI
Sbjct: 211 ATTDPTSLHGSCPVIEGEKWSATRWIH 237
>gi|302830268|ref|XP_002946700.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
gi|300267744|gb|EFJ51926.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
Length = 186
Score = 177 bits (450), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 91/184 (49%), Positives = 122/184 (66%), Gaps = 7/184 (3%)
Query: 111 LRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFN 170
+ PS LA R GE + Q +RTS G F+ D S L +E+KIA VT+LPR NGE +N
Sbjct: 1 MYPSGLAYRPGEKAEAEQQVRTSKGTFLGG--DSSPALRWLEDKIAAVTLLPRTNGEFWN 58
Query: 171 ILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLE-EGGETMFPFENGMNADGS 229
+L YK Q Y+SH D+FDP+EYGPQ SQR+A+ +V L+D GGET+F E + +
Sbjct: 59 VLNYKHSQHYDSHMDSFDPKEYGPQYSQRIATVIVVLSDDGLMGGETVFKREGKSSINKP 118
Query: 230 Y-DYQKCI---GLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
++ C GLK KPR GD +LF+S P+G +DP ++HGSCPVV G KWVA KW+R++
Sbjct: 119 ISNWTDCDADGGLKYKPRAGDAVLFWSARPDGQLDPHALHGSCPVVTGNKWVAVKWLRNK 178
Query: 286 EQYD 289
+YD
Sbjct: 179 GEYD 182
>gi|242039227|ref|XP_002467008.1| hypothetical protein SORBIDRAFT_01g018200 [Sorghum bicolor]
gi|241920862|gb|EER94006.1| hypothetical protein SORBIDRAFT_01g018200 [Sorghum bicolor]
Length = 307
Score = 177 bits (449), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 88/217 (40%), Positives = 136/217 (62%), Gaps = 19/217 (8%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTS 133
+VLSW PRA + NF + E+C +I++AK +++ ST+ VD+ G +RTS
Sbjct: 97 EVLSWEPRAFVYHNFLSKEECDHLISLAKPHMKKSTV-------VDSATGASKDSRVRTS 149
Query: 134 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 193
SG+F+ +D+ + IE++IA T +P +GE +L Y++GQKY H+D F
Sbjct: 150 SGMFLRRGQDK--IIQTIEKRIADFTFIPVEHGEGLQVLHYEVGQKYEPHFDYFHDDYNT 207
Query: 194 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKCI--GLKVKPRQGDGLL 249
QR+A+ L+YL+D+E+GGET+FP ++ + + +C GL VKP+ GD LL
Sbjct: 208 KNGGQRIATLLMYLSDVEDGGETVFPSSTTNSSSSPFYNELSECAKGGLSVKPKMGDALL 267
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
F+S+ P+G++D TS+HG CPV+KG KW +TKW+R E
Sbjct: 268 FWSMKPDGSMDSTSLHGGCPVIKGNKWSSTKWMRVHE 304
>gi|357125236|ref|XP_003564301.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
distachyon]
Length = 293
Score = 177 bits (449), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 91/211 (43%), Positives = 129/211 (61%), Gaps = 18/211 (8%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 135
LSW PRA + F + +C ++ +AK L+ S +A DN G +RTSSG
Sbjct: 34 LSWRPRAFLYSGFLSHAECDHLVKLAKGRLQKSMVA-------DNDSGKSVMSQVRTSSG 86
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 195
F++ EDE + IE+++A T LP N E+ +L Y++GQKY++H+D F +
Sbjct: 87 TFLNKHEDE--IISGIEKRVAAWTFLPEENAESIQVLHYEVGQKYDAHFDYFHDKNNQKL 144
Query: 196 KSQRVASFLVYLTDLEEGGETMFPFENG---MNADGSYDYQKCIGLKVKPRQGDGLLFYS 252
RVA+ L+YLTD+++GGET+FP G + D ++ GL VKPR+GD LLF+S
Sbjct: 145 GGHRVATVLMYLTDVKKGGETVFPNAEGRHLQHKDETWSECARSGLAVKPRKGDALLFFS 204
Query: 253 LLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
L N T DP+S+HGSCPV++GEKW ATKWI
Sbjct: 205 LHINATTDPSSLHGSCPVIEGEKWSATKWIH 235
>gi|21537370|gb|AAM61711.1| putative prolyl 4-hydroxylase, alpha subunit [Arabidopsis thaliana]
Length = 287
Score = 177 bits (448), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 93/213 (43%), Positives = 133/213 (62%), Gaps = 11/213 (5%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVFI 138
+VLSW PRA + NF + E+C+ +I++AK ++ ST+ G++ D+ +RTSSG F+
Sbjct: 77 EVLSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVVDSETGKSKDSR--VRTSSGTFL 134
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 198
D+ + IE++IA T +P +GE +L Y+ GQKY HYD F + Q
Sbjct: 135 RRGRDK--IIKTIEKRIADYTFIPADHGEGLQVLHYEAGQKYEPHYDYFVDEFNTKNGGQ 192
Query: 199 RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCI-----GLKVKPRQGDGLLFYSL 253
R+A+ L+YL+D+EEGGET+FP N MN Y + GL VKPR GD LLF+S+
Sbjct: 193 RMATMLMYLSDVEEGGETVFPAAN-MNFSSVPWYNELSECGKKGLSVKPRMGDALLFWSM 251
Query: 254 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
P+ T+DPTS+HG CPV++G KW +TKWI E
Sbjct: 252 RPDATLDPTSLHGGCPVIRGNKWSSTKWIHVGE 284
>gi|384246332|gb|EIE19822.1| hypothetical protein COCSUDRAFT_25518 [Coccomyxa subellipsoidea
C-169]
Length = 347
Score = 176 bits (447), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 96/207 (46%), Positives = 127/207 (61%), Gaps = 11/207 (5%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-GETVDNTQGIRTSSGVFISA 140
+SW PRA F +C+ +I+ AK ++ ST+ G+++D+T +RTS+G F
Sbjct: 86 VSWSPRAFLLKGFLKEAECEHLISKAKPSMVKSTVVDNDTGKSIDST--VRTSTGTFFGR 143
Query: 141 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQKS-Q 198
EDE + IE +I+ +T LP +NGE IL Y+ GQKY +H+D F D P+ Q
Sbjct: 144 EEDE--VIQGIERRISMITHLPEVNGEGLQILHYEDGQKYEAHHDFFHDKFNSRPENGGQ 201
Query: 199 RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCI--GLKVKPRQGDGLLFYSLLPN 256
R+A+ L+YLT EEGGET+FP N + +C G VK R+GD LLFYSLLPN
Sbjct: 202 RIATVLMYLTTAEEGGETVFPM--AANKVTGPQWSECARGGAAVKSRRGDALLFYSLLPN 259
Query: 257 GTIDPTSIHGSCPVVKGEKWVATKWIR 283
G DPTS+HGSCP KGEKW ATKWI
Sbjct: 260 GETDPTSLHGSCPTTKGEKWSATKWIH 286
>gi|297850430|ref|XP_002893096.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297338938|gb|EFH69355.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 287
Score = 176 bits (447), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 93/213 (43%), Positives = 133/213 (62%), Gaps = 11/213 (5%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVFI 138
+VLSW PRA + NF + E+C+ +I++AK ++ ST+ G++ D+ +RTSSG F+
Sbjct: 77 EVLSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVVDSETGKSKDSR--VRTSSGTFL 134
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 198
D+ + IE++IA T +P +GE IL Y+ GQKY HYD F + Q
Sbjct: 135 RRGRDK--IIKTIEKRIADYTFIPADHGEGLQILHYEAGQKYEPHYDYFVDEFNTKNGGQ 192
Query: 199 RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCI-----GLKVKPRQGDGLLFYSL 253
R+A+ L+YL+D+EEGGET+FP N MN Y + GL VKPR GD LLF+S+
Sbjct: 193 RMATMLMYLSDVEEGGETVFPAAN-MNFSSVPWYNELSECGKKGLSVKPRMGDALLFWSM 251
Query: 254 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
P+ T+DPTS+HG CPV++G KW +TKW+ E
Sbjct: 252 RPDATLDPTSLHGGCPVIRGNKWSSTKWMHVGE 284
>gi|159794881|pdb|2JIJ|A Chain A, Crystal Structure Of The Apo Form Of Chlamydomonas
Reinhardtii Prolyl-4 Hydroxylase Type I
gi|159794882|pdb|2JIJ|B Chain B, Crystal Structure Of The Apo Form Of Chlamydomonas
Reinhardtii Prolyl-4 Hydroxylase Type I
gi|159794883|pdb|2JIJ|C Chain C, Crystal Structure Of The Apo Form Of Chlamydomonas
Reinhardtii Prolyl-4 Hydroxylase Type I
Length = 233
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 95/208 (45%), Positives = 130/208 (62%), Gaps = 13/208 (6%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKGETVDNTQGIRTSSGVFISA 140
LSW PRA NF + E+C I+ A+ + + S + G++VD+ IRTS+G + +
Sbjct: 25 LSWSPRAFLLKNFLSDEECDYIVEKARPKMVKSSVVDNESGKSVDSE--IRTSTGTWFAK 82
Query: 141 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQKS-Q 198
ED + IE+++A+VTM+P N E +L Y GQKY HYD F DP GP+ Q
Sbjct: 83 GEDS--VISKIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNAGPEHGGQ 140
Query: 199 RVASFLVYLTDLEEGGETMFP-FENGMNADGSYDYQKCI--GLKVKPRQGDGLLFYSLLP 255
RV + L+YLT +EEGGET+ P E + DG + +C GL VKP +GD L+FYSL P
Sbjct: 141 RVVTMLMYLTTVEEGGETVLPNAEQKVTGDG---WSECAKRGLAVKPIKGDALMFYSLKP 197
Query: 256 NGTIDPTSIHGSCPVVKGEKWVATKWIR 283
+G+ DP S+HGSCP +KG+KW ATKWI
Sbjct: 198 DGSNDPASLHGSCPTLKGDKWSATKWIH 225
>gi|218193936|gb|EEC76363.1| hypothetical protein OsI_13952 [Oryza sativa Indica Group]
Length = 1062
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 91/211 (43%), Positives = 130/211 (61%), Gaps = 18/211 (8%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 135
LSW PRA + F + ++C ++N+AK + S +A DN G +RTSSG
Sbjct: 40 LSWRPRAFLYSGFLSHDECDHLVNLAKGRMEKSMVA-------DNDSGKSIMSQVRTSSG 92
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 195
F+S ED+ + IE+++A T LP N E+ IL Y++GQKY++H+D F + +
Sbjct: 93 TFLSKHEDD--IVSGIEKRVAAWTFLPEENAESIQILHYELGQKYDAHFDYFHDKNNLKR 150
Query: 196 KSQRVASFLVYLTDLEEGGETMFPFENGMN---ADGSYDYQKCIGLKVKPRQGDGLLFYS 252
RVA+ L+YLTD+++GGET+FP G + D ++ GL VKP++GD LLF+S
Sbjct: 151 GGHRVATVLMYLTDVKKGGETVFPNAAGRHLQLKDETWSDCARSGLAVKPKKGDALLFFS 210
Query: 253 LLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
L N T DP S+HGSCPV++GEKW ATKWI
Sbjct: 211 LHVNATTDPASLHGSCPVIEGEKWSATKWIH 241
>gi|115456019|ref|NP_001051610.1| Os03g0803500 [Oryza sativa Japonica Group]
gi|29150365|gb|AAO72374.1| putative oxidoreductase [Oryza sativa Japonica Group]
gi|108711618|gb|ABF99413.1| oxidoreductase, 2OG-Fe oxygenase family protein, putative,
expressed [Oryza sativa Japonica Group]
gi|113550081|dbj|BAF13524.1| Os03g0803500 [Oryza sativa Japonica Group]
gi|215765410|dbj|BAG87107.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222625993|gb|EEE60125.1| hypothetical protein OsJ_13003 [Oryza sativa Japonica Group]
Length = 299
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 91/211 (43%), Positives = 130/211 (61%), Gaps = 18/211 (8%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 135
LSW PRA + F + ++C ++N+AK + S +A DN G +RTSSG
Sbjct: 40 LSWRPRAFLYSGFLSHDECDHLVNLAKGRMEKSMVA-------DNDSGKSIMSQVRTSSG 92
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 195
F+S ED+ + IE+++A T LP N E+ IL Y++GQKY++H+D F + +
Sbjct: 93 TFLSKHEDD--IVSGIEKRVAAWTFLPEENAESIQILHYELGQKYDAHFDYFHDKNNLKR 150
Query: 196 KSQRVASFLVYLTDLEEGGETMFPFENGMN---ADGSYDYQKCIGLKVKPRQGDGLLFYS 252
RVA+ L+YLTD+++GGET+FP G + D ++ GL VKP++GD LLF+S
Sbjct: 151 GGHRVATVLMYLTDVKKGGETVFPNAAGRHLQLKDETWSDCARSGLAVKPKKGDALLFFS 210
Query: 253 LLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
L N T DP S+HGSCPV++GEKW ATKWI
Sbjct: 211 LHVNATTDPASLHGSCPVIEGEKWSATKWIH 241
>gi|241913390|pdb|3GZE|A Chain A, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
(Ser-Pro)5 Peptide Substrate
gi|241913391|pdb|3GZE|B Chain B, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
(Ser-Pro)5 Peptide Substrate
gi|241913392|pdb|3GZE|C Chain C, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
(Ser-Pro)5 Peptide Substrate
gi|241913393|pdb|3GZE|D Chain D, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
(Ser-Pro)5 Peptide Substrate
Length = 225
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 95/208 (45%), Positives = 130/208 (62%), Gaps = 13/208 (6%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKGETVDNTQGIRTSSGVFISA 140
LSW PRA NF + E+C I+ A+ + + S + G++VD+ IRTS+G + +
Sbjct: 17 LSWSPRAFLLKNFLSDEECDYIVEKARPKMVKSSVVDNESGKSVDSE--IRTSTGTWFAK 74
Query: 141 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQKS-Q 198
ED + IE+++A+VTM+P N E +L Y GQKY HYD F DP GP+ Q
Sbjct: 75 GEDS--VISKIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNAGPEHGGQ 132
Query: 199 RVASFLVYLTDLEEGGETMFP-FENGMNADGSYDYQKCI--GLKVKPRQGDGLLFYSLLP 255
RV + L+YLT +EEGGET+ P E + DG + +C GL VKP +GD L+FYSL P
Sbjct: 133 RVVTMLMYLTTVEEGGETVLPNAEQKVTGDG---WSECAKRGLAVKPIKGDALMFYSLKP 189
Query: 256 NGTIDPTSIHGSCPVVKGEKWVATKWIR 283
+G+ DP S+HGSCP +KG+KW ATKWI
Sbjct: 190 DGSNDPASLHGSCPTLKGDKWSATKWIH 217
>gi|159478673|ref|XP_001697425.1| predicted protein [Chlamydomonas reinhardtii]
gi|158274304|gb|EDP00087.1| predicted protein [Chlamydomonas reinhardtii]
Length = 297
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 95/206 (46%), Positives = 129/206 (62%), Gaps = 9/206 (4%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKGETVDNTQGIRTSSGVFISA 140
LSW PRA NF + E+C I+ A+ + + S + G++VD+ IRTS+G + +
Sbjct: 45 LSWSPRAFLLKNFLSDEECDYIVEKARPKMVKSSVVDNESGKSVDSE--IRTSTGTWFAK 102
Query: 141 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQKS-Q 198
ED + IE+++A+VTM+P N E +L Y GQKY HYD F DP GP+ Q
Sbjct: 103 GEDS--VISKIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNAGPEHGGQ 160
Query: 199 RVASFLVYLTDLEEGGETMFP-FENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 257
RV + L+YLT +EEGGET+ P E + DG + K GL VKP +GD L+FYSL P+G
Sbjct: 161 RVVTMLMYLTTVEEGGETVLPNAEQKVTGDGWSECAK-RGLAVKPIKGDALMFYSLKPDG 219
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIR 283
+ DP S+HGSCP +KG+KW ATKWI
Sbjct: 220 SNDPASLHGSCPTLKGDKWSATKWIH 245
>gi|159794879|pdb|2JIG|A Chain A, Crystal Structure Of Chlamydomonas Reinhardtii Prolyl-4
Hydroxylase Type I Complexed With Zinc And Pyridine-2,4-
Dicarboxylate
gi|159794880|pdb|2JIG|B Chain B, Crystal Structure Of Chlamydomonas Reinhardtii Prolyl-4
Hydroxylase Type I Complexed With Zinc And Pyridine-2,4-
Dicarboxylate
Length = 224
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 95/208 (45%), Positives = 130/208 (62%), Gaps = 13/208 (6%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKGETVDNTQGIRTSSGVFISA 140
LSW PRA NF + E+C I+ A+ + + S + G++VD+ IRTS+G + +
Sbjct: 16 LSWSPRAFLLKNFLSDEECDYIVEKARPKMVKSSVVDNESGKSVDSE--IRTSTGTWFAK 73
Query: 141 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQKS-Q 198
ED + IE+++A+VTM+P N E +L Y GQKY HYD F DP GP+ Q
Sbjct: 74 GEDS--VISKIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNAGPEHGGQ 131
Query: 199 RVASFLVYLTDLEEGGETMFP-FENGMNADGSYDYQKCI--GLKVKPRQGDGLLFYSLLP 255
RV + L+YLT +EEGGET+ P E + DG + +C GL VKP +GD L+FYSL P
Sbjct: 132 RVVTMLMYLTTVEEGGETVLPNAEQKVTGDG---WSECAKRGLAVKPIKGDALMFYSLKP 188
Query: 256 NGTIDPTSIHGSCPVVKGEKWVATKWIR 283
+G+ DP S+HGSCP +KG+KW ATKWI
Sbjct: 189 DGSNDPASLHGSCPTLKGDKWSATKWIH 216
>gi|18394842|ref|NP_564109.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
gi|9558598|gb|AAF88161.1|AC026234_12 Contains similarity to a prolyl 4-hydroxylase alpha subunit protein
from Gallus gallus gi|212530 [Arabidopsis thaliana]
gi|90962978|gb|ABE02413.1| At1g20270 [Arabidopsis thaliana]
gi|332191835|gb|AEE29956.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
Length = 287
Score = 176 bits (445), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 92/213 (43%), Positives = 133/213 (62%), Gaps = 11/213 (5%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVFI 138
+VLSW PRA + NF + E+C+ +I++AK ++ ST+ G++ D+ +RTSSG F+
Sbjct: 77 EVLSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVVDSETGKSKDSR--VRTSSGTFL 134
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 198
D+ + IE++IA T +P +GE +L Y+ GQKY HYD F + Q
Sbjct: 135 RRGRDK--IIKTIEKRIADYTFIPADHGEGLQVLHYEAGQKYEPHYDYFVDEFNTKNGGQ 192
Query: 199 RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCI-----GLKVKPRQGDGLLFYSL 253
R+A+ L+YL+D+EEGGET+FP N MN Y + GL VKPR GD LLF+S+
Sbjct: 193 RMATMLMYLSDVEEGGETVFPAAN-MNFSSVPWYNELSECGKKGLSVKPRMGDALLFWSM 251
Query: 254 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
P+ T+DPTS+HG CPV++G KW +TKW+ E
Sbjct: 252 RPDATLDPTSLHGGCPVIRGNKWSSTKWMHVGE 284
>gi|357517895|ref|XP_003629236.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355523258|gb|AET03712.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 326
Score = 176 bits (445), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 89/214 (41%), Positives = 134/214 (62%), Gaps = 19/214 (8%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-GETVDNTQGIRTSSGVFI 138
Q++SW PRA + NF T E+C+ +IN+AK ++ S + + G VD+ + RTSSG F+
Sbjct: 116 QIISWEPRAFLYHNFLTKEECEHLINIAKPSMHKSAVIDEETGNGVDSRE--RTSSGAFL 173
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 198
D + IE +IA T +P +GE FN+L Y++GQKY HYD F Q
Sbjct: 174 KRGSDR--IVKNIERRIADFTFIPVEHGENFNVLHYEVGQKYEPHYDYFMDTFSTTYAGQ 231
Query: 199 RVASFLVYLTDLEEGGETMFPFENGMNADGSY-------DYQKC--IGLKVKPRQGDGLL 249
R+A+ L+YL+D+EEGGET+FP NA G++ + C GL +KP+ G+ +L
Sbjct: 232 RIATMLMYLSDVEEGGETVFP-----NAKGNFSSVPWWNELSDCGKGGLSIKPKMGNAIL 286
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
F+S+ P+ T+DP+S+HG+CPV+KG+KW+ KW+
Sbjct: 287 FWSMKPDATLDPSSLHGACPVIKGDKWLCAKWMH 320
>gi|356502610|ref|XP_003520111.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 286
Score = 175 bits (444), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 88/226 (38%), Positives = 138/226 (61%), Gaps = 9/226 (3%)
Query: 66 HGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETV 124
H +A +D + +V+SW PRA + NF T E+C+ +IN+A +++ ST+A + G++V
Sbjct: 61 HIEAEEDDQVALRMEVISWQPRAFLYHNFLTKEECEYLINIATPHMQKSTVADNQSGQSV 120
Query: 125 DNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHY 184
+R S+G F+ +DE + IE++IA VT +P NGE ++ Y++GQ Y+ HY
Sbjct: 121 --VHDVRKSTGAFLDRGQDE--IVRNIEKRIADVTFIPIENGEPIYVIHYEVGQYYDPHY 176
Query: 185 DAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKC--IGLKV 240
D F QR+A+ L+YL+++EEGGETMFP + + + C +GL +
Sbjct: 177 DYFIDDFNIENGGQRIATMLMYLSNVEEGGETMFPRAKANFSSVPWWNELSNCGKMGLSI 236
Query: 241 KPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
KP+ GD LLF+S+ PN T+D ++H +CPV+KG KW TKW+ E
Sbjct: 237 KPKMGDALLFWSMKPNATLDALTLHSACPVIKGNKWSCTKWMHPTE 282
>gi|449432777|ref|XP_004134175.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 303
Score = 175 bits (443), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 93/214 (43%), Positives = 132/214 (61%), Gaps = 21/214 (9%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 135
+SW PRA + F T +C +I++AK L+ S++A DN G +RTSSG
Sbjct: 44 ISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVA-------DNLSGKSKVSEVRTSSG 96
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 195
FI A+D + IE+KIA T LP+ NGE +LRY+ GQKY++H+D F + +
Sbjct: 97 AFIHKAKDP--IVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIAR 154
Query: 196 KSQRVASFLVYLTDLEEGGETMFPF----ENGMNADGSYDYQKCI--GLKVKPRQGDGLL 249
R+A+ L+YL+D+E+GGET+FP + ++ + D C G+ VKPR+GD LL
Sbjct: 155 GGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRKGDALL 214
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
F+SL PN D +S+HG CPV++GEKW ATKWIR
Sbjct: 215 FFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIR 248
>gi|297802350|ref|XP_002869059.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297314895|gb|EFH45318.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 290
Score = 175 bits (443), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 96/276 (34%), Positives = 162/276 (58%), Gaps = 18/276 (6%)
Query: 21 FVFLACLFFFLAGLLGSSLLS---QDVTAARPS--ARVVESVKDEYKWMPHGQAGDDSVT 75
F L F + L+G +LS + +++RP +V+++++ + D
Sbjct: 21 FTVLILGLFVILILVGLGILSLPNTNKSSSRPMDLTTIVQTIEERESYGDEEDGNGDRW- 79
Query: 76 NIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSS 134
+V+SW PRA + NF T E+C+ +I++AK ++ S + ++ G+++D+ +RTSS
Sbjct: 80 ---LEVISWEPRAFVYHNFLTNEECEHLISLAKPSMVKSKVVDVKTGKSIDSR--VRTSS 134
Query: 135 GVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP 194
G F+ DE ++ IE +I+ T +P NGE +L Y++GQKY H+D F +
Sbjct: 135 GTFLKRGHDE--IVEEIENRISDFTFIPIENGEGLQVLHYEVGQKYEPHHDYFFDEFNVR 192
Query: 195 QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKC--IGLKVKPRQGDGLLF 250
+ QR+A+ L+YL+D++EGGET+FP G +D + + +C GL V P++ D LLF
Sbjct: 193 KGGQRIATVLMYLSDVDEGGETVFPAAKGNISDVPWWDELSQCGKEGLSVLPKKRDALLF 252
Query: 251 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
+S+ P+ ++DP+S+HG CPV+KG KW +TKW E
Sbjct: 253 WSMKPDASLDPSSLHGGCPVIKGNKWSSTKWFHVHE 288
>gi|224085946|ref|XP_002307750.1| predicted protein [Populus trichocarpa]
gi|222857199|gb|EEE94746.1| predicted protein [Populus trichocarpa]
Length = 288
Score = 174 bits (442), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 88/210 (41%), Positives = 134/210 (63%), Gaps = 11/210 (5%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-GETVDNTQGIRTSSGVFI 138
++LSW PRA + NF + E+C+ +IN+AK ++ ST+ K G + D+ +RTSSG+F+
Sbjct: 78 EILSWEPRAFLYHNFLSKEECEYLINLAKPHMMKSTVVDSKTGRSKDSR--VRTSSGMFL 135
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 198
D + IE++IA + +P +GE +L Y++GQKY +H+D F + Q
Sbjct: 136 RRGRDR--VIREIEKRIADFSFIPVEHGEGLQVLHYEVGQKYEAHFDYFLDEFNTKNGGQ 193
Query: 199 RVASFLVYLTDLEEGGETMFPFENGMNADGS---YDYQKCI--GLKVKPRQGDGLLFYSL 253
R A+ L+YL+D+EEGGET+FP N MN + +C GL +KP+ G+ LLF+S
Sbjct: 194 RTATLLMYLSDVEEGGETVFPAAN-MNISAVPWWNELSECAKQGLSLKPKMGNALLFWST 252
Query: 254 LPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
P+ T+DP+S+HGSCPV++G KW ATKW+
Sbjct: 253 RPDATLDPSSLHGSCPVIRGNKWSATKWMH 282
>gi|302845234|ref|XP_002954156.1| hypothetical protein VOLCADRAFT_82641 [Volvox carteri f.
nagariensis]
gi|300260655|gb|EFJ44873.1| hypothetical protein VOLCADRAFT_82641 [Volvox carteri f.
nagariensis]
Length = 309
Score = 174 bits (442), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 94/206 (45%), Positives = 131/206 (63%), Gaps = 9/206 (4%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKGETVDNTQGIRTSSGVFISA 140
LSW PRA F + E+C+ II AK + + S + G++VD+ IRTS+G +++
Sbjct: 57 LSWSPRAFLLKGFLSDEECEHIIAKAKPRMVKSSVVDNASGKSVDSE--IRTSTGAWLAK 114
Query: 141 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQKS-Q 198
EDE + IE+++A+VTM+P N E +L Y GQKY HYD F DP P+ Q
Sbjct: 115 GEDE--IISRIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNASPEHGGQ 172
Query: 199 RVASFLVYLTDLEEGGETMFPF-ENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 257
RV + L+YLT +EEGGET+ P + ++ +G + K GL VKP +GD L+FYSL P+G
Sbjct: 173 RVVTVLMYLTTVEEGGETVLPHADQKVSGEGWSECAK-RGLAVKPVKGDALMFYSLKPDG 231
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIR 283
+ DP S+HGSCP +KG+KW ATKWI
Sbjct: 232 SNDPASLHGSCPTLKGDKWSATKWIH 257
>gi|225459748|ref|XP_002285898.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Vitis vinifera]
gi|302141716|emb|CBI18919.3| unnamed protein product [Vitis vinifera]
Length = 288
Score = 174 bits (440), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 86/212 (40%), Positives = 137/212 (64%), Gaps = 9/212 (4%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVFI 138
+++SW PRA + NF + E+C+ +I++AK ++ ST+ G + D+ +RTSSG+F+
Sbjct: 78 EIVSWEPRAFIYHNFLSKEECEYMISLAKPYMKKSTVVDSETGRSKDSR--VRTSSGMFL 135
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 198
D+ + IE++IA T +P +GE +L Y++GQKY++HYD F + Q
Sbjct: 136 RRGRDK--IIRDIEKRIADFTFIPVEHGEGLQVLHYEVGQKYDAHYDYFLDEFNTKNGGQ 193
Query: 199 RVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKC--IGLKVKPRQGDGLLFYSLL 254
R+A+ L+YL+D+EEGGET+FP + + + +C GL VKP+ GD LLF+S+
Sbjct: 194 RIATLLMYLSDVEEGGETVFPATKANFSSVPWWNELSECGKKGLSVKPKMGDALLFWSMR 253
Query: 255 PNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
P+ T+DP+S+HG CPV+KG KW +TKW+ +E
Sbjct: 254 PDATLDPSSLHGGCPVIKGNKWSSTKWMHVEE 285
>gi|363806698|ref|NP_001242522.1| uncharacterized protein LOC100806046 [Glycine max]
gi|255647110|gb|ACU24023.1| unknown [Glycine max]
Length = 289
Score = 174 bits (440), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 90/217 (41%), Positives = 138/217 (63%), Gaps = 19/217 (8%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVFI 138
+V+SW PRA + NF T E+C+ +I++AK ++ ST+ G++ D+ +RTSSG F+
Sbjct: 79 EVVSWEPRAFVYHNFLTKEECEYLIDIAKPSMHKSTVVDSETGKSKDSR--VRTSSGTFL 136
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 198
+ D+ + IE+KI+ T +P +GE +L Y++GQKY HYD F Q
Sbjct: 137 ARGRDK--IVRNIEKKISDFTFIPVEHGEGLQVLHYEVGQKYEPHYDYFLDDFNTKNGGQ 194
Query: 199 RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQ-------KC--IGLKVKPRQGDGLL 249
R+A+ L+YLTD+EEGGET+FP A G++ + +C GL +KP++GD LL
Sbjct: 195 RIATVLMYLTDVEEGGETVFP-----AAKGNFSFVPWWNELFECGKKGLSIKPKRGDALL 249
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
F+S+ P+ ++DP+S+HG CPV+KG KW +TKW+R E
Sbjct: 250 FWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWMRVSE 286
>gi|224102545|ref|XP_002312720.1| predicted protein [Populus trichocarpa]
gi|222852540|gb|EEE90087.1| predicted protein [Populus trichocarpa]
Length = 300
Score = 174 bits (440), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 97/214 (45%), Positives = 130/214 (60%), Gaps = 21/214 (9%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 135
+SW PRA + F T +C +I++AK L+ S +A DN G +RTSSG
Sbjct: 42 VSWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVA-------DNESGKSKLSEVRTSSG 94
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 195
+FI+ A+D + IE+KIA T LPR NGE +LRY+ GQKY+ HYD F + +
Sbjct: 95 MFITKAKDP--IVAGIEDKIATWTFLPRENGEDIQVLRYEHGQKYDPHYDYFSDKVNIAR 152
Query: 196 KSQRVASFLVYLTDLEEGGETMFPFENGM---NADGSY-DYQKCI--GLKVKPRQGDGLL 249
RVA+ L+YLTD+E+GGET+FP + A S+ D +C G+ VKPR+GD LL
Sbjct: 153 GGHRVATVLMYLTDVEKGGETVFPSAEELPRRKASVSHEDLSECARKGIAVKPRRGDALL 212
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
F+SL P D +SIH CPV++GEKW ATKWI
Sbjct: 213 FFSLYPTAVPDTSSIHAGCPVIEGEKWSATKWIH 246
>gi|147800995|emb|CAN64470.1| hypothetical protein VITISV_014644 [Vitis vinifera]
Length = 288
Score = 174 bits (440), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 89/217 (41%), Positives = 136/217 (62%), Gaps = 19/217 (8%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTS 133
+V+SW PRA + NF + ++C+ +I +AK +++ ST+ VD++ G +RTS
Sbjct: 78 EVISWEPRAFVYHNFLSKDECEYLIKLAKPHMQKSTV-------VDSSTGKSKDSRVRTS 130
Query: 134 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 193
SG F++ +D+ + IE++++ T LP +GE IL Y++GQKY HYD F
Sbjct: 131 SGTFLTRGQDK--IIRGIEKRLSDFTFLPVEHGEGLQILHYEVGQKYEPHYDYFLDDYNT 188
Query: 194 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKC--IGLKVKPRQGDGLL 249
QR+A+ L+YL+D+EEGGET+FP G + + + C GL VKP+ GD LL
Sbjct: 189 KNGGQRMATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSXCGKEGLSVKPKMGDALL 248
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
F+S+ P+ ++DP+S+HG CPV+KG KW +TKWIR E
Sbjct: 249 FWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNE 285
>gi|48716447|dbj|BAD23054.1| putative prolyl 4-hydroxylase [Oryza sativa Japonica Group]
Length = 310
Score = 173 bits (439), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 89/217 (41%), Positives = 135/217 (62%), Gaps = 19/217 (8%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTS 133
+V+SW PRA + NF + E+C +I +AK ++ ST+ VD+T G +RTS
Sbjct: 100 EVISWEPRAFVYHNFLSKEECDYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 152
Query: 134 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 193
SG+F+ D+ + IE++IA T +P +GE +L Y++GQKY H+D F +
Sbjct: 153 SGMFLQRGRDK--VIRAIEKRIADYTFIPMEHGEGLQVLHYEVGQKYEPHFDYFLDEYNT 210
Query: 194 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKCI--GLKVKPRQGDGLL 249
QR+A+ L+YL+D+EEGGET+FP N ++ + + +C GL VKP+ GD LL
Sbjct: 211 KNGGQRMATLLMYLSDVEEGGETIFPDANVNSSSLPWYNELSECARKGLAVKPKMGDALL 270
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
F+S+ P+ T+DP S+HG CPV+KG KW +TKW+ +E
Sbjct: 271 FWSMKPDATLDPLSLHGGCPVIKGNKWSSTKWMHVRE 307
>gi|224141327|ref|XP_002324025.1| predicted protein [Populus trichocarpa]
gi|222867027|gb|EEF04158.1| predicted protein [Populus trichocarpa]
Length = 239
Score = 173 bits (439), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 94/206 (45%), Positives = 128/206 (62%), Gaps = 8/206 (3%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-GETVDNTQGIRTSSGVFISA 140
LSW PRA + F + E+C +IN+AK L S +A + GE++++ + RTSSG+FI
Sbjct: 21 LSWQPRAFVYKGFLSDEECDHLINLAKGKLVKSMVANDETGESMESQE--RTSSGMFIFK 78
Query: 141 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRV 200
EDE ++ IE +IA T LP NGE ILRY+ GQKY +H D F + + R
Sbjct: 79 TEDE--IVNGIEARIAAWTFLPEENGEPIQILRYEHGQKYEAHIDYFVDKANQEEGGHRA 136
Query: 201 ASFLVYLTDLEEGGETMFPF---ENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 257
A+ L+YL+D+++GGET+FP E D S+ G VKP +GD LLF+SL P+
Sbjct: 137 ATVLMYLSDVKKGGETVFPTSEAEGSQAKDDSWSDCAKKGYAVKPNKGDALLFFSLHPDA 196
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIR 283
T DP S+H SCPV++GEKW ATKWI
Sbjct: 197 TPDPGSLHASCPVIEGEKWSATKWIH 222
>gi|225468574|ref|XP_002263060.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
gi|296084059|emb|CBI24447.3| unnamed protein product [Vitis vinifera]
Length = 288
Score = 173 bits (439), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 89/217 (41%), Positives = 136/217 (62%), Gaps = 19/217 (8%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTS 133
+V+SW PRA + NF + ++C+ +I +AK +++ ST+ VD++ G +RTS
Sbjct: 78 EVISWEPRAFVYHNFLSKDECEYLIKLAKPHMQKSTV-------VDSSTGKSKDSRVRTS 130
Query: 134 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 193
SG F++ +D+ + IE++++ T LP +GE IL Y++GQKY HYD F
Sbjct: 131 SGTFLTRGQDK--IIRGIEKRLSDFTFLPVEHGEGLQILHYEVGQKYEPHYDYFLDDYNT 188
Query: 194 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKC--IGLKVKPRQGDGLL 249
QR+A+ L+YL+D+EEGGET+FP G + + + C GL VKP+ GD LL
Sbjct: 189 KNGGQRMATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKEGLSVKPKMGDALL 248
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
F+S+ P+ ++DP+S+HG CPV+KG KW +TKWIR E
Sbjct: 249 FWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNE 285
>gi|297818456|ref|XP_002877111.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297322949|gb|EFH53370.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 316
Score = 173 bits (439), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 91/206 (44%), Positives = 126/206 (61%), Gaps = 8/206 (3%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALR-KGETVDNTQGIRTSSGVFISA 140
LSW PRA + F + E+C I +AK L S +A GE+V++ +RTSSG+F+S
Sbjct: 59 LSWTPRAFLYKGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESE--VRTSSGMFLSK 116
Query: 141 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRV 200
+D+ + +E K+A T +P NGE+ IL Y+ GQKY H+D F Q R+
Sbjct: 117 RQDD--IVANVEAKLAAWTFIPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRI 174
Query: 201 ASFLVYLTDLEEGGETMFPFENGMNA---DGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 257
A+ L+YL+++E+GGET+FP G D S+ G VKPR+GD LLF++L PN
Sbjct: 175 ATVLMYLSNVEKGGETVFPMWKGKTTQLKDDSWTECAKQGYAVKPRKGDALLFFNLHPNA 234
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIR 283
T D S+HGSCPVV+GEKW AT+WI
Sbjct: 235 TTDSNSLHGSCPVVEGEKWSATRWIH 260
>gi|297832394|ref|XP_002884079.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297329919|gb|EFH60338.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 291
Score = 173 bits (439), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 85/211 (40%), Positives = 135/211 (63%), Gaps = 7/211 (3%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFIS 139
+V+SW PRA+ + NF + E+C+ +IN+AK ++ ST+ K +++ +RTSSG F+
Sbjct: 81 EVISWEPRAVVYHNFLSNEECEHLINLAKPSMVKSTVVDEKTGGSKDSR-VRTSSGTFLR 139
Query: 140 AAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQR 199
DE +++IE++I+ T +P NGE +L Y++GQKY HYD F + QR
Sbjct: 140 RGHDE--VVEVIEKRISDFTFIPVENGEGLQVLHYQVGQKYEPHYDYFLDEFNTKNGGQR 197
Query: 200 VASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKC--IGLKVKPRQGDGLLFYSLLP 255
+A+ L+YL+D+++GGET+FP G + + + KC GL V P++ D LLF+++ P
Sbjct: 198 IATVLMYLSDVDDGGETVFPAARGNISAVPWWNELSKCGKEGLSVLPKKRDALLFWNMRP 257
Query: 256 NGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
+ ++DP+S+HG CPVVKG KW +TKW E
Sbjct: 258 DASLDPSSLHGGCPVVKGNKWSSTKWFHVHE 288
>gi|242088305|ref|XP_002439985.1| hypothetical protein SORBIDRAFT_09g023860 [Sorghum bicolor]
gi|241945270|gb|EES18415.1| hypothetical protein SORBIDRAFT_09g023860 [Sorghum bicolor]
Length = 308
Score = 173 bits (438), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 88/210 (41%), Positives = 126/210 (60%), Gaps = 17/210 (8%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 135
+SW PR + +F + ++ +I++A+ L+ S +A DN G +RTSSG
Sbjct: 54 ISWKPRVFLYQHFLSDDEANHLISLARAELKRSAVA-------DNMSGKSTLSDVRTSSG 106
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 195
F+ +D ++ IE+KIA T LP+ NGE +LRYK G+KY HYD F +
Sbjct: 107 TFLRKGQDP--IVEGIEDKIAAWTFLPKENGEDIQVLRYKHGEKYEPHYDYFTDNVNTIR 164
Query: 196 KSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCI--GLKVKPRQGDGLLFYSL 253
R A+ L+YLTD+ EGGET+FP ++ + +C G+ VKPR+GD LLF++L
Sbjct: 165 GGHRYATVLLYLTDVAEGGETVFPLAEEVDDAKDATFSECAQKGIAVKPRKGDALLFFNL 224
Query: 254 LPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
P+GT DP S+HG C V++GEKW ATKWIR
Sbjct: 225 KPDGTTDPVSLHGGCAVIRGEKWSATKWIR 254
>gi|242063586|ref|XP_002453082.1| hypothetical protein SORBIDRAFT_04g038020 [Sorghum bicolor]
gi|241932913|gb|EES06058.1| hypothetical protein SORBIDRAFT_04g038020 [Sorghum bicolor]
Length = 307
Score = 173 bits (438), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 91/218 (41%), Positives = 135/218 (61%), Gaps = 21/218 (9%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTS 133
+V+SW PRA + NF + E+C+ +I +AK ++ ST+ VD+T G +RTS
Sbjct: 97 EVISWEPRAFVYHNFLSKEECEYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 149
Query: 134 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 193
SG+F+ D+ + IE++IA T +P +GE +L Y++GQKY H+D F +
Sbjct: 150 SGMFLQRGRDK--VIRAIEKRIADYTFIPADHGEGLQVLHYEVGQKYEPHFDYFLDEFNT 207
Query: 194 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQK---CI--GLKVKPRQGDGL 248
QR+A+ L+YL+D+EEGGET+FP N +NA Y + C GL VKP+ GD L
Sbjct: 208 KNGGQRMATLLMYLSDVEEGGETIFPDAN-VNASSLPWYNELSECAKRGLSVKPKMGDAL 266
Query: 249 LFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
LF+S+ P+ T+DP S+HG CPV++G KW +TKW+ E
Sbjct: 267 LFWSMKPDATLDPLSLHGGCPVIRGNKWSSTKWMHIHE 304
>gi|42567428|ref|NP_195306.2| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
gi|332661174|gb|AEE86574.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
Length = 290
Score = 173 bits (438), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 94/264 (35%), Positives = 155/264 (58%), Gaps = 19/264 (7%)
Query: 28 FFFLAGLLGSSLLSQDVTAARPSARVVESVKDEYKWMPHGQAGDDSVTNIPFQVLSWMPR 87
F L +S + D+T + + ES DE +D + +V+SW PR
Sbjct: 39 IFSLPSTNKTSSMPMDLTTIVQTIQERESFGDE----------EDGNGDRWLEVISWEPR 88
Query: 88 ALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVFISAAEDESG 146
A + NF T E+C+ +I++AK ++ S + ++ G+++D+ +RTSSG F++ DE
Sbjct: 89 AFVYHNFLTNEECEHLISLAKPSMMKSKVVDVKTGKSIDSR--VRTSSGTFLNRGHDE-- 144
Query: 147 TLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVY 206
++ IE +I+ T +P NGE +L Y++GQ+Y H+D F + + QR+A+ L+Y
Sbjct: 145 IVEEIENRISDFTFIPPENGEGLQVLHYEVGQRYEPHHDYFFDEFNVRKGGQRIATVLMY 204
Query: 207 LTDLEEGGETMFPFENGMNADGSY--DYQKC--IGLKVKPRQGDGLLFYSLLPNGTIDPT 262
L+D++EGGET+FP G +D + + +C GL V P++ D LLF+S+ P+ ++DP+
Sbjct: 205 LSDVDEGGETVFPAAKGNVSDVPWWDELSQCGKEGLSVLPKKRDALLFWSMKPDASLDPS 264
Query: 263 SIHGSCPVVKGEKWVATKWIRDQE 286
S+HG CPV+KG KW +TKW E
Sbjct: 265 SLHGGCPVIKGNKWSSTKWFHVHE 288
>gi|359477455|ref|XP_002278454.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1 [Vitis
vinifera]
Length = 296
Score = 173 bits (438), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 93/228 (40%), Positives = 135/228 (59%), Gaps = 19/228 (8%)
Query: 66 HGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVD 125
+ A +V+ + +SW PRA + F + E+C +I++AK L+ S +A D
Sbjct: 24 YADAAGSNVSAAKVRQISWKPRAFVYEGFLSEEECDHLISLAKSELKRSAVA-------D 76
Query: 126 NTQG------IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQK 179
N G +RTSSG+FI +D + IE+KIA T LP+ NGE +LRY+ GQK
Sbjct: 77 NVSGKSRLSEVRTSSGMFIGKGKDP--IVAGIEDKIAAWTFLPKDNGEDMQVLRYEPGQK 134
Query: 180 YNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNAD--GSYDYQKCI- 236
Y++HYD F + + R+A+ L+YL+D+ +GGET+FP ++ + D +C
Sbjct: 135 YDAHYDYFVDKVNIARGGHRIATVLMYLSDVVKGGETVFPMAEVSSSTLPTNDDLSECAR 194
Query: 237 -GLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
G+ VKPR+GD LLF+SL P DP S+HG CPV++GEKW ATKWI
Sbjct: 195 KGIAVKPRKGDALLFFSLHPTAIPDPMSLHGGCPVIEGEKWSATKWIH 242
>gi|18086437|gb|AAL57673.1| AT3g28480/MFJ20_16 [Arabidopsis thaliana]
gi|24796986|gb|AAN64505.1| At3g28480/MFJ20_16 [Arabidopsis thaliana]
Length = 316
Score = 173 bits (438), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 91/206 (44%), Positives = 126/206 (61%), Gaps = 8/206 (3%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALR-KGETVDNTQGIRTSSGVFISA 140
LSW PR + F + E+C I +AK L S +A GE+V++ +RTSSG+F+S
Sbjct: 59 LSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESE--VRTSSGMFLSK 116
Query: 141 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRV 200
+D+ ++ +E K+A T LP NGE+ IL Y+ GQKY H+D F Q R+
Sbjct: 117 RQDD--IVNNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRI 174
Query: 201 ASFLVYLTDLEEGGETMFPFENGMNA---DGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 257
A+ L+YL+++E+GGET+FP G D S+ G VKPR+GD LLF++L PN
Sbjct: 175 ATVLMYLSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDALLFFNLHPNA 234
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIR 283
T D S+HGSCPVV+GEKW AT+WI
Sbjct: 235 TTDSNSLHGSCPVVEGEKWSATRWIH 260
>gi|15227885|ref|NP_179363.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
gi|25411813|pir||F84555 similar to prolyl 4-hydroxylase alpha subunit [imported] -
Arabidopsis thaliana
gi|89274129|gb|ABD65585.1| At2g17720 [Arabidopsis thaliana]
gi|110738861|dbj|BAF01353.1| similar to prolyl 4-hydroxylase alpha subunit [Arabidopsis
thaliana]
gi|330251579|gb|AEC06673.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
Length = 291
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 85/211 (40%), Positives = 135/211 (63%), Gaps = 7/211 (3%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFIS 139
+V+SW PRA+ + NF T E+C+ +I++AK ++ ST+ K +++ +RTSSG F+
Sbjct: 81 EVISWEPRAVVYHNFLTNEECEHLISLAKPSMVKSTVVDEKTGGSKDSR-VRTSSGTFLR 139
Query: 140 AAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQR 199
DE +++IE++I+ T +P NGE +L Y++GQKY HYD F + QR
Sbjct: 140 RGHDE--VVEVIEKRISDFTFIPVENGEGLQVLHYQVGQKYEPHYDYFLDEFNTKNGGQR 197
Query: 200 VASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKC--IGLKVKPRQGDGLLFYSLLP 255
+A+ L+YL+D+++GGET+FP G + + + KC GL V P++ D LLF+++ P
Sbjct: 198 IATVLMYLSDVDDGGETVFPAARGNISAVPWWNELSKCGKEGLSVLPKKRDALLFWNMRP 257
Query: 256 NGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
+ ++DP+S+HG CPVVKG KW +TKW E
Sbjct: 258 DASLDPSSLHGGCPVVKGNKWSSTKWFHVHE 288
>gi|259490206|ref|NP_001159002.1| prolyl 4-hydroxylase alpha-2 subunit [Zea mays]
gi|195626402|gb|ACG35031.1| prolyl 4-hydroxylase alpha-2 subunit precursor [Zea mays]
gi|347978830|gb|AEP37757.1| prolyl 4-hydroxylase 8 [Zea mays]
gi|347978832|gb|AEP37758.1| prolyl 4-hydroxylase 8-1 [Zea mays]
gi|413939569|gb|AFW74120.1| prolyl 4-hydroxylase alpha-2 subunit isoform 1 [Zea mays]
gi|413939570|gb|AFW74121.1| prolyl 4-hydroxylase alpha-2 subunit isoform 2 [Zea mays]
gi|413939571|gb|AFW74122.1| prolyl 4-hydroxylase alpha-2 subunit isoform 3 [Zea mays]
gi|413939572|gb|AFW74123.1| prolyl 4-hydroxylase alpha-2 subunit isoform 4 [Zea mays]
Length = 307
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 91/218 (41%), Positives = 135/218 (61%), Gaps = 21/218 (9%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTS 133
+V+SW PRA + NF + ++C+ +I +AK ++ ST+ VD+T G +RTS
Sbjct: 97 EVISWEPRAFVYHNFLSKDECEYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 149
Query: 134 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 193
SG+F+ D+ + IE++IA T +P +GE +L Y++GQKY H+D F +
Sbjct: 150 SGMFLQRGRDK--VIRAIEKRIADYTFIPVDHGEGLQVLHYEVGQKYEPHFDYFLDEFNT 207
Query: 194 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQK---CI--GLKVKPRQGDGL 248
QR+A+ L+YL+D+EEGGET+FP N +NA Y + C GL VKP+ GD L
Sbjct: 208 KNGGQRIATLLMYLSDVEEGGETIFPDAN-VNASSLPWYNELSDCAKRGLSVKPKMGDAL 266
Query: 249 LFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
LF+S+ P+ T+DP S+HG CPV+KG KW +TKW+ E
Sbjct: 267 LFWSMKPDATLDPLSLHGGCPVIKGNKWSSTKWMHIHE 304
>gi|9294583|dbj|BAB02864.1| prolyl 4-hydroxylase alpha subunit-like protein [Arabidopsis
thaliana]
Length = 332
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 91/206 (44%), Positives = 125/206 (60%), Gaps = 8/206 (3%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALR-KGETVDNTQGIRTSSGVFISA 140
LSW PR + F + E+C I +AK L S +A GE+V++ +RTSSG+F+S
Sbjct: 75 LSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESE--VRTSSGMFLSK 132
Query: 141 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRV 200
+D+ + +E K+A T LP NGE+ IL Y+ GQKY H+D F Q R+
Sbjct: 133 RQDD--IVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRI 190
Query: 201 ASFLVYLTDLEEGGETMFPFENGMNA---DGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 257
A+ L+YL+++E+GGET+FP G D S+ G VKPR+GD LLF++L PN
Sbjct: 191 ATVLMYLSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDALLFFNLHPNA 250
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIR 283
T D S+HGSCPVV+GEKW AT+WI
Sbjct: 251 TTDSNSLHGSCPVVEGEKWSATRWIH 276
>gi|357517885|ref|XP_003629231.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355523253|gb|AET03707.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 279
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 89/223 (39%), Positives = 135/223 (60%), Gaps = 19/223 (8%)
Query: 71 DDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG- 129
DD +++SW PR + NF E+C+ +IN+AK +++ ST+ VD+T G
Sbjct: 62 DDDDNKRWVEIVSWEPRVFLYHNFLAKEECEHLINIAKPDVQKSTV-------VDDTTGK 114
Query: 130 -----IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHY 184
RTSSG FI D+ L IE++IA T +P +GE NIL Y++GQKY+ H
Sbjct: 115 SVNSSARTSSGTFIDRGYDK--ILSDIEKRIADFTFIPVEHGEDVNILHYEVGQKYDFHT 172
Query: 185 DAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKC--IGLKV 240
D F+ + +R+A+ L+YL+D+EEGGET+FP G + + + C GL +
Sbjct: 173 DYFEDEVNTKHGGERIATMLMYLSDVEEGGETVFPSAKGNFSSVPWWNELSDCGKKGLSI 232
Query: 241 KPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
KP+ G+ +LF+ + P+ T+DP S+HG+CPV+KG+KW TKW+R
Sbjct: 233 KPKMGNAILFWGMKPDATVDPLSVHGACPVIKGDKWSCTKWMR 275
>gi|449495423|ref|XP_004159836.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 304
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 92/215 (42%), Positives = 131/215 (60%), Gaps = 22/215 (10%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 135
+SW PRA + F T +C +I++AK L+ S++A DN G +RTSSG
Sbjct: 44 ISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVA-------DNLSGKSKVSEVRTSSG 96
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 195
FI A+D + IE+KIA T LP+ NGE +LRY+ GQKY++H+D F + +
Sbjct: 97 AFIHKAKDP--IVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIAR 154
Query: 196 KSQRVASFLVYLTDLEEGGETMFPFENGMN-----ADGSYDYQKCI--GLKVKPRQGDGL 248
R+A+ L+YL+D+E+GGET+F + ++ + D C G+ VKPR+GD L
Sbjct: 155 GGHRMATVLMYLSDVEKGGETVFLLRRSESQRRQASETNEDLSDCAKKGIAVKPRKGDAL 214
Query: 249 LFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
LF+SL PN D +S+HG CPV++GEKW ATKWIR
Sbjct: 215 LFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIR 249
>gi|18405808|ref|NP_566838.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
gi|21617881|gb|AAM66931.1| prolyl 4-hydroxylase, putative [Arabidopsis thaliana]
gi|332643929|gb|AEE77450.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
Length = 316
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 91/206 (44%), Positives = 125/206 (60%), Gaps = 8/206 (3%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALR-KGETVDNTQGIRTSSGVFISA 140
LSW PR + F + E+C I +AK L S +A GE+V++ +RTSSG+F+S
Sbjct: 59 LSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESE--VRTSSGMFLSK 116
Query: 141 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRV 200
+D+ + +E K+A T LP NGE+ IL Y+ GQKY H+D F Q R+
Sbjct: 117 RQDD--IVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRI 174
Query: 201 ASFLVYLTDLEEGGETMFPFENGMNA---DGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 257
A+ L+YL+++E+GGET+FP G D S+ G VKPR+GD LLF++L PN
Sbjct: 175 ATVLMYLSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDALLFFNLHPNA 234
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIR 283
T D S+HGSCPVV+GEKW AT+WI
Sbjct: 235 TTDSNSLHGSCPVVEGEKWSATRWIH 260
>gi|116788056|gb|ABK24739.1| unknown [Picea sitchensis]
Length = 303
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 87/218 (39%), Positives = 136/218 (62%), Gaps = 16/218 (7%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVD---------NTQG 129
+VLSW PRA+ + NF E+C+ +IN+AK ++ ST+ G++ D N
Sbjct: 82 EVLSWEPRAILYHNFLNKEECEYLINLAKPHMAKSTVVDSATGKSKDSRFVHRWKSNDSR 141
Query: 130 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP 189
+RTSSG+F++ +D+ T+ IE++IA T +P +GE +L Y++GQKY H+D F
Sbjct: 142 VRTSSGMFLNRGQDK--TIRSIEKRIADFTFIPAEHGEGLQVLHYEVGQKYEPHFDYFLD 199
Query: 190 QEYGPQKSQRVASFLVYLTDLEEGGETMFPFE--NGMNADGSYDYQKC--IGLKVKPRQG 245
+ QR+A+ L+YL+D+E+GGET+FP N + + +C G+ V+PR G
Sbjct: 200 EFNTKNGGQRIATVLMYLSDVEKGGETVFPASKVNSSSVPWWDELSECAKAGISVRPRMG 259
Query: 246 DGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
D LLF+S+ P+ +DP+S+H CPV++G+KW ATKWI
Sbjct: 260 DALLFWSMRPDAELDPSSLHAGCPVIQGDKWSATKWIH 297
>gi|357137804|ref|XP_003570489.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Brachypodium
distachyon]
Length = 318
Score = 172 bits (435), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 88/217 (40%), Positives = 133/217 (61%), Gaps = 19/217 (8%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTS 133
+V+SW PRA + NF + E+C+ +I +AK + ST+ VD+T G +RTS
Sbjct: 108 EVISWEPRAFVYHNFLSKEECEYLIGLAKPRMEKSTV-------VDSTTGKSKDSRVRTS 160
Query: 134 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 193
SG+F+ D+ + IE +IA T +P +GE +L Y++GQKY H+D F +
Sbjct: 161 SGMFLRRGRDK--VIRAIERRIADYTFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEFNT 218
Query: 194 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKCI--GLKVKPRQGDGLL 249
QR+A+ L+YL+D+EEGGET+FP N ++ + + +C GL VKP+ GD LL
Sbjct: 219 KNGGQRMATILMYLSDVEEGGETIFPDANVNSSSLPWHNELSECARKGLAVKPKMGDALL 278
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
F+S+ P+ T+DP S+HG CPV++G KW +TKW+ E
Sbjct: 279 FWSMNPDATLDPLSLHGGCPVIRGNKWSSTKWMHVGE 315
>gi|21593091|gb|AAM65040.1| putative prolyl 4-hydroxylase, alpha subunit [Arabidopsis thaliana]
Length = 291
Score = 172 bits (435), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 85/211 (40%), Positives = 134/211 (63%), Gaps = 7/211 (3%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFIS 139
+V+SW PRA+ + NF T E+C+ +I++AK ++ ST+ K +++ +RTSSG F+
Sbjct: 81 EVISWEPRAVVYHNFLTNEECEHLISLAKPSMVKSTVVDEKTGGSKDSR-VRTSSGTFLR 139
Query: 140 AAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQR 199
DE +++IE++I+ T +P NGE +L Y++GQKY HYD F + QR
Sbjct: 140 RGHDE--VVEVIEKRISDFTFIPVENGEGLQVLHYQVGQKYEPHYDYFLDEFNTKNGGQR 197
Query: 200 VASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKC--IGLKVKPRQGDGLLFYSLLP 255
+A+ L+YL+D+++GGET+FP G + + + KC GL V P+ D LLF+++ P
Sbjct: 198 IATVLMYLSDVDDGGETVFPAARGNISAVPWWNELSKCGKEGLSVLPKXRDALLFWNMRP 257
Query: 256 NGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
+ ++DP+S+HG CPVVKG KW +TKW E
Sbjct: 258 DASLDPSSLHGGCPVVKGNKWSSTKWFHVHE 288
>gi|303285562|ref|XP_003062071.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226456482|gb|EEH53783.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 522
Score = 172 bits (435), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 93/216 (43%), Positives = 134/216 (62%), Gaps = 15/216 (6%)
Query: 78 PFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVF 137
P + + P+A F NF T E+C+ +I +AK L PST+ G+ GIRTS+G+F
Sbjct: 228 PLVLSATRPKAYLFRNFLTEEECRHLIALAKAQLAPSTVVADGGKK-STKSGIRTSAGMF 286
Query: 138 ISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQK 196
++ + ++ T+ ++EE++A LP NGE ILRY+ GQKY+ HYD F D P +
Sbjct: 287 LT--KGQTPTVRMVEERVAAAVGLPEENGEGMQILRYEHGQKYDPHYDYFHDKINPSPNR 344
Query: 197 S-QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQK------CI--GLKVKPRQGDG 247
QR+A+ L+YL D EEGGET+FP N +G +D +K C GL VK ++GD
Sbjct: 345 GGQRMATMLIYLKDTEEGGETIFP--NAKKPEGFHDGEKDGAFSDCAKRGLPVKSKRGDA 402
Query: 248 LLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
+LF+SL + +D S+HG+CPV++GEKW A KWIR
Sbjct: 403 VLFWSLTSDYKLDEGSLHGACPVLRGEKWTAVKWIR 438
>gi|363543371|ref|NP_001241695.1| prolyl 4-hydroxylase 8-5 [Zea mays]
gi|347978840|gb|AEP37762.1| prolyl 4-hydroxylase 8-5 [Zea mays]
Length = 307
Score = 172 bits (435), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 91/218 (41%), Positives = 134/218 (61%), Gaps = 21/218 (9%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTS 133
+V+SW PRA + NF + ++C+ +I +AK ++ ST+ VD+T G +RTS
Sbjct: 97 EVISWEPRAFVYHNFLSKDECEYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 149
Query: 134 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 193
SG+F+ D+ + IE++IA T +P +GE +L Y++GQKY H+D F +
Sbjct: 150 SGMFLQRGRDK--VIRAIEKRIADYTFIPVDHGEGLQVLHYEVGQKYEPHFDYFLDEFNT 207
Query: 194 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQK---CI--GLKVKPRQGDGL 248
QR+A+ L+YL+D+EEGGET+FP N +NA Y + C GL VKP+ GD L
Sbjct: 208 KNGGQRIATLLMYLSDVEEGGETIFPDAN-VNASSLPWYNELSDCAKRGLSVKPKMGDAL 266
Query: 249 LFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
LF+S+ P T+DP S+HG CPV+KG KW +TKW+ E
Sbjct: 267 LFWSMKPGATLDPLSLHGGCPVIKGNKWSSTKWMHIHE 304
>gi|255539064|ref|XP_002510597.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223551298|gb|EEF52784.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 289
Score = 172 bits (435), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 87/208 (41%), Positives = 132/208 (63%), Gaps = 9/208 (4%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-GETVDNTQGIRTSSGVFI 138
+++SW PRA + NF + E+C+ +I +AK ++ ST+ K G + D+ +RTSSG+F+
Sbjct: 79 EIISWEPRAFVYHNFLSKEECEYLIALAKPHMVKSTVVDSKTGRSKDSR--VRTSSGMFL 136
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 198
D+ + IE++IA + +P +GE +L Y++GQKY +HYD F + Q
Sbjct: 137 RRGRDK--IIRNIEKRIADFSFIPIEHGEGLQVLHYEVGQKYEAHYDYFLDEFNTKNGGQ 194
Query: 199 RVASFLVYLTDLEEGGETMFPFE--NGMNADGSYDYQKCI--GLKVKPRQGDGLLFYSLL 254
R A+ L+YL+D+EEGGET+FP N N + +C GL VKP+ G+ LLF+S
Sbjct: 195 RTATLLMYLSDVEEGGETVFPAAKANISNVPSWNELSECARQGLSVKPKMGNALLFWSTR 254
Query: 255 PNGTIDPTSIHGSCPVVKGEKWVATKWI 282
P+ T+DP S+HGSCPV++G KW ATKW+
Sbjct: 255 PDATLDPASLHGSCPVIRGNKWSATKWM 282
>gi|226529219|ref|NP_001151238.1| LOC100284871 [Zea mays]
gi|195645242|gb|ACG42089.1| prolyl 4-hydroxylase alpha-2 subunit precursor [Zea mays]
gi|347978812|gb|AEP37748.1| prolyl 4-hydroxylase 5 [Zea mays]
gi|413923983|gb|AFW63915.1| prolyl 4-hydroxylase alpha-2 subunit [Zea mays]
Length = 308
Score = 172 bits (435), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 88/217 (40%), Positives = 135/217 (62%), Gaps = 19/217 (8%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTS 133
+V+SW PRA + NF + E+C+ +I +AK ++ ST+ VD+T G +RTS
Sbjct: 98 EVISWEPRAFVYHNFLSKEECEYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 150
Query: 134 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 193
SG+F+ D+ + +IE++IA T +P +GE +L Y++GQKY H+D F +
Sbjct: 151 SGMFLQRGRDK--VIRVIEKRIADYTFIPVDHGEGLQVLHYEVGQKYEPHFDYFLDEFNT 208
Query: 194 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKCI--GLKVKPRQGDGLL 249
QR+A+ L+YL+D+EEGGET+FP N + + + +C GL VKP+ GD LL
Sbjct: 209 KNGGQRMATLLMYLSDVEEGGETIFPDANVNVSSLPWYNELSECAKRGLSVKPKMGDALL 268
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
F+S+ P+ T+DP S+HG CPV++G KW +TKW+ E
Sbjct: 269 FWSMKPDATLDPLSLHGGCPVIRGNKWSSTKWMHIHE 305
>gi|255552788|ref|XP_002517437.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223543448|gb|EEF44979.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 311
Score = 171 bits (434), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 91/206 (44%), Positives = 130/206 (63%), Gaps = 8/206 (3%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-GETVDNTQGIRTSSGVFISA 140
LSW PRA + F + E+C +I++A+ L S +A + G+++++ +RTSSG+FI+
Sbjct: 52 LSWHPRAFLYKGFLSYEECDHLIDLARDKLEKSMVADNESGKSIESE--VRTSSGMFIAK 109
Query: 141 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRV 200
A+DE + IE +IA T LP NGE+ IL Y+ GQKY H+D F + RV
Sbjct: 110 AQDE--IVADIEARIAAWTFLPEENGESMQILHYEHGQKYEPHFDYFHDKANQELGGHRV 167
Query: 201 ASFLVYLTDLEEGGETMFPFENGMNADGSYD-YQKCI--GLKVKPRQGDGLLFYSLLPNG 257
A+ L+YL+++E+GGET+FP G + D + C G VKP +GD LLF+SL P+
Sbjct: 168 ATVLMYLSNVEKGGETVFPNAEGKLSQPKEDSWSDCAKGGYAVKPEKGDALLFFSLHPDA 227
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIR 283
T D S+HGSCPV++GEKW ATKWI
Sbjct: 228 TTDSDSLHGSCPVIEGEKWSATKWIH 253
>gi|224133600|ref|XP_002327635.1| predicted protein [Populus trichocarpa]
gi|222836720|gb|EEE75113.1| predicted protein [Populus trichocarpa]
Length = 291
Score = 171 bits (433), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 90/230 (39%), Positives = 138/230 (60%), Gaps = 19/230 (8%)
Query: 67 GQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDN 126
G D+ +V+SW PRA + NF T +C+ +IN+AK ++ ST+ VD+
Sbjct: 68 GSGDDEGKAEQWAEVISWKPRAFVYHNFLTKAECEYLINLAKPRMQKSTV-------VDS 120
Query: 127 TQG------IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKY 180
+ G +RTSSG F+ D+ + IE++IA + +P +GE IL Y++GQ+Y
Sbjct: 121 STGKSKDSKVRTSSGTFLPRGRDK--IVRDIEKRIADFSFIPVEHGEGLQILHYEVGQRY 178
Query: 181 NSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKCI-- 236
H+D F + QR+A+ L+YL+D+EEGGET+FP G + + + +C
Sbjct: 179 EPHFDYFMDEYNTKNGGQRIATVLMYLSDVEEGGETVFPSAEGNISAVPWWNELSECGKG 238
Query: 237 GLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
GL VKP+ GD LLF+S+ P+G+ DP+S+HG CPV++G KW +TKW+R E
Sbjct: 239 GLSVKPKMGDALLFWSMNPDGSPDPSSLHGGCPVIRGNKWSSTKWMRVNE 288
>gi|356502598|ref|XP_003520105.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 296
Score = 171 bits (433), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 88/213 (41%), Positives = 131/213 (61%), Gaps = 9/213 (4%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPST-LALRKGETVDNTQGIRTSSGVFI 138
+++SW PR + NF T E+C+ +IN+AK N+R ST + G ++++ +RTSSG F+
Sbjct: 86 EIISWEPRIFLYHNFLTKEECEHLINIAKPNMRKSTVIESETGMSIESR--VRTSSGTFL 143
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 198
+ D+ + IE +IA T +P NGE +L Y++G+KY H+D F
Sbjct: 144 ARGRDK--IVRNIENRIADFTFIPVDNGEELQVLHYQVGEKYVPHHDYFMDDINTANGGD 201
Query: 199 RVASFLVYLTDLEEGGETMFPFENG--MNADGSYDYQKC--IGLKVKPRQGDGLLFYSLL 254
R+A+ L+YL+D+EEGGET+FP G + G + C GL +KP+ + LLF+S+
Sbjct: 202 RIATMLMYLSDVEEGGETVFPDAKGNFSSMPGWNELSVCGKKGLSIKPKMRNALLFWSIK 261
Query: 255 PNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
P+ T DP S+HGSCPV+KG KW +TKWIR E
Sbjct: 262 PDATYDPLSLHGSCPVIKGNKWSSTKWIRIGEH 294
>gi|356555587|ref|XP_003546112.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 2
[Glycine max]
Length = 297
Score = 171 bits (433), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 95/225 (42%), Positives = 133/225 (59%), Gaps = 19/225 (8%)
Query: 69 AGDDSVTNIPFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDN 126
AG S P +V +SW PRA + F T +C +I++AK L+ S +A DN
Sbjct: 28 AGSASAIIDPSKVKQVSWKPRAFVYEGFLTELECDHLISIAKSELKRSAVA-------DN 80
Query: 127 TQG------IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKY 180
G +RTSSG+FI +D + +E+KI+ T+LP+ NGE +LRY+ GQKY
Sbjct: 81 LSGESKLSEVRTSSGMFIPKNKDP--IVAGVEDKISSWTLLPKENGEDIQVLRYEHGQKY 138
Query: 181 NSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCI--GL 238
+ HYD F + + RVA+ L+YLTD+ +GGET+FP +++ D +C G+
Sbjct: 139 DPHYDYFADKVNIARGGHRVATVLMYLTDVTKGGETVFPNAELKSSETKEDLSECAQKGI 198
Query: 239 KVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
VKPR+GD LLF+SL PN D S+H CPV++GEKW ATKWI
Sbjct: 199 AVKPRRGDALLFFSLYPNAIPDTMSLHAGCPVIEGEKWSATKWIH 243
>gi|359477453|ref|XP_003631980.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Vitis
vinifera]
gi|297736941|emb|CBI26142.3| unnamed protein product [Vitis vinifera]
Length = 298
Score = 171 bits (432), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 95/232 (40%), Positives = 134/232 (57%), Gaps = 25/232 (10%)
Query: 66 HGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVD 125
+ A +V+ + +SW PRA + F + E+C +I++AK L+ S +A D
Sbjct: 24 YADAAGSNVSAAKVRQISWKPRAFVYEGFLSEEECDHLISLAKSELKRSAVA-------D 76
Query: 126 NTQG------IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQK 179
N G +RTSSG+FI +D + IE+KIA T LP+ NGE +LRY+ GQK
Sbjct: 77 NVSGKSRLSEVRTSSGMFIGKGKDP--IVAGIEDKIAAWTFLPKDNGEDMQVLRYEPGQK 134
Query: 180 YNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENG--------MNADGSYD 231
Y++HYD F + + R+A+ L+YL+D+ +GGET+FP N D S
Sbjct: 135 YDAHYDYFVDKVNIARGGHRIATVLMYLSDVVKGGETVFPMAEEPSRRKPLPTNDDLSEC 194
Query: 232 YQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
+K G+ VKPR+GD LLF+SL P DP S+HG CPV++GEKW ATKWI
Sbjct: 195 ARK--GIAVKPRKGDALLFFSLHPTAIPDPMSLHGGCPVIEGEKWSATKWIH 244
>gi|50845214|gb|AAT84604.1| prolyl 4-hydroxylase [Dianthus caryophyllus]
Length = 316
Score = 171 bits (432), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 93/211 (44%), Positives = 125/211 (59%), Gaps = 18/211 (8%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 135
LSW PRA + F T E+C +I+MAK L S +A DN G +RTSSG
Sbjct: 58 LSWKPRAFLYEGFLTHEECDHLIDMAKDKLEKSMVA-------DNESGKSIPSEVRTSSG 110
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 195
+F+ A+D+ + IE +IA T LP NGEA IL Y+ GQKY H+D F +
Sbjct: 111 MFLQKAQDD--VVAAIEARIAAWTFLPIENGEAMQILHYERGQKYEPHFDYFHDKVNQQL 168
Query: 196 KSQRVASFLVYLTDLEEGGETMFP-FENGMNADGSYDYQKCI--GLKVKPRQGDGLLFYS 252
R+A+ L+YL+++EEGGET+FP E + + C G VKP++GD LLF+S
Sbjct: 169 GGHRIATVLMYLSNVEEGGETVFPNAEAKLQLANNESLSDCAKGGYSVKPKKGDALLFFS 228
Query: 253 LLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
L P+ + D S+HGSCPV++GEKW ATKWI
Sbjct: 229 LHPDASTDSLSLHGSCPVIEGEKWSATKWIH 259
>gi|363543369|ref|NP_001241694.1| prolyl 4-hydroxylase 8-4 [Zea mays]
gi|347978838|gb|AEP37761.1| prolyl 4-hydroxylase 8-4 [Zea mays]
Length = 307
Score = 171 bits (432), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 90/218 (41%), Positives = 135/218 (61%), Gaps = 21/218 (9%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTS 133
+V+SW PRA + NF + ++C+ +I +AK ++ ST+ VD+T G +RTS
Sbjct: 97 EVISWEPRAFVYHNFLSKDECEYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 149
Query: 134 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 193
SG+F+ ++ + IE++IA T +P +GE +L Y++GQKY H+D F +
Sbjct: 150 SGMFLQRGRNK--VIRAIEKRIADYTFIPVDHGEGLQVLHYEVGQKYEPHFDYFLDEFNT 207
Query: 194 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQK---CI--GLKVKPRQGDGL 248
QR+A+ L+YL+D+EEGGET+FP N +NA Y + C GL VKP+ GD L
Sbjct: 208 KNGGQRIATLLMYLSDVEEGGETIFPDAN-VNASSLPWYNELSDCAKRGLSVKPKMGDAL 266
Query: 249 LFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
LF+S+ P+ T+DP S+HG CPV+KG KW +TKW+ E
Sbjct: 267 LFWSMKPDATLDPLSLHGGCPVIKGNKWSSTKWMHIHE 304
>gi|159795555|pdb|2V4A|A Chain A, Crystal Structure Of The Semet-Labeled Prolyl-4
Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
Reinhardtii.
gi|159795556|pdb|2V4A|B Chain B, Crystal Structure Of The Semet-Labeled Prolyl-4
Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
Reinhardtii.
gi|159795557|pdb|2V4A|C Chain C, Crystal Structure Of The Semet-Labeled Prolyl-4
Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
Reinhardtii.
gi|159795558|pdb|2V4A|D Chain D, Crystal Structure Of The Semet-Labeled Prolyl-4
Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
Reinhardtii
Length = 233
Score = 170 bits (431), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 94/208 (45%), Positives = 127/208 (61%), Gaps = 13/208 (6%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLN-LRPSTLALRKGETVDNTQGIRTSSGVFISA 140
LSW PRA NF + E+C I+ A+ ++ S + G++VD+ IRTS+G + +
Sbjct: 25 LSWSPRAFLLKNFLSDEECDYIVEKARPKXVKSSVVDNESGKSVDSE--IRTSTGTWFAK 82
Query: 141 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQKS-Q 198
ED + IE+++A+VT +P N E +L Y GQKY HYD F DP GP+ Q
Sbjct: 83 GEDS--VISKIEKRVAQVTXIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNAGPEHGGQ 140
Query: 199 RVASFLVYLTDLEEGGETMFP-FENGMNADGSYDYQKCI--GLKVKPRQGDGLLFYSLLP 255
RV + L YLT +EEGGET+ P E + DG + +C GL VKP +GD L FYSL P
Sbjct: 141 RVVTXLXYLTTVEEGGETVLPNAEQKVTGDG---WSECAKRGLAVKPIKGDALXFYSLKP 197
Query: 256 NGTIDPTSIHGSCPVVKGEKWVATKWIR 283
+G+ DP S+HGSCP +KG+KW ATKWI
Sbjct: 198 DGSNDPASLHGSCPTLKGDKWSATKWIH 225
>gi|357467085|ref|XP_003603827.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355492875|gb|AES74078.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 280
Score = 170 bits (431), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 87/212 (41%), Positives = 133/212 (62%), Gaps = 9/212 (4%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-GETVDNTQGIRTSSGVFI 138
++LSW PRA + NF + E+C+ +IN+AK L S++ K G++ ++ +RTSSG+F+
Sbjct: 70 EILSWEPRAFVYHNFLSKEECEHLINLAKPFLAKSSVVDSKTGKSTESR--VRTSSGMFL 127
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 198
+D+ + IE +IA T +P NGE +L Y +G+KY HYD F + Q
Sbjct: 128 KRGKDK--IIQNIERRIADFTFIPVENGEGLQVLHYGVGEKYEPHYDYFLDEFNTKNGGQ 185
Query: 199 RVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKCI--GLKVKPRQGDGLLFYSLL 254
RVA+ L+YL+D+EEGGET+FP + + D +C GL +KP+ GD LLF+S+
Sbjct: 186 RVATVLMYLSDVEEGGETVFPAAKANFSSVPWWNDLSECARKGLSLKPKMGDALLFWSMR 245
Query: 255 PNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
P+ T+D +S+HG CPV+ G KW +TKW+ +E
Sbjct: 246 PDATLDASSLHGGCPVIVGNKWSSTKWMHLEE 277
>gi|326489721|dbj|BAK01841.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 315
Score = 170 bits (431), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 86/208 (41%), Positives = 132/208 (63%), Gaps = 9/208 (4%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVFI 138
+V+SW PRA + NF + E+C+ +I +AK + ST+ G++ D+ +RTSSG+F+
Sbjct: 105 EVISWEPRAFVYHNFLSKEECEYLIELAKPRMVKSTVVDSETGKSKDSR--VRTSSGMFL 162
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 198
D+ + IE +IA T +P +GE +L Y++GQKY H+D F + Q
Sbjct: 163 QRGRDK--VIRAIERRIADYTFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEFNTKNGGQ 220
Query: 199 RVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKCI--GLKVKPRQGDGLLFYSLL 254
R+A+ L+YL+D+EEGGET+FP N ++ + + +C GL VKP+ GD LLF+S+
Sbjct: 221 RMATILMYLSDIEEGGETIFPDANVNSSSLPWYNELSECARKGLAVKPKMGDALLFWSMK 280
Query: 255 PNGTIDPTSIHGSCPVVKGEKWVATKWI 282
P+ T+DP S+HG CPV+KG KW +TKW+
Sbjct: 281 PDATLDPLSLHGGCPVIKGNKWSSTKWL 308
>gi|224141325|ref|XP_002324024.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Populus
trichocarpa]
gi|222867026|gb|EEF04157.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Populus
trichocarpa]
Length = 308
Score = 170 bits (431), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 89/206 (43%), Positives = 129/206 (62%), Gaps = 8/206 (3%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-GETVDNTQGIRTSSGVFISA 140
LSW PRA + F + E+C ++N+A+ L S +A + G+++++ +RTSSG+FI
Sbjct: 49 LSWNPRAFLYKGFLSDEECDHLMNLARDKLEKSMVADNESGKSIESE--VRTSSGMFIGK 106
Query: 141 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRV 200
++DE +D IE +IA T LP+ NGE+ IL Y+ GQKY H+D F + RV
Sbjct: 107 SQDE--IVDDIEARIAAWTFLPQENGESIQILHYEHGQKYEPHFDYFHDKANQELGGHRV 164
Query: 201 ASFLVYLTDLEEGGETMFPFENGMNADGSYD-YQKCI--GLKVKPRQGDGLLFYSLLPNG 257
+ L+YL+++ +GGET+FP G D + C G VKP++GD LLF+SL P+
Sbjct: 165 VTVLMYLSNVGKGGETVFPNSEGKTIQPKDDSWSDCAKNGYAVKPQKGDALLFFSLHPDA 224
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIR 283
T D S+HGSCPV++GEKW ATKWI
Sbjct: 225 TTDTNSLHGSCPVIEGEKWSATKWIH 250
>gi|30689216|ref|NP_189490.2| Oxoglutarate/iron-dependent oxygenase [Arabidopsis thaliana]
gi|332643931|gb|AEE77452.1| Oxoglutarate/iron-dependent oxygenase [Arabidopsis thaliana]
Length = 288
Score = 170 bits (430), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 90/207 (43%), Positives = 129/207 (62%), Gaps = 9/207 (4%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA--LRKGETVDNTQGIRTSSGVFIS 139
LSW PRA + F + E+C +I +AK L S + + GE+ D+ +RTSSG+F++
Sbjct: 35 LSWTPRAFLYKGFLSDEECDHLIKLAKGKLEKSMVVADVDSGESEDSE--VRTSSGMFLT 92
Query: 140 AAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQR 199
+D+ + +E K+A T LP NGEA IL Y+ GQKY+ H+D F ++ R
Sbjct: 93 KRQDD--IVANVEAKLAAWTFLPEENGEALQILHYENGQKYDPHFDYFYDKKALELGGHR 150
Query: 200 VASFLVYLTDLEEGGETMFPFENGMNADGSYD-YQKCI--GLKVKPRQGDGLLFYSLLPN 256
+A+ L+YL+++ +GGET+FP G D + KC G VKPR+GD LLF++L N
Sbjct: 151 IATVLMYLSNVTKGGETVFPNWKGKTPQLKDDSWSKCAKQGYAVKPRKGDALLFFNLHLN 210
Query: 257 GTIDPTSIHGSCPVVKGEKWVATKWIR 283
GT DP S+HGSCPV++GEKW AT+WI
Sbjct: 211 GTTDPNSLHGSCPVIEGEKWSATRWIH 237
>gi|357140446|ref|XP_003571778.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
distachyon]
Length = 298
Score = 169 bits (429), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 93/211 (44%), Positives = 127/211 (60%), Gaps = 18/211 (8%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-GETVDNTQGIRTSSGVFISA 140
LSW PRA F + +C +I +AK L S +A + G++V + +RTSSG+F+
Sbjct: 38 LSWRPRAFLHKGFLSEPECDHMIELAKDKLEKSMVADNESGKSVQSE--VRTSSGMFLEK 95
Query: 141 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRV 200
+DE + IEE+IA T LP NGE+ IL YK G+KY HYD F + R+
Sbjct: 96 RQDE--VVARIEERIAAWTFLPSENGESIQILHYKNGEKYEPHYDYFHDKNNQALGGHRI 153
Query: 201 ASFLVYLTDLEEGGETMFPFENGMNADGSYDYQK------CI--GLKVKPRQGDGLLFYS 252
A+ L+YL+++E+GGET+FP NA+G K C G VKP +GD LLF+S
Sbjct: 154 ATVLMYLSNVEKGGETIFP-----NAEGKLTQHKDETASECAKNGYAVKPMKGDALLFFS 208
Query: 253 LLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
L P+ T DP S+HGSCPV++G+KW ATKWI
Sbjct: 209 LHPDATTDPDSLHGSCPVIEGQKWSATKWIH 239
>gi|114796723|gb|ABI79328.1| prolyl 4-hydroxylase [Dianthus caryophyllus]
Length = 297
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 95/220 (43%), Positives = 129/220 (58%), Gaps = 32/220 (14%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 135
+SW PRA + F T E+C +I++AK L+ S +A DN G +RTSSG
Sbjct: 40 ISWKPRAFVYEGFLTDEECDHLISIAKTELKRSAVA-------DNESGKSQVSEVRTSSG 92
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 195
FIS A+D + IEEK+A T LP NGE +LRY+ GQKY +H+D F + +
Sbjct: 93 AFISKAKD--AIVQRIEEKLATWTFLPIENGEDIQVLRYEEGQKYENHFDFFSDKVNIAR 150
Query: 196 KSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY----------DYQKCI--GLKVKPR 243
R A+ L+YL+++E+GG+T+FP NA+ S D +C G+ VKPR
Sbjct: 151 GGHRYATVLMYLSNVEKGGDTVFP-----NAELSERQKAAIAANDDLSECAKRGISVKPR 205
Query: 244 QGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
+GD LLF+SL P T D S+HG CPV++GEKW ATKWI
Sbjct: 206 KGDALLFFSLTPTATPDQLSLHGGCPVIEGEKWSATKWIH 245
>gi|302815629|ref|XP_002989495.1| hypothetical protein SELMODRAFT_129912 [Selaginella moellendorffii]
gi|300142673|gb|EFJ09371.1| hypothetical protein SELMODRAFT_129912 [Selaginella moellendorffii]
Length = 213
Score = 169 bits (428), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 86/212 (40%), Positives = 136/212 (64%), Gaps = 9/212 (4%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVFI 138
+++SW PRA NF T ++C +I +A ++ ST+ + G + D+ +RTSSG+F+
Sbjct: 3 EIISWTPRASLVHNFLTDDECDHLIRVAMPLMQKSTVVDSQTGGSRDSR--VRTSSGMFL 60
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 198
+ +D + IE+KIAK+T +P+ +GE +L Y+ GQKY++H+D F Q
Sbjct: 61 NRGQDR--VISEIEDKIAKLTFIPKDHGEGIQVLHYEPGQKYDAHHDFFYDTVNTRNGGQ 118
Query: 199 RVASFLVYLTDLEEGGETMFP--FENGMNADGSYDYQKC--IGLKVKPRQGDGLLFYSLL 254
R+A+ L+YLTD+EEGGET+FP +N + +C G+ V+P++GD LLF+S+
Sbjct: 119 RIATLLMYLTDVEEGGETVFPKSAKNSSSLPWHNQLSECGRRGVSVRPKRGDALLFWSMS 178
Query: 255 PNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
P+ +D +S+HG CPV+KG+KW ATKW+R E
Sbjct: 179 PDAQLDHSSLHGGCPVIKGDKWSATKWMRVSE 210
>gi|302762452|ref|XP_002964648.1| hypothetical protein SELMODRAFT_82355 [Selaginella moellendorffii]
gi|300168377|gb|EFJ34981.1| hypothetical protein SELMODRAFT_82355 [Selaginella moellendorffii]
Length = 225
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 86/212 (40%), Positives = 136/212 (64%), Gaps = 9/212 (4%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVFI 138
+++SW PRA NF T ++C +I +A ++ ST+ + G + D+ +RTSSG+F+
Sbjct: 15 EIISWTPRASLVHNFLTDDECDHLIRVAMPLMQKSTVVDSQTGGSRDSR--VRTSSGMFL 72
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 198
+ +D + IE+KIAK+T +P+ +GE +L Y+ GQKY++H+D F Q
Sbjct: 73 NRGQDR--VISEIEDKIAKLTFIPKDHGEGIQVLHYEPGQKYDAHHDFFYDTVNTRNGGQ 130
Query: 199 RVASFLVYLTDLEEGGETMFP--FENGMNADGSYDYQKC--IGLKVKPRQGDGLLFYSLL 254
R+A+ L+YLTD+EEGGET+FP +N + +C G+ V+P++GD LLF+S+
Sbjct: 131 RIATLLMYLTDVEEGGETVFPKSAKNSSSLPWHNQLSECGRRGVSVRPKRGDALLFWSMS 190
Query: 255 PNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
P+ +D +S+HG CPV+KG+KW ATKW+R E
Sbjct: 191 PDAQLDHSSLHGGCPVIKGDKWSATKWMRVSE 222
>gi|363807286|ref|NP_001242363.1| uncharacterized protein LOC100796794 precursor [Glycine max]
gi|255641119|gb|ACU20838.1| unknown [Glycine max]
Length = 297
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 94/238 (39%), Positives = 135/238 (56%), Gaps = 21/238 (8%)
Query: 58 KDEYKWMPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA 117
K ++ W + + + + +SW PRA + F T +C +I++AK L+ S +A
Sbjct: 15 KCDHVWSSYAGSASSVINPSKVKQISWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVA 74
Query: 118 LRKGETVDNTQG------IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNI 171
DN G +RTSSG+FIS +D + IE+KI+ T LP+ NGE +
Sbjct: 75 -------DNLSGESQLSDVRTSSGMFISKNKDP--IVAGIEDKISSWTFLPKENGEDIQV 125
Query: 172 LRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFE----NGMNAD 227
RY+ GQKY+ HYD F + + R+A+ L+YLTD+ +GGET+FP A+
Sbjct: 126 SRYEHGQKYDPHYDYFTDKVNIARGGHRIATVLMYLTDVAKGGETVFPSAEEPPRRRGAE 185
Query: 228 GSYDYQKCI--GLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
S D +C G+ VKPR+GD LLF+SL N T D +S+H CPV++GEKW ATKWI
Sbjct: 186 TSSDLSECAKKGIAVKPRRGDALLFFSLHTNATPDTSSLHAGCPVIEGEKWSATKWIH 243
>gi|384251901|gb|EIE25378.1| hypothetical protein COCSUDRAFT_35772 [Coccomyxa subellipsoidea
C-169]
Length = 222
Score = 168 bits (426), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 85/213 (39%), Positives = 125/213 (58%), Gaps = 17/213 (7%)
Query: 79 FQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRT 132
+VLSW PRA + NF T + ++ K ++ K E VDN G +RT
Sbjct: 1 MEVLSWEPRAYLYHNFLTEAEADYLVQKGKPHME-------KSEVVDNETGKSAPSKVRT 53
Query: 133 SSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEY 192
SSG+F++ ED+ ++ IE +IAK T +P+ NGE IL Y+ ++Y H+D F
Sbjct: 54 SSGMFLNRGEDD--VIERIEARIAKYTAIPKENGEGLQILHYQASEEYRPHFDYFHDNFN 111
Query: 193 GPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKC--IGLKVKPRQGDGLLF 250
QR+A+ L+YL+D+E+GGET+FP + G+ + +C G KP++GD L F
Sbjct: 112 TQNGGQRIATMLMYLSDVEDGGETVFPESSDKPNVGNTKFSQCAQAGAAAKPKKGDALFF 171
Query: 251 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
YSL P+G +D S+H CPV+KG+KW ATKW+R
Sbjct: 172 YSLTPDGRMDEKSLHAGCPVMKGDKWSATKWLR 204
>gi|28393447|gb|AAO42145.1| putative prolyl 4-hydroxylase [Arabidopsis thaliana]
Length = 253
Score = 168 bits (426), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 89/206 (43%), Positives = 128/206 (62%), Gaps = 9/206 (4%)
Query: 83 SWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA--LRKGETVDNTQGIRTSSGVFISA 140
SW PRA + F + E+C +I +AK L S + + GE+ D+ +RTSSG+F++
Sbjct: 1 SWTPRAFLYKGFLSDEECDHLIKLAKGKLEKSMVVADVDSGESEDSE--VRTSSGMFLTK 58
Query: 141 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRV 200
+D+ + +E K+A T LP NGEA IL Y+ GQKY+ H+D F ++ R+
Sbjct: 59 RQDD--IVANVEAKLAAWTFLPEENGEALQILHYENGQKYDPHFDYFYDKKALELGGHRI 116
Query: 201 ASFLVYLTDLEEGGETMFPFENGMNADGSYD-YQKCI--GLKVKPRQGDGLLFYSLLPNG 257
A+ L+YL+++ +GGET+FP G D + KC G VKPR+GD LLF++L NG
Sbjct: 117 ATVLMYLSNVTKGGETVFPNWKGKTPQLKDDSWSKCAKQGYAVKPRKGDALLFFNLHLNG 176
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIR 283
T DP S+HGSCPV++GEKW AT+WI
Sbjct: 177 TTDPNSLHGSCPVIEGEKWSATRWIH 202
>gi|224117220|ref|XP_002331751.1| predicted protein [Populus trichocarpa]
gi|222874448|gb|EEF11579.1| predicted protein [Populus trichocarpa]
Length = 266
Score = 168 bits (426), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 85/225 (37%), Positives = 131/225 (58%), Gaps = 7/225 (3%)
Query: 66 HGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVD 125
H D+ + +SW PRA + NF T +C +IN+AK +++ S + +
Sbjct: 42 HESGDDEGKAEQWVEAISWEPRAFIYHNFLTKAECDYLINLAKPHMQKS-MVVDSSSGKS 100
Query: 126 NTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD 185
+RTSSG F+ D+ + IE++IA + +P +GE IL Y++GQKY H+D
Sbjct: 101 KDSRVRTSSGTFLPRGRDK--IIRDIEKRIADFSFIPSEHGEGLQILHYEVGQKYEPHFD 158
Query: 186 AFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKCI--GLKVK 241
F QR+A+ L+YL+D+EEGGET+FP G + + + +C GL VK
Sbjct: 159 YFMDDYNTENGGQRIATVLMYLSDVEEGGETVFPSAKGNISSVPWWNELSECGKGGLSVK 218
Query: 242 PRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
P+ GD LLF+S+ P+ ++DP+S+HG CPV++G KW +TKW+R E
Sbjct: 219 PKMGDALLFWSMKPDASLDPSSLHGGCPVIRGNKWSSTKWMRVNE 263
>gi|168046048|ref|XP_001775487.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162673157|gb|EDQ59684.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 263
Score = 168 bits (425), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 88/205 (42%), Positives = 129/205 (62%), Gaps = 7/205 (3%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-GETVDNTQGIRTSSGVFISA 140
LSW PRA + NF + +C +I++AK L S +A + G++V + IRTSSG+F+
Sbjct: 9 LSWKPRAFLYSNFLSDAECDHMISLAKDKLEKSMVADNESGKSVKSE--IRTSSGMFLMK 66
Query: 141 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRV 200
+D+ + IE++IA T LP+ NGEA +LRY+ G+KY H+D F + R+
Sbjct: 67 GQDD--IISRIEDRIAAWTFLPKENGEAIQVLRYQDGEKYEPHFDYFHDKNNQALGGHRI 124
Query: 201 ASFLVYLTDLEEGGETMFPF--ENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGT 258
A+ L+YL+D+ +GGET+FP + G D S+ G+ VKPR+GD LLF+SL P+
Sbjct: 125 ATVLMYLSDVVKGGETVFPSSEDRGGPKDDSWSACGKTGVAVKPRKGDALLFFSLHPSAV 184
Query: 259 IDPTSIHGSCPVVKGEKWVATKWIR 283
D +S+H CPV++GEKW ATKWI
Sbjct: 185 PDESSLHTGCPVIEGEKWSATKWIH 209
>gi|307106819|gb|EFN55064.1| hypothetical protein CHLNCDRAFT_35843 [Chlorella variabilis]
Length = 287
Score = 167 bits (424), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 95/208 (45%), Positives = 131/208 (62%), Gaps = 15/208 (7%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-GETVDNTQGIRTSSGVFISA 140
+SW PRA + NF + E+C+ + +A+ L ST+ K G+++D+T +RTSSG F++
Sbjct: 41 VSWRPRAFVYHNFLSDEECEHLKELARKRLTKSTVVDNKTGKSMDST--VRTSSGTFLAR 98
Query: 141 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS--- 197
EDE + IE++I+ VTM+P NGEA IL+Y GQKY H D F +Y +
Sbjct: 99 GEDE--VVRAIEKRISLVTMIPEENGEAIQILKYVDGQKYEPHTDYFH-DKYNSRTENGG 155
Query: 198 QRVASFLVYLTDLEEGGETMFPF-ENGMNADGSYDYQKCI--GLKVKPRQGDGLLFYSLL 254
QRVA+ L+YL+ EEGGET+FP+ E + +G + +C GL VK +G LLFYSL
Sbjct: 156 QRVATILMYLSTPEEGGETVFPYAEKKVEGEG---WSECARKGLAVKAVKGSALLFYSLK 212
Query: 255 PNGTIDPTSIHGSCPVVKGEKWVATKWI 282
PNG D S HGSCP + GEKW AT+WI
Sbjct: 213 PNGEEDQASTHGSCPTLAGEKWSATRWI 240
>gi|363543301|ref|NP_001241866.1| prolyl 4-hydroxylase 6 precursor [Zea mays]
gi|195624808|gb|ACG34234.1| oxidoreductase [Zea mays]
gi|347978818|gb|AEP37751.1| prolyl 4-hydroxylase 6 [Zea mays]
Length = 297
Score = 167 bits (423), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 92/216 (42%), Positives = 131/216 (60%), Gaps = 28/216 (12%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 135
LS PRA + F + +C I+++AK ++ S +A DN G RTSSG
Sbjct: 38 LSSRPRAFLYSGFLSDTECDHIVSLAKGSMEKSMVA-------DNDSGKSVASQARTSSG 90
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 195
F++ EDE + IE+++A T LP N E+ +LRY+ GQKY++H+D F +
Sbjct: 91 TFLAKREDE--IVSAIEKRVAAWTFLPEENAESLQVLRYETGQKYDAHFDYFHDRNNLKL 148
Query: 196 KSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY------DYQKC--IGLKVKPRQGDG 247
QRVA+ L+YLTD+++GGET+FP NA+GS+ + +C GL VKP++GD
Sbjct: 149 GGQRVATVLMYLTDVKKGGETVFP-----NAEGSHLQYKDETWSECSRSGLAVKPKKGDA 203
Query: 248 LLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
LLF++L N T D S+HGSCPV++GEKW ATKWI
Sbjct: 204 LLFFNLHVNATADTGSLHGSCPVIEGEKWSATKWIH 239
>gi|388496942|gb|AFK36537.1| unknown [Lotus japonicus]
Length = 302
Score = 167 bits (423), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 97/231 (41%), Positives = 132/231 (57%), Gaps = 25/231 (10%)
Query: 69 AGDDSVTNIPFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDN 126
AG S P +V +SW PRA + F T +C +I++AK L+ S +A DN
Sbjct: 29 AGSASAIIDPSKVKQVSWKPRAFVYKGFLTELECDHLISLAKSELKRSAVA-------DN 81
Query: 127 TQG------IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKY 180
G +RTSSG+FIS +D + IE+KI+ T LP+ NGE +LRY+ GQKY
Sbjct: 82 LSGDSKLSDVRTSSGMFISKNKDP--IVAGIEDKISSWTFLPKENGEDIQVLRYEHGQKY 139
Query: 181 NSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFP------FENGMNADGSYDYQK 234
+ HYD F + + RVA+ L+YLT++ GGET+FP F ++ D +
Sbjct: 140 DPHYDFFADKVNIARGGHRVATVLMYLTNVTRGGETVFPNAEVEEFPRHRGSETIDDLSE 199
Query: 235 CI--GLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
C G+ VKPR+GD LLF+SL PN D S+H CPV++GEKW ATKWI
Sbjct: 200 CAKKGIAVKPRRGDALLFFSLYPNAVPDTMSLHAGCPVIEGEKWSATKWIH 250
>gi|90704797|dbj|BAE92293.1| putative prolyl 4-hydroxylase, alpha subunit [Cryptomeria japonica]
Length = 302
Score = 167 bits (423), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 96/268 (35%), Positives = 160/268 (59%), Gaps = 14/268 (5%)
Query: 21 FVFLACLFFFLAGLLGSSLLSQDVTAARPSARVVESVKDEYKWMPHGQAGDDSVTNIPFQ 80
F L+ F+ + S+ + T + S ++ ES+ +Y G+ D+ + +
Sbjct: 38 FTLLSVFVLFVLAIWVFSVPVKR-TPYQISRQLSESIAADYAKRSDGK--DEPKERV--E 92
Query: 81 VLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-GETVDNTQGIRTSSGVFIS 139
VLSW PRA + NF ++C+ +IN+AK ++ S + K G ++D+ +RTSSG F++
Sbjct: 93 VLSWEPRAFLYHNFLAKDECEYLINIAKPHMVKSMVVDSKTGGSMDSN--VRTSSGWFLN 150
Query: 140 AAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQR 199
+D+ + IE++IA + +P +GE ++L Y++ QKY++HYD F QR
Sbjct: 151 RGQDK--IIRRIEKRIADFSHIPVEHGEGLHVLHYEVEQKYDAHYDYFSDTINVKNGGQR 208
Query: 200 VASFLVYLTDLEEGGETMFPFE--NGMNADGSYDYQKC--IGLKVKPRQGDGLLFYSLLP 255
A+ L+YL+D+E+GGET+FP N + + +C GL V+P+ GD LLF+S+ P
Sbjct: 209 GATMLMYLSDVEKGGETVFPQSKVNSSSVPWWDELSECGRSGLSVRPKMGDALLFWSVKP 268
Query: 256 NGTIDPTSIHGSCPVVKGEKWVATKWIR 283
+ ++DP+S+HGSCPV++G KW ATKW+R
Sbjct: 269 DASLDPSSLHGSCPVIQGNKWSATKWMR 296
>gi|225452614|ref|XP_002281420.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
gi|296087745|emb|CBI35001.3| unnamed protein product [Vitis vinifera]
Length = 316
Score = 167 bits (422), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 92/211 (43%), Positives = 123/211 (58%), Gaps = 18/211 (8%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 135
LSW PRA + F + E+C +I +AK L S +A DN G +RTSSG
Sbjct: 57 LSWRPRAFLYKGFLSEEECDHLITLAKDKLEKSMVA-------DNESGKSIMSEVRTSSG 109
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 195
+F+ A+DE + IE +IA T LP NGE+ IL Y+ G+KY H+D F +
Sbjct: 110 MFLLKAQDE--IVADIEARIAAWTFLPVENGESIQILHYENGEKYEPHFDYFHDKVNQLL 167
Query: 196 KSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYD-YQKCI--GLKVKPRQGDGLLFYS 252
R+A+ L+YL +EEGGET+FP G + D + C G V P++GD LLF+S
Sbjct: 168 GGHRIATVLMYLATVEEGGETVFPNSEGRFSQPKDDSWSDCAKKGYAVNPKKGDALLFFS 227
Query: 253 LLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
L P+ T DP+S+HGSCPV+ GEKW ATKWI
Sbjct: 228 LHPDATTDPSSLHGSCPVIAGEKWSATKWIH 258
>gi|115464581|ref|NP_001055890.1| Os05g0489100 [Oryza sativa Japonica Group]
gi|50511363|gb|AAT77286.1| putative prolyl 4-hydroxylase alpha subunit [Oryza sativa Japonica
Group]
gi|113579441|dbj|BAF17804.1| Os05g0489100 [Oryza sativa Japonica Group]
gi|125587281|gb|EAZ27945.1| hypothetical protein OsJ_11906 [Oryza sativa Japonica Group]
gi|215737307|dbj|BAG96236.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 319
Score = 167 bits (422), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 91/215 (42%), Positives = 127/215 (59%), Gaps = 23/215 (10%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 135
+SW PR + +F + ++ ++++A+ L+ S +A DN G RTSSG
Sbjct: 61 ISWKPRVFLYQHFLSDDEANHLVSLARTELKRSAVA-------DNLSGKSELSDARTSSG 113
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 195
FI ++D + IEEKIA T LP+ NGE +LRYK G+KY HYD F +
Sbjct: 114 TFIRKSQDP--IVAGIEEKIAAWTFLPKENGEDIQVLRYKHGEKYERHYDYFSDNVNTLR 171
Query: 196 KSQRVASFLVYLTDLEEGGETMFPF-----ENGMNADGSYDYQKCI--GLKVKPRQGDGL 248
R+A+ L+YLTD+ EGGET+FP E+G N + S +C G+ VKPR+GD L
Sbjct: 172 GGHRIATVLMYLTDVAEGGETVFPLAEEFTESGTNNEDS-TLSECAKKGVAVKPRKGDAL 230
Query: 249 LFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
LF++L P+ + D S+H CPV+KGEKW ATKWIR
Sbjct: 231 LFFNLSPDASKDSLSLHAGCPVIKGEKWSATKWIR 265
>gi|125552794|gb|EAY98503.1| hypothetical protein OsI_20415 [Oryza sativa Indica Group]
Length = 319
Score = 167 bits (422), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 91/215 (42%), Positives = 127/215 (59%), Gaps = 23/215 (10%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 135
+SW PR + +F + ++ ++++A+ L+ S +A DN G RTSSG
Sbjct: 61 ISWKPRVFLYQHFLSDDEANHLVSLARAELKRSAVA-------DNLSGKSELSDARTSSG 113
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 195
FI ++D + IEEKIA T LP+ NGE +LRYK G+KY HYD F +
Sbjct: 114 TFIRKSQDP--IVAGIEEKIAAWTFLPKENGEDIQVLRYKHGEKYERHYDYFSDNVNTLR 171
Query: 196 KSQRVASFLVYLTDLEEGGETMFPF-----ENGMNADGSYDYQKCI--GLKVKPRQGDGL 248
R+A+ L+YLTD+ EGGET+FP E+G N + S +C G+ VKPR+GD L
Sbjct: 172 GGHRIATVLMYLTDVAEGGETVFPLAEEFTESGTNNEDS-TLSECAKKGVAVKPRKGDAL 230
Query: 249 LFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
LF++L P+ + D S+H CPV+KGEKW ATKWIR
Sbjct: 231 LFFNLSPDASKDSLSLHAGCPVIKGEKWSATKWIR 265
>gi|356550516|ref|XP_003543632.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 318
Score = 166 bits (421), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 96/225 (42%), Positives = 129/225 (57%), Gaps = 20/225 (8%)
Query: 70 GDDSVTNIPFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNT 127
G SV P +V LSW PRA + F + E+C +I +AK L S +A DN
Sbjct: 45 GGSSVKFDPTRVTQLSWSPRAFLYKGFLSDEECDHLITLAKDKLEKSMVA-------DNE 97
Query: 128 QG------IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYN 181
G +RTSSG+F++ A+DE + IE +IA T LP NGE+ IL Y+ GQKY
Sbjct: 98 SGKSIMSEVRTSSGMFLNKAQDE--IVAGIEARIAAWTFLPIENGESMQILHYENGQKYE 155
Query: 182 SHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENG---MNADGSYDYQKCIGL 238
H+D F + R+A+ L+YL+D+E+GGET+FP D S+ G
Sbjct: 156 PHFDYFHDKANQVMGGHRIATVLMYLSDVEKGGETIFPNAKAKLLQPKDESWSECAHKGY 215
Query: 239 KVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
VKPR+GD LLF+SL + + D S+HGSCPV++GEKW ATKWI
Sbjct: 216 AVKPRKGDALLFFSLHLDASTDNKSLHGSCPVIEGEKWSATKWIH 260
>gi|413932756|gb|AFW67307.1| oxidoreductase [Zea mays]
Length = 297
Score = 166 bits (421), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 91/216 (42%), Positives = 130/216 (60%), Gaps = 28/216 (12%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 135
LS PRA + F + +C ++++AK ++ S +A DN G RTSSG
Sbjct: 38 LSSRPRAFLYSGFLSDTECDHLVSLAKGSMEKSMVA-------DNDSGKSVASQARTSSG 90
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 195
F++ EDE + IE+++A T LP N E+ +LRY+ GQKY++H+D F +
Sbjct: 91 TFLAKREDE--IVSAIEKRVAAWTFLPEENAESLQVLRYETGQKYDAHFDYFHDRNNLKL 148
Query: 196 KSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY------DYQKC--IGLKVKPRQGDG 247
QRVA+ L+YLTD+ +GGET+FP NA+GS+ + +C GL VKP++GD
Sbjct: 149 GGQRVATVLMYLTDVNKGGETVFP-----NAEGSHLQYKDETWSECSRSGLAVKPKKGDA 203
Query: 248 LLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
LLF++L N T D S+HGSCPV++GEKW ATKWI
Sbjct: 204 LLFFNLHVNATADTGSLHGSCPVIEGEKWSATKWIH 239
>gi|356555585|ref|XP_003546111.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 1
[Glycine max]
Length = 301
Score = 166 bits (421), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 98/229 (42%), Positives = 133/229 (58%), Gaps = 23/229 (10%)
Query: 69 AGDDSVTNIPFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDN 126
AG S P +V +SW PRA + F T +C +I++AK L+ S +A DN
Sbjct: 28 AGSASAIIDPSKVKQVSWKPRAFVYEGFLTELECDHLISIAKSELKRSAVA-------DN 80
Query: 127 TQG------IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKY 180
G +RTSSG+FI +D + +E+KI+ T+LP+ NGE +LRY+ GQKY
Sbjct: 81 LSGESKLSEVRTSSGMFIPKNKDP--IVAGVEDKISSWTLLPKENGEDIQVLRYEHGQKY 138
Query: 181 NSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFP-FENGMNADGS---YDYQKCI 236
+ HYD F + + RVA+ L+YLTD+ +GGET+FP E GS D +C
Sbjct: 139 DPHYDYFADKVNIARGGHRVATVLMYLTDVTKGGETVFPNAEESPRHRGSETKEDLSECA 198
Query: 237 --GLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
G+ VKPR+GD LLF+SL PN D S+H CPV++GEKW ATKWI
Sbjct: 199 QKGIAVKPRRGDALLFFSLYPNAIPDTMSLHAGCPVIEGEKWSATKWIH 247
>gi|449459442|ref|XP_004147455.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
gi|449515722|ref|XP_004164897.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 319
Score = 166 bits (421), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 94/221 (42%), Positives = 135/221 (61%), Gaps = 9/221 (4%)
Query: 68 QAGDDSVTNIPFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVD 125
+ G ++T P +V LS PRA + F + E+C+ +IN AK L S +A G++V
Sbjct: 47 KTGGSAMTIDPTRVIQLSSKPRAFLYKGFLSAEECQHLINSAKGKLHQSLVAAGTGQSVT 106
Query: 126 NTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD 185
+ + RTS+G+F+ A+DE + IE +IA T LP NGE ILRY+ GQKY H+D
Sbjct: 107 SKE--RTSTGMFLHKAQDE--IVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFD 162
Query: 186 AFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFEN-GMNADGSYDYQKC--IGLKVKP 242
F R+A+ L+YL+++E+GGET+FP ++ + D +C +G V+P
Sbjct: 163 FFQDPGNIAIGGHRIATILMYLSNVEKGGETVFPNSPVKLSEEEKADLSECGKVGYGVRP 222
Query: 243 RQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
+ GD LLF+S+ PN T D TS HGSCPV++GEKW ATKWI
Sbjct: 223 KLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIH 263
>gi|357496283|ref|XP_003618430.1| Prolyl 4-hydroxylase subunit alpha-2 [Medicago truncatula]
gi|217073992|gb|ACJ85356.1| unknown [Medicago truncatula]
gi|355493445|gb|AES74648.1| Prolyl 4-hydroxylase subunit alpha-2 [Medicago truncatula]
gi|388494436|gb|AFK35284.1| unknown [Medicago truncatula]
Length = 313
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 90/211 (42%), Positives = 125/211 (59%), Gaps = 18/211 (8%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 135
LSW PRA + NF T E+C +I ++K L S +A DN G +RTSSG
Sbjct: 54 LSWSPRAFLYKNFLTDEECDHLIELSKDKLEKSMVA-------DNESGKSIQSEVRTSSG 106
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 195
+F++ +DE + IE +IA T LP NGE+ +L Y G+KY H+D F +
Sbjct: 107 MFLNKQQDE--IVSGIEARIAAWTFLPVENGESMQVLHYMNGEKYEPHFDFFHDKANQRL 164
Query: 196 KSQRVASFLVYLTDLEEGGETMFPFENG-MNADGSYDYQKCI--GLKVKPRQGDGLLFYS 252
RVA+ L+YL+++E+GGET+FP G ++ + +C G VKPR+GD LLF+S
Sbjct: 165 GGHRVATVLMYLSNVEKGGETIFPHAEGKLSQPKDESWSECAHKGYAVKPRKGDALLFFS 224
Query: 253 LLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
L + T D S+HGSCPV++GEKW ATKWI
Sbjct: 225 LHLDATTDSKSLHGSCPVIEGEKWSATKWIH 255
>gi|168002780|ref|XP_001754091.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162694645|gb|EDQ80992.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 214
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 89/210 (42%), Positives = 128/210 (60%), Gaps = 11/210 (5%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVFI 138
+VLSW PRA + +F T E+C +I +A+ +L ST+ G++ D+ +RTSSG F+
Sbjct: 4 EVLSWEPRAFLYHHFLTEEECNHLIEVARPSLVKSTVVDSDTGKSKDSR--LRTSSGTFL 61
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 198
+D + IE++IA T +P GE +L+YK +KY HYD F Q
Sbjct: 62 MRGQDP--VIKRIEKRIADFTFIPAEQGEGLQVLQYKESEKYEPHYDYFHDAYNTKNGGQ 119
Query: 199 RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQK---CI--GLKVKPRQGDGLLFYSL 253
R+A+ L+YL+++EEGGET+FP +N D+ K C GL V+PR GD LLF+S+
Sbjct: 120 RIATVLMYLSNVEEGGETVFPAAQ-VNKTEVPDWDKLSECAQKGLSVRPRMGDALLFWSM 178
Query: 254 LPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
P+ T+D TS+HG CPV+KG KW ATKW+
Sbjct: 179 KPDATLDSTSLHGGCPVIKGTKWSATKWLH 208
>gi|449454448|ref|XP_004144967.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
gi|449474082|ref|XP_004154068.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
gi|449515181|ref|XP_004164628.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 300
Score = 166 bits (419), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 94/229 (41%), Positives = 133/229 (58%), Gaps = 23/229 (10%)
Query: 69 AGDDSVTNIPFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDN 126
AG S T P +V +SW PRA + F T +C ++++A+ L+ + E DN
Sbjct: 27 AGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELK-------RSEVADN 79
Query: 127 TQG------IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKY 180
G +RTSSG+FIS +D + IE+KI+ T LP+ NGE +LRY+ GQKY
Sbjct: 80 DSGKSKLSTVRTSSGMFISKNKDP--IVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKY 137
Query: 181 NSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY----DYQKCI 236
SHYD F + R+A+ L+YL+++ +GGET+FP + +Y D +C
Sbjct: 138 ESHYDYFVDKVNIAWGGHRLATVLMYLSNVTQGGETVFPLAEKPSHRRAYETDEDLSECA 197
Query: 237 --GLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
G+ VKP++GD LLF+SL PN D S+HG CPV++GEKW ATKWI
Sbjct: 198 KKGVAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIH 246
>gi|168001068|ref|XP_001753237.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162695523|gb|EDQ81866.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 284
Score = 166 bits (419), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 86/216 (39%), Positives = 132/216 (61%), Gaps = 16/216 (7%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTS 133
+V+SW PR + NF + ++C +IN+A+ L ST+ VD T G +RTS
Sbjct: 79 EVISWQPRIILLHNFLSADECDHLINLARPRLVKSTV-------VDATTGKGIESKVRTS 131
Query: 134 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 193
+G+F++ + T+ IE +IA +M+P NGE +LRY+ Q Y +H+D F +
Sbjct: 132 TGMFLNGNDRRHHTIQAIETRIAAYSMVPVQNGELLQVLRYESDQYYKAHHDYFSDEFNL 191
Query: 194 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 253
+ QRVA+ L+YLT+ EGGET+FP + + S + IG+ VKP++GD +LF+S+
Sbjct: 192 KRGGQRVATMLMYLTEGVEGGETIFP--QAGDKECSCGGEMKIGVCVKPKRGDAVLFWSI 249
Query: 254 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 289
+G +DPTS+HG C V+ GEKW +TKW+R Q +D
Sbjct: 250 KLDGQVDPTSLHGGCKVLSGEKWSSTKWMR-QRAFD 284
>gi|359806348|ref|NP_001241485.1| uncharacterized protein LOC100783075 precursor [Glycine max]
gi|255645457|gb|ACU23224.1| unknown [Glycine max]
Length = 298
Score = 165 bits (417), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 92/233 (39%), Positives = 131/233 (56%), Gaps = 21/233 (9%)
Query: 63 WMPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGE 122
W + + V + +SW PRA + F T +C +I++AK L+ S +A
Sbjct: 21 WGSYAGSASSIVNPSKVKQISWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVA----- 75
Query: 123 TVDNTQG------IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKI 176
DN G +RTSSG+FIS +D + IE+KI+ T LP+ NGE +LRY+
Sbjct: 76 --DNLSGESQLSDVRTSSGMFISKNKDP--IISGIEDKISSWTFLPKENGEDIQVLRYEH 131
Query: 177 GQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFE----NGMNADGSYDY 232
GQKY+ HYD F + + R+A+ L+YLT++ +GGET+FP + S D
Sbjct: 132 GQKYDPHYDYFTDKVNIARGGHRIATVLMYLTNVTKGGETVFPSAEEPPRRRGTETSSDL 191
Query: 233 QKCI--GLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
+C G+ VKP +GD LLF+SL N T D +S+H CPV++GEKW ATKWI
Sbjct: 192 SECAKKGIAVKPHRGDALLFFSLHTNATPDTSSLHAGCPVIEGEKWSATKWIH 244
>gi|168060785|ref|XP_001782374.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666166|gb|EDQ52828.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 211
Score = 165 bits (417), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 87/210 (41%), Positives = 130/210 (61%), Gaps = 11/210 (5%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPST-LALRKGETVDNTQGIRTSSGVFI 138
+VLSW PRA + +F T +C +I +AK +L ST + G++ D+ +RTSSG F+
Sbjct: 3 EVLSWEPRAFLYHHFLTQVECNHLIEVAKPSLVKSTVIDSATGKSKDSR--VRTSSGTFL 60
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 198
+D + IE++IA T +P GE +L+Y+ +KY HYD F Q
Sbjct: 61 VRGQDH--IIKRIEKRIADFTFIPVEQGEGLQVLQYRESEKYEPHYDYFHDAFNTKNGGQ 118
Query: 199 RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQK---CI--GLKVKPRQGDGLLFYSL 253
R+A+ L+YL+D+E+GGET+FP + +NA D+ + C GL V+PR GD LLF+S+
Sbjct: 119 RIATVLMYLSDVEKGGETVFP-ASKVNASEVPDWDQRSECAKRGLSVRPRMGDALLFWSM 177
Query: 254 LPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
P+ +DPTS+HG+CPV++G KW ATKW+
Sbjct: 178 KPDAKLDPTSLHGACPVIQGTKWSATKWLH 207
>gi|356572148|ref|XP_003554232.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Glycine max]
Length = 319
Score = 164 bits (416), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 96/225 (42%), Positives = 131/225 (58%), Gaps = 20/225 (8%)
Query: 70 GDDSVTNIPFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNT 127
G SV P +V LSW PRA + F + E+C +I +AK L S +A DN
Sbjct: 46 GGSSVKFDPTRVTQLSWSPRAFLYKGFLSEEECDHLIVLAKDKLEKSMVA-------DND 98
Query: 128 QG------IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYN 181
G IRTSSG+F++ A+DE + IE +IA T LP NGE+ IL Y+ GQKY
Sbjct: 99 SGKSIMSDIRTSSGMFLNKAQDE--IVAGIEARIAAWTFLPVENGESMQILHYENGQKYE 156
Query: 182 SHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFP-FENGMNADGSYDYQKCI--GL 238
H+D F + R+A+ L+YL+D+E+GGET+FP E + + +C G
Sbjct: 157 PHFDYFHDKANQVMGGHRIATVLMYLSDVEKGGETIFPNAEAKLLQPKDESWSECAHKGY 216
Query: 239 KVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
VKP++GD LLF+SL + + D S+HGSCPV++GEKW ATKWI
Sbjct: 217 AVKPQKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIH 261
>gi|226495689|ref|NP_001149322.1| LOC100282945 precursor [Zea mays]
gi|194697650|gb|ACF82909.1| unknown [Zea mays]
gi|194708468|gb|ACF88318.1| unknown [Zea mays]
gi|195626376|gb|ACG35018.1| oxidoreductase [Zea mays]
gi|347978842|gb|AEP37763.1| prolyl 4-hydroxylase 9 [Zea mays]
gi|413945802|gb|AFW78451.1| oxidoreductase [Zea mays]
Length = 308
Score = 164 bits (416), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 90/212 (42%), Positives = 127/212 (59%), Gaps = 21/212 (9%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 135
+S PR + +F + ++ +I++A+ L+ S +A DN G +RTSSG
Sbjct: 54 ISCKPRVFLYQHFLSDDEANHLISLARAELKRSAVA-------DNMSGKSTLSEVRTSSG 106
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 195
F+ +D ++ IE+KIA T LP+ NGE +LRYK G+KY HYD F +
Sbjct: 107 TFLRKGQDP--IVEGIEDKIAAWTFLPKENGEDIQVLRYKHGEKYEPHYDYFTDNVNTVR 164
Query: 196 KSQRVASFLVYLTDLEEGGETMFPF----ENGMNADGSYDYQKCIGLKVKPRQGDGLLFY 251
R A+ L+YLTD+ EGGET+FP ++ +A S QK G+ V+PR+GD LLF+
Sbjct: 165 GGHRYATVLLYLTDVPEGGETVFPLAEEPDDAKDATLSECAQK--GIAVRPRKGDALLFF 222
Query: 252 SLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
+L P+GT D S+HG CPV+KGEKW ATKWIR
Sbjct: 223 NLNPDGTTDSVSLHGGCPVIKGEKWSATKWIR 254
>gi|356546462|ref|XP_003541645.1| PREDICTED: uncharacterized protein LOC100818794 [Glycine max]
Length = 839
Score = 164 bits (415), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 98/229 (42%), Positives = 132/229 (57%), Gaps = 23/229 (10%)
Query: 69 AGDDSVTNIPFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDN 126
AG S P +V +SW PRA + F T +C +I++AK L+ S +A DN
Sbjct: 566 AGSASAIIDPSKVKQVSWKPRAFVYEGFLTELECDHLISIAKSELKRSAVA-------DN 618
Query: 127 TQG------IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKY 180
G +RTSSG+FI +D + IE+KI+ T LP+ NGE +LRY+ GQKY
Sbjct: 619 LSGESKLSEVRTSSGMFIPKNKDL--IVAGIEDKISSWTFLPKENGEDIQVLRYEHGQKY 676
Query: 181 NSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFP-FENGMNADGS---YDYQKCI 236
+ HYD F + + RVA+ L+YLTD+ +GGET+FP E GS + +C
Sbjct: 677 DPHYDYFADKVNIARGGHRVATVLMYLTDVTKGGETVFPDAEESPRHKGSETNENLSECA 736
Query: 237 --GLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
G+ VKPR+GD LLF+SL PN D S+H CPV++GEKW ATKWI
Sbjct: 737 QKGIAVKPRRGDALLFFSLYPNAIPDTLSLHAGCPVIEGEKWSATKWIH 785
>gi|218184507|gb|EEC66934.1| hypothetical protein OsI_33548 [Oryza sativa Indica Group]
Length = 308
Score = 164 bits (415), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 91/211 (43%), Positives = 122/211 (57%), Gaps = 18/211 (8%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 135
LSW PRA F T +C+ +I++AK L S +A DN G +RTSSG
Sbjct: 48 LSWRPRAFLHKGFLTDAECEHLISLAKDKLEKSMVA-------DNESGKSVMSEVRTSSG 100
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 195
+F+ +DE + IEE+IA T LP NGE+ IL Y+ G+KY HYD F +
Sbjct: 101 MFLEKKQDE--VVARIEERIAAWTFLPPDNGESIQILHYQNGEKYEPHYDYFHDKNNQAL 158
Query: 196 KSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYD-YQKCI--GLKVKPRQGDGLLFYS 252
R+A+ L+YL+D+ +GGET+FP G D + C G VKP +GD LLF+S
Sbjct: 159 GGHRIATVLMYLSDVGKGGETIFPEAEGKLLQPKDDTWSDCAKNGYAVKPVKGDALLFFS 218
Query: 253 LLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
L P+ T D S+HGSCPV++G+KW ATKWI
Sbjct: 219 LHPDATTDSDSLHGSCPVIEGQKWSATKWIH 249
>gi|115481998|ref|NP_001064592.1| Os10g0413500 [Oryza sativa Japonica Group]
gi|110289075|gb|ABG66075.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
Group]
gi|113639201|dbj|BAF26506.1| Os10g0413500 [Oryza sativa Japonica Group]
gi|215692577|dbj|BAG87997.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222612821|gb|EEE50953.1| hypothetical protein OsJ_31503 [Oryza sativa Japonica Group]
Length = 308
Score = 164 bits (415), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 91/211 (43%), Positives = 122/211 (57%), Gaps = 18/211 (8%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 135
LSW PRA F T +C+ +I++AK L S +A DN G +RTSSG
Sbjct: 48 LSWRPRAFLHKGFLTDAECEHLISLAKDKLEKSMVA-------DNESGKSVMSEVRTSSG 100
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 195
+F+ +DE + IEE+IA T LP NGE+ IL Y+ G+KY HYD F +
Sbjct: 101 MFLEKKQDE--VVARIEERIAAWTFLPPDNGESIQILHYQNGEKYEPHYDYFHDKNNQAL 158
Query: 196 KSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYD-YQKCI--GLKVKPRQGDGLLFYS 252
R+A+ L+YL+D+ +GGET+FP G D + C G VKP +GD LLF+S
Sbjct: 159 GGHRIATVLMYLSDVGKGGETIFPEAEGKLLQPKDDTWSDCAKNGYAVKPVKGDALLFFS 218
Query: 253 LLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
L P+ T D S+HGSCPV++G+KW ATKWI
Sbjct: 219 LHPDATTDSDSLHGSCPVIEGQKWSATKWIH 249
>gi|357447553|ref|XP_003594052.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
truncatula]
gi|355483100|gb|AES64303.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
truncatula]
Length = 301
Score = 164 bits (414), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 94/219 (42%), Positives = 128/219 (58%), Gaps = 31/219 (14%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 135
+SW PRA + F T +C +I++AK L+ S +A DN G +RTSSG
Sbjct: 43 VSWKPRAFVYKGFLTDLECDHLISIAKSELKRSAVA-------DNLSGESKLSEVRTSSG 95
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 195
+FIS +D + IE+KI+ T LP+ NGE +LRY+ GQKY+ HYD F + +
Sbjct: 96 MFISKNKD--AIVSGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDYFADKVNIAR 153
Query: 196 KSQRVASFLVYLTDLEEGGETMFPFENGMNADGS---------YDYQKC--IGLKVKPRQ 244
RVA+ L+YLT++ +GGET+FP NA+ S D +C G+ VKPR+
Sbjct: 154 GGHRVATVLMYLTNVTKGGETVFP-----NAEESPRHKLSETDEDLSECGKKGVAVKPRR 208
Query: 245 GDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
GD LLF+SL PN D S+H CPV++GEKW ATKWI
Sbjct: 209 GDALLFFSLHPNAIPDTLSLHAGCPVIEGEKWSATKWIH 247
>gi|255551575|ref|XP_002516833.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223543921|gb|EEF45447.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 297
Score = 164 bits (414), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 90/214 (42%), Positives = 125/214 (58%), Gaps = 21/214 (9%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 135
+SW PRA + F T +C +I++AK L+ S +A DN G +RTSSG
Sbjct: 39 VSWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVA-------DNESGKSKLSEVRTSSG 91
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 195
+FI+ +D + IEEKI+ T LP+ NGE +LRY+ GQKY+ HYD F + +
Sbjct: 92 MFIAKGKDP--IIAGIEEKISTWTFLPKENGEDLQVLRYEHGQKYDPHYDYFADKINIAR 149
Query: 196 KSQRVASFLVYLTDLEEGGETMFPFE----NGMNADGSYDYQKCI--GLKVKPRQGDGLL 249
R+A+ L+YL+D+ +GGET+FP + D +C G+ VKPR+GD LL
Sbjct: 150 GGHRMATVLMYLSDVVKGGETVFPNAEEPPRRKATESHEDLSECAKKGISVKPRRGDALL 209
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
F+SL P DP S+H CPV++GEKW ATKWI
Sbjct: 210 FFSLHPTAIPDPNSLHAGCPVIEGEKWSATKWIH 243
>gi|255085784|ref|XP_002505323.1| predicted protein [Micromonas sp. RCC299]
gi|226520592|gb|ACO66581.1| predicted protein [Micromonas sp. RCC299]
Length = 215
Score = 164 bits (414), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 93/222 (41%), Positives = 126/222 (56%), Gaps = 27/222 (12%)
Query: 79 FQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVD-NTQGIRTSSGVF 137
+ +SW PRA + NF TPE+C ++N+AK + L++ D T G SG F
Sbjct: 2 IEQISWEPRAFVYHNFLTPEECAHLVNLAK----ATDGGLKRATVADARTGGTFPGSGAF 57
Query: 138 ISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQE-----Y 192
+ D + IEE+I+ M+P +GE ILRY G+KY+ H+D FD + Y
Sbjct: 58 LLRNHDP--IVTRIEERISAFAMIPADHGEGMRILRYGRGEKYDPHHDYFDDGDKNLRFY 115
Query: 193 GPQKSQRVASFLVYLTDLEEGGETMFPFENG------MNADG---SYDYQKCI--GLKVK 241
G QRVA+ L+YL+D+E GGET+FP M+ G S D KC L VK
Sbjct: 116 G----QRVATVLMYLSDVESGGETVFPKHGAWIEPDEMDVRGRSSSKDSSKCAKGALHVK 171
Query: 242 PRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
PR+GD LLF++ NG DPTS+H CPV++GEKW ATKW+R
Sbjct: 172 PRRGDALLFHNCHLNGREDPTSLHAGCPVLRGEKWTATKWMR 213
>gi|255637501|gb|ACU19077.1| unknown [Glycine max]
Length = 318
Score = 163 bits (413), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 95/225 (42%), Positives = 128/225 (56%), Gaps = 20/225 (8%)
Query: 70 GDDSVTNIPFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNT 127
G SV P +V LSW PRA + F + E+C +I +AK L S +A DN
Sbjct: 45 GGSSVKFDPTRVTQLSWSPRAFLYKGFLSDEECDHLITLAKDKLEKSMVA-------DNE 97
Query: 128 QG------IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYN 181
G +RTSSG+F++ A+DE + IE +IA T LP NGE+ IL Y+ GQKY
Sbjct: 98 SGKSIMSEVRTSSGMFLNKAQDE--IVAGIEARIAAWTFLPIENGESMQILHYENGQKYE 155
Query: 182 SHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENG---MNADGSYDYQKCIGL 238
H+D F + R+A+ L+YL+D+E+GGET+F D S+ G
Sbjct: 156 PHFDYFHDKANQVMGGHRIATVLMYLSDVEKGGETIFSNAKAKLLQPKDESWSECAHKGY 215
Query: 239 KVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
VKPR+GD LLF+SL + + D S+HGSCPV++GEKW ATKWI
Sbjct: 216 AVKPRKGDALLFFSLHLDASTDNKSLHGSCPVIEGEKWSATKWIH 260
>gi|255072321|ref|XP_002499835.1| prolyl 4-hydroxylase [Micromonas sp. RCC299]
gi|226515097|gb|ACO61093.1| prolyl 4-hydroxylase [Micromonas sp. RCC299]
Length = 454
Score = 163 bits (413), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 94/218 (43%), Positives = 135/218 (61%), Gaps = 17/218 (7%)
Query: 78 PFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVF 137
P + + P+A F NF TP +C+ ++ +AK L PST+ KG + IRTS+G+F
Sbjct: 169 PLVLSNHEPKAYMFRNFLTPHECEHLMQLAKKQLAPSTVVGDKG-SGSMVSKIRTSAGMF 227
Query: 138 ISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQ-EYGPQK 196
+ +D T+ IEE+IA + LP NGE ILRY+ GQKY+ H+D F Q P++
Sbjct: 228 LGRGQDP--TVRAIEERIAAASGLPEPNGEGLQILRYENGQKYDPHFDYFHDQVNSSPRR 285
Query: 197 S-QRVASFLVYLTDLEEGGETMFPFENGM-----NAD--GSYD-YQKCI--GLKVKPRQG 245
QR+A+ L+YL D EGGET+FP NG+ +AD G+++ + C G+ VK +G
Sbjct: 286 GGQRMATMLIYLEDTTEGGETIFP--NGVRPEDWDADEPGNHNSWSDCAKKGIPVKSHRG 343
Query: 246 DGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
D +LF+SL + T+D S+HG+CPV+ GEKW A KWIR
Sbjct: 344 DAVLFWSLKEDYTLDNGSLHGACPVIAGEKWTAVKWIR 381
>gi|412993142|emb|CCO16675.1| predicted protein [Bathycoccus prasinos]
Length = 564
Score = 163 bits (412), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 95/220 (43%), Positives = 136/220 (61%), Gaps = 19/220 (8%)
Query: 77 IPFQVLSWM-PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSG 135
+P VLS + P+A F NF + E+C ++ +AK L PST+ G +V +T IRTS+G
Sbjct: 276 MPPLVLSAVKPKAYLFRNFLSAEECDHLMKLAKAELAPSTVVGAGGTSVPST--IRTSAG 333
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGP 194
+F+ A D+ TL+ IE +IA + P NGE ILRY +GQKY+ H+D F D P
Sbjct: 334 MFLRKAADK--TLENIEYRIAAASGTPEPNGEGMQILRYDVGQKYDPHFDYFHDAVNPSP 391
Query: 195 QKS-QRVASFLVYLTDLEEGGETMFPFENGMNAD--------GSYDYQKCI--GLKVKPR 243
++ QR+A+ L+YL + +EGGET+FP G A+ +++ +C GL VK
Sbjct: 392 KRGGQRMATMLIYLENTKEGGETIFP--RGTRAETFDLTEEGNPHEWSECTKHGLPVKSV 449
Query: 244 QGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
+GD LLF+SL + +D S+HG+CPVVKG+KW A KWIR
Sbjct: 450 KGDALLFWSLTDDYKLDMGSLHGACPVVKGQKWTAVKWIR 489
>gi|195627276|gb|ACG35468.1| prolyl 4-hydroxylase [Zea mays]
Length = 298
Score = 163 bits (412), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 90/206 (43%), Positives = 123/206 (59%), Gaps = 8/206 (3%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-GETVDNTQGIRTSSGVFISA 140
LSW PRA F +C +I +AK L S +A K G++V + +RTSSG+F+
Sbjct: 38 LSWRPRAFLHKGFLLDAECDHLIALAKDKLEKSMVADNKSGKSVQSE--VRTSSGMFLEK 95
Query: 141 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRV 200
+DE + IEE+I+ T LP NGEA IL Y+ G+KY HYD F + R+
Sbjct: 96 KQDE--VVTRIEERISAWTFLPPENGEAIQILHYQNGEKYEPHYDYFHDKNNQALGGHRI 153
Query: 201 ASFLVYLTDLEEGGETMFPFENGMNADGSYD-YQKCI--GLKVKPRQGDGLLFYSLLPNG 257
A+ L+YL+++E+GGET+FP G D + C G VKP +GD LLF+SL P+
Sbjct: 154 ATVLMYLSNVEKGGETIFPNAEGKLLQPKDDTWSDCARNGYAVKPVKGDALLFFSLHPDS 213
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIR 283
T D S+HGSCPV++G+KW ATKWI
Sbjct: 214 TTDSDSLHGSCPVIEGQKWSATKWIH 239
>gi|357447555|ref|XP_003594053.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
truncatula]
gi|355483101|gb|AES64304.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
truncatula]
Length = 303
Score = 163 bits (412), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 91/216 (42%), Positives = 126/216 (58%), Gaps = 23/216 (10%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 135
+SW PRA + F T +C +I++AK L+ S +A DN G +RTSSG
Sbjct: 43 VSWKPRAFVYKGFLTDLECDHLISIAKSELKRSAVA-------DNLSGESKLSEVRTSSG 95
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 195
+FIS +D + IE+KI+ T LP+ NGE +LRY+ GQKY+ HYD F + +
Sbjct: 96 MFISKNKD--AIVSGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDYFADKVNIAR 153
Query: 196 KSQRVASFLVYLTDLEEGGETMFPFENGMN------ADGSYDYQKC--IGLKVKPRQGDG 247
RVA+ L+YLT++ +GGET+FP ++ D +C G+ VKPR+GD
Sbjct: 154 GGHRVATVLMYLTNVTKGGETVFPNAELQESPRHKLSETDEDLSECGKKGVAVKPRRGDA 213
Query: 248 LLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
LLF+SL PN D S+H CPV++GEKW ATKWI
Sbjct: 214 LLFFSLHPNAIPDTLSLHAGCPVIEGEKWSATKWIH 249
>gi|218199253|gb|EEC81680.1| hypothetical protein OsI_25242 [Oryza sativa Indica Group]
Length = 487
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 92/230 (40%), Positives = 133/230 (57%), Gaps = 17/230 (7%)
Query: 67 GQAGDDSVTNI----PF-----QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA 117
G+AGDD V + PF + +SW PR + F + ++C ++ + K ++ S +A
Sbjct: 36 GEAGDDGVGAVAAAPPFNASRVRAVSWRPRVFVYKGFLSDDECDHLVKLGKRKMQRSMVA 95
Query: 118 LRK-GETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKI 176
K G++V +RTSSG+F+ +D + IE++IA T LP N E ILRY+
Sbjct: 96 DNKSGKSV--MSEVRTSSGMFLDKRQDP--VVSRIEKRIAAWTFLPEENAENIQILRYEH 151
Query: 177 GQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYD-YQKC 235
GQKY H+D F + R A+ L+YL+ +E+GGET+FP G D + +C
Sbjct: 152 GQKYEPHFDYFHDKVNQALGGHRYATVLMYLSTVEKGGETVFPNAEGWENQPKDDTFSEC 211
Query: 236 I--GLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
GL VKP +GD +LF+SL +G DP S+HGSCPV++GEKW A KWIR
Sbjct: 212 AQKGLAVKPVKGDAVLFFSLHIDGVPDPLSLHGSCPVIEGEKWSAPKWIR 261
>gi|357128903|ref|XP_003566109.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Brachypodium
distachyon]
Length = 313
Score = 162 bits (410), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 87/214 (40%), Positives = 122/214 (57%), Gaps = 21/214 (9%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 135
+SW PR + +F + ++ ++++A+ L+ S +A DNT G +RTS G
Sbjct: 55 ISWKPRVFLYQHFLSDDEANHLLSLARAELKRSAVA-------DNTSGKSTLSEVRTSYG 107
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 195
FIS +D + IE+KIA T LP+ NGE +LRYK G+K +D F +
Sbjct: 108 TFISKGKDP--IVAGIEDKIAAWTFLPKENGEDMQVLRYKRGEKDEPQFDFFTDTVNTVR 165
Query: 196 KSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCI------GLKVKPRQGDGLL 249
RVA+ L+YLTD+ EGGET+FP G +D + G+ VKPR+GD LL
Sbjct: 166 GGHRVATVLLYLTDVAEGGETVFPLAKDFTDTGLHDKDTTLSECAQKGIAVKPRKGDALL 225
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
F++L P+ DP S+HG C V+KGEKW ATKWIR
Sbjct: 226 FFNLRPDAATDPLSLHGGCTVIKGEKWTATKWIR 259
>gi|255641919|gb|ACU21228.1| unknown [Glycine max]
Length = 301
Score = 162 bits (410), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 97/229 (42%), Positives = 132/229 (57%), Gaps = 23/229 (10%)
Query: 69 AGDDSVTNIPFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDN 126
AG S P +V +SW PRA + F T +C +I++AK L+ S +A DN
Sbjct: 28 AGSASAIIDPSKVKQVSWKPRAFVYEGFLTELECDHLISIAKSELKRSAVA-------DN 80
Query: 127 TQG------IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKY 180
G +RTSSG+FI +D + IE+KI+ T LP+ NGE +LRY+ GQKY
Sbjct: 81 LSGESKLSEVRTSSGMFIPKNKDL--IVAGIEDKISSWTFLPKENGEDIQVLRYEHGQKY 138
Query: 181 NSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFP-FENGMNADGS---YDYQKCI 236
+ HYD F + + RVA+ L+YLTD+ +GGET+FP E GS + +C
Sbjct: 139 DPHYDYFADKVNIARGGHRVATVLMYLTDVTKGGETVFPDAEESPRHKGSETNENLSECA 198
Query: 237 --GLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
G+ VKPR+GD LLF+SL PN D S+H CPV++GEKW AT+WI
Sbjct: 199 QKGIAVKPRRGDALLFFSLYPNAIPDTLSLHAGCPVIEGEKWSATEWIH 247
>gi|29150368|gb|AAO72377.1| putative oxidoreductase [Oryza sativa Japonica Group]
gi|108711617|gb|ABF99412.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
Group]
gi|125546090|gb|EAY92229.1| hypothetical protein OsI_13949 [Oryza sativa Indica Group]
gi|125588294|gb|EAZ28958.1| hypothetical protein OsJ_13002 [Oryza sativa Japonica Group]
Length = 310
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 86/212 (40%), Positives = 125/212 (58%), Gaps = 18/212 (8%)
Query: 81 VLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSS 134
++SW PR ++ F + ++C ++ + K L+ S +A DN G +RTSS
Sbjct: 50 IISWKPRIFFYKGFLSDDECDHLVKLGKEKLKRSMVA-------DNESGKSVMSEVRTSS 102
Query: 135 GVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP 194
G+F+ +D + IEE+IA T+LP+ N E ILRY+ GQKY+ H+D F +
Sbjct: 103 GMFLDKQQDP--VVSGIEERIAAWTLLPQENAENIQILRYENGQKYDPHFDYFQDKVNQL 160
Query: 195 QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYD-YQKCI--GLKVKPRQGDGLLFY 251
Q R A+ L YL+ +E+GGET+FP G + D + C GL VK +GD +LF+
Sbjct: 161 QGGHRYATVLTYLSTVEKGGETVFPNAEGWESQPKDDSFSDCAKKGLAVKAVKGDSVLFF 220
Query: 252 SLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
+L P+GT DP S+HGSCPV++GEKW A KWI
Sbjct: 221 NLQPDGTPDPLSLHGSCPVIEGEKWSAPKWIH 252
>gi|293337056|ref|NP_001169835.1| uncharacterized protein LOC100383727 precursor [Zea mays]
gi|224031897|gb|ACN35024.1| unknown [Zea mays]
gi|347978800|gb|AEP37742.1| prolyl 4-hydroxylase 2 [Zea mays]
gi|414871435|tpg|DAA49992.1| TPA: hypothetical protein ZEAMMB73_500506 [Zea mays]
Length = 299
Score = 162 bits (409), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 87/206 (42%), Positives = 125/206 (60%), Gaps = 8/206 (3%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-GETVDNTQGIRTSSGVFISA 140
LSW PRA F + +C +I +AK L S +A + G++V + +RTSSG+F+
Sbjct: 39 LSWRPRAFLHKGFLSDAECDHLIALAKDKLEKSMVADNESGKSVQSE--VRTSSGMFLER 96
Query: 141 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRV 200
+DE + IEE+I+ T LP NGE+ IL Y+ G+KY HYD F ++ R+
Sbjct: 97 KQDE--VVTRIEERISAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKKNQALGGHRI 154
Query: 201 ASFLVYLTDLEEGGETMFPFENG---MNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 257
A+ L+YL+++E+GGET+FP G D ++ G VKP +GD LLF+SL P+
Sbjct: 155 ATVLMYLSNVEKGGETIFPNAEGKLLQPKDNTWSDCARNGYAVKPVKGDALLFFSLHPDA 214
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIR 283
T D S+HGSCPV++G+KW ATKWI
Sbjct: 215 TTDSDSLHGSCPVIEGQKWSATKWIH 240
>gi|222636605|gb|EEE66737.1| hypothetical protein OsJ_23428 [Oryza sativa Japonica Group]
Length = 487
Score = 162 bits (409), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 92/230 (40%), Positives = 133/230 (57%), Gaps = 17/230 (7%)
Query: 67 GQAGDDSVTNI----PF-----QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA 117
G+AGDD V + PF + +SW PR + F + ++C ++ + K ++ S +A
Sbjct: 36 GEAGDDGVGAVAAAPPFNASRVRAVSWRPRVFVYKGFLSDDECDHLVKLGKRKMQRSMVA 95
Query: 118 LRK-GETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKI 176
K G++V +RTSSG+F+ +D + IE++IA T LP N E ILRY+
Sbjct: 96 DNKSGKSV--MSEVRTSSGMFLDKRQDP--VVSRIEKRIAAWTFLPEENAENIQILRYEH 151
Query: 177 GQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYD-YQKC 235
GQKY H+D F + R A+ L+YL+ +E+GGET+FP G D + +C
Sbjct: 152 GQKYEPHFDYFHDKVNQALGGHRYATVLMYLSTVEKGGETVFPNAEGWENQPKDDTFSEC 211
Query: 236 I--GLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
GL VKP +GD +LF+SL +G DP S+HGSCPV++GEKW A KWIR
Sbjct: 212 AQKGLAVKPVKGDTVLFFSLHIDGVPDPLSLHGSCPVIEGEKWSAPKWIR 261
>gi|449522594|ref|XP_004168311.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Cucumis
sativus]
Length = 313
Score = 162 bits (409), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 87/206 (42%), Positives = 125/206 (60%), Gaps = 8/206 (3%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALR-KGETVDNTQGIRTSSGVFISA 140
LSW PRA + F + +C +I++AK L S +A G++V + +RTSSG+F+
Sbjct: 56 LSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVSSE--VRTSSGMFLRK 113
Query: 141 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRV 200
A+DE + +E +IA T+LP NGE+ IL Y+ GQKY H+D F + R+
Sbjct: 114 AQDE--VVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRI 171
Query: 201 ASFLVYLTDLEEGGETMFP---FENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 257
A+ L+YL+++E+GGET+FP F+ D S+ G VK ++GD LLF+SL +
Sbjct: 172 ATVLMYLSNVEKGGETIFPNSEFKESQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDA 231
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIR 283
T D S+HGSCPV+ GEKW ATKWI
Sbjct: 232 TTDERSLHGSCPVIAGEKWSATKWIH 257
>gi|212720650|ref|NP_001132477.1| uncharacterized protein LOC100193935 precursor [Zea mays]
gi|194694488|gb|ACF81328.1| unknown [Zea mays]
gi|347978828|gb|AEP37756.1| prolyl 4-hydroxylase 7 [Zea mays]
gi|413934218|gb|AFW68769.1| prolyl 4-hydroxylase [Zea mays]
Length = 298
Score = 161 bits (408), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 89/206 (43%), Positives = 122/206 (59%), Gaps = 8/206 (3%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-GETVDNTQGIRTSSGVFISA 140
LSW PRA F +C +I +AK L S +A K G++V + +RTSSG+F+
Sbjct: 38 LSWRPRAFLHKGFLLDAECDHLIALAKDKLEKSMVADNKSGKSVQSE--VRTSSGMFLEK 95
Query: 141 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRV 200
+DE + IEE+I+ T LP NGEA IL Y+ G+KY HYD F + R+
Sbjct: 96 KQDE--VVTRIEERISAWTFLPPENGEAIQILHYQNGEKYEPHYDYFHDKNNQALGGHRI 153
Query: 201 ASFLVYLTDLEEGGETMFPFENGMNADGSYD-YQKCI--GLKVKPRQGDGLLFYSLLPNG 257
A+ L+YL+++E+GGET+FP G D + C G VKP +GD LLF+SL P+
Sbjct: 154 ATVLMYLSNVEKGGETIFPNAEGKLLQPKDDTWSDCARNGYAVKPVKGDALLFFSLHPDS 213
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIR 283
T D S+HGSCP ++G+KW ATKWI
Sbjct: 214 TTDSDSLHGSCPAIEGQKWSATKWIH 239
>gi|34393269|dbj|BAC83179.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein [Oryza
sativa Japonica Group]
gi|50509101|dbj|BAD30161.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein [Oryza
sativa Japonica Group]
Length = 313
Score = 161 bits (408), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 92/230 (40%), Positives = 133/230 (57%), Gaps = 17/230 (7%)
Query: 67 GQAGDDSVTNI----PF-----QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA 117
G+AGDD V + PF + +SW PR + F + ++C ++ + K ++ S +A
Sbjct: 30 GEAGDDGVGAVAAAPPFNASRVRAVSWRPRVFVYKGFLSDDECDHLVKLGKRKMQRSMVA 89
Query: 118 LRK-GETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKI 176
K G++V +RTSSG+F+ +D + IE++IA T LP N E ILRY+
Sbjct: 90 DNKSGKSV--MSEVRTSSGMFLDKRQDP--VVSRIEKRIAAWTFLPEENAENIQILRYEH 145
Query: 177 GQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYD-YQKC 235
GQKY H+D F + R A+ L+YL+ +E+GGET+FP G D + +C
Sbjct: 146 GQKYEPHFDYFHDKVNQALGGHRYATVLMYLSTVEKGGETVFPNAEGWENQPKDDTFSEC 205
Query: 236 I--GLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
GL VKP +GD +LF+SL +G DP S+HGSCPV++GEKW A KWIR
Sbjct: 206 AQKGLAVKPVKGDTVLFFSLHIDGVPDPLSLHGSCPVIEGEKWSAPKWIR 255
>gi|242039723|ref|XP_002467256.1| hypothetical protein SORBIDRAFT_01g022150 [Sorghum bicolor]
gi|241921110|gb|EER94254.1| hypothetical protein SORBIDRAFT_01g022150 [Sorghum bicolor]
Length = 303
Score = 161 bits (408), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 89/206 (43%), Positives = 124/206 (60%), Gaps = 8/206 (3%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-GETVDNTQGIRTSSGVFISA 140
LSW PRA F + +C +I +AK L S +A + G++V + +RTSSG+F+
Sbjct: 43 LSWRPRAFLHKGFLSDAECDHLIVLAKDKLEKSMVADNESGKSVQSE--VRTSSGMFLEK 100
Query: 141 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRV 200
+DE + IEE+IA T LP NGE+ IL Y+ G+KY HYD F + R+
Sbjct: 101 KQDE--VVRGIEERIAAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKNNQALGGHRI 158
Query: 201 ASFLVYLTDLEEGGETMFPFENGMNADGSYD-YQKCI--GLKVKPRQGDGLLFYSLLPNG 257
A+ L+YL+++E+GGET+FP G D + C G VKP +GD LLF+SL P+
Sbjct: 159 ATVLMYLSNVEKGGETIFPNAEGKLLQPKDDTWSDCARNGYAVKPVKGDALLFFSLHPDA 218
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIR 283
T D S+HGSCPV++G+KW ATKWI
Sbjct: 219 TTDSESLHGSCPVIEGQKWSATKWIH 244
>gi|388492638|gb|AFK34385.1| unknown [Medicago truncatula]
Length = 299
Score = 161 bits (408), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 95/229 (41%), Positives = 132/229 (57%), Gaps = 23/229 (10%)
Query: 69 AGDDSVTNIPFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDN 126
AG S P +V +SW+PRA + F T +C +I++AK L+ S +A DN
Sbjct: 25 AGSASSIINPSKVKQISWIPRAFVYQGFLTDLECDHLISLAKSELKRSAVA-------DN 77
Query: 127 TQG------IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKY 180
G +RTSSG+FIS +D + IE++I+ T LP+ NGE +LRY+ GQKY
Sbjct: 78 LSGDSQLSDVRTSSGMFISKNKDP--IVSGIEDRISAWTFLPKENGEDIQVLRYEHGQKY 135
Query: 181 NSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFE----NGMNADGSYDYQKCI 236
+ HYD F + Q R+A+ L+YLT++ +GGET+FP + S D +C
Sbjct: 136 DPHYDYFADKVNIVQGGHRLATVLMYLTNVTKGGETVFPEAEEPPRRRGSKKSSDLSECA 195
Query: 237 --GLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
G+ VKPR+GD LLF+SL N D S+H CPV++GEKW ATKWI
Sbjct: 196 KKGIAVKPRRGDALLFFSLDTNAIPDTNSLHAGCPVLEGEKWSATKWIH 244
>gi|357478545|ref|XP_003609558.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
gi|355510613|gb|AES91755.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
Length = 299
Score = 161 bits (407), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 95/229 (41%), Positives = 132/229 (57%), Gaps = 23/229 (10%)
Query: 69 AGDDSVTNIPFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDN 126
AG S P +V +SW+PRA + F T +C +I++AK L+ S +A DN
Sbjct: 25 AGSASSIINPSKVKQISWIPRAFVYQGFLTDLECDHLISLAKSELKRSAVA-------DN 77
Query: 127 TQG------IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKY 180
G +RTSSG+FIS +D + IE++I+ T LP+ NGE +LRY+ GQKY
Sbjct: 78 LSGDSQLSDVRTSSGMFISKNKDP--IVSGIEDRISAWTFLPKENGEDIQVLRYEHGQKY 135
Query: 181 NSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFE----NGMNADGSYDYQKCI 236
+ HYD F + Q R+A+ L+YLT++ +GGET+FP + S D +C
Sbjct: 136 DPHYDYFADKVNIVQGGHRLATVLMYLTNVTKGGETVFPEAEEPPRRRGSKKSSDLSECA 195
Query: 237 --GLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
G+ VKPR+GD LLF+SL N D S+H CPV++GEKW ATKWI
Sbjct: 196 KKGIAVKPRRGDALLFFSLDTNAIPDTNSLHAGCPVLEGEKWSATKWIH 244
>gi|115471029|ref|NP_001059113.1| Os07g0194500 [Oryza sativa Japonica Group]
gi|113610649|dbj|BAF21027.1| Os07g0194500 [Oryza sativa Japonica Group]
gi|215768445|dbj|BAH00674.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 319
Score = 161 bits (407), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 92/230 (40%), Positives = 133/230 (57%), Gaps = 17/230 (7%)
Query: 67 GQAGDDSVTNI----PF-----QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA 117
G+AGDD V + PF + +SW PR + F + ++C ++ + K ++ S +A
Sbjct: 36 GEAGDDGVGAVAAAPPFNASRVRAVSWRPRVFVYKGFLSDDECDHLVKLGKRKMQRSMVA 95
Query: 118 LRK-GETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKI 176
K G++V +RTSSG+F+ +D + IE++IA T LP N E ILRY+
Sbjct: 96 DNKSGKSV--MSEVRTSSGMFLDKRQDP--VVSRIEKRIAAWTFLPEENAENIQILRYEH 151
Query: 177 GQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYD-YQKC 235
GQKY H+D F + R A+ L+YL+ +E+GGET+FP G D + +C
Sbjct: 152 GQKYEPHFDYFHDKVNQALGGHRYATVLMYLSTVEKGGETVFPNAEGWENQPKDDTFSEC 211
Query: 236 I--GLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
GL VKP +GD +LF+SL +G DP S+HGSCPV++GEKW A KWIR
Sbjct: 212 AQKGLAVKPVKGDTVLFFSLHIDGVPDPLSLHGSCPVIEGEKWSAPKWIR 261
>gi|110289076|gb|ABB47602.2| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
Group]
Length = 309
Score = 161 bits (407), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 92/212 (43%), Positives = 123/212 (58%), Gaps = 19/212 (8%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 135
LSW PRA F T +C+ +I++AK L S +A DN G +RTSSG
Sbjct: 48 LSWRPRAFLHKGFLTDAECEHLISLAKDKLEKSMVA-------DNESGKSVMSEVRTSSG 100
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 195
+F+ +DE + IEE+IA T LP NGE+ IL Y+ G+KY HYD F +
Sbjct: 101 MFLEKKQDE--VVARIEERIAAWTFLPPDNGESIQILHYQNGEKYEPHYDYFHDKNNQAL 158
Query: 196 KSQRVASFLVYLTDLEEGGETMFP-FENGMNADGSYD-YQKCI--GLKVKPRQGDGLLFY 251
R+A+ L+YL+D+ +GGET+FP E G D + C G VKP +GD LLF+
Sbjct: 159 GGHRIATVLMYLSDVGKGGETIFPEAEVGKLLQPKDDTWSDCAKNGYAVKPVKGDALLFF 218
Query: 252 SLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
SL P+ T D S+HGSCPV++G+KW ATKWI
Sbjct: 219 SLHPDATTDSDSLHGSCPVIEGQKWSATKWIH 250
>gi|242075290|ref|XP_002447581.1| hypothetical protein SORBIDRAFT_06g004550 [Sorghum bicolor]
gi|241938764|gb|EES11909.1| hypothetical protein SORBIDRAFT_06g004550 [Sorghum bicolor]
Length = 263
Score = 160 bits (405), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 89/224 (39%), Positives = 134/224 (59%), Gaps = 9/224 (4%)
Query: 68 QAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTL---ALRKGETV 124
+AG + + +V+SW PR + F NF + E+C ++ +A+ L+ ST+ A KG
Sbjct: 46 EAGLLRLRYVKPEVISWTPRIIIFHNFLSSEECDYLMAIARPRLQMSTVVDVATGKGVKS 105
Query: 125 DNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHY 184
D +RTSSG+F+++ E +S + IE++I+ + +P+ NGE +LRY+ Q Y H+
Sbjct: 106 D----VRTSSGMFVNSEERKSPVIQAIEKRISVFSQIPKENGELIQVLRYEASQYYRPHH 161
Query: 185 DAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQ 244
D F + QRVA+ L+YLTD EGGET F + G + + S GL VKP +
Sbjct: 162 DYFSDTFNLKRGGQRVATMLMYLTDGVEGGETHF-LQAG-DGECSCGGNVVKGLCVKPNK 219
Query: 245 GDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQY 288
GD +LF+S+ +G DP SIH CPV+KGEKW ATKW+R + +
Sbjct: 220 GDAVLFWSMGLDGNTDPNSIHSGCPVLKGEKWSATKWMRQKMTF 263
>gi|414587756|tpg|DAA38327.1| TPA: hypothetical protein ZEAMMB73_894856 [Zea mays]
Length = 263
Score = 160 bits (405), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 86/212 (40%), Positives = 127/212 (59%), Gaps = 9/212 (4%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTL---ALRKGETVDNTQGIRTSSGV 136
+V+SW PR + F NF + E+C ++ +A+ L+ ST+ A KG D +RTSSG+
Sbjct: 58 EVISWTPRIIVFHNFLSSEECDYLMAIARPRLQISTVVDVATGKGVKSD----VRTSSGM 113
Query: 137 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK 196
F+++ E +S + IE++I+ + +P+ NGE +LRY+ Q Y H+D F +
Sbjct: 114 FVNSEERKSPVVQAIEKRISVFSQIPKENGELIQVLRYEASQYYRPHHDYFSDTFNLKRG 173
Query: 197 SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPN 256
QRVA+ L+YLTD GGET FP + + S GL VKP +GD +LF+S+ +
Sbjct: 174 GQRVATMLMYLTDGVVGGETHFP--QAGDGECSCGGNVVKGLCVKPNKGDAVLFWSMGLD 231
Query: 257 GTIDPTSIHGSCPVVKGEKWVATKWIRDQEQY 288
G DP SIH CPV+KGEKW ATKW+R + +
Sbjct: 232 GNTDPNSIHSGCPVLKGEKWSATKWMRQKMTF 263
>gi|218192156|gb|EEC74583.1| hypothetical protein OsI_10158 [Oryza sativa Indica Group]
Length = 299
Score = 159 bits (403), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 83/206 (40%), Positives = 123/206 (59%), Gaps = 9/206 (4%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNL--RPSTLALRKGETVDNTQGIRTSSGVFIS 139
+SW PR + F + +C+ +I +AK R + + + GE+V RTSSG+F+
Sbjct: 40 VSWSPRVFLYEGFLSDAECEHLIALAKQGRMERSTVVNGKSGESV--MSKTRTSSGMFLI 97
Query: 140 AAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQR 199
+DE + IEE+IA TM P NGE+ +LRY G+KY H+D ++ + R
Sbjct: 98 RKQDE--VVARIEERIAAWTMFPAENGESMQMLRYGQGEKYEPHFDYIRGRQASARGGHR 155
Query: 200 VASFLVYLTDLEEGGETMFP-FENGMNADGSYDYQKCI--GLKVKPRQGDGLLFYSLLPN 256
+A+ L+YL++++ GGET+FP E ++ + C G VKP +G +LF+SL PN
Sbjct: 156 IATVLMYLSNVKMGGETVFPDAEARLSQPKDETWSDCAEQGFAVKPTKGSAVLFFSLYPN 215
Query: 257 GTIDPTSIHGSCPVVKGEKWVATKWI 282
T DP S+HGSCPV++GEKW ATKWI
Sbjct: 216 ATFDPGSLHGSCPVIQGEKWSATKWI 241
>gi|363543295|ref|NP_001241863.1| prolyl 4-hydroxylase 4 precursor [Zea mays]
gi|347978806|gb|AEP37745.1| prolyl 4-hydroxylase 4 [Zea mays]
gi|414591890|tpg|DAA42461.1| TPA: hypothetical protein ZEAMMB73_637248 [Zea mays]
Length = 274
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 83/207 (40%), Positives = 125/207 (60%), Gaps = 8/207 (3%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-GETVDNTQGIRTSSGVFI 138
+ +SW PR + F + +C ++ +AK ++ S +A + G++V + +RTSSG+F+
Sbjct: 46 KAVSWHPRIFVYKGFLSDAECDHLVTLAKKKIQRSMVADNESGKSVKSE--VRTSSGMFL 103
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 198
+D + IEE+IA T LP+ N E +LRY+ GQKY H+D F + +
Sbjct: 104 DKRQDP--VVSRIEERIAAWTFLPQENAENMQVLRYEPGQKYEPHFDYFHDRVNQARGGH 161
Query: 199 RVASFLVYLTDLEEGGETMFPFENGMNA---DGSYDYQKCIGLKVKPRQGDGLLFYSLLP 255
R A+ L+YL+ + EGGET+FP G + D ++ GL VKP +GD +LF+SL
Sbjct: 162 RYATVLMYLSTVREGGETVFPNAKGWESQPKDATFSECAHKGLAVKPVKGDAVLFFSLHA 221
Query: 256 NGTIDPTSIHGSCPVVKGEKWVATKWI 282
+GT DP S+HGSCPV++GEKW A KWI
Sbjct: 222 DGTPDPLSLHGSCPVIRGEKWSAPKWI 248
>gi|255083627|ref|XP_002508388.1| predicted protein [Micromonas sp. RCC299]
gi|226523665|gb|ACO69646.1| predicted protein [Micromonas sp. RCC299]
Length = 253
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 89/224 (39%), Positives = 129/224 (57%), Gaps = 35/224 (15%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 135
+SW PRA + NF + E+C I+ +A+ +R ST+ +D+ G IRTS
Sbjct: 1 VSWYPRAFHLHNFMSHEECDRILEIARPRVRRSTV-------IDSVTGQSKVDPIRTSEQ 53
Query: 136 VFISAAEDESGTLDLI---EEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-----AF 187
F++ GT D++ EE++A VT LP +GE IL+Y +GQKY++H+D +
Sbjct: 54 TFLN-----RGTWDIVTKVEERLAVVTQLPAYHGEDMQILKYGLGQKYDAHHDVGELTSA 108
Query: 188 DPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMN------ADGSYDYQKCI--GLK 239
++ + RVA+ L+YL+D+EEGGET FP M A+G + C +
Sbjct: 109 SGKQLAAEGGHRVATVLLYLSDVEEGGETAFPDSEWMTPELRKWAEGQ-KWSDCAEGNVA 167
Query: 240 VKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
VKPR+GDGLLF+S+ IDP S+H CPV++GEKW ATKWI
Sbjct: 168 VKPRKGDGLLFWSVNNENAIDPHSMHAGCPVIRGEKWTATKWIH 211
>gi|145345764|ref|XP_001417370.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577597|gb|ABO95663.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 328
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 94/216 (43%), Positives = 124/216 (57%), Gaps = 22/216 (10%)
Query: 79 FQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRT 132
+ +SW P A + F T E+C + +A +L ST+ VD + G IRT
Sbjct: 56 IERVSWRPHAEVYRGFLTREECDHLKALATPSLGRSTV-------VDASNGGSVPSDIRT 108
Query: 133 SSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEY 192
SSG+F+ ED+ + IE +IA T +P +GE F +LRY+ GQ+Y H+D F E+
Sbjct: 109 SSGMFLLRGEDD--VVASIERRIASWTHVPESHGEGFQVLRYEFGQEYRPHFDYFQ-DEF 165
Query: 193 GPQK---SQRVASFLVYLTDLEEGGETMFP-FENGMNADGSYDYQKCIG--LKVKPRQGD 246
++ QRVA+ L+YLTD+EEGGET+FP E G N G D C L VKPR+GD
Sbjct: 166 NQKREKGGQRVATVLMYLTDVEEGGETIFPDAEAGANPGGGDDASSCAAGKLAVKPRKGD 225
Query: 247 GLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWI 282
L F SL NGT D S H CPVVKG K+ ATKW+
Sbjct: 226 ALFFRSLHHNGTSDAMSSHAGCPVVKGVKFSATKWM 261
>gi|116309432|emb|CAH66506.1| OSIGBa0111I14.1 [Oryza sativa Indica Group]
Length = 267
Score = 159 bits (402), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 85/208 (40%), Positives = 128/208 (61%), Gaps = 5/208 (2%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVFI 138
+V+SW PR + F NF + E+C + ++A+ L+ ST+ + G+ V + +RTSSG+F+
Sbjct: 62 EVISWSPRIIVFHNFLSSEECDYLRSIARPRLQISTVVDVATGKGVKSN--VRTSSGMFV 119
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 198
S+ E + + IE++I+ + +P NGE +LRY+ Q Y H+D F + Q
Sbjct: 120 SSEERKLPVIQSIEKRISVYSQIPEENGELIQVLRYEPSQYYRPHHDYFSDTFNIKRGGQ 179
Query: 199 RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGT 258
RVA+ L+YLTD EGGET FP + + S + GL VKP +GD +LF+S+ +G
Sbjct: 180 RVATMLMYLTDGVEGGETHFP--QAGDGECSCGGKMVKGLCVKPNKGDAVLFWSMGLDGE 237
Query: 259 IDPTSIHGSCPVVKGEKWVATKWIRDQE 286
D SIHG CPV++GEKW ATKW+R +E
Sbjct: 238 TDSNSIHGGCPVLEGEKWSATKWMRQKE 265
>gi|388500582|gb|AFK38357.1| unknown [Medicago truncatula]
Length = 299
Score = 159 bits (402), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 94/229 (41%), Positives = 131/229 (57%), Gaps = 23/229 (10%)
Query: 69 AGDDSVTNIPFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDN 126
AG S P +V +SW+PRA + F T +C +I++AK L+ S +A DN
Sbjct: 25 AGSASSIINPSKVKQISWIPRAFVYQGFLTDLECDHLISLAKSELKRSAVA-------DN 77
Query: 127 TQG------IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKY 180
G +RTSSG+ IS +D + IE++I+ T LP+ NGE +LRY+ GQKY
Sbjct: 78 LSGDSQLSDVRTSSGMLISKNKDP--IVSGIEDRISAWTFLPKENGEDIQVLRYEHGQKY 135
Query: 181 NSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFE----NGMNADGSYDYQKCI 236
+ HYD F + Q R+A+ L+YLT++ +GGET+FP + S D +C
Sbjct: 136 DPHYDYFADKVNIVQGGHRLATVLMYLTNVTKGGETVFPEAEEPPRRRGSKKSSDLSECA 195
Query: 237 --GLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
G+ VKPR+GD LLF+SL N D S+H CPV++GEKW ATKWI
Sbjct: 196 KKGIAVKPRRGDALLFFSLDTNAIPDTNSLHAGCPVLEGEKWSATKWIH 244
>gi|115457822|ref|NP_001052511.1| Os04g0346000 [Oryza sativa Japonica Group]
gi|38346023|emb|CAE03962.2| OSJNBb0085H11.11 [Oryza sativa Japonica Group]
gi|113564082|dbj|BAF14425.1| Os04g0346000 [Oryza sativa Japonica Group]
gi|125547818|gb|EAY93640.1| hypothetical protein OsI_15426 [Oryza sativa Indica Group]
gi|125589953|gb|EAZ30303.1| hypothetical protein OsJ_14349 [Oryza sativa Japonica Group]
gi|215693934|dbj|BAG89133.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 267
Score = 159 bits (402), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 85/208 (40%), Positives = 128/208 (61%), Gaps = 5/208 (2%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVFI 138
+V+SW PR + F NF + E+C + ++A+ L+ ST+ + G+ V + +RTSSG+F+
Sbjct: 62 EVISWSPRIIVFHNFLSSEECDYLRSIARPRLQISTVVDVATGKGVKSN--VRTSSGMFV 119
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 198
S+ E + + IE++I+ + +P NGE +LRY+ Q Y H+D F + Q
Sbjct: 120 SSEERKLPVIQSIEKRISVYSQIPEENGELIQVLRYEPSQYYRPHHDYFSDTFNIKRGGQ 179
Query: 199 RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGT 258
RVA+ L+YLTD EGGET FP + + S + GL VKP +GD +LF+S+ +G
Sbjct: 180 RVATMLMYLTDGVEGGETHFP--QAGDGECSCGGKMVKGLCVKPNKGDAVLFWSMGLDGE 237
Query: 259 IDPTSIHGSCPVVKGEKWVATKWIRDQE 286
D SIHG CPV++GEKW ATKW+R +E
Sbjct: 238 TDSNSIHGGCPVLEGEKWSATKWMRQKE 265
>gi|343171882|gb|AEL98645.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein, partial
[Silene latifolia]
gi|343171884|gb|AEL98646.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein, partial
[Silene latifolia]
Length = 162
Score = 159 bits (402), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 76/137 (55%), Positives = 100/137 (72%), Gaps = 10/137 (7%)
Query: 40 LSQDVTAARPSARVVESVKDE----------YKWMPHGQAGDDSVTNIPFQVLSWMPRAL 89
+SQD AR AR++ES E + +P G GDDS+++IPFQVLSW PR L
Sbjct: 26 ISQDYANARLRARILESSNGEEDITGGSSFGFDLLPSGDTGDDSISSIPFQVLSWRPRVL 85
Query: 90 YFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLD 149
YFP FAT + C++II++A+ L+PS LALRKGET+D+T+ IRTSSG+FISA ED++G LD
Sbjct: 86 YFPKFATADHCETIISIARSQLKPSRLALRKGETLDSTREIRTSSGMFISADEDKTGILD 145
Query: 150 LIEEKIAKVTMLPRING 166
I+EKIA+ TM+PR NG
Sbjct: 146 FIDEKIARATMIPRANG 162
>gi|108706361|gb|ABF94156.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
Group]
gi|222624253|gb|EEE58385.1| hypothetical protein OsJ_09545 [Oryza sativa Japonica Group]
Length = 299
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 83/206 (40%), Positives = 123/206 (59%), Gaps = 9/206 (4%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNL--RPSTLALRKGETVDNTQGIRTSSGVFIS 139
+SW PR + F + +C+ +I +AK R + + + GE+V RTSSG+F+
Sbjct: 40 VSWSPRVFLYEGFLSDVECEHLIALAKQGRMERSTVVNGKSGESV--MSKTRTSSGMFLI 97
Query: 140 AAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQR 199
+DE + IEE+IA TM P NGE+ +LRY G+KY H+D ++ + R
Sbjct: 98 RKQDE--VVARIEERIAAWTMFPAENGESMQMLRYGQGEKYEPHFDYIRGRQASARGGHR 155
Query: 200 VASFLVYLTDLEEGGETMFP-FENGMNADGSYDYQKCI--GLKVKPRQGDGLLFYSLLPN 256
+A+ L+YL++++ GGET+FP E ++ + C G VKP +G +LF+SL PN
Sbjct: 156 IATVLMYLSNVKMGGETVFPDAEARLSQPKDETWSDCAEQGFAVKPTKGSAVLFFSLYPN 215
Query: 257 GTIDPTSIHGSCPVVKGEKWVATKWI 282
T DP S+HGSCPV++GEKW ATKWI
Sbjct: 216 ATFDPGSLHGSCPVIQGEKWSATKWI 241
>gi|145343778|ref|XP_001416487.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144576712|gb|ABO94780.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 255
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 87/202 (43%), Positives = 123/202 (60%), Gaps = 11/202 (5%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK--GETVDNTQGIRTSSGVFISAAED 143
PRA + F T E+C I+ ++K +L S + K G T T IRTS+G FIS A D
Sbjct: 1 PRAFVYEGFLTDEECDHILALSKGHLHKSGVVDAKTGGST---TSDIRTSTGTFISRAHD 57
Query: 144 ESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASF 203
T+ IEE+I + +P +GEA +LRY+ GQ+Y +H+D F + G +++ R+A+
Sbjct: 58 P--TITAIEERIELWSQIPVDHGEALQVLRYENGQEYKAHFDYFFHK--GGKRNNRIATV 113
Query: 204 LVYLTDLEEGGETMFPFENGMNADGSYDYQKCI--GLKVKPRQGDGLLFYSLLPNGTIDP 261
L+YL+D+EEGGET+FP + Y +C G VK R+GD LLF+S+ P G +DP
Sbjct: 114 LLYLSDVEEGGETVFPNTDVPTDRDRSQYSECGNGGKSVKARKGDALLFWSMKPGGELDP 173
Query: 262 TSIHGSCPVVKGEKWVATKWIR 283
S H CPV+KG KW ATKW+
Sbjct: 174 GSSHAGCPVIKGVKWTATKWMH 195
>gi|294461211|gb|ADE76168.1| unknown [Picea sitchensis]
Length = 280
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 86/208 (41%), Positives = 125/208 (60%), Gaps = 18/208 (8%)
Query: 85 MPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFI 138
+P + NF T +C +I +A+ L+ S +A DN G IRTSSG+F+
Sbjct: 27 IPGLFLYKNFLTDAECDHLIFLARDKLQKSMVA-------DNESGKSVMSEIRTSSGMFL 79
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 198
+ A+DE + +E++IA T LP NGEA +L Y++GQKY H+D F +
Sbjct: 80 NKAQDE--IVASVEDRIAAWTFLPIENGEAMQVLHYELGQKYEPHFDYFHDKINQAMGGH 137
Query: 199 RVASFLVYLTDLEEGGETMFPFENGMNADGSYD-YQKCI--GLKVKPRQGDGLLFYSLLP 255
R+A+ L+YL+D+ +GGET+FP ++ D + +C G VKP +GD LLF+SL P
Sbjct: 138 RIATVLMYLSDVVKGGETVFPNAETKDSQPKDDSWSECAKGGYSVKPNKGDALLFFSLRP 197
Query: 256 NGTIDPTSIHGSCPVVKGEKWVATKWIR 283
+ T D +S+HGSCPV++GEKW ATKWI
Sbjct: 198 DATTDQSSLHGSCPVIEGEKWSATKWIH 225
>gi|148537204|dbj|BAF63493.1| prolyl 4-hydroxylase [Potamogeton distinctus]
Length = 246
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 86/199 (43%), Positives = 117/199 (58%), Gaps = 18/199 (9%)
Query: 94 FATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFISAAEDESGT 147
F + E+C +I + K L S +A DN G IRTSSG+F+ +DE T
Sbjct: 3 FLSHEECDHLIALGKDKLEKSMVA-------DNESGKSVMSEIRTSSGMFLERRQDE--T 53
Query: 148 LDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYL 207
+ IE++IA T LP NGE IL Y+ GQKY++HYD F + R+A+ L+YL
Sbjct: 54 ITRIEKRIAAWTFLPEENGEPIQILHYEKGQKYDAHYDYFHDKNNQRVGGHRMATVLMYL 113
Query: 208 TDLEEGGETMFPFENG---MNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSI 264
+D+++GGET+FP G D ++ G VKPR+GD LLF+S PN T DP S+
Sbjct: 114 SDVKKGGETVFPDAEGKLLQVKDDTWSDCARSGYAVKPRKGDALLFFSCHPNATTDPNSL 173
Query: 265 HGSCPVVKGEKWVATKWIR 283
H SCPV++GEKW AT+WI
Sbjct: 174 HASCPVIEGEKWSATRWIH 192
>gi|20260280|gb|AAM13038.1| unknown protein [Arabidopsis thaliana]
gi|22136524|gb|AAM91340.1| unknown protein [Arabidopsis thaliana]
Length = 298
Score = 158 bits (400), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 86/214 (40%), Positives = 128/214 (59%), Gaps = 21/214 (9%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 135
+S PRA + F T +C ++++AK +L+ S +A DN G +RTSSG
Sbjct: 40 VSSKPRAFVYEGFLTELECDHMVSLAKASLKRSAVA-------DNDSGESKFSEVRTSSG 92
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 195
FIS +D + IE+KI+ T LP+ NGE +LRY+ GQKY++H+D F + +
Sbjct: 93 TFISKGKDP--IVSGIEDKISTWTFLPKENGEDIQVLRYEHGQKYDAHFDYFHDKVNIVR 150
Query: 196 KSQRVASFLVYLTDLEEGGETMFPF----ENGMNADGSYDYQKCI--GLKVKPRQGDGLL 249
R+A+ L+YL+++ +GGET+FP + ++ D C G+ VKPR+GD LL
Sbjct: 151 GGHRMATILMYLSNVTKGGETVFPDAEIPSRRVLSENEEDLSDCAKRGIAVKPRKGDALL 210
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
F++L P+ DP S+HG CPV++GEKW ATKWI
Sbjct: 211 FFNLHPDAIPDPLSLHGGCPVIEGEKWSATKWIH 244
>gi|15239594|ref|NP_197391.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
gi|21593296|gb|AAM65245.1| prolyl 4-hydroxylase alpha subunit-like protein [Arabidopsis
thaliana]
gi|332005243|gb|AED92626.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
Length = 298
Score = 158 bits (400), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 86/214 (40%), Positives = 128/214 (59%), Gaps = 21/214 (9%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 135
+S PRA + F T +C ++++AK +L+ S +A DN G +RTSSG
Sbjct: 40 VSSKPRAFVYEGFLTELECDHMVSLAKASLKRSAVA-------DNDSGESKFSEVRTSSG 92
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 195
FIS +D + IE+KI+ T LP+ NGE +LRY+ GQKY++H+D F + +
Sbjct: 93 TFISKGKDP--IVSGIEDKISTWTFLPKENGEDIQVLRYEHGQKYDAHFDYFHDKVNIVR 150
Query: 196 KSQRVASFLVYLTDLEEGGETMFPF----ENGMNADGSYDYQKCI--GLKVKPRQGDGLL 249
R+A+ L+YL+++ +GGET+FP + ++ D C G+ VKPR+GD LL
Sbjct: 151 GGHRMATILMYLSNVTKGGETVFPDAEIPSRRVLSENKEDLSDCAKRGIAVKPRKGDALL 210
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
F++L P+ DP S+HG CPV++GEKW ATKWI
Sbjct: 211 FFNLHPDAIPDPLSLHGGCPVIEGEKWSATKWIH 244
>gi|297824279|ref|XP_002880022.1| AT-P4H-1 [Arabidopsis lyrata subsp. lyrata]
gi|297325861|gb|EFH56281.1| AT-P4H-1 [Arabidopsis lyrata subsp. lyrata]
Length = 283
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 86/230 (37%), Positives = 135/230 (58%), Gaps = 5/230 (2%)
Query: 57 VKDEYKWMPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTL 116
++D +W A + N+ +V+SW PR + +F +PE+C+ + +A+ L+ ST+
Sbjct: 55 LRDVSRWANDKDAELLRIGNVKPEVVSWSPRIIVLHDFLSPEECEYLKAIARPRLQVSTV 114
Query: 117 A-LRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYK 175
++ G+ V + +RTSSG+F++ E + + IE++IA + +P NGE +LRY+
Sbjct: 115 VDVKTGKGVKSD--VRTSSGMFLTHVERSNPIIQAIEKRIAVFSQVPAENGELIQVLRYE 172
Query: 176 IGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKC 235
Q Y H+D F + QRVA+ L+YLTD EGGET FP + D + +
Sbjct: 173 PKQFYKPHHDYFADTFNLKRGGQRVATMLMYLTDDVEGGETYFPLAG--DGDCTCGGKIM 230
Query: 236 IGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
G+ VKP +GD +LF+S+ +G DP SIHG C V+ GEKW ATKW+R +
Sbjct: 231 KGISVKPTKGDAVLFWSMGLDGQSDPRSIHGGCEVLSGEKWSATKWMRQK 280
>gi|302823087|ref|XP_002993198.1| hypothetical protein SELMODRAFT_431327 [Selaginella moellendorffii]
gi|300138968|gb|EFJ05718.1| hypothetical protein SELMODRAFT_431327 [Selaginella moellendorffii]
Length = 269
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 93/270 (34%), Positives = 143/270 (52%), Gaps = 23/270 (8%)
Query: 23 FLACLFFFLAGLLGSSLLSQDVTAARPSARVVESVKDEYKWMPHGQAGDDSVTNIPFQVL 82
F + L L +G + SQ + R ++ + +W +A + + +VL
Sbjct: 9 FFSVLLGILFVAMGLAAFSQLLFLGR-----IDDPGSDSRWKDDKEARLLRIGLVKPEVL 63
Query: 83 SWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG---------IRTS 133
+W PR + F + E+C +I +A L ST+ VD + G +RTS
Sbjct: 64 NWSPRIILLHKFLSAEECDYLIAIAGPRLAKSTV-------VDTSTGKARHGIESKVRTS 116
Query: 134 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 193
+G+F+S + + IE +IA +M+P NGE +LRY+ Q Y H+D F Q
Sbjct: 117 TGMFLSNYDRRYPMIQAIERRIAVYSMIPVENGELLQVLRYEPNQYYKPHHDYFSDQFNL 176
Query: 194 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 253
+ QRVA+ L+YL+D+EEGGET+FP + + +K GL VKPR+GD +LF+S
Sbjct: 177 KRGGQRVATVLMYLSDVEEGGETIFPSVGDGECECGGELRK--GLCVKPRKGDAILFWSA 234
Query: 254 LPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
+G +D S+HG C V++GEKW ATKW+R
Sbjct: 235 ALDGNVDSNSLHGGCSVLRGEKWSATKWLR 264
>gi|308802438|ref|XP_003078532.1| prolyl 4-hydroxylase alpha-1 subunit precursor (IC) [Ostreococcus
tauri]
gi|116056985|emb|CAL51412.1| prolyl 4-hydroxylase alpha-1 subunit precursor (IC) [Ostreococcus
tauri]
Length = 369
Score = 157 bits (398), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 88/220 (40%), Positives = 131/220 (59%), Gaps = 18/220 (8%)
Query: 78 PFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVF 137
P + S P+A NF +P++C ++ +AK L PST+ G +V + IRTS+G+F
Sbjct: 82 PLVLSSKKPKAYLMRNFLSPQECDHLMMLAKRELAPSTVVGDGGSSVASE--IRTSAGMF 139
Query: 138 ISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQK 196
+ ++D+ T+ IEE+IA+++ +P NGE ILRY GQKY+ H+D F D P++
Sbjct: 140 LRKSQDD--TVREIEERIARLSGVPVDNGEGMQILRYDKGQKYDPHFDYFHDKVNPAPKR 197
Query: 197 S-QRVASFLVYLTDLEEGGETMFP-------FENGMNAD---GSYDYQKCI--GLKVKPR 243
QRVA+ L+YL D EEGGET FP FE + + C G+ VK
Sbjct: 198 GGQRVATVLIYLVDTEEGGETTFPNGRLPENFEEDEPDNPFAAHIKHTDCAKNGIPVKSV 257
Query: 244 QGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
+GD +LF+S+ +G +D S+HG+CPV+ G+KW A KW+R
Sbjct: 258 RGDAILFFSMTKDGELDHGSLHGACPVIAGQKWTAVKWLR 297
>gi|412992163|emb|CCO19876.1| predicted protein [Bathycoccus prasinos]
Length = 350
Score = 157 bits (398), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 88/213 (41%), Positives = 123/213 (57%), Gaps = 15/213 (7%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALR-KGETVDNTQGIRTSSGVFISA 140
+SW PRA + + E+C+ I+ +AK ++ ST+ GE T IRTS F+
Sbjct: 83 ISWQPRAFVLHSILSEEECEEILRIAKPMMKRSTVVDSITGEI--KTDPIRTSKQTFL-- 138
Query: 141 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP-----QEYGPQ 195
A + + +EE++++ TMLP NGE IL Y +G+KY++H+D + Q+
Sbjct: 139 ARGKYPVVTRVEERLSRFTMLPWYNGEDMQILSYGVGEKYSAHHDVGEKNTKSGQQLSAD 198
Query: 196 KSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQK---CI--GLKVKPRQGDGLLF 250
QRVA+ L+YL D EEGGET FP + + Y QK C G+ KP++GDGLLF
Sbjct: 199 GGQRVATVLLYLQDTEEGGETAFPDSEWIEPESEYAQQKFSECAKNGVAFKPKRGDGLLF 258
Query: 251 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
+S+ P G ID S+H CPVVKG KW ATKWI
Sbjct: 259 FSITPEGDIDQKSMHAGCPVVKGTKWTATKWIH 291
>gi|242047772|ref|XP_002461632.1| hypothetical protein SORBIDRAFT_02g005750 [Sorghum bicolor]
gi|241925009|gb|EER98153.1| hypothetical protein SORBIDRAFT_02g005750 [Sorghum bicolor]
Length = 307
Score = 157 bits (397), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 89/232 (38%), Positives = 130/232 (56%), Gaps = 24/232 (10%)
Query: 67 GQAGDDSVTNIP------FQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK 120
G+ G D V P + +SW PR + F + +C ++ +AK ++ S +A
Sbjct: 26 GEDGGDVVAPAPPFNSSRVKAVSWQPRIFVYKGFLSDAECDHLVTLAKKKIQRSMVA--- 82
Query: 121 GETVDNTQG------IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRY 174
DN G +RTSSG+F++ +D + IEE+IA T LP+ N E ILRY
Sbjct: 83 ----DNQSGKSVMSEVRTSSGMFLNKRQDP--VVSRIEERIAAWTFLPQENAENMQILRY 136
Query: 175 KIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYD-YQ 233
+ GQKY H+D F + + R A+ L+YL+ +++GGET+FP G + D +
Sbjct: 137 EHGQKYEPHFDYFHDKINQVRGGHRYATVLMYLSTVDKGGETVFPNAKGWESQPKDDTFS 196
Query: 234 KCI--GLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
+C GL VKP +GD +LF+SL +G DP S+HGSCPV++GEKW A KWI
Sbjct: 197 ECAHQGLAVKPVKGDAVLFFSLHVDGVPDPLSLHGSCPVIQGEKWSAPKWIH 248
>gi|15224220|ref|NP_181836.1| P4H isoform 1 [Arabidopsis thaliana]
gi|3763917|gb|AAC64297.1| hypothetical protein [Arabidopsis thaliana]
gi|20197628|gb|AAM15158.1| hypothetical protein [Arabidopsis thaliana]
gi|26450452|dbj|BAC42340.1| unknown protein [Arabidopsis thaliana]
gi|29824245|gb|AAP04083.1| unknown protein [Arabidopsis thaliana]
gi|330255112|gb|AEC10206.1| P4H isoform 1 [Arabidopsis thaliana]
Length = 283
Score = 157 bits (397), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 86/230 (37%), Positives = 134/230 (58%), Gaps = 5/230 (2%)
Query: 57 VKDEYKWMPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTL 116
++D +W A + N+ +V+SW PR + +F +PE+C+ + +A+ L+ ST+
Sbjct: 55 LRDVSRWANDKDAELLRIGNVKPEVVSWSPRIIVLHDFLSPEECEYLKAIARPRLQVSTV 114
Query: 117 A-LRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYK 175
++ G+ V + +RTSSG+F++ E + IE++IA + +P NGE +LRY+
Sbjct: 115 VDVKTGKGVKSD--VRTSSGMFLTHVERSYPIIQAIEKRIAVFSQVPAENGELIQVLRYE 172
Query: 176 IGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKC 235
Q Y H+D F + QRVA+ L+YLTD EGGET FP + D + +
Sbjct: 173 PQQFYKPHHDYFADTFNLKRGGQRVATMLMYLTDDVEGGETYFPLAG--DGDCTCGGKIM 230
Query: 236 IGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
G+ VKP +GD +LF+S+ +G DP SIHG C V+ GEKW ATKW+R +
Sbjct: 231 KGISVKPTKGDAVLFWSMGLDGQSDPRSIHGGCEVLSGEKWSATKWMRQK 280
>gi|145345836|ref|XP_001417405.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577632|gb|ABO95698.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 330
Score = 157 bits (396), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 84/220 (38%), Positives = 130/220 (59%), Gaps = 18/220 (8%)
Query: 78 PFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVF 137
P + + P+A NF + E+C ++ +AK L PST+ G++V + IRTS+G+F
Sbjct: 41 PLVLSATQPKAYLLRNFLSAEECDHLMKLAKRELAPSTVVGEAGDSVPSD--IRTSAGMF 98
Query: 138 ISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQK 196
+ +D+ + IEE+IA+++ P NGE ILRY +GQKY+ H+D F D P++
Sbjct: 99 LRKGQDK--IVKAIEERIARLSGTPVDNGEGMQILRYDVGQKYDPHFDYFHDKVNPAPKR 156
Query: 197 S-QRVASFLVYLTDLEEGGETMFP---FENGMNAD-------GSYDYQKCI--GLKVKPR 243
QR+A+ L+YL D ++GGET FP AD ++ C G+ VK
Sbjct: 157 GGQRLATMLIYLVDTDKGGETTFPNAKLPQSFEADEPENPFASHIEHTDCAKKGIPVKSV 216
Query: 244 QGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
+GD +LF+S+ +G +D S+HG+CPV++G+KW A KWIR
Sbjct: 217 RGDAILFFSMTQDGVLDRGSLHGACPVIEGQKWTAVKWIR 256
>gi|326501992|dbj|BAK06488.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 306
Score = 157 bits (396), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 87/212 (41%), Positives = 121/212 (57%), Gaps = 19/212 (8%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 135
+SW PRA + F T +C ++ +A+ L+K VD G +RTSSG
Sbjct: 41 VSWRPRAFLYKGFLTEAECDHLVALAEEG------GLQKSMVVDRQTGKSVMSEVRTSSG 94
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 195
F++ +D+ + IE +IA T+LP+ NGE+ +LRY+ GQKY H D G
Sbjct: 95 TFLAKKQDQ--VVATIEARIAAWTLLPQENGESIQVLRYENGQKYEPHVDFIRHAAKGHH 152
Query: 196 K--SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQ-KCI--GLKVKPRQGDGLLF 250
RVA+ L+YL+D++ GGET+FP + D Q +C G VKP +GD +LF
Sbjct: 153 SRGGHRVATVLMYLSDVKMGGETVFPNSDAKTLQPKDDTQSECARRGYAVKPVKGDAVLF 212
Query: 251 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWI 282
+SL PNGT D S+HG CPV++GEKW ATKWI
Sbjct: 213 FSLHPNGTTDRDSLHGGCPVIEGEKWSATKWI 244
>gi|449461905|ref|XP_004148682.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 295
Score = 157 bits (396), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 86/209 (41%), Positives = 123/209 (58%), Gaps = 11/209 (5%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALR-KGETVDNTQGIRTSSGVFISA 140
LSW PRA + F + +C +I++AK L S +A G++V + +RTSSG+F+
Sbjct: 35 LSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVSSE--VRTSSGMFLRK 92
Query: 141 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRV 200
A+DE + +E +IA T+LP NGE+ IL Y+ GQKY H+D F + R+
Sbjct: 93 AQDE--VVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRI 150
Query: 201 ASFLVYLTDLEEGGETMFPF------ENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLL 254
A+ L+YL+++E+GGET+FP D S+ G VK ++GD LLF+SL
Sbjct: 151 ATVLMYLSNVEKGGETIFPNSEVWYGSESQAKDESWSDCSRKGYAVKAQKGDALLFFSLN 210
Query: 255 PNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
+ T D S+HGSCPV+ GEKW ATKWI
Sbjct: 211 LDATTDERSLHGSCPVIAGEKWSATKWIH 239
>gi|334185677|ref|NP_001189994.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
gi|332643930|gb|AEE77451.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
Length = 324
Score = 156 bits (395), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 87/210 (41%), Positives = 117/210 (55%), Gaps = 8/210 (3%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALR-KGETVDNTQGIRTSSGVFISA 140
LSW PR + F + E+C I +AK L S +A GE+V++ +
Sbjct: 59 LSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEDSVSVVRQSSSFI 118
Query: 141 AEDESGTLDLI----EEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK 196
A +S +D I E K+A T LP NGE+ IL Y+ GQKY H+D F Q
Sbjct: 119 ANMDSLEIDDIVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELG 178
Query: 197 SQRVASFLVYLTDLEEGGETMFPFENGMNA---DGSYDYQKCIGLKVKPRQGDGLLFYSL 253
R+A+ L+YL+++E+GGET+FP G D S+ G VKPR+GD LLF++L
Sbjct: 179 GHRIATVLMYLSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDALLFFNL 238
Query: 254 LPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
PN T D S+HGSCPVV+GEKW AT+WI
Sbjct: 239 HPNATTDSNSLHGSCPVVEGEKWSATRWIH 268
>gi|297812067|ref|XP_002873917.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297319754|gb|EFH50176.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 298
Score = 156 bits (395), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 85/214 (39%), Positives = 127/214 (59%), Gaps = 21/214 (9%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 135
+S PRA + F T +C ++++AK +L+ S +A DN G +RTSSG
Sbjct: 40 VSSKPRAFVYEGFLTELECDHMVSLAKASLKRSAVA-------DNDSGESKFSEVRTSSG 92
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 195
FI +D + IE+KI+ T LP+ NGE +LRY+ GQKY++H+D F + +
Sbjct: 93 TFIPKGKDP--IVSGIEDKISTWTFLPKENGEDIQVLRYEHGQKYDAHFDYFHDKVNIVR 150
Query: 196 KSQRVASFLVYLTDLEEGGETMFPFEN----GMNADGSYDYQKCI--GLKVKPRQGDGLL 249
R+A+ L+YL+++ +GGET+FP + ++ D C G+ VKPR+GD LL
Sbjct: 151 GGHRIATVLMYLSNVTKGGETVFPDAEVPSCRVLSENKEDLSDCAKRGIAVKPRKGDALL 210
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
F++L P+ DP S+HG CPV++GEKW ATKWI
Sbjct: 211 FFNLHPDAIPDPLSLHGGCPVIEGEKWSATKWIH 244
>gi|302764100|ref|XP_002965471.1| hypothetical protein SELMODRAFT_67344 [Selaginella moellendorffii]
gi|300166285|gb|EFJ32891.1| hypothetical protein SELMODRAFT_67344 [Selaginella moellendorffii]
Length = 264
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 84/213 (39%), Positives = 123/213 (57%), Gaps = 18/213 (8%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG---------I 130
+VL+W PR F + E+C +I +A L ST+ VD + G +
Sbjct: 60 EVLNWSPRITLLHKFLSAEECDYLIAIAGPRLAKSTV-------VDTSTGKARHGIESKV 112
Query: 131 RTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQ 190
RTS+G+F+S + ++ IE +IA +M+P NGE +LRY+ Q Y H+D F Q
Sbjct: 113 RTSTGMFLSNYDRRYPMIEAIERRIAVYSMIPVENGELLQVLRYEPNQYYKPHHDYFSDQ 172
Query: 191 EYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 250
+ QRVA+ L+YL+D+EEGGET+FP + + +K GL VKPR+GD +LF
Sbjct: 173 FNLKRGGQRVATVLMYLSDVEEGGETIFPSVGDGECECGGELRK--GLCVKPRKGDAILF 230
Query: 251 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
+S +G +D S+HG C V++GEKW ATKW+R
Sbjct: 231 WSAALDGNVDSNSLHGGCSVLRGEKWSATKWLR 263
>gi|18397528|ref|NP_566279.1| P4H isoform 2 [Arabidopsis thaliana]
gi|332640849|gb|AEE74370.1| P4H isoform 2 [Arabidopsis thaliana]
Length = 299
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 85/214 (39%), Positives = 127/214 (59%), Gaps = 21/214 (9%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 135
+S PRA + F T +C +I++AK NL+ S +A DN G +RTSSG
Sbjct: 41 VSSKPRAFVYEGFLTDLECDHLISLAKENLQRSAVA-------DNDNGESQVSDVRTSSG 93
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 195
FIS +D + IE+K++ T LP+ NGE +LRY+ GQKY++H+D F + +
Sbjct: 94 TFISKGKDP--IVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQKYDAHFDYFHDKVNIAR 151
Query: 196 KSQRVASFLVYLTDLEEGGETMFP----FENGMNADGSYDYQKCI--GLKVKPRQGDGLL 249
R+A+ L+YL+++ +GGET+FP F ++ D C G+ VKP++G+ LL
Sbjct: 152 GGHRIATVLLYLSNVTKGGETVFPDAQEFSRRSLSENKDDLSDCAKKGIAVKPKKGNALL 211
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
F++L + DP S+HG CPV++GEKW ATKWI
Sbjct: 212 FFNLQQDAIPDPFSLHGGCPVIEGEKWSATKWIH 245
>gi|21618073|gb|AAM67123.1| prolyl 4-hydroxylase alpha subunit-like protein [Arabidopsis
thaliana]
Length = 297
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 85/214 (39%), Positives = 127/214 (59%), Gaps = 21/214 (9%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 135
+S PRA + F T +C +I++AK NL+ S +A DN G +RTSSG
Sbjct: 39 VSSKPRAFVYEGFLTDLECDHLISLAKENLQRSAVA-------DNDNGESQVSDVRTSSG 91
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 195
FIS +D + IE+K++ T LP+ NGE +LRY+ GQKY++H+D F + +
Sbjct: 92 TFISKGKDP--IVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQKYDAHFDYFHDKVNIAR 149
Query: 196 KSQRVASFLVYLTDLEEGGETMFP----FENGMNADGSYDYQKCI--GLKVKPRQGDGLL 249
R+A+ L+YL+++ +GGET+FP F ++ D C G+ VKP++G+ LL
Sbjct: 150 GGHRIATVLLYLSNVTKGGETVFPDAQEFSRRSLSENKDDLSDCAKKGIAVKPKKGNALL 209
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
F++L + DP S+HG CPV++GEKW ATKWI
Sbjct: 210 FFNLQQDAIPDPFSLHGGCPVIEGEKWSATKWIH 243
>gi|297797785|ref|XP_002866777.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297312612|gb|EFH43036.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 266
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 98/271 (36%), Positives = 152/271 (56%), Gaps = 11/271 (4%)
Query: 1 MKGGKSNKANWSLKSKIELPFVFLACLFFFLAGLLGSSLLSQDVTAARPSARVVESVKDE 60
M +S++ + S+ L F L F + LL +LS A S++ +
Sbjct: 1 MARPRSHRPSARKSSRSTLVFAVLIMSTFVILILLAFGILSVPSNNA-GSSKANDLTSIV 59
Query: 61 YKWMPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK 120
K + G DDS +++SW PRA + NF T E+CK +I +AK ++ ST+ K
Sbjct: 60 RKTLQRG-VEDDSKNERWVEIISWEPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEK 118
Query: 121 -GETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQK 179
G++ D+ +RTSSG F++ D+ T+ IE++I+ T +P +GE +L Y+IGQK
Sbjct: 119 TGKSTDSR--VRTSSGTFLARGRDK--TIREIEKRISDFTFIPVEHGEGLQVLHYEIGQK 174
Query: 180 YNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKC-- 235
Y HYD F + QR+A+ L+YL+D+EEGGET+FP G + + + +C
Sbjct: 175 YEPHYDYFMDEYNTRNGGQRIATVLMYLSDVEEGGETVFPAAKGNYSAVPWWNELSECGK 234
Query: 236 IGLKVKPRQGDGLLFYSLLPNGTIDPTSIHG 266
GL VKP+ GD LLF+S+ P+ T+DP+S+HG
Sbjct: 235 GGLSVKPKMGDALLFWSMTPDATLDPSSLHG 265
>gi|388495016|gb|AFK35574.1| unknown [Lotus japonicus]
Length = 297
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 89/216 (41%), Positives = 124/216 (57%), Gaps = 25/216 (11%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 135
+SW PRA + F T +C +I++AK L+ S +A DN G +RTSSG
Sbjct: 39 VSWKPRAFVYEGFLTGLECDHLISLAKSELKRSAVA-------DNLPGDSKLSEVRTSSG 91
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 195
+FIS +D + IE+KI+ T LP+ NGE +LRY+ GQKY+ HYD F + +
Sbjct: 92 MFISKKKDP--IVAGIEDKISAWTFLPKENGEDMQVLRYEHGQKYDPHYDYFTDKVNIVR 149
Query: 196 KSQRVASFLVYLTDLEEGGETMFPF------ENGMNADGSYDYQKCI--GLKVKPRQGDG 247
R+A+ L+YLT++ GGET+FP G+ + D +C G+ VKPR+GD
Sbjct: 150 GGHRMATVLLYLTNVTRGGETVFPVAEEPPRRRGLETNS--DLSECAKKGIAVKPRRGDA 207
Query: 248 LLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
LLF+SL D S+H CPV++GEKW ATKWI
Sbjct: 208 LLFFSLHTTAIPDTDSLHAGCPVIEGEKWSATKWIH 243
>gi|110738390|dbj|BAF01121.1| hypothetical protein [Arabidopsis thaliana]
Length = 299
Score = 155 bits (393), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 85/214 (39%), Positives = 127/214 (59%), Gaps = 21/214 (9%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 135
+S PRA + F T +C +I++AK NL+ S +A DN G +RTSSG
Sbjct: 41 VSSKPRAFVYGGFLTDLECDHLISLAKENLQRSAVA-------DNDNGESQVSDVRTSSG 93
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 195
FIS +D + IE+K++ T LP+ NGE +LRY+ GQKY++H+D F + +
Sbjct: 94 TFISKGKDP--IVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQKYDAHFDYFHDKVNIAR 151
Query: 196 KSQRVASFLVYLTDLEEGGETMFP----FENGMNADGSYDYQKCI--GLKVKPRQGDGLL 249
R+A+ L+YL+++ +GGET+FP F ++ D C G+ VKP++G+ LL
Sbjct: 152 GGHRIATVLLYLSNVTKGGETVFPDAQEFSRRSLSENKDDLSDCAKKGIAVKPKKGNALL 211
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
F++L + DP S+HG CPV++GEKW ATKWI
Sbjct: 212 FFNLQQDAIPDPFSLHGGCPVIEGEKWSATKWIH 245
>gi|302834449|ref|XP_002948787.1| hypothetical protein VOLCADRAFT_80309 [Volvox carteri f.
nagariensis]
gi|300265978|gb|EFJ50167.1| hypothetical protein VOLCADRAFT_80309 [Volvox carteri f.
nagariensis]
Length = 329
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 94/247 (38%), Positives = 135/247 (54%), Gaps = 23/247 (9%)
Query: 49 PSARVVESV-KDEYKWMPHGQAGDDSVTNIP---FQVLSWMPRALYFPNFATPEQCKSII 104
P + VE V +D W+ G+ T++P VLSW PR + T E+C +I
Sbjct: 18 PLSMAVEDVGRDLIGWL--GETYRSGRTDVPDSRMVVLSWQPRVFLYKGILTQEECDYLI 75
Query: 105 NMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFISAAEDESGTLDLIEEKIAKV 158
+A+ L S ++ D T G IRTSSG+F + E++ + IE ++A
Sbjct: 76 KIAQGRLERSGVS-------DATTGEGGVSDIRTSSGMFYTRGEND--VVKRIETRLAMW 126
Query: 159 TMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMF 218
TMLP NGE +LRY+ QKY+ H+D F + R+A+ L+YL EEGGET+F
Sbjct: 127 TMLPVENGEGIQVLRYEKTQKYDPHHDYFSFEGRDANGGNRMATVLMYLATPEEGGETVF 186
Query: 219 PFENGMNADGSYDYQKC--IGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKW 276
P ++ +C GL VKP +GD +LF+S+ P+G +P S+HGSCPV++G KW
Sbjct: 187 PKIPVPAGQTRANFSECGMKGLAVKPVKGDAVLFWSIRPDGRFEPGSLHGSCPVIRGVKW 246
Query: 277 VATKWIR 283
ATKWI
Sbjct: 247 SATKWIH 253
>gi|10177121|dbj|BAB10411.1| prolyl 4-hydroxylase, alpha subunit-like protein [Arabidopsis
thaliana]
Length = 267
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 83/201 (41%), Positives = 126/201 (62%), Gaps = 9/201 (4%)
Query: 71 DDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-GETVDNTQG 129
DDS +++SW PRA + NF T E+CK +I +AK ++ ST+ K G++ D+
Sbjct: 70 DDSKNERWVEIISWEPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKTGKSTDSR-- 127
Query: 130 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP 189
+RTSSG F++ D+ T+ IE++I+ T +P +GE +L Y+IGQKY HYD F
Sbjct: 128 VRTSSGTFLARGRDK--TIREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHYDYFMD 185
Query: 190 QEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKC--IGLKVKPRQG 245
+ QR+A+ L+YL+D+EEGGET+FP G + + + +C GL VKP+ G
Sbjct: 186 EYNTRNGGQRIATVLMYLSDVEEGGETVFPAAKGNYSAVPWWNELSECGKGGLSVKPKMG 245
Query: 246 DGLLFYSLLPNGTIDPTSIHG 266
D LLF+S+ P+ T+DP+S+HG
Sbjct: 246 DALLFWSMTPDATLDPSSLHG 266
>gi|224034451|gb|ACN36301.1| unknown [Zea mays]
gi|413945801|gb|AFW78450.1| hypothetical protein ZEAMMB73_588774 [Zea mays]
Length = 295
Score = 154 bits (390), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 86/206 (41%), Positives = 120/206 (58%), Gaps = 22/206 (10%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA 141
+S PR + +F + ++ +I++A+ L+ S +A DN G T S
Sbjct: 54 ISCKPRVFLYQHFLSDDEANHLISLARAELKRSAVA-------DNMSGKSTLS------- 99
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVA 201
E ++ IE+KIA T LP+ NGE +LRYK G+KY HYD F + R A
Sbjct: 100 --EDPIVEGIEDKIAAWTFLPKENGEDIQVLRYKHGEKYEPHYDYFTDNVNTVRGGHRYA 157
Query: 202 SFLVYLTDLEEGGETMFPF----ENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 257
+ L+YLTD+ EGGET+FP ++ +A S QK G+ V+PR+GD LLF++L P+G
Sbjct: 158 TVLLYLTDVPEGGETVFPLAEEPDDAKDATLSECAQK--GIAVRPRKGDALLFFNLNPDG 215
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIR 283
T D S+HG CPV+KGEKW ATKWIR
Sbjct: 216 TTDSVSLHGGCPVIKGEKWSATKWIR 241
>gi|297829156|ref|XP_002882460.1| hypothetical protein ARALYDRAFT_896741 [Arabidopsis lyrata subsp.
lyrata]
gi|297328300|gb|EFH58719.1| hypothetical protein ARALYDRAFT_896741 [Arabidopsis lyrata subsp.
lyrata]
Length = 299
Score = 154 bits (390), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 84/214 (39%), Positives = 127/214 (59%), Gaps = 21/214 (9%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 135
+S PRA + F T +C +I++AK NL+ S +A DN G +RTSSG
Sbjct: 41 VSAKPRAFVYEGFLTDLECDHLISLAKENLQRSAVA-------DNDNGESQVSDVRTSSG 93
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 195
FIS +D + IE+K++ T LP+ NGE +LRY+ GQKY++H+D F + +
Sbjct: 94 TFISKGKDP--IVSGIEDKLSTWTFLPKENGEDLQVLRYEPGQKYDAHFDYFHDKVNIAR 151
Query: 196 KSQRVASFLVYLTDLEEGGETMFP----FENGMNADGSYDYQKCI--GLKVKPRQGDGLL 249
R+A+ L+YL+++ +GGET+FP + ++ D C G+ VKP++G+ LL
Sbjct: 152 GGHRIATVLLYLSNVTKGGETVFPDAQEYSRRSLSENKDDLSDCAKKGIAVKPKKGNALL 211
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
F++L + DP S+HG CPV++GEKW ATKWI
Sbjct: 212 FFNLQQDAIPDPFSLHGGCPVIEGEKWSATKWIH 245
>gi|357467075|ref|XP_003603822.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355492870|gb|AES74073.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 683
Score = 154 bits (389), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 84/211 (39%), Positives = 129/211 (61%), Gaps = 13/211 (6%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAK-LNLRPSTLALRKGETVDNTQGIRTSSGVFI 138
++LS +PRA + NF + E+C+ +IN+AK R + GE +++ RTSSG+F+
Sbjct: 113 EILSSVPRASMYHNFLSKEECEHLINLAKPFMARSLVVDGVTGEVKESSS--RTSSGMFL 170
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 198
+D+ + IE +IA +T +P NGE +++ Y +GQK HYD
Sbjct: 171 DRGKDK--IVQNIERRIADITSVPIENGEGLHVIHYGVGQKCEPHYDYTSDGVVTKNGGP 228
Query: 199 RVASFLVYLTDLEEGGETMFPFENGMNADGSY-DYQKCIG--LKVKPRQGDGLLFYSLLP 255
RVA+ L+YL+D+EEGGET+FP +A ++ KC G L VKP+ GD LLF+S+ P
Sbjct: 229 RVATVLMYLSDVEEGGETVFP-----DAQPNFTSVSKCSGDGLSVKPKMGDALLFWSMKP 283
Query: 256 NGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
+GT+D +S+HG PV++G KW +TKW+ +E
Sbjct: 284 DGTLDTSSLHGGSPVIRGNKWASTKWLHLRE 314
Score = 113 bits (283), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 73/200 (36%), Positives = 112/200 (56%), Gaps = 34/200 (17%)
Query: 94 FATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFISAAEDESGT 147
F + E+C+ +IN+AK + S + VD G RTSSG F+ +D+
Sbjct: 372 FGSKEECEHLINLAKPFMTRSLV-------VDGLTGKGRESSARTSSGRFLERGKDK--I 422
Query: 148 LDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYL 207
+ IE++IA +T +PR+ A + + + G + GP RVA+ L+YL
Sbjct: 423 VQNIEQRIADITSIPRM---ARDFMLFTAG--------GVVTKNGGP----RVATVLMYL 467
Query: 208 TDLEEGGETMFP-FENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHG 266
+D+EEGGET+FP + +N+ Y + GL VKP+ GD LLF S+ P+GT+D +S+HG
Sbjct: 468 SDVEEGGETVFPNAKPNINSVSKYPEK---GLSVKPKMGDALLFRSMKPDGTLDTSSLHG 524
Query: 267 SCPVVKGEKWVATKWIRDQE 286
PV++G KW +TKW+ E
Sbjct: 525 GSPVIRGNKWASTKWLHLTE 544
>gi|18071415|gb|AAL58274.1|AC068923_16 putative prolyl 4-hydroxylase, alpha subunit [Oryza sativa Japonica
Group]
Length = 343
Score = 154 bits (388), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 80/210 (38%), Positives = 129/210 (61%), Gaps = 23/210 (10%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTS 133
+VLSW PRA + NF + E+C+ +I++AK +++ ST+ VD + G +RTS
Sbjct: 111 EVLSWEPRAFLYHNFLSKEECEYLISLAKPHMKKSTV-------VDASTGGSKDSRVRTS 163
Query: 134 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 193
SG+F+ +D+ + IE++I+ T +P NGE +L Y++GQKY H+D F +
Sbjct: 164 SGMFLGRGQDK--IIRTIEKRISDYTFIPVENGEGLQVLHYEVGQKYEPHFDYFHDEFNT 221
Query: 194 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKCI--GLKVKPRQGDGLL 249
QR+A+ L+YL+D+EEGGET+FP ++ + + +C GL VKP+ GD LL
Sbjct: 222 KNGGQRIATLLMYLSDVEEGGETIFPSSKANSSSSPFYNELSECAKKGLAVKPKMGDALL 281
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVAT 279
F+S+ P+G++D TS+HG P++ W+ T
Sbjct: 282 FWSMRPDGSLDATSLHGEIPIL----WLLT 307
>gi|125542543|gb|EAY88682.1| hypothetical protein OsI_10157 [Oryza sativa Indica Group]
Length = 321
Score = 154 bits (388), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 88/224 (39%), Positives = 128/224 (57%), Gaps = 26/224 (11%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKL-NLRPSTLAL-RKGETVDNTQGIRTSSGVFIS 139
+SW PRA + F + +C +I++AK + ST+ GE+V T +RTSSG+F+
Sbjct: 45 VSWRPRAFLYEGFLSDAECDHLISLAKQGKMEKSTVVDGESGESV--TSKVRTSSGMFLD 102
Query: 140 AAEDESGTLDLIEEKIAKVTMLP-----------------RINGEAFNILRYKIGQKYNS 182
+DE + IEE+IA TMLP NGE+ ILRY G+KY
Sbjct: 103 KKQDE--VVARIEERIAAWTMLPTECIIFYCFANFAILKLSENGESMQILRYGQGEKYEP 160
Query: 183 HYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFP-FENGMNADGSYDYQKCI--GLK 239
H+D ++ ++ RVA+ L+YL++++ GGET+FP E ++ + C G
Sbjct: 161 HFDYISGRQGSTREGDRVATVLMYLSNVKMGGETIFPDCEARLSQPKDETWSDCAEQGFA 220
Query: 240 VKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
VKP +G +LF+SL PN T+D S+HGSCPV++GEKW ATKWI
Sbjct: 221 VKPAKGSAVLFFSLHPNATLDTDSLHGSCPVIEGEKWSATKWIH 264
>gi|363807814|ref|NP_001242181.1| uncharacterized protein LOC100782154 [Glycine max]
gi|255644463|gb|ACU22735.1| unknown [Glycine max]
Length = 285
Score = 154 bits (388), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 80/220 (36%), Positives = 130/220 (59%), Gaps = 9/220 (4%)
Query: 66 HGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLN-LRPSTLALRKGETV 124
H G+++ +V+SW PRA + NF T E+C+ +IN A N L+ + GE +
Sbjct: 70 HVSEGENNRVKRWVEVMSWEPRAFLYHNFLTKEECEYLINTATPNMLKSLVIDNESGEGI 129
Query: 125 DNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHY 184
+ + RTS+ + +D+ + IE++IA VT +P +GE +++RY +GQ Y H
Sbjct: 130 ETSY--RTSTEYVVERGKDK--IVRNIEKRIADVTFIPIEHGEPLHVIRYAVGQYYEPHV 185
Query: 185 DAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKC--IGLKV 240
D F+ + QR+A+ L+YL+++E GGET+FP N + + + +C GL +
Sbjct: 186 DYFEEEFSLVNGGQRIATMLMYLSNVEGGGETVFPIANANFSSVPWWNELSECGQTGLSI 245
Query: 241 KPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATK 280
KP+ GD LLF+S+ P+ T+DP ++H +CPV+KG KW TK
Sbjct: 246 KPKMGDALLFWSMKPDATLDPLTLHRACPVIKGNKWSCTK 285
>gi|308799217|ref|XP_003074389.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
gi|116000560|emb|CAL50240.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
Length = 294
Score = 152 bits (385), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 86/210 (40%), Positives = 122/210 (58%), Gaps = 12/210 (5%)
Query: 79 FQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFI 138
+ LSW P A + F T +C+ I +A L+PST+ + D + IRTSSG+F+
Sbjct: 26 IERLSWAPHAEVYRGFLTEAECEHIERLATAELKPSTV-VDASTGGDASSEIRTSSGMFL 84
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFD-----PQEYG 193
AED+ ++ IE +IA T +P +GE F +LRY+ Q+Y +HYD F +E G
Sbjct: 85 GRAEDD--VIEAIEARIAAWTHVPESHGEGFQVLRYEKHQEYRAHYDYFHDKFNVKREKG 142
Query: 194 PQKSQRVASFLVYLTDLEEGGETMFP-FENGMNADGSYDYQKCIGLKVKPRQGDGLLFYS 252
QR+ + L+YL+D+EEGGET+FP FE+G A L V+PR+GD L F S
Sbjct: 143 ---GQRMGTVLMYLSDVEEGGETVFPKFEDGTPAGSEASECARNKLAVRPRKGDALFFRS 199
Query: 253 LLPNGTIDPTSIHGSCPVVKGEKWVATKWI 282
L +G D S H CPV++G K+ ATKW+
Sbjct: 200 LRHDGVPDTFSEHAGCPVIRGVKFSATKWM 229
>gi|159476104|ref|XP_001696154.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein
[Chlamydomonas reinhardtii]
gi|158275325|gb|EDP01103.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein
[Chlamydomonas reinhardtii]
Length = 343
Score = 152 bits (385), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 89/225 (39%), Positives = 122/225 (54%), Gaps = 21/225 (9%)
Query: 69 AGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQ 128
AG+ + VLSW PR + T E+C +++ ++ L S ++ D T
Sbjct: 57 AGEHRAQDSRMVVLSWHPRVFLYKGILTHEECDQLMDNSRSRLERSGVS-------DATT 109
Query: 129 G------IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNS 182
G IRTSSG+F E E + IE ++A TMLP NGE +LRY+ QKY+
Sbjct: 110 GAGAVSDIRTSSGMFYERGETE--LVKRIENRLAMWTMLPVENGEGIQVLRYEKTQKYDP 167
Query: 183 HYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENG----MNADGSYDYQKCIGL 238
H+D F R+A+ L+YL EEGGET+FP G + S ++ GL
Sbjct: 168 HHDYFSFDGADDNGGNRMATVLMYLATPEEGGETVFPKVVGWVVQLTTTASAPCRQ--GL 225
Query: 239 KVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
VKP +GD +LF+S+ P+G DP S+HGSCPV+KG KW ATKWI
Sbjct: 226 AVKPAKGDAVLFWSIRPDGRFDPGSLHGSCPVIKGVKWSATKWIH 270
>gi|242047774|ref|XP_002461633.1| hypothetical protein SORBIDRAFT_02g005760 [Sorghum bicolor]
gi|241925010|gb|EER98154.1| hypothetical protein SORBIDRAFT_02g005760 [Sorghum bicolor]
Length = 275
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 81/206 (39%), Positives = 123/206 (59%), Gaps = 11/206 (5%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFIS 139
+ LSW PR + F + ++C ++ +AK + + +A + T RTSSG+F+
Sbjct: 49 KALSWQPRIFVYKGFLSDDECDHLVTLAK---KGTMVAHNRSSYYRQT---RTSSGMFLR 102
Query: 140 AAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQR 199
+D + IEE+IA T+LPR N E I RY+ GQKY+ H+D FD + + + R
Sbjct: 103 KRQDP--VVSRIEERIAAWTLLPRENVEKMQIQRYQHGQKYDPHFDYFDDKIHHTRGGPR 160
Query: 200 VASFLVYLTDLEEGGETMFPFENGMNADGSYD-YQKCI--GLKVKPRQGDGLLFYSLLPN 256
A+ L+YL+ +++GGET+FP G + D + +C GL VKP +GD +LF+SL +
Sbjct: 161 YATVLMYLSTVDKGGETVFPKAKGWESQPKDDTFSECAHKGLAVKPVKGDAVLFFSLHVD 220
Query: 257 GTIDPTSIHGSCPVVKGEKWVATKWI 282
G DP ++HGSCPV++GEKW A WI
Sbjct: 221 GGPDPLTLHGSCPVIQGEKWSAPNWI 246
>gi|6437556|gb|AAF08583.1|AC011623_16 unknown protein [Arabidopsis thaliana]
Length = 278
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 83/208 (39%), Positives = 123/208 (59%), Gaps = 30/208 (14%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 135
+S PRA + F T +C +I++AK NL+ S +A DN G +RTSSG
Sbjct: 41 VSSKPRAFVYEGFLTDLECDHLISLAKENLQRSAVA-------DNDNGESQVSDVRTSSG 93
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 195
FIS +D + IE+K++ T LP+ NGE +LRY+ GQKY++H+D F + +
Sbjct: 94 TFISKGKDP--IVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQKYDAHFDYFHDKVNIAR 151
Query: 196 KSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLP 255
R+A+ L+YL+++ +GGET+FP D Q C+ KP++G+ LLF++L
Sbjct: 152 GGHRIATVLLYLSNVTKGGETVFP-----------DAQVCL----KPKKGNALLFFNLQQ 196
Query: 256 NGTIDPTSIHGSCPVVKGEKWVATKWIR 283
+ DP S+HG CPV++GEKW ATKWI
Sbjct: 197 DAIPDPFSLHGGCPVIEGEKWSATKWIH 224
>gi|388520325|gb|AFK48224.1| unknown [Lotus japonicus]
Length = 188
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 77/189 (40%), Positives = 121/189 (64%), Gaps = 9/189 (4%)
Query: 103 IINMAKLNL-RPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTML 161
+IN+AK ++ + S + + G++V + +RTSSG+F+ +D+ + IE++IA +
Sbjct: 1 MINLAKPHMAKSSVVDSQTGKSVGSR--VRTSSGMFLKRGKDK--VIQTIEKRIADFAFI 56
Query: 162 PRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFE 221
P NGE +L Y++GQKY HYD F + QR+A+ L+YL+D+EEGGET+FP
Sbjct: 57 PVENGEGLQVLHYEVGQKYEPHYDYFLDEFNTKNGGQRIATVLMYLSDVEEGGETIFPAA 116
Query: 222 NGMNADGSY--DYQKCI--GLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWV 277
+ + D C GL VKP++GD LLF+S+ P+ T+DP+S+HG CPV++G KW
Sbjct: 117 KANFSSVPWYNDLSVCAKKGLSVKPKRGDALLFWSIRPDATLDPSSLHGGCPVIRGNKWS 176
Query: 278 ATKWIRDQE 286
+TKW+ +E
Sbjct: 177 STKWMHLEE 185
>gi|326503458|dbj|BAJ86235.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326516134|dbj|BAJ88090.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 266
Score = 151 bits (381), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 89/230 (38%), Positives = 128/230 (55%), Gaps = 23/230 (10%)
Query: 69 AGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTL---ALRKGETVD 125
A D + + +V+SW PR + F NF + E+C + +A+ L ST+ A KG D
Sbjct: 50 AADLRLGYVKPEVISWTPRIIVFHNFLSSEECDYLREIARPRLEISTVVDVATGKGVKSD 109
Query: 126 NTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD 185
+RTSSG+F+++ E + + IE++I+ + +P NGE +LRY+ Q Y H+D
Sbjct: 110 ----VRTSSGMFVNSEERKLPVIKAIEKRISVFSQIPVENGELIQVLRYEPNQYYRPHHD 165
Query: 186 AFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCI-------GL 238
F + QRVA+ L+YLTD EGGET FP DG +CI GL
Sbjct: 166 YFSDTFNLKRGGQRVATMLMYLTDGVEGGETHFP----QAGDG-----ECICGGRLVRGL 216
Query: 239 KVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQY 288
VKP +GD +LF+S+ +G D S+H C VVKGEKW ATKW+R + +
Sbjct: 217 CVKPNKGDAVLFWSMGLDGNTDSNSLHSGCAVVKGEKWSATKWMRQKMTF 266
>gi|357162904|ref|XP_003579560.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
distachyon]
Length = 266
Score = 151 bits (381), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 84/212 (39%), Positives = 123/212 (58%), Gaps = 9/212 (4%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTL---ALRKGETVDNTQGIRTSSGV 136
+V+SW PR + F NF + E+C + +A+ L ST+ A KG D +RTSSG+
Sbjct: 61 EVISWTPRIIVFHNFLSSEECDFLKEIARPRLEISTVVDVATGKGVKSD----VRTSSGM 116
Query: 137 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK 196
F+++ E + + IE++I+ + +P NGE +LRY+ Q Y H+D F +
Sbjct: 117 FVNSEERKFPVIQAIEKRISVFSQIPVENGELIQVLRYEPSQYYRPHHDYFSDTFNLKRG 176
Query: 197 SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPN 256
QRVA+ L+YLTD EGGET FP + + S + GL VKP +GD +LF+S+ +
Sbjct: 177 GQRVATMLMYLTDGVEGGETHFP--QAGDGECSCGGRIVRGLCVKPNKGDAVLFWSMGLD 234
Query: 257 GTIDPTSIHGSCPVVKGEKWVATKWIRDQEQY 288
G D SIH C V+KGEKW ATKW+R + +
Sbjct: 235 GNTDSNSIHSGCAVLKGEKWSATKWMRQKMTF 266
>gi|255085592|ref|XP_002505227.1| predicted protein [Micromonas sp. RCC299]
gi|226520496|gb|ACO66485.1| predicted protein [Micromonas sp. RCC299]
Length = 267
Score = 150 bits (380), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 87/208 (41%), Positives = 124/208 (59%), Gaps = 11/208 (5%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-GETVDNTQGIRTSSGVFISA 140
LS P+A + F +C I AK L ST+ K G++V + IRTS G+F
Sbjct: 8 LSEKPKAYLYRGFLRQAECDYIKERAKPKLEKSTVVDNKTGQSVPSN--IRTSDGMFFDR 65
Query: 141 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS--- 197
ED+ ++ IE +IA+ T +P NGE +LRY++GQKY H DAF ++ ++S
Sbjct: 66 HEDD--IIEDIERRIAEWTNVPWENGEGIQVLRYEVGQKYEPHLDAF-SDKFNTEESKGG 122
Query: 198 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCI--GLKVKPRQGDGLLFYSLLP 255
QR+A+ L+YL+D+EEGGET+FP G + +C G+ VK R+GD LLF+SL
Sbjct: 123 QRMATVLMYLSDVEEGGETVFPRSVDKPHKGDPKWSECAQRGVAVKARKGDALLFWSLDI 182
Query: 256 NGTIDPTSIHGSCPVVKGEKWVATKWIR 283
+ +D S+HG CPV+KG KW ATKW+
Sbjct: 183 DSNVDELSLHGGCPVIKGTKWSATKWMH 210
>gi|224069056|ref|XP_002302889.1| predicted protein [Populus trichocarpa]
gi|222844615|gb|EEE82162.1| predicted protein [Populus trichocarpa]
Length = 287
Score = 150 bits (380), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 95/271 (35%), Positives = 149/271 (54%), Gaps = 17/271 (6%)
Query: 28 FFFLAGLLGSSLLSQDVTAARPSARVVESVK-DEYKWMPHGQAGDD-----SVTNIPF-- 79
FF LA +L L PS + V ++ D Y +P G + D +V I +
Sbjct: 24 FFQLAFILK---LEDSYGTKFPSFKRVRKLQSDAYLQLPRGISHWDNDTEAAVLRIGYVK 80
Query: 80 -QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVF 137
+++SW PR + +F + E+C + +AK LR ST+ ++ G+ +++ +RTSSG+F
Sbjct: 81 PEIISWSPRIIVLHDFLSSEECDYLRALAKPRLRISTVVDVKTGKGIESK--VRTSSGMF 138
Query: 138 ISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS 197
+S+ E + IE++I+ + +P NGE +LRY+ Q Y H+D F +
Sbjct: 139 LSSEEKTYQVVQAIEKRISVYSQVPIENGELIQVLRYEKNQYYKPHHDYFSDTFNLKRGG 198
Query: 198 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 257
QRVA+ L+YL+D EGGET FP + S + GL VKP +G+ +LF+S+ +G
Sbjct: 199 QRVATMLMYLSDNVEGGETYFPMAG--SGKCSCGGKVVDGLSVKPIKGNAVLFWSMGLDG 256
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQY 288
DP+SIHG C V+ G KW ATKW+R + +
Sbjct: 257 QSDPSSIHGGCEVLSGVKWSATKWMRQRATF 287
>gi|307110744|gb|EFN58979.1| hypothetical protein CHLNCDRAFT_137600 [Chlorella variabilis]
Length = 327
Score = 150 bits (379), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 86/217 (39%), Positives = 125/217 (57%), Gaps = 16/217 (7%)
Query: 79 FQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFI 138
+V++W PRAL F + +C II +A +L ST+ +G ++ + IRTSSG+FI
Sbjct: 42 VEVVAWKPRALLLHGFLSHAECDHIIRVADPSLERSTVVSPEGGSMLDE--IRTSSGMFI 99
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 198
D + +EE++A +T LP + E +LRY++GQKY++H+D D E Q
Sbjct: 100 LKGHD--AVISGLEERVAALTHLPVSHQEDLQVLRYELGQKYSAHWDINDSPERAQQMRA 157
Query: 199 -------RVASFLVYLTDLEEGGETMFP----FENGMNADGSYDYQKCIGLKVKPRQGDG 247
R A+ L+YL+D+EEGGET FP + G+ A Y G+ VKPR+GD
Sbjct: 158 KGVLGGLRTATLLMYLSDVEEGGETAFPHGRWLDEGVQAAPPYTECASKGVVVKPRKGDA 217
Query: 248 LLFYSLLPNG-TIDPTSIHGSCPVVKGEKWVATKWIR 283
+LF+SL NG D S+H CPVV+G K+ ATKW+
Sbjct: 218 ILFFSLKLNGQKKDVYSLHAGCPVVRGVKYSATKWVH 254
>gi|224001336|ref|XP_002290340.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220973762|gb|EED92092.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 483
Score = 149 bits (377), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 85/226 (37%), Positives = 127/226 (56%), Gaps = 16/226 (7%)
Query: 69 AGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQ 128
A S + + LS P + F + E+C I +A ++ S+++L+ + ++
Sbjct: 251 ASSKSQKQVTIETLSLRPLVVSVEGFLSDEECDYIAEIASPQVKYSSVSLKDADKGKDSS 310
Query: 129 GIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFD 188
RTS F+SA +DE L I+ ++A +T +PR + E +LRY G+KY+SH+D FD
Sbjct: 311 EWRTSQSAFLSARDDE--VLTEIDHRVASLTRIPRNHQEYVQVLRYGAGEKYDSHHDYFD 368
Query: 189 PQEYGPQKS----------QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKC-IG 237
P Y KS R A+ YLTD+ +GGET+FP G A S ++ C IG
Sbjct: 369 PSAYRSDKSTLRLIENGKKNRYATVFWYLTDVHDGGETIFPRYGGAPAPRS--HKDCSIG 426
Query: 238 LKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGE-KWVATKWI 282
LKVKP++G ++FYSL +G +DP S+HG+CPV + KW A KWI
Sbjct: 427 LKVKPQKGKVVIFYSLDASGEMDPFSLHGACPVGENNLKWAANKWI 472
>gi|357517893|ref|XP_003629235.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355523257|gb|AET03711.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 196
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 81/194 (41%), Positives = 117/194 (60%), Gaps = 21/194 (10%)
Query: 94 FATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEE 153
T E+C+ +IN+AK ++ ST+ G++VDN+ RTSSG FI+ D+ L IE+
Sbjct: 12 ITTKEECEHLINIAKPSMHKSTVDDETGKSVDNSA--RTSSGTFINRGHDK--ILRNIEQ 67
Query: 154 KIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEG 213
+IA T +P NGE+ NIL Y++GQKY H D F + + E+G
Sbjct: 68 RIADFTFIPVENGESVNILHYEVGQKYEPHPDFFTDE-------------INTKNGGEQG 114
Query: 214 GETMFPFENGMNADGSY--DYQKC--IGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCP 269
GET+FPF G + + + C GL +KP+ GD LLF+S+ P+GT+DP S+HG+CP
Sbjct: 115 GETVFPFAEGNFSSVPWWNELSDCGKKGLSIKPKMGDALLFWSMKPDGTLDPLSMHGACP 174
Query: 270 VVKGEKWVATKWIR 283
V+KG+KW TKW+R
Sbjct: 175 VIKGDKWSCTKWMR 188
>gi|307102975|gb|EFN51240.1| hypothetical protein CHLNCDRAFT_28187 [Chlorella variabilis]
Length = 322
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 84/220 (38%), Positives = 123/220 (55%), Gaps = 19/220 (8%)
Query: 79 FQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKG----ETVDNTQGIRTSS 134
+++SW PRAL F +C +I++A+ L PS + R G ++V QG+ +SS
Sbjct: 15 IELVSWKPRALLLHGFLAHSECDHMISLAEARLEPSKVVSRDGSGKLDSVRTRQGL-SSS 73
Query: 135 GVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQE--- 191
G F++ +D + +E++I T LP + E +L+Y++GQKY++HYD E
Sbjct: 74 GTFLTKRQDS--VVAGVEDRIELATHLPFSHSEQLQVLKYELGQKYSAHYDVHGSNEQAQ 131
Query: 192 ----YGPQKSQRVASFLVYLTDLEEGGETMFP----FENGMNADGSYDYQKCIGLKVKPR 243
G Q R A+ L+YL+D+EEGGET FP + G A Y G+ VKPR
Sbjct: 132 LAIRRGEQGGSRYATMLMYLSDVEEGGETSFPHGRWIDEGAQAQPPYSECGSRGVAVKPR 191
Query: 244 QGDGLLFYSLLPNG-TIDPTSIHGSCPVVKGEKWVATKWI 282
+GD +LFYSL +G + D S+H CPV KG K+ AT WI
Sbjct: 192 KGDAILFYSLKSDGQSKDFFSLHAGCPVAKGVKYSATAWI 231
>gi|449520144|ref|XP_004167094.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 323
Score = 148 bits (374), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 78/212 (36%), Positives = 124/212 (58%), Gaps = 9/212 (4%)
Query: 76 NIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSG 135
+ + L +PRA + NF + ++C +IN+AK + S ++ + RTSSG
Sbjct: 64 TLDYTSLHAVPRAFIYHNFLSEKECSQLINLAKPRMERSLVSAQNTNWEGVVSSRRTSSG 123
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 195
F++ +++ + IE++IA+ T +P NGE +IL Y++GQK+ H+D P + +
Sbjct: 124 RFLAKGQNQ--LVRRIEKRIAEFTFIPVENGEGLSILHYEVGQKFEPHHDYTHPDSFSFK 181
Query: 196 K-SQRVASFLVYLTDLEEGGETMFPFENGMNADGSY------DYQKCIGLKVKPRQGDGL 248
QR A+ ++YL+ ++EGG T+FP + +Y K GL VKP+ GD L
Sbjct: 182 SLGQRNATLVMYLSGVKEGGATVFPEAKKCASSARRWWKKLPEYGKDNGLSVKPKMGDAL 241
Query: 249 LFYSLLPNGTIDPTSIHGSCPVVKGEKWVATK 280
LF+S+ P+GT+DPTS+H S PVVKG+KWV K
Sbjct: 242 LFWSVKPDGTLDPTSLHASSPVVKGDKWVGVK 273
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 40/66 (60%), Gaps = 8/66 (12%)
Query: 207 LTDLEEGGETMFPFENGMNADGSYDYQKCI------GLKVKPRQGDGLLFYSLLPNGTID 260
+ ++EEGGET+FP N S + K + GL +KP+ GD L F+S+ P+GT+D
Sbjct: 9 ILNIEEGGETVFPAAN--KCVSSVPWWKKLPTHGKDGLSIKPKMGDALFFWSMKPDGTLD 66
Query: 261 PTSIHG 266
TS+H
Sbjct: 67 YTSLHA 72
>gi|224033439|gb|ACN35795.1| unknown [Zea mays]
Length = 180
Score = 148 bits (373), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 78/178 (43%), Positives = 110/178 (61%), Gaps = 14/178 (7%)
Query: 120 KGETVDNTQG------IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILR 173
K VD+T G +RTSSG+F+ D+ + IE++IA T +P +GE +L
Sbjct: 3 KSTVVDSTTGKSKDSRVRTSSGMFLQRGRDK--VIRAIEKRIADYTFIPVDHGEGLQVLH 60
Query: 174 YKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQ 233
Y++GQKY H+D F + QR+A+ L+YL+D+EEGGET+FP N +NA Y
Sbjct: 61 YEVGQKYEPHFDYFLDEFNTKNGGQRIATLLMYLSDVEEGGETIFPDAN-VNASSLPWYN 119
Query: 234 K---CI--GLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
+ C GL VKP+ GD LLF+S+ P+ T+DP S+HG CPV+KG KW +TKW+ E
Sbjct: 120 ELSDCAKRGLSVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIKGNKWSSTKWMHIHE 177
>gi|449443245|ref|XP_004139390.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 295
Score = 147 bits (372), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 77/202 (38%), Positives = 120/202 (59%), Gaps = 9/202 (4%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PRA + NF + ++C +IN+AK + S ++ + RTSSG F++ +++
Sbjct: 83 PRAFIYHNFLSEKECSQLINLAKPRMERSLVSAQNTNWEGVVSSRRTSSGRFLAKGQNQ- 141
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK-SQRVASFL 204
+ IE++IA+ T +P NGE +IL Y++GQK+ H+D P + + QR A+ +
Sbjct: 142 -LVRRIEKRIAEFTFIPVENGEGLSILHYEVGQKFEPHHDYTHPDSFSFKSLGQRNATLV 200
Query: 205 VYLTDLEEGGETMFPFENGMNADGSY------DYQKCIGLKVKPRQGDGLLFYSLLPNGT 258
+YL+ ++EGG T+FP + +Y K GL VKP+ GD LLF+S+ P+GT
Sbjct: 201 MYLSGVKEGGATVFPEAKKCASSARRWWKKLPEYGKDNGLSVKPKMGDALLFWSVKPDGT 260
Query: 259 IDPTSIHGSCPVVKGEKWVATK 280
+DPTS+H S PVVKG+KWV K
Sbjct: 261 LDPTSLHASSPVVKGDKWVGVK 282
Score = 68.2 bits (165), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 33/76 (43%), Positives = 50/76 (65%), Gaps = 8/76 (10%)
Query: 207 LTDLEEGGETMFPFENGMNADGSYDYQKCI------GLKVKPRQGDGLLFYSLLPNGTID 260
+ ++EEGGET+FP N S + K + GL +KP+ GD L F+S+ P+GT+D
Sbjct: 9 ILNIEEGGETVFPAAN--QCVSSVPWWKKLPTHGKDGLSIKPKMGDALFFWSMKPDGTLD 66
Query: 261 PTSIHGSCPVVKGEKW 276
TS+HGS PV++G++W
Sbjct: 67 YTSLHGSYPVIRGDEW 82
>gi|357467077|ref|XP_003603823.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355492871|gb|AES74074.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 291
Score = 147 bits (371), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 81/210 (38%), Positives = 121/210 (57%), Gaps = 16/210 (7%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTS 133
+VLS PRA + NF + E+C+ +IN+AK ++ S + VD G +RTS
Sbjct: 86 EVLSSEPRASMYHNFLSKEECEHLINLAKPFMQRSLV-------VDGVTGQGILNSVRTS 138
Query: 134 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 193
SG F+ +D+ + +E +IA +T +P NGE I+ Y++GQK+ HYD
Sbjct: 139 SGTFLERGKDK--IVQNVERRIADITSIPIENGEGLQIIHYEVGQKFEPHYDYNFNWRIT 196
Query: 194 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 253
RVA+ L+YL+D+EEGGET+FP N + Y GL VKP+ GD LLF+S+
Sbjct: 197 NNGGPRVATVLMYLSDVEEGGETVFP-NAKPNFNSVSKYHPGKGLVVKPKMGDALLFWSV 255
Query: 254 LPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
P+G++D S+HG PV++G KW + K +
Sbjct: 256 KPDGSLDTASLHGGSPVIRGSKWASNKLLH 285
>gi|159487419|ref|XP_001701720.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280939|gb|EDP06695.1| predicted protein [Chlamydomonas reinhardtii]
Length = 274
Score = 147 bits (370), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 90/215 (41%), Positives = 123/215 (57%), Gaps = 18/215 (8%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGE-TVDNTQGIRTSSGVFI 138
Q + PRA YF NF T + ++ +A L+ ST+ GE VDN IRTS G+FI
Sbjct: 2 QQVGLHPRAYYFHNFLTKAERGHLVKLAAPKLKRSTVVGNDGEGVVDN---IRTSYGMFI 58
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFD-PQEYGPQKS 197
+D + IE++I+ T LP + E +LRY GQ Y +HYD+ D E GP+
Sbjct: 59 RRLQDP--VVARIEKRISLWTHLPVEHQEDIQVLRYAHGQTYGAHYDSGDKSNEPGPK-- 114
Query: 198 QRVASFLVYLTDLEEGGETMFPFENGMNADGSY------DYQKCI--GLKVKPRQGDGLL 249
R+A+FL+YL+D+EEGGET FP N + AD S + C + KP+ GD +L
Sbjct: 115 WRLATFLMYLSDVEEGGETAFP-HNSVWADPSIPEKVGDKFSDCAKGNVAAKPKAGDAVL 173
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRD 284
FYS PN T+DP ++H CPV+KG KW A W+ D
Sbjct: 174 FYSFYPNMTMDPAAMHTGCPVIKGVKWAAPVWMHD 208
>gi|238007346|gb|ACR34708.1| unknown [Zea mays]
Length = 180
Score = 146 bits (368), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 74/177 (41%), Positives = 110/177 (62%), Gaps = 12/177 (6%)
Query: 120 KGETVDNTQG------IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILR 173
K VD+T G +RTSSG+F+ D+ + +IE++IA T +P +GE +L
Sbjct: 3 KSTVVDSTTGKSKDSRVRTSSGMFLQRGRDK--VIRVIEKRIADYTFIPVDHGEGLQVLH 60
Query: 174 YKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--D 231
Y++GQKY H+D F + QR+A+ L+YL+D+EEGGET+FP N + + +
Sbjct: 61 YEVGQKYEPHFDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETIFPDANVNVSSLPWYNE 120
Query: 232 YQKCI--GLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
+C GL VKP+ GD LLF+S+ P+ T+DP S+HG CPV++G KW +TKW+ E
Sbjct: 121 LSECAKRGLSVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIRGNKWSSTKWMHIHE 177
>gi|384250156|gb|EIE23636.1| hypothetical protein COCSUDRAFT_53414 [Coccomyxa subellipsoidea
C-169]
Length = 285
Score = 145 bits (365), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 80/223 (35%), Positives = 124/223 (55%), Gaps = 7/223 (3%)
Query: 63 WMPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKG 121
W Q + +++ + +SW PRA + + ++C IIN A+ N+ + + L +
Sbjct: 37 WSESYQHKTPAKSSLMVERISWNPRAFLYRGLLSQDECDYIINAARPNMVKATVLDAKTK 96
Query: 122 ETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYN 181
+ V N +R + +I + D+ +D IE +IA+ T LP +GE F+I++Y GQ Y
Sbjct: 97 KQVPNK--LRNNKEAYIDGSADD--VIDQIERRIARYTFLPAAHGEPFHIMQYLPGQGYA 152
Query: 182 SHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCI--GLK 239
H D D + ++R+A+ ++YL+D+ EGGET+FP G Y KC G+
Sbjct: 153 PHTDWLDDWWHPRLGNERIATMIIYLSDVVEGGETVFPNSTMQPHVGDAAYSKCAQQGIA 212
Query: 240 VKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWI 282
VKP +GD LL Y+LL NG D S+H CPV++G KW ATK I
Sbjct: 213 VKPVKGDALLLYNLLENGRNDGESLHQGCPVIRGVKWTATKRI 255
>gi|307111754|gb|EFN59988.1| hypothetical protein CHLNCDRAFT_49444 [Chlorella variabilis]
Length = 344
Score = 144 bits (364), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 78/208 (37%), Positives = 117/208 (56%), Gaps = 9/208 (4%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKGET-VDNTQGIRTSSGVF 137
QVL R + NF T E+C II +A+ + R + G++ +DN +RTS G F
Sbjct: 64 QVLHEDARIFLYHNFLTDEECDHIIKLAEPTMARSGVVETDSGKSKIDN---VRTSKGTF 120
Query: 138 ISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS 197
++ D + IE +IAK T++P NGE +L+Y+ GQ+Y HYD F +
Sbjct: 121 LNRGHDS--VIADIEARIAKWTLMPAGNGEGLQVLKYEHGQEYEGHYDYFFHKAGTANGG 178
Query: 198 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIG--LKVKPRQGDGLLFYSLLP 255
R + L+YL D+EEGGET FP N D ++ +C L KP++G+ +LF+S+ P
Sbjct: 179 NRYLTVLMYLNDVEEGGETCFPNIPSPNGDNGPEFSECARKVLAAKPKKGNAVLFHSIKP 238
Query: 256 NGTIDPTSIHGSCPVVKGEKWVATKWIR 283
G ++ S+H +CPV+KG KW A KW+
Sbjct: 239 TGELERRSLHTACPVIKGVKWSAPKWVH 266
>gi|363543299|ref|NP_001241865.1| prolyl 4-hydroxylase 5-1 [Zea mays]
gi|347978814|gb|AEP37749.1| prolyl 4-hydroxylase 5-1 [Zea mays]
Length = 180
Score = 144 bits (363), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 72/177 (40%), Positives = 110/177 (62%), Gaps = 12/177 (6%)
Query: 120 KGETVDNTQG------IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILR 173
K VD+T G +RTSSG+F+ D+ + +IE++I T +P +GE +L
Sbjct: 3 KSTVVDSTTGKSKDSRVRTSSGMFLQRGRDK--VIRVIEKRITDYTFIPVDHGEGLQVLH 60
Query: 174 YKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--D 231
Y++GQKY H+D F + QR+A+ L++L+D+EEGGET+FP N ++ + +
Sbjct: 61 YEVGQKYEPHFDYFLDEFNTKNGGQRMATLLMHLSDVEEGGETIFPDANVNDSSLPWYNE 120
Query: 232 YQKCI--GLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
+C GL VKP+ GD LLF+S+ P+ T+DP S+HG CPV++G KW +TKW+ E
Sbjct: 121 LSECAKRGLSVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIRGNKWSSTKWMHIHE 177
>gi|308801080|ref|XP_003075321.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
gi|116061875|emb|CAL52593.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
Length = 541
Score = 144 bits (363), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 78/201 (38%), Positives = 122/201 (60%), Gaps = 11/201 (5%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTL--ALRKGETVDNTQGIRTSSGVFISAAED 143
PRA + NF + ++C+ ++ ++K L S + A G ++ +RTS+G FIS D
Sbjct: 265 PRAFLYENFLSEKECEHLLALSKGKLHKSGVVDAQTGGSSLSE---VRTSTGTFISRKYD 321
Query: 144 ESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASF 203
+ + +EE+I + +P+ + EAF ILRY+ GQ+Y +H+D F ++ R+A+
Sbjct: 322 D--IIAGVEERIELWSQIPQSHHEAFQILRYEPGQEYKAHFDYF--FHKSGMRNNRIATV 377
Query: 204 LVYLTDLEEGGETMFPFENGMNADGSYDYQKCI--GLKVKPRQGDGLLFYSLLPNGTIDP 261
L+YL+D+EEGGET+FP + + Y +C G +K R+GD LLF+S+ P G +D
Sbjct: 378 LLYLSDVEEGGETVFPNTDVPTSRNRSMYSECGNGGKALKARKGDALLFWSMKPGGELDA 437
Query: 262 TSIHGSCPVVKGEKWVATKWI 282
S H CPV+KGEKW ATKW+
Sbjct: 438 GSSHAGCPVIKGEKWTATKWM 458
>gi|302765413|ref|XP_002966127.1| hypothetical protein SELMODRAFT_86017 [Selaginella moellendorffii]
gi|300165547|gb|EFJ32154.1| hypothetical protein SELMODRAFT_86017 [Selaginella moellendorffii]
Length = 201
Score = 144 bits (363), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 78/202 (38%), Positives = 120/202 (59%), Gaps = 8/202 (3%)
Query: 89 LYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTL 148
L F + ++C +I +A LR S++ K +++ RTS G F+ D +
Sbjct: 1 LIFFYLYSDDECDHLIGLALPRLRRSSVIDEKTGLGKDSRN-RTSWGAFLR--RDHDNIV 57
Query: 149 DLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLT 208
IE++I+ +T +P+ GE+ ++RYK GQK+ H D + E R+ + L+YLT
Sbjct: 58 SGIEDRISSITFIPKEYGESLQVVRYKTGQKFEPHQDYYKLTENNNNGGHRIGTLLLYLT 117
Query: 209 DLEEGGETMFP--FENGMNADGSYDYQKCI--GLKVKPRQGDGLLFYSLLPNGTIDPTSI 264
++E GGET+FP N +N D S + +C G+ ++PR+GDGLLF+ P+G IDP S
Sbjct: 118 NVENGGETVFPRALANVIN-DYSTNTSECTKKGIVIRPRRGDGLLFWITRPSGEIDPFSF 176
Query: 265 HGSCPVVKGEKWVATKWIRDQE 286
HG CPVVKGEKW+ATK++ + E
Sbjct: 177 HGGCPVVKGEKWLATKFLHEHE 198
>gi|222623961|gb|EEE58093.1| hypothetical protein OsJ_08962 [Oryza sativa Japonica Group]
Length = 387
Score = 144 bits (362), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 78/201 (38%), Positives = 121/201 (60%), Gaps = 19/201 (9%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTS 133
+V+SW PRA + NF + E+C +I +AK ++ ST+ VD+T G +RTS
Sbjct: 100 EVISWEPRAFVYHNFLSKEECDYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 152
Query: 134 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 193
SG+F+ D+ + IE++IA T +P +GE +L Y++GQKY H+D F +
Sbjct: 153 SGMFLQRGRDK--VIRAIEKRIADYTFIPMEHGEGLQVLHYEVGQKYEPHFDYFLDEYNT 210
Query: 194 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKCI--GLKVKPRQGDGLL 249
QR+A+ L+YL+D+EEGGET+FP N ++ + + +C GL VKP+ GD LL
Sbjct: 211 KNGGQRMATLLMYLSDVEEGGETIFPDANVNSSSLPWYNELSECARKGLAVKPKMGDALL 270
Query: 250 FYSLLPNGTIDPTSIHGSCPV 270
F+S+ P+ T+DP S+H + V
Sbjct: 271 FWSMKPDATLDPLSLHDTLRV 291
>gi|9294584|dbj|BAB02865.1| unnamed protein product [Arabidopsis thaliana]
Length = 328
Score = 144 bits (362), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 76/166 (45%), Positives = 108/166 (65%), Gaps = 7/166 (4%)
Query: 121 GETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKY 180
GE+ D+ +RTSSG+F++ +D+ + +E K+A T LP NGEA IL Y+ GQKY
Sbjct: 9 GESEDSE--VRTSSGMFLTKRQDD--IVANVEAKLAAWTFLPEENGEALQILHYENGQKY 64
Query: 181 NSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYD-YQKCI--G 237
+ H+D F ++ R+A+ L+YL+++ +GGET+FP G D + KC G
Sbjct: 65 DPHFDYFYDKKALELGGHRIATVLMYLSNVTKGGETVFPNWKGKTPQLKDDSWSKCAKQG 124
Query: 238 LKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
VKPR+GD LLF++L NGT DP S+HGSCPV++GEKW AT+WI
Sbjct: 125 YAVKPRKGDALLFFNLHLNGTTDPNSLHGSCPVIEGEKWSATRWIH 170
>gi|218191856|gb|EEC74283.1| hypothetical protein OsI_09531 [Oryza sativa Indica Group]
Length = 376
Score = 143 bits (361), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 77/196 (39%), Positives = 119/196 (60%), Gaps = 19/196 (9%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTS 133
+V+SW PRA + NF + E+C +I +AK ++ ST+ VD+T G +RTS
Sbjct: 100 EVISWEPRAFVYHNFLSKEECDYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 152
Query: 134 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 193
SG+F+ D+ + IE++IA T +P +GE +L Y++GQKY H+D F +
Sbjct: 153 SGMFLQRGRDK--VIRAIEKRIADYTFIPMEHGEGLQVLHYEVGQKYEPHFDYFLDEYNT 210
Query: 194 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSY--DYQKCI--GLKVKPRQGDGLL 249
QR+A+ L+YL+D+EEGGET+FP N ++ + + +C GL VKP+ GD LL
Sbjct: 211 KNGGQRMATLLMYLSDVEEGGETIFPDANVNSSSLPWYNELSECARKGLAVKPKMGDALL 270
Query: 250 FYSLLPNGTIDPTSIH 265
F+S+ P+ T+DP S+H
Sbjct: 271 FWSMKPDATLDPLSLH 286
>gi|357445147|ref|XP_003592851.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
gi|355481899|gb|AES63102.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
Length = 281
Score = 142 bits (359), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 80/205 (39%), Positives = 121/205 (59%), Gaps = 5/205 (2%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVFI 138
+VLSW PR + NF + E+C + +A L+ ST+ G+ + + +RTSSG+F+
Sbjct: 76 EVLSWSPRIILLHNFLSYEECDYLRGVALPRLKISTVVDANTGKGIKSD--VRTSSGMFL 133
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 198
S E + + IE++I+ + +P NGE +LRY+ Q Y H+D F + Q
Sbjct: 134 SHEERKYPMIHAIEKRISVYSQIPIENGELMQVLRYEKNQYYRPHHDYFSDTFNLKRGGQ 193
Query: 199 RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGT 258
R+A+ L+YL D EGGET FP + + + S + GL VKP +G+ +LF+S+ +G
Sbjct: 194 RIATMLMYLGDNVEGGETHFP--SAGSDECSCGGKLTKGLCVKPVKGNAVLFWSMGLDGQ 251
Query: 259 IDPTSIHGSCPVVKGEKWVATKWIR 283
DP S+HG CPV+ GEKW ATKW+R
Sbjct: 252 SDPDSVHGGCPVLAGEKWSATKWMR 276
>gi|302841711|ref|XP_002952400.1| hypothetical protein VOLCADRAFT_81799 [Volvox carteri f.
nagariensis]
gi|300262336|gb|EFJ46543.1| hypothetical protein VOLCADRAFT_81799 [Volvox carteri f.
nagariensis]
Length = 269
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 83/202 (41%), Positives = 114/202 (56%), Gaps = 14/202 (6%)
Query: 87 RALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
R + F TPE+C I A+ L R + G +V IRTS G+F ED
Sbjct: 44 RIYLWRGFLTPEECDYIRMKAEKRLERSGVVDTASGSSV--VSDIRTSDGMFFERGED-- 99
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
L+ +E+++A TM P GEA +LRY+ QKY+SH + F +E R A+ L
Sbjct: 100 AILEAVEQRLADWTMTPIWAGEALQVLRYRKDQKYDSHVNYFFHKEGSANGGNRWATVLT 159
Query: 206 YLTDLEEGGETMF---PFENGMNADGSYDYQKCI--GLKVKPRQGDGLLFYSLLPNGTID 260
YLTD EEGGET+F P G+N + +C L VKPR+GD +LF+S+ NG ++
Sbjct: 160 YLTDTEEGGETVFPKIPAPGGVNV----GFSECAKYNLAVKPRKGDAILFHSMKTNGQLE 215
Query: 261 PTSIHGSCPVVKGEKWVATKWI 282
S+HG+CPV+KGEK+ TKWI
Sbjct: 216 ERSLHGACPVIKGEKFSMTKWI 237
>gi|225433714|ref|XP_002268409.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
gi|296089634|emb|CBI39453.3| unnamed protein product [Vitis vinifera]
Length = 287
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 88/253 (34%), Positives = 136/253 (53%), Gaps = 16/253 (6%)
Query: 47 ARPSARVVESVKDEYKWMPHGQAG------DDSVTNIPF---QVLSWMPRALYFPNFATP 97
A PS R V +++ HG D + I + ++L+W PR + +F +
Sbjct: 40 AFPSFRKVHKRQNDGSLQLHGGISQWVNDKDADILRIGYVKPEILNWSPRIILLHSFLSS 99
Query: 98 EQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIA 156
E+C + MA+ L+ ST+ + G+ + + +RTSSG+F+S + + IE++I+
Sbjct: 100 EECDYLRAMAEPLLQISTVVDAQTGKGIQSD--VRTSSGMFLSPDDSTYPIVRAIEKRIS 157
Query: 157 KVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGET 216
+ +P NGE +LRYK Q Y H+D F + QRVA+ L+YL+D EGGET
Sbjct: 158 VYSQVPVENGELIQVLRYKKSQFYKPHHDYFSDSFNLKRGGQRVATMLIYLSDNVEGGET 217
Query: 217 MFPFE-NGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEK 275
FP +G G + GL V P +G+ +LF+S+ +G DP SIHG C V+ GEK
Sbjct: 218 YFPMAGSGFCRCGGKSVR---GLSVAPVKGNAVLFWSMGLDGQSDPNSIHGGCEVLAGEK 274
Query: 276 WVATKWIRDQEQY 288
W ATKW+R + +
Sbjct: 275 WSATKWMRQRSTH 287
>gi|449468746|ref|XP_004152082.1| PREDICTED: putative prolyl 4-hydroxylase-like [Cucumis sativus]
Length = 290
Score = 142 bits (357), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 79/207 (38%), Positives = 119/207 (57%), Gaps = 5/207 (2%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVFI 138
+V+SW PR + NF + ++C + +A L ST+ + G+ V + RTSSG+F+
Sbjct: 83 EVVSWSPRIIVLHNFLSTKECDYLKGIALARLEISTVVDTKTGKGVKSD--FRTSSGMFL 140
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 198
S E + IE++I+ + +P NGE +LRY+ Q Y H+D F + Q
Sbjct: 141 SHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ 200
Query: 199 RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGT 258
R+A+ L+YL++ EGGET FP + + S + GL VKP +GD +LF+S+ +G
Sbjct: 201 RIATMLMYLSENIEGGETYFP--KAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQ 258
Query: 259 IDPTSIHGSCPVVKGEKWVATKWIRDQ 285
DP SIHG C V+ GEKW ATKW+R +
Sbjct: 259 SDPKSIHGGCEVLSGEKWSATKWMRQK 285
>gi|159469311|ref|XP_001692811.1| predicted protein [Chlamydomonas reinhardtii]
gi|158278064|gb|EDP03830.1| predicted protein [Chlamydomonas reinhardtii]
Length = 273
Score = 141 bits (356), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 79/202 (39%), Positives = 115/202 (56%), Gaps = 12/202 (5%)
Query: 87 RALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESG 146
R + F TPE+C I A+ L S + + G IRTS G+F ED
Sbjct: 44 RIYLWKGFLTPEECDYIRMKAEKRLERSGV-VDTGSGGSVVSDIRTSDGMFFERGED--A 100
Query: 147 TLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVY 206
++ +E+++A TM P GE+ +LRY+ QKY+SH+D F ++ R A+ L+Y
Sbjct: 101 IIEAVEQRLADWTMTPIWGGESLQVLRYRKDQKYDSHWDYFFHKDGSSNGGNRWATVLLY 160
Query: 207 LTDLEEGGETMF---PFENGMNADGSYDYQKCI--GLKVKPRQGDGLLFYSLLPNGTIDP 261
LT+ EEGGET+F P NG+N + +C L VKP +GD LLF+S+ P G ++
Sbjct: 161 LTETEEGGETVFPKIPAPNGINV----GFSECAKYNLAVKPHKGDALLFHSMKPTGELEE 216
Query: 262 TSIHGSCPVVKGEKWVATKWIR 283
S+HG+CPV++GEK+ TKWI
Sbjct: 217 RSMHGACPVIRGEKFSMTKWIH 238
>gi|219121927|ref|XP_002181308.1| proly 4-hydroxylase [Phaeodactylum tricornutum CCAP 1055/1]
gi|217407294|gb|EEC47231.1| proly 4-hydroxylase [Phaeodactylum tricornutum CCAP 1055/1]
Length = 226
Score = 141 bits (356), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 77/216 (35%), Positives = 114/216 (52%), Gaps = 14/216 (6%)
Query: 77 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGV 136
+ + LS +P L F + ++C I A+ ++ S + L + RTS
Sbjct: 5 VTLETLSLVPLVLSVEGFLSDDECTYIQETAEPHMEYSEVTLMDKDQGRPASDFRTSQSA 64
Query: 137 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK 196
FI A +D L I+ + A + +PR + E +LRY + +KY+SH D FDP Y K
Sbjct: 65 FIRAHDD--AILTDIDYRTASLVRIPRRHQEDVQVLRYDVTEKYDSHADYFDPALYTKDK 122
Query: 197 S----------QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGD 246
R+A+ YL+D+E+GGET+FP NG D + GLKVKP +G
Sbjct: 123 RTLALIRNGHRNRMATVFWYLSDVEKGGETVFPRFNGAQETSMKDCK--TGLKVKPEKGK 180
Query: 247 GLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWI 282
++FYS+ P+G +D S+HG+CPV KG KW A KW+
Sbjct: 181 VIIFYSMTPDGALDEYSLHGACPVQKGTKWAANKWV 216
>gi|302844247|ref|XP_002953664.1| prolyl 4-hydroxylase alpha subunit-like protein [Volvox carteri f.
nagariensis]
gi|300261073|gb|EFJ45288.1| prolyl 4-hydroxylase alpha subunit-like protein [Volvox carteri f.
nagariensis]
Length = 364
Score = 141 bits (355), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 86/208 (41%), Positives = 122/208 (58%), Gaps = 17/208 (8%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGE-TVDNTQGIRTSSGVFISAAEDE 144
PRA F NF T + ++ +A L+ ST+ KGE VDN IRTS G+FI D
Sbjct: 55 PRAYLFHNFLTKAERAHMVRLAAPKLKRSTVVGSKGEGVVDN---IRTSFGMFIRRLSDP 111
Query: 145 SGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEY-GPQKSQRVASF 203
+ IE++I+ T LP + E +LRY GQ Y +HYD+ ++ GP+ R+A+F
Sbjct: 112 --IIARIEKRISLWTHLPIEHQEDIQVLRYAHGQTYGAHYDSGASSDHVGPK--WRLATF 167
Query: 204 LVYLTDLEEGGETMFPFENGMNADGSYDYQ-----KCI--GLKVKPRQGDGLLFYSLLPN 256
L+YL+D+EEGGET FP +N + D + + +C + KP+ GD +LFYS LPN
Sbjct: 168 LMYLSDVEEGGETAFP-QNSVWYDPTIPERIGPVSECAKGHVAAKPKAGDAVLFYSFLPN 226
Query: 257 GTIDPTSIHGSCPVVKGEKWVATKWIRD 284
T+DP ++H CPV+KG KW A W+ D
Sbjct: 227 NTMDPAAMHTGCPVIKGIKWAAPVWMHD 254
>gi|297727581|ref|NP_001176154.1| Os10g0415128 [Oryza sativa Japonica Group]
gi|255679404|dbj|BAH94882.1| Os10g0415128 [Oryza sativa Japonica Group]
Length = 241
Score = 140 bits (354), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 73/157 (46%), Positives = 98/157 (62%), Gaps = 5/157 (3%)
Query: 130 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP 189
+RTSSG+F+ +DE + IEE+IA T LP NGE+ IL Y+ G+KY HYD F
Sbjct: 15 VRTSSGMFLEKKQDE--VVARIEERIAAWTFLPPDNGESIQILHYQNGEKYEPHYDYFHD 72
Query: 190 QEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYD-YQKCI--GLKVKPRQGD 246
+ R+A+ L+YL+D+ +GGET+FP G D + C G VKP +GD
Sbjct: 73 KNNQALGGHRIATVLMYLSDVGKGGETIFPEAEGKLLQPKDDTWSDCAKNGYAVKPVKGD 132
Query: 247 GLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
LLF+SL P+ T D S+HGSCPV++G+KW ATKWI
Sbjct: 133 ALLFFSLHPDATTDSDSLHGSCPVIEGQKWSATKWIH 169
>gi|397568865|gb|EJK46391.1| hypothetical protein THAOC_34939 [Thalassiosira oceanica]
Length = 488
Score = 140 bits (353), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 84/242 (34%), Positives = 122/242 (50%), Gaps = 18/242 (7%)
Query: 53 VVESVKDEYKWMPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLR 112
V E + E + P G+ N+ + LS P L F E+C I+ A ++
Sbjct: 243 VSEGFRREIELAPSTITGNSK--NVTIETLSMKPLVLSISGFLADEECDYIMEKAAPTMK 300
Query: 113 PSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNIL 172
S ++L+ + RTS F++A D L IE + A +T +P + E +L
Sbjct: 301 YSGVSLKDADKGRPASDWRTSQSTFVAAMGDP--ILRDIELRTASLTRVPVTHQEFVQVL 358
Query: 173 RYKIGQKYNSHYDAFDPQEYGPQ----------KSQRVASFLVYLTDLEEGGETMFPFEN 222
RY + +KY++H+D FDP Y K R A+ YLTD+ GGET FP
Sbjct: 359 RYGVTEKYDAHHDFFDPSSYRSDPGTLQLIENGKKNRYATVFWYLTDVARGGETCFPRHG 418
Query: 223 GMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGE--KWVATK 280
G A D+ C GLKVKP++G ++FYSL +G +DP S+HG+CPV+ E KW A K
Sbjct: 419 G--APPPRDFSMCTGLKVKPQKGKVIIFYSLDASGEMDPLSLHGACPVLGKEDIKWAANK 476
Query: 281 WI 282
W+
Sbjct: 477 WL 478
>gi|307110383|gb|EFN58619.1| hypothetical protein CHLNCDRAFT_19485 [Chlorella variabilis]
Length = 328
Score = 140 bits (352), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 82/204 (40%), Positives = 114/204 (55%), Gaps = 26/204 (12%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA 141
+SW PRA F NF T E+ I+ +AK ++ ST+ G +V++ IRTS G F+
Sbjct: 35 VSWKPRAFVFHNFMTEEEADHIVALAKPFMKRSTVVGAGGASVEDQ--IRTSYGTFLKRL 92
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVA 201
+D + +E+++A T L + E ILRY IGQKY +HYD+ D S RV
Sbjct: 93 QDP--IVTAVEQRLATWTKLNVSHQEDMQILRYGIGQKYGAHYDSLD------NDSPRVC 144
Query: 202 SFLVYLTDL--EEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 259
+ L+YL+D+ + GGET FP G+ Y P++GD LLFYSL P+GT
Sbjct: 145 TVLLYLSDVPADGGGETAFP---GVRRQALY-----------PKKGDALLFYSLKPDGTS 190
Query: 260 DPTSIHGSCPVVKGEKWVATKWIR 283
D S+H CP++ G KW ATKWI
Sbjct: 191 DAYSLHTGCPIISGVKWTATKWIH 214
>gi|356576923|ref|XP_003556579.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 287
Score = 140 bits (352), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 76/205 (37%), Positives = 119/205 (58%), Gaps = 5/205 (2%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVFI 138
+VL+W PR + NF + E+C + +A L S + + G+ + + +RTSSG+F+
Sbjct: 82 EVLNWSPRIILLHNFLSMEECDYLRAIALPRLHISNVVDTKTGKGIKSD--VRTSSGMFL 139
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 198
+ E + + IE++I+ + +P NGE +LRY+ Q Y H+D F + Q
Sbjct: 140 NPQERKYPMVQAIEKRISVYSQIPIENGELMQVLRYEKNQYYKPHHDYFSDTFNLKRGGQ 199
Query: 199 RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGT 258
R+A+ L+YL+D EGGET FP + + S + GL VKP +G+ +LF+S+ +G
Sbjct: 200 RIATMLMYLSDNIEGGETYFPLAG--SGECSCGGKLVKGLSVKPIKGNAVLFWSMGLDGQ 257
Query: 259 IDPTSIHGSCPVVKGEKWVATKWIR 283
DP S+HG C V+ GEKW ATKW+R
Sbjct: 258 SDPNSVHGGCEVISGEKWSATKWMR 282
>gi|159487421|ref|XP_001701721.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280940|gb|EDP06696.1| predicted protein [Chlamydomonas reinhardtii]
Length = 336
Score = 139 bits (351), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 89/214 (41%), Positives = 122/214 (57%), Gaps = 16/214 (7%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFIS 139
Q + PRA YF NF T + ++ +A L+ ST+ KGE V + IRTS G+FI
Sbjct: 20 QQVGLHPRAYYFHNFLTKAERAHLVRVAAPKLKRSTVVGGKGEGVVDD--IRTSYGMFIR 77
Query: 140 AAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEY-GPQKSQ 198
D + IE++I+ T LP + E ILRY GQ Y +HYD+ ++ GP+
Sbjct: 78 RLSDP--VVTRIEKRISLWTHLPVEHQEDIQILRYAHGQTYGAHYDSGASSDHVGPK--W 133
Query: 199 RVASFLVYLTDLEEGGETMFPFENGMNADGSY------DYQKCI--GLKVKPRQGDGLLF 250
R+A+FL+YL+D+EEGGET FP N + AD S + C + KP+ GD +LF
Sbjct: 134 RLATFLMYLSDVEEGGETAFP-HNSVWADPSIPEQVGDKFSDCAKGHVAAKPKAGDAVLF 192
Query: 251 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRD 284
YS PN T+DP S+H CPV+KG KW A W+ D
Sbjct: 193 YSFYPNNTMDPASMHTGCPVIKGVKWAAPVWMHD 226
>gi|299115886|emb|CBN75895.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein
[Ectocarpus siliculosus]
Length = 404
Score = 137 bits (346), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 77/218 (35%), Positives = 121/218 (55%), Gaps = 10/218 (4%)
Query: 70 GDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG 129
G +++ +I + LS P NF E+CK I A +++PS ++L +
Sbjct: 184 GLETLGSIDMKTLSMEPLVFEARNFLLDEECKHIREKADPHMKPSPVSLMDHDKGKPDTN 243
Query: 130 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP 189
RTS+ F+ + D L I+ ++ + T +P+ + E +L+Y GQ+Y +H+D D
Sbjct: 244 WRTSTTYFMPSTRDP--LLQGIDRRVEEFTRVPKSHQEQVQVLKYDKGQRYTAHHDFLDE 301
Query: 190 QEY----GPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCI-GLKVKPRQ 244
+ G +K++ + F YL+D+EEGGET+FP G G D+ C GLKVKP +
Sbjct: 302 RTMRNMDGGRKNRMITVFW-YLSDVEEGGETIFPRYGGRT--GRVDFSDCTTGLKVKPVE 358
Query: 245 GDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWI 282
G +FYSL P+G D S+HG+CPV+ G+KW A KW+
Sbjct: 359 GKVAMFYSLKPDGQFDDFSLHGACPVITGQKWAANKWV 396
>gi|307102963|gb|EFN51228.1| hypothetical protein CHLNCDRAFT_141231 [Chlorella variabilis]
Length = 313
Score = 137 bits (345), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 82/212 (38%), Positives = 114/212 (53%), Gaps = 23/212 (10%)
Query: 84 WM----PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTS 133
WM A F NF T E+C I+ +AK +L S + VD G IRTS
Sbjct: 33 WMQVLDAEARIFINFLTEEECDHIVALAKPHLERSGV-------VDTATGGSEISDIRTS 85
Query: 134 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 193
G+F+ D+ T+ IEE+IA+ T+LP NGE +L Y G+KY+ ++ FD
Sbjct: 86 KGMFLERGHDD--TVAAIEERIARWTLLPVGNGEGLQVLNYHPGEKYDDYF--FDKVNGE 141
Query: 194 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCI--GLKVKPRQGDGLLFY 251
R A+ L+YL +EEGGET+FP D + +C L KP +G +LF+
Sbjct: 142 SNGGNRYATVLMYLNTVEEGGETVFPNIPAPGGDNGPTFTECARRHLAAKPTKGSAVLFH 201
Query: 252 SLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
S+ P+G ++ S+H +CPVVKGEKW A KWI
Sbjct: 202 SIKPSGDLERRSLHTACPVVKGEKWSAPKWIH 233
>gi|303287328|ref|XP_003062953.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226455589|gb|EEH52892.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 259
Score = 137 bits (345), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 80/224 (35%), Positives = 125/224 (55%), Gaps = 32/224 (14%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 135
+SW PRA + N T +C ++ +A+ +R ST+ VD+T G IRTS
Sbjct: 4 ISWHPRAFHLHNIMTDAECDEVLELARTRVRRSTV-------VDSTTGESKVDPIRTSEQ 56
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFN-----ILRYKIGQKYNSHYDAFD-- 188
F++ + +IE+++ + TMLP NGE +L+Y GQKY++H+D +
Sbjct: 57 CFLNRGH--FPIVSVIEKRLERYTMLPWYNGEDLQARPSRVLKYSNGQKYDAHHDVGELD 114
Query: 189 ---PQEYGPQKSQRVASFLVYLTDLEE--GGETMFPFENGMN--ADGSYDYQKCI--GLK 239
++ + RVA+ L+YL+D+++ GGET FP ++ AD + +C +
Sbjct: 115 TASGKQLAAEGGHRVATVLLYLSDVDDDGGGETAFPDSEWIDPTADRGSGWSECAEDHVA 174
Query: 240 VKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
VKP++GDGLLF+S+ P G ID S+H CPV+ G+ W ATKWI
Sbjct: 175 VKPKKGDGLLFWSITPEGVIDQQSMHAGCPVL-GKSWTATKWIH 217
>gi|145341735|ref|XP_001415959.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144576182|gb|ABO94251.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 254
Score = 137 bits (344), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 79/217 (36%), Positives = 121/217 (55%), Gaps = 27/217 (12%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 135
LSW PRA + T QC++++ + +R ST+ VD+ G IRTS
Sbjct: 6 LSWYPRAFALRDALTEAQCEAVLRATRARVRRSTV-------VDSVTGESKVDPIRTSKQ 58
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-----AFDPQ 190
F++ E+ + I + ++ VTMLP + E +L Y++G+KY++H D + +
Sbjct: 59 TFLNRDEE---VVREIYDALSAVTMLPWTHNEDMQVLEYRVGEKYDAHEDVGAEDSLSGR 115
Query: 191 EYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMN---ADGSYDYQKCIGLKV--KPRQG 245
E +RVA+ L+YL + E GGET FP ++ A+G+ + KC +V KPR+G
Sbjct: 116 ELSKDGGKRVATVLLYLEEPEAGGETAFPDSEWIDPKMAEGT-SWSKCAEHRVAMKPRRG 174
Query: 246 DGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWI 282
DGL+F+S+ PNG ID ++H CPVV G KW AT W+
Sbjct: 175 DGLIFWSVDPNGKIDHRALHVGCPVVAGVKWTATVWV 211
>gi|307102962|gb|EFN51227.1| hypothetical protein CHLNCDRAFT_28161 [Chlorella variabilis]
Length = 300
Score = 136 bits (342), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 78/215 (36%), Positives = 118/215 (54%), Gaps = 13/215 (6%)
Query: 74 VTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTS 133
+ +I +VLSW PR + T E+C ++ A L S + ++ IRTS
Sbjct: 11 LLHILLKVLSWDPRIFLYQRLLTEEECDHMMTKAGPRLTRSGVVDVDNPGGESVSDIRTS 70
Query: 134 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 193
G+F EDE + +E ++++ +++P +GE +LRY+ G++Y H+D F
Sbjct: 71 YGMFFDRGEDE--VVREVERRLSEWSLIPPGHGEGIQVLRYENGEEYKPHFDYFFDNLSV 128
Query: 194 PQKSQRVASFLVYLTDLEEGGETMFPF-----ENGMNADGSYDYQKCI--GLKVKPRQGD 246
R+A+ L+YL + E GGET+FP E + A Y +C GL VKPR+GD
Sbjct: 129 QNGGNRLATILMYLAEPEFGGETVFPNVKAPPEQTLEAG----YSECATQGLAVKPRKGD 184
Query: 247 GLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKW 281
+LF+SL GT+D S+HGSCP +KG K+ ATKW
Sbjct: 185 AVLFFSLRTEGTLDKGSLHGSCPTLKGFKFAATKW 219
>gi|226494249|ref|NP_001141909.1| uncharacterized protein LOC100274058 [Zea mays]
gi|194706408|gb|ACF87288.1| unknown [Zea mays]
gi|413932757|gb|AFW67308.1| hypothetical protein ZEAMMB73_919439 [Zea mays]
gi|413932758|gb|AFW67309.1| hypothetical protein ZEAMMB73_919439 [Zea mays]
Length = 217
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 69/155 (44%), Positives = 99/155 (63%), Gaps = 13/155 (8%)
Query: 137 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK 196
+S + + + IE+++A T LP N E+ +LRY+ GQKY++H+D F +
Sbjct: 10 MLSPPQPKDEIVSAIEKRVAAWTFLPEENAESLQVLRYETGQKYDAHFDYFHDRNNLKLG 69
Query: 197 SQRVASFLVYLTDLEEGGETMFPFENGMNADGSY------DYQKC--IGLKVKPRQGDGL 248
QRVA+ L+YLTD+ +GGET+FP NA+GS+ + +C GL VKP++GD L
Sbjct: 70 GQRVATVLMYLTDVNKGGETVFP-----NAEGSHLQYKDETWSECSRSGLAVKPKKGDAL 124
Query: 249 LFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
LF++L N T D S+HGSCPV++GEKW ATKWI
Sbjct: 125 LFFNLHVNATADTGSLHGSCPVIEGEKWSATKWIH 159
>gi|302845026|ref|XP_002954052.1| hypothetical protein VOLCADRAFT_64430 [Volvox carteri f.
nagariensis]
gi|300260551|gb|EFJ44769.1| hypothetical protein VOLCADRAFT_64430 [Volvox carteri f.
nagariensis]
Length = 311
Score = 135 bits (340), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 77/212 (36%), Positives = 119/212 (56%), Gaps = 17/212 (8%)
Query: 81 VLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISA 140
V+SW PRA NF T +C I ++A++++R ST+ G +V RTS G FI+
Sbjct: 3 VISWQPRAFVIRNFLTEHECTHIADLAQVHMRRSTVVADNGSSV--LDDYRTSYGTFINR 60
Query: 141 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRV 200
++ + +E+++A +T P + E +LRY +GQ Y+ H D+ + S R+
Sbjct: 61 Y--QTPVIAAVEDRVALLTRTPVVYQEDMQVLRYGLGQYYHRHTDSLE------NDSPRM 112
Query: 201 ASFLVYLTDLEEGGETMFP----FENGMNADGSYDYQKCI--GLKVKPRQGDGLLFYSLL 254
A+ L+YL++ E GGET FP + + A + C+ + KPR+GD LLF+S+
Sbjct: 113 ATVLLYLSEPELGGETAFPQAASWAHPAMAQLFGPFSDCVKGNVAFKPRRGDALLFWSVK 172
Query: 255 PNG-TIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
P+G T DP S H CPV++G KW AT W+ Q
Sbjct: 173 PDGRTEDPYSEHEGCPVIRGVKWTATVWVHTQ 204
>gi|255637879|gb|ACU19258.1| unknown [Glycine max]
Length = 287
Score = 135 bits (340), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 75/205 (36%), Positives = 119/205 (58%), Gaps = 5/205 (2%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVFI 138
+VL+W PR + NF + E+C + +A L ST+ + G+ + + +RTSSG+F+
Sbjct: 82 EVLNWSPRIILLHNFLSMEECDYLRALALPRLHISTVVDTKTGKGIKSD--VRTSSGMFL 139
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 198
++ E + + IE++I+ + +P NGE +LRY+ Q Y +D F + Q
Sbjct: 140 NSKERKYPMVQAIEKRISVYSQIPIENGELMQVLRYEKNQYYKPRHDYFFDTFNLKRGGQ 199
Query: 199 RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGT 258
+A+ L+YL+D EGGET FP + + S + GL VKP +G+ +LF+S+ +G
Sbjct: 200 GIATMLMYLSDNIEGGETYFPLAG--SGECSCGGKLVKGLSVKPIKGNAVLFWSMGLDGQ 257
Query: 259 IDPTSIHGSCPVVKGEKWVATKWIR 283
DP S+HG C V+ GEKW ATKW+R
Sbjct: 258 SDPNSVHGGCEVISGEKWSATKWLR 282
>gi|255071007|ref|XP_002507585.1| predicted protein [Micromonas sp. RCC299]
gi|226522860|gb|ACO68843.1| predicted protein [Micromonas sp. RCC299]
Length = 433
Score = 135 bits (340), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 74/204 (36%), Positives = 114/204 (55%), Gaps = 11/204 (5%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTL--ALRKGETVDNTQGIRTSSGVFISAA-- 141
PRA F + +C ++ A+ N+ S + A G + N IRTS+G F+
Sbjct: 166 PRAFMHIGFLSERECDLLVEYARPNMYKSGVVDASNGGSSFSN---IRTSTGSFVPTVFP 222
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVA 201
+ + IE +IA T +P +GE +LRY+IGQ+Y SH+D F + G K+ R+A
Sbjct: 223 LGMNDVVRRIERRIAAWTQIPAAHGEPIQVLRYQIGQEYQSHFDYFFHE--GGMKNNRIA 280
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCI--GLKVKPRQGDGLLFYSLLPNGTI 259
+ L+YL+D+++GGET+FP + + C G+ V P++GD +LF+++ G +
Sbjct: 281 TVLMYLSDVKDGGETVFPSAESLQVKPEPIHHACAKNGITVIPKKGDAILFWNMKVGGDL 340
Query: 260 DPTSIHGSCPVVKGEKWVATKWIR 283
D S H CPVV GEKW ATKW+
Sbjct: 341 DGGSTHAGCPVVLGEKWTATKWLH 364
>gi|145354086|ref|XP_001421326.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144581563|gb|ABO99619.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 309
Score = 134 bits (337), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 80/216 (37%), Positives = 119/216 (55%), Gaps = 16/216 (7%)
Query: 79 FQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFI 138
+ +S PRA + NF T E+ ++ I A+ +R S + + + + T RTSSG ++
Sbjct: 78 IERISESPRAYVYRNFLTREEAEATIAAARRTMRRSEV-VNEADGTSKTSDERTSSGGWV 136
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 198
S + E + IE ++A TMLPR GE ++RY+ GQ+Y +H D F + Q
Sbjct: 137 SGEDSE--VMANIERRVAAWTMLPRNRGETTQVMRYEAGQEYAAHDDYFHDEVNVKNGGQ 194
Query: 199 RVASFLVYLTDLEEGGETMFPF---------ENGMNADGSYDYQKCIG----LKVKPRQG 245
R A+ L+YL+D+EEGGET+FP E G+ + G L VKPR+G
Sbjct: 195 RAATVLMYLSDVEEGGETVFPRGTPLGGAAPEKSGVTQGNACERALRGDPNVLAVKPRRG 254
Query: 246 DGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKW 281
D LLF+++ NG +D + H CPVV+G KW AT+W
Sbjct: 255 DALLFFNVHLNGEVDERARHAGCPVVRGTKWTATRW 290
>gi|303282201|ref|XP_003060392.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226457863|gb|EEH55161.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 369
Score = 133 bits (335), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 83/203 (40%), Positives = 108/203 (53%), Gaps = 13/203 (6%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVFISAAEDE 144
PRA + F T +C I A L S + GE V + IRTS G+F ED+
Sbjct: 83 PRAYVYRGFLTDAECDHFIARASPKLAKSNVVDTDTGEGVPSA--IRTSDGMFFDRGEDD 140
Query: 145 SGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQ--EYGPQKSQRVAS 202
+D +E +I+ T LP NGE +LRY GQKY++H DAF + QRVA+
Sbjct: 141 --VVDAVERRISAWTRLPTENGEGMQVLRYAGGQKYDAHLDAFVDKFNADDAHGGQRVAT 198
Query: 203 FLVYLTDLEEGGETMFPFENGMNADGSYDYQKCI--GLKVKPRQGDGLLFYSLLPNGTID 260
L+YL D+++GGET+FP G Y C G+ VKPR+GD LLF+S+ T
Sbjct: 199 VLMYLNDVDDGGETVFPETTAKPHVGDERYSACARRGVAVKPRRGDALLFWSMDETFT-- 256
Query: 261 PTSIHGSCPV-VKGEKWVATKWI 282
S+HG CPV G KW TKWI
Sbjct: 257 -RSLHGGCPVGAGGVKWSMTKWI 278
>gi|319763870|ref|YP_004127807.1| procollagen-proline dioxygenase [Alicycliphilus denitrificans BC]
gi|330823866|ref|YP_004387169.1| procollagen-proline dioxygenase [Alicycliphilus denitrificans K601]
gi|317118431|gb|ADV00920.1| Procollagen-proline dioxygenase [Alicycliphilus denitrificans BC]
gi|329309238|gb|AEB83653.1| Procollagen-proline dioxygenase [Alicycliphilus denitrificans K601]
Length = 284
Score = 132 bits (333), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 83/229 (36%), Positives = 121/229 (52%), Gaps = 40/229 (17%)
Query: 69 AGDDSVTNIPFQVLSWM--PRALYFPNFATPEQCKSIINMAKLNLRPS--TLALRKGETV 124
AGD V +VL M PR + F N +PE+C+++I A+ + S A GE V
Sbjct: 83 AGDRRV-----EVLMAMANPRVVLFGNLLSPEECQAVIEAARTRMARSLTVQAASGGEEV 137
Query: 125 DNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHY 184
+ RTS G+F E+E+ + +EE+IA++ P NGE +L Y+ G +Y HY
Sbjct: 138 NKD---RTSDGMFFQRGENEA--VARLEERIARLVRWPVENGEGLQVLHYRPGAEYKPHY 192
Query: 185 DAFDPQEYGPQK-----SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLK 239
D FDP E G + QRVA+ ++YL D GG T FP + L+
Sbjct: 193 DYFDPAEPGTPRLLRRGGQRVATLVIYLNDPVRGGGTTFP---------------DVPLE 237
Query: 240 VKPRQGDGLLFYSLLPNGTIDPTS--IHGSCPVVKGEKWVATKWIRDQE 286
+ PRQG+ + F G P+S +HG PV++GEKW+ATKW+R++E
Sbjct: 238 IGPRQGNAVFFSY----GRAHPSSRTLHGGAPVIEGEKWIATKWLRERE 282
>gi|3805847|emb|CAA21467.1| putative protein [Arabidopsis thaliana]
gi|7270533|emb|CAB81490.1| putative protein [Arabidopsis thaliana]
Length = 307
Score = 132 bits (333), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 74/222 (33%), Positives = 127/222 (57%), Gaps = 36/222 (16%)
Query: 79 FQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVD------------ 125
+V+SW PRA + NF T E+C+ +I++AK ++ S + ++ G+++D
Sbjct: 80 LEVISWEPRAFVYHNFLTNEECEHLISLAKPSMMKSKVVDVKTGKSIDSRFCTLTSVVVF 139
Query: 126 ----------NTQ-------GIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEA 168
N++ +RTSSG F++ DE ++ IE +I+ T +P NGE
Sbjct: 140 TFQLNLERFENSKFANPSLCRVRTSSGTFLNRGHDE--IVEEIENRISDFTFIPPENGEG 197
Query: 169 FNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADG 228
+L Y++GQ+Y H+D F + + QR+A+ L+YL+D++EGGET+FP G +D
Sbjct: 198 LQVLHYEVGQRYEPHHDYFFDEFNVRKGGQRIATVLMYLSDVDEGGETVFPAAKGNVSDV 257
Query: 229 SY--DYQKC--IGLKVKPRQGDGLLFYSLLPNGTIDPTSIHG 266
+ + +C GL V P++ D LLF+S+ P+ ++DP+S+HG
Sbjct: 258 PWWDELSQCGKEGLSVLPKKRDALLFWSMKPDASLDPSSLHG 299
>gi|302831512|ref|XP_002947321.1| hypothetical protein VOLCADRAFT_120451 [Volvox carteri f.
nagariensis]
gi|300267185|gb|EFJ51369.1| hypothetical protein VOLCADRAFT_120451 [Volvox carteri f.
nagariensis]
Length = 797
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 80/222 (36%), Positives = 115/222 (51%), Gaps = 24/222 (10%)
Query: 79 FQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRT 132
+ +SW PRA + NF T +C ++ + T + + VD+ G IRT
Sbjct: 493 IETISWSPRAFVYHNFLTSAECDHLVQIG-------TQRVSRSLVVDSQTGQSKLDDIRT 545
Query: 133 SSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEY 192
S G ED + IEE+IA+ T LP +GE ILRY GQKY++H+D FD +
Sbjct: 546 SYGAAFGRGEDP--VIAEIEERIAEWTHLPPEHGEPMQILRYVDGQKYDAHWDWFDDPVH 603
Query: 193 GPQ---KSQRVASFLVYLTDLEEGGETMFPFEN--GMNADGSYDYQKC---IGLKVKPRQ 244
R A+ L+YL+++E GGET P + M+ + C +GL ++PR+
Sbjct: 604 HRSYLVDGNRYATVLLYLSEVEAGGETNLPLADPIDMSVQAIENPSPCAAKMGLSIRPRK 663
Query: 245 GDGLLFYSLLPNGTI-DPTSIHGSCPVVKGEKWVATKWIRDQ 285
GD LLFY + G D ++H SCP +KG KW ATKWI +
Sbjct: 664 GDALLFYDMDIEGQKGDRKALHASCPTLKGMKWTATKWIHSK 705
>gi|159464219|ref|XP_001690339.1| hypothetical protein CHLREDRAFT_114525 [Chlamydomonas reinhardtii]
gi|158279839|gb|EDP05598.1| predicted protein [Chlamydomonas reinhardtii]
Length = 244
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 74/205 (36%), Positives = 117/205 (57%), Gaps = 11/205 (5%)
Query: 87 RALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESG 146
R +F T E+ I+ +++ L S + G + ++ IRTS GVF+ ED
Sbjct: 1 RIFLIEHFLTDEEADHIVQVSERRLERSGVVATNGGSEESQ--IRTSFGVFLERGEDP-- 56
Query: 147 TLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVY 206
+ +EE+I+ +T++P NGE +LRY+ QKY++H+D F ++ R A+ L+Y
Sbjct: 57 VVKGVEERISALTLMPVGNGEGLQVLRYQKEQKYDAHWDYFFHKDGIANGGNRYATVLMY 116
Query: 207 LTDLEEGGETMFPFENGMNADGSYD--YQKCI--GLKVKPRQGDGLLFYSLLPNGTIDPT 262
L D EEGGET+FP + A G + + +C L KP++G +LF+S+ P G ++
Sbjct: 117 LVDTEEGGETVFP---NIAAPGGENVGFSECARYHLAAKPKKGTAILFHSIKPTGELERK 173
Query: 263 SIHGSCPVVKGEKWVATKWIRDQEQ 287
S+H +CPV+KG KW A KWI + Q
Sbjct: 174 SLHTACPVIKGIKWSAAKWIHVKPQ 198
>gi|242085722|ref|XP_002443286.1| hypothetical protein SORBIDRAFT_08g016950 [Sorghum bicolor]
gi|241943979|gb|EES17124.1| hypothetical protein SORBIDRAFT_08g016950 [Sorghum bicolor]
Length = 147
Score = 131 bits (330), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 64/133 (48%), Positives = 87/133 (65%), Gaps = 4/133 (3%)
Query: 151 IEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDL 210
IE++IA T +P NGE +L Y +GQK+ H+D D R A+FL+YL+D+
Sbjct: 14 IEQRIADYTSVPIENGEPLQVLHYAVGQKFEPHFDYTDGTSVTKIGGPRKATFLMYLSDV 73
Query: 211 EEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPV 270
EEGGET+FP A GS K G+ VKP+ GD LLF+S+ P+G++DP S+HG+ PV
Sbjct: 74 EEGGETVFP---NATAKGSAPSAKS-GISVKPKMGDALLFWSMKPDGSLDPKSLHGASPV 129
Query: 271 VKGEKWVATKWIR 283
+KG+KW ATKWI
Sbjct: 130 IKGDKWSATKWIH 142
>gi|159486447|ref|XP_001701251.1| hypothetical protein CHLREDRAFT_122372 [Chlamydomonas reinhardtii]
gi|158271833|gb|EDO97644.1| predicted protein [Chlamydomonas reinhardtii]
Length = 251
Score = 131 bits (329), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 77/212 (36%), Positives = 113/212 (53%), Gaps = 18/212 (8%)
Query: 79 FQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFI 138
+ +SW PR + NF + +C+ I A ++ S++ G +V +T IRTS G FI
Sbjct: 2 IETVSWNPRVFIYHNFLSDAECRHIKRTAAPMMKRSSVVGTNGSSVLDT--IRTSYGTFI 59
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 198
D ++ + ++A T P N E +LRY GQKY +H D+ S
Sbjct: 60 RRRHDP--VVERVLRRVAAWTKAPPENQEDLQVLRYGPGQKYGAHMDSLI------DDSP 111
Query: 199 RVASFLVYLTDLEEGGETMFPFENGMNADGSY-----DYQKCIGLKV--KPRQGDGLLFY 251
R+A+ L+YL D E GGET FP ++G D S + +C V +P++GD L+F+
Sbjct: 112 RMATVLLYLHDTEYGGETAFP-DSGHWLDPSLAQSMGPFSECAQGHVAFRPKKGDALMFW 170
Query: 252 SLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
S+ P+GT DP S+H CPVV G KW AT W+
Sbjct: 171 SIKPDGTHDPLSLHTGCPVVTGVKWTATSWVH 202
>gi|302838815|ref|XP_002950965.1| hypothetical protein VOLCADRAFT_60971 [Volvox carteri f.
nagariensis]
gi|300263660|gb|EFJ47859.1| hypothetical protein VOLCADRAFT_60971 [Volvox carteri f.
nagariensis]
Length = 298
Score = 130 bits (327), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 80/222 (36%), Positives = 115/222 (51%), Gaps = 28/222 (12%)
Query: 79 FQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFI 138
+ +SW PR + NF T +C+ I A ++ S++ + G +V T IRTS G FI
Sbjct: 2 IEAVSWNPRVFIYHNFLTDGECRHIKRTAAPMMKRSSVVGQNGSSV--TDNIRTSYGTFI 59
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFN------------ILRYKIGQKYNSHYDA 186
D ++ I ++A T P N E +LRY IGQKY +H D+
Sbjct: 60 RRRHDP--VIERILRRVAAWTKAPPENQEDLQAGRGEGGREKERVLRYGIGQKYGAHMDS 117
Query: 187 FDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENG-MNADGSYD---YQKCIGLKV-- 240
S R+A+ L+YL D EEGGET FP + + D + + +C V
Sbjct: 118 L------IDDSPRMATVLLYLHDTEEGGETAFPDSSSWLTPDLATRMGPFSECAQGHVAF 171
Query: 241 KPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWI 282
+P++GD L+F+S+ P+GT DP S+H CPVVKG KW AT W+
Sbjct: 172 RPKKGDALMFWSIKPDGTHDPLSMHTGCPVVKGVKWTATSWV 213
>gi|120609859|ref|YP_969537.1| 2OG-Fe(II) oxygenase [Acidovorax citrulli AAC00-1]
gi|120588323|gb|ABM31763.1| 2OG-Fe(II) oxygenase [Acidovorax citrulli AAC00-1]
Length = 309
Score = 130 bits (327), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 75/207 (36%), Positives = 116/207 (56%), Gaps = 29/207 (14%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKG-ETVDNTQGIRTSSGVFISAAED 143
PR + F N +PE+C +II+ A+ + R T+A R G E V++ RTS+G+F +
Sbjct: 122 PRVVLFGNLLSPEECDAIIDAARPRMARSLTVATRTGGEEVNDD---RTSNGMFFQ--RE 176
Query: 144 ESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP-----QKSQ 198
E+ + +E +IA++ P NGE +L Y+ G +Y HYD FDP E G + Q
Sbjct: 177 ENPVVARLEARIARLVNWPLENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPTILRRGGQ 236
Query: 199 RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGT 258
RVA+ ++YL D E+GG T FP + L+V PR+G+ + F P+ +
Sbjct: 237 RVATIVIYLNDPEKGGGTTFP---------------DVHLEVAPRRGNAVFFSYERPHPS 281
Query: 259 IDPTSIHGSCPVVKGEKWVATKWIRDQ 285
++HG PVV G+KW+ATKW+R++
Sbjct: 282 T--RTLHGGAPVVAGDKWIATKWLRER 306
>gi|302835042|ref|XP_002949083.1| hypothetical protein VOLCADRAFT_89416 [Volvox carteri f.
nagariensis]
gi|300265828|gb|EFJ50018.1| hypothetical protein VOLCADRAFT_89416 [Volvox carteri f.
nagariensis]
Length = 263
Score = 130 bits (326), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 78/218 (35%), Positives = 117/218 (53%), Gaps = 29/218 (13%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFIS 139
+ +SWMPRA + F TP +C +I +A L S + + +D+ IRTS I
Sbjct: 60 ETVSWMPRAFVYHQFLTPAECDHLIELATPKLERSMVVGTDSDLIDD---IRTSFSASIM 116
Query: 140 AAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK-SQ 198
E + + IEE+IA+ + +LRY GQKY++H+D FD E S
Sbjct: 117 YGE--TSIVSSIEERIAR-----------WTVLRYVNGQKYDAHWDWFDDNEVAKAGGSN 163
Query: 199 RVASFLVYLTDLE--EGGETMFPFENGMN-----ADGSYDYQKC---IGLKVKPRQGDGL 248
R+A+ L+YL+D++ GGET P ++ DG Y +C +G+ ++PR+GD L
Sbjct: 164 RMATVLMYLSDVDPAAGGETALPLAEPLDPHKQSVDGQ-GYSQCAARMGISIRPRKGDVL 222
Query: 249 LFYSLLPNGTI-DPTSIHGSCPVVKGEKWVATKWIRDQ 285
LF+ + P G I D ++H SCP G KW ATKWI ++
Sbjct: 223 LFWDMDPAGLIPDRHALHASCPTFSGTKWTATKWIHNK 260
>gi|3169183|gb|AAC17826.1| hypothetical protein [Arabidopsis thaliana]
Length = 1036
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 69/121 (57%), Positives = 82/121 (67%), Gaps = 4/121 (3%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFIS 139
Q LSW PR Y PNFAT +QC+++I+MAK L+PSTLALRK ET R+ +
Sbjct: 798 QGLSWNPRVFYLPNFATKQQCEAVIDMAKPKLKPSTLALRK-ETKHFQMQYRS---LHQH 853
Query: 140 AAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQR 199
EDESG L IEEKIA T P+ E+FNILRY++GQKY+SHYDAF EYGP SQR
Sbjct: 854 TDEDESGVLAAIEEKIALATRFPKDYYESFNILRYQLGQKYDSHYDAFHSAEYGPLISQR 913
Query: 200 V 200
V
Sbjct: 914 V 914
>gi|55741082|gb|AAV64222.1| unknown [Zea mays]
Length = 369
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 67/147 (45%), Positives = 92/147 (62%), Gaps = 5/147 (3%)
Query: 140 AAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQR 199
A +DE T IEE+I+ T LP NGE+ IL Y+ G+KY HYD F ++ R
Sbjct: 191 ATQDEVVTR--IEERISAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKKNQALGGHR 248
Query: 200 VASFLVYLTDLEEGGETMFPFENG---MNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPN 256
+A+ L+YL+++E+GGET+FP G D ++ G VKP +GD LLF+SL P+
Sbjct: 249 IATVLMYLSNVEKGGETIFPNAEGKLLQPKDNTWSDCARNGYAVKPVKGDALLFFSLHPD 308
Query: 257 GTIDPTSIHGSCPVVKGEKWVATKWIR 283
T D S+HGSCPV++G+KW ATKWI
Sbjct: 309 ATTDSDSLHGSCPVIEGQKWSATKWIH 335
>gi|326316001|ref|YP_004233673.1| procollagen-proline dioxygenase [Acidovorax avenae subsp. avenae
ATCC 19860]
gi|323372837|gb|ADX45106.1| Procollagen-proline dioxygenase [Acidovorax avenae subsp. avenae
ATCC 19860]
Length = 298
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 75/207 (36%), Positives = 116/207 (56%), Gaps = 29/207 (14%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKG-ETVDNTQGIRTSSGVFISAAED 143
PR + F N +PE+C +II+ A+ + R T+A R G E V++ RTS+G+F +
Sbjct: 111 PRVVLFGNLLSPEECDAIIDAARPRMARSLTVATRTGGEEVNDD---RTSNGMFFQ--RE 165
Query: 144 ESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP-----QKSQ 198
E+ + +E +IA++ P NGE +L Y+ G +Y HYD FDP E G + Q
Sbjct: 166 ENPMVAKLEARIARLVNWPLENGEGLQVLHYRPGAEYKPHYDYFDPTEPGTPTILRRGGQ 225
Query: 199 RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGT 258
RVA+ ++YL D E+GG T FP + L+V PR+G+ + F P+ +
Sbjct: 226 RVATIVIYLNDPEKGGGTTFP---------------DVHLEVAPRRGNAVFFSYERPHPS 270
Query: 259 IDPTSIHGSCPVVKGEKWVATKWIRDQ 285
++HG PVV G+KW+ATKW+R++
Sbjct: 271 T--RTLHGGAPVVAGDKWIATKWLRER 295
>gi|412985583|emb|CCO19029.1| predicted protein [Bathycoccus prasinos]
Length = 458
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 77/214 (35%), Positives = 119/214 (55%), Gaps = 16/214 (7%)
Query: 79 FQVLSW-MPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG-IRTSSGV 136
Q++S PRA + F T E+C +I+ +K + S + ET + IRTS+G
Sbjct: 177 MQIISLDHPRAFLYKRFMTDEECDFLIDHSKSRMSKSGVV--DAETGGTAKSDIRTSTGS 234
Query: 137 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK 196
F+ ++ + +E+++A +MLP + EA +LRY++ Q+Y +HYD F + G
Sbjct: 235 FVGIGAND--LMKKLEKRVATFSMLPVKHQEATQVLRYEVKQEYRAHYDYFFHK--GGMA 290
Query: 197 SQRVASFLVYLTDLEEGGETMFP-----FENGMNADGSYDYQKC--IGLKVKPRQGDGLL 249
+ R+ + L+YL + E GGET+FP E G ++ +C G R+GD L+
Sbjct: 291 NNRIVTILMYLHEPEFGGETVFPNTEVPLERAEKGWGK-NFSECGNRGRAAVVRKGDALI 349
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
F+S+ P G +DP S H CPVV+GEKW ATKWI
Sbjct: 350 FWSMKPGGELDPGSSHAGCPVVRGEKWTATKWIH 383
>gi|307109700|gb|EFN57937.1| hypothetical protein CHLNCDRAFT_142031 [Chlorella variabilis]
Length = 325
Score = 129 bits (324), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 75/211 (35%), Positives = 118/211 (55%), Gaps = 16/211 (7%)
Query: 79 FQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFI 138
F+ +SW PRA NFA+ E+ +I +A+ LR ST+ +GE+V RTS G+FI
Sbjct: 32 FEPVSWYPRAFVAHNFASKEETDHMIKLAQPQLRRSTVVGSRGESV--VDNYRTSYGMFI 89
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 198
DE + +E+++A T + E +LRY Q+Y +H+D+ D S
Sbjct: 90 RRHHDE--VVSTLEKRVATWTKYNVTHQEDIQVLRYGTTQEYKAHFDSLD------DDSP 141
Query: 199 RVASFLVYLTDLEEGGETMFPFENGMN-----ADGSYDYQKCIGLKVKPRQGDGLLFYSL 253
R A+ L+YL+D+E GGET FP ++ A G + + +KP++GD ++F+SL
Sbjct: 142 RTATVLIYLSDVESGGETTFPNSEWIDPALPKALGPFSECAQGHVAMKPKRGDAIVFHSL 201
Query: 254 LPNG-TIDPTSIHGSCPVVKGEKWVATKWIR 283
P+G + D ++H +CPV+ G K+VA WI
Sbjct: 202 NPDGRSHDQHALHTACPVIVGVKYVAIFWIH 232
>gi|229086310|ref|ZP_04218488.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-44]
gi|228697005|gb|EEL49812.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-44]
Length = 220
Score = 129 bits (324), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 70/220 (31%), Positives = 125/220 (56%), Gaps = 28/220 (12%)
Query: 67 GQAGDDSVT-NIPFQVLSWM--PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET 123
GD VT + Q++S + P + N + E+C+S+I ++K +++ S + +
Sbjct: 21 NHIGDTIVTEDREIQIISRVEEPLIVVLENVLSDEECESLIELSKDSMKRSKIGASR--E 78
Query: 124 VDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSH 183
VDN IRTSSG F+ +E+ T+ +IE++++ + +P +GE +IL+Y GQ+Y +H
Sbjct: 79 VDN---IRTSSGTFL----EENETVAIIEKRVSSIMNIPVEHGEGLHILKYTPGQEYKAH 131
Query: 184 YDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPR 243
YD F + + R+++ ++YL D+EEGGET FP + L + P+
Sbjct: 132 YDYFAEHSRAAE-NNRISTLVMYLNDVEEGGETFFP---------------KLNLSIAPK 175
Query: 244 QGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
+G + F + +++ ++HG PV+KGEKWVAT+W++
Sbjct: 176 KGSAVYFEYFYNDKSLNELTLHGGAPVIKGEKWVATQWMK 215
>gi|55741040|gb|AAV64184.1| unknown [Zea mays]
Length = 394
Score = 129 bits (324), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 66/147 (44%), Positives = 92/147 (62%), Gaps = 5/147 (3%)
Query: 140 AAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQR 199
A +DE + IEE+I+ T LP NGE+ IL Y+ G+KY HYD F ++ R
Sbjct: 191 ATQDE--VVTRIEERISAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKKNQALGGHR 248
Query: 200 VASFLVYLTDLEEGGETMFPFENG---MNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPN 256
+A+ L+YL+++E+GGET+FP G D ++ G VKP +GD LLF+SL P+
Sbjct: 249 IATVLMYLSNVEKGGETIFPNAEGKLLQPKDNTWSDCARNGYAVKPVKGDALLFFSLHPD 308
Query: 257 GTIDPTSIHGSCPVVKGEKWVATKWIR 283
T D S+HGSCPV++G+KW ATKWI
Sbjct: 309 ATTDSDSLHGSCPVIEGQKWSATKWIH 335
>gi|357467087|ref|XP_003603828.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355492876|gb|AES74079.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 156
Score = 129 bits (324), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 62/140 (44%), Positives = 88/140 (62%), Gaps = 4/140 (2%)
Query: 151 IEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDL 210
IE +IA T +P NGE +L Y +G+KY HYD F + QRVA+ L+YL+D+
Sbjct: 14 IERRIADFTFIPVENGEGLQVLHYGVGEKYEPHYDYFLDEFNTKNGGQRVATVLMYLSDV 73
Query: 211 EEGGETMFPFENGMNADGSY--DYQKCI--GLKVKPRQGDGLLFYSLLPNGTIDPTSIHG 266
EEGGET+FP + + D +C GL +KP+ GD LLF+S+ P+ T+D +S+HG
Sbjct: 74 EEGGETVFPAAKANFSSVPWWNDLSECARKGLSLKPKMGDALLFWSMRPDATLDASSLHG 133
Query: 267 SCPVVKGEKWVATKWIRDQE 286
CPV+ G KW +TKW+ +E
Sbjct: 134 GCPVIVGNKWSSTKWMHLEE 153
>gi|319792090|ref|YP_004153730.1| procollagen-proline dioxygenase [Variovorax paradoxus EPS]
gi|315594553|gb|ADU35619.1| Procollagen-proline dioxygenase [Variovorax paradoxus EPS]
Length = 280
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 86/228 (37%), Positives = 119/228 (52%), Gaps = 38/228 (16%)
Query: 69 AGDDSVTNIPFQVLSWM--PRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKGETVD 125
AGD V QVL M PR + F N + E+C+ +I A++ L R T+ R G V
Sbjct: 79 AGDRQV-----QVLQTMRHPRVIVFGNLLSTEECEGLIAAARVRLARSLTVETRTGGEVL 133
Query: 126 NTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD 185
N RTS G+F E+E + +E+++A + P GE ILRY G +Y HYD
Sbjct: 134 NVD--RTSDGMFFERGENE--IVARLEQRLAMLLRWPLEYGEGLQILRYAPGAQYRPHYD 189
Query: 186 AFDPQEYGP-----QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKV 240
FDP E G + QRVA+ ++YL + E+GG T FP +GL+V
Sbjct: 190 YFDPNEPGTPTILKRGGQRVATLVMYLQEPEQGGATTFP---------------DVGLEV 234
Query: 241 KPRQGDGLLFYSLLPNGTIDPTS--IHGSCPVVKGEKWVATKWIRDQE 286
P +G G+ F P DP + +HG PV+ GEKWVATKW+R++E
Sbjct: 235 APVRGTGVFFSYDRP----DPVTRTLHGGAPVLAGEKWVATKWLRERE 278
>gi|239814309|ref|YP_002943219.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
gi|239800886|gb|ACS17953.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
Length = 279
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 86/228 (37%), Positives = 119/228 (52%), Gaps = 38/228 (16%)
Query: 69 AGDDSVTNIPFQVLSWM--PRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKGETVD 125
AGD V QVL M PR + F N +PE+C+ +I A++ L R T+ R G V
Sbjct: 78 AGDRRV-----QVLQTMRHPRVVVFGNLVSPEECEGLIAAARVRLARSLTVETRTGGEVL 132
Query: 126 NTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD 185
N RTS G+F E++ + +E++IA + P GE ILRY G +Y HYD
Sbjct: 133 NVD--RTSEGMFFERGEND--IVARLEQRIAALLRWPVEFGEGLQILRYAPGAQYRPHYD 188
Query: 186 AFDPQEYGP-----QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKV 240
FDP E G + QRVA+ ++YL + +GG T FP +GL+V
Sbjct: 189 YFDPGEPGTPTILKRGGQRVATLVMYLQEPGQGGATTFP---------------DVGLEV 233
Query: 241 KPRQGDGLLFYSLLPNGTIDPTS--IHGSCPVVKGEKWVATKWIRDQE 286
P +G G+ F P DP + +HG PV+ GEKWVATKW+R++E
Sbjct: 234 APVRGTGVFFSYEEP----DPATRTLHGGAPVLAGEKWVATKWLRERE 277
>gi|159485424|ref|XP_001700744.1| hypothetical protein CHLREDRAFT_187378 [Chlamydomonas reinhardtii]
gi|158281243|gb|EDP06998.1| predicted protein [Chlamydomonas reinhardtii]
Length = 253
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 78/217 (35%), Positives = 116/217 (53%), Gaps = 16/217 (7%)
Query: 79 FQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFI 138
+ +SW+PRA + F + +C +I +A L S + K + VD IRTS I
Sbjct: 38 IETISWVPRAFIYHGFLSHAECDHLIGLALPKLERSLVVGNKSDEVDP---IRTSYSASI 94
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFD-PQEYGPQKS 197
+E+ + IE +IA+ T LPR + E +LRY GQKY++H+D FD + G
Sbjct: 95 --GYNETDVVADIEGRIARWTHLPRSHQEPMEVLRYINGQKYDAHWDWFDETETGGTGGG 152
Query: 198 QRVASFLVYLTDLE--EGGETMFPFE-------NGMNADGSYDYQKCIGLKVKPRQGDGL 248
R+A+ L+YL+D+E GGET P G+ G + +G+ V+P++GD L
Sbjct: 153 NRMATALMYLSDMEPAAGGETALPLAQPLDWEVQGVEGRGYSECASKMGISVRPKKGDVL 212
Query: 249 LFYSLLPNG-TIDPTSIHGSCPVVKGEKWVATKWIRD 284
LF+ + P G D ++H SCP G KW ATKWI +
Sbjct: 213 LFWDMEPGGREPDRHALHASCPTFSGTKWTATKWIHN 249
>gi|302842389|ref|XP_002952738.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
gi|300262082|gb|EFJ46291.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
Length = 281
Score = 128 bits (322), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 68/164 (41%), Positives = 99/164 (60%), Gaps = 9/164 (5%)
Query: 127 TQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDA 186
T IRTS GVF+ EDE + +EE+IA T++P NGE +LRY+ QKY++H+D
Sbjct: 35 TSNIRTSYGVFLDRGEDE--IVKRVEERIAAWTLMPVGNGEGLQVLRYQKEQKYDAHWDY 92
Query: 187 FDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYD--YQKCI--GLKVKP 242
F ++ R A+ L+YL D EEGGET+FP + A G + + +C L KP
Sbjct: 93 FFHKDGITNGGNRYATVLMYLVDTEEGGETVFP---NVAAPGGENVGFSECARYHLAAKP 149
Query: 243 RQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
++G +LF+S+ P G ++ S+H +CPV++G KW A KWI E
Sbjct: 150 KKGTAILFHSIKPTGELERKSLHTACPVIRGIKWSAAKWIHHAE 193
>gi|354334983|gb|AER23925.1| procollagen-proline dioxygenase [Variovorax sp. HH01]
Length = 280
Score = 128 bits (322), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 87/228 (38%), Positives = 118/228 (51%), Gaps = 38/228 (16%)
Query: 69 AGDDSVTNIPFQVLSWM--PRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKGETVD 125
AGD V QVL M PR + F N + E+C+ +I A++ L R T+ R G V
Sbjct: 79 AGDRRV-----QVLQTMRHPRVVVFGNLLSAEECEGLIAAARVRLARSLTVETRTGGEVL 133
Query: 126 NTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD 185
N RTS G+F E+E + +E++IA + P GE ILRY G +Y HYD
Sbjct: 134 NVD--RTSDGMFFERGENE--IVARVEQRIAALLRWPLEFGEGLQILRYAPGAQYRPHYD 189
Query: 186 AFDPQEYGP-----QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKV 240
FDP E G + QRVA+ ++YL + E GG T FP +GL+V
Sbjct: 190 YFDPSEPGTPTILKRGGQRVATLVMYLQEPEGGGATTFP---------------DVGLEV 234
Query: 241 KPRQGDGLLFYSLLPNGTIDPT--SIHGSCPVVKGEKWVATKWIRDQE 286
P +G G+ F P DP ++HG PV+ GEKWVATKW+R++E
Sbjct: 235 APARGCGVFFSYDRP----DPVTRTLHGGAPVLAGEKWVATKWLRERE 278
>gi|372266874|ref|ZP_09502922.1| peptidyl prolyl 4-hydroxylase-like protein subunit alpha
[Alteromonas sp. S89]
Length = 294
Score = 128 bits (322), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 71/206 (34%), Positives = 111/206 (53%), Gaps = 23/206 (11%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + F NF +C +++ M++ NL PS + + + + RTS G A E+
Sbjct: 103 PNIVLFANFLAEWECDALVEMSRPNLSPSRVVNTQHGAFE-LKPSRTSGGTHF--ARGET 159
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK-----SQRV 200
+ IE +IA + +P +GE IL Y + +Y HYD FDP++ G Q+ QRV
Sbjct: 160 PLIADIEARIASLLKVPEAHGEPLQILHYPVSGEYRPHYDFFDPEKPGNQEVLAAGGQRV 219
Query: 201 ASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTID 260
+ ++YL+D+E GG T+FP +GL+V+P++G L F + +G +D
Sbjct: 220 GTLIMYLSDVESGGATVFP---------------RVGLEVQPQKGAALFFSYVGEHGKLD 264
Query: 261 PTSIHGSCPVVKGEKWVATKWIRDQE 286
S+HG PV+ GEKW+ATKW+R E
Sbjct: 265 LQSLHGGSPVLAGEKWIATKWLRAAE 290
>gi|403238305|ref|ZP_10916891.1| procollagen-proline dioxygenase [Bacillus sp. 10403023]
Length = 296
Score = 128 bits (321), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 74/204 (36%), Positives = 109/204 (53%), Gaps = 19/204 (9%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P L+ F + E+C +I M++ L+PST+ K G RTS G+ E+E
Sbjct: 109 PFILHLDYFLSEEECDQLIEMSRERLKPSTVIDPKTGEEKAATG-RTSKGMSFYLQENE- 166
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS-QRVASFL 204
+ +E++IA++ P NGE +L Y IG++Y SH+D F + P+K QRV +FL
Sbjct: 167 -FIKKVEKRIAELIEFPVENGEGLQVLNYGIGEEYKSHFDYFPQSKVVPEKGGQRVGTFL 225
Query: 205 VYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSI 264
+YL D+ GGET+FP G+ + P++G + F G +D S+
Sbjct: 226 IYLNDVPAGGETVFP---------------KAGVSIVPKKGSAVYFQYGNSKGEVDRMSL 270
Query: 265 HGSCPVVKGEKWVATKWIRDQEQY 288
H S PV +GEKWVATKWIR + Y
Sbjct: 271 HSSIPVSEGEKWVATKWIRQENIY 294
>gi|328876967|gb|EGG25330.1| putative prolyl 4-hydroxylase alpha subunit [Dictyostelium
fasciculatum]
Length = 244
Score = 127 bits (320), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 73/212 (34%), Positives = 118/212 (55%), Gaps = 35/212 (16%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGI-RTSSGVFISA 140
+S PR P+F +P +C+ +I+++K LRP + + G+ R+ G+F+
Sbjct: 29 MSQCPRVYRVPDFLSPAECEHLIDISKNKLRPCN---------EISSGVHRSGWGLFMKE 79
Query: 141 AEDESGTLDLIEEKIAKVTMLPRI--NGEAFNILRYKIGQKYNSHYDAFDP-QEYGPQK- 196
E++ D++++ ++ ML + N E ++RY G++ ++HYD F+P G K
Sbjct: 80 GEEDH---DVVKKIFQRMKMLVNLTENCEVMQVIRYHPGEETSAHYDYFNPLTTNGAMKI 136
Query: 197 ---SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 253
QRV + L+YL+++EEGGET FP +G+KVKP +GD +LFY+
Sbjct: 137 GLYGQRVCTILMYLSEVEEGGETSFPE---------------VGVKVKPVKGDAVLFYNC 181
Query: 254 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
PNG +DP S+H PV+KG KWVA K I +
Sbjct: 182 KPNGEVDPLSLHQGDPVIKGTKWVAIKLINQK 213
>gi|108706360|gb|ABF94155.1| oxidoreductase, 2OG-Fe oxygenase family protein, putative [Oryza
sativa Japonica Group]
gi|125585047|gb|EAZ25711.1| hypothetical protein OsJ_09544 [Oryza sativa Japonica Group]
Length = 277
Score = 127 bits (320), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 78/217 (35%), Positives = 121/217 (55%), Gaps = 27/217 (12%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKL-NLRPSTLAL-RKGETVDNTQGIRTSSGVFIS 139
+SW PRA + F + +C +I++AK + ST+ GE+V T +RTSSG+F+
Sbjct: 45 VSWRPRAFLYEGFLSDAECDHLISLAKQGKMEKSTVVDGESGESV--TSKVRTSSGMFLD 102
Query: 140 AAEDESGTLDLIEEKIAKVTMLP-----------------RINGEAFNILRYKIGQKYNS 182
+DE + IEE+IA TMLP NGE+ ILRY G+KY
Sbjct: 103 KKQDE--VVARIEERIAAWTMLPTECIIFYCFANFAILKLSENGESMQILRYGQGEKYEP 160
Query: 183 HYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCI--GLKV 240
H+D ++ ++ RVA+ L+YL++++ G +++ P + ++ + C G V
Sbjct: 161 HFDYISGRQGSTREGDRVATVLMYLSNVKMG-DSLLP-QARLSQPKDETWSDCAEQGFAV 218
Query: 241 KPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWV 277
KP +G +LF+SL PN T+D S+HGSCPV++GEK V
Sbjct: 219 KPAKGSAVLFFSLHPNATLDTDSLHGSCPVIEGEKVV 255
>gi|229002593|ref|ZP_04160640.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock3-17]
gi|229003816|ref|ZP_04161625.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock1-4]
gi|228757417|gb|EEM06653.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock1-4]
gi|228758520|gb|EEM07660.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock3-17]
Length = 219
Score = 127 bits (320), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 68/207 (32%), Positives = 113/207 (54%), Gaps = 27/207 (13%)
Query: 79 FQVLSWM--PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGV 136
Q++S + P + N + E+C+++I M+K ++ S + + + T IRTSSG
Sbjct: 33 IQIISRLEEPLIVVLANVLSDEECETLIEMSKNKMKRSKIGISR-----KTNDIRTSSGA 87
Query: 137 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK 196
F+ +ES IE +IA + +P +GE IL+Y +GQ+Y +HYD F +
Sbjct: 88 FL----EESEITTRIERRIASIMNVPAPHGEGLQILKYTVGQEYQAHYDFF-VENSAAAS 142
Query: 197 SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPN 256
+ R+++ ++YL +EEGGET FP + L V P++G + F +
Sbjct: 143 NNRMSTLVMYLNHVEEGGETFFP---------------KLNLSVSPKKGMAVYFEYFYQD 187
Query: 257 GTIDPTSIHGSCPVVKGEKWVATKWIR 283
+I+ ++HG PV+KGEKWVAT+W+R
Sbjct: 188 ESINKLTLHGGAPVIKGEKWVATQWMR 214
>gi|308812133|ref|XP_003083374.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein (ISS)
[Ostreococcus tauri]
gi|116055254|emb|CAL57650.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein (ISS)
[Ostreococcus tauri]
Length = 311
Score = 127 bits (320), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 80/237 (33%), Positives = 118/237 (49%), Gaps = 28/237 (11%)
Query: 66 HGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVD 125
HG D ++ + +S PRA F F T +C +I A + S E D
Sbjct: 55 HGVDAADGGSSGWIEKISDSPRAYVFREFLTDAECDRVIERAYPTMEAS-------EVTD 107
Query: 126 NTQG------IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQK 179
+ G R+S G ++S +DE + IE + + MLP GE +LRY+ GQK
Sbjct: 108 DDSGEARPDDARSSIGGWVSGDDDE--VIRNIELRASTWAMLPMNRGETMQVLRYEKGQK 165
Query: 180 YNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPF----------ENGMNADGS 229
Y++H D F + QRVA+ L+YL+D+EEGGET+FP ++G+ D +
Sbjct: 166 YDAHDDFFHDEHNVKNGGQRVATILMYLSDVEEGGETVFPLGTPLGGRDPEKSGVTGDNA 225
Query: 230 YDYQKCIG---LKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
+ L VKPR+GD LLF++ +G +D + H CPV +G KW T+W R
Sbjct: 226 CELASQNDPRVLAVKPRRGDALLFFNAHLSGEMDEKANHAGCPVNRGTKWTMTRWHR 282
>gi|228990015|ref|ZP_04149988.1| Prolyl 4-hydroxylase alpha subunit [Bacillus pseudomycoides DSM
12442]
gi|228769681|gb|EEM18271.1| Prolyl 4-hydroxylase alpha subunit [Bacillus pseudomycoides DSM
12442]
Length = 219
Score = 127 bits (319), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 68/207 (32%), Positives = 113/207 (54%), Gaps = 27/207 (13%)
Query: 79 FQVLSWM--PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGV 136
Q++S + P + N + E+C+++I M+K ++ S + + + T IRTSSG
Sbjct: 33 IQIISRLEEPLIVVLANVLSDEECETLIEMSKNKMKRSKIGVSR-----KTNDIRTSSGA 87
Query: 137 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK 196
F+ +ES IE +IA + +P +GE IL+Y +GQ+Y +HYD F +
Sbjct: 88 FL----EESEITTRIERRIASIMNVPAPHGEGLQILKYTVGQEYQAHYDFF-VENSAAAS 142
Query: 197 SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPN 256
+ R+++ ++YL +EEGGET FP + L V P++G + F +
Sbjct: 143 NNRMSTLVMYLNHVEEGGETFFP---------------KLNLSVSPKKGMAVYFEYFYQD 187
Query: 257 GTIDPTSIHGSCPVVKGEKWVATKWIR 283
+I+ ++HG PV+KGEKWVAT+W+R
Sbjct: 188 ESINKLTLHGGAPVIKGEKWVATQWMR 214
>gi|159489450|ref|XP_001702710.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280732|gb|EDP06489.1| predicted protein [Chlamydomonas reinhardtii]
Length = 252
Score = 127 bits (319), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 78/213 (36%), Positives = 114/213 (53%), Gaps = 25/213 (11%)
Query: 81 VLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISA 140
V+SW PRA NF T ++ I ++A++++R ST+ G +V RTS G FI+
Sbjct: 3 VISWEPRAFVIRNFLTDQEATHIADVAQVHMRRSTVVADNGSSV--LDDYRTSYGTFINR 60
Query: 141 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRV 200
+ + +E+++A +T +P E +LRY GQ Y+ H D+ + S R+
Sbjct: 61 YA--TPVVARVEDRVAVLTRVPVHYQEDMQVLRYGNGQYYHRHTDSLE------NDSPRL 112
Query: 201 ASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIG---------LKVKPRQGDGLLFY 251
A+ L+YL+D E GGET FP A D K G + KPR+GD LLF+
Sbjct: 113 ATVLLYLSDPELGGETAFPL-----AWAHPDMPKVFGPFSECVKNNVAFKPRKGDALLFW 167
Query: 252 SLLPNG-TIDPTSIHGSCPVVKGEKWVATKWIR 283
S+ P+G T DP S H CPV++G KW AT W+
Sbjct: 168 SVKPDGKTEDPLSEHEGCPVIRGVKWTATVWVH 200
>gi|413934216|gb|AFW68767.1| hypothetical protein ZEAMMB73_452923 [Zea mays]
Length = 210
Score = 127 bits (319), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 67/149 (44%), Positives = 90/149 (60%), Gaps = 5/149 (3%)
Query: 138 ISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS 197
I +DE + IEE+I+ T LP NGEA IL Y+ G+KY HYD F +
Sbjct: 5 ILTCQDE--VVTRIEERISAWTFLPPENGEAIQILHYQNGEKYEPHYDYFHDKNNQALGG 62
Query: 198 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYD-YQKCI--GLKVKPRQGDGLLFYSLL 254
R+A+ L+YL+++E+GGET+FP G D + C G VKP +GD LLF+SL
Sbjct: 63 HRIATVLMYLSNVEKGGETIFPNAEGKLLQPKDDTWSDCARNGYAVKPVKGDALLFFSLH 122
Query: 255 PNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
P+ T D S+HGSCP ++G+KW ATKWI
Sbjct: 123 PDSTTDSDSLHGSCPAIEGQKWSATKWIH 151
>gi|413934217|gb|AFW68768.1| hypothetical protein ZEAMMB73_452923 [Zea mays]
Length = 204
Score = 127 bits (318), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 64/136 (47%), Positives = 85/136 (62%), Gaps = 3/136 (2%)
Query: 151 IEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDL 210
IEE+I+ T LP NGEA IL Y+ G+KY HYD F + R+A+ L+YL+++
Sbjct: 10 IEERISAWTFLPPENGEAIQILHYQNGEKYEPHYDYFHDKNNQALGGHRIATVLMYLSNV 69
Query: 211 EEGGETMFPFENGMNADGSYD-YQKCI--GLKVKPRQGDGLLFYSLLPNGTIDPTSIHGS 267
E+GGET+FP G D + C G VKP +GD LLF+SL P+ T D S+HGS
Sbjct: 70 EKGGETIFPNAEGKLLQPKDDTWSDCARNGYAVKPVKGDALLFFSLHPDSTTDSDSLHGS 129
Query: 268 CPVVKGEKWVATKWIR 283
CP ++G+KW ATKWI
Sbjct: 130 CPAIEGQKWSATKWIH 145
>gi|412988743|emb|CCO15334.1| predicted protein [Bathycoccus prasinos]
Length = 352
Score = 127 bits (318), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 77/237 (32%), Positives = 120/237 (50%), Gaps = 25/237 (10%)
Query: 63 WMPHGQAGDDSVTNIPF-QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKG 121
W+ +++ ++ P+ + LSW PRA + NF + E+ K ++++ + + ST+ G
Sbjct: 51 WLGEVNTQEETNSSKPWIEALSWDPRAFLYHNFLSKEEAKHLVDLGEPRVTRSTVV---G 107
Query: 122 ETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYN 181
IRTS G FI DE L+ IE++ A + +P + E +LRY+ GQKY+
Sbjct: 108 GQTGRVSDIRTSFGTFIPKKYDE--VLEKIEDRCAVFSGIPVAHQEQMQLLRYRDGQKYS 165
Query: 182 SHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMN-------------ADG 228
H D + G +R+A+ L++L + EGGET F N + +D
Sbjct: 166 DHTDGLISENGG----KRIATILMFLHEPTEGGETSFVLGNPLGKVKERIERTKDQFSDC 221
Query: 229 SYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
Y K G VKP+ GD +LF+S G D S+H SCP + G KW AT WI ++
Sbjct: 222 GYRSGK--GFAVKPKVGDAILFFSFSEAGITDNNSMHASCPTLGGTKWTATMWIHER 276
>gi|159487763|ref|XP_001701892.1| predicted protein [Chlamydomonas reinhardtii]
gi|158281111|gb|EDP06867.1| predicted protein [Chlamydomonas reinhardtii]
Length = 259
Score = 127 bits (318), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 75/209 (35%), Positives = 114/209 (54%), Gaps = 15/209 (7%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA 141
++W PR + NF T + K +I +A ++ ST+ G++V++ RTS G F+
Sbjct: 4 VAWKPRVFIYHNFITEVEAKHLIELAAPQMKRSTVVGAGGKSVEDN--YRTSYGTFLKRY 61
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVA 201
+DE ++ IE ++A T +P + E ILRY +GQ+Y H D +E G RVA
Sbjct: 62 QDE--IVERIENRVAAWTQIPVAHQEDTQILRYGLGQQYKVHADTLRDEEAG----VRVA 115
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGS----YDYQKCIGLKV--KPRQGDGLLFYSLLP 255
+ L+YL + + GGET FP +N + ++ C V P++GD LLF+S+ P
Sbjct: 116 TVLIYLNEPDGGGETAFPSSEWVNPQLAKTLGANFSDCAKNHVAFAPKRGDALLFWSINP 175
Query: 256 NG-TIDPTSIHGSCPVVKGEKWVATKWIR 283
+G T D + H CPV+ G KW ATKWI
Sbjct: 176 DGNTEDTHASHTGCPVLSGVKWTATKWIH 204
>gi|319652187|ref|ZP_08006306.1| hypothetical protein HMPREF1013_02919 [Bacillus sp. 2_A_57_CT2]
gi|317396176|gb|EFV76895.1| hypothetical protein HMPREF1013_02919 [Bacillus sp. 2_A_57_CT2]
Length = 283
Score = 126 bits (317), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 72/204 (35%), Positives = 107/204 (52%), Gaps = 19/204 (9%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P L+ + E+C +I++++ L+PS L + +G + RTS + E+E
Sbjct: 96 PFVLHLDQVLSSEECDELISLSRSRLQPS-LVVDRGSGEERAGSGRTSKSMAFRLKENE- 153
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS-QRVASFL 204
++ IE +IA++T P NGE IL Y +G++Y H+D F P K QRV +FL
Sbjct: 154 -LVERIETRIAELTGYPAENGEGLQILNYGLGEEYKPHFDFFPPHMADASKGGQRVGTFL 212
Query: 205 VYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSI 264
+YL D+E+GGET+F GL P++G + F+ G +D S+
Sbjct: 213 IYLNDVEDGGETVFS---------------KAGLSFVPKKGAAIYFHYGNAQGQLDRLSV 257
Query: 265 HGSCPVVKGEKWVATKWIRDQEQY 288
H S PV KGEKW ATKWIR+ Y
Sbjct: 258 HSSVPVRKGEKWAATKWIRESNIY 281
>gi|398804098|ref|ZP_10563100.1| 2OG-Fe(II) oxygenase superfamily enzyme [Polaromonas sp. CF318]
gi|398094921|gb|EJL85274.1| 2OG-Fe(II) oxygenase superfamily enzyme [Polaromonas sp. CF318]
Length = 277
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 78/208 (37%), Positives = 113/208 (54%), Gaps = 29/208 (13%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKGETVDNTQGIRTSSGVFISAAEDE 144
P F N + +C+++I A+ L R T+ +R G N RTS G+F + E+E
Sbjct: 90 PELWVFDNLLSAAECEALIAAAESRLARSLTVDIRTGGEELNHD--RTSHGMFYTRGENE 147
Query: 145 SGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP-----QKSQR 199
+ IE +IA++ P NGE +LRY+ G +Y HYD FDP E G + QR
Sbjct: 148 --VIRRIEARIARLLNWPVQNGEGLQVLRYRRGAEYKPHYDYFDPGEPGTAAILRRGGQR 205
Query: 200 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF-YSLLPNGT 258
VAS ++YL + EGG T+FP IGLKV+P+QG + F Y+L +
Sbjct: 206 VASLIMYLREPGEGGATVFP---------------DIGLKVRPQQGSAVFFSYALAHPAS 250
Query: 259 IDPTSIHGSCPVVKGEKWVATKWIRDQE 286
+ ++HG PV GEKW+ATKW+R++E
Sbjct: 251 L---TLHGGEPVKSGEKWIATKWLRERE 275
>gi|398808448|ref|ZP_10567311.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
gi|398087480|gb|EJL78066.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
Length = 280
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 86/228 (37%), Positives = 118/228 (51%), Gaps = 38/228 (16%)
Query: 69 AGDDSVTNIPFQVLSWM--PRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKGETVD 125
AGD V QVL M PR + F N + E+C+ +I A++ L R T+ R G V
Sbjct: 79 AGDRRV-----QVLQTMRHPRVVVFGNLLSAEECEGLIAAARVRLARSLTVETRTGGEVL 133
Query: 126 NTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD 185
N RTS G+F E+E + +E+++A + P GE ILRY G +Y HYD
Sbjct: 134 NVD--RTSDGMFFERGENE--IVARLEQRLATLLRWPLEYGEGLQILRYAPGAQYRPHYD 189
Query: 186 AFDPQEYGP-----QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKV 240
FDP E G + QRVA+ ++YL + E GG T FP +GL+V
Sbjct: 190 YFDPGEPGTPTILKRGGQRVATLVMYLQEPEGGGATTFP---------------DVGLEV 234
Query: 241 KPRQGDGLLFYSLLPNGTIDPTS--IHGSCPVVKGEKWVATKWIRDQE 286
P +G G+ F P DP + +HG PV+ GEKWVATKW+R++E
Sbjct: 235 APVRGCGVFFSYDRP----DPVTRTLHGGAPVLAGEKWVATKWLRERE 278
>gi|299532490|ref|ZP_07045880.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni S44]
gi|298719437|gb|EFI60404.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni S44]
Length = 299
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 74/212 (34%), Positives = 111/212 (52%), Gaps = 37/212 (17%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFIS 139
PR + F N + E+C +II A+ +R S TVDN G RTS+G+F
Sbjct: 112 PRVVVFGNLLSDEECDAIIAAARPRMRRSL-------TVDNQSGGEAVNDDRTSNGMFFQ 164
Query: 140 AAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP----- 194
E+E + L+E++IA++ P NGE +L Y+ G +Y HYD F P E G
Sbjct: 165 RGENE--LISLVEQRIARLLNWPLENGEGMQVLHYRPGAEYKPHYDYFAPNEPGTPTILK 222
Query: 195 QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLL 254
+ QRV + ++YL + GG T FP +GL+V PR+G+ + F
Sbjct: 223 RGGQRVGTLVMYLNEPARGGATTFP---------------DVGLQVVPRRGNAVFFSYNR 267
Query: 255 PNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
P+ ++HG PV++GEKW+ATKW+R++E
Sbjct: 268 PDPATK--TLHGGAPVLEGEKWIATKWLRERE 297
>gi|229061929|ref|ZP_04199257.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH603]
gi|228717372|gb|EEL69042.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH603]
Length = 216
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 108/198 (54%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I ++K N++ S + + + IRTSSG F+ +E+
Sbjct: 39 PLIVVLANVLSDEECAELIELSKSNMKRSKVGSSR-----DVNDIRTSSGAFL----EEN 89
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ +T +P ++GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTSKIEKRISSITNVPVVHGEGLHILNYEVDQEYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + ++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQLLNELTLH 193
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|339327280|ref|YP_004686973.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator N-1]
gi|338167437|gb|AEI78492.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator N-1]
Length = 297
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 74/221 (33%), Positives = 113/221 (51%), Gaps = 25/221 (11%)
Query: 70 GDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG 129
G D I F++ S P+ F T ++C +++ +++ L S + + +N
Sbjct: 91 GGDRQVPILFRLAS--PQVQLFQQLLTDDECDALVALSRGRLARSPV-VNPDTGDENLID 147
Query: 130 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP 189
RTS G AE + IE +IA VT +P +GE IL YK G +Y H+D F+P
Sbjct: 148 ARTSMGAMFQVAEH--ALIARIEARIAAVTGVPAEHGEGLQILNYKPGGEYQPHFDYFNP 205
Query: 190 QEYGPQK-----SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQ 244
Q G + QR+A+ ++YL E GG T FP +GL+V P +
Sbjct: 206 QRPGEARQLSVGGQRIATLVIYLNTPEAGGATAFP---------------RVGLEVAPVK 250
Query: 245 GDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
G+ + F LLP+GT+D ++H PV GEKW+ATKW+R++
Sbjct: 251 GNAVYFSYLLPDGTLDERTLHAGLPVASGEKWIATKWLRER 291
>gi|340787855|ref|YP_004753320.1| peptidyl prolyl 4-hydroxylase-like protein subunit alpha
[Collimonas fungivorans Ter331]
gi|340553122|gb|AEK62497.1| Peptidyl prolyl 4-hydroxylase-like protein, alpha subunit
[Collimonas fungivorans Ter331]
Length = 289
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 85/262 (32%), Positives = 126/262 (48%), Gaps = 46/262 (17%)
Query: 44 VTAARPSARVV-ESVKDEYKWMPHGQAGDDSVT----NIPFQVLSWMPRALYFPNFATPE 98
+AA P A V ++V EY++ G + V NI + PRA+ F N + +
Sbjct: 53 ASAAAPDATTVTDAVPKEYRYEGCGLNTGNKVHIDGHNIAVLMKLEKPRAILFGNVLSHD 112
Query: 99 QCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFISAAEDESGT---LD 149
+C +I ++K L LR G VD+ G RTSSG F GT +
Sbjct: 113 ECDQLIALSKTKL------LRSG-VVDHQTGNTKLHEHRTSSGTFF-----HRGTTPFIA 160
Query: 150 LIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS-----QRVASFL 204
+I++++A + +P +GE IL Y++G +Y HYD F P G K QR A+ +
Sbjct: 161 MIDKRLAALMQVPESHGEGLQILNYQMGGEYRPHYDYFRPDAPGSAKHLARGGQRTATLI 220
Query: 205 VYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSI 264
+YL D++ GGET+FP GL + P +G + F +D S
Sbjct: 221 IYLNDVDGGGETIFPRN---------------GLSIVPAKGSAIYFSYTNAENQLDSLSF 265
Query: 265 HGSCPVVKGEKWVATKWIRDQE 286
HG PV++GEKW+ATKW+R E
Sbjct: 266 HGGSPVIEGEKWIATKWVRQNE 287
>gi|430751569|ref|YP_007214477.1| 2OG-Fe(II) oxygenase [Thermobacillus composti KWC4]
gi|430735534|gb|AGA59479.1| 2OG-Fe(II) oxygenase superfamily enzyme [Thermobacillus composti
KWC4]
Length = 215
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 73/215 (33%), Positives = 115/215 (53%), Gaps = 26/215 (12%)
Query: 71 DDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGI 130
DD V + VL P + F + ++C+ +I A L+ S L + V + I
Sbjct: 17 DDGV--VEATVLHQEPLIVRFERLLSDDECRQLIETAAPRLKESKLV---NKVVSD---I 68
Query: 131 RTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQ 190
RTS G+F E+ES + IE +IA++ +P + E +L Y GQ+Y +H+D F P
Sbjct: 69 RTSRGMFFE--EEESPFIHRIERRIAQLMNVPIEHAEGLQVLHYGPGQEYKAHHDFFAPG 126
Query: 191 EYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 250
++ R+++ +VYL D+EEGGET+FP +G+ +KP++G L F
Sbjct: 127 SPAA-RNNRISTLIVYLNDVEEGGETVFPL---------------LGIAMKPKRGAALYF 170
Query: 251 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
N ++ ++H S PVV+GEKWVAT+W+R Q
Sbjct: 171 EYFYRNQALNDLTLHSSVPVVRGEKWVATQWMRRQ 205
>gi|423521903|ref|ZP_17498376.1| hypothetical protein IGC_01286 [Bacillus cereus HuA4-10]
gi|401176565|gb|EJQ83760.1| hypothetical protein IGC_01286 [Bacillus cereus HuA4-10]
Length = 216
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 108/198 (54%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I ++K N++ S + + + IRTSSG F+ +E+
Sbjct: 39 PLIVVLANVLSDEECDKLIELSKNNMKRSKVGSSR-----DVNDIRTSSGAFL----EEN 89
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ +T +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|255577610|ref|XP_002529682.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223530830|gb|EEF32693.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 165
Score = 124 bits (312), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 68/156 (43%), Positives = 96/156 (61%), Gaps = 4/156 (2%)
Query: 130 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP 189
+RTSSG+F+S+ E +S IE++I+ + +P NGE +LRY+ Q Y H+D F
Sbjct: 11 VRTSSGMFLSSEERKSPMA--IEKRISVYSQVPIENGELVQVLRYEKSQFYRPHHDYFSD 68
Query: 190 QEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLL 249
+ QRVA+ L+YL+D EGGET FP + + S + GL VKP +GD +L
Sbjct: 69 TFNLKRGGQRVATMLMYLSDNVEGGETYFPMAG--SGECSCGGKIVKGLSVKPIKGDAVL 126
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
F+S+ +G DP SIHG C V+ GEKW ATKW+R +
Sbjct: 127 FWSMGLDGQSDPKSIHGGCEVLAGEKWSATKWMRQR 162
>gi|222111817|ref|YP_002554081.1| procollagen-proline dioxygenase [Acidovorax ebreus TPSY]
gi|221731261|gb|ACM34081.1| Procollagen-proline dioxygenase [Acidovorax ebreus TPSY]
Length = 289
Score = 124 bits (312), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 68/206 (33%), Positives = 111/206 (53%), Gaps = 25/206 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F N +PE+C++II+ A+ + +L ++ + RTS G+F E +
Sbjct: 102 PRVVLFGNLLSPEECQAIIDAAQPRMA-RSLTVQTTTGGEEVNADRTSDGMFFQRGE--T 158
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP-----QKSQRV 200
+ +EE+IA++ P NGE +L Y+ G +Y HYD FDP + G + QRV
Sbjct: 159 PVVQRLEERIARLVRWPIQNGEGLQVLHYRPGAEYKPHYDYFDPDQPGTSTIVRRGGQRV 218
Query: 201 ASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTID 260
A+ ++YL + +GG T FP + L+V PRQG+ + F P+ +
Sbjct: 219 ATLVIYLNNPRKGGGTTFP---------------DVPLEVAPRQGNAVFFSYERPHPST- 262
Query: 261 PTSIHGSCPVVKGEKWVATKWIRDQE 286
++HG V++GEKW+ATKW+R++E
Sbjct: 263 -RTLHGGASVIEGEKWIATKWLRERE 287
>gi|421749438|ref|ZP_16186877.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator HPC(L)]
gi|409771699|gb|EKN53918.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator HPC(L)]
Length = 319
Score = 124 bits (311), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 73/221 (33%), Positives = 113/221 (51%), Gaps = 25/221 (11%)
Query: 69 AGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQ 128
AGD + F + S PR F P++C+++I +++ L S + + +N
Sbjct: 112 AGDGRDVPVLFAIES--PRIALFQRLLMPDECEALIALSRGRLARSPV-VNPDTGDENLI 168
Query: 129 GIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFD 188
RTS G E ++ +E +IA VT +P +GE IL YK G +Y HYD F+
Sbjct: 169 DARTSMGAMFQVGEHP--LIERLEARIAAVTGVPVEHGEGLQILNYKPGAEYQPHYDFFN 226
Query: 189 PQEYGPQK-----SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPR 243
PQ G + QR+A+ ++YL D+ GG T FP +GL+V P
Sbjct: 227 PQRPGEARQLRVGGQRMATLVIYLNDVPAGGATAFP---------------KLGLRVNPV 271
Query: 244 QGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRD 284
QG+ + F L +G++D ++H PV +GEKW+ATKW+R+
Sbjct: 272 QGNAVFFAYLGEDGSLDERTLHAGLPVEQGEKWIATKWLRE 312
>gi|264677094|ref|YP_003277000.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni CNB-2]
gi|262207606|gb|ACY31704.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni CNB-2]
Length = 306
Score = 124 bits (310), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 73/212 (34%), Positives = 112/212 (52%), Gaps = 37/212 (17%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFIS 139
PR + F N + E+C +II A RP +R+ TVDN G RTS+G+F
Sbjct: 119 PRVVVFGNLLSDEECDAIIAAA----RPR---MRRSLTVDNQSGGEAVNDDRTSNGMFFQ 171
Query: 140 AAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP----- 194
E++ + L+E++IA++ P NGE +L Y+ G +Y HYD F P E G
Sbjct: 172 RGEND--LISLVEQRIARLLNWPLENGEGMQVLHYRPGAEYKPHYDYFAPNEPGTPTILK 229
Query: 195 QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLL 254
+ QRV + ++YL + GG T FP +GL++ PR+G+ + F
Sbjct: 230 RGGQRVGTLVMYLNEPARGGATTFP---------------DVGLQIVPRRGNAVFFSYNR 274
Query: 255 PNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
P+ ++HG PV++GEKW+ATKW+R++E
Sbjct: 275 PDPATK--TLHGGAPVLEGEKWIATKWLRERE 304
>gi|229075940|ref|ZP_04208916.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-18]
gi|229117732|ref|ZP_04247101.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-3]
gi|407706764|ref|YP_006830349.1| alpha/beta fold family hydrolase [Bacillus thuringiensis MC28]
gi|423377905|ref|ZP_17355189.1| hypothetical protein IC9_01258 [Bacillus cereus BAG1O-2]
gi|423464099|ref|ZP_17440867.1| hypothetical protein IEK_01286 [Bacillus cereus BAG6O-1]
gi|423547540|ref|ZP_17523898.1| hypothetical protein IGO_03975 [Bacillus cereus HuB5-5]
gi|423622677|ref|ZP_17598455.1| hypothetical protein IK3_01275 [Bacillus cereus VD148]
gi|228665709|gb|EEL21182.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-3]
gi|228707255|gb|EEL59452.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-18]
gi|401179261|gb|EJQ86434.1| hypothetical protein IGO_03975 [Bacillus cereus HuB5-5]
gi|401260797|gb|EJR66965.1| hypothetical protein IK3_01275 [Bacillus cereus VD148]
gi|401636171|gb|EJS53925.1| hypothetical protein IC9_01258 [Bacillus cereus BAG1O-2]
gi|402420366|gb|EJV52637.1| hypothetical protein IEK_01286 [Bacillus cereus BAG6O-1]
gi|407384449|gb|AFU14950.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis MC28]
Length = 216
Score = 124 bits (310), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 107/198 (54%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I M+K ++ ST+ + + IRTSSG F+ +E+
Sbjct: 39 PLIVVLGNVISDEECNELIEMSKNKIKRSTIG-----SARDVNDIRTSSGAFL----EEN 89
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|423512354|ref|ZP_17488885.1| hypothetical protein IG3_03851 [Bacillus cereus HuA2-1]
gi|402449325|gb|EJV81162.1| hypothetical protein IG3_03851 [Bacillus cereus HuA2-1]
Length = 216
Score = 124 bits (310), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 107/198 (54%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I ++K N++ S + + + IRTSSG F+ +E+
Sbjct: 39 PLIVVLANVLSDEECAELIELSKSNMKRSKVGSSR-----DVNDIRTSSGAFL----EEN 89
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ +T +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + ++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQLLNELTLH 193
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|229135058|ref|ZP_04263863.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST196]
gi|228648443|gb|EEL04473.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST196]
Length = 216
Score = 124 bits (310), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 107/198 (54%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I ++K N++ S + + + IRTSSG F+ +E+
Sbjct: 39 PLIVVLANVLSDEECAELIELSKSNMKRSKVGSSR-----DVNDIRTSSGAFL----EEN 89
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ +T +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + ++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQLLNELTLH 193
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|423518940|ref|ZP_17495421.1| hypothetical protein IG7_04010 [Bacillus cereus HuA2-4]
gi|401159995|gb|EJQ67374.1| hypothetical protein IG7_04010 [Bacillus cereus HuA2-4]
Length = 216
Score = 124 bits (310), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 107/198 (54%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I ++K N++ S + + + IRTSSG F+ +E+
Sbjct: 39 PLIVVLANVLSDEECAELIELSKNNMKRSKVGSSR-----DVNDIRTSSGAFL----EEN 89
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ +T +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + ++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------QLNLSVHPRKGMAVYFEYFYQDQLLNELTLH 193
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|194290782|ref|YP_002006689.1| prolyl 4-hydroxylase subunit alpha [Cupriavidus taiwanensis LMG
19424]
gi|193224617|emb|CAQ70628.1| putative Prolyl 4-hydroxylase alpha subunit [Cupriavidus
taiwanensis LMG 19424]
Length = 296
Score = 124 bits (310), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 73/221 (33%), Positives = 113/221 (51%), Gaps = 25/221 (11%)
Query: 70 GDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG 129
G D I F++ S P+ F + ++C +++ +++ L S + + +N
Sbjct: 90 GGDRRVPILFRLAS--PQVQLFQQLLSDDECDALVALSRGRLARSPV-VNPDTGDENLID 146
Query: 130 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP 189
RTS G AE + IE +IA VT +P +GE IL YK G +Y H+D F+P
Sbjct: 147 ARTSMGAMFQVAE--HALIARIEARIAAVTGVPADHGEGLQILNYKPGGEYQPHFDYFNP 204
Query: 190 QEYGPQK-----SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQ 244
Q G + QR+A+ ++YL E GG T FP +GL+V P +
Sbjct: 205 QRPGEARQLSVGGQRIATLVIYLNTPEAGGATAFP---------------RVGLEVAPVK 249
Query: 245 GDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
G+ + F LLP+GT+D ++H PV GEKW+ATKW+R++
Sbjct: 250 GNAVYFSYLLPDGTLDDRTLHAGLPVAAGEKWIATKWLRER 290
>gi|423368291|ref|ZP_17345723.1| hypothetical protein IC3_03392 [Bacillus cereus VD142]
gi|401081042|gb|EJP89322.1| hypothetical protein IC3_03392 [Bacillus cereus VD142]
Length = 216
Score = 123 bits (309), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 107/198 (54%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I ++K N++ S + + + IRTSSG F+ +E+
Sbjct: 39 PLIVVLANVLSDEECAELIELSKNNMKRSKVGSSR-----DVNDIRTSSGAFL----EEN 89
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ +T +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + ++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQLLNELTLH 193
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|308799555|ref|XP_003074558.1| putative oxidoreductase (ISS) [Ostreococcus tauri]
gi|116000729|emb|CAL50409.1| putative oxidoreductase (ISS) [Ostreococcus tauri]
Length = 274
Score = 123 bits (309), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 78/223 (34%), Positives = 123/223 (55%), Gaps = 17/223 (7%)
Query: 70 GDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKGETVDNTQ 128
GD V I + LSW PRA N + ++I+ +A+ + R + + G++V N
Sbjct: 2 GDARV--IAVEPLSWYPRAFALRNALDETEMRAILALARTRVARSTVIDSESGKSVVNP- 58
Query: 129 GIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFD 188
IRTS F+S + + + E+++ VT LP + E +L Y G+KY++H D +
Sbjct: 59 -IRTSKQTFLSRNDP---VVRKVLERMSSVTHLPWYHCEDLQVLEYSAGEKYDAHEDVGE 114
Query: 189 P-QEYGPQKSQ----RVASFLVYLTDLEEGGETMFPFENGMNAD--GSYDYQKCIGLKV- 240
+ G Q S+ RVA+ L+YL + EEGGET FP ++ + + + KC +V
Sbjct: 115 EGTKSGDQLSKNGGKRVATILLYLEEPEEGGETAFPDSEWIDPERAKTETWSKCAHRRVA 174
Query: 241 -KPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWI 282
KP +GDGL+F+S+ P+GTID ++H CP +G KW AT W+
Sbjct: 175 MKPTRGDGLMFWSVRPDGTIDHRALHVGCPPTRGTKWTATIWV 217
>gi|121595595|ref|YP_987491.1| 2OG-Fe(II) oxygenase [Acidovorax sp. JS42]
gi|120607675|gb|ABM43415.1| 2OG-Fe(II) oxygenase [Acidovorax sp. JS42]
Length = 289
Score = 123 bits (309), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 68/206 (33%), Positives = 111/206 (53%), Gaps = 25/206 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F N +PE+C++II+ A+ + +L ++ + RTS G+F E +
Sbjct: 102 PRVVLFGNLLSPEECQAIIDAAQPRMA-RSLTVQTTTGGEEVNADRTSDGMFFQRGE--T 158
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP-----QKSQRV 200
+ +EE+IA++ P NGE +L Y+ G +Y HYD FDP + G + QRV
Sbjct: 159 PVVQRLEERIARLVRWPIQNGEGLQVLHYRPGAEYKPHYDYFDPDQPGTSTIVRRGGQRV 218
Query: 201 ASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTID 260
A+ ++YL + +GG T FP + L+V PRQG+ + F P+ +
Sbjct: 219 ATLVIYLNNPLKGGGTTFP---------------DVPLEVAPRQGNAVFFSYERPHPST- 262
Query: 261 PTSIHGSCPVVKGEKWVATKWIRDQE 286
++HG V++GEKW+ATKW+R++E
Sbjct: 263 -RTLHGGASVIEGEKWIATKWLRERE 287
>gi|423541303|ref|ZP_17517694.1| hypothetical protein IGK_03395 [Bacillus cereus HuB4-10]
gi|401172491|gb|EJQ79712.1| hypothetical protein IGK_03395 [Bacillus cereus HuB4-10]
Length = 216
Score = 123 bits (309), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 64/198 (32%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I M+K ++ ST+ + + IRTSSG F+ E S
Sbjct: 39 PLIVVLGNVISDEECDELIEMSKNKIKRSTIGSSR-----DVNDIRTSSGAFLEENELTS 93
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 94 K----IEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|344172475|emb|CCA85118.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia syzygii R24]
Length = 289
Score = 123 bits (309), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 70/215 (32%), Positives = 110/215 (51%), Gaps = 25/215 (11%)
Query: 77 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSG 135
IP PR + F +F + E+C +I + + L+ S + GE +N RTS G
Sbjct: 88 IPILFAIETPRIVLFQHFLSDEECDELIALGRHRLKRSPVVNPETGE--ENLISARTSQG 145
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 195
E + IE +IA+ T +P +GE F +L Y+ G +Y H+D F+P G
Sbjct: 146 AMFQVGEHP--LIARIEARIAQATGVPVEHGEGFQVLHYQPGGEYQPHFDYFNPGRSGEA 203
Query: 196 KS-----QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 250
+ QRVA+ ++YL ++ GG T FP +GL+V P +G+ + F
Sbjct: 204 RQLEVGGQRVATLVIYLNSVQAGGATGFP---------------KLGLEVAPVKGNAVFF 248
Query: 251 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
P+GT+D ++H PV +GEKW+ATKW+R++
Sbjct: 249 VYKRPDGTLDDNTLHAGLPVERGEKWIATKWLRER 283
>gi|449520827|ref|XP_004167434.1| PREDICTED: putative prolyl 4-hydroxylase-like, partial [Cucumis
sativus]
Length = 164
Score = 123 bits (309), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 65/156 (41%), Positives = 93/156 (59%), Gaps = 2/156 (1%)
Query: 130 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP 189
RTSSG+F+S E + IE++I+ + +P NGE +LRY+ Q Y H+D F
Sbjct: 6 FRTSSGMFLSHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQFYKPHHDYFSD 65
Query: 190 QEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLL 249
+ QR+A+ L+YL++ EGGET FP + + S + GL VKP +GD +L
Sbjct: 66 TFNLKRGGQRIATMLMYLSENIEGGETYFP--KAGSGECSCGGKTVPGLSVKPAKGDAVL 123
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
F+S+ +G DP SIHG C V+ GEKW ATKW+R +
Sbjct: 124 FWSMGLDGQSDPKSIHGGCEVLSGEKWSATKWMRQK 159
>gi|344169181|emb|CCA81504.1| putative Prolyl 4-hydroxylase alpha subunit [blood disease
bacterium R229]
Length = 289
Score = 123 bits (309), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 70/215 (32%), Positives = 110/215 (51%), Gaps = 25/215 (11%)
Query: 77 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSG 135
IP PR + F +F + E+C +I + + L+ S + GE +N RTS G
Sbjct: 88 IPILFAIETPRIVLFQHFLSDEECDELIALGRHRLKRSPVVNPETGE--ENLISARTSQG 145
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 195
E + IE +IA+ T +P +GE F +L Y+ G +Y H+D F+P G
Sbjct: 146 AMFQVGEHP--LIARIEARIAQATGVPVEHGEGFQVLHYQPGGEYQPHFDYFNPGRSGEA 203
Query: 196 KS-----QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 250
+ QRVA+ ++YL ++ GG T FP +GL+V P +G+ + F
Sbjct: 204 RQLEVGGQRVATLVIYLNSVQAGGATGFP---------------KLGLEVAPVKGNAVFF 248
Query: 251 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
P+GT+D ++H PV +GEKW+ATKW+R++
Sbjct: 249 VYKRPDGTLDDNTLHAGLPVERGEKWIATKWLRER 283
>gi|300690371|ref|YP_003751366.1| prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum PSI07]
gi|299077431|emb|CBJ50057.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
PSI07]
Length = 289
Score = 123 bits (309), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 70/215 (32%), Positives = 110/215 (51%), Gaps = 25/215 (11%)
Query: 77 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSG 135
IP PR + F +F + E+C +I + + L+ S + GE +N RTS G
Sbjct: 88 IPILFAIETPRIVLFQHFLSDEECDELIALGRHRLKRSPVVNPETGE--ENLISARTSQG 145
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 195
E + IE +IA+ T +P +GE F +L Y+ G +Y H+D F+P G
Sbjct: 146 AMFQVGEHP--LIARIEARIAQATGVPVEHGEGFQVLHYQPGGEYQPHFDYFNPGRSGEA 203
Query: 196 KS-----QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 250
+ QRVA+ ++YL ++ GG T FP +GL+V P +G+ + F
Sbjct: 204 RQLEVGGQRVATLVIYLNSVQAGGATGFP---------------KLGLEVAPVKGNAVFF 248
Query: 251 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
P+GT+D ++H PV +GEKW+ATKW+R++
Sbjct: 249 VYKRPDGTLDDNTLHAGLPVERGEKWIATKWLRER 283
>gi|423615424|ref|ZP_17591258.1| hypothetical protein IIO_00750 [Bacillus cereus VD115]
gi|401259961|gb|EJR66134.1| hypothetical protein IIO_00750 [Bacillus cereus VD115]
Length = 216
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 107/198 (54%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I M+K ++ ST+ + + IRTSSG F+ +E+
Sbjct: 39 PLIVVLGNVISDEECDELIEMSKNKIKRSTIGSSR-----DVNDIRTSSGAFL----EEN 89
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|421890664|ref|ZP_16321519.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
K60-1]
gi|378964031|emb|CCF98267.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
K60-1]
Length = 288
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 70/215 (32%), Positives = 109/215 (50%), Gaps = 25/215 (11%)
Query: 77 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSG 135
IP PR + F +F + E+C +I + + L+ S + GE +N RTS G
Sbjct: 87 IPILFAIETPRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGE--ENLISARTSEG 144
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 195
E + IE +IA+ T +P +GE F +L Y G +Y H+D F+P G
Sbjct: 145 AMFQVGEHP--LVARIEARIAQATGVPVEHGEGFQVLHYHPGGEYQPHFDYFNPGRSGEA 202
Query: 196 KS-----QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 250
+ QRVA+ ++YL ++ GG T FP +GL+V P +G+ + F
Sbjct: 203 RQLDVGGQRVATLVIYLNSVQAGGATGFP---------------KLGLEVAPVKGNAVFF 247
Query: 251 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
P+GT+D ++H PV +GEKW+ATKW+R++
Sbjct: 248 VYKRPDGTLDDNTLHAGLPVERGEKWIATKWLRER 282
>gi|300702992|ref|YP_003744594.1| prolyl 4-hydroxylase subunit alpha [Ralstonia solanacearum
CFBP2957]
gi|299070655|emb|CBJ41950.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
CFBP2957]
Length = 289
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 70/215 (32%), Positives = 109/215 (50%), Gaps = 25/215 (11%)
Query: 77 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSG 135
IP PR + F +F + E+C +I + + L+ S + GE +N RTS G
Sbjct: 88 IPILFAIETPRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGE--ENLISARTSEG 145
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 195
E + IE +IA+ T +P +GE F +L Y G +Y H+D F+P G
Sbjct: 146 AMFQVGEHP--LVARIEARIAQATGVPVEHGEGFQVLHYHPGGEYQPHFDYFNPGRSGEA 203
Query: 196 KS-----QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 250
+ QRVA+ ++YL ++ GG T FP +GL+V P +G+ + F
Sbjct: 204 RQLEVGGQRVATLVIYLNSVQAGGATGFP---------------KLGLEVAPVKGNAVFF 248
Query: 251 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
P+GT+D ++H PV +GEKW+ATKW+R++
Sbjct: 249 VYKRPDGTLDDNTLHAGLPVERGEKWIATKWLRER 283
>gi|423483822|ref|ZP_17460512.1| hypothetical protein IEQ_03600 [Bacillus cereus BAG6X1-2]
gi|401141373|gb|EJQ48928.1| hypothetical protein IEQ_03600 [Bacillus cereus BAG6X1-2]
Length = 216
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 107/198 (54%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I M+K ++ ST+ + + IRTSSG F+ +E+
Sbjct: 39 PLIVVLGNVISDEECDELIEMSKNKIKRSTIGSSR-----DVNDIRTSSGAFL----EEN 89
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|229104864|ref|ZP_04235524.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-28]
gi|228678581|gb|EEL32798.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-28]
Length = 216
Score = 122 bits (307), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 107/198 (54%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I M+K ++ ST+ + + IRTSSG F+ +E+
Sbjct: 39 PLIVVLGNVISDEECGELIEMSKNKIKRSTIGSSR-----DVNDIRTSSGAFL----EEN 89
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|423389445|ref|ZP_17366671.1| hypothetical protein ICG_01293 [Bacillus cereus BAG1X1-3]
gi|401641536|gb|EJS59253.1| hypothetical protein ICG_01293 [Bacillus cereus BAG1X1-3]
Length = 216
Score = 122 bits (307), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 62/198 (31%), Positives = 108/198 (54%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C+ +I ++K ++ S + + + IRTSSG F+ +E+
Sbjct: 39 PLIVVLANVLSDEECEELIELSKNKMKRSKVGSSR-----DVNDIRTSSGAFL----EEN 89
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ +T +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|297802348|ref|XP_002869058.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297314894|gb|EFH45317.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 245
Score = 122 bits (307), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 68/197 (34%), Positives = 114/197 (57%), Gaps = 31/197 (15%)
Query: 79 FQVLSWMPRALYFPNF--------ATPEQCKSIINMAKLNLRPSTLALRKGET-VDNTQG 129
+V++ PRA + NF T E+C+ +I++AK ++ S +R T +
Sbjct: 55 LEVIAKEPRAFVYHNFLALFFKFCKTNEECEHLISLAKPSMARS--KVRNAITGLGEESS 112
Query: 130 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP 189
RTSSG F+ D+ + IE++I++ T +P NGEA ++ Y++GQK+ H+D F
Sbjct: 113 SRTSSGTFLRKGHDK--IVKEIEKRISEFTFIPEENGEALQVIHYEVGQKFEPHFDGF-- 168
Query: 190 QEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLL 249
QR+A+ L+YL+D+++GGET+FP G+ + G+ V+P++GD LL
Sbjct: 169 --------QRIATVLMYLSDVDKGGETVFPEAKGIKSKK--------GVSVRPKKGDALL 212
Query: 250 FYSLLPNGTIDPTSIHG 266
F+S+ P+G+ DP+S HG
Sbjct: 213 FWSMRPDGSQDPSSKHG 229
>gi|228922987|ref|ZP_04086280.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
huazhongensis BGSC 4BD1]
gi|228836620|gb|EEM81968.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
huazhongensis BGSC 4BD1]
Length = 216
Score = 122 bits (307), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 65/198 (32%), Positives = 107/198 (54%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I M+K ++ S + + + IRTSSG F+ ED
Sbjct: 39 PLIVVLANVLSDEECDELIEMSKNKMKRSKVGSSR-----DVNDIRTSSGAFL---EDSE 90
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
TL IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 91 LTLK-IEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|386332363|ref|YP_006028532.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum Po82]
gi|334194811|gb|AEG67996.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum Po82]
Length = 292
Score = 122 bits (307), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 78/247 (31%), Positives = 121/247 (48%), Gaps = 26/247 (10%)
Query: 45 TAARPSARVVESVKDEYKWMPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSII 104
T A P+A V + E + + A S IP PR + F +F + E+C +I
Sbjct: 60 TPAAPTAPVPNPAQAEPE-AENSNAVRTSDREIPILFAIETPRIVLFQHFLSDEECDELI 118
Query: 105 NMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPR 163
+ + L+ S + GE +N RTS G E + IE +IA+ T +P
Sbjct: 119 ALGRYRLKRSPVVNPETGE--ENLISARTSEGAMFQVGEHP--LVARIEARIAQATGVPV 174
Query: 164 INGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS-----QRVASFLVYLTDLEEGGETMF 218
+GE F +L Y G +Y H+D F+P G + QRVA+ ++YL ++ GG T F
Sbjct: 175 EHGEGFQVLHYHPGGEYQPHFDYFNPGRSGEARQLEVGGQRVATLVIYLNSVQAGGATGF 234
Query: 219 PFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVA 278
P +GL+V P +G+ + F P+GT+D ++H PV +GEKW+A
Sbjct: 235 P---------------KLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEKWIA 279
Query: 279 TKWIRDQ 285
TKW+R++
Sbjct: 280 TKWLRER 286
>gi|330799463|ref|XP_003287764.1| hypothetical protein DICPUDRAFT_151895 [Dictyostelium purpureum]
gi|325082219|gb|EGC35708.1| hypothetical protein DICPUDRAFT_151895 [Dictyostelium purpureum]
Length = 220
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 75/212 (35%), Positives = 108/212 (50%), Gaps = 31/212 (14%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGI-RTSSGVFISA 140
LS PR P F T E+C +I+ +K LRP + + G+ R+ G+F+
Sbjct: 28 LSQKPRVYRIPEFLTEEECNHLIDTSKNKLRPCN---------EISSGVHRSGWGLFMKE 78
Query: 141 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP-QEYGPQK--- 196
E+E I K+ + + E I+RY G++ ++HYD F+P G K
Sbjct: 79 GEEEHPVTKNIFNKMKNFVNISD-SCEVMQIIRYNPGEETSAHYDYFNPLTTNGSMKIGL 137
Query: 197 -SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLP 255
QR+ + L+YL D+EEGGET FP +G+KVKP +GD +LFY+ P
Sbjct: 138 YGQRICTILMYLCDVEEGGETSFPE---------------VGIKVKPIRGDAVLFYNCKP 182
Query: 256 NGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
NG +DP S+H PV KG KWVA K I + +
Sbjct: 183 NGDVDPLSLHQGDPVTKGTKWVAIKLINQKSK 214
>gi|388567209|ref|ZP_10153646.1| procollagen-proline dioxygenase [Hydrogenophaga sp. PBC]
gi|388265592|gb|EIK91145.1| procollagen-proline dioxygenase [Hydrogenophaga sp. PBC]
Length = 296
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 72/206 (34%), Positives = 108/206 (52%), Gaps = 25/206 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + N + E+C +II AK L S L ++ + RTSSG+F + ++
Sbjct: 109 PRVVVLGNLLSAEECDAIIESAKPKLARS-LTVQTATGGEELNADRTSSGMFFT--RGQT 165
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP-----QKSQRV 200
+ +E +IA++ P NGE +L Y+ G +Y HYD FDP+E G + QRV
Sbjct: 166 PEVTAVERRIARLVGWPVENGEGLQVLHYRPGAEYKPHYDYFDPKEAGTPTILKRGGQRV 225
Query: 201 ASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTID 260
A+ ++YL + GG T FP +GL+V P +G + F P+ T
Sbjct: 226 ATLVMYLNEPARGGGTTFP---------------DVGLEVAPVKGSAVFFSYDRPHPTTR 270
Query: 261 PTSIHGSCPVVKGEKWVATKWIRDQE 286
S+HG PV++GEKWVATKW+R++E
Sbjct: 271 --SLHGGAPVLEGEKWVATKWLRERE 294
>gi|423612451|ref|ZP_17588312.1| hypothetical protein IIM_03166 [Bacillus cereus VD107]
gi|401246040|gb|EJR52392.1| hypothetical protein IIM_03166 [Bacillus cereus VD107]
Length = 254
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 64/198 (32%), Positives = 104/198 (52%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I ++K + S + + N IRTSSG F+ E S
Sbjct: 77 PLIVVLANVLSDEECDELIELSKNKMERSKIGSSR-----NVNDIRTSSGAFLEENEFTS 131
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ +T +P +GE +IL Y + Q+Y +HYD F + + R+++ ++
Sbjct: 132 K----IEKRISSITNVPVAHGEGLHILNYAVDQEYKAHYDYF-AEHSRSAANNRISTLVM 186
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 187 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 231
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 232 GGAPVTKGEKWIATQWMR 249
>gi|207744371|ref|YP_002260763.1| prolyl 4-hydroxylase subunit alpha [Ralstonia solanacearum IPO1609]
gi|206595776|emb|CAQ62703.1| prolyl 4-hydroxylase alpha subunit homologue protein [Ralstonia
solanacearum IPO1609]
Length = 280
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 70/216 (32%), Positives = 109/216 (50%), Gaps = 25/216 (11%)
Query: 76 NIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSS 134
IP PR + F +F + E+C +I + + L+ S + GE +N RTS
Sbjct: 78 EIPILFAIETPRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGE--ENLISARTSE 135
Query: 135 GVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP 194
G E + IE +IA+ T +P +GE F +L Y G +Y H+D F+P G
Sbjct: 136 GAMFQVGEHP--LVARIEARIAQATGVPVEHGEGFQVLHYHPGGEYQPHFDYFNPGRSGE 193
Query: 195 QKS-----QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLL 249
+ QRVA+ ++YL ++ GG T FP +GL+V P +G+ +
Sbjct: 194 ARQLEVGGQRVATLVIYLNSVQAGGATGFP---------------KLGLEVAPVKGNAVF 238
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
F P+GT+D ++H PV +GEKW+ATKW+R++
Sbjct: 239 FVYKRPDGTLDDNTLHAGLPVERGEKWIATKWLRER 274
>gi|15233345|ref|NP_195307.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
gi|3805848|emb|CAA21468.1| putative protein [Arabidopsis thaliana]
gi|7270534|emb|CAB81491.1| putative protein [Arabidopsis thaliana]
gi|332661175|gb|AEE86575.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
Length = 272
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 68/197 (34%), Positives = 114/197 (57%), Gaps = 31/197 (15%)
Query: 79 FQVLSWMPRALYFPNF--------ATPEQCKSIINMAKLNLRPSTLALRKGET-VDNTQG 129
+V++ PRA + NF T E+C +I++AK ++ S +R T +
Sbjct: 88 LEVITKEPRAFVYHNFLALFFKICKTNEECDHLISLAKPSMARS--KVRNALTGLGEESS 145
Query: 130 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP 189
RTSSG FI + D+ + IE++I++ T +P+ NGE ++ Y++GQK+ H+D F
Sbjct: 146 SRTSSGTFIRSGHDK--IVKEIEKRISEFTFIPQENGETLQVINYEVGQKFEPHFDGF-- 201
Query: 190 QEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLL 249
QR+A+ L+YL+D+++GGET+FP G+ + G+ V+P++GD LL
Sbjct: 202 --------QRIATVLMYLSDVDKGGETVFPEAKGIKSKK--------GVSVRPKKGDALL 245
Query: 250 FYSLLPNGTIDPTSIHG 266
F+S+ P+G+ DP+S HG
Sbjct: 246 FWSMRPDGSRDPSSKHG 262
>gi|83746819|ref|ZP_00943867.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum UW551]
gi|83726588|gb|EAP73718.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum UW551]
Length = 289
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 70/216 (32%), Positives = 109/216 (50%), Gaps = 25/216 (11%)
Query: 76 NIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSS 134
IP PR + F +F + E+C +I + + L+ S + GE +N RTS
Sbjct: 87 EIPILFAIETPRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGE--ENLISARTSE 144
Query: 135 GVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP 194
G E + IE +IA+ T +P +GE F +L Y G +Y H+D F+P G
Sbjct: 145 GAMFQVGEHP--LVARIEARIAQATGVPVEHGEGFQVLHYHPGGEYQPHFDYFNPGRSGE 202
Query: 195 QKS-----QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLL 249
+ QRVA+ ++YL ++ GG T FP +GL+V P +G+ +
Sbjct: 203 ARQLEVGGQRVATLVIYLNSVQAGGATGFP---------------KLGLEVAPVKGNAVF 247
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
F P+GT+D ++H PV +GEKW+ATKW+R++
Sbjct: 248 FVYKRPDGTLDDNTLHAGLPVERGEKWIATKWLRER 283
>gi|423598444|ref|ZP_17574444.1| hypothetical protein III_01246 [Bacillus cereus VD078]
gi|423660914|ref|ZP_17636083.1| hypothetical protein IKM_01311 [Bacillus cereus VDM022]
gi|401236714|gb|EJR43171.1| hypothetical protein III_01246 [Bacillus cereus VD078]
gi|401300955|gb|EJS06544.1| hypothetical protein IKM_01311 [Bacillus cereus VDM022]
Length = 216
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 62/198 (31%), Positives = 107/198 (54%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I ++K ++ S + + + IRTSSG F+ +E+
Sbjct: 39 PLIVVLANVLSDEECDELIELSKSKMKRSKVGSSR-----DVNDIRTSSGAFL----EEN 89
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ +T +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|423582447|ref|ZP_17558558.1| hypothetical protein IIA_03962 [Bacillus cereus VD014]
gi|401213326|gb|EJR20067.1| hypothetical protein IIA_03962 [Bacillus cereus VD014]
Length = 248
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 67/198 (33%), Positives = 108/198 (54%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I M+K ++ S + G + D IRTSSG F+ ED
Sbjct: 71 PLIVVLANVLSDEECDELIEMSKNKMKRSKV----GSSRD-VNDIRTSSGAFL---EDSE 122
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
TL IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 123 LTLK-IEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 180
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 181 YLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 225
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 226 GGAPVTKGEKWIATQWVR 243
>gi|229019457|ref|ZP_04176278.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1273]
gi|229025700|ref|ZP_04182104.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1272]
gi|423417837|ref|ZP_17394926.1| hypothetical protein IE3_01309 [Bacillus cereus BAG3X2-1]
gi|228735575|gb|EEL86166.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1272]
gi|228741812|gb|EEL91991.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1273]
gi|401107008|gb|EJQ14965.1| hypothetical protein IE3_01309 [Bacillus cereus BAG3X2-1]
Length = 216
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 62/198 (31%), Positives = 107/198 (54%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I ++K ++ S + + + IRTSSG F+ +E+
Sbjct: 39 PLIVVLANVLSDEECDELIELSKNKMKRSKVGSSR-----DVNDIRTSSGAFL----EEN 89
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ +T +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|423634936|ref|ZP_17610589.1| hypothetical protein IK7_01345 [Bacillus cereus VD156]
gi|401278922|gb|EJR84852.1| hypothetical protein IK7_01345 [Bacillus cereus VD156]
Length = 248
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 67/198 (33%), Positives = 108/198 (54%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I M+K ++ S + G + D IRTSSG F+ ED
Sbjct: 71 PLIVVLANVLSDEECDELIEMSKNKMKRSKV----GSSRD-VNDIRTSSGAFL---EDSE 122
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
TL IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 123 LTLK-IEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 180
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 181 YLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 225
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 226 GGAPVTKGEKWIATQWVR 243
>gi|299065638|emb|CBJ36810.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
CMR15]
Length = 289
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 70/216 (32%), Positives = 109/216 (50%), Gaps = 25/216 (11%)
Query: 76 NIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSS 134
IP PR + F +F + E+C +I + + L+ S + GE +N RTS
Sbjct: 87 EIPILFAIETPRIVLFQHFLSDEECDQLITLGRHRLKRSPVVNPETGE--ENLISARTSQ 144
Query: 135 GVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP 194
G E + IE +IA+ T +P +GE F +L Y+ G +Y H+D F+P G
Sbjct: 145 GAMFQVGEHP--LIARIEARIAQATGVPVEHGEGFQVLHYQPGGEYQPHFDYFNPGRSGE 202
Query: 195 QKS-----QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLL 249
+ QRVA+ ++YL + GG T FP +GL+V P +G+ +
Sbjct: 203 ARQLEVGGQRVATLVIYLNSVPAGGATGFP---------------KLGLEVAPVKGNAVF 247
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
F P+GT+D ++H PV +GEKW+ATKW+R++
Sbjct: 248 FVYKRPDGTLDDKTLHAGLPVERGEKWIATKWLRER 283
>gi|229168980|ref|ZP_04296697.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH621]
gi|423591765|ref|ZP_17567796.1| hypothetical protein IIG_00633 [Bacillus cereus VD048]
gi|228614572|gb|EEK71680.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH621]
gi|401231898|gb|EJR38400.1| hypothetical protein IIG_00633 [Bacillus cereus VD048]
Length = 216
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 107/198 (54%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I ++K N++ S + + + IRTSSG F+ +E+
Sbjct: 39 PLIVVLANVLSDEECAELIELSKSNMKRSKVGSSR-----DVNDIRTSSGAFL----EEN 89
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ +T +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTWKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + ++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQLLNELTLH 193
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|113869198|ref|YP_727687.1| prolyl 4-hydroxylase alpha subunit [Ralstonia eutropha H16]
gi|113527974|emb|CAJ94319.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia eutropha H16]
Length = 297
Score = 121 bits (304), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 73/221 (33%), Positives = 112/221 (50%), Gaps = 25/221 (11%)
Query: 70 GDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG 129
G D I F++ S P+ F T ++C +++ +++ L S + + +N
Sbjct: 91 GGDRQVPILFRLAS--PQVQLFQQLLTDDECDALVALSRGRLARSPV-VNPDTGDENLID 147
Query: 130 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP 189
RTS G AE + IE +IA VT +P +GE IL YK G +Y H+D F+P
Sbjct: 148 ARTSMGAMFQVAEHP--LITRIEARIAAVTGVPAEHGEGLQILNYKPGGEYQPHFDYFNP 205
Query: 190 QEYGPQK-----SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQ 244
Q G + QR+A+ ++YL E GG T FP +GL+V P +
Sbjct: 206 QRPGEARQLSVGGQRIATLVIYLNTPEAGGATAFP---------------RVGLEVAPVK 250
Query: 245 GDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
G+ + F LLP+G +D ++H PV GEKW+ATKW+R++
Sbjct: 251 GNAVYFSYLLPDGALDERTLHAGLPVAFGEKWIATKWLRER 291
>gi|163941996|ref|YP_001646880.1| 2OG-Fe(II) oxygenase [Bacillus weihenstephanensis KBAB4]
gi|229013455|ref|ZP_04170592.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides DSM 2048]
gi|423495146|ref|ZP_17471790.1| hypothetical protein IEW_04044 [Bacillus cereus CER057]
gi|423498060|ref|ZP_17474677.1| hypothetical protein IEY_01287 [Bacillus cereus CER074]
gi|163864193|gb|ABY45252.1| 2OG-Fe(II) oxygenase [Bacillus weihenstephanensis KBAB4]
gi|228747867|gb|EEL97733.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides DSM 2048]
gi|401151239|gb|EJQ58691.1| hypothetical protein IEW_04044 [Bacillus cereus CER057]
gi|401161347|gb|EJQ68714.1| hypothetical protein IEY_01287 [Bacillus cereus CER074]
Length = 216
Score = 121 bits (304), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 62/198 (31%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I ++K + S + + + IRTSSG F+ +E+
Sbjct: 39 PLIVVLANVLSDEECDELIELSKSKMERSKVGSSR-----DVNDIRTSSGAFL----EEN 89
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ +T +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|308804269|ref|XP_003079447.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
gi|116057902|emb|CAL54105.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
Length = 363
Score = 121 bits (304), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 73/219 (33%), Positives = 115/219 (52%), Gaps = 27/219 (12%)
Query: 79 FQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFI 138
+ LSW PRA + NF T ++C+ +I + + L ST+ KG+ D RTS G FI
Sbjct: 91 WTTLSWSPRAFLYQNFLTEDECEHLIALGEKKLERSTVVGSKGKEGD-VHSARTSFGTFI 149
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 198
+ + TL +E+++A+ + +P + E +LRY+ GQ+Y + +
Sbjct: 150 T--RRLTPTLSAVEDRVAEYSGIPWRHQEQLQLLRYEKGQEYGN-------------GEK 194
Query: 199 RVASFLVYLTDLEEGGETMFPFENGMNADGS-----------YDYQKCIGLKVKPRQGDG 247
R+A+ L++L + E GGET FP + A S + + G V PR+GD
Sbjct: 195 RIATVLMFLREPEFGGETHFPDATPLPATRSEFLGSRAKLSDCGWNEGRGFSVIPRKGDA 254
Query: 248 LLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
+LF+S NGT D + H SCP ++G K+ ATKWI ++E
Sbjct: 255 ILFFSHHINGTSDDAASHASCPTLRGIKYTATKWIHEKE 293
>gi|365090417|ref|ZP_09328465.1| 2OG-Fe(II) oxygenase [Acidovorax sp. NO-1]
gi|363416516|gb|EHL23626.1| 2OG-Fe(II) oxygenase [Acidovorax sp. NO-1]
Length = 302
Score = 121 bits (304), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 73/208 (35%), Positives = 113/208 (54%), Gaps = 29/208 (13%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKG-ETVDNTQGIRTSSGVFISAAED 143
PR + F N +PE+C ++I A+ L R T+A + G E +++ RTS G+F +
Sbjct: 115 PRIVVFGNLLSPEECDALIADAQPRLARSLTVATKTGGEEINDD---RTSDGMFFQRGQ- 170
Query: 144 ESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP-----QKSQ 198
S + IEE+IA++ P NGE +L Y+ G +Y HYD FDP E G + Q
Sbjct: 171 -SPLIQRIEERIARLLNWPIENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPSIVNRGGQ 229
Query: 199 RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGT 258
RV + ++YL E+GG T FP + L+V P++G+ + F P+ +
Sbjct: 230 RVGTLVMYLNTPEKGGGTTFP---------------DVHLEVAPQRGNAVFFSYERPHPS 274
Query: 259 IDPTSIHGSCPVVKGEKWVATKWIRDQE 286
++HG PV+ GEKW+ATKW+R++E
Sbjct: 275 T--RTLHGGAPVIAGEKWIATKWLRERE 300
>gi|66820122|ref|XP_643703.1| hypothetical protein DDB_G0275385 [Dictyostelium discoideum AX4]
gi|60471803|gb|EAL69758.1| hypothetical protein DDB_G0275385 [Dictyostelium discoideum AX4]
Length = 221
Score = 121 bits (303), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 74/212 (34%), Positives = 110/212 (51%), Gaps = 31/212 (14%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGI-RTSSGVFISA 140
LS PR P F T E+C+ +I+ +K LRP + + G+ R+ G+F+
Sbjct: 28 LSQAPRIYRIPGFLTDEECEFLIDTSKNKLRPCN---------EISSGVHRSGWGLFMKE 78
Query: 141 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP-QEYGPQK--- 196
E++ I K+ + + E ++RY G++ +SH+D F+P G K
Sbjct: 79 GEEDHQITKNIFNKMKSFVNISE-SCEVMQVIRYNQGEETSSHFDYFNPLTTNGSMKIGL 137
Query: 197 -SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLP 255
QRV + L+YL D+EEGGET FP +G+KVKP +GD +LFY+ P
Sbjct: 138 YGQRVCTILMYLCDVEEGGETTFPE---------------VGIKVKPIKGDAVLFYNCKP 182
Query: 256 NGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
NG +DP S+H PV+KG KWVA K I + +
Sbjct: 183 NGDVDPLSLHQGDPVLKGNKWVAIKLINQKSK 214
>gi|229098707|ref|ZP_04229647.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-29]
gi|423441025|ref|ZP_17417931.1| hypothetical protein IEA_01355 [Bacillus cereus BAG4X2-1]
gi|423533441|ref|ZP_17509859.1| hypothetical protein IGI_01273 [Bacillus cereus HuB2-9]
gi|228684786|gb|EEL38724.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-29]
gi|402417686|gb|EJV49986.1| hypothetical protein IEA_01355 [Bacillus cereus BAG4X2-1]
gi|402463660|gb|EJV95360.1| hypothetical protein IGI_01273 [Bacillus cereus HuB2-9]
Length = 216
Score = 121 bits (303), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 62/198 (31%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I M+K ++ ST+ + + IRTSSG F+ +E+
Sbjct: 39 PLIVVLGNVISDEECNELIEMSKNKIKRSTIG-----SARDVNDIRTSSGAFL----EEN 89
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 266 GSCPVVKGEKWVATKWIR 283
G V KGEKW+AT+W+R
Sbjct: 194 GGASVTKGEKWIATQWVR 211
>gi|228954520|ref|ZP_04116545.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
kurstaki str. T03a001]
gi|449091198|ref|YP_007423639.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
kurstaki str. HD73]
gi|228805177|gb|EEM51771.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
kurstaki str. T03a001]
gi|449024955|gb|AGE80118.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
kurstaki str. HD73]
Length = 216
Score = 121 bits (303), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 105/198 (53%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I M+K ++ S + + + IRTSSG F+ E S
Sbjct: 39 PLIVVLANVLSDEECDELIEMSKNKMKRSKVG-----SARDVNDIRTSSGAFLEDNELTS 93
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 94 K----IEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|337280547|ref|YP_004620019.1| hypothetical protein Rta_28970 [Ramlibacter tataouinensis TTB310]
gi|334731624|gb|AEG94000.1| conserved hypothetical protein [Ramlibacter tataouinensis TTB310]
Length = 286
Score = 121 bits (303), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 72/206 (34%), Positives = 111/206 (53%), Gaps = 27/206 (13%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKGETVDNTQGIRTSSGVFISAAEDE 144
PR + F + + ++C+ +I +AK L R T+A + G N RTSSG+F E+E
Sbjct: 99 PRVVVFGSLLSDQECEQLIGLAKPRLARSLTVATKTGGEEVNED--RTSSGMFFQRGENE 156
Query: 145 SGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG-----PQKSQR 199
+ IE +IA++ P NGE +L Y+ G +Y HYD FDP E G + QR
Sbjct: 157 --LVARIEARIARLVNWPVENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPTILKRGGQR 214
Query: 200 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 259
V + ++YL + E+GG T FP + L+V P++G G+ F P+ +
Sbjct: 215 VGTLVMYLGEPEKGGGTTFP---------------DVHLEVAPKRGHGVFFSYERPHPST 259
Query: 260 DPTSIHGSCPVVKGEKWVATKWIRDQ 285
++HG PV+ GEKW+ATKW+R++
Sbjct: 260 --RTLHGGAPVLAGEKWIATKWLRER 283
>gi|423489423|ref|ZP_17466105.1| hypothetical protein IEU_04046 [Bacillus cereus BtB2-4]
gi|402431659|gb|EJV63723.1| hypothetical protein IEU_04046 [Bacillus cereus BtB2-4]
Length = 216
Score = 121 bits (303), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 62/198 (31%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I ++K + S + + + IRTSSG F+ +E+
Sbjct: 39 PLIVVLANVLSDEECDELIELSKSKMERSKVGSSR-----DVNDIRTSSGAFL----EEN 89
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ +T +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTSKIEKRISSITNVPVSHGEGLHILNYEVDQEYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|423457579|ref|ZP_17434376.1| hypothetical protein IEI_00719 [Bacillus cereus BAG5X2-1]
gi|401147963|gb|EJQ55456.1| hypothetical protein IEI_00719 [Bacillus cereus BAG5X2-1]
Length = 216
Score = 121 bits (303), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 66/198 (33%), Positives = 107/198 (54%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I ++K S LA K + + IRTSSG F+ ED
Sbjct: 39 PLIVVLGNVLSDEECDELIELSK-----SKLARSKVGSSRDVNDIRTSSGAFL---EDNE 90
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
T+ IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 91 LTVK-IEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|17547533|ref|NP_520935.1| hypothetical protein RSc2814 [Ralstonia solanacearum GMI1000]
gi|17429837|emb|CAD16521.1| putative prolyl 4-hydroxylase alpha subunit homologue
oxidoreductase protein [Ralstonia solanacearum GMI1000]
Length = 289
Score = 120 bits (302), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 70/216 (32%), Positives = 109/216 (50%), Gaps = 25/216 (11%)
Query: 76 NIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSS 134
IP PR + F +F + E+C +I + + L+ S + GE +N RTS
Sbjct: 87 EIPILFAIETPRIVLFQHFLSDEECDQLIALGRHRLKRSPVVNPETGE--ENLISARTSQ 144
Query: 135 GVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP 194
G E + IE +IA+ T +P +GE F +L Y+ G +Y H+D F+P G
Sbjct: 145 GAMFQVGEHP--LVARIEARIAQATGVPVEHGEGFQVLHYQPGGEYQPHFDYFNPGRSGE 202
Query: 195 QKS-----QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLL 249
+ QRVA+ ++YL + GG T FP +GL+V P +G+ +
Sbjct: 203 ARQLEVGGQRVATLVIYLNSVPAGGATGFP---------------KLGLEVAPVKGNAVF 247
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
F P+GT+D ++H PV +GEKW+ATKW+R++
Sbjct: 248 FVYKRPDGTLDDNTLHAGLPVERGEKWIATKWLRER 283
>gi|423448819|ref|ZP_17425698.1| hypothetical protein IEC_03427 [Bacillus cereus BAG5O-1]
gi|401129413|gb|EJQ37096.1| hypothetical protein IEC_03427 [Bacillus cereus BAG5O-1]
Length = 216
Score = 120 bits (302), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 62/198 (31%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I M+K ++ ST+ + + IRTSSG F+ +E+
Sbjct: 39 PLIVVLGNVISDEECDELIEMSKNKIKRSTIGSSR-----DVNDIRTSSGAFL----EEN 89
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 266 GSCPVVKGEKWVATKWIR 283
G V KGEKW+AT+W+R
Sbjct: 194 GGASVTKGEKWIATQWVR 211
>gi|228902749|ref|ZP_04066896.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL
4222]
gi|228967277|ref|ZP_04128313.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
sotto str. T04001]
gi|402564350|ref|YP_006607074.1| prolyl 4-hydroxylase subunit alpha domain-containing protein
[Bacillus thuringiensis HD-771]
gi|434377355|ref|YP_006611999.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
thuringiensis HD-789]
gi|228792646|gb|EEM40212.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
sotto str. T04001]
gi|228856936|gb|EEN01449.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL
4222]
gi|401793002|gb|AFQ19041.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
thuringiensis HD-771]
gi|401875912|gb|AFQ28079.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
thuringiensis HD-789]
Length = 216
Score = 120 bits (302), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 105/198 (53%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I M+K ++ S + + + IRTSSG F+ E S
Sbjct: 39 PLIVVLANVLSDEECDKLIEMSKNKMKRSKVGSSR-----DVNDIRTSSGAFLEDNELTS 93
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 94 K----IEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|218231188|ref|YP_002369041.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
B4264]
gi|218159145|gb|ACK59137.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
B4264]
Length = 216
Score = 120 bits (302), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 64/198 (32%), Positives = 104/198 (52%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I M+K + S + + + IRTSSG F+ E S
Sbjct: 39 PLIVVLANVLSDEECGELIEMSKNKMERSKIGSSR-----DVNDIRTSSGAFLEDNELTS 93
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 94 K----IEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +I+ ++H
Sbjct: 149 YLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSINELTLH 193
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|423669823|ref|ZP_17644852.1| hypothetical protein IKO_03520 [Bacillus cereus VDM034]
gi|423673973|ref|ZP_17648912.1| hypothetical protein IKS_01516 [Bacillus cereus VDM062]
gi|401298950|gb|EJS04550.1| hypothetical protein IKO_03520 [Bacillus cereus VDM034]
gi|401309524|gb|EJS14857.1| hypothetical protein IKS_01516 [Bacillus cereus VDM062]
Length = 216
Score = 120 bits (302), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 62/198 (31%), Positives = 105/198 (53%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I ++K + S + + + IRTSSG F+ +E+
Sbjct: 39 PLIVVLANVLSDEECDELIELSKSKMERSKVGSSR-----DVNDIRTSSGAFL----EEN 89
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ +T +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + ++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQLLNELTLH 193
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|423558182|ref|ZP_17534484.1| hypothetical protein II3_03386 [Bacillus cereus MC67]
gi|401191450|gb|EJQ98472.1| hypothetical protein II3_03386 [Bacillus cereus MC67]
Length = 216
Score = 120 bits (302), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 105/198 (53%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I ++K ++ S + + + IRTSSG F+ E S
Sbjct: 39 PLIVVLANVLSDEECDGLIELSKNKIKRSKIGSSR-----DVNDIRTSSGAFLEENELTS 93
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 94 K----IEKRISSIMNVPVAHGEGLHILNYEVDQEYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKWVAT+W+R
Sbjct: 194 GGAPVTKGEKWVATQWVR 211
>gi|145347188|ref|XP_001418057.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144578285|gb|ABO96350.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 317
Score = 120 bits (302), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 71/225 (31%), Positives = 113/225 (50%), Gaps = 19/225 (8%)
Query: 71 DDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGI 130
DD + + LSW PR NF + E+C+ +I + + L ST+ +
Sbjct: 28 DDVERSKVVETLSWSPRVFLLKNFLSDEECEHLIELGEKKLERSTVV--NSDESGAVSTA 85
Query: 131 RTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQ 190
RTS G F++ E TL +E+++AK + +P + E +LRY+ GQ+Y +H+D +
Sbjct: 86 RTSFGTFVTRRLTE--TLQRVEDRVAKYSGIPWEHQEQLQLLRYRDGQEYVAHHDGIISE 143
Query: 191 EYGPQKSQRVASFLVYLTDLEEGGETMFP-----------FENGMNADGSYDYQKCIGLK 239
G +R+A+ L++L + GGET FP F + + G
Sbjct: 144 NGG----KRIATVLMFLREPTSGGETSFPQGTPLPETKAAFLANKDKLSECGWNDGNGFS 199
Query: 240 VKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRD 284
V P++G+ +LF+S NGT DP + H SCP + G K+ ATKWI +
Sbjct: 200 VIPKKGEAVLFFSFHINGTNDPFANHASCPTLGGTKYTATKWIHE 244
>gi|423437685|ref|ZP_17414666.1| hypothetical protein IE9_03866 [Bacillus cereus BAG4X12-1]
gi|423503075|ref|ZP_17479667.1| hypothetical protein IG1_00641 [Bacillus cereus HD73]
gi|401120840|gb|EJQ28636.1| hypothetical protein IE9_03866 [Bacillus cereus BAG4X12-1]
gi|402459296|gb|EJV91033.1| hypothetical protein IG1_00641 [Bacillus cereus HD73]
Length = 248
Score = 120 bits (301), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 105/198 (53%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I M+K ++ S + + + IRTSSG F+ E S
Sbjct: 71 PLIVVLANVLSDEECDELIEMSKNKMKRSKVG-----SARDVNDIRTSSGAFLEDNELTS 125
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 126 K----IEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 180
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 181 YLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 225
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 226 GGAPVTKGEKWIATQWVR 243
>gi|221068712|ref|ZP_03544817.1| Procollagen-proline dioxygenase [Comamonas testosteroni KF-1]
gi|220713735|gb|EED69103.1| Procollagen-proline dioxygenase [Comamonas testosteroni KF-1]
Length = 299
Score = 120 bits (301), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 88/267 (32%), Positives = 128/267 (47%), Gaps = 51/267 (19%)
Query: 40 LSQDVTAARPSARVVESVKDEYKWMPHGQ-------AGDDSVTNIPFQVLSWM--PRALY 90
L Q AARP A V + + P Q AGD V VL M PR +
Sbjct: 62 LPQAFAAARPEAGVNVQIHESGVPEPDLQTDPCCINAGDREV-----HVLMNMRHPRVVV 116
Query: 91 FPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFISAAEDE 144
F N + E+C +II A ++ S TVDN G RTS+G+F E++
Sbjct: 117 FGNLLSDEECDAIIAAAGPRMQRSL-------TVDNQSGGEAVNDDRTSNGMFFQRGEND 169
Query: 145 SGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP-----QKSQR 199
+ +E++IA++ P NGE +L Y+ G +Y HYD F P E G + QR
Sbjct: 170 --LICRVEQRIARLLNWPLENGEGMQVLHYRPGAEYKPHYDYFAPNEPGTPTILKRGGQR 227
Query: 200 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 259
V + ++YL + GG T FP +GL+V PR+G+ + F P+
Sbjct: 228 VGTLVMYLNEPARGGATTFP---------------DVGLQVVPRRGNAVFFSYNRPDPAT 272
Query: 260 DPTSIHGSCPVVKGEKWVATKWIRDQE 286
++HG PV++GEKW+ATKW+R++E
Sbjct: 273 K--TLHGGAPVLEGEKWIATKWLRERE 297
>gi|229192445|ref|ZP_04319408.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 10876]
gi|228591022|gb|EEK48878.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 10876]
Length = 216
Score = 120 bits (301), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 104/198 (52%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I M+K + S + + + IRTSSG F+ E S
Sbjct: 39 PLIVVLANVISDEECDELIEMSKNKMERSKIGSSR-----DVNDIRTSSGAFLEDNELTS 93
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 94 K----IEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|281307110|pdb|3ITQ|A Chain A, Crystal Structure Of A Prolyl 4-Hydroxylase From Bacillus
Anthracis
gi|281307111|pdb|3ITQ|B Chain B, Crystal Structure Of A Prolyl 4-Hydroxylase From Bacillus
Anthracis
Length = 216
Score = 120 bits (301), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 64/198 (32%), Positives = 105/198 (53%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I ++K S LA K + + IRTSSG F+ D++
Sbjct: 39 PLIVVLGNVLSDEECDELIELSK-----SKLARSKVGSSRDVNDIRTSSGAFL----DDN 89
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ +
Sbjct: 90 ELTAKIEKRISSIXNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVX 148
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGXAVYFEYFYQDQSLNELTLH 193
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|206971296|ref|ZP_03232247.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
AH1134]
gi|229081494|ref|ZP_04213993.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-2]
gi|423411965|ref|ZP_17389085.1| hypothetical protein IE1_01269 [Bacillus cereus BAG3O-2]
gi|423432249|ref|ZP_17409253.1| hypothetical protein IE7_04065 [Bacillus cereus BAG4O-1]
gi|206734068|gb|EDZ51239.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
AH1134]
gi|228701801|gb|EEL54288.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-2]
gi|401104033|gb|EJQ12010.1| hypothetical protein IE1_01269 [Bacillus cereus BAG3O-2]
gi|401117005|gb|EJQ24843.1| hypothetical protein IE7_04065 [Bacillus cereus BAG4O-1]
Length = 216
Score = 120 bits (301), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 104/198 (52%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I M+K + S + + + IRTSSG F+ E S
Sbjct: 39 PLIVVLANVLSDEECDELIEMSKNKMERSKIGSSR-----DVNDIRTSSGAFLEDNELTS 93
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 94 K----IEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|30264308|ref|NP_846685.1| prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. Ames]
gi|47529753|ref|YP_021102.1| prolyl 4-hydroxylase subunit alpha [Bacillus anthracis str. 'Ames
Ancestor']
gi|65321616|ref|ZP_00394575.1| hypothetical protein Bant_01005109 [Bacillus anthracis str. A2012]
gi|165873278|ref|ZP_02217887.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0488]
gi|167634610|ref|ZP_02392930.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0442]
gi|167638693|ref|ZP_02396969.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0193]
gi|170687507|ref|ZP_02878724.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0465]
gi|170709341|ref|ZP_02899757.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0389]
gi|177655890|ref|ZP_02937082.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0174]
gi|190566156|ref|ZP_03019075.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Tsiankovskii-I]
gi|196034803|ref|ZP_03102210.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
W]
gi|227817011|ref|YP_002817020.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
anthracis str. CDC 684]
gi|228929280|ref|ZP_04092307.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pondicheriensis BGSC 4BA1]
gi|228935557|ref|ZP_04098373.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
andalousiensis BGSC 4AW1]
gi|229123754|ref|ZP_04252949.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 95/8201]
gi|229604260|ref|YP_002868528.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0248]
gi|254683996|ref|ZP_05147856.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. CNEVA-9066]
gi|254721830|ref|ZP_05183619.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A1055]
gi|254736344|ref|ZP_05194050.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Western North America USA6153]
gi|254741382|ref|ZP_05199069.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Kruger B]
gi|254753983|ref|ZP_05206018.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Vollum]
gi|254757854|ref|ZP_05209881.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Australia 94]
gi|386738126|ref|YP_006211307.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. H9401]
gi|421506493|ref|ZP_15953416.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. UR-1]
gi|421638315|ref|ZP_16078911.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. BF1]
gi|30258953|gb|AAP28171.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Ames]
gi|47504901|gb|AAT33577.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. 'Ames Ancestor']
gi|164710995|gb|EDR16563.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0488]
gi|167513541|gb|EDR88911.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0193]
gi|167530062|gb|EDR92797.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0442]
gi|170125767|gb|EDS94678.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0389]
gi|170668702|gb|EDT19448.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0465]
gi|172079923|gb|EDT65028.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0174]
gi|190563075|gb|EDV17041.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Tsiankovskii-I]
gi|195992342|gb|EDX56303.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
W]
gi|227005734|gb|ACP15477.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. CDC 684]
gi|228659889|gb|EEL15534.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 95/8201]
gi|228824095|gb|EEM69911.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
andalousiensis BGSC 4AW1]
gi|228830570|gb|EEM76180.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pondicheriensis BGSC 4BA1]
gi|229268668|gb|ACQ50305.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0248]
gi|384387978|gb|AFH85639.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. H9401]
gi|401823486|gb|EJT22633.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. UR-1]
gi|403394741|gb|EJY91981.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. BF1]
Length = 216
Score = 120 bits (301), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 64/198 (32%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I ++K S LA K + + IRTSSG F+ D++
Sbjct: 39 PLIVVLGNVLSDEECDELIELSK-----SKLARSKVGSSRDVNDIRTSSGAFL----DDN 89
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|253575459|ref|ZP_04852796.1| prolyl 4-hydroxylase [Paenibacillus sp. oral taxon 786 str. D14]
gi|251845106|gb|EES73117.1| prolyl 4-hydroxylase [Paenibacillus sp. oral taxon 786 str. D14]
Length = 215
Score = 120 bits (301), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 70/206 (33%), Positives = 106/206 (51%), Gaps = 26/206 (12%)
Query: 81 VLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISA 140
VL P + F T ++C+ +I A LR S L + IRTS G+F
Sbjct: 25 VLHKEPLIMRFERLLTDDECRQLIEAAAPRLRESKLVNKV------VSEIRTSRGMFFE- 77
Query: 141 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ-R 199
E+E+ + IE++I+ + +P + E +L Y GQ+Y +HYD F P P S R
Sbjct: 78 -EEENPFIHRIEKRISALMNVPIEHAEGLQVLHYGPGQEYQAHYDFFGPN--SPSASNNR 134
Query: 200 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 259
+++ ++YL D+E GGET+FP + L+VKP +G L F +
Sbjct: 135 ISTLIIYLNDVEAGGETVFPL---------------LDLEVKPERGSALYFEYFYRQQEL 179
Query: 260 DPTSIHGSCPVVKGEKWVATKWIRDQ 285
+ ++H S PVV+GEKWVAT+W+R Q
Sbjct: 180 NNLTLHSSVPVVRGEKWVATQWMRRQ 205
>gi|218899396|ref|YP_002447807.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
G9842]
gi|218542449|gb|ACK94843.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
G9842]
Length = 216
Score = 120 bits (301), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 105/198 (53%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I M+K ++ S + + + IRTSSG F+ E S
Sbjct: 39 PLIVVLANVLSDEECDELIEMSKNKMKRSKVGSSR-----DVNDIRTSSGAFLEDNELTS 93
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 94 K----IEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAVNNRISTLVM 148
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|228941395|ref|ZP_04103947.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
berliner ATCC 10792]
gi|228974327|ref|ZP_04134896.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
thuringiensis str. T01001]
gi|228980919|ref|ZP_04141223.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
gi|384188306|ref|YP_005574202.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
chinensis CT-43]
gi|410676625|ref|YP_006928996.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
gi|452200698|ref|YP_007480779.1| Peptidyl prolyl 4-hydroxylase-like protein, alpha subunit [Bacillus
thuringiensis serovar thuringiensis str. IS5056]
gi|228778855|gb|EEM27118.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
gi|228785377|gb|EEM33387.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
thuringiensis str. T01001]
gi|228818321|gb|EEM64394.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
berliner ATCC 10792]
gi|326942015|gb|AEA17911.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
chinensis CT-43]
gi|409175754|gb|AFV20059.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
gi|452106091|gb|AGG03031.1| Peptidyl prolyl 4-hydroxylase-like protein, alpha subunit [Bacillus
thuringiensis serovar thuringiensis str. IS5056]
Length = 216
Score = 120 bits (301), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 105/198 (53%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I M+K ++ S + + + IRTSSG F+ E S
Sbjct: 39 PLIVVLANVLSDEECGELIEMSKNKMKRSKVGSSR-----DVNDIRTSSGAFLEDNELTS 93
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 94 K----IEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|75760922|ref|ZP_00740932.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
israelensis ATCC 35646]
gi|423385740|ref|ZP_17362996.1| hypothetical protein ICE_03486 [Bacillus cereus BAG1X1-2]
gi|423561293|ref|ZP_17537569.1| hypothetical protein II5_00697 [Bacillus cereus MSX-A1]
gi|74491592|gb|EAO54798.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
israelensis ATCC 35646]
gi|401201550|gb|EJR08415.1| hypothetical protein II5_00697 [Bacillus cereus MSX-A1]
gi|401635796|gb|EJS53551.1| hypothetical protein ICE_03486 [Bacillus cereus BAG1X1-2]
Length = 248
Score = 120 bits (301), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 65/198 (32%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I M+K ++ S + G + D IRTSSG F+ E S
Sbjct: 71 PLIVVLANVLSDEECDKLIEMSKNKMKRSKV----GSSRD-VNDIRTSSGAFLEDNELTS 125
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 126 K----IEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 180
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 181 YLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 225
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 226 GGAPVTKGEKWIATQWVR 243
>gi|423527903|ref|ZP_17504348.1| hypothetical protein IGE_01455 [Bacillus cereus HuB1-1]
gi|402451566|gb|EJV83385.1| hypothetical protein IGE_01455 [Bacillus cereus HuB1-1]
Length = 248
Score = 120 bits (300), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 65/198 (32%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I M+K ++ S + G + D IRTSSG F+ E S
Sbjct: 71 PLIVVLANVLSDEECDKLIEMSKNKMKRSKV----GSSRD-VNDIRTSSGAFLEDNELTS 125
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 126 K----IEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 180
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 181 YLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 225
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 226 GGAPVTKGEKWIATQWVR 243
>gi|421895470|ref|ZP_16325871.1| prolyl 4-hydroxylase alpha subunit homologue protein [Ralstonia
solanacearum MolK2]
gi|206586635|emb|CAQ17221.1| prolyl 4-hydroxylase alpha subunit homologue protein [Ralstonia
solanacearum MolK2]
Length = 283
Score = 120 bits (300), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 77/247 (31%), Positives = 119/247 (48%), Gaps = 26/247 (10%)
Query: 45 TAARPSARVVESVKDEYKWMPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSII 104
T A P+A V + E + A S IP PR + F +F + E+C +I
Sbjct: 51 TPAAPTAAVPNPAQAE-PGAENSNAVRTSDREIPILFAIETPRIVLFQHFLSDEECDELI 109
Query: 105 NMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPR 163
+ + L+ S + GE +N RTS G E + IE +IA+ T +P
Sbjct: 110 ALGRYRLKRSPVVNPETGE--ENLISARTSEGAMFQVGEHP--LVARIEARIAQATGVPV 165
Query: 164 INGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS-----QRVASFLVYLTDLEEGGETMF 218
+GE F +L Y G +Y H+D F+P G + QRVA+ ++YL ++ GG T F
Sbjct: 166 EHGEGFQVLHYHPGGEYQPHFDYFNPGRGGEARQLEVGGQRVATLVIYLNSVQAGGATGF 225
Query: 219 PFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVA 278
P +GL+V P +G+ + F P+G +D ++H PV +GEKW+A
Sbjct: 226 P---------------KLGLEVAPVKGNAVFFVYKRPDGMLDDNTLHAGLPVERGEKWIA 270
Query: 279 TKWIRDQ 285
TKW+R++
Sbjct: 271 TKWLRER 277
>gi|423358724|ref|ZP_17336227.1| hypothetical protein IC1_00704 [Bacillus cereus VD022]
gi|401084596|gb|EJP92842.1| hypothetical protein IC1_00704 [Bacillus cereus VD022]
Length = 248
Score = 120 bits (300), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 65/198 (32%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I M+K ++ S + G + D IRTSSG F+ E S
Sbjct: 71 PLIVVLANVLSDEECDKLIEMSKNKMKRSKV----GSSRD-VNDIRTSSGAFLEDNELTS 125
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 126 K----IEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 180
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 181 YLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 225
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 226 GGAPVTKGEKWIATQWVR 243
>gi|228960501|ref|ZP_04122151.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pakistani str. T13001]
gi|229047930|ref|ZP_04193506.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH676]
gi|423630961|ref|ZP_17606708.1| hypothetical protein IK5_03811 [Bacillus cereus VD154]
gi|423650103|ref|ZP_17625673.1| hypothetical protein IKA_03890 [Bacillus cereus VD169]
gi|228723387|gb|EEL74756.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH676]
gi|228799198|gb|EEM46165.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pakistani str. T13001]
gi|401264328|gb|EJR70440.1| hypothetical protein IK5_03811 [Bacillus cereus VD154]
gi|401282521|gb|EJR88420.1| hypothetical protein IKA_03890 [Bacillus cereus VD169]
Length = 248
Score = 120 bits (300), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 65/198 (32%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I M+K ++ S + G + D IRTSSG F+ E S
Sbjct: 71 PLIVVLANVLSDEECDELIEMSKNKMKRSKV----GSSRD-VNDIRTSSGAFLEDNELTS 125
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 126 K----IEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 180
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 181 YLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 225
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 226 GGAPVTKGEKWIATQWVR 243
>gi|49480949|ref|YP_038297.1| prolyl 4-hydroxylase subunit alpha [Bacillus thuringiensis serovar
konkukian str. 97-27]
gi|49332505|gb|AAT63151.1| prolyl 4-hydroxylase, alpha subunit [Bacillus thuringiensis serovar
konkukian str. 97-27]
Length = 232
Score = 120 bits (300), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 107/198 (54%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I ++K + LA K + + IRTSSG F+ D++
Sbjct: 55 PLIVVLGNVLSDEECDELIELSK-----NKLARSKVGSSRDVNDIRTSSGAFL----DDN 105
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
+ IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 106 ELTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 164
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 165 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 209
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 210 GGAPVTKGEKWIATQWVR 227
>gi|229152436|ref|ZP_04280628.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1550]
gi|228631044|gb|EEK87681.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1550]
Length = 248
Score = 120 bits (300), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 66/198 (33%), Positives = 105/198 (53%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I M+K + S + G + D IRTSSG F+ E S
Sbjct: 71 PLIVVLANVLSDEECGELIEMSKNKMERSKI----GSSRD-VNDIRTSSGAFLEDNELTS 125
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 126 K----IEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 180
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +I+ ++H
Sbjct: 181 YLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSINELTLH 225
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 226 GGAPVTKGEKWIATQWVR 243
>gi|325267002|ref|ZP_08133672.1| 2OG-Fe(II) oxygenase [Kingella denitrificans ATCC 33394]
gi|324981502|gb|EGC17144.1| 2OG-Fe(II) oxygenase [Kingella denitrificans ATCC 33394]
Length = 279
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 69/206 (33%), Positives = 108/206 (52%), Gaps = 25/206 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVFISAAEDE 144
P + NF T E+C +I +A+ + +T+ GE V + RTS + AE
Sbjct: 91 PEVVVLDNFITAEECAQLIALAEGKVEDATVVDPATGEFVKHQD--RTSMNAAFARAEHP 148
Query: 145 SGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS-----QR 199
+ +E +IA P NGE +LRY+ G +Y +H+D FD Q G +K+ QR
Sbjct: 149 --LIARLEARIAAAIHWPAENGEGMQVLRYRSGGEYKAHFDYFDTQSEGGRKNMQTGGQR 206
Query: 200 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 259
V +FLVYL D++ GG T FP + +++P++G L F + LPNG
Sbjct: 207 VGTFLVYLCDVDAGGATRFP---------------ALNFEIRPKKGMALFFANTLPNGEG 251
Query: 260 DPTSIHGSCPVVKGEKWVATKWIRDQ 285
+P ++H PVV G K++A+KW+R++
Sbjct: 252 NPLTLHAGVPVVSGVKYLASKWLREK 277
>gi|49187135|ref|YP_030387.1| prolyl 4-hydroxylase subunit alpha [Bacillus anthracis str. Sterne]
gi|228947951|ref|ZP_04110238.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
monterrey BGSC 4AJ1]
gi|49181062|gb|AAT56438.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Sterne]
gi|228811938|gb|EEM58272.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
monterrey BGSC 4AJ1]
Length = 232
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 64/198 (32%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I ++K S LA K + + IRTSSG F+ D++
Sbjct: 55 PLIVVLGNVLSDEECDELIELSK-----SKLARSKVGSSRDVNDIRTSSGAFL----DDN 105
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 106 ELTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 164
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 165 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 209
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 210 GGAPVTKGEKWIATQWVR 227
>gi|94312029|ref|YP_585239.1| prolyl 4-hydroxylase [Cupriavidus metallidurans CH34]
gi|93355881|gb|ABF09970.1| prolyl 4-hydroxylase [Cupriavidus metallidurans CH34]
Length = 293
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 72/206 (34%), Positives = 103/206 (50%), Gaps = 25/206 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVD-NTQGIRTSSGVFISAAEDE 144
PR L N +C +++ +A+ L+ S + +T D N RTS G E
Sbjct: 101 PRILLLQNLLDDAECDAVVALARDRLQRSPVV--NPDTGDENLIDARTSMGAMFQVGE-- 156
Query: 145 SGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK-----SQR 199
L IE +IA VT P +GE F +L YK G +Y H+D F+P+ G + QR
Sbjct: 157 HALLQRIEARIAAVTGWPVEHGEGFQVLNYKPGGEYQPHFDFFNPKRPGEARQLRVGGQR 216
Query: 200 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 259
VA+ ++YL GG T FP IGL+V P +G+ +LF LP+G +
Sbjct: 217 VATMVIYLNSPASGGATAFP---------------RIGLEVAPVKGNAVLFSYGLPDGAL 261
Query: 260 DPTSIHGSCPVVKGEKWVATKWIRDQ 285
D ++H PV GEKW+ATKW+R+
Sbjct: 262 DERTLHAGLPVEAGEKWIATKWLREH 287
>gi|430808003|ref|ZP_19435118.1| prolyl 4-hydroxylase [Cupriavidus sp. HMR-1]
gi|429499635|gb|EKZ98045.1| prolyl 4-hydroxylase [Cupriavidus sp. HMR-1]
Length = 293
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 72/206 (34%), Positives = 103/206 (50%), Gaps = 25/206 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVD-NTQGIRTSSGVFISAAEDE 144
PR L N +C +++ +A+ L+ S + +T D N RTS G E
Sbjct: 101 PRILLLQNLLDDAECDAVVALARDRLQRSPVV--NPDTGDENLIDARTSMGAMFQVGE-- 156
Query: 145 SGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK-----SQR 199
L IE +IA VT P +GE F +L YK G +Y H+D F+P+ G + QR
Sbjct: 157 HALLQRIEARIAAVTGWPVEHGEGFQVLNYKPGGEYQPHFDFFNPKRPGEARQLRVGGQR 216
Query: 200 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 259
VA+ ++YL GG T FP IGL+V P +G+ +LF LP+G +
Sbjct: 217 VATMVIYLNSPASGGATAFP---------------RIGLEVAPVKGNAVLFSYGLPDGAL 261
Query: 260 DPTSIHGSCPVVKGEKWVATKWIRDQ 285
D ++H PV GEKW+ATKW+R+
Sbjct: 262 DERTLHAGLPVEAGEKWIATKWLREH 287
>gi|229186477|ref|ZP_04313640.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BGSC 6E1]
gi|228596991|gb|EEK54648.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BGSC 6E1]
Length = 216
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I ++K + LA K + + IRTSSG F+ D++
Sbjct: 39 PLIVVLGNVLSDEECDELIELSK-----NKLARSKVGSSRDVNDIRTSSGAFL----DDN 89
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVI 148
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|228910069|ref|ZP_04073889.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL 200]
gi|228849586|gb|EEM94420.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL 200]
Length = 248
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 65/198 (32%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I M+K ++ S + G + D IRTSSG F+ E S
Sbjct: 71 PLIVVLANVLSDEECDELIEMSKNKMKRSKV----GSSRD-VNDIRTSSGAFLEDNELTS 125
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 126 K----IEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAVNNRISTLVM 180
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 181 YLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 225
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 226 GGAPVTKGEKWIATQWVR 243
>gi|365158975|ref|ZP_09355162.1| hypothetical protein HMPREF1014_00625 [Bacillus sp. 7_6_55CFAA_CT2]
gi|363625964|gb|EHL76973.1| hypothetical protein HMPREF1014_00625 [Bacillus sp. 7_6_55CFAA_CT2]
Length = 248
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 65/198 (32%), Positives = 105/198 (53%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I M+K + S + G + D IRTSSG F+ E S
Sbjct: 71 PLIVVLANVLSDEECDELIEMSKNKMERSKI----GSSRD-VNDIRTSSGAFLEDNELTS 125
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 126 K----IEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 180
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 181 YLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 225
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 226 GGAPVTKGEKWIATQWVR 243
>gi|229111709|ref|ZP_04241257.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-15]
gi|296504733|ref|YP_003666433.1| prolyl 4-hydroxylase subunit alpha [Bacillus thuringiensis BMB171]
gi|423585282|ref|ZP_17561369.1| hypothetical protein IIE_00694 [Bacillus cereus VD045]
gi|423640681|ref|ZP_17616299.1| hypothetical protein IK9_00626 [Bacillus cereus VD166]
gi|228671703|gb|EEL26999.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-15]
gi|296325785|gb|ADH08713.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis BMB171]
gi|401233925|gb|EJR40411.1| hypothetical protein IIE_00694 [Bacillus cereus VD045]
gi|401279742|gb|EJR85664.1| hypothetical protein IK9_00626 [Bacillus cereus VD166]
Length = 248
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 65/198 (32%), Positives = 105/198 (53%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I M+K + S + G + D IRTSSG F+ E S
Sbjct: 71 PLIVVLANVLSDEECDELIEMSKNKMERSKI----GSSRD-VNDIRTSSGAFLEDNEFTS 125
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 126 K----IEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 180
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 181 YLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 225
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 226 GGAPVTKGEKWIATQWVR 243
>gi|423657194|ref|ZP_17632493.1| hypothetical protein IKG_04182 [Bacillus cereus VD200]
gi|401289937|gb|EJR95641.1| hypothetical protein IKG_04182 [Bacillus cereus VD200]
Length = 248
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 65/198 (32%), Positives = 105/198 (53%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I M+K + S + G + D IRTSSG F+ E S
Sbjct: 71 PLIVVLANVLSDEECDELIEMSKNKMERSKI----GSSRD-VNDIRTSSGAFLEDNELTS 125
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 126 K----IEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 180
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 181 YLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 225
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 226 GGAPVTKGEKWIATQWVR 243
>gi|229180513|ref|ZP_04307855.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 172560W]
gi|228602937|gb|EEK60416.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 172560W]
Length = 232
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 104/198 (52%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I M+K + S + + + IRTSSG F+ E S
Sbjct: 55 PLIVVLANVLSDEECDELIEMSKNKMERSKIGSSR-----DVNDIRTSSGAFLEDNELTS 109
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 110 K----IEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 164
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 165 YLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 209
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 210 GGAPVTKGEKWIATQWVR 227
>gi|423478381|ref|ZP_17455096.1| hypothetical protein IEO_03839 [Bacillus cereus BAG6X1-1]
gi|402428543|gb|EJV60640.1| hypothetical protein IEO_03839 [Bacillus cereus BAG6X1-1]
Length = 216
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 61/198 (30%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I ++K ++ S + + + IRTSSG F+ D++
Sbjct: 39 PLIVVLGNVLSDEECDELIELSKSKMKRSKVGSSR-----DVNDIRTSSGAFL----DDN 89
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|302844249|ref|XP_002953665.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
gi|300261074|gb|EFJ45289.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
Length = 245
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 76/192 (39%), Positives = 109/192 (56%), Gaps = 15/192 (7%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PRA F NF T + ++ +A L+ ST+ GE V + IRTS G+FI D
Sbjct: 61 PRAYLFHNFLTKAERAHMVRLAAPKLKRSTVVGNDGEGVVDE--IRTSYGMFIRRLADP- 117
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFD-PQEYGPQKSQRVASFL 204
+ IE++I+ T LP + E +LRY GQ Y +HYD+ D E GP+ R+A+FL
Sbjct: 118 -VITRIEKRISLWTHLPIEHQEDIQVLRYAHGQTYGAHYDSGDKSNEPGPK--WRLATFL 174
Query: 205 VYLTDLEEGGETMFPFENGMNADGSYDYQ-----KCI--GLKVKPRQGDGLLFYSLLPNG 257
+YL+D+EEGGET FP +N + D + + +C + KP+ GD +LFYS PN
Sbjct: 175 MYLSDVEEGGETAFP-QNSVWYDPTIPERIGPVSECAKGHVAAKPKAGDAVLFYSFYPNL 233
Query: 258 TIDPTSIHGSCP 269
T+DP ++H CP
Sbjct: 234 TMDPAAMHTGCP 245
>gi|428175714|gb|EKX44602.1| hypothetical protein GUITHDRAFT_71994 [Guillardia theta CCMP2712]
Length = 244
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 75/231 (32%), Positives = 121/231 (52%), Gaps = 23/231 (9%)
Query: 73 SVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGI-- 130
S + + LS PR NF + E+C+ II A L PST+ L++G+ + + +
Sbjct: 14 SSRTVEVKRLSSTPRLFVVENFLSAEECEEIIKTATPLLAPSTV-LKQGDQSNGEEKVKD 72
Query: 131 --RTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFD 188
RTS ++ + + + I +++ ++ +P E +L+Y Q Y+ HYD FD
Sbjct: 73 EVRTSETAWL--MDKKVPIVAKIRQRVEELIRIPMSYAEDMQVLKYTFKQHYHVHYDFFD 130
Query: 189 PQEYGPQKS---QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQK---C-----IG 237
P+ Y + S R+ + YLT +E+GGET+FPF N +A+ + Q C
Sbjct: 131 PKMYPGRWSSGHNRLVTVFFYLTSVEKGGETIFPFGN-TSAEEHHKIQSWGPCENAVESS 189
Query: 238 LKVKPRQGDGLLFYSLLPNG----TIDPTSIHGSCPVVKGEKWVATKWIRD 284
+KVKP +G ++FY + P+G +D TS+HG C + GEKW A WIR+
Sbjct: 190 IKVKPVRGSAVIFYLMKPHGHTHGELDHTSLHGGCDPIVGEKWAANYWIRN 240
>gi|423400914|ref|ZP_17378087.1| hypothetical protein ICW_01312 [Bacillus cereus BAG2X1-2]
gi|401653904|gb|EJS71447.1| hypothetical protein ICW_01312 [Bacillus cereus BAG2X1-2]
Length = 216
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 61/198 (30%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I ++K ++ S + + + IRTSSG F+ D++
Sbjct: 39 PLIVVLGNVLSDEECDELIELSKSKMKRSKVGSSR-----DVNDIRTSSGAFL----DDN 89
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|407938132|ref|YP_006853773.1| 2OG-Fe(II) oxygenase [Acidovorax sp. KKS102]
gi|407895926|gb|AFU45135.1| 2OG-Fe(II) oxygenase [Acidovorax sp. KKS102]
Length = 303
Score = 119 bits (298), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 70/207 (33%), Positives = 110/207 (53%), Gaps = 27/207 (13%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKGETVDNTQGIRTSSGVFISAAEDE 144
PR + F N +PE+C ++I A+ + R T+A + G N RTS G+F +
Sbjct: 116 PRIVVFGNLLSPEECDALIAAAEPRMARSLTVATKTGGEEINAD--RTSDGMFFQRGQ-- 171
Query: 145 SGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP-----QKSQR 199
S + IEE+IA++ P NGE +L Y+ G +Y HYD FDP E G + QR
Sbjct: 172 SPLIQRIEERIARLLQWPIENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPSIIKRGGQR 231
Query: 200 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 259
V + ++YL ++GG T FP + L+V P++G+ + F P+ +
Sbjct: 232 VGTLVMYLNTPDKGGGTTFP---------------DVHLEVAPQRGNAVFFSYERPHPST 276
Query: 260 DPTSIHGSCPVVKGEKWVATKWIRDQE 286
++HG PV+ G+KW+ATKW+R++E
Sbjct: 277 --RTLHGGAPVIAGDKWIATKWLRERE 301
>gi|423452458|ref|ZP_17429311.1| hypothetical protein IEE_01202 [Bacillus cereus BAG5X1-1]
gi|401140096|gb|EJQ47653.1| hypothetical protein IEE_01202 [Bacillus cereus BAG5X1-1]
Length = 216
Score = 119 bits (298), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 62/198 (31%), Positives = 105/198 (53%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I ++K + S + + + IRTSSG F+ +E+
Sbjct: 39 PLIVVLANVLSDEECDGLIELSKNKIERSKIGSSR-----DVNDIRTSSGAFL----EEN 89
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTSKIEKRISSIMNVPVAHGEGLHILNYEVDQEYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKWVAT+W+R
Sbjct: 194 GGAPVTKGEKWVATQWVR 211
>gi|418530659|ref|ZP_13096582.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni ATCC 11996]
gi|371452378|gb|EHN65407.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni ATCC 11996]
Length = 299
Score = 119 bits (298), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 84/260 (32%), Positives = 126/260 (48%), Gaps = 47/260 (18%)
Query: 45 TAARPSARVVESVKDEYKWMPH-------GQAGDDSVTNIPFQVLSWMPRALYFPNFATP 97
AARP A V + + P AGD V ++ V PR + F N +
Sbjct: 67 AAARPEAEVAVQIHESGVPEPDLLTDPCCVHAGDRDV-HVLMSVRH--PRVVVFGNLLSN 123
Query: 98 EQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFISAAEDESGTLDLI 151
E+C +II A RP +++ TVDN G RTS+G+F E++ + +
Sbjct: 124 EECDAIIAAA----RPR---MQRSLTVDNQSGGEAVNDDRTSNGMFFQRGEND--LISRV 174
Query: 152 EEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP-----QKSQRVASFLVY 206
E++IA++ P NGE +L Y+ G +Y HYD F P E G + QRV + ++Y
Sbjct: 175 EQRIARLLNWPLENGEGMQVLHYRPGAEYKPHYDYFAPNEPGTPTILKRGGQRVGTLVMY 234
Query: 207 LTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHG 266
L + GG T FP +GL+V PR+G+ + F P ++HG
Sbjct: 235 LNEPARGGATTFP---------------DVGLQVVPRRGNAVFFSYNRPEPATK--TLHG 277
Query: 267 SCPVVKGEKWVATKWIRDQE 286
PV++GEKW+ATKW+R++E
Sbjct: 278 GAPVLEGEKWIATKWLRERE 297
>gi|229146822|ref|ZP_04275187.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST24]
gi|228636650|gb|EEK93115.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST24]
Length = 216
Score = 119 bits (298), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 103/198 (52%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I M+K + S + + + IRTSSG F+ E S
Sbjct: 39 PLIVVLANVLSDEECDELIEMSKNKMERSKIGSSR-----DVNDIRTSSGAFLEDNELTS 93
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 94 K----IEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F +++ ++H
Sbjct: 149 YLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQGQSLNELTLH 193
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|383642155|ref|ZP_09954561.1| hypothetical protein SeloA3_06917 [Sphingomonas elodea ATCC 31461]
Length = 327
Score = 119 bits (298), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 73/201 (36%), Positives = 102/201 (50%), Gaps = 24/201 (11%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPS-TLALRKGETVDNTQGIRTSSGVFISAAEDE 144
PR +FP F + E+C + A+ L PS L G + + IRTS G I +E
Sbjct: 140 PRVEHFPGFLSREECAHVATTAQDLLEPSFVLDPNSGRPIPHP--IRTSDGGAIGPT-NE 196
Query: 145 SGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFL 204
+ + I +IA T GE+ +LRY GQ+Y H D E +QR+A+F+
Sbjct: 197 NLVVRAINLRIAAATGTAVEQGESLTVLRYARGQEYRRHLDTIAGAE-----NQRIATFI 251
Query: 205 VYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSI 264
VYL D EGGET FP N ++V+PR GD + F ++ P+GT DP +
Sbjct: 252 VYLNDGFEGGETHFPLLN---------------IQVRPRIGDAIRFDTIRPDGTPDPRLV 296
Query: 265 HGSCPVVKGEKWVATKWIRDQ 285
H PV G KW+AT+WIR +
Sbjct: 297 HAGQPVRNGVKWIATRWIRRE 317
>gi|118479416|ref|YP_896567.1| prolyl 4-hydroxylase subunit alpha [Bacillus thuringiensis str. Al
Hakam]
gi|118418641|gb|ABK87060.1| prolyl 4-hydroxylase, alpha subunit [Bacillus thuringiensis str. Al
Hakam]
Length = 232
Score = 119 bits (298), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I ++K + LA K + + IRTSSG F+ D++
Sbjct: 55 PLIVVLGNVLSDEECDELIELSK-----NKLARSKVGSSRDVNDIRTSSGAFL----DDN 105
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 106 ELTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVI 164
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 165 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 209
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 210 GGAPVTKGEKWIATQWVR 227
>gi|301055727|ref|YP_003793938.1| prolyl 4-hydroxylase subunit alpha [Bacillus cereus biovar
anthracis str. CI]
gi|300377896|gb|ADK06800.1| prolyl 4-hydroxylase, alpha subunit [Bacillus cereus biovar
anthracis str. CI]
Length = 216
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I ++K + LA K + + IRTSSG F+ D++
Sbjct: 39 PLIVVLGNVLSDEECDELIELSK-----NKLARSKVGSSRDVNDIRTSSGAFL----DDN 89
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|196041590|ref|ZP_03108882.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
NVH0597-99]
gi|218905373|ref|YP_002453207.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
AH820]
gi|225866219|ref|YP_002751597.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
03BB102]
gi|423550018|ref|ZP_17526345.1| hypothetical protein IGW_00649 [Bacillus cereus ISP3191]
gi|196027578|gb|EDX66193.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
NVH0597-99]
gi|218537435|gb|ACK89833.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
AH820]
gi|225786013|gb|ACO26230.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
03BB102]
gi|401189634|gb|EJQ96684.1| hypothetical protein IGW_00649 [Bacillus cereus ISP3191]
Length = 216
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I ++K + LA K + + IRTSSG F+ D++
Sbjct: 39 PLIVVLGNVLSDEECDELIELSK-----NKLARSKVGSSRDVNDIRTSSGAFL----DDN 89
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|302844281|ref|XP_002953681.1| hypothetical protein VOLCADRAFT_63898 [Volvox carteri f.
nagariensis]
gi|300261090|gb|EFJ45305.1| hypothetical protein VOLCADRAFT_63898 [Volvox carteri f.
nagariensis]
Length = 304
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 71/209 (33%), Positives = 110/209 (52%), Gaps = 15/209 (7%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA 141
++W PR + NF T + K +I +A ++ ST+ G++V+++ ++GV
Sbjct: 4 VAWKPRVFIYHNFITDMEAKHMIELAAPQMKRSTVVGAGGQSVEDSYRTLYTAGV----R 59
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVA 201
+ ++ IE ++A T + ++ E ILRY IGQ+Y H D E G RVA
Sbjct: 60 RYQDDVVERIENRVAAWTQISVLHQEDMQILRYGIGQQYKVHADTLRDDEAG----VRVA 115
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGS----YDYQKCIGLKV--KPRQGDGLLFYSLLP 255
+ L+YL + E GGET FP +N + ++ C V P++GD LLF+S+ P
Sbjct: 116 TVLIYLNEPEAGGETAFPDSQWVNPKLAETIGANFSACAKNHVAFAPKRGDALLFWSIGP 175
Query: 256 NGTI-DPTSIHGSCPVVKGEKWVATKWIR 283
+GT D + H CPV+ G KW ATKWI
Sbjct: 176 DGTTEDYHASHTGCPVLSGVKWTATKWIH 204
>gi|116784858|gb|ABK23496.1| unknown [Picea sitchensis]
Length = 208
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 65/157 (41%), Positives = 90/157 (57%), Gaps = 12/157 (7%)
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 195
+FI +D + IE+KIA T LP+ NGE +LRY+ G+KY+ H+D F + +
Sbjct: 1 MFIPKGKD--AIISRIEDKIAAWTFLPKENGEDMQVLRYEPGEKYDPHFDFFQDKVNIVR 58
Query: 196 KSQRVASFLVYLTDLEEGGETMFP---------FENGMNADGSYDYQKCIGLKVKPRQGD 246
RVA+ L+YLTD+ +GGET+FP + + D D K G VKP++GD
Sbjct: 59 GGHRVATVLMYLTDVSKGGETVFPSAEEDTHRRISSIIKDDTLSDCAK-RGTAVKPKRGD 117
Query: 247 GLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
LLF+SL D S+H CPV++GEKW TKWI
Sbjct: 118 ALLFFSLTTQAKPDTRSLHAGCPVIEGEKWSVTKWIH 154
>gi|229071739|ref|ZP_04204954.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus F65185]
gi|228711334|gb|EEL63294.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus F65185]
Length = 232
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 104/198 (52%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I M+K + S + + + IRTSSG F+ E S
Sbjct: 55 PLIVVLANVLSDEECDELIEMSKNKMERSKIGSSR-----DVNDIRTSSGAFLEDNELTS 109
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 110 K----IEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 164
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 165 YLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 209
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 210 GGAPVTKGEKWIATQWMR 227
>gi|196046329|ref|ZP_03113555.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
03BB108]
gi|376268135|ref|YP_005120847.1| Peptidyl prolyl 4- hydroxylase like protein [Bacillus cereus
F837/76]
gi|196022799|gb|EDX61480.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
03BB108]
gi|364513935|gb|AEW57334.1| Peptidyl prolyl 4- hydroxylase like protein [Bacillus cereus
F837/76]
Length = 216
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I ++K + LA K + + IRTSSG F+ D++
Sbjct: 39 PLIVVLGNVLSDEECDELIELSK-----NKLARSKVGSSRDVNDIRTSSGAFL----DDN 89
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|30022316|ref|NP_833947.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 14579]
gi|229129515|ref|ZP_04258486.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-Cer4]
gi|29897873|gb|AAP11148.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 14579]
gi|228654120|gb|EEL09987.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-Cer4]
Length = 232
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 64/198 (32%), Positives = 105/198 (53%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I M+K + S + + + IRTSSG F+ ED
Sbjct: 55 PLIVVLANVLSDEECDELIEMSKNKMERSKIGSSR-----DVNDIRTSSGAFL---EDNK 106
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
T IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 107 LT-SKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 164
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 165 YLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 209
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 210 GGAPVTKGEKWIATQWVR 227
>gi|229163182|ref|ZP_04291137.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus R309803]
gi|228620245|gb|EEK77116.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus R309803]
Length = 229
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 105/198 (53%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I ++K S LA K + + IRTS G F+ D++
Sbjct: 52 PLIVVLGNVLSDEECDELIELSK-----SKLARSKVGSSRDVNDIRTSKGAFL----DDN 102
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 103 ELTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 161
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 162 YLNDVEEGGETFFP---------------KLNLSVNPRKGMAVYFEYFYQDQSLNELTLH 206
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 207 GGAPVTKGEKWIATQWVR 224
>gi|47567794|ref|ZP_00238502.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus G9241]
gi|47555471|gb|EAL13814.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus G9241]
Length = 216
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 62/198 (31%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I ++K + LA K + + IRTS G F+ D++
Sbjct: 39 PLIVVLGNVLSDEECDELIELSK-----NKLARSKVGSSRDVNDIRTSKGAFL----DDN 89
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
+ IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|217961727|ref|YP_002340297.1| prolyl 4-hydroxylase subunit alpha domain-containing protein
[Bacillus cereus AH187]
gi|222097680|ref|YP_002531737.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
Q1]
gi|229198365|ref|ZP_04325071.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1293]
gi|375286242|ref|YP_005106681.1| prolyl 4-hydroxylase subunit alpha domain-containing protein
[Bacillus cereus NC7401]
gi|423354732|ref|ZP_17332357.1| hypothetical protein IAU_02806 [Bacillus cereus IS075]
gi|423566803|ref|ZP_17543050.1| hypothetical protein II7_00026 [Bacillus cereus MSX-A12]
gi|423574080|ref|ZP_17550199.1| hypothetical protein II9_01301 [Bacillus cereus MSX-D12]
gi|217067199|gb|ACJ81449.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
AH187]
gi|221241738|gb|ACM14448.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
Q1]
gi|228585065|gb|EEK43177.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1293]
gi|358354769|dbj|BAL19941.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
NC7401]
gi|401086280|gb|EJP94507.1| hypothetical protein IAU_02806 [Bacillus cereus IS075]
gi|401212649|gb|EJR19392.1| hypothetical protein II9_01301 [Bacillus cereus MSX-D12]
gi|401215318|gb|EJR22035.1| hypothetical protein II7_00026 [Bacillus cereus MSX-A12]
Length = 216
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I ++K + LA K + + IRTSSG F+ D++
Sbjct: 39 PLIVVLGNVLSDEECDKLIELSK-----NKLARSKVGSSRDVNDIRTSSGAFL----DDN 89
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|423470454|ref|ZP_17447198.1| hypothetical protein IEM_01760 [Bacillus cereus BAG6O-2]
gi|402436583|gb|EJV68613.1| hypothetical protein IEM_01760 [Bacillus cereus BAG6O-2]
Length = 216
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 62/198 (31%), Positives = 105/198 (53%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I ++K + S + + + IRTSSG F+ +E+
Sbjct: 39 PLIVVLANVLSDEECDGLIELSKNKIERSKIGSSR-----DVNDIRTSSGAFL----EEN 89
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTSKIEKRISSIMNVPVAHGEGLHILNYEVDQEYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKWVAT+W+R
Sbjct: 194 GGAPVTKGEKWVATQWMR 211
>gi|241767624|ref|ZP_04765273.1| Procollagen-proline dioxygenase [Acidovorax delafieldii 2AN]
gi|241361463|gb|EER57922.1| Procollagen-proline dioxygenase [Acidovorax delafieldii 2AN]
Length = 318
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 82/226 (36%), Positives = 119/226 (52%), Gaps = 36/226 (15%)
Query: 70 GDDSVTNIPFQVLSWM--PRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKG-ETVD 125
GD SV +VL M PR + F N +PE+C+++I A + R T+A + G E V+
Sbjct: 118 GDRSV-----EVLLTMAHPRVVVFGNLLSPEECEALIAAAAPRMARSLTVATQTGGEEVN 172
Query: 126 NTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD 185
+ RTS G+F ES + IEE+IA + P NGE +L Y+ G +Y HYD
Sbjct: 173 DD---RTSHGMFFQ--RGESPLVQRIEERIASLLNWPIENGEGLQVLHYRPGAEYKPHYD 227
Query: 186 AFDPQEYGP-----QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKV 240
FDP E G + QRV + ++YL E+GG T FP D Q ++V
Sbjct: 228 YFDPAEPGTPTVIQRGGQRVGTLVMYLNTPEQGGGTTFP-----------DAQ----IEV 272
Query: 241 KPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
P++G+ F P T ++HG PV+ G+KW+ATKW+R++E
Sbjct: 273 APQRGNAAFFSYERP--TPSTRTLHGGAPVLAGDKWIATKWLRERE 316
>gi|52141260|ref|YP_085568.1| prolyl 4-hydroxylase, alpha subunit [Bacillus cereus E33L]
gi|51974729|gb|AAU16279.1| prolyl 4-hydroxylase, alpha subunit [Bacillus cereus E33L]
Length = 232
Score = 118 bits (296), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I ++K + LA K + + IRTSSG F+ D++
Sbjct: 55 PLIVVLGNVLSDEECDELIELSK-----NKLARSKVGSSRDVNDIRTSSGAFL----DDN 105
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 106 ELTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 164
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 165 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 209
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 210 GGAPVTKGEKWIATQWVR 227
>gi|160900716|ref|YP_001566298.1| procollagen-proline dioxygenase [Delftia acidovorans SPH-1]
gi|160366300|gb|ABX37913.1| Procollagen-proline dioxygenase [Delftia acidovorans SPH-1]
Length = 294
Score = 118 bits (296), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 74/217 (34%), Positives = 117/217 (53%), Gaps = 31/217 (14%)
Query: 79 FQVLSWM--PRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKG-ETVDNTQGIRTSS 134
QVL M PR + F N + E+C +II A+ + R T+A + G E +++ RTS+
Sbjct: 98 VQVLVSMRNPRIVVFGNLLSHEECDAIIAAARPRMARSLTVATQSGGEEINDD---RTSN 154
Query: 135 GVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP 194
G+F E +G + +EE+IA++ P +GE +L Y G +Y H+D F P E G
Sbjct: 155 GMFFQRGE--TGIVSQLEERIARLLRWPLDHGEGLQVLHYGPGAEYKPHHDYFAPGEPGT 212
Query: 195 -----QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLL 249
+ QRV + ++YL + E GG T+FP + L+V PR+G+ +
Sbjct: 213 PTILKRGGQRVGTLVIYLNEPERGGATIFP---------------EVPLQVVPRRGNAVF 257
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
F P+ + ++HG PV+ GEKW+ATKW+R++E
Sbjct: 258 FSYERPDPST--RTLHGGAPVLAGEKWIATKWLRERE 292
>gi|384182063|ref|YP_005567825.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
finitimus YBT-020]
gi|324328147|gb|ADY23407.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
finitimus YBT-020]
Length = 216
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I ++K + LA K + + IRTSSG F+ D++
Sbjct: 39 PLIVVLGNVLSDEECDELIELSK-----NKLARSKVGSSRDVNDIRTSSGAFL----DDN 89
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDRSLNELTLH 193
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|395003644|ref|ZP_10387769.1| 2OG-Fe(II) oxygenase superfamily enzyme [Acidovorax sp. CF316]
gi|394318439|gb|EJE54870.1| 2OG-Fe(II) oxygenase superfamily enzyme [Acidovorax sp. CF316]
Length = 299
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 84/246 (34%), Positives = 126/246 (51%), Gaps = 36/246 (14%)
Query: 49 PSARVVESVKDEYKWMPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAK 108
P+ RV E + DE AGD +V NI + PR + F N + E+C ++I A
Sbjct: 80 PAVRVPEPLLDESP--AQIDAGDRAV-NILLAIAK--PRIVVFGNLLSAEECDALIAAAA 134
Query: 109 LNL-RPSTLALRKG-ETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRING 166
+ R T+A + G E V++ RTS G+F E+ + IEE+IA++ P NG
Sbjct: 135 PRMARSLTVATKTGGEEVNDD---RTSDGMFFQRGENP--VVQRIEERIARLLDWPIENG 189
Query: 167 EAFNILRYKIGQKYNSHYDAFDPQEYGP-----QKSQRVASFLVYLTDLEEGGETMFPFE 221
E +L Y+ G +Y HYD FDP E G + QRV + ++YL E+GG T FP
Sbjct: 190 EGLQVLHYRPGAEYKPHYDYFDPGEPGTPTILKRGGQRVGTLVMYLNTPEKGGGTTFP-- 247
Query: 222 NGMNADGSYDYQKCIGLKVKPRQGDGLLF-YSLLPNGTIDPTSIHGSCPVVKGEKWVATK 280
+ ++V P++G+ + F Y T ++HG PV+ GEKW+ATK
Sbjct: 248 -------------DVHVEVAPQRGNAVFFSYERAHPAT---RTLHGGAPVIAGEKWIATK 291
Query: 281 WIRDQE 286
W+R++E
Sbjct: 292 WLRERE 297
>gi|423604110|ref|ZP_17580003.1| hypothetical protein IIK_00691 [Bacillus cereus VD102]
gi|401245796|gb|EJR52149.1| hypothetical protein IIK_00691 [Bacillus cereus VD102]
Length = 216
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I ++K + LA K + + IRTSSG F+ D++
Sbjct: 39 PLIVVLGNVLSDEECDELIELSK-----NKLARSKVGSSRDVNDIRTSSGAFL----DDN 89
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFHQDQSLNELTLH 193
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|351731158|ref|ZP_08948849.1| 2OG-Fe(II) oxygenase [Acidovorax radicis N35]
Length = 303
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 71/208 (34%), Positives = 112/208 (53%), Gaps = 29/208 (13%)
Query: 86 PRALYFPNFATPEQCKSII-NMAKLNLRPSTLALRKG-ETVDNTQGIRTSSGVFISAAED 143
PR + F N +PE+C ++I + A R T+A + G E +++ RTS G+F +
Sbjct: 116 PRVVVFGNLLSPEECDALIADAAPRMARSLTVATKTGGEEINDD---RTSDGMFFQRGQ- 171
Query: 144 ESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP-----QKSQ 198
S + IEE+IA++ P NGE +L Y+ G +Y HYD FDP E G + Q
Sbjct: 172 -SPLIQRIEERIARLLNWPIENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPTIVKRGGQ 230
Query: 199 RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGT 258
RV + ++YL E+GG T FP + ++V P++G+ + F P+ +
Sbjct: 231 RVGTLVMYLNTPEKGGGTTFP---------------DVHVEVAPQRGNAVFFSYERPHPS 275
Query: 259 IDPTSIHGSCPVVKGEKWVATKWIRDQE 286
++HG PV+ GEKW+ATKW+R++E
Sbjct: 276 T--RTLHGGAPVLAGEKWIATKWLRERE 301
>gi|206978009|ref|ZP_03238895.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
H3081.97]
gi|423373947|ref|ZP_17351286.1| hypothetical protein IC5_03002 [Bacillus cereus AND1407]
gi|206743809|gb|EDZ55230.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
H3081.97]
gi|401094762|gb|EJQ02832.1| hypothetical protein IC5_03002 [Bacillus cereus AND1407]
Length = 216
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 105/198 (53%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I ++K + LA K + + IRTSSG F+ D+
Sbjct: 39 PLIVVLGNVLSDEECDKLIELSK-----NKLARSKVGSSRDVNDIRTSSGAFL----DDD 89
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|388520887|gb|AFK48505.1| unknown [Lotus japonicus]
Length = 187
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 60/127 (47%), Positives = 82/127 (64%), Gaps = 4/127 (3%)
Query: 161 LPRI-NGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFP 219
+P I NGE+ IL Y+ G+KY HYD F + R+A+ L+YL+D+ +GGET+FP
Sbjct: 3 IPSIENGESIQILHYENGRKYEPHYDYFHDRANQFMGGHRIATVLMYLSDVGKGGETIFP 62
Query: 220 -FENGMNADGSYDYQKCI--GLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKW 276
E+ ++ + +C G VKPR+GD LLF+SL N T D S+HGSCPV++GEKW
Sbjct: 63 NAESKLSQPKDESWSECAHKGYAVKPRKGDALLFFSLHLNATTDSNSLHGSCPVIEGEKW 122
Query: 277 VATKWIR 283
ATKWI
Sbjct: 123 SATKWIH 129
>gi|229157835|ref|ZP_04285910.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 4342]
gi|228625792|gb|EEK82544.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 4342]
Length = 232
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 62/198 (31%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I ++K + LA K + + IRTS G F+ D++
Sbjct: 55 PLIVVLGNVLSDEECDELIELSK-----NKLARSKVGSSRDVNDIRTSKGAFL----DDN 105
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
+ IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 106 ELTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 164
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 165 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 209
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 210 GGAPVTKGEKWIATQWVR 227
>gi|333912984|ref|YP_004486716.1| procollagen-proline dioxygenase [Delftia sp. Cs1-4]
gi|333743184|gb|AEF88361.1| Procollagen-proline dioxygenase [Delftia sp. Cs1-4]
Length = 294
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 74/217 (34%), Positives = 117/217 (53%), Gaps = 31/217 (14%)
Query: 79 FQVLSWM--PRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKG-ETVDNTQGIRTSS 134
QVL M PR + F N + E+C +II A+ + R T+A + G E +++ RTS+
Sbjct: 98 VQVLVSMRNPRIVVFGNLLSHEECDAIIAAARPRMARSLTVATQSGGEEINDD---RTSN 154
Query: 135 GVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP 194
G+F E +G + +EE+IA++ P +GE +L Y G +Y H+D F P E G
Sbjct: 155 GMFFQRGE--TGIVSQLEERIARLLRWPLDHGEGLQVLHYGPGAEYKPHHDYFAPGEPGT 212
Query: 195 -----QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLL 249
+ QRV + ++YL + E GG T+FP + L+V PR+G+ +
Sbjct: 213 PTILKRGGQRVGTLVIYLNEPERGGATIFP---------------EVPLQVVPRRGNAVF 257
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
F P+ + ++HG PV+ GEKW+ATKW+R++E
Sbjct: 258 FSYERPDPST--RTLHGGAPVLAGEKWIATKWLRERE 292
>gi|229031885|ref|ZP_04187873.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1271]
gi|228729503|gb|EEL80492.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1271]
Length = 216
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 105/198 (53%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I ++K S LA K + + IRTS G F+ D++
Sbjct: 39 PLIVVLGNVLSDEECGELIELSK-----SKLARSKVGSSRDVNDIRTSKGAFL----DDN 89
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTTKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|228987427|ref|ZP_04147547.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
tochigiensis BGSC 4Y1]
gi|228772399|gb|EEM20845.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
tochigiensis BGSC 4Y1]
Length = 232
Score = 118 bits (295), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 62/198 (31%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I ++K + LA K + + IRTS G F+ D++
Sbjct: 55 PLIVVLGNVLSDEECDELIELSK-----NKLARSKVGSSRDVNDIRTSKGAFL----DDN 105
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
+ IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 106 ELTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 164
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 165 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 209
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 210 GGAPVTKGEKWIATQWVR 227
>gi|229093299|ref|ZP_04224414.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-42]
gi|228690082|gb|EEL43879.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-42]
Length = 232
Score = 118 bits (295), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 62/198 (31%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I ++K + LA K + + IRTSSG F+ D++
Sbjct: 55 PLIVVLGNVLSDEECDELIELSK-----NKLARSKVGSSRDVNDIRTSSGAFL----DDN 105
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
+ IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 106 ELTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 164
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 165 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 209
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+ T+W+R
Sbjct: 210 GGAPVTKGEKWITTQWVR 227
>gi|229174912|ref|ZP_04302432.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus MM3]
gi|228608580|gb|EEK65882.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus MM3]
Length = 216
Score = 118 bits (295), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 105/198 (53%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I ++K S LA K + + IRTS G F+ D++
Sbjct: 39 PLIVVLGNVLSDEECDELIELSK-----SKLARSKVGSSRDVNDIRTSKGAFL----DDN 89
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTVKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|254254263|ref|ZP_04947580.1| hypothetical protein BDAG_03558 [Burkholderia dolosa AUO158]
gi|124898908|gb|EAY70751.1| hypothetical protein BDAG_03558 [Burkholderia dolosa AUO158]
Length = 285
Score = 118 bits (295), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 75/227 (33%), Positives = 106/227 (46%), Gaps = 29/227 (12%)
Query: 67 GQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDN 126
G D + IP + P+ + F N ++C +I + L ST ET
Sbjct: 77 GNVIDAADRRIPVLIRCERPQIVVFGNVLDQDECDEMIQRSMHKLEQSTTV--NAET--G 132
Query: 127 TQGI---RTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSH 183
TQ + RTS G + ED + IE ++A + P NGE +LRY G +Y SH
Sbjct: 133 TQEVIRHRTSHGTWFQNGED--ALIRRIETRLAALMNCPVENGEGLQVLRYTPGGEYRSH 190
Query: 184 YDAFDPQEYGP-----QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGL 238
YD F P G QRVA+ +VYL D+ GGET+FP G+
Sbjct: 191 YDYFQPTAAGSLTHVRTGGQRVATLIVYLNDVPSGGETVFPEA---------------GI 235
Query: 239 KVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
V PR+GD + F + +DP ++H PV GEKW+ TKW+R++
Sbjct: 236 SVVPRRGDAVYFRYMNRLRQLDPATLHAGAPVRDGEKWIMTKWVRER 282
>gi|42783360|ref|NP_980607.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 10987]
gi|42739288|gb|AAS43215.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
ATCC 10987]
Length = 216
Score = 118 bits (295), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I ++K + LA K + + IRTSSG F+ D++
Sbjct: 39 PLIVVLGNVLSDEECDELIELSK-----NKLARSKVGSSRDVNDIRTSSGAFL----DDN 89
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWMR 211
>gi|229140971|ref|ZP_04269515.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST26]
gi|228642547|gb|EEK98834.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST26]
Length = 232
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I ++K + LA K + + IRTSSG F+ D++
Sbjct: 55 PLIVVLGNVLSDEECDKLIELSK-----NKLARSKVGSSRDVNDIRTSSGAFL----DDN 105
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 106 ELTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 164
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 165 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 209
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 210 GGAPVTKGEKWIATQWVR 227
>gi|255083957|ref|XP_002508553.1| predicted protein [Micromonas sp. RCC299]
gi|226523830|gb|ACO69811.1| predicted protein [Micromonas sp. RCC299]
Length = 262
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 75/219 (34%), Positives = 113/219 (51%), Gaps = 22/219 (10%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET--VDNTQGIRTSSGVF 137
+ LS P+A + F + E+C +I + +L+ ST+ K +T +D+ +RTS G F
Sbjct: 2 EKLSDEPKAFLYHGFLSAEECDHLIKIGTPHLKRSTVVGGKDDTGVLDD---VRTSFGTF 58
Query: 138 ISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS 197
+ D+ L IE ++ + + N E +L+Y GQ+Y H D P
Sbjct: 59 LPKKYDD--VLYGIERRVEDFSQISYENQEQLQLLKYHDGQEYKDHQDGL----TSPNGG 112
Query: 198 QRVASFLVYLTDLEEGGETMFPFENGMNA-----DGSYD------YQKCIGLKVKPRQGD 246
+R+A+ L++L + E+GGET FP + A G D ++ GL VKPR+GD
Sbjct: 113 RRIATVLMFLHEPEKGGETSFPQGKPLPAVAQRLRGMRDELSDCAWRDGRGLAVKPRRGD 172
Query: 247 GLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
+LF+S NG D S H SCP V G KW ATKWI ++
Sbjct: 173 AVLFFSFKKNGGSDIASTHASCPTVGGVKWTATKWIHEK 211
>gi|423426372|ref|ZP_17403403.1| hypothetical protein IE5_04061 [Bacillus cereus BAG3X2-2]
gi|401111119|gb|EJQ19018.1| hypothetical protein IE5_04061 [Bacillus cereus BAG3X2-2]
Length = 248
Score = 117 bits (294), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 64/198 (32%), Positives = 105/198 (53%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I ++K + S + G + D IRTSSG F+ E S
Sbjct: 71 PLIVVLANVLSDEECDELIEISKNKMERSKI----GSSRD-VNDIRTSSGAFLEDNELTS 125
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 126 K----IEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 180
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 181 YLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 225
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 226 GGAPVTKGEKWIATQWVR 243
>gi|323528042|ref|YP_004230194.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1001]
gi|323385044|gb|ADX57134.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1001]
Length = 300
Score = 117 bits (294), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 65/208 (31%), Positives = 106/208 (50%), Gaps = 29/208 (13%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGI---RTSSGVFISAAE 142
P+ + F N +PE+C +I ++ L+ ST+ + +G+ RTS G++ E
Sbjct: 111 PQVIVFANVLSPEECDEVIERSRHRLKRSTIV----DPATGQEGVIRNRTSEGIWYQRGE 166
Query: 143 DESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG-----PQKS 197
D ++ ++++IA + P NGE IL Y +Y H+D F P + G +
Sbjct: 167 D--AFIERLDQRIASLMNWPVENGEGLQILHYGPTGEYRPHFDYFPPDQPGSAVHTARGG 224
Query: 198 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 257
QRVA+ +VYL D+ +GGET+FP GL V +QG + F +
Sbjct: 225 QRVATLVVYLNDVADGGETIFP---------------AAGLSVAAKQGGAVYFRYMNGQR 269
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
+DP ++HG PV G+KW+ TKW+R++
Sbjct: 270 QLDPLTLHGGAPVHAGDKWIMTKWMRER 297
>gi|241664232|ref|YP_002982592.1| procollagen-proline dioxygenase [Ralstonia pickettii 12D]
gi|309783051|ref|ZP_07677770.1| procollagen-proline dioxygenase [Ralstonia sp. 5_7_47FAA]
gi|404397139|ref|ZP_10988932.1| hypothetical protein HMPREF0989_00773 [Ralstonia sp. 5_2_56FAA]
gi|240866259|gb|ACS63920.1| Procollagen-proline dioxygenase [Ralstonia pickettii 12D]
gi|308918159|gb|EFP63837.1| procollagen-proline dioxygenase [Ralstonia sp. 5_7_47FAA]
gi|348610674|gb|EGY60360.1| hypothetical protein HMPREF0989_00773 [Ralstonia sp. 5_2_56FAA]
Length = 288
Score = 117 bits (294), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 66/215 (30%), Positives = 109/215 (50%), Gaps = 23/215 (10%)
Query: 76 NIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSG 135
+IP PR + F +F + ++C +I + + L+ S + + +N RTS G
Sbjct: 86 DIPILFAIETPRIVLFQHFLSDQECDELIAIGRNRLKRSPV-VNPDTGEENLISARTSQG 144
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 195
E + IE +IA+ +P +GE F +L Y+ G +Y H+D F+P G
Sbjct: 145 GMFQVGEHP--LIAKIEARIAQAVGVPVEHGEGFQVLNYQPGGEYQPHFDFFNPGRSGEA 202
Query: 196 KS-----QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 250
+ QRVA+ ++YL ++ GG T FP +GL+V P +G+ + F
Sbjct: 203 RQLEVGGQRVATMVIYLNSVQAGGATGFP---------------KLGLEVAPVKGNAVFF 247
Query: 251 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
P+GT+D ++H PV +GEKW+ATKW+R++
Sbjct: 248 VYKRPDGTLDEDTLHAGLPVERGEKWIATKWLRER 282
>gi|91789558|ref|YP_550510.1| procollagen-proline,2-oxoglutarate-4-dioxygenase [Polaromonas sp.
JS666]
gi|91698783|gb|ABE45612.1| Procollagen-proline,2-oxoglutarate-4-dioxygenase [Polaromonas sp.
JS666]
Length = 277
Score = 117 bits (293), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 71/208 (34%), Positives = 109/208 (52%), Gaps = 27/208 (12%)
Query: 85 MPRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKGETVDNTQGIRTSSGVFISAAED 143
+P + F N + +C++++ +A+ L R T+ ++ G N RTS G+F A
Sbjct: 89 LPDLVVFGNLLSDSECEALMEVAQPRLARSLTVNIKTGGEERNRD--RTSQGMFF--ARG 144
Query: 144 ESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP-----QKSQ 198
E+ + +E +IA++ P GE +LRY+ G +Y HYD FDP E G + Q
Sbjct: 145 ENPLVQRVEARIARLVGWPVDRGEGLQVLRYRQGAQYKPHYDYFDPAEPGTPAILQRGGQ 204
Query: 199 RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGT 258
RVA+ ++YL + E+GG T+FP IGL+V PR+G + F P
Sbjct: 205 RVATLIMYLNEPEQGGATVFP---------------DIGLQVTPRRGTAVFFS--YPAAN 247
Query: 259 IDPTSIHGSCPVVKGEKWVATKWIRDQE 286
+ HG PV GEKW+ATKW+R++E
Sbjct: 248 PASLTRHGGEPVKAGEKWIATKWLRERE 275
>gi|420246706|ref|ZP_14750139.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. BT03]
gi|398073616|gb|EJL64785.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. BT03]
Length = 282
Score = 117 bits (293), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 69/219 (31%), Positives = 110/219 (50%), Gaps = 27/219 (12%)
Query: 72 DSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIR 131
D I F+ P+ + F + + E+C +I A+ L+ ST + + D Q +R
Sbjct: 83 DVTVRIRFE----RPQVIAFDDVLSGEECAELIERARHRLKRSTTVNPENGSEDVIQ-LR 137
Query: 132 TSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQE 191
TS G + ED ++ ++ +I+ + P +GE IL Y+ G +Y H+D F P +
Sbjct: 138 TSEGFWFQRCED--AFIERLDHRISALMNWPLEHGEGLQILHYRQGGEYRPHFDYFPPGQ 195
Query: 192 YG-----PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGD 246
G + QRVA+ +VYL+D+E GGET+FP GL V RQG
Sbjct: 196 NGSVLHTARGGQRVATLIVYLSDVEGGGETVFP---------------DAGLAVMARQGG 240
Query: 247 GLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
+ F + +DP ++HG PV G+KW+ TKW+R++
Sbjct: 241 AIYFRYMNGRRQLDPLTLHGGAPVTSGDKWIMTKWMRER 279
>gi|402555628|ref|YP_006596899.1| prolyl 4-hydroxylase subunit alpha [Bacillus cereus FRI-35]
gi|401796838|gb|AFQ10697.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus FRI-35]
Length = 216
Score = 117 bits (293), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I ++K + LA K + + IRTSSG F+ D++
Sbjct: 39 PLIVVLGNVLSDEECGELIELSK-----NKLARSKVGSSRDVNDIRTSSGAFL----DDN 89
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWMR 211
>gi|390570433|ref|ZP_10250698.1| procollagen-proline dioxygenase [Burkholderia terrae BS001]
gi|389937613|gb|EIM99476.1| procollagen-proline dioxygenase [Burkholderia terrae BS001]
Length = 285
Score = 117 bits (293), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 69/219 (31%), Positives = 110/219 (50%), Gaps = 27/219 (12%)
Query: 72 DSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIR 131
D I F+ P+ + F + + E+C +I A+ L+ ST + + D Q +R
Sbjct: 86 DVTVRIRFE----RPQVIAFDDVLSGEECAELIERARHRLKRSTTVNPENGSEDVIQ-LR 140
Query: 132 TSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQE 191
TS G + ED ++ ++ +I+ + P +GE IL Y+ G +Y H+D F P +
Sbjct: 141 TSEGFWFQRCED--AFIERLDHRISALMNWPLEHGEGLQILHYRQGGEYRPHFDYFPPGQ 198
Query: 192 YG-----PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGD 246
G + QRVA+ +VYL+D+E GGET+FP GL V RQG
Sbjct: 199 NGSVLHTARGGQRVATLIVYLSDVEGGGETVFP---------------DAGLAVMARQGG 243
Query: 247 GLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
+ F + +DP ++HG PV G+KW+ TKW+R++
Sbjct: 244 AIYFRYMNGRRQLDPLTLHGGAPVTSGDKWIMTKWMRER 282
>gi|407708877|ref|YP_006792741.1| prolyl 4-hydroxylase [Burkholderia phenoliruptrix BR3459a]
gi|407237560|gb|AFT87758.1| prolyl 4-hydroxylase [Burkholderia phenoliruptrix BR3459a]
Length = 300
Score = 117 bits (293), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 65/208 (31%), Positives = 105/208 (50%), Gaps = 29/208 (13%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGI---RTSSGVFISAAE 142
P+ + F N +PE+C +I ++ L+ ST+ + +G+ RTS G++ E
Sbjct: 111 PQVIVFANVLSPEECDEVIERSRHRLKRSTIV----DPATGQEGVIRNRTSEGIWYQRGE 166
Query: 143 DESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG-----PQKS 197
D ++ ++ +IA + P NGE IL Y +Y H+D F P + G +
Sbjct: 167 D--AFIERLDRRIASLMNWPVENGEGLQILHYGPTGEYRPHFDYFPPDQPGSAVHTARGG 224
Query: 198 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 257
QRVA+ +VYL D+ +GGET+FP GL V +QG + F +
Sbjct: 225 QRVATLVVYLNDVADGGETIFP---------------AAGLSVAAKQGGAVYFRYMNGQR 269
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
+DP ++HG PV G+KW+ TKW+R++
Sbjct: 270 QLDPLTLHGGAPVRAGDKWIMTKWMRER 297
>gi|423395462|ref|ZP_17372663.1| hypothetical protein ICU_01156 [Bacillus cereus BAG2X1-1]
gi|401654873|gb|EJS72412.1| hypothetical protein ICU_01156 [Bacillus cereus BAG2X1-1]
Length = 216
Score = 117 bits (292), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 62/198 (31%), Positives = 105/198 (53%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I ++K + LA K + + IRTS G F+ D++
Sbjct: 39 PLIVVLGNVLSDEECDKLIELSK-----NKLARSKVGSSRDVNDIRTSKGAFL----DDN 89
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|228916870|ref|ZP_04080433.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pulsiensis BGSC 4CC1]
gi|228842793|gb|EEM87878.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pulsiensis BGSC 4CC1]
Length = 232
Score = 117 bits (292), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 62/198 (31%), Positives = 105/198 (53%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I ++K + LA K + + IRTS G F+ D++
Sbjct: 55 PLIVVLGNVLSDEECDELIELSK-----NKLARSKVGSSRDVNDIRTSKGAFL----DDN 105
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 106 ELTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 164
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 165 YLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 209
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 210 GGAPVTKGEKWIATQWVR 227
>gi|423406337|ref|ZP_17383486.1| hypothetical protein ICY_01022 [Bacillus cereus BAG2X1-3]
gi|401660331|gb|EJS77813.1| hypothetical protein ICY_01022 [Bacillus cereus BAG2X1-3]
Length = 216
Score = 117 bits (292), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 62/198 (31%), Positives = 105/198 (53%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I ++K + LA K + + IRTS G F+ D++
Sbjct: 39 PLIVVLGNVLSDEECDKLIELSK-----NKLARSKVGSSRDVNDIRTSKGAFL----DDN 89
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE++I+ + +P +GE +IL Y++ Q+Y +HYD F + + R+++ ++
Sbjct: 90 ELTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF-AEHSRSAANNRISTLVM 148
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + L V PR+G + F + +++ ++H
Sbjct: 149 YLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTLH 193
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV KGEKW+AT+W+R
Sbjct: 194 GGAPVTKGEKWIATQWVR 211
>gi|307725787|ref|YP_003909000.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1003]
gi|307586312|gb|ADN59709.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1003]
Length = 313
Score = 116 bits (291), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 65/208 (31%), Positives = 108/208 (51%), Gaps = 29/208 (13%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTL---ALRKGETVDNTQGIRTSSGVFISAAE 142
P+ + F N +P++C +I ++ L+ ST+ A + + + N RTS G++ E
Sbjct: 124 PQVIVFGNVLSPDECAEMIERSRHRLKRSTIVDPATGREDVIRN----RTSEGIWYQRGE 179
Query: 143 DESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG-----PQKS 197
D ++ ++++IA + P NGE IL Y +Y H+D F P + G +
Sbjct: 180 D--ALIERLDQRIASLMNWPLENGEGLQILHYGPSGEYRPHFDYFPPDQPGSAVHTARGG 237
Query: 198 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 257
QRVA+ +VYL D+ +GGET+FP GL V +QG + F +
Sbjct: 238 QRVATLVVYLNDVPDGGETIFPE---------------AGLSVAAQQGGAVYFRYMNGRR 282
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
+DP ++HG PV+ G+KW+ TKW+R++
Sbjct: 283 QLDPLTLHGGAPVLSGDKWIMTKWVRER 310
>gi|186474111|ref|YP_001861453.1| procollagen-proline dioxygenase [Burkholderia phymatum STM815]
gi|184196443|gb|ACC74407.1| Procollagen-proline dioxygenase [Burkholderia phymatum STM815]
Length = 305
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 69/215 (32%), Positives = 110/215 (51%), Gaps = 23/215 (10%)
Query: 76 NIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSG 135
++ +V P+ + F + + ++C +I A+ L+ ST + D Q +RTS G
Sbjct: 106 DVAVRVRFERPQVIVFDDVLSRDECDELIERARHRLKRSTTVNPESGREDVIQ-LRTSEG 164
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP- 194
+ ED ++ ++ +I+ + P +GE IL Y G +Y H+D F P + G
Sbjct: 165 FWFQRCED--AFIERLDRRISALMNWPLEHGEGLQILHYTKGGEYRPHFDYFPPSQSGSV 222
Query: 195 ----QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 250
+ QRVA+ +VYL+D+ GGET+FP NA GL V RQG + F
Sbjct: 223 LHTSRGGQRVATLIVYLSDVAGGGETVFP-----NA----------GLAVMARQGGAIYF 267
Query: 251 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
L + +DP ++HG PV GEKW+ TKW+R++
Sbjct: 268 RYLNGHRQLDPLTLHGGAPVTNGEKWIMTKWMRER 302
>gi|301093292|ref|XP_002997494.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262110636|gb|EEY68688.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 324
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 69/219 (31%), Positives = 112/219 (51%), Gaps = 17/219 (7%)
Query: 76 NIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSG 135
++ + LS P F ++ I+ ++ +L+PST+ L G RTS+
Sbjct: 105 DVVLETLSLTPLVFSVDEFLKDDEIDIIMALSLEHLKPSTVTLMDGHEDRAATDWRTSTT 164
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 195
F+S+++ LD I++++A +T +P + E +LRY+ QKY+ H D F P E+
Sbjct: 165 YFLSSSK--HSKLDEIDQRVADLTKVPVDHQEDVQVLRYEETQKYDHHTDYF-PVEHHKN 221
Query: 196 KSQ-----------RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKC-IGLKVKPR 243
R+ + Y++D+ +GG T+FP G A + C GLKV P+
Sbjct: 222 SPHVLESIDYGYKNRMITVFWYMSDVAKGGHTIFPRAGG--APRPQSMKDCSTGLKVSPK 279
Query: 244 QGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWI 282
+ ++FYS+LPNG DP S+HG CPV G K+ KW+
Sbjct: 280 KRKVIVFYSMLPNGQGDPMSLHGGCPVEDGIKYSGNKWV 318
>gi|334188665|ref|NP_001190630.1| iron ion binding / oxidoreductase/ oxidoreductase protein
[Arabidopsis thaliana]
gi|332010771|gb|AED98154.1| iron ion binding / oxidoreductase/ oxidoreductase protein
[Arabidopsis thaliana]
Length = 243
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 63/154 (40%), Positives = 95/154 (61%), Gaps = 7/154 (4%)
Query: 71 DDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-GETVDNTQG 129
DDS +++SW PRA + NF E+CK +I +AK ++ ST+ K G++ D+
Sbjct: 70 DDSKNERWVEIISWEPRASVYHNFL--EECKYLIELAKPHMEKSTVVDEKTGKSTDSR-- 125
Query: 130 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP 189
+RTSSG F++ D+ T+ IE++I+ T +P +GE +L Y+IGQKY HYD F
Sbjct: 126 VRTSSGTFLARGRDK--TIREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHYDYFMD 183
Query: 190 QEYGPQKSQRVASFLVYLTDLEEGGETMFPFENG 223
+ QR+A+ L+YL+D+EEGGET+FP G
Sbjct: 184 EYNTRNGGQRIATVLMYLSDVEEGGETVFPAAKG 217
>gi|187930127|ref|YP_001900614.1| procollagen-proline dioxygenase [Ralstonia pickettii 12J]
gi|187727017|gb|ACD28182.1| Procollagen-proline dioxygenase [Ralstonia pickettii 12J]
Length = 288
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 66/215 (30%), Positives = 108/215 (50%), Gaps = 23/215 (10%)
Query: 76 NIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSG 135
+IP PR + F +F + +C +I + + L+ S + + +N RTS G
Sbjct: 86 DIPILFAIETPRIVLFQHFLSDAECDELIAIGRNRLKRSPV-VNPDTGEENLISARTSQG 144
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 195
E + IE +IA+ +P +GE F +L Y+ G +Y H+D F+P G
Sbjct: 145 GMFQVGEHP--LIAKIEVRIAQAVGVPVEHGEGFQVLNYQPGGEYQPHFDFFNPGRSGEA 202
Query: 196 KS-----QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 250
+ QRVA+ ++YL ++ GG T FP +GL+V P +G+ + F
Sbjct: 203 RQLEVGGQRVATMVIYLNSVQAGGATGFP---------------KLGLEVAPVKGNAVFF 247
Query: 251 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
P+GT+D ++H PV +GEKW+ATKW+R++
Sbjct: 248 VYKRPDGTLDEDTLHAGLPVERGEKWIATKWLRER 282
>gi|281206564|gb|EFA80750.1| putative prolyl 4-hydroxylase alpha subunit [Polysphondylium
pallidum PN500]
Length = 251
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 71/220 (32%), Positives = 112/220 (50%), Gaps = 32/220 (14%)
Query: 73 SVTNIPFQV-LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGI- 130
S NIP + +S PR P F T E+C+ +I +K L+P + + G+
Sbjct: 50 STDNIPKLIEVSQKPRIYRIPKFLTDEECEHLIETSKNKLKPCN---------EISSGVH 100
Query: 131 RTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP- 189
R+ G+F+ E++ I ++ L + E ++RY G++ ++H+D F+P
Sbjct: 101 RSGWGLFMKEGEEDHPVTQNIFNRMKTFVNLTE-SSEVMQVIRYNPGEETSAHFDYFNPL 159
Query: 190 QEYGPQK----SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQG 245
G K QR+ + L+YL D+EEGGET FP + +KVKP +G
Sbjct: 160 TTNGAMKIGLYGQRICTILMYLADVEEGGETSFPE---------------VNVKVKPIKG 204
Query: 246 DGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
D +LFY+ PNG +DP S+H PV+KG KW+A K + +
Sbjct: 205 DAVLFYNCKPNGEVDPLSLHQGDPVIKGTKWIAIKLVNQK 244
>gi|149180354|ref|ZP_01858859.1| prolyl 4-hydroxylase, alpha subunit [Bacillus sp. SG-1]
gi|148852546|gb|EDL66691.1| prolyl 4-hydroxylase, alpha subunit [Bacillus sp. SG-1]
Length = 212
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 62/201 (30%), Positives = 105/201 (52%), Gaps = 27/201 (13%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C ++I ++K L+ S + N +RTSS F+ E ES
Sbjct: 37 PLIVVLGNVLSDEECDALIGLSKDKLKRSKIG-----NTRNENDMRTSSSTFME--EGES 89
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
+ +E++I+++ +P NGE IL YKIGQ+Y +H+D F + R+++ ++
Sbjct: 90 EVVTRVEKRISQIMNIPYENGEGLQILNYKIGQEYKAHFDFFK-----NASNPRISTLVM 144
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + V P++G + F N ++ ++H
Sbjct: 145 YLNDVEEGGETYFP---------------KLNFSVSPQKGMAVYFEYFYDNQELNDLTLH 189
Query: 266 GSCPVVKGEKWVATKWIRDQE 286
G PV+ G+KW AT+W+R ++
Sbjct: 190 GGAPVIIGDKWAATQWMRRKQ 210
>gi|340357957|ref|ZP_08680560.1| prolyl 4-hydroxylase [Sporosarcina newyorkensis 2681]
gi|339616017|gb|EGQ20677.1| prolyl 4-hydroxylase [Sporosarcina newyorkensis 2681]
Length = 211
Score = 115 bits (288), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 62/200 (31%), Positives = 106/200 (53%), Gaps = 24/200 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I +A ++ S + + E +RTSS +FI +DE+
Sbjct: 33 PLIVVLGNVLSDEECDELIQLAGDKVKRSKIGTTREEN-----ELRTSSSMFIE--DDEN 85
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
+ ++++I+ + +P +GE ILRY GQ+Y +H+D F + R+++ ++
Sbjct: 86 LIVTRVKKRISAIMKIPMEHGEGLQILRYTPGQQYKAHHDFFSSD--SKITNNRISTLVM 143
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+E+GGET FP + V PR+G + F + T++ ++H
Sbjct: 144 YLNDVEQGGETFFPH---------------LKFSVSPRKGMAVYFEYFYSDQTLNDFTLH 188
Query: 266 GSCPVVKGEKWVATKWIRDQ 285
G PVV+GEKWVAT+W+R Q
Sbjct: 189 GGAPVVEGEKWVATQWMRKQ 208
>gi|413963357|ref|ZP_11402584.1| ProCollegen-proline dioxygenase [Burkholderia sp. SJ98]
gi|413929189|gb|EKS68477.1| ProCollegen-proline dioxygenase [Burkholderia sp. SJ98]
Length = 286
Score = 115 bits (288), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 66/191 (34%), Positives = 104/191 (54%), Gaps = 25/191 (13%)
Query: 99 QCKSIINMAKLNL-RPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAK 157
+C +I + + ++ R S + G+ + T R S G F++A+ D ++ I+ +IA+
Sbjct: 107 ECDRLIEIGREHVQRSSVVDPDSGKEI--TIEERRSEGAFVNASTD--ALVETIDRRIAE 162
Query: 158 VTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS-----QRVASFLVYLTDLEE 212
+ P NGE +ILRY +G +Y HYD F ++ G + QR+A+ ++YL ++E+
Sbjct: 163 LFRQPVENGEDLHILRYGMGGEYRPHYDYFPEEQAGSKHHMQRGGQRIATVILYLNEVEQ 222
Query: 213 GGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVK 272
GG+T FP IGL + PR+G L F + G DP ++H PV K
Sbjct: 223 GGDTTFP---------------DIGLAIHPRRGSALYFEYVNELGQSDPKTLHAGTPVEK 267
Query: 273 GEKWVATKWIR 283
GEKW+ATKWIR
Sbjct: 268 GEKWIATKWIR 278
>gi|170690448|ref|ZP_02881615.1| Procollagen-proline dioxygenase [Burkholderia graminis C4D1M]
gi|170144883|gb|EDT13044.1| Procollagen-proline dioxygenase [Burkholderia graminis C4D1M]
Length = 307
Score = 115 bits (287), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 65/208 (31%), Positives = 107/208 (51%), Gaps = 29/208 (13%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTL---ALRKGETVDNTQGIRTSSGVFISAAE 142
P+ + F N +PE+C +I ++ L+ ST+ A + + + N RTS G++ E
Sbjct: 118 PQVIVFANVLSPEECDEVIERSRHRLKRSTIVDPATGQEDVIRN----RTSEGIWYQRGE 173
Query: 143 DESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG-----PQKS 197
D ++ ++++IA + P NGE IL Y +Y H+D F P + G +
Sbjct: 174 D--AFIERLDQRIASLMNWPVENGEGLQILHYGPTGEYRPHFDYFPPDQPGSMVHTARGG 231
Query: 198 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 257
QRVA+ ++YL D+ +GGET+FP GL V +QG + F +
Sbjct: 232 QRVATLVIYLNDVPDGGETIFPEA---------------GLSVAAKQGGAVYFRYMNGQR 276
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
+DP ++HG PV G+KW+ TKW+R++
Sbjct: 277 QLDPLTLHGGAPVRAGDKWIMTKWMRER 304
>gi|195061068|ref|XP_001995918.1| GH14106 [Drosophila grimshawi]
gi|193891710|gb|EDV90576.1| GH14106 [Drosophila grimshawi]
Length = 511
Score = 115 bits (287), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 66/212 (31%), Positives = 115/212 (54%), Gaps = 21/212 (9%)
Query: 77 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGV 136
+ +++ P + F + +P++ + N+A+ L+ +T+ + G+ V ++ +RTS G
Sbjct: 308 LKMEIVLLNPFIVVFHDALSPQEIDYLQNLARPLLKRTTVHV-NGKYV--SRRVRTSKGA 364
Query: 137 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP-QEYGPQ 195
++ D + IE ++ +T L EA+NI+ Y +G Y +HYD F+ ++ +
Sbjct: 365 WLE--RDLNNLTRRIERRVVDMTELSMQGSEAYNIMNYGLGGHYAAHYDFFNTTKQQTSE 422
Query: 196 KSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLP 255
R+A+ L YL+D+E+GG T+FP + L V P +G L +Y+LL
Sbjct: 423 TGDRIATVLFYLSDVEQGGATVFP---------------NLKLAVSPERGMALFWYNLLD 467
Query: 256 NGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
NGT D ++HG CPV+ G KWV T WI ++ Q
Sbjct: 468 NGTGDTRTLHGGCPVLVGSKWVMTLWIHERAQ 499
>gi|406665340|ref|ZP_11073114.1| hypothetical protein B857_00901 [Bacillus isronensis B3W22]
gi|405387266|gb|EKB46691.1| hypothetical protein B857_00901 [Bacillus isronensis B3W22]
Length = 211
Score = 115 bits (287), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 65/210 (30%), Positives = 113/210 (53%), Gaps = 24/210 (11%)
Query: 77 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGV 136
I +VL P + F N + E+C+++I+ A L S LA ++ IRTSSG+
Sbjct: 21 ITAEVLHEEPLIVKFLNVLSDEECQNLIDCASSRLERSKLAKKE------ISSIRTSSGM 74
Query: 137 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK 196
F E+E+ + IE++I+ + LP + E +L Y+ GQ++ +H+D F P +
Sbjct: 75 FFE--ENENPLISEIEKRISSLMHLPIEHAEGLQVLHYEPGQEFKAHFDFFGPN-HPSSS 131
Query: 197 SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPN 256
+ R+++ +VYL D+EEGG T FP +G+ P++G + F +
Sbjct: 132 NNRISTLVVYLNDVEEGGVTTFP---------------NLGIVNVPKKGTAVYFEYFYND 176
Query: 257 GTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
++ ++H PV++GEKWVAT+W+R ++
Sbjct: 177 QKLNELTLHSGEPVIQGEKWVATQWMRKKQ 206
>gi|384046522|ref|YP_005494539.1| prolyl 4-hydroxylase alpha subunit [Bacillus megaterium WSH-002]
gi|345444213|gb|AEN89230.1| Prolyl 4-hydroxylase alpha subunit [Bacillus megaterium WSH-002]
Length = 219
Score = 114 bits (286), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 65/202 (32%), Positives = 106/202 (52%), Gaps = 25/202 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P L N + E+C +I ++K ++ S + + IRTSSG+F +E+E
Sbjct: 39 PLVLVLGNVLSNEECDELIQLSKDKMQRSKIGAER-----EVNSIRTSSGMFFEESENE- 92
Query: 146 GTLDLIEEKIAKVTMLPRIN-GEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFL 204
+ IE +++K+ M P I E IL+Y Q+Y +H+D F K+ R+++ +
Sbjct: 93 -LVHQIERRLSKI-MGPSIEYAEGLQILKYLPDQEYKAHHDYFTSASKAS-KNNRISTLV 149
Query: 205 VYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSI 264
+YL D+EEGGET FP +GL + P +G + F + ++ ++
Sbjct: 150 MYLNDVEEGGETYFP---------------KLGLSISPTKGMAVYFEYFYSDAELNDRTL 194
Query: 265 HGSCPVVKGEKWVATKWIRDQE 286
HG PV+KGEKWVAT+W+R Q+
Sbjct: 195 HGGAPVIKGEKWVATQWMRKQK 216
>gi|295704991|ref|YP_003598066.1| 2OG-Fe(II) oxygenase [Bacillus megaterium DSM 319]
gi|294802650|gb|ADF39716.1| 2OG-Fe(II) oxygenase family protein [Bacillus megaterium DSM 319]
Length = 219
Score = 114 bits (286), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 66/202 (32%), Positives = 106/202 (52%), Gaps = 25/202 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P L N + E+C +I ++K ++ S + + IRTSSG+F +E+E
Sbjct: 39 PLVLVLGNVLSNEECDELIRLSKDKMQRSKIGAAR-----EVNSIRTSSGMFFDESENE- 92
Query: 146 GTLDLIEEKIAKVTMLPRIN-GEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFL 204
+ IE +++K+ M P I E IL+Y Q+Y +H+D F K+ R+++ +
Sbjct: 93 -LVHQIERRLSKI-MGPSIEYAEGLQILKYLPDQEYKAHHDYFTSASKAS-KNNRISTLV 149
Query: 205 VYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSI 264
+YL D+EEGGET FP +GL V P +G + F + ++ ++
Sbjct: 150 MYLNDVEEGGETYFP---------------KLGLSVSPTKGMAVYFEYFYSDAELNDRTL 194
Query: 265 HGSCPVVKGEKWVATKWIRDQE 286
HG PV+KGEKWVAT+W+R Q+
Sbjct: 195 HGGAPVIKGEKWVATQWMRKQK 216
>gi|294499597|ref|YP_003563297.1| 2OG-Fe(II) oxygenase family protein [Bacillus megaterium QM B1551]
gi|294349534|gb|ADE69863.1| 2OG-Fe(II) oxygenase family protein [Bacillus megaterium QM B1551]
Length = 219
Score = 114 bits (285), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 65/202 (32%), Positives = 106/202 (52%), Gaps = 25/202 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P L N + E+C +I ++K ++ S + + IRTSSG+F +E+E
Sbjct: 39 PLVLVLGNVLSNEECDELIQLSKDKMQRSKIGAAR-----EVNSIRTSSGMFFEESENE- 92
Query: 146 GTLDLIEEKIAKVTMLPRIN-GEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFL 204
+ IE +++K+ M P I E +L+Y Q+Y +H+D F K+ R+++ +
Sbjct: 93 -LVHQIERRLSKI-MGPSIEYAEGLQVLKYLPDQEYKAHHDYFTSASKAS-KNNRISTLV 149
Query: 205 VYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSI 264
+YL D+EEGGET FP +GL V P +G + F + ++ ++
Sbjct: 150 MYLNDVEEGGETYFP---------------KLGLSVSPTKGMAVYFEYFYSDAELNDRTL 194
Query: 265 HGSCPVVKGEKWVATKWIRDQE 286
HG PV+KGEKWVAT+W+R Q+
Sbjct: 195 HGGAPVIKGEKWVATQWMRKQK 216
>gi|385206010|ref|ZP_10032880.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
gi|385185901|gb|EIF35175.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
Length = 296
Score = 114 bits (285), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 78/214 (36%), Positives = 112/214 (52%), Gaps = 27/214 (12%)
Query: 80 QVLSWM--PRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGV 136
+V+S M P A+ +F + +C+ +I++A+ L ST+ G V G R+S G+
Sbjct: 94 RVISRMQRPAAILLDDFLSANECEQLISLARPRLSRSTVVDPVTGRNV--VAGHRSSDGM 151
Query: 137 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD---AFDP--QE 191
F E + + +E +IA++T LP NGE +L Y++G + H D A +P QE
Sbjct: 152 FFRLGE--TPLIARLEARIAELTGLPVENGEGLQLLHYEVGAESTPHVDYLIAGNPANQE 209
Query: 192 YGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFY 251
+ QRV + L+YL D+E GGETMFP G V PR+G L F
Sbjct: 210 SIARSGQRVGTLLMYLNDVEGGGETMFP---------------QTGWSVVPRRGQALYFE 254
Query: 252 SLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
G DP+S+H S P+ GEKWVATKWIR +
Sbjct: 255 YGNRFGLADPSSLHTSTPLRVGEKWVATKWIRTR 288
>gi|377811809|ref|YP_005044249.1| ProCollegen-proline dioxygenase [Burkholderia sp. YI23]
gi|357941170|gb|AET94726.1| ProCollegen-proline dioxygenase [Burkholderia sp. YI23]
Length = 283
Score = 114 bits (284), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 66/209 (31%), Positives = 103/209 (49%), Gaps = 35/209 (16%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFIS 139
P + +P +C +I + + +R S++ VD G R S G F++
Sbjct: 91 PVVALLADVLSPRECDRLIEIGRERVRRSSV-------VDPDSGGEVLIDARKSEGAFVN 143
Query: 140 AAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK--- 196
+ D + I+ +IA++ P NGE +ILRY G +Y H+D F ++ G +
Sbjct: 144 GSTDP--LVATIDRRIAELVQQPVENGEDLHILRYGAGGEYRPHFDYFPEEQAGSKHHMQ 201
Query: 197 --SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLL 254
QR+A+ ++YL +EEGG+T FP IGL + PR+G L F +
Sbjct: 202 RGGQRIATLILYLNQVEEGGDTTFPD---------------IGLTIHPRRGAALYFEYVN 246
Query: 255 PNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
G DP ++H PV +GEKW+ATKW+R
Sbjct: 247 ALGQTDPRTLHAGMPVERGEKWIATKWMR 275
>gi|357417854|ref|YP_004930874.1| procollagen-proline dioxygenase [Pseudoxanthomonas spadix BD-a59]
gi|355335432|gb|AER56833.1| Procollagen-proline dioxygenase [Pseudoxanthomonas spadix BD-a59]
Length = 283
Score = 114 bits (284), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 75/227 (33%), Positives = 110/227 (48%), Gaps = 32/227 (14%)
Query: 68 QAGDDSVTNIPFQVLSWM--PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVD 125
QAGD V QVL+ + PR + F N E+C ++I +A+ ++ S + D
Sbjct: 81 QAGDRQV-----QVLASLLHPRVIVFGNLLAAEECDALIALARRQIKRSPV-FDPDTGQD 134
Query: 126 NTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD 185
RTS G+F + +E +IA + P NGE +LRY G +Y HYD
Sbjct: 135 QQHQARTSEGMFFGRGANP--LCARVEARIAALLNWPLENGEGLQVLRYGPGAQYEPHYD 192
Query: 186 AFDPQEYGPQKS-----QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKV 240
FDP G + + QRVAS ++YL +GG T FP + L+V
Sbjct: 193 YFDPARPGAEVALRRGGQRVASLVIYLNTPTQGGATTFPDAH---------------LEV 237
Query: 241 KPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
P +G+ + F P+ ++HG PVV+GEKWVATKW+R++
Sbjct: 238 APIKGNAVYFSYDRPHPMTG--TLHGGAPVVEGEKWVATKWLRERRH 282
>gi|383757171|ref|YP_005436156.1| putative prolyl 4-hydroxylase alpha subunit [Rubrivivax gelatinosus
IL144]
gi|381377840|dbj|BAL94657.1| putative prolyl 4-hydroxylase alpha subunit homologue
oxidoreductase protein [Rubrivivax gelatinosus IL144]
Length = 279
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 70/211 (33%), Positives = 105/211 (49%), Gaps = 37/211 (17%)
Query: 85 MPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFI 138
+PR + F + E+C ++ +A RP L + ETVDN+ G RTS G+F
Sbjct: 91 LPRVVVFGGLLSDEECDELVALA----RPR---LARSETVDNSTGGSEVNAARTSDGMFF 143
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP---- 194
E ++ IE +IA++ P GE +LRY+ G +Y H+D FDP G
Sbjct: 144 ERGEKP--LIERIERRIAELVRWPVERGEGLQVLRYRPGAQYKPHHDFFDPAHPGTANIL 201
Query: 195 -QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 253
+ QRV + ++YL GG T FP +GL+V+P +G+ + F
Sbjct: 202 RRGGQRVGTVVMYLNTPAGGGATTFP---------------EVGLEVQPVKGNAVFFSYE 246
Query: 254 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRD 284
P + ++HG PV+ GEKWVATKW+R+
Sbjct: 247 RPLAST--RTLHGGAPVLDGEKWVATKWMRE 275
>gi|348683507|gb|EGZ23322.1| hypothetical protein PHYSODRAFT_310730 [Phytophthora sojae]
Length = 417
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 67/225 (29%), Positives = 114/225 (50%), Gaps = 17/225 (7%)
Query: 76 NIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSG 135
++ + LS P F ++ I+N++ +L+PS + L G RTS+
Sbjct: 198 DVVLETLSMTPLVFSVEEFLKDDEIDIIMNLSLEHLKPSGVTLMDGHENRAATDWRTSTT 257
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 195
F+ + D +D I+++++ +T +P + E +LRY+ QKY+ H D F P E+
Sbjct: 258 YFLPS--DAHPKIDEIDQRVSDLTKVPIDHQEDVQVLRYEKTQKYDHHTDYF-PVEHHKN 314
Query: 196 KSQ-----------RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCI-GLKVKPR 243
R+ + Y++D+ +GG T+FP G A + C GL V P+
Sbjct: 315 APHILESIDYGYKNRMITVFWYMSDVAKGGHTIFPRAGG--APRPTSMKDCTTGLNVPPK 372
Query: 244 QGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQY 288
+ ++FYS+LPNG DP S+HG CPV +G K+ KW+ ++ +Y
Sbjct: 373 KRKVIVFYSMLPNGEGDPMSLHGGCPVEEGVKYSGNKWVWNKARY 417
>gi|326518408|dbj|BAJ88233.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 276
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 72/195 (36%), Positives = 106/195 (54%), Gaps = 23/195 (11%)
Query: 69 AGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTL---ALRKGETVD 125
A D + + +V+SW PR + F NF + E+C + +A+ L ST+ A KG D
Sbjct: 50 AADLRLGYVKPEVISWTPRIIVFHNFLSSEECDYLREIARPRLEISTVVDVATGKGVKSD 109
Query: 126 NTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD 185
+RTSSG+F+++ E + + IE++I+ + +P NGE +LRY+ Q Y H+D
Sbjct: 110 ----VRTSSGMFVNSEERKLPVIKAIEKRISVFSQIPVENGELIQVLRYEPNQYYRPHHD 165
Query: 186 AFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCI-------GL 238
F + QRVA+ L+YLTD EGGET FP DG +CI GL
Sbjct: 166 YFSDTFNLKRGGQRVATMLMYLTDGVEGGETHFP----QAGDG-----ECICGGRLVRGL 216
Query: 239 KVKPRQGDGLLFYSL 253
VKP +GD +LF+S+
Sbjct: 217 CVKPNKGDAVLFWSM 231
>gi|393200372|ref|YP_006462214.1| prolyl 4-hydroxylase [Solibacillus silvestris StLB046]
gi|327439703|dbj|BAK16068.1| prolyl 4-hydroxylase [Solibacillus silvestris StLB046]
Length = 211
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 65/210 (30%), Positives = 111/210 (52%), Gaps = 24/210 (11%)
Query: 77 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGV 136
I +VL P + F N + E+C+++I+ A L S LA ++ IRTSSG+
Sbjct: 21 ITAEVLHEEPLIVKFLNVLSDEECQNLIDCASSRLERSKLAKKE------ISSIRTSSGM 74
Query: 137 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK 196
F E+E+ + IE++I+ + LP + E +L Y+ GQ++ H+D F P +
Sbjct: 75 FFE--ENENPLISEIEKRISSLMHLPIEHAEGLQVLHYEPGQEFKPHFDFFGPN-HPSSS 131
Query: 197 SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPN 256
+ R+ + +VYL D+EEGG T FP +G+ P++G + F +
Sbjct: 132 NNRICTLVVYLNDVEEGGVTTFP---------------NLGIVNVPKKGTAVYFEYFYND 176
Query: 257 GTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
++ ++H PV++GEKWVAT+W+R ++
Sbjct: 177 QKLNELTLHSGEPVIQGEKWVATQWMRKKQ 206
>gi|295699617|ref|YP_003607510.1| procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
gi|295438830|gb|ADG17999.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
Length = 286
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 65/207 (31%), Positives = 104/207 (50%), Gaps = 27/207 (13%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLA--LRKGETVDNTQGIRTSSGVFISAAED 143
P+ + F + + +C +I ++ L+ ST L E V RTS GV+ ED
Sbjct: 97 PQLVVFADVLSAAECAELIERSRHRLKRSTTVNPLTGREDVIRN---RTSEGVWYRRGED 153
Query: 144 ESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP-----QKSQ 198
+ + +E +IA +T P NGE +L Y +Y+ H+D F P + G Q Q
Sbjct: 154 Q--LIARVERRIASLTNWPLENGEGLQVLHYGTSGEYSPHFDFFAPDQPGSAVHTTQGGQ 211
Query: 199 RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGT 258
RVA+ ++YL D+ +GGET+FP GL V + G + F +
Sbjct: 212 RVATLIIYLNDVADGGETVFP---------------TAGLSVAAQAGGAVYFRYMNAERQ 256
Query: 259 IDPTSIHGSCPVVKGEKWVATKWIRDQ 285
+DP+++HG PV+ G+KW+ TKW+R++
Sbjct: 257 LDPSTLHGGAPVLAGDKWIMTKWMRER 283
>gi|403234403|ref|ZP_10912989.1| Procollagen-proline dioxygenase [Bacillus sp. 10403023]
Length = 217
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 62/198 (31%), Positives = 107/198 (54%), Gaps = 24/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I ++K + S +A VDN +RTSS FI E+E+
Sbjct: 39 PLIVVLGNVLSDEECDELIRLSKDRINRSKIA---NANVDN---MRTSSSTFIE--ENEN 90
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
+ IE++I+++ +P GE IL Y++GQ+Y SH+D F + + R+++ ++
Sbjct: 91 IIVSRIEKRISQIMNIPTEYGEGLQILNYQVGQEYKSHFDFFS-SPHNAINNPRISTLVM 149
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL+D+E+GGET FP + V P++G + F + T++ ++H
Sbjct: 150 YLSDVEQGGETYFP---------------KLHFSVSPQKGMAVYFEYFYNDQTLNELTLH 194
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV+ G+KW AT+W+R
Sbjct: 195 GGAPVIVGDKWAATQWMR 212
>gi|388519941|gb|AFK48032.1| unknown [Lotus japonicus]
Length = 151
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 59/150 (39%), Positives = 89/150 (59%), Gaps = 2/150 (1%)
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 195
+F++ E + + IE++I+ + +P NGE +LRY+ Q Y H+D F +
Sbjct: 1 MFLTPEERKYPMVHAIEKRISVYSQVPIENGELMQVLRYEKNQYYKPHHDYFADTFNLKR 60
Query: 196 KSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLP 255
QR+A+ L+YL+D EGGET FP N + S + GL VKP +G+ +LF+S+
Sbjct: 61 GGQRIATMLMYLSDNVEGGETYFP--NIGSGQCSCGGKTVEGLSVKPTKGNAVLFWSMGL 118
Query: 256 NGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
+G DP S+HG C V+ GEKW ATKW+R +
Sbjct: 119 DGQSDPLSVHGGCEVLAGEKWSATKWMRQK 148
>gi|385205097|ref|ZP_10031967.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
gi|385184988|gb|EIF34262.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
Length = 292
Score = 112 bits (279), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 65/208 (31%), Positives = 108/208 (51%), Gaps = 29/208 (13%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTL---ALRKGETVDNTQGIRTSSGVFISAAE 142
P+ + F + +P++C +I ++ L+ ST A K + + N RTS G++ E
Sbjct: 103 PQMIVFADVLSPDECAEMIERSRHRLKRSTTVNPATGKEDVIRN----RTSEGIWYQRGE 158
Query: 143 DESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG-----PQKS 197
D ++ ++ +I+ + P NGE +LRY +Y H+D F P + G Q
Sbjct: 159 DP--FIERMDRRISSLMNWPVENGEGLQLLRYGTTGEYRPHFDYFPPDQPGSTVHTAQGG 216
Query: 198 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 257
QRVA+ ++YL D+ +GGET+FP E GM+ S QG + F +
Sbjct: 217 QRVATLVIYLNDVPDGGETIFP-EAGMSVAAS--------------QGGAVYFRYMNGRR 261
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
+DP ++HG PV+ G+KW+ TKW+R++
Sbjct: 262 QLDPLTLHGGAPVLSGDKWIMTKWMRER 289
>gi|125546091|gb|EAY92230.1| hypothetical protein OsI_13950 [Oryza sativa Indica Group]
Length = 178
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 57/144 (39%), Positives = 87/144 (60%), Gaps = 15/144 (10%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 135
LSW PRA + F + ++C ++N+AK + S +A DN G +RTSSG
Sbjct: 40 LSWRPRAFLYSGFLSHDECDHLVNLAKGRMEKSMVA-------DNDSGKSIMSQVRTSSG 92
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 195
F+S ED+ + IE+++A T LP N E+ IL Y++GQKY++H+D F + +
Sbjct: 93 TFLSKHEDD--IVSGIEKRVAAWTFLPEENAESIQILHYELGQKYDAHFDYFHDKNNLKR 150
Query: 196 KSQRVASFLVYLTDLEEGGETMFP 219
RVA+ L+YLTD+++GGET+FP
Sbjct: 151 GGHRVATVLMYLTDVKKGGETVFP 174
>gi|333981907|ref|YP_004511117.1| procollagen-proline dioxygenase [Methylomonas methanica MC09]
gi|333805948|gb|AEF98617.1| Procollagen-proline dioxygenase [Methylomonas methanica MC09]
Length = 286
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 64/209 (30%), Positives = 102/209 (48%), Gaps = 35/209 (16%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGI------RTSSGVFIS 139
P + F + E+C+ +I ++ L PS + VD G R+S G +
Sbjct: 96 PDIVVVDEFMSGEECEQLIEQSRRKLTPSAI-------VDPQTGKFQVIADRSSEGTYFQ 148
Query: 140 AAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG-----P 194
E S + ++ +I+++ P +GE IL Y +G +Y H+D F E G
Sbjct: 149 RGE--SPLISRLDRRISELMNWPEDHGEGIQILHYGVGAQYKPHFDYFLENESGGALQMT 206
Query: 195 QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLL 254
Q QRVA+ ++YL ++ EGGET+FP +G+ + P++G F
Sbjct: 207 QSGQRVATLVMYLNEVTEGGETVFPD---------------VGISITPKRGSAAYFAYCN 251
Query: 255 PNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
G +DP ++HG PV+ GEKW+ATKW+R
Sbjct: 252 SLGQVDPATLHGGAPVLTGEKWIATKWMR 280
>gi|187920106|ref|YP_001889137.1| procollagen-proline dioxygenase [Burkholderia phytofirmans PsJN]
gi|187718544|gb|ACD19767.1| Procollagen-proline dioxygenase [Burkholderia phytofirmans PsJN]
Length = 295
Score = 111 bits (278), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 63/208 (30%), Positives = 105/208 (50%), Gaps = 29/208 (13%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLA---LRKGETVDNTQGIRTSSGVFISAAE 142
P+ + F + +P++C +I ++ L+ ST K + + N RTS G++ E
Sbjct: 106 PQVIVFGDVLSPDECAEMIERSRHRLKRSTTVNPETGKEDVIRN----RTSEGIWYQRGE 161
Query: 143 DESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG-----PQKS 197
D ++ ++ +I+ + P NGE IL Y +Y H+D F P + G Q
Sbjct: 162 D--AFIERMDRRISSLMNWPVENGEGLQILHYGTTGEYRPHFDYFPPDQPGSAVHTAQGG 219
Query: 198 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 257
QRVA+ ++YL D+ +GGET+FP G+ V RQG + F +
Sbjct: 220 QRVATLVIYLNDVPDGGETIFPE---------------AGISVAARQGGAVYFRYMNGQR 264
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
+DP ++HG PV+ G+KW+ TKW+R++
Sbjct: 265 QLDPLTLHGGAPVLGGDKWIMTKWMRER 292
>gi|171059332|ref|YP_001791681.1| procollagen-proline dioxygenase [Leptothrix cholodnii SP-6]
gi|170776777|gb|ACB34916.1| Procollagen-proline dioxygenase [Leptothrix cholodnii SP-6]
Length = 287
Score = 111 bits (278), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 74/216 (34%), Positives = 106/216 (49%), Gaps = 40/216 (18%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFIS 139
PR + F F + ++C +++ +A+ L + ETVDN G RTS G+F
Sbjct: 100 PRVVVFGGFLSHDECDALVALAQPRLA-------RSETVDNDTGGSEVNEARTSQGMFFM 152
Query: 140 AAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP----- 194
E E + IE +IA + P NGE +L Y+ G +Y HYD FDP + G
Sbjct: 153 RGEGE--LISRIEARIAALLDWPLENGEGVQVLHYRPGAEYKPHYDYFDPAQPGTPTILK 210
Query: 195 QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF-YSL 253
+ QRV + ++YL E GG T FP + L+V P +G+ + F Y
Sbjct: 211 RGGQRVGTLVMYLNTPERGGGTTFP---------------DVNLEVAPIKGNAVFFSYER 255
Query: 254 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 289
T S+HG PV+ GEKWVATKW+R Q ++D
Sbjct: 256 AHPST---RSLHGGAPVLAGEKWVATKWLR-QARFD 287
>gi|303279839|ref|XP_003059212.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226459048|gb|EEH56344.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 409
Score = 111 bits (278), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 88/295 (29%), Positives = 129/295 (43%), Gaps = 53/295 (17%)
Query: 31 LAGLLGSSLLSQDVTAARPSARVVESVKDEYKWMPHGQAGDDSVTNIPFQVLSWMPRALY 90
L G LG + T R + R D+ G D V + + LS PRA
Sbjct: 40 LIGWLGEVPRADATTTTREAPRA-----DDASASSLGPTRDIGVGDARVEKLSDSPRAYL 94
Query: 91 FPNFATPEQCKSIINMAKLNLRPSTLALRK---GETVDNTQGIRTSSGVFISAAEDESGT 147
F F T E+C +I ++ +L+ ST+ GE RTS+G F+ D+
Sbjct: 95 FREFLTKEECAHLIEISTPHLKRSTVVGDDALLGEADGRRSDYRTSTGAFLPKLYDD--V 152
Query: 148 LDLIEEKIAKVTMLPRINGE---AFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFL 204
+ +E ++ + LP N E A ++LRY++GQ+Y H D F + G +RVA+ L
Sbjct: 153 VTRVERRVEAFSRLPFENQEQLQARSLLRYELGQEYRDHVDGFATENGG----KRVATVL 208
Query: 205 VYLTDLEEGGETMFPFENGMNADGSY----------DYQKCI-----------------G 237
++L + EEGGET FP NG ++ + C G
Sbjct: 209 MFLAEPEEGGETAFP--NGEPSEAVAARVAAQRARGELSDCAWRGGGGGTAGGGRGNLRG 266
Query: 238 LKVKPRQGDGLLFYSLLPNGT-------IDPTSIHGSCPVVKGEKWVATKWIRDQ 285
VKPR GD +LF+S + + S H SCP +G KW ATKWI ++
Sbjct: 267 FAVKPRLGDAVLFFSYDADDDGGYDGAEVSHASTHASCPTTRGVKWTATKWIHER 321
>gi|91778899|ref|YP_554107.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
xenovorans LB400]
gi|91691559|gb|ABE34757.1| Procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
xenovorans LB400]
Length = 292
Score = 111 bits (278), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 65/208 (31%), Positives = 107/208 (51%), Gaps = 29/208 (13%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTL---ALRKGETVDNTQGIRTSSGVFISAAE 142
P+ + F + +P++C +I ++ L+ ST A K + + N RTS G++ E
Sbjct: 103 PQVIVFADVLSPDECAEMIERSRHRLKRSTTVNPATGKEDVIRN----RTSEGIWYQRGE 158
Query: 143 DESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG-----PQKS 197
D ++ ++ +I+ + P NGE IL Y +Y H+D F P + G Q
Sbjct: 159 DP--FIERMDRRISSLMNWPVENGEGLQILHYGTTGEYRPHFDYFPPDQPGSAVHTAQGG 216
Query: 198 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 257
QRVA+ ++YL D+ +GGET+FP E GM+ S QG + F +
Sbjct: 217 QRVATLVIYLNDVPDGGETIFP-EAGMSVAAS--------------QGGAVYFRYMNDRR 261
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
+DP ++HG PV+ G+KW+ TKW+R++
Sbjct: 262 QLDPLTLHGGAPVLAGDKWIMTKWMRER 289
>gi|377810637|ref|YP_005043077.1| proCollegen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia sp.
YI23]
gi|357939998|gb|AET93554.1| proCollegen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia sp.
YI23]
Length = 297
Score = 111 bits (277), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 72/203 (35%), Positives = 98/203 (48%), Gaps = 23/203 (11%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P A+ F T +C +I +A+ L ST+ + D G R+S G F AE +
Sbjct: 102 PAAVLLDEFLTGSECDQLIALARPRLSRSTV-VDPVTGRDVAAGHRSSDGTFFRLAE--T 158
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-----DPQEYGPQKSQRV 200
+ +E +IA +T L NGE +LRY+ G + H D +E + QRV
Sbjct: 159 PLVARLEMRIAALTGLAAENGEGLQLLRYQPGAESTPHVDYLVAGNETNRESIARSGQRV 218
Query: 201 ASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTID 260
+ L+YL D+E GGET+FP +G V PR+G L F G D
Sbjct: 219 GTLLMYLNDVEGGGETVFP---------------QVGCSVVPRRGQALYFEYCNRAGVCD 263
Query: 261 PTSIHGSCPVVKGEKWVATKWIR 283
P S+H S P+ GEKWVATKWIR
Sbjct: 264 PASLHASTPLRSGEKWVATKWIR 286
>gi|332526359|ref|ZP_08402485.1| procollagen-proline dioxygenase [Rubrivivax benzoatilyticus JA2]
gi|332110495|gb|EGJ10818.1| procollagen-proline dioxygenase [Rubrivivax benzoatilyticus JA2]
Length = 224
Score = 111 bits (277), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 75/230 (32%), Positives = 113/230 (49%), Gaps = 44/230 (19%)
Query: 68 QAGDDSVTNIPFQVLSWM--PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVD 125
+AGD V VL+ M PR + F + ++C ++ +A+ L LR ETVD
Sbjct: 22 RAGDREV-----HVLATMALPRVVVFGGLLSEQECDELVALAQPRL------LRS-ETVD 69
Query: 126 NTQG------IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQK 179
N+ G RTS G+F E+ ++ IE +IA++ P GE +L Y+ G +
Sbjct: 70 NSTGGSEVNAARTSDGMFFE--RGETPLIERIERRIAELVHWPVERGEGLQVLHYRPGAQ 127
Query: 180 YNSHYDAFDPQEYGP-----QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQK 234
Y H+D FDP G + QRV + ++YL GG T FP
Sbjct: 128 YKPHHDFFDPAHPGTANILRRGGQRVGTVVIYLNTPAGGGATTFP--------------- 172
Query: 235 CIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRD 284
+GL+V+P +G+ + F P + ++HG PV+ GEKWVATKW+R+
Sbjct: 173 EVGLEVQPIKGNAVFFSYERPLASTR--TLHGGAPVLDGEKWVATKWLRE 220
>gi|329913962|ref|ZP_08276011.1| hypothetical protein IMCC9480_1311 [Oxalobacteraceae bacterium
IMCC9480]
gi|327545257|gb|EGF30515.1| hypothetical protein IMCC9480_1311 [Oxalobacteraceae bacterium
IMCC9480]
Length = 280
Score = 110 bits (276), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 69/212 (32%), Positives = 103/212 (48%), Gaps = 35/212 (16%)
Query: 85 MPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGI------RTSSGVFI 138
+PR + N + ++C +I M++ ST T+DN GI RTS I
Sbjct: 91 IPRIVVLGNVLSDDECDAIAAMSRTRFARST-------TIDNASGINRFDDSRTSESAHI 143
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS- 197
E E + I+ ++A ++ P +GE + +Y+ G +Y H+D FDP G K
Sbjct: 144 QRGETE--LIARIDARLAALSGWPVDHGEPLQLQKYQAGNEYRPHFDWFDPALAGTAKHL 201
Query: 198 ----QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 253
QR+A+ ++YLTD+EEGG T FP IGL V P++G L F +
Sbjct: 202 EKSGQRLATIILYLTDVEEGGGTSFP---------------GIGLDVHPQKGGALFFRNT 246
Query: 254 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
P G D + H PV KG K +A KW+R++
Sbjct: 247 TPYGVPDRKTQHAGLPVEKGTKIIANKWLREK 278
>gi|390176896|ref|XP_002136934.2| GA26861 [Drosophila pseudoobscura pseudoobscura]
gi|388858831|gb|EDY67492.2| GA26861 [Drosophila pseudoobscura pseudoobscura]
Length = 513
Score = 110 bits (276), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 63/213 (29%), Positives = 111/213 (52%), Gaps = 22/213 (10%)
Query: 78 PF--QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSG 135
PF ++LS P + + + TP + ++ N++K +++ + K + RTS+
Sbjct: 312 PFKTEILSLSPYMVLYHDVITPLESLTLKNLSKPHMKRRAMTFNKQKLRPLIDSGRTSNS 371
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFD-PQEYGP 194
V++++ E+ ++ +E ++ +T N E + ++ Y IG Y H D F+ PQ G
Sbjct: 372 VWLTSHEN--AVMERLERRVGVMTNFEMENSEVYQLINYGIGGHYKPHTDHFETPQHRG- 428
Query: 195 QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLL 254
R+A+ L YL+D+ +GG T+FP N + V+PRQGD LL+Y+L
Sbjct: 429 -GGDRIATVLFYLSDVPQGGATLFPRLN---------------ISVQPRQGDALLWYNLN 472
Query: 255 PNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
G + ++H SCP+++G KW KWI + Q
Sbjct: 473 DRGQGEIGTVHTSCPIIQGSKWALVKWIDELSQ 505
>gi|428182311|gb|EKX51172.1| hypothetical protein GUITHDRAFT_92735 [Guillardia theta CCMP2712]
Length = 190
Score = 110 bits (276), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 67/191 (35%), Positives = 101/191 (52%), Gaps = 22/191 (11%)
Query: 114 STLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILR 173
ST+A E + RTSS ++S D + I ++A++ LP E +L
Sbjct: 4 STIAEAGNEAKNGVGSARTSSTAWLSKTADP--LVAKIRTRVAELVKLPMELAEDMQVLH 61
Query: 174 YKIGQKYNSHYDAFDPQEY-----GPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADG 228
Y Q Y +H+D FDP Y P +++ + F YL+D+EEGGET+FPF NG +
Sbjct: 62 YSKNQHYWAHHDFFDPNIYRGFVTSPGQNRFITVFF-YLSDVEEGGETVFPFANGDDRRV 120
Query: 229 SYDYQKCI-GLKVKPRQGDGLLFYSLLPN------------GTIDPTSIHGSCPVVKGEK 275
+ D+ C GLKVKP+ G+ ++FYS+L +D S+HG C V+KG+K
Sbjct: 121 T-DFADCSRGLKVKPKAGNAIIFYSMLAKRQQEICPPDDLGCNLDVRSLHGGCDVIKGDK 179
Query: 276 WVATKWIRDQE 286
W A WI +++
Sbjct: 180 WAANYWIANKK 190
>gi|89096248|ref|ZP_01169141.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus sp.
NRRL B-14911]
gi|89089102|gb|EAR68210.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus sp.
NRRL B-14911]
Length = 217
Score = 110 bits (276), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 62/198 (31%), Positives = 104/198 (52%), Gaps = 23/198 (11%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C+ +I M++ L+ S + TVD+ IRTSS +F E+E
Sbjct: 39 PLIVILGNVLSDEECEGLIRMSEDKLKRSKIG--NTRTVDD---IRTSSSMFFEEGENE- 92
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
+ IE +++++ +P +GE +L Y IGQ+Y +H+D F + R+++ ++
Sbjct: 93 -LVARIERRLSQIMNIPVEHGEGLQMLNYHIGQEYKAHFDFF-SSSSRAASNPRISTLVM 150
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + V P++G + F N ++ ++H
Sbjct: 151 YLNDVEEGGETYFP---------------KLNFSVNPQKGSAVYFEYFYDNQDLNDLTLH 195
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV+KG KW AT+W+R
Sbjct: 196 GGAPVIKGSKWAATQWMR 213
>gi|195159311|ref|XP_002020525.1| GL13465 [Drosophila persimilis]
gi|194117294|gb|EDW39337.1| GL13465 [Drosophila persimilis]
Length = 578
Score = 110 bits (275), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 63/213 (29%), Positives = 110/213 (51%), Gaps = 20/213 (9%)
Query: 78 PF--QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSG 135
PF ++LS P + + + TP + ++ N++K +++ + K + RTS+
Sbjct: 375 PFKTELLSLAPYMVLYHDVITPLESLTLKNLSKPHMKRRAMTFNKQKLRPLIDSGRTSNS 434
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFD-PQEYGP 194
V++++ E+ ++ +E ++ +T N E + ++ Y IG Y H D F+ PQ
Sbjct: 435 VWLTSHEN--AVMERLERRVGVMTNFEMENSEVYQLINYGIGGHYKPHTDHFETPQLEHR 492
Query: 195 QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLL 254
R+A+ L YL+D+ +GG T+FP N + V+PRQGD LL+Y+L
Sbjct: 493 GGGDRIATVLFYLSDVPQGGATLFPRLN---------------ISVQPRQGDALLWYNLN 537
Query: 255 PNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
G + ++H SCP++KG KW KWI + Q
Sbjct: 538 DRGQGEIGTVHTSCPIIKGSKWALVKWIDELSQ 570
>gi|91779740|ref|YP_554948.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
xenovorans LB400]
gi|91692400|gb|ABE35598.1| Procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
xenovorans LB400]
Length = 296
Score = 110 bits (275), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 77/214 (35%), Positives = 110/214 (51%), Gaps = 27/214 (12%)
Query: 80 QVLSWM--PRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGV 136
+V+S M P A+ +F + +C+ +I +A+ L ST+ G V G R+S G+
Sbjct: 94 RVISRMQRPAAVLLDDFLSANECEQLIALARPRLSRSTVVDPVTGRNV--VAGHRSSDGM 151
Query: 137 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD---AFDP--QE 191
F E + + +E +IA++T LP NGE +L Y+ G + H D A +P +E
Sbjct: 152 FFRLGE--TPLIARLEARIAELTGLPVENGEGLQLLHYEAGAESTPHVDYLIAGNPANRE 209
Query: 192 YGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFY 251
+ QRV + L+YL D+E GGETMFP G V PR+G L F
Sbjct: 210 SIARSGQRVGTLLMYLNDVEGGGETMFP---------------QTGWSVVPRRGQALYFE 254
Query: 252 SLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
G DP+S+H S P+ GEKWVATKWIR +
Sbjct: 255 YGNRFGLADPSSLHTSTPLRAGEKWVATKWIRTR 288
>gi|330821584|ref|YP_004350446.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
gladioli BSR3]
gi|327373579|gb|AEA64934.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
gladioli BSR3]
Length = 302
Score = 110 bits (275), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 69/207 (33%), Positives = 100/207 (48%), Gaps = 25/207 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVFISAAEDE 144
P A+ F + +C+ +I +A+ L ST+ G + G R+S G+F E
Sbjct: 102 PAAVLLDGFLSAGECRQLIELARPRLNRSTVVDPVTGRNI--VAGHRSSDGMFFRLGE-- 157
Query: 145 SGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP-----QEYGPQKSQR 199
+ + IE++IA +T P NGE +L Y+ G + H D P E + QR
Sbjct: 158 TPLISRIEQRIAALTGFPVENGEGLQMLHYEAGAESTPHVDYLVPGNPANAESIARSGQR 217
Query: 200 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 259
V + L+YL D+E GGET+FP +G V PR+G F +G
Sbjct: 218 VGTLLMYLNDVESGGETLFP---------------QVGCSVVPRRGQAFYFEYGNGSGRS 262
Query: 260 DPTSIHGSCPVVKGEKWVATKWIRDQE 286
DP S+H S P+ G+KWVATKWIR +
Sbjct: 263 DPASLHASSPIGSGDKWVATKWIRTRR 289
>gi|433460968|ref|ZP_20418587.1| prolyl 4-hydroxylase alpha subunit [Halobacillus sp. BAB-2008]
gi|432190746|gb|ELK47751.1| prolyl 4-hydroxylase alpha subunit [Halobacillus sp. BAB-2008]
Length = 211
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 63/201 (31%), Positives = 102/201 (50%), Gaps = 25/201 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P+ N + E+C+++I ++K + S + + + IRTSS F+ E
Sbjct: 34 PKIAILGNVVSEEECEALIRLSKDKVNRSKIG-----SDHDVSDIRTSSSAFLPDDE--- 85
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
IE+++A++ +P +GE +IL YK GQ+Y +H+D F K+ R+++ ++
Sbjct: 86 -LTGRIEKRLAQIMNVPVEHGEGIHILHYKPGQEYKAHHDYFRSTSRAA-KNPRISTLVL 143
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP N L V P +G + F + I+ ++H
Sbjct: 144 YLNDVEEGGETYFPEMN---------------LTVSPHKGMAVYFEYFYNDPAINERTLH 188
Query: 266 GSCPVVKGEKWVATKWIRDQE 286
G PV GEKW AT W+R Q+
Sbjct: 189 GGSPVTAGEKWAATMWVRRQQ 209
>gi|218665910|ref|YP_002425647.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrooxidans ATCC 23270]
gi|218518123|gb|ACK78709.1| oxidoreductase, 2OG-Fe(II) oxygenase family [Acidithiobacillus
ferrooxidans ATCC 23270]
Length = 248
Score = 109 bits (272), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 72/200 (36%), Positives = 99/200 (49%), Gaps = 32/200 (16%)
Query: 96 TPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFIS----AAEDESGTLDLI 151
TPE C+++I + + LRP+T+ D G + G +S D+ L +
Sbjct: 73 TPENCQNLIAIGQSLLRPATV-------TDEQTGQEVAHGERVSEMAWPKRDDYPILQSL 125
Query: 152 EEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP---QKSQRVASFLVYLT 208
E IA++T +P E IL Y+ G +Y HYDAF P Q R A+ ++YL
Sbjct: 126 AEGIAQLTGIPIDCQEPLQILHYRPGGEYKPHYDAFAAD--APTLRQGGNRQATLILYLN 183
Query: 209 DLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSC 268
+EEGGET FP +GL+V P G G+ F +L G P S+H
Sbjct: 184 AVEEGGETAFPE---------------LGLQVSPIPGGGVFFRNLNEEGQRHPLSLHAGL 228
Query: 269 PVVKGEKWVATKWIRDQEQY 288
PV KGEKW+AT+WIR QE Y
Sbjct: 229 PVRKGEKWIATQWIR-QEAY 247
>gi|205374182|ref|ZP_03226981.1| prolyl 4-hydroxylase alpha subunit [Bacillus coahuilensis m4-4]
Length = 210
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 59/198 (29%), Positives = 105/198 (53%), Gaps = 25/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P N + E+C +I+++K + S +A + IRTS+ VF+ ED S
Sbjct: 33 PFVAVLGNVLSDEECDELISLSKDRMNRSKIA------GNQENDIRTSTSVFL--PEDAS 84
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
+ +E++I+++ +P +GE +L Y+IGQ+Y +H+D F P++ ++ R+++ ++
Sbjct: 85 EVVQRVEKRISQIMNIPVEHGEGLQLLNYQIGQEYKAHFDFFSPKKL--IENPRISTLVL 142
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGG+T FP + L V P +G + F + ++ ++H
Sbjct: 143 YLNDVEEGGDTYFP---------------NLKLSVSPHKGMAVYFEYFYDDPMLNELTLH 187
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV G+KW AT W+R
Sbjct: 188 GGAPVTIGDKWAATMWMR 205
>gi|302850293|ref|XP_002956674.1| hypothetical protein VOLCADRAFT_67269 [Volvox carteri f.
nagariensis]
gi|300258035|gb|EFJ42276.1| hypothetical protein VOLCADRAFT_67269 [Volvox carteri f.
nagariensis]
Length = 325
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 75/223 (33%), Positives = 113/223 (50%), Gaps = 22/223 (9%)
Query: 68 QAGDDSVTNIP---FQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETV 124
Q+ S++ +P Q +SW PRA+ + NF + ++ + II++A ++ ST+ K E V
Sbjct: 27 QSHFQSLSQLPTCRIQTISWKPRAVVYHNFLSDQEARHIIDLAHEQMKRSTVVGNKNEGV 86
Query: 125 DNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHY 184
IRTS G F+ A+D + IEE++A + +P + E +LRY KY H
Sbjct: 87 --VDDIRTSYGTFLRRAQDP--VIMAIEERLALWSHMPPSHQEDMQVLRYGRTNKYGPHI 142
Query: 185 DAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFE--NGMNADGSYDYQKCIG-LKVK 241
D +RVA+ L+YL E G + P M A+ S G + K
Sbjct: 143 DGL----------ERVATVLMYLVG-ESPGPDLAPVSACECMYAEQSNPSACAKGHVAYK 191
Query: 242 PRQGDGLLFYSLLPN-GTIDPTSIHGSCPVVKGEKWVATKWIR 283
P++GD L+F+ + P+ T D S+H CPVV G KW A KWI
Sbjct: 192 PKRGDALMFFDVKPDYTTTDGHSMHTGCPVVAGVKWNAVKWIH 234
>gi|386712780|ref|YP_006179102.1| prolyl 4-hydroxylase alpha subunit [Halobacillus halophilus DSM
2266]
gi|384072335|emb|CCG43825.1| prolyl 4-hydroxylase alpha subunit [Halobacillus halophilus DSM
2266]
Length = 211
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 64/196 (32%), Positives = 103/196 (52%), Gaps = 26/196 (13%)
Query: 93 NFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIE 152
N + E+C+ +I ++K + S + + E D IRTSS F+ E + IE
Sbjct: 41 NVVSEEECEELIFLSKNKMNRSKIG-SQHEVSD----IRTSSSTFLP----EDDLTNRIE 91
Query: 153 EKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEE 212
+++A++ +P +GE +IL YK GQ+Y +HYD F + + R+++ ++YL D+EE
Sbjct: 92 KRVAQIMNVPVEHGEGLHILNYKQGQEYKAHYDYFRSKAKAAN-NPRISTLVLYLNDVEE 150
Query: 213 GGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVK 272
GGET FP N L + P +G + F + I+ ++HG PV
Sbjct: 151 GGETYFPHMN---------------LSISPHKGMAVYFEYFYSDPLINERTLHGGSPVTS 195
Query: 273 GEKWVATKWIRDQEQY 288
GEKW AT W+R ++QY
Sbjct: 196 GEKWAATMWVR-RKQY 210
>gi|428183249|gb|EKX52107.1| hypothetical protein GUITHDRAFT_150687 [Guillardia theta CCMP2712]
Length = 315
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 67/204 (32%), Positives = 100/204 (49%), Gaps = 21/204 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR N T E+C+S+ ++ + L + G RT++ ++ +
Sbjct: 88 PRIYVLHNILTKEECESLKSLGVMAGMEKALIIPYGGKELVESSTRTNTAAWLEY--HQG 145
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK----SQRVA 201
+ +E +AKVT NGE IL Y+ Q++ H+D FDP P+ R+A
Sbjct: 146 PVVTKLENLLAKVTNTEPENGENLQILHYQTSQQFKEHHDYFDPATDPPENFEPGGNRLA 205
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
+ ++YL + EEGGET D+ K I KVKP G +LFY L P+G++D
Sbjct: 206 TAIIYLQNAEEGGET--------------DFMK-IDTKVKPEAGSAVLFYDLKPDGSVDK 250
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQ 285
+IH P GEKWVATKWI ++
Sbjct: 251 LTIHSGNPPKGGEKWVATKWIHER 274
>gi|402813396|ref|ZP_10862991.1| hypothetical protein PAV_1c08470 [Paenibacillus alvei DSM 29]
gi|402509339|gb|EJW19859.1| hypothetical protein PAV_1c08470 [Paenibacillus alvei DSM 29]
Length = 215
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 67/200 (33%), Positives = 102/200 (51%), Gaps = 25/200 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I +K L+ S + GE Q IRTSSGVF +E+
Sbjct: 36 PLIVILGNVLSNEECDELIEHSKERLQRSKI----GEERSVNQ-IRTSSGVFC----EEN 86
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
T+ IE++I+++ +P +G+ +L Y GQ+Y H+D F + R+++ ++
Sbjct: 87 ETVAKIEKRISQIMNIPIEHGDGLQVLLYAPGQEYKPHFDFFADTSRA-SANNRISTLVM 145
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP N L V P +G + F N ++ ++H
Sbjct: 146 YLNDVEEGGETTFPMLN---------------LSVFPSKGMAVYFEYFYSNHELNERTLH 190
Query: 266 GSCPVVKGEKWVATKWIRDQ 285
PV KGEKWVAT W+R Q
Sbjct: 191 AGAPVRKGEKWVATMWMRRQ 210
>gi|319652240|ref|ZP_08006358.1| prolyl 4-hydroxylase [Bacillus sp. 2_A_57_CT2]
gi|317396063|gb|EFV76783.1| prolyl 4-hydroxylase [Bacillus sp. 2_A_57_CT2]
Length = 216
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 61/198 (30%), Positives = 102/198 (51%), Gaps = 23/198 (11%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I +K ++ S +A VD +RTSS F E+E
Sbjct: 38 PLIVILGNVLSDEECDQLIQQSKDRMQRSKVA--NSLEVDE---LRTSSSTFFHEGENE- 91
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
+ IE++I+++ +P +GE IL YKIGQ+Y +H+D F + R+++ ++
Sbjct: 92 -IVARIEKRISQIMNIPVEHGEGLQILNYKIGQEYKAHFDFFSSTSRAA-SNPRISTLVM 149
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+E+GGET FP + V P++G + F + ++ ++H
Sbjct: 150 YLNDVEQGGETYFP---------------KLNFSVSPQKGMAVYFEYFYNDQNLNDLTLH 194
Query: 266 GSCPVVKGEKWVATKWIR 283
G PVV G+KW AT+W+R
Sbjct: 195 GGAPVVMGDKWAATQWMR 212
>gi|416009427|ref|ZP_11561250.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein
[Acidithiobacillus sp. GGI-221]
gi|339836568|gb|EGQ64151.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein
[Acidithiobacillus sp. GGI-221]
Length = 196
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 71/200 (35%), Positives = 98/200 (49%), Gaps = 32/200 (16%)
Query: 96 TPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFIS----AAEDESGTLDLI 151
TPE C+++I + + LRP+T+ D G + G +S D+ L +
Sbjct: 21 TPENCQNLIAIGQSLLRPATV-------TDEQTGQEVAHGERVSEMAWPKRDDHPILQSL 73
Query: 152 EEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP---QKSQRVASFLVYLT 208
E IA++T +P E IL Y+ G +Y HYDAF P Q R + ++YL
Sbjct: 74 AEGIAQLTGIPIDCQEPLQILHYRPGGEYKPHYDAFAAD--APTLRQGGNRQGTLILYLN 131
Query: 209 DLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSC 268
+EEGGET FP +GL+V P G G+ F +L G P S+H
Sbjct: 132 AVEEGGETAFPE---------------LGLQVSPIPGGGVFFRNLNEEGQRHPLSLHAGL 176
Query: 269 PVVKGEKWVATKWIRDQEQY 288
PV KGEKW+AT+WIR QE Y
Sbjct: 177 PVRKGEKWIATQWIR-QEAY 195
>gi|389795384|ref|ZP_10198508.1| procollagen-proline dioxygenase [Rhodanobacter fulvus Jip2]
gi|388430823|gb|EIL87950.1| procollagen-proline dioxygenase [Rhodanobacter fulvus Jip2]
Length = 293
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 65/199 (32%), Positives = 99/199 (49%), Gaps = 35/199 (17%)
Query: 98 EQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFISAAEDESGTLDLI 151
E+C +I R S L++ TVD G R+S G F D+ + +
Sbjct: 109 EECDELI-------RRSADKLQRSTTVDPVNGGYEVIAARSSEGTFFPVNADD--FIARL 159
Query: 152 EEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS-----QRVASFLVY 206
+ +IA++ P NGE +L Y G +Y H+D F P + G + QRV++ L+Y
Sbjct: 160 DRRIAELMNCPVENGEGLQVLHYGEGGEYQPHFDYFSPGDPGSEAQMVVGGQRVSTLLIY 219
Query: 207 LTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHG 266
L D+ +GG T+FP +GL+V PR+G + F +G +DP ++HG
Sbjct: 220 LNDVAQGGATVFP---------------TLGLRVLPRKGMAVYFEYSNRDGQVDPLTLHG 264
Query: 267 SCPVVKGEKWVATKWIRDQ 285
PV KGEKW+ TKW+R +
Sbjct: 265 GEPVEKGEKWIITKWMRQR 283
>gi|317127314|ref|YP_004093596.1| Procollagen-proline dioxygenase [Bacillus cellulosilyticus DSM
2522]
gi|315472262|gb|ADU28865.1| Procollagen-proline dioxygenase [Bacillus cellulosilyticus DSM
2522]
Length = 229
Score = 108 bits (269), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 57/198 (28%), Positives = 104/198 (52%), Gaps = 24/198 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + E+C +I+++K + S ++ + + +RTSS +F AE++
Sbjct: 44 PLIVLLGNVLSEEECDQLISLSKDRIERSKISNK------SVHDLRTSSSMFFDDAEND- 96
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
+ +E++++++ +P +GE IL Y IGQ+Y +HYD F + R+++ ++
Sbjct: 97 -VVSTVEKRVSQIMKIPVDHGEGIQILNYAIGQEYKAHYDYFSSGN-SKVNNPRISTLVM 154
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+E GGET FP + V P++G + F + T++ ++H
Sbjct: 155 YLNDVEAGGETYFP---------------KLNFYVAPKKGMAVYFEYFYNDTTLNELTLH 199
Query: 266 GSCPVVKGEKWVATKWIR 283
G PVV G+KW AT+W+R
Sbjct: 200 GGAPVVIGDKWAATQWMR 217
>gi|73542634|ref|YP_297154.1| procollagen-proline,2-oxoglutarate-4-dioxygenase [Ralstonia
eutropha JMP134]
gi|72120047|gb|AAZ62310.1| Procollagen-proline,2-oxoglutarate-4-dioxygenase [Ralstonia
eutropha JMP134]
Length = 282
Score = 108 bits (269), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 63/205 (30%), Positives = 100/205 (48%), Gaps = 23/205 (11%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + + + +C +++ +A+ L S + + +N RTS G E
Sbjct: 90 PSIRLYQHLLSDAECDALVELARGRLARSPV-INPDTGDENLIDARTSMGAMFQVGEHT- 147
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK-----SQRV 200
+ IE++IA V +P +GE IL YK G +Y H+D F+P+ G + QR
Sbjct: 148 -LIQRIEDRIAAVLGVPVDHGEGLQILNYKPGGEYQPHFDFFNPKRPGEARQLRVGGQRT 206
Query: 201 ASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTID 260
A+ ++YL + GG T FP IGL+V P +G+ + F L P+G +D
Sbjct: 207 ATLVIYLNTPQAGGATAFP---------------RIGLEVAPVKGNAVYFSYLQPDGKLD 251
Query: 261 PTSIHGSCPVVKGEKWVATKWIRDQ 285
++H PV GEKW+ATKW+R+
Sbjct: 252 ERTLHAGLPVQSGEKWIATKWLREH 276
>gi|319795182|ref|YP_004156822.1| procollagen-proline dioxygenase [Variovorax paradoxus EPS]
gi|315597645|gb|ADU38711.1| Procollagen-proline dioxygenase [Variovorax paradoxus EPS]
Length = 296
Score = 107 bits (268), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 64/199 (32%), Positives = 98/199 (49%), Gaps = 23/199 (11%)
Query: 93 NFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIE 152
N +CK++I MAK L PSTL + D R S G+F E++ + ++
Sbjct: 107 NVVDAHECKALIEMAKPRLAPSTL-VDPMSGRDVVSDKRASWGMFFRLCEND--LVARLD 163
Query: 153 EKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS-----QRVASFLVYL 207
+++ + LP NGE ++L Y G H+D P ++S QRV++ + YL
Sbjct: 164 RRLSALMNLPLENGEGLHLLYYPTGAGSEPHHDYLAPTNAANRESIARSGQRVSTLVTYL 223
Query: 208 TDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGS 267
D EGG+T+FP +GL V P +G+ F NG +D S+H S
Sbjct: 224 NDAPEGGQTVFPQ---------------LGLAVSPIRGNACYFEYCDGNGRVDARSLHAS 268
Query: 268 CPVVKGEKWVATKWIRDQE 286
PV +G+KWV TKW+R++
Sbjct: 269 APVTRGDKWVMTKWMRERR 287
>gi|115434812|ref|NP_001042164.1| Os01g0174500 [Oryza sativa Japonica Group]
gi|55296794|dbj|BAD68120.1| prolyl 4-hydroxylase -like [Oryza sativa Japonica Group]
gi|113531695|dbj|BAF04078.1| Os01g0174500 [Oryza sativa Japonica Group]
gi|222617830|gb|EEE53962.1| hypothetical protein OsJ_00571 [Oryza sativa Japonica Group]
Length = 303
Score = 107 bits (267), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 63/202 (31%), Positives = 107/202 (52%), Gaps = 20/202 (9%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA 141
LSW PR + F + +C +++M + N+ S+LA T G R SS I
Sbjct: 63 LSWHPRIFLYEGFLSDMECDHLVSMGRGNME-SSLAF--------TDGDRNSSYNNI--- 110
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVA 201
E + IE++I+ + LP+ NGE+ +L+Y + + + + + R+A
Sbjct: 111 --EDIVVSKIEDRISLWSFLPKENGESIQVLKYGVNRS-----GSIKEEPKSSSGAHRLA 163
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDY-QKCIGLKVKPRQGDGLLFYSLLPNGTID 260
+ L+YL+D+++GGET+FP +A +C G V+P +G+ +L ++L P+G D
Sbjct: 164 TILMYLSDVKQGGETVFPRSEMKDAQAKEGAPSQCSGYAVRPAKGNAILLFNLRPDGETD 223
Query: 261 PTSIHGSCPVVKGEKWVATKWI 282
S + CPV++GEKW+A K I
Sbjct: 224 KDSQYEECPVLEGEKWLAIKHI 245
>gi|167519971|ref|XP_001744325.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163777411|gb|EDQ91028.1| predicted protein [Monosiga brevicollis MX1]
Length = 492
Score = 107 bits (267), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 63/204 (30%), Positives = 99/204 (48%), Gaps = 28/204 (13%)
Query: 87 RALYFPNFATPEQCKSIINMAKLNLRPSTL----ALRKGETVDNTQGIRTSSGVFISAAE 142
R F NFA+ ++C + + L + A R E R S+ ++
Sbjct: 305 RLQIFRNFASAQECAHLREEGRKKLSRAVAWTDGAFRPVE-------FRISTAAWLQPDH 357
Query: 143 DESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVAS 202
D+ T + +IA T L EA + Y IG Y +HYD +E + R+A+
Sbjct: 358 DDVVTN--LHTRIADATQLDLEFAEALQVSNYGIGGFYETHYDHHASRERELPEGDRIAT 415
Query: 203 FLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPT 262
F++YL +E+GG T FP +G V+P GD + +Y+LLP+G D
Sbjct: 416 FMIYLNQVEQGGYTAFPR---------------LGAAVEPGHGDAVFWYNLLPDGESDNN 460
Query: 263 SIHGSCPVVKGEKWVATKWIRDQE 286
++HG+CPV++G KWVA KWI +++
Sbjct: 461 TLHGACPVLQGSKWVANKWIHEKK 484
>gi|226314793|ref|YP_002774689.1| hypothetical protein BBR47_52080 [Brevibacillus brevis NBRC 100599]
gi|226097743|dbj|BAH46185.1| conserved hypothetical protein [Brevibacillus brevis NBRC 100599]
Length = 215
Score = 107 bits (266), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 60/200 (30%), Positives = 100/200 (50%), Gaps = 25/200 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + +C +I ++ L+ S + + + IRTSSGVF E
Sbjct: 36 PLVVVLGNVLSDSECDELIEHSRERLQRSKIGEDR-----SVNSIRTSSGVFCEQTE--- 87
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
T+ IE++I+++ +P +G+ +LRY GQ+Y HYD F + + R+++ ++
Sbjct: 88 -TITRIEKRISQIMNIPIEHGDGLQVLRYTPGQEYKPHYDFF-AETSRASTNNRISTLVM 145
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+E+GGET+FP + L V P +G + F N ++ ++H
Sbjct: 146 YLNDVEQGGETVFPL---------------LHLSVFPTKGMAVYFEYFYRNQEVNEFTLH 190
Query: 266 GSCPVVKGEKWVATKWIRDQ 285
V+ GEKWVAT W+R Q
Sbjct: 191 AGAQVIHGEKWVATMWMRRQ 210
>gi|398818543|ref|ZP_10577128.1| 2OG-Fe(II) oxygenase superfamily enzyme [Brevibacillus sp. BC25]
gi|398027481|gb|EJL21031.1| 2OG-Fe(II) oxygenase superfamily enzyme [Brevibacillus sp. BC25]
Length = 220
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 62/200 (31%), Positives = 101/200 (50%), Gaps = 25/200 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + N + +C +I ++ L+ S + GE + IRTSSGVF E
Sbjct: 41 PLVVVLGNVLSDSECDELIEHSRERLQRSKI----GED-GSVNSIRTSSGVFCEQTE--- 92
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
T+ IE++I+++ +P +G+ +LRY GQ+Y HYD F + + R+++ ++
Sbjct: 93 -TITRIEKRISQIMNIPIEHGDGLQVLRYTPGQEYKPHYDFF-AETSRASTNNRISTLVM 150
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+E+GGET+FP + L V P +G + F N ++ ++H
Sbjct: 151 YLNDVEQGGETVFPL---------------LHLSVFPTKGMAVYFEYFYSNQELNDFTLH 195
Query: 266 GSCPVVKGEKWVATKWIRDQ 285
V+ GEKWVAT W+R Q
Sbjct: 196 AGTQVIHGEKWVATMWMRRQ 215
>gi|389770666|ref|ZP_10192118.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
gi|388429637|gb|EIL86932.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
Length = 286
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 64/199 (32%), Positives = 102/199 (51%), Gaps = 30/199 (15%)
Query: 98 EQCKSIINMAKLNLRPSTLA---LRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEK 154
E+C +I A L+ ST+ K ET+ + R+S G F D+ + ++ +
Sbjct: 107 EECDELIRRAAAKLQRSTIVDPTTGKHETIAD----RSSEGTFFEINADD--FIARLDRR 160
Query: 155 IAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG-----PQKSQRVASFLVYLTD 209
I+ + LP +GE IL Y G +Y H+D F P + G QRV++ ++YL +
Sbjct: 161 ISALMNLPVDHGEGLQILHYGPGGEYKPHFDFFPPGDPGSAVQMATGGQRVSTLVMYLNE 220
Query: 210 LEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCP 269
+E+GG T+FP +GL V P++G + F G +DP ++HG P
Sbjct: 221 VEDGGATIFP---------------ELGLSVLPKKGSAVYFEYTNSRGQLDPRTLHGGAP 265
Query: 270 VVKGEKWVATKWIRDQEQY 288
V++GEKW+ TKW+R Q +Y
Sbjct: 266 VLRGEKWIVTKWMR-QRRY 283
>gi|375106426|ref|ZP_09752687.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderiales bacterium
JOSHI_001]
gi|374667157|gb|EHR71942.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderiales bacterium
JOSHI_001]
Length = 295
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 71/213 (33%), Positives = 105/213 (49%), Gaps = 39/213 (18%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFIS 139
PR + F + E+C +++++A RP L + ETV N G RTS G+F
Sbjct: 108 PRVMVFGGLLSDEECDAMVDLA----RPR---LARSETVHNGSGGSEVNAARTSDGMFFD 160
Query: 140 AAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP----- 194
E IE++IA + P NGE +LRY+ G +Y +H+D FDP + G
Sbjct: 161 --RGEFPLCRTIEQRIAALVNWPVENGEGLQVLRYRPGSEYKAHHDYFDPAQPGTPTILK 218
Query: 195 QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF-YSL 253
+ QRV + ++YL GG T FP +GL+V P +G+ + F Y
Sbjct: 219 RGGQRVGTVVMYLNHPIRGGGTAFP---------------DVGLEVAPFKGNAVFFSYDR 263
Query: 254 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
T ++H PV++GEKWVATKW+R+ E
Sbjct: 264 AHPMT---RTLHAGTPVLEGEKWVATKWVREGE 293
>gi|148653656|ref|YP_001280749.1| procollagen-proline dioxygenase [Psychrobacter sp. PRwf-1]
gi|148572740|gb|ABQ94799.1| Procollagen-proline dioxygenase [Psychrobacter sp. PRwf-1]
Length = 268
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 64/232 (27%), Positives = 112/232 (48%), Gaps = 29/232 (12%)
Query: 64 MPHGQAGDDSV----TNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-L 118
+PH ++ V + + + P +F +PE+C ++I+ A L+ S +
Sbjct: 53 IPHINMTNNYVELSDKRVSLSFVCYKPFVTVINDFLSPEECDALISDADQKLKASRVVDP 112
Query: 119 RKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQ 178
G V+++ TS+G E + IE +IA + P +GE +LRY+ G
Sbjct: 113 EDGSFVEHSARTSTSTGYH----RGEIDIIKTIEARIADLINWPVDHGEGLQVLRYEDGG 168
Query: 179 KYNSHYDAFDPQEYGP-----QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQ 233
+Y H+D FDP + Q QRV +FL+YL++++ GG T FP
Sbjct: 169 EYRPHFDFFDPAKKSSRLVTKQGGQRVGTFLMYLSEVDSGGSTRFP-------------- 214
Query: 234 KCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
+ +++P +G L F + I+P ++H PV +G K++ATKW+R++
Sbjct: 215 -NLNFEIRPNKGSALYFANTNLKAEIEPLTLHAGMPVTEGVKYLATKWLREK 265
>gi|418523362|ref|ZP_13089380.1| hypothetical protein WS7_20388 [Xanthomonas axonopodis pv.
malvacearum str. GSPB2388]
gi|410699993|gb|EKQ58573.1| hypothetical protein WS7_20388 [Xanthomonas axonopodis pv.
malvacearum str. GSPB2388]
Length = 286
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 70/212 (33%), Positives = 99/212 (46%), Gaps = 37/212 (17%)
Query: 85 MPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFI 138
+PR + F + +C ++I +A RP L + TVDN G RTS G+ +
Sbjct: 95 LPRVVVLGGFLSDGECDALIALA----RPR---LARSRTVDNANGEHLVHAARTSDGMCL 147
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP---- 194
+D IE +IA++ P +GE +LRY G +Y HYD FDP G
Sbjct: 148 RVGQD--ALCQRIEARIARLFDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAVGTPILL 205
Query: 195 -QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 253
QRVAS ++YL E GG T FP + L V +G+ + F
Sbjct: 206 QAGGQRVASLVMYLNTPERGGATRFPDAH---------------LDVAAVKGNAVFFSYD 250
Query: 254 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
P+ S+H PV+ GEKWVATKW+R++
Sbjct: 251 RPHPMT--RSLHAGAPVLAGEKWVATKWLRER 280
>gi|295700439|ref|YP_003608332.1| procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
gi|295439652|gb|ADG18821.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
Length = 296
Score = 105 bits (261), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 70/215 (32%), Positives = 109/215 (50%), Gaps = 27/215 (12%)
Query: 79 FQVLSWM--PRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKGETVDNTQGIRTSSG 135
+VLS + P A++ NF + ++C+ +I +A+ L R + + G V T R+S G
Sbjct: 93 VRVLSRLQRPAAVHLANFLSADECEQLIALAQPRLDRSAVVDPVTGRDVIATH--RSSHG 150
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-----DPQ 190
+F E + + IE +IA++T P NGE +L Y+ G + H D +
Sbjct: 151 MFFRLGE--TPLIARIEARIAELTATPVENGEGLQMLHYEEGAESTPHVDYLMTGNEANR 208
Query: 191 EYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 250
E + QR+ + L+YL D+E GGET+FP +G + P++G L F
Sbjct: 209 ESIARSGQRMGTLLMYLKDVEGGGETVFP---------------QVGWSIVPQRGHALYF 253
Query: 251 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
G DP+S+H S P+ G+KWVATKWIR +
Sbjct: 254 EYGNRYGMCDPSSLHASTPLRTGDKWVATKWIRTR 288
>gi|124267278|ref|YP_001021282.1| hypothetical protein Mpe_A2091 [Methylibium petroleiphilum PM1]
gi|124260053|gb|ABM95047.1| conserved hypothetical protein [Methylibium petroleiphilum PM1]
Length = 289
Score = 105 bits (261), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 68/209 (32%), Positives = 95/209 (45%), Gaps = 37/209 (17%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGI------RTSSGVFIS 139
PR + F + +C I+ +A L + TVD G RTS G+F +
Sbjct: 102 PRVIVFSGLLSDAECDEIVALAGARLA-------RSHTVDTATGASEVNAARTSDGMFFT 154
Query: 140 AAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP----- 194
E E +IA + P NGE +L Y+ G +Y HYD FDP + G
Sbjct: 155 RGEHP--VCARFEARIAALLNWPVENGEGLQVLHYRPGAEYKPHYDYFDPDQPGTPAVLR 212
Query: 195 QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLL 254
+ QRVA+ + YL GG T FP IGL+V P +G + F
Sbjct: 213 RGGQRVATLVTYLNTPTRGGGTTFP---------------DIGLEVTPLKGHAVFFSYDR 257
Query: 255 PNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
P+ + S+HG PV++G+KWVATKW+R
Sbjct: 258 PHPST--RSLHGGAPVLEGDKWVATKWLR 284
>gi|452752943|ref|ZP_21952682.1| eukaryotic Peptidyl prolyl 4-hydroxylase, alpha subunit [alpha
proteobacterium JLT2015]
gi|451959765|gb|EMD82182.1| eukaryotic Peptidyl prolyl 4-hydroxylase, alpha subunit [alpha
proteobacterium JLT2015]
Length = 314
Score = 105 bits (261), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 65/185 (35%), Positives = 90/185 (48%), Gaps = 22/185 (11%)
Query: 99 QCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKV 158
+C + M+ LRPST+ L +RTS G +S E E + ++ +IA
Sbjct: 142 ECAYLQQMSAPRLRPSTI-LDPQTGARRPDPVRTSVGAALSPVE-EDLVVGMLNRRIAAA 199
Query: 159 TMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMF 218
T R+ GE +ILRY Q+Y H+DA E +QR + +VYLT EGGET F
Sbjct: 200 TGTDRMQGEPLHILRYSGAQEYRPHHDAVAGLE-----NQRSHTLIVYLTADYEGGETAF 254
Query: 219 PFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVA 278
P +G +++ RQGD LLF +L +G D H P G KW+A
Sbjct: 255 PE---------------LGFRLRGRQGDALLFANLREDGRPDLRMRHAGLPATSGAKWIA 299
Query: 279 TKWIR 283
T+WIR
Sbjct: 300 TRWIR 304
>gi|363543309|ref|NP_001241870.1| prolyl 4-hydroxylase 6-3 precursor [Zea mays]
gi|347978824|gb|AEP37754.1| prolyl 4-hydroxylase 6-3 [Zea mays]
Length = 208
Score = 105 bits (261), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 64/179 (35%), Positives = 98/179 (54%), Gaps = 20/179 (11%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 135
LS PRA + F + +C ++++AK ++ S +A DN G RTSSG
Sbjct: 38 LSSRPRAFLYSGFLSDTECDHLVSLAKGSMEKSMVA-------DNDSGKSVASQARTSSG 90
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 195
F++ EDE + IE+++A T LP N E+ +LRY+ GQKY++H+D F +
Sbjct: 91 TFLAKREDE--IVSAIEKRVAAWTFLPEENAESLQVLRYETGQKYDAHFDYFHDRNNLKL 148
Query: 196 KSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLL 254
QRVA+ L+YLTD+++GGE +FP +A+GS+ K R G +F+S L
Sbjct: 149 GGQRVATVLMYLTDVKKGGEAVFP-----DAEGSHLQYKDETWSDCSRSGLAGIFFSEL 202
>gi|357135727|ref|XP_003569460.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like isoform 2
[Brachypodium distachyon]
Length = 314
Score = 105 bits (261), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 62/208 (29%), Positives = 109/208 (52%), Gaps = 19/208 (9%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA 141
L+W PR + F + +C ++ +A+LN+ S L + TQ T + A
Sbjct: 63 LAWHPRVFLYEGFLSGMECDHLVYVARLNIESSLLVNAGARNI--TQN-STDARFKFQLA 119
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS---Q 198
+ + + IE++I+ + +P+ +GE+ IL+Y Q D + G Q S
Sbjct: 120 DSKDIVVSKIEDRISLWSFIPKEHGESMQILKYGSNQS--------DHNKDGTQSSSGGN 171
Query: 199 RVASFLVYLTDLEEGGETMFP---FENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLP 255
R+ + L+YL+D+++GGET+FP ++ +G+ +C G VKP +GD +L ++L P
Sbjct: 172 RLVTILMYLSDVKQGGETVFPRSELKDTQAKEGAL--SECAGYAVKPVKGDAILLFNLRP 229
Query: 256 NGTIDPTSIHGSCPVVKGEKWVATKWIR 283
+G D S + C V++GEKW+A K +
Sbjct: 230 DGVTDSDSHYEDCSVLEGEKWLAIKHLH 257
>gi|325925807|ref|ZP_08187179.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas perforans
91-118]
gi|325543793|gb|EGD15204.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas perforans
91-118]
Length = 286
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 69/212 (32%), Positives = 99/212 (46%), Gaps = 37/212 (17%)
Query: 85 MPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFI 138
+PR + F + E+C ++I +A RP L + TVDN G RTS + +
Sbjct: 95 LPRVVVLGGFLSDEECDALIALA----RPH---LARSRTVDNANGEHVVHAARTSDSMCL 147
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP---- 194
+D IE +IA++ P +GE +LRY G +Y HYD FDP G
Sbjct: 148 RLGQD--ALCQRIEARIARLLDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAAGTPVLV 205
Query: 195 -QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 253
QRVAS ++YL E GG T FP + L V +G+ + F
Sbjct: 206 QAGGQRVASLVMYLNTPERGGATRFPDAH---------------LDVAAVKGNAVFFSYD 250
Query: 254 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
P+ S+H PV+ G+KWVATKW+R++
Sbjct: 251 RPHPMT--RSLHAGAPVLAGDKWVATKWLRER 280
>gi|239915958|ref|NP_001070123.2| prolyl 4-hydroxylase alpha II-like precursor [Danio rerio]
Length = 490
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 61/199 (30%), Positives = 103/199 (51%), Gaps = 31/199 (15%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P+ + + + + + +++ ++A RP + G V IRTS VF+ +E
Sbjct: 308 PKIIRYHDVISDTEIETLKDIA----RPELTRSQTGWGV--ISDIRTSQSVFL----EEV 357
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
GT+ I ++IA +T L + E ++ Y IG +Y H+D G + ++R A+FL+
Sbjct: 358 GTVARISQRIADITGLSVESAEKLHVQNYGIGGRYTPHFDT------GDEVNERTATFLI 411
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
Y++D+E GG T+F +G+ VKP +G + +Y+L NG +D + H
Sbjct: 412 YMSDVEVGGATVF---------------TNVGVAVKPEKGSAVFWYNLHKNGELDLKTKH 456
Query: 266 GSCPVVKGEKWVATKWIRD 284
CPV+ G KWVA KWI +
Sbjct: 457 AGCPVLVGNKWVANKWIHE 475
>gi|363543293|ref|NP_001241862.1| prolyl 4-hydroxylase 2-1 precursor [Zea mays]
gi|347978802|gb|AEP37743.1| prolyl 4-hydroxylase 2-1 [Zea mays]
Length = 204
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 57/143 (39%), Positives = 86/143 (60%), Gaps = 5/143 (3%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-GETVDNTQGIRTSSGVFISA 140
LSW PRA F + +C +I +AK L S +A + G++V + +RTSSG+F+
Sbjct: 39 LSWRPRAFLHKGFLSDAECDHLIALAKDKLEKSMVADNESGKSVQSE--VRTSSGMFLER 96
Query: 141 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRV 200
+DE + IEE+I+ T LP NGE+ IL Y+ G+KY HYD F ++ R+
Sbjct: 97 KQDE--VVTRIEERISAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKKNQALGGHRI 154
Query: 201 ASFLVYLTDLEEGGETMFPFENG 223
A+ L+YL+++E+GGET+FP G
Sbjct: 155 ATVLMYLSNVEKGGETIFPNAEG 177
>gi|414587755|tpg|DAA38326.1| TPA: hypothetical protein ZEAMMB73_894856 [Zea mays]
Length = 244
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 57/145 (39%), Positives = 87/145 (60%), Gaps = 7/145 (4%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTL---ALRKGETVDNTQGIRTSSGV 136
+V+SW PR + F NF + E+C ++ +A+ L+ ST+ A KG D +RTSSG+
Sbjct: 58 EVISWTPRIIVFHNFLSSEECDYLMAIARPRLQISTVVDVATGKGVKSD----VRTSSGM 113
Query: 137 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK 196
F+++ E +S + IE++I+ + +P+ NGE +LRY+ Q Y H+D F +
Sbjct: 114 FVNSEERKSPVVQAIEKRISVFSQIPKENGELIQVLRYEASQYYRPHHDYFSDTFNLKRG 173
Query: 197 SQRVASFLVYLTDLEEGGETMFPFE 221
QRVA+ L+YLTD GGET FP E
Sbjct: 174 GQRVATMLMYLTDGVVGGETHFPQE 198
>gi|92096574|gb|AAI15350.1| LOC557059 protein [Danio rerio]
Length = 508
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 61/199 (30%), Positives = 103/199 (51%), Gaps = 31/199 (15%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P+ + + + + + +++ ++A RP + G V IRTS VF+ +E
Sbjct: 326 PKIIRYHDVISDTEIETLKDIA----RPELTRSQTGWGV--ISDIRTSQSVFL----EEV 375
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
GT+ I ++IA +T L + E ++ Y IG +Y H+D G + ++R A+FL+
Sbjct: 376 GTVARISQRIADITGLSVESAEKLHVQNYGIGGRYTPHFDT------GDEVNERTATFLI 429
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
Y++D+E GG T+F +G+ VKP +G + +Y+L NG +D + H
Sbjct: 430 YMSDVEVGGATVFT---------------NVGVAVKPEKGSAVFWYNLHKNGELDLKTKH 474
Query: 266 GSCPVVKGEKWVATKWIRD 284
CPV+ G KWVA KWI +
Sbjct: 475 AGCPVLVGNKWVANKWIHE 493
>gi|78046308|ref|YP_362483.1| 2OG-Fe(II) oxygenase [Xanthomonas campestris pv. vesicatoria str.
85-10]
gi|78034738|emb|CAJ22383.1| putative 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas
campestris pv. vesicatoria str. 85-10]
Length = 296
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 69/212 (32%), Positives = 99/212 (46%), Gaps = 37/212 (17%)
Query: 85 MPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFI 138
+PR + F + E+C ++I +A RP L + TVDN G RTS + +
Sbjct: 105 LPRVVVLGGFLSDEECDALIALA----RPR---LARSRTVDNANGEHVVHAARTSDSMCL 157
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP---- 194
+D IE +IA++ P +GE +LRY G +Y HYD FDP G
Sbjct: 158 RLGQD--ALCQRIEARIARLLDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAAGTPVLV 215
Query: 195 -QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 253
QRVAS ++YL E GG T FP + L V +G+ + F
Sbjct: 216 QAGGQRVASLVMYLNTPERGGATRFPDAH---------------LDVAAVKGNAVFFSYD 260
Query: 254 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
P+ S+H PV+ G+KWVATKW+R++
Sbjct: 261 RPHPMT--RSLHAGAPVLAGDKWVATKWLRER 290
>gi|251794605|ref|YP_003009336.1| procollagen-proline dioxygenase [Paenibacillus sp. JDR-2]
gi|247542231|gb|ACS99249.1| Procollagen-proline dioxygenase [Paenibacillus sp. JDR-2]
Length = 209
Score = 104 bits (259), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 57/200 (28%), Positives = 105/200 (52%), Gaps = 25/200 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P L N + +C +I++A ++ + + G + D ++ +RTSS +F +E+E
Sbjct: 32 PLILILDNVLSWAECDLLIDLASARMQRAKI----GSSHDVSE-VRTSSSMFFEESENE- 85
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
+ +E ++A++ +P + E +LRY+ G++Y+ H+D F G + R+++ ++
Sbjct: 86 -CIGQVEARVAELMNIPVSHAEPLQVLRYQPGEQYHPHFDYFTQ---GSSMNNRISTLVM 141
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+EEGGET FP + V P++G + F + ++ ++H
Sbjct: 142 YLNDVEEGGETYFP---------------SLHFSVTPKKGSAVYFEYFYNDTRLNELTLH 186
Query: 266 GSCPVVKGEKWVATKWIRDQ 285
PV GEKWVAT+W+R Q
Sbjct: 187 AGHPVEAGEKWVATQWMRRQ 206
>gi|346723630|ref|YP_004850299.1| hypothetical protein XACM_0696 [Xanthomonas axonopodis pv.
citrumelo F1]
gi|346648377|gb|AEO41001.1| hypothetical protein XACM_0696 [Xanthomonas axonopodis pv.
citrumelo F1]
Length = 286
Score = 104 bits (259), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 67/212 (31%), Positives = 99/212 (46%), Gaps = 37/212 (17%)
Query: 85 MPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFI 138
+PR + F + E+C ++I +A+ +L + TVDN G RTS + +
Sbjct: 95 LPRVVVLGGFLSDEECDALIALAQPHLA-------RSRTVDNANGEHVVHAARTSDSMCL 147
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP---- 194
+D IE +IA++ P +GE +LRY G +Y HYD FDP G
Sbjct: 148 RLGQD--ALCQRIEARIARLLDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAAGTPVLV 205
Query: 195 -QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 253
QRVAS ++YL E GG T FP + L V +G+ + F
Sbjct: 206 QAGGQRVASLVMYLNTPERGGATRFPDAH---------------LDVAAVKGNAVFFSYD 250
Query: 254 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
P+ S+H PV+ G+KWVATKW+R++
Sbjct: 251 RPHPMT--RSLHAGAPVLAGDKWVATKWLRER 280
>gi|398810140|ref|ZP_10568970.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
gi|398083831|gb|EJL74535.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
Length = 296
Score = 103 bits (258), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 65/201 (32%), Positives = 101/201 (50%), Gaps = 35/201 (17%)
Query: 97 PEQCKSIINMAKLNLRPSTLALRKGETVDNTQGI------RTSSGVFISAAEDESGTLDL 150
P++C+ +I +A+ L PST TVD G R+S G+F E+ +
Sbjct: 110 PQECEELIALARPRLAPST-------TVDPLSGRDLVGEQRSSLGMFFRLREN--AFIAR 160
Query: 151 IEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS-----QRVASFLV 205
++++++++ LP NGE +L Y G + H+D P + S QRV++ +
Sbjct: 161 LDQRVSELMNLPVENGEGLQVLCYPAGAQSMPHFDFLVPSNAANKASLARSGQRVSTLVS 220
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL ++EEGGET+FP +C G V PR+G + F G +D S+H
Sbjct: 221 YLNEVEEGGETIFP--------------EC-GWSVPPRRGSAVYFEYCNSLGQVDHASLH 265
Query: 266 GSCPVVKGEKWVATKWIRDQE 286
PV+ GEKWVATKW+R +
Sbjct: 266 AGGPVLHGEKWVATKWMRQRR 286
>gi|239816557|ref|YP_002945467.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
gi|239803134|gb|ACS20201.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
Length = 296
Score = 103 bits (257), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 64/200 (32%), Positives = 99/200 (49%), Gaps = 35/200 (17%)
Query: 98 EQCKSIINMAKLNLRPSTLALRKGETVDNTQGI------RTSSGVFISAAEDESGTLDLI 151
E+C+++I +A+ L PST +VD G R+S G+F E+ + +
Sbjct: 111 EECEALIALARPRLAPST-------SVDPLTGRNRLGAQRSSLGMFFRLREN--AFVARL 161
Query: 152 EEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS-----QRVASFLVY 206
+E+++++ LP NGE +L Y G + H+D P Q S QRV++ + Y
Sbjct: 162 DERLSELMNLPVENGEGLQVLHYPAGAQSLPHFDFLVPSNAANQASLQRSGQRVSTLVAY 221
Query: 207 LTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHG 266
L ++EEGGET+FP G V P++G + F G +D S+H
Sbjct: 222 LNEVEEGGETVFPE---------------TGWSVSPQRGGAVYFEYCNSLGQVDHASLHA 266
Query: 267 SCPVVKGEKWVATKWIRDQE 286
PV+ GEKWVATKW+R +
Sbjct: 267 GAPVLSGEKWVATKWMRQRR 286
>gi|418515355|ref|ZP_13081536.1| hypothetical protein MOU_00890 [Xanthomonas axonopodis pv.
malvacearum str. GSPB1386]
gi|410708074|gb|EKQ66523.1| hypothetical protein MOU_00890 [Xanthomonas axonopodis pv.
malvacearum str. GSPB1386]
Length = 216
Score = 103 bits (257), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 69/212 (32%), Positives = 98/212 (46%), Gaps = 37/212 (17%)
Query: 85 MPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFI 138
+PR + F + +C ++I +A RP L + TVDN G RTS + +
Sbjct: 25 LPRVVVLGGFLSDGECDALIALA----RPR---LARSRTVDNANGEHLVHAARTSDSMCL 77
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP---- 194
+D IE +IA++ P +GE +LRY G +Y HYD FDP G
Sbjct: 78 RVGQD--ALCQRIEARIARLFDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAVGTPILL 135
Query: 195 -QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 253
QRVAS ++YL E GG T FP + L V +G+ + F
Sbjct: 136 QAGGQRVASLVMYLNTPERGGATRFPDAH---------------LDVAAVKGNAVFFSYD 180
Query: 254 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
P+ S+H PV+ GEKWVATKW+R++
Sbjct: 181 RPHPMT--RSLHAGAPVLAGEKWVATKWLRER 210
>gi|445499353|ref|ZP_21466208.1| prolyl 4-hydroxylase alpha subunit [Janthinobacterium sp. HH01]
gi|444789348|gb|ELX10896.1| prolyl 4-hydroxylase alpha subunit [Janthinobacterium sp. HH01]
Length = 272
Score = 103 bits (257), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 70/220 (31%), Positives = 105/220 (47%), Gaps = 25/220 (11%)
Query: 69 AGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQ 128
A D V + F VL P+ + N + E+C +II ST+ + +
Sbjct: 68 AAPDRVAEVLF-VLK-QPQIILLGNVLSDEECDAIIAHCGTRYTRSTVTGEADGSSMVHE 125
Query: 129 GIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFD 188
G RTS FI E E + IE ++A + P E F + +Y Q+Y HYD D
Sbjct: 126 G-RTSEMAFIQRGEAE--VAERIERRLAALAHWPAECSEPFQLQKYDATQEYRPHYDWLD 182
Query: 189 PQEYGPQKS-----QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPR 243
P G + QR+A+F++YL+D+E+GG T+FP +GL+V P+
Sbjct: 183 PDSSGHRSHLARGGQRLATFILYLSDVEQGGGTVFP---------------GLGLEVYPK 227
Query: 244 QGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
+G L F + N D ++HG PVV+G K +A KW+R
Sbjct: 228 KGSALWFLNTDINHQPDKRTLHGGAPVVRGTKIIANKWLR 267
>gi|198477152|ref|XP_002136738.1| GA29216 [Drosophila pseudoobscura pseudoobscura]
gi|198145043|gb|EDY71755.1| GA29216 [Drosophila pseudoobscura pseudoobscura]
Length = 517
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 64/217 (29%), Positives = 110/217 (50%), Gaps = 26/217 (11%)
Query: 78 PF--QVLSWMPRALYFPNFATPEQCKSIINMAK-LNLRPSTLALRKGETVDNTQGIRTSS 134
PF ++LS P + + + TP + ++ N++K L R + + + + RTS+
Sbjct: 312 PFKTEILSLSPYMVLYHDVITPLESLTLKNLSKPLMKRRAMVMVNNLKVRPFIDSGRTSN 371
Query: 135 GVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFD----PQ 190
V++ A E+ ++ +E ++ +T N E + ++ Y IG Y H D F+ P+
Sbjct: 372 SVWL--ASHENAVMERLERRVGVMTNFEMENSEVYQLINYGIGGHYKPHTDHFETPQAPE 429
Query: 191 EYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 250
G R+A+ L YL+D+ +GG T+FP N + V+PRQGD LL+
Sbjct: 430 HRG--GGDRIATVLFYLSDVPQGGATLFPRLN---------------ISVQPRQGDALLW 472
Query: 251 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
Y+L G + ++H SCP+++G KW KWI + Q
Sbjct: 473 YNLNDRGQGEIGTVHTSCPIIQGSKWALVKWIDELSQ 509
>gi|229084249|ref|ZP_04216532.1| 2OG-Fe(II) oxygenase [Bacillus cereus Rock3-44]
gi|228699049|gb|EEL51751.1| 2OG-Fe(II) oxygenase [Bacillus cereus Rock3-44]
Length = 235
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 66/217 (30%), Positives = 104/217 (47%), Gaps = 26/217 (11%)
Query: 78 PFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVF 137
P +L P + T +C +I++A+ L+PS + G + T +RTS +
Sbjct: 39 PSNLLHDNPFIGCYEKVVTQTECHQLIDLARHGLQPSKVI---GNSEQKTSAVRTSDTIG 95
Query: 138 ISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP-----QEY 192
E TL + + +IA + LP E I RY++G K+N+H+D F+P + Y
Sbjct: 96 FQHHLTEL-TLQICK-RIASIVELPLNYAEHLQIARYQVGGKFNAHFDTFNPSTELGKMY 153
Query: 193 GPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYS 252
+ QR+ + L+YL ++ GGET FP N ++V P +G L+F +
Sbjct: 154 LSENGQRIITALLYLNNVSAGGETSFPLLN---------------IQVAPSEGTLLVFEN 198
Query: 253 LLPNGT-IDPTSIHGSCPVVKGEKWVATKWIRDQEQY 288
N SIH C V +GEKW+AT W ++ QY
Sbjct: 199 CKKNSNERHALSIHEGCAVHEGEKWIATLWFHEKSQY 235
>gi|428170517|gb|EKX39441.1| hypothetical protein GUITHDRAFT_114401 [Guillardia theta CCMP2712]
Length = 322
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 76/270 (28%), Positives = 122/270 (45%), Gaps = 29/270 (10%)
Query: 27 LFFFLAGLLGSSLLSQDVTAARPSARVVESVKDEYKWMPHGQAGDDSVTNIPFQVLSWMP 86
LF + G S+L Q + P + + S + G + + + +S P
Sbjct: 23 LFLVIVGYAVLSMLLQSLWMTGPKSDALLSKAPSLERRSMTNLGGMAKKSTWIETVSVDP 82
Query: 87 RALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESG 146
R N T E+C ++++A ++L G RT+ ++ +D+
Sbjct: 83 RIFIVHNLLTEEECDHLVSLALQKGLSASLITPYGTNKLVESTTRTNKQAWLDFQQDD-- 140
Query: 147 TLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ----RVAS 202
+ +E+KIAK+T GE +L Y Q++ H+D FDP P+ + R+ +
Sbjct: 141 VVKRVEDKIAKLTKTTPEQGENLQVLHYAKSQQFTEHHDYFDPATDPPENYEKGGNRLIT 200
Query: 203 FLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGT--ID 260
+VYL EEGGET F N LK+ +GD ++FY+ L +G ID
Sbjct: 201 VIVYLQAAEEGGETHFGAAN---------------LKLTAAKGDAVMFYN-LKHGCDGID 244
Query: 261 PT-----SIHGSCPVVKGEKWVATKWIRDQ 285
PT ++H P +KGEKWVATKWI ++
Sbjct: 245 PTCVDKQTLHAGLPPIKGEKWVATKWIHER 274
>gi|410637601|ref|ZP_11348175.1| prolyl 4-hydroxylase [Glaciecola lipolytica E3]
gi|410142794|dbj|GAC15380.1| prolyl 4-hydroxylase [Glaciecola lipolytica E3]
Length = 280
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 64/197 (32%), Positives = 100/197 (50%), Gaps = 25/197 (12%)
Query: 93 NFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIE 152
NF T ++C++++ + K LRPS + R+G D +G RTSS + +D I+
Sbjct: 92 NFLTAQECEALVALTKSKLRPSEIPEREG---DQYKGFRTSSTCDLPFTKDPLA--HEID 146
Query: 153 EKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP-----QEYGPQKSQRVASFLVYL 207
+KI L E Y IGQ++ +H D F P + Y QR +F++YL
Sbjct: 147 QKIVDALGLGVGEKEVIQAQHYAIGQEFKAHCDYFVPGSKDFKTYSKDGGQRTWTFMIYL 206
Query: 208 TDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGS 267
+L EGGET F +G+K KP+QG L++ +L +G+I+ ++H +
Sbjct: 207 NELCEGGETEFV---------------KLGIKFKPKQGTALVWNNLHEDGSINEDTLHHA 251
Query: 268 CPVVKGEKWVATKWIRD 284
P+ GEK V TKW R+
Sbjct: 252 HPIESGEKVVITKWFRE 268
>gi|77748547|ref|NP_641044.2| hypothetical protein XAC0691 [Xanthomonas axonopodis pv. citri str.
306]
gi|381169877|ref|ZP_09879039.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas citri pv.
mangiferaeindicae LMG 941]
gi|380689647|emb|CCG35526.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas citri pv.
mangiferaeindicae LMG 941]
Length = 286
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 69/212 (32%), Positives = 98/212 (46%), Gaps = 37/212 (17%)
Query: 85 MPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFI 138
+PR + F + +C ++I +A RP L + TVDN G RTS + +
Sbjct: 95 LPRVVVLGGFLSDGECDALIALA----RPR---LARSRTVDNANGEHMVHAARTSDSMCL 147
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP---- 194
+D IE +IA++ P +GE +LRY G +Y HYD FDP G
Sbjct: 148 RVGQD--ALCQRIEARIARLFDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAAGTPILL 205
Query: 195 -QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 253
QRVAS ++YL E GG T FP + L V +G+ + F
Sbjct: 206 QAGGQRVASLVMYLNTPERGGATRFPDAH---------------LDVAAVKGNAVFFSYD 250
Query: 254 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
P+ S+H PV+ GEKWVATKW+R++
Sbjct: 251 RPHPMT--RSLHAGAPVLAGEKWVATKWLRER 280
>gi|374370415|ref|ZP_09628419.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus basilensis OR16]
gi|373098067|gb|EHP39184.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus basilensis OR16]
Length = 454
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 69/224 (30%), Positives = 101/224 (45%), Gaps = 27/224 (12%)
Query: 68 QAGDDSV----TNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET 123
QAG ++ IP PR F T +C +++ +A+ L S + +
Sbjct: 110 QAGRNAAHFAGREIPILFTLAAPRVTLFQQLLTDAECDALVALARGRLARSPV-INPDTG 168
Query: 124 VDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSH 183
+N RTS G E ++ IE+ IA VT + GE IL YK G +Y H
Sbjct: 169 DENLIEARTSLGAMFQVGEHP--LIERIEDCIAAVTGIAAERGEGLQILNYKPGGEYQPH 226
Query: 184 YDAFDPQEYGPQK-----SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGL 238
YD F+PQ G + QRV + ++YL GG T FP +GL
Sbjct: 227 YDFFNPQRPGEARQLKVGGQRVGTLVIYLNSPLAGGATAFPK---------------LGL 271
Query: 239 KVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWI 282
+V P +G+ + F +G +D ++H PV GEKW+ATKW+
Sbjct: 272 EVAPVKGNAVYFSYRKSDGALDERTLHAGLPVEAGEKWIATKWL 315
>gi|218187602|gb|EEC70029.1| hypothetical protein OsI_00603 [Oryza sativa Indica Group]
Length = 549
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 63/202 (31%), Positives = 105/202 (51%), Gaps = 20/202 (9%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA 141
LSW PR + F + +C +++ + N+ S+LA T G R SS I
Sbjct: 309 LSWHPRIFLYEGFLSDMECDHLVSTGRGNM-DSSLAF--------TDGDRNSSYNNI--- 356
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVA 201
E + IE++I+ + LP+ NGE +L+Y + ++ + + +A
Sbjct: 357 --EDIVVSKIEDRISLWSFLPKENGENIQVLKYGVNRR-----GSIKEEPKSSTGGHWLA 409
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDY-QKCIGLKVKPRQGDGLLFYSLLPNGTID 260
+ L+YL+D+++GGET+FP +A +C G V+P +G+ LL ++L P+G ID
Sbjct: 410 TILIYLSDVKQGGETVFPRSEMKDAQAKEGAPSQCSGYAVRPAKGNALLLFNLRPDGEID 469
Query: 261 PTSIHGSCPVVKGEKWVATKWI 282
S + CPV++GEKW+A K I
Sbjct: 470 KDSQYEECPVLEGEKWLAIKHI 491
>gi|209522122|ref|ZP_03270769.1| Procollagen-proline dioxygenase [Burkholderia sp. H160]
gi|209497434|gb|EDZ97642.1| Procollagen-proline dioxygenase [Burkholderia sp. H160]
Length = 296
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 72/215 (33%), Positives = 108/215 (50%), Gaps = 27/215 (12%)
Query: 80 QVLSWM--PRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGV 136
+VLS + P A++ +F + ++C+ +I +A+ L ST+ G V G R+S G+
Sbjct: 94 RVLSRLQRPAAVHLADFLSADECEQLIALAQPRLDRSTVVDPVTGRNV--VAGHRSSHGM 151
Query: 137 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-----DPQE 191
F E + + IE +IA +T P NGE +L Y+ G + H D +E
Sbjct: 152 FFRLGE--TPLIVRIEARIAALTGTPVENGEGLQMLHYEEGAESTPHVDYLITGNEANRE 209
Query: 192 YGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFY 251
+ QR+ + L+YL D+E GGET+FP IG V P++G L F
Sbjct: 210 SIARSGQRMGTLLMYLKDVEGGGETVFP---------------QIGWSVAPQRGHALYFE 254
Query: 252 SLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
G DP+S+H S P+ G+KWVATKWIR +
Sbjct: 255 YGNRFGLCDPSSLHASTPLRVGDKWVATKWIRTRR 289
>gi|21106803|gb|AAM35580.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri
str. 306]
Length = 306
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 69/212 (32%), Positives = 98/212 (46%), Gaps = 37/212 (17%)
Query: 85 MPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFI 138
+PR + F + +C ++I +A RP L + TVDN G RTS + +
Sbjct: 115 LPRVVVLGGFLSDGECDALIALA----RPR---LARSRTVDNANGEHMVHAARTSDSMCL 167
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP---- 194
+D IE +IA++ P +GE +LRY G +Y HYD FDP G
Sbjct: 168 RVGQD--ALCQRIEARIARLFDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAAGTPILL 225
Query: 195 -QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 253
QRVAS ++YL E GG T FP + L V +G+ + F
Sbjct: 226 QAGGQRVASLVMYLNTPERGGATRFPDAH---------------LDVAAVKGNAVFFSYD 270
Query: 254 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
P+ S+H PV+ GEKWVATKW+R++
Sbjct: 271 RPHPMT--RSLHAGAPVLAGEKWVATKWLRER 300
>gi|325915062|ref|ZP_08177391.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas vesicatoria
ATCC 35937]
gi|325538760|gb|EGD10427.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas vesicatoria
ATCC 35937]
Length = 286
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 66/212 (31%), Positives = 97/212 (45%), Gaps = 37/212 (17%)
Query: 85 MPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGI------RTSSGVFI 138
+PR + F + +C ++I +A+ L + TVDN G RTS + +
Sbjct: 95 LPRVMVLGGFLSDAECDAMIALAQPRLA-------RSRTVDNANGAHVVHAARTSDSMCL 147
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP---- 194
+D IE +IA++ P NGE +LRY G +Y HYD FDP G
Sbjct: 148 QLGQD--ALCQRIEARIARLLDWPVENGEGLQVLRYGTGAEYQPHYDYFDPDAAGTPVLL 205
Query: 195 -QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 253
QRVAS ++YL + GG T FP + L + +G+ + F
Sbjct: 206 QAGGQRVASLVMYLNTPDRGGATRFPD---------------VHLDIAAIKGNAVFFSYD 250
Query: 254 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
P+ S+H PV+ GEKWVATKW+R++
Sbjct: 251 RPHPMT--RSLHAGAPVLAGEKWVATKWLRER 280
>gi|115313004|gb|AAI24075.1| Zgc:152670 [Danio rerio]
Length = 235
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 56/155 (36%), Positives = 84/155 (54%), Gaps = 25/155 (16%)
Query: 130 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP 189
IRTS VF+ DE GT+ I ++IA +T L + E ++ Y IG +Y H+DA
Sbjct: 91 IRTSQSVFL----DEVGTVARISQRIADITGLSVESAEKLHVQNYGIGGRYTPHFDA--- 143
Query: 190 QEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLL 249
G ++R A+FL+Y++D+E GG T+F +G+ VKP +G +
Sbjct: 144 ---GGDVNERTATFLIYMSDVEVGGATVFT---------------NVGVAVKPEKGSAVF 185
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRD 284
+ +L NG +D + H CPV+ G KWVA KWI +
Sbjct: 186 WNNLHKNGELDLKTKHAGCPVLVGNKWVANKWIHE 220
>gi|352086439|ref|ZP_08953941.1| Procollagen-proline dioxygenase [Rhodanobacter sp. 2APBS1]
gi|389799401|ref|ZP_10202396.1| procollagen-proline dioxygenase [Rhodanobacter sp. 116-2]
gi|351679404|gb|EHA62545.1| Procollagen-proline dioxygenase [Rhodanobacter sp. 2APBS1]
gi|388442818|gb|EIL98985.1| procollagen-proline dioxygenase [Rhodanobacter sp. 116-2]
Length = 284
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 67/199 (33%), Positives = 104/199 (52%), Gaps = 30/199 (15%)
Query: 93 NFATPEQCKSIINMAKLNL-RPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLI 151
N + ++C+ +I +A+ L R T+ + VD RTS G+F + +E + I
Sbjct: 102 NILSTQECEELIALARPRLQRALTVDSEGRQQVDRR---RTSEGMFFTL--NEVPLVGRI 156
Query: 152 EEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQE--YGPQKS---QRVASFLVY 206
E+++A + +P +GE IL Y GQ+Y H+D FDP++ YG + QR+AS ++Y
Sbjct: 157 EQRLAALLRVPASHGEGLQILHYLPGQEYEPHFDWFDPEQPGYGAITAVGGQRIASVVMY 216
Query: 207 LTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHG 266
L GG T FP +GL V R+G + F G DP+S+H
Sbjct: 217 LNTPARGGGTAFP---------------ELGLTVTARRGSAVYFAY---EGG-DPSSLHA 257
Query: 267 SCPVVKGEKWVATKWIRDQ 285
PV+ GEKW+ATKW+R++
Sbjct: 258 GLPVLDGEKWIATKWLRER 276
>gi|357135725|ref|XP_003569459.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like isoform 1
[Brachypodium distachyon]
Length = 303
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 61/208 (29%), Positives = 106/208 (50%), Gaps = 30/208 (14%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA 141
L+W PR + F + +C ++ +A+LN+ S L G R I+
Sbjct: 63 LAWHPRVFLYEGFLSGMECDHLVYVARLNIESSLLV---------NAGARN-----ITQN 108
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS---Q 198
+ + IE++I+ + +P+ +GE+ IL+Y Q D + G Q S
Sbjct: 109 STDDIVVSKIEDRISLWSFIPKEHGESMQILKYGSNQS--------DHNKDGTQSSSGGN 160
Query: 199 RVASFLVYLTDLEEGGETMFP---FENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLP 255
R+ + L+YL+D+++GGET+FP ++ +G+ +C G VKP +GD +L ++L P
Sbjct: 161 RLVTILMYLSDVKQGGETVFPRSELKDTQAKEGAL--SECAGYAVKPVKGDAILLFNLRP 218
Query: 256 NGTIDPTSIHGSCPVVKGEKWVATKWIR 283
+G D S + C V++GEKW+A K +
Sbjct: 219 DGVTDSDSHYEDCSVLEGEKWLAIKHLH 246
>gi|90022913|ref|YP_528740.1| hypothetical protein Sde_3273 [Saccharophagus degradans 2-40]
gi|89952513|gb|ABD82528.1| 2OG-Fe(II) oxygenase [Saccharophagus degradans 2-40]
Length = 478
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 69/221 (31%), Positives = 108/221 (48%), Gaps = 41/221 (18%)
Query: 86 PRALYFPN----------------FATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG 129
PR ++ PN F T E+C+ II + LRPS L+ ++ + +
Sbjct: 86 PRKIFIPNALKLNSDKLEMYALGEFLTTEECERIIANIRSKLRPSELSSQESD-----KT 140
Query: 130 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP 189
RTS + +D + ++ +I K+ + E Y++GQ++ +H D F+
Sbjct: 141 YRTSRTCDLGTIDDP--FIHYVDSRICKLVGIDPSYSEVIQGQLYEVGQEFKAHTDYFEI 198
Query: 190 QE---YGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGD 246
+E +G QR + ++YL D+EEGGET FP ADG+ +KPR G
Sbjct: 199 KEMPEHGAVMGQRTYTVMIYLNDVEEGGETDFP-----AADGA----------IKPRAGL 243
Query: 247 GLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
L++ SL NG +P S+H + PV+KG K V TKW R Q +
Sbjct: 244 ALIWNSLQSNGAPNPHSMHQAYPVLKGHKAVITKWFRSQSR 284
>gi|323445926|gb|EGB02303.1| hypothetical protein AURANDRAFT_39521 [Aureococcus anophagefferens]
Length = 239
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 61/208 (29%), Positives = 109/208 (52%), Gaps = 23/208 (11%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA 141
LS P + +FA + C+ +I A+ +L + + R+G IR +S +++A
Sbjct: 31 LSADPLVYFIDDFADEDSCEHLIRQARPSLGGAEVQTRRGSAART--AIRRASSCWLAAR 88
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYK--IGQKYNSHYDAFDPQEYGPQKS-Q 198
DE+ L+ +E+ I P E F+++RY+ G++Y +H DAF+ ++ Q
Sbjct: 89 GDEA--LEHLEDAICAELGAPEERTEFFHVVRYRPSTGERYAAHADAFEAGNAELERGGQ 146
Query: 199 RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGT 258
R+ + L+YL+D+ GG T+FP +GL V PR+G L+F ++ + T
Sbjct: 147 RLTTALLYLSDVGAGGATVFP---------------ALGLSVAPRRGRLLVFANVADDTT 191
Query: 259 IDPTSIHGSCPVV-KGEKWVATKWIRDQ 285
+D ++H P+ EKW+A KW+R++
Sbjct: 192 VDARTVHAGEPIAGDTEKWIANKWVRER 219
>gi|325922187|ref|ZP_08183974.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas gardneri ATCC
19865]
gi|325547306|gb|EGD18373.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas gardneri ATCC
19865]
Length = 285
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 66/212 (31%), Positives = 98/212 (46%), Gaps = 37/212 (17%)
Query: 85 MPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGI------RTSSGVFI 138
+PR + +F + +C ++I +A+ L + TVDN G RTS + +
Sbjct: 95 LPRVVVLGDFLSDAECDALIALAQPRLA-------RSRTVDNDNGAQIVHAARTSDSMCL 147
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP---- 194
+D IE +IA++ P +GE +LRY G +Y HYD FDP G
Sbjct: 148 QLGQD--ALCQRIEARIARLLDWPVDHGEGLQVLRYATGAEYQPHYDYFDPTAAGTPVLL 205
Query: 195 -QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 253
QR+AS ++YL E GG T FP + L V +G+ + F
Sbjct: 206 QAGGQRLASLVMYLNTPERGGATRFPD---------------VHLDVAAVKGNAVFFSYD 250
Query: 254 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
P+ S+H PV+ GEKWVATKW+R++
Sbjct: 251 RPHPMT--RSLHAGAPVLAGEKWVATKWLRER 280
>gi|196011912|ref|XP_002115819.1| hypothetical protein TRIADDRAFT_59908 [Trichoplax adhaerens]
gi|190581595|gb|EDV21671.1| hypothetical protein TRIADDRAFT_59908 [Trichoplax adhaerens]
Length = 300
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 61/217 (28%), Positives = 110/217 (50%), Gaps = 24/217 (11%)
Query: 77 IPFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG-IRTS 133
+P+ + +S P + + N + + +S+ +A L+P+ + N +G R +
Sbjct: 93 MPYAIEEMSRDPLIILYHNLTSNAEMESLKALAAKQLQPAGVYHTTSADNRNLEGYTRIA 152
Query: 134 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 193
FI ++ES I +++ VT L E ++ Y I +Y HYD F P + G
Sbjct: 153 KMAFI--LDEESAVASAITQRLQDVTGLNMNFSEPLQVINYGIAGQYTPHYDTF-PAKSG 209
Query: 194 PQKS---QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 250
+ R+A+ ++YL+D+E GG T+F I ++V PR+G+ +++
Sbjct: 210 DRSHPSHDRLATAILYLSDVERGGATVF---------------TNINVRVLPRKGNVIIW 254
Query: 251 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
Y+ LP+G + P ++H CPV+ G KW+A KWI+ + Q
Sbjct: 255 YNYLPDGNLHPGTLHAGCPVLVGSKWIANKWIQSKGQ 291
>gi|326435474|gb|EGD81044.1| hypothetical protein PTSG_10986 [Salpingoeca sp. ATCC 50818]
Length = 264
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 69/208 (33%), Positives = 102/208 (49%), Gaps = 26/208 (12%)
Query: 81 VLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVF--- 137
+LS P + F NF + E+ +I++ AK ST + + RTSS +
Sbjct: 64 MLSEDPPVIQFNNFISQERIDAILHFAKPKFARSTSGIER-----EVSNYRTSSTAWMLP 118
Query: 138 -ISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK 196
+ + L +EE+IA++ LP N E F +L+Y+ Q Y H D + Q P
Sbjct: 119 DVLGNDPMQAHLKDMEEEIARIVRLPVENQEHFQVLQYQKNQYYKVHSDYIEEQRQQP-C 177
Query: 197 SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPN 256
RVA+F +YL D+EEGG T FP + L V+P +G+ +L+YS PN
Sbjct: 178 GIRVATFFLYLNDVEEGGGTRFP---------------NLNLTVQPAKGNAVLWYSAYPN 222
Query: 257 GT-IDPTSIHGSCPVVKGEKWVATKWIR 283
T +D + H + PV KG K+ A KWI
Sbjct: 223 TTRMDSRTDHEAMPVAKGMKYGANKWIH 250
>gi|224006596|ref|XP_002292258.1| hypothetical protein THAPSDRAFT_263536 [Thalassiosira pseudonana
CCMP1335]
gi|220971900|gb|EED90233.1| hypothetical protein THAPSDRAFT_263536 [Thalassiosira pseudonana
CCMP1335]
Length = 206
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 66/211 (31%), Positives = 108/211 (51%), Gaps = 20/211 (9%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG---IRTSSGVFISAAE 142
PR Y NF + ++ ++ ++ PST K + RTS F
Sbjct: 2 PRVFYVHNFLSADEADELV---AFSMAPSTGGTHKAWNQGGSNAKLTTRTSMNAF-DITT 57
Query: 143 DESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQE-----YGPQK- 196
S + ++ ++ + ILRY++GQ Y +H+D F ++ + P K
Sbjct: 58 KLSFRIKRRAFRLLRMGAYKENLADGIQILRYELGQAYIAHHDYFPVRQSNDHLWDPSKG 117
Query: 197 -SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYD---YQKCIG-LKVKPRQGDGLLFY 251
S R A+ +YL+D+E GG+T+ + G++A GS++ +C L V PR+GD +LFY
Sbjct: 118 GSNRFATIFLYLSDVEVGGQTLEK-DAGVDA-GSWEDKLVDQCYSKLAVPPRRGDAILFY 175
Query: 252 SLLPNGTIDPTSIHGSCPVVKGEKWVATKWI 282
S P+G +DP S+HG+CP++KG KW A W+
Sbjct: 176 SQYPDGHLDPNSLHGACPILKGTKWGANLWV 206
>gi|389793983|ref|ZP_10197143.1| 2OG-Fe(II) oxygenase [Rhodanobacter fulvus Jip2]
gi|388433014|gb|EIL89992.1| 2OG-Fe(II) oxygenase [Rhodanobacter fulvus Jip2]
Length = 282
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 72/221 (32%), Positives = 105/221 (47%), Gaps = 32/221 (14%)
Query: 71 DDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKGETVDNTQG 129
D IP V + P + +C +I +A+ L R T+ + +D
Sbjct: 80 DGRTIGIPLSVDA--PALRVLDGLLSERECADLIELARPRLQRALTVDSDGKQQIDQR-- 135
Query: 130 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP 189
RTS G+F A E + + IE+++A++ +P +GE IL Y GQ+Y HYD FDP
Sbjct: 136 -RTSEGMFFRAGE--TPLVAAIEQRLAQLLGVPASHGEGLQILHYGPGQEYEPHYDWFDP 192
Query: 190 QEYGPQK-----SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQ 244
G K QR+AS ++YL E GG T FP IGL V R+
Sbjct: 193 ALPGYDKLTARAGQRIASVVMYLNTPERGGGTAFP---------------EIGLTVTARR 237
Query: 245 GDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
G + F G D +S+H PV++GEKW+AT W+R++
Sbjct: 238 GAAVYF--AYEGG--DQSSLHAGLPVLQGEKWIATHWLRER 274
>gi|224009604|ref|XP_002293760.1| prolyl 4-hydroxylase alpha subunit [Thalassiosira pseudonana
CCMP1335]
gi|220970432|gb|EED88769.1| prolyl 4-hydroxylase alpha subunit [Thalassiosira pseudonana
CCMP1335]
Length = 206
Score = 101 bits (251), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 67/213 (31%), Positives = 105/213 (49%), Gaps = 15/213 (7%)
Query: 77 IPFQVLSWMPRALYFPNFATPEQCKSIINMAK-LNLRPSTLALRKGETVD---NTQGIRT 132
+ +VLS PRA NF + + I+ + + L ST A T D +T+ RT
Sbjct: 1 MTLKVLSCAPRAFEIENFLSQTEVDHIMYLTTGMKLHRSTTAGSDQITADERDSTRNTRT 60
Query: 133 SSGVFISAAEDESGTLDLIEEKIAKVTMLPR-INGEAFNILRYKIGQKYNSHYDAFDPQE 191
S ++ ++S +D I + A + ++ + EA ++ Y +GQ+Y +H+D P
Sbjct: 61 SLNTWVY--REKSAIIDTIYRRAADLQLMNEALIAEALQLVHYDVGQEYTAHHDWGHPDI 118
Query: 192 YGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFY 251
+ R + L+YL + EGG T FP +NA+ GL V+P+ G +LFY
Sbjct: 119 DNEYQPARYCTLLLYLNEGMEGGATQFP--RWVNAETRN------GLDVEPKIGKAVLFY 170
Query: 252 SLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRD 284
S LP+G +D S H + PV GEKW+ W D
Sbjct: 171 SQLPDGNMDDWSHHAAMPVRVGEKWLMNLWTWD 203
>gi|389728965|ref|ZP_10189244.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
gi|388441204|gb|EIL97500.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
Length = 285
Score = 101 bits (251), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 71/227 (31%), Positives = 106/227 (46%), Gaps = 33/227 (14%)
Query: 65 PHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKGET 123
PH G+ SV + + P F + ++C ++I +AK L R T+A +
Sbjct: 77 PHAVIGERSVR---VMLAAETPPLRVFDGLLSDDECAALIELAKPRLQRARTVAEDGAQQ 133
Query: 124 VDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSH 183
+D RTS G+F E ++ IE +IA + +P +GE +L Y GQ+Y H
Sbjct: 134 IDEH---RTSDGMFFGLGEQP--LIERIEARIAALLGIPVDHGEGLQVLHYLPGQQYEPH 188
Query: 184 YDAFDPQEYG-----PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGL 238
D FDP + G QR+AS ++YL + GG T FP IGL
Sbjct: 189 QDWFDPTQPGYAAITATGGQRIASLVIYLNTPDAGGGTAFPE---------------IGL 233
Query: 239 KVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
V +G + F + D S+H PV +GEKW+ATKW+R++
Sbjct: 234 TVTALRGSAVCFT----YESGDVFSLHAGLPVTRGEKWIATKWLRER 276
>gi|319786559|ref|YP_004146034.1| Procollagen-proline dioxygenase [Pseudoxanthomonas suwonensis 11-1]
gi|317465071|gb|ADV26803.1| Procollagen-proline dioxygenase [Pseudoxanthomonas suwonensis 11-1]
Length = 289
Score = 100 bits (250), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 66/211 (31%), Positives = 97/211 (45%), Gaps = 37/211 (17%)
Query: 85 MPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFI 138
+PR + + E+C +++ +++ LR ST TVD G RTS G F
Sbjct: 101 LPRVVVLGGLLSDEECDALVELSRPRLRRST-------TVDAQTGGSQVHADRTSRGTFF 153
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP---- 194
IE +IA++ P NGE +L Y G ++ HYD FDP E G
Sbjct: 154 ERGAHP--VCATIEARIARLLEWPVENGEGLQVLHYPPGAEFRPHYDYFDPDEPGAEVLL 211
Query: 195 -QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 253
Q QRVA+ ++YL GG T FP + L+V +G+ + F
Sbjct: 212 RQGGQRVATVVMYLNTPARGGATTFPDAH---------------LEVAAVKGNAVFFSYD 256
Query: 254 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRD 284
P+ ++HG PV +GEKW+ATKW+R+
Sbjct: 257 RPHPMT--RTLHGGAPVTEGEKWIATKWLRE 285
>gi|319943342|ref|ZP_08017624.1| 2OG-Fe(II) oxygenase [Lautropia mirabilis ATCC 51599]
gi|319743157|gb|EFV95562.1| 2OG-Fe(II) oxygenase [Lautropia mirabilis ATCC 51599]
Length = 311
Score = 100 bits (250), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 64/232 (27%), Positives = 112/232 (48%), Gaps = 43/232 (18%)
Query: 78 PFQVLSWMPR------------ALYFPNFA------TPEQCKSIINMAKLNLRPSTLALR 119
P +++S +PR + PN A + E+C +I +++ ++ S + R
Sbjct: 95 PIRLISQLPRFTVADREVELAAVMSNPNIAVIRGLLSDEECDEVIRLSRGKMKTSQVVDR 154
Query: 120 K-GETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQ 178
+ G + +++ +R S G E+E + IE +++ + LP GE IL Y G
Sbjct: 155 ESGGSYESS--VRKSEGSHFERGENE--LVRRIEARLSALVDLPVNRGEPLQILHYGPGG 210
Query: 179 KYNSHYDAFDPQEYGPQ-----KSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQ 233
+Y +H D F+P++ G QR+ + ++YL D+ EGGET FP
Sbjct: 211 EYKAHQDFFEPKDPGSAVLTRVGGQRIGTVVMYLNDVPEGGETAFP-------------- 256
Query: 234 KCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
IG KP +G + F +G +D +H PV++G+KW+ TKW+R++
Sbjct: 257 -DIGFSAKPIKGSAVYFEYQNADGQLDYRCLHAGMPVIRGDKWIMTKWLRER 307
>gi|389775678|ref|ZP_10193553.1| procollagen-proline dioxygenase [Rhodanobacter spathiphylli B39]
gi|388437120|gb|EIL93940.1| procollagen-proline dioxygenase [Rhodanobacter spathiphylli B39]
Length = 284
Score = 100 bits (250), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 69/199 (34%), Positives = 100/199 (50%), Gaps = 30/199 (15%)
Query: 93 NFATPEQCKSIINMAKLNL-RPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLI 151
N E+C+ +I +A+ L R T+A VD RTS G+F + +E + I
Sbjct: 102 NLLAAEECEELIALAQPRLKRALTVASDGSNQVDQR---RTSEGMFFTL--NELPLVGRI 156
Query: 152 EEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG-----PQKSQRVASFLVY 206
E+++A + +P +GE IL Y GQ+Y H+D FDPQ+ G QRVAS ++Y
Sbjct: 157 EQRLATLLGMPVSHGEGLQILHYLPGQEYEPHFDWFDPQQPGYDTITAVGGQRVASVVMY 216
Query: 207 LTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHG 266
L +GG T FP +GL V R+G + F G D S+H
Sbjct: 217 LNTPAQGGGTAFP---------------ELGLTVTARRGAAVYF--AYEGG--DQQSLHA 257
Query: 267 SCPVVKGEKWVATKWIRDQ 285
PV +GEKW+ATKW+R++
Sbjct: 258 GLPVQRGEKWIATKWLRER 276
>gi|24417248|gb|AAN60234.1| unknown [Arabidopsis thaliana]
Length = 190
Score = 100 bits (249), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 55/136 (40%), Positives = 81/136 (59%), Gaps = 5/136 (3%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALR-KGETVDNTQGIRTSSGVFISA 140
LSW PR + F + E+C I +AK L S +A GE+V++ +RTSSG+F+S
Sbjct: 59 LSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESE--VRTSSGMFLSK 116
Query: 141 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRV 200
+D+ + +E K+A T LP NGE+ IL Y+ GQKY H+D F Q R+
Sbjct: 117 RQDD--IVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRI 174
Query: 201 ASFLVYLTDLEEGGET 216
A+ L+YL+++E+GGET
Sbjct: 175 ATVLMYLSNVEKGGET 190
>gi|384429387|ref|YP_005638747.1| procollagen-proline, 2-oxoglutarate-4-dioxygenase [Xanthomonas
campestris pv. raphani 756C]
gi|341938490|gb|AEL08629.1| procollagen-proline, 2-oxoglutarate-4-dioxygenase [Xanthomonas
campestris pv. raphani 756C]
Length = 286
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 66/212 (31%), Positives = 98/212 (46%), Gaps = 37/212 (17%)
Query: 85 MPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFI 138
+PR + + ++C ++I +A RP L + TVDN G RTS + +
Sbjct: 95 LPRVVVLGGLLSDDECDALIALA----RPQ---LARSRTVDNRDGSEIVHAARTSHSMAL 147
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP---- 194
+D IE +IA++ P +GE +LRY G +Y HYD F+P G
Sbjct: 148 QPGQD--ALCQRIEARIARLLEWPVEHGEGLQVLRYATGAQYAPHYDYFEPDAPGTPVLL 205
Query: 195 -QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 253
QRVAS ++YL E GG T FP + L V +G+ + F
Sbjct: 206 QHGGQRVASLVMYLNTPERGGATRFP---------------DVHLDVAAVKGNAVFFSYD 250
Query: 254 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
P+ ++H PV+ GEKWVATKW+R++
Sbjct: 251 RPHPMT--RTLHAGAPVLAGEKWVATKWLRER 280
>gi|242051901|ref|XP_002455096.1| hypothetical protein SORBIDRAFT_03g004265 [Sorghum bicolor]
gi|241927071|gb|EES00216.1| hypothetical protein SORBIDRAFT_03g004265 [Sorghum bicolor]
Length = 303
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 63/204 (30%), Positives = 108/204 (52%), Gaps = 21/204 (10%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA 141
LSW PR + F + +C +I+MA + S+L + G +N+QG A
Sbjct: 62 LSWHPRVFLYEGFLSDMECDHLISMAH-GKKQSSLVV-GGSAGNNSQG-----------A 108
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVA 201
E + IE++I+ + LP+ GE+ IL+Y++ + S Y+ ++ Q R+
Sbjct: 109 SIEDTIVSTIEDRISVWSFLPKDFGESMQILKYEVNK---SDYNNYESQSSSGH--DRLV 163
Query: 202 SFLVYLTDLEEGGETMFPFEN--GMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 259
+ L+YL+D++ GGET FP G + + +C G V+P +G+ +L ++L P+G I
Sbjct: 164 TVLMYLSDVKRGGETAFPRSELKGTKVELAAP-SECAGYAVQPVRGNAILLFNLKPDGVI 222
Query: 260 DPTSIHGSCPVVKGEKWVATKWIR 283
D S + C V++GE+W+A K I
Sbjct: 223 DKDSQYEMCSVLEGEEWLAIKHIH 246
>gi|294666178|ref|ZP_06731433.1| 2OG-Fe II oxygenase superfamily protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
gi|292604043|gb|EFF47439.1| 2OG-Fe II oxygenase superfamily protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
Length = 296
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 68/212 (32%), Positives = 97/212 (45%), Gaps = 37/212 (17%)
Query: 85 MPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFI 138
+P + F + +C ++I +A RP L + TVDN G RTS + +
Sbjct: 105 LPCVVVLGGFLSGGECDALIALA----RPR---LARSRTVDNANGEHVVHAARTSDSMCL 157
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP---- 194
+D IE +IA++ P +GE +LRY G +Y HYD FDP G
Sbjct: 158 RVGQD--ALCQRIEARIARLLDWPVDHGEGLQVLRYGTGAEYRPHYDYFDPDAAGTPVLL 215
Query: 195 -QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 253
QRVAS ++YL E GG T FP + L V +G+ + F
Sbjct: 216 QAGGQRVASLVMYLNTPERGGATRFPDAH---------------LDVAAVKGNAVFFSYD 260
Query: 254 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
P+ S+H PV+ GEKWVATKW+R++
Sbjct: 261 RPHPMT--RSLHAGAPVLAGEKWVATKWLRER 290
>gi|195390833|ref|XP_002054072.1| GJ22994 [Drosophila virilis]
gi|194152158|gb|EDW67592.1| GJ22994 [Drosophila virilis]
Length = 496
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 63/212 (29%), Positives = 100/212 (47%), Gaps = 21/212 (9%)
Query: 77 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGV 136
I ++ P + F + +P + + +A+ L +T+ K D+ RTS G
Sbjct: 294 IKMEIRLLNPFIIVFHDVLSPREIDELQKLARPLLERTTVVKFKKYEKDSR---RTSKGT 350
Query: 137 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQ 195
+I D + IE +I + L E F ++ Y +G Y +H D D +
Sbjct: 351 WIE--RDHNNLTKRIERRITDMVELDLRYSEPFQVMNYGLGGHYAAHEDFLGDTWADKKE 408
Query: 196 KSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLP 255
+ R+A+ L YLTD+E+GG T+F N V P++G L +Y+L
Sbjct: 409 EDDRIATVLFYLTDVEQGGATVFTILNQ---------------AVSPKRGTALFWYNLHR 453
Query: 256 NGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
NGT D ++HG CPV+ G KW+ T WIR++ Q
Sbjct: 454 NGTGDTRTLHGGCPVLVGSKWIMTLWIRERMQ 485
>gi|294627644|ref|ZP_06706226.1| 2OG-Fe II oxygenase superfamily protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 11122]
gi|292597996|gb|EFF42151.1| 2OG-Fe II oxygenase superfamily protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 11122]
Length = 296
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 68/212 (32%), Positives = 97/212 (45%), Gaps = 37/212 (17%)
Query: 85 MPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFI 138
+P + F + +C ++I +A RP L + TVDN G RTS + +
Sbjct: 105 LPCVVVLGGFLSGGECDALIALA----RPR---LARSRTVDNANGEHVVHAARTSDSMCL 157
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP---- 194
+D IE +IA++ P +GE +LRY G +Y HYD FDP G
Sbjct: 158 RVGQD--ALCQRIEARIARLLDWPVDHGEGLQVLRYGTGAEYRPHYDYFDPDAAGTPVLL 215
Query: 195 -QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 253
QRVAS ++YL E GG T FP + L V +G+ + F
Sbjct: 216 QAGGQRVASLVMYLNTPERGGATRFPDAH---------------LDVAAVKGNAVFFSYD 260
Query: 254 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
P+ S+H PV+ GEKWVATKW+R++
Sbjct: 261 RPHPMT--RSLHAGAPVLAGEKWVATKWLRER 290
>gi|66572403|gb|AAY47813.1| conserved hypothetical protein [Xanthomonas campestris pv.
campestris str. 8004]
Length = 308
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 66/212 (31%), Positives = 97/212 (45%), Gaps = 37/212 (17%)
Query: 85 MPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFI 138
+PR + ++C ++I +A RP L + TVDN G RTS + +
Sbjct: 117 LPRVVVLGGLLADDECDALIALA----RPQ---LARSRTVDNRDGSEIVHAARTSHSMAL 169
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP---- 194
+D IE +IA++ P +GE +LRY G +Y HYD F+P G
Sbjct: 170 QPGQD--ALCQRIEARIAQLLEWPVEHGEGLQVLRYATGAQYAPHYDYFEPDAPGTPVLL 227
Query: 195 -QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 253
QRVAS ++YL E GG T FP + L V +G+ + F
Sbjct: 228 QHGGQRVASLVMYLNTPERGGATRFPD---------------VHLDVAAVKGNAVFFSYD 272
Query: 254 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
P+ ++H PV+ GEKWVATKW+R++
Sbjct: 273 RPHPMT--RTLHAGAPVLAGEKWVATKWLRER 302
>gi|77761111|ref|YP_241833.2| hypothetical protein XC_0735 [Xanthomonas campestris pv. campestris
str. 8004]
Length = 288
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 66/212 (31%), Positives = 97/212 (45%), Gaps = 37/212 (17%)
Query: 85 MPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFI 138
+PR + ++C ++I +A RP L + TVDN G RTS + +
Sbjct: 97 LPRVVVLGGLLADDECDALIALA----RPQ---LARSRTVDNRDGSEIVHAARTSHSMAL 149
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP---- 194
+D IE +IA++ P +GE +LRY G +Y HYD F+P G
Sbjct: 150 QPGQD--ALCQRIEARIAQLLEWPVEHGEGLQVLRYATGAQYAPHYDYFEPDAPGTPVLL 207
Query: 195 -QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 253
QRVAS ++YL E GG T FP + L V +G+ + F
Sbjct: 208 QHGGQRVASLVMYLNTPERGGATRFP---------------DVHLDVAAVKGNAVFFSYD 252
Query: 254 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
P+ ++H PV+ GEKWVATKW+R++
Sbjct: 253 RPHPMT--RTLHAGAPVLAGEKWVATKWLRER 282
>gi|332187533|ref|ZP_08389270.1| 2OG-Fe(II) oxygenase superfamily protein [Sphingomonas sp. S17]
gi|332012462|gb|EGI54530.1| 2OG-Fe(II) oxygenase superfamily protein [Sphingomonas sp. S17]
Length = 228
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 65/200 (32%), Positives = 100/200 (50%), Gaps = 28/200 (14%)
Query: 90 YFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLD 149
Y +F TP QC ++I M N RPSTL + + G RTS ++ E +
Sbjct: 47 YQADFLTPAQCDALIAMIDANRRPSTLLSDRPDY-----GFRTSESCDMNRWSPE---VQ 98
Query: 150 LIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK-----SQRVASFL 204
I+E IA++ +P GE RY GQ++ +H+D F E +K QR + +
Sbjct: 99 PIDESIAQLLGIPPEQGETMQGQRYAPGQQFRAHHDYFHESESYWEKVKVHGGQRTWTAM 158
Query: 205 VYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSI 264
+YL D+ EGG T FP G++V PR+G L + ++L +G+ + ++
Sbjct: 159 IYLNDVPEGGATWFP---------------QAGIRVAPRRGLLLAWNNMLLDGSPNDATL 203
Query: 265 HGSCPVVKGEKWVATKWIRD 284
H PVV+G K+V TKW R+
Sbjct: 204 HEGMPVVEGVKYVITKWFRE 223
>gi|381200649|ref|ZP_09907785.1| Prolyl 4-hydroxylase alpha subunit [Sphingobium yanoikuyae XLDN2-5]
Length = 305
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 64/202 (31%), Positives = 100/202 (49%), Gaps = 29/202 (14%)
Query: 91 FPNFATPEQCKSIINMAKLNLRPS-TLALRKGETVDNTQGIRTS-SGVFISAAEDESGTL 148
F F T ++C +I+ + L P+ + R G + + +RTS G+F A ED +
Sbjct: 126 FRQFLTGDECHHVISEGQALLEPAMVIDPRSGRPMPHP--VRTSDGGIFGPAREDL--VI 181
Query: 149 DLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ-KSQRVASFLVYL 207
I +IA + GE +LRY +GQ+Y H+D P ++QR + L+YL
Sbjct: 182 QAINRRIAAASGTMLSGGEPLTLLRYAVGQQYRQHHDCL------PHVRNQRAWTMLIYL 235
Query: 208 TDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGS 267
+ GGET+FP +GL VK R+GD LLF + G ++H
Sbjct: 236 NEGYAGGETIFPR---------------LGLSVKGRKGDALLFRNTDAQGQAAEAAVHLG 280
Query: 268 CPVVKGEKWVATKWIRDQEQYD 289
PV+ G+KW+ T+WIR +++D
Sbjct: 281 APVMAGQKWLCTRWIR-HDRHD 301
>gi|195113239|ref|XP_002001175.1| GI10638 [Drosophila mojavensis]
gi|193917769|gb|EDW16636.1| GI10638 [Drosophila mojavensis]
Length = 511
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 61/210 (29%), Positives = 105/210 (50%), Gaps = 27/210 (12%)
Query: 77 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG-IRTSSG 135
I +VL P + F + + + + +A+ +L S + + N QG R S+G
Sbjct: 310 IKMEVLVLDPLVVIFHDVLSSREIDGLQEIARPHLERSMVVKYRA----NVQGKHRISAG 365
Query: 136 VFISAAEDESGTLDL-IEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP 194
++ E + L IE +IA + L E F ++ Y IG +Y +H+D F
Sbjct: 366 TWV---ERKYNNLTWRIERRIADMVDLNLEGSEPFYVINYGIGGQYKAHWDFFGADTV-- 420
Query: 195 QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLL 254
+ R+A+ L Y+ D+E+GG T+FP +G V+ ++G+ L +Y++
Sbjct: 421 -EDNRLATVLFYMNDVEQGGATVFP---------------RLGQTVRAKRGNALFWYNMQ 464
Query: 255 PNGTIDPTSIHGSCPVVKGEKWVATKWIRD 284
NGT+D ++HG CP++ G KW+ T+WI D
Sbjct: 465 HNGTVDDRTLHGGCPILVGSKWIFTQWISD 494
>gi|389809938|ref|ZP_10205598.1| procollagen-proline dioxygenase [Rhodanobacter thiooxydans LCS2]
gi|388441354|gb|EIL97635.1| procollagen-proline dioxygenase [Rhodanobacter thiooxydans LCS2]
Length = 284
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 66/199 (33%), Positives = 98/199 (49%), Gaps = 30/199 (15%)
Query: 93 NFATPEQCKSIINMAKLNL-RPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLI 151
N + +C +I +A+ L R T+ + VD RTS G+F + DE + I
Sbjct: 102 NILSARECDELIALARPRLQRALTVDSEGRQQVDRR---RTSEGMFFTL--DEVPLVGRI 156
Query: 152 EEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK-----SQRVASFLVY 206
E ++A + +P +GE IL Y GQ Y H+D FDP + G + QR+AS ++Y
Sbjct: 157 ERRVAALLDVPASHGEGLQILHYLPGQAYEPHFDWFDPDQPGYETITAVGGQRIASVVMY 216
Query: 207 LTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHG 266
L GG T FP +GL V R+G + F G D +S+H
Sbjct: 217 LNTPARGGGTAFP---------------ALGLTVTARRGAAVYF--AYEGG--DCSSLHA 257
Query: 267 SCPVVKGEKWVATKWIRDQ 285
PV++GEKW+ATKW+R++
Sbjct: 258 GLPVLEGEKWIATKWLRER 276
>gi|344175386|emb|CCA88057.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia syzygii R24]
Length = 331
Score = 98.2 bits (243), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 62/216 (28%), Positives = 98/216 (45%), Gaps = 23/216 (10%)
Query: 76 NIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPS-TLALRKGETVDNTQGIRTSS 134
+ Q +S PRA + + ++C ++I A+ L S + G+ V N S
Sbjct: 122 QVSVQFVSHHPRAALISDLLSTQECDALIEQARSRLTTSYVIEYESGQEVVNEATRSCSC 181
Query: 135 GVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP 194
F E+ S I E+ A++ P + E RY G+++ H D F
Sbjct: 182 ASF--PPEEMSMLQKRIVERAARLVGQPGAHCEGVTFARYLPGEQFRPHVDYFRGAVLNN 239
Query: 195 QK-----SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLL 249
K R+A+ L+YL ++E GG T FP G +V+P++G L
Sbjct: 240 DKIMGSSGHRIATVLLYLNEVEAGGATFFPNP---------------GFEVRPQKGGALY 284
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
F +G++DPTS+H C V +GEKW+AT W R++
Sbjct: 285 FAYQQADGSMDPTSLHEGCAVTQGEKWIATLWFRER 320
>gi|414591891|tpg|DAA42462.1| TPA: hypothetical protein ZEAMMB73_637248 [Zea mays]
Length = 207
Score = 97.8 bits (242), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 52/145 (35%), Positives = 84/145 (57%), Gaps = 5/145 (3%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-GETVDNTQGIRTSSGVFI 138
+ +SW PR + F + +C ++ +AK ++ S +A + G++V + +RTSSG+F+
Sbjct: 46 KAVSWHPRIFVYKGFLSDAECDHLVTLAKKKIQRSMVADNESGKSVKSE--VRTSSGMFL 103
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 198
+D + IEE+IA T LP+ N E +LRY+ GQKY H+D F + +
Sbjct: 104 DKRQDP--VVSRIEERIAAWTFLPQENAENMQVLRYEPGQKYEPHFDYFHDRVNQARGGH 161
Query: 199 RVASFLVYLTDLEEGGETMFPFENG 223
R A+ L+YL+ + EGGET+FP G
Sbjct: 162 RYATVLMYLSTVREGGETVFPNAKG 186
>gi|157111033|ref|XP_001651361.1| prolyl 4-hydroxylase alpha subunit 1, putative [Aedes aegypti]
gi|108878552|gb|EAT42777.1| AAEL005714-PA, partial [Aedes aegypti]
Length = 522
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 58/213 (27%), Positives = 113/213 (53%), Gaps = 24/213 (11%)
Query: 78 PFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSG 135
PF++ + P+ + F + + + + + +AK L +T+A ++ + ++ + S
Sbjct: 319 PFKLEEMHLKPKIVIFHDVLSDTEIELLKRLAKPILERATIANQQTGKAERSKDRVSKSS 378
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 195
F ++ T+ I +++A +T L E ++ Y +G +Y+ H+D F +G
Sbjct: 379 WF---PDEYHSTIRTITKRVADMTGLSMDTAEELQVVNYGLGGQYDPHFDFF---HWGKL 432
Query: 196 KS-QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLL 254
K R+A+ L Y++D+ GG T+FP +G+ ++ R+G +Y+L
Sbjct: 433 KEVNRIATVLFYMSDVSIGGATVFP---------------KLGVTLEARKGTAAFWYNLH 477
Query: 255 PNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
+G +D +++HG+CPV+ GEKWVA KWIR++ Q
Sbjct: 478 SSGELDYSTLHGACPVLIGEKWVANKWIRERGQ 510
>gi|323454062|gb|EGB09933.1| hypothetical protein AURANDRAFT_14928, partial [Aureococcus
anophagefferens]
Length = 182
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 65/198 (32%), Positives = 100/198 (50%), Gaps = 29/198 (14%)
Query: 93 NFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIE 152
NF T E+C ++I+ AK ++ P+ + G RTSS ++ A ED L +
Sbjct: 8 NFLTEEECDALIDSAKDHMTPAPVV---GPGNGEVSVSRTSSTCYL-ARED----LPSVC 59
Query: 153 EKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP-----QEYGPQKSQRVASFLVYL 207
K+ +T P + E + RY+ G+ Y HYDAFD + + QRVA+ LVYL
Sbjct: 60 TKVCALTGKPLEHLELPQVGRYRGGEFYKPHYDAFDTSSADGRRFAQNGGQRVATVLVYL 119
Query: 208 TDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGS 267
D+E GGET F +G+++KPR+G+ L+F+ +G +D +H +
Sbjct: 120 NDVERGGETSF---------------SKLGVRIKPRKGNALIFFPATLDGVLDQNYLHAA 164
Query: 268 CPVVKGEKWVATKWIRDQ 285
P V KWV+ WIR +
Sbjct: 165 EPAVD-PKWVSQIWIRQR 181
>gi|427410797|ref|ZP_18900999.1| hypothetical protein HMPREF9718_03473 [Sphingobium yanoikuyae ATCC
51230]
gi|425710785|gb|EKU73805.1| hypothetical protein HMPREF9718_03473 [Sphingobium yanoikuyae ATCC
51230]
Length = 322
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 64/202 (31%), Positives = 100/202 (49%), Gaps = 29/202 (14%)
Query: 91 FPNFATPEQCKSIINMAKLNLRPS-TLALRKGETVDNTQGIRTS-SGVFISAAEDESGTL 148
F F T ++C +I+ + L P+ + R G + + IRTS G+F A ED +
Sbjct: 143 FRQFLTGDECHHVISEGQALLEPAMVIDPRSGRPMPHP--IRTSDGGIFGPAREDL--VI 198
Query: 149 DLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ-KSQRVASFLVYL 207
I +IA + GE +LRY +GQ+Y H+D P ++QR + L+YL
Sbjct: 199 QAINRRIAAASGTMLSGGEPLTLLRYAVGQQYRQHHDCL------PHVRNQRAWTMLIYL 252
Query: 208 TDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGS 267
+ GGET+FP +GL VK R+G+ LLF + G ++H
Sbjct: 253 NEGYAGGETIFPR---------------LGLSVKGRKGNALLFRNTDAQGQAAEAAVHLG 297
Query: 268 CPVVKGEKWVATKWIRDQEQYD 289
PV+ G+KW+ T+WIR +++D
Sbjct: 298 APVMAGQKWLCTRWIR-HDRHD 318
>gi|413945803|gb|AFW78452.1| hypothetical protein ZEAMMB73_588774 [Zea mays]
Length = 239
Score = 97.4 bits (241), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 55/144 (38%), Positives = 81/144 (56%), Gaps = 15/144 (10%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSG 135
+S PR + +F + ++ +I++A+ L+ S +A DN G +RTSSG
Sbjct: 54 ISCKPRVFLYQHFLSDDEANHLISLARAELKRSAVA-------DNMSGKSTLSEVRTSSG 106
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 195
F+ +D ++ IE+KIA T LP+ NGE +LRYK G+KY HYD F +
Sbjct: 107 TFLRKGQDP--IVEGIEDKIAAWTFLPKENGEDIQVLRYKHGEKYEPHYDYFTDNVNTVR 164
Query: 196 KSQRVASFLVYLTDLEEGGETMFP 219
R A+ L+YLTD+ EGGET+FP
Sbjct: 165 GGHRYATVLLYLTDVPEGGETVFP 188
>gi|321474898|gb|EFX85862.1| hypothetical protein DAPPUDRAFT_309117 [Daphnia pulex]
Length = 541
Score = 97.1 bits (240), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 62/215 (28%), Positives = 102/215 (47%), Gaps = 24/215 (11%)
Query: 77 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGV 136
I ++ S PR + + N T E+ ++ +A+ LR ST+ + T+ R +
Sbjct: 333 IKMELASLKPRLVIYHNVVTDEEIETAKKLAQSRLRRSTVQNSLTGASEPTK-YRIAKAA 391
Query: 137 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK 196
F+ +E + + + +I VT L E + Y IG Y HYD E QK
Sbjct: 392 FLQNSEHDH--IVKMTRRIGDVTGLDMTTAEELQVCNYGIGGHYEPHYDHARKGEV--QK 447
Query: 197 S----QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYS 252
R+A+++ Y++D+E GG T+FP I L + P++G +++
Sbjct: 448 DFGWGNRIATWMFYMSDVEAGGATVFP---------------QINLALWPQKGSAAFWFN 492
Query: 253 LLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
L PNG D + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 493 LHPNGEGDDLTQHAACPVLTGSKWVSNKWIHERNQ 527
>gi|77747935|ref|NP_638775.2| hypothetical protein XCC3429 [Xanthomonas campestris pv. campestris
str. ATCC 33913]
Length = 288
Score = 97.1 bits (240), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 65/212 (30%), Positives = 96/212 (45%), Gaps = 37/212 (17%)
Query: 85 MPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFI 138
+PR + ++C ++I +A RP L + TVDN G RTS + +
Sbjct: 97 LPRVVVLGGLLADDECDALIALA----RPQ---LARSRTVDNRDGSEIVHAARTSHSMAL 149
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP---- 194
+D IE +IA++ P +GE +LRY G +Y HYD F+P G
Sbjct: 150 QPGQD--ALCQRIEARIAQLLEWPVEHGEGLQVLRYATGAQYAPHYDYFEPDAPGTPVLL 207
Query: 195 -QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 253
QRVAS ++YL E GG T P + L V +G+ + F
Sbjct: 208 QHGGQRVASLVMYLNTPERGGATRVPD---------------VHLDVAAVKGNAVFFSYD 252
Query: 254 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
P+ ++H PV+ GEKWVATKW+R++
Sbjct: 253 RPHPMT--RTLHAGAPVLAGEKWVATKWLRER 282
>gi|363543297|ref|NP_001241864.1| prolyl 4-hydroxylase 4-2 precursor [Zea mays]
gi|194704960|gb|ACF86564.1| unknown [Zea mays]
gi|347978810|gb|AEP37747.1| prolyl 4-hydroxylase 4-2 [Zea mays]
Length = 207
Score = 97.1 bits (240), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 52/145 (35%), Positives = 83/145 (57%), Gaps = 5/145 (3%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-GETVDNTQGIRTSSGVFI 138
+ +SW PR + F + +C ++ +AK + S +A + G++V + +RTSSG+F+
Sbjct: 46 KAVSWHPRIFVYKGFLSDAECDHLVTLAKKKTQRSMVADNESGKSVKSE--VRTSSGMFL 103
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 198
+D + IEE+IA T LP+ N E +LRY+ GQKY H+D F + +
Sbjct: 104 DKRQDP--VVSRIEERIAAWTFLPQENAENMQVLRYEPGQKYEPHFDYFHDRVNQARGGH 161
Query: 199 RVASFLVYLTDLEEGGETMFPFENG 223
R A+ L+YL+ + EGGET+FP G
Sbjct: 162 RYATVLMYLSTVREGGETVFPNAKG 186
>gi|21114687|gb|AAM42699.1| conserved hypothetical protein [Xanthomonas campestris pv.
campestris str. ATCC 33913]
Length = 308
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 65/212 (30%), Positives = 96/212 (45%), Gaps = 37/212 (17%)
Query: 85 MPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFI 138
+PR + ++C ++I +A RP L + TVDN G RTS + +
Sbjct: 117 LPRVVVLGGLLADDECDALIALA----RPQ---LARSRTVDNRDGSEIVHAARTSHSMAL 169
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP---- 194
+D IE +IA++ P +GE +LRY G +Y HYD F+P G
Sbjct: 170 QPGQD--ALCQRIEARIAQLLEWPVEHGEGLQVLRYATGAQYAPHYDYFEPDAPGTPVLL 227
Query: 195 -QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 253
QRVAS ++YL E GG T P + L V +G+ + F
Sbjct: 228 QHGGQRVASLVMYLNTPERGGATRVPD---------------VHLDVAAVKGNAVFFSYD 272
Query: 254 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
P+ ++H PV+ GEKWVATKW+R++
Sbjct: 273 RPHPMT--RTLHAGAPVLAGEKWVATKWLRER 302
>gi|428178571|gb|EKX47446.1| hypothetical protein GUITHDRAFT_152114 [Guillardia theta CCMP2712]
Length = 262
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 73/249 (29%), Positives = 115/249 (46%), Gaps = 45/249 (18%)
Query: 49 PSARVVESVKDEYKWMPHGQAGDDSVTNIPF-QVLSWMPRALYFPNFATPEQCKSIINMA 107
PS R +ES+K +N+P+ + ++ PR N T ++C+ ++ +A
Sbjct: 42 PSLRGLESLK----------------SNLPYLEQINASPRVFRIRNLLTKQECEHLMLLA 85
Query: 108 KLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGE 167
T+ + G RT+ G ++ +D+ + +EE + K+T GE
Sbjct: 86 FRKGLSKTMIMPYGTHKLVESTTRTNDGAWLDFLQDD--VVRRLEETLGKLTKTTPQQGE 143
Query: 168 AFNILRYKIG-QKYNSHYDAFDPQEYGP----QKSQRVASFLVYLTDLEEGGETMFPFEN 222
+L Y G Q + HYD FDP P Q R + +VYL EGGET FP
Sbjct: 144 NLQVLHYSNGAQFFQEHYDYFDPARDPPESFEQGGNRYITVIVYLEAALEGGETHFP--- 200
Query: 223 GMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPN--GT----IDPTSIHGSCPVVKGEKW 276
+GLK+ + GD L+FY+L + GT ++ +IH + P V+GEKW
Sbjct: 201 ------------ELGLKLTAQPGDALMFYNLKEHCSGTDPDCVEKKTIHAALPPVRGEKW 248
Query: 277 VATKWIRDQ 285
VA KWI ++
Sbjct: 249 VAVKWIHEK 257
>gi|421871431|ref|ZP_16303052.1| 2OG-Fe(II) oxygenase superfamily protein [Brevibacillus
laterosporus GI-9]
gi|372459315|emb|CCF12601.1| 2OG-Fe(II) oxygenase superfamily protein [Brevibacillus
laterosporus GI-9]
Length = 201
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 63/217 (29%), Positives = 106/217 (48%), Gaps = 25/217 (11%)
Query: 78 PFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVF 137
P Q+L+ P +P+ + E C+S+IN+A+ L P+T+ + G V + +R S +
Sbjct: 4 PTQLLNQQPFIGCYPSLISSEACQSLINLARGQLTPATVVGQSGLEVSH---VRISELAW 60
Query: 138 ISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQE----YG 193
+E + I ++IA++ P E + Y G K+ +H D +D QE +
Sbjct: 61 FCHNYNE--VVQSICKQIAEIVEQPIHYAEKLQVAHYGAGGKFEAHLDCYDSQEANKTFL 118
Query: 194 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 253
QR+ + ++YL D+ GGET FP + ++V P G L+F +
Sbjct: 119 EHSGQRLYTAILYLNDVVSGGETYFPN---------------LKIEVSPTTGTLLVFENC 163
Query: 254 LPNGTI-DPTSIHGSCPVVKGEKWVATKWIRDQEQYD 289
P+ +I D S+HGS + GEKW+ T W ++ QY
Sbjct: 164 QPDTSIPDLRSLHGSKILQSGEKWIGTLWFCERPQYQ 200
>gi|255607134|ref|XP_002538686.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223510975|gb|EEF23697.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 318
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 66/219 (30%), Positives = 108/219 (49%), Gaps = 34/219 (15%)
Query: 76 NIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKG--ETVDNTQGIRTS 133
+I ++ PR F + + +C ++I ++ L+ S + +G E VD+T RTS
Sbjct: 116 DIKVVMVCTAPRIALFDDVLSDAECDALIAASRSRLQRSKVVANRGSGEFVDDT---RTS 172
Query: 134 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 193
G + + E+ + I+ +IA++T P + E IL Y +G +Y H+D F+PQ+ G
Sbjct: 173 YGAYFNKGENS--LVATIQRRIAELTRWPLTHAEPLQILNYGLGGEYLPHFDYFEPQQPG 230
Query: 194 ---PQKS--QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGL 248
P +S QR+A+ ++YL D+E GG T+FP N L+ +PR+G +
Sbjct: 231 LPSPLESGGQRIATVVMYLNDVEAGGGTIFPHLN---------------LETRPRKGGAI 275
Query: 249 LFYSLLPNGTIDPTSIHGSCPV---VKGEKWVATKWIRD 284
F L SI C + KW+AT+W RD
Sbjct: 276 YFSYQLAVA----RSIRSRCMAARRIARRKWIATQWFRD 310
>gi|339009924|ref|ZP_08642495.1| 2OG-Fe(II) oxygenase [Brevibacillus laterosporus LMG 15441]
gi|338773194|gb|EGP32726.1| 2OG-Fe(II) oxygenase [Brevibacillus laterosporus LMG 15441]
Length = 201
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 63/217 (29%), Positives = 106/217 (48%), Gaps = 25/217 (11%)
Query: 78 PFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVF 137
P Q+L+ P +P+ + E C+S+IN+A+ L P+T+ + G V + +R S +
Sbjct: 4 PTQLLNQQPFIGCYPSLISSEACQSLINLARGQLTPATVVGQSGLEVSH---VRISELAW 60
Query: 138 ISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQE----YG 193
+E + I ++IA++ P E + Y G K+ +H D +D QE +
Sbjct: 61 FCHNYNE--VVQSICKQIAEIVEQPIHYAEKLQVAHYGAGGKFEAHLDCYDSQEANKPFL 118
Query: 194 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 253
QR+ + ++YL D+ GGET FP + ++V P G L+F +
Sbjct: 119 EHSGQRLYTAILYLNDVVSGGETYFPN---------------LKIEVSPTTGTLLVFENC 163
Query: 254 LPNGTI-DPTSIHGSCPVVKGEKWVATKWIRDQEQYD 289
P+ +I D S+HGS + GEKW+ T W ++ QY
Sbjct: 164 QPDTSIPDLRSLHGSKILQSGEKWIGTLWFCERPQYQ 200
>gi|326436053|gb|EGD81623.1| p4ha2 protein [Salpingoeca sp. ATCC 50818]
Length = 548
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 60/200 (30%), Positives = 98/200 (49%), Gaps = 24/200 (12%)
Query: 87 RALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
R F FA+PE+C+ + + K L R + + V+ R S+ ++ D
Sbjct: 339 RLQVFRQFASPEECRHLQHAGKRRLERAVAWTDGRFQPVE----FRISTAAWLQP--DHD 392
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
+ I +I T + EA I Y +G Y H+D + P +R+A+F++
Sbjct: 393 AIVKRIHGRIEDATQVDIEYAEALQISNYGMGGFYEPHFD-HSSRGTNP-DGERLATFMI 450
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL +++GG T FP +G V+P GD + +Y+L P+G DP ++H
Sbjct: 451 YLNPVKQGGFTAFPR---------------LGAAVQPGYGDAVFWYNLQPSGVGDPLTLH 495
Query: 266 GSCPVVKGEKWVATKWIRDQ 285
G+CPV++G KWVA KWI ++
Sbjct: 496 GACPVLRGSKWVANKWIHER 515
>gi|290243077|ref|YP_003494747.1| Procollagen-proline dioxygenase [Thioalkalivibrio sp. K90mix]
gi|288945582|gb|ADC73280.1| Procollagen-proline dioxygenase [Thioalkalivibrio sp. K90mix]
Length = 575
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 67/217 (30%), Positives = 109/217 (50%), Gaps = 26/217 (11%)
Query: 79 FQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFI 138
+ LS P +Y F P +C+++I++A+ ++ + ++L V +QG RT S ++
Sbjct: 50 METLSQDPLVVYLDEFLEPGECEALIHLAQGRMKRALVSLDGSSGV--SQG-RTGSNCWL 106
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFD---PQ--EYG 193
E+ I E++AK P E ++ Y Q+Y HYDA+D P+
Sbjct: 107 RYQEEPLARR--IGERVAKRVGFPLEYAEPLQVIHYGHEQEYRPHYDAYDLDTPRGLRCT 164
Query: 194 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 253
Q QR+ + L+YL ++EEGG T FP NA G++V PR+G +F ++
Sbjct: 165 RQGGQRMVTALLYLNEVEEGGATAFP-----NA----------GVEVAPRKGRIAIFNNV 209
Query: 254 LPN-GTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 289
+ G P S+HG PV GEKW A+ W R + ++
Sbjct: 210 GADPGRPHPRSLHGGMPVKSGEKWAASIWFRARPAHE 246
>gi|195505209|ref|XP_002099405.1| GE10885 [Drosophila yakuba]
gi|194185506|gb|EDW99117.1| GE10885 [Drosophila yakuba]
Length = 473
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 61/211 (28%), Positives = 104/211 (49%), Gaps = 21/211 (9%)
Query: 77 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGV 136
+ ++LS P + F + + + SI N+AK L + + G ++ RT+ G
Sbjct: 274 LKMELLSLDPYMVLFHDVVSDKDITSIRNLAKGGLVRAVTVTKDGSYEEDPA--RTTKGT 331
Query: 137 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK 196
++ + S + + + +T L + + F +L Y IG Y +H+D E G
Sbjct: 332 WLV---ENSKLIQRLSQLAQDMTNLDIRDADPFQVLNYGIGGYYGTHFDFLADTEMG-NF 387
Query: 197 SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPN 256
S R+A+ + YL+D+ +GG T+FP +GL V P++G LL+Y+L
Sbjct: 388 SNRIATAVFYLSDVPQGGATIFP---------------KLGLSVFPKKGSALLWYNLDHK 432
Query: 257 GTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
G D + H +CP + G +WV TKWI ++EQ
Sbjct: 433 GDGDNRTAHSACPTIVGSRWVMTKWINEREQ 463
>gi|219113023|ref|XP_002186095.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|209582945|gb|ACI65565.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 508
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 68/225 (30%), Positives = 108/225 (48%), Gaps = 26/225 (11%)
Query: 72 DSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMA-KLNLRPSTLALRKGETVDNTQGI 130
D N+ VLS +PR +F + + + ++N+A K L+ ST+
Sbjct: 274 DPTMNMTMTVLSCVPRVFEVKDFLSDMEVEHLLNIASKRKLKRSTMHAGGSSEATTNDDT 333
Query: 131 RTSSGVFISAAED--------ESGTLDLIEEKI---AKVTMLPRIN------GEAFNILR 173
RTS+ +I +D + L ++E + + + +P E ++
Sbjct: 334 RTSTNDWIPRHQDLITDTIYRRAADLLQMDEALLRWRRKSEIPEFTESHISISERLQLVN 393
Query: 174 YKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQ 233
Y++GQ+Y H+D P Q S R A+ L YL D +GGET FP ++AD +
Sbjct: 394 YQVGQQYTPHHDFTMPGLVNMQPS-RFATLLFYLNDDMDGGETAFP--RWLHAD-----E 445
Query: 234 KCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVA 278
+ LKVKP +G +LFY+LLP+G D S H + PV +GEKW+
Sbjct: 446 EGGSLKVKPEKGKAILFYNLLPDGNYDERSEHAALPVRRGEKWLT 490
>gi|398806116|ref|ZP_10565064.1| 2OG-Fe(II) oxygenase superfamily enzyme [Polaromonas sp. CF318]
gi|398089832|gb|EJL80333.1| 2OG-Fe(II) oxygenase superfamily enzyme [Polaromonas sp. CF318]
Length = 294
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 63/216 (29%), Positives = 101/216 (46%), Gaps = 24/216 (11%)
Query: 70 GDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG 129
G D+V + F+ L+ PR + NF + E+C + A+ P+T+ + V
Sbjct: 83 GADAV--VTFEQLA--PRIVVLDNFLSSEECDGLCEEARPAFAPATVVDPHQDAVHAAH- 137
Query: 130 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP 189
R++ + AA E + +E +I ++T P E + RY GQ Y HYD F
Sbjct: 138 FRSNDSAQLPAAGSE--LVRRVEARIERLTGWPSAFCETLQLQRYAQGQDYRPHYDFFGQ 195
Query: 190 QEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLL 249
Q QR+A+ ++YL E GG T F +G+++ PR+G L
Sbjct: 196 DMVEAQGGQRLATLILYLRAPEAGGATYF---------------ANLGMRIAPRKGSALF 240
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
F P+ + ++HG V+ GEKW+AT+W RD+
Sbjct: 241 F--TYPDPGNNSGTLHGGEAVLAGEKWIATQWFRDR 274
>gi|441432545|ref|YP_007354587.1| Prolyl 4-hydroxylase [Acanthamoeba polyphaga moumouvirus]
gi|371944705|gb|AEX62527.1| putative prolyl4-hydroxylase [Moumouvirus Monve]
gi|440383625|gb|AGC02151.1| Prolyl 4-hydroxylase [Acanthamoeba polyphaga moumouvirus]
Length = 239
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 64/199 (32%), Positives = 93/199 (46%), Gaps = 30/199 (15%)
Query: 93 NFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIE 152
NF E+CK I+N + L S + K + IR S ++S + + +
Sbjct: 61 NFINKEKCKEIMNNTQNKLFDSEVISGK------NKAIRNSQQCWVSKYDP---MVKSMF 111
Query: 153 EKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-----DPQEYGPQKSQRVASFLVYL 207
+KI++ +P N E ++RY GQ YN H+DA E+ + QR + LVYL
Sbjct: 112 QKISQQFNIPLENAEDLQVVRYLPGQYYNEHHDACCDNNDKCNEFISRGGQRCLTVLVYL 171
Query: 208 TDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGT-IDPTSIHG 266
+ EGG T F K + LKVKP GD ++FY L N + P S+H
Sbjct: 172 NNEFEGGHTFF---------------KNLNLKVKPETGDAIVFYPLAKNTSKCHPLSLHA 216
Query: 267 SCPVVKGEKWVATKWIRDQ 285
PV GEKW+A W R++
Sbjct: 217 GMPVTSGEKWIANLWFRER 235
>gi|194905392|ref|XP_001981188.1| GG11756 [Drosophila erecta]
gi|190655826|gb|EDV53058.1| GG11756 [Drosophila erecta]
Length = 509
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 61/211 (28%), Positives = 101/211 (47%), Gaps = 21/211 (9%)
Query: 77 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGV 136
+ ++LS P + F + + + SI N+AK L + + G D RT+ G
Sbjct: 310 LKMELLSLDPYVVLFHDVVSDQDILSIRNLAKGGLARAVTVTQDGN--DKEDPARTTKGT 367
Query: 137 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK 196
++ + S + + + +T + + F +L Y IG Y +H+D + E G
Sbjct: 368 WLV---ENSKLIQRLSQLSQDMTNFDVRDADPFQVLNYGIGGFYGTHFDFLEDTEMG-HF 423
Query: 197 SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPN 256
S R+A+ + YL+D+ +GG T FP +GL V P +G LL+Y+L
Sbjct: 424 SDRIATAVFYLSDVPQGGATTFP---------------DLGLSVFPEKGAALLWYNLDHK 468
Query: 257 GTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
G D + H +CP + G +WV TKWI ++EQ
Sbjct: 469 GVGDNRTAHSACPTIVGSRWVMTKWINEREQ 499
>gi|414870897|tpg|DAA49454.1| TPA: hypothetical protein ZEAMMB73_536273 [Zea mays]
Length = 222
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 49/134 (36%), Positives = 77/134 (57%), Gaps = 15/134 (11%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTS 133
+VLSW PRA + NF + E+C +I++AK +++ ST+ VD+ G +RTS
Sbjct: 97 EVLSWEPRAFVYHNFLSKEECDHLISLAKPHMKKSTV-------VDSATGGSKDSRVRTS 149
Query: 134 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 193
SG+F+ +D+ + IE++IA T +P GE +L Y++GQKY H+D F
Sbjct: 150 SGMFLRRGQDK--IIRTIEKRIADYTFIPVEQGEGLQVLHYEVGQKYEPHFDYFHDDYNT 207
Query: 194 PQKSQRVASFLVYL 207
QR+A+ L+YL
Sbjct: 208 KNGGQRIATLLMYL 221
>gi|255633460|gb|ACU17088.1| unknown [Glycine max]
Length = 207
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 53/146 (36%), Positives = 84/146 (57%), Gaps = 5/146 (3%)
Query: 66 HGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETV 124
H DD +V+SW PRA + NF T E+C+ +I++AK N+ S++ G++
Sbjct: 66 HTSDDDDVRGEQWVEVVSWEPRAFVYHNFLTKEECEYLIDIAKPNMHKSSVVDSETGKSK 125
Query: 125 DNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHY 184
D+ +RTSSG F++ D+ + IE++IA + +P +GE +L Y++GQKY HY
Sbjct: 126 DSR--VRTSSGTFLARGRDK--IVRDIEKRIAHYSFIPVEHGEGLQVLHYEVGQKYEPHY 181
Query: 185 DAFDPQEYGPQKSQRVASFLVYLTDL 210
D F QR+A+ L+YLTD+
Sbjct: 182 DYFLDDFNTKNGGQRIATVLMYLTDV 207
>gi|424863736|ref|ZP_18287648.1| prolyl 4-hydroxylase subunit alpha-2 [SAR86 cluster bacterium
SAR86A]
gi|400757057|gb|EJP71269.1| prolyl 4-hydroxylase subunit alpha-2 [SAR86 cluster bacterium
SAR86A]
Length = 205
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 57/209 (27%), Positives = 97/209 (46%), Gaps = 26/209 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P NF + ++C++ + M K + + + + D ++ + + F S
Sbjct: 17 PIVYVVNNFLSDDECEAFVEMGKGKMERAKVI-----SDDESEFHASRTNDFCWLEHSAS 71
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS-----QRV 200
+ + ++ + + +P N E F ++ Y G +Y H+DAFD Q + QR+
Sbjct: 72 DVIHEVSKRFSVLVKMPINNAEQFQLVYYGPGNEYKPHFDAFDKTTKEGQNNWFPGGQRM 131
Query: 201 ASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGT-I 259
+ L YL D+EEGG T FP I + VKP +GD ++F++ + T I
Sbjct: 132 VTALAYLNDVEEGGATDFP---------------KINVSVKPNKGDVVVFHNCIEGTTEI 176
Query: 260 DPTSIHGSCPVVKGEKWVATKWIRDQEQY 288
+P ++HG PVV GEKW W R+ Y
Sbjct: 177 NPQALHGGSPVVAGEKWAVNLWFRESAIY 205
>gi|451927223|gb|AGF85101.1| 4-hydroxylase [Moumouvirus goulette]
Length = 239
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 63/199 (31%), Positives = 93/199 (46%), Gaps = 30/199 (15%)
Query: 93 NFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIE 152
NF E+C I+N + L S + K + IR S ++S + + +
Sbjct: 61 NFINKEKCGEIMNNTQSKLFDSEVISGK------NKAIRNSQQCWVSKYDP---MVKSMF 111
Query: 153 EKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-----DPQEYGPQKSQRVASFLVYL 207
+KI++ +P N E ++RY GQ YN H+DA E+ + QR + L+YL
Sbjct: 112 QKISQQFNIPIQNAEDLQVVRYLPGQYYNEHHDACCDNNDKCNEFISRGGQRCLTVLIYL 171
Query: 208 TDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGT-IDPTSIHG 266
+ EGG T F K +GLKVKP GD ++FY L N + P S+H
Sbjct: 172 NNEFEGGHTFF---------------KNLGLKVKPETGDAIVFYPLAKNTSKCHPLSLHA 216
Query: 267 SCPVVKGEKWVATKWIRDQ 285
PV GEKW+A W R++
Sbjct: 217 GMPVTNGEKWIANLWFRER 235
>gi|170064953|ref|XP_001867740.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
gi|167882143|gb|EDS45526.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
Length = 509
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 62/211 (29%), Positives = 106/211 (50%), Gaps = 23/211 (10%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFIS 139
+VL+ P + + A+ + +I +AK + +T+ V N RTS ++
Sbjct: 303 EVLNLDPFITVYHDVASDREISKLIELAKSRISRATIRDDGEPQVSNA---RTSQNAWLD 359
Query: 140 AAEDESGTLDLIEEKIAKVTM-LPRINGEAFNILRYKIGQKYNSHYD-AFDPQEY-GPQK 196
A +D T ++ ++ +T L + + E + Y +G Y +H+D A + Y G +
Sbjct: 360 AGDDRVVTT--LDRRVGDMTGGLRQQSYEMLQVNNYGVGGHYVAHHDWAMEAVPYAGLRV 417
Query: 197 SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPN 256
R+A+ + YL+D+E GG T+FP +GL V PR+G +L+Y+L N
Sbjct: 418 GNRIATVMFYLSDVEIGGATVFP---------------QLGLAVFPRKGSAILWYNLYRN 462
Query: 257 GTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
G D ++H +CPV+ G KWVA +WI + Q
Sbjct: 463 GKGDRRTLHAACPVLSGSKWVANQWIHEYHQ 493
>gi|403183473|gb|EJY58123.1| AAEL017524-PA, partial [Aedes aegypti]
Length = 212
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 62/207 (29%), Positives = 104/207 (50%), Gaps = 25/207 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + + N + ++ + II ++K L+ S + + V N RTS +++ + E
Sbjct: 13 PLIVIYHNAISDKEIEQIIQVSKPMLKRSMVGESFSKEVSNE---RTSQNAWLADYDFE- 68
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQ-EYGPQKS----QRV 200
+ ++ + +T L R + E+ + Y IG Y H+D P K R+
Sbjct: 69 -LVKVLSLRTEDMTGLDRKSYESLQVNNYGIGGFYLPHFDWVRTNGTEEPYKDMGLGNRI 127
Query: 201 ASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTID 260
A+ + YL+D+E+GG T+FP IG+ V P++G + +Y+LLP+GT D
Sbjct: 128 ATLMYYLSDVEQGGATVFP---------------QIGVGVFPKKGSAIFWYNLLPDGTGD 172
Query: 261 PTSIHGSCPVVKGEKWVATKWIRDQEQ 287
++HG+CPV+ G KWVA KWI Q
Sbjct: 173 ERTLHGACPVLLGSKWVANKWIHQYHQ 199
>gi|307211752|gb|EFN87747.1| Prolyl 4-hydroxylase subunit alpha-1 [Harpegnathos saltator]
Length = 415
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 56/206 (27%), Positives = 99/206 (48%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR +++ N E+ ++I MA+ + +T+ K ++ R S ++ E E
Sbjct: 208 PRIVFYHNVIYDEEIETIKRMAQPRFKRATVQNYKTGALE-IANYRISKSAWLQ--EHEH 264
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ + +++ +T + E ++ Y IG Y H+D +E KS R+A
Sbjct: 265 KHVAAVSKRVEHMTSMSVETAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIA 324
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
+ L Y++D+E+GG T+F I + + PR+G +Y+L PNG D
Sbjct: 325 TVLYYMSDVEQGGGTVFT---------------AINISLWPRKGSAAFWYNLKPNGEGDF 369
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
+ H +CPV+ G KWVA KW+ ++ Q
Sbjct: 370 KTRHAACPVLTGSKWVANKWLHERGQ 395
>gi|21711777|gb|AAM75079.1| RE70601p [Drosophila melanogaster]
Length = 316
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 56/187 (29%), Positives = 89/187 (47%), Gaps = 38/187 (20%)
Query: 124 VDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLP--------RING--------- 166
+ QG+ T S + + SG ++++ + +KV P R+N
Sbjct: 134 IKELQGMATPSLKRATVYQASSGRNEVVKTRTSKVAWFPDGYNPLTVRLNARISDMTGFN 193
Query: 167 ----EAFNILRYKIGQKYNSHYDAFDP--QEYGPQKSQRVASFLVYLTDLEEGGETMFPF 220
E ++ Y +G Y+ HYD F+ R+A+ L YLTD+E+GG T+FP
Sbjct: 194 LYGSEMLQLMNYGLGGHYDQHYDFFNKTNSNMTAMSGDRIATVLFYLTDVEQGGATVFP- 252
Query: 221 ENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATK 280
I V P++G +++Y+L NG ID ++H +CPV+ G KWV K
Sbjct: 253 --------------NIRKAVFPQRGSVVMWYNLKDNGQIDTQTLHAACPVIVGSKWVCNK 298
Query: 281 WIRDQEQ 287
WIR++EQ
Sbjct: 299 WIREREQ 305
>gi|195390835|ref|XP_002054073.1| GJ22993 [Drosophila virilis]
gi|194152159|gb|EDW67593.1| GJ22993 [Drosophila virilis]
Length = 525
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 63/227 (27%), Positives = 111/227 (48%), Gaps = 35/227 (15%)
Query: 75 TNIPF--------QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK--GETV 124
TN PF ++L+ P + + + TP + + + +A L+ +T+ +K TV
Sbjct: 310 TNSPFLRLAPLKTELLALDPYMVLYHDVITPSEIRELQYLAVPTLKRATVFNQKMGRNTV 369
Query: 125 DNTQGIRTSSGVFISAAEDESGTLDL-IEEKIAKVTMLPRINGEAFNILRYKIGQKYNSH 183
T RTS +++ D L + + +I+ +T E ++ Y +G Y+ H
Sbjct: 370 VKT---RTSKVTWLT---DSLNPLTVRLNRRISDMTGFDLYGSEMLQVMNYGLGGHYDLH 423
Query: 184 YDAFDP---QEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKV 240
+D F+ ++ R+A+ L YLTD+E+GG T+FP I +
Sbjct: 424 FDYFNATIAKDLTKLNGDRIATVLFYLTDVEQGGATVFP---------------NIKQAI 468
Query: 241 KPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
P++G +++Y+L N DP ++H +CPV+ G KWV KWIR+ +Q
Sbjct: 469 FPKKGTAVMWYNLRHNNDGDPQTLHAACPVIVGSKWVCNKWIREHQQ 515
>gi|301613004|ref|XP_002936004.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Xenopus
(Silurana) tropicalis]
Length = 526
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 58/206 (28%), Positives = 103/206 (50%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + + + + E+ + +AK LR +T++ ++ Q R + ++S ED
Sbjct: 327 PRIVRYHDIISDEEISKVKELAKPRLRRATISNPITGVLETAQ-YRITKSAWLSGYEDP- 384
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD---AFDPQEYGPQKS-QRVA 201
+ + +I VT L E + Y IG +Y H+D ++P + + RVA
Sbjct: 385 -VVARLNRRIEGVTGLDMSTAEELQVANYGIGGQYEPHFDFLRKYEPDAFKKLGTGNRVA 443
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+E GG T+FP +G V P++G + +Y+LL +G D
Sbjct: 444 TWLFYMSDVEAGGATVFPE---------------VGAAVYPKKGTAVFWYNLLESGEGDY 488
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 489 STRHAACPVLVGNKWVSNKWIHERGQ 514
>gi|393718270|ref|ZP_10338197.1| putative oxygenase [Sphingomonas echinoides ATCC 14820]
Length = 226
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 98/201 (48%), Gaps = 28/201 (13%)
Query: 89 LYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTL 148
Y P+F C ++ + N R ST+ ++ Q RTS + + S +
Sbjct: 43 FYHPDFLDAATCDRLVALIDANRRRSTVLAE-----ESVQDFRTSDSCDM---DRWSPDV 94
Query: 149 DLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQK----SQRVASF 203
+E IA + + ++GE RY +GQ + +H+D F + Q Y P+ QR +
Sbjct: 95 RPTDEAIADLLGIDPVHGETMQGQRYAVGQHFRAHFDYFNEAQAYWPKMVETGGQRTWTA 154
Query: 204 LVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTS 263
++YL D+EEGG T FP IG++V P++G L + ++ P+G + +
Sbjct: 155 MIYLNDVEEGGATWFP---------------TIGIRVAPKKGLLLTWNNMKPDGDRNTAT 199
Query: 264 IHGSCPVVKGEKWVATKWIRD 284
+H PVV+G K++ TKW R+
Sbjct: 200 LHEGMPVVQGTKYIVTKWFRE 220
>gi|215697788|dbj|BAG91981.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 225
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 57/185 (30%), Positives = 99/185 (53%), Gaps = 20/185 (10%)
Query: 99 QCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKV 158
+C +++M + N+ S+LA T G R SS I E + IE++I+
Sbjct: 2 ECDHLVSMGRGNME-SSLAF--------TDGDRNSSYNNI-----EDIVVSKIEDRISLW 47
Query: 159 TMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMF 218
+ LP+ NGE+ +L+Y + + + + + R+A+ L+YL+D+++GGET+F
Sbjct: 48 SFLPKENGESIQVLKYGVNRS-----GSIKEEPKSSSGAHRLATILMYLSDVKQGGETVF 102
Query: 219 PFENGMNADGSYDY-QKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWV 277
P +A +C G V+P +G+ +L ++L P+G D S + CPV++GEKW+
Sbjct: 103 PRSEMKDAQAKEGAPSQCSGYAVRPAKGNAILLFNLRPDGETDKDSQYEECPVLEGEKWL 162
Query: 278 ATKWI 282
A K I
Sbjct: 163 AIKHI 167
>gi|428172003|gb|EKX40915.1| hypothetical protein GUITHDRAFT_112917 [Guillardia theta CCMP2712]
Length = 421
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 71/230 (30%), Positives = 106/230 (46%), Gaps = 39/230 (16%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFIS 139
+V S PR L +F TPE+C +I+ AK + ST++ V + RTSS ++
Sbjct: 195 KVRSISPRVLEVEDFLTPEECHELISSAKPLMSRSTVSAEGDSAVSLQESSRTSSTAWLP 254
Query: 140 AAE--------DESGTL---DLIEEKIAKVTMLPRINGE------AFNILRYKIGQKYNS 182
D +L D + + V L I+ A+ +LRY++ Q Y+
Sbjct: 255 PHSHTLANKLYDRVSSLVGIDFRKHEHVVVEDLQAIDKRGGSSVTAWQVLRYEVNQHYHI 314
Query: 183 HYDAFDPQEY-----GPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKC-I 236
H+D FDP + G +++ + +F YLTD+E G DY C
Sbjct: 315 HHDYFDPVLHRGFLQGDGRNRFITAFF-YLTDVERGDPRPIT-----------DYSDCNR 362
Query: 237 GLKVKPRQGDGLLFYSLLPNGT----IDPTSIHGSCPVVKGEKWVATKWI 282
GL+V P++G ++FYSLL +G +D S HG C V G KW A WI
Sbjct: 363 GLRVPPKRGKAIIFYSLLADGQRSGGLDVASWHGGCDVHNGTKWAANYWI 412
>gi|292619367|ref|XP_001922562.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Danio rerio]
Length = 541
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 59/206 (28%), Positives = 100/206 (48%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + + T ++ + I ++K LR +T++ V T R S +++A E
Sbjct: 342 PRIIRYHEIITEQEIEKIKELSKPRLRRATIS-NPITGVLETAHYRISKSAWLAAYE--H 398
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+D I ++I +T L E + Y +G +Y H+D E K R+A
Sbjct: 399 PVVDRINQRIEDITGLNVKTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIA 458
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G VKP +G + +Y+L P+G D
Sbjct: 459 TWLFYMSDVAAGGATVFP---------------EVGAAVKPLKGTAVFWYNLFPSGEGDY 503
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 504 STRHAACPVLVGNKWVSNKWIHERGQ 529
>gi|24651477|ref|NP_733395.1| prolyl-4-hydroxylase-alpha PV [Drosophila melanogaster]
gi|20269812|gb|AAM18061.1|AF495539_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]PV [Drosophila
melanogaster]
gi|23172718|gb|AAN14252.1| prolyl-4-hydroxylase-alpha PV [Drosophila melanogaster]
Length = 525
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 63/224 (28%), Positives = 107/224 (47%), Gaps = 30/224 (13%)
Query: 75 TNIPFQVLSWM--------PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDN 126
T PF +L+ + P + + + +P++ K + MA L+ +T+ + +
Sbjct: 310 TTSPFLILAPLKMELVGLDPYMVLYHDVLSPKEIKELQGMATPGLKRATV-YQASSGRNE 368
Query: 127 TQGIRTSSGVFISAAEDESGTLDL-IEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD 185
RTS ++ D L + + +I+ +T E ++ Y +G Y+ HYD
Sbjct: 369 VVKTRTSK---VAWFPDGYNPLTVRLNARISDMTGFNLYGSEMLQLMNYGLGGHYDQHYD 425
Query: 186 AFDP--QEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPR 243
F+ R+A+ L YLTD+E+GG T+FP I V P+
Sbjct: 426 FFNKTNSNMTAMSGDRIATVLFYLTDVEQGGATVFP---------------NIRKAVFPQ 470
Query: 244 QGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
+G +++Y+L NG ID ++H +CPV+ G KWV KWIR++EQ
Sbjct: 471 RGSVVMWYNLKDNGQIDTQTLHAACPVIVGSKWVCNKWIREREQ 514
>gi|312599252|gb|ADQ91275.1| hypothetical protein BpV2_108c [Bathycoccus sp. RCC1105 virus BpV2]
Length = 197
Score = 94.4 bits (233), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 68/198 (34%), Positives = 97/198 (48%), Gaps = 28/198 (14%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR L N + ++CK I N+A L+ ST++ K +D + IR S ++ A+ED
Sbjct: 24 PRVL--KNVLSEDECKHIQNIASKKLQTSTVS--KSRDID--ESIRKSETAWLKASED-- 75
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
+D + K +T P N E +L+YK G Y H D F P + K++R+ +F++
Sbjct: 76 PVVDKLIRKCVSMTDRPLRNCEDLQVLKYKPGGFYKPHQDTF-PDD----KNKRMYTFII 130
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
L D EGGET FP N SY +K GD L F +L I ++H
Sbjct: 131 ALNDEYEGGETEFP-----NIKKSYRLEK----------GDALFFNTLNNYECITKKALH 175
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV GEKWV W+R
Sbjct: 176 GGTPVKSGEKWVCNLWVR 193
>gi|195575145|ref|XP_002105540.1| GD16902 [Drosophila simulans]
gi|194201467|gb|EDX15043.1| GD16902 [Drosophila simulans]
Length = 525
Score = 94.0 bits (232), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 47/139 (33%), Positives = 73/139 (52%), Gaps = 17/139 (12%)
Query: 151 IEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP--QEYGPQKSQRVASFLVYLT 208
+ +I+ +T E ++ Y +G Y+ HYD F+ R+A+ L YLT
Sbjct: 391 LNARISDMTGFNLYGSEMLQLMNYGLGGHYDQHYDFFNKTNSNMTAMSGDRIATVLFYLT 450
Query: 209 DLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSC 268
D+E+GG T+FP I V P++G +++Y+L NG ID ++H +C
Sbjct: 451 DVEQGGATVFP---------------NIRKAVFPQRGSVVMWYNLRDNGQIDTQTLHAAC 495
Query: 269 PVVKGEKWVATKWIRDQEQ 287
PV+ G KWV KWIR++EQ
Sbjct: 496 PVIVGSKWVCNKWIREREQ 514
>gi|255545252|ref|XP_002513687.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223547595|gb|EEF49090.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 309
Score = 94.0 bits (232), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 56/206 (27%), Positives = 105/206 (50%), Gaps = 25/206 (12%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA 141
LSW PR + F T E+C +I++A + + +G + + + ++++
Sbjct: 61 LSWRPRVFLYKGFLTDEECDRLISLA-----------HGAKEISKGKGDGSRNNIQLASS 109
Query: 142 EDESGTLD----LIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS 197
E S D IEE+I+ T +P+ N + ++ Y I ++ H+D FD + S
Sbjct: 110 ESRSHIYDDLLARIEERISAWTFIPKENSKPLQVMHYGI-EEAREHFDYFDNKTLISNVS 168
Query: 198 QRVASFLVYLTDLEEGGETMFP---FENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLL 254
+A+ ++YL+++ GGE +FP ++ + +D + D ++P +G+ +L ++
Sbjct: 169 L-MATLVLYLSNVTRGGEILFPKSELKDKVWSDCTKDSSI-----LRPVKGNAVLIFNAH 222
Query: 255 PNGTIDPTSIHGSCPVVKGEKWVATK 280
N + D S HG CPV++GE W ATK
Sbjct: 223 LNASADSRSTHGRCPVLEGEMWCATK 248
>gi|194905290|ref|XP_001981166.1| GG11918 [Drosophila erecta]
gi|190655804|gb|EDV53036.1| GG11918 [Drosophila erecta]
Length = 525
Score = 93.6 bits (231), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 47/139 (33%), Positives = 72/139 (51%), Gaps = 17/139 (12%)
Query: 151 IEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP--QEYGPQKSQRVASFLVYLT 208
+ +IA +T E ++ Y +G Y+ HYD F+ R+A+ L YLT
Sbjct: 391 LNARIADMTGFNLYGSEMLQLMNYGLGGHYDQHYDFFNTINSNLTAMSGDRIATVLFYLT 450
Query: 209 DLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSC 268
D+E+GG T+FP I V P++G +++Y+L NG D ++H +C
Sbjct: 451 DVEQGGATVFP---------------NIRKAVFPQRGSVIMWYNLQDNGQTDNKTLHAAC 495
Query: 269 PVVKGEKWVATKWIRDQEQ 287
PV+ G KWV KWIR++EQ
Sbjct: 496 PVIVGSKWVCNKWIREREQ 514
>gi|195113237|ref|XP_002001174.1| GI10637 [Drosophila mojavensis]
gi|193917768|gb|EDW16635.1| GI10637 [Drosophila mojavensis]
Length = 529
Score = 93.6 bits (231), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 56/215 (26%), Positives = 104/215 (48%), Gaps = 23/215 (10%)
Query: 77 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGV 136
+ +++S P + + + +P + + ++A L+ +T+ + ++ RTS
Sbjct: 321 LKMELISLDPYMVIYHDVISPSEISELQSLAVPGLKRATV-FNQQSMRNHVVKTRTSKVT 379
Query: 137 FISAAEDESGTLDL-IEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQ---EY 192
++ D L + + +I +T E ++ Y +G Y+ HYD F+ +
Sbjct: 380 WLL---DTLNQLTIRLNRRITDMTGFDMYGSEMLQVMNYGLGGHYDKHYDYFNSSVAADL 436
Query: 193 GPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYS 252
R+A+ L YLTD+E+GG T+FP I V P+ G +++Y+
Sbjct: 437 TRLNGDRIATVLFYLTDVEQGGATVFP---------------NIEKAVFPKSGTAVVWYN 481
Query: 253 LLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
L +G DP ++H +CPV+ G KWV KWIR+++Q
Sbjct: 482 LRHDGNGDPQTLHAACPVIVGSKWVCNKWIRERQQ 516
>gi|195341590|ref|XP_002037389.1| GM12139 [Drosophila sechellia]
gi|194131505|gb|EDW53548.1| GM12139 [Drosophila sechellia]
Length = 525
Score = 93.6 bits (231), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 47/139 (33%), Positives = 73/139 (52%), Gaps = 17/139 (12%)
Query: 151 IEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP--QEYGPQKSQRVASFLVYLT 208
+ +I+ +T E ++ Y +G Y+ HYD F+ R+A+ L YLT
Sbjct: 391 LNARISDMTGFNLYGSEMLQLMNYGLGGHYDQHYDFFNNTNSNMTAMSGDRIATVLFYLT 450
Query: 209 DLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSC 268
D+E+GG T+FP I V P++G +++Y+L NG ID ++H +C
Sbjct: 451 DVEQGGATVFP---------------NIRKAVFPQRGSVVMWYNLRDNGQIDTQTLHAAC 495
Query: 269 PVVKGEKWVATKWIRDQEQ 287
PV+ G KWV KWIR++EQ
Sbjct: 496 PVIVGSKWVCNKWIREREQ 514
>gi|224122338|ref|XP_002318810.1| predicted protein [Populus trichocarpa]
gi|222859483|gb|EEE97030.1| predicted protein [Populus trichocarpa]
Length = 310
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 60/203 (29%), Positives = 101/203 (49%), Gaps = 13/203 (6%)
Query: 81 VLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISA 140
+SW PR + F T E+C +I++A+ S E D+ G + +F S+
Sbjct: 60 TVSWQPRVFVYKGFLTDEECDHLISLAQGTKETS-------EGKDDDSGRIERNRLFASS 112
Query: 141 AE---DESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS 197
+ L IEE+++ T+LP+ N + ++ Y I N ++D F +
Sbjct: 113 TSLLNMDDNILSRIEERVSAWTLLPKENSKPLQVMHYGIEDAKN-YFDYFGNKSAIISSE 171
Query: 198 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 257
+A+ + YL+++ +GGE FP N S D K I ++P +G+ +LF+++ PN
Sbjct: 172 PLMATLVFYLSNVTQGGEIFFPKSEVKNKIWS-DCTK-ISDSLRPIKGNAILFFTVHPNT 229
Query: 258 TIDPTSIHGSCPVVKGEKWVATK 280
+ D S H CPV++GE W ATK
Sbjct: 230 SPDMGSSHSRCPVLEGEMWYATK 252
>gi|195061074|ref|XP_001995919.1| GH14105 [Drosophila grimshawi]
gi|193891711|gb|EDV90577.1| GH14105 [Drosophila grimshawi]
Length = 513
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 44/137 (32%), Positives = 70/137 (51%), Gaps = 15/137 (10%)
Query: 151 IEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDL 210
+ ++I ++ E ++ Y +G Y SHYD + R+A+ + YL+D+
Sbjct: 383 LNKRIEDMSGFTMYGSEMLQVMNYGLGGHYASHYDFLNATSKTRLNGDRIATVMFYLSDV 442
Query: 211 EEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPV 270
E+GG T+FP I V P++G +++Y+L NG D +IH +CPV
Sbjct: 443 EQGGATVFP---------------KIQKAVFPQRGTAIIWYNLKENGDFDTNTIHAACPV 487
Query: 271 VKGEKWVATKWIRDQEQ 287
+ G KWV KWIR+ EQ
Sbjct: 488 IVGSKWVCNKWIRENEQ 504
>gi|357483927|ref|XP_003612250.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355513585|gb|AES95208.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 204
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 50/129 (38%), Positives = 79/129 (61%), Gaps = 5/129 (3%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVFI 138
+V+SW PRA + NF T E+C+ +I++AK ++ ST+ G++ D+ +RTSSG F+
Sbjct: 79 EVVSWEPRAFVYHNFLTKEECEYLIDIAKPSMHKSTVVDSETGKSKDSR--VRTSSGTFL 136
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 198
+ D+ + IE+KIA T +P +GE +L Y++GQKY HYD F + Q
Sbjct: 137 ARGRDK--IVRNIEKKIADFTFIPVEHGEGLQVLHYEVGQKYEPHYDYFLDEFNTKNGGQ 194
Query: 199 RVASFLVYL 207
R+A+ L+YL
Sbjct: 195 RIATVLMYL 203
>gi|307190793|gb|EFN74662.1| Prolyl 4-hydroxylase subunit alpha-2 [Camponotus floridanus]
Length = 476
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 56/206 (27%), Positives = 98/206 (47%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + + N E+ ++I MA+ + +T+ K ++ R S ++ E E
Sbjct: 269 PRIVVYHNVIYDEEIETIKRMAQPRFKRATVQNYKTGALE-IANYRISKSAWLQ--EHEH 325
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ + +++ +T + E ++ Y IG Y H+D +E KS R+A
Sbjct: 326 KHVAAVSKRVEHMTSMSIETAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIA 385
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
+ L Y++D+E+GG T+F I + + PR+G +Y+L PNG D
Sbjct: 386 TVLYYMSDVEQGGGTVFT---------------AINISLWPRKGSAAFWYNLKPNGEGDF 430
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
+ H +CPV+ G KWVA KW+ ++ Q
Sbjct: 431 KTRHAACPVLTGSKWVANKWLHERGQ 456
>gi|195505251|ref|XP_002099423.1| GE23370 [Drosophila yakuba]
gi|194185524|gb|EDW99135.1| GE23370 [Drosophila yakuba]
Length = 534
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 61/211 (28%), Positives = 97/211 (45%), Gaps = 29/211 (13%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + + + + +I+ A N++ + + ET T RT+ G ++ +E
Sbjct: 327 PYVVLYHEVLSAREISMLISKAAQNMKNTRV---HRETKPKTNRGRTAKGHWLKKESNE- 382
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFD---PQEYGPQKSQ---- 198
I +I +T + E F ++ Y IG Y H D FD GP+ Q
Sbjct: 383 -LTRRITRRIVDMTGFDLADSEDFQVINYGIGGHYFLHMDYFDYASSNYTGPRSRQSKVL 441
Query: 199 --RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPN 256
R+A+ L YL+D+E+GG T+F +G V P+ G + +Y+L +
Sbjct: 442 GDRIATVLFYLSDVEQGGATVF---------------GNVGYSVYPQAGTAIFWYNLDTD 486
Query: 257 GTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
G DP + H SCPV+ G KWV T+WIR+ Q
Sbjct: 487 GNGDPLTRHASCPVIVGSKWVMTEWIRESRQ 517
>gi|219124513|ref|XP_002182546.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217405892|gb|EEC45833.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 193
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 59/208 (28%), Positives = 99/208 (47%), Gaps = 21/208 (10%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFIS 139
+ LS PRA NF T + I+ + + +++ T + RTSS +++
Sbjct: 2 KALSCAPRAFQVENFLTDVEADHIVGLVQ-----KKNDMQRSSTNGHISETRTSSTTWLA 56
Query: 140 AAEDESGTLDLIEEKIAKV-----TMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP 194
D +D I ++A ML R E I+ Y +GQ+Y +H+D F + P
Sbjct: 57 RHSDP--VIDSIFRRVADTLKMDEAMLHRRINEDLQIVHYGVGQQYTAHHD-FGYPKGDP 113
Query: 195 QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLL 254
R +F +YL D+ GG+T FP +G+ L V P++G ++FY +
Sbjct: 114 GSPSRSINFCMYLNDVPAGGQTSFPRWRNAETNGA--------LNVVPKKGTAMIFYMVN 165
Query: 255 PNGTIDPTSIHGSCPVVKGEKWVATKWI 282
P+G +D + H + PV++GEK+ + WI
Sbjct: 166 PDGNLDDLTHHAALPVIEGEKFFSNLWI 193
>gi|291190274|ref|NP_001167096.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide precursor [Salmo
salar]
gi|223648100|gb|ACN10808.1| Prolyl 4-hydroxylase subunit alpha-1 precursor [Salmo salar]
Length = 545
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 57/206 (27%), Positives = 101/206 (49%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + + + + + + + +AK LR +T++ V T R S +++A ED
Sbjct: 346 PRIIRYHDVLSNSEIEKVKELAKPRLRRATIS-NPITGVLETAHYRISKSAWLTAYEDP- 403
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+D I ++I +T L E + Y +G +Y H+D E K R+A
Sbjct: 404 -VVDKINQRIEDITGLNVKTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIA 462
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L+Y++D+ GG T+F +G V P++G + +Y+L P+G D
Sbjct: 463 TWLIYMSDVPSGGATVFT---------------DVGAAVWPKKGSAVFWYNLFPSGEGDY 507
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 508 STRHAACPVLVGNKWVSNKWIHERGQ 533
>gi|313229039|emb|CBY18191.1| unnamed protein product [Oikopleura dioica]
Length = 522
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 46/143 (32%), Positives = 82/143 (57%), Gaps = 16/143 (11%)
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVA 201
+++S T++ +I+++T L E + Y IG +Y HYD + +E+ ++R+A
Sbjct: 376 DEDSDTVEKYNRRISRLTGLDLEYAEQLQMSNYGIGGQYEPHYD-YSRREWDIYNNRRIA 434
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L YLT +E+GG T+F +GL ++ +G + +Y+LLPNG+ D
Sbjct: 435 TWLSYLTTVEQGGGTVFT---------------ELGLHIRSIKGSAVFWYNLLPNGSGDE 479
Query: 262 TSIHGSCPVVKGEKWVATKWIRD 284
+ H +CPV++G KWV+ KWI +
Sbjct: 480 RTRHAACPVLRGNKWVSNKWIHE 502
>gi|194905294|ref|XP_001981167.1| GG11919 [Drosophila erecta]
gi|190655805|gb|EDV53037.1| GG11919 [Drosophila erecta]
Length = 533
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 59/211 (27%), Positives = 97/211 (45%), Gaps = 29/211 (13%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + + + + ++ A N++ + + + E NT RT+ G ++ +E
Sbjct: 326 PYVVLYHEVLSAREISMLMGKAAQNMKNTRV---QSEKAVNTNRERTAKGYWLKKESNE- 381
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD--AFDPQEYGPQKSQ----- 198
I +I +T + E F ++ Y IG Y+ H+D F Y ++S
Sbjct: 382 -MTRRITRRIVDMTGFDLADSEDFQVINYGIGGHYSLHFDYFGFASSNYTGERSHHSIVL 440
Query: 199 --RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPN 256
R+A+ L YLTD+E+GG T+F +G V P+ G + +Y+L +
Sbjct: 441 GDRIATVLFYLTDVEQGGATVF---------------GNVGYSVYPQAGTAIFWYNLDTD 485
Query: 257 GTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
G DP + H SCPVV G KWV T+WI + Q
Sbjct: 486 GNGDPLTRHASCPVVVGSKWVMTEWIHEARQ 516
>gi|195341588|ref|XP_002037388.1| GM12140 [Drosophila sechellia]
gi|194131504|gb|EDW53547.1| GM12140 [Drosophila sechellia]
Length = 534
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 61/211 (28%), Positives = 95/211 (45%), Gaps = 28/211 (13%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + + + + +I A N++ + + +G N RT+ G + +E
Sbjct: 326 PYVVLYHEVLSAREISMLIGKATQNMKNTRVHKEQGVPKKNRG--RTAKGFWFKKESNE- 382
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFD---------PQEYGPQK 196
I +I +T + E F ++ Y IG Y H D FD Y
Sbjct: 383 -LTKGITRRIMDMTGFDLADSEGFQVINYGIGGHYLLHMDYFDFASSNHTDTRSSYSMDL 441
Query: 197 SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPN 256
R+A+ L YLTD+E+GG T+F AD +G V P+ G + +Y+L N
Sbjct: 442 GDRIATVLFYLTDVEQGGATVF-------AD--------VGYSVYPQAGTAIFWYNLDTN 486
Query: 257 GTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
G DP + H +CPV+ G KWV T+WIR++ Q
Sbjct: 487 GKGDPRTKHAACPVIVGSKWVMTEWIREKRQ 517
>gi|291230950|ref|XP_002735430.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Saccoglossus
kowalevskii]
Length = 533
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 57/208 (27%), Positives = 98/208 (47%), Gaps = 26/208 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQ--GIRTSSGVFISAAED 143
P+ + F + + + + +A LR +T+ + N + R S ++S ED
Sbjct: 333 PKLIIFHDAILTNEIRKVKALASPRLRRATI---QNSVTGNLEFAEYRISKSAWLS--ED 387
Query: 144 ESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QR 199
+ + + +I + T L E + Y +G Y H+D +E KS R
Sbjct: 388 DGDVVHRLNHRIEQYTGLTMDTAEELQVANYGLGGHYEPHFDFARKEEINAFKSLNTGNR 447
Query: 200 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 259
+A+FL Y++D+E GG T+FP +G ++ P +G +Y+LL NG
Sbjct: 448 IATFLFYMSDVEAGGATVFP---------------QVGARLIPEKGSAAFWYNLLKNGEG 492
Query: 260 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D ++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 493 DYSTRHAACPVLVGSKWVSNKWIHERGQ 520
>gi|397643670|gb|EJK76008.1| hypothetical protein THAOC_02250 [Thalassiosira oceanica]
Length = 480
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 72/257 (28%), Positives = 110/257 (42%), Gaps = 61/257 (23%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDN-TQGIRTSSGVFISAAEDE 144
PR Y NF + + + + P +A G T QG G ++ E
Sbjct: 205 PRVFYVHNFLSAAEADEFVKFSTAPENPYKMAPSTGGTHKAWNQG---GDGAVLTTRTSE 261
Query: 145 SGTLDLIEEKIAKVT----MLPRING------EAFNILRYKIGQKYNSHYDAF------- 187
+ D+ ++ V L R+NG + ILRYK+GQ Y +H+D F
Sbjct: 262 N-AFDITTKQSFDVKKRAFRLLRMNGYQENMADGIQILRYKVGQAYVAHHDYFPTHQSKD 320
Query: 188 ---DPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNAD----------------- 227
DP G S R A+ +YL+D+ GG+T+FP ++A+
Sbjct: 321 FNWDPLSGG---SNRFATIFLYLSDVSYGGQTVFPNCEKLSAEKSPELVERLGESPSASE 377
Query: 228 ------------GSYD---YQKCI-GLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVV 271
GS++ KC V PR+GD +LFYS P+G +D S+HG+CP++
Sbjct: 378 LKEFVSNAGLMEGSWEDNLIHKCYEKFAVPPRRGDAILFYSQRPDGLLDTNSLHGACPIL 437
Query: 272 KGEKWVATKWIRDQEQY 288
G KW A W+ + +Y
Sbjct: 438 NGTKWGANLWVWNACRY 454
>gi|87199403|ref|YP_496660.1| 2OG-Fe(II) oxygenase [Novosphingobium aromaticivorans DSM 12444]
gi|87135084|gb|ABD25826.1| 2OG-Fe(II) oxygenase [Novosphingobium aromaticivorans DSM 12444]
Length = 211
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 60/198 (30%), Positives = 97/198 (48%), Gaps = 28/198 (14%)
Query: 93 NFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIE 152
+F + +C +I + + RPST+A G+ RTS + + E LD
Sbjct: 34 DFLSQAECNGLIARIERDRRPSTIADANGDHY-----FRTSETCDLPMDDPEIVALD--- 85
Query: 153 EKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP-----QEYGPQKSQRVASFLVYL 207
EK+ ++ + R GE RY+ GQ++ +H D FDP Q + QR +F+VYL
Sbjct: 86 EKLCALSGIGRPFGEPIQGQRYESGQEFKAHTDYFDPHGADFQRFCSVAGQRTWTFMVYL 145
Query: 208 TDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGS 267
D+E GG T F K I ++P +G + + + P+GT++P ++H +
Sbjct: 146 NDVEAGGATRF---------------KVIDKTIQPERGKLVCWNNRRPDGTVNPCTLHHA 190
Query: 268 CPVVKGEKWVATKWIRDQ 285
V KG K+V TKW R++
Sbjct: 191 MKVRKGLKYVITKWYREK 208
>gi|413923982|gb|AFW63914.1| hypothetical protein ZEAMMB73_179176 [Zea mays]
Length = 222
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 49/134 (36%), Positives = 79/134 (58%), Gaps = 15/134 (11%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTS 133
+V+SW PRA + NF + E+C+ +I +AK ++ ST+ VD+T G +RTS
Sbjct: 98 EVISWEPRAFVYHNFLSKEECEYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 150
Query: 134 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 193
SG+F+ D+ + +IE++IA T +P +GE +L Y++GQKY H+D F +
Sbjct: 151 SGMFLQRGRDK--VIRVIEKRIADYTFIPVDHGEGLQVLHYEVGQKYEPHFDYFLDEFNT 208
Query: 194 PQKSQRVASFLVYL 207
QR+A+ L+YL
Sbjct: 209 KNGGQRMATLLMYL 222
>gi|363814557|ref|NP_001242754.1| uncharacterized protein LOC100794585 [Glycine max]
gi|255628535|gb|ACU14612.1| unknown [Glycine max]
Length = 238
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 51/148 (34%), Positives = 84/148 (56%), Gaps = 3/148 (2%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVFI 138
+VL+W PR + NF + E+C + +A L ST+ + G+ + + +RTSSG+F+
Sbjct: 82 EVLNWSPRIILLHNFLSMEECDYLRALALPRLHISTVVDTKTGKGIKSD--VRTSSGMFL 139
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 198
++ E + + IE++I+ + +P NGE +LRY+ Q Y H+D F + Q
Sbjct: 140 NSKERKYPMVQAIEKRISVYSQIPIENGELMQVLRYEKNQYYKPHHDYFSDTFNLKRGGQ 199
Query: 199 RVASFLVYLTDLEEGGETMFPFENGMNA 226
R+A+ L+YL+D E GET FP +NA
Sbjct: 200 RIATMLMYLSDNIERGETYFPLAGSVNA 227
>gi|194765178|ref|XP_001964704.1| GF23330 [Drosophila ananassae]
gi|190614976|gb|EDV30500.1| GF23330 [Drosophila ananassae]
Length = 537
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 59/221 (26%), Positives = 102/221 (46%), Gaps = 27/221 (12%)
Query: 76 NIPFQVLSWM--------PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNT 127
N PF++L+ + P + + + + ++ + + MA +R ST+ G +
Sbjct: 313 NHPFRLLAPLKLEEHNLDPYVVTYHDMLSAQKIRDLRQMAVPRMRRSTVNPLPGGQ-NKK 371
Query: 128 QGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF 187
R S ++ A + T++ + + T L E + Y +G Y H+D F
Sbjct: 372 SAFRVSKNAWL--AYESHPTMEGMLRDLKDATGLDTTYCEQLQVANYGVGGHYEPHWDFF 429
Query: 188 -DPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGD 246
DP Y ++ R+A+ + YL+D+E+GG T FPF + VKP+ G+
Sbjct: 430 RDPNHYPAEEGNRIATAIFYLSDVEQGGATAFPF---------------LDFAVKPQLGN 474
Query: 247 GLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
L +Y+L + +D + H CPV+KG KW+ WI D Q
Sbjct: 475 VLFWYNLHRSLDMDYRTKHAGCPVLKGSKWIGNVWIHDMTQ 515
>gi|383864775|ref|XP_003707853.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Megachile
rotundata]
Length = 550
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 56/206 (27%), Positives = 98/206 (47%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + + N E+ ++I MA+ + +T+ K ++ R S ++ E E
Sbjct: 343 PRIVIYHNVIYDEEIETIKRMAQPRFKRATVQNYKTGALE-IANYRISKSAWLQ--EHEH 399
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ + +++ +T L E ++ Y IG Y H+D +E KS R+A
Sbjct: 400 KHVAAVSKRVEHMTSLNVETAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIA 459
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
+ L Y++D+E+GG T+F I + + PR+G +++L PNG D
Sbjct: 460 TVLYYMSDVEQGGGTVFT---------------AINISLWPRKGSAAFWFNLKPNGEGDL 504
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
+ H +CPV+ G KWVA KW+ ++ Q
Sbjct: 505 RTRHAACPVLTGSKWVANKWLHERGQ 530
>gi|312032360|ref|NP_001185667.1| prolyl 4-hydroxylase subunit alpha-1 isoform 4 precursor [Gallus
gallus]
Length = 536
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 57/206 (27%), Positives = 101/206 (49%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + E+ +++ +AK LR +T++ ++ T R S ++S E S
Sbjct: 337 PRIVRFLDIISDEEIETVKELAKPRLRRATISNPITGALE-TAHYRISKSAWLSGYE--S 393
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 394 PVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 453
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V P++G + +Y+L P+G D
Sbjct: 454 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFPSGEGDY 498
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 499 STRHAACPVLVGNKWVSNKWLHERGQ 524
>gi|195575143|ref|XP_002105539.1| GD16913 [Drosophila simulans]
gi|194201466|gb|EDX15042.1| GD16913 [Drosophila simulans]
Length = 534
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 61/211 (28%), Positives = 95/211 (45%), Gaps = 28/211 (13%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + + + + +I A N++ + + +G N RT+ G + +E
Sbjct: 326 PYVVLYHEVLSAREISMLIGKAAQNMKNTRVHKEQGVPKKNRG--RTAKGFWFKKESNE- 382
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFD---------PQEYGPQK 196
I +I +T + E F ++ Y IG Y H D FD Y
Sbjct: 383 -LTKGITRRIMDMTGFDLADSEGFQVINYGIGGHYLLHMDYFDFASSNHTDTRSGYSMDL 441
Query: 197 SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPN 256
R+A+ L YLTD+E+GG T+F AD +G V P+ G + +Y+L N
Sbjct: 442 GDRIATVLFYLTDVEQGGATVF-------AD--------VGYSVYPQAGTAIFWYNLDTN 486
Query: 257 GTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
G DP + H +CPV+ G KWV T+WIR++ Q
Sbjct: 487 GKGDPRTRHAACPVIVGSKWVMTEWIREKRQ 517
>gi|195505255|ref|XP_002099425.1| GE23368 [Drosophila yakuba]
gi|194185526|gb|EDW99137.1| GE23368 [Drosophila yakuba]
Length = 528
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 47/137 (34%), Positives = 73/137 (53%), Gaps = 20/137 (14%)
Query: 156 AKVTMLPRIN---GEAFNILRYKIGQKYNSHYDAFDP--QEYGPQKSQRVASFLVYLTDL 210
A++T + N E ++ Y +G Y+ HYD F+ R+A+ L YLTD+
Sbjct: 396 ARITDMTGFNLHGSEMLQLMNYGLGGHYDQHYDYFNTINSNLTAMSGDRIATVLFYLTDV 455
Query: 211 EEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPV 270
E+GG T+FP I V P++G +++Y+L +G ID ++H +CPV
Sbjct: 456 EQGGATVFP---------------NIRKAVFPQRGSVIMWYNLKDDGQIDTQTLHAACPV 500
Query: 271 VKGEKWVATKWIRDQEQ 287
+ G KWV KWIR++EQ
Sbjct: 501 IVGSKWVCNKWIREREQ 517
>gi|195452778|ref|XP_002073496.1| GK13116 [Drosophila willistoni]
gi|194169581|gb|EDW84482.1| GK13116 [Drosophila willistoni]
Length = 521
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 55/215 (25%), Positives = 103/215 (47%), Gaps = 23/215 (10%)
Query: 77 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGV 136
+ +++ P + + + +P + + MAK L+ +T+ T + Q ++T +
Sbjct: 316 LKMELIGLDPYMVLYHDVISPNEIAELQEMAKPELKRATVY---NSTKNTNQFVKTRTAK 372
Query: 137 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK 196
+ + + + ++I +T E ++ Y +G Y H+D F+ P
Sbjct: 373 VAWFLDTFNQLTERLNQRIMDMTNFVLNGSEMLQVMNYGLGGYYVKHFDYFNTTT-NPHI 431
Query: 197 SQ----RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYS 252
SQ R+A+ L YL D+E+GG T+FP I V P++G +++Y+
Sbjct: 432 SQINGDRIATVLFYLNDVEQGGATVFP---------------EIKKAVFPKRGSAIMWYN 476
Query: 253 LLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
L +G + ++H +CPV+ G KWV KWIR++EQ
Sbjct: 477 LKDDGEGNRDTLHAACPVIVGSKWVCNKWIREREQ 511
>gi|47550697|ref|NP_999856.1| prolyl 4-hydroxylase, alpha polypeptide I b precursor [Danio rerio]
gi|28277826|gb|AAH45890.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I [Danio rerio]
Length = 536
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 58/206 (28%), Positives = 99/206 (48%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + + + + +++ MAK LR +T++ V T R S ++S E
Sbjct: 337 PRIVRYHEIISDSEIETVKEMAKPRLRRATIS-NPITGVLETAPYRISKSAWLSGYE--H 393
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
T++ I ++I VT L E + Y +G +Y H+D E K R+A
Sbjct: 394 STIERINQRIEDVTGLEMDTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIA 453
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+F +G V P++G + +Y+L P+G D
Sbjct: 454 TWLFYMSDVSAGGATVFT---------------DVGAAVWPKKGTAVFWYNLFPSGEGDY 498
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 499 STRHAACPVLVGNKWVSNKWIHERGQ 524
>gi|407699315|ref|YP_006824102.1| prolyl 4-hydroxylase subunit alpha [Alteromonas macleodii str.
'Black Sea 11']
gi|407248462|gb|AFT77647.1| Prolyl 4-hydroxylase alpha subunit [Alteromonas macleodii str.
'Black Sea 11']
Length = 354
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 69/213 (32%), Positives = 102/213 (47%), Gaps = 26/213 (12%)
Query: 78 PFQVLSW-MPRALYFPNFATPEQCKSIINMAKLNLRPSTLA--LRKGETVDNTQGIRTSS 134
P +VL +P LY + + +C +I L+PS + L VDN +RTS
Sbjct: 146 PTEVLDQTLPVELYV-DVLSEYECAYLITKFSSLLQPSMVVDPLTGNGKVDN---VRTSY 201
Query: 135 GVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP 194
I+ + + T L ++ I++VT PR NGEA N+LRY GQ+Y HYDA + G
Sbjct: 202 VAIIAPSYCDWITRKL-DKVISQVTHTPRCNGEALNLLRYTPGQQYKPHYDALNEDHDGS 260
Query: 195 QK---SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFY 251
QR+ + LVYL + +GGET FP + + V P G+ ++F
Sbjct: 261 MYKDGKQRIKTALVYLNTVRQGGETRFPK---------------LDISVSPTLGNMVVFS 305
Query: 252 SLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRD 284
+ +G + S H P KW+ TKWIR+
Sbjct: 306 NSDESGKLLLNSYHLGAPTFSENKWLVTKWIRE 338
>gi|410632646|ref|ZP_11343301.1| prolyl 4-hydroxylase [Glaciecola arctica BSs20135]
gi|410147883|dbj|GAC20168.1| prolyl 4-hydroxylase [Glaciecola arctica BSs20135]
Length = 480
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 57/194 (29%), Positives = 102/194 (52%), Gaps = 25/194 (12%)
Query: 93 NFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIE 152
+F P++C+++I + + +PST+ E D Q RTSS + +D + I+
Sbjct: 103 DFLLPQECQALIELIEQAKQPSTIT---SENPD--QQFRTSSTCHLGNMQDP--VIRKID 155
Query: 153 EKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQE---YGPQKSQRVASFLVYLTD 209
+I + + E Y++GQ++ H D F+P E YG + QR +F++YL +
Sbjct: 156 LQICQYLGIDPSYSEVIQGQHYQLGQQFKPHTDYFEPYELAHYGGIQGQRTYTFMIYLNE 215
Query: 210 LEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCP 269
+E+GG+T+FP + IG K K +G +++ ++ P+G+++ ++H P
Sbjct: 216 VEQGGDTVFP-------------ELAIGFKAK--KGMAVIWNNINPDGSVNYQTLHQGMP 260
Query: 270 VVKGEKWVATKWIR 283
V KGEK + TKW R
Sbjct: 261 VQKGEKLIITKWFR 274
>gi|340367965|ref|XP_003382523.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Amphimedon
queenslandica]
Length = 525
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 58/214 (27%), Positives = 101/214 (47%), Gaps = 20/214 (9%)
Query: 77 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGV 136
I +V P+ F + T + + + +A L +T+ GE + T R S
Sbjct: 317 IKTEVAFVKPKIYIFYDIVTDREIERLKELANPKLNRATVHGENGELLHAT--YRISKSG 374
Query: 137 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQE---YG 193
++S ++D G +D I+++I VT L E ++ Y IG +Y HYD E
Sbjct: 375 WLSGSDDPLGYVDRIDQRIEDVTGLTMSTAEQLQVVNYGIGGQYEPHYDFARTGEDTFTS 434
Query: 194 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 253
R+++ L+Y++D+E+GG T+FP +G ++ P + +++L
Sbjct: 435 LGSGNRISTLLIYMSDVEKGGATVFP---------------GVGARLVPIKRAAAYWWNL 479
Query: 254 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
+G D ++ H CPV+ G KWV KWI ++ Q
Sbjct: 480 KRSGDGDYSTRHAGCPVLVGSKWVCNKWIHERGQ 513
>gi|195110931|ref|XP_002000033.1| GI24862 [Drosophila mojavensis]
gi|193916627|gb|EDW15494.1| GI24862 [Drosophila mojavensis]
Length = 549
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 60/220 (27%), Positives = 100/220 (45%), Gaps = 28/220 (12%)
Query: 78 PFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSG 135
P +V LS P + F + + +++ +AK + +T+ V N RTS
Sbjct: 324 PLKVEELSHDPLLVLFHDVIYQSEIDTLMRLAKNKIHRATVTGHNSSVVSNA---RTSQF 380
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP------ 189
F+ + L I++++A +T L E + Y IG Y H D F P
Sbjct: 381 TFLPKTRHK--VLRTIDQRVADMTDLHLEYAEDHQLANYGIGGHYAQHMDWFYPITFETK 438
Query: 190 QEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLL 249
Q P+ R+ + L YL+D+E+GG T FP + ++P++
Sbjct: 439 QVSNPEMGNRIGTVLFYLSDVEQGGATAFP---------------ALKQLLRPKKHAAAF 483
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 289
+Y+L +G D ++HG+CP++ G KWV +WIR+ Q D
Sbjct: 484 WYNLHASGVGDARTMHGACPIIVGSKWVLNRWIREFVQSD 523
>gi|195425415|ref|XP_002061004.1| GK10713 [Drosophila willistoni]
gi|194157089|gb|EDW71990.1| GK10713 [Drosophila willistoni]
Length = 502
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 51/163 (31%), Positives = 87/163 (53%), Gaps = 22/163 (13%)
Query: 129 GIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLP--RINGEAFNILRYKIGQKYNSHYDA 186
RTS+ VF+ + S +D++ +++A +T L + + + ++ Y +G Y H+D
Sbjct: 329 NFRTSNSVFL--LNNASYLVDILRQRVADMTHLNVFKNSSDDLQVMNYGLGGYYRYHFDF 386
Query: 187 FDPQEYGPQK--SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQ 244
F E P K R+ + L+Y+TD+++GG T+FP + + P++
Sbjct: 387 FGKDE-SPNKLLGDRIITVLIYMTDVQQGGATVFP---------------ALRITNFPKK 430
Query: 245 GDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
G L+F +L N + DP+++H CPV+ G KW ATKWI EQ
Sbjct: 431 GSALIFRNLDNNISPDPSTLHAGCPVLFGSKWAATKWIYSAEQ 473
>gi|359400227|ref|ZP_09193216.1| 2OG-Fe(II) oxygenase [Novosphingobium pentaromativorans US6-1]
gi|357598467|gb|EHJ60196.1| 2OG-Fe(II) oxygenase [Novosphingobium pentaromativorans US6-1]
Length = 193
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 53/198 (26%), Positives = 100/198 (50%), Gaps = 28/198 (14%)
Query: 93 NFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIE 152
+F QC ++I + + RPST+A G+ V RTSS +S + G + +
Sbjct: 16 DFLDTAQCDALIALIEAEHRPSTVANYNGDDV-----FRTSSTCDLSP---DVGAVAALA 67
Query: 153 EKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP-----QEYGPQKSQRVASFLVYL 207
K+ ++ + + E RY++GQ++ +H D F+P ++Y QR +F++YL
Sbjct: 68 RKLCDISGIDPAHAEPLQGQRYEVGQEFKAHTDYFEPNNSDFEKYCSVSGQRTWTFMIYL 127
Query: 208 TDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGS 267
D++ GG T F K I ++P +G + + + P+G+++P ++H +
Sbjct: 128 NDVDAGGATRF---------------KVINKLIQPERGKLVAWNNRRPDGSLNPATLHHA 172
Query: 268 CPVVKGEKWVATKWIRDQ 285
V +G K+V T+W R++
Sbjct: 173 MKVRQGRKYVVTQWFRER 190
>gi|194765180|ref|XP_001964705.1| GF23331 [Drosophila ananassae]
gi|190614977|gb|EDV30501.1| GF23331 [Drosophila ananassae]
Length = 535
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 59/218 (27%), Positives = 101/218 (46%), Gaps = 24/218 (11%)
Query: 78 PFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSG 135
PF++ LS P + + + I MA+ ++ ST+ G RTS G
Sbjct: 316 PFKLEELSHEPLVFQVHQVVSSKSAEFIKKMARPKIKRSTVYSIGGGGGSQAAAFRTSQG 375
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEY--- 192
+ + + ++ + ++ L E + Y IG Y H+D+F P+ +
Sbjct: 376 ASFNYSRN--AATKILSRHVGDLSSLDMNFAEELQVANYGIGGHYEPHWDSF-PENHIYD 432
Query: 193 -GPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFY 251
G + R+A+ + YL+D+E GG T FPF + L V P +G L +Y
Sbjct: 433 EGDDRGNRIATGIYYLSDVEAGGGTAFPF---------------LPLLVTPEKGSLLFWY 477
Query: 252 SLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 289
+L +G D + H +CPV++G KW+A WIR++ Q++
Sbjct: 478 NLHESGDQDYRTKHAACPVLQGSKWIANVWIRERNQHN 515
>gi|221460681|ref|NP_733394.3| CG31013 [Drosophila melanogaster]
gi|220903261|gb|AAF57073.4| CG31013 [Drosophila melanogaster]
Length = 534
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 59/211 (27%), Positives = 94/211 (44%), Gaps = 28/211 (13%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + + + + +I A N++ + + K V RT+ G ++ +E
Sbjct: 326 PYVVLYHEVLSAREISMLIGKAAQNMKNT--KIHKERAVPKKNRGRTAKGFWLKKESNE- 382
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFD---------PQEYGPQK 196
I +I +T + E F ++ Y IG Y H D FD Y
Sbjct: 383 -LTKRITRRIMDMTGFDLADSEGFQVINYGIGGHYFLHMDYFDFASSNHTDTRSRYSIDL 441
Query: 197 SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPN 256
R+A+ L YLTD+E+GG T+F D +G V P+ G + +Y+L +
Sbjct: 442 GDRIATVLFYLTDVEQGGATVF-------GD--------VGYYVSPQAGTAIFWYNLDTD 486
Query: 257 GTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
G DP + H +CPV+ G KWV T+WIR++ Q
Sbjct: 487 GNGDPRTRHAACPVIVGSKWVMTEWIREKRQ 517
>gi|198449500|ref|XP_001357604.2| GA15939 [Drosophila pseudoobscura pseudoobscura]
gi|198130634|gb|EAL26738.2| GA15939 [Drosophila pseudoobscura pseudoobscura]
Length = 528
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 60/205 (29%), Positives = 98/205 (47%), Gaps = 28/205 (13%)
Query: 89 LYFPNFATPEQCKSIINMAKLNLRPSTL---ALRKGETVDNTQGIRTSSGVFISAAEDES 145
LY + PE + +MA L+ +T+ + R+ E V RTS + +E
Sbjct: 334 LYHDVISAPE-ISQLQDMATPGLKRATVYKASGRRSEVVKT----RTSKVAWFPDTFNE- 387
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQ---EYGPQKSQRVAS 202
+ + +IA +T + E + Y +G Y+ HYD F+ R+A+
Sbjct: 388 -LTERLNRRIADMTNFDLLGSEMLQAMNYGLGGHYDKHYDFFNASTATNLTQMNGDRIAT 446
Query: 203 FLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPT 262
L YLTD+E+GG T+FP I V P++G +++Y+L +G +P
Sbjct: 447 VLFYLTDVEQGGATVFP---------------NIRKAVFPQRGSAIIWYNLKDDGDPNPQ 491
Query: 263 SIHGSCPVVKGEKWVATKWIRDQEQ 287
++H +CPV+ G KWV KWIR++ Q
Sbjct: 492 TLHAACPVLVGSKWVCNKWIRERAQ 516
>gi|195391766|ref|XP_002054531.1| GJ24504 [Drosophila virilis]
gi|194152617|gb|EDW68051.1| GJ24504 [Drosophila virilis]
Length = 545
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 60/220 (27%), Positives = 99/220 (45%), Gaps = 28/220 (12%)
Query: 78 PFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSG 135
P +V LS P + + + + ++ + K + +T+ V N RTS
Sbjct: 320 PLKVEELSHDPLLVLYHDVIYQSEIDTLAKLTKNKIHRATVTGNNASVVSNA---RTSQF 376
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEY--- 192
FI + L I++++A +T L + E + Y IG Y H D F P +
Sbjct: 377 TFIPKTRHK--VLRTIDQRVADMTDLNMVFAEDHQLANYGIGGHYAQHMDWFSPNAFETK 434
Query: 193 ---GPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLL 249
+ R+A+ L YLTD+E+GG T FP + +KP++
Sbjct: 435 QVANSEMGNRIATVLFYLTDVEQGGGTAFPVLKQL---------------LKPKKYAAAF 479
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 289
+Y+L +G D ++HG+CP++ G KWV +WIR+ Q D
Sbjct: 480 WYNLHASGAGDVRTMHGACPIIVGSKWVLNRWIREFVQSD 519
>gi|312032358|ref|NP_001185666.1| prolyl 4-hydroxylase subunit alpha-1 isoform 3 precursor [Gallus
gallus]
Length = 536
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 57/206 (27%), Positives = 101/206 (49%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + E+ +++ +AK LR +T++ ++ T R S ++S E S
Sbjct: 337 PRIVRFLDIISDEEIETVKELAKPRLRRATISNPITGALE-TAHYRISKSAWLSGYE--S 393
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 394 PVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIA 453
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V P++G + +Y+L P+G D
Sbjct: 454 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFPSGEGDY 498
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 499 STRHAACPVLVGNKWVSNKWLHERGQ 524
>gi|350416719|ref|XP_003491070.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Bombus
impatiens]
Length = 557
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 55/206 (26%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + + N E+ ++I MA+ + +T+ K ++ R S ++ E E
Sbjct: 350 PRIVVYHNVIYDEEIETIKRMAQPRFKRATVQNYKTGALE-IANYRISKSAWLQEHEHEH 408
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ + ++ +T + E ++ Y IG Y H+D +E KS R+A
Sbjct: 409 --VAAVSRRVEHMTSMTVDTAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIA 466
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
+ L Y++D+E+GG T+F I + + P++G +Y+L PNG D
Sbjct: 467 TVLYYMSDVEQGGGTVFT---------------AINISLWPKKGSAAFWYNLKPNGEGDF 511
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
+ H +CPV+ G KWVA KW+ ++ Q
Sbjct: 512 KTRHAACPVLTGSKWVANKWLHERGQ 537
>gi|340722330|ref|XP_003399560.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Bombus
terrestris]
Length = 557
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 55/206 (26%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + + N E+ ++I MA+ + +T+ K ++ R S ++ E E
Sbjct: 350 PRIVVYHNVIYDEEIETIKRMAQPRFKRATVQNYKTGALE-IANYRISKSAWLQEHEHEH 408
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ + ++ +T + E ++ Y IG Y H+D +E KS R+A
Sbjct: 409 --VAAVSRRVEHMTSMTVDTAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIA 466
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
+ L Y++D+E+GG T+F I + + P++G +Y+L PNG D
Sbjct: 467 TVLYYMSDVEQGGGTVFT---------------AINISLWPKKGSAAFWYNLKPNGEGDF 511
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
+ H +CPV+ G KWVA KW+ ++ Q
Sbjct: 512 KTRHAACPVLTGSKWVANKWLHERGQ 537
>gi|332026992|gb|EGI67088.1| Prolyl 4-hydroxylase subunit alpha-1 [Acromyrmex echinatior]
Length = 415
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 55/206 (26%), Positives = 98/206 (47%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + + N E+ ++I MA+ + +T+ K ++ R S ++ E E
Sbjct: 208 PRIVVYHNVIYDEEIETIKRMAQPRFKRATVQNYKTGALE-IANYRISKSAWLQ--EHEH 264
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ + +++ +T + E ++ Y IG Y H+D +E KS R+A
Sbjct: 265 KHVAAVSKRVEHMTSMSVETAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIA 324
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
+ L Y++D+E+GG T+F I + + PR+G +++L PNG D
Sbjct: 325 TVLYYMSDVEQGGGTVFT---------------AINISLWPRKGSAAFWHNLKPNGEGDF 369
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
+ H +CPV+ G KWVA KW+ ++ Q
Sbjct: 370 KTRHAACPVLTGSKWVANKWLHERGQ 395
>gi|326923461|ref|XP_003207954.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 1
[Meleagris gallopavo]
Length = 536
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 57/206 (27%), Positives = 101/206 (49%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + E+ +++ +AK LR +T++ ++ T R S ++S E S
Sbjct: 337 PRIVRFLDIISDEEIETVKELAKPRLRRATISNPITGALE-TAHYRISKSAWLSGYE--S 393
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 394 PVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIA 453
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V P++G + +Y+L P+G D
Sbjct: 454 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFPSGEGDY 498
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 499 STRHAACPVLVGNKWVSNKWLHERGQ 524
>gi|85857698|gb|ABC86384.1| IP10964p [Drosophila melanogaster]
Length = 534
Score = 91.3 bits (225), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 59/211 (27%), Positives = 94/211 (44%), Gaps = 28/211 (13%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + + + + +I A N++ + + K V RT+ G ++ +E
Sbjct: 326 PYVVLYHEVLSAREISMLIGKAAQNMKNT--KIHKERAVPKKNRGRTAKGFWLKKESNE- 382
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFD---------PQEYGPQK 196
I +I +T + E F ++ Y IG Y H D FD Y
Sbjct: 383 -LTKRITRRIMDMTGFDLADSEGFQVINYGIGGHYFLHMDYFDFASSNHTDTRSRYSIDL 441
Query: 197 SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPN 256
R+A+ L YLTD+E+GG T+F D +G V P+ G + +Y+L +
Sbjct: 442 GDRIATVLFYLTDVEQGGATVF-------GD--------VGYYVSPQAGTAIFWYNLDTD 486
Query: 257 GTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
G DP + H +CPV+ G KWV T+WIR++ Q
Sbjct: 487 GNGDPRTRHAACPVIVGSKWVMTEWIREKRQ 517
>gi|289526401|gb|ADD01323.1| FI13021p [Drosophila melanogaster]
gi|373432715|gb|AEY70761.1| FI17809p1 [Drosophila melanogaster]
Length = 193
Score = 91.3 bits (225), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 57/194 (29%), Positives = 88/194 (45%), Gaps = 28/194 (14%)
Query: 103 IINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLP 162
+I A N++ + + K V RT+ G ++ +E I +I +T
Sbjct: 2 LIGKAAQNMKNT--KIHKERAVPKKNRGRTAKGFWLKKESNE--LTKRITRRIMDMTGFD 57
Query: 163 RINGEAFNILRYKIGQKYNSHYDAFD---------PQEYGPQKSQRVASFLVYLTDLEEG 213
+ E F ++ Y IG Y H D FD Y R+A+ L YLTD+E+G
Sbjct: 58 LADSEGFQVINYGIGGHYFLHMDYFDFASSNHTDTRSRYSIDLGDRIATVLFYLTDVEQG 117
Query: 214 GETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKG 273
G T+F +G V P+ G + +Y+L +G DP + H +CPV+ G
Sbjct: 118 GATVFG---------------DVGYYVSPQAGTAIFWYNLDTDGNGDPRTRHAACPVIVG 162
Query: 274 EKWVATKWIRDQEQ 287
KWV T+WIR++ Q
Sbjct: 163 SKWVMTEWIREKRQ 176
>gi|89092696|ref|ZP_01165649.1| hypothetical protein MED92_15358 [Neptuniibacter caesariensis]
gi|89083208|gb|EAR62427.1| hypothetical protein MED92_15358 [Oceanospirillum sp. MED92]
Length = 441
Score = 91.3 bits (225), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 58/207 (28%), Positives = 99/207 (47%), Gaps = 26/207 (12%)
Query: 88 ALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGT 147
Y NF +PE+C +I + + + RPST G + + RTS +S E S
Sbjct: 72 VFYVNNFLSPEECAQMIELIQHHQRPSTTTNETG----HYKHYRTSKTCDLSLLE--STF 125
Query: 148 LDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQ-----EYGPQKSQRVAS 202
+ I+++I K+ + E Y IG+++ H D F+P+ E+ + QR +
Sbjct: 126 VAEIDQRICKMLGIEPSYSEGIQGQWYDIGEEFKPHTDYFEPKSDEFLEHAEARGQRTWT 185
Query: 203 FLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPT 262
F++YL + +EGG T FP +G + P QG +++ +L +G+ +P
Sbjct: 186 FMIYLNNTQEGGGTFFPE---------------LGQRFLPSQGKAVIWNNLTTDGSPNPA 230
Query: 263 SIHGSCPVVKGEKWVATKWIRDQEQYD 289
++H PV +G K + TKW R + D
Sbjct: 231 TLHHGEPVKRGYKAIITKWFRSKGTAD 257
>gi|301115862|ref|XP_002905660.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262110449|gb|EEY68501.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 215
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 59/217 (27%), Positives = 103/217 (47%), Gaps = 19/217 (8%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P F ++ I+ ++ +L PS + L+ G RTS+ ++ ++
Sbjct: 3 PLVFSVEEFLRDDEIDVILELSMPHLAPSGVTLQDGHENRPATDWRTSTTYWLDSSSHP- 61
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQ------------EYG 193
+ I+++ A + +P + E+ +LRY+ Q Y+ H D F + EYG
Sbjct: 62 -VVQTIDKRTADLVKVPISHQESVQVLRYEPTQHYDQHLDYFSAERHRNSPDVLKRIEYG 120
Query: 194 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCI-GLKVKPRQGDGLLFYS 252
R+ + Y++D+ +GG T F G+ S + C G+ V P++ ++FYS
Sbjct: 121 --YKNRMITVFWYMSDVAKGGHTNFARSGGLPRPSSN--KDCSQGISVAPKKRKVVVFYS 176
Query: 253 LLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 289
+LPNG DP S+H CPV +G K KWI ++ + D
Sbjct: 177 MLPNGEGDPMSLHAGCPVEEGIKLSGNKWIWNKPRSD 213
>gi|348523976|ref|XP_003449499.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oreochromis
niloticus]
Length = 594
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 61/210 (29%), Positives = 100/210 (47%), Gaps = 30/210 (14%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + + N + + + + +AK LR +T++ V T R S ++ A E
Sbjct: 395 PHIVRYHNIVSEKDMEKVKELAKPRLRRATIS-NPVTGVLETAHYRISKSAWLGAYE--H 451
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD--------AFDPQEYGPQKS 197
+D I + I VT L E + Y +G +Y H+D AF+ G
Sbjct: 452 PVVDKINQLIEDVTGLNVKTAEDLQVANYGLGGQYEPHFDFGRKDEPDAFEELGTG---- 507
Query: 198 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 257
R+A++L+Y+TD++ GG T+F IG VKP++G + +Y+L P+G
Sbjct: 508 NRIATWLLYMTDVQAGGATVFT---------------DIGAAVKPKKGTAVFWYNLYPSG 552
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 553 EGDYRTRHAACPVLLGNKWVSNKWIHERGQ 582
>gi|405964867|gb|EKC30309.1| Prolyl 4-hydroxylase subunit alpha-1 [Crassostrea gigas]
Length = 591
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 64/241 (26%), Positives = 110/241 (45%), Gaps = 43/241 (17%)
Query: 75 TNIPF-----QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-----GETV 124
T IP+ +V+++ PR F + +P + + ++A ST+ L T
Sbjct: 351 TVIPYYKAKEEVVNYEPRIAIFHDVISPTSIEHLKSVASKGFTRSTVFLENTGPDGHVTY 410
Query: 125 DNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLP------RINGEAFNILRYKIGQ 178
+R S ++ DE L +E +I T L R + E F +L Y +G
Sbjct: 411 GKLDNVRVSQTSWLGT--DEYPELSRLENRIKLTTGLSAEYKSVRSHSEKFQVLNYGVGG 468
Query: 179 KYNSHYDAF--------DPQEYGPQKS--QRVASFLVYLTDLEEGGETMFPFENGMNADG 228
Y HYD +P + ++ +R+A+++ YL D++ GG T+FP
Sbjct: 469 MYTVHYDYTGYMLGIPSNPLDSDDIRTSGERMATWMFYLNDVKAGGATVFP--------- 519
Query: 229 SYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQY 288
+ ++ +G +Y++ P+G DP ++HG CPV+ G KWV+ KWIR++ Q
Sbjct: 520 ------EVKTRIPVAKGGAAFWYNVRPSGATDPRTLHGGCPVLVGSKWVSNKWIREEGQM 573
Query: 289 D 289
D
Sbjct: 574 D 574
>gi|347972274|ref|XP_001237637.3| AGAP004611-PA [Anopheles gambiae str. PEST]
gi|333469330|gb|EAU76664.3| AGAP004611-PA [Anopheles gambiae str. PEST]
Length = 514
Score = 90.9 bits (224), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 60/219 (27%), Positives = 106/219 (48%), Gaps = 25/219 (11%)
Query: 74 VTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTS 133
++ + Q ++ P + + + + ++ +II+++K + S + + V T RTS
Sbjct: 304 ISPLKLQEVNHDPMIVMYHDVISNKEIDAIISISKPLMHRSMVGDDHEKAVSKT---RTS 360
Query: 134 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD---AFDPQ 190
S ++ + + ++ +T L E + Y IG Y HYD A + +
Sbjct: 361 SNAWLDDVMHP--VVRTLSQRTEDMTNLAMTAAERLQVGNYGIGGHYLPHYDYAVAEEGK 418
Query: 191 EYGPQ--KSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGL 248
E P K R+A+ + YL+D+ GG T+FP +GL V P++G +
Sbjct: 419 EVYPSIGKGNRIATVMYYLSDVAIGGATVFP---------------QLGLGVFPQKGSAI 463
Query: 249 LFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
+Y+L NGT+D ++HG+CPV G KWV KWI ++ Q
Sbjct: 464 FWYNLHANGTVDHRTLHGACPVFVGSKWVGNKWIHERGQ 502
>gi|195444366|ref|XP_002069834.1| GK11733 [Drosophila willistoni]
gi|194165919|gb|EDW80820.1| GK11733 [Drosophila willistoni]
Length = 517
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 75/245 (30%), Positives = 116/245 (47%), Gaps = 36/245 (14%)
Query: 55 ESVKDEYKWMPHG-----QAGDDSVTNI-PFQV--LSWMPRALYFPNFATPEQCKSIINM 106
E + EYK P G + G D I PF+V L+ P + + + + + M
Sbjct: 286 ECCRGEYK-PPKGLSCYYEYGADPFLRIAPFKVELLNRSPYVAAYYDVLNDSEIEELKLM 344
Query: 107 AKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGT--LDLIEEKIAKVTML--P 162
+ +R S L +D RTS+ VF+ +E+G L+ I ++ A +T L
Sbjct: 345 SSPQIRRSLL-YNHTLDIDQADVDRTSNSVFM----EETGITLLETISQRAADMTDLYVT 399
Query: 163 RINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFEN 222
I+ E ++ Y +G +Y H D FD + R+A+ L YLTD+++GG T+FPF
Sbjct: 400 AISSEDLQVINYGLGGQYTPHCDYFDE---NAENGDRLATVLFYLTDVQQGGATVFPF-- 454
Query: 223 GMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWI 282
+ L P++G L+F +L + D S H +CPV+ G KWVATKWI
Sbjct: 455 -------------LRLSYFPKKGSALIFRNLDNAMSGDKDSTHSACPVLFGNKWVATKWI 501
Query: 283 RDQEQ 287
+Q
Sbjct: 502 YHFDQ 506
>gi|116008432|ref|NP_651804.2| CG15539, isoform A [Drosophila melanogaster]
gi|66772391|gb|AAY55507.1| IP10910p [Drosophila melanogaster]
gi|66772535|gb|AAY55579.1| IP10810p [Drosophila melanogaster]
gi|113194858|gb|AAF57060.2| CG15539, isoform A [Drosophila melanogaster]
Length = 386
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 58/211 (27%), Positives = 101/211 (47%), Gaps = 21/211 (9%)
Query: 77 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGV 136
+ ++LS P + F + + + SI N+ K L + + G ++ RT+ G
Sbjct: 187 LKMELLSLDPYMVLFHDVVSDKDIVSIRNLTKGKLARTVTVSKDGNYTEDPD--RTTKGT 244
Query: 137 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK 196
++ + + + + + +T + + F +L Y IG Y H+D + E
Sbjct: 245 WLV---ENNALIQRLSQLTQDMTNFDIHDADPFQVLNYGIGGFYGIHFDFLEDAELD-NF 300
Query: 197 SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPN 256
S R+A+ + YL+D+ +GG T+FP +GL V P++G LL+Y+L
Sbjct: 301 SDRIATAVFYLSDVPQGGATIFP---------------KLGLSVFPKKGSALLWYNLDHK 345
Query: 257 GTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
G D + H +CP V G +WV TKWI ++EQ
Sbjct: 346 GDGDNRTAHSACPTVVGSRWVMTKWINEREQ 376
>gi|159462456|ref|XP_001689458.1| predicted protein [Chlamydomonas reinhardtii]
gi|158283446|gb|EDP09196.1| predicted protein [Chlamydomonas reinhardtii]
Length = 221
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 60/196 (30%), Positives = 90/196 (45%), Gaps = 30/196 (15%)
Query: 89 LYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTL 148
+ + NF + +C+ II++A ++ ST+ K V IRTS G F+ D +
Sbjct: 1 MVYHNFLSDRECRHIIDLAHAQMKRSTVVGSKNAGV--VDDIRTSYGTFLRRVPDP--VI 56
Query: 149 DLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLT 208
IE ++A + LP + E +LRY KY H D +RVA+ L+YL
Sbjct: 57 AAIEHRLALWSHLPASHQEDMQVLRYGPTNKYGPHIDGL----------ERVATVLIYLG 106
Query: 209 DLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPN-GTIDPTSIHGS 267
E + A G Y KP++GD L+F+ +P+ D S+H
Sbjct: 107 QAERANLSQC-------ARGRVAY--------KPKRGDALMFFDTMPDYKQTDVHSMHTG 151
Query: 268 CPVVKGEKWVATKWIR 283
CPVV+G KW A KW+
Sbjct: 152 CPVVEGVKWNAVKWLH 167
>gi|195159303|ref|XP_002020521.1| GL13468 [Drosophila persimilis]
gi|194117290|gb|EDW39333.1| GL13468 [Drosophila persimilis]
Length = 415
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 60/213 (28%), Positives = 104/213 (48%), Gaps = 35/213 (16%)
Query: 78 PF--QVLSWMPRALYFPNFATPEQCKSIINMAK-LNLRPSTLALRKGETVDNTQGIRTSS 134
PF ++LS P + + + TP + ++ N++K L R + + + + RTS+
Sbjct: 227 PFKTELLSLSPYMVLYHDVITPLESLTLKNLSKPLMKRRAMVMVNNLKVRPFIDSGRTSN 286
Query: 135 GVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP 194
V++++ E+ ++ +E ++ +T N E + ++ Y IG Y H D F+ P
Sbjct: 287 SVWLTSHEN--AVMERLERRVGVMTNFEMENSEVYQLINYGIGGHYKPHTDHFE----TP 340
Query: 195 QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLL 254
Q L+D+ +GG T+FP N + V+PRQGD LL+Y+L
Sbjct: 341 Q-----------LSDVPQGGATLFPRLN---------------ISVQPRQGDALLWYNLN 374
Query: 255 PNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
G + ++H SCP++KG KW KWI + Q
Sbjct: 375 DRGQGEIGTVHTSCPIIKGSKWALVKWIDELSQ 407
>gi|332140647|ref|YP_004426385.1| Prolyl 4-hydroxylase alpha subunit [Alteromonas macleodii str.
'Deep ecotype']
gi|327550669|gb|AEA97387.1| Prolyl 4-hydroxylase alpha subunit [Alteromonas macleodii str.
'Deep ecotype']
Length = 376
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 63/196 (32%), Positives = 93/196 (47%), Gaps = 34/196 (17%)
Query: 99 QCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFISAAEDESGTLDLIE 152
+C+ +I L+PS + VD G +RTS I A + T L +
Sbjct: 189 ECRYLITKFNALLKPSMV-------VDPVTGRGKIDSVRTSYVAVIEPAHCDWITRKL-D 240
Query: 153 EKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF----DPQEYGPQKSQRVASFLVYLT 208
+ I+++T R NGEA N+LRY GQ+Y HYD D + K QR+ + LVYL
Sbjct: 241 KTISQITHTLRQNGEALNLLRYSPGQQYKPHYDGLNEINDALMFKDGK-QRIKTALVYLN 299
Query: 209 DLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSC 268
+ EGGET+FP + +++ P+ G ++F + NG + S H
Sbjct: 300 TISEGGETLFPK---------------LDIRIAPKSGTMVVFSNSDENGKLLLNSYHAGA 344
Query: 269 PVVKGEKWVATKWIRD 284
P V KW+ TKWIR+
Sbjct: 345 PTVSENKWLVTKWIRE 360
>gi|198429625|ref|XP_002128613.1| PREDICTED: similar to procollagen-proline, 2-oxoglutarate
4-dioxygenase (proline 4-hydroxylase), alpha 1
polypeptide [Ciona intestinalis]
Length = 195
Score = 90.5 bits (223), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 44/149 (29%), Positives = 80/149 (53%), Gaps = 18/149 (12%)
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP---QKSQ 198
+++ + + ++I+ VT L E I Y +G +Y H+D ++G +
Sbjct: 45 DEDHPVIKRVCQRISDVTGLSMETAEELQIANYGVGGQYEPHFDYSRKSDFGKFDDEVGN 104
Query: 199 RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGT 258
R+A+FL Y++++E+GG T+F G+ V+P +G + +Y+LLP+G
Sbjct: 105 RIATFLTYMSNVEQGGSTVFLHP---------------GIAVRPIKGSAVFWYNLLPSGA 149
Query: 259 IDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWV+ KWI +++Q
Sbjct: 150 GDERTRHAACPVLTGVKWVSNKWIHERDQ 178
>gi|195055767|ref|XP_001994784.1| GH14132 [Drosophila grimshawi]
gi|193892547|gb|EDV91413.1| GH14132 [Drosophila grimshawi]
Length = 537
Score = 90.5 bits (223), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 53/178 (29%), Positives = 88/178 (49%), Gaps = 24/178 (13%)
Query: 119 RKGETVDNTQGI---RTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYK 175
R G +++T + RTS +FI+A + L I++++A +T L E + Y
Sbjct: 362 RAGVVINSTSTVSKKRTSQHIFIAATRHK--VLRTIDQRVADMTNLNMQYAEDHQLADYG 419
Query: 176 IGQKYNSHYDAFDPQEYGPQKS----QRVASFLVYLTDLEEGGETMFPFENGMNADGSYD 231
IG Y+ H+D F + K R+A+ L YL+D+ +GG T FP +
Sbjct: 420 IGGHYSQHFDWFGNSDLANSKCDEMGNRIATVLFYLSDVAQGGGTAFPILKQL------- 472
Query: 232 YQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 289
+KP++ +Y+L +G D ++HG CP++ G KWV +WIR+ +Q D
Sbjct: 473 --------LKPKKYAAAFWYNLHASGKGDWRNLHGGCPIIVGSKWVLNRWIREYDQSD 522
>gi|348518914|ref|XP_003446976.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Oreochromis
niloticus]
Length = 536
Score = 90.5 bits (223), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 61/207 (29%), Positives = 100/207 (48%), Gaps = 24/207 (11%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET-VDNTQGIRTSSGVFISAAEDE 144
P + + + + E+ + I +AK L +T+ R +T V T R S ++ ED
Sbjct: 337 PHIVRYLDLLSDEEIEKIKELAKPRLARATV--RDPKTGVLTTANYRVSKSAWLEGEEDP 394
Query: 145 SGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRV 200
+D + ++I +T L E + Y +G +Y H+D E K RV
Sbjct: 395 --VIDRVNQRIEAITGLTVETAELLQVANYGVGGQYEPHFDFSRKDEPDAFKRLGTGNRV 452
Query: 201 ASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTID 260
A+FL Y++D+E GG T+FP D+ G + PR+G + +Y+L +G D
Sbjct: 453 ATFLNYMSDVEAGGATVFP-----------DF----GAAIWPRKGTSVFWYNLFRSGEGD 497
Query: 261 PTSIHGSCPVVKGEKWVATKWIRDQEQ 287
+ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 498 YRTRHAACPVLVGSKWVSNKWIHERGQ 524
>gi|113682363|ref|NP_001038463.1| prolyl 4-hydroxylase, alpha polypeptide I a precursor [Danio rerio]
Length = 522
Score = 90.5 bits (223), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 60/225 (26%), Positives = 102/225 (45%), Gaps = 38/225 (16%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLA---------------LRKGETVDNTQGI 130
PR + + T ++ + I ++K LR +T++ R+ D G
Sbjct: 301 PRIIRYHEIITEQEIEKIKELSKPRLRRATISNPITGVLETAHYRISKRRATVHDPQTGK 360
Query: 131 RTSSGVFISA----AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDA 186
T++ +S A E +D I ++I +T L E + Y +G +Y H+D
Sbjct: 361 LTTAQYRVSKSAWLAAYEHPVVDRINQRIEDITGLNVKTAEELQVANYGVGGQYEPHFDF 420
Query: 187 FDPQEYGPQKS----QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKP 242
E K R+A++L Y++D+ GG T+FP +G VKP
Sbjct: 421 GRKDEPDAFKELGTGNRIATWLFYMSDVAAGGATVFPE---------------VGAAVKP 465
Query: 243 RQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
+G + +Y+L P+G D ++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 466 LKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKWVSNKWIHERGQ 510
>gi|321461762|gb|EFX72791.1| hypothetical protein DAPPUDRAFT_308081 [Daphnia pulex]
Length = 561
Score = 90.5 bits (223), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 63/213 (29%), Positives = 96/213 (45%), Gaps = 30/213 (14%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + + T Q + + + + L +T R GE IRTS ++ E E+
Sbjct: 349 PMIVVLHDLITERQTEILRQLGEPKL--ATSLHRGGEGKFVRSMIRTSKNAWLQ--EHEN 404
Query: 146 GTLDLIEEKIAKVTML---PRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ---- 198
+L I ++ T L P E F I Y IG Y +H D + P+
Sbjct: 405 ASLPAIRHRMELATGLIYGPETASEYFQIANYGIGGLYKTHTDNVIHPDVRPEDQDPWNL 464
Query: 199 ----RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLL 254
R+A+ +VYL+D+E GG T+FP G+ PR+G +++L
Sbjct: 465 YVGDRIATLMVYLSDVEAGGATVFPR---------------AGVTCWPRKGSAAFWWNLY 509
Query: 255 PNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
+G D T+ HG+CPV+ G KWV+ KWIR +Q
Sbjct: 510 KSGEPDLTTRHGACPVLHGSKWVSNKWIRQYDQ 542
>gi|387016440|gb|AFJ50339.1| Prolyl 4-hydroxylase subunit alpha-1-like [Crotalus adamanteus]
Length = 543
Score = 90.5 bits (223), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 57/206 (27%), Positives = 100/206 (48%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + E+ + + ++K LR +T++ V T R S ++S E+
Sbjct: 344 PRIVRFLDIISNEEIEKVKELSKPRLRRATIS-NPITGVLETAHYRISKSAWLSGYENP- 401
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ I ++I +T L E + Y +G +Y H+D E K R+A
Sbjct: 402 -VVARINQRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIA 460
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V P++G + +Y+L P+G D
Sbjct: 461 TWLFYMSDVAAGGATVFP---------------EVGASVWPKKGTAVFWYNLFPSGEGDY 505
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 506 STRHAACPVLVGNKWVSNKWIHERGQ 531
>gi|116008128|ref|NP_001036776.1| CG15539, isoform B [Drosophila melanogaster]
gi|113194857|gb|ABI31220.1| CG15539, isoform B [Drosophila melanogaster]
Length = 509
Score = 90.5 bits (223), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 58/211 (27%), Positives = 101/211 (47%), Gaps = 21/211 (9%)
Query: 77 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGV 136
+ ++LS P + F + + + SI N+ K L + + G ++ RT+ G
Sbjct: 310 LKMELLSLDPYMVLFHDVVSDKDIVSIRNLTKGKLARTVTVSKDGNYTEDPD--RTTKGT 367
Query: 137 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK 196
++ + + + + + +T + + F +L Y IG Y H+D + E
Sbjct: 368 WLV---ENNALIQRLSQLTQDMTNFDIHDADPFQVLNYGIGGFYGIHFDFLEDAELD-NF 423
Query: 197 SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPN 256
S R+A+ + YL+D+ +GG T+FP +GL V P++G LL+Y+L
Sbjct: 424 SDRIATAVFYLSDVPQGGATIFP---------------KLGLSVFPKKGSALLWYNLDHK 468
Query: 257 GTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
G D + H +CP V G +WV TKWI ++EQ
Sbjct: 469 GDGDNRTAHSACPTVVGSRWVMTKWINEREQ 499
>gi|224008853|ref|XP_002293385.1| hypothetical protein THAPSDRAFT_264010 [Thalassiosira pseudonana
CCMP1335]
gi|220970785|gb|EED89121.1| hypothetical protein THAPSDRAFT_264010 [Thalassiosira pseudonana
CCMP1335]
Length = 248
Score = 90.5 bits (223), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 61/231 (26%), Positives = 99/231 (42%), Gaps = 34/231 (14%)
Query: 72 DSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIR 131
++ +I + +S PR +F + + I+ + + + R
Sbjct: 30 NATLDITLRTVSCSPRIFELEHFISDVEADHILMLTNRTHELHRSSTGDSSHHSDHDSTR 89
Query: 132 TSSGVFISAAEDESGTLDLIEEKIAKVTML-------------PRIN-----GEAFNILR 173
TS +I +E+ +D I ++A V + PR+ E ++
Sbjct: 90 TSMNTWI--YREETAIIDTIYRRVADVLRIDEALLRRRQPDEHPRLGTRSSIAEPLQMVH 147
Query: 174 YKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQ 233
Y G++Y +H+D P + R + L+YL D+EEGGET FP
Sbjct: 148 YDPGEEYTAHHDFGYTHMSAPHQPSRSINMLLYLNDVEEGGETSFP-------------- 193
Query: 234 KCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRD 284
+ GL VKP +G +LFY L +G D S H + PV+KGEKW++ WI D
Sbjct: 194 RWGGLDVKPVKGKAVLFYMLTADGNSDDLSQHAALPVIKGEKWMSNLWIWD 244
>gi|195159142|ref|XP_002020441.1| GL13994 [Drosophila persimilis]
gi|194117210|gb|EDW39253.1| GL13994 [Drosophila persimilis]
Length = 493
Score = 90.5 bits (223), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 45/140 (32%), Positives = 72/140 (51%), Gaps = 18/140 (12%)
Query: 151 IEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG---PQKSQRVASFLVYL 207
+ +IA +T + E + Y +G Y+ HYD F+ R+A+ L YL
Sbjct: 357 LNRRIADMTNFDLLGSEMLQAMNYGLGGHYDKHYDFFNASTAANLTQMNGDRIATVLFYL 416
Query: 208 TDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGS 267
TD+E+GG T+FP I V P++G +++Y+L +G +P ++H +
Sbjct: 417 TDVEQGGATVFP---------------NIRKAVFPQRGSAIIWYNLKDDGDPNPQTLHAA 461
Query: 268 CPVVKGEKWVATKWIRDQEQ 287
CPV+ G KWV KWIR++ Q
Sbjct: 462 CPVLVGSKWVCNKWIRERAQ 481
>gi|313768105|ref|YP_004061536.1| hypothetical protein BpV1_106c [Bathycoccus sp. RCC1105 virus BpV1]
gi|312599712|gb|ADQ91733.1| hypothetical protein BpV1_106c [Bathycoccus sp. RCC1105 virus BpV1]
Length = 197
Score = 90.1 bits (222), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 66/214 (30%), Positives = 98/214 (45%), Gaps = 35/214 (16%)
Query: 77 IPFQVLSWM-------PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG 129
+ F ++ W PR L N + ++CK I ++A L+ ST+++ + D +
Sbjct: 8 VSFLLIIWFFIPIYEKPRVL--KNVLSEDECKHIQDIASKKLQTSTVSMSR----DIDEK 61
Query: 130 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP 189
IR S ++ A+ED +D + K +T P N E +L+YK G Y H D F
Sbjct: 62 IRKSETAWLKASED--PVVDKLIRKCVSMTDRPLHNCEDLQVLKYKPGGFYKPHQDCF-- 117
Query: 190 QEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLL 249
K++R+ +F++ L D EGGET FP N Y +K GD L
Sbjct: 118 ---KNDKNKRMYTFIIALNDEYEGGETEFP-----NIKRRYRLEK----------GDALF 159
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
F +L ++HG PV GEKWV WIR
Sbjct: 160 FNTLNNYECTTKQALHGGAPVKSGEKWVCNLWIR 193
>gi|321463241|gb|EFX74258.1| hypothetical protein DAPPUDRAFT_22132 [Daphnia pulex]
Length = 523
Score = 90.1 bits (222), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 67/225 (29%), Positives = 104/225 (46%), Gaps = 34/225 (15%)
Query: 77 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTL--ALRKGETVDNTQGIRTSS 134
I + S+ P F + + E+ ++I +AK L S + L G V N +RTS
Sbjct: 310 IKIEQHSFEPAIYTFHDVLSDEEIETIKELAKPLLARSMVQGKLGVGHEVSN---VRTSK 366
Query: 135 GVFISAAEDESGTLDLIEEKIAKVTMLP----RINGEAFNILRYKIGQKYNSHYDA---- 186
++ E L+ + +I +T L R E + Y IG Y+ H+D
Sbjct: 367 TAWLP--EGLHPLLNRLSRRIGLITGLKTDPIRDEAELLQVANYGIGGHYSPHHDYLMKD 424
Query: 187 ---FDPQEYGP-QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKP 242
F+ + Q R+A+F+ YL D+E GG T FP G+ VKP
Sbjct: 425 KADFEYMHHRELQAGDRIATFMFYLNDVERGGSTAFP---------------RAGVAVKP 469
Query: 243 RQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
+G +++L +G DP ++HG+CPV+ G KWV+ KWIR+ Q
Sbjct: 470 VKGGAAFWFNLKRSGKPDPLTLHGACPVLLGHKWVSNKWIRETAQ 514
>gi|303273602|ref|XP_003056161.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226462245|gb|EEH59537.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 750
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 65/233 (27%), Positives = 110/233 (47%), Gaps = 56/233 (24%)
Query: 91 FPNFATPEQCKSIINMAKLNLRPSTLA---LRKGETVDNTQGIRTSSGVFISAAEDESGT 147
F +F + +C ++ +A +LR S + L +G RTSS F++ + E
Sbjct: 534 FDHFLSAVECDDLVAIAAPDLRRSRVTDGKLSEG---------RTSSSTFLTGCKQEEPL 584
Query: 148 LDLIEEKIAK-------VTMLPRI--------------------------NGEAFNILRY 174
+ IE+++ + + P + E ++RY
Sbjct: 585 VRAIEQRLLRAVQSATLIAAQPNVYDSNERHGQPYRGSTSRFSQRPNLLQGAEPMQVVRY 644
Query: 175 KIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNA-DGSYDYQ 233
GQ Y +HYD + G + R A+F++YLTD+ GG T FP ++ DG D
Sbjct: 645 TEGQMYTAHYD----NKQGCLR--RTATFMMYLTDVHSGGATHFPRAVPVSMRDGCGDA- 697
Query: 234 KCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
G+++ P++G L+F+S + G D S+H + PV++GEKW+ATKW+R+ E
Sbjct: 698 --AGIRIWPKRGRALVFWS-VSGGIEDVRSLHEAEPVIEGEKWIATKWLREDE 747
>gi|156405954|ref|XP_001640996.1| predicted protein [Nematostella vectensis]
gi|156228133|gb|EDO48933.1| predicted protein [Nematostella vectensis]
Length = 182
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 56/166 (33%), Positives = 85/166 (51%), Gaps = 15/166 (9%)
Query: 133 SSGVFISAAEDESGT-LDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQE 191
SS +++ ED T L I + K++ E + +YK+GQKY+ HYD+
Sbjct: 9 SSSLYLKNKEDSKITILRDIAQLAGKLSNTQWRFAEPVALTKYKVGQKYSLHYDS--GFL 66
Query: 192 YGPQKSQRVASFLVYLTDLEEGGETMFPFENGM--------NADGSYDYQKCIG----LK 239
++ +R A+FLVYL D++ GGET+FP + N D C +K
Sbjct: 67 MNQRRVKRTATFLVYLNDVKSGGETIFPLATNISSIQLKKENVDKPSLDSICGKENNMVK 126
Query: 240 VKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
V P LLF++ + +D S+HGSCPVV GEKW+A W+ ++
Sbjct: 127 VSPEAQSCLLFWNHVDGDDVDAFSLHGSCPVVSGEKWIAQIWLHNE 172
>gi|194905419|ref|XP_001981192.1| GG11932 [Drosophila erecta]
gi|190655830|gb|EDV53062.1| GG11932 [Drosophila erecta]
Length = 535
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 60/215 (27%), Positives = 96/215 (44%), Gaps = 22/215 (10%)
Query: 78 PFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSG 135
PF++ L P + + +S+ A+ ++ ST+ G RTS G
Sbjct: 316 PFKLEELHLDPPVVQLHQVIGSKDAESLQRTARPRIKRSTVYSLAGNGDSTAAAFRTSQG 375
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEY-- 192
+ + + L+ + + L E + Y IG Y H+D+F D Y
Sbjct: 376 ASFNYSRN--AATKLLSHHVGDFSGLNMEYAEDLQVANYGIGGHYEPHWDSFPDNHVYQE 433
Query: 193 GPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYS 252
G R+A+ + YL+D+E GG T FPF + L V P +G L +Y+
Sbjct: 434 GDLHGNRIATAIYYLSDVEAGGGTAFPF---------------LPLLVTPERGSLLFWYN 478
Query: 253 LLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
L P+G D + H +CPV++G KW+A WIR++ Q
Sbjct: 479 LHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQ 513
>gi|224007761|ref|XP_002292840.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220971702|gb|EED90036.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 490
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 68/214 (31%), Positives = 108/214 (50%), Gaps = 32/214 (14%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINM---AKLNLRPSTLALRKGETVDN--TQGIRTSSGV 136
+S P + F NF T E+C +I + AK ++ + D+ ++G RTS
Sbjct: 284 MSQPPWIITFDNFLTDEECNQMIQLGYKAKYERSKDVGEMQIDGSYDSVVSKG-RTSENA 342
Query: 137 FISAAED--ESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQE--- 191
+ S + + T LI ++I+ VT +P + E F IL+Y+ GQ Y SH+D + QE
Sbjct: 343 WCSFRDKCRNTTTAQLIHDRISTVTGIPANHSEDFQILKYEKGQFYRSHHDYIEHQEKRR 402
Query: 192 YGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFY 251
GP RV +F +YL+D+EEGG+T FP + + VKP++G +L+
Sbjct: 403 CGP----RVLTFFLYLSDVEEGGDTNFPK---------------LSIAVKPKKGSAVLWP 443
Query: 252 SLLPNG--TIDPTSIHGSCPVVKGEKWVATKWIR 283
S+L + DP + H + VV G K+ A W+
Sbjct: 444 SVLDSNPSMKDPRTDHEAQEVVNGTKFGANAWLH 477
>gi|449280261|gb|EMC87600.1| Prolyl 4-hydroxylase subunit alpha-1 [Columba livia]
Length = 536
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 57/206 (27%), Positives = 99/206 (48%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + E+ +++ +AK L +T+ + + T R S ++S E S
Sbjct: 337 PRIVRFLDIISDEEIETVKELAKPRLSRATVHDPETGKL-TTAHYRVSKSAWLSGYE--S 393
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 394 PVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIA 453
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V PR+G + +Y+L P+G D
Sbjct: 454 TWLFYMSDVSAGGATVFP---------------EVGASVWPRKGTAVFWYNLFPSGEGDY 498
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 499 STRHAACPVLVGNKWVSNKWLHERGQ 524
>gi|325920649|ref|ZP_08182559.1| 2OG-Fe(II) oxygenase superfamily enzyme,Sel1 repeat protein
[Xanthomonas gardneri ATCC 19865]
gi|325548839|gb|EGD19783.1| 2OG-Fe(II) oxygenase superfamily enzyme,Sel1 repeat protein
[Xanthomonas gardneri ATCC 19865]
Length = 422
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 61/201 (30%), Positives = 88/201 (43%), Gaps = 38/201 (18%)
Query: 98 EQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAK 157
++C+ ++ +A+ +LR S + + T IRTS G TLD I E A
Sbjct: 244 DECRLLMLLARPHLRASQVVDPNDASTHRTP-IRTSRG----------ATLDPILEDFAA 292
Query: 158 VTM---------LPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG---PQKSQRVASFLV 205
LP + EA ++L Y G+ Y +H D P P R+ + V
Sbjct: 293 RAAQARVAACAQLPLTHAEALSVLCYAPGEHYRAHRDYLPPGTIAADRPGAGNRLRTACV 352
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D++ GGET FP G++V+PR G + F +L +G DP S+H
Sbjct: 353 YLNDVDAGGETEFPVA---------------GIRVQPRAGSVVCFDNLQADGCPDPDSLH 397
Query: 266 GSCPVVKGEKWVATKWIRDQE 286
PV G KW+ T W R Q
Sbjct: 398 AGLPVTTGSKWLGTLWFRQQR 418
>gi|195505199|ref|XP_002099401.1| GE23383 [Drosophila yakuba]
gi|194185502|gb|EDW99113.1| GE23383 [Drosophila yakuba]
Length = 535
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 60/216 (27%), Positives = 97/216 (44%), Gaps = 24/216 (11%)
Query: 78 PFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSG 135
PF++ L P + + +S+ A+ ++ ST+ G RTS G
Sbjct: 316 PFKLEELHLDPLLVQLHQVIGAKDSESLQRTARPRIKRSTVYSLAGNGGSTAAAFRTSQG 375
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEY--- 192
+ + S L+ + + L E + Y IG Y H+D+F P+ +
Sbjct: 376 ASFNYSR--SAATKLLSHHVGDFSGLNMEYAEDLQVANYGIGGHYEPHWDSF-PENHVYQ 432
Query: 193 -GPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFY 251
G R+A+ + YL+D+E GG T FPF + L V P +G L +Y
Sbjct: 433 EGDLHGNRIATGIYYLSDVEAGGGTAFPF---------------LPLLVTPEKGSLLFWY 477
Query: 252 SLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
+L P+G D + H +CPV++G KW+A WIR++ Q
Sbjct: 478 NLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQ 513
>gi|299115443|emb|CBN75608.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 548
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 64/247 (25%), Positives = 109/247 (44%), Gaps = 40/247 (16%)
Query: 77 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGI---RTS 133
+ + LS PR NF E+ SII A L + L++ T + I RTS
Sbjct: 205 VVLETLSHSPRVFSLYNFMDMEEADSIIEDA-LGMTQEAYRLKRSSTGTKGKAISKTRTS 263
Query: 134 SGVFISAAEDESGTLDLIEEKIAKVTMLPRIN---GEAFNILRYKIGQKYNSHYDAFDPQ 190
F++ + T ++ +I ++ + + + +LRY Q Y +H+D +
Sbjct: 264 DNAFVT----HTNTAQALKRRIFQLLGIEEYHETWADGLQVLRYNESQAYVAHFDYLESA 319
Query: 191 EYGPQKSQ-----RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCI--------- 236
E KS+ R A+ ++Y D+ EGGET+F G++ D + +
Sbjct: 320 EGHDFKSEGLGTNRFATVVLYFNDVREGGETVFTHAPGIDHHLVPDTKVPVREVLENLDL 379
Query: 237 ---------------GLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKW 281
+ V P++G +LFY+ P+G D +S HG+CPV+ G+KW A W
Sbjct: 380 PRSGWEEKLLLQCRRHMVVAPKRGQAVLFYNQHPDGRKDLSSEHGACPVIDGQKWAANLW 439
Query: 282 IRDQEQY 288
+ + +Y
Sbjct: 440 VWNGPRY 446
>gi|195390805|ref|XP_002054058.1| GJ23004 [Drosophila virilis]
gi|194152144|gb|EDW67578.1| GJ23004 [Drosophila virilis]
Length = 446
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 52/158 (32%), Positives = 83/158 (52%), Gaps = 21/158 (13%)
Query: 131 RTSSGVFISAAEDESGTL-DLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP 189
R+ VFI E E G L IE ++ ++ L + +++ Y IG Y H+D+F
Sbjct: 296 RSGKNVFI---ELEKGELVKTIEMRVTDMSGLSMEGSDDLSLINYGIGGHYIPHHDSFSE 352
Query: 190 QEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLL 249
+E + R+A+ L YL+D+E GG T FP N L + P +G +L
Sbjct: 353 EE--NKTEDRIATALFYLSDVELGGATTFPLLN---------------LTISPEKGTAVL 395
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
+++L +GT P ++H +CPV+ G K+V TKWI + +Q
Sbjct: 396 WHNLKDSGTPHPKTVHAACPVIVGSKYVMTKWIYNMDQ 433
>gi|194905376|ref|XP_001981185.1| GG11927 [Drosophila erecta]
gi|190655823|gb|EDV53055.1| GG11927 [Drosophila erecta]
Length = 539
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 63/213 (29%), Positives = 102/213 (47%), Gaps = 21/213 (9%)
Query: 77 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGV 136
I ++LS P + + +P++ I + +K + PS + V RTS V
Sbjct: 322 IKTEILSIDPFVVLLHDMVSPKEAALIRSSSKSTIFPSETVNAANDFV--VSKFRTSKSV 379
Query: 137 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF--DPQEYGP 194
++ +E+ T+ L + ++A T L + E F ++ Y IG + SH+D D +
Sbjct: 380 WLDRDANEA-TVKLTQ-RLADATGLDVKHSEHFQVINYGIGGVFESHFDTTLEDTNRFVG 437
Query: 195 QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLL 254
R+A+ L YL D+ +GG T FP G+N + V PR G L +Y+L
Sbjct: 438 GFIDRIATTLFYLNDVPQGGATHFP---GLN------------ITVFPRLGAALFWYNLD 482
Query: 255 PNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
G + ++H CPV+ G KWV +KWI D+ Q
Sbjct: 483 TQGMLQVRTMHTGCPVIVGSKWVVSKWIDDKGQ 515
>gi|195055773|ref|XP_001994787.1| GH17427 [Drosophila grimshawi]
gi|193892550|gb|EDV91416.1| GH17427 [Drosophila grimshawi]
Length = 538
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 58/222 (26%), Positives = 100/222 (45%), Gaps = 19/222 (8%)
Query: 67 GQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDN 126
GQ + + + S P + + + +P+Q + MA +++ ST+ G
Sbjct: 313 GQHAYRVLAPLKLEEHSLDPLVVSYHDMLSPQQIIELRQMAVPHMKRSTVNPLPGRQSKK 372
Query: 127 TQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDA 186
+ R S ++ D + + ++ T L E + Y +G Y H+D
Sbjct: 373 S-AFRVSKNAWLEY--DTHPMMGRMLRDLSDATGLDMTYCEQLQVANYGVGGHYEPHWDF 429
Query: 187 F-DPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQG 245
F D Q Y ++ R+A+ + YL+D+E+GG T FPF N V+P+ G
Sbjct: 430 FVDSQHYPAEEGNRIATAIFYLSDVEQGGATAFPFLN---------------FAVRPQLG 474
Query: 246 DGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
+ L +Y+L + +D + H CPV+KG KW+A WI + Q
Sbjct: 475 NILFWYNLHRSLDMDYRTKHAGCPVLKGSKWIANIWIHEATQ 516
>gi|410910256|ref|XP_003968606.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Takifugu
rubripes]
Length = 540
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 60/213 (28%), Positives = 101/213 (47%), Gaps = 25/213 (11%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFIS 139
+VLS P + + +F + + + I A+L LR S +A + T R S ++
Sbjct: 336 EVLSLRPYVVLYHDFISDSESEEIKQHAQLGLRRSVVATGDKQA---TAEYRISKSAWLK 392
Query: 140 AAEDESGTLDLIEEKIAKVTML--PRINGEAFNILRYKIGQKYNSHYD-AFDPQE--YGP 194
+ T+ +++KI+ +T L +GE ++ Y IG Y H+D A P +
Sbjct: 393 GSAH--STVSRLDQKISMLTGLNVQHPHGEYLQVVNYGIGGHYEPHFDHATSPSSPVFKL 450
Query: 195 QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLL 254
+ RVA+F++YL+ +E GG T F + N V + + +++L
Sbjct: 451 KTGNRVATFMIYLSSVEAGGSTAFIYAN---------------FSVPVMKNAAIFWWNLH 495
Query: 255 PNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
NG D ++H CPV+ G+KWVA KWI + Q
Sbjct: 496 RNGEGDADTLHAGCPVLIGDKWVANKWIHEYGQ 528
>gi|328790718|ref|XP_392392.4| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Apis mellifera]
Length = 415
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + + N ++ ++I MA+ + +T+ K ++ R S ++ E E
Sbjct: 208 PRIVVYHNVIYDDEIETIKRMAQPRFKRATVQNYKTGALE-IANYRISKSAWLQ--EHEH 264
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ + ++ +T + E ++ Y IG Y H+D +E KS R+A
Sbjct: 265 KHVAAVSRRVEHMTSMTVDTAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIA 324
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
+ L Y++D+E+GG T+F I + + P++G +Y+L PNG D
Sbjct: 325 TVLYYMSDVEQGGGTVFT---------------AINIALWPKKGSAAFWYNLKPNGEGDF 369
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
+ H +CPV+ G KWVA KW+ ++ Q
Sbjct: 370 KTRHAACPVLTGSKWVANKWLHERGQ 395
>gi|390989473|ref|ZP_10259770.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas axonopodis
pv. punicae str. LMG 859]
gi|372555742|emb|CCF66745.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas axonopodis
pv. punicae str. LMG 859]
Length = 152
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 55/164 (33%), Positives = 76/164 (46%), Gaps = 24/164 (14%)
Query: 127 TQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDA 186
RTS + + +D IE +IA++ P +GE +LRY G +Y HYD
Sbjct: 2 VHAARTSDSMCLRVGQD--ALCQRIEARIARLFDWPVDHGEGLQVLRYATGAEYRPHYDY 59
Query: 187 FDPQEYGP-----QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVK 241
FDP G QRVAS ++YL E GG T FP + L V
Sbjct: 60 FDPDAAGTPILLQAGGQRVASLVMYLNTPERGGATRFPDAH---------------LDVA 104
Query: 242 PRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
+G+ + F P+ S+H PV+ GEKWVATKW+R++
Sbjct: 105 AVKGNAVFFSYDRPHPMT--RSLHAGAPVLTGEKWVATKWLRER 146
>gi|195452776|ref|XP_002073495.1| GK13117 [Drosophila willistoni]
gi|194169580|gb|EDW84481.1| GK13117 [Drosophila willistoni]
Length = 487
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 56/216 (25%), Positives = 103/216 (47%), Gaps = 25/216 (11%)
Query: 77 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGV 136
+ +++ P + + + +P + + MAK L+ + R + NT + +
Sbjct: 279 LKMELIGLDPYMVLYHDVISPNEIAELQEMAKPQLKRA----RVYNSTKNTDQLSKTRTA 334
Query: 137 FISAAEDESGTL-DLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 195
++ D L + + ++I +T E ++ Y +G Y H+D F+ + GP
Sbjct: 335 KLAWFLDTFNQLTERLNQRIMDMTNFVLNGSEMLQVMNYGLGGYYVKHFDYFNTTK-GPH 393
Query: 196 KSQ----RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFY 251
+Q R+A+ L YL D+E+GG T+FP I V P++G +++Y
Sbjct: 394 ITQINGDRIATVLFYLNDVEQGGATVFP---------------EIKKAVFPKRGSAIMWY 438
Query: 252 SLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
+L +G + ++H CPV+ G KWV KWIR++EQ
Sbjct: 439 NLKDDGEGNRDTLHAGCPVIVGSKWVCNKWIREREQ 474
>gi|410914996|ref|XP_003970973.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Takifugu
rubripes]
Length = 538
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 60/206 (29%), Positives = 98/206 (47%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + + +F + E+ + I +AK L +T+ K V T R S ++ ED
Sbjct: 339 PNIVRYLDFLSNEEIEKIKELAKPKLARATVRDPKS-GVLTTASYRVSKSAWLEGEEDP- 396
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ + ++I +T L E + Y +G +Y H+D E K RVA
Sbjct: 397 -IIARVNQRIEDLTGLTVKTAELLQVANYGVGGQYEPHFDFSRKDEPDAFKRLGTGNRVA 455
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
+FL Y++D+E GG T+FP D+ G + PR+G + +Y+L +G D
Sbjct: 456 TFLNYMSDVEAGGATVFP-----------DF----GAAIWPRKGTAVFWYNLFKSGEGDY 500
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
+ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 501 RTRHAACPVLVGNKWVSNKWIHERGQ 526
>gi|380025232|ref|XP_003696381.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Apis florea]
Length = 537
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + + N ++ ++I MA+ + +T+ K ++ R S ++ E E
Sbjct: 330 PRIVVYHNVIYDDEIETIKRMAQPRFKRATVQNYKTGALE-IANYRISKSAWLQ--EHEH 386
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ + ++ +T + E ++ Y IG Y H+D +E KS R+A
Sbjct: 387 KHVAAVSRRVEHMTSMTVDTAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIA 446
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
+ L Y++D+E+GG T+F I + + P++G +Y+L PNG D
Sbjct: 447 TVLYYMSDVEQGGGTVFT---------------AINIALWPKKGSAAFWYNLKPNGEGDF 491
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
+ H +CPV+ G KWVA KW+ ++ Q
Sbjct: 492 KTRHAACPVLTGSKWVANKWLHERGQ 517
>gi|194765138|ref|XP_001964684.1| GF23317 [Drosophila ananassae]
gi|190614956|gb|EDV30480.1| GF23317 [Drosophila ananassae]
Length = 520
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 58/217 (26%), Positives = 106/217 (48%), Gaps = 27/217 (12%)
Query: 77 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTL---ALRKGETVDNTQGIRTS 133
+ +++ P + + + + + + MA +L+ +T+ +L K E V RTS
Sbjct: 318 LKMEIVGLNPYMVIYHDVLSSAEIDEMKEMATPSLKRATVYKASLGKNEVVKT----RTS 373
Query: 134 SGVFISAAEDESGTLDL-IEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEY 192
++ D +L L + +I +T E ++ Y +G Y+ HYD F+ E
Sbjct: 374 K---VAWFPDSYNSLTLRLNARIHDMTGFDLSGSEMLQLMNYGLGGHYDKHYDFFNATEK 430
Query: 193 GPQKS-QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFY 251
+ R+A+ L Y++D+E+GG T+FP I V P++G +++Y
Sbjct: 431 SSSLTGDRIATVLFYMSDVEQGGATVFP---------------NIYKTVYPQRGTAVMWY 475
Query: 252 SLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQY 288
+L +G D ++H +CPV+ G KWV KWIR++ Q+
Sbjct: 476 NLKDDGQPDEQTLHAACPVLVGSKWVCNKWIRERAQF 512
>gi|312032356|ref|NP_001185665.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Gallus
gallus]
Length = 536
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 56/206 (27%), Positives = 99/206 (48%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + E+ +++ +AK L +T+ + + T R S ++S E S
Sbjct: 337 PRIVRFLDIISDEEIETVKELAKPRLSRATVHDPETGKL-TTAHYRVSKSAWLSGYE--S 393
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 394 PVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 453
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V P++G + +Y+L P+G D
Sbjct: 454 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFPSGEGDY 498
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 499 STRHAACPVLVGNKWVSNKWLHERGQ 524
>gi|194765168|ref|XP_001964699.1| GF22909 [Drosophila ananassae]
gi|190614971|gb|EDV30495.1| GF22909 [Drosophila ananassae]
Length = 525
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 58/214 (27%), Positives = 95/214 (44%), Gaps = 26/214 (12%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA 141
LS P + + + + +I + L+ +T+ V N +RTS F+
Sbjct: 298 LSRDPLLILYHDVIYQSEIDTIRKLTTNKLKRATITSTNESVVSN---VRTSQFTFLPVT 354
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEY------GPQ 195
ED+ L I+ ++A +T E Y IG Y H D F + P+
Sbjct: 355 EDK--VLATIDRRVADMTNFNMRYAEDHQFANYGIGGHYGQHMDWFYQPSFDAGLVSSPE 412
Query: 196 KSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLP 255
R+A+ L YL+D+ +GG T FP + + +KP++ +Y+L
Sbjct: 413 MGNRIATVLFYLSDVTQGGGTAFPH---------------LRVLLKPKKYAAAFWYNLHA 457
Query: 256 NGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 289
+G DP + HG+CP++ G KWV +WIR+ Q D
Sbjct: 458 SGVGDPRTQHGACPIISGSKWVQNRWIREFIQSD 491
>gi|47227817|emb|CAG08980.1| unnamed protein product [Tetraodon nigroviridis]
Length = 285
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 58/213 (27%), Positives = 102/213 (47%), Gaps = 25/213 (11%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFIS 139
+ LS P + + +F + + + I + A+L LR S +A R + T R S ++
Sbjct: 81 ETLSLQPYVVLYHDFISDTEAEEIKHHAQLGLRRSVVATRDKQV---TAEYRISKSAWLK 137
Query: 140 AAEDESGTLDLIEEKIAKVTML--PRINGEAFNILRYKIGQKYNSHYD-AFDPQE--YGP 194
+ + + ++++I+ +T L +GE ++ Y IG Y H+D A P +
Sbjct: 138 GSAQSA--VSRLDQRISMLTGLNVQHPHGEYLQVVNYGIGGHYEPHFDHATSPSSPVFKL 195
Query: 195 QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLL 254
+ RVA+ ++YL+ +E GG T F + N V + + +++L
Sbjct: 196 KTGNRVATVMIYLSSVEAGGSTAFIYAN---------------FSVPVMKNAAIFWWNLH 240
Query: 255 PNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
NG DP ++H CPV+ G+KWVA KWI + Q
Sbjct: 241 RNGRGDPDTLHAGCPVLIGDKWVANKWIHEYGQ 273
>gi|195575097|ref|XP_002105516.1| GD17035 [Drosophila simulans]
gi|194201443|gb|EDX15019.1| GD17035 [Drosophila simulans]
Length = 535
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 59/216 (27%), Positives = 96/216 (44%), Gaps = 24/216 (11%)
Query: 78 PFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSG 135
PF++ L P + +S+ A+ ++ ST+ G RTS G
Sbjct: 316 PFKLEELHLDPLVVQLHQVIGSNDSESLQKTARPRIKRSTVYSLGGNGGSTAAAFRTSQG 375
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEY--- 192
+ + + L+ + + L E + Y IG Y H+D+F P+ +
Sbjct: 376 ASFNYSRN--AATKLLSHHVGDFSGLNMDYAEDLQVANYGIGGHYEPHWDSF-PENHIYQ 432
Query: 193 -GPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFY 251
G R+A+ + YL+D+E GG T FPF + L V P +G L +Y
Sbjct: 433 EGDLHGNRIATGIYYLSDVEAGGGTAFPF---------------LPLLVTPEKGSLLFWY 477
Query: 252 SLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
+L P+G D + H +CPV++G KW+A WIR++ Q
Sbjct: 478 NLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQ 513
>gi|195145314|ref|XP_002013641.1| GL24244 [Drosophila persimilis]
gi|194102584|gb|EDW24627.1| GL24244 [Drosophila persimilis]
Length = 496
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 59/197 (29%), Positives = 94/197 (47%), Gaps = 28/197 (14%)
Query: 88 ALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGT 147
ALY + EQ ++ L S L ++G D IRT + S A + + T
Sbjct: 312 ALYHEVVSAAEQRHLML------LSESQLQRQRGHQYDK---IRTFASA--SVAANATPT 360
Query: 148 LDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ--KSQRVASFLV 205
++ + ++ +T L E IL Y IG +Y H D PQ + K R+A+ L+
Sbjct: 361 VEQLHRRLEDITGLDLAESEPLRILNYGIGGQYYIHVDCEQPQTHVEPYPKEYRLATVLL 420
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL+D+ GG T FP +GL ++P +G L++++ G D ++H
Sbjct: 421 YLSDVRLGGFTSFP---------------ALGLGIRPNRGSALVWHNANNAGNCDYRALH 465
Query: 266 GSCPVVKGEKWVATKWI 282
+CPV+ G +WVA+KWI
Sbjct: 466 AACPVLLGTRWVASKWI 482
>gi|323452216|gb|EGB08091.1| hypothetical protein AURANDRAFT_26622 [Aureococcus anophagefferens]
Length = 190
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 64/209 (30%), Positives = 98/209 (46%), Gaps = 27/209 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + +C II + +R S + G+ T RTS ++ + S
Sbjct: 1 PRVFLVREMLSEFECDHIIELGTKVVRKSMV----GQGGGFTSKTRTSENGWLRRSA--S 54
Query: 146 GTLDLIEEKIAKVTMLPR------INGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQR 199
L+ I ++ V + N E ++RY Q+Y H+D D + PQ QR
Sbjct: 55 PILENIYKRFGDVLGIDHDLLRSGKNAEELQVVRYDRSQEYAPHHDFGD--DGTPQ--QR 110
Query: 200 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 259
+ L+Y+ EEGG T FP N DG +G++V P +GD +LFYS+LP+G
Sbjct: 111 FLTLLLYIQLPEEGGATSFPKAN----DG-------MGVQVVPARGDAVLFYSMLPDGNA 159
Query: 260 DPTSIHGSCPVVKGEKWVATKWIRDQEQY 288
D ++H PV KG+KWV W+ D ++
Sbjct: 160 DDLALHAGMPVRKGQKWVCNLWVWDPHRH 188
>gi|327267604|ref|XP_003218589.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Anolis
carolinensis]
Length = 542
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 57/206 (27%), Positives = 98/206 (47%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + E+ +++ +AK L +T+ + + T R S ++S E+
Sbjct: 343 PRIVRFVEIISDEEIETVKELAKPRLSRATVHDPQTGKL-TTAHYRVSKSAWLSGYENP- 400
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 401 -IVARINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIA 459
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V PR+G + +Y+L P+G D
Sbjct: 460 TWLFYMSDVSAGGATVFP---------------EVGASVWPRKGTAVFWYNLFPSGEGDY 504
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 505 STRHAACPVLVGNKWVSNKWIHERGQ 530
>gi|85708137|ref|ZP_01039203.1| Prolyl 4-hydroxylase, alpha subunit [Erythrobacter sp. NAP1]
gi|85689671|gb|EAQ29674.1| Prolyl 4-hydroxylase, alpha subunit [Erythrobacter sp. NAP1]
Length = 211
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 63/230 (27%), Positives = 104/230 (45%), Gaps = 38/230 (16%)
Query: 72 DSVTNIPFQVLSWMPRALYFPN----------FATPEQCKSIINMAKLNLRPSTLALRKG 121
D++T+ Q LS A P+ F +PE C +I + L RPST+A G
Sbjct: 2 DAMTDSVSQRLSRFDTAQRVPHPRIEQFQVKGFISPELCAELIRLIDLGRRPSTIADANG 61
Query: 122 ETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYN 181
+ RTS + A E + +E + + + GE RY +GQ++
Sbjct: 62 DDY-----FRTSETCDLDANET---AVKELEARFFSLNGIDPKYGEPVQGQRYDVGQEFK 113
Query: 182 SHYDAFDP-----QEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCI 236
+H D F P +++ + QR +F++YL ++E GG T F K I
Sbjct: 114 AHTDYFTPGGADFEKFCAESGQRTWTFMIYLNEVEAGGATRF---------------KVI 158
Query: 237 GLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
+P G + + + P+G+++P ++H V KG K+V TKW R++E
Sbjct: 159 KKSFQPETGKLVCWNNARPDGSVNPATLHHGMKVRKGVKYVITKWYRERE 208
>gi|156406532|ref|XP_001641099.1| predicted protein [Nematostella vectensis]
gi|156228236|gb|EDO49036.1| predicted protein [Nematostella vectensis]
Length = 410
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 66/208 (31%), Positives = 92/208 (44%), Gaps = 30/208 (14%)
Query: 87 RALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAED--- 143
R L + TP C ++ + NL+ LA + E D T G + F D
Sbjct: 160 RRLEILDKMTPIMCFDGVDTLRKNLKELKLAYKVSER-DFTMGTTCLNETFSGKLRDHFK 218
Query: 144 ----------ESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 193
E+ + ++I T L NG F I Y IG Y SH D E
Sbjct: 219 WSHSMAFYTGENKFSTMYAKRIQAATGLREENGGKFQITGYPIGVGYKSHTDCV-VYEGE 277
Query: 194 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 253
P+K R A+ LVYL D+EEGGET FP +G+KVKP++G L++ S+
Sbjct: 278 PEKRDRYATILVYLQDVEEGGETDFPL---------------LGIKVKPKKGLALVWNSM 322
Query: 254 LPNGTIDPTSIHGSCPVVKGEKWVATKW 281
G DP S+H + V KG K++ +W
Sbjct: 323 DARGNCDPLSLHDAKQVTKGHKYIIQRW 350
>gi|312032354|ref|NP_001185664.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Gallus
gallus]
Length = 536
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 56/206 (27%), Positives = 99/206 (48%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + E+ +++ +AK L +T+ + + T R S ++S E S
Sbjct: 337 PRIVRFLDIISDEEIETVKELAKPRLSRATVHDPETGKL-TTAHYRVSKSAWLSGYE--S 393
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 394 PVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIA 453
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V P++G + +Y+L P+G D
Sbjct: 454 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFPSGEGDY 498
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 499 STRHAACPVLVGNKWVSNKWLHERGQ 524
>gi|297803562|ref|XP_002869665.1| ShTK domain-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297315501|gb|EFH45924.1| ShTK domain-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 290
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 64/211 (30%), Positives = 99/211 (46%), Gaps = 35/211 (16%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA 141
LSW PR + F + E+ +I+ LR T + G+ TQ +G
Sbjct: 61 LSWQPRVFLYRGFLSEEESDHLIS-----LRKDTSEVTSGDADGKTQLDPVVAG------ 109
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVA 201
IEEKI+ T LPR NG + + Y +K D F + + +A
Sbjct: 110 ---------IEEKISAWTFLPRENGGSIKVRSY-TSEKSGKKLDYFGEEPSSVLRESLLA 159
Query: 202 SFLVYLTDLEEGGETMFPF-----ENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPN 256
+ ++YL++ +GGE +FP + + DG+ ++P +G+ +LF+S L N
Sbjct: 160 TVVLYLSNTTQGGELLFPNSEVKPKKSCSEDGNI---------LRPVKGNAVLFFSRLLN 210
Query: 257 GTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
++D TS H CPVVKGE VATK I ++Q
Sbjct: 211 ASLDETSTHLICPVVKGELLVATKLIYAKKQ 241
>gi|326923463|ref|XP_003207955.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 2
[Meleagris gallopavo]
Length = 536
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 56/206 (27%), Positives = 99/206 (48%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + E+ +++ +AK L +T+ + + T R S ++S E S
Sbjct: 337 PRIVRFLDIISDEEIETVKELAKPRLSRATVHDPETGKL-TTAHYRVSKSAWLSGYE--S 393
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 394 PVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIA 453
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V P++G + +Y+L P+G D
Sbjct: 454 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFPSGEGDY 498
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 499 STRHAACPVLVGNKWVSNKWLHERGQ 524
>gi|224052167|ref|XP_002191912.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Taeniopygia
guttata]
Length = 536
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 57/206 (27%), Positives = 98/206 (47%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + E+ +++ +AK L +T+ + + T R S ++S E S
Sbjct: 337 PRIVRFLDIISDEEIETVKELAKPRLSRATVHDPETGKL-TTAHYRVSKSAWLSGYE--S 393
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 394 PVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIA 453
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V PR+G + +Y+L P+G D
Sbjct: 454 TWLFYMSDVSAGGATVFP---------------EVGASVWPRKGTAVFWYNLFPSGEGDY 498
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV KW+ ++ Q
Sbjct: 499 STRHAACPVLVGNKWVFNKWLHERGQ 524
>gi|219123691|ref|XP_002182153.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217406114|gb|EEC46054.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 188
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 62/208 (29%), Positives = 97/208 (46%), Gaps = 29/208 (13%)
Query: 81 VLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISA 140
VL+ P NF TP +C+ +I+MA+ + P+ + G+ RTSS ++S
Sbjct: 1 VLNTSPPMFAVDNFLTPLECEFLIHMAQDSFGPAPVV---GKGAGEVSPSRTSSTCYLSR 57
Query: 141 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFD-----PQEYGPQ 195
+ L + K++ +T P + E + RY Q+Y HYDAFD +
Sbjct: 58 ED-----LPDLMRKVSSLTGKPIEHCELPQVGRYFPSQQYLQHYDAFDLGTEDGLRFAAN 112
Query: 196 KSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLP 255
QR + L+YL D+ GG T FP + L V+PRQG L+F+
Sbjct: 113 GGQRTITVLLYLNDVARGGATRFP---------------ALNLDVQPRQGMALVFFPATI 157
Query: 256 NGTIDPTSIHGSCPVVKGEKWVATKWIR 283
+G +D ++H + P V K+V+ WIR
Sbjct: 158 DGMLDRMALHAAMPAVD-TKYVSQVWIR 184
>gi|334140935|ref|YP_004534141.1| 2OG-Fe(II) oxygenase [Novosphingobium sp. PP1Y]
gi|333938965|emb|CCA92323.1| 2OG-Fe(II) oxygenase [Novosphingobium sp. PP1Y]
Length = 209
Score = 88.2 bits (217), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 52/198 (26%), Positives = 99/198 (50%), Gaps = 28/198 (14%)
Query: 93 NFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIE 152
+F QC ++I + + RPST+A G+ V RTSS +S + + +
Sbjct: 32 DFLDTAQCDALIALIEAEHRPSTVANYNGDDV-----FRTSSTCDLSP---DVPAVAALA 83
Query: 153 EKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP-----QEYGPQKSQRVASFLVYL 207
K+ ++ + + E RY++GQ++ +H D F+P ++Y QR +F++YL
Sbjct: 84 RKLCDISGIDPAHAEPLQGQRYEVGQEFKAHTDYFEPNNSDFEKYCSVSGQRTWTFMIYL 143
Query: 208 TDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGS 267
D++ GG T F K I ++P +G + + + P+G+++P ++H +
Sbjct: 144 NDVDAGGATRF---------------KVINKLIQPERGKLVAWNNRRPDGSLNPATLHHA 188
Query: 268 CPVVKGEKWVATKWIRDQ 285
V +G K+V T+W R++
Sbjct: 189 MKVRQGRKYVVTQWFRER 206
>gi|189241578|ref|XP_969458.2| PREDICTED: similar to prolyl 4-hydroxylase alpha subunit 1,
putative [Tribolium castaneum]
Length = 515
Score = 88.2 bits (217), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 46/146 (31%), Positives = 76/146 (52%), Gaps = 17/146 (11%)
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVA 201
+ E L ++ +++A +T L E F ++ Y IG Y H+D P R+
Sbjct: 380 DQEHQHLAVVAQRVAHMTGLTLSTAEEFQVVNYGIGGHYEPHFDF--QSTVDPAIGSRIE 437
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
+ L YL+D+E+GG T+FP I + V P++G +++++L P+G D
Sbjct: 438 TVLFYLSDVEQGGATVFP---------------EIQVSVWPQKGSAVVWFNLHPSGDGDQ 482
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
+ H CPV+ G KW+ATKWI ++ Q
Sbjct: 483 RTKHAGCPVLIGSKWIATKWIHERGQ 508
>gi|78046960|ref|YP_363135.1| hypothetical protein XCV1404 [Xanthomonas campestris pv.
vesicatoria str. 85-10]
gi|78035390|emb|CAJ23035.1| conserved hypothetical protein [Xanthomonas campestris pv.
vesicatoria str. 85-10]
Length = 418
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 55/207 (26%), Positives = 91/207 (43%), Gaps = 20/207 (9%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + + ++C+ ++ +A+ +LR S + + + +RTS G + ++
Sbjct: 228 PRIEEYAAVLSADECRLLMLLARPHLRASKV-IDPNDASTGRAPVRTSHGATLDPIIEDF 286
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG---PQKSQRVAS 202
+ ++A LP + E ++L Y G++Y +H D P P R +
Sbjct: 287 AA-RAAQSRLAACAQLPLAHAEPLSVLCYAPGEQYRAHRDYLPPGTIAADRPTAGNRQRT 345
Query: 203 FLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPT 262
VYL D+ GGET FP G++V+PR G + F +L +G D
Sbjct: 346 VCVYLNDVGAGGETEFPV---------------AGVRVRPRPGTLVCFDNLHADGRPDAD 390
Query: 263 SIHGSCPVVKGEKWVATKWIRDQEQYD 289
S+H PV G KW+ T W R Q D
Sbjct: 391 SLHAGLPVTAGSKWLGTLWFRQQRYRD 417
>gi|346724248|ref|YP_004850917.1| hypothetical protein XACM_1335 [Xanthomonas axonopodis pv.
citrumelo F1]
gi|346648995|gb|AEO41619.1| hypothetical protein XACM_1335 [Xanthomonas axonopodis pv.
citrumelo F1]
Length = 418
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 55/207 (26%), Positives = 91/207 (43%), Gaps = 20/207 (9%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + + ++C+ ++ +A+ +LR S + + + +RTS G + ++
Sbjct: 228 PRIEEYAAVLSADECRLLMLLARPHLRASKV-IDPNDASTGRAPVRTSHGATLDPIIEDF 286
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG---PQKSQRVAS 202
+ ++A LP + E ++L Y G++Y +H D P P R +
Sbjct: 287 AA-RAAQSRLAACAQLPLAHAEPLSVLCYAPGEQYRAHRDYLPPGTIAADRPTAGNRQRT 345
Query: 203 FLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPT 262
VYL D+ GGET FP G++V+PR G + F +L +G D
Sbjct: 346 VCVYLNDVGAGGETEFPV---------------AGVRVRPRPGTLVCFDNLHADGRPDAD 390
Query: 263 SIHGSCPVVKGEKWVATKWIRDQEQYD 289
S+H PV G KW+ T W R Q D
Sbjct: 391 SLHAGLPVTAGSKWLGTLWFRQQRYRD 417
>gi|114799222|ref|YP_760562.1| 2OG-Fe(II) oxygenase [Hyphomonas neptunium ATCC 15444]
gi|114739396|gb|ABI77521.1| oxidoreductase, 2OG-Fe(II) oxygenase family [Hyphomonas neptunium
ATCC 15444]
Length = 298
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 64/209 (30%), Positives = 99/209 (47%), Gaps = 33/209 (15%)
Query: 86 PRA-LY-FPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQG--IRTSSGVFISAA 141
P+A LY +PNF PE C ++I + LR ST T D IRTS I
Sbjct: 100 PKAQLYVWPNFLAPETCDALIALTDERLRAST-------TTDAFADPKIRTSRSSDIGTM 152
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP-----QEYGPQK 196
+ ++E IA+ + +A RY + Q+Y +HYD F P Q +
Sbjct: 153 G--HNLVMQLDELIAEALGIHWSYSDATQTQRYDVNQEYKAHYDYFTPGTRDYQVHCQFT 210
Query: 197 SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPN 256
QR +F++YL D+EEGG T F + + + P +G +++ +L P+
Sbjct: 211 GQRTWTFMIYLNDVEEGGGTRF---------------RRLEKTIMPEKGKAVIWNNLNPD 255
Query: 257 GTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
G+++P +IH V G K+V TKW R++
Sbjct: 256 GSVNPYTIHHGMKVRSGAKYVITKWFRER 284
>gi|270001038|gb|EEZ97485.1| hypothetical protein TcasGA2_TC011322 [Tribolium castaneum]
Length = 509
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 46/146 (31%), Positives = 76/146 (52%), Gaps = 17/146 (11%)
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVA 201
+ E L ++ +++A +T L E F ++ Y IG Y H+D P R+
Sbjct: 374 DQEHQHLAVVAQRVAHMTGLTLSTAEEFQVVNYGIGGHYEPHFDF--QSTVDPAIGSRIE 431
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
+ L YL+D+E+GG T+FP I + V P++G +++++L P+G D
Sbjct: 432 TVLFYLSDVEQGGATVFP---------------EIQVSVWPQKGSAVVWFNLHPSGDGDQ 476
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
+ H CPV+ G KW+ATKWI ++ Q
Sbjct: 477 RTKHAGCPVLIGSKWIATKWIHERGQ 502
>gi|212530|gb|AAA49002.1| prolyl 4-hydroxylase, alpha subunit (EC 1.14.11.2), partial [Gallus
gallus]
Length = 489
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 56/206 (27%), Positives = 99/206 (48%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + E+ +++ +AK L +T+ + + T R S ++S E S
Sbjct: 290 PRIVRFLDIISDEEIETVKELAKPRLSRATVHDPETGKL-TTAHYRVSKSAWLSGYE--S 346
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 347 PVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIA 406
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V P++G + +Y+L P+G D
Sbjct: 407 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFPSGEGDY 451
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 452 STRHAACPVLVGNKWVSNKWLHERGQ 477
>gi|260825357|ref|XP_002607633.1| hypothetical protein BRAFLDRAFT_59428 [Branchiostoma floridae]
gi|229292981|gb|EEN63643.1| hypothetical protein BRAFLDRAFT_59428 [Branchiostoma floridae]
Length = 520
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 59/207 (28%), Positives = 98/207 (47%), Gaps = 25/207 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISA--AED 143
P+ N T + + I +A+ LR + + E+ +G S + SA +
Sbjct: 322 PKLWVLHNILTDPEMEVIKKLAQPRLRRARV-----ESPTTGEGELASYRISKSAWLYDW 376
Query: 144 ESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYG--PQKSQRV 200
E + + +++ VT L E ++ Y IG Y H+D A +E+ P + R+
Sbjct: 377 EHRVIRRVNQRVEDVTGLTMETAELLQVVNYGIGGHYEPHFDCATKDEEFALDPNEGDRI 436
Query: 201 ASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTID 260
A+ L Y++D+E GG T+FP +G +V P +G G +Y+LL +G D
Sbjct: 437 ATMLFYMSDVEAGGATVFP---------------QVGARVVPEKGAGAFWYNLLKSGEGD 481
Query: 261 PTSIHGSCPVVKGEKWVATKWIRDQEQ 287
+ H CPV+ G KWV+ KWI ++ Q
Sbjct: 482 MLTEHAGCPVLVGSKWVSNKWIHERGQ 508
>gi|432949777|ref|XP_004084253.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Oryzias
latipes]
Length = 532
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 58/206 (28%), Positives = 101/206 (49%), Gaps = 24/206 (11%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET-VDNTQGIRTSSGVFISAAEDE 144
P + + N + ++ + I +AK L +T+ R +T V T R S ++ +D
Sbjct: 335 PHIVRYLNILSDQEIEKIKELAKPRLARATV--RDPKTGVLTTAPYRVSKSAWLEGEDDP 392
Query: 145 SGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ---KSQRVA 201
+D + ++I +T L E + Y +G +Y H+D F + + R+A
Sbjct: 393 --VIDRVNQRIQDITGLTVETAELLQVANYGVGGQYEPHFD-FSRRPFDSNLKVDGNRLA 449
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
+FL Y++D+E GG T+FP D+ G + PR+G + +Y+L +G D
Sbjct: 450 TFLNYMSDVEAGGATVFP-----------DF----GASIWPRKGTAVFWYNLFRSGEGDY 494
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
+ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 495 RTRHAACPVLVGSKWVSNKWIHERGQ 520
>gi|129365|sp|P16924.1|P4HA1_CHICK RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1
Length = 516
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 56/206 (27%), Positives = 99/206 (48%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + E+ +++ +AK L +T+ + + T R S ++S E S
Sbjct: 317 PRIVRFLDIISDEEIETVKELAKPRLSRATVHDPETGKL-TTAHYRVSKSAWLSGYE--S 373
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 374 PVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIA 433
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V P++G + +Y+L P+G D
Sbjct: 434 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFPSGEGDY 478
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 479 STRHAACPVLVGNKWVSNKWLHERGQ 504
>gi|228993272|ref|ZP_04153188.1| hypothetical protein bpmyx0001_40040 [Bacillus pseudomycoides DSM
12442]
gi|228766340|gb|EEM14983.1| hypothetical protein bpmyx0001_40040 [Bacillus pseudomycoides DSM
12442]
Length = 195
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 60/215 (27%), Positives = 95/215 (44%), Gaps = 33/215 (15%)
Query: 81 VLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISA 140
VL P + TP +C+ +I ++K +++P+ GE + F
Sbjct: 7 VLHDEPFVAQYEQIITPAECQELIELSKKHIQPAQAYGHTGERKSD----------FTWL 56
Query: 141 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-----DPQEYGPQ 195
G + + E IA LP + E RY++G K+++H D + D + Q
Sbjct: 57 PHYSHGLVSQVSELIATAMPLPLNHAEPLQAARYEVGGKFDAHIDCYGTWHEDGRNRVEQ 116
Query: 196 KSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLP 255
QR+ + ++YL + GGET FP + L V P +G LL +
Sbjct: 117 GGQRLYTAILYLNTVNAGGETFFP---------------SLNLTVTPSEGK-LLVFENCK 160
Query: 256 NGTIDPT--SIHGSCPVVKGEKWVATKWIRDQEQY 288
GT +P S+H C V +GEKW+AT W R++ QY
Sbjct: 161 RGTNEPHPLSLHEGCAVHEGEKWIATLWFREKPQY 195
>gi|395492951|ref|ZP_10424530.1| putative oxygenase [Sphingomonas sp. PAMC 26617]
Length = 226
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 55/198 (27%), Positives = 93/198 (46%), Gaps = 28/198 (14%)
Query: 92 PNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLI 151
P+F C + M N R ST+ D Q RTS + + S +
Sbjct: 46 PDFLDAATCAKFVEMIDANRRRSTVLAD-----DAVQAFRTSESCDM---DRWSPDVRPT 97
Query: 152 EEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQK----SQRVASFLVY 206
+E IA + + ++GE RY +GQ + +H D F + Q Y P+ QR + ++Y
Sbjct: 98 DEAIAALLGIDPVHGETMQGQRYAVGQHFRAHNDYFNEAQPYWPKMIESGGQRTWTAMIY 157
Query: 207 LTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHG 266
L D+EEGG T FP +G+++ P++G + + ++ +G+ +P ++H
Sbjct: 158 LNDVEEGGATWFPL---------------VGVRIAPKRGLLIAWNNMRADGSPNPDTLHE 202
Query: 267 SCPVVKGEKWVATKWIRD 284
PV G K++ TKW R+
Sbjct: 203 GMPVTAGTKYIITKWFRE 220
>gi|386368303|gb|AFJ06910.1| procollagen-proline dioxygenase [Mytilus galloprovincialis]
Length = 535
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 47/145 (32%), Positives = 76/145 (52%), Gaps = 20/145 (13%)
Query: 148 LDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-----AFDPQEYGPQKSQRVAS 202
+D ++ +I VT L + +A + Y IG Y+ HYD D E + R+A+
Sbjct: 394 VDRVQNRIKAVTGLDLDSADALQVANYGIGGHYDPHYDFSTRDDDDTSETEKRDGNRIAT 453
Query: 203 FLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPT 262
FL+Y+TD++ GG T+FP I ++V P++G + +Y+L +G
Sbjct: 454 FLLYMTDVDAGGATVFPI---------------IDVRVLPKKGTAVFWYNLRRSGKGIME 498
Query: 263 SIHGSCPVVKGEKWVATKWIRDQEQ 287
+ H +CPV+ G KWV+ KWIR + Q
Sbjct: 499 TRHAACPVLVGTKWVSNKWIRTRGQ 523
>gi|321474876|gb|EFX85840.1| hypothetical protein DAPPUDRAFT_309107 [Daphnia pulex]
Length = 528
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 47/161 (29%), Positives = 79/161 (49%), Gaps = 21/161 (13%)
Query: 131 RTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDP 189
R + F+ + E + + ++ +T L E + Y IG Y H+D A
Sbjct: 373 RIAKAAFLK--DSEHNLIVKMSRRVGDITGLDMAASEDLQVCNYGIGGHYVPHFDYARQG 430
Query: 190 QEYGPQK---SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGD 246
+ +GP+ R+A++L Y++D+E GG T+FP +G + P++G
Sbjct: 431 EIHGPRDLDWGNRIATWLFYMSDVEAGGATVFP---------------AVGAALWPQKGS 475
Query: 247 GLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
+Y+L PNG D ++H CPV+ G KWV+ KWI ++ Q
Sbjct: 476 AAFWYNLRPNGNGDEDTLHAGCPVLTGSKWVSNKWIHERSQ 516
>gi|404253277|ref|ZP_10957245.1| putative oxygenase [Sphingomonas sp. PAMC 26621]
Length = 226
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 55/198 (27%), Positives = 93/198 (46%), Gaps = 28/198 (14%)
Query: 92 PNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLI 151
P+F C + M N R ST+ D Q RTS + + S +
Sbjct: 46 PDFLDAATCAKFVEMIDANRRRSTVLAD-----DAVQAFRTSESCDM---DRWSPDVRPT 97
Query: 152 EEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQK----SQRVASFLVY 206
+E IA + + ++GE RY +GQ + +H D F + Q Y P+ QR + ++Y
Sbjct: 98 DEAIATLLGIDPVHGETMQGQRYAVGQHFRAHNDYFNEAQPYWPKMIESGGQRTWTAMIY 157
Query: 207 LTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHG 266
L D+EEGG T FP +G+++ P++G + + ++ +G+ +P ++H
Sbjct: 158 LNDVEEGGATWFPL---------------VGVRIAPKRGLLIAWNNMRADGSPNPDTLHE 202
Query: 267 SCPVVKGEKWVATKWIRD 284
PV G K++ TKW R+
Sbjct: 203 GMPVTAGTKYIITKWFRE 220
>gi|147791524|emb|CAN70717.1| hypothetical protein VITISV_029140 [Vitis vinifera]
Length = 173
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 54/144 (37%), Positives = 75/144 (52%), Gaps = 28/144 (19%)
Query: 148 LDLIEEKIAKVTMLPRINGE--AFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLV 205
L IE++I+ + +P NGE FN+ R QRVA+ L+
Sbjct: 55 LQAIEKRISVYSQVPVENGELIQFNLKR----------------------GGQRVATMLI 92
Query: 206 YLTDLEEGGETMFPFE-NGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSI 264
YL+D EGGET FP +G G + GL V P +G+ +LF+S+ +G DP SI
Sbjct: 93 YLSDNVEGGETYFPMAGSGFCRCGGKSVR---GLSVAPVKGNAVLFWSMGLDGQSDPNSI 149
Query: 265 HGSCPVVKGEKWVATKWIRDQEQY 288
HG C V+ GEKW ATKW+R + +
Sbjct: 150 HGGCEVLAGEKWSATKWMRQRSTH 173
>gi|37912909|gb|AAR05245.1| conserved hypothetical protein [uncultured marine proteobacterium
ANT32C12]
Length = 186
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 55/165 (33%), Positives = 83/165 (50%), Gaps = 25/165 (15%)
Query: 131 RTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQ 190
RT+S +I D S + + ++ + + +P N E F ++ Y G +Y H+DAFD
Sbjct: 40 RTNSYAWIQ--HDASEIIHEVSKRFSILVKMPINNAEQFQLVHYGPGTEYKPHFDAFDKS 97
Query: 191 -EYGPQK----SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQG 245
E G QR+ + L YL D+E+GG T FP I + VKP +G
Sbjct: 98 TEEGRNNWFPGGQRMVTALAYLNDVEDGGATDFP---------------DIHVSVKPNKG 142
Query: 246 DGLLFYSLLPNGT--IDPTSIHGSCPVVKGEKWVATKWIRDQEQY 288
D ++F++ +GT I+P S+HG PV+ GEKW W R + Y
Sbjct: 143 DVVVFHNC-KDGTSDINPNSLHGGSPVISGEKWAVNLWFRQEAIY 186
>gi|393725345|ref|ZP_10345272.1| putative oxygenase [Sphingomonas sp. PAMC 26605]
Length = 226
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 57/200 (28%), Positives = 95/200 (47%), Gaps = 28/200 (14%)
Query: 90 YFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLD 149
+ P+F C +++ M N R ST+ +N Q RTS + + S +
Sbjct: 44 HHPDFLDAATCDTLVAMIDANKRRSTVLAE-----ENVQEFRTSESCDM---DRWSPDVR 95
Query: 150 LIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQK----SQRVASFL 204
+E IA + + + GE RY +GQ + +H+D F + Q Y PQ QR + +
Sbjct: 96 PTDEAIAHLLGIDPVYGETMQGQRYAVGQHFRAHFDYFNEKQAYWPQMIETGGQRTWTAM 155
Query: 205 VYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSI 264
+YL + EGG T FP IG++V P++G L + ++ +G+ + ++
Sbjct: 156 IYLNHVAEGGATWFP---------------QIGIRVAPKKGLLLAWNNMNADGSRNTETL 200
Query: 265 HGSCPVVKGEKWVATKWIRD 284
H PVV G K++ TKW R+
Sbjct: 201 HEGMPVVSGTKYIVTKWFRE 220
>gi|291190128|ref|NP_001167431.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Salmo salar]
gi|223649060|gb|ACN11288.1| Prolyl 4-hydroxylase subunit alpha-2 precursor [Salmo salar]
Length = 538
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 60/207 (28%), Positives = 100/207 (48%), Gaps = 24/207 (11%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET-VDNTQGIRTSSGVFISAAEDE 144
P + + N + + + I +AK L +T+ R +T V T R S ++ ED
Sbjct: 339 PHIVRYLNALSDSEIEKIKELAKPRLARATV--RDPKTGVLTTANYRVSKSAWLEGEEDP 396
Query: 145 SGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRV 200
++ + ++I +T L E I Y +G +Y H+D E K+ RV
Sbjct: 397 --VIERVNQRIEDITGLTTQTAELLQIANYGVGGQYEPHFDFSRKDEPDAFKTLGTGNRV 454
Query: 201 ASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTID 260
A+FL Y++D+E GG T+FP D+ G + P++G + +Y+L +G D
Sbjct: 455 ATFLNYMSDVEAGGATVFP-----------DF----GAAIYPKKGTAVFWYNLFRSGEGD 499
Query: 261 PTSIHGSCPVVKGEKWVATKWIRDQEQ 287
+ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 500 YRTRHAACPVLVGCKWVSNKWIHERGQ 526
>gi|410860761|ref|YP_006975995.1| prolyl 4-hydroxylase subunit alpha [Alteromonas macleodii AltDE1]
gi|410818023|gb|AFV84640.1| Prolyl 4-hydroxylase alpha subunit [Alteromonas macleodii AltDE1]
Length = 376
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 62/196 (31%), Positives = 92/196 (46%), Gaps = 34/196 (17%)
Query: 99 QCKSIINMAKLNLRPSTLALRKGETVDNTQG------IRTSSGVFISAAEDESGTLDLIE 152
+C+ +I L+PS + VD G +RTS I + T L +
Sbjct: 189 ECRYLIAKFSALLKPSMV-------VDPVTGRGKIDSVRTSYVAVIEPTHCDWITRKL-D 240
Query: 153 EKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF----DPQEYGPQKSQRVASFLVYLT 208
+ I+++T R NGEA N+LRY GQ+Y HYD D + K QR+ + LVYL
Sbjct: 241 KIISQITHTLRQNGEALNLLRYSPGQQYKPHYDGLNEINDALMFKDGK-QRIKTALVYLN 299
Query: 209 DLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSC 268
+ EGGET+FP + +++ P+ G ++F + NG + S H
Sbjct: 300 TINEGGETLFPK---------------LDIRIAPKSGTMVVFSNSDENGKLLLNSYHAGA 344
Query: 269 PVVKGEKWVATKWIRD 284
P V KW+ TKWIR+
Sbjct: 345 PTVSENKWLVTKWIRE 360
>gi|443707037|gb|ELU02831.1| hypothetical protein CAPTEDRAFT_181697 [Capitella teleta]
Length = 538
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 55/211 (26%), Positives = 95/211 (45%), Gaps = 20/211 (9%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFIS 139
+V+ P + N T ++ I ++K L S + G Q RTS +I
Sbjct: 333 EVMFLDPFIAIYHNLMTDKEADMIKRISKPKLHRSGVFTYSGGNQKPVQDYRTSKSAWIE 392
Query: 140 AAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQE---YGPQK 196
++E + + E+ + +T L E F ++ Y IG Y H+D P E + P+
Sbjct: 393 --DEEHPMIRRVSERTSALTDLSLDTVELFQVVNYGIGGHYEPHFDFARPNEIATFDPEV 450
Query: 197 SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPN 256
R+ + + Y+ E GG T+FP +G+K+ P +G ++++L+ N
Sbjct: 451 GNRIITVIFYVAAPEAGGATVFP---------------DLGVKLWPEKGSCAVWWNLMRN 495
Query: 257 GTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
G D + H CP + G KW+A KW ++ Q
Sbjct: 496 GEGDYRTKHAGCPTITGSKWIANKWYHERGQ 526
>gi|393909803|gb|EFO21561.2| prolyl 4-hydroxylase 2 [Loa loa]
Length = 542
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 66/217 (30%), Positives = 101/217 (46%), Gaps = 26/217 (11%)
Query: 78 PFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALR-KGETVDNTQGIRTSS 134
PF+V L + P A++F + T E+ I +A LR +T+ GE T RTS
Sbjct: 323 PFKVEILRFSPLAVFFRDVITDEEVTIIQMLATPRLRRATVQNSITGEL--ETASYRTSK 380
Query: 135 GVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP 194
++ E E + I +I +T L + E + Y IG Y+ H+D +E
Sbjct: 381 SAWLKDEEHE--IVHRINRRIDLMTNLEQETSEELQVGNYGIGGHYDPHFDFARREEVNA 438
Query: 195 QKS----QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 250
+S R+A+ L Y+T E GG T+F + V P + D L +
Sbjct: 439 FQSLNTGNRLATLLFYMTQPESGGATVF---------------TEVKTTVMPSKNDALFW 483
Query: 251 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
Y+LL +G D + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 484 YNLLRSGEGDLRTRHAACPVLIGSKWVSNKWIHERGQ 520
>gi|312080225|ref|XP_003142509.1| prolyl 4-hydroxylase 2 [Loa loa]
Length = 541
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 66/217 (30%), Positives = 101/217 (46%), Gaps = 26/217 (11%)
Query: 78 PFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALR-KGETVDNTQGIRTSS 134
PF+V L + P A++F + T E+ I +A LR +T+ GE T RTS
Sbjct: 322 PFKVEILRFSPLAVFFRDVITDEEVTIIQMLATPRLRRATVQNSITGEL--ETASYRTSK 379
Query: 135 GVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP 194
++ E E + I +I +T L + E + Y IG Y+ H+D +E
Sbjct: 380 SAWLKDEEHE--IVHRINRRIDLMTNLEQETSEELQVGNYGIGGHYDPHFDFARREEVNA 437
Query: 195 QKS----QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 250
+S R+A+ L Y+T E GG T+F + V P + D L +
Sbjct: 438 FQSLNTGNRLATLLFYMTQPESGGATVF---------------TEVKTTVMPSKNDALFW 482
Query: 251 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
Y+LL +G D + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 483 YNLLRSGEGDLRTRHAACPVLIGSKWVSNKWIHERGQ 519
>gi|357605723|gb|EHJ64752.1| prolyl 4-hydroxylase alpha subunit [Danaus plexippus]
Length = 235
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 45/148 (30%), Positives = 75/148 (50%), Gaps = 17/148 (11%)
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK--SQR 199
++ES + + ++A +T L E ++ Y IG Y+ H+D +E +K R
Sbjct: 75 DEESAVVARVSRRVADITGLSMTTAEELQVVNYGIGGHYDPHFDFARKEENAFEKFNGNR 134
Query: 200 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 259
+A+ L Y++D+ +GG T+F +GL V PR+G + + +L P+G
Sbjct: 135 IATVLFYMSDVAQGGATVF---------------TELGLSVFPRRGSAVFWLNLHPSGEG 179
Query: 260 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV++G KWV KWI Q
Sbjct: 180 DLATRHAACPVLRGSKWVCNKWIHQGGQ 207
>gi|198449502|ref|XP_001357605.2| GA15937 [Drosophila pseudoobscura pseudoobscura]
gi|198130635|gb|EAL26739.2| GA15937 [Drosophila pseudoobscura pseudoobscura]
Length = 510
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 57/213 (26%), Positives = 99/213 (46%), Gaps = 26/213 (12%)
Query: 77 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGV 136
+ ++L P + + + + + I+ MA+ + ++ + T T RT+ G
Sbjct: 310 LKMELLGEHPYVVVYHDVLSDSEIAEILEMAERRMARTSTVAQPNRTSSPT---RTAMGA 366
Query: 137 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF--DPQEYGP 194
++ + + I ++ ++ L E ++ Y IG Y H D F P+ G
Sbjct: 367 WLKRSSN--ALTRRIARRVRDMSGLQLEGSERMQVINYGIGGHYVPHKDWFTQHPEVMG- 423
Query: 195 QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLL 254
R+A+ L YLTD+E+GG TMF KV PR+G L +Y+L
Sbjct: 424 ---NRLATVLFYLTDVEQGGATMFNKAEH---------------KVLPRRGTALFWYNLH 465
Query: 255 PNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
+G D ++ H +CP++ G KWV T+WIR++ Q
Sbjct: 466 TDGEGDWSTTHAACPIIVGSKWVLTQWIRERNQ 498
>gi|195341542|ref|XP_002037365.1| GM12152 [Drosophila sechellia]
gi|194131481|gb|EDW53524.1| GM12152 [Drosophila sechellia]
Length = 535
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 59/216 (27%), Positives = 97/216 (44%), Gaps = 24/216 (11%)
Query: 78 PFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSG 135
PF++ L P + +S+ A+ ++ ST+ G RTS G
Sbjct: 316 PFKLEELHLDPLVVQLHQVIGSNDSESLQKSARPMIKRSTVYSLGGNGGSTAAAFRTSQG 375
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEY--- 192
+ +++ L+ + + L E + Y IG Y H+D+F P+ +
Sbjct: 376 ASFNYSKN--AATKLLSHHVGDFSDLNMDYAEDLQVANYGIGGHYEPHWDSF-PENHIYQ 432
Query: 193 -GPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFY 251
G R+A+ + YL+D+E GG T FPF + L V P +G L +Y
Sbjct: 433 EGDLHGNRIATGIYYLSDVEAGGGTAFPF---------------LPLLVTPEKGSLLFWY 477
Query: 252 SLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
+L P+G D + H +CPV++G KW+A WIR++ Q
Sbjct: 478 NLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQ 513
>gi|443730626|gb|ELU16050.1| hypothetical protein CAPTEDRAFT_114796, partial [Capitella teleta]
Length = 150
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 50/152 (32%), Positives = 76/152 (50%), Gaps = 23/152 (15%)
Query: 143 DESGTLDLIEEKIAKVTML-PRINGEAFNILRYKIGQKYNSHYDAFDPQEY------GPQ 195
+ S + D + +++ T L E F + Y IG Y H+D F +Y Q
Sbjct: 6 ENSASADKLSRRVSSATKLDAEKYAELFQVSTYGIGGHYEPHFD-FSKVKYFTNPVLNEQ 64
Query: 196 KSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLP 255
R+A+F++YL D+E GG T+FP N L ++P + + +++LL
Sbjct: 65 MGDRIATFMIYLNDVEAGGRTVFPRLN---------------LVIEPIKNSAVFWHNLLD 109
Query: 256 NGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
+G D +IHG+CPVV G KWVA KWI + Q
Sbjct: 110 DGQQDDRTIHGACPVVLGRKWVANKWIHEYGQ 141
>gi|4336512|gb|AAD17844.1| prolyl 4-hydroxylase alpha subunit [Drosophila melanogaster]
Length = 535
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 59/216 (27%), Positives = 96/216 (44%), Gaps = 24/216 (11%)
Query: 78 PFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSG 135
PF++ L P + + S+ A+ ++ ST+ G RTS G
Sbjct: 316 PFKLEELHLDPLVVQLHQVIGSKDSDSLQKTARPRIKRSTVYSLGGNGGSTAAAFRTSQG 375
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEY--- 192
+ + + L+ + + L E + Y IG Y H+D+F P+ +
Sbjct: 376 ASFNYSRN--AATKLLSRHVGDFSGLNMDYAEDLQVANYGIGGHYEPHWDSF-PENHIYQ 432
Query: 193 -GPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFY 251
G R+A+ + YL+D+E GG T FPF + L V P +G L +Y
Sbjct: 433 EGDLHGNRMATGIYYLSDVEAGGGTAFPF---------------LPLLVTPERGSLLFWY 477
Query: 252 SLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
+L P+G D + H +CPV++G KW+A WIR++ Q
Sbjct: 478 NLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQ 513
>gi|344199983|ref|YP_004784309.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrivorans SS3]
gi|343775427|gb|AEM47983.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrivorans SS3]
Length = 212
Score = 87.4 bits (215), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 55/200 (27%), Positives = 91/200 (45%), Gaps = 23/200 (11%)
Query: 89 LYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTL 148
++F +PE+C +I + +PS + + T G R++ S + D+ +
Sbjct: 15 VHFSGLLSPEECTELIAAGGSHAKPSEVIYGVSDVSHETSGRRSTVA---SPSADKYPII 71
Query: 149 DLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ---KSQRVASFLV 205
+ +I+ + N E +L Y G +Y+ HYD+F E PQ R+ + L+
Sbjct: 72 KAVRRRISLFIGVAEENQEPLQVLHYTRGGRYDIHYDSF--LEGSPQLENGGNRMLTVLL 129
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D+E+GG T FP I + P G G+LF + S+H
Sbjct: 130 YLNDVEQGGWTQFPH---------------IMANIVPNVGTGILFRNTDAQNLQLRESLH 174
Query: 266 GSCPVVKGEKWVATKWIRDQ 285
PV+ GEKW+A+ WIR++
Sbjct: 175 AGLPVIDGEKWIASIWIREK 194
>gi|221126103|ref|XP_002165259.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Hydra
magnipapillata]
Length = 533
Score = 87.4 bits (215), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 62/215 (28%), Positives = 96/215 (44%), Gaps = 21/215 (9%)
Query: 77 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSG 135
+ +VL P + T ++ K II AK LR + + + G+ + R S
Sbjct: 324 LKMEVLHHDPYIELYYELITDDEAKHIIKFAKPLLRRAFVHDMVTGDLI--YADYRVSKN 381
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD---AFDPQEY 192
+I AED I ++ VT L E + Y I +Y H+D P+ +
Sbjct: 382 TWI--AEDMDVIAAKIIRRVGDVTGLNMRYAEHLQVANYGIAGQYEPHFDHSTGTRPKHF 439
Query: 193 GPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYS 252
R+A+ L+YL+D++ GG T+F G+ P +G G+ +Y+
Sbjct: 440 DRWGGNRIATMLLYLSDVDWGGRTVFT-------------NTAPGVGTDPIKGAGVFWYN 486
Query: 253 LLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
LL NG +P + H CPVV G+KWVA WI + Q
Sbjct: 487 LLRNGKSNPKTQHAGCPVVLGQKWVANLWIHEHGQ 521
>gi|410900628|ref|XP_003963798.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Takifugu
rubripes]
Length = 548
Score = 87.4 bits (215), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 100/206 (48%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + + + + ++ +++ +AK LR +T++ V T R S +++ E
Sbjct: 349 PYIVRYIDIISDKEIETVKKLAKPRLRRATIS-NPITGVLETASYRISKSAWLTGYE--H 405
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+++I ++I +T L E + Y +G +Y H+D E K R+A
Sbjct: 406 PVIEIINQRIEDLTGLEMDTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIA 465
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V P++G + +Y+L NG D
Sbjct: 466 TWLFYMSDVAAGGATVFP---------------DVGAAVWPQKGTAVFWYNLFANGEGDY 510
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 511 STRHAACPVLVGNKWVSNKWIHERGQ 536
>gi|449469338|ref|XP_004152378.1| PREDICTED: uncharacterized protein LOC101218968 [Cucumis sativus]
Length = 311
Score = 87.0 bits (214), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 59/209 (28%), Positives = 106/209 (50%), Gaps = 23/209 (11%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKGETVDNTQGIRTSSGVFISA 140
+SW PR + F + E+C +I++A + PS + G TV + + SSGV ++
Sbjct: 59 VSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGITV--STELLNSSGVILNT 116
Query: 141 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRV 200
+D + IE ++A T+LP+ + F I++Y+ G++ Y + P +
Sbjct: 117 TDD---IVARIENRLAIWTLLPKDHSMPFQIMQYR-GEEAKHKYFYGNRSAMLPSSEPLM 172
Query: 201 ASFLVYLTDLEEGGETMFP-------FENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 253
A+ ++YL+D GGE +FP F +G ++ ++P +G+ +LF+S+
Sbjct: 173 ATVVLYLSDSASGGEILFPESKVKSKFWSGRRKKNNF---------LRPVKGNAILFFSV 223
Query: 254 LPNGTIDPTSIHGSCPVVKGEKWVATKWI 282
N + D +S H P+ GE WVATK++
Sbjct: 224 HLNASPDKSSYHIRSPIRDGELWVATKFL 252
>gi|345481336|ref|XP_001600680.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Nasonia
vitripennis]
Length = 556
Score = 87.0 bits (214), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 98/206 (47%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + + + ++ ++I MA+ + +T+ K ++ R S ++ E E
Sbjct: 349 PRIVIYHDVIYDDEIETIKRMAQPRFKRATVQNYKTGELE-IANYRISKSAWLQ--EHEH 405
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ + +++ +T + E ++ Y IG Y H+D +E KS R+A
Sbjct: 406 KHVRAVSQRVEHMTSMSIETAEELQVVNYGIGGHYEPHFDFARREEKNAFKSLGTGNRIA 465
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
+ L Y++D+E+GG T+F I + + P++G +Y+L PNG D
Sbjct: 466 TVLYYMSDVEQGGGTVFT---------------KINISLWPKKGSAAFWYNLKPNGEGDY 510
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
+ H +CPV+ G KWVA KW+ ++ Q
Sbjct: 511 KTRHAACPVLTGSKWVANKWLHERGQ 536
>gi|241999340|ref|XP_002434313.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
gi|215496072|gb|EEC05713.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
Length = 267
Score = 87.0 bits (214), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 69/229 (30%), Positives = 107/229 (46%), Gaps = 37/229 (16%)
Query: 78 PF--QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSG 135
PF +VLS PR + FP+F P +C+ ++++ L + + L G + +R ++
Sbjct: 48 PFKIEVLSEDPRIVVFPDFLNPRECEIFRSISQEKLSRAKVYL--GGPPEGGFSLRRTNK 105
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSH--YDAFDPQEYG 193
V ++D L + +IA T L + E + + Y +G Y H Y F +
Sbjct: 106 V-AWMSDDLHPLLGKVSRRIALATGLTLTSAEMYQVANYGLGGHYIPHPDYAGFGEAQGD 164
Query: 194 PQKS--QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFY 251
KS R+A+ L+YL D+ GG T F +N + L VKP G L +Y
Sbjct: 165 IYKSSGNRLATMLIYLADVAGGGATAF-----IN----------MRLAVKPTLGTALFWY 209
Query: 252 SLLP-NGTI------------DPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
+L P +G I DP + H CPV+ G KW+ TKWI ++EQ
Sbjct: 210 NLKPYDGPIVNESFWNQRRFGDPRTFHMGCPVLTGSKWIVTKWIHEREQ 258
>gi|91091610|ref|XP_969386.1| PREDICTED: similar to prolyl 4-hydroxylase alpha subunit 1,
putative [Tribolium castaneum]
gi|270001037|gb|EEZ97484.1| hypothetical protein TcasGA2_TC011321 [Tribolium castaneum]
Length = 536
Score = 87.0 bits (214), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 57/216 (26%), Positives = 100/216 (46%), Gaps = 24/216 (11%)
Query: 78 PFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSG 135
PF+V P F + + +I MA+ + +T+ ++ Q R S
Sbjct: 322 PFKVEEAHHRPDIFIFRDVLADSEIATIKRMAQPRFKRATVQNTDTGELEIAQ-YRISKS 380
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 195
++ E+E + + ++++ +T L E ++ Y IG Y H+D E
Sbjct: 381 AWLK--EEEHKHIADVSQRVSDMTGLTMSTAEELQVVNYGIGGHYEPHFDFARRDERNAF 438
Query: 196 KS----QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFY 251
KS R+A+ L Y++D+E+GG T+FP I + + P++G +Y
Sbjct: 439 KSLGTGNRIATVLFYMSDVEQGGATVFP---------------SIQVSLWPQKGSAAFWY 483
Query: 252 SLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
+L P+G D + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 484 NLHPSGDGDKMTRHAACPVLTGSKWVSNKWIHERGQ 519
>gi|410927705|ref|XP_003977281.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Takifugu
rubripes]
Length = 531
Score = 87.0 bits (214), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 55/206 (26%), Positives = 99/206 (48%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + + + + + +++ +AK LR +T+ + + T R S ++ A E
Sbjct: 332 PHIVRYHDILSNREMETVKELAKPRLRRATVHDPQTGQL-TTAPYRVSKSAWLGAFE--H 388
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+D I ++I +T L E + Y +G +Y HYD E K R+A
Sbjct: 389 PVVDRINQRIEDITGLDVSTAEDLQVANYGVGGQYEPHYDFGRKDEPDAFKELGTGNRIA 448
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L+Y+++++ GG T+F IG V P++G + +Y+L P+G D
Sbjct: 449 TWLLYMSEVQAGGATVFT---------------DIGASVSPKKGSAVFWYNLHPSGDGDY 493
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
+ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 494 RTRHAACPVLLGNKWVSNKWIHERGQ 519
>gi|195159144|ref|XP_002020442.1| GL13995 [Drosophila persimilis]
gi|194117211|gb|EDW39254.1| GL13995 [Drosophila persimilis]
Length = 535
Score = 87.0 bits (214), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 57/213 (26%), Positives = 99/213 (46%), Gaps = 26/213 (12%)
Query: 77 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGV 136
+ ++L P + + + + + I+ MA+ + ++ + T T RT+ G
Sbjct: 335 LKMELLGEHPYVVVYHDVLSDSEIAEILEMAERRMARTSTVAQPNRTSSPT---RTALGA 391
Query: 137 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF--DPQEYGP 194
++ + + I ++ ++ L E ++ Y IG Y H D F P+ G
Sbjct: 392 WLKRSSN--ALTRRIARRVRDMSGLQLEGSERMQVINYGIGGHYVPHKDWFTQHPEVMG- 448
Query: 195 QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLL 254
R+A+ L YLTD+E+GG TMF KV PR+G L +Y+L
Sbjct: 449 ---NRLATVLFYLTDVEQGGATMFNKAEH---------------KVLPRRGTALFWYNLH 490
Query: 255 PNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
+G D ++ H +CP++ G KWV T+WIR++ Q
Sbjct: 491 TDGEGDWSTTHAACPIIVGSKWVLTQWIRERNQ 523
>gi|66772633|gb|AAY55628.1| IP02961p [Drosophila melanogaster]
Length = 409
Score = 87.0 bits (214), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 59/216 (27%), Positives = 95/216 (43%), Gaps = 24/216 (11%)
Query: 78 PFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSG 135
PF++ L P + + S+ A+ ++ ST+ G RTS G
Sbjct: 190 PFKLEELHLDPLVVQLHQVIGSKDSDSLQKTARPRIKRSTVYSLGGNGGSTAAAFRTSQG 249
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEY--- 192
+ + + L+ + + L E + Y IG Y H+D+F P+ +
Sbjct: 250 ASFNYSRN--AATKLLSRHVGDFSGLNMDYAEDLQVANYGIGGHYEPHWDSF-PENHIYQ 306
Query: 193 -GPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFY 251
G R+A+ + YL D+E GG T FPF + L V P +G L +Y
Sbjct: 307 EGDLHGNRMATGIYYLADVEAGGGTAFPF---------------LPLLVTPERGSLLFWY 351
Query: 252 SLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
+L P+G D + H +CPV++G KW+A WIR++ Q
Sbjct: 352 NLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQ 387
>gi|24651418|ref|NP_524594.2| prolyl-4-hydroxylase-alpha MP [Drosophila melanogaster]
gi|7301951|gb|AAF57057.1| prolyl-4-hydroxylase-alpha MP [Drosophila melanogaster]
gi|359807686|gb|AEV66559.1| FI17802p1 [Drosophila melanogaster]
Length = 535
Score = 87.0 bits (214), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 59/216 (27%), Positives = 95/216 (43%), Gaps = 24/216 (11%)
Query: 78 PFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSG 135
PF++ L P + + S+ A+ ++ ST+ G RTS G
Sbjct: 316 PFKLEELHLDPLVVQLHQVIGSKDSDSLQKTARPRIKRSTVYSLGGNGGSTAAAFRTSQG 375
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEY--- 192
+ + + L+ + + L E + Y IG Y H+D+F P+ +
Sbjct: 376 ASFNYSRN--AATKLLSRHVGDFSGLNMDYAEDLQVANYGIGGHYEPHWDSF-PENHIYQ 432
Query: 193 -GPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFY 251
G R+A+ + YL D+E GG T FPF + L V P +G L +Y
Sbjct: 433 EGDLHGNRMATGIYYLADVEAGGGTAFPF---------------LPLLVTPERGSLLFWY 477
Query: 252 SLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
+L P+G D + H +CPV++G KW+A WIR++ Q
Sbjct: 478 NLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQ 513
>gi|198417610|ref|XP_002125349.1| PREDICTED: similar to Prolyl 4-hydroxylase subunit alpha-1
precursor (4-PH alpha-1)
(Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1) [Ciona intestinalis]
Length = 527
Score = 87.0 bits (214), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 48/147 (32%), Positives = 75/147 (51%), Gaps = 19/147 (12%)
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQE----YGPQKS 197
+D+ + I E+I+ +T L E + Y +G +Y H+D E Q
Sbjct: 372 DDDGPEVAKITERISDITGLTLNTSEEIQVANYGVGGEYPPHFDIPTTDEERDDLKSQDG 431
Query: 198 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 257
+R+A+FL+YL+D+E GG T F +NA G+ KP +G + +Y++ P+G
Sbjct: 432 ERIATFLIYLSDVEVGGRTAF-----VNA----------GVSAKPIKGSAVFWYNVFPSG 476
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIRD 284
D + HG+CPV G KW KWIR+
Sbjct: 477 EPDLRTYHGACPVAFGNKWAGNKWIRE 503
>gi|170591592|ref|XP_001900554.1| prolyl 4-hydroxylase [Brugia malayi]
gi|16415740|emb|CAC82616.1| prolyl 4-hydroxylase [Brugia malayi]
gi|21425621|emb|CAD19314.1| prolyl 4-hydroxylase [Brugia malayi]
gi|158592166|gb|EDP30768.1| prolyl 4-hydroxylase, putative [Brugia malayi]
Length = 541
Score = 87.0 bits (214), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 66/217 (30%), Positives = 101/217 (46%), Gaps = 26/217 (11%)
Query: 78 PFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALR-KGETVDNTQGIRTSS 134
PF+V L + P A+ F + T E+ I +A LR +T+ GE T RTS
Sbjct: 322 PFKVEILRFNPLAVLFRDVITDEEVTMIQMLATPRLRRATVQNSITGEL--ETASYRTSK 379
Query: 135 GVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP 194
++ E E + I ++I +T L + E + Y IG Y+ H+D +E
Sbjct: 380 SAWLKDEEHE--VVHRINKRIDLMTNLEQETSEELQVGNYGIGGHYDPHFDFARREEVNA 437
Query: 195 QKS----QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 250
+S R+A+ L Y+T E GG T+F + V P + D L +
Sbjct: 438 FQSLNTGNRLATLLFYMTQPESGGATVF---------------TEVKTTVMPSKNDALFW 482
Query: 251 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
Y+LL +G D + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 483 YNLLRSGEGDLRTRHAACPVLTGTKWVSNKWIHERGQ 519
>gi|363539943|ref|YP_004894760.1| mg709 gene product [Megavirus chiliensis]
gi|448825700|ref|YP_007418631.1| putative prolyl 4-hydroxylase [Megavirus lba]
gi|350611108|gb|AEQ32552.1| putative prolyl 4-hydroxylase [Megavirus chiliensis]
gi|371944083|gb|AEX61911.1| putative prolyl4-hydroxylase [Megavirus courdo7]
gi|425701637|gb|AFX92799.1| putative prolyl 4-hydroxylase [Megavirus courdo11]
gi|444236885|gb|AGD92655.1| putative prolyl 4-hydroxylase [Megavirus lba]
Length = 240
Score = 87.0 bits (214), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 60/200 (30%), Positives = 89/200 (44%), Gaps = 30/200 (15%)
Query: 93 NFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIE 152
NF P +C+ I+ + L S + K + N+Q +++ L++ E
Sbjct: 62 NFIEPSKCQEIMKNCRNKLFDSEVISGKNSKIRNSQQCWI--------PKNDPMVLNMFE 113
Query: 153 EKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-----DPQEYGPQKSQRVASFLVYL 207
I+K +P N E ++RY GQ YN H+DA +E+ + QR + L+YL
Sbjct: 114 N-ISKQFGIPFENAEDLQVVRYLPGQYYNEHHDACCDDTDKCREFISRGGQRKLTVLIYL 172
Query: 208 TDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPN-GTIDPTSIHG 266
+ EGG T F K + L+ KP GD L+FY L N P S+H
Sbjct: 173 NNEFEGGCTYF---------------KNLELRAKPSTGDALVFYPLAKNVNKCHPLSLHA 217
Query: 267 SCPVVKGEKWVATKWIRDQE 286
PV GEKW+A W R+
Sbjct: 218 GMPVTSGEKWIANIWFRENR 237
>gi|321474952|gb|EFX85916.1| hypothetical protein DAPPUDRAFT_45616 [Daphnia pulex]
Length = 537
Score = 87.0 bits (214), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 53/210 (25%), Positives = 97/210 (46%), Gaps = 30/210 (14%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + + + + ++ +++ MAK + +T+ K ++ R S ++ + E +
Sbjct: 336 PMIVVYHDVMSDDEIETVKKMAKPRFKRATIRNSKTGELE-PANYRISKSAWLKSEEHDH 394
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD--------AFDPQEYGPQKS 197
+ + ++ +T L E ++ Y IG Y H+D AF +G
Sbjct: 395 --ILKVTRRVGDITGLDMSTAEDLQVVNYGIGGHYEPHFDYARTETTEAFKELGWG---- 448
Query: 198 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 257
R+A++L Y++D+E GG T+FP G V PR+G +Y+L PNG
Sbjct: 449 NRIATWLFYMSDVEAGGATVFP---------------PTGAAVWPRKGSAAFWYNLYPNG 493
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
+ + H +CPV+ G KWV+ +WI + Q
Sbjct: 494 KGNELTRHAACPVLSGSKWVSNRWIHEHRQ 523
>gi|354483225|ref|XP_003503795.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 2
[Cricetulus griseus]
Length = 534
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 58/210 (27%), Positives = 102/210 (48%), Gaps = 30/210 (14%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + + + + ++AK LR +T++ ++ T R S ++S ED
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGNLE-TVHYRISKSAWLSGYEDP- 392
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD--------AFDPQEYGPQKS 197
+ I +I +T L E + Y +G +Y H+D AF QE G
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAF--QELGT--G 447
Query: 198 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 257
R+A++L Y++D+ GG T+FP +G V P++G + +Y+L +G
Sbjct: 448 NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASG 492
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 493 EGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|334314087|ref|XP_003339988.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2
[Monodelphis domestica]
Length = 537
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 57/206 (27%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + + + ++AK LR +T++ V T R S ++S ED
Sbjct: 338 PRIVRFHEIISDAEIEIVKDLAKPRLRRATIS-NPITGVLETAHYRISKSAWLSGYEDP- 395
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 396 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIA 454
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 455 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 499
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 500 STRHAACPVLVGNKWVSNKWIHERGQ 525
>gi|310831339|ref|YP_003969982.1| putative prolyl 4-hydroxylase alpha subunit [Cafeteria
roenbergensis virus BV-PW1]
gi|309386523|gb|ADO67383.1| putative prolyl 4-hydroxylase alpha subunit [Cafeteria
roenbergensis virus BV-PW1]
Length = 210
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 66/211 (31%), Positives = 95/211 (45%), Gaps = 27/211 (12%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFIS 139
+LS P Y N ++C II + L+P AL G + RT + ++S
Sbjct: 4 HILSQDPLIYYVDNVLNKQECYHIIKITSNKLKP---ALVSGNSRGFLSTGRTGTNCWLS 60
Query: 140 AAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF--DPQEYG---- 193
DE T + I KI + P N E F +L Y QKY HYDAF D E
Sbjct: 61 HKNDEI-TFN-IALKITNLVNKPLENAENFQVLHYSTNQKYEYHYDAFPIDNSEKAKRCL 118
Query: 194 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 253
+ QR+ + L+YL ++ +GGET F K + +K+ P+ G L+F +
Sbjct: 119 KKGGQRLLTALIYLNNVTKGGETEF---------------KNLNIKITPKIGRILVFENT 163
Query: 254 LPNG-TIDPTSIHGSCPVVKGEKWVATKWIR 283
L N P S+H V++GEK+V W R
Sbjct: 164 LQNSLNKHPDSLHSGKQVIEGEKYVINLWFR 194
>gi|402593814|gb|EJW87741.1| hypothetical protein WUBG_01349 [Wuchereria bancrofti]
Length = 541
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 66/217 (30%), Positives = 101/217 (46%), Gaps = 26/217 (11%)
Query: 78 PFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALR-KGETVDNTQGIRTSS 134
PF+V L + P A+ F + T E+ I +A LR +T+ GE T RTS
Sbjct: 322 PFKVEILRFNPLAVLFRDVITDEEITMIQMLATPRLRRATVQNSITGEL--ETASYRTSK 379
Query: 135 GVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP 194
++ E E + I ++I +T L + E + Y IG Y+ H+D +E
Sbjct: 380 SAWLKDEEHE--VVHRINKRIDLMTNLEQETSEELQVGNYGIGGHYDPHFDFARREEVNA 437
Query: 195 QKS----QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 250
+S R+A+ L Y+T E GG T+F + V P + D L +
Sbjct: 438 FQSLNTGNRLATLLFYMTQPESGGATVF---------------TEVKTTVMPSKNDALFW 482
Query: 251 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
Y+LL +G D + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 483 YNLLRSGEGDLRTRHAACPVLTGTKWVSNKWIHERGQ 519
>gi|359490628|ref|XP_002271805.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Vitis
vinifera]
Length = 312
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 59/205 (28%), Positives = 108/205 (52%), Gaps = 13/205 (6%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET--VDNTQGIRTSSG-VFI 138
LSW PRA + F + E+C +I++A + LA G++ V + +++S G ++I
Sbjct: 60 LSWQPRAFLYRGFLSDEECDHLISLALG--KKEELATNGGDSGNVVLKRLLKSSEGPLYI 117
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 198
+DE IE++I+ T LP+ N E +++Y+ + Y+ F +
Sbjct: 118 ---DDEVAAR--IEKRISAWTFLPKENSEPLEVVQYQF-ENAKQKYNYFSNKSTSKFGEP 171
Query: 199 RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGT 258
+A+ L++L+++ GGE FP + S + GL+ P +G+ +LF+++ PN +
Sbjct: 172 LMATVLLHLSNVTRGGELFFPESESKSGILSDCTESSSGLR--PVKGNAILFFNVHPNAS 229
Query: 259 IDPTSIHGSCPVVKGEKWVATKWIR 283
D +S + CPV++GE W ATK+
Sbjct: 230 PDKSSSYARCPVLEGEMWCATKFFH 254
>gi|344254200|gb|EGW10304.1| Prolyl 4-hydroxylase subunit alpha-1 [Cricetulus griseus]
Length = 507
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 58/210 (27%), Positives = 102/210 (48%), Gaps = 30/210 (14%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + + + + ++AK LR +T++ ++ T R S ++S ED
Sbjct: 308 PRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGNLE-TVHYRISKSAWLSGYEDP- 365
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD--------AFDPQEYGPQKS 197
+ I +I +T L E + Y +G +Y H+D AF QE G
Sbjct: 366 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAF--QELGT--G 420
Query: 198 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 257
R+A++L Y++D+ GG T+FP +G V P++G + +Y+L +G
Sbjct: 421 NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASG 465
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 466 EGDYSTRHAACPVLVGNKWVSNKWLHERGQ 495
>gi|198466401|ref|XP_002135182.1| GA23910 [Drosophila pseudoobscura pseudoobscura]
gi|198150583|gb|EDY73809.1| GA23910 [Drosophila pseudoobscura pseudoobscura]
Length = 530
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 45/148 (30%), Positives = 76/148 (51%), Gaps = 28/148 (18%)
Query: 143 DESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEY-------GPQ 195
+++ T I ++I +T E N+ Y +G + HYD + P+ Y GP
Sbjct: 385 EQTTTRARIYQRITDITGFQLFVQEELNVANYGLGTIFGPHYD-YTPENYDIGWFMGGP- 442
Query: 196 KSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLP 255
+ + L Y++DL++GG T+FP I + V PR+G LL+++L
Sbjct: 443 ----LGTILFYVSDLQQGGATIFP---------------SINITVSPRKGSALLWFNLYD 483
Query: 256 NGTIDPTSIHGSCPVVKGEKWVATKWIR 283
+G DP ++H SCPV++G++W TKW+
Sbjct: 484 DGEPDPRTLHSSCPVIEGDRWTLTKWVH 511
>gi|260825355|ref|XP_002607632.1| hypothetical protein BRAFLDRAFT_84679 [Branchiostoma floridae]
gi|229292980|gb|EEN63642.1| hypothetical protein BRAFLDRAFT_84679 [Branchiostoma floridae]
Length = 519
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 56/206 (27%), Positives = 99/206 (48%), Gaps = 23/206 (11%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPS-TLALRKGETVDNTQGIRTSSGVFISAAEDE 144
P+ N + + + I +A+ LRP+ T G V ++ I ++ ++ E
Sbjct: 321 PKLWVLHNILSDPEMEVIKKLAQPRLRPAATQNPTTGGAVLSSYRISKNAWLYYW----E 376
Query: 145 SGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYG--PQKSQRVA 201
++ +++++ T L E ++ Y IG Y H+D A +E+ P + R+A
Sbjct: 377 HRLINRVKQRVEDATGLTMETAEPLQVINYGIGGHYEPHFDCATKDEEFALDPNEGDRIA 436
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
+ L Y++D+E GG T+FP +G +V P +G G +Y+LL +G D
Sbjct: 437 TMLFYMSDVEAGGATVFP---------------QVGARVVPEKGAGAFWYNLLKSGEGDM 481
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
+ H CPV+ G KWV+ WI ++ Q
Sbjct: 482 LTEHAGCPVLVGSKWVSNMWIHERGQ 507
>gi|228999322|ref|ZP_04158902.1| hypothetical protein bmyco0003_38780 [Bacillus mycoides Rock3-17]
gi|229006877|ref|ZP_04164509.1| hypothetical protein bmyco0002_37790 [Bacillus mycoides Rock1-4]
gi|228754370|gb|EEM03783.1| hypothetical protein bmyco0002_37790 [Bacillus mycoides Rock1-4]
gi|228760519|gb|EEM09485.1| hypothetical protein bmyco0003_38780 [Bacillus mycoides Rock3-17]
Length = 195
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 59/215 (27%), Positives = 94/215 (43%), Gaps = 33/215 (15%)
Query: 81 VLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISA 140
VL P + TP +C+ +I ++K +++P+ GE + F
Sbjct: 7 VLHDEPFVAQYEQIITPAECQELIELSKKHIQPAQAYGHTGERKSD----------FTWL 56
Query: 141 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-----DPQEYGPQ 195
G + + E IA LP + E RY++G K+++H D + D + Q
Sbjct: 57 PHYSHGLVSQVSELIATAMPLPLNHAEPLQAARYEVGGKFDAHIDCYGTWHEDGRNRVEQ 116
Query: 196 KSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLP 255
QR+ + ++YL + GGET FP + L V P +G LL +
Sbjct: 117 GGQRLYTAILYLNTVNAGGETFFP---------------SLNLTVTPSEGK-LLVFENCK 160
Query: 256 NGTIDPT--SIHGSCPVVKGEKWVATKWIRDQEQY 288
GT +P S+H C V +GEKW+ T W R++ QY
Sbjct: 161 RGTNEPHPLSLHEGCAVHEGEKWIVTLWFREKPQY 195
>gi|301104296|ref|XP_002901233.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262101167|gb|EEY59219.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 535
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 69/259 (26%), Positives = 109/259 (42%), Gaps = 54/259 (20%)
Query: 77 IPFQVLSWMPRALYFPNFATPEQCKSIINMA------KLNLRPSTLALRKGETVDNTQGI 130
+ + +S PR NF + E+ +I L+ ST+ +
Sbjct: 179 VIIESISESPRTFRLHNFFSGEEADKLIKRTLEIDDPSNKLQQSTVGANDNKNKKKKSKH 238
Query: 131 RTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGE---AFNILRYKIGQKYNSHYD-- 185
RTS F + +E +D I +++ V L + +LRY+ Q Y +H D
Sbjct: 239 RTSENAFDTVSE---AAVD-IRKRVFDVLSLGEFQADMADGLQLLRYQQKQAYIAHEDYF 294
Query: 186 --------AFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPF---------ENGMNADG 228
FDP + G S R A+ +YL+D+ GG+T+FP E +
Sbjct: 295 PVGAAKDFNFDPHKGG---SNRFATVFLYLSDVPRGGQTVFPLAEMPEGLPTEYQHPPNS 351
Query: 229 SYDYQ------------------KC-IGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCP 269
+ DY+ KC L P +G +LFYS PNG +DP S+HG CP
Sbjct: 352 AQDYEAIGAELFEPGSWEMDMVRKCSTKLASYPSKGGAVLFYSQKPNGELDPKSLHGGCP 411
Query: 270 VVKGEKWVATKWIRDQEQY 288
V++G KW A W+ ++ ++
Sbjct: 412 VLEGTKWGANLWVWNRRRH 430
>gi|302143843|emb|CBI22704.3| unnamed protein product [Vitis vinifera]
Length = 317
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 58/208 (27%), Positives = 108/208 (51%), Gaps = 14/208 (6%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET--VDNTQGIRTSSG-VFI 138
LSW PRA + F + E+C +I++A + LA G++ V + +++S G ++I
Sbjct: 60 LSWQPRAFLYRGFLSDEECDHLISLALG--KKEELATNGGDSGNVVLKRLLKSSEGPLYI 117
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQ 198
+DE IE++I+ T LP+ N E +++Y+ + Y+ F +
Sbjct: 118 ---DDEVAAR--IEKRISAWTFLPKENSEPLEVVQYQF-ENAKQKYNYFSNKSTSKFGEP 171
Query: 199 RVASFLVYLTDLEEGGETMFP---FENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLP 255
+A+ L++L+++ GGE FP +N + G ++P +G+ +LF+++ P
Sbjct: 172 LMATVLLHLSNVTRGGELFFPESELKNSQSKSGILSDCTESSSGLRPVKGNAILFFNVHP 231
Query: 256 NGTIDPTSIHGSCPVVKGEKWVATKWIR 283
N + D +S + CPV++GE W ATK+
Sbjct: 232 NASPDKSSSYARCPVLEGEMWCATKFFH 259
>gi|239792190|dbj|BAH72464.1| ACYPI007079 [Acyrthosiphon pisum]
Length = 249
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 52/206 (25%), Positives = 100/206 (48%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + + + + + I MA+ L+ +T+ K ++ R S ++ ED
Sbjct: 44 PRIILYRDVLYDNEIEVIKRMAQPRLKRATVQNYKTGELEFAD-YRISKSAWLKEHEDV- 101
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ + +++ +T L E ++ Y +G Y+ HYD +E KS R+A
Sbjct: 102 -VVANVAKRVEVMTGLTTETAEELQVVNYGVGGHYDPHYDFARTEEINAFKSLGTGNRIA 160
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
+ L Y++D+ +GG T+FP+ +G+ ++P +G ++++L P+G D
Sbjct: 161 TVLFYMSDVAQGGATVFPW---------------LGVALQPVKGTAAVWFNLYPSGNGDL 205
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
+ H +CPV++G KWV KW+ + Q
Sbjct: 206 RTRHAACPVLQGSKWVCNKWLHEAGQ 231
>gi|74225936|dbj|BAE28745.1| unnamed protein product [Mus musculus]
Length = 561
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 99/206 (48%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + + + + ++AK LR +T++ ++ T R S ++S ED
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPVTGALE-TVHYRISKSAWLSGYEDP- 392
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ I +I +T L E + Y +G +Y H+D E + R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIA 451
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|125772813|ref|XP_001357665.1| GA21991 [Drosophila pseudoobscura pseudoobscura]
gi|54637397|gb|EAL26799.1| GA21991 [Drosophila pseudoobscura pseudoobscura]
Length = 534
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 55/203 (27%), Positives = 92/203 (45%), Gaps = 19/203 (9%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + + + +P + + MA + ST+ G + R S ++ A D
Sbjct: 328 PFVVTYHDMLSPRKIADLRLMAVPRMHRSTVNPLPGGQ-NKKSSFRVSKNAWL--AYDSH 384
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQKSQRVASFL 204
T+ + ++ T L E + Y +G Y H+D F DP Y ++ R+A+ +
Sbjct: 385 PTMGGMLSDLSDATGLDMTFCEQLQVANYGVGGHYEPHWDFFRDPDHYPAEEGNRMATAI 444
Query: 205 VYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSI 264
YL+D+E+GG T FPF N VKP+ G+ L +Y++ + +D +
Sbjct: 445 FYLSDVEQGGATAFPFLN---------------FAVKPQLGNVLFWYNVHRSLDVDYRTK 489
Query: 265 HGSCPVVKGEKWVATKWIRDQEQ 287
H CPV+KG KW+ WI + Q
Sbjct: 490 HAGCPVLKGSKWIGNVWIHEATQ 512
>gi|195159317|ref|XP_002020528.1| GL14042 [Drosophila persimilis]
gi|194117297|gb|EDW39340.1| GL14042 [Drosophila persimilis]
Length = 534
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 55/203 (27%), Positives = 92/203 (45%), Gaps = 19/203 (9%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + + + +P + + MA + ST+ G + R S ++ A D
Sbjct: 328 PFVVTYHDMLSPRKIADLRLMAVPRMHRSTVNPLPGGQ-NKKSSFRVSKNAWL--AYDSH 384
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQKSQRVASFL 204
T+ + ++ T L E + Y +G Y H+D F DP Y ++ R+A+ +
Sbjct: 385 PTMGGMLSDLSDATGLDMTFCEQLQVANYGVGGHYEPHWDFFRDPDHYPAEEGNRMATAI 444
Query: 205 VYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSI 264
YL+D+E+GG T FPF N VKP+ G+ L +Y++ + +D +
Sbjct: 445 FYLSDVEQGGATAFPFLN---------------FAVKPQLGNVLFWYNVHRSLDVDYRTK 489
Query: 265 HGSCPVVKGEKWVATKWIRDQEQ 287
H CPV+KG KW+ WI + Q
Sbjct: 490 HAGCPVLKGSKWIGNVWIHEATQ 512
>gi|74224984|dbj|BAE38205.1| unnamed protein product [Mus musculus]
Length = 534
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 99/206 (48%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + + + + ++AK LR +T++ ++ T R S ++S ED
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPVTGALE-TVHYRISKSAWLSGYEDP- 392
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ I +I +T L E + Y +G +Y H+D E + R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIA 451
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|149038788|gb|EDL93077.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide, isoform CRA_b
[Rattus norvegicus]
Length = 534
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 99/206 (48%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + + + + ++AK LR +T++ ++ T R S ++S ED
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPVTGALE-TVHYRISKSAWLSGYEDP- 392
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ I +I +T L E + Y +G +Y H+D E + R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIA 451
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|347964867|ref|XP_309164.4| AGAP000971-PA [Anopheles gambiae str. PEST]
gi|333466515|gb|EAA04901.5| AGAP000971-PA [Anopheles gambiae str. PEST]
Length = 553
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + + + + + + I + A+ R +T+ K ++ R S ++ AEDE
Sbjct: 349 PYIVIYHDVMSDREIERIKHYARPRFRRATVQNYKTGELEFA-NYRISKSAWLKDAEDE- 406
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ I +++ +T L E ++ Y IG Y H+D +E KS R+A
Sbjct: 407 -MIRTISQRVEDMTGLTMETAEELQVVNYGIGGHYEPHFDFARREERNAFKSLGTGNRIA 465
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
+ L Y++D+ +GG T+FP + L + PR+G +++L +G D
Sbjct: 466 TVLFYMSDVTQGGATVFP---------------SLNLALWPRKGTAAFWFNLHASGRGDY 510
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
+ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 511 ATRHAACPVLTGTKWVSNKWIHERGQ 536
>gi|195110923|ref|XP_002000029.1| GI22757 [Drosophila mojavensis]
gi|193916623|gb|EDW15490.1| GI22757 [Drosophila mojavensis]
Length = 535
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 57/179 (31%), Positives = 84/179 (46%), Gaps = 25/179 (13%)
Query: 117 ALRKGETVDNTQG-----IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNI 171
L++ E T G RTS G DE ++ + + + ++ L E I
Sbjct: 354 VLQRSEVYSPTNGSTAATFRTSQGTVFEY--DEHPIIEKLSQHMTLISGLDMGFAEPLQI 411
Query: 172 LRYKIGQKYNSHYDAF-DPQEYGPQ--KSQRVASFLVYLTDLEEGGETMFPFENGMNADG 228
Y IG Y H D+F + +Y Q K+ R+A+ + YL+++E GG T FPF
Sbjct: 412 ANYGIGGHYEPHMDSFPESFDYSLQRFKTNRIATGIFYLSNVEAGGATAFPF-------- 463
Query: 229 SYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
+ L VKP QG L +Y+L +G D + H CPV++G KW+A WIR Q
Sbjct: 464 -------LPLLVKPEQGSLLFWYNLHRSGDADYRTKHAGCPVLQGSKWIANVWIRLSHQ 515
>gi|33859596|ref|NP_035160.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Mus musculus]
gi|20455506|sp|Q60715.2|P4HA1_MOUSE RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; Flags: Precursor
gi|16307134|gb|AAH09654.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide [Mus musculus]
gi|74144306|dbj|BAE36020.1| unnamed protein product [Mus musculus]
gi|74146660|dbj|BAE41331.1| unnamed protein product [Mus musculus]
gi|148700260|gb|EDL32207.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide, isoform CRA_a [Mus
musculus]
Length = 534
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 99/206 (48%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + + + + ++AK LR +T++ ++ T R S ++S ED
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPVTGALE-TVHYRISKSAWLSGYEDP- 392
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ I +I +T L E + Y +G +Y H+D E + R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIA 451
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|260812289|ref|XP_002600853.1| hypothetical protein BRAFLDRAFT_214927 [Branchiostoma floridae]
gi|229286143|gb|EEN56865.1| hypothetical protein BRAFLDRAFT_214927 [Branchiostoma floridae]
Length = 281
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 52/159 (32%), Positives = 79/159 (49%), Gaps = 21/159 (13%)
Query: 130 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRI--NGEAFNILRYKIGQKYNSHYDAF 187
IR S ++ +DE + + ++I +T L + E +L Y +G +Y H+D
Sbjct: 125 IRISQQAWLHDKDDE--IVARVSKRIGLLTGLNTTPTSTELLQVLNYGLGGQYEPHHDYM 182
Query: 188 DPQE--YGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQG 245
+E +G R+A+FL+YL+D+ GG T+FP N + V +
Sbjct: 183 TAEEKMWGTILGNRMATFLMYLSDVTAGGATVFPVAN---------------VTVPVVKN 227
Query: 246 DGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRD 284
GLLF LL +G D S+H CPVV G KW+A KWI +
Sbjct: 228 AGLLFMDLLRSGRGDVNSLHAGCPVVIGSKWIANKWIHE 266
>gi|195452734|ref|XP_002073476.1| GK13124 [Drosophila willistoni]
gi|194169561|gb|EDW84462.1| GK13124 [Drosophila willistoni]
Length = 536
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 53/203 (26%), Positives = 94/203 (46%), Gaps = 19/203 (9%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + + + +P + + MA ++R ST+ G + R S ++ A +
Sbjct: 330 PFVVTYHDMLSPNKIAQLREMAVPHMRRSTVNPLPGGQ-NKKSSFRVSKNAWL--AYETH 386
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQKSQRVASFL 204
T+ + ++ T L E + Y +G Y H+D F +P Y ++ R+A+ +
Sbjct: 387 PTMGKMLRDLSDTTGLDMTYCEQLQVANYGVGGHYEPHWDFFRNPDHYPAEEGNRIATAI 446
Query: 205 VYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSI 264
YL+++E+GG T FPF N V+P+ G+ L +Y+L + +D +
Sbjct: 447 YYLSEVEQGGATAFPFLN---------------FAVRPQLGNVLFWYNLHRSSDMDYRTK 491
Query: 265 HGSCPVVKGEKWVATKWIRDQEQ 287
H CPV+KG KW+ WI + Q
Sbjct: 492 HAGCPVLKGSKWIGNVWIHEVTQ 514
>gi|307108823|gb|EFN57062.1| hypothetical protein CHLNCDRAFT_143806 [Chlorella variabilis]
Length = 514
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 63/237 (26%), Positives = 102/237 (43%), Gaps = 47/237 (19%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P L F +PE+C + + +++ S + + + +RTS G F++ +
Sbjct: 287 PCLLLADAFLSPEECGEVRALGAPHMKRSKV------SAGDETPLRTSWGTFLTGPLAQQ 340
Query: 146 GTLDLIEEKIAKVTMLPRIN--------GEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS 197
+E ++ ++ L GEA I+RY GQ Y H D S
Sbjct: 341 PVAARLEGRVRQLAALACEAEGRRALQLGEATQIVRYDPGQFYALHLD-----NRAGDSS 395
Query: 198 QRVASFLVYLTDLEEGGETMFPFENG--------MNADGSYDY----------------- 232
+R A+ ++Y++D+E GG T FP G A G+ +
Sbjct: 396 RRAATVMIYISDVEAGGATHFPRSCGYPLERALEACAAGARNEPAPPPGACPPAGHSPRA 455
Query: 233 ---QKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
Q GL V+PR+G ++F+S LP G D SIH + V G KW+ T+W R+Q+
Sbjct: 456 GQPQHPPGLWVQPREGRAVIFWSRLPGGGEDKASIHEAERVEAGTKWICTRWCREQD 512
>gi|103487007|ref|YP_616568.1| 2OG-Fe(II) oxygenase [Sphingopyxis alaskensis RB2256]
gi|98977084|gb|ABF53235.1| 2OG-Fe(II) oxygenase [Sphingopyxis alaskensis RB2256]
Length = 218
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 56/200 (28%), Positives = 90/200 (45%), Gaps = 28/200 (14%)
Query: 91 FPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDL 150
P F E C ++I++ + RPST+A G+ RTSS + + +
Sbjct: 32 LPRFLDAETCAALIDLIDSDARPSTIADANGDAA-----FRTSSTCDL---DHRLPIVIA 83
Query: 151 IEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQ-----EYGPQKSQRVASFLV 205
+ K+ +T +P GE RY IG+++ +H D FDP Y QR + ++
Sbjct: 84 VNNKLHDLTGIPLAYGEPLQGQRYDIGEEFKAHTDYFDPHGADWDTYCAVPGQRSWTLMI 143
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL GG T F M+ +P G L + ++ P+GTI+P ++H
Sbjct: 144 YLNQPAAGGATRFLATGKMH---------------QPEVGKLLAWNNVRPDGTINPDTLH 188
Query: 266 GSCPVVKGEKWVATKWIRDQ 285
V KG K++ TKW R++
Sbjct: 189 HGMKVRKGRKYIITKWFRER 208
>gi|195452730|ref|XP_002073475.1| GK13125 [Drosophila willistoni]
gi|194169560|gb|EDW84461.1| GK13125 [Drosophila willistoni]
Length = 539
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 60/208 (28%), Positives = 96/208 (46%), Gaps = 25/208 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALR---KGETVDNTQGIRTSSGVFISAAE 142
P + N + + + +A+ N++ S + + ETV RTS G E
Sbjct: 329 PFVVQVHNIVSQKDMNLLQKIARPNIQRSQVYAQDHNANETV--AAAYRTSKGATFEYFE 386
Query: 143 DESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGP--QKSQR 199
S ++L+ +A ++ L + E I Y IG Y H+D F D Y P + R
Sbjct: 387 HRS--MELLSRHVADLSGLDMNSAELLQIANYGIGGHYEPHWDCFPDHHVYLPDDRDGNR 444
Query: 200 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 259
+A+ + YL+++E GG T FPF + L V P +G + +Y+L +G
Sbjct: 445 IATGIYYLSEVEAGGGTAFPF---------------LPLLVTPERGSLVFWYNLHRSGDQ 489
Query: 260 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV++G KW+A WIR Q
Sbjct: 490 DYRTKHAACPVLQGSKWIANVWIRQSNQ 517
>gi|443709455|gb|ELU04127.1| hypothetical protein CAPTEDRAFT_149240 [Capitella teleta]
Length = 532
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 45/149 (30%), Positives = 79/149 (53%), Gaps = 18/149 (12%)
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD---AFDPQEYGPQKSQ 198
++E + I E+ + +T L E ++ Y IG +Y H+D +P + +
Sbjct: 389 DEEDPLIARISERCSALTNLSLTTVEELQVVNYGIGGQYEPHFDFSRRSEPTAFEKWRGN 448
Query: 199 RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGT 258
R+ + + Y+TD+E GG T+F ++A G+KV P +G ++++LLP+G
Sbjct: 449 RILTVIYYMTDVEAGGATVF-----LDA----------GVKVYPEKGSAAVWHNLLPSGE 493
Query: 259 IDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWVA KW +++Q
Sbjct: 494 GDMRTRHAACPVLTGSKWVANKWFHERDQ 522
>gi|198449648|ref|XP_001357666.2| GA21989 [Drosophila pseudoobscura pseudoobscura]
gi|198130700|gb|EAL26801.2| GA21989 [Drosophila pseudoobscura pseudoobscura]
Length = 536
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 50/161 (31%), Positives = 78/161 (48%), Gaps = 20/161 (12%)
Query: 130 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP 189
RTS G + ++ T + + +A ++ L E I Y IG Y H+D+F
Sbjct: 371 FRTSQGASFNYSQ--YATTQRLSQHVADLSGLDMDYAENLQIANYGIGGHYEPHWDSFPE 428
Query: 190 QEYGPQKS---QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGD 246
P+ R+A+ + YL+D+ GG T FPF + L V P +G
Sbjct: 429 HHEYPEDDLYGNRLATAIYYLSDVVAGGGTAFPF---------------LPLLVTPERGS 473
Query: 247 GLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
L +Y+L P+G D + H +CPV++G KW+A WIR++ Q
Sbjct: 474 LLFWYNLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQ 514
>gi|170064951|ref|XP_001867739.1| prolyl 4-hydroxylase subunit alpha-2 [Culex quinquefasciatus]
gi|167882142|gb|EDS45525.1| prolyl 4-hydroxylase subunit alpha-2 [Culex quinquefasciatus]
Length = 516
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 43/123 (34%), Positives = 67/123 (54%), Gaps = 17/123 (13%)
Query: 167 EAFNILRYKIGQKYNSHYDAFDPQEYGPQ--KSQRVASFLVYLTDLEEGGETMFPFENGM 224
E+ + Y IG Y HYD + P+ R+A+ + YL+D+EEGG T+FP
Sbjct: 397 ESLQVNNYGIGGHYLPHYDWSREENPYPELNTGNRIATLMFYLSDVEEGGATVFPH---- 452
Query: 225 NADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRD 284
+G+ V P++G + +Y+L +G D ++HG+CPV+ G KWVA KWI +
Sbjct: 453 -----------LGVGVFPKKGTAIFWYNLRASGKGDEKTLHGACPVLIGSKWVANKWIHE 501
Query: 285 QEQ 287
+ Q
Sbjct: 502 RHQ 504
>gi|159481038|ref|XP_001698589.1| predicted protein [Chlamydomonas reinhardtii]
gi|158282329|gb|EDP08082.1| predicted protein [Chlamydomonas reinhardtii]
Length = 258
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 51/152 (33%), Positives = 77/152 (50%), Gaps = 6/152 (3%)
Query: 77 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGV 136
I + +SW PRA + NF + +C + ++ + S L + IRTS G
Sbjct: 6 IRIETISWSPRAFIYHNFLSEAECDHLTDIGNKRVSRS-LVVDSKTGQSKLDDIRTSYGA 64
Query: 137 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGP- 194
ED + +EE+IA+ T LP GE ILRY GQKY++H+D F DP +
Sbjct: 65 AFGRGEDP--VIAAVEERIAEWTHLPPEYGEPMQILRYVDGQKYDAHWDWFDDPVHHAAY 122
Query: 195 -QKSQRVASFLVYLTDLEEGGETMFPFENGMN 225
+ R A+ L+YL+ +E GGET P + ++
Sbjct: 123 LHEGNRYATVLLYLSGVEGGGETNLPLADPID 154
>gi|195159319|ref|XP_002020529.1| GL14044 [Drosophila persimilis]
gi|194117298|gb|EDW39341.1| GL14044 [Drosophila persimilis]
Length = 536
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 50/161 (31%), Positives = 78/161 (48%), Gaps = 20/161 (12%)
Query: 130 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP 189
RTS G + ++ T + + +A ++ L E I Y IG Y H+D+F
Sbjct: 371 FRTSQGASFNYSQ--YATTQRLSQHVADLSGLDMDYAENLQIANYGIGGHYEPHWDSFPE 428
Query: 190 QEYGPQKS---QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGD 246
P+ R+A+ + YL+D+ GG T FPF + L V P +G
Sbjct: 429 HHEYPEDDLYGNRLATAIYYLSDVVAGGGTAFPF---------------LPLLVTPERGS 473
Query: 247 GLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
L +Y+L P+G D + H +CPV++G KW+A WIR++ Q
Sbjct: 474 LLFWYNLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQ 514
>gi|405964866|gb|EKC30308.1| KRR1 small subunit processome component-like protein [Crassostrea
gigas]
Length = 885
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 64/244 (26%), Positives = 109/244 (44%), Gaps = 47/244 (19%)
Query: 75 TNIPF-----QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK----GE--- 122
T IP+ +V+++ PR F + + + + ++A L ST+ L G+
Sbjct: 643 TVIPYYKAKEEVVNYEPRIAIFHDVISSTSIEHLKSIASKGLTRSTVFLENTGPNGQVTI 702
Query: 123 TVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLP------RINGEAFNILRYKI 176
T IR S +I DE L +E +I +T L R + E F ++ Y +
Sbjct: 703 TYGKQDNIRVSQTCWIRT--DEYPELLRLENRIQLITGLSAEYKPVRSHSEKFQVVNYGV 760
Query: 177 GQKYNSHYD-----------AFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMN 225
G Y +H+D D ++ R+A+++ Y+ D + GG T+FP
Sbjct: 761 GGMYTAHHDYTGYKLGIISNPMDSEDISTS-GDRMATWMFYMNDAKAGGATVFPE----- 814
Query: 226 ADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
+ ++ +G +++L P+G DP ++HG CPV+ G KWV KWIR++
Sbjct: 815 ----------VRTRIPVAKGGAAFWFNLRPSGATDPRTLHGGCPVLVGSKWVTNKWIREE 864
Query: 286 EQYD 289
Q D
Sbjct: 865 GQMD 868
>gi|328696638|ref|XP_003240086.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
[Acyrthosiphon pisum]
Length = 534
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 52/206 (25%), Positives = 100/206 (48%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + + + + + I MA+ L+ +T+ K ++ R S ++ ED
Sbjct: 329 PRIILYRDVLYDNEIEVIKRMAQPRLKRATVQNYKTGELEFAD-YRISKSAWLKEHED-- 385
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ + +++ +T L E ++ Y +G Y+ HYD +E KS R+A
Sbjct: 386 VVVANVAKRVEVMTGLTTETAEELQVVNYGVGGHYDPHYDFARTEEINAFKSLGTGNRIA 445
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
+ L Y++D+ +GG T+FP+ +G+ ++P +G ++++L P+G D
Sbjct: 446 TVLFYMSDVAQGGATVFPW---------------LGVALQPVKGTAAVWFNLYPSGNGDL 490
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
+ H +CPV++G KWV KW+ + Q
Sbjct: 491 RTRHAACPVLQGSKWVCNKWLHEAGQ 516
>gi|195575113|ref|XP_002105524.1| GD16980 [Drosophila simulans]
gi|194201451|gb|EDX15027.1| GD16980 [Drosophila simulans]
Length = 518
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 64/218 (29%), Positives = 104/218 (47%), Gaps = 31/218 (14%)
Query: 77 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVD-----NTQGIR 131
I ++LS P + + +P + I + +K + PS ETV+ R
Sbjct: 301 IKTEILSVDPFVILLHDMVSPTEGALIRSSSKNQILPS-------ETVNAANEFEVAKFR 353
Query: 132 TSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDA--FDP 189
TS V+ + +E+ TL L + ++ + T L + E F ++ Y IG + SH+D D
Sbjct: 354 TSKSVWFDSDANEA-TLKLTQ-RLGEATGLDMKHSEPFQVINYGIGGVFESHFDTSLADE 411
Query: 190 QEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLL 249
+ R+A+ L YL D+ +GG T FP G+N + V P+ G L+
Sbjct: 412 DRFVNGYIDRLATTLFYLNDVPQGGATHFP---GLN------------ITVFPKFGTVLM 456
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
+Y+L G + ++H CPV+ G KWV +KWI D+ Q
Sbjct: 457 WYNLHTEGLLHVRTMHTGCPVIVGSKWVVSKWIDDKGQ 494
>gi|156398644|ref|XP_001638298.1| predicted protein [Nematostella vectensis]
gi|156225417|gb|EDO46235.1| predicted protein [Nematostella vectensis]
Length = 495
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 48/162 (29%), Positives = 80/162 (49%), Gaps = 20/162 (12%)
Query: 127 TQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDA 186
T R S ++S E +D +E +IA +T L E F + Y + +Y+ H+D
Sbjct: 335 TAHYRISKNCWLSGRE-HGEVIDRVERRIAAMTRLNLETAEGFQVQNYGLAGQYDPHFDF 393
Query: 187 FDPQEYGPQKS----QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKP 242
S R+A+ LV+++ +E GG T+FP+ +G ++ P
Sbjct: 394 SRDLANSSLGSLGTGNRIATVLVWMSQVESGGATVFPY---------------VGARILP 438
Query: 243 RQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRD 284
++GD + +++LL +G D + H CPV+ G KWVA KWI +
Sbjct: 439 QKGDAVFWHNLLRSGDGDFRTRHAGCPVLSGIKWVANKWIHE 480
>gi|195391758|ref|XP_002054527.1| GJ22759 [Drosophila virilis]
gi|194152613|gb|EDW68047.1| GJ22759 [Drosophila virilis]
Length = 539
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 60/218 (27%), Positives = 100/218 (45%), Gaps = 23/218 (10%)
Query: 75 TNIPFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRT 132
T P+++ L P + + + + ++A+ L+ S + R G + RT
Sbjct: 318 TYAPYKLEELHLDPYIIQVHDVISARDTAELQHLARPELQRSQVYSRTGHE-HISANFRT 376
Query: 133 SSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQE 191
S G + + + +A+++ L + E I Y IG Y H D+F D +
Sbjct: 377 SQGTTFEYTDHP--IMQKMSHHVAEISGLDMRSAEPLQIANYGIGGHYEPHMDSFPDSYD 434
Query: 192 YGPQ--KSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLL 249
Y K+ R+A+ + YL+++E GG T FPF + L V P +G L
Sbjct: 435 YSLNMYKTNRLATGIYYLSNVEAGGGTAFPF---------------LPLLVTPERGSLLF 479
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
+Y+L P+G D + H +CPV++G KW+A WIR Q
Sbjct: 480 WYNLHPSGDADYRTKHAACPVLQGSKWIANVWIRLSNQ 517
>gi|196011902|ref|XP_002115814.1| hypothetical protein TRIADDRAFT_30039 [Trichoplax adhaerens]
gi|190581590|gb|EDV21666.1| hypothetical protein TRIADDRAFT_30039 [Trichoplax adhaerens]
Length = 534
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 57/212 (26%), Positives = 98/212 (46%), Gaps = 23/212 (10%)
Query: 77 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGV 136
I +V+S P L + N + +++ +A L+ +T+ + ++ R S
Sbjct: 325 INVEVISLQPYILIYHNLLNDLEVEALKTLAAPMLQRATVHNKDTGKLEYAT-YRISKSA 383
Query: 137 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGP-- 194
+++ +D+ + I I VT L + EA I Y IG Y H+D D +
Sbjct: 384 WLN--DDDHPLVRRISTLIEDVTGLTMESAEALQIANYGIGGHYEPHFDHADVRSGTDVF 441
Query: 195 ---QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFY 251
+ R+A+ L+YL+ +E GG T+F G++++PRQG +Y
Sbjct: 442 KTWKGGNRIATMLIYLSSVELGGATVFS---------------SAGVRIEPRQGSAAFWY 486
Query: 252 SLLPNGTIDPTSIHGSCPVVKGEKWVATKWIR 283
+L NG + + H +CPV+ G KW+A KWI
Sbjct: 487 NLHRNGNGNNLTRHAACPVLIGSKWIANKWIH 518
>gi|193688213|ref|XP_001943683.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 1
[Acyrthosiphon pisum]
Length = 552
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 52/206 (25%), Positives = 100/206 (48%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + + + + + I MA+ L+ +T+ K ++ R S ++ ED
Sbjct: 347 PRIILYRDVLYDNEIEVIKRMAQPRLKRATVQNYKTGELEFAD-YRISKSAWLKEHED-- 403
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ + +++ +T L E ++ Y +G Y+ HYD +E KS R+A
Sbjct: 404 VVVANVAKRVEVMTGLTTETAEELQVVNYGVGGHYDPHYDFARTEEINAFKSLGTGNRIA 463
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
+ L Y++D+ +GG T+FP+ +G+ ++P +G ++++L P+G D
Sbjct: 464 TVLFYMSDVAQGGATVFPW---------------LGVALQPVKGTAAVWFNLYPSGNGDL 508
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
+ H +CPV++G KWV KW+ + Q
Sbjct: 509 RTRHAACPVLQGSKWVCNKWLHEAGQ 534
>gi|432904500|ref|XP_004077362.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oryzias
latipes]
Length = 555
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 56/206 (27%), Positives = 98/206 (47%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + + + + + I +AK LR +T++ V T R S +++A ED
Sbjct: 351 PYIVRYIDIISEAEMDKIKQLAKPRLRRATIS-NPVTGVLETAPYRISKSAWLTAYEDP- 408
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
++ I ++I +T L E + Y +G +Y H+D E K R+A
Sbjct: 409 -VVEKINQRIEDLTGLEMDTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIA 467
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 468 TWLFYMSDVSAGGATVFP---------------DVGASVGPQKGTAVFWYNLFASGEGDY 512
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 513 STRHAACPVLVGNKWVSNKWIHERGQ 538
>gi|410295850|gb|JAA26525.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
gi|410295854|gb|JAA26527.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
Length = 534
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 98/206 (47%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + + + + ++AK LR +T+ + + Q R S ++S E+
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLRRATVHDPETGKLTTAQ-YRVSKSAWLSGYENP- 392
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 451
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|195505214|ref|XP_002099407.1| GE23379 [Drosophila yakuba]
gi|194185508|gb|EDW99119.1| GE23379 [Drosophila yakuba]
Length = 547
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 60/214 (28%), Positives = 100/214 (46%), Gaps = 21/214 (9%)
Query: 77 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPS-TLALRKGETVDNTQGIRTSSG 135
I ++LS P L F + + ++ I + +K ++ PS T + + D+ RTS
Sbjct: 328 IKTEILSIDPFVLLFHDMISQKESTLIRSSSKEHMLPSATTDVDASGSEDHVATFRTSKS 387
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF--DPQEYG 193
V+ S+ ++ T I E++ T L E F ++ Y +G + +H D D +
Sbjct: 388 VWYSSTSND--TTKRITERLGDATGLDMNFTEYFQVINYGLGGFFETHLDMLLSDRSRFN 445
Query: 194 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 253
+ R+A+ L YL ++ +GG T FP N L V P+ G L +Y+L
Sbjct: 446 GTRD-RLATTLFYLNEVRQGGGTHFPRLN---------------LTVFPQPGSALFWYNL 489
Query: 254 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
G +++H CPV+ G KWV +KW+ D Q
Sbjct: 490 DTRGNDHTSTLHTGCPVIVGSKWVMSKWVEDAGQ 523
>gi|390178148|ref|XP_001358756.3| GA13990 [Drosophila pseudoobscura pseudoobscura]
gi|388859341|gb|EAL27899.3| GA13990 [Drosophila pseudoobscura pseudoobscura]
Length = 498
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 42/146 (28%), Positives = 76/146 (52%), Gaps = 16/146 (10%)
Query: 143 DESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ-KSQRVA 201
+++ + + ++ +T L I +A ++ Y +G Y+ HYD+ + E R+A
Sbjct: 351 NDTAVVKTLHRRLNDMTGLDMIESDALTLINYGMGGHYDVHYDSHNYSEANRLILGDRIA 410
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
+ L Y+ +++ GG T FP+ I + V P++G +L+Y+L G ++P
Sbjct: 411 TVLFYVGEVDSGGATTFPY---------------INVSVTPKKGSAVLWYNLDNAGQMNP 455
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
+IH CPV+ G K+V TKWI + Q
Sbjct: 456 KAIHAGCPVIVGSKYVLTKWINEIPQ 481
>gi|196011900|ref|XP_002115813.1| hypothetical protein TRIADDRAFT_59899 [Trichoplax adhaerens]
gi|190581589|gb|EDV21665.1| hypothetical protein TRIADDRAFT_59899 [Trichoplax adhaerens]
Length = 581
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 55/214 (25%), Positives = 94/214 (43%), Gaps = 23/214 (10%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFIS 139
+VLS P + + N T + + +A L+ + + + + R S ++
Sbjct: 346 EVLSLQPYIVIYHNLLTNSEVVLLKTLASPLLKRAVVVGKPDKEYGEETTYRISKTAWLD 405
Query: 140 AAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQ------EYG 193
+++ + I I + L E I Y IG Y H D + + EY
Sbjct: 406 --KEDHPAVKRITTLIGDIIGLTSETAEPLQIANYGIGGHYEPHLDFIESEDKEALSEYT 463
Query: 194 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 253
+ R+A+ L+YL+++E GG T+FP G++V+PRQG +Y++
Sbjct: 464 SRIGNRIATVLIYLSNVEAGGATVFP---------------KAGVRVEPRQGSAAFWYNM 508
Query: 254 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
NG + S+H +CPV+ G KW A W R+ Q
Sbjct: 509 HRNGEGNKLSVHAACPVLIGSKWAANLWFREVGQ 542
>gi|149186836|ref|ZP_01865146.1| 2OG-Fe(II) oxygenase [Erythrobacter sp. SD-21]
gi|148829503|gb|EDL47944.1| 2OG-Fe(II) oxygenase [Erythrobacter sp. SD-21]
Length = 211
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 62/208 (29%), Positives = 95/208 (45%), Gaps = 30/208 (14%)
Query: 86 PRALYFP--NFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAED 143
P+A F +FA+P C+ ++ + + + RPSTLA N RTS + E
Sbjct: 24 PKAELFQLRDFASPAMCEQLVALIEKDRRPSTLA-----DAGNDHYFRTSETCDLDPDEP 78
Query: 144 ESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-----DPQEYGPQKSQ 198
+ IE +A + + GE RY +GQ++ H D F D ++Y Q
Sbjct: 79 ---VVCEIEALLAALNGIDPKFGEPLQGQRYDVGQEFKPHCDYFNRGGQDWEKYCSVAGQ 135
Query: 199 RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGT 258
R +F+VYL +E GG T F K +G +P G + + ++ P G
Sbjct: 136 RTWTFMVYLNAVEAGGATRF---------------KAVGKTFQPEPGKLVCWNNMRPEGR 180
Query: 259 IDPTSIHGSCPVVKGEKWVATKWIRDQE 286
+P +IH V KG K+V TKW R++E
Sbjct: 181 ENPNTIHHGMKVRKGVKYVITKWYREKE 208
>gi|402824057|ref|ZP_10873445.1| 2OG-Fe(II) oxygenase [Sphingomonas sp. LH128]
gi|402262407|gb|EJU12382.1| 2OG-Fe(II) oxygenase [Sphingomonas sp. LH128]
Length = 210
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 51/197 (25%), Positives = 99/197 (50%), Gaps = 28/197 (14%)
Query: 94 FATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEE 153
F + E+C ++++ + N RPST+A G+ RTSS +S + +D + +
Sbjct: 34 FLSAERCAQLVDLIETNNRPSTIADYNGD-----DAFRTSSTCDLSR---DYPVVDELAQ 85
Query: 154 KIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP-----QEYGPQKSQRVASFLVYLT 208
+++++ + + E RY++GQ++ +H D F+P ++Y QR +F++YL
Sbjct: 86 ALSRLSGIDLAHAEPLQGQRYEVGQEFKAHTDYFEPDSADFEKYCKVPGQRTWTFMIYLN 145
Query: 209 DLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSC 268
D+E GG T F K I ++P G + + + +G+ + ++H +
Sbjct: 146 DVEAGGATRF---------------KVIDKMIQPETGKLIGWNNRRADGSCNAATLHHAM 190
Query: 269 PVVKGEKWVATKWIRDQ 285
V KG K+V T+W R++
Sbjct: 191 KVRKGRKYVITQWYRER 207
>gi|321474877|gb|EFX85841.1| hypothetical protein DAPPUDRAFT_208740 [Daphnia pulex]
Length = 545
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 52/208 (25%), Positives = 97/208 (46%), Gaps = 26/208 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQ--GIRTSSGVFISAAED 143
P + + N ++ +++ MA+ + +T+ + N + R S ++ + E
Sbjct: 344 PLIVIYHNVINDDEIETVKKMAQPRFKRATV---QNSVTGNLEPANYRISKSAWLKSEEH 400
Query: 144 ESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QR 199
+ + + ++ VT L E ++ Y IG Y H+D +E K R
Sbjct: 401 DH--VFKVTRRVGDVTGLDMATAEDLQVVNYGIGGHYEPHFDYARKEEVNAFKDLGWGNR 458
Query: 200 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 259
VA++L Y++++E GG T+FP + L + P++G +Y+L PNG
Sbjct: 459 VATWLFYMSEVEAGGATVFP---------------KLNLALWPQKGSAAFWYNLHPNGEG 503
Query: 260 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
+ + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 504 NELTRHAACPVLTGSKWVSNKWIHERNQ 531
>gi|195392288|ref|XP_002054791.1| GJ24631 [Drosophila virilis]
gi|194152877|gb|EDW68311.1| GJ24631 [Drosophila virilis]
Length = 499
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 53/210 (25%), Positives = 101/210 (48%), Gaps = 27/210 (12%)
Query: 77 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGV 136
+ + LS P + + + + + I+ +AK +LR + + G ++Q ++G
Sbjct: 296 LKLEQLSLDPYMVLYHDVVQANEREHIMQLAKPHLRRALV----GAARAHSQRFAMNAGF 351
Query: 137 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ- 195
+ ++S + +++ ++ N +L Y IG +Y HYD + Q+ Q
Sbjct: 352 ----SYNDSRQGQRLRQRLEDMSGFDLTNSGQLAVLNYGIGGQYYMHYDCWFSQDDAAQV 407
Query: 196 ---KSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYS 252
K R+A+ L+YLTD++ GG T FP +GL V+P G L++++
Sbjct: 408 ASIKDNRIATILLYLTDVQLGGLTSFP---------------ALGLAVQPSPGSALIWHN 452
Query: 253 LLPNGTIDPTSIHGSCPVVKGEKWVATKWI 282
+ D ++H +CP++ G +WVAT+WI
Sbjct: 453 MNNAAECDRRTLHAACPLLLGTRWVATQWI 482
>gi|354483223|ref|XP_003503794.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 1
[Cricetulus griseus]
Length = 534
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 57/210 (27%), Positives = 100/210 (47%), Gaps = 30/210 (14%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + + + + ++AK L +T+ + + Q R S ++S ED
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYEDP- 392
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD--------AFDPQEYGPQKS 197
+ I +I +T L E + Y +G +Y H+D AF QE G
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAF--QELGT--G 447
Query: 198 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 257
R+A++L Y++D+ GG T+FP +G V P++G + +Y+L +G
Sbjct: 448 NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASG 492
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 493 EGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|432891690|ref|XP_004075614.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Oryzias
latipes]
Length = 517
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 63/217 (29%), Positives = 99/217 (45%), Gaps = 33/217 (15%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFIS 139
+VLS P + + NF T + + I A+ LR S +A GE T R S ++
Sbjct: 313 EVLSLQPYVVIYHNFITDREAEEIKGFAQPALRRSVVA--SGEN-QATVEYRISKSAWLK 369
Query: 140 AAED-ESGTLDLIEEKIAKVTMLPRIN-----GEAFNILRYKIGQKYNSHYD-AFDPQE- 191
+E G LD +++ML +N E ++ Y IG Y H+D A P
Sbjct: 370 GSESCIVGKLD------QRISMLTGLNVRPPYAEYLQVVNYGIGGHYEPHFDHATSPSSP 423
Query: 192 -YGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 250
+ + RVA+F++YL+ +E GG T F + N V + + +
Sbjct: 424 VFKLKTGNRVATFMIYLSSVEAGGSTAFIYAN---------------FSVPVLKKAAIFW 468
Query: 251 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
++L NG D ++H CPV+ G+KWVA KW+ + Q
Sbjct: 469 WNLHRNGRGDAETLHAGCPVLIGDKWVANKWVHEYGQ 505
>gi|395817618|ref|XP_003782262.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Otolemur
garnettii]
Length = 538
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 55/208 (26%), Positives = 101/208 (48%), Gaps = 28/208 (13%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 141
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 341 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 393
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQR 199
ED+ + + ++ +T L E + Y +G +Y H+D + P + G + + R
Sbjct: 394 EDDDPVVARVNHRMQHITGLSVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNR 453
Query: 200 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 259
VA+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 454 VATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEG 498
Query: 260 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 499 DYRTRHAACPVLVGCKWVSNKWFHERGQ 526
>gi|443697961|gb|ELT98195.1| hypothetical protein CAPTEDRAFT_181380 [Capitella teleta]
Length = 530
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 58/210 (27%), Positives = 97/210 (46%), Gaps = 25/210 (11%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFIS 139
++L++ P + + T Q I +++ L S + K + RTS +
Sbjct: 326 EMLNFDPAIYVYHDVLTDSQNAIIKEVSRPKLHRSGV-FSKTDADTGLSNFRTSQ----T 380
Query: 140 AAEDESGT--LDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF---DPQEYGP 194
A D+S + + +K + ++ L E +L Y IG Y H+D + E+
Sbjct: 381 AWHDDSTHPLIARLSQKASAISNLTLETVEHLQVLNYGIGGLYEPHWDFVQGEERNEFSE 440
Query: 195 QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLL 254
RVA+F+ YL++LE GG T++P +G V PR+ L+Y+L+
Sbjct: 441 SDRNRVATFICYLSELEAGGYTVYP---------------TVGAAVVPRKNSCALWYNLM 485
Query: 255 PNGTIDPTSIHGSCPVVKGEKWVATKWIRD 284
NGT D + H +CP++ G KWVA KW +
Sbjct: 486 RNGTGDYRTYHAACPILYGYKWVANKWFHE 515
>gi|443697959|gb|ELT98193.1| hypothetical protein CAPTEDRAFT_162820 [Capitella teleta]
Length = 347
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 59/213 (27%), Positives = 98/213 (46%), Gaps = 25/213 (11%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFIS 139
++L++ P + + T Q I +++ L S + K + RTS +
Sbjct: 143 EMLNFDPAIYVYHDVLTDSQNAIIKEVSRPKLHRSGV-FSKTDADTGLSNFRTSQ----T 197
Query: 140 AAEDESGT--LDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF---DPQEYGP 194
A D+S + + +K + ++ L E +L Y IG Y H+D + E+
Sbjct: 198 AWHDDSTHPLIARLSQKASAISNLTLETVEHLQVLNYGIGGLYEPHWDFVQGEERNEFSE 257
Query: 195 QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLL 254
RVA+F+ YL++LE GG T++P +G V PR+ L+Y+L+
Sbjct: 258 SDRNRVATFICYLSELEAGGYTVYP---------------TVGAAVVPRKNSCALWYNLM 302
Query: 255 PNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
NGT D + H +CP++ G KWVA KW + Q
Sbjct: 303 RNGTGDYRTYHAACPILYGYKWVANKWFHEGGQ 335
>gi|301607015|ref|XP_002933103.1| PREDICTED: transmembrane prolyl 4-hydroxylase-like [Xenopus
(Silurana) tropicalis]
Length = 469
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 73/271 (26%), Positives = 113/271 (41%), Gaps = 45/271 (16%)
Query: 56 SVKDEYKWMPHGQAGD----DSVTNIPFQVLSWMP----RALYFPNFATPEQ--CKSIIN 105
S D + + H Q G + +T+ WM R +Y A P+ S+
Sbjct: 157 SKADIFNLLDHNQDGQLQIKEVLTHSRLGNGRWMTPENIREMYSALRADPDGNGVLSLDE 216
Query: 106 MAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRI- 164
KLNLR L K E V + +R S ++ E L I +++ K+T LP
Sbjct: 217 FKKLNLRDFHKYLGKQE-VTMSDLVRNSHHTWLYQGEGAHHVLRSIRQRVIKLTHLPLDI 275
Query: 165 --NGEAFNILRYKIGQKYNSHYDA-------------FDPQEYGP-QKSQRVASFLVYLT 208
N E ++RY G Y++H D+ E P + S R + L YL
Sbjct: 276 VENSEPLQVVRYDTGGHYHAHMDSGPVFPETACTHTKLTTNETAPFETSCRYVTVLFYLN 335
Query: 209 DLEEGGETMFP------------FENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPN 256
++ GGET FP +N ++ + + L++KPRQG + +Y+ L +
Sbjct: 336 NVTGGGETTFPVADNRTYEELSLIQNDVDLRDTRKHCDKGNLRIKPRQGTAVFWYNYLSD 395
Query: 257 GT-----IDPTSIHGSCPVVKGEKWVATKWI 282
G +D ++HG C V G KW+A WI
Sbjct: 396 GKGWVGDVDEFALHGGCLVTAGTKWIANNWI 426
>gi|170064956|ref|XP_001867741.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
gi|167882144|gb|EDS45527.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
Length = 520
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 57/214 (26%), Positives = 99/214 (46%), Gaps = 23/214 (10%)
Query: 77 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGV 136
+ +V++ P + + + + +I +A+ ++ S + + E + IR S
Sbjct: 315 LKLEVVNLEPLIVVYHEAVSDREIAKLIELARPLIKRSAVGDTRSEQISK---IRISQNA 371
Query: 137 FISAAEDE-SGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 195
+ D TL+ +A P + E + Y +G Y+ HYD P
Sbjct: 372 WFENEHDPIVETLNQRARDMAGGLNEP--SYELLQVNNYGLGGFYSIHYDWSTSANPFPN 429
Query: 196 KS--QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 253
K R+A+ + YL+D++EGG T+FP N L V+PR+G + +Y+L
Sbjct: 430 KGMGNRIATLMFYLSDVQEGGSTVFPRLN---------------LAVRPRKGTAIFWYNL 474
Query: 254 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
NG + ++H +CPV+ G KWVA KWI ++ Q
Sbjct: 475 HRNGKGNKKTLHAACPVLIGSKWVANKWIHERHQ 508
>gi|395817620|ref|XP_003782263.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Otolemur
garnettii]
Length = 540
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 97/210 (46%), Gaps = 30/210 (14%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 141
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 341 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 393
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS---- 197
ED+ + + ++ +T L E + Y +G +Y H+D E K
Sbjct: 394 EDDDPVVARVNHRMQHITGLSVKTAELLQVANYGVGGQYEPHFDFSRNHERDAFKRLGTG 453
Query: 198 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 257
RVA+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 454 NRVATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSG 498
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 499 EGDYRTRHAACPVLVGCKWVSNKWFHERGQ 528
>gi|30686940|ref|NP_194290.2| Oxoglutarate/iron-dependent oxygenase [Arabidopsis thaliana]
gi|26451153|dbj|BAC42680.1| unknown protein [Arabidopsis thaliana]
gi|29893542|gb|AAP06823.1| unknown protein [Arabidopsis thaliana]
gi|332659681|gb|AEE85081.1| Oxoglutarate/iron-dependent oxygenase [Arabidopsis thaliana]
Length = 291
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 62/208 (29%), Positives = 98/208 (47%), Gaps = 29/208 (13%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA 141
LSW+PR + F + E+C +I+ LR T + + TQ +G
Sbjct: 62 LSWLPRVFLYRGFLSEEECDHLIS-----LRKETTEVYSVDADGKTQLDPVVAG------ 110
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVA 201
IEEK++ T LP NG + + Y +K D F + +A
Sbjct: 111 ---------IEEKVSAWTFLPGENGGSIKVRSY-TSEKSGKKLDYFGEEPSSVLHESLLA 160
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCI--GLKVKPRQGDGLLFYSLLPNGTI 259
+ ++YL++ +GGE +FP + M S C+ G ++P +G+ +LF++ L N ++
Sbjct: 161 TVVLYLSNTTQGGELLFP-NSEMKPKNS-----CLEGGNILRPVKGNAILFFTRLLNASL 214
Query: 260 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D S H CPVVKGE VATK I ++Q
Sbjct: 215 DGKSTHLRCPVVKGELLVATKLIYAKKQ 242
>gi|344274274|ref|XP_003408942.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2
[Loxodonta africana]
Length = 534
Score = 84.3 bits (207), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 99/206 (48%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + + + + ++AK LR +T++ ++ T R S ++S E+
Sbjct: 335 PRIVRFHDIISDAEIEVVKDLAKPRLRRATISNPITGDLE-TVHYRISKSAWLSGYENP- 392
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 451
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------DVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|395820526|ref|XP_003783615.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Otolemur
garnettii]
Length = 534
Score = 84.3 bits (207), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 99/206 (48%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + + + + ++AK LR +T++ ++ T R S ++S E+
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLE-TVHYRISKSAWLSGYENP- 392
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 451
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|395501518|ref|XP_003755140.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Sarcophilus
harrisii]
Length = 385
Score = 84.3 bits (207), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 55/206 (26%), Positives = 96/206 (46%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + + + ++AK L +T+ + + Q R S ++S ED
Sbjct: 186 PRIVRFHEIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYEDP- 243
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 244 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIA 302
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 303 TWLFYMSDVSAGGATVFPE---------------VGASVWPKKGTAVFWYNLFASGEGDY 347
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 348 STRHAACPVLVGNKWVSNKWIHERGQ 373
>gi|291404184|ref|XP_002718472.1| PREDICTED: prolyl 4-hydroxylase, alpha I subunit isoform 2
[Oryctolagus cuniculus]
Length = 534
Score = 84.0 bits (206), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 99/206 (48%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + + + + ++AK LR +T++ ++ T R S ++S E+
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLE-TVHYRISKSAWLSGYENP- 392
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 451
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|196011908|ref|XP_002115817.1| hypothetical protein TRIADDRAFT_30052 [Trichoplax adhaerens]
gi|190581593|gb|EDV21669.1| hypothetical protein TRIADDRAFT_30052, partial [Trichoplax
adhaerens]
Length = 495
Score = 84.0 bits (206), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 62/212 (29%), Positives = 102/212 (48%), Gaps = 29/212 (13%)
Query: 77 IPFQVLSWMPRALYFPNFATPEQCKSI--INMAKLNLRPSTLALRKGETVDNTQGIRTSS 134
IP + +S P + + + Q ++I I+ +K N P+ L G + TQ
Sbjct: 290 IPVEEISLDPFIVIYYDIINDHQIETIKKISPSKSNKSPNHAMLCSGIKSEATQ-----V 344
Query: 135 GVFISAAEDESGTLDLIEEKIAKVTM-LPRIN---GEAFNILRYKIGQKYNSHYDAFDPQ 190
+F + E D + EKI+++T L ++ E + Y IG Y HYD+
Sbjct: 345 SIFCCSTWLEDA-YDPVVEKISRLTQELTHLDVNYAEDLQVANYGIGGHYVPHYDSTIIA 403
Query: 191 EYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 250
P QR+A+ + YL+++E GG T+FP +G+ V+P++G L +
Sbjct: 404 PEDPL--QRLATMMFYLSNVEIGGATIFPR---------------LGVAVRPQKGSALFW 446
Query: 251 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWI 282
+L NG + ++H +CPVV G KW+A KWI
Sbjct: 447 INLKRNGLTNRQTLHAACPVVIGSKWIANKWI 478
>gi|63252888|ref|NP_001017962.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Homo
sapiens]
gi|197099666|ref|NP_001125733.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Pongo abelii]
gi|217272849|ref|NP_001136067.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Homo
sapiens]
gi|114631177|ref|XP_001140234.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3 [Pan
troglodytes]
gi|114631181|ref|XP_001140652.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 7 [Pan
troglodytes]
gi|2507090|sp|P13674.2|P4HA1_HUMAN RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; Flags: Precursor
gi|75061858|sp|Q5RAG8.1|P4HA1_PONAB RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; Flags: Precursor
gi|602675|gb|AAA59068.1| alpha-subunit of prolyl 4-hydroxylase [Homo sapiens]
gi|23271226|gb|AAH34998.1| Prolyl 4-hydroxylase, alpha polypeptide I [Homo sapiens]
gi|55729010|emb|CAH91242.1| hypothetical protein [Pongo abelii]
gi|56403853|emb|CAI29712.1| hypothetical protein [Pongo abelii]
gi|119574854|gb|EAW54469.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I, isoform CRA_c [Homo
sapiens]
gi|119574855|gb|EAW54470.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I, isoform CRA_d [Homo
sapiens]
gi|123981532|gb|ABM82595.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I [synthetic
construct]
gi|123996359|gb|ABM85781.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I [synthetic
construct]
gi|261861532|dbj|BAI47288.1| prolyl 4-hydroxylase, alpha polypeptide I [synthetic construct]
gi|410295852|gb|JAA26526.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
gi|410349611|gb|JAA41409.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
Length = 534
Score = 84.0 bits (206), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 99/206 (48%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + + + + ++AK LR +T++ ++ T R S ++S E+
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLE-TVHYRISKSAWLSGYENP- 392
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 451
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|323453493|gb|EGB09364.1| hypothetical protein AURANDRAFT_15704, partial [Aureococcus
anophagefferens]
Length = 148
Score = 84.0 bits (206), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 49/157 (31%), Positives = 82/157 (52%), Gaps = 18/157 (11%)
Query: 131 RTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD---AF 187
RTS + + A + + + +I +VT +P+ N E+F +LRY GQ+Y +H+D
Sbjct: 4 RTSENAWCTGACESNRATRAVMARIEEVTGVPKENYESFQVLRYTHGQQYRAHHDMSRGD 63
Query: 188 DPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDG 247
+ GP R+ +F +Y +D+E+GGET FP + +K+ P++G
Sbjct: 64 NALACGP----RIYTFFMYFSDVEKGGETEFPMVKRPSGK---------TVKIAPKRGSA 110
Query: 248 LLFYSLLPNGTI--DPTSIHGSCPVVKGEKWVATKWI 282
LL+ S+ + DP + H + PVV+G K+ A WI
Sbjct: 111 LLWPSVTSDDPTAQDPRTRHAALPVVEGTKFAANAWI 147
>gi|410251926|gb|JAA13930.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
Length = 566
Score = 84.0 bits (206), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 99/206 (48%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + + + + ++AK LR +T++ ++ T R S ++S E+
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLE-TVHYRISKSAWLSGYENP- 392
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 451
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|195145084|ref|XP_002013526.1| GL24185 [Drosophila persimilis]
gi|194102469|gb|EDW24512.1| GL24185 [Drosophila persimilis]
Length = 229
Score = 84.0 bits (206), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 41/146 (28%), Positives = 76/146 (52%), Gaps = 16/146 (10%)
Query: 143 DESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ-KSQRVA 201
+++ + + ++ +T L I + ++ Y +G Y+ HYD+ + E R+A
Sbjct: 82 NDTAVVKTLHRRLNDMTGLDMIESDTLTLINYGMGGHYDVHYDSHNYSEANRLILGDRIA 141
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
+ L Y+ +++ GG T FP+ I + V P++G +L+Y+L +G ++P
Sbjct: 142 TVLFYVGEVDSGGATTFPY---------------INVSVTPKKGSAVLWYNLDNSGQMNP 186
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
+IH CPV+ G K+V TKWI + Q
Sbjct: 187 KAIHAGCPVIVGSKYVLTKWINEIPQ 212
>gi|194905410|ref|XP_001981191.1| GG11931 [Drosophila erecta]
gi|190655829|gb|EDV53061.1| GG11931 [Drosophila erecta]
Length = 537
Score = 84.0 bits (206), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 55/203 (27%), Positives = 90/203 (44%), Gaps = 19/203 (9%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P F + +P + + MA ++ ST+ R G + R S ++ A +
Sbjct: 331 PYVASFHDMLSPRKISQLREMAVPRMQRSTVNPRPGGQHKKS-AFRVSKNAWL--AYEAH 387
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQKSQRVASFL 204
T+ + + T L E + Y +G Y H+D F DP Y + R+A+ +
Sbjct: 388 PTMAGMLRDLKDATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPSHYPAAEGNRIATAI 447
Query: 205 VYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSI 264
YL+++E+GG T FPF + VKP+ G+ L +Y+L + D +
Sbjct: 448 FYLSEVEQGGATAFPF---------------LDFAVKPQLGNVLFWYNLHRSLDKDYRTK 492
Query: 265 HGSCPVVKGEKWVATKWIRDQEQ 287
H CPV+KG KW+ WI + Q
Sbjct: 493 HAGCPVLKGSKWIGNVWIHEVTQ 515
>gi|190788|gb|AAA36535.1| prolyl 4-hydroxylase alpha subunit (EC 1.14.11.2) [Homo sapiens]
Length = 534
Score = 84.0 bits (206), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 99/206 (48%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + + + + ++AK LR +T++ ++ T R S ++S E+
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLE-TVHYRISKSAWLSGYENP- 392
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 451
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|380813206|gb|AFE78477.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Macaca
mulatta]
gi|384947328|gb|AFI37269.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Macaca
mulatta]
Length = 534
Score = 84.0 bits (206), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 99/206 (48%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + + + + ++AK LR +T++ ++ T R S ++S E+
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLE-TVHYRISKSAWLSGYENP- 392
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 451
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|449488641|ref|XP_004158125.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized LOC101218968
[Cucumis sativus]
Length = 311
Score = 84.0 bits (206), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 58/209 (27%), Positives = 105/209 (50%), Gaps = 23/209 (11%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNL-RPSTLALRKGETVDNTQGIRTSSGVFISA 140
+SW PR + F + E+C +I++A + PS + G TV + + SSGV ++
Sbjct: 59 VSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGITV--STELLNSSGVILNT 116
Query: 141 AEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRV 200
+D + IE ++A T+LP+ + F I++Y+ G++ Y + P +
Sbjct: 117 TDD---IVARIENRLAIWTLLPKDHSMPFQIMQYR-GEEAKHKYFYGNRSAMLPSSEPLM 172
Query: 201 ASFLVYLTDLEEGGETMFP-------FENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 253
A+ ++YL+D GGE +FP F +G ++ ++P +G+ +L +S+
Sbjct: 173 ATVVLYLSDSASGGEILFPESKVKSKFWSGRRKKNNF---------LRPVKGNAILXFSV 223
Query: 254 LPNGTIDPTSIHGSCPVVKGEKWVATKWI 282
N + D +S H P+ GE WVATK++
Sbjct: 224 HLNASPDKSSYHIRSPIRDGELWVATKFL 252
>gi|268536692|ref|XP_002633481.1| C. briggsae CBR-PHY-2 protein [Caenorhabditis briggsae]
gi|94442973|emb|CAJ98659.1| prolyl 4-hydroxylase [Caenorhabditis briggsae]
Length = 539
Score = 84.0 bits (206), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 56/218 (25%), Positives = 99/218 (45%), Gaps = 22/218 (10%)
Query: 74 VTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTS 133
+ I ++L + P A+ F N + + + I +A L+ +T+ K +++ R S
Sbjct: 313 IAPIKVEILRFDPLAVLFKNVISDSEIEVIKELASPKLKRATVQNSKTGELEHAT-YRIS 371
Query: 134 SGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG 193
++ D +D + +I T L + E + Y +G Y+ H+D +E
Sbjct: 372 KSAWLKG--DLDPVIDRVNRRIEDFTGLNQATSEELQVANYGLGGHYDPHFDFARKEEKN 429
Query: 194 PQKS----QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLL 249
K+ R+A+ L Y++ E GG T+F +G V P + D L
Sbjct: 430 AFKTLNTGNRIATVLFYMSQPERGGATVF---------------NHLGTAVFPSKNDALF 474
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
+Y+L +G D + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 475 WYNLRRDGEGDLRTRHAACPVLLGVKWVSNKWIHERGQ 512
>gi|194905381|ref|XP_001981186.1| GG11928 [Drosophila erecta]
gi|190655824|gb|EDV53056.1| GG11928 [Drosophila erecta]
Length = 543
Score = 84.0 bits (206), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 63/214 (29%), Positives = 96/214 (44%), Gaps = 21/214 (9%)
Query: 77 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSG 135
I ++LS P L + ++ I +K +L S + + DN RTS
Sbjct: 324 IKTEILSLDPFVLLLHDMVRQKESTLIRASSKEHLLQSEITNTDASSSEDNVAIFRTSKS 383
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF--DPQEYG 193
V+ S+ D + T I E++A T L E F ++ Y +G + +H D D +
Sbjct: 384 VWYSS--DFNDTTKKITERLADATGLDMHFTEYFQVINYGLGGFFATHLDMLLSDKTRFN 441
Query: 194 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 253
S R+A+ + YL + +GG T FP N L V P+ G L +Y+L
Sbjct: 442 -GTSDRIATTVFYLNGVRQGGATHFPLLN---------------LTVFPQPGSALFWYNL 485
Query: 254 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
G +++H CPV+ G KWV TKW+ DQ Q
Sbjct: 486 DTKGNDQRSTMHTGCPVIVGSKWVMTKWVGDQGQ 519
>gi|195505202|ref|XP_002099402.1| GE23382 [Drosophila yakuba]
gi|194185503|gb|EDW99114.1| GE23382 [Drosophila yakuba]
Length = 537
Score = 84.0 bits (206), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 54/203 (26%), Positives = 93/203 (45%), Gaps = 19/203 (9%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + + +P + + MA +R ST+ G + R S ++ A E
Sbjct: 331 PYVATYHDMLSPRKISQLREMAVPRMRRSTVNPLPGGQHKKS-AFRVSKNAWL-AYESHP 388
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQKSQRVASFL 204
+ ++ + + + T L E + Y +G Y H+D F DP Y ++ R+A+ +
Sbjct: 389 TMVGMLRD-LKEATGLDTTYCEQLQVANYGVGGHYEPHWDFFRDPNHYPEEEGNRIATAI 447
Query: 205 VYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSI 264
YL+++E+GG T FPF + + VKP+ G+ L +Y+L + D +
Sbjct: 448 FYLSEVEQGGATAFPF---------------LDIAVKPQLGNVLFWYNLHRSLDKDYRTK 492
Query: 265 HGSCPVVKGEKWVATKWIRDQEQ 287
H CPV+KG KW+ WI + Q
Sbjct: 493 HAGCPVLKGSKWIGNVWIHEVTQ 515
>gi|403255941|ref|XP_003920663.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Saimiri
boliviensis boliviensis]
gi|403255945|ref|XP_003920665.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 5 [Saimiri
boliviensis boliviensis]
Length = 535
Score = 84.0 bits (206), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 97/210 (46%), Gaps = 30/210 (14%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 141
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 388
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS---- 197
ED+ + + ++ +T L E + Y +G +Y H+D E K
Sbjct: 389 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDAFKHLGTG 448
Query: 198 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 257
RVA+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 449 NRVATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSG 493
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 494 EGDYRTRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|291387304|ref|XP_002710243.1| PREDICTED: prolyl 4-hydroxylase, alpha II subunit isoform 1
precursor (predicted)-like isoform 3 [Oryctolagus
cuniculus]
Length = 535
Score = 84.0 bits (206), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 56/210 (26%), Positives = 97/210 (46%), Gaps = 30/210 (14%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 141
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 388
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK----S 197
ED+ + I ++ +T L E + Y +G +Y H+D E K
Sbjct: 389 EDDDPVVARINRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRNNERDAFKRLGTG 448
Query: 198 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 257
RVA+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 449 NRVATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSG 493
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 494 EGDYRTRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|26336999|dbj|BAC32183.1| unnamed protein product [Mus musculus]
gi|148700261|gb|EDL32208.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide, isoform CRA_b [Mus
musculus]
Length = 534
Score = 84.0 bits (206), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + + + + ++AK L +T+ + + Q R S ++S ED
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYEDP- 392
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ I +I +T L E + Y +G +Y H+D E + R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIA 451
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|355562502|gb|EHH19096.1| hypothetical protein EGK_19739 [Macaca mulatta]
gi|355782842|gb|EHH64763.1| hypothetical protein EGM_18071 [Macaca fascicularis]
gi|383418719|gb|AFH32573.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Macaca
mulatta]
Length = 534
Score = 84.0 bits (206), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 99/206 (48%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + + + + ++AK LR +T++ ++ T R S ++S E+
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLE-TVHYRISKSAWLSGYENP- 392
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 451
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|311977988|ref|YP_003987108.1| putative prolyl 4-hydroxylase [Acanthamoeba polyphaga mimivirus]
gi|81999799|sp|Q5UP57.1|P4H_MIMIV RecName: Full=Putative prolyl 4-hydroxylase; Short=4-PH; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
gi|55417206|gb|AAV50856.1| prolyl 4-hydroxylase [Acanthamoeba polyphaga mimivirus]
gi|308204490|gb|ADO18291.1| putative prolyl 4-hydroxylase [Acanthamoeba polyphaga mimivirus]
gi|339061535|gb|AEJ34839.1| prolyl 4-hydroxylase [Acanthamoeba polyphaga mimivirus]
gi|351737756|gb|AEQ60791.1| Prolyl 4-hydroxylase [Acanthamoeba castellanii mamavirus]
gi|398257408|gb|EJN41016.1| prolyl 4-hydroxylase [Acanthamoeba polyphaga lentillevirus]
Length = 242
Score = 83.6 bits (205), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 70/259 (27%), Positives = 109/259 (42%), Gaps = 46/259 (17%)
Query: 34 LLGSSLLSQDVTAARPSARVVESVKDEYKWMPHGQAGDDSVTNIPFQVLSWMPRALYFPN 93
+L S +S ++R V ++KD++ + A D N PF VL+ N
Sbjct: 22 VLSKSCVSHFRNVGSLNSRDV-NLKDDFSY-----ANIDDPYNKPF-VLN---------N 65
Query: 94 FATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEE 153
P +C+ I+ A L S + + + N+Q + S + + I E
Sbjct: 66 LINPTKCQEIMQFANGKLFDSQVLSGTDKNIRNSQQMWISKN---------NPMVKPIFE 116
Query: 154 KIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-----DPQEYGPQKSQRVASFLVYLT 208
I + +P N E ++RY Q YN H+D+ E+ + QR+ + L+YL
Sbjct: 117 NICRQFNVPFDNAEDLQVVRYLPNQYYNEHHDSCCDSSKQCSEFIERGGQRILTVLIYLN 176
Query: 209 DLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGT-IDPTSIHGS 267
+ G T FP N K KP+ GD L+FY L N P S+H
Sbjct: 177 NEFSDGHTYFPNLNQ---------------KFKPKTGDALVFYPLANNSNKCHPYSLHAG 221
Query: 268 CPVVKGEKWVATKWIRDQE 286
PV GEKW+A W R+++
Sbjct: 222 MPVTSGEKWIANLWFRERK 240
>gi|354474413|ref|XP_003499425.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1
[Cricetulus griseus]
Length = 535
Score = 83.6 bits (205), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 97/210 (46%), Gaps = 30/210 (14%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 141
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 388
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK----S 197
ED+ + + ++ +T L E + Y +G +Y H+D E K
Sbjct: 389 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRSDEQDAFKRLGTG 448
Query: 198 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 257
RVA+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 449 NRVATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSG 493
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 494 EGDYRTRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|195110925|ref|XP_002000030.1| GI22756 [Drosophila mojavensis]
gi|193916624|gb|EDW15491.1| GI22756 [Drosophila mojavensis]
Length = 533
Score = 83.6 bits (205), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 54/204 (26%), Positives = 94/204 (46%), Gaps = 21/204 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTL-ALRKGETVDNTQGIRTSSGVFISAAEDE 144
P + + + +P+Q + MA +++ ST+ L G+ + + R S ++ +
Sbjct: 327 PLVVSYHDMLSPQQIGELRAMAVPHMQRSTVNPLSGGQRMKS--AFRVSKNAWLPYSTHP 384
Query: 145 SGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQKSQRVASF 203
+ + + T L E + Y +G Y H+D F D + Y + R+A+
Sbjct: 385 --MMGRMLRDVGDATGLDMTYCEQLQVANYGVGGHYEPHWDFFRDSRHYPAAEGNRIATA 442
Query: 204 LVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTS 263
+ YL+D+E+GG T FPF N V+P+ G+ L +Y+L + D +
Sbjct: 443 IFYLSDVEQGGATAFPFLN---------------FAVRPQLGNILFWYNLHRSSDEDYRT 487
Query: 264 IHGSCPVVKGEKWVATKWIRDQEQ 287
H CPV+KG KW+A WI + Q
Sbjct: 488 KHAGCPVLKGSKWIANIWIHEATQ 511
>gi|157818741|ref|NP_001101745.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Rattus norvegicus]
gi|149052604|gb|EDM04421.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide (predicted),
isoform CRA_a [Rattus norvegicus]
Length = 535
Score = 83.6 bits (205), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 97/210 (46%), Gaps = 30/210 (14%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 141
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 388
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS---- 197
ED+ + + ++ +T L E + Y +G +Y H+D E K
Sbjct: 389 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRSDERDAFKRLGTG 448
Query: 198 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 257
RVA+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 449 NRVATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSG 493
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 494 EGDYRTRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|219116348|ref|XP_002178969.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217409736|gb|EEC49667.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 302
Score = 83.6 bits (205), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 64/218 (29%), Positives = 99/218 (45%), Gaps = 30/218 (13%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET--VDNTQGIRTSSGVF 137
V+S P + +F + CK++I+ A ST + + T T IRTS+ V+
Sbjct: 90 HVVSSEPPLVLIHDFLSTSMCKNLIDTAT-----STDKMIRSTTGSEQETSTIRTSTTVW 144
Query: 138 ISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS 197
++ E T +I EKI+ ++ P + E ++RY+ GQ + H D D K
Sbjct: 145 LND-EQVPETSRIIAEKISSISGFPANHMENLQVVRYETGQSFKLHTDTIDAYNEM-DKR 202
Query: 198 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 257
RVA+ L+YL GGET+FP ++AD K I +++ PRQG + F++
Sbjct: 203 GRVATCLIYLAAPTIGGETLFP---DVHAD------KAIQIRIAPRQGSAIFFWNTHEKP 253
Query: 258 ------------TIDPTSIHGSCPVVKGEKWVATKWIR 283
D H PV GEKWV +W+
Sbjct: 254 GSPAYDGADMFLNTDLRMRHAGMPVDGGEKWVCNRWVH 291
>gi|351706369|gb|EHB09288.1| Prolyl 4-hydroxylase subunit alpha-2 [Heterocephalus glaber]
Length = 535
Score = 83.6 bits (205), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 56/210 (26%), Positives = 96/210 (45%), Gaps = 30/210 (14%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 141
P + + N + E+ I +AK L +T+ D G+ T + +S +
Sbjct: 336 PHIVRYYNVMSDEEIDRIKELAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 388
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS---- 197
ED+ + + ++ +T L E + Y +G +Y H+D E K
Sbjct: 389 EDDDPVVARVNRRMQYITGLTVQTAELLQVANYGMGGQYEPHFDFSRNHERDAFKRLGTG 448
Query: 198 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 257
RVA+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 449 NRVATFLNYMSDVEAGGATVFP---------------DLGAALWPKKGTAVFWYNLLRSG 493
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 494 EGDYRTRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|51036657|ref|NP_742059.2| prolyl 4-hydroxylase subunit alpha-1 precursor [Rattus norvegicus]
gi|90111077|sp|P54001.2|P4HA1_RAT RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; Flags: Precursor
gi|50927553|gb|AAH78703.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I [Rattus norvegicus]
gi|149038787|gb|EDL93076.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide, isoform CRA_a
[Rattus norvegicus]
Length = 534
Score = 83.6 bits (205), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + + + + ++AK L +T+ + + Q R S ++S ED
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYEDP- 392
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ I +I +T L E + Y +G +Y H+D E + R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIA 451
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|334314085|ref|XP_001363658.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1
[Monodelphis domestica]
Length = 537
Score = 83.6 bits (205), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 55/206 (26%), Positives = 96/206 (46%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + + + ++AK L +T+ + + Q R S ++S ED
Sbjct: 338 PRIVRFHEIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYEDP- 395
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 396 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIA 454
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 455 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 499
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 500 STRHAACPVLVGNKWVSNKWIHERGQ 525
>gi|297675929|ref|XP_002815906.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Pongo
abelii]
Length = 535
Score = 83.6 bits (205), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 97/210 (46%), Gaps = 30/210 (14%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 141
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 388
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS---- 197
ED+ + + ++ +T L E + Y +G +Y H+D E K
Sbjct: 389 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTFKHLGTG 448
Query: 198 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 257
RVA+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 449 NRVATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSG 493
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 494 EGDYRTRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|332221660|ref|XP_003259981.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Nomascus
leucogenys]
Length = 537
Score = 83.6 bits (205), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 97/210 (46%), Gaps = 30/210 (14%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 141
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 338 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 390
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS---- 197
ED+ + + ++ +T L E + Y +G +Y H+D E K
Sbjct: 391 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTFKHLGTG 450
Query: 198 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 257
RVA+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 451 NRVATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSG 495
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 496 EGDYRTRHAACPVLVGCKWVSNKWFHERGQ 525
>gi|4758868|ref|NP_004190.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Homo
sapiens]
gi|217272863|ref|NP_001136071.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Homo
sapiens]
gi|20455169|sp|O15460.1|P4HA2_HUMAN RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
alpha-2; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-2; Flags: Precursor
gi|2439985|gb|AAB71339.1| prolyl 4-hydroxylase alpha (II) subunit [Homo sapiens]
gi|18073926|emb|CAC85689.1| Prolyl 4-hydroxylase alpha IIb subunit [Homo sapiens]
gi|119582746|gb|EAW62342.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_b
[Homo sapiens]
gi|119582747|gb|EAW62343.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_b
[Homo sapiens]
Length = 535
Score = 83.6 bits (205), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 97/210 (46%), Gaps = 30/210 (14%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 141
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 388
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS---- 197
ED+ + + ++ +T L E + Y +G +Y H+D E K
Sbjct: 389 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTFKHLGTG 448
Query: 198 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 257
RVA+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 449 NRVATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSG 493
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 494 EGDYRTRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|357010489|ref|ZP_09075488.1| response regulator receiver domain-containing protein
[Paenibacillus elgii B69]
Length = 397
Score = 83.6 bits (205), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 59/219 (26%), Positives = 100/219 (45%), Gaps = 28/219 (12%)
Query: 78 PFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVF 137
P V P + + +C+ +I +A+ L P+ + GE R S F
Sbjct: 4 PSVVFHEEPLVASYEQVVSGPECRQLIELARHQLEPAKVI---GEKEVVASEFRKSE--F 58
Query: 138 ISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFD-----PQEY 192
D + + E++A + P E+ I RY +G ++ +H+D +D + +
Sbjct: 59 AWFHHDSHPLVREVSERLAALAGRPLHYAESLQIARYVVGGRFGAHFDTYDLNTVDGKRF 118
Query: 193 GPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYS 252
Q QR+ + L+YL ++ GGET FP N L + P +G+ L+ +
Sbjct: 119 YDQGGQRLYTALLYLNTVDAGGETYFPELN---------------LDIAPSEGN-LIVFE 162
Query: 253 LLPNGTID--PTSIHGSCPVVKGEKWVATKWIRDQEQYD 289
GT + P S+HGS + +GEKW+AT W R++ QY+
Sbjct: 163 TCKWGTNERHPLSLHGSRELREGEKWIATLWFRERPQYE 201
>gi|344264847|ref|XP_003404501.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1
[Loxodonta africana]
Length = 536
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 97/210 (46%), Gaps = 30/210 (14%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 141
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 337 PHIVRYYDVMSDEEIERIKQIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 389
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS---- 197
ED+ + + ++ +T L E + Y +G +Y H+D E K
Sbjct: 390 EDDDPVVAQVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRSHEQDAFKRLGTG 449
Query: 198 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 257
RVA+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 450 NRVATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSG 494
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 495 EGDYRTRHAACPVLVGCKWVSNKWFHERGQ 524
>gi|156370129|ref|XP_001628324.1| predicted protein [Nematostella vectensis]
gi|156215298|gb|EDO36261.1| predicted protein [Nematostella vectensis]
Length = 541
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 64/226 (28%), Positives = 97/226 (42%), Gaps = 41/226 (18%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPST---------------LALRKGETVDNTQG- 129
P L F NF T + K I +A L+ +T ++ R+ G
Sbjct: 310 PEVLIFRNFITDSEIKRIKELATPRLKRATVKDPVTGELIFANYRISKRRATIQHPVTGK 369
Query: 130 -----IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHY 184
R S ++ EDE + I ++ + L E ++ Y IG Y HY
Sbjct: 370 LEFANYRISKSGWLRDEEDE--LVKRISYRVQAYSGLNMTTSEDLQVVNYGIGGHYEPHY 427
Query: 185 D-AFDPQEYGPQ--KSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVK 241
D A D ++ R+A+FL YL+D+E GG T+F +G V
Sbjct: 428 DFARDGEDKFTSLGTGNRIATFLSYLSDVEAGGGTVFT---------------RVGATVW 472
Query: 242 PRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
P++GD +Y+L +G D ++ H +CPV+ G KWVA KWI + Q
Sbjct: 473 PQKGDAAFWYNLKRSGDGDSSTRHAACPVLVGSKWVANKWIHEVGQ 518
>gi|357542083|gb|AET84843.1| hypothetical protein MPXG_00045 [Micromonas pusilla virus SP1]
Length = 196
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 66/196 (33%), Positives = 92/196 (46%), Gaps = 29/196 (14%)
Query: 91 FPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDL 150
F NF T + +I AK NL PST++ +D + +R S ++S + +
Sbjct: 24 FRNFITSGERAHVIEEAKKNLTPSTVSTE--HKLD--ESVRKSETAWLSF---DDPIIRG 76
Query: 151 IEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDL 210
I EK + T P IN E +LRY+ G Y H D K+QR+ +F++ L D
Sbjct: 77 IAEKCIRYTDRPLINCEKLQVLRYEEGGHYIPHQDIL-----RNAKNQRMYTFILALNDD 131
Query: 211 EEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPT-SIHGSCP 269
EGGET+FP N SY K R GD L F+ L N D + ++HG P
Sbjct: 132 YEGGETVFP-----NLRKSY----------KLRAGDAL-FFDTLDNYEYDTSRALHGGKP 175
Query: 270 VVKGEKWVATKWIRDQ 285
V GEKW+ W+R
Sbjct: 176 VKSGEKWICNLWVRKH 191
>gi|149052606|gb|EDM04423.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide (predicted),
isoform CRA_c [Rattus norvegicus]
Length = 506
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 97/210 (46%), Gaps = 30/210 (14%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 141
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 307 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 359
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS---- 197
ED+ + + ++ +T L E + Y +G +Y H+D E K
Sbjct: 360 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRSDERDAFKRLGTG 419
Query: 198 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 257
RVA+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 420 NRVATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSG 464
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 465 EGDYRTRHAACPVLVGCKWVSNKWFHERGQ 494
>gi|114601566|ref|XP_001162222.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Pan
troglodytes]
gi|114601568|ref|XP_001162843.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 17 [Pan
troglodytes]
gi|397518358|ref|XP_003829358.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Pan
paniscus]
gi|397518362|ref|XP_003829360.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 5 [Pan
paniscus]
gi|410215944|gb|JAA05191.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
gi|410255608|gb|JAA15771.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
gi|410331279|gb|JAA34586.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
Length = 535
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 97/210 (46%), Gaps = 30/210 (14%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 141
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 388
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS---- 197
ED+ + + ++ +T L E + Y +G +Y H+D E K
Sbjct: 389 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTFKHLGTG 448
Query: 198 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 257
RVA+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 449 NRVATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSG 493
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 494 EGDYRTRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|355691582|gb|EHH26767.1| hypothetical protein EGK_16829 [Macaca mulatta]
gi|355750162|gb|EHH54500.1| hypothetical protein EGM_15360 [Macaca fascicularis]
gi|384939464|gb|AFI33337.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Macaca
mulatta]
Length = 535
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 97/210 (46%), Gaps = 30/210 (14%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 141
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 388
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS---- 197
ED+ + + ++ +T L E + Y +G +Y H+D E K
Sbjct: 389 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERHTFKHLGTG 448
Query: 198 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 257
RVA+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 449 NRVATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSG 493
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 494 EGDYRTRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|291387300|ref|XP_002710241.1| PREDICTED: prolyl 4-hydroxylase, alpha II subunit isoform 1
precursor (predicted)-like isoform 1 [Oryctolagus
cuniculus]
Length = 533
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 55/208 (26%), Positives = 101/208 (48%), Gaps = 28/208 (13%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 141
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 388
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQR 199
ED+ + I ++ +T L E + Y +G +Y H+D + P + G + + R
Sbjct: 389 EDDDPVVARINRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNR 448
Query: 200 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 259
+A+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 449 LATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEG 493
Query: 260 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 494 DYRTRHAACPVLVGCKWVSNKWFHERGQ 521
>gi|474940|emb|CAA55546.1| gamma-butyrobetaine,2-oxoglutarate dioxygenase [Rattus norvegicus]
Length = 534
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + + + + ++AK L +T+ + + Q R S ++S ED
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYEDP- 392
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ I +I +T L E + Y +G +Y H+D E + R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIA 451
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|226874876|ref|NP_035161.2| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Mus
musculus]
gi|148701601|gb|EDL33548.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide, isoform CRA_f [Mus
musculus]
Length = 537
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 97/210 (46%), Gaps = 30/210 (14%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 141
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 338 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 390
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK----S 197
ED+ + + ++ +T L E + Y +G +Y H+D E K
Sbjct: 391 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRSDEQDAFKRLGTG 450
Query: 198 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 257
RVA+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 451 NRVATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSG 495
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 496 EGDYRTRHAACPVLVGCKWVSNKWFHERGQ 525
>gi|170064960|ref|XP_001867743.1| prolyl 4-hydroxylase subunit alpha-1 [Culex quinquefasciatus]
gi|167882146|gb|EDS45529.1| prolyl 4-hydroxylase subunit alpha-1 [Culex quinquefasciatus]
Length = 545
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 52/207 (25%), Positives = 96/207 (46%), Gaps = 24/207 (11%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVFISAAEDE 144
P + + + + + I +AK R +T+ + GE R S ++ ++E
Sbjct: 342 PYIVIYHEVMSDAEIEVIKRLAKPRFRRATVQNYKTGEL--EVANYRISKSAWLK--DEE 397
Query: 145 SGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRV 200
+ + +++ +T L E ++ Y IG Y H+D +E KS R+
Sbjct: 398 HSVVRTVGQRVEDMTGLTMTTAEELQVVNYGIGGHYEPHFDFARREEKNAFKSLGTGNRI 457
Query: 201 ASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTID 260
A+ L Y++D+ +GG T+FP I + ++P++G +Y+L +G D
Sbjct: 458 ATVLFYMSDVSQGGATVFP---------------SIRVALRPKKGTAAFWYNLHASGHGD 502
Query: 261 PTSIHGSCPVVKGEKWVATKWIRDQEQ 287
+ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 503 YATRHAACPVLTGTKWVSNKWIHERGQ 529
>gi|410948134|ref|XP_003980796.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Felis
catus]
Length = 535
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 97/210 (46%), Gaps = 30/210 (14%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 141
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 388
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS---- 197
ED+ + + ++ +T L E + Y +G +Y H+D E K
Sbjct: 389 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRKNEQDAFKRLGTG 448
Query: 198 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 257
RVA+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 449 NRVATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSG 493
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 494 EGDYRTRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|115495019|ref|NP_001069238.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Bos taurus]
gi|122144801|sp|Q1RMU3.1|P4HA1_BOVIN RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; Flags: Precursor
gi|92097479|gb|AAI14709.1| Prolyl 4-hydroxylase, alpha polypeptide I [Bos taurus]
gi|296472132|tpg|DAA14247.1| TPA: prolyl 4-hydroxylase subunit alpha-1 precursor [Bos taurus]
gi|440892721|gb|ELR45796.1| Prolyl 4-hydroxylase subunit alpha-1 [Bos grunniens mutus]
Length = 534
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 99/206 (48%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + + + + ++AK LR +T++ ++ T R S ++S E+
Sbjct: 335 PRIIRFHDIISDAEIEVVKDLAKPRLRRATISNPITGDLE-TVHYRISKSAWLSGYENP- 392
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 451
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVLAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|198449635|ref|XP_001357660.2| GA21971 [Drosophila pseudoobscura pseudoobscura]
gi|198130694|gb|EAL26794.2| GA21971 [Drosophila pseudoobscura pseudoobscura]
Length = 549
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 56/214 (26%), Positives = 92/214 (42%), Gaps = 26/214 (12%)
Query: 82 LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA 141
LS P + + + + I + + + + L TV N +RTS FI+
Sbjct: 324 LSHDPLLVLYHDVIYQSEIDVIRQLTTNRMARAMVTLTNQSTVSN---VRTSQITFIAKT 380
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEY------GPQ 195
E E L I+ ++A +T L E Y IG Y H D F + +
Sbjct: 381 EHE--VLQTIDRRVADMTNLNMDYAEDHQFANYGIGGHYGQHMDWFTETTFDNGLVSSTE 438
Query: 196 KSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLP 255
R+A+ L YL+D+ +GG T FP+ + ++P++ +++L
Sbjct: 439 MGNRIATVLFYLSDVAQGGGTAFPY---------------LKQHLRPKKYAAAFWHNLHA 483
Query: 256 NGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 289
G D + HG+CP++ G KWV +WIR+ Q D
Sbjct: 484 AGRGDARTQHGACPIIAGSKWVLNRWIREFVQSD 517
>gi|335283456|ref|XP_003354320.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Sus scrofa]
Length = 535
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 97/210 (46%), Gaps = 30/210 (14%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 141
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 388
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK----S 197
ED+ + + ++ +T L E + Y +G +Y H+D E K
Sbjct: 389 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRKDEQDAFKRLGTG 448
Query: 198 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 257
RVA+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 449 NRVATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSG 493
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 494 EGDYRTRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|345305838|ref|XP_001508476.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Ornithorhynchus
anatinus]
Length = 493
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + + + + +++ ++AK L +T+ + + Q R S ++S ED
Sbjct: 294 PRIVRYHEIISDAEIETVKDLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYEDP- 351
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 352 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIA 410
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 411 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 455
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 456 STRHAACPVLVGNKWVSNKWIHERGQ 481
>gi|332221664|ref|XP_003259983.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 5 [Nomascus
leucogenys]
Length = 558
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 97/210 (46%), Gaps = 30/210 (14%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 141
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 359 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 411
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS---- 197
ED+ + + ++ +T L E + Y +G +Y H+D E K
Sbjct: 412 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTFKHLGTG 471
Query: 198 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 257
RVA+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 472 NRVATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSG 516
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 517 EGDYRTRHAACPVLVGCKWVSNKWFHERGQ 546
>gi|281361323|ref|NP_652183.2| CG15864 [Drosophila melanogaster]
gi|272476864|gb|AAF54202.3| CG15864 [Drosophila melanogaster]
Length = 490
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 45/159 (28%), Positives = 80/159 (50%), Gaps = 23/159 (14%)
Query: 130 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFD- 188
+RTS +I A+ + E++ +T + F+++ Y +G Y HYD +
Sbjct: 336 VRTSKDSYIVDAKT-------LNERVTDMTGFSMEMSDPFSLINYGLGGHYMLHYDFHEY 388
Query: 189 PQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGL 248
P++ R+A+ L YL +++ GG T+FP I + V P++G +
Sbjct: 389 TNTTRPKQGDRIATVLFYLGEVDSGGATIFPM---------------INITVTPKKGSAV 433
Query: 249 LFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
+Y+L +G ++ S+H +CPV+ G K+V TKWI + Q
Sbjct: 434 FWYNLHNSGAMNLKSLHSACPVISGSKYVLTKWINELPQ 472
>gi|195575099|ref|XP_002105517.1| GD17024 [Drosophila simulans]
gi|194201444|gb|EDX15020.1| GD17024 [Drosophila simulans]
Length = 537
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 54/203 (26%), Positives = 92/203 (45%), Gaps = 19/203 (9%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P F + +P + + MA + ST+ G + + R S ++ A E
Sbjct: 331 PYVATFHDMLSPRKISQLREMAVPRMHRSTVNPLPGGQLKKS-AFRVSKNAWL-AYESHP 388
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQKSQRVASFL 204
+ ++ + + T L E + Y +G Y H+D F DP Y ++ R+A+ +
Sbjct: 389 TMVGMLRD-LKDATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRIATAI 447
Query: 205 VYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSI 264
YL+++E+GG T FPF + + VKP+ G+ L +Y+L + D +
Sbjct: 448 FYLSEVEQGGATAFPF---------------LDIAVKPQLGNVLFWYNLHRSLDKDYRTK 492
Query: 265 HGSCPVVKGEKWVATKWIRDQEQ 287
H CPV+KG KW+ WI + Q
Sbjct: 493 HAGCPVLKGSKWIGNVWIHEVTQ 515
>gi|383640592|ref|ZP_09952998.1| 2OG-Fe(II) oxygenase [Sphingomonas elodea ATCC 31461]
Length = 221
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 61/198 (30%), Positives = 95/198 (47%), Gaps = 28/198 (14%)
Query: 93 NFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIE 152
NF QC +++ + RPSTLA + RTS + AAED +DL +
Sbjct: 43 NFLDSGQCAALMARIDEHRRPSTLA-----NAGDDYAFRTSETCDL-AAEDPLA-IDL-K 94
Query: 153 EKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP-----QEYGPQKSQRVASFLVYL 207
+I ++ L + E RY +GQ++ +H D FDP +Y QR + ++YL
Sbjct: 95 ARILELIGLDPDHAEPMQGQRYAVGQEFKAHTDYFDPGSVDFDKYCSVAGQRTWTVMLYL 154
Query: 208 TDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGS 267
++E GG T F K I V+P G L + +L P+GT++P +IH +
Sbjct: 155 NEVEAGGATRF---------------KAIDKIVQPEAGKLLAWNNLRPDGTVNPATIHHA 199
Query: 268 CPVVKGEKWVATKWIRDQ 285
V G K+V T+W R++
Sbjct: 200 MKVRAGCKYVITQWFRER 217
>gi|224006261|ref|XP_002292091.1| hypothetical protein THAPSDRAFT_263436 [Thalassiosira pseudonana
CCMP1335]
gi|220972610|gb|EED90942.1| hypothetical protein THAPSDRAFT_263436 [Thalassiosira pseudonana
CCMP1335]
Length = 232
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 68/236 (28%), Positives = 114/236 (48%), Gaps = 39/236 (16%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAK-----LNLRPSTLALR--KGETVDNTQGIRT 132
+VLS PR L F +P + + +I++A + ++ ST+ +G T +T R+
Sbjct: 1 KVLSCAPRVLEVKKFLSPVEVQHLIDLASGAKGDVAMQRSTVLASNIRGATKTDT---RS 57
Query: 133 SSGVFISAAEDESGTLDLIEEKIAKVTML----------PRING----EAFNILRYKIGQ 178
SSG +I +D +D I +IA + + P + G EA +LRY+ G+
Sbjct: 58 SSGGWIHREQDV--IVDTIFRRIADLLKIDKNLMRDQRPPHLIGAHVVEAMQLLRYEPGE 115
Query: 179 KYNSHYDAFDPQEYGPQKSQRVASFLVYLT---DLEEGGETMFPFENGMNADG----SYD 231
+YN H+D P + +R + L+YLT D+ + G + P + DG
Sbjct: 116 EYNPHHDFTYPSIDNRYQPKRYVTILLYLTGEGDVIQDGIRLSPKNTNTDVDGLQGGETT 175
Query: 232 YQKCI------GLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKW 281
+ + I G+KV P+ G ++FY++LP+G +D S H V KG K++A W
Sbjct: 176 FPRAITTEYHDGIKVAPQSGKAVVFYNILPDGNMDDLSQHSGGKVEKGVKYLANVW 231
>gi|194905372|ref|XP_001981184.1| GG11758 [Drosophila erecta]
gi|190655822|gb|EDV53054.1| GG11758 [Drosophila erecta]
Length = 550
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 49/166 (29%), Positives = 79/166 (47%), Gaps = 23/166 (13%)
Query: 130 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP 189
+RTS FI A+ + L I++++A +T L E Y IG Y H D F
Sbjct: 371 VRTSQFTFIPASAHK--VLSTIDQRVADMTNLNMKYAEDHQFANYGIGGHYGQHMDWFYQ 428
Query: 190 QEY------GPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPR 243
+ P+ R+A+ L YL+D+ +GG T FP + +KP+
Sbjct: 429 TTFDAGLVSSPEMGNRIATVLFYLSDVSQGGGTAFPQLRTL---------------LKPK 473
Query: 244 QGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 289
+ +++L +G D + HG+CP++ G KWV +WIR+ +Q D
Sbjct: 474 KYAAAFWHNLHASGVGDVRTQHGACPIIAGSKWVQNRWIREFDQSD 519
>gi|119582752|gb|EAW62348.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_f
[Homo sapiens]
Length = 567
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 97/210 (46%), Gaps = 30/210 (14%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 141
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 368 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 420
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS---- 197
ED+ + + ++ +T L E + Y +G +Y H+D E K
Sbjct: 421 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTFKHLGTG 480
Query: 198 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 257
RVA+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 481 NRVATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSG 525
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 526 EGDYRTRHAACPVLVGCKWVSNKWFHERGQ 555
>gi|119582749|gb|EAW62345.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_d
[Homo sapiens]
Length = 488
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 101/208 (48%), Gaps = 28/208 (13%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 141
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 291 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 343
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQR 199
ED+ + + ++ +T L E + Y +G +Y H+D + P + G + + R
Sbjct: 344 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNR 403
Query: 200 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 259
+A+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 404 LATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEG 448
Query: 260 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 449 DYRTRHAACPVLVGCKWVSNKWFHERGQ 476
>gi|395736141|ref|XP_003776706.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Pongo abelii]
Length = 577
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 97/210 (46%), Gaps = 30/210 (14%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 141
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 378 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 430
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS---- 197
ED+ + + ++ +T L E + Y +G +Y H+D E K
Sbjct: 431 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTFKHLGTG 490
Query: 198 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 257
RVA+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 491 NRVATFLNYMSDVEAGGATVFPD---------------LGAAIWPKKGTAVFWYNLLRSG 535
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 536 EGDYRTRHAACPVLVGCKWVSNKWFHERGQ 565
>gi|332221656|ref|XP_003259979.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Nomascus
leucogenys]
gi|332221658|ref|XP_003259980.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Nomascus
leucogenys]
Length = 535
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 101/208 (48%), Gaps = 28/208 (13%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 141
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 338 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 390
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQR 199
ED+ + + ++ +T L E + Y +G +Y H+D + P + G + + R
Sbjct: 391 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNR 450
Query: 200 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 259
+A+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 451 LATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEG 495
Query: 260 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 496 DYRTRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|198449643|ref|XP_001357664.2| GA15938 [Drosophila pseudoobscura pseudoobscura]
gi|198130698|gb|EAL26798.2| GA15938 [Drosophila pseudoobscura pseudoobscura]
Length = 549
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 61/227 (26%), Positives = 101/227 (44%), Gaps = 23/227 (10%)
Query: 63 WMPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGE 122
W+ H ++ + LS P YF + + ++ + II K + S +
Sbjct: 323 WLSHQDVPYQRLSPFKVEQLSGDPYVAYFHDVLSDKESEQIIEHGKGQVTRSEIGQTGNS 382
Query: 123 TVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNS 182
TV + IRTS ++ + + L I++++ +T L E ++ Y IG +Y
Sbjct: 383 TVSD---IRTSQNTWLWY--ENNPWLADIKQRLEDITGLSTDTAEPLQLVNYGIGGQYEP 437
Query: 183 HYDAFDPQE--YGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKV 240
H+D D E +G K R+ + L YL D+ GG T FPF + L V
Sbjct: 438 HFDFMDDAEKNFG-WKGNRLLTALFYLNDVPLGGATAFPF---------------LHLAV 481
Query: 241 KPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
P +G L++Y+L + D + H CPV+KG KW+ +W + Q
Sbjct: 482 PPVKGSLLVWYNLHRSLHKDFRTKHAGCPVLKGSKWICNQWFHEAAQ 528
>gi|403255937|ref|XP_003920661.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Saimiri
boliviensis boliviensis]
gi|403255939|ref|XP_003920662.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Saimiri
boliviensis boliviensis]
gi|403255943|ref|XP_003920664.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 4 [Saimiri
boliviensis boliviensis]
Length = 533
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 101/208 (48%), Gaps = 28/208 (13%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 141
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 388
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQR 199
ED+ + + ++ +T L E + Y +G +Y H+D + P + G + + R
Sbjct: 389 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNR 448
Query: 200 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 259
+A+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 449 LATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEG 493
Query: 260 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 494 DYRTRHAACPVLVGCKWVSNKWFHERGQ 521
>gi|321474953|gb|EFX85917.1| hypothetical protein DAPPUDRAFT_309108 [Daphnia pulex]
Length = 549
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 48/166 (28%), Positives = 77/166 (46%), Gaps = 22/166 (13%)
Query: 127 TQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD- 185
T R S F+ E + + ++ +T L E + Y IG Y H+D
Sbjct: 387 TAKYRISKAAFLKNKEHHH--VLKMSRRVGAITGLDMSTAEDLQVCNYGIGGHYEPHFDY 444
Query: 186 AFDPQEYGPQKS----QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVK 241
A + G K R+A++L Y++D+E GG T+FP + + +
Sbjct: 445 ARKNETIGFNKDSGWRNRIATWLFYMSDVEAGGATVFP---------------ALNVALW 489
Query: 242 PRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
P++G +Y+L PNG + + H +CPV+ G KWVA KWI ++ Q
Sbjct: 490 PQKGSAAFWYNLFPNGEGNELTRHAACPVLTGSKWVANKWIHEKNQ 535
>gi|195390831|ref|XP_002054071.1| GJ22995 [Drosophila virilis]
gi|194152157|gb|EDW67591.1| GJ22995 [Drosophila virilis]
Length = 485
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 55/211 (26%), Positives = 96/211 (45%), Gaps = 32/211 (15%)
Query: 77 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGV 136
+ +VL P + F + +P + + +A L+ +T+ + +G RTS G+
Sbjct: 295 LKMEVLVVKPFIVAFHDVLSPHEIGELQQLAMPLLKRTTVYDSNAGLHGSVKGTRTSKGI 354
Query: 137 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK 196
++S + + I +I+ +T + ++ Y + Y H D F+ E
Sbjct: 355 WLSRSHN--NLTKRIGRRISDMTGFHLEGSTSLQVMNYGLSGHYALHTDYFNTAE----- 407
Query: 197 SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPN 256
L+D+E+GG+T+FP I KP +G LL+Y+L N
Sbjct: 408 ----------LSDVEQGGDTVFPR---------------IEQAFKPERGKALLWYNLHRN 442
Query: 257 GTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
GT D + HG+CPV+ G KW+ T+WI ++ Q
Sbjct: 443 GTGDKRTEHGACPVLVGSKWIMTQWINERPQ 473
>gi|332221662|ref|XP_003259982.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 4 [Nomascus
leucogenys]
Length = 556
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 101/208 (48%), Gaps = 28/208 (13%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 141
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 359 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 411
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQR 199
ED+ + + ++ +T L E + Y +G +Y H+D + P + G + + R
Sbjct: 412 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNR 471
Query: 200 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 259
+A+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 472 LATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEG 516
Query: 260 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 517 DYRTRHAACPVLVGCKWVSNKWFHERGQ 544
>gi|195391760|ref|XP_002054528.1| GJ22757 [Drosophila virilis]
gi|194152614|gb|EDW68048.1| GJ22757 [Drosophila virilis]
Length = 534
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 44/142 (30%), Positives = 70/142 (49%), Gaps = 16/142 (11%)
Query: 147 TLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQKSQRVASFLV 205
T+ + ++ T L E + Y +G Y H+D F D + Y + R+A+ +
Sbjct: 386 TMGRMLRDVSDATGLDMTFCEQLQVANYGVGGHYEPHWDFFRDSRHYPAAEGNRIATAIF 445
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL+D+E+GG T FPF N V+P+ G+ L +Y+L + +D + H
Sbjct: 446 YLSDVEQGGATAFPFLN---------------FAVRPQLGNILFWYNLHRSSDMDFRTKH 490
Query: 266 GSCPVVKGEKWVATKWIRDQEQ 287
CPV+KG KW+A WI + Q
Sbjct: 491 AGCPVLKGSKWIANIWIHEATQ 512
>gi|63252891|ref|NP_001017973.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Homo
sapiens]
gi|63252893|ref|NP_001017974.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Homo
sapiens]
gi|217272861|ref|NP_001136070.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Homo
sapiens]
gi|18073925|emb|CAC85688.1| Prolyl 4-hydroxylase alpha IIa subunit [Homo sapiens]
gi|23274221|gb|AAH35813.1| Prolyl 4-hydroxylase, alpha polypeptide II [Homo sapiens]
gi|37183058|gb|AAQ89329.1| P4HA2 [Homo sapiens]
gi|119582745|gb|EAW62341.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_a
[Homo sapiens]
gi|119582750|gb|EAW62346.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_a
[Homo sapiens]
gi|123983232|gb|ABM83357.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II [synthetic
construct]
gi|157928048|gb|ABW03320.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II [synthetic
construct]
Length = 533
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 101/208 (48%), Gaps = 28/208 (13%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 141
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 388
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQR 199
ED+ + + ++ +T L E + Y +G +Y H+D + P + G + + R
Sbjct: 389 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNR 448
Query: 200 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 259
+A+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 449 LATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEG 493
Query: 260 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 494 DYRTRHAACPVLVGCKWVSNKWFHERGQ 521
>gi|386780652|ref|NP_001247763.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Macaca mulatta]
gi|383422579|gb|AFH34503.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Macaca
mulatta]
gi|384939466|gb|AFI33338.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Macaca
mulatta]
Length = 533
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 101/208 (48%), Gaps = 28/208 (13%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 141
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 388
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQR 199
ED+ + + ++ +T L E + Y +G +Y H+D + P + G + + R
Sbjct: 389 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNR 448
Query: 200 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 259
+A+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 449 LATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEG 493
Query: 260 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 494 DYRTRHAACPVLVGCKWVSNKWFHERGQ 521
>gi|348505573|ref|XP_003440335.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Oreochromis
niloticus]
Length = 517
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 59/217 (27%), Positives = 102/217 (47%), Gaps = 33/217 (15%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFIS 139
+++S P + + +F T + + I ++A LR S +A + + T R S ++
Sbjct: 313 ELVSLQPYVVLYHDFVTDTEAEDIKSLAHPGLRRSVVAAGEKQA---TADYRISKSAWLK 369
Query: 140 -AAEDESGTLDLIEEKIAKVTMLPRIN-----GEAFNILRYKIGQKYNSHYD-AFDPQE- 191
+A+ G LD ++++L +N GE ++ Y IG Y H+D A P
Sbjct: 370 GSAQSIVGKLD------QRISLLTGLNVKHPYGEYLQVVNYGIGGHYEPHFDHATSPSSP 423
Query: 192 -YGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 250
+ + RVA+F++YL+ +E GG T F + N V + + +
Sbjct: 424 VFKLKTGNRVATFMIYLSPVEAGGSTAFIYAN---------------FSVPVVEKAAIFW 468
Query: 251 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
++L NG D ++H CPV+ G+KWVA KWI + Q
Sbjct: 469 WNLHRNGEGDDDTLHAGCPVLIGDKWVANKWIHEYGQ 505
>gi|297675927|ref|XP_002815905.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Pongo
abelii]
gi|395736137|ref|XP_003776704.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Pongo abelii]
Length = 533
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 101/208 (48%), Gaps = 28/208 (13%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 141
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 388
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQR 199
ED+ + + ++ +T L E + Y +G +Y H+D + P + G + + R
Sbjct: 389 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNR 448
Query: 200 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 259
+A+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 449 LATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEG 493
Query: 260 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 494 DYRTRHAACPVLVGCKWVSNKWFHERGQ 521
>gi|57997558|emb|CAI46066.1| hypothetical protein [Homo sapiens]
Length = 533
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 101/208 (48%), Gaps = 28/208 (13%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 141
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 388
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQR 199
ED+ + + ++ +T L E + Y +G +Y H+D + P + G + + R
Sbjct: 389 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNR 448
Query: 200 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 259
+A+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 449 LATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEG 493
Query: 260 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 494 DYRTRHAACPVLVGCKWVSNKWFHERGQ 521
>gi|195575111|ref|XP_002105523.1| GD16991 [Drosophila simulans]
gi|194201450|gb|EDX15026.1| GD16991 [Drosophila simulans]
Length = 542
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 62/213 (29%), Positives = 98/213 (46%), Gaps = 19/213 (8%)
Query: 77 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPS-TLALRKGETVDNTQGIRTSSG 135
I ++LS P L + + ++ I N +K ++ PS T +T RTS
Sbjct: 323 IKTEILSVDPFVLLLHDMISQKESTLIRNSSKEHMLPSATTDPDSSDTETQVDTYRTSKS 382
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 195
V+ S+ D + T I E++ T L E + ++ Y +G + +H D ++
Sbjct: 383 VWYSS--DFNDTTKKITERLGDATGLDTNFTEFYQVINYGLGGFFETHLDMLLSEKNRFN 440
Query: 196 KSQ-RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLL 254
++ R+A+ L YL ++ +GG T FP I L V P+ G L +Y+L
Sbjct: 441 GTRDRIATTLFYLNEVRQGGGTYFPR---------------INLTVFPQPGSALFWYNLD 485
Query: 255 PNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
NG S+H CPV+ G KWV +KWI D Q
Sbjct: 486 TNGNDHMGSLHTGCPVIVGSKWVMSKWINDMGQ 518
>gi|17541712|ref|NP_502317.1| Protein PHY-2 [Caenorhabditis elegans]
gi|32171589|sp|Q20065.1|P4HA2_CAEEL RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
alpha-2; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-2; Flags: Precursor
gi|3876769|emb|CAA93469.1| Protein PHY-2 [Caenorhabditis elegans]
Length = 539
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 56/215 (26%), Positives = 97/215 (45%), Gaps = 22/215 (10%)
Query: 77 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGV 136
I ++L + P A+ F N + + I +A L+ +T+ K +++ R S
Sbjct: 316 IKVEILRFDPLAVLFKNVIHDSEIEVIKELASPKLKRATVQNSKTGELEHAT-YRISKSA 374
Query: 137 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK 196
++ D +D + +I T L + E + Y +G Y+ H+D +E K
Sbjct: 375 WLKG--DLDPVIDRVNRRIEDFTNLNQATSEELQVANYGLGGHYDPHFDFARKEEKNAFK 432
Query: 197 S----QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYS 252
+ R+A+ L Y++ E GG T+F +G V P + D L +Y+
Sbjct: 433 TLNTGNRIATVLFYMSQPERGGATVF---------------NHLGTAVFPSKNDALFWYN 477
Query: 253 LLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
L +G D + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 478 LRRDGEGDLRTRHAACPVLLGVKWVSNKWIHEKGQ 512
>gi|114601548|ref|XP_001162501.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 9 [Pan
troglodytes]
gi|114601562|ref|XP_001162805.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 16 [Pan
troglodytes]
gi|114601564|ref|XP_517917.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 19 [Pan
troglodytes]
gi|397518354|ref|XP_003829356.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Pan
paniscus]
gi|397518356|ref|XP_003829357.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Pan
paniscus]
gi|397518360|ref|XP_003829359.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 4 [Pan
paniscus]
gi|410215942|gb|JAA05190.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
gi|410255606|gb|JAA15770.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
gi|410331277|gb|JAA34585.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
gi|410331281|gb|JAA34587.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
Length = 533
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 101/208 (48%), Gaps = 28/208 (13%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 141
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 388
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQR 199
ED+ + + ++ +T L E + Y +G +Y H+D + P + G + + R
Sbjct: 389 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNR 448
Query: 200 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 259
+A+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 449 LATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEG 493
Query: 260 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 494 DYRTRHAACPVLVGCKWVSNKWFHERGQ 521
>gi|90085216|dbj|BAE91349.1| unnamed protein product [Macaca fascicularis]
Length = 244
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + + + + ++AK L +T+ + + Q R S ++S E+
Sbjct: 45 PRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYENP- 102
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 103 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 161
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 162 TWLFYMSDVSAGGATVFPE---------------VGASVWPKKGTAVFWYNLFASGEGDY 206
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 207 STRHAACPVLVGNKWVSNKWLHERGQ 232
>gi|148701597|gb|EDL33544.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide, isoform CRA_b [Mus
musculus]
Length = 506
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 97/210 (46%), Gaps = 30/210 (14%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 141
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 307 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 359
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS---- 197
ED+ + + ++ +T L E + Y +G +Y H+D E K
Sbjct: 360 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRSDEQDAFKRLGTG 419
Query: 198 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 257
RVA+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 420 NRVATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSG 464
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 465 EGDYRTRHAACPVLVGCKWVSNKWFHERGQ 494
>gi|395736139|ref|XP_003776705.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Pongo abelii]
Length = 575
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 101/208 (48%), Gaps = 28/208 (13%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 141
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 378 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 430
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQR 199
ED+ + + ++ +T L E + Y +G +Y H+D + P + G + + R
Sbjct: 431 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNR 490
Query: 200 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 259
+A+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 491 LATFLNYMSDVEAGGATVFPD---------------LGAAIWPKKGTAVFWYNLLRSGEG 535
Query: 260 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 536 DYRTRHAACPVLVGCKWVSNKWFHERGQ 563
>gi|74148153|dbj|BAE36242.1| unnamed protein product [Mus musculus]
Length = 454
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 56/210 (26%), Positives = 100/210 (47%), Gaps = 30/210 (14%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + + + + ++AK L +T+ + + Q R S ++S ED
Sbjct: 255 PRIIRFHDIISDAENEIVKDLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYEDP- 312
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD--------AFDPQEYGPQKS 197
+ I +I +T L E + Y +G +Y H+D AF +E G
Sbjct: 313 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAF--RELG--TG 367
Query: 198 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 257
R+A++L Y++D+ GG T+FP +G V P++G + +Y+L +G
Sbjct: 368 NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASG 412
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 413 EGDYSTRHAACPVLVGNKWVSNKWLHERGQ 442
>gi|348501574|ref|XP_003438344.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oreochromis
niloticus]
Length = 615
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 98/206 (47%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + + + + + + + +AK LR +T++ V T R S +++ +D
Sbjct: 416 PYIVRYLDIISDAEIERVKQLAKPRLRRATIS-NPITGVLETASYRISKSAWLTEYDDP- 473
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
++ I ++I VT L E + Y +G +Y H+D E K R+A
Sbjct: 474 -MIEKINDRIEGVTGLEMDTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIA 532
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 533 TWLFYMSDVSAGGATVFP---------------DVGAAVWPQKGTAVFWYNLFASGEGDY 577
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 578 STRHAACPVLVGNKWVSNKWIHERGQ 603
>gi|119582748|gb|EAW62344.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_c
[Homo sapiens]
Length = 565
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 101/208 (48%), Gaps = 28/208 (13%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 141
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 368 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 420
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQR 199
ED+ + + ++ +T L E + Y +G +Y H+D + P + G + + R
Sbjct: 421 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNR 480
Query: 200 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 259
+A+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 481 LATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEG 525
Query: 260 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 526 DYRTRHAACPVLVGCKWVSNKWFHERGQ 553
>gi|426349879|ref|XP_004042513.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Gorilla gorilla
gorilla]
Length = 565
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 101/208 (48%), Gaps = 28/208 (13%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 141
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 368 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 420
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQR 199
ED+ + + ++ +T L E + Y +G +Y H+D + P + G + + R
Sbjct: 421 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNR 480
Query: 200 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 259
+A+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 481 LATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEG 525
Query: 260 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 526 DYRTRHAACPVLVGCKWVSNKWFHERGQ 553
>gi|432926124|ref|XP_004080841.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oryzias
latipes]
Length = 523
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 102/210 (48%), Gaps = 30/210 (14%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + + + A+ ++ +++ +AK LR +T+ + + Q R S ++ + E
Sbjct: 324 PYIVRYHDVASEKEMETVKELAKPRLRRATVHDPQTGKLTTAQ-YRVSKSAWLGS--HEH 380
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD--------AFDPQEYGPQKS 197
+D I ++I +T L E + Y +G +Y H+D AF+ G
Sbjct: 381 PIVDRINQRIEDITGLDVSTAEDLQVANYGVGGQYEPHFDFGRKDEADAFEELGTG---- 436
Query: 198 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 257
R+A++L+Y++D++ GG T+F IG V P++G + +Y+L +G
Sbjct: 437 NRIATWLLYMSDVQAGGNTVFTD---------------IGAVVWPKKGTAVFWYNLHRSG 481
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 482 EGDYRTRHAACPVLVGNKWVSNKWIHERGQ 511
>gi|426255744|ref|XP_004021508.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1 [Ovis
aries]
Length = 534
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 99/206 (48%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + + + + ++AK LR +T++ ++ T R S ++S E+
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLE-TVHYRISKSAWLSGYENP- 392
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 451
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVLAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|209862961|ref|NP_001129548.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Mus
musculus]
gi|17390970|gb|AAH18411.1| P4ha2 protein [Mus musculus]
gi|18073922|emb|CAC85690.1| Prolyl 4-hydroxylase alpha IIa subunit [Mus musculus]
gi|74211515|dbj|BAE26490.1| unnamed protein product [Mus musculus]
Length = 535
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 101/208 (48%), Gaps = 28/208 (13%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 141
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 338 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 390
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQR 199
ED+ + + ++ +T L E + Y +G +Y H+D + P + G + + R
Sbjct: 391 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNR 450
Query: 200 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 259
+A+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 451 LATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEG 495
Query: 260 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 496 DYRTRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|116283554|gb|AAH17062.1| P4HA2 protein [Homo sapiens]
Length = 504
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 101/208 (48%), Gaps = 28/208 (13%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 141
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 307 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 359
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQR 199
ED+ + + ++ +T L E + Y +G +Y H+D + P + G + + R
Sbjct: 360 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNR 419
Query: 200 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 259
+A+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 420 LATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEG 464
Query: 260 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 465 DYRTRHAACPVLVGCKWVSNKWFHERGQ 492
>gi|195341544|ref|XP_002037366.1| GM12151 [Drosophila sechellia]
gi|194131482|gb|EDW53525.1| GM12151 [Drosophila sechellia]
Length = 537
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 54/203 (26%), Positives = 91/203 (44%), Gaps = 19/203 (9%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P F + P + + MA + ST+ G + + R S ++ A E
Sbjct: 331 PYVATFHDMLNPRKISQLREMAVPRMHRSTVNPLPGGQLKKS-AFRVSKNAWL-AYESHP 388
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQKSQRVASFL 204
+ ++ + + T L E + Y +G Y H+D F DP Y ++ R+A+ +
Sbjct: 389 TMVGMLRD-LKDATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRIATAI 447
Query: 205 VYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSI 264
YL+++E+GG T FPF + + VKP+ G+ L +Y+L + D +
Sbjct: 448 FYLSEVEQGGATAFPF---------------LDIAVKPQLGNVLFWYNLHRSLDKDYRTK 492
Query: 265 HGSCPVVKGEKWVATKWIRDQEQ 287
H CPV+KG KW+ WI + Q
Sbjct: 493 HAGCPVLKGSKWIGNVWIHEVTQ 515
>gi|219113719|ref|XP_002186443.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|209583293|gb|ACI65913.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 230
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 67/239 (28%), Positives = 107/239 (44%), Gaps = 44/239 (18%)
Query: 76 NIPFQVLSWMPRALYFPNFATPEQCKSIINMAK-LNLRPSTLALRKG------ETVDNTQ 128
N+ +VLS PRA NF + ++ + I+ +A ++L+ S+ G E +++
Sbjct: 2 NLTLKVLSCAPRAFEIENFLSRQEVEHIVQLASGVDLKLSSTGDITGHKETPKELQTDSR 61
Query: 129 GIRTSSGVFISAAEDESGTLDLIEEKIAKVTML-------------------PRINGEAF 169
RTS ++ ++S +D I + A V + + E
Sbjct: 62 RTRTSYNSWV--PREKSPIIDAIYRRAADVMRIDEALLRHRSDHTEWTNLTSTKPLAEQL 119
Query: 170 NILRYKIGQKYNSHYD----AFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMN 225
++ Y GQ+Y +H+D D Q G R + L+YL + GGET FP N
Sbjct: 120 QLVHYGPGQEYTAHHDFGFSRIDDQFQGA----RFGTLLLYLNEGMTGGETSFP--RWSN 173
Query: 226 ADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRD 284
A+ ++ L +KP G +LFYS LP+G +D S H + PV GEKW+ W D
Sbjct: 174 AETFHE------LSIKPEVGKAVLFYSQLPDGNLDDLSHHAAKPVTDGEKWLINLWTWD 226
>gi|73952886|ref|XP_850682.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3 [Canis
lupus familiaris]
Length = 534
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + + + + ++AK L +T+ + + Q R S ++S E+
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYENP- 392
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 451
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|148701600|gb|EDL33547.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide, isoform CRA_e [Mus
musculus]
Length = 593
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 101/208 (48%), Gaps = 28/208 (13%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 141
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 396 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 448
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQR 199
ED+ + + ++ +T L E + Y +G +Y H+D + P + G + + R
Sbjct: 449 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNR 508
Query: 200 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 259
+A+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 509 LATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEG 553
Query: 260 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 554 DYRTRHAACPVLVGCKWVSNKWFHERGQ 581
>gi|261245137|gb|ACX54875.1| FI12021p [Drosophila melanogaster]
Length = 538
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 104/218 (47%), Gaps = 31/218 (14%)
Query: 77 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVD-----NTQGIR 131
I ++LS P + + + ++ I + +K + PS ETV+ R
Sbjct: 321 IKTEILSVDPFVILLHDMVSHKEGALIRSSSKNQILPS-------ETVNAANEFEIAKFR 373
Query: 132 TSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDA--FDP 189
TS V+ + +E+ TL L + ++ + T L + E F ++ Y IG + SH+D D
Sbjct: 374 TSKSVWFDSDANEA-TLKLTQ-RLGEATGLDMKHSEPFQVINYGIGGVFESHFDTSLADE 431
Query: 190 QEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLL 249
+ R+A+ L YL D+ +GG T FP G+N + V P+ G L+
Sbjct: 432 DRFVNGYIDRLATTLFYLNDVPQGGATHFP---GLN------------ITVFPKFGTVLM 476
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
+Y+L G + ++H CPV+ G KWV +KWI D+ Q
Sbjct: 477 WYNLHTEGMLHVRTMHTGCPVIVGSKWVVSKWIDDKGQ 514
>gi|66771513|gb|AAY55068.1| IP12095p [Drosophila melanogaster]
Length = 538
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 104/218 (47%), Gaps = 31/218 (14%)
Query: 77 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVD-----NTQGIR 131
I ++LS P + + + ++ I + +K + PS ETV+ R
Sbjct: 321 IKTEILSVDPFVILLHDMVSHKEGALIRSSSKNQILPS-------ETVNAANEFEIAKFR 373
Query: 132 TSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDA--FDP 189
TS V+ + +E+ TL L + ++ + T L + E F ++ Y IG + SH+D D
Sbjct: 374 TSKSVWFDSDANEA-TLKLTQ-RLGEATGLDMKHSEPFQVINYGIGGVFESHFDTSLADE 431
Query: 190 QEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLL 249
+ R+A+ L YL D+ +GG T FP G+N + V P+ G L+
Sbjct: 432 DRFVNGYIDRLATTLFYLNDVPQGGATHFP---GLN------------ITVFPKFGTVLM 476
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
+Y+L G + ++H CPV+ G KWV +KWI D+ Q
Sbjct: 477 WYNLHTEGMLHVRTMHTGCPVIVGSKWVVSKWIDDKGQ 514
>gi|195572619|ref|XP_002104293.1| GD18524 [Drosophila simulans]
gi|194200220|gb|EDX13796.1| GD18524 [Drosophila simulans]
Length = 472
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 45/159 (28%), Positives = 80/159 (50%), Gaps = 23/159 (14%)
Query: 130 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFD- 188
+RTS +I +E + E++ +T + F+++ Y +G Y HYD +
Sbjct: 318 VRTSKDSYIVDSES-------LNERVTDMTGFSMEMSDPFSLINYGLGGHYMLHYDFHEY 370
Query: 189 PQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGL 248
P++ R+A+ L YL +++ GG T+FP I + V P++G +
Sbjct: 371 TNTTRPKQGDRIATVLFYLGEVDSGGATIFP---------------KINIAVTPKKGSAV 415
Query: 249 LFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
+Y+L +G ++ S+H +CPV+ G K+V TKWI + Q
Sbjct: 416 FWYNLHNSGAMNLKSLHSACPVISGSKYVLTKWINELPQ 454
>gi|354474415|ref|XP_003499426.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2
[Cricetulus griseus]
Length = 533
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 101/208 (48%), Gaps = 28/208 (13%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 141
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 388
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQR 199
ED+ + + ++ +T L E + Y +G +Y H+D + P + G + + R
Sbjct: 389 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNR 448
Query: 200 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 259
+A+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 449 LATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEG 493
Query: 260 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 494 DYRTRHAACPVLVGCKWVSNKWFHERGQ 521
>gi|227553849|gb|ACP40552.1| IP22178p [Drosophila melanogaster]
Length = 467
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 54/203 (26%), Positives = 92/203 (45%), Gaps = 19/203 (9%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P F + +P + + MA + ST+ G + + R S ++ A E
Sbjct: 261 PYVATFHDILSPGKISQLREMAVPRMHRSTVNPLPGGQLKKS-AFRVSKNAWL-AYESHP 318
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQKSQRVASFL 204
+ ++ + + T L E + Y +G Y H+D F DP Y ++ R+A+ +
Sbjct: 319 TMVGMLRD-LKDATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRIATAI 377
Query: 205 VYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSI 264
YL+++E+GG T FPF + + VKP+ G+ L +Y+L + D +
Sbjct: 378 FYLSEVEQGGATAFPF---------------LDIAVKPQLGNVLFWYNLHRSLDKDYRTK 422
Query: 265 HGSCPVVKGEKWVATKWIRDQEQ 287
H CPV+KG KW+ WI + Q
Sbjct: 423 HAGCPVLKGSKWIGNVWIHEVTQ 445
>gi|410948132|ref|XP_003980795.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Felis
catus]
gi|410948136|ref|XP_003980797.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Felis
catus]
Length = 533
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 101/208 (48%), Gaps = 28/208 (13%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 141
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 388
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQR 199
ED+ + + ++ +T L E + Y +G +Y H+D + P + G + + R
Sbjct: 389 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNR 448
Query: 200 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 259
+A+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 449 LATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEG 493
Query: 260 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 494 DYRTRHAACPVLVGCKWVSNKWFHERGQ 521
>gi|66770643|gb|AAY54633.1| IP12395p [Drosophila melanogaster]
Length = 538
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 104/218 (47%), Gaps = 31/218 (14%)
Query: 77 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVD-----NTQGIR 131
I ++LS P + + + ++ I + +K + PS ETV+ R
Sbjct: 321 IKTEILSVDPFVILLHDMVSHKEGALIRSSSKNQILPS-------ETVNAANEFEIAKFR 373
Query: 132 TSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDA--FDP 189
TS V+ + +E+ TL L + ++ + T L + E F ++ Y IG + SH+D D
Sbjct: 374 TSKSVWFDSDANEA-TLKLTQ-RLGEATGLDMKHSEPFQVINYGIGGVFESHFDTSLADE 431
Query: 190 QEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLL 249
+ R+A+ L YL D+ +GG T FP G+N + V P+ G L+
Sbjct: 432 DRFVNGYIDRLATTLFYLNDVPQGGATHFP---GLN------------ITVFPKFGTVLM 476
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
+Y+L G + ++H CPV+ G KWV +KWI D+ Q
Sbjct: 477 WYNLHTEGMLHVRTMHTGCPVIVGSKWVVSKWIDDKGQ 514
>gi|20269818|gb|AAM18064.1| prolyl 4-hydroxylase alpha-related protein PH4[alpha]NE1
[Drosophila melanogaster]
Length = 286
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 54/203 (26%), Positives = 92/203 (45%), Gaps = 19/203 (9%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P F + +P + + MA + ST+ G + + R S ++ A E
Sbjct: 80 PYVATFHDILSPGKISQLREMAVPRMHRSTVNPLPGGQLKKS-AFRVSKNAWL-AYESHP 137
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQKSQRVASFL 204
+ ++ + + T L E + Y +G Y H+D F DP Y ++ R+A+ +
Sbjct: 138 TMVGMLRD-LKDATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRIATAI 196
Query: 205 VYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSI 264
YL+++E+GG T FPF + + VKP+ G+ L +Y+L + D +
Sbjct: 197 FYLSEVEQGGATAFPF---------------LDIAVKPQLGNVLFWYNLHRSLDKDYRTK 241
Query: 265 HGSCPVVKGEKWVATKWIRDQEQ 287
H CPV+KG KW+ WI + Q
Sbjct: 242 HAGCPVLKGSKWIGNVWIHEVTQ 264
>gi|148233143|ref|NP_001090904.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Sus scrofa]
gi|83778522|gb|ABC47142.1| procollagen-proline 2-oxoglutarate-4-dioxygenase [Sus scrofa]
Length = 534
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 98/206 (47%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + + + ++AK LR +T++ ++ T R S ++S E+
Sbjct: 335 PRIIRFHDIISDAEIDIVKDLAKPRLRRATISNPITGDLE-TVHYRISKSAWLSGYENP- 392
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ + +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 393 -VVSRLNMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 451
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|116008537|ref|NP_733379.2| CG31524, isoform A [Drosophila melanogaster]
gi|113194861|gb|AAN14239.2| CG31524, isoform A [Drosophila melanogaster]
Length = 536
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 104/218 (47%), Gaps = 31/218 (14%)
Query: 77 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVD-----NTQGIR 131
I ++LS P + + + ++ I + +K + PS ETV+ R
Sbjct: 319 IKTEILSVDPFVILLHDMVSHKEGALIRSSSKNQILPS-------ETVNAANEFEIAKFR 371
Query: 132 TSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDA--FDP 189
TS V+ + +E+ TL L + ++ + T L + E F ++ Y IG + SH+D D
Sbjct: 372 TSKSVWFDSDANEA-TLKLTQ-RLGEATGLDMKHSEPFQVINYGIGGVFESHFDTSLADE 429
Query: 190 QEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLL 249
+ R+A+ L YL D+ +GG T FP G+N + V P+ G L+
Sbjct: 430 DRFVNGYIDRLATTLFYLNDVPQGGATHFP---GLN------------ITVFPKFGTVLM 474
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
+Y+L G + ++H CPV+ G KWV +KWI D+ Q
Sbjct: 475 WYNLHTEGMLHVRTMHTGCPVIVGSKWVVSKWIDDKGQ 512
>gi|116008130|ref|NP_001036777.1| CG31524, isoform B [Drosophila melanogaster]
gi|113194860|gb|ABI31221.1| CG31524, isoform B [Drosophila melanogaster]
Length = 535
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 104/218 (47%), Gaps = 31/218 (14%)
Query: 77 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVD-----NTQGIR 131
I ++LS P + + + ++ I + +K + PS ETV+ R
Sbjct: 318 IKTEILSVDPFVILLHDMVSHKEGALIRSSSKNQILPS-------ETVNAANEFEIAKFR 370
Query: 132 TSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDA--FDP 189
TS V+ + +E+ TL L + ++ + T L + E F ++ Y IG + SH+D D
Sbjct: 371 TSKSVWFDSDANEA-TLKLTQ-RLGEATGLDMKHSEPFQVINYGIGGVFESHFDTSLADE 428
Query: 190 QEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLL 249
+ R+A+ L YL D+ +GG T FP G+N + V P+ G L+
Sbjct: 429 DRFVNGYIDRLATTLFYLNDVPQGGATHFP---GLN------------ITVFPKFGTVLM 473
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
+Y+L G + ++H CPV+ G KWV +KWI D+ Q
Sbjct: 474 WYNLHTEGMLHVRTMHTGCPVIVGSKWVVSKWIDDKGQ 511
>gi|296220402|ref|XP_002756291.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Callithrix
jacchus]
Length = 534
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + + + + ++AK L +T+ + + Q R S ++S E+
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYENP- 392
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 451
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|63252886|ref|NP_000908.2| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Homo
sapiens]
gi|114631173|ref|XP_508168.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 13 [Pan
troglodytes]
gi|602676|gb|AAA59069.1| alpha-subunit of prolyl 4-hydroxylase [Homo sapiens]
gi|62897481|dbj|BAD96680.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I variant [Homo
sapiens]
gi|119574852|gb|EAW54467.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I, isoform CRA_a [Homo
sapiens]
gi|119574853|gb|EAW54468.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I, isoform CRA_b [Homo
sapiens]
gi|410349609|gb|JAA41408.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
gi|410349613|gb|JAA41410.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
Length = 534
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + + + + ++AK L +T+ + + Q R S ++S E+
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYENP- 392
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 451
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|344274272|ref|XP_003408941.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1
[Loxodonta africana]
Length = 534
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + + + + ++AK L +T+ + + Q R S ++S E+
Sbjct: 335 PRIVRFHDIISDAEIEVVKDLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYENP- 392
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 451
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------DVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|301770069|ref|XP_002920453.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Ailuropoda
melanoleuca]
Length = 534
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + + + + ++AK L +T+ + + Q R S ++S E+
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYENP- 392
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 451
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|397490069|ref|XP_003816032.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Pan paniscus]
Length = 488
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + + + + ++AK L +T+ + + Q R S ++S E+
Sbjct: 289 PRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYENP- 346
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 347 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 405
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 406 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 450
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 451 STRHAACPVLVGNKWVSNKWLHERGQ 476
>gi|291404182|ref|XP_002718471.1| PREDICTED: prolyl 4-hydroxylase, alpha I subunit isoform 1
[Oryctolagus cuniculus]
Length = 534
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + + + + ++AK L +T+ + + Q R S ++S E+
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYENP- 392
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 451
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|332244067|ref|XP_003271193.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
alpha-1 [Nomascus leucogenys]
Length = 502
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + + + + ++AK L +T+ + + Q R S ++S E+
Sbjct: 303 PRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYENP- 360
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 361 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 419
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 420 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 464
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 465 STRHAACPVLVGNKWVSNKWLHERGQ 490
>gi|301754231|ref|XP_002912939.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Ailuropoda
melanoleuca]
Length = 535
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 97/210 (46%), Gaps = 30/210 (14%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 141
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 388
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK----S 197
ED+ + + ++ +T L E + Y +G +Y H+D E K
Sbjct: 389 EDDDPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRKNEQDAFKRLGTG 448
Query: 198 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 257
RVA+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 449 NRVATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSG 493
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 494 EGDYRTRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|402880501|ref|XP_003903839.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like, partial
[Papio anubis]
Length = 379
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + + + + ++AK L +T+ + + Q R S ++S E+
Sbjct: 180 PRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYENP- 237
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 238 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 296
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 297 TWLFYMSDVSAGGATVFPE---------------VGASVWPKKGTAVFWYNLFASGEGDY 341
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 342 STRHAACPVLVGNKWVSNKWLHERGQ 367
>gi|395820524|ref|XP_003783614.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1 [Otolemur
garnettii]
Length = 534
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + + + + ++AK L +T+ + + Q R S ++S E+
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYENP- 392
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 451
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|195159313|ref|XP_002020526.1| GL14040 [Drosophila persimilis]
gi|194117295|gb|EDW39338.1| GL14040 [Drosophila persimilis]
Length = 549
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 61/227 (26%), Positives = 100/227 (44%), Gaps = 23/227 (10%)
Query: 63 WMPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGE 122
W+ H ++ + LS P YF + + ++ + II K + S +
Sbjct: 323 WLSHQDVPYQRLSPFKVEQLSGDPYVAYFHDVLSDKESEQIIEHGKGQVTRSEIGQTGNS 382
Query: 123 TVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNS 182
TV IRTS ++ + + L I++++ +T L E ++ Y IG +Y
Sbjct: 383 TVSE---IRTSQNTWLWY--ENNPWLADIKQRLEDITGLSTDTAEPLQLVNYGIGGQYEP 437
Query: 183 HYDAFDPQE--YGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKV 240
H+D D E +G K R+ + L YL D+ GG T FPF + L V
Sbjct: 438 HFDFMDDAEKNFG-WKGNRLLTALFYLNDVPLGGATAFPF---------------LHLAV 481
Query: 241 KPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
P +G L++Y+L + D + H CPV+KG KW+ +W + Q
Sbjct: 482 PPVKGSLLVWYNLHRSLHKDFRTKHAGCPVLKGSKWICNEWFHEAAQ 528
>gi|443721482|gb|ELU10773.1| hypothetical protein CAPTEDRAFT_174752 [Capitella teleta]
Length = 525
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 60/232 (25%), Positives = 106/232 (45%), Gaps = 36/232 (15%)
Query: 77 IPF-----QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIR 131
+PF ++L+ P + F + + + K++ A L + +A + + + R
Sbjct: 302 LPFVRYKEEMLNRKPHIVLFHDVMSDAEAKTMKMEAMHKLERAHVADNENKHGHSASAKR 361
Query: 132 TSSGVFISAAEDESGTLDLIEEKIAKVTMLPR--ING----EAFNILRYKIGQKYNSHYD 185
S ++ + + T+ + ++A +T L ++G E F IL Y IG +Y H D
Sbjct: 362 ISQVSWL-WDDHANKTIHQLSRRVADITGLQTGVVSGLHSAEPFQILNYGIGGQYEPHVD 420
Query: 186 AFDPQ-------EYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGL 238
F E+ R+A+F+ YL D+ GG T+FP + +
Sbjct: 421 YFAGNHSHSSLPEHVRASGNRLATFMFYLNDVHAGGATVFP---------------KLKV 465
Query: 239 KVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRD--QEQY 288
+ P + +Y++ NG +DP + H CPV+ G+KWVA KWI + Q+QY
Sbjct: 466 GIPPTKNGAAFWYNIGLNGDVDPLTEHAGCPVLLGQKWVANKWIHEHGQDQY 517
>gi|348576112|ref|XP_003473831.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cavia
porcellus]
Length = 534
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + + + + ++AK L +T+ + + Q R S ++S E+
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYENP- 392
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 451
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|359401514|ref|ZP_09194482.1| 2OG-Fe(II) oxygenase [Novosphingobium pentaromativorans US6-1]
gi|357597189|gb|EHJ58939.1| 2OG-Fe(II) oxygenase [Novosphingobium pentaromativorans US6-1]
Length = 232
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 71/243 (29%), Positives = 108/243 (44%), Gaps = 41/243 (16%)
Query: 47 ARPSARVVESVKDEYKWMPHGQAGDDSVTNIPFQVLSWMPRALYFPN-FATPEQCKSIIN 105
R AR+ SV++ +G +V +P + + +Y N F C +I
Sbjct: 20 CRALARIGASVRERL-------SGQPTVRRLPVEAVE-----IYTSNSFLNEADCAHLIA 67
Query: 106 MAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRIN 165
+ PS L +G G RTS I + L+L +++ T +
Sbjct: 68 LIDTCACPSRLLEEEG-----WDGYRTSYSGDIDTHDRIVRDLEL---RLSDFTGIAPSC 119
Query: 166 GEAFNILRYKIGQKYNSHYDAFDPQE-YGPQK----SQRVASFLVYLTDLEEGGETMFPF 220
GE+ RY+ GQ +N H D FD + Y Q+ QR + ++YL +EEGG T F
Sbjct: 120 GESAQGQRYECGQYFNEHCDWFDTEAGYWRQERRCGGQRSWTAMIYLNAVEEGGRTDFTH 179
Query: 221 ENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATK 280
IGL + P G LL+ + LP+GT +P ++H + PVV+G K+V TK
Sbjct: 180 ---------------IGLSIPPEPGCLLLWNNALPDGTPNPLTMHAARPVVRGVKYVVTK 224
Query: 281 WIR 283
W R
Sbjct: 225 WFR 227
>gi|190786|gb|AAA36534.1| prolyl 4-hydroxylase alpha subunit (EC 1.14.11.2) [Homo sapiens]
Length = 534
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + + + + ++AK L +T+ + + Q R S ++S E+
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYENP- 392
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 451
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|24651420|ref|NP_733374.1| prolyl-4-hydroxylase-alpha NE1 [Drosophila melanogaster]
gi|7301952|gb|AAF57058.1| prolyl-4-hydroxylase-alpha NE1 [Drosophila melanogaster]
gi|363987308|gb|AEW43896.1| FI16820p1 [Drosophila melanogaster]
Length = 537
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 54/203 (26%), Positives = 92/203 (45%), Gaps = 19/203 (9%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P F + +P + + MA + ST+ G + + R S ++ A E
Sbjct: 331 PYVATFHDILSPGKISQLREMAVPRMHRSTVNPLPGGQLKKS-AFRVSKNAWL-AYESHP 388
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQKSQRVASFL 204
+ ++ + + T L E + Y +G Y H+D F DP Y ++ R+A+ +
Sbjct: 389 TMVGMLRD-LKDATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRIATAI 447
Query: 205 VYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSI 264
YL+++E+GG T FPF + + VKP+ G+ L +Y+L + D +
Sbjct: 448 FYLSEVEQGGATAFPF---------------LDIAVKPQLGNVLFWYNLHRSLDKDYRTK 492
Query: 265 HGSCPVVKGEKWVATKWIRDQEQ 287
H CPV+KG KW+ WI + Q
Sbjct: 493 HAGCPVLKGSKWIGNVWIHEVTQ 515
>gi|226874889|ref|NP_001152881.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Bos
taurus]
gi|296485624|tpg|DAA27739.1| TPA: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Bos taurus]
Length = 535
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 97/210 (46%), Gaps = 30/210 (14%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 141
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 388
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS---- 197
ED+ + + ++ +T L E + Y +G +Y H+D E K
Sbjct: 389 EDDDPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRKDEQDAFKRLGTG 448
Query: 198 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 257
RVA+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 449 NRVATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSG 493
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 494 EGDYRTRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|313768324|ref|YP_004062004.1| hypothetical protein MpV1_121c [Micromonas sp. RCC1109 virus MpV1]
gi|312599020|gb|ADQ91044.1| hypothetical protein MpV1_121c [Micromonas sp. RCC1109 virus MpV1]
Length = 219
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 60/198 (30%), Positives = 95/198 (47%), Gaps = 29/198 (14%)
Query: 87 RALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESG 146
R F +F TP++ K I+ MA L+PST++ + + N + +R S ++ E
Sbjct: 45 RPRVFHDFITPQERKHIMEMASKELKPSTVS---TDRILN-ESVRKSETAWLGR---EDP 97
Query: 147 TLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVY 206
+D + + K P N E +LRYK G YN H D+ + + R+ +F++
Sbjct: 98 VVDAVIHRCLKYIDRPIKNCEKLQVLRYKPGGYYNPHQDSINDG-----SNPRLYTFILA 152
Query: 207 LTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPT-SIH 265
L D EGGET FP +G + K GD L F+ +L N ++ + ++H
Sbjct: 153 LNDEYEGGETEFP---------------KLGNEYKLTAGDAL-FFDILDNYELETSKALH 196
Query: 266 GSCPVVKGEKWVATKWIR 283
G PV GEKW+ W+R
Sbjct: 197 GGKPVKSGEKWICNLWVR 214
>gi|432106758|gb|ELK32410.1| Prolyl 4-hydroxylase subunit alpha-1 [Myotis davidii]
Length = 534
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + + + + ++AK L +T+ + + Q R S ++S E+
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYENP- 392
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 451
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|410251924|gb|JAA13929.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
Length = 566
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + + + + ++AK L +T+ + + Q R S ++S E+
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYENP- 392
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 451
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|380813208|gb|AFE78478.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Macaca
mulatta]
gi|384947330|gb|AFI37270.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Macaca
mulatta]
Length = 534
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + + + + ++AK L +T+ + + Q R S ++S E+
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYENP- 392
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 451
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|383418721|gb|AFH32574.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Macaca
mulatta]
Length = 534
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + + + + ++AK L +T+ + + Q R S ++S E+
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYENP- 392
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 451
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|344264849|ref|XP_003404502.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2
[Loxodonta africana]
Length = 534
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 101/208 (48%), Gaps = 28/208 (13%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 141
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 337 PHIVRYYDVMSDEEIERIKQIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 389
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQR 199
ED+ + + ++ +T L E + Y +G +Y H+D + P + G + + R
Sbjct: 390 EDDDPVVAQVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNR 449
Query: 200 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 259
+A+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 450 LATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEG 494
Query: 260 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 495 DYRTRHAACPVLVGCKWVSNKWFHERGQ 522
>gi|297515507|gb|ADI44133.1| RT08151p [Drosophila melanogaster]
Length = 546
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 54/210 (25%), Positives = 91/210 (43%), Gaps = 26/210 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + + + + I + + L +T+ V N +RTS FI +
Sbjct: 333 PLLVLYHDVIYQSEIDVIRKLTENRLMRATITSHNESVVSN---VRTSQFTFIPVTAHK- 388
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEY------GPQKSQR 199
L I++++A +T L E Y IG Y H D F + P+ R
Sbjct: 389 -VLSTIDQRVADMTNLNMKYAEDHQFANYGIGGHYGQHMDWFYQTTFDAGLVSSPEMGNR 447
Query: 200 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 259
+A+ L YL+D+ +GG T FP + +KP++ +++L +G
Sbjct: 448 IAAVLFYLSDVAQGGGTAFP---------------QLRTLLKPKKYAAAFWHNLHASGVG 492
Query: 260 DPTSIHGSCPVVKGEKWVATKWIRDQEQYD 289
D + HG+CP++ G KWV +WIR+ +Q D
Sbjct: 493 DVRTQHGACPIIAGSKWVQNRWIRENDQSD 522
>gi|349604936|gb|AEQ00344.1| Prolyl 4-hydroxylase subunit alpha-1-like protein, partial [Equus
caballus]
Length = 302
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + + + + ++AK L +T+ + + Q R S ++S E+
Sbjct: 103 PRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYENP- 160
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 161 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 219
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 220 TWLFYMSDVSAGGATVFPE---------------VGASVWPKKGTAVFWYNLFASGEGDY 264
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 265 STRHAACPVLVGNKWVSNKWLHERGQ 290
>gi|341614920|ref|ZP_08701789.1| hypothetical protein CJLT1_08183 [Citromicrobium sp. JLT1363]
Length = 210
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 56/197 (28%), Positives = 90/197 (45%), Gaps = 28/197 (14%)
Query: 94 FATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEE 153
F E C +I M + RPSTLA D + RTS + + S LD +
Sbjct: 33 FLDAEFCAELITMIEAKRRPSTLA-----DFDGDEFFRTSETCDLPMDDPRSQRLDAM-- 85
Query: 154 KIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP-----QEYGPQKSQRVASFLVYLT 208
+A ++ + GE RY +GQ++ +H D F+P ++Y QR +F++YL
Sbjct: 86 -LADLSGIDPAYGEPLQGQRYAVGQEFKAHCDYFNPDGQDWEKYCSVAGQRTWTFMIYLN 144
Query: 209 DLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSC 268
+ E GG T F K I +P G + + + P+ +++P ++H
Sbjct: 145 EPEAGGVTRF---------------KTIKKSFQPETGKLICWNNRRPDQSVNPNTMHHGM 189
Query: 269 PVVKGEKWVATKWIRDQ 285
V KG K+V TKW R++
Sbjct: 190 KVRKGMKYVITKWYREK 206
>gi|426229219|ref|XP_004008688.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 1
[Ovis aries]
Length = 535
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 97/210 (46%), Gaps = 30/210 (14%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 141
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 388
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK----S 197
ED+ + + ++ +T L E + Y +G +Y H+D E K
Sbjct: 389 EDDDPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRKDEQDAFKRLGTG 448
Query: 198 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 257
RVA+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 449 NRVATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSG 493
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 494 EGDYRTRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|344276265|ref|XP_003409929.1| PREDICTED: transmembrane prolyl 4-hydroxylase [Loxodonta africana]
Length = 475
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 58/215 (26%), Positives = 96/215 (44%), Gaps = 35/215 (16%)
Query: 102 SIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTML 161
S+ + ++LR +R + ++ +R S ++ E + I +++ ++T L
Sbjct: 218 SLQEFSNMDLRDFHKYMRS-HKAETSELVRNSHHTWLYQGEGAHHVMRAIRQRVLRLTRL 276
Query: 162 -PRIN--GEAFNILRYKIGQKYNSHYDA-------------FDPQEYGP-QKSQRVASFL 204
P I E ++RY G Y++H D+ E GP + S R + L
Sbjct: 277 SPEIVELSEPLQVVRYGEGGHYHAHVDSGPVYPETICSHTKLVANESGPFETSCRYMTVL 336
Query: 205 VYLTDLEEGGETMFPFENGMNADGSYDYQKCI------------GLKVKPRQGDGLLFYS 252
YL ++ GGET+FP + D Q + L+VKPRQG + +Y+
Sbjct: 337 FYLNNVTGGGETVFPVADNRTYDEMSLIQDDVDLRDTRRHCDKGNLRVKPRQGTAVFWYN 396
Query: 253 LLPNGT-----IDPTSIHGSCPVVKGEKWVATKWI 282
LP+G +D S+HG C V +G KW+A WI
Sbjct: 397 YLPDGQGWVGDVDDYSLHGGCLVTRGTKWIANNWI 431
>gi|156333122|ref|XP_001619372.1| hypothetical protein NEMVEDRAFT_v1g151555 [Nematostella vectensis]
gi|156202442|gb|EDO27272.1| predicted protein [Nematostella vectensis]
Length = 144
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 45/146 (30%), Positives = 70/146 (47%), Gaps = 15/146 (10%)
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVA 201
++E + I ++ + L E ++ Y IG Y HYD + R+A
Sbjct: 8 DEEDELVKRISYRVQAYSGLNMTTSEDLQVVNYGIGGHYEPHYDFARDKFTSLGTGNRIA 67
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
+FL YL+D+E GG T+F +G V P++GD +Y+L +G D
Sbjct: 68 TFLSYLSDVEAGGGTVFTR---------------VGATVWPQKGDAAFWYNLKRSGDGDS 112
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWVA KWI + Q
Sbjct: 113 STRHAACPVLVGSKWVANKWIHEVGQ 138
>gi|116008434|ref|NP_651806.2| CG9698 [Drosophila melanogaster]
gi|113194862|gb|AAF57062.2| CG9698 [Drosophila melanogaster]
Length = 547
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 54/210 (25%), Positives = 91/210 (43%), Gaps = 26/210 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + + + + I + + L +T+ V N +RTS FI +
Sbjct: 333 PLLVLYHDVIYQSEIDVIRKLTENRLMRATITSHNESVVSN---VRTSQFTFIPVTAHK- 388
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEY------GPQKSQR 199
L I++++A +T L E Y IG Y H D F + P+ R
Sbjct: 389 -VLSTIDQRVADMTNLNMKYAEDHQFANYGIGGHYGQHMDWFYQTTFDAGLVSSPEMGNR 447
Query: 200 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 259
+A+ L YL+D+ +GG T FP + +KP++ +++L +G
Sbjct: 448 IATVLFYLSDVAQGGGTAFP---------------QLRTLLKPKKYAAAFWHNLHASGVG 492
Query: 260 DPTSIHGSCPVVKGEKWVATKWIRDQEQYD 289
D + HG+CP++ G KWV +WIR+ +Q D
Sbjct: 493 DVRTQHGACPIIAGSKWVQNRWIRENDQSD 522
>gi|292621357|ref|XP_691737.4| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Danio rerio]
Length = 538
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 58/217 (26%), Positives = 95/217 (43%), Gaps = 33/217 (15%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFI- 138
+++S P + F F T + K+I A LR S +A + T R S ++
Sbjct: 334 EIISLQPYVVLFHGFVTQAEAKNIRKYAMPGLRRSVVASGMNQA---TAEYRISKSAWLK 390
Query: 139 SAAEDESGTLDLIEEKIAKVTMLPRIN-----GEAFNILRYKIGQKYNSHYDAFDPQE-- 191
+A + G LD ++T++ +N E ++ Y IG Y H+D
Sbjct: 391 ESAHEVVGKLD------QRITLVTGLNVQPPYAEYLQVVNYGIGGHYEPHFDHATSDSSP 444
Query: 192 -YGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 250
Y + RVA+ ++YL+ ++ GG T F + N V Q L +
Sbjct: 445 LYRLKTGNRVATIMIYLSPVQAGGSTAFIYAN---------------FSVPVVQNAALFW 489
Query: 251 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
++L NG + ++H CPV+ G KWVA KW+ + EQ
Sbjct: 490 WNLHKNGQGNVDTLHAGCPVIVGNKWVANKWVHEYEQ 526
>gi|195330778|ref|XP_002032080.1| GM23711 [Drosophila sechellia]
gi|194121023|gb|EDW43066.1| GM23711 [Drosophila sechellia]
Length = 490
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 45/159 (28%), Positives = 80/159 (50%), Gaps = 23/159 (14%)
Query: 130 IRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFD- 188
+RTS +I A+ + E++ +T + F+++ Y +G Y HYD +
Sbjct: 336 VRTSKDSYIVDAKS-------LNERVTDMTGFSMEMSDPFSLINYGLGGHYMLHYDFHEY 388
Query: 189 PQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGL 248
P++ R+A+ L YL +++ GG T+FP I + V P++G +
Sbjct: 389 TNTTRPKQGDRIATVLFYLGEVDSGGATIFP---------------KINIAVTPKKGSAV 433
Query: 249 LFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
+Y+L +G ++ S+H +CPV+ G K+V TKWI + Q
Sbjct: 434 FWYNLHNSGAMNLKSLHSACPVISGSKYVLTKWINELPQ 472
>gi|836898|gb|AAC52197.1| prolyl 4-hydroxylase alpha(I)-subunit, partial [Mus musculus]
gi|1096887|prf||2112362A Pro 4-hydroxylase:SUBUNIT=alpha:ISOTYPE=I
Length = 526
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 96/206 (46%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + + + + +AK L +T+ + + Q R S ++S ED
Sbjct: 327 PRIIRFHDIISDAEIEIVKYLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYEDP- 384
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ I +I +T L E + Y +G +Y H+D E + R+A
Sbjct: 385 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIA 443
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 444 TWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 488
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 489 STRHAACPVLVGNKWVSNKWLHERGQ 514
>gi|348557542|ref|XP_003464578.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 1
[Cavia porcellus]
Length = 535
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 54/210 (25%), Positives = 98/210 (46%), Gaps = 30/210 (14%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 141
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 388
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS---- 197
E++ + + ++ ++T L E + Y +G +Y H+D E K
Sbjct: 389 EEDDPVVARVNRRMQQITGLTVKTAELLQVANYGMGGQYEPHFDFSRSHERDAFKRLGTG 448
Query: 198 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 257
RVA+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 449 NRVATFLNYMSDVEAGGATVFP---------------DLGAALWPKKGTAVFWYNLLRSG 493
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 494 EGDYRTRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|2498741|sp|Q60716.1|P4HA2_MOUSE RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
alpha-2; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-2; Flags: Precursor
gi|836900|gb|AAC52198.1| prolyl 4-hydroxylase alpha(II)-subunit [Mus musculus]
gi|18073923|emb|CAC85691.1| Prolyl 4-hydroxylase alpha IIb subunit [Mus musculus]
gi|1096888|prf||2112362B Pro 4-hydroxylase:SUBUNIT=alpha:ISOTYPE=II
Length = 537
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 54/210 (25%), Positives = 97/210 (46%), Gaps = 30/210 (14%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 141
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 338 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 390
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQK----S 197
ED+ + + ++ +T L E + Y +G +Y H+D + K
Sbjct: 391 EDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRSDDEDAFKRLGTG 450
Query: 198 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 257
RVA+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 451 NRVATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSG 495
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 496 EGDYRTRHAACPVLVGCKWVSNKWFHERGQ 525
>gi|194764881|ref|XP_001964556.1| GF23245 [Drosophila ananassae]
gi|190614828|gb|EDV30352.1| GF23245 [Drosophila ananassae]
Length = 460
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 45/140 (32%), Positives = 73/140 (52%), Gaps = 20/140 (14%)
Query: 151 IEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF---DPQEYGPQKSQRVASFLVYL 207
IE++I +T L E F ++ Y IG Y HYD + +P + + +R+ + L YL
Sbjct: 324 IEKRIKDMTGLSMDLSEDFMLINYGIGGTYKMHYDFYVYSEPLRF--LRGERIVTVLFYL 381
Query: 208 TDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGS 267
D+E G T+FPF N + + P++G +++Y+L +G + + H +
Sbjct: 382 GDVELSGSTVFPFLN---------------ISITPKKGSAVMWYNLHNSGDVHQKTQHCA 426
Query: 268 CPVVKGEKWVATKWIRDQEQ 287
CPVV G K+V TKWI + Q
Sbjct: 427 CPVVVGSKYVLTKWINELHQ 446
>gi|397644356|gb|EJK76358.1| hypothetical protein THAOC_01879, partial [Thalassiosira oceanica]
Length = 539
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 64/207 (30%), Positives = 96/207 (46%), Gaps = 29/207 (14%)
Query: 86 PRALYFPNFATPEQCKSIINMAKL-NLRPST---LALRKGETVDNTQGIRTSSGVFISAA 141
P + F NF T E+ ++ +L ST A GE RTSS +
Sbjct: 336 PWVVVFDNFLTDEEVADLVKGGELEGYERSTDQGAANAYGEQEKVVSRTRTSSNAWCMHK 395
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVA 201
+ + +KI VT +P++N E+F +L+Y GQ Y SH+D+ + R+
Sbjct: 396 CERLPGVRSASKKIEAVTGIPQVNYESFQLLKYDGGQFYRSHHDSSSVDD--SPAGHRIL 453
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
+F +YL+D+EEGGET F +G+ VKP++G L++ S+L DP
Sbjct: 454 TFFLYLSDVEEGGETYF---------------SKLGIAVKPKKGRALVWPSVLDE---DP 495
Query: 262 T-----SIHGSCPVVKGEKWVATKWIR 283
T H + V+KGEK A WI
Sbjct: 496 TYWDKRMYHEAKDVIKGEKKAANHWIH 522
>gi|390352104|ref|XP_003727818.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like
[Strongylocentrotus purpuratus]
Length = 121
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 42/121 (34%), Positives = 63/121 (52%), Gaps = 16/121 (13%)
Query: 167 EAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNA 226
E I Y +G Y H+D F + R+AS L YL+D+ +GG+T+F ++A
Sbjct: 5 EFLQIANYGLGGHYLPHFD-FTRDVATHKNGNRIASMLFYLSDVAKGGDTVF-----IDA 58
Query: 227 DGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
G K+KP +G + +Y+L NG +D + H SCPV+ G KWVA W+ +
Sbjct: 59 ----------GAKIKPEKGSAIFWYNLFKNGKVDERTKHASCPVISGSKWVANMWMHEHG 108
Query: 287 Q 287
Q
Sbjct: 109 Q 109
>gi|397615311|gb|EJK63351.1| hypothetical protein THAOC_15991 [Thalassiosira oceanica]
Length = 463
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 47/122 (38%), Positives = 69/122 (56%), Gaps = 10/122 (8%)
Query: 166 GEAFNILRYKIGQKYNSHYD-AFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGM 224
EA ++ Y++GQ+Y +H+D + P + Q + R A+ L+YL + GGET FP
Sbjct: 349 AEALQLVHYEVGQEYTAHHDFGYAPFDRKDQPA-RFATLLLYLNEGMVGGETQFP--RWA 405
Query: 225 NADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRD 284
NA+ GL V+P+ G +LFYS LP+G +D S H + PV GEKW+ W+ D
Sbjct: 406 NAETR------AGLDVEPKIGKAVLFYSQLPDGNMDDLSQHAARPVKIGEKWLMNLWVWD 459
Query: 285 QE 286
E
Sbjct: 460 PE 461
>gi|151556370|gb|AAI47868.1| P4HA1 protein [Bos taurus]
Length = 534
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + + + + ++AK L +T+ + + Q R S ++S E+
Sbjct: 335 PRIIRFHDIISDAEIEVVKDLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYENP- 392
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 451
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVLAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|195055775|ref|XP_001994788.1| GH17428 [Drosophila grimshawi]
gi|193892551|gb|EDV91417.1| GH17428 [Drosophila grimshawi]
Length = 540
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 55/203 (27%), Positives = 93/203 (45%), Gaps = 24/203 (11%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTL-ALRKGETVDNTQGIRTSSGVFISAAEDE 144
P + + + E+ + +A+ L+ S + +L E + + R S G F E
Sbjct: 331 PYVIQVHDIISAEETIVLQQLARPELQRSMVYSLSNSEHI--STNFRISQGTFFEY--HE 386
Query: 145 SGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQ---KSQRV 200
+ + + + ++ L + E + Y IG Y H D+F + YG + RV
Sbjct: 387 HPIMQRMSQHLENISGLDMRSAEQLQVANYGIGGHYEPHMDSFSENHNYGINTYMSTNRV 446
Query: 201 ASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTID 260
A+ + YL+++E GG T FPF + L V+P +G L +Y+L +G +D
Sbjct: 447 ATGIYYLSNVEAGGGTAFPF---------------LPLLVEPERGSLLFWYNLHRSGDLD 491
Query: 261 PTSIHGSCPVVKGEKWVATKWIR 283
+ H CPV+ G KW+A WIR
Sbjct: 492 YRTKHAGCPVLMGSKWIANVWIR 514
>gi|426229221|ref|XP_004008689.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
[Ovis aries]
Length = 487
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 101/208 (48%), Gaps = 28/208 (13%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 141
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 290 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 342
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQR 199
ED+ + + ++ +T L E + Y +G +Y H+D + P + G + + R
Sbjct: 343 EDDDPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNR 402
Query: 200 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 259
+A+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 403 LATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEG 447
Query: 260 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 448 DYRTRHAACPVLVGCKWVSNKWFHERGQ 475
>gi|167524906|ref|XP_001746788.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774568|gb|EDQ88195.1| predicted protein [Monosiga brevicollis MX1]
Length = 321
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 46/133 (34%), Positives = 68/133 (51%), Gaps = 16/133 (12%)
Query: 151 IEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFLVYLTDL 210
+EE+I K+ LP +N E F +LRY Q Y H D D ++Y RV + +YL D+
Sbjct: 191 LEERIGKLVGLPVVNQEHFQVLRYNNNQYYRVHNDLID-EQYDMPCGPRVLTLFIYLNDV 249
Query: 211 EEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPV 270
GGET F +GL VKP++G +L+YS+ + + + H + PV
Sbjct: 250 PAGGETSF---------------TRLGLAVKPKKGKAVLWYSVTNDLEPEERTDHEARPV 294
Query: 271 VKGEKWVATKWIR 283
+G K+ A KWI
Sbjct: 295 KQGTKYAANKWIH 307
>gi|407686446|ref|YP_006801619.1| hypothetical protein AMBAS45_03290 [Alteromonas macleodii str.
'Balearic Sea AD45']
gi|407289826|gb|AFT94138.1| hypothetical protein AMBAS45_03290 [Alteromonas macleodii str.
'Balearic Sea AD45']
Length = 263
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 57/200 (28%), Positives = 94/200 (47%), Gaps = 27/200 (13%)
Query: 91 FPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDL 150
+ +F + ++C I+ + K L PS LA + D+ IRTSS ++ ++ +
Sbjct: 85 YDDFLSSQECDDIVALTKDKLAPSKLA--GAASADD---IRTSSTCELAFLGNK--LVKD 137
Query: 151 IEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP-----QEYGPQKSQRVASFLV 205
++ +I L GE Y +G+ Y HYD F P + + + QR + ++
Sbjct: 138 VDSRIVSTLSLGVGEGEVIQAQHYNVGEYYKPHYDFFPPGSPQYKAHCLSRGQRTWTCMI 197
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D +GG T F + + VKP++G L + +LLP+G + SIH
Sbjct: 198 YLNDECDGGHTRF---------------TKLDIAVKPKKGMALFWNNLLPSGDPNLNSIH 242
Query: 266 GSCPVVKGEKWVATKWIRDQ 285
+ PV +G K V TKW R +
Sbjct: 243 FAEPVTRGHKTVITKWFRTK 262
>gi|149185530|ref|ZP_01863846.1| hypothetical protein ED21_20934 [Erythrobacter sp. SD-21]
gi|148830750|gb|EDL49185.1| hypothetical protein ED21_20934 [Erythrobacter sp. SD-21]
Length = 241
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 60/198 (30%), Positives = 84/198 (42%), Gaps = 28/198 (14%)
Query: 91 FPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDL 150
P F P +C +I + + NL PS L + G RTS + SA E E L
Sbjct: 53 MPGFLAPAECTRLIELIESNLLPSPLF-----SDPTGTGARTSQTHYFSAEEPEVAALGA 107
Query: 151 IEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ-----KSQRVASFLV 205
K+ + L R E RY +GQ+Y H D F + Q QR S +V
Sbjct: 108 ---KLDDLLGLERRQAETVQGQRYDVGQEYRHHRDFFRVEREHWQLERRRGGQRTWSAMV 164
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL +E GGET FP + L V P G + + ++ +G +P ++H
Sbjct: 165 YLNAIEAGGETDFP---------------VLDLSVAPEPGLLIAWNNVDRHGHPNPATLH 209
Query: 266 GSCPVVKGEKWVATKWIR 283
PV G K+V T+W R
Sbjct: 210 AGMPVEAGRKYVVTQWYR 227
>gi|73970649|ref|XP_850109.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Canis
lupus familiaris]
Length = 533
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 101/208 (48%), Gaps = 28/208 (13%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 141
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 388
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQR 199
ED+ + + ++ +T L E + Y +G +Y H+D + P + G + + R
Sbjct: 389 EDDDPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNR 448
Query: 200 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 259
+A+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 449 LATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEG 493
Query: 260 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 494 DYRTRHAACPVLVGCKWVSNKWFHERGQ 521
>gi|324507368|gb|ADY43128.1| Prolyl 4-hydroxylase subunit alpha-2 [Ascaris suum]
Length = 534
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 57/216 (26%), Positives = 100/216 (46%), Gaps = 24/216 (11%)
Query: 77 IPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSG 135
I ++L + P + F + + + I +A L+ +T+ R G+ R S
Sbjct: 316 IKVEILRFSPLVVLFKQVISDYEIEVIEKLAIPKLKRATVQNARTGDL--EYANYRISKS 373
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 195
++ + + +D I ++I +T L + E Y IG Y+ H+D ++
Sbjct: 374 AWLKGTDHPA--IDRINKRIDLMTNLNQETAEELQAQNYGIGGHYDPHFDFARKEDINAF 431
Query: 196 KS----QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFY 251
K+ R+A+ L+Y++D+E GG T+F +G V P + D L +Y
Sbjct: 432 KTLNTGNRIATILIYMSDVESGGATVFNH---------------LGNAVFPSKYDALFWY 476
Query: 252 SLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
+L +G D + H +CPV+ G KWV+ KWI D+ Q
Sbjct: 477 NLRRDGEGDLRTRHAACPVLTGIKWVSNKWIHDRGQ 512
>gi|74353841|gb|AAI03334.1| Prolyl 4-hydroxylase, alpha polypeptide II [Bos taurus]
Length = 487
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 101/208 (48%), Gaps = 28/208 (13%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 141
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 290 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 342
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQR 199
ED+ + + ++ +T L E + Y +G +Y H+D + P + G + + R
Sbjct: 343 EDDDPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNR 402
Query: 200 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 259
+A+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 403 LATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEG 447
Query: 260 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 448 DYRTRHAACPVLVGCKWVSNKWFHERGQ 475
>gi|406595590|ref|YP_006746720.1| hypothetical protein MASE_03040 [Alteromonas macleodii ATCC 27126]
gi|407682553|ref|YP_006797727.1| hypothetical protein AMEC673_03255 [Alteromonas macleodii str.
'English Channel 673']
gi|406372911|gb|AFS36166.1| hypothetical protein MASE_03040 [Alteromonas macleodii ATCC 27126]
gi|407244164|gb|AFT73350.1| hypothetical protein AMEC673_03255 [Alteromonas macleodii str.
'English Channel 673']
Length = 263
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 57/200 (28%), Positives = 94/200 (47%), Gaps = 27/200 (13%)
Query: 91 FPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDL 150
+ +F + ++C I+ + K L PS LA + D+ IRTSS ++ ++ +
Sbjct: 85 YDDFLSSQECDDIVALTKDKLAPSKLA--GAASADD---IRTSSTCELAFLGNK--LVKD 137
Query: 151 IEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP-----QEYGPQKSQRVASFLV 205
++ +I L GE Y +G+ Y HYD F P + + + QR + ++
Sbjct: 138 VDNRIVSTLSLGVGEGEVIQAQHYNVGEYYKPHYDFFPPGSPQYKAHCLSRGQRTWTCMI 197
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D +GG T F + + VKP++G L + +LLP+G + SIH
Sbjct: 198 YLNDECDGGHTRF---------------TKLDIAVKPKKGMALFWNNLLPSGDPNLNSIH 242
Query: 266 GSCPVVKGEKWVATKWIRDQ 285
+ PV +G K V TKW R +
Sbjct: 243 FAEPVTRGHKTVITKWFRTK 262
>gi|324511726|gb|ADY44875.1| Prolyl 4-hydroxylase subunit alpha-1 [Ascaris suum]
Length = 550
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 63/216 (29%), Positives = 101/216 (46%), Gaps = 24/216 (11%)
Query: 78 PFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSG 135
PF+V L + P A+ F + + E+ K I +A L+ +T+ K ++ T R S
Sbjct: 319 PFKVEILRFNPLAVLFVDIISDEEAKMIQQIATPRLKRATVQNSKTGELE-TAAYRISKS 377
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 195
++ + E +D I +I +T L + E I Y +G Y+ H+D +E
Sbjct: 378 AWLKGGDHE--LIDRINRRIELMTNLIQETSEELQIANYGVGGHYDPHFDFARKEEPKAF 435
Query: 196 KS----QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFY 251
+S R+A+ L YLT+ E GG T+F + V P + L +Y
Sbjct: 436 ESLGTGNRLATVLFYLTEPEIGGGTVF---------------TELRTAVMPSKNGALFWY 480
Query: 252 SLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
+L +G D + H +CPV+ G KWVA KWI ++ Q
Sbjct: 481 NLYRSGEGDLRTRHAACPVLVGIKWVANKWIHERGQ 516
>gi|226874885|ref|NP_001029465.2| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Bos
taurus]
gi|296485623|tpg|DAA27738.1| TPA: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Bos taurus]
Length = 533
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 101/208 (48%), Gaps = 28/208 (13%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 141
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 388
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQR 199
ED+ + + ++ +T L E + Y +G +Y H+D + P + G + + R
Sbjct: 389 EDDDPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNR 448
Query: 200 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 259
+A+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 449 LATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEG 493
Query: 260 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 494 DYRTRHAACPVLVGCKWVSNKWFHERGQ 521
>gi|198459366|ref|XP_002138685.1| GA24919 [Drosophila pseudoobscura pseudoobscura]
gi|198136669|gb|EDY69243.1| GA24919 [Drosophila pseudoobscura pseudoobscura]
Length = 448
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 45/119 (37%), Positives = 64/119 (53%), Gaps = 18/119 (15%)
Query: 171 ILRYKIGQKYNSHYDAFDP--QEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADG 228
+L Y +Y +H D F P EY Q+ R+A+ L YL D+E+GG+T+FP
Sbjct: 336 VLNYATAAQYLTHSDYFGPAYSEY-IQRGDRIATVLFYLNDVEQGGKTVFP--------- 385
Query: 229 SYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
+G+ P +G ++FY+L + DP + HG CPV+ G KW ATKWI EQ
Sbjct: 386 ------RLGIFRSPMKGSAVVFYNLNSSLQGDPRTEHGGCPVLVGTKWAATKWIYSAEQ 438
>gi|90023340|ref|YP_529167.1| response regulator receiver domain-containing protein
[Saccharophagus degradans 2-40]
gi|89952940|gb|ABD82955.1| 2OG-Fe(II) oxygenase [Saccharophagus degradans 2-40]
Length = 269
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 55/209 (26%), Positives = 102/209 (48%), Gaps = 26/209 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P +F T + II A ++ + ++ K E +++ RT S ++ A D +
Sbjct: 63 PSVTICEDFLTQAEVFQIIKAAGDKMQRARVSSGK-EGIESAG--RTGSNCWV--AHDHN 117
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFD-PQEYG----PQKSQRV 200
+ ++I+K+ + N E+F ++ Y + Q+Y+SH+DA++ E G + QR+
Sbjct: 118 KVTHALAKRISKLVGISLQNAESFQVIHYGVSQEYSSHFDAWEFNTERGERCMARGGQRL 177
Query: 201 ASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI- 259
+ L+YL D+ GG T FP + L+V+ ++G ++F++ P
Sbjct: 178 VTCLIYLNDVPAGGGTGFPE---------------LDLEVQAKKGRMVIFHNCYPGTNYR 222
Query: 260 DPTSIHGSCPVVKGEKWVATKWIRDQEQY 288
P S+HG PV +GEKW W R+ + +
Sbjct: 223 HPHSLHGGLPVEEGEKWAVNLWFREADYW 251
>gi|381173085|ref|ZP_09882194.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas citri pv.
mangiferaeindicae LMG 941]
gi|380686458|emb|CCG38681.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas citri pv.
mangiferaeindicae LMG 941]
Length = 418
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 59/213 (27%), Positives = 89/213 (41%), Gaps = 38/213 (17%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + + ++C+ ++ +A+ +LR S + + + IRTS G
Sbjct: 228 PRIEEYAAVLSADECRLLMLLARPHLRASKV-IDPNDASTQRAPIRTSRGA--------- 277
Query: 146 GTLDLIEEKIAKVTM---------LPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG--- 193
TLD I E A LP + E ++L Y G++Y +H D P
Sbjct: 278 -TLDPIIEDFAARAAQARLAACAQLPLAHAEPLSVLCYAPGEQYRAHRDYLPPGTIAADR 336
Query: 194 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 253
P R + VYL D+ GG+T FP G++V+PR G + F +L
Sbjct: 337 PTAGNRQRTVCVYLNDVGAGGDTEFPI---------------AGVRVRPRPGTLVCFDNL 381
Query: 254 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
+G D S+H PV G KW+ T W R Q
Sbjct: 382 HADGRPDADSLHAGLPVTAGSKWLGTLWFRQQR 414
>gi|440912197|gb|ELR61789.1| Prolyl 4-hydroxylase subunit alpha-2, partial [Bos grunniens mutus]
Length = 535
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 101/208 (48%), Gaps = 28/208 (13%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 141
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 338 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 390
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQR 199
ED+ + + ++ +T L E + Y +G +Y H+D + P + G + + R
Sbjct: 391 EDDDPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNR 450
Query: 200 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 259
+A+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 451 LATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEG 495
Query: 260 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 496 DYRTRHAACPVLVGCKWVSNKWFHERGQ 523
>gi|94495931|ref|ZP_01302510.1| 2OG-Fe(II) oxygenase [Sphingomonas sp. SKA58]
gi|94424623|gb|EAT09645.1| 2OG-Fe(II) oxygenase [Sphingomonas sp. SKA58]
Length = 229
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 59/198 (29%), Positives = 93/198 (46%), Gaps = 28/198 (14%)
Query: 93 NFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIE 152
+F +P++C + + N +PSTL D RTS +S + ++ I
Sbjct: 52 DFLSPDECAELRRLIDANAQPSTL-FSGSANAD----YRTSHSGNLSPRDP---LVERIT 103
Query: 153 EKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP-----QEYGPQKSQRVASFLVYL 207
++I +T LP INGE RY GQ+Y H D F Q QR + ++YL
Sbjct: 104 QRICALTGLPAINGETLQGQRYTPGQEYKVHCDYFPATADYWQRMRGTGGQRTWTAMIYL 163
Query: 208 TDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGS 267
+ +E GGET FP +C V P +G L++ ++ +G + S+H +
Sbjct: 164 SAVEAGGETHFP--------------QC-EFMVPPVEGMILIWNNMDRDGAPNRFSLHAA 208
Query: 268 CPVVKGEKWVATKWIRDQ 285
PV +G K+V TKW R++
Sbjct: 209 LPVERGTKYVVTKWFRER 226
>gi|431913403|gb|ELK15078.1| Protein ariadne-2 like protein [Pteropus alecto]
Length = 843
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 56/212 (26%), Positives = 96/212 (45%), Gaps = 38/212 (17%)
Query: 102 SIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTML 161
S+ + ++LR +R+ ++++ +R S ++ E + I +++ ++T L
Sbjct: 595 SLQEFSNMDLRDFHKYMRR-HKAESSELVRNSHHTWLYQGEGAHHVMRAIRQRVLRLTRL 653
Query: 162 -PRIN--GEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASF-----------LVYL 207
P I E ++RY G Y++H D+ GP + + S L YL
Sbjct: 654 SPEIVELSEPLQVVRYGEGGHYHAHVDS------GPVYPETICSHTKLVANDYMTVLFYL 707
Query: 208 TDLEEGGETMFPFENGMNADGSYDYQKCI------------GLKVKPRQGDGLLFYSLLP 255
++ GGET+FP + D Q + L+VKPRQG + +Y+ LP
Sbjct: 708 NNVTGGGETVFPVADNRTYDEMSLIQDDVDLRDTRRHCDKGNLRVKPRQGTAVFWYNYLP 767
Query: 256 NGT-----IDPTSIHGSCPVVKGEKWVATKWI 282
+G +D S+HG C V +G KW+A WI
Sbjct: 768 DGQGWVGDVDDYSLHGGCLVTRGTKWIANNWI 799
>gi|418521653|ref|ZP_13087695.1| hypothetical protein WS7_11622 [Xanthomonas axonopodis pv.
malvacearum str. GSPB2388]
gi|410702188|gb|EKQ60697.1| hypothetical protein WS7_11622 [Xanthomonas axonopodis pv.
malvacearum str. GSPB2388]
Length = 418
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 59/213 (27%), Positives = 89/213 (41%), Gaps = 38/213 (17%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + + ++C+ ++ +A+ +LR S + + + IRTS G
Sbjct: 228 PRIEEYAAVLSADECRLLMLLARPHLRASKV-IDPNDASTQRAPIRTSRGA--------- 277
Query: 146 GTLDLIEEKIAKVTM---------LPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG--- 193
TLD I E A LP + E ++L Y G++Y +H D P
Sbjct: 278 -TLDPIIEDFAARAAQARLAACAQLPLAHAEPLSVLCYAPGEQYRAHRDYLPPGTIAADR 336
Query: 194 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 253
P R + VYL D+ GG+T FP G++V+PR G + F +L
Sbjct: 337 PTAGNRQRTVCVYLNDVGAGGDTEFPI---------------AGVRVRPRPGTLVCFDNL 381
Query: 254 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
+G D S+H PV G KW+ T W R Q
Sbjct: 382 HADGRPDADSLHAGLPVTAGSKWLGTLWFRQQR 414
>gi|355709025|gb|AES03456.1| prolyl 4-hydroxylase, alpha polypeptide II [Mustela putorius furo]
Length = 532
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 101/208 (48%), Gaps = 28/208 (13%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 141
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 388
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQR 199
ED+ + + ++ +T L E + Y +G +Y H+D + P + G + + R
Sbjct: 389 EDDDPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNR 448
Query: 200 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 259
+A+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 449 LATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEG 493
Query: 260 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 494 DYRTRHAACPVLVGCKWVSNKWFHERGQ 521
>gi|77748579|ref|NP_641686.2| hypothetical protein XAC1351 [Xanthomonas axonopodis pv. citri str.
306]
Length = 418
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 59/213 (27%), Positives = 89/213 (41%), Gaps = 38/213 (17%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + + ++C+ ++ +A+ +LR S + + + IRTS G
Sbjct: 228 PRIEEYAAVLSADECRLLMLLARPHLRASKV-IDPNDASTQRAPIRTSRGA--------- 277
Query: 146 GTLDLIEEKIAKVTM---------LPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG--- 193
TLD I E A LP + E ++L Y G++Y +H D P
Sbjct: 278 -TLDPIIEDFAARAAQARLAACAQLPLAHAEPLSVLCYAPGEQYRAHRDYLPPGTIAADR 336
Query: 194 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 253
P R + VYL D+ GG+T FP G++V+PR G + F +L
Sbjct: 337 PTAGNRQRTVCVYLNDVGAGGDTEFPI---------------AGVRVRPRPGTLVCFDNL 381
Query: 254 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
+G D S+H PV G KW+ T W R Q
Sbjct: 382 HADGRPDADSLHAGLPVTAGSKWLGTLWFRQQR 414
>gi|426249581|ref|XP_004018528.1| PREDICTED: LOW QUALITY PROTEIN: transmembrane prolyl 4-hydroxylase
[Ovis aries]
Length = 484
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 59/219 (26%), Positives = 98/219 (44%), Gaps = 35/219 (15%)
Query: 102 SIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTML 161
S+ + ++LR +R +++Q +R S ++ E + I +++ ++T L
Sbjct: 245 SLQEFSNMDLRDFHKYMRS-HRAESSQLVRNSHHTWLYQGEGAHHVMRAIRQRVLRLTRL 303
Query: 162 -PRIN--GEAFNILRYKIGQKYNSHYDA-------------FDPQEYGP-QKSQRVASFL 204
P I E ++RY G Y++H D+ E P + S R + L
Sbjct: 304 SPEIVELSEPLQVVRYGEGGHYHAHVDSGPVYPETICSHTKLVANESVPFETSCRYMTVL 363
Query: 205 VYLTDLEEGGETMFPFENGMNADGSYDYQKCI------------GLKVKPRQGDGLLFYS 252
YL ++ GGET+FP + D Q + L+VKPRQG + +Y+
Sbjct: 364 FYLNNVTGGGETVFPVADNRTYDEMSLIQDDVDLRDTRRHCDKGNLRVKPRQGTAVFWYN 423
Query: 253 LLPNGT-----IDPTSIHGSCPVVKGEKWVATKWIRDQE 286
LP+G +D S+HG C V +G KW+A WI +E
Sbjct: 424 YLPDGQGWVGDVDDYSLHGGCLVTRGTKWIANNWINARE 462
>gi|195452746|ref|XP_002073482.1| GK14141 [Drosophila willistoni]
gi|194169567|gb|EDW84468.1| GK14141 [Drosophila willistoni]
Length = 541
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 58/220 (26%), Positives = 100/220 (45%), Gaps = 28/220 (12%)
Query: 78 PFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSG 135
P +V L+ P + + + + I N+ + + +T+ KG V +RTS
Sbjct: 320 PLKVEELNHNPLLVLYHDVIYQSEIDVIRNLTENEISRATVIGAKGSEVSK---VRTSQF 376
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSH-----YDAFDPQ 190
FI + L I++++A ++ L E Y IG Y H DAFD +
Sbjct: 377 TFIPKTRHK--VLQTIDQRVADMSNLNMDYAELHQFANYGIGGHYAQHNDWFGQDAFDNE 434
Query: 191 EYG-PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLL 249
P+ R+A+ L YL+D+ +GG T FP + ++P++
Sbjct: 435 LVSSPEMGNRIATVLFYLSDVAQGGGTAFPHLKQL---------------LQPKKYAAAF 479
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 289
+++L +G D ++HG+CP++ G KWV +WIR+ Q D
Sbjct: 480 WHNLHASGVGDLRTLHGACPIIAGSKWVQNRWIREFIQAD 519
>gi|194765174|ref|XP_001964702.1| GF23328 [Drosophila ananassae]
gi|190614974|gb|EDV30498.1| GF23328 [Drosophila ananassae]
Length = 542
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 59/213 (27%), Positives = 99/213 (46%), Gaps = 23/213 (10%)
Query: 78 PFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSG 135
PF+V L+ P YF + + II ++ S + + T T IRTS+
Sbjct: 329 PFKVEQLNLDPYVAYFHEAINSSEMEQIIEKGLGSMERSRVGQSQNAT---TSEIRTSAN 385
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFD-PQEYGP 194
++ +E+ L I++++ +T L + E ++ Y IG +Y H+D + PQ+
Sbjct: 386 TWLWY--NENPWLSKIKQRLEDITGLSTESAEPLQLVNYGIGGQYEPHFDFVEEPQKVFG 443
Query: 195 QKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLL 254
K R+ + L Y+ D+ GG T FPF + L V P +G L++Y+L
Sbjct: 444 WKGNRMLTALFYINDVALGGATAFPF---------------LQLAVPPVKGSLLVWYNLH 488
Query: 255 PNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
+ D + H CPV+KG KW+ +W + Q
Sbjct: 489 RSLHKDFRTKHAGCPVIKGSKWICNEWFHEGTQ 521
>gi|325929527|ref|ZP_08190641.1| 2OG-Fe(II) oxygenase superfamily enzyme,Sel1 repeat protein
[Xanthomonas perforans 91-118]
gi|325540037|gb|EGD11665.1| 2OG-Fe(II) oxygenase superfamily enzyme,Sel1 repeat protein
[Xanthomonas perforans 91-118]
Length = 418
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 60/216 (27%), Positives = 89/216 (41%), Gaps = 38/216 (17%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + + ++C+ ++ +A+ +LR S + + + IRTS G
Sbjct: 228 PRIEEYAAVLSADECRLLMLLARPHLRASKV-IDPNDASTGRAPIRTSHGA--------- 277
Query: 146 GTLDLIEEKIAKVTM---------LPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG--- 193
TLD I E A LP + E ++L Y G++Y +H D P
Sbjct: 278 -TLDPIIEDFAARAAQARLAACAQLPLAHAEPLSVLCYAPGEQYRAHRDYLPPGTIAADR 336
Query: 194 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 253
P R + VYL D+ GET FP G++V+PR G + F +L
Sbjct: 337 PTAGNRQRTVCVYLNDVGAAGETEFPV---------------AGVRVRPRPGTLVCFDNL 381
Query: 254 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 289
+G D S+H PV G KW+ T W R Q D
Sbjct: 382 HADGRPDADSLHAGLPVTAGSKWLGTLWFRQQRYRD 417
>gi|195069801|ref|XP_001997031.1| GH12975 [Drosophila grimshawi]
gi|193891500|gb|EDV90366.1| GH12975 [Drosophila grimshawi]
Length = 242
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 94/207 (45%), Gaps = 24/207 (11%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTL-ALRKGETVDNTQGIRTSSGVFISAAEDE 144
P + + + E+ + +A+ L+ S + +L E + + R S G F E
Sbjct: 33 PYVIQVHDIISAEETIVLQQLARPELQRSMVYSLSNSEHI--STNFRISQGTFFEYHEHP 90
Query: 145 SGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF-DPQEYGPQ---KSQRV 200
+ + + + ++ L + E + Y IG Y H D+F + YG + RV
Sbjct: 91 --IMQRMSQHLENISGLDMRSAEQLQVANYGIGGHYEPHMDSFSENHNYGINTYMSTNRV 148
Query: 201 ASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTID 260
A+ + YL+++E GG T FPF + L V+P +G L +Y+L +G +D
Sbjct: 149 ATGIYYLSNVEAGGGTAFPF---------------LPLLVEPERGSLLFWYNLHRSGDLD 193
Query: 261 PTSIHGSCPVVKGEKWVATKWIRDQEQ 287
+ H CPV+ G KW+A WIR Q
Sbjct: 194 YRTKHAGCPVLMGSKWIANVWIRLSNQ 220
>gi|410447164|ref|ZP_11301266.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [SAR86 cluster
bacterium SAR86E]
gi|409980151|gb|EKO36903.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [SAR86 cluster
bacterium SAR86E]
Length = 214
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 54/214 (25%), Positives = 99/214 (46%), Gaps = 31/214 (14%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P NF + +C + IN A+ L+ ST+ G + G RTS +I D +
Sbjct: 21 PIVYLVKNFLSDLECDAFINEAEGRLQDSTVI---GANDEIKLGARTSQNCWIE--HDAN 75
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS-----QRV 200
+ + ++++ + +P N E + + Y+ ++Y +D+FD +K+ QR+
Sbjct: 76 ELVHEVSKRLSILAQIPIRNAEQYQLACYEKDEEYKPRFDSFDFDTLEGKKNWEPGGQRM 135
Query: 201 ASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGT-- 258
+ +VYL D++ GG T FP +G + P++GD ++ + + +
Sbjct: 136 LTIIVYLNDVQSGGGTDFP---------------KLGFTIPPKKGDVVVLNNTCDDDSQN 180
Query: 259 ----IDPTSIHGSCPVVKGEKWVATKWIRDQEQY 288
I P S+H PV+ G+KW+ T W R +Y
Sbjct: 181 GHPNIHPNSLHAGMPVLSGKKWIVTLWFRQNLRY 214
>gi|426255746|ref|XP_004021509.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Ovis
aries]
Length = 534
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 97/206 (47%), Gaps = 22/206 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + F + + + + + ++AK L +T+ + + Q R S ++S E+
Sbjct: 335 PRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQ-YRVSKSAWLSGYENP- 392
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVA 201
+ I +I +T L E + Y +G +Y H+D E K R+A
Sbjct: 393 -VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIA 451
Query: 202 SFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDP 261
++L Y++D+ GG T+FP +G V P++G + +Y+L +G D
Sbjct: 452 TWLFYMSDVLAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGEGDY 496
Query: 262 TSIHGSCPVVKGEKWVATKWIRDQEQ 287
++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 497 STRHAACPVLVGNKWVSNKWLHERGQ 522
>gi|195172672|ref|XP_002027120.1| GL20071 [Drosophila persimilis]
gi|194112933|gb|EDW34976.1| GL20071 [Drosophila persimilis]
Length = 455
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 61/217 (28%), Positives = 101/217 (46%), Gaps = 27/217 (12%)
Query: 78 PFQV--LSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSG 135
PF+V LS P + + + + + + A ++ S + E NT RTS
Sbjct: 249 PFKVEPLSQDPYIAMYHDVIYDSEIEELKDNAFPDMERSKVYTYSDEDSKNTG--RTSMS 306
Query: 136 VFISAAEDESGTLDLIEEKIAKVT---MLPRINGEAFNILRYKIGQKYNSHYDAFDP--Q 190
F + + + + ++ +T +L + + +L Y +Y +H D F P
Sbjct: 307 AF--QTDHQYKAVTKVNRRVMHMTGFEVLADGSSDELLVLNYATAAQYLTHSDYFGPAYS 364
Query: 191 EYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 250
EY Q+ R+A+ L YL D+E+GG+T+FP +G+ P +G ++F
Sbjct: 365 EY-IQRGDRIATVLFYLNDVEQGGKTVFP---------------RLGIFRSPMKGSAVVF 408
Query: 251 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
Y++ + DP + HG CPV+ G KW ATKWI EQ
Sbjct: 409 YNMNSSLQGDPRTEHGGCPVLVGTKWAATKWIYSAEQ 445
>gi|242018356|ref|XP_002429643.1| Prolyl 4-hydroxylase alpha-1 subunit precursor, putative [Pediculus
humanus corporis]
gi|212514628|gb|EEB16905.1| Prolyl 4-hydroxylase alpha-1 subunit precursor, putative [Pediculus
humanus corporis]
Length = 534
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 43/143 (30%), Positives = 71/143 (49%), Gaps = 23/143 (16%)
Query: 151 IEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS----QRVASFLVY 206
+ +++ +T L E+ ++ Y IG Y H+D +E +S R+A+ L Y
Sbjct: 396 VSQRVEDITGLNMATAESLQVVNYGIGGHYEPHFDFARKEEKNAFQSLGTGNRIATILFY 455
Query: 207 LTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVK--PRQGDGLLFYSLLPNGTIDPTSI 264
++D+ +GG T+FP G+KV P++G +Y+L NG D +
Sbjct: 456 MSDVSQGGATVFP-----------------GIKVSLWPKKGTAAFWYNLRKNGEGDYLTR 498
Query: 265 HGSCPVVKGEKWVATKWIRDQEQ 287
H +CPV+ G KWV KWI ++ Q
Sbjct: 499 HAACPVLTGSKWVCNKWIHERGQ 521
>gi|393774561|ref|ZP_10362923.1| 2OG-Fe(II) oxygenase [Novosphingobium sp. Rr 2-17]
gi|392720044|gb|EIZ77547.1| 2OG-Fe(II) oxygenase [Novosphingobium sp. Rr 2-17]
Length = 210
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 50/198 (25%), Positives = 98/198 (49%), Gaps = 28/198 (14%)
Query: 93 NFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIE 152
+F + +C+ ++ + + RPST+A G+ RTSS +SA + + +
Sbjct: 33 DFLSAPECEELVALIEAEHRPSTIADFTGD-----DAFRTSSTCDLSA---QVPAVADLA 84
Query: 153 EKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP-----QEYGPQKSQRVASFLVYL 207
K+A+++ + + E RY++GQ++ +H D F+P Q Y QR +F+VYL
Sbjct: 85 AKLARLSGIDPAHAEPLQGQRYEVGQQFKAHTDTFEPGTADYQRYCSASGQRTWTFMVYL 144
Query: 208 TDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGS 267
+++ GG T F + I ++P +G + + + P+ +P ++H +
Sbjct: 145 NEVDAGGATRF---------------REIDKLIQPERGKLVAWNNRKPDRQPNPATLHHA 189
Query: 268 CPVVKGEKWVATKWIRDQ 285
V +G K+V T+W R++
Sbjct: 190 MKVRRGRKYVITQWYRER 207
>gi|442757047|gb|JAA70682.1| Putative prolyl 4-hydroxylase alpha subunit [Ixodes ricinus]
Length = 532
Score = 80.9 bits (198), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 38/127 (29%), Positives = 68/127 (53%), Gaps = 18/127 (14%)
Query: 159 TMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEY---GPQKSQRVASFLVYLTDLEEGGE 215
T+ R E + + Y IG Y H+D F+ + G + RVA+ ++Y++D+EEGG
Sbjct: 398 TLFSRDEAEKYQLANYGIGGHYVPHHDYFEEFQTPSKGNRFGNRVATLMIYMSDVEEGGA 457
Query: 216 TMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEK 275
T+FP +G++V P++GD + +++++ + + + H CPV+ G K
Sbjct: 458 TVFP---------------SLGVRVSPKKGDAVFWWNIMSSWEGEMLTWHAGCPVLYGSK 502
Query: 276 WVATKWI 282
W+A KW
Sbjct: 503 WIANKWF 509
>gi|256083648|ref|XP_002578053.1| prolyl 4-hydroxylase alpha subunit 1 [Schistosoma mansoni]
gi|360044447|emb|CCD81995.1| putative prolyl 4-hydroxylase alpha subunit 1 [Schistosoma mansoni]
Length = 584
Score = 80.9 bits (198), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 57/211 (27%), Positives = 95/211 (45%), Gaps = 33/211 (15%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGI------RTSSGVFIS 139
PR + + + P + + I +A LR +T+ + GI RTS ++
Sbjct: 379 PRIVMWYDLIFPSEIEKIKELATPRLRRATVK-------NPVTGILEIAFYRTSKSAWLP 431
Query: 140 AAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQE---YGPQK 196
+ E D I ++I VT L E + Y +G Y H+D +E + +
Sbjct: 432 HSMSE--ITDQISQRIRAVTGLSLETAEDLQVGNYGLGGHYAPHFDFGRKREKDAFEVKN 489
Query: 197 SQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPN 256
R+A+ + YL+D++ GG T+F IG +V P++G +++LLPN
Sbjct: 490 GNRIATIIFYLSDVQAGGATVF---------------NRIGTRVVPKKGAAGFWFNLLPN 534
Query: 257 GTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
G D + H +CPV+ G KWV W ++ Q
Sbjct: 535 GEGDLRTRHAACPVLAGSKWVMNLWFHERGQ 565
>gi|148226320|ref|NP_001087703.1| prolyl 4-hydroxylase, alpha polypeptide 2 precursor [Xenopus
laevis]
gi|51703693|gb|AAH81114.1| MGC83530 protein [Xenopus laevis]
Length = 533
Score = 80.9 bits (198), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 53/209 (25%), Positives = 100/209 (47%), Gaps = 30/209 (14%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 141
PR + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 336 PRIVRYLDVLSDEEIEKIKELAKPRLARATVR-------DPKTGVLTVANYRVSKSAWLE 388
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ---KSQ 198
E + + + ++ +T L + E + Y +G +Y H+D F + + +
Sbjct: 389 EYDDPVIGRVNSRMQAITGLTKDTAELLQVANYGMGGQYEPHFD-FSRRPFDSNLKTEGN 447
Query: 199 RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGT 258
R+A++L Y++D+E GG T+FP D+ G + PR+G + +Y+L +G
Sbjct: 448 RLATYLNYMSDVEAGGATVFP-----------DF----GAAIWPRKGTAVFWYNLFRSGE 492
Query: 259 IDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 493 GDYRTRHAACPVLVGSKWVSNKWFHERGQ 521
>gi|296284739|ref|ZP_06862737.1| hypothetical protein CbatJ_14013 [Citromicrobium bathyomarinum
JL354]
Length = 210
Score = 80.9 bits (198), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 54/202 (26%), Positives = 93/202 (46%), Gaps = 28/202 (13%)
Query: 89 LYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTL 148
L F P+ C +I M + RPSTLA G+ RTS + + + L
Sbjct: 28 LQMRQFLDPDFCGELIAMIDADRRPSTLADHDGDMY-----FRTSETCDLPMDDPRTQRL 82
Query: 149 DLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP-----QEYGPQKSQRVASF 203
+ + +A+++ + +GE RY +GQ++ +H D F+P ++Y QR +F
Sbjct: 83 EAM---LAELSGIDPRHGEPLQGQRYAVGQEFKAHCDYFNPDGQDWEKYCSVAGQRTWTF 139
Query: 204 LVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTS 263
++YL + E GG T F K + +P G + + + P+ + +P +
Sbjct: 140 MIYLNEPEAGGATRF---------------KVLKKSFQPETGKLVCWNNRRPDQSTNPNT 184
Query: 264 IHGSCPVVKGEKWVATKWIRDQ 285
+H V KG K+V TKW R++
Sbjct: 185 MHHGMKVRKGTKYVITKWYREK 206
>gi|345324764|ref|XP_001505668.2| PREDICTED: LOW QUALITY PROTEIN: transmembrane prolyl 4-hydroxylase
[Ornithorhynchus anatinus]
Length = 495
Score = 80.9 bits (198), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 58/215 (26%), Positives = 95/215 (44%), Gaps = 35/215 (16%)
Query: 102 SIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTML 161
S+ +LNLR + + V + +R S ++ E + I++++ ++T L
Sbjct: 240 SLEEFKRLNLRDFHKYM-GSQKVKMSDLVRNSQHTWLYQGEGAHQVMRSIQQRVLRLTRL 298
Query: 162 PRI---NGEAFNILRYKIGQKYNSHYDA-------------FDPQEYGP-QKSQRVASFL 204
P+ + E ++RY G Y++H D+ F E P + S R + L
Sbjct: 299 PQEIVEHSEPLQVVRYDQGGHYHAHMDSGPVFPETACSHTKFITNETAPFETSCRYVTVL 358
Query: 205 VYLTDLEEGGETMFPFENGMNADGSYDYQKCI------------GLKVKPRQGDGLLFYS 252
YL ++ GGET FP + D Q I L+VKP+QG + +Y+
Sbjct: 359 FYLNNVTGGGETTFPVADNRTYDEMSLIQNDIDLRDTRKHCDKGNLRVKPKQGTAVFWYN 418
Query: 253 LLPNGT-----IDPTSIHGSCPVVKGEKWVATKWI 282
L +G +D S+HG C V +G KW+A WI
Sbjct: 419 YLSDGQGWVGDLDEYSLHGGCLVTQGTKWIANNWI 453
>gi|414587754|tpg|DAA38325.1| TPA: hypothetical protein ZEAMMB73_894856 [Zea mays]
Length = 169
Score = 80.9 bits (198), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 41/111 (36%), Positives = 68/111 (61%), Gaps = 7/111 (6%)
Query: 80 QVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTL---ALRKGETVDNTQGIRTSSGV 136
+V+SW PR + F NF + E+C ++ +A+ L+ ST+ A KG D +RTSSG+
Sbjct: 58 EVISWTPRIIVFHNFLSSEECDYLMAIARPRLQISTVVDVATGKGVKSD----VRTSSGM 113
Query: 137 FISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF 187
F+++ E +S + IE++I+ + +P+ NGE +LRY+ Q Y H+D F
Sbjct: 114 FVNSEERKSPVVQAIEKRISVFSQIPKENGELIQVLRYEASQYYRPHHDYF 164
>gi|321474875|gb|EFX85839.1| hypothetical protein DAPPUDRAFT_309105 [Daphnia pulex]
Length = 545
Score = 80.9 bits (198), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 55/207 (26%), Positives = 97/207 (46%), Gaps = 26/207 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRK-GETVDNTQGIRTSSGVFISAAEDE 144
PR + + + + E+ ++I +A+ +T+ ++ GE + I S+ + +E
Sbjct: 346 PRIVVYHDIISDEEIETIKRLAQPRFERATVQKKESGEREFSRYRIAKSAWL----KHEE 401
Query: 145 SGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP----QEYGPQKSQRV 200
+ I ++ +T L E + Y IG Y HYD Q++G R+
Sbjct: 402 HDYVSDINFRVGDITGLDMATSEDLQVCNYGIGGHYEPHYDYARKGEVQQDFG--WGGRI 459
Query: 201 ASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTID 260
A++L Y++D+E GG T+FP N L + P++G +++L PNG +
Sbjct: 460 ATWLFYMSDVEAGGATVFPKLN---------------LSLWPQKGSAAFWFNLYPNGEGN 504
Query: 261 PTSIHGSCPVVKGEKWVATKWIRDQEQ 287
+ H CPV+ G KWVA WI ++ Q
Sbjct: 505 EMTQHAGCPVLTGSKWVANYWIHERGQ 531
>gi|298712929|emb|CBJ26831.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 294
Score = 80.9 bits (198), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 61/196 (31%), Positives = 94/196 (47%), Gaps = 32/196 (16%)
Query: 94 FATPEQCKSIINMA-KLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIE 152
F+ PE C ++I +A + + GE ++ RTSS F+ A ED L +
Sbjct: 110 FSGPE-CDALIALAGNYMIVSPVVGAGAGEVSES----RTSSSCFL-ARED----LPTVC 159
Query: 153 EKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFD-----PQEYGPQKSQRVASFLVYL 207
K+ +T P + E + RY QKY +H+DAFD + + QRV + LVYL
Sbjct: 160 HKVMALTGKPIEHLELPQVGRYYTSQKYANHWDAFDLNTEDGRRFAQNGGQRVCTVLVYL 219
Query: 208 TDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGS 267
D+ GG T FP +G+KV+PR+G ++F+ +G +D +H +
Sbjct: 220 NDVPSGGCTAFPQ---------------LGMKVQPRKGMAVVFFPATLDGVLDSRLLHAA 264
Query: 268 CPVVKGEKWVATKWIR 283
P + KWV+ WIR
Sbjct: 265 EPAID-TKWVSQIWIR 279
>gi|148701598|gb|EDL33545.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide, isoform CRA_c [Mus
musculus]
gi|149052607|gb|EDM04424.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide (predicted),
isoform CRA_d [Rattus norvegicus]
Length = 189
Score = 80.9 bits (198), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 53/196 (27%), Positives = 96/196 (48%), Gaps = 28/196 (14%)
Query: 98 EQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA----EDESGTLDLIEE 153
E+ + I +AK L +T+ D G+ T + +S + ED+ + +
Sbjct: 4 EEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLEEDDDPVVARVNR 56
Query: 154 KIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQRVASFLVYLTDLE 211
++ +T L E + Y +G +Y H+D + P + G + + R+A+FL Y++D+E
Sbjct: 57 RMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 116
Query: 212 EGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVV 271
GG T+FP +G + P++G + +Y+LL +G D + H +CPV+
Sbjct: 117 AGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVL 161
Query: 272 KGEKWVATKWIRDQEQ 287
G KWV+ KW ++ Q
Sbjct: 162 VGCKWVSNKWFHERGQ 177
>gi|74216495|dbj|BAE25162.1| unnamed protein product [Mus musculus]
Length = 187
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 53/196 (27%), Positives = 96/196 (48%), Gaps = 28/196 (14%)
Query: 98 EQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA----EDESGTLDLIEE 153
E+ + I +AK L +T+ D G+ T + +S + ED+ + +
Sbjct: 2 EEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLEEDDDPVVARVNR 54
Query: 154 KIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQRVASFLVYLTDLE 211
++ +T L E + Y +G +Y H+D + P + G + + R+A+FL Y++D+E
Sbjct: 55 RMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 114
Query: 212 EGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVV 271
GG T+FP +G + P++G + +Y+LL +G D + H +CPV+
Sbjct: 115 AGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVL 159
Query: 272 KGEKWVATKWIRDQEQ 287
G KWV+ KW ++ Q
Sbjct: 160 VGCKWVSNKWFHERGQ 175
>gi|449267219|gb|EMC78185.1| Prolyl 4-hydroxylase subunit alpha-2 [Columba livia]
Length = 538
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 98/208 (47%), Gaps = 30/208 (14%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 141
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 339 PHIVRYYDVMSDEEIEKIKQLAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 391
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS---- 197
ED+ + + +++ ++T L E + Y +G +Y H+D E K
Sbjct: 392 EDDDPVVAKVNQRMQQITGLTVKTAELLQVANYGMGGQYEPHFDFSRKDEPDAFKRLGTG 451
Query: 198 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 257
RVA+FL Y++D+E GG T+FP D+ G + P++G + +Y+L +G
Sbjct: 452 NRVATFLNYMSDVEAGGATVFP-----------DF----GAAIWPKKGTAVFWYNLFRSG 496
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
D + H +CPV+ G KWV+ KW ++
Sbjct: 497 EGDYRTRHAACPVLVGCKWVSNKWFHER 524
>gi|54792285|emb|CAG28668.1| prolyl 4-hydroxylase alpha-2 subunit [Gallus gallus]
Length = 538
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 98/208 (47%), Gaps = 30/208 (14%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 141
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 340 PHIVRYYDVMSDEEIEKIKQLAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 392
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS---- 197
ED+ + + +++ ++T L E + Y +G +Y H+D E K
Sbjct: 393 EDDDPVVAKVNQRMQQITGLTVKTAELLQVANYGMGGQYEPHFDFSRKDEPDAFKRLGTG 452
Query: 198 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 257
RVA+FL Y++D+E GG T+FP D+ G + P++G + +Y+L +G
Sbjct: 453 NRVATFLNYMSDVEAGGATVFP-----------DF----GAAIWPKKGTAVFWYNLFRSG 497
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
D + H +CPV+ G KWV+ KW ++
Sbjct: 498 EGDYRTRHAACPVLVGCKWVSNKWFHER 525
>gi|348557544|ref|XP_003464579.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
[Cavia porcellus]
Length = 533
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 53/208 (25%), Positives = 102/208 (49%), Gaps = 28/208 (13%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 141
P + + + + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVR-------DPKTGVLTVASYRVSKSSWLE 388
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQR 199
E++ + + ++ ++T L E + Y +G +Y H+D + P + G + + R
Sbjct: 389 EEDDPVVARVNRRMQQITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNR 448
Query: 200 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 259
+A+FL Y++D+E GG T+FP +G + P++G + +Y+LL +G
Sbjct: 449 LATFLNYMSDVEAGGATVFP---------------DLGAALWPKKGTAVFWYNLLRSGEG 493
Query: 260 DPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWV+ KW ++ Q
Sbjct: 494 DYRTRHAACPVLVGCKWVSNKWFHERGQ 521
>gi|399057802|ref|ZP_10744231.1| 2OG-Fe(II) oxygenase superfamily enzyme [Novosphingobium sp. AP12]
gi|398041550|gb|EJL34606.1| 2OG-Fe(II) oxygenase superfamily enzyme [Novosphingobium sp. AP12]
Length = 210
Score = 80.5 bits (197), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 51/199 (25%), Positives = 96/199 (48%), Gaps = 28/199 (14%)
Query: 93 NFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIE 152
NF EQC ++ + + + RPST+A G+ RTSS +S + + +
Sbjct: 33 NFVAAEQCAELMALIEDSHRPSTIADYNGD-----DAFRTSSTCDLST---DVPVVANLA 84
Query: 153 EKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP-----QEYGPQKSQRVASFLVYL 207
+++++ + + E RY++GQ++ +H D F+P +Y QR +F++YL
Sbjct: 85 AALSRLSGIDLAHAEPLQGQRYEVGQEFKAHTDYFEPGNADYDKYCAVPGQRTWTFMIYL 144
Query: 208 TDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGS 267
++E GG T F + I ++P G + + + P+GT + ++H +
Sbjct: 145 NEVEAGGATRF---------------RVIDKMIQPEIGKLIAWNNRRPDGTPNAATLHHA 189
Query: 268 CPVVKGEKWVATKWIRDQE 286
V KG K+V T+W R++
Sbjct: 190 MKVRKGYKYVITQWYRERH 208
>gi|407698902|ref|YP_006823689.1| hypothetical protein AMBLS11_03220 [Alteromonas macleodii str.
'Black Sea 11']
gi|407248049|gb|AFT77234.1| hypothetical protein AMBLS11_03220 [Alteromonas macleodii str.
'Black Sea 11']
Length = 263
Score = 80.5 bits (197), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 56/200 (28%), Positives = 94/200 (47%), Gaps = 27/200 (13%)
Query: 91 FPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDL 150
+ +F + ++C I+ + K L PS LA + D+ IRTSS ++ ++ +
Sbjct: 85 YDDFLSSQECDDIVALTKDKLAPSKLA--GAASADD---IRTSSTCELAFLGNK--LVKD 137
Query: 151 IEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDP-----QEYGPQKSQRVASFLV 205
++ +I L GE Y +G+ Y HYD F P + + + QR + ++
Sbjct: 138 VDSRIVSTLSLGVGEGEVIQAQHYNVGEYYKPHYDFFPPGSPQYKTHCLSRGQRTWTCMI 197
Query: 206 YLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIH 265
YL D +GG T F + + V+P++G L + +LLP+G + SIH
Sbjct: 198 YLNDECDGGHTRF---------------TKLDIAVRPKKGMALFWNNLLPSGDPNLNSIH 242
Query: 266 GSCPVVKGEKWVATKWIRDQ 285
+ PV +G K V TKW R +
Sbjct: 243 FAEPVTRGHKTVITKWFRTK 262
>gi|355709034|gb|AES03459.1| prolyl 4-hydroxylase, transmembrane [Mustela putorius furo]
Length = 444
Score = 80.5 bits (197), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 59/215 (27%), Positives = 96/215 (44%), Gaps = 35/215 (16%)
Query: 102 SIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTML 161
S+ +K++LR + KG +++ +R S ++ E + I +++ ++T L
Sbjct: 187 SLQEFSKMDLRDFHKYM-KGHKAASSELVRNSHHTWLYQGEGAHHVMRAIRQRVLRLTRL 245
Query: 162 -PRIN--GEAFNILRYKIGQKYNSHYDA-------------FDPQEYGP-QKSQRVASFL 204
P I E ++RY G Y++H D+ E P + S R + L
Sbjct: 246 SPEIVELSEPLQVVRYGEGGHYHAHVDSGPVYPETICSHTKLIANESVPFETSCRYMTVL 305
Query: 205 VYLTDLEEGGETMFPFENGMNADGSYDYQKCI------------GLKVKPRQGDGLLFYS 252
YL ++ GGET+FP + D Q + L+VKPRQG + +Y+
Sbjct: 306 FYLNNVTGGGETVFPVADNRTYDEMSLIQDDVDLRDTRRHCDKGNLRVKPRQGTAVFWYN 365
Query: 253 LLPNGT-----IDPTSIHGSCPVVKGEKWVATKWI 282
LP+G +D S+HG C V G KW+A WI
Sbjct: 366 YLPDGQGWVGDVDDYSLHGGCLVTSGTKWIANNWI 400
>gi|443709454|gb|ELU04126.1| hypothetical protein CAPTEDRAFT_167710 [Capitella teleta]
Length = 535
Score = 80.5 bits (197), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 47/149 (31%), Positives = 74/149 (49%), Gaps = 18/149 (12%)
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQE---YGPQKSQ 198
++E T+ I + + +T L E I Y IG Y H+D E + +
Sbjct: 390 DEEHPTVAKISNRCSALTNLSLSTVEELQIANYGIGGHYEPHFDYSRLAEVTSFDHWRGN 449
Query: 199 RVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGT 258
R+ + + YL+D+E GG T+F M A G K++P +G ++Y+L P+GT
Sbjct: 450 RILTVIFYLSDVEAGGGTVF-----MTA----------GTKLRPEKGAAAVWYNLHPDGT 494
Query: 259 IDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
D + H +CPV+ G KWVA KW ++ Q
Sbjct: 495 GDDETKHAACPVLTGNKWVANKWFHERGQ 523
>gi|195505218|ref|XP_002099409.1| GE10887 [Drosophila yakuba]
gi|194185510|gb|EDW99121.1| GE10887 [Drosophila yakuba]
Length = 521
Score = 80.5 bits (197), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 54/210 (25%), Positives = 92/210 (43%), Gaps = 26/210 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
P + + + + I + + L+ +T+ V N +RTS FI + +
Sbjct: 302 PLLVLYHDVIYQSEIDVIRKLTENRLKRATVTGHNESVVSN---VRTSQFTFIPVSAHK- 357
Query: 146 GTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEY------GPQKSQR 199
L I++++A +T L E Y IG Y H D F P+ R
Sbjct: 358 -VLSTIDQRVADMTNLNMKYAEDHQFANYGIGGHYGQHMDWFYQTTIDAGLISSPEMGNR 416
Query: 200 VASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTI 259
+A+ L YL+D+ +GG T FP + +KP++ +++L +G
Sbjct: 417 IATVLFYLSDVSQGGGTAFPQLRTL---------------LKPKKYAAAFWHNLHASGVG 461
Query: 260 DPTSIHGSCPVVKGEKWVATKWIRDQEQYD 289
D + HG+CP++ G KWV +WIR+ +Q D
Sbjct: 462 DVRTQHGACPIIAGSKWVQNRWIREVDQSD 491
>gi|21107513|gb|AAM36222.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri
str. 306]
Length = 273
Score = 80.5 bits (197), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 59/213 (27%), Positives = 89/213 (41%), Gaps = 38/213 (17%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDES 145
PR + + ++C+ ++ +A+ +LR S + + + IRTS G
Sbjct: 83 PRIEEYAAVLSADECRLLMLLARPHLRASKV-IDPNDASTQRAPIRTSRG---------- 131
Query: 146 GTLDLIEEKIAKVTM---------LPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG--- 193
TLD I E A LP + E ++L Y G++Y +H D P
Sbjct: 132 ATLDPIIEDFAARAAQARLAACAQLPLAHAEPLSVLCYAPGEQYRAHRDYLPPGTIAADR 191
Query: 194 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 253
P R + VYL D+ GG+T FP G++V+PR G + F +L
Sbjct: 192 PTAGNRQRTVCVYLNDVGAGGDTEFPIA---------------GVRVRPRPGTLVCFDNL 236
Query: 254 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQE 286
+G D S+H PV G KW+ T W R Q
Sbjct: 237 HADGRPDADSLHAGLPVTAGSKWLGTLWFRQQR 269
>gi|223997846|ref|XP_002288596.1| hypothetical protein THAPSDRAFT_261963 [Thalassiosira pseudonana
CCMP1335]
gi|220975704|gb|EED94032.1| hypothetical protein THAPSDRAFT_261963 [Thalassiosira pseudonana
CCMP1335]
Length = 373
Score = 80.5 bits (197), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 56/162 (34%), Positives = 83/162 (51%), Gaps = 23/162 (14%)
Query: 131 RTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQ 190
RTSS + + + + ++I ++T +P+ N E+F IL+YK G+ Y SH+D+ D
Sbjct: 218 RTSSNAWCRKECENLTGVKGVSKRIEEMTGIPQNNYESFQILQYKPGEYYKSHHDSSDAN 277
Query: 191 EYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 250
+ RV +F +YL D+EEGGET F N + VKP++G L++
Sbjct: 278 K-DKVTGHRVLTFFLYLNDVEEGGETHFTKLN---------------ISVKPKRGRALVW 321
Query: 251 YSLL---PNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 289
S+L PN T D H + V KG K+ A WI QYD
Sbjct: 322 PSVLNEDPNST-DNRMYHEAKSVEKGIKYAANHWIH---QYD 359
>gi|296474834|tpg|DAA16949.1| TPA: hypoxia-inducible factor prolyl 4-hydroxylase [Bos taurus]
Length = 494
Score = 80.5 bits (197), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 58/215 (26%), Positives = 96/215 (44%), Gaps = 35/215 (16%)
Query: 102 SIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTML 161
S+ + ++LR +R +++Q +R S ++ E + I +++ ++T L
Sbjct: 237 SLQEFSNMDLRDFHKYMRS-HRAESSQLVRNSHHTWLYQGEGAHHVMRAIRQRVLRLTRL 295
Query: 162 -PRIN--GEAFNILRYKIGQKYNSHYDA-------------FDPQEYGP-QKSQRVASFL 204
P I E ++RY G Y++H D+ E P + S R + L
Sbjct: 296 SPEIVELSEPLQVVRYGEGGHYHAHVDSGPVYPETICSHTKLVANESVPFETSCRYMTVL 355
Query: 205 VYLTDLEEGGETMFPFENGMNADGSYDYQKCI------------GLKVKPRQGDGLLFYS 252
YL ++ GGET+FP + D Q + L+VKPRQG + +Y+
Sbjct: 356 FYLNNVTGGGETVFPVADNRTYDEMSLIQDDVDLRDTRRHCDKGNLRVKPRQGTAVFWYN 415
Query: 253 LLPNGT-----IDPTSIHGSCPVVKGEKWVATKWI 282
LP+G +D S+HG C V +G KW+A WI
Sbjct: 416 YLPDGQGWVGDVDDYSLHGGCLVTRGTKWIANNWI 450
>gi|198477150|ref|XP_002136737.1| GA29215 [Drosophila pseudoobscura pseudoobscura]
gi|198145042|gb|EDY71754.1| GA29215 [Drosophila pseudoobscura pseudoobscura]
Length = 508
Score = 80.5 bits (197), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 48/157 (30%), Positives = 77/157 (49%), Gaps = 20/157 (12%)
Query: 131 RTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQ 190
RT+ ++ + + + ++ E ++ + + E F +L Y IG Y H D F+
Sbjct: 363 RTTKAGWLDPSHNLIRRMGILTEDMSNLDL---ERSEDFQVLNYGIGGHYAVHPDFFEGS 419
Query: 191 EYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLF 250
P+ RVA+ L YL+D+ GG T+FP + L V P++G L++
Sbjct: 420 --NPELPDRVATLLFYLSDVPLGGATVFPL---------------LDLSVFPKKGAVLMW 462
Query: 251 YSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
Y+L G +IH +CPVV G +WV TKW+ Q Q
Sbjct: 463 YNLDHKGQGMEKTIHSACPVVVGSRWVMTKWVNQQPQ 499
>gi|17861644|gb|AAL39299.1| GH17175p [Drosophila melanogaster]
Length = 187
Score = 80.5 bits (197), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 40/122 (32%), Positives = 63/122 (51%), Gaps = 16/122 (13%)
Query: 167 EAFNILRYKIGQKYNSHYDAF-DPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMN 225
E + Y +G Y H+D F DP Y ++ R+A+ + YL+++E+GG T FPF
Sbjct: 59 EQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRIATAIFYLSEVEQGGATAFPF----- 113
Query: 226 ADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
+ + VKP+ G+ L +Y+L + D + H CPV+KG KW+ WI +
Sbjct: 114 ----------LDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGSKWIGNVWIHEV 163
Query: 286 EQ 287
Q
Sbjct: 164 TQ 165
>gi|323455897|gb|EGB11765.1| hypothetical protein AURANDRAFT_52419 [Aureococcus anophagefferens]
Length = 478
Score = 80.5 bits (197), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 67/258 (25%), Positives = 106/258 (41%), Gaps = 54/258 (20%)
Query: 68 QAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNT 127
+ G D+ LS P+ F + + +I K ++PS + L G + D T
Sbjct: 139 ETGVDAGHRSVVTTLSMRPQVFRISQFMMGHETEKLIERNKPRIKPSEVGL-VGRSGDKT 197
Query: 128 QGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRING-----EAFNILRYKIGQKYNS 182
RTS+ + +A+ + + I + L +I+ + +L Y+ Q Y
Sbjct: 198 ---RTSTNAWDTASP-------VARDVIGRAFRLLKIDAHRKLEDGLQVLHYERPQWYKP 247
Query: 183 HYDAFDPQEYGP----------------QKSQRVASFLVYLTDLEEGGETMFPF------ 220
H D F + G + R A+ +YL + GGET+FP
Sbjct: 248 HVDYFTSRNAGGGGASEDAFSNAIPTANNGTNRFATVFLYLNNAGSGGETVFPLSTTHEI 307
Query: 221 -------ENGMN--------ADGSYDYQ-KCIGLKVKPRQGDGLLFYSLLPNGTIDPTSI 264
+ G N AD ++ K L+V PR GD +LFYS + ++D S+
Sbjct: 308 YQGGRLTQAGTNRTPGFIRDADAAWVCDTKSEALRVTPRTGDSVLFYSQRGDASLDGYSL 367
Query: 265 HGSCPVVKGEKWVATKWI 282
HGSCP+ GEKW A W+
Sbjct: 368 HGSCPMGDGEKWAANLWV 385
>gi|327265288|ref|XP_003217440.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Anolis
carolinensis]
Length = 554
Score = 80.1 bits (196), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 98/208 (47%), Gaps = 30/208 (14%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAA---- 141
P + + N + E+ + I +AK L +T+ D G+ T + +S +
Sbjct: 355 PHIVRYYNVLSDEEIEKIKELAKPKLARATVR-------DPKTGVLTVANYRVSKSSWLE 407
Query: 142 EDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKS---- 197
E++ + + +++ +T L E + Y +G +Y H+D +E K
Sbjct: 408 EEDDLVVAKVNQRMEHITGLTVKTAELLQVANYGMGGQYEPHFDFSRKEEPDAFKRLGTG 467
Query: 198 QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNG 257
RVA+FL Y++D+E GG T+FP D+ G + P++G + +Y+L +G
Sbjct: 468 NRVATFLNYMSDVEAGGATVFP-----------DF----GAAIWPKKGTAVFWYNLFRSG 512
Query: 258 TIDPTSIHGSCPVVKGEKWVATKWIRDQ 285
D + H +CPV+ G KWV+ KW ++
Sbjct: 513 EGDYRTRHAACPVLVGCKWVSNKWFHER 540
>gi|348688210|gb|EGZ28024.1| hypothetical protein PHYSODRAFT_321730 [Phytophthora sojae]
Length = 487
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 55/225 (24%), Positives = 106/225 (47%), Gaps = 15/225 (6%)
Query: 76 NIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSG 135
++ + +S P F ++ ++ ++ +L PS + L+ G RTS+
Sbjct: 265 DLVMETISMTPLVFSVEEFLRDDEIDVVLELSMPHLAPSGVTLQDGHENRPATDWRTSTT 324
Query: 136 VFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQ 195
++ ++ + I+++ A + +P + E+ +LRY+ Q Y+ H D F + +
Sbjct: 325 YWLESSSHP--VVQDIDKRTADLVKVPISHQESVQVLRYEHTQHYDQHLDYFSVKRHRNS 382
Query: 196 KS----------QRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCI-GLKVKPRQ 244
R+ + Y++D+ +GG T F G+ + + C GL V P++
Sbjct: 383 ADVLKKIEHGYKNRMITVFWYMSDVAKGGHTNFARAGGLPPPPTN--KGCTQGLSVVPKK 440
Query: 245 GDGLLFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQYD 289
++FYS+LPNG DP S+H CPV +G K KW+ ++ + D
Sbjct: 441 RKVVVFYSMLPNGEGDPMSLHAGCPVEEGIKMSGNKWVWNKPRSD 485
>gi|47213360|emb|CAF90979.1| unnamed protein product [Tetraodon nigroviridis]
Length = 511
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 59/218 (27%), Positives = 103/218 (47%), Gaps = 39/218 (17%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLA-LRKGETVDNTQGIRTSSGVFISAAEDE 144
PR + + + + + + + +A+ LR +T+ R G+ T R S ++ A E
Sbjct: 307 PRIVRYHDVLSNREMEKVKELARPRLRRATVHDPRTGQLT--TAPYRVSKSAWLGAFE-- 362
Query: 145 SGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD--------AFDPQEYGPQK 196
+D I ++I +T L E + Y +G +Y H+D AF +E G
Sbjct: 363 HPIVDQINQRIEDITGLDVSTAEDLQVANYGVGGQYEPHFDFGQKDEPDAF--EELG--T 418
Query: 197 SQRVASFLVY-------LTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLL 249
R+A++L+Y ++D++ GG T+F IG V P++G +
Sbjct: 419 GNRIATWLLYVSAAVLRMSDVQAGGATVF---------------TDIGASVLPQKGSAVF 463
Query: 250 FYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
+Y+L P+G D + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 464 WYNLRPSGDGDYRTRHAACPVLLGNKWVSNKWIHERGQ 501
>gi|224012759|ref|XP_002295032.1| hypothetical protein THAPSDRAFT_264808 [Thalassiosira pseudonana
CCMP1335]
gi|220969471|gb|EED87812.1| hypothetical protein THAPSDRAFT_264808 [Thalassiosira pseudonana
CCMP1335]
Length = 194
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 94/201 (46%), Gaps = 26/201 (12%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISA-AEDE 144
P + NF T E+ +I + K + E ++ G RTS + +
Sbjct: 2 PWLVSLENFLTDEEADYLIEVGKRQ------QYQLSEQRKDSLGTRTSYSAWCRRDCWKD 55
Query: 145 SGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYGPQKSQRVASFL 204
T+ + ++IAKVT + +LRY+ GQK+ H D G + QR+ +FL
Sbjct: 56 DATVSSVVDRIAKVTKVETKQLSNLQLLRYEEGQKFKQHTDFAAMLSRGRAQGQRLMTFL 115
Query: 205 VYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSI 264
+YL+D+EEGGET FP+ G+ ++PR+G +L+ +++ N D I
Sbjct: 116 IYLSDVEEGGETSFPYS---------------GVTIQPRKGHAVLWPNVM-NDDPDAKEI 159
Query: 265 ---HGSCPVVKGEKWVATKWI 282
H S PV+KG K + +I
Sbjct: 160 RADHMSLPVLKGVKHAVSIYI 180
>gi|159474434|ref|XP_001695330.1| predicted protein [Chlamydomonas reinhardtii]
gi|158275813|gb|EDP01588.1| predicted protein [Chlamydomonas reinhardtii]
Length = 1887
Score = 80.1 bits (196), Expect = 1e-12, Method: Composition-based stats.
Identities = 57/211 (27%), Positives = 90/211 (42%), Gaps = 25/211 (11%)
Query: 83 SWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAE 142
S PR L F P C ++ +A L +R + R S F +
Sbjct: 1687 SLSPRVLVVDGFLPPGLCDALCAVAAPRL------IRSRVSTGAETPSRVSQSTFFTGDS 1740
Query: 143 DESGTLDLIEEKIAKVTMLPRING---------EAFNILRYKIGQKYNSHYDAFDPQEYG 193
+ +E ++ + P + EA ++ Y +G Y+ HYD + G
Sbjct: 1741 ARLPEVVAVEARLQALMERPEVTAGGRPTLVKSEALQVVSYDVGGFYSEHYD----NKTG 1796
Query: 194 PQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSL 253
S R A+ ++YL D + GG T FP N GL+V P +G L+F+S
Sbjct: 1797 GVIS-RAATIIIYLQDTQAGGSTHFP-----NQQLRLMRVARPGLRVYPAKGRALIFWSR 1850
Query: 254 LPNGTIDPTSIHGSCPVVKGEKWVATKWIRD 284
LP+G+ D S+H + PV G KW+ T+W ++
Sbjct: 1851 LPDGSEDLASLHSAEPVRAGSKWICTRWFKE 1881
>gi|395516293|ref|XP_003762325.1| PREDICTED: transmembrane prolyl 4-hydroxylase [Sarcophilus
harrisii]
Length = 517
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 69/268 (25%), Positives = 111/268 (41%), Gaps = 45/268 (16%)
Query: 59 DEYKWMPHGQAGD----DSVTNIPFQVLSWMP----RALYFPNFATPEQ--CKSIINMAK 108
D + + H Q G + +T+ WM R +Y A P+ S+ +
Sbjct: 208 DVFNLLDHNQDGQLQLKEVLTHTRLGNGRWMTPENIREMYAAVKADPDGDGVLSLEEFKQ 267
Query: 109 LNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRI---N 165
LNLR + + V + +R S ++ E + I +++ ++T LPR +
Sbjct: 268 LNLRDFHKYM-GSQKVKVSDLVRNSQHTWLYQGEGAHQVMRSIRQRVLRLTRLPRQIVEH 326
Query: 166 GEAFNILRYKIGQKYNSHYDA-------------FDPQEYGP-QKSQRVASFLVYLTDLE 211
E ++RY G Y++H D+ E P + S R + L YL ++
Sbjct: 327 SEPMQVVRYDQGGHYHAHMDSGPVFPETACSHTKLIANETSPFETSCRYVTVLFYLNNVT 386
Query: 212 EGGETMFPFENGMNADGSYDYQKCI------------GLKVKPRQGDGLLFYSLLPNGT- 258
GGET FP + D Q + L+VKPRQG + +Y+ L +G
Sbjct: 387 GGGETTFPVADNRTYDELSLIQNDVDLRDTRKHCDKGNLRVKPRQGTAVFWYNYLSDGQG 446
Query: 259 ----IDPTSIHGSCPVVKGEKWVATKWI 282
+D S+HG C V +G KW+A WI
Sbjct: 447 WVGELDEYSLHGGCLVTQGTKWIANNWI 474
>gi|195505216|ref|XP_002099408.1| GE23378 [Drosophila yakuba]
gi|194185509|gb|EDW99120.1| GE23378 [Drosophila yakuba]
Length = 546
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 52/159 (32%), Positives = 80/159 (50%), Gaps = 19/159 (11%)
Query: 131 RTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYDAF--D 188
RTS V++ +E+ TL L + ++ T L + E F ++ Y IG + SH+D D
Sbjct: 381 RTSKSVWLDNDANEA-TLKLTQ-RLGDATGLDISHSEPFQVINYGIGGIFESHFDTSLQD 438
Query: 189 PQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGL 248
+ R+A+ L YL D+ +GG T FP G+N + V P+ G L
Sbjct: 439 ENRFLDGYMDRLATTLFYLNDVPQGGATHFP---GLN------------ITVFPKFGTAL 483
Query: 249 LFYSLLPNGTIDPTSIHGSCPVVKGEKWVATKWIRDQEQ 287
+Y+L G + ++H CPV+ G KWV +KWI D+ Q
Sbjct: 484 FWYNLDTKGLLRLRTMHTGCPVIVGSKWVVSKWIDDKGQ 522
>gi|190402274|gb|ACE77683.1| prolyl 4-hydroxylase subunit alpha-2 precursor (predicted) [Sorex
araneus]
Length = 533
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 55/205 (26%), Positives = 101/205 (49%), Gaps = 22/205 (10%)
Query: 86 PRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGET-VDNTQGIRTSSGVFISAAEDE 144
P + + + + E+ + I +AK L +T+ R +T V T R S ++ +D
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATV--RDPKTGVLTTASYRVSKSSWLEETDDP 393
Query: 145 SGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNSHYD-AFDPQEYGPQ-KSQRVAS 202
+ + ++ +T L E + Y +G +Y H+D + P + G + + R+A+
Sbjct: 394 --VVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451
Query: 203 FLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPT 262
FL Y++D+E GG T+FP +G + P++G + +Y+LL +G D
Sbjct: 452 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 496
Query: 263 SIHGSCPVVKGEKWVATKWIRDQEQ 287
+ H +CPV+ G KWV+ KW ++ Q
Sbjct: 497 TRHAACPVLVGCKWVSNKWFHERGQ 521
>gi|289662828|ref|ZP_06484409.1| hypothetical protein XcampvN_06993, partial [Xanthomonas campestris
pv. vasculorum NCPPB 702]
Length = 301
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 50/195 (25%), Positives = 87/195 (44%), Gaps = 20/195 (10%)
Query: 98 EQCKSIINMAKLNLRPSTLALRKGETVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAK 157
++C+ ++ +A+ +LR S + + + +RTS G + ++ + + ++A
Sbjct: 123 DECRLLMLLARPHLRDSQV-IDPNDASTQRAPVRTSRGATLDPIIEDFAA-RVAQARLAA 180
Query: 158 VTMLPRINGEAFNILRYKIGQKYNSHYDAFDPQEYG---PQKSQRVASFLVYLTDLEEGG 214
L + E ++L Y G++Y +H D P P R + VYL ++ GG
Sbjct: 181 CAQLTLTHAEPLSVLCYAPGEQYRAHRDYLPPGTIAADHPNAGNRQRTVCVYLNVVDAGG 240
Query: 215 ETMFPFENGMNADGSYDYQKCIGLKVKPRQGDGLLFYSLLPNGTIDPTSIHGSCPVVKGE 274
ET FP G++V+PR G + F +L +G + S+H PV G
Sbjct: 241 ETEFPLA---------------GVRVQPRPGALVCFDNLHADGRPNADSLHAGLPVTAGS 285
Query: 275 KWVATKWIRDQEQYD 289
KW+ T W R Q D
Sbjct: 286 KWLGTLWFRQQRYRD 300
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.317 0.135 0.407
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,862,698,921
Number of Sequences: 23463169
Number of extensions: 201574936
Number of successful extensions: 394911
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1426
Number of HSP's successfully gapped in prelim test: 565
Number of HSP's that attempted gapping in prelim test: 389983
Number of HSP's gapped (non-prelim): 2170
length of query: 289
length of database: 8,064,228,071
effective HSP length: 141
effective length of query: 148
effective length of database: 9,050,888,538
effective search space: 1339531503624
effective search space used: 1339531503624
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 76 (33.9 bits)