BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 022502
(296 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255551575|ref|XP_002516833.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223543921|gb|EEF45447.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 297
Score = 503 bits (1294), Expect = e-140, Method: Compositional matrix adjust.
Identities = 237/291 (81%), Positives = 262/291 (90%), Gaps = 4/291 (1%)
Query: 10 FFFLLSFSLLIRKSFS----STAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKS 65
F FLL SL+ KS S T+II+PSKVKQ+SWKPRAFVYEGFLTDLECDHLI+LAKS
Sbjct: 7 FVFLLLISLIFHKSSSYPGSPTSIIDPSKVKQVSWKPRAFVYEGFLTDLECDHLISLAKS 66
Query: 66 QLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVL 125
+LKRSAVADN SG+SKLS+VRTSSG FI KGKD IIAGIE+KI+TWTFLPKENGED+QVL
Sbjct: 67 ELKRSAVADNESGKSKLSEVRTSSGMFIAKGKDPIIAGIEEKISTWTFLPKENGEDLQVL 126
Query: 126 RYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPAT 185
RYEHGQKY+PHYDYF+DK+NI RGGHR+ATVLMYLSDV KGGETVFPNAEEPPRR+ +
Sbjct: 127 RYEHGQKYDPHYDYFADKINIARGGHRMATVLMYLSDVVKGGETVFPNAEEPPRRKATES 186
Query: 186 NDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDS 245
++DLSECAKKGI+VKPRRGDALLFFSLH AIPDP SLH+GCPVIEGEKWSATKWIHVDS
Sbjct: 187 HEDLSECAKKGISVKPRRGDALLFFSLHPTAIPDPNSLHAGCPVIEGEKWSATKWIHVDS 246
Query: 246 FDKIVEEGGDCTDNNASCERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
FDK +E GG+CTD N SCERWAALGECT NPEYMVGS +LPG+CRRSCKVC
Sbjct: 247 FDKNIEAGGNCTDKNESCERWAALGECTNNPEYMVGSPELPGYCRRSCKVC 297
>gi|363807286|ref|NP_001242363.1| uncharacterized protein LOC100796794 precursor [Glycine max]
gi|255641119|gb|ACU20838.1| unknown [Glycine max]
Length = 297
Score = 499 bits (1285), Expect = e-139, Method: Compositional matrix adjust.
Identities = 235/297 (79%), Positives = 263/297 (88%), Gaps = 8/297 (2%)
Query: 8 LNFFFLLSFSLLIRK--------SFSSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHL 59
+N + L F LLI K + S++++INPSKVKQISWKPRAFVYEGFLTDLECDHL
Sbjct: 1 MNRVWFLLFLLLISKCDHVWSSYAGSASSVINPSKVKQISWKPRAFVYEGFLTDLECDHL 60
Query: 60 INLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENG 119
I+LAKS+LKRSAVADNLSGES+LSDVRTSSG FI K KD I+AGIEDKI++WTFLPKENG
Sbjct: 61 ISLAKSELKRSAVADNLSGESQLSDVRTSSGMFISKNKDPIVAGIEDKISSWTFLPKENG 120
Query: 120 EDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPR 179
EDIQV RYEHGQKY+PHYDYF+DKVNI RGGHR+ATVLMYL+DVAKGGETVFP+AEEPPR
Sbjct: 121 EDIQVSRYEHGQKYDPHYDYFTDKVNIARGGHRIATVLMYLTDVAKGGETVFPSAEEPPR 180
Query: 180 RRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATK 239
RR T+ DLSECAKKGIAVKPRRGDALLFFSLHTNA PD SLH+GCPVIEGEKWSATK
Sbjct: 181 RRGAETSSDLSECAKKGIAVKPRRGDALLFFSLHTNATPDTSSLHAGCPVIEGEKWSATK 240
Query: 240 WIHVDSFDKIVEEGGDCTDNNASCERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
WIHVDSFDK V GGDC+DN+ SCERWA+LGECTKNPEYM+GS+ +PG+CR+SCK C
Sbjct: 241 WIHVDSFDKTVGAGGDCSDNHVSCERWASLGECTKNPEYMIGSSDIPGYCRKSCKAC 297
>gi|359806348|ref|NP_001241485.1| uncharacterized protein LOC100783075 precursor [Glycine max]
gi|255645457|gb|ACU23224.1| unknown [Glycine max]
Length = 298
Score = 492 bits (1267), Expect = e-137, Method: Compositional matrix adjust.
Identities = 234/299 (78%), Positives = 264/299 (88%), Gaps = 9/299 (3%)
Query: 6 LSLNFFFLLSFSLLIRK--------SFSSTAIINPSKVKQISWKPRAFVYEGFLTDLECD 57
+S +FLL F LLI K + S+++I+NPSKVKQISWKPRAFVYEGFLTDLECD
Sbjct: 1 MSSRVWFLL-FLLLISKCHQVWGSYAGSASSIVNPSKVKQISWKPRAFVYEGFLTDLECD 59
Query: 58 HLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKE 117
HLI+LAKS+LKRSAVADNLSGES+LSDVRTSSG FI K KD II+GIEDKI++WTFLPKE
Sbjct: 60 HLISLAKSELKRSAVADNLSGESQLSDVRTSSGMFISKNKDPIISGIEDKISSWTFLPKE 119
Query: 118 NGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEP 177
NGEDIQVLRYEHGQKY+PHYDYF+DKVNI RGGHR+ATVLMYL++V KGGETVFP+AEEP
Sbjct: 120 NGEDIQVLRYEHGQKYDPHYDYFTDKVNIARGGHRIATVLMYLTNVTKGGETVFPSAEEP 179
Query: 178 PRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSA 237
PRRR T+ DLSECAKKGIAVKP RGDALLFFSLHTNA PD SLH+GCPVIEGEKWSA
Sbjct: 180 PRRRGTETSSDLSECAKKGIAVKPHRGDALLFFSLHTNATPDTSSLHAGCPVIEGEKWSA 239
Query: 238 TKWIHVDSFDKIVEEGGDCTDNNASCERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
TKWIHVDSFDK V GGDC+D++ SCERWA+LGECTKNPEYM+GS+ +PG+CR+SCK C
Sbjct: 240 TKWIHVDSFDKTVGAGGDCSDHHVSCERWASLGECTKNPEYMIGSSDVPGYCRKSCKSC 298
>gi|224102545|ref|XP_002312720.1| predicted protein [Populus trichocarpa]
gi|222852540|gb|EEE90087.1| predicted protein [Populus trichocarpa]
Length = 300
Score = 490 bits (1261), Expect = e-136, Method: Compositional matrix adjust.
Identities = 232/300 (77%), Positives = 262/300 (87%), Gaps = 4/300 (1%)
Query: 1 MSPTRLSLNFFFLLSFSLLIRKSFS----STAIINPSKVKQISWKPRAFVYEGFLTDLEC 56
M+ T F FLLS ++ KS S S++IINP+KVKQ+SWKPRAFVYEGFLTDLEC
Sbjct: 1 MATTIYPRQFLFLLSIFSILHKSISYPGTSSSIINPAKVKQVSWKPRAFVYEGFLTDLEC 60
Query: 57 DHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPK 116
DHLI+LAKS+LKRSAVADN SG+SKLS+VRTSSG FI K KD I+AGIEDKIATWTFLP+
Sbjct: 61 DHLISLAKSELKRSAVADNESGKSKLSEVRTSSGMFITKAKDPIVAGIEDKIATWTFLPR 120
Query: 117 ENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEE 176
ENGEDIQVLRYEHGQKY+PHYDYFSDKVNI RGGHR+ATVLMYL+DV KGGETVFP+AEE
Sbjct: 121 ENGEDIQVLRYEHGQKYDPHYDYFSDKVNIARGGHRVATVLMYLTDVEKGGETVFPSAEE 180
Query: 177 PPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWS 236
PRR+ +++DLSECA+KGIAVKPRRGDALLFFSL+ A+PD S+H+GCPVIEGEKWS
Sbjct: 181 LPRRKASVSHEDLSECARKGIAVKPRRGDALLFFSLYPTAVPDTSSIHAGCPVIEGEKWS 240
Query: 237 ATKWIHVDSFDKIVEEGGDCTDNNASCERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
ATKWIHVDSFDK +E GG+CTD N SC RWAALGECTKN EYMVGS+ LPG+CRRSCKVC
Sbjct: 241 ATKWIHVDSFDKNLEAGGNCTDQNESCGRWAALGECTKNVEYMVGSSGLPGYCRRSCKVC 300
>gi|356546462|ref|XP_003541645.1| PREDICTED: uncharacterized protein LOC100818794 [Glycine max]
Length = 839
Score = 484 bits (1245), Expect = e-134, Method: Compositional matrix adjust.
Identities = 220/272 (80%), Positives = 250/272 (91%)
Query: 25 SSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSD 84
S++AII+PSKVKQ+SWKPRAFVYEGFLT+LECDHLI++AKS+LKRSAVADNLSGESKLS+
Sbjct: 568 SASAIIDPSKVKQVSWKPRAFVYEGFLTELECDHLISIAKSELKRSAVADNLSGESKLSE 627
Query: 85 VRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKV 144
VRTSSG FIPK KD I+AGIEDKI++WTFLPKENGEDIQVLRYEHGQKY+PHYDYF+DKV
Sbjct: 628 VRTSSGMFIPKNKDLIVAGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDYFADKV 687
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
NI RGGHR+ATVLMYL+DV KGGETVFP+AEE PR + TN++LSECA+KGIAVKPRRG
Sbjct: 688 NIARGGHRVATVLMYLTDVTKGGETVFPDAEESPRHKGSETNENLSECAQKGIAVKPRRG 747
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGGDCTDNNASCE 264
DALLFFSL+ NAIPD +SLH+GCPVIEGEKWSATKWIHVDSFDK+V +GGDC D + +CE
Sbjct: 748 DALLFFSLYPNAIPDTLSLHAGCPVIEGEKWSATKWIHVDSFDKVVGDGGDCNDKHENCE 807
Query: 265 RWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
RWA LGECT NPEYMVGS LPG+C +SCK C
Sbjct: 808 RWATLGECTSNPEYMVGSPGLPGYCMKSCKEC 839
>gi|357447553|ref|XP_003594052.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
truncatula]
gi|355483100|gb|AES64303.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
truncatula]
Length = 301
Score = 483 bits (1244), Expect = e-134, Method: Compositional matrix adjust.
Identities = 221/271 (81%), Positives = 251/271 (92%)
Query: 26 STAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDV 85
++AII+P+KVKQ+SWKPRAFVY+GFLTDLECDHLI++AKS+LKRSAVADNLSGESKLS+V
Sbjct: 31 TSAIIDPTKVKQVSWKPRAFVYKGFLTDLECDHLISIAKSELKRSAVADNLSGESKLSEV 90
Query: 86 RTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVN 145
RTSSG FI K KDAI++GIEDKI++WTFLPKENGEDIQVLRYEHGQKY+PHYDYF+DKVN
Sbjct: 91 RTSSGMFISKNKDAIVSGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDYFADKVN 150
Query: 146 IVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGD 205
I RGGHR+ATVLMYL++V KGGETVFPNAEE PR + T++DLSEC KKG+AVKPRRGD
Sbjct: 151 IARGGHRVATVLMYLTNVTKGGETVFPNAEESPRHKLSETDEDLSECGKKGVAVKPRRGD 210
Query: 206 ALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGGDCTDNNASCER 265
ALLFFSLH NAIPD +SLH+GCPVIEGEKWSATKWIHVDSFDK V GGDCTD + SCER
Sbjct: 211 ALLFFSLHPNAIPDTLSLHAGCPVIEGEKWSATKWIHVDSFDKTVGAGGDCTDQHESCER 270
Query: 266 WAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
WAALGECTKNPEYMVG++ LPG+CR+SCK C
Sbjct: 271 WAALGECTKNPEYMVGTSGLPGYCRKSCKTC 301
>gi|215490181|dbj|BAG86624.1| type 2 proly 4-hydroxylase [Nicotiana tabacum]
Length = 294
Score = 481 bits (1238), Expect = e-133, Method: Compositional matrix adjust.
Identities = 227/293 (77%), Positives = 255/293 (87%), Gaps = 1/293 (0%)
Query: 4 TRLSLNFFFLLSFSLLIRKSFSSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLA 63
+ L L FFLL +R+S SS+AIINPSK KQISWKPRAFVYEGFLTD EC+HLI+LA
Sbjct: 3 SSLQLASFFLLFIIAFVRES-SSSAIINPSKAKQISWKPRAFVYEGFLTDEECNHLISLA 61
Query: 64 KSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQ 123
KS+LKRSAVADN SG SK S+VRTSSG FIPK KD I++GIE+KIATWTFLPKENGE+IQ
Sbjct: 62 KSELKRSAVADNESGNSKTSEVRTSSGMFIPKAKDPIVSGIEEKIATWTFLPKENGEEIQ 121
Query: 124 VLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTP 183
VLRYE GQKYEPHYDYF DKVNI RGGHRLATVLMYL++V KGGETVFP AEE PRRR+
Sbjct: 122 VLRYEEGQKYEPHYDYFVDKVNIARGGHRLATVLMYLTNVEKGGETVFPKAEESPRRRSM 181
Query: 184 ATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHV 243
+D LSECAKKGI VKPR+GDALLF+SLH NA PDP+SLH GCPVI+GEKWSATKWIHV
Sbjct: 182 IADDSLSECAKKGIPVKPRKGDALLFYSLHPNATPDPLSLHGGCPVIQGEKWSATKWIHV 241
Query: 244 DSFDKIVEEGGDCTDNNASCERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
DSFDK V+ G+C+D + +CERWAALGECTKNPEYM+GSA LPG+CR+SCKVC
Sbjct: 242 DSFDKTVDTEGNCSDRDENCERWAALGECTKNPEYMLGSAGLPGYCRKSCKVC 294
>gi|357447555|ref|XP_003594053.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
truncatula]
gi|355483101|gb|AES64304.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
truncatula]
Length = 303
Score = 479 bits (1232), Expect = e-133, Method: Compositional matrix adjust.
Identities = 221/273 (80%), Positives = 251/273 (91%), Gaps = 2/273 (0%)
Query: 26 STAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDV 85
++AII+P+KVKQ+SWKPRAFVY+GFLTDLECDHLI++AKS+LKRSAVADNLSGESKLS+V
Sbjct: 31 TSAIIDPTKVKQVSWKPRAFVYKGFLTDLECDHLISIAKSELKRSAVADNLSGESKLSEV 90
Query: 86 RTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVN 145
RTSSG FI K KDAI++GIEDKI++WTFLPKENGEDIQVLRYEHGQKY+PHYDYF+DKVN
Sbjct: 91 RTSSGMFISKNKDAIVSGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDYFADKVN 150
Query: 146 IVRGGHRLATVLMYLSDVAKGGETVFPNAE--EPPRRRTPATNDDLSECAKKGIAVKPRR 203
I RGGHR+ATVLMYL++V KGGETVFPNAE E PR + T++DLSEC KKG+AVKPRR
Sbjct: 151 IARGGHRVATVLMYLTNVTKGGETVFPNAELQESPRHKLSETDEDLSECGKKGVAVKPRR 210
Query: 204 GDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGGDCTDNNASC 263
GDALLFFSLH NAIPD +SLH+GCPVIEGEKWSATKWIHVDSFDK V GGDCTD + SC
Sbjct: 211 GDALLFFSLHPNAIPDTLSLHAGCPVIEGEKWSATKWIHVDSFDKTVGAGGDCTDQHESC 270
Query: 264 ERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
ERWAALGECTKNPEYMVG++ LPG+CR+SCK C
Sbjct: 271 ERWAALGECTKNPEYMVGTSGLPGYCRKSCKTC 303
>gi|255641919|gb|ACU21228.1| unknown [Glycine max]
Length = 301
Score = 478 bits (1230), Expect = e-132, Method: Compositional matrix adjust.
Identities = 223/288 (77%), Positives = 256/288 (88%), Gaps = 5/288 (1%)
Query: 14 LSFSLLIRKSFSS-----TAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLK 68
L+ L ++FSS +AII+PSKVKQ+SWKPRAFVYEGFLT+LECDHLI++AKS+LK
Sbjct: 14 LALMLQWHEAFSSYAGSASAIIDPSKVKQVSWKPRAFVYEGFLTELECDHLISIAKSELK 73
Query: 69 RSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYE 128
RSAVADNLSGESKLS+VRTSSG FIPK KD I+AGIEDKI++WTFLPKENGEDIQVLRYE
Sbjct: 74 RSAVADNLSGESKLSEVRTSSGMFIPKNKDLIVAGIEDKISSWTFLPKENGEDIQVLRYE 133
Query: 129 HGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDD 188
HGQKY+PHYDYF+DKVNI RGGHR+ATVLMYL+DV KGGETVFP+AEE PR + TN++
Sbjct: 134 HGQKYDPHYDYFADKVNIARGGHRVATVLMYLTDVTKGGETVFPDAEESPRHKGSETNEN 193
Query: 189 LSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDK 248
LSECA+KGIAVKPRRGDALLFFSL+ NAIPD +SLH+GCPVIEGEKWSAT+WIHVDSFDK
Sbjct: 194 LSECAQKGIAVKPRRGDALLFFSLYPNAIPDTLSLHAGCPVIEGEKWSATEWIHVDSFDK 253
Query: 249 IVEEGGDCTDNNASCERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
+V +GGDC D + +CERWA LGECT NPEYMVGS LPG+C +SCK C
Sbjct: 254 VVGDGGDCNDKHENCERWATLGECTSNPEYMVGSPGLPGYCMKSCKEC 301
>gi|356555585|ref|XP_003546111.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 1
[Glycine max]
Length = 301
Score = 478 bits (1229), Expect = e-132, Method: Compositional matrix adjust.
Identities = 222/288 (77%), Positives = 252/288 (87%), Gaps = 5/288 (1%)
Query: 14 LSFSLLIRKSFSS-----TAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLK 68
L+ L ++FSS +AII+PSKVKQ+SWKPRAFVYEGFLT+LECDHLI++AKS+LK
Sbjct: 14 LALMLQWHEAFSSYAGSASAIIDPSKVKQVSWKPRAFVYEGFLTELECDHLISIAKSELK 73
Query: 69 RSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYE 128
RSAVADNLSGESKLS+VRTSSG FIPK KD I+AG+EDKI++WT LPKENGEDIQVLRYE
Sbjct: 74 RSAVADNLSGESKLSEVRTSSGMFIPKNKDPIVAGVEDKISSWTLLPKENGEDIQVLRYE 133
Query: 129 HGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDD 188
HGQKY+PHYDYF+DKVNI RGGHR+ATVLMYL+DV KGGETVFPNAEE PR R T +D
Sbjct: 134 HGQKYDPHYDYFADKVNIARGGHRVATVLMYLTDVTKGGETVFPNAEESPRHRGSETKED 193
Query: 189 LSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDK 248
LSECA+KGIAVKPRRGDALLFFSL+ NAIPD +SLH+GCPVIEGEKWSATKWIHVDSFDK
Sbjct: 194 LSECAQKGIAVKPRRGDALLFFSLYPNAIPDTMSLHAGCPVIEGEKWSATKWIHVDSFDK 253
Query: 249 IVEEGGDCTDNNASCERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
+V +GGDC D +C+RWA LGECT NP YMVGS LPG+C +SCK C
Sbjct: 254 MVADGGDCNDKQENCDRWATLGECTSNPNYMVGSPGLPGYCMKSCKAC 301
>gi|357478545|ref|XP_003609558.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
gi|355510613|gb|AES91755.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
Length = 299
Score = 477 bits (1228), Expect = e-132, Method: Compositional matrix adjust.
Identities = 221/272 (81%), Positives = 247/272 (90%)
Query: 25 SSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSD 84
S+++IINPSKVKQISW PRAFVY+GFLTDLECDHLI+LAKS+LKRSAVADNLSG+S+LSD
Sbjct: 27 SASSIINPSKVKQISWIPRAFVYQGFLTDLECDHLISLAKSELKRSAVADNLSGDSQLSD 86
Query: 85 VRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKV 144
VRTSSG FI K KD I++GIED+I+ WTFLPKENGEDIQVLRYEHGQKY+PHYDYF+DKV
Sbjct: 87 VRTSSGMFISKNKDPIVSGIEDRISAWTFLPKENGEDIQVLRYEHGQKYDPHYDYFADKV 146
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
NIV+GGHRLATVLMYL++V KGGETVFP AEEPPRRR + DLSECAKKGIAVKPRRG
Sbjct: 147 NIVQGGHRLATVLMYLTNVTKGGETVFPEAEEPPRRRGSKKSSDLSECAKKGIAVKPRRG 206
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGGDCTDNNASCE 264
DALLFFSL TNAIPD SLH+GCPV+EGEKWSATKWIHVDSFDKIV GG C+D + SCE
Sbjct: 207 DALLFFSLDTNAIPDTNSLHAGCPVLEGEKWSATKWIHVDSFDKIVGAGGGCSDQHDSCE 266
Query: 265 RWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
RWA+LGECT NP YMVGS+ LPG+CR+SCK C
Sbjct: 267 RWASLGECTNNPVYMVGSSDLPGYCRKSCKAC 298
>gi|388495016|gb|AFK35574.1| unknown [Lotus japonicus]
Length = 297
Score = 475 bits (1222), Expect = e-131, Method: Compositional matrix adjust.
Identities = 222/268 (82%), Positives = 243/268 (90%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
IINPSKVKQ+SWKPRAFVYEGFLT LECDHLI+LAKS+LKRSAVADNL G+SKLS+VRTS
Sbjct: 30 IINPSKVKQVSWKPRAFVYEGFLTGLECDHLISLAKSELKRSAVADNLPGDSKLSEVRTS 89
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVR 148
SG FI K KD I+AGIEDKI+ WTFLPKENGED+QVLRYEHGQKY+PHYDYF+DKVNIVR
Sbjct: 90 SGMFISKKKDPIVAGIEDKISAWTFLPKENGEDMQVLRYEHGQKYDPHYDYFTDKVNIVR 149
Query: 149 GGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALL 208
GGHR+ATVL+YL++V +GGETVFP AEEPPRRR TN DLSECAKKGIAVKPRRGDALL
Sbjct: 150 GGHRMATVLLYLTNVTRGGETVFPVAEEPPRRRGLETNSDLSECAKKGIAVKPRRGDALL 209
Query: 209 FFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGGDCTDNNASCERWAA 268
FFSLHT AIPD SLH+GCPVIEGEKWSATKWIHVDSFDK V GGDC+D + SC+RWA+
Sbjct: 210 FFSLHTTAIPDTDSLHAGCPVIEGEKWSATKWIHVDSFDKTVGAGGDCSDQHESCQRWAS 269
Query: 269 LGECTKNPEYMVGSAQLPGFCRRSCKVC 296
LGECT NPEYMVGS+ LPG CRRSCK C
Sbjct: 270 LGECTNNPEYMVGSSDLPGSCRRSCKAC 297
>gi|388500582|gb|AFK38357.1| unknown [Medicago truncatula]
Length = 299
Score = 475 bits (1222), Expect = e-131, Method: Compositional matrix adjust.
Identities = 220/272 (80%), Positives = 246/272 (90%)
Query: 25 SSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSD 84
S+++IINPSKVKQISW PRAFVY+GFLTDLECDHLI+LAKS+LKRSAVADNLSG+S+LSD
Sbjct: 27 SASSIINPSKVKQISWIPRAFVYQGFLTDLECDHLISLAKSELKRSAVADNLSGDSQLSD 86
Query: 85 VRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKV 144
VRTSSG I K KD I++GIED+I+ WTFLPKENGEDIQVLRYEHGQKY+PHYDYF+DKV
Sbjct: 87 VRTSSGMLISKNKDPIVSGIEDRISAWTFLPKENGEDIQVLRYEHGQKYDPHYDYFADKV 146
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
NIV+GGHRLATVLMYL++V KGGETVFP AEEPPRRR + DLSECAKKGIAVKPRRG
Sbjct: 147 NIVQGGHRLATVLMYLTNVTKGGETVFPEAEEPPRRRGSKKSSDLSECAKKGIAVKPRRG 206
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGGDCTDNNASCE 264
DALLFFSL TNAIPD SLH+GCPV+EGEKWSATKWIHVDSFDKIV GG C+D + SCE
Sbjct: 207 DALLFFSLDTNAIPDTNSLHAGCPVLEGEKWSATKWIHVDSFDKIVGAGGGCSDQHDSCE 266
Query: 265 RWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
RWA+LGECT NP YMVGS+ LPG+CR+SCK C
Sbjct: 267 RWASLGECTNNPVYMVGSSDLPGYCRKSCKAC 298
>gi|388492638|gb|AFK34385.1| unknown [Medicago truncatula]
Length = 299
Score = 475 bits (1222), Expect = e-131, Method: Compositional matrix adjust.
Identities = 220/272 (80%), Positives = 246/272 (90%)
Query: 25 SSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSD 84
S+++IINPSKVKQISW PRAFVY+GFLTDLECDHLI+LAKS+LKRSAVADNLSG+S+LSD
Sbjct: 27 SASSIINPSKVKQISWIPRAFVYQGFLTDLECDHLISLAKSELKRSAVADNLSGDSQLSD 86
Query: 85 VRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKV 144
VRTSSG FI K KD I++GIED+I+ WTFLPKENGEDIQVLRYEHGQKY+PHYDYF+DKV
Sbjct: 87 VRTSSGMFISKNKDPIVSGIEDRISAWTFLPKENGEDIQVLRYEHGQKYDPHYDYFADKV 146
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
NIV+GGHRLATVLMYL++V KGGETVFP AEEPPRRR + DLSECAKKGIAVKPRRG
Sbjct: 147 NIVQGGHRLATVLMYLTNVTKGGETVFPEAEEPPRRRGSKKSSDLSECAKKGIAVKPRRG 206
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGGDCTDNNASCE 264
DALLFFSL TNAIPD SLH+GCPV+EGEKWSATKWIHVDS DKIV GG C+D + SCE
Sbjct: 207 DALLFFSLDTNAIPDTNSLHAGCPVLEGEKWSATKWIHVDSLDKIVGAGGGCSDQHDSCE 266
Query: 265 RWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
RWA+LGECT NP YMVGS+ LPG+CR+SCK C
Sbjct: 267 RWASLGECTNNPVYMVGSSDLPGYCRKSCKAC 298
>gi|359477453|ref|XP_003631980.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Vitis
vinifera]
gi|297736941|emb|CBI26142.3| unnamed protein product [Vitis vinifera]
Length = 298
Score = 472 bits (1214), Expect = e-131, Method: Compositional matrix adjust.
Identities = 227/295 (76%), Positives = 254/295 (86%), Gaps = 5/295 (1%)
Query: 7 SLNFFFLLSFSLLIRKSFSSTAI-----INPSKVKQISWKPRAFVYEGFLTDLECDHLIN 61
SL F LL S I + SS A ++ +KV+QISWKPRAFVYEGFL++ ECDHLI+
Sbjct: 4 SLQFLLLLWISSTILEFSSSYADAAGSNVSAAKVRQISWKPRAFVYEGFLSEEECDHLIS 63
Query: 62 LAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGED 121
LAKS+LKRSAVADN+SG+S+LS+VRTSSG FI KGKD I+AGIEDKIA WTFLPK+NGED
Sbjct: 64 LAKSELKRSAVADNVSGKSRLSEVRTSSGMFIGKGKDPIVAGIEDKIAAWTFLPKDNGED 123
Query: 122 IQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRR 181
+QVLRYE GQKY+ HYDYF DKVNI RGGHR+ATVLMYLSDV KGGETVFP AEEP RR+
Sbjct: 124 MQVLRYEPGQKYDAHYDYFVDKVNIARGGHRIATVLMYLSDVVKGGETVFPMAEEPSRRK 183
Query: 182 TPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
TNDDLSECA+KGIAVKPR+GDALLFFSLH AIPDP+SLH GCPVIEGEKWSATKWI
Sbjct: 184 PLPTNDDLSECARKGIAVKPRKGDALLFFSLHPTAIPDPMSLHGGCPVIEGEKWSATKWI 243
Query: 242 HVDSFDKIVEEGGDCTDNNASCERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
HVDSFDKI++ GG+CTD N SCERWAALGECTKNPEYM+GS+ LPG CRRSCKVC
Sbjct: 244 HVDSFDKILKPGGNCTDENDSCERWAALGECTKNPEYMLGSSDLPGACRRSCKVC 298
>gi|356555587|ref|XP_003546112.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 2
[Glycine max]
Length = 297
Score = 465 bits (1196), Expect = e-128, Method: Compositional matrix adjust.
Identities = 218/288 (75%), Positives = 250/288 (86%), Gaps = 9/288 (3%)
Query: 14 LSFSLLIRKSFSS-----TAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLK 68
L+ L ++FSS +AII+PSKVKQ+SWKPRAFVYEGFLT+LECDHLI++AKS+LK
Sbjct: 14 LALMLQWHEAFSSYAGSASAIIDPSKVKQVSWKPRAFVYEGFLTELECDHLISIAKSELK 73
Query: 69 RSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYE 128
RSAVADNLSGESKLS+VRTSSG FIPK KD I+AG+EDKI++WT LPKENGEDIQVLRYE
Sbjct: 74 RSAVADNLSGESKLSEVRTSSGMFIPKNKDPIVAGVEDKISSWTLLPKENGEDIQVLRYE 133
Query: 129 HGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDD 188
HGQKY+PHYDYF+DKVNI RGGHR+ATVLMYL+DV KGGETVFPNAE ++ T +D
Sbjct: 134 HGQKYDPHYDYFADKVNIARGGHRVATVLMYLTDVTKGGETVFPNAE----LKSSETKED 189
Query: 189 LSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDK 248
LSECA+KGIAVKPRRGDALLFFSL+ NAIPD +SLH+GCPVIEGEKWSATKWIHVDSFDK
Sbjct: 190 LSECAQKGIAVKPRRGDALLFFSLYPNAIPDTMSLHAGCPVIEGEKWSATKWIHVDSFDK 249
Query: 249 IVEEGGDCTDNNASCERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
+V +GGDC D +C+RWA LGECT NP YMVGS LPG+C +SCK C
Sbjct: 250 MVADGGDCNDKQENCDRWATLGECTSNPNYMVGSPGLPGYCMKSCKAC 297
>gi|388496942|gb|AFK36537.1| unknown [Lotus japonicus]
Length = 302
Score = 463 bits (1191), Expect = e-128, Method: Compositional matrix adjust.
Identities = 219/274 (79%), Positives = 248/274 (90%), Gaps = 4/274 (1%)
Query: 25 SSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSD 84
S++AII+PSKVKQ+SWKPRAFVY+GFLT+LECDHLI+LAKS+LKRSAVADNLSG+SKLSD
Sbjct: 31 SASAIIDPSKVKQVSWKPRAFVYKGFLTELECDHLISLAKSELKRSAVADNLSGDSKLSD 90
Query: 85 VRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKV 144
VRTSSG FI K KD I+AGIEDKI++WTFLPKENGEDIQVLRYEHGQKY+PHYD+F+DKV
Sbjct: 91 VRTSSGMFISKNKDPIVAGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDFFADKV 150
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNA--EEPPRRRTPATNDDLSECAKKGIAVKPR 202
NI RGGHR+ATVLMYL++V +GGETVFPNA EE PR R T DDLSECAKKGIAVKPR
Sbjct: 151 NIARGGHRVATVLMYLTNVTRGGETVFPNAEVEEFPRHRGSETIDDLSECAKKGIAVKPR 210
Query: 203 RGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGGDCTDNNAS 262
RGDALLFFSL+ NA+PD +SLH+GCPVIEGEKWSATKWIHVDSFD+ + GGDCTD++ S
Sbjct: 211 RGDALLFFSLYPNAVPDTMSLHAGCPVIEGEKWSATKWIHVDSFDR--KAGGDCTDHHES 268
Query: 263 CERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
C WAA+GECT NPEYMVGSA LPG+C RSCK C
Sbjct: 269 CASWAAVGECTNNPEYMVGSAGLPGYCMRSCKAC 302
>gi|15239594|ref|NP_197391.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
gi|21593296|gb|AAM65245.1| prolyl 4-hydroxylase alpha subunit-like protein [Arabidopsis
thaliana]
gi|332005243|gb|AED92626.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
Length = 298
Score = 461 bits (1187), Expect = e-127, Method: Compositional matrix adjust.
Identities = 219/298 (73%), Positives = 253/298 (84%), Gaps = 2/298 (0%)
Query: 1 MSPTRLSLNFFFLLSFSLLIRKSFSST--AIINPSKVKQISWKPRAFVYEGFLTDLECDH 58
M+ L ++FF + S L S S+ +NPSKVKQ+S KPRAFVYEGFLT+LECDH
Sbjct: 1 MARRGLLISFFAIFSVLLQSSTSLISSSSVFVNPSKVKQVSSKPRAFVYEGFLTELECDH 60
Query: 59 LINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKEN 118
+++LAK+ LKRSAVADN SGESK S+VRTSSGTFI KGKD I++GIEDKI+TWTFLPKEN
Sbjct: 61 MVSLAKASLKRSAVADNDSGESKFSEVRTSSGTFISKGKDPIVSGIEDKISTWTFLPKEN 120
Query: 119 GEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPP 178
GEDIQVLRYEHGQKY+ H+DYF DKVNIVRGGHR+AT+LMYLS+V KGGETVFP+AE P
Sbjct: 121 GEDIQVLRYEHGQKYDAHFDYFHDKVNIVRGGHRMATILMYLSNVTKGGETVFPDAEIPS 180
Query: 179 RRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSAT 238
RR +DLS+CAK+GIAVKPR+GDALLFF+LH +AIPDP+SLH GCPVIEGEKWSAT
Sbjct: 181 RRVLSENKEDLSDCAKRGIAVKPRKGDALLFFNLHPDAIPDPLSLHGGCPVIEGEKWSAT 240
Query: 239 KWIHVDSFDKIVEEGGDCTDNNASCERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
KWIHVDSFD+IV G+CTD N SCERWA LGECTKNPEYMVG+ +LPG+CRRSCK C
Sbjct: 241 KWIHVDSFDRIVTPSGNCTDMNESCERWAVLGECTKNPEYMVGTTELPGYCRRSCKAC 298
>gi|20260280|gb|AAM13038.1| unknown protein [Arabidopsis thaliana]
gi|22136524|gb|AAM91340.1| unknown protein [Arabidopsis thaliana]
Length = 298
Score = 461 bits (1187), Expect = e-127, Method: Compositional matrix adjust.
Identities = 219/298 (73%), Positives = 253/298 (84%), Gaps = 2/298 (0%)
Query: 1 MSPTRLSLNFFFLLSFSLLIRKSFSST--AIINPSKVKQISWKPRAFVYEGFLTDLECDH 58
M+ L ++FF + S L S S+ +NPSKVKQ+S KPRAFVYEGFLT+LECDH
Sbjct: 1 MARRGLLISFFAIFSVLLQSSTSLISSSSVFVNPSKVKQVSSKPRAFVYEGFLTELECDH 60
Query: 59 LINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKEN 118
+++LAK+ LKRSAVADN SGESK S+VRTSSGTFI KGKD I++GIEDKI+TWTFLPKEN
Sbjct: 61 MVSLAKASLKRSAVADNDSGESKFSEVRTSSGTFISKGKDPIVSGIEDKISTWTFLPKEN 120
Query: 119 GEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPP 178
GEDIQVLRYEHGQKY+ H+DYF DKVNIVRGGHR+AT+LMYLS+V KGGETVFP+AE P
Sbjct: 121 GEDIQVLRYEHGQKYDAHFDYFHDKVNIVRGGHRMATILMYLSNVTKGGETVFPDAEIPS 180
Query: 179 RRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSAT 238
RR +DLS+CAK+GIAVKPR+GDALLFF+LH +AIPDP+SLH GCPVIEGEKWSAT
Sbjct: 181 RRVLSENEEDLSDCAKRGIAVKPRKGDALLFFNLHPDAIPDPLSLHGGCPVIEGEKWSAT 240
Query: 239 KWIHVDSFDKIVEEGGDCTDNNASCERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
KWIHVDSFD+IV G+CTD N SCERWA LGECTKNPEYMVG+ +LPG+CRRSCK C
Sbjct: 241 KWIHVDSFDRIVTPSGNCTDMNESCERWAVLGECTKNPEYMVGTTELPGYCRRSCKAC 298
>gi|359477455|ref|XP_002278454.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1 [Vitis
vinifera]
Length = 296
Score = 459 bits (1181), Expect = e-127, Method: Compositional matrix adjust.
Identities = 224/295 (75%), Positives = 250/295 (84%), Gaps = 7/295 (2%)
Query: 7 SLNFFFLLSFSLLIRKSFSSTAI-----INPSKVKQISWKPRAFVYEGFLTDLECDHLIN 61
SL F LL S I + SS A ++ +KV+QISWKPRAFVYEGFL++ ECDHLI+
Sbjct: 4 SLQFLLLLWISSTILEFSSSYADAAGSNVSAAKVRQISWKPRAFVYEGFLSEEECDHLIS 63
Query: 62 LAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGED 121
LAKS+LKRSAVADN+SG+S+LS+VRTSSG FI KGKD I+AGIEDKIA WTFLPK+NGED
Sbjct: 64 LAKSELKRSAVADNVSGKSRLSEVRTSSGMFIGKGKDPIVAGIEDKIAAWTFLPKDNGED 123
Query: 122 IQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRR 181
+QVLRYE GQKY+ HYDYF DKVNI RGGHR+ATVLMYLSDV KGGETVFP AE
Sbjct: 124 MQVLRYEPGQKYDAHYDYFVDKVNIARGGHRIATVLMYLSDVVKGGETVFPMAE--VSSS 181
Query: 182 TPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
T TNDDLSECA+KGIAVKPR+GDALLFFSLH AIPDP+SLH GCPVIEGEKWSATKWI
Sbjct: 182 TLPTNDDLSECARKGIAVKPRKGDALLFFSLHPTAIPDPMSLHGGCPVIEGEKWSATKWI 241
Query: 242 HVDSFDKIVEEGGDCTDNNASCERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
HVDSFDKI++ GG+CTD N SCERWAALGECTKNPEYM+GS+ LPG CRRSCKVC
Sbjct: 242 HVDSFDKILKPGGNCTDENDSCERWAALGECTKNPEYMLGSSDLPGACRRSCKVC 296
>gi|449454448|ref|XP_004144967.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
gi|449474082|ref|XP_004154068.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
gi|449515181|ref|XP_004164628.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 300
Score = 457 bits (1177), Expect = e-126, Method: Compositional matrix adjust.
Identities = 218/295 (73%), Positives = 251/295 (85%), Gaps = 5/295 (1%)
Query: 7 SLNFFFLLSFSLLIRKSF-----SSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLIN 61
+L F FL+ S IR+S S++A ++PSKVKQISWKPRAFVYEGFLTDLECDHL++
Sbjct: 6 NLLFIFLILTSSFIRESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVS 65
Query: 62 LAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGED 121
+A+S+LKRS VADN SG+SKLS VRTSSG FI K KD I++GIEDKI+ WTFLPKENGED
Sbjct: 66 IARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPKENGED 125
Query: 122 IQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRR 181
IQVLRYEHGQKYE HYDYF DKVNI GGHRLATVLMYLS+V +GGETVFP AE+P RR
Sbjct: 126 IQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTQGGETVFPLAEKPSHRR 185
Query: 182 TPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
T++DLSECAKKG+AVKP++GDALLFFSL NAIPD SLH GCPV+EGEKWSATKWI
Sbjct: 186 AYETDEDLSECAKKGVAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWI 245
Query: 242 HVDSFDKIVEEGGDCTDNNASCERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
HVDSF K + + G+CTD N SCERWAALGECTKNPEYMVGS ++PG+CRRSC++C
Sbjct: 246 HVDSFSKNLGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC 300
>gi|449432777|ref|XP_004134175.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 303
Score = 456 bits (1174), Expect = e-126, Method: Compositional matrix adjust.
Identities = 210/268 (78%), Positives = 238/268 (88%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+NP+KVKQISW PRAFVYEGFLTDLECDHLI+LAK++LKRS+VADNLSG+SK+S+VRTS
Sbjct: 35 IVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGKSKVSEVRTS 94
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVR 148
SG FI K KD I++GIEDKIA WTFLPK+NGEDIQVLRYE+GQKY+ H+DYF+DKVNI R
Sbjct: 95 SGAFIHKAKDPIVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIAR 154
Query: 149 GGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALL 208
GGHR+ATVLMYLSDV KGGETVFP+AEE RR+ TN+DLS+CAKKGIAVKPR+GDALL
Sbjct: 155 GGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRKGDALL 214
Query: 209 FFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGGDCTDNNASCERWAA 268
FFSLH NAIPD SLH GCPVIEGEKWSATKWI VDSFD +V + +C D N SCERWA
Sbjct: 215 FFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIRVDSFDMVVRDHTNCGDENPSCERWAE 274
Query: 269 LGECTKNPEYMVGSAQLPGFCRRSCKVC 296
LGECT NPEYMVGS +LPG+CR+SCK C
Sbjct: 275 LGECTNNPEYMVGSPELPGYCRKSCKAC 302
>gi|297812067|ref|XP_002873917.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297319754|gb|EFH50176.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 298
Score = 456 bits (1172), Expect = e-126, Method: Compositional matrix adjust.
Identities = 211/269 (78%), Positives = 239/269 (88%)
Query: 28 AIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRT 87
INPSKVKQ+S KPRAFVYEGFLT+LECDH+++LAK+ LKRSAVADN SGESK S+VRT
Sbjct: 30 VFINPSKVKQVSSKPRAFVYEGFLTELECDHMVSLAKASLKRSAVADNDSGESKFSEVRT 89
Query: 88 SSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIV 147
SSGTFIPKGKD I++GIEDKI+TWTFLPKENGEDIQVLRYEHGQKY+ H+DYF DKVNIV
Sbjct: 90 SSGTFIPKGKDPIVSGIEDKISTWTFLPKENGEDIQVLRYEHGQKYDAHFDYFHDKVNIV 149
Query: 148 RGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDAL 207
RGGHR+ATVLMYLS+V KGGETVFP+AE P R +DLS+CAK+GIAVKPR+GDAL
Sbjct: 150 RGGHRIATVLMYLSNVTKGGETVFPDAEVPSCRVLSENKEDLSDCAKRGIAVKPRKGDAL 209
Query: 208 LFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGGDCTDNNASCERWA 267
LFF+LH +AIPDP+SLH GCPVIEGEKWSATKWIHVDSFDKIV G+CT+ + SCERWA
Sbjct: 210 LFFNLHPDAIPDPLSLHGGCPVIEGEKWSATKWIHVDSFDKIVTPSGNCTNMHESCERWA 269
Query: 268 ALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
LGECTKNPEYMVG+ +LPG+CR SCK C
Sbjct: 270 VLGECTKNPEYMVGTTELPGYCRHSCKAC 298
>gi|114796723|gb|ABI79328.1| prolyl 4-hydroxylase [Dianthus caryophyllus]
Length = 297
Score = 450 bits (1157), Expect = e-124, Method: Compositional matrix adjust.
Identities = 217/292 (74%), Positives = 247/292 (84%), Gaps = 6/292 (2%)
Query: 9 NFFFLLSFSLLIRKSFSSTAII---NPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKS 65
F L+S S LI S SS I NPSKV+QISWKPRAFVYEGFLTD ECDHLI++AK+
Sbjct: 8 QFICLISISCLINGSLSSNDSIFKLNPSKVRQISWKPRAFVYEGFLTDEECDHLISIAKT 67
Query: 66 QLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVL 125
+LKRSAVADN SG+S++S+VRTSSG FI K KDAI+ IE+K+ATWTFLP ENGEDIQVL
Sbjct: 68 ELKRSAVADNESGKSQVSEVRTSSGAFISKAKDAIVQRIEEKLATWTFLPIENGEDIQVL 127
Query: 126 RYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTP-A 184
RYE GQKYE H+D+FSDKVNI RGGHR ATVLMYLS+V KGG+TVFPNAE R++ A
Sbjct: 128 RYEEGQKYENHFDFFSDKVNIARGGHRYATVLMYLSNVEKGGDTVFPNAELSERQKAAIA 187
Query: 185 TNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVD 244
NDDLSECAK+GI+VKPR+GDALLFFSL A PD +SLH GCPVIEGEKWSATKWIHVD
Sbjct: 188 ANDDLSECAKRGISVKPRKGDALLFFSLTPTATPDQLSLHGGCPVIEGEKWSATKWIHVD 247
Query: 245 SFDKIVEEGGDCTDNNASCERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
SFDKI+E+G C D+N +CERWAALGECTKNPEYMVG++ LPG+CRRSCKVC
Sbjct: 248 SFDKILEDG--CNDHNQNCERWAALGECTKNPEYMVGTSSLPGYCRRSCKVC 297
>gi|449495423|ref|XP_004159836.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 304
Score = 444 bits (1141), Expect = e-122, Method: Compositional matrix adjust.
Identities = 207/269 (76%), Positives = 234/269 (86%), Gaps = 1/269 (0%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+NP+KVKQISW PRAFVYEGFLTDLECDHLI+LAK++LKRS+VADNLSG+SK+S+VRTS
Sbjct: 35 IVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGKSKVSEVRTS 94
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVR 148
SG FI K KD I++GIEDKIA WTFLPK+NGEDIQVLRYE+GQKY+ H+DYF+DKVNI R
Sbjct: 95 SGAFIHKAKDPIVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIAR 154
Query: 149 GGHRLATVLMYLSDVAKGGETVF-PNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDAL 207
GGHR+ATVLMYLSDV KGGETVF E RR+ TN+DLS+CAKKGIAVKPR+GDAL
Sbjct: 155 GGHRMATVLMYLSDVEKGGETVFLLRRSESQRRQASETNEDLSDCAKKGIAVKPRKGDAL 214
Query: 208 LFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGGDCTDNNASCERWA 267
LFFSLH NAIPD SLH GCPVIEGEKWSATKWI VDSFD +V + +C D N SCERWA
Sbjct: 215 LFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIRVDSFDMVVRDHTNCGDENPSCERWA 274
Query: 268 ALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
LGECT NPEYMVGS +LPG+CR+SCK C
Sbjct: 275 ELGECTNNPEYMVGSPELPGYCRKSCKAC 303
>gi|18397528|ref|NP_566279.1| P4H isoform 2 [Arabidopsis thaliana]
gi|332640849|gb|AEE74370.1| P4H isoform 2 [Arabidopsis thaliana]
Length = 299
Score = 442 bits (1137), Expect = e-122, Method: Compositional matrix adjust.
Identities = 216/299 (72%), Positives = 250/299 (83%), Gaps = 3/299 (1%)
Query: 1 MSPTRLSLNFFFLLSFSLLIRKS---FSSTAIINPSKVKQISWKPRAFVYEGFLTDLECD 57
MS +RL L F + LL + S ++IINPSKVKQ+S KPRAFVYEGFLTDLECD
Sbjct: 1 MSMSRLGLLLFVAILLVLLQSSTCLISSPSSIINPSKVKQVSSKPRAFVYEGFLTDLECD 60
Query: 58 HLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKE 117
HLI+LAK L+RSAVADN +GES++SDVRTSSGTFI KGKD I++GIEDK++TWTFLPKE
Sbjct: 61 HLISLAKENLQRSAVADNDNGESQVSDVRTSSGTFISKGKDPIVSGIEDKLSTWTFLPKE 120
Query: 118 NGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEP 177
NGED+QVLRYEHGQKY+ H+DYF DKVNI RGGHR+ATVL+YLS+V KGGETVFP+A+E
Sbjct: 121 NGEDLQVLRYEHGQKYDAHFDYFHDKVNIARGGHRIATVLLYLSNVTKGGETVFPDAQEF 180
Query: 178 PRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSA 237
RR DDLS+CAKKGIAVKP++G+ALLFF+L +AIPDP SLH GCPVIEGEKWSA
Sbjct: 181 SRRSLSENKDDLSDCAKKGIAVKPKKGNALLFFNLQQDAIPDPFSLHGGCPVIEGEKWSA 240
Query: 238 TKWIHVDSFDKIVEEGGDCTDNNASCERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
TKWIHVDSFDKI+ G+CTD N SCERWA LGEC KNPEYMVG+ ++PG CRRSCK C
Sbjct: 241 TKWIHVDSFDKILTHDGNCTDVNESCERWAVLGECGKNPEYMVGTPEIPGNCRRSCKAC 299
>gi|110738390|dbj|BAF01121.1| hypothetical protein [Arabidopsis thaliana]
Length = 299
Score = 439 bits (1129), Expect = e-121, Method: Compositional matrix adjust.
Identities = 215/299 (71%), Positives = 249/299 (83%), Gaps = 3/299 (1%)
Query: 1 MSPTRLSLNFFFLLSFSLLIRKS---FSSTAIINPSKVKQISWKPRAFVYEGFLTDLECD 57
MS +RL L F + LL + S ++IINPSKVKQ+S KPRAFVY GFLTDLECD
Sbjct: 1 MSMSRLGLLLFVAILLVLLQSSTCLISSPSSIINPSKVKQVSSKPRAFVYGGFLTDLECD 60
Query: 58 HLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKE 117
HLI+LAK L+RSAVADN +GES++SDVRTSSGTFI KGKD I++GIEDK++TWTFLPKE
Sbjct: 61 HLISLAKENLQRSAVADNDNGESQVSDVRTSSGTFISKGKDPIVSGIEDKLSTWTFLPKE 120
Query: 118 NGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEP 177
NGED+QVLRYEHGQKY+ H+DYF DKVNI RGGHR+ATVL+YLS+V KGGETVFP+A+E
Sbjct: 121 NGEDLQVLRYEHGQKYDAHFDYFHDKVNIARGGHRIATVLLYLSNVTKGGETVFPDAQEF 180
Query: 178 PRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSA 237
RR DDLS+CAKKGIAVKP++G+ALLFF+L +AIPDP SLH GCPVIEGEKWSA
Sbjct: 181 SRRSLSENKDDLSDCAKKGIAVKPKKGNALLFFNLQQDAIPDPFSLHGGCPVIEGEKWSA 240
Query: 238 TKWIHVDSFDKIVEEGGDCTDNNASCERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
TKWIHVDSFDKI+ G+CTD N SCERWA LGEC KNPEYMVG+ ++PG CRRSCK C
Sbjct: 241 TKWIHVDSFDKILTHDGNCTDVNESCERWAVLGECGKNPEYMVGTPEIPGNCRRSCKAC 299
>gi|21618073|gb|AAM67123.1| prolyl 4-hydroxylase alpha subunit-like protein [Arabidopsis
thaliana]
Length = 297
Score = 439 bits (1128), Expect = e-120, Method: Compositional matrix adjust.
Identities = 214/296 (72%), Positives = 248/296 (83%), Gaps = 3/296 (1%)
Query: 4 TRLSLNFFFLLSFSLLIRKS---FSSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLI 60
+RL L F + LL + S ++IINPSKVKQ+S KPRAFVYEGFLTDLECDHLI
Sbjct: 2 SRLGLLLFVAILLVLLQSSTCLISSPSSIINPSKVKQVSSKPRAFVYEGFLTDLECDHLI 61
Query: 61 NLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGE 120
+LAK L+RSAVADN +GES++SDVRTSSGTFI KGKD I++GIEDK++TWTFLPKENGE
Sbjct: 62 SLAKENLQRSAVADNDNGESQVSDVRTSSGTFISKGKDPIVSGIEDKLSTWTFLPKENGE 121
Query: 121 DIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRR 180
D+QVLRYEHGQKY+ H+DYF DKVNI RGGHR+ATVL+YLS+V KGGETVFP+A+E RR
Sbjct: 122 DLQVLRYEHGQKYDAHFDYFHDKVNIARGGHRIATVLLYLSNVTKGGETVFPDAQEFSRR 181
Query: 181 RTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKW 240
DDLS+CAKKGIAVKP++G+ALLFF+L +AIPDP SLH GCPVIEGEKWSATKW
Sbjct: 182 SLSENKDDLSDCAKKGIAVKPKKGNALLFFNLQQDAIPDPFSLHGGCPVIEGEKWSATKW 241
Query: 241 IHVDSFDKIVEEGGDCTDNNASCERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
IHVDSFDKI+ G+CTD N SCERWA LGEC KNPEYMVG+ ++PG CRRSCK C
Sbjct: 242 IHVDSFDKILTHDGNCTDVNESCERWAVLGECGKNPEYMVGTPEIPGNCRRSCKAC 297
>gi|297829156|ref|XP_002882460.1| hypothetical protein ARALYDRAFT_896741 [Arabidopsis lyrata subsp.
lyrata]
gi|297328300|gb|EFH58719.1| hypothetical protein ARALYDRAFT_896741 [Arabidopsis lyrata subsp.
lyrata]
Length = 299
Score = 437 bits (1125), Expect = e-120, Method: Compositional matrix adjust.
Identities = 215/299 (71%), Positives = 247/299 (82%), Gaps = 3/299 (1%)
Query: 1 MSPTRLSLNFF---FLLSFSLLIRKSFSSTAIINPSKVKQISWKPRAFVYEGFLTDLECD 57
MS +RL L F FL+ S ++IINPSKVKQ+S KPRAFVYEGFLTDLECD
Sbjct: 1 MSMSRLGLLLFVAIFLVLLQSSTSLISSPSSIINPSKVKQVSAKPRAFVYEGFLTDLECD 60
Query: 58 HLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKE 117
HLI+LAK L+RSAVADN +GES++SDVRTSSGTFI KGKD I++GIEDK++TWTFLPKE
Sbjct: 61 HLISLAKENLQRSAVADNDNGESQVSDVRTSSGTFISKGKDPIVSGIEDKLSTWTFLPKE 120
Query: 118 NGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEP 177
NGED+QVLRYE GQKY+ H+DYF DKVNI RGGHR+ATVL+YLS+V KGGETVFP+A+E
Sbjct: 121 NGEDLQVLRYEPGQKYDAHFDYFHDKVNIARGGHRIATVLLYLSNVTKGGETVFPDAQEY 180
Query: 178 PRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSA 237
RR DDLS+CAKKGIAVKP++G+ALLFF+L +AIPDP SLH GCPVIEGEKWSA
Sbjct: 181 SRRSLSENKDDLSDCAKKGIAVKPKKGNALLFFNLQQDAIPDPFSLHGGCPVIEGEKWSA 240
Query: 238 TKWIHVDSFDKIVEEGGDCTDNNASCERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
TKWIHVDSFDKI+ G+CTD N SCERWA LGEC KNPEYMVG+ +LPG CR SCK C
Sbjct: 241 TKWIHVDSFDKILTHDGNCTDVNESCERWAVLGECGKNPEYMVGTPELPGNCRHSCKAC 299
>gi|115464581|ref|NP_001055890.1| Os05g0489100 [Oryza sativa Japonica Group]
gi|50511363|gb|AAT77286.1| putative prolyl 4-hydroxylase alpha subunit [Oryza sativa Japonica
Group]
gi|113579441|dbj|BAF17804.1| Os05g0489100 [Oryza sativa Japonica Group]
gi|125587281|gb|EAZ27945.1| hypothetical protein OsJ_11906 [Oryza sativa Japonica Group]
gi|215737307|dbj|BAG96236.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 319
Score = 407 bits (1047), Expect = e-111, Method: Compositional matrix adjust.
Identities = 186/268 (69%), Positives = 226/268 (84%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
++ P +QISWKPR F+Y+ FL+D E +HL++LA+++LKRSAVADNLSG+S+LSD RTS
Sbjct: 52 VVYPHHSRQISWKPRVFLYQHFLSDDEANHLVSLARTELKRSAVADNLSGKSELSDARTS 111
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVR 148
SGTFI K +D I+AGIE+KIA WTFLPKENGEDIQVLRY+HG+KYE HYDYFSD VN +R
Sbjct: 112 SGTFIRKSQDPIVAGIEEKIAAWTFLPKENGEDIQVLRYKHGEKYERHYDYFSDNVNTLR 171
Query: 149 GGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALL 208
GGHR+ATVLMYL+DVA+GGETVFP AEE T + LSECAKKG+AVKPR+GDALL
Sbjct: 172 GGHRIATVLMYLTDVAEGGETVFPLAEEFTESGTNNEDSTLSECAKKGVAVKPRKGDALL 231
Query: 209 FFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGGDCTDNNASCERWAA 268
FF+L +A D +SLH+GCPVI+GEKWSATKWI V SFDK+ G+CTD+N SCE+WAA
Sbjct: 232 FFNLSPDASKDSLSLHAGCPVIKGEKWSATKWIRVASFDKVYHTQGNCTDDNESCEKWAA 291
Query: 269 LGECTKNPEYMVGSAQLPGFCRRSCKVC 296
LGEC KNPEYM+G+A LPG+CR+SC +C
Sbjct: 292 LGECIKNPEYMIGTAALPGYCRKSCNIC 319
>gi|125552794|gb|EAY98503.1| hypothetical protein OsI_20415 [Oryza sativa Indica Group]
Length = 319
Score = 407 bits (1046), Expect = e-111, Method: Compositional matrix adjust.
Identities = 186/268 (69%), Positives = 226/268 (84%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
++ P +QISWKPR F+Y+ FL+D E +HL++LA+++LKRSAVADNLSG+S+LSD RTS
Sbjct: 52 VVYPHHSRQISWKPRVFLYQHFLSDDEANHLVSLARAELKRSAVADNLSGKSELSDARTS 111
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVR 148
SGTFI K +D I+AGIE+KIA WTFLPKENGEDIQVLRY+HG+KYE HYDYFSD VN +R
Sbjct: 112 SGTFIRKSQDPIVAGIEEKIAAWTFLPKENGEDIQVLRYKHGEKYERHYDYFSDNVNTLR 171
Query: 149 GGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALL 208
GGHR+ATVLMYL+DVA+GGETVFP AEE T + LSECAKKG+AVKPR+GDALL
Sbjct: 172 GGHRIATVLMYLTDVAEGGETVFPLAEEFTESGTNNEDSTLSECAKKGVAVKPRKGDALL 231
Query: 209 FFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGGDCTDNNASCERWAA 268
FF+L +A D +SLH+GCPVI+GEKWSATKWI V SFDK+ G+CTD+N SCE+WAA
Sbjct: 232 FFNLSPDASKDSLSLHAGCPVIKGEKWSATKWIRVASFDKVYHTQGNCTDDNESCEKWAA 291
Query: 269 LGECTKNPEYMVGSAQLPGFCRRSCKVC 296
LGEC KNPEYM+G+A LPG+CR+SC +C
Sbjct: 292 LGECIKNPEYMIGTAALPGYCRKSCNIC 319
>gi|6437556|gb|AAF08583.1|AC011623_16 unknown protein [Arabidopsis thaliana]
Length = 278
Score = 403 bits (1036), Expect = e-110, Method: Compositional matrix adjust.
Identities = 201/299 (67%), Positives = 236/299 (78%), Gaps = 24/299 (8%)
Query: 1 MSPTRLSLNFFFLLSFSLLIRKS---FSSTAIINPSKVKQISWKPRAFVYEGFLTDLECD 57
MS +RL L F + LL + S ++IINPSKVKQ+S KPRAFVYEGFLTDLECD
Sbjct: 1 MSMSRLGLLLFVAILLVLLQSSTCLISSPSSIINPSKVKQVSSKPRAFVYEGFLTDLECD 60
Query: 58 HLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKE 117
HLI+LAK L+RSAVADN +GES++SDVRTSSGTFI KGKD I++GIEDK++TWTFLPKE
Sbjct: 61 HLISLAKENLQRSAVADNDNGESQVSDVRTSSGTFISKGKDPIVSGIEDKLSTWTFLPKE 120
Query: 118 NGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEP 177
NGED+QVLRYEHGQKY+ H+DYF DKVNI RGGHR+ATVL+YLS+V KGGETVFP+A+
Sbjct: 121 NGEDLQVLRYEHGQKYDAHFDYFHDKVNIARGGHRIATVLLYLSNVTKGGETVFPDAQ-- 178
Query: 178 PRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSA 237
+ +KP++G+ALLFF+L +AIPDP SLH GCPVIEGEKWSA
Sbjct: 179 -------------------VCLKPKKGNALLFFNLQQDAIPDPFSLHGGCPVIEGEKWSA 219
Query: 238 TKWIHVDSFDKIVEEGGDCTDNNASCERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
TKWIHVDSFDKI+ G+CTD N SCERWA LGEC KNPEYMVG+ ++PG CRRSCK C
Sbjct: 220 TKWIHVDSFDKILTHDGNCTDVNESCERWAVLGECGKNPEYMVGTPEIPGNCRRSCKAC 278
>gi|242088305|ref|XP_002439985.1| hypothetical protein SORBIDRAFT_09g023860 [Sorghum bicolor]
gi|241945270|gb|EES18415.1| hypothetical protein SORBIDRAFT_09g023860 [Sorghum bicolor]
Length = 308
Score = 401 bits (1030), Expect = e-109, Method: Compositional matrix adjust.
Identities = 187/269 (69%), Positives = 220/269 (81%), Gaps = 4/269 (1%)
Query: 28 AIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRT 87
A++ P +QISWKPR F+Y+ FL+D E +HLI+LA+++LKRSAVADN+SG+S LSDVRT
Sbjct: 44 AVVYPHHSRQISWKPRVFLYQHFLSDDEANHLISLARAELKRSAVADNMSGKSTLSDVRT 103
Query: 88 SSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIV 147
SSGTF+ KG+D I+ GIEDKIA WTFLPKENGEDIQVLRY+HG+KYEPHYDYF+D VN +
Sbjct: 104 SSGTFLRKGQDPIVEGIEDKIAAWTFLPKENGEDIQVLRYKHGEKYEPHYDYFTDNVNTI 163
Query: 148 RGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDAL 207
RGGHR ATVL+YL+DVA+GGETVFP AEE A + SECA+KGIAVKPR+GDAL
Sbjct: 164 RGGHRYATVLLYLTDVAEGGETVFPLAEE----VDDAKDATFSECAQKGIAVKPRKGDAL 219
Query: 208 LFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGGDCTDNNASCERWA 267
LFF+L + DPVSLH GC VI GEKWSATKWI V SFDK+ G+CTD N SC +WA
Sbjct: 220 LFFNLKPDGTTDPVSLHGGCAVIRGEKWSATKWIRVASFDKVHYPQGNCTDENESCSKWA 279
Query: 268 ALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
ALGEC KNPEYMVG+ LPG+CRRSC VC
Sbjct: 280 ALGECIKNPEYMVGTTALPGYCRRSCNVC 308
>gi|226495689|ref|NP_001149322.1| LOC100282945 precursor [Zea mays]
gi|194697650|gb|ACF82909.1| unknown [Zea mays]
gi|194708468|gb|ACF88318.1| unknown [Zea mays]
gi|195626376|gb|ACG35018.1| oxidoreductase [Zea mays]
gi|347978842|gb|AEP37763.1| prolyl 4-hydroxylase 9 [Zea mays]
gi|413945802|gb|AFW78451.1| oxidoreductase [Zea mays]
Length = 308
Score = 397 bits (1021), Expect = e-108, Method: Compositional matrix adjust.
Identities = 186/269 (69%), Positives = 222/269 (82%), Gaps = 4/269 (1%)
Query: 28 AIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRT 87
A++ P +QIS KPR F+Y+ FL+D E +HLI+LA+++LKRSAVADN+SG+S LS+VRT
Sbjct: 44 AVVYPHHSRQISCKPRVFLYQHFLSDDEANHLISLARAELKRSAVADNMSGKSTLSEVRT 103
Query: 88 SSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIV 147
SSGTF+ KG+D I+ GIEDKIA WTFLPKENGEDIQVLRY+HG+KYEPHYDYF+D VN V
Sbjct: 104 SSGTFLRKGQDPIVEGIEDKIAAWTFLPKENGEDIQVLRYKHGEKYEPHYDYFTDNVNTV 163
Query: 148 RGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDAL 207
RGGHR ATVL+YL+DV +GGETVFP AEEP A + LSECA+KGIAV+PR+GDAL
Sbjct: 164 RGGHRYATVLLYLTDVPEGGETVFPLAEEP----DDAKDATLSECAQKGIAVRPRKGDAL 219
Query: 208 LFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGGDCTDNNASCERWA 267
LFF+L+ + D VSLH GCPVI+GEKWSATKWI V SFDK+ G+CTD N SC +WA
Sbjct: 220 LFFNLNPDGTTDSVSLHGGCPVIKGEKWSATKWIRVASFDKVHHPQGNCTDENESCAKWA 279
Query: 268 ALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
ALGEC KNPEYMVG+ LPG+CRRSC VC
Sbjct: 280 ALGECIKNPEYMVGTTALPGYCRRSCNVC 308
>gi|326526235|dbj|BAJ97134.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 308
Score = 389 bits (1000), Expect = e-106, Method: Compositional matrix adjust.
Identities = 182/268 (67%), Positives = 220/268 (82%), Gaps = 4/268 (1%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
++ P +QISW PRAF+Y FL+D E +HL++LA+++LKRSAVAD SG+S+LS+VRTS
Sbjct: 45 VVYPHHSRQISWHPRAFLYPHFLSDDEANHLVSLARAELKRSAVADETSGKSQLSEVRTS 104
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVR 148
SGTFI KGKD I+AGIEDKIA WTFLPKENGED+QVLRY+ G+KYEPHYD+F+D VN +
Sbjct: 105 SGTFISKGKDPIVAGIEDKIAAWTFLPKENGEDMQVLRYKRGEKYEPHYDFFTDSVNTIL 164
Query: 149 GGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALL 208
GGHR+ATVL+YL+DVA+GGETVFP A + R + + LSECA+KGIAVKPR+GDALL
Sbjct: 165 GGHRVATVLLYLTDVAEGGETVFPLA----KGRKGSHHKGLSECAQKGIAVKPRKGDALL 220
Query: 209 FFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGGDCTDNNASCERWAA 268
FF+L +A DP SLH GC VI+GEKWSATKWI V SFDK+ G+CTDN+ SC +WAA
Sbjct: 221 FFNLRPDAATDPTSLHGGCEVIKGEKWSATKWIRVASFDKVYHSPGNCTDNSNSCSQWAA 280
Query: 269 LGECTKNPEYMVGSAQLPGFCRRSCKVC 296
LGECTKNP YMVG+A LPG CRRSC VC
Sbjct: 281 LGECTKNPAYMVGTAVLPGHCRRSCNVC 308
>gi|357128903|ref|XP_003566109.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Brachypodium
distachyon]
Length = 313
Score = 382 bits (982), Expect = e-104, Method: Compositional matrix adjust.
Identities = 179/274 (65%), Positives = 220/274 (80%)
Query: 23 SFSSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKL 82
S +++ P +QISWKPR F+Y+ FL+D E +HL++LA+++LKRSAVADN SG+S L
Sbjct: 40 SVYPASVVYPHHSRQISWKPRVFLYQHFLSDDEANHLLSLARAELKRSAVADNTSGKSTL 99
Query: 83 SDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSD 142
S+VRTS GTFI KGKD I+AGIEDKIA WTFLPKENGED+QVLRY+ G+K EP +D+F+D
Sbjct: 100 SEVRTSYGTFISKGKDPIVAGIEDKIAAWTFLPKENGEDMQVLRYKRGEKDEPQFDFFTD 159
Query: 143 KVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPR 202
VN VRGGHR+ATVL+YL+DVA+GGETVFP A++ + LSECA+KGIAVKPR
Sbjct: 160 TVNTVRGGHRVATVLLYLTDVAEGGETVFPLAKDFTDTGLHDKDTTLSECAQKGIAVKPR 219
Query: 203 RGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGGDCTDNNAS 262
+GDALLFF+L +A DP+SLH GC VI+GEKW+ATKWI V SFDK+ G+C+DNN S
Sbjct: 220 KGDALLFFNLRPDAATDPLSLHGGCTVIKGEKWTATKWIRVASFDKVYHMPGNCSDNNDS 279
Query: 263 CERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
C RWAALGEC KNP YM+G+A LPG CRRSC VC
Sbjct: 280 CVRWAALGECIKNPPYMIGTAALPGHCRRSCNVC 313
>gi|302786814|ref|XP_002975178.1| hypothetical protein SELMODRAFT_174666 [Selaginella moellendorffii]
gi|300157337|gb|EFJ23963.1| hypothetical protein SELMODRAFT_174666 [Selaginella moellendorffii]
Length = 283
Score = 377 bits (967), Expect = e-102, Method: Compositional matrix adjust.
Identities = 177/272 (65%), Positives = 213/272 (78%), Gaps = 6/272 (2%)
Query: 25 SSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSD 84
S +I++P+KV Q+SWKPRAF+Y+GF++ ECDH++ +AK +L++S VADN SG+S LS+
Sbjct: 18 SDQSIVDPTKVIQLSWKPRAFLYKGFMSAAECDHVVKMAKDKLQKSMVADNESGKSVLSN 77
Query: 85 VRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKV 144
+RTSSG F+ KG+D +I IE++IA WTFLPKENGE IQVLRYE G+KYEPHYDYF DK
Sbjct: 78 IRTSSGMFLSKGQDEVINRIEERIAAWTFLPKENGEAIQVLRYEFGEKYEPHYDYFHDKY 137
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
N GGHR+ATVLMYLSD KGGETVFP++EE T +D S+CAKKGIAVKPR+G
Sbjct: 138 NQALGGHRIATVLMYLSDAVKGGETVFPSSEE----DTTVKDDSWSDCAKKGIAVKPRKG 193
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGGDCTDNNASCE 264
DALLF+SLH +A PD SLH GCPVIEGEKWSATKWIHV F K +EG C D N C
Sbjct: 194 DALLFYSLHPDATPDESSLHGGCPVIEGEKWSATKWIHVLPFGKPKKEG--CADENEKCG 251
Query: 265 RWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
WAA GEC KNP YMVG+ + PG CR+SCKVC
Sbjct: 252 EWAAYGECDKNPSYMVGTQEWPGACRKSCKVC 283
>gi|302791635|ref|XP_002977584.1| hypothetical protein SELMODRAFT_106693 [Selaginella moellendorffii]
gi|300154954|gb|EFJ21588.1| hypothetical protein SELMODRAFT_106693 [Selaginella moellendorffii]
Length = 296
Score = 377 bits (967), Expect = e-102, Method: Compositional matrix adjust.
Identities = 177/272 (65%), Positives = 214/272 (78%), Gaps = 7/272 (2%)
Query: 25 SSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSD 84
S +I++P+KV Q+SWKPRAF+Y+GF++ ECDH++ +AK +L++S VADN SG+S LS+
Sbjct: 32 SDQSIVDPTKVIQLSWKPRAFLYKGFMSAAECDHVVKMAKDKLQKSMVADNESGKSVLSN 91
Query: 85 VRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKV 144
+RTSSG F+ KG+D +I IE++IA WTFLPKENGE IQVLRYE G+KYEPHYDYF DK
Sbjct: 92 IRTSSGMFLSKGQDEVINRIEERIAAWTFLPKENGEAIQVLRYEFGEKYEPHYDYFHDKY 151
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
N GGHR+ATVLMYLSDV KGGETVFP++E+ T +D S+CAKKGIAVKPR+G
Sbjct: 152 NQALGGHRIATVLMYLSDVVKGGETVFPSSED-----TTVKDDSWSDCAKKGIAVKPRKG 206
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGGDCTDNNASCE 264
DALLF+SLH +A PD SLH GCPVIEGEKWSATKWIHV F K +EG C D N C
Sbjct: 207 DALLFYSLHPDATPDESSLHGGCPVIEGEKWSATKWIHVLPFGKPKKEG--CADENEKCG 264
Query: 265 RWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
WAA GEC KNP YMVG+ + PG CR+SCKVC
Sbjct: 265 EWAAYGECDKNPSYMVGTQEWPGACRKSCKVC 296
>gi|215490183|dbj|BAG86625.1| type 2 proly 4-hydroxylase [Nicotiana tabacum]
Length = 318
Score = 373 bits (958), Expect = e-101, Method: Compositional matrix adjust.
Identities = 178/278 (64%), Positives = 210/278 (75%), Gaps = 6/278 (2%)
Query: 19 LIRKSFSSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSG 78
L+ SS+ I+P++V QISW+PRAFVY FLTD ECDH I LAK +L++S VADN SG
Sbjct: 46 LLTDRSSSSPTIDPTRVTQISWRPRAFVYRNFLTDEECDHFITLAKHKLEKSMVADNESG 105
Query: 79 ESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYD 138
+S S+VRTSSG F K +D ++A +E +IA WTFLP+ENGE IQ+L YEHGQKYEPH+D
Sbjct: 106 KSVESEVRTSSGMFFRKAQDQVVANVEARIAAWTFLPEENGESIQILHYEHGQKYEPHFD 165
Query: 139 YFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIA 198
YF DKVN GGHR+ATVLMYLSDV KGGETVFPN+E ++T A DD S+CAKKG A
Sbjct: 166 YFHDKVNQELGGHRVATVLMYLSDVEKGGETVFPNSEA---KKTQAKGDDWSDCAKKGYA 222
Query: 199 VKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGGDCTD 258
VKPR+GDALLFFSLH +A DP+SLH CPVIEGEKWSATKWIHV SF+ C D
Sbjct: 223 VKPRKGDALLFFSLHPDATTDPLSLHGSCPVIEGEKWSATKWIHVRSFETT---SSVCKD 279
Query: 259 NNASCERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
N +C +WA GEC KNP YM+GS G CR+SCKVC
Sbjct: 280 QNPNCPQWATAGECEKNPLYMMGSEDSVGHCRKSCKVC 317
>gi|168046048|ref|XP_001775487.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162673157|gb|EDQ59684.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 263
Score = 373 bits (958), Expect = e-101, Method: Compositional matrix adjust.
Identities = 173/267 (64%), Positives = 213/267 (79%), Gaps = 4/267 (1%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
I+P++VKQ+SWKPRAF+Y FL+D ECDH+I+LAK +L++S VADN SG+S S++RTSS
Sbjct: 1 IDPTRVKQLSWKPRAFLYSNFLSDAECDHMISLAKDKLEKSMVADNESGKSVKSEIRTSS 60
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRG 149
G F+ KG+D II+ IED+IA WTFLPKENGE IQVLRY+ G+KYEPH+DYF DK N G
Sbjct: 61 GMFLMKGQDDIISRIEDRIAAWTFLPKENGEAIQVLRYQDGEKYEPHFDYFHDKNNQALG 120
Query: 150 GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLF 209
GHR+ATVLMYLSDV KGGETVFP++E+ R +D S C K G+AVKPR+GDALLF
Sbjct: 121 GHRIATVLMYLSDVVKGGETVFPSSED----RGGPKDDSWSACGKTGVAVKPRKGDALLF 176
Query: 210 FSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGGDCTDNNASCERWAAL 269
FSLH +A+PD SLH+GCPVIEGEKWSATKWIHV +F+K + G C + SCE WAA
Sbjct: 177 FSLHPSAVPDESSLHTGCPVIEGEKWSATKWIHVAAFEKPRPKNGACVNEVDSCEEWAAY 236
Query: 270 GECTKNPEYMVGSAQLPGFCRRSCKVC 296
GEC KNP YMVG+ + PG+CR++C VC
Sbjct: 237 GECQKNPAYMVGTKEWPGYCRKACHVC 263
>gi|224034451|gb|ACN36301.1| unknown [Zea mays]
gi|413945801|gb|AFW78450.1| hypothetical protein ZEAMMB73_588774 [Zea mays]
Length = 295
Score = 368 bits (945), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 176/269 (65%), Positives = 210/269 (78%), Gaps = 17/269 (6%)
Query: 28 AIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRT 87
A++ P +QIS KPR F+Y+ FL+D E +HLI+LA+++LKRSAVADN+SG+S LS+
Sbjct: 44 AVVYPHHSRQISCKPRVFLYQHFLSDDEANHLISLARAELKRSAVADNMSGKSTLSE--- 100
Query: 88 SSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIV 147
D I+ GIEDKIA WTFLPKENGEDIQVLRY+HG+KYEPHYDYF+D VN V
Sbjct: 101 ----------DPIVEGIEDKIAAWTFLPKENGEDIQVLRYKHGEKYEPHYDYFTDNVNTV 150
Query: 148 RGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDAL 207
RGGHR ATVL+YL+DV +GGETVFP AEEP A + LSECA+KGIAV+PR+GDAL
Sbjct: 151 RGGHRYATVLLYLTDVPEGGETVFPLAEEP----DDAKDATLSECAQKGIAVRPRKGDAL 206
Query: 208 LFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGGDCTDNNASCERWA 267
LFF+L+ + D VSLH GCPVI+GEKWSATKWI V SFDK+ G+CTD N SC +WA
Sbjct: 207 LFFNLNPDGTTDSVSLHGGCPVIKGEKWSATKWIRVASFDKVHHPQGNCTDENESCAKWA 266
Query: 268 ALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
ALGEC KNPEYMVG+ LPG+CRRSC VC
Sbjct: 267 ALGECIKNPEYMVGTTALPGYCRRSCNVC 295
>gi|50845214|gb|AAT84604.1| prolyl 4-hydroxylase [Dianthus caryophyllus]
Length = 316
Score = 367 bits (942), Expect = 3e-99, Method: Compositional matrix adjust.
Identities = 175/281 (62%), Positives = 216/281 (76%), Gaps = 6/281 (2%)
Query: 19 LIRKSFSSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSG 78
L ++ S+ ++PS V Q+SWKPRAF+YEGFLT ECDHLI++AK +L++S VADN SG
Sbjct: 39 LKSENVPSSVGVDPSHVTQLSWKPRAFLYEGFLTHEECDHLIDMAKDKLEKSMVADNESG 98
Query: 79 ESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYD 138
+S S+VRTSSG F+ K +D ++A IE +IA WTFLP ENGE +Q+L YE GQKYEPH+D
Sbjct: 99 KSIPSEVRTSSGMFLQKAQDDVVAAIEARIAAWTFLPIENGEAMQILHYERGQKYEPHFD 158
Query: 139 YFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIA 198
YF DKVN GGHR+ATVLMYLS+V +GGETVFPNAE + A N+ LS+CAK G +
Sbjct: 159 YFHDKVNQQLGGHRIATVLMYLSNVEEGGETVFPNAEA---KLQLANNESLSDCAKGGYS 215
Query: 199 VKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEE---GGD 255
VKP++GDALLFFSLH +A D +SLH CPVIEGEKWSATKWIHV SFD+I ++ GD
Sbjct: 216 VKPKKGDALLFFSLHPDASTDSLSLHGSCPVIEGEKWSATKWIHVRSFDRIRKDDPPSGD 275
Query: 256 CTDNNASCERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
C D+NA C +WA GEC KNP YMVGS + G+CR+SC VC
Sbjct: 276 CVDDNALCAQWALAGECKKNPLYMVGSKDMKGYCRKSCNVC 316
>gi|297818456|ref|XP_002877111.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297322949|gb|EFH53370.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 316
Score = 361 bits (926), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 166/280 (59%), Positives = 212/280 (75%), Gaps = 3/280 (1%)
Query: 17 SLLIRKSFSSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNL 76
S++ K+ +S+ +P++V Q+SW PRAF+Y+GFL+D ECDH I LAK +L++S VADN
Sbjct: 38 SVIKMKTSASSFGFDPTRVTQLSWTPRAFLYKGFLSDEECDHFIKLAKGKLEKSMVADND 97
Query: 77 SGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPH 136
SGES S+VRTSSG F+ K +D I+A +E K+A WTF+P+ENGE +Q+L YE+GQKYEPH
Sbjct: 98 SGESVESEVRTSSGMFLSKRQDDIVANVEAKLAAWTFIPEENGESMQILHYENGQKYEPH 157
Query: 137 YDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKG 196
+DYF D+ N+ GGHR+ATVLMYLS+V KGGETVFP + + T +D +ECAK+G
Sbjct: 158 FDYFHDQANLELGGHRIATVLMYLSNVEKGGETVFPMWK---GKTTQLKDDSWTECAKQG 214
Query: 197 IAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGGDC 256
AVKPR+GDALLFF+LH NA D SLH CPV+EGEKWSAT+WIHV SFD+ + C
Sbjct: 215 YAVKPRKGDALLFFNLHPNATTDSNSLHGSCPVVEGEKWSATRWIHVRSFDRAFSKQSGC 274
Query: 257 TDNNASCERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
D N SCE+WA GEC KNP YMVGS + G+CR+SC VC
Sbjct: 275 VDENVSCEKWAKAGECQKNPTYMVGSDKDHGYCRKSCNVC 314
>gi|356550516|ref|XP_003543632.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 318
Score = 360 bits (925), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 170/273 (62%), Positives = 207/273 (75%), Gaps = 5/273 (1%)
Query: 26 STAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDV 85
S+ +P++V Q+SW PRAF+Y+GFL+D ECDHLI LAK +L++S VADN SG+S +S+V
Sbjct: 47 SSVKFDPTRVTQLSWSPRAFLYKGFLSDEECDHLITLAKDKLEKSMVADNESGKSIMSEV 106
Query: 86 RTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVN 145
RTSSG F+ K +D I+AGIE +IA WTFLP ENGE +Q+L YE+GQKYEPH+DYF DK N
Sbjct: 107 RTSSGMFLNKAQDEIVAGIEARIAAWTFLPIENGESMQILHYENGQKYEPHFDYFHDKAN 166
Query: 146 IVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGD 205
V GGHR+ATVLMYLSDV KGGET+FPNA+ + ++ SECA KG AVKPR+GD
Sbjct: 167 QVMGGHRIATVLMYLSDVEKGGETIFPNAKA---KLLQPKDESWSECAHKGYAVKPRKGD 223
Query: 206 ALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEE--GGDCTDNNASC 263
ALLFFSLH +A D SLH CPVIEGEKWSATKWIHV F K +++ GDC D N +C
Sbjct: 224 ALLFFSLHLDASTDNKSLHGSCPVIEGEKWSATKWIHVSDFQKPIKQVDSGDCVDENENC 283
Query: 264 ERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
RWA +GEC KNP YMVG + G C +SC VC
Sbjct: 284 PRWAKVGECEKNPLYMVGGEGVKGSCMKSCNVC 316
>gi|255552788|ref|XP_002517437.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223543448|gb|EEF44979.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 311
Score = 360 bits (924), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 173/281 (61%), Positives = 210/281 (74%), Gaps = 5/281 (1%)
Query: 18 LLIRKSFSSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLS 77
L ++K S+ I +P++V Q+SW PRAF+Y+GFL+ ECDHLI+LA+ +L++S VADN S
Sbjct: 32 LRLKKGVVSSRIFDPTRVTQLSWHPRAFLYKGFLSYEECDHLIDLARDKLEKSMVADNES 91
Query: 78 GESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHY 137
G+S S+VRTSSG FI K +D I+A IE +IA WTFLP+ENGE +Q+L YEHGQKYEPH+
Sbjct: 92 GKSIESEVRTSSGMFIAKAQDEIVADIEARIAAWTFLPEENGESMQILHYEHGQKYEPHF 151
Query: 138 DYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGI 197
DYF DK N GGHR+ATVLMYLS+V KGGETVFPNAE + + D S+CAK G
Sbjct: 152 DYFHDKANQELGGHRVATVLMYLSNVEKGGETVFPNAE---GKLSQPKEDSWSDCAKGGY 208
Query: 198 AVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEE--GGD 255
AVKP +GDALLFFSLH +A D SLH CPVIEGEKWSATKWIHV SF+K ++ GD
Sbjct: 209 AVKPEKGDALLFFSLHPDATTDSDSLHGSCPVIEGEKWSATKWIHVRSFEKSFKQLGKGD 268
Query: 256 CTDNNASCERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
C D N C WA GEC KNP YM+GS G+CR+SCKVC
Sbjct: 269 CVDENDHCPLWAKAGECKKNPLYMIGSGGANGYCRKSCKVC 309
>gi|218193936|gb|EEC76363.1| hypothetical protein OsI_13952 [Oryza sativa Indica Group]
Length = 1062
Score = 360 bits (924), Expect = 5e-97, Method: Compositional matrix adjust.
Identities = 166/268 (61%), Positives = 207/268 (77%), Gaps = 5/268 (1%)
Query: 31 NPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSG 90
+P++V Q+SW+PRAF+Y GFL+ ECDHL+NLAK ++++S VADN SG+S +S VRTSSG
Sbjct: 33 DPARVTQLSWRPRAFLYSGFLSHDECDHLVNLAKGRMEKSMVADNDSGKSIMSQVRTSSG 92
Query: 91 TFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGG 150
TF+ K +D I++GIE ++A WTFLP+EN E IQ+L YE GQKY+ H+DYF DK N+ RGG
Sbjct: 93 TFLSKHEDDIVSGIEKRVAAWTFLPEENAESIQILHYELGQKYDAHFDYFHDKNNLKRGG 152
Query: 151 HRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFF 210
HR+ATVLMYL+DV KGGETVFPNA R ++ S+CA+ G+AVKP++GDALLFF
Sbjct: 153 HRVATVLMYLTDVKKGGETVFPNA---AGRHLQLKDETWSDCARSGLAVKPKKGDALLFF 209
Query: 211 SLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGGD--CTDNNASCERWAA 268
SLH NA DP SLH CPVIEGEKWSATKWIHV SFD + D C+D N C RWAA
Sbjct: 210 SLHVNATTDPASLHGSCPVIEGEKWSATKWIHVRSFDNPPDVSLDLPCSDENERCTRWAA 269
Query: 269 LGECTKNPEYMVGSAQLPGFCRRSCKVC 296
+GEC +NP+YMVG+ GFCR+SC VC
Sbjct: 270 VGECYRNPKYMVGTKDSLGFCRKSCGVC 297
>gi|9294583|dbj|BAB02864.1| prolyl 4-hydroxylase alpha subunit-like protein [Arabidopsis
thaliana]
Length = 332
Score = 359 bits (922), Expect = 7e-97, Method: Compositional matrix adjust.
Identities = 165/280 (58%), Positives = 211/280 (75%), Gaps = 3/280 (1%)
Query: 17 SLLIRKSFSSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNL 76
S++ K+ +S+ +P++V Q+SW PR F+YEGFL+D ECDH I LAK +L++S VADN
Sbjct: 54 SVIKMKTSASSFGFDPTRVTQLSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADND 113
Query: 77 SGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPH 136
SGES S+VRTSSG F+ K +D I++ +E K+A WTFLP+ENGE +Q+L YE+GQKYEPH
Sbjct: 114 SGESVESEVRTSSGMFLSKRQDDIVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPH 173
Query: 137 YDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKG 196
+DYF D+ N+ GGHR+ATVLMYLS+V KGGETVFP + + T +D +ECAK+G
Sbjct: 174 FDYFHDQANLELGGHRIATVLMYLSNVEKGGETVFPMWK---GKATQLKDDSWTECAKQG 230
Query: 197 IAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGGDC 256
AVKPR+GDALLFF+LH NA D SLH CPV+EGEKWSAT+WIHV SF++ + C
Sbjct: 231 YAVKPRKGDALLFFNLHPNATTDSNSLHGSCPVVEGEKWSATRWIHVKSFERAFNKQSGC 290
Query: 257 TDNNASCERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
D N SCE+WA GEC KNP YMVGS + G+CR+SCK C
Sbjct: 291 MDENVSCEKWAKAGECQKNPTYMVGSDKDHGYCRKSCKAC 330
>gi|18405808|ref|NP_566838.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
gi|21617881|gb|AAM66931.1| prolyl 4-hydroxylase, putative [Arabidopsis thaliana]
gi|332643929|gb|AEE77450.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
Length = 316
Score = 359 bits (921), Expect = 9e-97, Method: Compositional matrix adjust.
Identities = 165/280 (58%), Positives = 211/280 (75%), Gaps = 3/280 (1%)
Query: 17 SLLIRKSFSSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNL 76
S++ K+ +S+ +P++V Q+SW PR F+YEGFL+D ECDH I LAK +L++S VADN
Sbjct: 38 SVIKMKTSASSFGFDPTRVTQLSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADND 97
Query: 77 SGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPH 136
SGES S+VRTSSG F+ K +D I++ +E K+A WTFLP+ENGE +Q+L YE+GQKYEPH
Sbjct: 98 SGESVESEVRTSSGMFLSKRQDDIVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPH 157
Query: 137 YDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKG 196
+DYF D+ N+ GGHR+ATVLMYLS+V KGGETVFP + + T +D +ECAK+G
Sbjct: 158 FDYFHDQANLELGGHRIATVLMYLSNVEKGGETVFPMWK---GKATQLKDDSWTECAKQG 214
Query: 197 IAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGGDC 256
AVKPR+GDALLFF+LH NA D SLH CPV+EGEKWSAT+WIHV SF++ + C
Sbjct: 215 YAVKPRKGDALLFFNLHPNATTDSNSLHGSCPVVEGEKWSATRWIHVKSFERAFNKQSGC 274
Query: 257 TDNNASCERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
D N SCE+WA GEC KNP YMVGS + G+CR+SCK C
Sbjct: 275 MDENVSCEKWAKAGECQKNPTYMVGSDKDHGYCRKSCKAC 314
>gi|225452614|ref|XP_002281420.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
gi|296087745|emb|CBI35001.3| unnamed protein product [Vitis vinifera]
Length = 316
Score = 358 bits (920), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 172/282 (60%), Positives = 214/282 (75%), Gaps = 5/282 (1%)
Query: 17 SLLIRKSFSSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNL 76
S+L K + +P++V Q+SW+PRAF+Y+GFL++ ECDHLI LAK +L++S VADN
Sbjct: 36 SVLGLKPRGFASGFDPTRVTQLSWRPRAFLYKGFLSEEECDHLITLAKDKLEKSMVADNE 95
Query: 77 SGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPH 136
SG+S +S+VRTSSG F+ K +D I+A IE +IA WTFLP ENGE IQ+L YE+G+KYEPH
Sbjct: 96 SGKSIMSEVRTSSGMFLLKAQDEIVADIEARIAAWTFLPVENGESIQILHYENGEKYEPH 155
Query: 137 YDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKG 196
+DYF DKVN + GGHR+ATVLMYL+ V +GGETVFPN+E R + +D S+CAKKG
Sbjct: 156 FDYFHDKVNQLLGGHRIATVLMYLATVEEGGETVFPNSE---GRFSQPKDDSWSDCAKKG 212
Query: 197 IAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEG--G 254
AV P++GDALLFFSLH +A DP SLH CPVI GEKWSATKWIHV SFDK + G G
Sbjct: 213 YAVNPKKGDALLFFSLHPDATTDPSSLHGSCPVIAGEKWSATKWIHVRSFDKPSKRGAQG 272
Query: 255 DCTDNNASCERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
+C D + C +WAA+GEC KNP YMVGS GFCR+SC VC
Sbjct: 273 ECVDEDEHCPKWAAVGECEKNPVYMVGSENSDGFCRKSCGVC 314
>gi|255637501|gb|ACU19077.1| unknown [Glycine max]
Length = 318
Score = 358 bits (919), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 169/273 (61%), Positives = 206/273 (75%), Gaps = 5/273 (1%)
Query: 26 STAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDV 85
S+ +P++V Q+SW PRAF+Y+GFL+D ECDHLI LAK +L++S VADN SG+S +S+V
Sbjct: 47 SSVKFDPTRVTQLSWSPRAFLYKGFLSDEECDHLITLAKDKLEKSMVADNESGKSIMSEV 106
Query: 86 RTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVN 145
RTSSG F+ K +D I+AGIE +IA WTFLP ENGE +Q+L YE+GQKYEPH+DYF DK N
Sbjct: 107 RTSSGMFLNKAQDEIVAGIEARIAAWTFLPIENGESMQILHYENGQKYEPHFDYFHDKAN 166
Query: 146 IVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGD 205
V GGHR+ATVLMYLSDV KGGET+F NA+ + ++ SECA KG AVKPR+GD
Sbjct: 167 QVMGGHRIATVLMYLSDVEKGGETIFSNAKA---KLLQPKDESWSECAHKGYAVKPRKGD 223
Query: 206 ALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEE--GGDCTDNNASC 263
ALLFFSLH +A D SLH CPVIEGEKWSATKWIHV F K +++ GDC D N +C
Sbjct: 224 ALLFFSLHLDASTDNKSLHGSCPVIEGEKWSATKWIHVSDFQKPIKQVDSGDCVDENENC 283
Query: 264 ERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
RWA +GEC KNP YMVG + G C +SC VC
Sbjct: 284 PRWAKVGECEKNPLYMVGGEGVKGSCMKSCNVC 316
>gi|18086437|gb|AAL57673.1| AT3g28480/MFJ20_16 [Arabidopsis thaliana]
gi|24796986|gb|AAN64505.1| At3g28480/MFJ20_16 [Arabidopsis thaliana]
Length = 316
Score = 358 bits (918), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 165/280 (58%), Positives = 210/280 (75%), Gaps = 3/280 (1%)
Query: 17 SLLIRKSFSSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNL 76
S++ K+ +S+ +P++V Q+SW PR F+YEGFL+D ECDH I LAK +L++S VADN
Sbjct: 38 SVIKMKTSASSFGFDPTRVTQLSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADND 97
Query: 77 SGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPH 136
SGES S+VRTSSG F+ K +D I+ +E K+A WTFLP+ENGE +Q+L YE+GQKYEPH
Sbjct: 98 SGESVESEVRTSSGMFLSKRQDDIVNNVEAKLAAWTFLPEENGESMQILHYENGQKYEPH 157
Query: 137 YDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKG 196
+DYF D+ N+ GGHR+ATVLMYLS+V KGGETVFP + + T +D +ECAK+G
Sbjct: 158 FDYFHDQANLELGGHRIATVLMYLSNVEKGGETVFPMWK---GKATQLKDDSWTECAKQG 214
Query: 197 IAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGGDC 256
AVKPR+GDALLFF+LH NA D SLH CPV+EGEKWSAT+WIHV SF++ + C
Sbjct: 215 YAVKPRKGDALLFFNLHPNATTDSNSLHGSCPVVEGEKWSATRWIHVKSFERAFNKQSGC 274
Query: 257 TDNNASCERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
D N SCE+WA GEC KNP YMVGS + G+CR+SCK C
Sbjct: 275 MDENVSCEKWAKAGECQKNPTYMVGSDKDHGYCRKSCKAC 314
>gi|218184507|gb|EEC66934.1| hypothetical protein OsI_33548 [Oryza sativa Indica Group]
Length = 308
Score = 357 bits (916), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 167/270 (61%), Positives = 208/270 (77%), Gaps = 6/270 (2%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+PS+V Q+SW+PRAF+++GFLTD EC+HLI+LAK +L++S VADN SG+S +S+VRTSS
Sbjct: 40 FDPSRVVQLSWRPRAFLHKGFLTDAECEHLISLAKDKLEKSMVADNESGKSVMSEVRTSS 99
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRG 149
G F+ K +D ++A IE++IA WTFLP +NGE IQ+L Y++G+KYEPHYDYF DK N G
Sbjct: 100 GMFLEKKQDEVVARIEERIAAWTFLPPDNGESIQILHYQNGEKYEPHYDYFHDKNNQALG 159
Query: 150 GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLF 209
GHR+ATVLMYLSDV KGGET+FP AE + +D S+CAK G AVKP +GDALLF
Sbjct: 160 GHRIATVLMYLSDVGKGGETIFPEAE---GKLLQPKDDTWSDCAKNGYAVKPVKGDALLF 216
Query: 210 FSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGGD---CTDNNASCERW 266
FSLH +A D SLH CPVIEG+KWSATKWIHV SFD V++G C D N C +W
Sbjct: 217 FSLHPDATTDSDSLHGSCPVIEGQKWSATKWIHVRSFDISVKQGASTDGCEDENVLCPQW 276
Query: 267 AALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
AA+GEC KNP YMVG+ + PGFCR+SC VC
Sbjct: 277 AAVGECAKNPNYMVGTNEAPGFCRKSCNVC 306
>gi|115481998|ref|NP_001064592.1| Os10g0413500 [Oryza sativa Japonica Group]
gi|110289075|gb|ABG66075.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
Group]
gi|113639201|dbj|BAF26506.1| Os10g0413500 [Oryza sativa Japonica Group]
gi|215692577|dbj|BAG87997.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222612821|gb|EEE50953.1| hypothetical protein OsJ_31503 [Oryza sativa Japonica Group]
Length = 308
Score = 357 bits (916), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 167/270 (61%), Positives = 208/270 (77%), Gaps = 6/270 (2%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+PS+V Q+SW+PRAF+++GFLTD EC+HLI+LAK +L++S VADN SG+S +S+VRTSS
Sbjct: 40 FDPSRVVQLSWRPRAFLHKGFLTDAECEHLISLAKDKLEKSMVADNESGKSVMSEVRTSS 99
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRG 149
G F+ K +D ++A IE++IA WTFLP +NGE IQ+L Y++G+KYEPHYDYF DK N G
Sbjct: 100 GMFLEKKQDEVVARIEERIAAWTFLPPDNGESIQILHYQNGEKYEPHYDYFHDKNNQALG 159
Query: 150 GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLF 209
GHR+ATVLMYLSDV KGGET+FP AE + +D S+CAK G AVKP +GDALLF
Sbjct: 160 GHRIATVLMYLSDVGKGGETIFPEAE---GKLLQPKDDTWSDCAKNGYAVKPVKGDALLF 216
Query: 210 FSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGGD---CTDNNASCERW 266
FSLH +A D SLH CPVIEG+KWSATKWIHV SFD V++G C D N C +W
Sbjct: 217 FSLHPDATTDSDSLHGSCPVIEGQKWSATKWIHVRSFDISVKQGASTDGCEDENVLCPQW 276
Query: 267 AALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
AA+GEC KNP YMVG+ + PGFCR+SC VC
Sbjct: 277 AAVGECAKNPNYMVGTNEAPGFCRKSCNVC 306
>gi|110289076|gb|ABB47602.2| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
Group]
Length = 309
Score = 357 bits (915), Expect = 5e-96, Method: Compositional matrix adjust.
Identities = 167/270 (61%), Positives = 208/270 (77%), Gaps = 5/270 (1%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+PS+V Q+SW+PRAF+++GFLTD EC+HLI+LAK +L++S VADN SG+S +S+VRTSS
Sbjct: 40 FDPSRVVQLSWRPRAFLHKGFLTDAECEHLISLAKDKLEKSMVADNESGKSVMSEVRTSS 99
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRG 149
G F+ K +D ++A IE++IA WTFLP +NGE IQ+L Y++G+KYEPHYDYF DK N G
Sbjct: 100 GMFLEKKQDEVVARIEERIAAWTFLPPDNGESIQILHYQNGEKYEPHYDYFHDKNNQALG 159
Query: 150 GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLF 209
GHR+ATVLMYLSDV KGGET+FP AE + +D S+CAK G AVKP +GDALLF
Sbjct: 160 GHRIATVLMYLSDVGKGGETIFPEAE--VGKLLQPKDDTWSDCAKNGYAVKPVKGDALLF 217
Query: 210 FSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGGD---CTDNNASCERW 266
FSLH +A D SLH CPVIEG+KWSATKWIHV SFD V++G C D N C +W
Sbjct: 218 FSLHPDATTDSDSLHGSCPVIEGQKWSATKWIHVRSFDISVKQGASTDGCEDENVLCPQW 277
Query: 267 AALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
AA+GEC KNP YMVG+ + PGFCR+SC VC
Sbjct: 278 AAVGECAKNPNYMVGTNEAPGFCRKSCNVC 307
>gi|357140446|ref|XP_003571778.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
distachyon]
Length = 298
Score = 356 bits (914), Expect = 5e-96, Method: Compositional matrix adjust.
Identities = 167/270 (61%), Positives = 210/270 (77%), Gaps = 6/270 (2%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+PS+V Q+SW+PRAF+++GFL++ ECDH+I LAK +L++S VADN SG+S S+VRTSS
Sbjct: 30 FDPSRVVQLSWRPRAFLHKGFLSEPECDHMIELAKDKLEKSMVADNESGKSVQSEVRTSS 89
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRG 149
G F+ K +D ++A IE++IA WTFLP ENGE IQ+L Y++G+KYEPHYDYF DK N G
Sbjct: 90 GMFLEKRQDEVVARIEERIAAWTFLPSENGESIQILHYKNGEKYEPHYDYFHDKNNQALG 149
Query: 150 GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLF 209
GHR+ATVLMYLS+V KGGET+FPNAE + T ++ SECAK G AVKP +GDALLF
Sbjct: 150 GHRIATVLMYLSNVEKGGETIFPNAE---GKLTQHKDETASECAKNGYAVKPMKGDALLF 206
Query: 210 FSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEG--GD-CTDNNASCERW 266
FSLH +A DP SLH CPVIEG+KWSATKWIHV SF+ ++G GD C D N C +W
Sbjct: 207 FSLHPDATTDPDSLHGSCPVIEGQKWSATKWIHVRSFENPGKQGASGDGCEDENVLCAQW 266
Query: 267 AALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
AA+GEC KNP YMVG+ + PGFCR+SC +C
Sbjct: 267 AAVGECAKNPNYMVGTKEAPGFCRKSCNLC 296
>gi|356572148|ref|XP_003554232.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Glycine max]
Length = 319
Score = 356 bits (914), Expect = 6e-96, Method: Compositional matrix adjust.
Identities = 168/273 (61%), Positives = 207/273 (75%), Gaps = 5/273 (1%)
Query: 26 STAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDV 85
S+ +P++V Q+SW PRAF+Y+GFL++ ECDHLI LAK +L++S VADN SG+S +SD+
Sbjct: 48 SSVKFDPTRVTQLSWSPRAFLYKGFLSEEECDHLIVLAKDKLEKSMVADNDSGKSIMSDI 107
Query: 86 RTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVN 145
RTSSG F+ K +D I+AGIE +IA WTFLP ENGE +Q+L YE+GQKYEPH+DYF DK N
Sbjct: 108 RTSSGMFLNKAQDEIVAGIEARIAAWTFLPVENGESMQILHYENGQKYEPHFDYFHDKAN 167
Query: 146 IVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGD 205
V GGHR+ATVLMYLSDV KGGET+FPNAE + ++ SECA KG AVKP++GD
Sbjct: 168 QVMGGHRIATVLMYLSDVEKGGETIFPNAEA---KLLQPKDESWSECAHKGYAVKPQKGD 224
Query: 206 ALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEE--GGDCTDNNASC 263
ALLFFSLH +A D SLH CPVIEGEKWSATKWIHV F+K ++ G+C D N +C
Sbjct: 225 ALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIHVSDFEKPFKQVDNGECVDENENC 284
Query: 264 ERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
RWA +GEC KNP YMVG + G C +SC VC
Sbjct: 285 PRWAKVGECDKNPLYMVGGEGVRGSCMKSCNVC 317
>gi|242039723|ref|XP_002467256.1| hypothetical protein SORBIDRAFT_01g022150 [Sorghum bicolor]
gi|241921110|gb|EER94254.1| hypothetical protein SORBIDRAFT_01g022150 [Sorghum bicolor]
Length = 303
Score = 356 bits (914), Expect = 6e-96, Method: Compositional matrix adjust.
Identities = 168/270 (62%), Positives = 209/270 (77%), Gaps = 6/270 (2%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+PS+V Q+SW+PRAF+++GFL+D ECDHLI LAK +L++S VADN SG+S S+VRTSS
Sbjct: 35 FDPSRVVQLSWRPRAFLHKGFLSDAECDHLIVLAKDKLEKSMVADNESGKSVQSEVRTSS 94
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRG 149
G F+ K +D ++ GIE++IA WTFLP ENGE IQ+L Y++G+KYEPHYDYF DK N G
Sbjct: 95 GMFLEKKQDEVVRGIEERIAAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKNNQALG 154
Query: 150 GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLF 209
GHR+ATVLMYLS+V KGGET+FPNAE + +D S+CA+ G AVKP +GDALLF
Sbjct: 155 GHRIATVLMYLSNVEKGGETIFPNAE---GKLLQPKDDTWSDCARNGYAVKPVKGDALLF 211
Query: 210 FSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGGD---CTDNNASCERW 266
FSLH +A D SLH CPVIEG+KWSATKWIHV SFD V++ G C D+N C +W
Sbjct: 212 FSLHPDATTDSESLHGSCPVIEGQKWSATKWIHVRSFDLPVKQPGSSDGCEDDNVLCPQW 271
Query: 267 AALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
AA+GEC KNP YMVG+ + PGFCR+SCKVC
Sbjct: 272 AAVGECAKNPNYMVGTKEAPGFCRKSCKVC 301
>gi|115456019|ref|NP_001051610.1| Os03g0803500 [Oryza sativa Japonica Group]
gi|29150365|gb|AAO72374.1| putative oxidoreductase [Oryza sativa Japonica Group]
gi|108711618|gb|ABF99413.1| oxidoreductase, 2OG-Fe oxygenase family protein, putative,
expressed [Oryza sativa Japonica Group]
gi|113550081|dbj|BAF13524.1| Os03g0803500 [Oryza sativa Japonica Group]
gi|215765410|dbj|BAG87107.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222625993|gb|EEE60125.1| hypothetical protein OsJ_13003 [Oryza sativa Japonica Group]
Length = 299
Score = 356 bits (914), Expect = 7e-96, Method: Compositional matrix adjust.
Identities = 166/271 (61%), Positives = 207/271 (76%), Gaps = 5/271 (1%)
Query: 28 AIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRT 87
+P++V Q+SW+PRAF+Y GFL+ ECDHL+NLAK ++++S VADN SG+S +S VRT
Sbjct: 30 GFYDPARVTQLSWRPRAFLYSGFLSHDECDHLVNLAKGRMEKSMVADNDSGKSIMSQVRT 89
Query: 88 SSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIV 147
SSGTF+ K +D I++GIE ++A WTFLP+EN E IQ+L YE GQKY+ H+DYF DK N+
Sbjct: 90 SSGTFLSKHEDDIVSGIEKRVAAWTFLPEENAESIQILHYELGQKYDAHFDYFHDKNNLK 149
Query: 148 RGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDAL 207
RGGHR+ATVLMYL+DV KGGETVFPNA R ++ S+CA+ G+AVKP++GDAL
Sbjct: 150 RGGHRVATVLMYLTDVKKGGETVFPNAAG---RHLQLKDETWSDCARSGLAVKPKKGDAL 206
Query: 208 LFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGGD--CTDNNASCER 265
LFFSLH NA DP SLH CPVIEGEKWSATKWIHV SFD + D C+D N C R
Sbjct: 207 LFFSLHVNATTDPASLHGSCPVIEGEKWSATKWIHVRSFDNPPDVSLDLPCSDENERCTR 266
Query: 266 WAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
WAA+GEC +NP+YMVG+ GFCR+SC VC
Sbjct: 267 WAAVGECYRNPKYMVGTKDSLGFCRKSCGVC 297
>gi|242047772|ref|XP_002461632.1| hypothetical protein SORBIDRAFT_02g005750 [Sorghum bicolor]
gi|241925009|gb|EER98153.1| hypothetical protein SORBIDRAFT_02g005750 [Sorghum bicolor]
Length = 307
Score = 355 bits (912), Expect = 9e-96, Method: Compositional matrix adjust.
Identities = 165/273 (60%), Positives = 208/273 (76%), Gaps = 12/273 (4%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
N S+VK +SW+PR FVY+GFL+D ECDHL+ LAK +++RS VADN SG+S +S+VRTSS
Sbjct: 39 FNSSRVKAVSWQPRIFVYKGFLSDAECDHLVTLAKKKIQRSMVADNQSGKSVMSEVRTSS 98
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRG 149
G F+ K +D +++ IE++IA WTFLP+EN E++Q+LRYEHGQKYEPH+DYF DK+N VRG
Sbjct: 99 GMFLNKRQDPVVSRIEERIAAWTFLPQENAENMQILRYEHGQKYEPHFDYFHDKINQVRG 158
Query: 150 GHRLATVLMYLSDVAKGGETVFPNA---EEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
GHR ATVLMYLS V KGGETVFPNA E P+ +D SECA +G+AVKP +GDA
Sbjct: 159 GHRYATVLMYLSTVDKGGETVFPNAKGWESQPK------DDTFSECAHQGLAVKPVKGDA 212
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDK---IVEEGGDCTDNNASC 263
+LFFSLH + +PDP+SLH CPVI+GEKWSA KWIHV S++ + ++ C D + C
Sbjct: 213 VLFFSLHVDGVPDPLSLHGSCPVIQGEKWSAPKWIHVRSYENPPVVPKDTRGCADKSEHC 272
Query: 264 ERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
WAA GEC KNP YMVG+ PG CR+SC VC
Sbjct: 273 AEWAAAGECGKNPVYMVGAEGAPGQCRKSCNVC 305
>gi|357125236|ref|XP_003564301.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
distachyon]
Length = 293
Score = 355 bits (912), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 167/274 (60%), Positives = 207/274 (75%), Gaps = 5/274 (1%)
Query: 25 SSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSD 84
++ + ++V Q+SW+PRAF+Y GFL+ ECDHL+ LAK +L++S VADN SG+S +S
Sbjct: 21 AAGGFYDQARVTQLSWRPRAFLYSGFLSHAECDHLVKLAKGRLQKSMVADNDSGKSVMSQ 80
Query: 85 VRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKV 144
VRTSSGTF+ K +D II+GIE ++A WTFLP+EN E IQVL YE GQKY+ H+DYF DK
Sbjct: 81 VRTSSGTFLNKHEDEIISGIEKRVAAWTFLPEENAESIQVLHYEVGQKYDAHFDYFHDKN 140
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
N GGHR+ATVLMYL+DV KGGETVFPNAE R ++ SECA+ G+AVKPR+G
Sbjct: 141 NQKLGGHRVATVLMYLTDVKKGGETVFPNAEG---RHLQHKDETWSECARSGLAVKPRKG 197
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDK--IVEEGGDCTDNNAS 262
DALLFFSLH NA DP SLH CPVIEGEKWSATKWIHV SFD IV C+D+N
Sbjct: 198 DALLFFSLHINATTDPSSLHGSCPVIEGEKWSATKWIHVRSFDNPPIVRMDVRCSDDNEL 257
Query: 263 CERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
C +WAA+GEC +NP+YM+G+ GFCR+SC +C
Sbjct: 258 CSKWAAVGECYRNPKYMIGTKDTLGFCRKSCGIC 291
>gi|115471029|ref|NP_001059113.1| Os07g0194500 [Oryza sativa Japonica Group]
gi|113610649|dbj|BAF21027.1| Os07g0194500 [Oryza sativa Japonica Group]
gi|215768445|dbj|BAH00674.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 319
Score = 354 bits (908), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 166/271 (61%), Positives = 205/271 (75%), Gaps = 9/271 (3%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
N S+V+ +SW+PR FVY+GFL+D ECDHL+ L K +++RS VADN SG+S +S+VRTSS
Sbjct: 52 FNASRVRAVSWRPRVFVYKGFLSDDECDHLVKLGKRKMQRSMVADNKSGKSVMSEVRTSS 111
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRG 149
G F+ K +D +++ IE +IA WTFLP+EN E+IQ+LRYEHGQKYEPH+DYF DKVN G
Sbjct: 112 GMFLDKRQDPVVSRIEKRIAAWTFLPEENAENIQILRYEHGQKYEPHFDYFHDKVNQALG 171
Query: 150 GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLF 209
GHR ATVLMYLS V KGGETVFPNAE +D SECA+KG+AVKP +GD +LF
Sbjct: 172 GHRYATVLMYLSTVEKGGETVFPNAEG---WENQPKDDTFSECAQKGLAVKPVKGDTVLF 228
Query: 210 FSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFD----KIVEEGGDCTDNNASCER 265
FSLH + +PDP+SLH CPVIEGEKWSA KWI + S++ V EG C+DN+A C +
Sbjct: 229 FSLHIDGVPDPLSLHGSCPVIEGEKWSAPKWIRIRSYEHPPVSKVTEG--CSDNSARCAK 286
Query: 266 WAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
WA GEC KNP YMVG+ LPG CR+SC VC
Sbjct: 287 WAEAGECEKNPVYMVGAEGLPGNCRKSCGVC 317
>gi|224141325|ref|XP_002324024.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Populus
trichocarpa]
gi|222867026|gb|EEF04157.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Populus
trichocarpa]
Length = 308
Score = 354 bits (908), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 169/278 (60%), Positives = 206/278 (74%), Gaps = 5/278 (1%)
Query: 21 RKSFSSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGES 80
+K ++ +P++V Q+SW PRAF+Y+GFL+D ECDHL+NLA+ +L++S VADN SG+S
Sbjct: 32 KKILQKKSVFDPTRVTQLSWNPRAFLYKGFLSDEECDHLMNLARDKLEKSMVADNESGKS 91
Query: 81 KLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF 140
S+VRTSSG FI K +D I+ IE +IA WTFLP+ENGE IQ+L YEHGQKYEPH+DYF
Sbjct: 92 IESEVRTSSGMFIGKSQDEIVDDIEARIAAWTFLPQENGESIQILHYEHGQKYEPHFDYF 151
Query: 141 SDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVK 200
DK N GGHR+ TVLMYLS+V KGGETVFPN+E + +D S+CAK G AVK
Sbjct: 152 HDKANQELGGHRVVTVLMYLSNVGKGGETVFPNSE---GKTIQPKDDSWSDCAKNGYAVK 208
Query: 201 PRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEG--GDCTD 258
P++GDALLFFSLH +A D SLH CPVIEGEKWSATKWIHV SF+K ++ G C D
Sbjct: 209 PQKGDALLFFSLHPDATTDTNSLHGSCPVIEGEKWSATKWIHVRSFEKSLKHAASGGCID 268
Query: 259 NNASCERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
N +C WA GEC KNP YMVGS G CR+SCKVC
Sbjct: 269 ENENCPLWAKAGECQKNPVYMVGSEGSYGSCRKSCKVC 306
>gi|449461905|ref|XP_004148682.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 295
Score = 353 bits (907), Expect = 4e-95, Method: Compositional matrix adjust.
Identities = 170/291 (58%), Positives = 215/291 (73%), Gaps = 8/291 (2%)
Query: 13 LLSFSLLIRKSFSSTA----IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLK 68
L+FSL F++ A I +P++V Q+SW+PRAF+Y+GFL+D ECDHLI+LAK +L+
Sbjct: 6 FLAFSLCFLSVFTAFAFFSLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLE 65
Query: 69 RSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYE 128
+S VADN SG+S S+VRTSSG F+ K +D ++AG+E +IA WT LP ENGE IQ+L YE
Sbjct: 66 KSMVADNDSGKSVSSEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYE 125
Query: 129 HGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDD 188
+GQKYEPH+D+F DKVN GGHR+ATVLMYLS+V KGGET+FPN+E + A ++
Sbjct: 126 NGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEVWYGSESQAKDES 185
Query: 189 LSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDK 248
S+C++KG AVK ++GDALLFFSL+ +A D SLH CPVI GEKWSATKWIHV SF+K
Sbjct: 186 WSDCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEK 245
Query: 249 I---VEEGGDCTDNNASCERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
I V G C D N +C WA GEC KNP YMVGS G+CR+SCK C
Sbjct: 246 ITSRVSRQG-CVDENENCLAWAKKGECKKNPTYMVGSGGALGYCRKSCKAC 295
>gi|34393269|dbj|BAC83179.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein [Oryza
sativa Japonica Group]
gi|50509101|dbj|BAD30161.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein [Oryza
sativa Japonica Group]
Length = 313
Score = 353 bits (906), Expect = 5e-95, Method: Compositional matrix adjust.
Identities = 166/271 (61%), Positives = 205/271 (75%), Gaps = 9/271 (3%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
N S+V+ +SW+PR FVY+GFL+D ECDHL+ L K +++RS VADN SG+S +S+VRTSS
Sbjct: 46 FNASRVRAVSWRPRVFVYKGFLSDDECDHLVKLGKRKMQRSMVADNKSGKSVMSEVRTSS 105
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRG 149
G F+ K +D +++ IE +IA WTFLP+EN E+IQ+LRYEHGQKYEPH+DYF DKVN G
Sbjct: 106 GMFLDKRQDPVVSRIEKRIAAWTFLPEENAENIQILRYEHGQKYEPHFDYFHDKVNQALG 165
Query: 150 GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLF 209
GHR ATVLMYLS V KGGETVFPNAE +D SECA+KG+AVKP +GD +LF
Sbjct: 166 GHRYATVLMYLSTVEKGGETVFPNAEG---WENQPKDDTFSECAQKGLAVKPVKGDTVLF 222
Query: 210 FSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFD----KIVEEGGDCTDNNASCER 265
FSLH + +PDP+SLH CPVIEGEKWSA KWI + S++ V EG C+DN+A C +
Sbjct: 223 FSLHIDGVPDPLSLHGSCPVIEGEKWSAPKWIRIRSYEHPPVSKVTEG--CSDNSARCAK 280
Query: 266 WAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
WA GEC KNP YMVG+ LPG CR+SC VC
Sbjct: 281 WAEAGECEKNPVYMVGAEGLPGNCRKSCGVC 311
>gi|449522594|ref|XP_004168311.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Cucumis
sativus]
Length = 313
Score = 352 bits (904), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 170/285 (59%), Positives = 215/285 (75%), Gaps = 7/285 (2%)
Query: 15 SFSLLIRKSFSSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVAD 74
S S+L K+ SS I +P++V Q+SW+PRAF+Y+GFL+D ECDHLI+LAK +L++S VAD
Sbjct: 33 SGSVLRLKTDSSPLIFDPTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVAD 92
Query: 75 NLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYE 134
N SG+S S+VRTSSG F+ K +D ++AG+E +IA WT LP ENGE IQ+L YE+GQKYE
Sbjct: 93 NDSGKSVSSEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYE 152
Query: 135 PHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAK 194
PH+D+F DKVN GGHR+ATVLMYLS+V KGGET+FPN+E + + A ++ S+C++
Sbjct: 153 PHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSE---FKESQAKDESWSDCSR 209
Query: 195 KGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKI---VE 251
KG AVK ++GDALLFFSL+ +A D SLH CPVI GEKWSATKWIHV SF+KI V
Sbjct: 210 KGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKITSRVS 269
Query: 252 EGGDCTDNNASCERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
G C D N +C WA GEC KNP YMVGS G+CR+SCK C
Sbjct: 270 RQG-CVDENENCLAWAKKGECKKNPTYMVGSGGALGYCRKSCKAC 313
>gi|195627276|gb|ACG35468.1| prolyl 4-hydroxylase [Zea mays]
Length = 298
Score = 351 bits (901), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 165/270 (61%), Positives = 207/270 (76%), Gaps = 6/270 (2%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+PS+V Q+SW+PRAF+++GFL D ECDHLI LAK +L++S VADN SG+S S+VRTSS
Sbjct: 30 FDPSRVVQLSWRPRAFLHKGFLLDAECDHLIALAKDKLEKSMVADNKSGKSVQSEVRTSS 89
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRG 149
G F+ K +D ++ IE++I+ WTFLP ENGE IQ+L Y++G+KYEPHYDYF DK N G
Sbjct: 90 GMFLEKKQDEVVTRIEERISAWTFLPPENGEAIQILHYQNGEKYEPHYDYFHDKNNQALG 149
Query: 150 GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLF 209
GHR+ATVLMYLS+V KGGET+FPNAE + +D S+CA+ G AVKP +GDALLF
Sbjct: 150 GHRIATVLMYLSNVEKGGETIFPNAE---GKLLQPKDDTWSDCARNGYAVKPVKGDALLF 206
Query: 210 FSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGG---DCTDNNASCERW 266
FSLH ++ D SLH CPVIEG+KWSATKWIHV SFD V++ G C D+N C +W
Sbjct: 207 FSLHPDSTTDSDSLHGSCPVIEGQKWSATKWIHVRSFDLTVKQPGPSDGCEDDNVLCPQW 266
Query: 267 AALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
AA+GEC KNP YMVG+ + PGFCR+SCKVC
Sbjct: 267 AAVGECAKNPNYMVGTKEAPGFCRKSCKVC 296
>gi|297818458|ref|XP_002877112.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297322950|gb|EFH53371.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 289
Score = 351 bits (900), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 175/290 (60%), Positives = 216/290 (74%), Gaps = 10/290 (3%)
Query: 11 FFLLSFSLLIRKSFSSTA--IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLK 68
F SFSLL+ S S++ +P++V Q+SW PRAF+Y GFL+D ECDHLINLAK +L+
Sbjct: 6 FLAFSFSLLLIFSQISSSSFTFDPTRVTQLSWTPRAFLYNGFLSDEECDHLINLAKGKLE 65
Query: 69 RS-AVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRY 127
+S VAD+ SGES S+ RTSSG F+ K +D I+A +E K+ATWTFLP+ENGE +Q+L Y
Sbjct: 66 KSMVVADDNSGESIDSEERTSSGVFLTKRQDDIVANVEAKLATWTFLPEENGEALQILHY 125
Query: 128 EHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATND 187
E+GQKY+PH+DY+ DK + GGHR+ATVLMYLS+V KGGETVFP + +TP D
Sbjct: 126 ENGQKYDPHFDYYYDKETLKLGGHRIATVLMYLSNVTKGGETVFPMW----KGKTPQLKD 181
Query: 188 DL-SECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
D SECAK+G AVKPR+GDALLFF+LH NA DP SLH CPVIEGEKWSAT+WIHV SF
Sbjct: 182 DTWSECAKQGYAVKPRKGDALLFFNLHPNATTDPTSLHGSCPVIEGEKWSATRWIHVRSF 241
Query: 247 DKIVEEGGDCTDNNASCERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
K +G C D++ SCE WA GEC KNP YM+GS G+CR+SCK C
Sbjct: 242 GKKQSDG--CVDDHESCEIWAKAGECEKNPMYMMGSETDLGYCRKSCKAC 289
>gi|293337056|ref|NP_001169835.1| uncharacterized protein LOC100383727 precursor [Zea mays]
gi|224031897|gb|ACN35024.1| unknown [Zea mays]
gi|347978800|gb|AEP37742.1| prolyl 4-hydroxylase 2 [Zea mays]
gi|414871435|tpg|DAA49992.1| TPA: hypothetical protein ZEAMMB73_500506 [Zea mays]
Length = 299
Score = 350 bits (898), Expect = 4e-94, Method: Compositional matrix adjust.
Identities = 164/270 (60%), Positives = 208/270 (77%), Gaps = 6/270 (2%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+PS+V Q+SW+PRAF+++GFL+D ECDHLI LAK +L++S VADN SG+S S+VRTSS
Sbjct: 31 FDPSRVVQLSWRPRAFLHKGFLSDAECDHLIALAKDKLEKSMVADNESGKSVQSEVRTSS 90
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRG 149
G F+ + +D ++ IE++I+ WTFLP ENGE IQ+L Y++G+KYEPHYDYF DK N G
Sbjct: 91 GMFLERKQDEVVTRIEERISAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKKNQALG 150
Query: 150 GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLF 209
GHR+ATVLMYLS+V KGGET+FPNAE + ++ S+CA+ G AVKP +GDALLF
Sbjct: 151 GHRIATVLMYLSNVEKGGETIFPNAE---GKLLQPKDNTWSDCARNGYAVKPVKGDALLF 207
Query: 210 FSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGGD---CTDNNASCERW 266
FSLH +A D SLH CPVIEG+KWSATKWIHV SFD V++ G C D+N C +W
Sbjct: 208 FSLHPDATTDSDSLHGSCPVIEGQKWSATKWIHVRSFDLPVKQPGSSDGCEDDNILCPQW 267
Query: 267 AALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
AA+GEC KNP YMVG+ + PGFCR+SCKVC
Sbjct: 268 AAVGECAKNPNYMVGTKEAPGFCRKSCKVC 297
>gi|212720650|ref|NP_001132477.1| uncharacterized protein LOC100193935 precursor [Zea mays]
gi|194694488|gb|ACF81328.1| unknown [Zea mays]
gi|347978828|gb|AEP37756.1| prolyl 4-hydroxylase 7 [Zea mays]
gi|413934218|gb|AFW68769.1| prolyl 4-hydroxylase [Zea mays]
Length = 298
Score = 350 bits (897), Expect = 6e-94, Method: Compositional matrix adjust.
Identities = 164/270 (60%), Positives = 206/270 (76%), Gaps = 6/270 (2%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+PS+V Q+SW+PRAF+++GFL D ECDHLI LAK +L++S VADN SG+S S+VRTSS
Sbjct: 30 FDPSRVVQLSWRPRAFLHKGFLLDAECDHLIALAKDKLEKSMVADNKSGKSVQSEVRTSS 89
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRG 149
G F+ K +D ++ IE++I+ WTFLP ENGE IQ+L Y++G+KYEPHYDYF DK N G
Sbjct: 90 GMFLEKKQDEVVTRIEERISAWTFLPPENGEAIQILHYQNGEKYEPHYDYFHDKNNQALG 149
Query: 150 GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLF 209
GHR+ATVLMYLS+V KGGET+FPNAE + +D S+CA+ G AVKP +GDALLF
Sbjct: 150 GHRIATVLMYLSNVEKGGETIFPNAE---GKLLQPKDDTWSDCARNGYAVKPVKGDALLF 206
Query: 210 FSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGG---DCTDNNASCERW 266
FSLH ++ D SLH CP IEG+KWSATKWIHV SFD V++ G C D+N C +W
Sbjct: 207 FSLHPDSTTDSDSLHGSCPAIEGQKWSATKWIHVRSFDLTVKQPGPSDGCEDDNVLCPQW 266
Query: 267 AALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
AA+GEC KNP YMVG+ + PGFCR+SCKVC
Sbjct: 267 AAVGECAKNPNYMVGTKEAPGFCRKSCKVC 296
>gi|242032633|ref|XP_002463711.1| hypothetical protein SORBIDRAFT_01g004670 [Sorghum bicolor]
gi|241917565|gb|EER90709.1| hypothetical protein SORBIDRAFT_01g004670 [Sorghum bicolor]
Length = 297
Score = 348 bits (892), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 163/271 (60%), Positives = 200/271 (73%), Gaps = 5/271 (1%)
Query: 28 AIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRT 87
+P++V Q+SW+PRAF+Y GFL+D ECDHLINLAK +++S VADN SG+S +S VRT
Sbjct: 28 GFYDPARVTQLSWRPRAFLYSGFLSDTECDHLINLAKGSMEKSMVADNDSGKSLMSQVRT 87
Query: 88 SSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIV 147
SSG F+ K +D I++ IE ++A WTFLP+EN E +QVLRYE GQKY+ H+DYF DK N+
Sbjct: 88 SSGAFLAKHEDEIVSAIEKRVAAWTFLPEENAESMQVLRYEIGQKYDAHFDYFHDKNNVK 147
Query: 148 RGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDAL 207
GG R ATVLMYL+DV KGGETVFPNAE + T SEC++ G+AVKP++GDAL
Sbjct: 148 HGGQRFATVLMYLTDVKKGGETVFPNAEGSHLQYKDET---WSECSRSGLAVKPKKGDAL 204
Query: 208 LFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDK--IVEEGGDCTDNNASCER 265
LFF LH NA D SLH CPVIEGEKWSATKWIHV SFD V C+D+N C +
Sbjct: 205 LFFGLHLNATTDTSSLHGSCPVIEGEKWSATKWIHVRSFDNPPNVRMDAPCSDDNELCPK 264
Query: 266 WAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
WAA+GEC KNP YMVG+ GFCR+SC +C
Sbjct: 265 WAAIGECYKNPTYMVGTKDTNGFCRKSCGLC 295
>gi|30689216|ref|NP_189490.2| Oxoglutarate/iron-dependent oxygenase [Arabidopsis thaliana]
gi|332643931|gb|AEE77452.1| Oxoglutarate/iron-dependent oxygenase [Arabidopsis thaliana]
Length = 288
Score = 341 bits (875), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 166/271 (61%), Positives = 206/271 (76%), Gaps = 13/271 (4%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRS-AVADNLSGESKLSDVRTS 88
++P+++ Q+SW PRAF+Y+GFL+D ECDHLI LAK +L++S VAD SGES+ S+VRTS
Sbjct: 27 VDPTRITQLSWTPRAFLYKGFLSDEECDHLIKLAKGKLEKSMVVADVDSGESEDSEVRTS 86
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVR 148
SG F+ K +D I+A +E K+A WTFLP+ENGE +Q+L YE+GQKY+PH+DYF DK +
Sbjct: 87 SGMFLTKRQDDIVANVEAKLAAWTFLPEENGEALQILHYENGQKYDPHFDYFYDKKALEL 146
Query: 149 GGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDD-LSECAKKGIAVKPRRGDAL 207
GGHR+ATVLMYLS+V KGGETVFPN + +TP DD S+CAK+G AVKPR+GDAL
Sbjct: 147 GGHRIATVLMYLSNVTKGGETVFPNW----KGKTPQLKDDSWSKCAKQGYAVKPRKGDAL 202
Query: 208 LFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSF--DKIVEEGGDCTDNNASCER 265
LFF+LH N DP SLH CPVIEGEKWSAT+WIHV SF K+V C D++ SC+
Sbjct: 203 LFFNLHLNGTTDPNSLHGSCPVIEGEKWSATRWIHVRSFGKKKLV-----CVDDHESCQE 257
Query: 266 WAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
WA GEC KNP YMVGS GFCR+SCK C
Sbjct: 258 WADAGECEKNPMYMVGSETSLGFCRKSCKAC 288
>gi|29150368|gb|AAO72377.1| putative oxidoreductase [Oryza sativa Japonica Group]
gi|108711617|gb|ABF99412.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
Group]
gi|125546090|gb|EAY92229.1| hypothetical protein OsI_13949 [Oryza sativa Indica Group]
gi|125588294|gb|EAZ28958.1| hypothetical protein OsJ_13002 [Oryza sativa Japonica Group]
Length = 310
Score = 337 bits (863), Expect = 5e-90, Method: Compositional matrix adjust.
Identities = 156/269 (57%), Positives = 200/269 (74%), Gaps = 5/269 (1%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
N S V ISWKPR F Y+GFL+D ECDHL+ L K +LKRS VADN SG+S +S+VRTSS
Sbjct: 43 FNASSVTIISWKPRIFFYKGFLSDDECDHLVKLGKEKLKRSMVADNESGKSVMSEVRTSS 102
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRG 149
G F+ K +D +++GIE++IA WT LP+EN E+IQ+LRYE+GQKY+PH+DYF DKVN ++G
Sbjct: 103 GMFLDKQQDPVVSGIEERIAAWTLLPQENAENIQILRYENGQKYDPHFDYFQDKVNQLQG 162
Query: 150 GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLF 209
GHR ATVL YLS V KGGETVFPNAE + +D S+CAKKG+AVK +GD++LF
Sbjct: 163 GHRYATVLTYLSTVEKGGETVFPNAE---GWESQPKDDSFSDCAKKGLAVKAVKGDSVLF 219
Query: 210 FSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKI--VEEGGDCTDNNASCERWA 267
F+L + PDP+SLH CPVIEGEKWSA KWIHV S+D +++ +C+D + +C WA
Sbjct: 220 FNLQPDGTPDPLSLHGSCPVIEGEKWSAPKWIHVRSYDNASSMKQSEECSDLSENCAAWA 279
Query: 268 ALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
A GEC N YM+G+ PG C++SC C
Sbjct: 280 ASGECNNNAVYMIGTEDAPGQCQKSCNAC 308
>gi|294461211|gb|ADE76168.1| unknown [Picea sitchensis]
Length = 280
Score = 337 bits (863), Expect = 5e-90, Method: Compositional matrix adjust.
Identities = 162/258 (62%), Positives = 194/258 (75%), Gaps = 8/258 (3%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
P F+Y+ FLTD ECDHLI LA+ +L++S VADN SG+S +S++RTSSG F+ K +D I+
Sbjct: 28 PGLFLYKNFLTDAECDHLIFLARDKLQKSMVADNESGKSVMSEIRTSSGMFLNKAQDEIV 87
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLS 161
A +ED+IA WTFLP ENGE +QVL YE GQKYEPH+DYF DK+N GGHR+ATVLMYLS
Sbjct: 88 ASVEDRIAAWTFLPIENGEAMQVLHYELGQKYEPHFDYFHDKINQAMGGHRIATVLMYLS 147
Query: 162 DVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPV 221
DV KGGETVFPNAE + + +D SECAK G +VKP +GDALLFFSL +A D
Sbjct: 148 DVVKGGETVFPNAE---TKDSQPKDDSWSECAKGGYSVKPNKGDALLFFSLRPDATTDQS 204
Query: 222 SLHSGCPVIEGEKWSATKWIHVDSFD---KIVEEGGDCTDNNASCERWAALGECTKNPEY 278
SLH CPVIEGEKWSATKWIHV SF+ + + EG C D N SC WA++GEC KNP Y
Sbjct: 205 SLHGSCPVIEGEKWSATKWIHVRSFEVSNRKISEG--CVDENDSCTHWASIGECKKNPTY 262
Query: 279 MVGSAQLPGFCRRSCKVC 296
MVGS PG CR+SC+VC
Sbjct: 263 MVGSPDSPGACRKSCQVC 280
>gi|357496283|ref|XP_003618430.1| Prolyl 4-hydroxylase subunit alpha-2 [Medicago truncatula]
gi|217073992|gb|ACJ85356.1| unknown [Medicago truncatula]
gi|355493445|gb|AES74648.1| Prolyl 4-hydroxylase subunit alpha-2 [Medicago truncatula]
gi|388494436|gb|AFK35284.1| unknown [Medicago truncatula]
Length = 313
Score = 336 bits (862), Expect = 7e-90, Method: Compositional matrix adjust.
Identities = 163/276 (59%), Positives = 202/276 (73%), Gaps = 7/276 (2%)
Query: 24 FSSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLS 83
F + +P++V Q+SW PRAF+Y+ FLTD ECDHLI L+K +L++S VADN SG+S S
Sbjct: 40 FGAKVKFDPTRVTQLSWSPRAFLYKNFLTDEECDHLIELSKDKLEKSMVADNESGKSIQS 99
Query: 84 DVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDK 143
+VRTSSG F+ K +D I++GIE +IA WTFLP ENGE +QVL Y +G+KYEPH+D+F DK
Sbjct: 100 EVRTSSGMFLNKQQDEIVSGIEARIAAWTFLPVENGESMQVLHYMNGEKYEPHFDFFHDK 159
Query: 144 VNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRR 203
N GGHR+ATVLMYLS+V KGGET+FP+AE + + ++ SECA KG AVKPR+
Sbjct: 160 ANQRLGGHRVATVLMYLSNVEKGGETIFPHAE---GKLSQPKDESWSECAHKGYAVKPRK 216
Query: 204 GDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGGD---CTDNN 260
GDALLFFSLH +A D SLH CPVIEGEKWSATKWIHV F+K V + + C D N
Sbjct: 217 GDALLFFSLHLDATTDSKSLHGSCPVIEGEKWSATKWIHVADFEKPVRQALEDRVCADEN 276
Query: 261 ASCERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
+C RWA +GEC KNP YMVG G C +SC VC
Sbjct: 277 ENCARWAKVGECEKNPLYMVGKGG-NGKCMKSCNVC 311
>gi|363543301|ref|NP_001241866.1| prolyl 4-hydroxylase 6 precursor [Zea mays]
gi|195624808|gb|ACG34234.1| oxidoreductase [Zea mays]
gi|347978818|gb|AEP37751.1| prolyl 4-hydroxylase 6 [Zea mays]
Length = 297
Score = 336 bits (861), Expect = 8e-90, Method: Compositional matrix adjust.
Identities = 162/292 (55%), Positives = 205/292 (70%), Gaps = 9/292 (3%)
Query: 11 FFLLSFSLLIRK----SFSSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQ 66
FLL+ +L R +P+ V Q+S +PRAF+Y GFL+D ECDH+++LAK
Sbjct: 7 LFLLAAIVLSRAVSHGHGGGGGFYDPASVTQLSSRPRAFLYSGFLSDTECDHIVSLAKGS 66
Query: 67 LKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLR 126
+++S VADN SG+S S RTSSGTF+ K +D I++ IE ++A WTFLP+EN E +QVLR
Sbjct: 67 MEKSMVADNDSGKSVASQARTSSGTFLAKREDEIVSAIEKRVAAWTFLPEENAESLQVLR 126
Query: 127 YEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATN 186
YE GQKY+ H+DYF D+ N+ GG R+ATVLMYL+DV KGGETVFPNAE + T
Sbjct: 127 YETGQKYDAHFDYFHDRNNLKLGGQRVATVLMYLTDVKKGGETVFPNAEGSHLQYKDET- 185
Query: 187 DDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
SEC++ G+AVKP++GDALLFF+LH NA D SLH CPVIEGEKWSATKWIHV SF
Sbjct: 186 --WSECSRSGLAVKPKKGDALLFFNLHVNATADTGSLHGSCPVIEGEKWSATKWIHVRSF 243
Query: 247 DKI--VEEGGDCTDNNASCERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
D V C+D+ C RWAA+GEC +NP YMVG+ GFCR+SC +C
Sbjct: 244 DNPPDVRTDAPCSDDKELCPRWAAIGECHRNPTYMVGTKDTLGFCRKSCGIC 295
>gi|413932756|gb|AFW67307.1| oxidoreductase [Zea mays]
Length = 297
Score = 335 bits (860), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 158/271 (58%), Positives = 198/271 (73%), Gaps = 5/271 (1%)
Query: 28 AIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRT 87
+P+ V Q+S +PRAF+Y GFL+D ECDHL++LAK +++S VADN SG+S S RT
Sbjct: 28 GFYDPASVTQLSSRPRAFLYSGFLSDTECDHLVSLAKGSMEKSMVADNDSGKSVASQART 87
Query: 88 SSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIV 147
SSGTF+ K +D I++ IE ++A WTFLP+EN E +QVLRYE GQKY+ H+DYF D+ N+
Sbjct: 88 SSGTFLAKREDEIVSAIEKRVAAWTFLPEENAESLQVLRYETGQKYDAHFDYFHDRNNLK 147
Query: 148 RGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDAL 207
GG R+ATVLMYL+DV KGGETVFPNAE + T SEC++ G+AVKP++GDAL
Sbjct: 148 LGGQRVATVLMYLTDVNKGGETVFPNAEGSHLQYKDET---WSECSRSGLAVKPKKGDAL 204
Query: 208 LFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDK--IVEEGGDCTDNNASCER 265
LFF+LH NA D SLH CPVIEGEKWSATKWIHV SFD V C+D+ C R
Sbjct: 205 LFFNLHVNATADTGSLHGSCPVIEGEKWSATKWIHVRSFDNPPDVRTDAPCSDDKELCPR 264
Query: 266 WAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
WAA+GEC +NP YMVG+ GFCR+SC +C
Sbjct: 265 WAAIGECHRNPTYMVGTKDTLGFCRKSCGIC 295
>gi|218199253|gb|EEC81680.1| hypothetical protein OsI_25242 [Oryza sativa Indica Group]
Length = 487
Score = 334 bits (857), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 156/254 (61%), Positives = 193/254 (75%), Gaps = 9/254 (3%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
N S+V+ +SW+PR FVY+GFL+D ECDHL+ L K +++RS VADN SG+S +S+VRTSS
Sbjct: 52 FNASRVRAVSWRPRVFVYKGFLSDDECDHLVKLGKRKMQRSMVADNKSGKSVMSEVRTSS 111
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRG 149
G F+ K +D +++ IE +IA WTFLP+EN E+IQ+LRYEHGQKYEPH+DYF DKVN G
Sbjct: 112 GMFLDKRQDPVVSRIEKRIAAWTFLPEENAENIQILRYEHGQKYEPHFDYFHDKVNQALG 171
Query: 150 GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLF 209
GHR ATVLMYLS V KGGETVFPNAE +D SECA+KG+AVKP +GDA+LF
Sbjct: 172 GHRYATVLMYLSTVEKGGETVFPNAEG---WENQPKDDTFSECAQKGLAVKPVKGDAVLF 228
Query: 210 FSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFD----KIVEEGGDCTDNNASCER 265
FSLH + +PDP+SLH CPVIEGEKWSA KWI + S++ V EG C+DN+A C +
Sbjct: 229 FSLHIDGVPDPLSLHGSCPVIEGEKWSAPKWIRIRSYEHPPVSKVTEG--CSDNSARCAK 286
Query: 266 WAALGECTKNPEYM 279
WA GEC KNP YM
Sbjct: 287 WAEAGECEKNPVYM 300
>gi|334185677|ref|NP_001189994.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
gi|332643930|gb|AEE77451.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
Length = 324
Score = 333 bits (853), Expect = 7e-89, Method: Compositional matrix adjust.
Identities = 161/288 (55%), Positives = 206/288 (71%), Gaps = 11/288 (3%)
Query: 17 SLLIRKSFSSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNL 76
S++ K+ +S+ +P++V Q+SW PR F+YEGFL+D ECDH I LAK +L++S VADN
Sbjct: 38 SVIKMKTSASSFGFDPTRVTQLSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADND 97
Query: 77 SGESKLSD----VRTSSGTFIPKGK----DAIIAGIEDKIATWTFLPKENGEDIQVLRYE 128
SGES S+ V S +FI D I++ +E K+A WTFLP+ENGE +Q+L YE
Sbjct: 98 SGESVESEDSVSVVRQSSSFIANMDSLEIDDIVSNVEAKLAAWTFLPEENGESMQILHYE 157
Query: 129 HGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDD 188
+GQKYEPH+DYF D+ N+ GGHR+ATVLMYLS+V KGGETVFP + + T +D
Sbjct: 158 NGQKYEPHFDYFHDQANLELGGHRIATVLMYLSNVEKGGETVFPMWK---GKATQLKDDS 214
Query: 189 LSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDK 248
+ECAK+G AVKPR+GDALLFF+LH NA D SLH CPV+EGEKWSAT+WIHV SF++
Sbjct: 215 WTECAKQGYAVKPRKGDALLFFNLHPNATTDSNSLHGSCPVVEGEKWSATRWIHVKSFER 274
Query: 249 IVEEGGDCTDNNASCERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
+ C D N SCE+WA GEC KNP YMVGS + G+CR+SCK C
Sbjct: 275 AFNKQSGCMDENVSCEKWAKAGECQKNPTYMVGSDKDHGYCRKSCKAC 322
>gi|222636605|gb|EEE66737.1| hypothetical protein OsJ_23428 [Oryza sativa Japonica Group]
Length = 487
Score = 333 bits (853), Expect = 8e-89, Method: Compositional matrix adjust.
Identities = 155/254 (61%), Positives = 192/254 (75%), Gaps = 9/254 (3%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
N S+V+ +SW+PR FVY+GFL+D ECDHL+ L K +++RS VADN SG+S +S+VRTSS
Sbjct: 52 FNASRVRAVSWRPRVFVYKGFLSDDECDHLVKLGKRKMQRSMVADNKSGKSVMSEVRTSS 111
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRG 149
G F+ K +D +++ IE +IA WTFLP+EN E+IQ+LRYEHGQKYEPH+DYF DKVN G
Sbjct: 112 GMFLDKRQDPVVSRIEKRIAAWTFLPEENAENIQILRYEHGQKYEPHFDYFHDKVNQALG 171
Query: 150 GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLF 209
GHR ATVLMYLS V KGGETVFPNAE +D SECA+KG+AVKP +GD +LF
Sbjct: 172 GHRYATVLMYLSTVEKGGETVFPNAEG---WENQPKDDTFSECAQKGLAVKPVKGDTVLF 228
Query: 210 FSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFD----KIVEEGGDCTDNNASCER 265
FSLH + +PDP+SLH CPVIEGEKWSA KWI + S++ V EG C+DN+A C +
Sbjct: 229 FSLHIDGVPDPLSLHGSCPVIEGEKWSAPKWIRIRSYEHPPVSKVTEG--CSDNSARCAK 286
Query: 266 WAALGECTKNPEYM 279
WA GEC KNP YM
Sbjct: 287 WAEAGECEKNPVYM 300
>gi|28393447|gb|AAO42145.1| putative prolyl 4-hydroxylase [Arabidopsis thaliana]
Length = 253
Score = 331 bits (849), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 164/262 (62%), Positives = 198/262 (75%), Gaps = 13/262 (4%)
Query: 39 SWKPRAFVYEGFLTDLECDHLINLAKSQLKRS-AVADNLSGESKLSDVRTSSGTFIPKGK 97
SW PRAF+Y+GFL+D ECDHLI LAK +L++S VAD SGES+ S+VRTSSG F+ K +
Sbjct: 1 SWTPRAFLYKGFLSDEECDHLIKLAKGKLEKSMVVADVDSGESEDSEVRTSSGMFLTKRQ 60
Query: 98 DAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVL 157
D I+A +E K+A WTFLP+ENGE +Q+L YE+GQKY+PH+DYF DK + GGHR+ATVL
Sbjct: 61 DDIVANVEAKLAAWTFLPEENGEALQILHYENGQKYDPHFDYFYDKKALELGGHRIATVL 120
Query: 158 MYLSDVAKGGETVFPNAEEPPRRRTPATNDD-LSECAKKGIAVKPRRGDALLFFSLHTNA 216
MYLS+V KGGETVFPN + +TP DD S+CAK+G AVKPR+GDALLFF+LH N
Sbjct: 121 MYLSNVTKGGETVFPNW----KGKTPQLKDDSWSKCAKQGYAVKPRKGDALLFFNLHLNG 176
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWIHVDSF--DKIVEEGGDCTDNNASCERWAALGECTK 274
DP SLH CPVIEGEKWSAT+WIHV SF K+V C D++ SC+ WA GEC K
Sbjct: 177 TTDPNSLHGSCPVIEGEKWSATRWIHVRSFGKKKLV-----CVDDHESCQEWADAGECEK 231
Query: 275 NPEYMVGSAQLPGFCRRSCKVC 296
NP YMVGS GFCR+SCK C
Sbjct: 232 NPMYMVGSETSLGFCRKSCKAC 253
>gi|326501992|dbj|BAK06488.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 306
Score = 323 bits (828), Expect = 6e-86, Method: Compositional matrix adjust.
Identities = 157/272 (57%), Positives = 198/272 (72%), Gaps = 8/272 (2%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAK-SQLKRSAVADNLSGESKLSDVRTS 88
+P++ +SW+PRAF+Y+GFLT+ ECDHL+ LA+ L++S V D +G+S +S+VRTS
Sbjct: 33 FDPTRAVHVSWRPRAFLYKGFLTEAECDHLVALAEEGGLQKSMVVDRQTGKSVMSEVRTS 92
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF--SDKVNI 146
SGTF+ K +D ++A IE +IA WT LP+ENGE IQVLRYE+GQKYEPH D+ + K +
Sbjct: 93 SGTFLAKKQDQVVATIEARIAAWTLLPQENGESIQVLRYENGQKYEPHVDFIRHAAKGHH 152
Query: 147 VRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
RGGHR+ATVLMYLSDV GGETVFPN++ + +D SECA++G AVKP +GDA
Sbjct: 153 SRGGHRVATVLMYLSDVKMGGETVFPNSDA---KTLQPKDDTQSECARRGYAVKPVKGDA 209
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFD--KIVEEGGDCTDNNASCE 264
+LFFSLH N D SLH GCPVIEGEKWSATKWIHV FD + V C D++ C
Sbjct: 210 VLFFSLHPNGTTDRDSLHGGCPVIEGEKWSATKWIHVRPFDNRRRVPSTAGCGDDDELCP 269
Query: 265 RWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
R AA GEC +NP YMVG+A PGFCR+SC C
Sbjct: 270 RLAANGECDRNPRYMVGTAGSPGFCRKSCNAC 301
>gi|108706361|gb|ABF94156.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
Group]
gi|222624253|gb|EEE58385.1| hypothetical protein OsJ_09545 [Oryza sativa Japonica Group]
Length = 299
Score = 322 bits (825), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 150/269 (55%), Positives = 194/269 (72%), Gaps = 5/269 (1%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKS-QLKRSAVADNLSGESKLSDVRTS 88
+ S+ +SW PR F+YEGFL+D+EC+HLI LAK +++RS V + SGES +S RTS
Sbjct: 32 FDASRTVDVSWSPRVFLYEGFLSDVECEHLIALAKQGRMERSTVVNGKSGESVMSKTRTS 91
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVR 148
SG F+ + +D ++A IE++IA WT P ENGE +Q+LRY G+KYEPH+DY + R
Sbjct: 92 SGMFLIRKQDEVVARIEERIAAWTMFPAENGESMQMLRYGQGEKYEPHFDYIRGRQASAR 151
Query: 149 GGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALL 208
GGHR+ATVLMYLS+V GGETVFP+AE R + ++ S+CA++G AVKP +G A+L
Sbjct: 152 GGHRIATVLMYLSNVKMGGETVFPDAEA---RLSQPKDETWSDCAEQGFAVKPTKGSAVL 208
Query: 209 FFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGGD-CTDNNASCERWA 267
FFSL+ NA DP SLH CPVI+GEKWSATKWIHV S+D+ D C D +A C WA
Sbjct: 209 FFSLYPNATFDPGSLHGSCPVIQGEKWSATKWIHVRSYDENGRRSSDKCEDQHALCSSWA 268
Query: 268 ALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
A GEC KNP YMVG+++ PGFCR+SC VC
Sbjct: 269 AAGECAKNPGYMVGTSESPGFCRKSCNVC 297
>gi|218192156|gb|EEC74583.1| hypothetical protein OsI_10158 [Oryza sativa Indica Group]
Length = 299
Score = 321 bits (822), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 150/269 (55%), Positives = 193/269 (71%), Gaps = 5/269 (1%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKS-QLKRSAVADNLSGESKLSDVRTS 88
+ S+ +SW PR F+YEGFL+D EC+HLI LAK +++RS V + SGES +S RTS
Sbjct: 32 FDASRAVDVSWSPRVFLYEGFLSDAECEHLIALAKQGRMERSTVVNGKSGESVMSKTRTS 91
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVR 148
SG F+ + +D ++A IE++IA WT P ENGE +Q+LRY G+KYEPH+DY + R
Sbjct: 92 SGMFLIRKQDEVVARIEERIAAWTMFPAENGESMQMLRYGQGEKYEPHFDYIRGRQASAR 151
Query: 149 GGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALL 208
GGHR+ATVLMYLS+V GGETVFP+AE R + ++ S+CA++G AVKP +G A+L
Sbjct: 152 GGHRIATVLMYLSNVKMGGETVFPDAEA---RLSQPKDETWSDCAEQGFAVKPTKGSAVL 208
Query: 209 FFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGGD-CTDNNASCERWA 267
FFSL+ NA DP SLH CPVI+GEKWSATKWIHV S+D+ D C D +A C WA
Sbjct: 209 FFSLYPNATFDPGSLHGSCPVIQGEKWSATKWIHVRSYDENGRRSSDKCEDEHALCSSWA 268
Query: 268 ALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
A GEC KNP YMVG+++ PGFCR+SC VC
Sbjct: 269 AAGECAKNPGYMVGTSESPGFCRKSCNVC 297
>gi|449459442|ref|XP_004147455.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
gi|449515722|ref|XP_004164897.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 319
Score = 317 bits (813), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 157/280 (56%), Positives = 194/280 (69%), Gaps = 5/280 (1%)
Query: 17 SLLIRKSFSSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNL 76
S++ K+ S I+P++V Q+S KPRAF+Y+GFL+ EC HLIN AK +L +S VA
Sbjct: 42 SVIRMKTGGSAMTIDPTRVIQLSSKPRAFLYKGFLSAEECQHLINSAKGKLHQSLVAAG- 100
Query: 77 SGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPH 136
+G+S S RTS+G F+ K +D I+A IE +IA WTFLP +NGE IQ+LRYE+GQKYEPH
Sbjct: 101 TGQSVTSKERTSTGMFLHKAQDEIVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPH 160
Query: 137 YDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKG 196
+D+F D NI GGHR+AT+LMYLS+V KGGETVFPN+ P + + DLSEC K G
Sbjct: 161 FDFFQDPGNIAIGGHRIATILMYLSNVEKGGETVFPNS---PVKLSEEEKADLSECGKVG 217
Query: 197 IAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGGDC 256
V+P+ GDALLFFS++ N PD S H CPVIEGEKWSATKWIH+ D+ C
Sbjct: 218 YGVRPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPIDEFWRNPA-C 276
Query: 257 TDNNASCERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
D N C WA GEC KNP YM+GS GFCR SCKVC
Sbjct: 277 VDENDHCTAWAKAGECEKNPVYMMGSKNELGFCRFSCKVC 316
>gi|116784858|gb|ABK23496.1| unknown [Picea sitchensis]
Length = 208
Score = 315 bits (806), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 148/207 (71%), Positives = 169/207 (81%), Gaps = 2/207 (0%)
Query: 92 FIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGH 151
FIPKGKDAII+ IEDKIA WTFLPKENGED+QVLRYE G+KY+PH+D+F DKVNIVRGGH
Sbjct: 2 FIPKGKDAIISRIEDKIAAWTFLPKENGEDMQVLRYEPGEKYDPHFDFFQDKVNIVRGGH 61
Query: 152 RLATVLMYLSDVAKGGETVFPNAEEPPRRRTPAT--NDDLSECAKKGIAVKPRRGDALLF 209
R+ATVLMYL+DV+KGGETVFP+AEE RR + +D LS+CAK+G AVKP+RGDALLF
Sbjct: 62 RVATVLMYLTDVSKGGETVFPSAEEDTHRRISSIIKDDTLSDCAKRGTAVKPKRGDALLF 121
Query: 210 FSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGGDCTDNNASCERWAAL 269
FSL T A PD SLH+GCPVIEGEKWS TKWIHV+SFDK + +C D N C WAA
Sbjct: 122 FSLTTQAKPDTRSLHAGCPVIEGEKWSVTKWIHVESFDKPRQSSDNCVDQNPRCGEWAAY 181
Query: 270 GECTKNPEYMVGSAQLPGFCRRSCKVC 296
GEC NP YM+GS LPG CR+SCKVC
Sbjct: 182 GECNNNPIYMLGSPDLPGACRKSCKVC 208
>gi|148537204|dbj|BAF63493.1| prolyl 4-hydroxylase [Potamogeton distinctus]
Length = 246
Score = 311 bits (797), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 150/249 (60%), Positives = 182/249 (73%), Gaps = 3/249 (1%)
Query: 48 EGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDK 107
+GFL+ ECDHLI L K +L++S VADN SG+S +S++RTSSG F+ + +D I IE +
Sbjct: 1 KGFLSHEECDHLIALGKDKLEKSMVADNESGKSVMSEIRTSSGMFLERRQDETITRIEKR 60
Query: 108 IATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGG 167
IA WTFLP+ENGE IQ+L YE GQKY+ HYDYF DK N GGHR+ATVLMYLSDV KGG
Sbjct: 61 IAAWTFLPEENGEPIQILHYEKGQKYDAHYDYFHDKNNQRVGGHRMATVLMYLSDVKKGG 120
Query: 168 ETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGC 227
ETVFP+AE + +D S+CA+ G AVKPR+GDALLFFS H NA DP SLH+ C
Sbjct: 121 ETVFPDAE---GKLLQVKDDTWSDCARSGYAVKPRKGDALLFFSCHPNATTDPNSLHASC 177
Query: 228 PVIEGEKWSATKWIHVDSFDKIVEEGGDCTDNNASCERWAALGECTKNPEYMVGSAQLPG 287
PVIEGEKWSAT+WIHV SF K +C D +C WA+ GEC KN YMVG+ + G
Sbjct: 178 PVIEGEKWSATRWIHVRSFAKKERNKDECVDEEDNCSFWASNGECEKNVLYMVGNNETLG 237
Query: 288 FCRRSCKVC 296
+CR+SCKVC
Sbjct: 238 YCRKSCKVC 246
>gi|224141327|ref|XP_002324025.1| predicted protein [Populus trichocarpa]
gi|222867027|gb|EEF04158.1| predicted protein [Populus trichocarpa]
Length = 239
Score = 306 bits (785), Expect = 6e-81, Method: Compositional matrix adjust.
Identities = 151/234 (64%), Positives = 180/234 (76%), Gaps = 4/234 (1%)
Query: 19 LIR-KSFSSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLS 77
+IR K+ + T +P++ Q+SW+PRAFVY+GFL+D ECDHLINLAK +L +S VA++ +
Sbjct: 1 IIRSKTGAFTKAFDPTRAAQLSWQPRAFVYKGFLSDEECDHLINLAKGKLVKSMVANDET 60
Query: 78 GESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHY 137
GES S RTSSG FI K +D I+ GIE +IA WTFLP+ENGE IQ+LRYEHGQKYE H
Sbjct: 61 GESMESQERTSSGMFIFKTEDEIVNGIEARIAAWTFLPEENGEPIQILRYEHGQKYEAHI 120
Query: 138 DYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGI 197
DYF DK N GGHR ATVLMYLSDV KGGETVFP +E + A +D S+CAKKG
Sbjct: 121 DYFVDKANQEEGGHRAATVLMYLSDVKKGGETVFPTSE---AEGSQAKDDSWSDCAKKGY 177
Query: 198 AVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVE 251
AVKP +GDALLFFSLH +A PDP SLH+ CPVIEGEKWSATKWIHV SF + V+
Sbjct: 178 AVKPNKGDALLFFSLHPDATPDPGSLHASCPVIEGEKWSATKWIHVRSFSEPVK 231
>gi|125542543|gb|EAY88682.1| hypothetical protein OsI_10157 [Oryza sativa Indica Group]
Length = 321
Score = 306 bits (783), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 151/286 (52%), Positives = 190/286 (66%), Gaps = 22/286 (7%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKS-QLKRSAVADNLSGESKLSDVRTS 88
+ S+ +SW+PRAF+YEGFL+D ECDHLI+LAK ++++S V D SGES S VRTS
Sbjct: 37 FDASRAVDVSWRPRAFLYEGFLSDAECDHLISLAKQGKMEKSTVVDGESGESVTSKVRTS 96
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLP-----------------KENGEDIQVLRYEHGQ 131
SG F+ K +D ++A IE++IA WT LP ENGE +Q+LRY G+
Sbjct: 97 SGMFLDKKQDEVVARIEERIAAWTMLPTECIIFYCFANFAILKLSENGESMQILRYGQGE 156
Query: 132 KYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSE 191
KYEPH+DY S + R G R+ATVLMYLS+V GGET+FP+ E R + ++ S+
Sbjct: 157 KYEPHFDYISGRQGSTREGDRVATVLMYLSNVKMGGETIFPDCEA---RLSQPKDETWSD 213
Query: 192 CAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVE 251
CA++G AVKP +G A+LFFSLH NA D SLH CPVIEGEKWSATKWIHV S+
Sbjct: 214 CAEQGFAVKPAKGSAVLFFSLHPNATLDTDSLHGSCPVIEGEKWSATKWIHVRSYSYRRR 273
Query: 252 EGGDCTDNNASCERWAALGECTKNPEYMVGSAQL-PGFCRRSCKVC 296
G C D + C WAA GEC KNP YMVG++ PGFCR+SC VC
Sbjct: 274 SAGKCEDEHVLCSSWAAAGECAKNPGYMVGTSDSPPGFCRKSCNVC 319
>gi|297727581|ref|NP_001176154.1| Os10g0415128 [Oryza sativa Japonica Group]
gi|255679404|dbj|BAH94882.1| Os10g0415128 [Oryza sativa Japonica Group]
Length = 241
Score = 298 bits (763), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 142/228 (62%), Positives = 171/228 (75%), Gaps = 6/228 (2%)
Query: 72 VADNLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQ 131
VADN SG+S +S+VRTSSG F+ K +D ++A IE++IA WTFLP +NGE IQ+L Y++G+
Sbjct: 2 VADNESGKSVMSEVRTSSGMFLEKKQDEVVARIEERIAAWTFLPPDNGESIQILHYQNGE 61
Query: 132 KYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSE 191
KYEPHYDYF DK N GGHR+ATVLMYLSDV KGGET+FP AE + +D S+
Sbjct: 62 KYEPHYDYFHDKNNQALGGHRIATVLMYLSDVGKGGETIFPEAE---GKLLQPKDDTWSD 118
Query: 192 CAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVE 251
CAK G AVKP +GDALLFFSLH +A D SLH CPVIEG+KWSATKWIHV SFD V+
Sbjct: 119 CAKNGYAVKPVKGDALLFFSLHPDATTDSDSLHGSCPVIEGQKWSATKWIHVRSFDISVK 178
Query: 252 EGGD---CTDNNASCERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
+G C D N C +WAA+GEC KNP YMVG+ + PGFCR+SC VC
Sbjct: 179 QGASTDGCEDENVLCPQWAAVGECAKNPNYMVGTNEAPGFCRKSCNVC 226
>gi|363543295|ref|NP_001241863.1| prolyl 4-hydroxylase 4 precursor [Zea mays]
gi|347978806|gb|AEP37745.1| prolyl 4-hydroxylase 4 [Zea mays]
gi|414591890|tpg|DAA42461.1| TPA: hypothetical protein ZEAMMB73_637248 [Zea mays]
Length = 274
Score = 295 bits (756), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 140/221 (63%), Positives = 171/221 (77%), Gaps = 9/221 (4%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
N S+VK +SW PR FVY+GFL+D ECDHL+ LAK +++RS VADN SG+S S+VRTSS
Sbjct: 40 FNSSRVKAVSWHPRIFVYKGFLSDAECDHLVTLAKKKIQRSMVADNESGKSVKSEVRTSS 99
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRG 149
G F+ K +D +++ IE++IA WTFLP+EN E++QVLRYE GQKYEPH+DYF D+VN RG
Sbjct: 100 GMFLDKRQDPVVSRIEERIAAWTFLPQENAENMQVLRYEPGQKYEPHFDYFHDRVNQARG 159
Query: 150 GHRLATVLMYLSDVAKGGETVFPNA---EEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
GHR ATVLMYLS V +GGETVFPNA E P+ T SECA KG+AVKP +GDA
Sbjct: 160 GHRYATVLMYLSTVREGGETVFPNAKGWESQPKDAT------FSECAHKGLAVKPVKGDA 213
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFD 247
+LFFSLH + PDP+SLH CPVI GEKWSA KWIHV S++
Sbjct: 214 VLFFSLHADGTPDPLSLHGSCPVIRGEKWSAPKWIHVRSYE 254
>gi|145345764|ref|XP_001417370.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577597|gb|ABO95663.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 328
Score = 293 bits (750), Expect = 6e-77, Method: Compositional matrix adjust.
Identities = 146/270 (54%), Positives = 184/270 (68%), Gaps = 10/270 (3%)
Query: 34 KVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFI 93
+++++SW+P A VY GFLT ECDHL LA L RS V D +G S SD+RTSSG F+
Sbjct: 55 RIERVSWRPHAEVYRGFLTREECDHLKALATPSLGRSTVVDASNGGSVPSDIRTSSGMFL 114
Query: 94 PKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVR--GGH 151
+G+D ++A IE +IA+WT +P+ +GE QVLRYE GQ+Y PH+DYF D+ N R GG
Sbjct: 115 LRGEDDVVASIERRIASWTHVPESHGEGFQVLRYEFGQEYRPHFDYFQDEFNQKREKGGQ 174
Query: 152 RLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFS 211
R+ATVLMYL+DV +GGET+FP+AE P DD S CA +AVKPR+GDAL F S
Sbjct: 175 RVATVLMYLTDVEEGGETIFPDAEAGA---NPGGGDDASSCAAGKLAVKPRKGDALFFRS 231
Query: 212 LHTNAIPDPVSLHSGCPVIEGEKWSATKWIHV----DSFDKIVE-EGGDCTDNNASCERW 266
LH N D +S H+GCPV++G K+SATKW+HV DS V E G C D NA+CE W
Sbjct: 232 LHHNGTSDAMSSHAGCPVVKGVKFSATKWMHVAPIEDSATASVRFEPGVCKDVNAACEGW 291
Query: 267 AALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
A+ GECTKNP +MVG + G C RSC C
Sbjct: 292 ASSGECTKNPSFMVGRGRANGNCMRSCGAC 321
>gi|255085592|ref|XP_002505227.1| predicted protein [Micromonas sp. RCC299]
gi|226520496|gb|ACO66485.1| predicted protein [Micromonas sp. RCC299]
Length = 267
Score = 289 bits (740), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 145/271 (53%), Positives = 189/271 (69%), Gaps = 13/271 (4%)
Query: 33 SKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTF 92
S++ ++S KP+A++Y GFL ECD++ AK +L++S V DN +G+S S++RTS G F
Sbjct: 3 SRIVKLSEKPKAYLYRGFLRQAECDYIKERAKPKLEKSTVVDNKTGQSVPSNIRTSDGMF 62
Query: 93 IPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNI--VRGG 150
+ +D II IE +IA WT +P ENGE IQVLRYE GQKYEPH D FSDK N +GG
Sbjct: 63 FDRHEDDIIEDIERRIAEWTNVPWENGEGIQVLRYEVGQKYEPHLDAFSDKFNTEESKGG 122
Query: 151 HRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFF 210
R+ATVLMYLSDV +GGETVFP + + P + P SECA++G+AVK R+GDALLF+
Sbjct: 123 QRMATVLMYLSDVEEGGETVFPRSVDKPHKGDPK----WSECAQRGVAVKARKGDALLFW 178
Query: 211 SLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFD-----KIVEEGGDCTDNNASCER 265
SL ++ D +SLH GCPVI+G KWSATKW+H+ SFD K E G C D N CE
Sbjct: 179 SLDIDSNVDELSLHGGCPVIKGTKWSATKWMHLKSFDTANSFKFPE--GVCDDVNEQCEG 236
Query: 266 WAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
WA+ GEC KNP+YM+G+ + G+C R+C C
Sbjct: 237 WASTGECEKNPKYMIGNGKTDGYCVRACGKC 267
>gi|307106819|gb|EFN55064.1| hypothetical protein CHLNCDRAFT_35843 [Chlorella variabilis]
Length = 287
Score = 273 bits (699), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 135/265 (50%), Positives = 176/265 (66%), Gaps = 16/265 (6%)
Query: 34 KVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFI 93
KV+Q+SW+PRAFVY FL+D EC+HL LA+ +L +S V DN +G+S S VRTSSGTF+
Sbjct: 37 KVEQVSWRPRAFVYHNFLSDEECEHLKELARKRLTKSTVVDNKTGKSMDSTVRTSSGTFL 96
Query: 94 PKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVN--IVRGGH 151
+G+D ++ IE +I+ T +P+ENGE IQ+L+Y GQKYEPH DYF DK N GG
Sbjct: 97 ARGEDEVVRAIEKRISLVTMIPEENGEAIQILKYVDGQKYEPHTDYFHDKYNSRTENGGQ 156
Query: 152 RLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFS 211
R+AT+LMYLS +GGETVFP AE+ + SECA+KG+AVK +G ALLF+S
Sbjct: 157 RVATILMYLSTPEEGGETVFPYAEK------KVEGEGWSECARKGLAVKAVKGSALLFYS 210
Query: 212 LHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGGDCTDNNASCERWAALGE 271
L N D S H CP + GEKWSAT+WIHV +F +G C D N CE WA +GE
Sbjct: 211 LKPNGEEDQASTHGSCPTLAGEKWSATRWIHVGAFQPGGAKG--CKDENEKCEEWAVMGE 268
Query: 272 CTKNPEYMVGSAQLPGFCRRSCKVC 296
C NP +M + C++SC++C
Sbjct: 269 CQNNPAFMKSN------CKKSCELC 287
>gi|308799217|ref|XP_003074389.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
gi|116000560|emb|CAL50240.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
Length = 294
Score = 270 bits (691), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 136/275 (49%), Positives = 178/275 (64%), Gaps = 11/275 (4%)
Query: 28 AIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRT 87
I+ ++++SW P A VY GFLT+ EC+H+ LA ++LK S V D +G S++RT
Sbjct: 19 GTIDAGAIERLSWAPHAEVYRGFLTEAECEHIERLATAELKPSTVVDASTGGDASSEIRT 78
Query: 88 SSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIV 147
SSG F+ + +D +I IE +IA WT +P+ +GE QVLRYE Q+Y HYDYF DK N+
Sbjct: 79 SSGMFLGRAEDDVIEAIEARIAAWTHVPESHGEGFQVLRYEKHQEYRAHYDYFHDKFNVK 138
Query: 148 R--GGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGD 205
R GG R+ TVLMYLSDV +GGETVFP E+ TPA + SECA+ +AV+PR+GD
Sbjct: 139 REKGGQRMGTVLMYLSDVEEGGETVFPKFEDG----TPA-GSEASECARNKLAVRPRKGD 193
Query: 206 ALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHV----DSFDKIVEEGGDCTDNNA 261
AL F SL + +PD S H+GCPVI G K+SATKW+HV D + ++ G C D +A
Sbjct: 194 ALFFRSLRHDGVPDTFSEHAGCPVIRGVKFSATKWMHVSPIEDGSNGLLLPPGVCKDLHA 253
Query: 262 SCERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
+C WA GEC KN YMVG + G C RSC C
Sbjct: 254 ACVAWAKSGECEKNKNYMVGRGRSKGNCMRSCGAC 288
>gi|302773668|ref|XP_002970251.1| hypothetical protein SELMODRAFT_411114 [Selaginella moellendorffii]
gi|300161767|gb|EFJ28381.1| hypothetical protein SELMODRAFT_411114 [Selaginella moellendorffii]
Length = 256
Score = 267 bits (683), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 130/215 (60%), Positives = 166/215 (77%), Gaps = 2/215 (0%)
Query: 32 PSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGT 91
P + ISW+PRA V+ FL+ ECDHLI LA+ +KRSAV DN +G+SK S VRTSSGT
Sbjct: 43 PVWTETISWQPRASVFHNFLSSEECDHLIRLAQPNMKRSAVVDNQTGKSKDSRVRTSSGT 102
Query: 92 FIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGH 151
F+ +G+D II+ IE++IA +TF+PKE+GE +QVL YE GQKY+ H+DYF DKVN GG
Sbjct: 103 FLRRGQDEIISRIEERIAKFTFIPKEHGEGLQVLHYEVGQKYDAHHDYFHDKVNTKNGGQ 162
Query: 152 RLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFS 211
R+ATVLMYLSDV +GGETVFP+A+ + D+LSECAKKG++VKPR+GDALLF+S
Sbjct: 163 RVATVLMYLSDVEEGGETVFPSAKV--NSSSVPWWDELSECAKKGVSVKPRKGDALLFWS 220
Query: 212 LHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
+ +A DP SLH GCPVI+G KWSATKW+H+ +
Sbjct: 221 MSPDAELDPFSLHGGCPVIKGNKWSATKWMHLREY 255
>gi|9294584|dbj|BAB02865.1| unnamed protein product [Arabidopsis thaliana]
Length = 328
Score = 267 bits (682), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 134/220 (60%), Positives = 162/220 (73%), Gaps = 12/220 (5%)
Query: 71 AVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHG 130
VAD SGES+ S+VRTSSG F+ K +D I+A +E K+A WTFLP+ENGE +Q+L YE+G
Sbjct: 2 VVADVDSGESEDSEVRTSSGMFLTKRQDDIVANVEAKLAAWTFLPEENGEALQILHYENG 61
Query: 131 QKYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDD-L 189
QKY+PH+DYF DK + GGHR+ATVLMYLS+V KGGETVFPN + +TP DD
Sbjct: 62 QKYDPHFDYFYDKKALELGGHRIATVLMYLSNVTKGGETVFPNW----KGKTPQLKDDSW 117
Query: 190 SECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSF--D 247
S+CAK+G AVKPR+GDALLFF+LH N DP SLH CPVIEGEKWSAT+WIHV SF
Sbjct: 118 SKCAKQGYAVKPRKGDALLFFNLHLNGTTDPNSLHGSCPVIEGEKWSATRWIHVRSFGKK 177
Query: 248 KIVEEGGDCTDNNASCERWAALGECTKNPEYMVGSAQLPG 287
K+V C D++ SC+ WA GEC KNP YMVG + G
Sbjct: 178 KLV-----CVDDHESCQEWADAGECEKNPMYMVGVGKKTG 212
>gi|55741040|gb|AAV64184.1| unknown [Zea mays]
Length = 394
Score = 266 bits (681), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 124/203 (61%), Positives = 153/203 (75%), Gaps = 6/203 (2%)
Query: 97 KDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATV 156
+D ++ IE++I+ WTFLP ENGE IQ+L Y++G+KYEPHYDYF DK N GGHR+ATV
Sbjct: 193 QDEVVTRIEERISAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKKNQALGGHRIATV 252
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
LMYLS+V KGGET+FPNAE + ++ S+CA+ G AVKP +GDALLFFSLH +A
Sbjct: 253 LMYLSNVEKGGETIFPNAEG---KLLQPKDNTWSDCARNGYAVKPVKGDALLFFSLHPDA 309
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGGD---CTDNNASCERWAALGECT 273
D SLH CPVIEG+KWSATKWIHV SFD V++ G C D+N C +WAA+GEC
Sbjct: 310 TTDSDSLHGSCPVIEGQKWSATKWIHVRSFDLPVKQPGSSDGCEDDNILCPQWAAVGECA 369
Query: 274 KNPEYMVGSAQLPGFCRRSCKVC 296
KNP YMVG+ + PGFCR+SCKVC
Sbjct: 370 KNPNYMVGTKEAPGFCRKSCKVC 392
>gi|302834449|ref|XP_002948787.1| hypothetical protein VOLCADRAFT_80309 [Volvox carteri f.
nagariensis]
gi|300265978|gb|EFJ50167.1| hypothetical protein VOLCADRAFT_80309 [Volvox carteri f.
nagariensis]
Length = 329
Score = 266 bits (681), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 131/280 (46%), Positives = 176/280 (62%), Gaps = 20/280 (7%)
Query: 33 SKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTF 92
S++ +SW+PR F+Y+G LT ECD+LI +A+ +L+RS V+D +GE +SD+RTSSG F
Sbjct: 48 SRMVVLSWQPRVFLYKGILTQEECDYLIKIAQGRLERSGVSDATTGEGGVSDIRTSSGMF 107
Query: 93 IPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHR 152
+G++ ++ IE ++A WT LP ENGE IQVLRYE QKY+PH+DYFS + GG+R
Sbjct: 108 YTRGENDVVKRIETRLAMWTMLPVENGEGIQVLRYEKTQKYDPHHDYFSFEGRDANGGNR 167
Query: 153 LATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSL 212
+ATVLMYL+ +GGETVFP P + T + SEC KG+AVKP +GDA+LF+S+
Sbjct: 168 MATVLMYLATPEEGGETVFPKIPVPAGQ----TRANFSECGMKGLAVKPVKGDAVLFWSI 223
Query: 213 HTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGGD----------------C 256
+ +P SLH CPVI G KWSATKWIHV + E+ + C
Sbjct: 224 RPDGRFEPGSLHGSCPVIRGVKWSATKWIHVGPYSMGAEKAVEVTRVIYAPPPPPAVPGC 283
Query: 257 TDNNASCERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
+ + C+ WA GEC NP YMVG PG C +C C
Sbjct: 284 INTHKLCDHWAESGECESNPGYMVGQLGSPGACNLACNRC 323
>gi|307111754|gb|EFN59988.1| hypothetical protein CHLNCDRAFT_49444 [Chlorella variabilis]
Length = 344
Score = 266 bits (680), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 136/301 (45%), Positives = 186/301 (61%), Gaps = 23/301 (7%)
Query: 12 FLLSFSLLIRKSFSSTAIINPSK---VKQISWKPRAFVYEGFLTDLECDHLINLAKSQLK 68
F SF S +S +PS+ V+ + R F+Y FLTD ECDH+I LA+ +
Sbjct: 40 FAASFG---NSSCASEPACDPSRSPRVQVLHEDARIFLYHNFLTDEECDHIIKLAEPTMA 96
Query: 69 RSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYE 128
RS V + SG+SK+ +VRTS GTF+ +G D++IA IE +IA WT +P NGE +QVL+YE
Sbjct: 97 RSGVVETDSGKSKIDNVRTSKGTFLNRGHDSVIADIEARIAKWTLMPAGNGEGLQVLKYE 156
Query: 129 HGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDD 188
HGQ+YE HYDYF K GG+R TVLMYL+DV +GGET FPN P P +
Sbjct: 157 HGQEYEGHYDYFFHKAGTANGGNRYLTVLMYLNDVEEGGETCFPNIPSPNGDNGP----E 212
Query: 189 LSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFD- 247
SECA+K +A KP++G+A+LF S+ + SLH+ CPVI+G KWSA KW+HV +
Sbjct: 213 FSECARKVLAAKPKKGNAVLFHSIKPTGELERRSLHTACPVIKGVKWSAPKWVHVGHYAV 272
Query: 248 --------KIVEEGG----DCTDNNASCERWAALGECTKNPEYMVGSAQLPGFCRRSCKV 295
+ + +G +C + +A+C+ WA GEC KNP +MVG+ Q PG C ++C
Sbjct: 273 GGEKPQHIQQIPQGDSTYPECKNKDAACDSWAGNGECEKNPVFMVGTKQRPGHCIKACGK 332
Query: 296 C 296
C
Sbjct: 333 C 333
>gi|302793288|ref|XP_002978409.1| hypothetical protein SELMODRAFT_418273 [Selaginella moellendorffii]
gi|300153758|gb|EFJ20395.1| hypothetical protein SELMODRAFT_418273 [Selaginella moellendorffii]
Length = 256
Score = 265 bits (678), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 129/215 (60%), Positives = 165/215 (76%), Gaps = 2/215 (0%)
Query: 32 PSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGT 91
P + ISW+PRA V+ FL+ ECDHLI LA+ +KRSAV DN +G+SK S VRTSSGT
Sbjct: 43 PVWTETISWQPRASVFHNFLSSEECDHLIRLAQPNMKRSAVVDNQTGKSKDSRVRTSSGT 102
Query: 92 FIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGH 151
F+ +G+D II+ IE++IA +TF+PKE+GE +QVL YE GQKY+ H+DYF DKVN GG
Sbjct: 103 FLRRGQDEIISRIEERIAKFTFIPKEHGEGLQVLHYEVGQKYDAHHDYFHDKVNTKNGGQ 162
Query: 152 RLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFS 211
R+ATVLMYLSDV +GGETVFP+A+ + D+LSEC KKG++VKPR+GDALLF+S
Sbjct: 163 RVATVLMYLSDVEEGGETVFPSAK--VNSSSVPWWDELSECGKKGVSVKPRKGDALLFWS 220
Query: 212 LHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
+ +A DP SLH GCPVI+G KWSATKW+H+ +
Sbjct: 221 MSPDAELDPFSLHGGCPVIKGNKWSATKWMHLREY 255
>gi|413934216|gb|AFW68767.1| hypothetical protein ZEAMMB73_452923 [Zea mays]
Length = 210
Score = 265 bits (676), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 123/203 (60%), Positives = 152/203 (74%), Gaps = 6/203 (2%)
Query: 97 KDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATV 156
+D ++ IE++I+ WTFLP ENGE IQ+L Y++G+KYEPHYDYF DK N GGHR+ATV
Sbjct: 9 QDEVVTRIEERISAWTFLPPENGEAIQILHYQNGEKYEPHYDYFHDKNNQALGGHRIATV 68
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
LMYLS+V KGGET+FPNAE + +D S+CA+ G AVKP +GDALLFFSLH ++
Sbjct: 69 LMYLSNVEKGGETIFPNAE---GKLLQPKDDTWSDCARNGYAVKPVKGDALLFFSLHPDS 125
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGG---DCTDNNASCERWAALGECT 273
D SLH CP IEG+KWSATKWIHV SFD V++ G C D+N C +WAA+GEC
Sbjct: 126 TTDSDSLHGSCPAIEGQKWSATKWIHVRSFDLTVKQPGPSDGCEDDNVLCPQWAAVGECA 185
Query: 274 KNPEYMVGSAQLPGFCRRSCKVC 296
KNP YMVG+ + PGFCR+SCKVC
Sbjct: 186 KNPNYMVGTKEAPGFCRKSCKVC 208
>gi|384246332|gb|EIE19822.1| hypothetical protein COCSUDRAFT_25518 [Coccomyxa subellipsoidea
C-169]
Length = 347
Score = 265 bits (676), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 138/268 (51%), Positives = 171/268 (63%), Gaps = 17/268 (6%)
Query: 34 KVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFI 93
+V ++SW PRAF+ +GFL + EC+HLI+ AK + +S V DN +G+S S VRTS+GTF
Sbjct: 82 EVIEVSWSPRAFLLKGFLKEAECEHLISKAKPSMVKSTVVDNDTGKSIDSTVRTSTGTFF 141
Query: 94 PKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNI--VRGGH 151
+ +D +I GIE +I+ T LP+ NGE +Q+L YE GQKYE H+D+F DK N GG
Sbjct: 142 GREEDEVIQGIERRISMITHLPEVNGEGLQILHYEDGQKYEAHHDFFHDKFNSRPENGGQ 201
Query: 152 RLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFS 211
R+ATVLMYL+ +GGETVFP A T SECA+ G AVK RRGDALLF+S
Sbjct: 202 RIATVLMYLTTAEEGGETVFPMAA------NKVTGPQWSECARGGAAVKSRRGDALLFYS 255
Query: 212 LHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEG---GDCTDNNASCERWAA 268
L N DP SLH CP +GEKWSATKWIHV F E+ G+C D + C WAA
Sbjct: 256 LLPNGETDPTSLHGSCPTTKGEKWSATKWIHVGPFGGSSEQQRAKGECIDADERCSGWAA 315
Query: 269 LGECTKNPEYMVGSAQLPGFCRRSCKVC 296
GEC KNP YM+ S CR SC C
Sbjct: 316 DGECKKNPGYMMSS------CRLSCHTC 337
>gi|159478673|ref|XP_001697425.1| predicted protein [Chlamydomonas reinhardtii]
gi|158274304|gb|EDP00087.1| predicted protein [Chlamydomonas reinhardtii]
Length = 297
Score = 265 bits (676), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 133/268 (49%), Positives = 179/268 (66%), Gaps = 17/268 (6%)
Query: 34 KVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFI 93
+V +SW PRAF+ + FL+D ECD+++ A+ ++ +S+V DN SG+S S++RTS+GT+
Sbjct: 41 EVVHLSWSPRAFLLKNFLSDEECDYIVEKARPKMVKSSVVDNESGKSVDSEIRTSTGTWF 100
Query: 94 PKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNI--VRGGH 151
KG+D++I+ IE ++A T +P EN E +QVL Y GQKYEPHYDYF D VN GG
Sbjct: 101 AKGEDSVISKIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNAGPEHGGQ 160
Query: 152 RLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFS 211
R+ T+LMYL+ V +GGETV PNAE+ T D SECAK+G+AVKP +GDAL+F+S
Sbjct: 161 RVVTMLMYLTTVEEGGETVLPNAEQ------KVTGDGWSECAKRGLAVKPIKGDALMFYS 214
Query: 212 LHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSF---DKIVEEGGDCTDNNASCERWAA 268
L + DP SLH CP ++G+KWSATKWIHV K+ +C D + C+ WA
Sbjct: 215 LKPDGSNDPASLHGSCPTLKGDKWSATKWIHVAPIGGKKKLNLGTPECHDEDERCQEWAF 274
Query: 269 LGECTKNPEYMVGSAQLPGFCRRSCKVC 296
GEC KNP +M AQ C+RSCK C
Sbjct: 275 FGECEKNPGFM--DAQ----CKRSCKKC 296
>gi|302845234|ref|XP_002954156.1| hypothetical protein VOLCADRAFT_82641 [Volvox carteri f.
nagariensis]
gi|300260655|gb|EFJ44873.1| hypothetical protein VOLCADRAFT_82641 [Volvox carteri f.
nagariensis]
Length = 309
Score = 265 bits (676), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 134/268 (50%), Positives = 177/268 (66%), Gaps = 17/268 (6%)
Query: 34 KVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFI 93
+V +SW PRAF+ +GFL+D EC+H+I AK ++ +S+V DN SG+S S++RTS+G ++
Sbjct: 53 EVIHLSWSPRAFLLKGFLSDEECEHIIAKAKPRMVKSSVVDNASGKSVDSEIRTSTGAWL 112
Query: 94 PKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIV--RGGH 151
KG+D II+ IE ++A T +P EN E +QVL Y GQKYEPHYDYF D VN GG
Sbjct: 113 AKGEDEIISRIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNASPEHGGQ 172
Query: 152 RLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFS 211
R+ TVLMYL+ V +GGETV P+A++ + + SECAK+G+AVKP +GDAL+F+S
Sbjct: 173 RVVTVLMYLTTVEEGGETVLPHADQ------KVSGEGWSECAKRGLAVKPVKGDALMFYS 226
Query: 212 LHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSF--DKIVEEGG-DCTDNNASCERWAA 268
L + DP SLH CP ++G+KWSATKWIHV K V G +C D+ C WA
Sbjct: 227 LKPDGSNDPASLHGSCPTLKGDKWSATKWIHVGPIGGKKAVSLGTPECHDSMEQCTEWAF 286
Query: 269 LGECTKNPEYMVGSAQLPGFCRRSCKVC 296
GEC KNP YM + C RSCK C
Sbjct: 287 FGECEKNPGYMREN------CARSCKTC 308
>gi|413934217|gb|AFW68768.1| hypothetical protein ZEAMMB73_452923 [Zea mays]
Length = 204
Score = 264 bits (675), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 123/202 (60%), Positives = 151/202 (74%), Gaps = 6/202 (2%)
Query: 98 DAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVL 157
D ++ IE++I+ WTFLP ENGE IQ+L Y++G+KYEPHYDYF DK N GGHR+ATVL
Sbjct: 4 DEVVTRIEERISAWTFLPPENGEAIQILHYQNGEKYEPHYDYFHDKNNQALGGHRIATVL 63
Query: 158 MYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAI 217
MYLS+V KGGET+FPNAE + +D S+CA+ G AVKP +GDALLFFSLH ++
Sbjct: 64 MYLSNVEKGGETIFPNAE---GKLLQPKDDTWSDCARNGYAVKPVKGDALLFFSLHPDST 120
Query: 218 PDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGG---DCTDNNASCERWAALGECTK 274
D SLH CP IEG+KWSATKWIHV SFD V++ G C D+N C +WAA+GEC K
Sbjct: 121 TDSDSLHGSCPAIEGQKWSATKWIHVRSFDLTVKQPGPSDGCEDDNVLCPQWAAVGECAK 180
Query: 275 NPEYMVGSAQLPGFCRRSCKVC 296
NP YMVG+ + PGFCR+SCKVC
Sbjct: 181 NPNYMVGTKEAPGFCRKSCKVC 202
>gi|224133600|ref|XP_002327635.1| predicted protein [Populus trichocarpa]
gi|222836720|gb|EEE75113.1| predicted protein [Populus trichocarpa]
Length = 291
Score = 260 bits (665), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 123/209 (58%), Positives = 161/209 (77%), Gaps = 2/209 (0%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGK 97
ISWKPRAFVY FLT EC++LINLAK ++++S V D+ +G+SK S VRTSSGTF+P+G+
Sbjct: 83 ISWKPRAFVYHNFLTKAECEYLINLAKPRMQKSTVVDSSTGKSKDSKVRTSSGTFLPRGR 142
Query: 98 DAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVL 157
D I+ IE +IA ++F+P E+GE +Q+L YE GQ+YEPH+DYF D+ N GG R+ATVL
Sbjct: 143 DKIVRDIEKRIADFSFIPVEHGEGLQILHYEVGQRYEPHFDYFMDEYNTKNGGQRIATVL 202
Query: 158 MYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAI 217
MYLSDV +GGETVFP+AE P N +LSEC K G++VKP+ GDALLF+S++ +
Sbjct: 203 MYLSDVEEGGETVFPSAEGNI-SAVPWWN-ELSECGKGGLSVKPKMGDALLFWSMNPDGS 260
Query: 218 PDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
PDP SLH GCPVI G KWS+TKW+ V+ +
Sbjct: 261 PDPSSLHGGCPVIRGNKWSSTKWMRVNEY 289
>gi|242047774|ref|XP_002461633.1| hypothetical protein SORBIDRAFT_02g005760 [Sorghum bicolor]
gi|241925010|gb|EER98154.1| hypothetical protein SORBIDRAFT_02g005760 [Sorghum bicolor]
Length = 275
Score = 259 bits (662), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 127/221 (57%), Positives = 161/221 (72%), Gaps = 14/221 (6%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
++ S+VK +SW+PR FVY+GFL+D ECDHL+ LAK K + VA N S S RTSS
Sbjct: 43 LSSSRVKALSWQPRIFVYKGFLSDDECDHLVTLAK---KGTMVAHNRS--SYYRQTRTSS 97
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRG 149
G F+ K +D +++ IE++IA WT LP+EN E +Q+ RY+HGQKY+PH+DYF DK++ RG
Sbjct: 98 GMFLRKRQDPVVSRIEERIAAWTLLPRENVEKMQIQRYQHGQKYDPHFDYFDDKIHHTRG 157
Query: 150 GHRLATVLMYLSDVAKGGETVFPNA---EEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
G R ATVLMYLS V KGGETVFP A E P+ +D SECA KG+AVKP +GDA
Sbjct: 158 GPRYATVLMYLSTVDKGGETVFPKAKGWESQPK------DDTFSECAHKGLAVKPVKGDA 211
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFD 247
+LFFSLH + PDP++LH CPVI+GEKWSA WIHV SF+
Sbjct: 212 VLFFSLHVDGGPDPLTLHGSCPVIQGEKWSAPNWIHVRSFE 252
>gi|357137804|ref|XP_003570489.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Brachypodium
distachyon]
Length = 318
Score = 258 bits (660), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 119/209 (56%), Positives = 161/209 (77%), Gaps = 2/209 (0%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGK 97
ISW+PRAFVY FL+ EC++LI LAK ++++S V D+ +G+SK S VRTSSG F+ +G+
Sbjct: 110 ISWEPRAFVYHNFLSKEECEYLIGLAKPRMEKSTVVDSTTGKSKDSRVRTSSGMFLRRGR 169
Query: 98 DAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVL 157
D +I IE +IA +TF+P E+GE +QVL YE GQKYEPH+DYF D+ N GG R+AT+L
Sbjct: 170 DKVIRAIERRIADYTFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEFNTKNGGQRMATIL 229
Query: 158 MYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAI 217
MYLSDV +GGET+FP+A + +++LSECA+KG+AVKP+ GDALLF+S++ +A
Sbjct: 230 MYLSDVEEGGETIFPDANV--NSSSLPWHNELSECARKGLAVKPKMGDALLFWSMNPDAT 287
Query: 218 PDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
DP+SLH GCPVI G KWS+TKW+HV +
Sbjct: 288 LDPLSLHGGCPVIRGNKWSSTKWMHVGEY 316
>gi|226494249|ref|NP_001141909.1| uncharacterized protein LOC100274058 [Zea mays]
gi|194706408|gb|ACF87288.1| unknown [Zea mays]
gi|413932757|gb|AFW67308.1| hypothetical protein ZEAMMB73_919439 [Zea mays]
gi|413932758|gb|AFW67309.1| hypothetical protein ZEAMMB73_919439 [Zea mays]
Length = 217
Score = 258 bits (658), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 122/205 (59%), Positives = 149/205 (72%), Gaps = 5/205 (2%)
Query: 94 PKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRL 153
P+ KD I++ IE ++A WTFLP+EN E +QVLRYE GQKY+ H+DYF D+ N+ GG R+
Sbjct: 14 PQPKDEIVSAIEKRVAAWTFLPEENAESLQVLRYETGQKYDAHFDYFHDRNNLKLGGQRV 73
Query: 154 ATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLH 213
ATVLMYL+DV KGGETVFPNAE + T SEC++ G+AVKP++GDALLFF+LH
Sbjct: 74 ATVLMYLTDVNKGGETVFPNAEGSHLQYKDET---WSECSRSGLAVKPKKGDALLFFNLH 130
Query: 214 TNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKI--VEEGGDCTDNNASCERWAALGE 271
NA D SLH CPVIEGEKWSATKWIHV SFD V C+D+ C RWAA+GE
Sbjct: 131 VNATADTGSLHGSCPVIEGEKWSATKWIHVRSFDNPPDVRTDAPCSDDKELCPRWAAIGE 190
Query: 272 CTKNPEYMVGSAQLPGFCRRSCKVC 296
C +NP YMVG+ GFCR+SC +C
Sbjct: 191 CHRNPTYMVGTKDTLGFCRKSCGIC 215
>gi|356517655|ref|XP_003527502.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Glycine max]
Length = 290
Score = 257 bits (657), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 121/209 (57%), Positives = 158/209 (75%), Gaps = 2/209 (0%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGK 97
+SW+PRAF+Y FL+ EC++LI LAK Q+ +S+V D+ +G+S S VRTSSG F+ +GK
Sbjct: 82 LSWEPRAFIYHNFLSKEECEYLIELAKPQMVKSSVVDSKTGKSTESRVRTSSGMFLKRGK 141
Query: 98 DAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVL 157
D I+ IE +IA +TF+P+ENGE +Q+L YE GQKYEPHYDYF D+ N GG R+ATVL
Sbjct: 142 DKIVQNIEKRIADFTFIPEENGEGLQILHYEVGQKYEPHYDYFLDEFNTKNGGQRIATVL 201
Query: 158 MYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAI 217
MYLSDV +GGETVFP A P N DLS+CA+KG++VKP+ GDALLF+S+ +A
Sbjct: 202 MYLSDVEEGGETVFP-AANANFSSVPWWN-DLSQCARKGLSVKPKMGDALLFWSMRPDAT 259
Query: 218 PDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
DP SLH GCPVI+G KWS+TKW+H+ +
Sbjct: 260 LDPSSLHGGCPVIKGNKWSSTKWMHLREY 288
>gi|307102962|gb|EFN51227.1| hypothetical protein CHLNCDRAFT_28161 [Chlorella variabilis]
Length = 300
Score = 257 bits (656), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 124/270 (45%), Positives = 175/270 (64%), Gaps = 12/270 (4%)
Query: 35 VKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVAD--NLSGESKLSDVRTSSGTF 92
+K +SW PR F+Y+ LT+ ECDH++ A +L RS V D N GES +SD+RTS G F
Sbjct: 16 LKVLSWDPRIFLYQRLLTEEECDHMMTKAGPRLTRSGVVDVDNPGGES-VSDIRTSYGMF 74
Query: 93 IPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHR 152
+G+D ++ +E +++ W+ +P +GE IQVLRYE+G++Y+PH+DYF D +++ GG+R
Sbjct: 75 FDRGEDEVVREVERRLSEWSLIPPGHGEGIQVLRYENGEEYKPHFDYFFDNLSVQNGGNR 134
Query: 153 LATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSL 212
LAT+LMYL++ GGETVFPN + PP + A SECA +G+AVKPR+GDA+LFFSL
Sbjct: 135 LATILMYLAEPEFGGETVFPNVKAPPEQTLEA---GYSECATQGLAVKPRKGDAVLFFSL 191
Query: 213 HTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDK------IVEEGGDCTDNNASCERW 266
T D SLH CP ++G K++ATKW HV + ++ C D +C W
Sbjct: 192 RTEGTLDKGSLHGSCPTLKGFKFAATKWYHVAHYAMGGERAPVLPASAGCKDEKDACVGW 251
Query: 267 AALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
A GEC NP +MVG+ + PG C +C C
Sbjct: 252 AEGGECESNPGFMVGTKEQPGACLLACGRC 281
>gi|21537370|gb|AAM61711.1| putative prolyl 4-hydroxylase, alpha subunit [Arabidopsis thaliana]
Length = 287
Score = 257 bits (656), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 124/209 (59%), Positives = 156/209 (74%), Gaps = 2/209 (0%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGK 97
+SW+PRAFVY FL+ EC++LI+LAK + +S V D+ +G+SK S VRTSSGTF+ +G+
Sbjct: 79 LSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVVDSETGKSKDSRVRTSSGTFLRRGR 138
Query: 98 DAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVL 157
D II IE +IA +TF+P ++GE +QVL YE GQKYEPHYDYF D+ N GG R+AT+L
Sbjct: 139 DKIIKTIEKRIADYTFIPADHGEGLQVLHYEAGQKYEPHYDYFVDEFNTKNGGQRMATML 198
Query: 158 MYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAI 217
MYLSDV +GGETVFP A P N +LSEC KKG++VKPR GDALLF+S+ +A
Sbjct: 199 MYLSDVEEGGETVFP-AANMNFSSVPWYN-ELSECGKKGLSVKPRMGDALLFWSMRPDAT 256
Query: 218 PDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
DP SLH GCPVI G KWS+TKWIHV +
Sbjct: 257 LDPTSLHGGCPVIRGNKWSSTKWIHVGEY 285
>gi|48716447|dbj|BAD23054.1| putative prolyl 4-hydroxylase [Oryza sativa Japonica Group]
Length = 310
Score = 256 bits (654), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 120/209 (57%), Positives = 158/209 (75%), Gaps = 2/209 (0%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGK 97
ISW+PRAFVY FL+ ECD+LI LAK + +S V D+ +G+SK S VRTSSG F+ +G+
Sbjct: 102 ISWEPRAFVYHNFLSKEECDYLIGLAKPHMVKSTVVDSTTGKSKDSRVRTSSGMFLQRGR 161
Query: 98 DAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVL 157
D +I IE +IA +TF+P E+GE +QVL YE GQKYEPH+DYF D+ N GG R+AT+L
Sbjct: 162 DKVIRAIEKRIADYTFIPMEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRMATLL 221
Query: 158 MYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAI 217
MYLSDV +GGET+FP+A + ++LSECA+KG+AVKP+ GDALLF+S+ +A
Sbjct: 222 MYLSDVEEGGETIFPDANV--NSSSLPWYNELSECARKGLAVKPKMGDALLFWSMKPDAT 279
Query: 218 PDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
DP+SLH GCPVI+G KWS+TKW+HV +
Sbjct: 280 LDPLSLHGGCPVIKGNKWSSTKWMHVREY 308
>gi|18394842|ref|NP_564109.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
gi|9558598|gb|AAF88161.1|AC026234_12 Contains similarity to a prolyl 4-hydroxylase alpha subunit protein
from Gallus gallus gi|212530 [Arabidopsis thaliana]
gi|90962978|gb|ABE02413.1| At1g20270 [Arabidopsis thaliana]
gi|332191835|gb|AEE29956.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
Length = 287
Score = 256 bits (653), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 123/209 (58%), Positives = 156/209 (74%), Gaps = 2/209 (0%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGK 97
+SW+PRAFVY FL+ EC++LI+LAK + +S V D+ +G+SK S VRTSSGTF+ +G+
Sbjct: 79 LSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVVDSETGKSKDSRVRTSSGTFLRRGR 138
Query: 98 DAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVL 157
D II IE +IA +TF+P ++GE +QVL YE GQKYEPHYDYF D+ N GG R+AT+L
Sbjct: 139 DKIIKTIEKRIADYTFIPADHGEGLQVLHYEAGQKYEPHYDYFVDEFNTKNGGQRMATML 198
Query: 158 MYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAI 217
MYLSDV +GGETVFP A P N +LSEC KKG++VKPR GDALLF+S+ +A
Sbjct: 199 MYLSDVEEGGETVFP-AANMNFSSVPWYN-ELSECGKKGLSVKPRMGDALLFWSMRPDAT 256
Query: 218 PDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
DP SLH GCPVI G KWS+TKW+HV +
Sbjct: 257 LDPTSLHGGCPVIRGNKWSSTKWMHVGEY 285
>gi|297850430|ref|XP_002893096.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297338938|gb|EFH69355.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 287
Score = 255 bits (652), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 122/209 (58%), Positives = 156/209 (74%), Gaps = 2/209 (0%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGK 97
+SW+PRAFVY FL+ EC++LI+LAK + +S V D+ +G+SK S VRTSSGTF+ +G+
Sbjct: 79 LSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVVDSETGKSKDSRVRTSSGTFLRRGR 138
Query: 98 DAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVL 157
D II IE +IA +TF+P ++GE +Q+L YE GQKYEPHYDYF D+ N GG R+AT+L
Sbjct: 139 DKIIKTIEKRIADYTFIPADHGEGLQILHYEAGQKYEPHYDYFVDEFNTKNGGQRMATML 198
Query: 158 MYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAI 217
MYLSDV +GGETVFP A P N +LSEC KKG++VKPR GDALLF+S+ +A
Sbjct: 199 MYLSDVEEGGETVFP-AANMNFSSVPWYN-ELSECGKKGLSVKPRMGDALLFWSMRPDAT 256
Query: 218 PDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
DP SLH GCPVI G KWS+TKW+HV +
Sbjct: 257 LDPTSLHGGCPVIRGNKWSSTKWMHVGEY 285
>gi|449491267|ref|XP_004158845.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 287
Score = 255 bits (651), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 122/212 (57%), Positives = 162/212 (76%), Gaps = 2/212 (0%)
Query: 35 VKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIP 94
V+ ISW+PRAFVY FLT EC++LI+LAK +++S V D+ +G+SK S VRTSSGTF+P
Sbjct: 76 VEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSTVVDSETGQSKDSRVRTSSGTFLP 135
Query: 95 KGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLA 154
+G+D + IE +++ ++F+P E+GE +QVL YE GQKYEPH+DYF D+ N GG R+A
Sbjct: 136 RGRDKTVRTIEKRLSDFSFIPVEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRIA 195
Query: 155 TVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHT 214
TVLMYLSDV +GGETVFP A + P N +LS+C KKG++VKP+RGDALLF+S+
Sbjct: 196 TVLMYLSDVEEGGETVFP-AAKGNFSSVPWWN-ELSDCGKKGLSVKPKRGDALLFWSMKP 253
Query: 215 NAIPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
+A DP SLH GCPVI+G KWSATKW+ V+ +
Sbjct: 254 DASLDPSSLHGGCPVIKGNKWSATKWVRVEEY 285
>gi|168060785|ref|XP_001782374.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666166|gb|EDQ52828.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 211
Score = 255 bits (651), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 125/216 (57%), Positives = 157/216 (72%), Gaps = 10/216 (4%)
Query: 35 VKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIP 94
V+ +SW+PRAF+Y FLT +EC+HLI +AK L +S V D+ +G+SK S VRTSSGTF+
Sbjct: 2 VEVLSWEPRAFLYHHFLTQVECNHLIEVAKPSLVKSTVIDSATGKSKDSRVRTSSGTFLV 61
Query: 95 KGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLA 154
+G+D II IE +IA +TF+P E GE +QVL+Y +KYEPHYDYF D N GG R+A
Sbjct: 62 RGQDHIIKRIEKRIADFTFIPVEQGEGLQVLQYRESEKYEPHYDYFHDAFNTKNGGQRIA 121
Query: 155 TVLMYLSDVAKGGETVFP----NAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFF 210
TVLMYLSDV KGGETVFP NA E P D SECAK+G++V+PR GDALLF+
Sbjct: 122 TVLMYLSDVEKGGETVFPASKVNASEVP------DWDQRSECAKRGLSVRPRMGDALLFW 175
Query: 211 SLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
S+ +A DP SLH CPVI+G KWSATKW+HV+ +
Sbjct: 176 SMKPDAKLDPTSLHGACPVIQGTKWSATKWLHVEKY 211
>gi|449434114|ref|XP_004134841.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 287
Score = 254 bits (650), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 122/212 (57%), Positives = 162/212 (76%), Gaps = 2/212 (0%)
Query: 35 VKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIP 94
V+ ISW+PRAFVY FLT EC++LI+LAK +++S V D+ +G+SK S VRTSSGTF+P
Sbjct: 76 VEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSTVVDSETGQSKDSRVRTSSGTFLP 135
Query: 95 KGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLA 154
+G+D + IE +++ ++F+P E+GE +QVL YE GQKYEPH+DYF D+ N GG R+A
Sbjct: 136 RGRDKTVRTIEKRLSDFSFIPVEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRIA 195
Query: 155 TVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHT 214
TVLMYLSDV +GGETVFP A + P N +LS+C KKG++VKP+RGDALLF+S+
Sbjct: 196 TVLMYLSDVEEGGETVFP-AAKGNFSSVPWWN-ELSDCGKKGLSVKPKRGDALLFWSMKP 253
Query: 215 NAIPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
+A DP SLH GCPVI+G KWSATKW+ V+ +
Sbjct: 254 DASLDPSSLHGGCPVIKGNKWSATKWMRVEEY 285
>gi|326489721|dbj|BAK01841.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 315
Score = 254 bits (650), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 118/209 (56%), Positives = 159/209 (76%), Gaps = 2/209 (0%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGK 97
ISW+PRAFVY FL+ EC++LI LAK ++ +S V D+ +G+SK S VRTSSG F+ +G+
Sbjct: 107 ISWEPRAFVYHNFLSKEECEYLIELAKPRMVKSTVVDSETGKSKDSRVRTSSGMFLQRGR 166
Query: 98 DAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVL 157
D +I IE +IA +TF+P E+GE +QVL YE GQKYEPH+DYF D+ N GG R+AT+L
Sbjct: 167 DKVIRAIERRIADYTFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEFNTKNGGQRMATIL 226
Query: 158 MYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAI 217
MYLSD+ +GGET+FP+A + ++LSECA+KG+AVKP+ GDALLF+S+ +A
Sbjct: 227 MYLSDIEEGGETIFPDANV--NSSSLPWYNELSECARKGLAVKPKMGDALLFWSMKPDAT 284
Query: 218 PDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
DP+SLH GCPVI+G KWS+TKW+HV +
Sbjct: 285 LDPLSLHGGCPVIKGNKWSSTKWLHVGEY 313
>gi|357467085|ref|XP_003603827.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355492875|gb|AES74078.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 280
Score = 254 bits (649), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 127/226 (56%), Positives = 162/226 (71%), Gaps = 2/226 (0%)
Query: 21 RKSFSSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGES 80
RK + S A + +SW+PRAFVY FL+ EC+HLINLAK L +S+V D+ +G+S
Sbjct: 55 RKIYESLAEKKEQWTEILSWEPRAFVYHNFLSKEECEHLINLAKPFLAKSSVVDSKTGKS 114
Query: 81 KLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF 140
S VRTSSG F+ +GKD II IE +IA +TF+P ENGE +QVL Y G+KYEPHYDYF
Sbjct: 115 TESRVRTSSGMFLKRGKDKIIQNIERRIADFTFIPVENGEGLQVLHYGVGEKYEPHYDYF 174
Query: 141 SDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVK 200
D+ N GG R+ATVLMYLSDV +GGETVFP A + P N DLSECA+KG+++K
Sbjct: 175 LDEFNTKNGGQRVATVLMYLSDVEEGGETVFP-AAKANFSSVPWWN-DLSECARKGLSLK 232
Query: 201 PRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
P+ GDALLF+S+ +A D SLH GCPVI G KWS+TKW+H++ +
Sbjct: 233 PKMGDALLFWSMRPDATLDASSLHGGCPVIVGNKWSSTKWMHLEEY 278
>gi|326495334|dbj|BAJ85763.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 300
Score = 254 bits (649), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 122/209 (58%), Positives = 158/209 (75%), Gaps = 2/209 (0%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGK 97
+SW+PRAF+Y FL+ EC++LI+LAK +K+S V D+ +G SK S VRTSSGTF+ +G+
Sbjct: 92 LSWEPRAFIYHNFLSKEECEYLISLAKPHMKKSTVVDSATGGSKDSRVRTSSGTFLRRGQ 151
Query: 98 DAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVL 157
D I+ IE +I+ +TF+P ENGE +QVL YE GQKYEPH+DYF D N GG R+ATVL
Sbjct: 152 DKIVRTIEKRISDFTFIPVENGEGLQVLHYEVGQKYEPHFDYFHDDFNTKNGGQRIATVL 211
Query: 158 MYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAI 217
MYLSDV +GGETVFP+A + P N +LSECAK+GI+VKP+ GDALLF+S+ +
Sbjct: 212 MYLSDVEEGGETVFPSA-KVNSSSIPFYN-ELSECAKRGISVKPKMGDALLFWSMRPDGT 269
Query: 218 PDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
DP SLH GCPVI+G+KWS+TKWI V +
Sbjct: 270 LDPTSLHGGCPVIKGDKWSSTKWIRVHEY 298
>gi|388520887|gb|AFK48505.1| unknown [Lotus japonicus]
Length = 187
Score = 253 bits (647), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 117/182 (64%), Positives = 137/182 (75%), Gaps = 5/182 (2%)
Query: 117 ENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEE 176
ENGE IQ+L YE+G+KYEPHYDYF D+ N GGHR+ATVLMYLSDV KGGET+FPNAE
Sbjct: 7 ENGESIQILHYENGRKYEPHYDYFHDRANQFMGGHRIATVLMYLSDVGKGGETIFPNAES 66
Query: 177 PPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWS 236
+ + ++ SECA KG AVKPR+GDALLFFSLH NA D SLH CPVIEGEKWS
Sbjct: 67 ---KLSQPKDESWSECAHKGYAVKPRKGDALLFFSLHLNATTDSNSLHGSCPVIEGEKWS 123
Query: 237 ATKWIHVDSFDKIV--EEGGDCTDNNASCERWAALGECTKNPEYMVGSAQLPGFCRRSCK 294
ATKWIHV F+K + ++ GDCTD N +C RWA LGEC KNP YM+G + G+C +SC
Sbjct: 124 ATKWIHVSDFEKAIKQDDNGDCTDENENCSRWAKLGECVKNPLYMIGGKGVKGYCMKSCN 183
Query: 295 VC 296
VC
Sbjct: 184 VC 185
>gi|357483925|ref|XP_003612249.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355513584|gb|AES95207.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 289
Score = 253 bits (646), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 122/212 (57%), Positives = 160/212 (75%), Gaps = 2/212 (0%)
Query: 35 VKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIP 94
V+ +SW+PRAFVY FLT EC++LI++AK + +S V D+ +G+SK S VRTSSGTF+
Sbjct: 78 VEVVSWEPRAFVYHNFLTKEECEYLIDIAKPSMHKSTVVDSETGKSKDSRVRTSSGTFLA 137
Query: 95 KGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLA 154
+G+D I+ IE KIA +TF+P E+GE +QVL YE GQKYEPHYDYF D+ N GG R+A
Sbjct: 138 RGRDKIVRNIEKKIADFTFIPVEHGEGLQVLHYEVGQKYEPHYDYFLDEFNTKNGGQRIA 197
Query: 155 TVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHT 214
TVLMYL+DV +GGETVFP A + P N +LS+C KKG+++KP+RGDALLF+S+
Sbjct: 198 TVLMYLTDVEEGGETVFP-AAKGNFSNVPWYN-ELSDCGKKGLSIKPKRGDALLFWSMKP 255
Query: 215 NAIPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
+A D SLH GCPVI+G KWS+TKWI V+ +
Sbjct: 256 DATLDASSLHGGCPVIKGNKWSSTKWIRVNEY 287
>gi|357146834|ref|XP_003574128.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Brachypodium
distachyon]
Length = 306
Score = 253 bits (645), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 121/209 (57%), Positives = 158/209 (75%), Gaps = 2/209 (0%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGK 97
+SW+PRAF+Y FL+ EC++LI+LAK +K+S V D+ +G SK S VRTSSGTF+ +G+
Sbjct: 98 LSWEPRAFLYHNFLSKEECEYLISLAKPHMKKSTVVDSATGGSKDSRVRTSSGTFLRRGQ 157
Query: 98 DAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVL 157
D +I IE +I+ +TF+P ENGE +QVL YE GQKYEPH+DYF D N GG R+AT+L
Sbjct: 158 DKVIRTIEKRISDFTFIPAENGEGLQVLHYEVGQKYEPHFDYFHDDFNTKNGGQRIATLL 217
Query: 158 MYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAI 217
MYLSDV +GGETVFP+A + P N +LSECAK+GI+VKP+ GDALLF+S+ +
Sbjct: 218 MYLSDVEEGGETVFPSA-KVNSSSIPFYN-ELSECAKRGISVKPKMGDALLFWSMRPDGT 275
Query: 218 PDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
DP SLH GCPVI+G+KWS+TKWI V +
Sbjct: 276 LDPTSLHGGCPVIKGDKWSSTKWIRVHEY 304
>gi|307110744|gb|EFN58979.1| hypothetical protein CHLNCDRAFT_137600 [Chlorella variabilis]
Length = 327
Score = 253 bits (645), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 136/274 (49%), Positives = 172/274 (62%), Gaps = 18/274 (6%)
Query: 35 VKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIP 94
V+ ++WKPRA + GFL+ ECDH+I +A L+RS V G S L ++RTSSG FI
Sbjct: 42 VEVVAWKPRALLLHGFLSHAECDHIIRVADPSLERSTVVSP-EGGSMLDEIRTSSGMFIL 100
Query: 95 KGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSD-------KVNIV 147
KG DA+I+G+E+++A T LP + ED+QVLRYE GQKY H+D + V
Sbjct: 101 KGHDAVISGLEERVAALTHLPVSHQEDLQVLRYELGQKYSAHWDINDSPERAQQMRAKGV 160
Query: 148 RGGHRLATVLMYLSDVAKGGETVFPNA---EEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
GG R AT+LMYLSDV +GGET FP+ +E + P T ECA KG+ VKPR+G
Sbjct: 161 LGGLRTATLLMYLSDVEEGGETAFPHGRWLDEGVQAAPPYT-----ECASKGVVVKPRKG 215
Query: 205 DALLFFSLHTNAI-PDPVSLHSGCPVIEGEKWSATKWIHVDSFDK-IVEEGGDCTDNNAS 262
DA+LFFSL N D SLH+GCPV+ G K+SATKW+HV+ F V++ C D
Sbjct: 216 DAILFFSLKLNGQKKDVYSLHAGCPVVRGVKYSATKWVHVEPFGHTTVQQPSRCEDARVE 275
Query: 263 CERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
C +WAA GEC NP YM GS G CR SCKVC
Sbjct: 276 CPQWAAAGECDSNPVYMKGSEVSVGSCRLSCKVC 309
>gi|357517881|ref|XP_003629229.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355523251|gb|AET03705.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 278
Score = 252 bits (644), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 121/216 (56%), Positives = 158/216 (73%), Gaps = 2/216 (0%)
Query: 31 NPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSG 90
N V+ +SW+PRAF+Y FLT EC+HLIN AK +++S+V DN +G+SK S VRTSSG
Sbjct: 63 NKRWVQIVSWEPRAFLYHNFLTKKECEHLINTAKPSMQKSSVVDNETGKSKDSSVRTSSG 122
Query: 91 TFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGG 150
TF+ +G D I+ IE +IA +TF+P ENGE VLRYE GQKY+PH DYF+D N V GG
Sbjct: 123 TFLDRGGDEIVRNIEKRIADFTFIPVENGESFNVLRYEVGQKYDPHLDYFADDYNTVNGG 182
Query: 151 HRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFF 210
R+AT+LMYLSDV +GGETVFP A + P N +LS+C KKG+++KP+ GDALLF+
Sbjct: 183 QRIATMLMYLSDVEEGGETVFP-AAKGNISSVPWWN-ELSDCGKKGLSIKPKMGDALLFW 240
Query: 211 SLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
S+ + DP SLH CPVI+G+KWS TKW+ ++ F
Sbjct: 241 SMKPDGTLDPSSLHGACPVIKGDKWSCTKWMRINEF 276
>gi|145343778|ref|XP_001416487.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144576712|gb|ABO94780.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 255
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 131/267 (49%), Positives = 169/267 (63%), Gaps = 24/267 (8%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
PRAFVYEGFLTD ECDH++ L+K L +S V D +G S SD+RTS+GTFI + D I
Sbjct: 1 PRAFVYEGFLTDEECDHILALSKGHLHKSGVVDAKTGGSTTSDIRTSTGTFISRAHDPTI 60
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLS 161
IE++I W+ +P ++GE +QVLRYE+GQ+Y+ H+DYF K + +R+ATVL+YLS
Sbjct: 61 TAIEERIELWSQIPVDHGEALQVLRYENGQEYKAHFDYFFHKGG--KRNNRIATVLLYLS 118
Query: 162 DVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPV 221
DV +GGETVFPN + P R SEC G +VK R+GDALLF+S+ DP
Sbjct: 119 DVEEGGETVFPNTDVPTDR----DRSQYSECGNGGKSVKARKGDALLFWSMKPGGELDPG 174
Query: 222 SLHSGCPVIEGEKWSATKWIHV-------DSFDKIVEEGG-----DCTDNNASCERWAAL 269
S H+GCPVI+G KW+ATKW+HV D KI EGG C D + +C WA
Sbjct: 175 SSHAGCPVIKGVKWTATKWMHVNAIGKHGDDVHKIFYEGGPQATESCKDTDDACRGWAES 234
Query: 270 GECTKNPEYMVGSAQLPGFCRRSCKVC 296
GEC KNP +M+ S C SC+ C
Sbjct: 235 GECDKNPGFMLKS------CAMSCRAC 255
>gi|356540840|ref|XP_003538892.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Glycine max]
Length = 290
Score = 251 bits (642), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 121/212 (57%), Positives = 160/212 (75%), Gaps = 2/212 (0%)
Query: 35 VKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIP 94
V+ +SW+PRAFVY FLT EC++LI++AK + +S+V D+ +G+SK S VRTSSGTF+
Sbjct: 79 VEVVSWEPRAFVYHNFLTKEECEYLIDIAKPNMHKSSVVDSETGKSKDSRVRTSSGTFLA 138
Query: 95 KGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLA 154
+G+D I+ IE +IA ++F+P E+GE +QVL YE GQKYEPHYDYF D N GG R+A
Sbjct: 139 RGRDKIVRDIEKRIAHYSFIPVEHGEGLQVLHYEVGQKYEPHYDYFLDDFNTKNGGQRIA 198
Query: 155 TVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHT 214
TVLMYL+DV +GGETVFP A + P N +LSEC KKG+++KP+RGDALLF+S+
Sbjct: 199 TVLMYLTDVEEGGETVFP-AAKGNFSSVPWWN-ELSECGKKGLSIKPKRGDALLFWSMKP 256
Query: 215 NAIPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
+A DP SLH GCPVI+G KWS+TKW+ V +
Sbjct: 257 DATLDPSSLHGGCPVIKGNKWSSTKWMRVSEY 288
>gi|255579590|ref|XP_002530636.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223529809|gb|EEF31744.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 287
Score = 251 bits (641), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 123/209 (58%), Positives = 156/209 (74%), Gaps = 2/209 (0%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGK 97
ISW+PRAFVY FLT EC++LINLAK +++S V D+ +G SK S VRTSSGTF+ +G+
Sbjct: 79 ISWEPRAFVYHNFLTKEECEYLINLAKPNMQKSTVVDSETGRSKDSRVRTSSGTFLSRGR 138
Query: 98 DAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVL 157
D I IE +IA ++F+P E+GE +QVL YE GQKYEPH+DYF+D+ N GG R+AT+L
Sbjct: 139 DKKIRDIEKRIADFSFIPVEHGEGLQVLHYEVGQKYEPHFDYFNDEFNTKNGGQRVATLL 198
Query: 158 MYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAI 217
MYLSDV +GGETVFP A + P N +LSEC KKG++VKP GDALLF+S+ +A
Sbjct: 199 MYLSDVEEGGETVFP-AAKGNFSAVPWWN-ELSECGKKGLSVKPNMGDALLFWSMKPDAT 256
Query: 218 PDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
DP SLH GCPVI G KWSATKW+ V+ +
Sbjct: 257 LDPSSLHGGCPVINGNKWSATKWMRVNEY 285
>gi|242063586|ref|XP_002453082.1| hypothetical protein SORBIDRAFT_04g038020 [Sorghum bicolor]
gi|241932913|gb|EES06058.1| hypothetical protein SORBIDRAFT_04g038020 [Sorghum bicolor]
Length = 307
Score = 251 bits (641), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 116/209 (55%), Positives = 157/209 (75%), Gaps = 2/209 (0%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGK 97
ISW+PRAFVY FL+ EC++LI LAK + +S V D+ +G+SK S VRTSSG F+ +G+
Sbjct: 99 ISWEPRAFVYHNFLSKEECEYLIGLAKPHMVKSTVVDSTTGKSKDSRVRTSSGMFLQRGR 158
Query: 98 DAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVL 157
D +I IE +IA +TF+P ++GE +QVL YE GQKYEPH+DYF D+ N GG R+AT+L
Sbjct: 159 DKVIRAIEKRIADYTFIPADHGEGLQVLHYEVGQKYEPHFDYFLDEFNTKNGGQRMATLL 218
Query: 158 MYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAI 217
MYLSDV +GGET+FP+A + ++LSECAK+G++VKP+ GDALLF+S+ +A
Sbjct: 219 MYLSDVEEGGETIFPDANV--NASSLPWYNELSECAKRGLSVKPKMGDALLFWSMKPDAT 276
Query: 218 PDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
DP+SLH GCPVI G KWS+TKW+H+ +
Sbjct: 277 LDPLSLHGGCPVIRGNKWSSTKWMHIHEY 305
>gi|308801080|ref|XP_003075321.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
gi|116061875|emb|CAL52593.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
Length = 541
Score = 251 bits (641), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 131/287 (45%), Positives = 181/287 (63%), Gaps = 25/287 (8%)
Query: 23 SFSSTAIINPSKVKQISWK-PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK 81
S + ++P++++ IS PRAF+YE FL++ EC+HL+ L+K +L +S V D +G S
Sbjct: 245 SLNGKPALDPNRIRTISLNAPRAFLYENFLSEKECEHLLALSKGKLHKSGVVDAQTGGSS 304
Query: 82 LSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS 141
LS+VRTS+GTFI + D IIAG+E++I W+ +P+ + E Q+LRYE GQ+Y+ H+DYF
Sbjct: 305 LSEVRTSTGTFISRKYDDIIAGVEERIELWSQIPQSHHEAFQILRYEPGQEYKAHFDYFF 364
Query: 142 DKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKP 201
K + +R+ATVL+YLSDV +GGETVFPN + P R SEC G A+K
Sbjct: 365 HKSGMR--NNRIATVLLYLSDVEEGGETVFPNTDVPTSR----NRSMYSECGNGGKALKA 418
Query: 202 RRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHV-------DSFDKIVEEGG 254
R+GDALLF+S+ D S H+GCPVI+GEKW+ATKW+HV D + +GG
Sbjct: 419 RKGDALLFWSMKPGGELDAGSSHAGCPVIKGEKWTATKWMHVNPLAGPNDDAHNVFYDGG 478
Query: 255 -----DCTDNNASCERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
C+D A C WA GEC KNP +M S C+ SC+VC
Sbjct: 479 PRSTASCSDAQAECRGWAESGECDKNPGFMRES------CKMSCRVC 519
>gi|225468574|ref|XP_002263060.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
gi|296084059|emb|CBI24447.3| unnamed protein product [Vitis vinifera]
Length = 288
Score = 250 bits (639), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 122/209 (58%), Positives = 158/209 (75%), Gaps = 2/209 (0%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGK 97
ISW+PRAFVY FL+ EC++LI LAK +++S V D+ +G+SK S VRTSSGTF+ +G+
Sbjct: 80 ISWEPRAFVYHNFLSKDECEYLIKLAKPHMQKSTVVDSSTGKSKDSRVRTSSGTFLTRGQ 139
Query: 98 DAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVL 157
D II GIE +++ +TFLP E+GE +Q+L YE GQKYEPHYDYF D N GG R+ATVL
Sbjct: 140 DKIIRGIEKRLSDFTFLPVEHGEGLQILHYEVGQKYEPHYDYFLDDYNTKNGGQRMATVL 199
Query: 158 MYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAI 217
MYLSDV +GGETVFP A + P N +LS+C K+G++VKP+ GDALLF+S+ +A
Sbjct: 200 MYLSDVEEGGETVFP-AAKGNFSSVPWWN-ELSDCGKEGLSVKPKMGDALLFWSMKPDAS 257
Query: 218 PDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
DP SLH GCPVI+G KWS+TKWI V+ +
Sbjct: 258 LDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
>gi|147800995|emb|CAN64470.1| hypothetical protein VITISV_014644 [Vitis vinifera]
Length = 288
Score = 250 bits (638), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 122/209 (58%), Positives = 157/209 (75%), Gaps = 2/209 (0%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGK 97
ISW+PRAFVY FL+ EC++LI LAK +++S V D+ +G+SK S VRTSSGTF+ +G+
Sbjct: 80 ISWEPRAFVYHNFLSKDECEYLIKLAKPHMQKSTVVDSSTGKSKDSRVRTSSGTFLTRGQ 139
Query: 98 DAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVL 157
D II GIE +++ +TFLP E+GE +Q+L YE GQKYEPHYDYF D N GG R+ATVL
Sbjct: 140 DKIIRGIEKRLSDFTFLPVEHGEGLQILHYEVGQKYEPHYDYFLDDYNTKNGGQRMATVL 199
Query: 158 MYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAI 217
MYLSDV +GGETVFP A + P N +LS C K+G++VKP+ GDALLF+S+ +A
Sbjct: 200 MYLSDVEEGGETVFP-AAKGNFSSVPWWN-ELSXCGKEGLSVKPKMGDALLFWSMKPDAS 257
Query: 218 PDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
DP SLH GCPVI+G KWS+TKWI V+ +
Sbjct: 258 LDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
>gi|363806698|ref|NP_001242522.1| uncharacterized protein LOC100806046 [Glycine max]
gi|255647110|gb|ACU24023.1| unknown [Glycine max]
Length = 289
Score = 249 bits (637), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 121/213 (56%), Positives = 159/213 (74%), Gaps = 2/213 (0%)
Query: 35 VKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIP 94
V+ +SW+PRAFVY FLT EC++LI++AK + +S V D+ +G+SK S VRTSSGTF+
Sbjct: 78 VEVVSWEPRAFVYHNFLTKEECEYLIDIAKPSMHKSTVVDSETGKSKDSRVRTSSGTFLA 137
Query: 95 KGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLA 154
+G+D I+ IE KI+ +TF+P E+GE +QVL YE GQKYEPHYDYF D N GG R+A
Sbjct: 138 RGRDKIVRNIEKKISDFTFIPVEHGEGLQVLHYEVGQKYEPHYDYFLDDFNTKNGGQRIA 197
Query: 155 TVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHT 214
TVLMYL+DV +GGETVFP A + P N +L EC KKG+++KP+RGDALLF+S+
Sbjct: 198 TVLMYLTDVEEGGETVFP-AAKGNFSFVPWWN-ELFECGKKGLSIKPKRGDALLFWSMKP 255
Query: 215 NAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFD 247
+A DP SLH GCPVI+G KWS+TKW+ V ++
Sbjct: 256 DASLDPSSLHGGCPVIKGNKWSSTKWMRVSEYN 288
>gi|168002780|ref|XP_001754091.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162694645|gb|EDQ80992.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 214
Score = 249 bits (637), Expect = 9e-64, Method: Compositional matrix adjust.
Identities = 119/213 (55%), Positives = 161/213 (75%), Gaps = 4/213 (1%)
Query: 35 VKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIP 94
V+ +SW+PRAF+Y FLT+ EC+HLI +A+ L +S V D+ +G+SK S +RTSSGTF+
Sbjct: 3 VEVLSWEPRAFLYHHFLTEEECNHLIEVARPSLVKSTVVDSDTGKSKDSRLRTSSGTFLM 62
Query: 95 KGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLA 154
+G+D +I IE +IA +TF+P E GE +QVL+Y+ +KYEPHYDYF D N GG R+A
Sbjct: 63 RGQDPVIKRIEKRIADFTFIPAEQGEGLQVLQYKESEKYEPHYDYFHDAYNTKNGGQRIA 122
Query: 155 TVLMYLSDVAKGGETVFPNAEEPPRRRTPATN-DDLSECAKKGIAVKPRRGDALLFFSLH 213
TVLMYLS+V +GGETVFP A+ +T + D LSECA+KG++V+PR GDALLF+S+
Sbjct: 123 TVLMYLSNVEEGGETVFPAAQ---VNKTEVPDWDKLSECAQKGLSVRPRMGDALLFWSMK 179
Query: 214 TNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
+A D SLH GCPVI+G KWSATKW+HV+++
Sbjct: 180 PDATLDSTSLHGGCPVIKGTKWSATKWLHVENY 212
>gi|226529219|ref|NP_001151238.1| LOC100284871 [Zea mays]
gi|195645242|gb|ACG42089.1| prolyl 4-hydroxylase alpha-2 subunit precursor [Zea mays]
gi|347978812|gb|AEP37748.1| prolyl 4-hydroxylase 5 [Zea mays]
gi|413923983|gb|AFW63915.1| prolyl 4-hydroxylase alpha-2 subunit [Zea mays]
Length = 308
Score = 248 bits (634), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 116/209 (55%), Positives = 157/209 (75%), Gaps = 2/209 (0%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGK 97
ISW+PRAFVY FL+ EC++LI LAK + +S V D+ +G+SK S VRTSSG F+ +G+
Sbjct: 100 ISWEPRAFVYHNFLSKEECEYLIGLAKPHMVKSTVVDSTTGKSKDSRVRTSSGMFLQRGR 159
Query: 98 DAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVL 157
D +I IE +IA +TF+P ++GE +QVL YE GQKYEPH+DYF D+ N GG R+AT+L
Sbjct: 160 DKVIRVIEKRIADYTFIPVDHGEGLQVLHYEVGQKYEPHFDYFLDEFNTKNGGQRMATLL 219
Query: 158 MYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAI 217
MYLSDV +GGET+FP+A + ++LSECAK+G++VKP+ GDALLF+S+ +A
Sbjct: 220 MYLSDVEEGGETIFPDANV--NVSSLPWYNELSECAKRGLSVKPKMGDALLFWSMKPDAT 277
Query: 218 PDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
DP+SLH GCPVI G KWS+TKW+H+ +
Sbjct: 278 LDPLSLHGGCPVIRGNKWSSTKWMHIHEY 306
>gi|259490206|ref|NP_001159002.1| prolyl 4-hydroxylase alpha-2 subunit [Zea mays]
gi|195626402|gb|ACG35031.1| prolyl 4-hydroxylase alpha-2 subunit precursor [Zea mays]
gi|347978830|gb|AEP37757.1| prolyl 4-hydroxylase 8 [Zea mays]
gi|347978832|gb|AEP37758.1| prolyl 4-hydroxylase 8-1 [Zea mays]
gi|413939569|gb|AFW74120.1| prolyl 4-hydroxylase alpha-2 subunit isoform 1 [Zea mays]
gi|413939570|gb|AFW74121.1| prolyl 4-hydroxylase alpha-2 subunit isoform 2 [Zea mays]
gi|413939571|gb|AFW74122.1| prolyl 4-hydroxylase alpha-2 subunit isoform 3 [Zea mays]
gi|413939572|gb|AFW74123.1| prolyl 4-hydroxylase alpha-2 subunit isoform 4 [Zea mays]
Length = 307
Score = 248 bits (633), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 115/209 (55%), Positives = 158/209 (75%), Gaps = 2/209 (0%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGK 97
ISW+PRAFVY FL+ EC++LI LAK + +S V D+ +G+SK S VRTSSG F+ +G+
Sbjct: 99 ISWEPRAFVYHNFLSKDECEYLIGLAKPHMVKSTVVDSTTGKSKDSRVRTSSGMFLQRGR 158
Query: 98 DAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVL 157
D +I IE +IA +TF+P ++GE +QVL YE GQKYEPH+DYF D+ N GG R+AT+L
Sbjct: 159 DKVIRAIEKRIADYTFIPVDHGEGLQVLHYEVGQKYEPHFDYFLDEFNTKNGGQRIATLL 218
Query: 158 MYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAI 217
MYLSDV +GGET+FP+A + ++LS+CAK+G++VKP+ GDALLF+S+ +A
Sbjct: 219 MYLSDVEEGGETIFPDANV--NASSLPWYNELSDCAKRGLSVKPKMGDALLFWSMKPDAT 276
Query: 218 PDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
DP+SLH GCPVI+G KWS+TKW+H+ +
Sbjct: 277 LDPLSLHGGCPVIKGNKWSSTKWMHIHEY 305
>gi|225459748|ref|XP_002285898.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Vitis vinifera]
gi|302141716|emb|CBI18919.3| unnamed protein product [Vitis vinifera]
Length = 288
Score = 248 bits (632), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 119/209 (56%), Positives = 157/209 (75%), Gaps = 2/209 (0%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGK 97
+SW+PRAF+Y FL+ EC+++I+LAK +K+S V D+ +G SK S VRTSSG F+ +G+
Sbjct: 80 VSWEPRAFIYHNFLSKEECEYMISLAKPYMKKSTVVDSETGRSKDSRVRTSSGMFLRRGR 139
Query: 98 DAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVL 157
D II IE +IA +TF+P E+GE +QVL YE GQKY+ HYDYF D+ N GG R+AT+L
Sbjct: 140 DKIIRDIEKRIADFTFIPVEHGEGLQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRIATLL 199
Query: 158 MYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAI 217
MYLSDV +GGETVFP A + P N +LSEC KKG++VKP+ GDALLF+S+ +A
Sbjct: 200 MYLSDVEEGGETVFP-ATKANFSSVPWWN-ELSECGKKGLSVKPKMGDALLFWSMRPDAT 257
Query: 218 PDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
DP SLH GCPVI+G KWS+TKW+HV+ +
Sbjct: 258 LDPSSLHGGCPVIKGNKWSSTKWMHVEEY 286
>gi|363543371|ref|NP_001241695.1| prolyl 4-hydroxylase 8-5 [Zea mays]
gi|347978840|gb|AEP37762.1| prolyl 4-hydroxylase 8-5 [Zea mays]
Length = 307
Score = 248 bits (632), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 115/209 (55%), Positives = 157/209 (75%), Gaps = 2/209 (0%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGK 97
ISW+PRAFVY FL+ EC++LI LAK + +S V D+ +G+SK S VRTSSG F+ +G+
Sbjct: 99 ISWEPRAFVYHNFLSKDECEYLIGLAKPHMVKSTVVDSTTGKSKDSRVRTSSGMFLQRGR 158
Query: 98 DAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVL 157
D +I IE +IA +TF+P ++GE +QVL YE GQKYEPH+DYF D+ N GG R+AT+L
Sbjct: 159 DKVIRAIEKRIADYTFIPVDHGEGLQVLHYEVGQKYEPHFDYFLDEFNTKNGGQRIATLL 218
Query: 158 MYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAI 217
MYLSDV +GGET+FP+A + ++LS+CAK+G++VKP+ GDALLF+S+ A
Sbjct: 219 MYLSDVEEGGETIFPDANV--NASSLPWYNELSDCAKRGLSVKPKMGDALLFWSMKPGAT 276
Query: 218 PDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
DP+SLH GCPVI+G KWS+TKW+H+ +
Sbjct: 277 LDPLSLHGGCPVIKGNKWSSTKWMHIHEY 305
>gi|414870899|tpg|DAA49456.1| TPA: hypothetical protein ZEAMMB73_536273 [Zea mays]
Length = 364
Score = 247 bits (631), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 121/209 (57%), Positives = 153/209 (73%), Gaps = 2/209 (0%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGK 97
+SW+PRAFVY FL+ ECDHLI+LAK +K+S V D+ +G SK S VRTSSG F+ +G+
Sbjct: 156 LSWEPRAFVYHNFLSKEECDHLISLAKPHMKKSTVVDSATGGSKDSRVRTSSGMFLRRGQ 215
Query: 98 DAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVL 157
D II IE +IA +TF+P E GE +QVL YE GQKYEPH+DYF D N GG R+AT+L
Sbjct: 216 DKIIRTIEKRIADYTFIPVEQGEGLQVLHYEVGQKYEPHFDYFHDDYNTKNGGQRIATLL 275
Query: 158 MYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAI 217
MYLSDV GGETVFP++ +P N +LSECAK G++VKP+ GDALLF+S+ +
Sbjct: 276 MYLSDVEDGGETVFPSSTT-NSSSSPFYN-ELSECAKGGLSVKPKMGDALLFWSMKPDGS 333
Query: 218 PDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
DP SLH GCPVI+G KWS+TKW+ V +
Sbjct: 334 LDPTSLHGGCPVIKGNKWSSTKWMRVHEY 362
>gi|449529555|ref|XP_004171765.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 284
Score = 247 bits (631), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 131/256 (51%), Positives = 171/256 (66%), Gaps = 17/256 (6%)
Query: 6 LSLNFFFLLSFSLLI-----RKSFSS---TAIINPSKVKQ-------ISWKPRAFVYEGF 50
L+L FF L++ FSS TA ++ K+ ISW+PRAFVY F
Sbjct: 30 LALGFFMLIALRFFSPPETSHHRFSSVRHTAFLSDGLGKRGDQWVEFISWEPRAFVYHNF 89
Query: 51 LTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIAT 110
L+ EC +LI+LAK +++S V D+ +GES S VRTSSG F+ +G+D II IE +IA
Sbjct: 90 LSKEECLYLISLAKPHMEKSTVVDSKTGESVDSRVRTSSGMFLNRGQDKIIRNIEKRIAD 149
Query: 111 WTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETV 170
+TF+P E+GE +Q+L YE GQKY+ HYDYF D+ NI +GG R+AT+LMYLSDV +GGETV
Sbjct: 150 FTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATLLMYLSDVEEGGETV 209
Query: 171 FPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVI 230
FP A + P N +LSEC K G++VKP+ GDALLF+S+ +A DP SLH CPVI
Sbjct: 210 FP-AAKGNFSSVPWWN-ELSECGKGGLSVKPKMGDALLFWSMKPDATLDPTSLHGACPVI 267
Query: 231 EGEKWSATKWIHVDSF 246
G KWS TKW+HVD +
Sbjct: 268 RGNKWSCTKWMHVDKY 283
>gi|212720775|ref|NP_001131953.1| uncharacterized protein LOC100193348 [Zea mays]
gi|194693016|gb|ACF80592.1| unknown [Zea mays]
gi|347978798|gb|AEP37741.1| prolyl 4-hydroxylase 1 [Zea mays]
gi|414870898|tpg|DAA49455.1| TPA: hypothetical protein ZEAMMB73_536273 [Zea mays]
Length = 307
Score = 247 bits (630), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 121/209 (57%), Positives = 153/209 (73%), Gaps = 2/209 (0%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGK 97
+SW+PRAFVY FL+ ECDHLI+LAK +K+S V D+ +G SK S VRTSSG F+ +G+
Sbjct: 99 LSWEPRAFVYHNFLSKEECDHLISLAKPHMKKSTVVDSATGGSKDSRVRTSSGMFLRRGQ 158
Query: 98 DAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVL 157
D II IE +IA +TF+P E GE +QVL YE GQKYEPH+DYF D N GG R+AT+L
Sbjct: 159 DKIIRTIEKRIADYTFIPVEQGEGLQVLHYEVGQKYEPHFDYFHDDYNTKNGGQRIATLL 218
Query: 158 MYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAI 217
MYLSDV GGETVFP++ +P N +LSECAK G++VKP+ GDALLF+S+ +
Sbjct: 219 MYLSDVEDGGETVFPSSTT-NSSSSPFYN-ELSECAKGGLSVKPKMGDALLFWSMKPDGS 276
Query: 218 PDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
DP SLH GCPVI+G KWS+TKW+ V +
Sbjct: 277 LDPTSLHGGCPVIKGNKWSSTKWMRVHEY 305
>gi|224117220|ref|XP_002331751.1| predicted protein [Populus trichocarpa]
gi|222874448|gb|EEF11579.1| predicted protein [Populus trichocarpa]
Length = 266
Score = 247 bits (630), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 133/246 (54%), Positives = 171/246 (69%), Gaps = 6/246 (2%)
Query: 5 RLSLNFFFLLSFSLLIRKSFSSTAIINPSKVKQ----ISWKPRAFVYEGFLTDLECDHLI 60
R SLN F S + S + + K +Q ISW+PRAF+Y FLT ECD+LI
Sbjct: 21 RKSLNVHFPNDLSSIAHNSKIHESGDDEGKAEQWVEAISWEPRAFIYHNFLTKAECDYLI 80
Query: 61 NLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGE 120
NLAK +++S V D+ SG+SK S VRTSSGTF+P+G+D II IE +IA ++F+P E+GE
Sbjct: 81 NLAKPHMQKSMVVDSSSGKSKDSRVRTSSGTFLPRGRDKIIRDIEKRIADFSFIPSEHGE 140
Query: 121 DIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRR 180
+Q+L YE GQKYEPH+DYF D N GG R+ATVLMYLSDV +GGETVFP+A+
Sbjct: 141 GLQILHYEVGQKYEPHFDYFMDDYNTENGGQRIATVLMYLSDVEEGGETVFPSAKGNI-S 199
Query: 181 RTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKW 240
P N +LSEC K G++VKP+ GDALLF+S+ +A DP SLH GCPVI G KWS+TKW
Sbjct: 200 SVPWWN-ELSECGKGGLSVKPKMGDALLFWSMKPDASLDPSSLHGGCPVIRGNKWSSTKW 258
Query: 241 IHVDSF 246
+ V+ +
Sbjct: 259 MRVNEY 264
>gi|363543369|ref|NP_001241694.1| prolyl 4-hydroxylase 8-4 [Zea mays]
gi|347978838|gb|AEP37761.1| prolyl 4-hydroxylase 8-4 [Zea mays]
Length = 307
Score = 246 bits (629), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 114/209 (54%), Positives = 158/209 (75%), Gaps = 2/209 (0%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGK 97
ISW+PRAFVY FL+ EC++LI LAK + +S V D+ +G+SK S VRTSSG F+ +G+
Sbjct: 99 ISWEPRAFVYHNFLSKDECEYLIGLAKPHMVKSTVVDSTTGKSKDSRVRTSSGMFLQRGR 158
Query: 98 DAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVL 157
+ +I IE +IA +TF+P ++GE +QVL YE GQKYEPH+DYF D+ N GG R+AT+L
Sbjct: 159 NKVIRAIEKRIADYTFIPVDHGEGLQVLHYEVGQKYEPHFDYFLDEFNTKNGGQRIATLL 218
Query: 158 MYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAI 217
MYLSDV +GGET+FP+A + ++LS+CAK+G++VKP+ GDALLF+S+ +A
Sbjct: 219 MYLSDVEEGGETIFPDANV--NASSLPWYNELSDCAKRGLSVKPKMGDALLFWSMKPDAT 276
Query: 218 PDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
DP+SLH GCPVI+G KWS+TKW+H+ +
Sbjct: 277 LDPLSLHGGCPVIKGNKWSSTKWMHIHEY 305
>gi|42567428|ref|NP_195306.2| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
gi|332661174|gb|AEE86574.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
Length = 290
Score = 246 bits (628), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 123/232 (53%), Positives = 166/232 (71%), Gaps = 11/232 (4%)
Query: 21 RKSFSSTAIINPSK-VKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGE 79
R+SF N + ++ ISW+PRAFVY FLT+ EC+HLI+LAK + +S V D +G+
Sbjct: 65 RESFGDEEDGNGDRWLEVISWEPRAFVYHNFLTNEECEHLISLAKPSMMKSKVVDVKTGK 124
Query: 80 SKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDY 139
S S VRTSSGTF+ +G D I+ IE++I+ +TF+P ENGE +QVL YE GQ+YEPH+DY
Sbjct: 125 SIDSRVRTSSGTFLNRGHDEIVEEIENRISDFTFIPPENGEGLQVLHYEVGQRYEPHHDY 184
Query: 140 FSDKVNIVRGGHRLATVLMYLSDVAKGGETVFP----NAEEPPRRRTPATNDDLSECAKK 195
F D+ N+ +GG R+ATVLMYLSDV +GGETVFP N + P D+LS+C K+
Sbjct: 185 FFDEFNVRKGGQRIATVLMYLSDVDEGGETVFPAAKGNVSDVPWW------DELSQCGKE 238
Query: 196 GIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFD 247
G++V P++ DALLF+S+ +A DP SLH GCPVI+G KWS+TKW HV ++
Sbjct: 239 GLSVLPKKRDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWFHVHEYN 290
>gi|116788056|gb|ABK24739.1| unknown [Picea sitchensis]
Length = 303
Score = 246 bits (627), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 124/220 (56%), Positives = 156/220 (70%), Gaps = 13/220 (5%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSD-----------VR 86
+SW+PRA +Y FL EC++LINLAK + +S V D+ +G+SK S VR
Sbjct: 84 LSWEPRAILYHNFLNKEECEYLINLAKPHMAKSTVVDSATGKSKDSRFVHRWKSNDSRVR 143
Query: 87 TSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNI 146
TSSG F+ +G+D I IE +IA +TF+P E+GE +QVL YE GQKYEPH+DYF D+ N
Sbjct: 144 TSSGMFLNRGQDKTIRSIEKRIADFTFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEFNT 203
Query: 147 VRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
GG R+ATVLMYLSDV KGGETVFP A + P D+LSECAK GI+V+PR GDA
Sbjct: 204 KNGGQRIATVLMYLSDVEKGGETVFP-ASKVNSSSVPWW-DELSECAKAGISVRPRMGDA 261
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
LLF+S+ +A DP SLH+GCPVI+G+KWSATKWIHV +
Sbjct: 262 LLFWSMRPDAELDPSSLHAGCPVIQGDKWSATKWIHVGEY 301
>gi|449443243|ref|XP_004139389.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 284
Score = 245 bits (626), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 128/256 (50%), Positives = 172/256 (67%), Gaps = 17/256 (6%)
Query: 6 LSLNFFFLLSFSLLI-----RKSFSS---TAIINPSKVKQ-------ISWKPRAFVYEGF 50
L+L FF L++ L FSS TA ++ K+ ISW+PRAFVY F
Sbjct: 30 LALGFFMLIALRFLSPPETSHHRFSSVRHTAFLSDGLGKRGDQWVEFISWEPRAFVYHNF 89
Query: 51 LTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIAT 110
L+ EC +LI+LAK +++S V DN +G++ VRTSSG F+ +G+D I++ IE +IA
Sbjct: 90 LSKEECLYLISLAKPHMEKSTVVDNETGKNVEDSVRTSSGMFLNRGQDKIVSNIEKRIAD 149
Query: 111 WTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETV 170
+TF+P E+GE +Q+L YE GQKY+ HYDYF D+ NI +GG R+AT+LMYLSDV +GGETV
Sbjct: 150 FTFIPIEHGEGLQILHYEVGQKYDAHYDYFVDEYNIKKGGQRMATLLMYLSDVEEGGETV 209
Query: 171 FPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVI 230
FP A + P N +LS+C K G++VKP+ GDALLF+S+ +A DP SLH CPVI
Sbjct: 210 FP-AAKGNFSSVPWWN-ELSKCGKGGLSVKPKMGDALLFWSMKPDATLDPTSLHGACPVI 267
Query: 231 EGEKWSATKWIHVDSF 246
G KWS TKW+HVD +
Sbjct: 268 RGNKWSCTKWMHVDKY 283
>gi|297802350|ref|XP_002869059.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297314895|gb|EFH45318.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 290
Score = 245 bits (626), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 120/214 (56%), Positives = 158/214 (73%), Gaps = 10/214 (4%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGK 97
ISW+PRAFVY FLT+ EC+HLI+LAK + +S V D +G+S S VRTSSGTF+ +G
Sbjct: 83 ISWEPRAFVYHNFLTNEECEHLISLAKPSMVKSKVVDVKTGKSIDSRVRTSSGTFLKRGH 142
Query: 98 DAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVL 157
D I+ IE++I+ +TF+P ENGE +QVL YE GQKYEPH+DYF D+ N+ +GG R+ATVL
Sbjct: 143 DEIVEEIENRISDFTFIPIENGEGLQVLHYEVGQKYEPHHDYFFDEFNVRKGGQRIATVL 202
Query: 158 MYLSDVAKGGETVFP----NAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLH 213
MYLSDV +GGETVFP N + P D+LS+C K+G++V P++ DALLF+S+
Sbjct: 203 MYLSDVDEGGETVFPAAKGNISDVPWW------DELSQCGKEGLSVLPKKRDALLFWSMK 256
Query: 214 TNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFD 247
+A DP SLH GCPVI+G KWS+TKW HV ++
Sbjct: 257 PDASLDPSSLHGGCPVIKGNKWSSTKWFHVHEYN 290
>gi|159476104|ref|XP_001696154.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein
[Chlamydomonas reinhardtii]
gi|158275325|gb|EDP01103.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein
[Chlamydomonas reinhardtii]
Length = 343
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 125/280 (44%), Positives = 169/280 (60%), Gaps = 20/280 (7%)
Query: 33 SKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTF 92
S++ +SW PR F+Y+G LT ECD L++ ++S+L+RS V+D +G +SD+RTSSG F
Sbjct: 65 SRMVVLSWHPRVFLYKGILTHEECDQLMDNSRSRLERSGVSDATTGAGAVSDIRTSSGMF 124
Query: 93 IPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHR 152
+G+ ++ IE+++A WT LP ENGE IQVLRYE QKY+PH+DYFS GG+R
Sbjct: 125 YERGETELVKRIENRLAMWTMLPVENGEGIQVLRYEKTQKYDPHHDYFSFDGADDNGGNR 184
Query: 153 LATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSL 212
+ATVLMYL+ +GGETVFP + T S ++G+AVKP +GDA+LF+S+
Sbjct: 185 MATVLMYLATPEEGGETVFPKVVGWVVQLTTTA----SAPCRQGLAVKPAKGDAVLFWSI 240
Query: 213 HTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEG----------------GDC 256
+ DP SLH CPVI+G KWSATKWIHV + E C
Sbjct: 241 RPDGRFDPGSLHGSCPVIKGVKWSATKWIHVGHYAMSGERSETVKRVQYVPPPPPAVPGC 300
Query: 257 TDNNASCERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
+ + C WA GEC NP YM+G +PG C +C C
Sbjct: 301 ENQHKLCSHWAESGECESNPGYMIGKKGMPGACILACNRC 340
>gi|115482738|ref|NP_001064962.1| Os10g0497800 [Oryza sativa Japonica Group]
gi|78708853|gb|ABB47828.1| prolyl 4-hydroxylase alpha subunit, putative, expressed [Oryza
sativa Japonica Group]
gi|113639571|dbj|BAF26876.1| Os10g0497800 [Oryza sativa Japonica Group]
gi|215767852|dbj|BAH00081.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218184821|gb|EEC67248.1| hypothetical protein OsI_34188 [Oryza sativa Indica Group]
Length = 321
Score = 245 bits (625), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 118/209 (56%), Positives = 156/209 (74%), Gaps = 2/209 (0%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGK 97
+SW+PRAF+Y FL+ EC++LI+LAK +K+S V D +G SK S VRTSSG F+ +G+
Sbjct: 113 LSWEPRAFLYHNFLSKEECEYLISLAKPHMKKSTVVDASTGGSKDSRVRTSSGMFLGRGQ 172
Query: 98 DAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVL 157
D II IE +I+ +TF+P ENGE +QVL YE GQKYEPH+DYF D+ N GG R+AT+L
Sbjct: 173 DKIIRTIEKRISDYTFIPVENGEGLQVLHYEVGQKYEPHFDYFHDEFNTKNGGQRIATLL 232
Query: 158 MYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAI 217
MYLSDV +GGET+FP+++ +P N +LSECAKKG+AVKP+ GDALLF+S+ +
Sbjct: 233 MYLSDVEEGGETIFPSSKA-NSSSSPFYN-ELSECAKKGLAVKPKMGDALLFWSMRPDGS 290
Query: 218 PDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
D SLH GCPVI+G KWS+TKW+ V +
Sbjct: 291 LDATSLHGGCPVIKGNKWSSTKWMRVHEY 319
>gi|242039227|ref|XP_002467008.1| hypothetical protein SORBIDRAFT_01g018200 [Sorghum bicolor]
gi|241920862|gb|EER94006.1| hypothetical protein SORBIDRAFT_01g018200 [Sorghum bicolor]
Length = 307
Score = 245 bits (625), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 120/209 (57%), Positives = 153/209 (73%), Gaps = 2/209 (0%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGK 97
+SW+PRAFVY FL+ ECDHLI+LAK +K+S V D+ +G SK S VRTSSG F+ +G+
Sbjct: 99 LSWEPRAFVYHNFLSKEECDHLISLAKPHMKKSTVVDSATGASKDSRVRTSSGMFLRRGQ 158
Query: 98 DAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVL 157
D II IE +IA +TF+P E+GE +QVL YE GQKYEPH+DYF D N GG R+AT+L
Sbjct: 159 DKIIQTIEKRIADFTFIPVEHGEGLQVLHYEVGQKYEPHFDYFHDDYNTKNGGQRIATLL 218
Query: 158 MYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAI 217
MYLSDV GGETVFP++ +P N +LSECAK G++VKP+ GDALLF+S+ +
Sbjct: 219 MYLSDVEDGGETVFPSSTT-NSSSSPFYN-ELSECAKGGLSVKPKMGDALLFWSMKPDGS 276
Query: 218 PDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
D SLH GCPVI+G KWS+TKW+ V +
Sbjct: 277 MDSTSLHGGCPVIKGNKWSSTKWMRVHEY 305
>gi|303282201|ref|XP_003060392.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226457863|gb|EEH55161.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 369
Score = 244 bits (623), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 131/262 (50%), Positives = 159/262 (60%), Gaps = 14/262 (5%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
PRA+VY GFLTD ECDH I A +L +S V D +GE S +RTS G F +G+D ++
Sbjct: 83 PRAYVYRGFLTDAECDHFIARASPKLAKSNVVDTDTGEGVPSAIRTSDGMFFDRGEDDVV 142
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNI--VRGGHRLATVLMY 159
+E +I+ WT LP ENGE +QVLRY GQKY+ H D F DK N GG R+ATVLMY
Sbjct: 143 DAVERRISAWTRLPTENGEGMQVLRYAGGQKYDAHLDAFVDKFNADDAHGGQRVATVLMY 202
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV GGETVFP P ++ S CA++G+AVKPRRGDALLF+S+
Sbjct: 203 LNDVDDGGETVFPETTAKPH----VGDERYSACARRGVAVKPRRGDALLFWSMDETFT-- 256
Query: 220 PVSLHSGCPV-IEGEKWSATKWIHVDSFD---KIVEEGGDCTDNNASCERWAALGECTKN 275
SLH GCPV G KWS TKWIH +F K+ G C D +A+C WA GEC KN
Sbjct: 257 -RSLHGGCPVGAGGVKWSMTKWIHKGAFSRGHKMKFPEGVCDDEDANCAGWAKSGECEKN 315
Query: 276 PEYMVGSA-QLPGFCRRSCKVC 296
P YM G + G C SC C
Sbjct: 316 PAYMTGDGRENDGHCAFSCGTC 337
>gi|307102975|gb|EFN51240.1| hypothetical protein CHLNCDRAFT_28187 [Chlorella variabilis]
Length = 322
Score = 244 bits (623), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 133/280 (47%), Positives = 177/280 (63%), Gaps = 23/280 (8%)
Query: 34 KVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRT-----S 88
+++ +SWKPRA + GFL ECDH+I+LA+++L+ S V G KL VRT S
Sbjct: 14 RIELVSWKPRALLLHGFLAHSECDHMISLAEARLEPSKVVSR-DGSGKLDSVRTRQGLSS 72
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSD----KV 144
SGTF+ K +D+++AG+ED+I T LP + E +QVL+YE GQKY HYD ++
Sbjct: 73 SGTFLTKRQDSVVAGVEDRIELATHLPFSHSEQLQVLKYELGQKYSAHYDVHGSNEQAQL 132
Query: 145 NIVR---GGHRLATVLMYLSDVAKGGETVFPNA---EEPPRRRTPATNDDLSECAKKGIA 198
I R GG R AT+LMYLSDV +GGET FP+ +E + + P SEC +G+A
Sbjct: 133 AIRRGEQGGSRYATMLMYLSDVEEGGETSFPHGRWIDEGAQAQPP-----YSECGSRGVA 187
Query: 199 VKPRRGDALLFFSLHTNAIP-DPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVE-EGGDC 256
VKPR+GDA+LF+SL ++ D SLH+GCPV +G K+SAT WIHV+ + G C
Sbjct: 188 VKPRKGDAILFYSLKSDGQSKDFFSLHAGCPVAKGVKYSATAWIHVEPYSNTGPLHPGFC 247
Query: 257 TDNNASCERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
DNNA C WAALGEC +N +M G+ G CR SCKVC
Sbjct: 248 RDNNAKCPEWAALGECERNVVFMRGNGTYRGHCRLSCKVC 287
>gi|222613083|gb|EEE51215.1| hypothetical protein OsJ_32038 [Oryza sativa Japonica Group]
Length = 222
Score = 243 bits (621), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 118/209 (56%), Positives = 156/209 (74%), Gaps = 2/209 (0%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGK 97
+SW+PRAF+Y FL+ EC++LI+LAK +K+S V D +G SK S VRTSSG F+ +G+
Sbjct: 14 LSWEPRAFLYHNFLSKEECEYLISLAKPHMKKSTVVDASTGGSKDSRVRTSSGMFLGRGQ 73
Query: 98 DAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVL 157
D II IE +I+ +TF+P ENGE +QVL YE GQKYEPH+DYF D+ N GG R+AT+L
Sbjct: 74 DKIIRTIEKRISDYTFIPVENGEGLQVLHYEVGQKYEPHFDYFHDEFNTKNGGQRIATLL 133
Query: 158 MYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAI 217
MYLSDV +GGET+FP+++ +P N +LSECAKKG+AVKP+ GDALLF+S+ +
Sbjct: 134 MYLSDVEEGGETIFPSSKA-NSSSSPFYN-ELSECAKKGLAVKPKMGDALLFWSMRPDGS 191
Query: 218 PDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
D SLH GCPVI+G KWS+TKW+ V +
Sbjct: 192 LDATSLHGGCPVIKGNKWSSTKWMRVHEY 220
>gi|297832394|ref|XP_002884079.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297329919|gb|EFH60338.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 291
Score = 243 bits (620), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 119/214 (55%), Positives = 154/214 (71%), Gaps = 6/214 (2%)
Query: 35 VKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIP 94
V+ ISW+PRA VY FL++ EC+HLINLAK + +S V D +G SK S VRTSSGTF+
Sbjct: 80 VEVISWEPRAVVYHNFLSNEECEHLINLAKPSMVKSTVVDEKTGGSKDSRVRTSSGTFLR 139
Query: 95 KGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLA 154
+G D ++ IE +I+ +TF+P ENGE +QVL Y+ GQKYEPHYDYF D+ N GG R+A
Sbjct: 140 RGHDEVVEVIEKRISDFTFIPVENGEGLQVLHYQVGQKYEPHYDYFLDEFNTKNGGQRIA 199
Query: 155 TVLMYLSDVAKGGETVFPNAEEPPRRRTPATN--DDLSECAKKGIAVKPRRGDALLFFSL 212
TVLMYLSDV GGETVFP A R A ++LS+C K+G++V P++ DALLF+++
Sbjct: 200 TVLMYLSDVDDGGETVFPAA----RGNISAVPWWNELSKCGKEGLSVLPKKRDALLFWNM 255
Query: 213 HTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
+A DP SLH GCPV++G KWS+TKW HV F
Sbjct: 256 RPDASLDPSSLHGGCPVVKGNKWSSTKWFHVHEF 289
>gi|255539064|ref|XP_002510597.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223551298|gb|EEF52784.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 289
Score = 243 bits (619), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 118/209 (56%), Positives = 152/209 (72%), Gaps = 2/209 (0%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGK 97
ISW+PRAFVY FL+ EC++LI LAK + +S V D+ +G SK S VRTSSG F+ +G+
Sbjct: 81 ISWEPRAFVYHNFLSKEECEYLIALAKPHMVKSTVVDSKTGRSKDSRVRTSSGMFLRRGR 140
Query: 98 DAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVL 157
D II IE +IA ++F+P E+GE +QVL YE GQKYE HYDYF D+ N GG R AT+L
Sbjct: 141 DKIIRNIEKRIADFSFIPIEHGEGLQVLHYEVGQKYEAHYDYFLDEFNTKNGGQRTATLL 200
Query: 158 MYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAI 217
MYLSDV +GGETVFP A + P+ N +LSECA++G++VKP+ G+ALLF+S +A
Sbjct: 201 MYLSDVEEGGETVFP-AAKANISNVPSWN-ELSECARQGLSVKPKMGNALLFWSTRPDAT 258
Query: 218 PDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
DP SLH CPVI G KWSATKW+H+ +
Sbjct: 259 LDPASLHGSCPVIRGNKWSATKWMHLGEY 287
>gi|15227885|ref|NP_179363.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
gi|25411813|pir||F84555 similar to prolyl 4-hydroxylase alpha subunit [imported] -
Arabidopsis thaliana
gi|89274129|gb|ABD65585.1| At2g17720 [Arabidopsis thaliana]
gi|110738861|dbj|BAF01353.1| similar to prolyl 4-hydroxylase alpha subunit [Arabidopsis
thaliana]
gi|330251579|gb|AEC06673.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
Length = 291
Score = 242 bits (618), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 119/214 (55%), Positives = 154/214 (71%), Gaps = 6/214 (2%)
Query: 35 VKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIP 94
V+ ISW+PRA VY FLT+ EC+HLI+LAK + +S V D +G SK S VRTSSGTF+
Sbjct: 80 VEVISWEPRAVVYHNFLTNEECEHLISLAKPSMVKSTVVDEKTGGSKDSRVRTSSGTFLR 139
Query: 95 KGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLA 154
+G D ++ IE +I+ +TF+P ENGE +QVL Y+ GQKYEPHYDYF D+ N GG R+A
Sbjct: 140 RGHDEVVEVIEKRISDFTFIPVENGEGLQVLHYQVGQKYEPHYDYFLDEFNTKNGGQRIA 199
Query: 155 TVLMYLSDVAKGGETVFPNAEEPPRRRTPATN--DDLSECAKKGIAVKPRRGDALLFFSL 212
TVLMYLSDV GGETVFP A R A ++LS+C K+G++V P++ DALLF+++
Sbjct: 200 TVLMYLSDVDDGGETVFPAA----RGNISAVPWWNELSKCGKEGLSVLPKKRDALLFWNM 255
Query: 213 HTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
+A DP SLH GCPV++G KWS+TKW HV F
Sbjct: 256 RPDASLDPSSLHGGCPVVKGNKWSSTKWFHVHEF 289
>gi|307102963|gb|EFN51228.1| hypothetical protein CHLNCDRAFT_141231 [Chlorella variabilis]
Length = 313
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 131/310 (42%), Positives = 177/310 (57%), Gaps = 29/310 (9%)
Query: 5 RLSLNFFFLLSFSLLIRKSFSSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAK 64
RLSL LL+ + S +P + A ++ FLT+ ECDH++ LAK
Sbjct: 2 RLSLGLTALLTLQASFLCALSQDTAQDPKRPWMQVLDAEARIFINFLTEEECDHIVALAK 61
Query: 65 SQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQV 124
L+RS V D +G S++SD+RTS G F+ +G D +A IE++IA WT LP NGE +QV
Sbjct: 62 PHLERSGVVDTATGGSEISDIRTSKGMFLERGHDDTVAAIEERIARWTLLPVGNGEGLQV 121
Query: 125 LRYEHGQKYEPHYDYFSDKVN-IVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTP 183
L Y G+KY+ DYF DKVN GG+R ATVLMYL+ V +GGETVFPN P P
Sbjct: 122 LNYHPGEKYD---DYFFDKVNGESNGGNRYATVLMYLNTVEEGGETVFPNIPAPGGDNGP 178
Query: 184 ATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHV 243
+ECA++ +A KP +G A+LF S+ + + SLH+ CPV++GEKWSA KWIHV
Sbjct: 179 T----FTECARRHLAAKPTKGSAVLFHSIKPSGDLERRSLHTACPVVKGEKWSAPKWIHV 234
Query: 244 DSFDKIVEEGGD-----------------CTDNNASCERWAALGECTKNPEYMVGSAQLP 286
+ GG+ C D + +CE+WAA GEC N +M+G+ P
Sbjct: 235 GHYAM----GGEAAVPVPQHPQKVGNLLGCEDADENCEQWAANGECENNKVFMIGTRDRP 290
Query: 287 GFCRRSCKVC 296
G C +SC C
Sbjct: 291 GSCVKSCDAC 300
>gi|356502610|ref|XP_003520111.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 286
Score = 241 bits (616), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 114/216 (52%), Positives = 154/216 (71%), Gaps = 2/216 (0%)
Query: 34 KVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFI 93
+++ ISW+PRAF+Y FLT EC++LIN+A +++S VADN SG+S + DVR S+G F+
Sbjct: 73 RMEVISWQPRAFLYHNFLTKEECEYLINIATPHMQKSTVADNQSGQSVVHDVRKSTGAFL 132
Query: 94 PKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRL 153
+G+D I+ IE +IA TF+P ENGE I V+ YE GQ Y+PHYDYF D NI GG R+
Sbjct: 133 DRGQDEIVRNIEKRIADVTFIPIENGEPIYVIHYEVGQYYDPHYDYFIDDFNIENGGQRI 192
Query: 154 ATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLH 213
AT+LMYLS+V +GGET+FP A + P N +LS C K G+++KP+ GDALLF+S+
Sbjct: 193 ATMLMYLSNVEEGGETMFPRA-KANFSSVPWWN-ELSNCGKMGLSIKPKMGDALLFWSMK 250
Query: 214 TNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKI 249
NA D ++LHS CPVI+G KWS TKW+H F +
Sbjct: 251 PNATLDALTLHSACPVIKGNKWSCTKWMHPTEFKMV 286
>gi|21593091|gb|AAM65040.1| putative prolyl 4-hydroxylase, alpha subunit [Arabidopsis thaliana]
Length = 291
Score = 241 bits (615), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 119/214 (55%), Positives = 153/214 (71%), Gaps = 6/214 (2%)
Query: 35 VKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIP 94
V+ ISW+PRA VY FLT+ EC+HLI+LAK + +S V D +G SK S VRTSSGTF+
Sbjct: 80 VEVISWEPRAVVYHNFLTNEECEHLISLAKPSMVKSTVVDEKTGGSKDSRVRTSSGTFLR 139
Query: 95 KGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLA 154
+G D ++ IE +I+ +TF+P ENGE +QVL Y+ GQKYEPHYDYF D+ N GG R+A
Sbjct: 140 RGHDEVVEVIEKRISDFTFIPVENGEGLQVLHYQVGQKYEPHYDYFLDEFNTKNGGQRIA 199
Query: 155 TVLMYLSDVAKGGETVFPNAEEPPRRRTPATN--DDLSECAKKGIAVKPRRGDALLFFSL 212
TVLMYLSDV GGETVFP A R A ++LS+C K+G++V P+ DALLF+++
Sbjct: 200 TVLMYLSDVDDGGETVFPAA----RGNISAVPWWNELSKCGKEGLSVLPKXRDALLFWNM 255
Query: 213 HTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
+A DP SLH GCPV++G KWS+TKW HV F
Sbjct: 256 RPDASLDPSSLHGGCPVVKGNKWSSTKWFHVHEF 289
>gi|240256489|ref|NP_201407.4| iron ion binding / oxidoreductase/ oxidoreductase protein
[Arabidopsis thaliana]
gi|332010770|gb|AED98153.1| iron ion binding / oxidoreductase/ oxidoreductase protein
[Arabidopsis thaliana]
Length = 289
Score = 241 bits (614), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 121/216 (56%), Positives = 153/216 (70%), Gaps = 2/216 (0%)
Query: 31 NPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSG 90
N V+ ISW+PRA VY FLT EC +LI LAK +++S V D +G+S S VRTSSG
Sbjct: 74 NERWVEIISWEPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKTGKSTDSRVRTSSG 133
Query: 91 TFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGG 150
TF+ +G+D I IE +I+ +TF+P E+GE +QVL YE GQKYEPHYDYF D+ N GG
Sbjct: 134 TFLARGRDKTIREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHYDYFMDEYNTRNGG 193
Query: 151 HRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFF 210
R+ATVLMYLSDV +GGETVFP A + P N +LSEC K G++VKP+ GDALLF+
Sbjct: 194 QRIATVLMYLSDVEEGGETVFP-AAKGNYSAVPWWN-ELSECGKGGLSVKPKMGDALLFW 251
Query: 211 SLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
S+ +A DP SLH GC VI+G KWS+TKW+ V +
Sbjct: 252 SMTPDATLDPSSLHGGCAVIKGNKWSSTKWLRVHEY 287
>gi|303285562|ref|XP_003062071.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226456482|gb|EEH53783.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 522
Score = 239 bits (611), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 118/260 (45%), Positives = 162/260 (62%), Gaps = 19/260 (7%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
+P+A+++ FLT+ EC HLI LAK+QL S V + +S S +RTS+G F+ KG+
Sbjct: 235 RPKAYLFRNFLTEEECRHLIALAKAQLAPSTVVADGGKKSTKSGIRTSAGMFLTKGQTPT 294
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIV--RGGHRLATVLM 158
+ +E+++A LP+ENGE +Q+LRYEHGQKY+PHYDYF DK+N RGG R+AT+L+
Sbjct: 295 VRMVEERVAAAVGLPEENGEGMQILRYEHGQKYDPHYDYFHDKINPSPNRGGQRMATMLI 354
Query: 159 YLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIP 218
YL D +GGET+FPNA++P + S+CAK+G+ VK +RGDA+LF+SL ++
Sbjct: 355 YLKDTEEGGETIFPNAKKPEGFHDGEKDGAFSDCAKRGLPVKSKRGDAVLFWSLTSDYKL 414
Query: 219 DPVSLHSGCPVIEGEKWSATKWIHVDSFD-----------------KIVEEGGDCTDNNA 261
D SLH CPV+ GEKW+A KWI V FD V+ C D
Sbjct: 415 DEGSLHGACPVLRGEKWTAVKWIRVAKFDGRFTGELPMPSLTRGDRAAVDATARCVDEWD 474
Query: 262 SCERWAALGECTKNPEYMVG 281
C WA G C +NPE+M G
Sbjct: 475 ECAEWARKGWCERNPEFMTG 494
>gi|302815629|ref|XP_002989495.1| hypothetical protein SELMODRAFT_129912 [Selaginella moellendorffii]
gi|300142673|gb|EFJ09371.1| hypothetical protein SELMODRAFT_129912 [Selaginella moellendorffii]
Length = 213
Score = 238 bits (606), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 113/209 (54%), Positives = 154/209 (73%), Gaps = 2/209 (0%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGK 97
ISW PRA + FLTD ECDHLI +A +++S V D+ +G S+ S VRTSSG F+ +G+
Sbjct: 5 ISWTPRASLVHNFLTDDECDHLIRVAMPLMQKSTVVDSQTGGSRDSRVRTSSGMFLNRGQ 64
Query: 98 DAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVL 157
D +I+ IEDKIA TF+PK++GE IQVL YE GQKY+ H+D+F D VN GG R+AT+L
Sbjct: 65 DRVISEIEDKIAKLTFIPKDHGEGIQVLHYEPGQKYDAHHDFFYDTVNTRNGGQRIATLL 124
Query: 158 MYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAI 217
MYL+DV +GGETVFP + + + ++ LSEC ++G++V+P+RGDALLF+S+ +A
Sbjct: 125 MYLTDVEEGGETVFPKSAK--NSSSLPWHNQLSECGRRGVSVRPKRGDALLFWSMSPDAQ 182
Query: 218 PDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
D SLH GCPVI+G+KWSATKW+ V +
Sbjct: 183 LDHSSLHGGCPVIKGDKWSATKWMRVSEY 211
>gi|302762452|ref|XP_002964648.1| hypothetical protein SELMODRAFT_82355 [Selaginella moellendorffii]
gi|300168377|gb|EFJ34981.1| hypothetical protein SELMODRAFT_82355 [Selaginella moellendorffii]
Length = 225
Score = 237 bits (605), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 113/209 (54%), Positives = 154/209 (73%), Gaps = 2/209 (0%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGK 97
ISW PRA + FLTD ECDHLI +A +++S V D+ +G S+ S VRTSSG F+ +G+
Sbjct: 17 ISWTPRASLVHNFLTDDECDHLIRVAMPLMQKSTVVDSQTGGSRDSRVRTSSGMFLNRGQ 76
Query: 98 DAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVL 157
D +I+ IEDKIA TF+PK++GE IQVL YE GQKY+ H+D+F D VN GG R+AT+L
Sbjct: 77 DRVISEIEDKIAKLTFIPKDHGEGIQVLHYEPGQKYDAHHDFFYDTVNTRNGGQRIATLL 136
Query: 158 MYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAI 217
MYL+DV +GGETVFP + + + ++ LSEC ++G++V+P+RGDALLF+S+ +A
Sbjct: 137 MYLTDVEEGGETVFPKSAK--NSSSLPWHNQLSECGRRGVSVRPKRGDALLFWSMSPDAQ 194
Query: 218 PDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
D SLH GCPVI+G+KWSATKW+ V +
Sbjct: 195 LDHSSLHGGCPVIKGDKWSATKWMRVSEY 223
>gi|224085946|ref|XP_002307750.1| predicted protein [Populus trichocarpa]
gi|222857199|gb|EEE94746.1| predicted protein [Populus trichocarpa]
Length = 288
Score = 237 bits (605), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 115/209 (55%), Positives = 151/209 (72%), Gaps = 2/209 (0%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGK 97
+SW+PRAF+Y FL+ EC++LINLAK + +S V D+ +G SK S VRTSSG F+ +G+
Sbjct: 80 LSWEPRAFLYHNFLSKEECEYLINLAKPHMMKSTVVDSKTGRSKDSRVRTSSGMFLRRGR 139
Query: 98 DAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVL 157
D +I IE +IA ++F+P E+GE +QVL YE GQKYE H+DYF D+ N GG R AT+L
Sbjct: 140 DRVIREIEKRIADFSFIPVEHGEGLQVLHYEVGQKYEAHFDYFLDEFNTKNGGQRTATLL 199
Query: 158 MYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAI 217
MYLSDV +GGETVFP A P N +LSECAK+G+++KP+ G+ALLF+S +A
Sbjct: 200 MYLSDVEEGGETVFPAANMNI-SAVPWWN-ELSECAKQGLSLKPKMGNALLFWSTRPDAT 257
Query: 218 PDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
DP SLH CPVI G KWSATKW+H+ +
Sbjct: 258 LDPSSLHGSCPVIRGNKWSATKWMHLGEY 286
>gi|356502598|ref|XP_003520105.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 296
Score = 237 bits (605), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 112/209 (53%), Positives = 156/209 (74%), Gaps = 2/209 (0%)
Query: 35 VKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIP 94
V+ ISW+PR F+Y FLT EC+HLIN+AK +++S V ++ +G S S VRTSSGTF+
Sbjct: 85 VEIISWEPRIFLYHNFLTKEECEHLINIAKPNMRKSTVIESETGMSIESRVRTSSGTFLA 144
Query: 95 KGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLA 154
+G+D I+ IE++IA +TF+P +NGE++QVL Y+ G+KY PH+DYF D +N GG R+A
Sbjct: 145 RGRDKIVRNIENRIADFTFIPVDNGEELQVLHYQVGEKYVPHHDYFMDDINTANGGDRIA 204
Query: 155 TVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHT 214
T+LMYLSDV +GGETVFP+A + P N +LS C KKG+++KP+ +ALLF+S+
Sbjct: 205 TMLMYLSDVEEGGETVFPDA-KGNFSSMPGWN-ELSVCGKKGLSIKPKMRNALLFWSIKP 262
Query: 215 NAIPDPVSLHSGCPVIEGEKWSATKWIHV 243
+A DP+SLH CPVI+G KWS+TKWI +
Sbjct: 263 DATYDPLSLHGSCPVIKGNKWSSTKWIRI 291
>gi|357517897|ref|XP_003629237.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355523259|gb|AET03713.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|388513409|gb|AFK44766.1| unknown [Medicago truncatula]
gi|388516345|gb|AFK46234.1| unknown [Medicago truncatula]
Length = 275
Score = 237 bits (604), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 114/216 (52%), Positives = 152/216 (70%), Gaps = 2/216 (0%)
Query: 31 NPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSG 90
N V+ ISW+PRAF+Y FLT EC+HLIN+AK + +S V D +G+S S +RTSSG
Sbjct: 62 NKRWVQIISWEPRAFLYHNFLTKEECEHLINIAKPSMHKSEVIDEKTGKSLNSSIRTSSG 121
Query: 91 TFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGG 150
TF+ + D I++ IE +IA +TF+P E+GE VL YE GQKYEPHYDYF D + G
Sbjct: 122 TFLDREGDEIVSNIEKRIADFTFIPVEHGESFNVLHYEVGQKYEPHYDYFLDTFSTRHAG 181
Query: 151 HRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFF 210
R+AT+LMYLSDV +GGETVFPNA + P N +LS+C K G+++KP+ G+A+LF+
Sbjct: 182 QRIATMLMYLSDVEEGGETVFPNA-KGNFSSVPWWN-ELSDCGKGGLSIKPKMGNAILFW 239
Query: 211 SLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
S+ +A DP SLH CPVI+G+KWS KW+H D +
Sbjct: 240 SMKPDATLDPSSLHGACPVIKGDKWSCAKWMHADEY 275
>gi|145345836|ref|XP_001417405.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577632|gb|ABO95698.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 330
Score = 236 bits (602), Expect = 9e-60, Method: Compositional matrix adjust.
Identities = 126/280 (45%), Positives = 171/280 (61%), Gaps = 25/280 (8%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
+P+A++ FL+ ECDHL+ LAK +L S V +G+S SD+RTS+G F+ KG+D I
Sbjct: 48 QPKAYLLRNFLSAEECDHLMKLAKRELAPSTVVGE-AGDSVPSDIRTSAGMFLRKGQDKI 106
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIV--RGGHRLATVLM 158
+ IE++IA + P +NGE +Q+LRY+ GQKY+PH+DYF DKVN RGG RLAT+L+
Sbjct: 107 VKAIEERIARLSGTPVDNGEGMQILRYDVGQKYDPHFDYFHDKVNPAPKRGGQRLATMLI 166
Query: 159 YLSDVAKGGETVFPNAEEPPRRRTP------ATNDDLSECAKKGIAVKPRRGDALLFFSL 212
YL D KGGET FPNA+ P A++ + ++CAKKGI VK RGDA+LFFS+
Sbjct: 167 YLVDTDKGGETTFPNAKLPQSFEADEPENPFASHIEHTDCAKKGIPVKSVRGDAILFFSM 226
Query: 213 HTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEE------------GGDCTDNN 260
+ + D SLH CPVIEG+KW+A KWI V FD +E C D+
Sbjct: 227 TQDGVLDRGSLHGACPVIEGQKWTAVKWIRVGKFDGNYQEEIPMPKLSRRTDEEPCVDDW 286
Query: 261 ASCERWAALGECTKNPEYMVGSAQL----PGFCRRSCKVC 296
C +WA+ G C NPE+M + C +SC +C
Sbjct: 287 DECAKWASQGWCELNPEFMTTADSARDSQSAACAKSCGLC 326
>gi|255072321|ref|XP_002499835.1| prolyl 4-hydroxylase [Micromonas sp. RCC299]
gi|226515097|gb|ACO61093.1| prolyl 4-hydroxylase [Micromonas sp. RCC299]
Length = 454
Score = 235 bits (599), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 118/258 (45%), Positives = 159/258 (61%), Gaps = 17/258 (6%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
+P+A+++ FLT EC+HL+ LAK QL S V + S +S +RTS+G F+ +G+D
Sbjct: 176 EPKAYMFRNFLTPHECEHLMQLAKKQLAPSTVVGDKGSGSMVSKIRTSAGMFLGRGQDPT 235
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIV--RGGHRLATVLM 158
+ IE++IA + LP+ NGE +Q+LRYE+GQKY+PH+DYF D+VN RGG R+AT+L+
Sbjct: 236 VRAIEERIAAASGLPEPNGEGLQILRYENGQKYDPHFDYFHDQVNSSPRRGGQRMATMLI 295
Query: 159 YLSDVAKGGETVFPNAEEPP--RRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
YL D +GGET+FPN P P ++ S+CAKKGI VK RGDA+LF+SL +
Sbjct: 296 YLEDTTEGGETIFPNGVRPEDWDADEPGNHNSWSDCAKKGIPVKSHRGDAVLFWSLKEDY 355
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWIHVDSFD-------------KIVEEGGDCTDNNASC 263
D SLH CPVI GEKW+A KWI V FD + G+C D C
Sbjct: 356 TLDNGSLHGACPVIAGEKWTAVKWIRVAKFDGGFTDPLPMPALARSDRTKGECLDEWDEC 415
Query: 264 ERWAALGECTKNPEYMVG 281
WA G C +NP +M G
Sbjct: 416 GEWAKKGWCDRNPSFMTG 433
>gi|159794881|pdb|2JIJ|A Chain A, Crystal Structure Of The Apo Form Of Chlamydomonas
Reinhardtii Prolyl-4 Hydroxylase Type I
gi|159794882|pdb|2JIJ|B Chain B, Crystal Structure Of The Apo Form Of Chlamydomonas
Reinhardtii Prolyl-4 Hydroxylase Type I
gi|159794883|pdb|2JIJ|C Chain C, Crystal Structure Of The Apo Form Of Chlamydomonas
Reinhardtii Prolyl-4 Hydroxylase Type I
Length = 233
Score = 235 bits (599), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 112/212 (52%), Positives = 152/212 (71%), Gaps = 8/212 (3%)
Query: 34 KVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFI 93
+V +SW PRAF+ + FL+D ECD+++ A+ ++ +S+V DN SG+S S++RTS+GT+
Sbjct: 21 EVVHLSWSPRAFLLKNFLSDEECDYIVEKARPKMVKSSVVDNESGKSVDSEIRTSTGTWF 80
Query: 94 PKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNI--VRGGH 151
KG+D++I+ IE ++A T +P EN E +QVL Y GQKYEPHYDYF D VN GG
Sbjct: 81 AKGEDSVISKIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNAGPEHGGQ 140
Query: 152 RLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFS 211
R+ T+LMYL+ V +GGETV PNAE+ T D SECAK+G+AVKP +GDAL+F+S
Sbjct: 141 RVVTMLMYLTTVEEGGETVLPNAEQ------KVTGDGWSECAKRGLAVKPIKGDALMFYS 194
Query: 212 LHTNAIPDPVSLHSGCPVIEGEKWSATKWIHV 243
L + DP SLH CP ++G+KWSATKWIHV
Sbjct: 195 LKPDGSNDPASLHGSCPTLKGDKWSATKWIHV 226
>gi|241913390|pdb|3GZE|A Chain A, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
(Ser-Pro)5 Peptide Substrate
gi|241913391|pdb|3GZE|B Chain B, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
(Ser-Pro)5 Peptide Substrate
gi|241913392|pdb|3GZE|C Chain C, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
(Ser-Pro)5 Peptide Substrate
gi|241913393|pdb|3GZE|D Chain D, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
(Ser-Pro)5 Peptide Substrate
Length = 225
Score = 234 bits (598), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 112/212 (52%), Positives = 152/212 (71%), Gaps = 8/212 (3%)
Query: 34 KVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFI 93
+V +SW PRAF+ + FL+D ECD+++ A+ ++ +S+V DN SG+S S++RTS+GT+
Sbjct: 13 EVVHLSWSPRAFLLKNFLSDEECDYIVEKARPKMVKSSVVDNESGKSVDSEIRTSTGTWF 72
Query: 94 PKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNI--VRGGH 151
KG+D++I+ IE ++A T +P EN E +QVL Y GQKYEPHYDYF D VN GG
Sbjct: 73 AKGEDSVISKIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNAGPEHGGQ 132
Query: 152 RLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFS 211
R+ T+LMYL+ V +GGETV PNAE+ T D SECAK+G+AVKP +GDAL+F+S
Sbjct: 133 RVVTMLMYLTTVEEGGETVLPNAEQ------KVTGDGWSECAKRGLAVKPIKGDALMFYS 186
Query: 212 LHTNAIPDPVSLHSGCPVIEGEKWSATKWIHV 243
L + DP SLH CP ++G+KWSATKWIHV
Sbjct: 187 LKPDGSNDPASLHGSCPTLKGDKWSATKWIHV 218
>gi|159794879|pdb|2JIG|A Chain A, Crystal Structure Of Chlamydomonas Reinhardtii Prolyl-4
Hydroxylase Type I Complexed With Zinc And Pyridine-2,4-
Dicarboxylate
gi|159794880|pdb|2JIG|B Chain B, Crystal Structure Of Chlamydomonas Reinhardtii Prolyl-4
Hydroxylase Type I Complexed With Zinc And Pyridine-2,4-
Dicarboxylate
Length = 224
Score = 234 bits (598), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 112/212 (52%), Positives = 152/212 (71%), Gaps = 8/212 (3%)
Query: 34 KVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFI 93
+V +SW PRAF+ + FL+D ECD+++ A+ ++ +S+V DN SG+S S++RTS+GT+
Sbjct: 12 EVVHLSWSPRAFLLKNFLSDEECDYIVEKARPKMVKSSVVDNESGKSVDSEIRTSTGTWF 71
Query: 94 PKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNI--VRGGH 151
KG+D++I+ IE ++A T +P EN E +QVL Y GQKYEPHYDYF D VN GG
Sbjct: 72 AKGEDSVISKIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNAGPEHGGQ 131
Query: 152 RLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFS 211
R+ T+LMYL+ V +GGETV PNAE+ T D SECAK+G+AVKP +GDAL+F+S
Sbjct: 132 RVVTMLMYLTTVEEGGETVLPNAEQ------KVTGDGWSECAKRGLAVKPIKGDALMFYS 185
Query: 212 LHTNAIPDPVSLHSGCPVIEGEKWSATKWIHV 243
L + DP SLH CP ++G+KWSATKWIHV
Sbjct: 186 LKPDGSNDPASLHGSCPTLKGDKWSATKWIHV 217
>gi|384251901|gb|EIE25378.1| hypothetical protein COCSUDRAFT_35772 [Coccomyxa subellipsoidea
C-169]
Length = 222
Score = 233 bits (595), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 106/210 (50%), Positives = 147/210 (70%), Gaps = 4/210 (1%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGK 97
+SW+PRA++Y FLT+ E D+L+ K +++S V DN +G+S S VRTSSG F+ +G+
Sbjct: 4 LSWEPRAYLYHNFLTEAEADYLVQKGKPHMEKSEVVDNETGKSAPSKVRTSSGMFLNRGE 63
Query: 98 DAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVL 157
D +I IE +IA +T +PKENGE +Q+L Y+ ++Y PH+DYF D N GG R+AT+L
Sbjct: 64 DDVIERIEARIAKYTAIPKENGEGLQILHYQASEEYRPHFDYFHDNFNTQNGGQRIATML 123
Query: 158 MYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAI 217
MYLSDV GGETVFP + + P N S+CA+ G A KP++GDAL F+SL +
Sbjct: 124 MYLSDVEDGGETVFPESSDKPN----VGNTKFSQCAQAGAAAKPKKGDALFFYSLTPDGR 179
Query: 218 PDPVSLHSGCPVIEGEKWSATKWIHVDSFD 247
D SLH+GCPV++G+KWSATKW+ VD F+
Sbjct: 180 MDEKSLHAGCPVMKGDKWSATKWLRVDRFE 209
>gi|449520146|ref|XP_004167095.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 249
Score = 233 bits (595), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 113/212 (53%), Positives = 152/212 (71%), Gaps = 2/212 (0%)
Query: 35 VKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIP 94
V+ ISW+PRAFVY FL+ EC +LI+LAK +++S V DN +G++ VRTSSG F+
Sbjct: 39 VEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDNETGKNVEDSVRTSSGMFLN 98
Query: 95 KGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLA 154
+G+D I++ IE +IA +TF+P E+GE +Q+L YE GQKY+ HYD+F D+ N+ G R+A
Sbjct: 99 RGQDKIVSNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDFFDDEFNLKEIGQRMA 158
Query: 155 TVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHT 214
T+LMYLSDV +GGETVFP A + P N +LS+C K G++VKP+ GDALLF+S+
Sbjct: 159 TLLMYLSDVEEGGETVFP-AAKGNFSSVPWWN-ELSKCGKGGLSVKPKMGDALLFWSMKP 216
Query: 215 NAIPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
+ DP SLH CPVI G KWS TKWIHV+
Sbjct: 217 DTTLDPTSLHGACPVIRGNKWSCTKWIHVNQL 248
>gi|357517895|ref|XP_003629236.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355523258|gb|AET03712.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 326
Score = 233 bits (594), Expect = 8e-59, Method: Compositional matrix adjust.
Identities = 114/216 (52%), Positives = 149/216 (68%), Gaps = 2/216 (0%)
Query: 31 NPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSG 90
N V+ ISW+PRAF+Y FLT EC+HLIN+AK + +SAV D +G S RTSSG
Sbjct: 111 NKRWVQIISWEPRAFLYHNFLTKEECEHLINIAKPSMHKSAVIDEETGNGVDSRERTSSG 170
Query: 91 TFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGG 150
F+ +G D I+ IE +IA +TF+P E+GE+ VL YE GQKYEPHYDYF D + G
Sbjct: 171 AFLKRGSDRIVKNIERRIADFTFIPVEHGENFNVLHYEVGQKYEPHYDYFMDTFSTTYAG 230
Query: 151 HRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFF 210
R+AT+LMYLSDV +GGETVFPNA + P N +LS+C K G+++KP+ G+A+LF+
Sbjct: 231 QRIATMLMYLSDVEEGGETVFPNA-KGNFSSVPWWN-ELSDCGKGGLSIKPKMGNAILFW 288
Query: 211 SLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
S+ +A DP SLH CPVI+G+KW KW+HV F
Sbjct: 289 SMKPDATLDPSSLHGACPVIKGDKWLCAKWMHVGEF 324
>gi|412985583|emb|CCO19029.1| predicted protein [Bathycoccus prasinos]
Length = 458
Score = 233 bits (593), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 122/285 (42%), Positives = 175/285 (61%), Gaps = 21/285 (7%)
Query: 25 SSTAIINPSKVKQISW-KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLS 83
S +++P++++ IS PRAF+Y+ F+TD ECD LI+ +KS++ +S V D +G + S
Sbjct: 167 SPEHVLDPNRMQIISLDHPRAFLYKRFMTDEECDFLIDHSKSRMSKSGVVDAETGGTAKS 226
Query: 84 DVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDK 143
D+RTS+G+F+ G + ++ +E ++AT++ LP ++ E QVLRYE Q+Y HYDYF K
Sbjct: 227 DIRTSTGSFVGIGANDLMKKLEKRVATFSMLPVKHQEATQVLRYEVKQEYRAHYDYFFHK 286
Query: 144 VNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRR 203
+ +R+ T+LMYL + GGETVFPN E P R + SEC +G A R+
Sbjct: 287 GGMA--NNRIVTILMYLHEPEFGGETVFPNTEVPLERAEKGWGKNFSECGNRGRAAVVRK 344
Query: 204 GDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFD-------KIVEEGG-- 254
GDAL+F+S+ DP S H+GCPV+ GEKW+ATKWIHV+ + K+ GG
Sbjct: 345 GDALIFWSMKPGGELDPGSSHAGCPVVRGEKWTATKWIHVNPTNQWNQNNHKVHYAGGPA 404
Query: 255 ---DCTDNNASCERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
C D NA+C WA GECT NP +MV S C+ SC+ C
Sbjct: 405 NSETCKDTNAACPGWAEGGECTANPGFMVNS------CKVSCRQC 443
>gi|357517885|ref|XP_003629231.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355523253|gb|AET03707.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 279
Score = 232 bits (592), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 111/213 (52%), Positives = 155/213 (72%), Gaps = 2/213 (0%)
Query: 31 NPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSG 90
N V+ +SW+PR F+Y FL EC+HLIN+AK +++S V D+ +G+S S RTSSG
Sbjct: 66 NKRWVEIVSWEPRVFLYHNFLAKEECEHLINIAKPDVQKSTVVDDTTGKSVNSSARTSSG 125
Query: 91 TFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGG 150
TFI +G D I++ IE +IA +TF+P E+GED+ +L YE GQKY+ H DYF D+VN GG
Sbjct: 126 TFIDRGYDKILSDIEKRIADFTFIPVEHGEDVNILHYEVGQKYDFHTDYFEDEVNTKHGG 185
Query: 151 HRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFF 210
R+AT+LMYLSDV +GGETVFP+A+ P N +LS+C KKG+++KP+ G+A+LF+
Sbjct: 186 ERIATMLMYLSDVEEGGETVFPSAKG-NFSSVPWWN-ELSDCGKKGLSIKPKMGNAILFW 243
Query: 211 SLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHV 243
+ +A DP+S+H CPVI+G+KWS TKW+ V
Sbjct: 244 GMKPDATVDPLSVHGACPVIKGDKWSCTKWMRV 276
>gi|413945803|gb|AFW78452.1| hypothetical protein ZEAMMB73_588774 [Zea mays]
Length = 239
Score = 231 bits (590), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 106/148 (71%), Positives = 129/148 (87%)
Query: 28 AIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRT 87
A++ P +QIS KPR F+Y+ FL+D E +HLI+LA+++LKRSAVADN+SG+S LS+VRT
Sbjct: 44 AVVYPHHSRQISCKPRVFLYQHFLSDDEANHLISLARAELKRSAVADNMSGKSTLSEVRT 103
Query: 88 SSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIV 147
SSGTF+ KG+D I+ GIEDKIA WTFLPKENGEDIQVLRY+HG+KYEPHYDYF+D VN V
Sbjct: 104 SSGTFLRKGQDPIVEGIEDKIAAWTFLPKENGEDIQVLRYKHGEKYEPHYDYFTDNVNTV 163
Query: 148 RGGHRLATVLMYLSDVAKGGETVFPNAE 175
RGGHR ATVL+YL+DV +GGETVFP AE
Sbjct: 164 RGGHRYATVLLYLTDVPEGGETVFPLAE 191
>gi|159487419|ref|XP_001701720.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280939|gb|EDP06695.1| predicted protein [Chlamydomonas reinhardtii]
Length = 274
Score = 231 bits (588), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 124/275 (45%), Positives = 162/275 (58%), Gaps = 15/275 (5%)
Query: 35 VKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIP 94
V+Q+ PRA+ + FLT E HL+ LA +LKRS V N GE + ++RTS G FI
Sbjct: 1 VQQVGLHPRAYYFHNFLTKAERGHLVKLAAPKLKRSTVVGN-DGEGVVDNIRTSYGMFIR 59
Query: 95 KGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLA 154
+ +D ++A IE +I+ WT LP E+ EDIQVLRY HGQ Y HYD DK N RLA
Sbjct: 60 RLQDPVVARIEKRISLWTHLPVEHQEDIQVLRYAHGQTYGAHYDS-GDKSNEPGPKWRLA 118
Query: 155 TVLMYLSDVAKGGETVFPNAEEPPRRRTP-ATNDDLSECAKKGIAVKPRRGDALLFFSLH 213
T LMYLSDV +GGET FP+ P D S+CAK +A KP+ GDA+LF+S +
Sbjct: 119 TFLMYLSDVEEGGETAFPHNSVWADPSIPEKVGDKFSDCAKGNVAAKPKAGDAVLFYSFY 178
Query: 214 TNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSF-----DKIVE-------EGGDCTDNNA 261
N DP ++H+GCPVI+G KW+A W+H F +V+ + G CTD +
Sbjct: 179 PNMTMDPAAMHTGCPVIKGVKWAAPVWMHDIPFRPSEISGMVQRIPDNEPDAGTCTDLHP 238
Query: 262 SCERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
C WAA GEC N +M+G G CR++CK C
Sbjct: 239 RCVEWAAAGECEHNKGFMMGGPDNLGTCRKTCKAC 273
>gi|302831512|ref|XP_002947321.1| hypothetical protein VOLCADRAFT_120451 [Volvox carteri f.
nagariensis]
gi|300267185|gb|EFJ51369.1| hypothetical protein VOLCADRAFT_120451 [Volvox carteri f.
nagariensis]
Length = 797
Score = 231 bits (588), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 122/271 (45%), Positives = 169/271 (62%), Gaps = 15/271 (5%)
Query: 35 VKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIP 94
++ ISW PRAFVY FLT ECDHL+ + ++ RS V D+ +G+SKL D+RTS G
Sbjct: 493 IETISWSPRAFVYHNFLTSAECDHLVQIGTQRVSRSLVVDSQTGQSKLDDIRTSYGAAFG 552
Query: 95 KGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVN---IVRGGH 151
+G+D +IA IE++IA WT LP E+GE +Q+LRY GQKY+ H+D+F D V+ + G+
Sbjct: 553 RGEDPVIAEIEERIAEWTHLPPEHGEPMQILRYVDGQKYDAHWDWFDDPVHHRSYLVDGN 612
Query: 152 RLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKK-GIAVKPRRGDALLFF 210
R ATVL+YLS+V GGET P A+ P + ++ S CA K G++++PR+GDALLF+
Sbjct: 613 RYATVLLYLSEVEAGGETNLPLAD--PIDMSVQAIENPSPCAAKMGLSIRPRKGDALLFY 670
Query: 211 SLHTNAIP-DPVSLHSGCPVIEGEKWSATKWIH----VDSFDKIVEEGGDCTDNNASCER 265
+ D +LH+ CP ++G KW+ATKWIH + FD + G C D C
Sbjct: 671 DMDIEGQKGDRKALHASCPTLKGMKWTATKWIHSKPYMGRFDPL-RTAGVCRDTAQDCAA 729
Query: 266 WAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
A G CT + + MVG A G CR+SC C
Sbjct: 730 LVAEGRCTSDLDTMVGPA---GKCRKSCGDC 757
>gi|90704797|dbj|BAE92293.1| putative prolyl 4-hydroxylase, alpha subunit [Cryptomeria japonica]
Length = 302
Score = 230 bits (587), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 110/213 (51%), Positives = 153/213 (71%), Gaps = 2/213 (0%)
Query: 34 KVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFI 93
+V+ +SW+PRAF+Y FL EC++LIN+AK + +S V D+ +G S S+VRTSSG F+
Sbjct: 90 RVEVLSWEPRAFLYHNFLAKDECEYLINIAKPHMVKSMVVDSKTGGSMDSNVRTSSGWFL 149
Query: 94 PKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRL 153
+G+D II IE +IA ++ +P E+GE + VL YE QKY+ HYDYFSD +N+ GG R
Sbjct: 150 NRGQDKIIRRIEKRIADFSHIPVEHGEGLHVLHYEVEQKYDAHYDYFSDTINVKNGGQRG 209
Query: 154 ATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLH 213
AT+LMYLSDV KGGETVFP ++ + D+LSEC + G++V+P+ GDALLF+S+
Sbjct: 210 ATMLMYLSDVEKGGETVFPQSK--VNSSSVPWWDELSECGRSGLSVRPKMGDALLFWSVK 267
Query: 214 TNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
+A DP SLH CPVI+G KWSATKW+ ++ +
Sbjct: 268 PDASLDPSSLHGSCPVIQGNKWSATKWMRLNKY 300
>gi|302844247|ref|XP_002953664.1| prolyl 4-hydroxylase alpha subunit-like protein [Volvox carteri f.
nagariensis]
gi|300261073|gb|EFJ45288.1| prolyl 4-hydroxylase alpha subunit-like protein [Volvox carteri f.
nagariensis]
Length = 364
Score = 229 bits (584), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 124/274 (45%), Positives = 160/274 (58%), Gaps = 14/274 (5%)
Query: 35 VKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIP 94
V+Q+ PRA+++ FLT E H++ LA +LKRS V + GE + ++RTS G FI
Sbjct: 48 VEQVGLHPRAYLFHNFLTKAERAHMVRLAAPKLKRSTVVGS-KGEGVVDNIRTSFGMFIR 106
Query: 95 KGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLA 154
+ D IIA IE +I+ WT LP E+ EDIQVLRY HGQ Y HYD + + V RLA
Sbjct: 107 RLSDPIIARIEKRISLWTHLPIEHQEDIQVLRYAHGQTYGAHYDSGASS-DHVGPKWRLA 165
Query: 155 TVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHT 214
T LMYLSDV +GGET FP P +SECAK +A KP+ GDA+LF+S
Sbjct: 166 TFLMYLSDVEEGGETAFPQNSVWYDPTIPERIGPVSECAKGHVAAKPKAGDAVLFYSFLP 225
Query: 215 NAIPDPVSLHSGCPVIEGEKWSATKWIH--------VDSFDKIV----EEGGDCTDNNAS 262
N DP ++H+GCPVI+G KW+A W+H V +++ E G C D +
Sbjct: 226 NNTMDPAAMHTGCPVIKGIKWAAPVWMHDIPFRPEEVQGGKQLIMDRDPEAGLCVDGHPR 285
Query: 263 CERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
C WAA GEC KNP YM G G CR+SC+ C
Sbjct: 286 CGEWAAAGECEKNPMYMAGGPNSLGTCRKSCRTC 319
>gi|302143843|emb|CBI22704.3| unnamed protein product [Vitis vinifera]
Length = 317
Score = 227 bits (579), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 114/272 (41%), Positives = 175/272 (64%), Gaps = 11/272 (4%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDV-RTS 88
++PS+V Q+SW+PRAF+Y GFL+D ECDHLI+LA + + A SG L + ++S
Sbjct: 52 VDPSRVIQLSWQPRAFLYRGFLSDEECDHLISLALGKKEELATNGGDSGNVVLKRLLKSS 111
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVR 148
G D + A IE +I+ WTFLPKEN E ++V++Y+ + + Y+YFS+K
Sbjct: 112 EGPLYID--DEVAARIEKRISAWTFLPKENSEPLEVVQYQF-ENAKQKYNYFSNKSTSKF 168
Query: 149 GGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALL 208
G +ATVL++LS+V +GGE FP +E + + + + LS+C + ++P +G+A+L
Sbjct: 169 GEPLMATVLLHLSNVTRGGELFFPESE---LKNSQSKSGILSDCTESSSGLRPVKGNAIL 225
Query: 209 FFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDK----IVEEGGDCTDNNASCE 264
FF++H NA PD S ++ CPV+EGE W ATK+ H+ + + +GG+CTD + +C
Sbjct: 226 FFNVHPNASPDKSSSYARCPVLEGEMWCATKFFHLRAIGRENVSFKLDGGECTDEDENCP 285
Query: 265 RWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
+WA++GEC +NP YM+GS G CR+SC VC
Sbjct: 286 KWASIGECQRNPIYMIGSPDYYGTCRKSCNVC 317
>gi|159795555|pdb|2V4A|A Chain A, Crystal Structure Of The Semet-Labeled Prolyl-4
Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
Reinhardtii.
gi|159795556|pdb|2V4A|B Chain B, Crystal Structure Of The Semet-Labeled Prolyl-4
Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
Reinhardtii.
gi|159795557|pdb|2V4A|C Chain C, Crystal Structure Of The Semet-Labeled Prolyl-4
Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
Reinhardtii.
gi|159795558|pdb|2V4A|D Chain D, Crystal Structure Of The Semet-Labeled Prolyl-4
Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
Reinhardtii
Length = 233
Score = 227 bits (578), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 111/212 (52%), Positives = 148/212 (69%), Gaps = 8/212 (3%)
Query: 34 KVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFI 93
+V +SW PRAF+ + FL+D ECD+++ A+ + +S+V DN SG+S S++RTS+GT+
Sbjct: 21 EVVHLSWSPRAFLLKNFLSDEECDYIVEKARPKXVKSSVVDNESGKSVDSEIRTSTGTWF 80
Query: 94 PKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNI--VRGGH 151
KG+D++I+ IE ++A T +P EN E +QVL Y GQKYEPHYDYF D VN GG
Sbjct: 81 AKGEDSVISKIEKRVAQVTXIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNAGPEHGGQ 140
Query: 152 RLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFS 211
R+ T L YL+ V +GGETV PNAE+ T D SECAK+G+AVKP +GDAL F+S
Sbjct: 141 RVVTXLXYLTTVEEGGETVLPNAEQ------KVTGDGWSECAKRGLAVKPIKGDALXFYS 194
Query: 212 LHTNAIPDPVSLHSGCPVIEGEKWSATKWIHV 243
L + DP SLH CP ++G+KWSATKWIHV
Sbjct: 195 LKPDGSNDPASLHGSCPTLKGDKWSATKWIHV 226
>gi|159487763|ref|XP_001701892.1| predicted protein [Chlamydomonas reinhardtii]
gi|158281111|gb|EDP06867.1| predicted protein [Chlamydomonas reinhardtii]
Length = 259
Score = 227 bits (578), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 114/264 (43%), Positives = 165/264 (62%), Gaps = 7/264 (2%)
Query: 35 VKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIP 94
++ ++WKPR F+Y F+T++E HLI LA Q+KRS V G+S + RTS GTF+
Sbjct: 1 IEHVAWKPRVFIYHNFITEVEAKHLIELAAPQMKRSTVV-GAGGKSVEDNYRTSYGTFLK 59
Query: 95 KGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLA 154
+ +D I+ IE+++A WT +P + ED Q+LRY GQ+Y+ H D D+ G R+A
Sbjct: 60 RYQDEIVERIENRVAAWTQIPVAHQEDTQILRYGLGQQYKVHADTLRDE----EAGVRVA 115
Query: 155 TVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHT 214
TVL+YL++ GGET FP++E + + S+CAK +A P+RGDALLF+S++
Sbjct: 116 TVLIYLNEPDGGGETAFPSSEWVNPQLAKTLGANFSDCAKNHVAFAPKRGDALLFWSINP 175
Query: 215 NA-IPDPVSLHSGCPVIEGEKWSATKWIHVDSF-DKIVEEGGDCTDNNASCERWAALGEC 272
+ D + H+GCPV+ G KW+ATKWIH F + + G C D + +C WAA G+C
Sbjct: 176 DGNTEDTHASHTGCPVLSGVKWTATKWIHARPFRPNEMADPGVCYDESPNCPEWAARGDC 235
Query: 273 TKNPEYMVGSAQLPGFCRRSCKVC 296
KN +YMV +A PG CR+SC C
Sbjct: 236 EKNSDYMVVNAVSPGVCRKSCGAC 259
>gi|255071007|ref|XP_002507585.1| predicted protein [Micromonas sp. RCC299]
gi|226522860|gb|ACO68843.1| predicted protein [Micromonas sp. RCC299]
Length = 433
Score = 226 bits (577), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 120/283 (42%), Positives = 174/283 (61%), Gaps = 28/283 (9%)
Query: 30 INPSKVKQISW-KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+P ++ +S PRAF++ GFL++ ECD L+ A+ + +S V D +G S S++RTS
Sbjct: 153 FDPRNIQVVSLDNPRAFMHIGFLSERECDLLVEYARPNMYKSGVVDASNGGSSFSNIRTS 212
Query: 89 SGTFIPK----GKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKV 144
+G+F+P G + ++ IE +IA WT +P +GE IQVLRY+ GQ+Y+ H+DYF +
Sbjct: 213 TGSFVPTVFPLGMNDVVRRIERRIAAWTQIPAAHGEPIQVLRYQIGQEYQSHFDYFFHEG 272
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ +R+ATVLMYLSDV GGETVFP+AE + P + CAK GI V P++G
Sbjct: 273 GM--KNNRIATVLMYLSDVKDGGETVFPSAESLQVKPEPIHH----ACAKNGITVIPKKG 326
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDS---FD---KIVEEG----- 253
DA+LF+++ D S H+GCPV+ GEKW+ATKW+HV S FD +++ EG
Sbjct: 327 DAILFWNMKVGGDLDGGSTHAGCPVVLGEKWTATKWLHVSSSTEFDARQRVLREGRETNF 386
Query: 254 GDCTDNNASCERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
G C + N C+ WA EC +NP+YM + C SC +C
Sbjct: 387 GGCRNANIQCQVWAEQNECERNPQYMRDT------CHLSCGMC 423
>gi|255083627|ref|XP_002508388.1| predicted protein [Micromonas sp. RCC299]
gi|226523665|gb|ACO69646.1| predicted protein [Micromonas sp. RCC299]
Length = 253
Score = 226 bits (577), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 112/253 (44%), Positives = 162/253 (64%), Gaps = 10/253 (3%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGK 97
+SW PRAF F++ ECD ++ +A+ +++RS V D+++G+SK+ +RTS TF+ +G
Sbjct: 1 VSWYPRAFHLHNFMSHEECDRILEIARPRVRRSTVIDSVTGQSKVDPIRTSEQTFLNRGT 60
Query: 98 DAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDY-----FSDKVNIVRGGHR 152
I+ +E+++A T LP +GED+Q+L+Y GQKY+ H+D S K GGHR
Sbjct: 61 WDIVTKVEERLAVVTQLPAYHGEDMQILKYGLGQKYDAHHDVGELTSASGKQLAAEGGHR 120
Query: 153 LATVLMYLSDVAKGGETVFPNAE-EPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFS 211
+ATVL+YLSDV +GGET FP++E P R A S+CA+ +AVKPR+GD LLF+S
Sbjct: 121 VATVLLYLSDVEEGGETAFPDSEWMTPELRKWAEGQKWSDCAEGNVAVKPRKGDGLLFWS 180
Query: 212 LHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSF----DKIVEEGGDCTDNNASCERWA 267
++ DP S+H+GCPVI GEKW+ATKWIH F + C + + C+ WA
Sbjct: 181 VNNENAIDPHSMHAGCPVIRGEKWTATKWIHARPFRWTAPPPPKAPPGCDNKHELCKAWA 240
Query: 268 ALGECTKNPEYMV 280
GEC KNP +M+
Sbjct: 241 NAGECKKNPGFML 253
>gi|224122338|ref|XP_002318810.1| predicted protein [Populus trichocarpa]
gi|222859483|gb|EEE97030.1| predicted protein [Populus trichocarpa]
Length = 310
Score = 226 bits (576), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 116/273 (42%), Positives = 169/273 (61%), Gaps = 15/273 (5%)
Query: 27 TAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVR 86
T ++PS+V +SW+PR FVY+GFLTD ECDHLI+LA+ + S D+ SG + + +
Sbjct: 50 TNWVDPSRVVTVSWQPRVFVYKGFLTDEECDHLISLAQGTKETSEGKDDDSGRIERNRLF 109
Query: 87 TSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRY--EHGQKYEPHYDYFSDKV 144
SS + + D I++ IE++++ WT LPKEN + +QV+ Y E + Y +DYF +K
Sbjct: 110 ASSTSLL-NMDDNILSRIEERVSAWTLLPKENSKPLQVMHYGIEDAKNY---FDYFGNKS 165
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
I+ +AT++ YLS+V +GGE FP +E N S+C K +++P +G
Sbjct: 166 AIISSEPLMATLVFYLSNVTQGGEIFFPKSE--------VKNKIWSDCTKISDSLRPIKG 217
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVE-EGGDCTDNNASC 263
+A+LFF++H N PD S HS CPV+EGE W ATK ++ + + EG +CTD + +C
Sbjct: 218 NAILFFTVHPNTSPDMGSSHSRCPVLEGEMWYATKKFYLRAIKVFSDSEGSECTDEDENC 277
Query: 264 ERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
WAALGEC KNP YM+GS G CR+SC C
Sbjct: 278 PSWAALGECEKNPVYMIGSPDYFGTCRKSCNAC 310
>gi|388520325|gb|AFK48224.1| unknown [Lotus japonicus]
Length = 188
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 110/188 (58%), Positives = 140/188 (74%), Gaps = 2/188 (1%)
Query: 59 LINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKEN 118
+INLAK + +S+V D+ +G+S S VRTSSG F+ +GKD +I IE +IA + F+P EN
Sbjct: 1 MINLAKPHMAKSSVVDSQTGKSVGSRVRTSSGMFLKRGKDKVIQTIEKRIADFAFIPVEN 60
Query: 119 GEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPP 178
GE +QVL YE GQKYEPHYDYF D+ N GG R+ATVLMYLSDV +GGET+FP A +
Sbjct: 61 GEGLQVLHYEVGQKYEPHYDYFLDEFNTKNGGQRIATVLMYLSDVEEGGETIFP-AAKAN 119
Query: 179 RRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSAT 238
P N DLS CAKKG++VKP+RGDALLF+S+ +A DP SLH GCPVI G KWS+T
Sbjct: 120 FSSVPWYN-DLSVCAKKGLSVKPKRGDALLFWSIRPDATLDPSSLHGGCPVIRGNKWSST 178
Query: 239 KWIHVDSF 246
KW+H++ +
Sbjct: 179 KWMHLEEY 186
>gi|255083957|ref|XP_002508553.1| predicted protein [Micromonas sp. RCC299]
gi|226523830|gb|ACO69811.1| predicted protein [Micromonas sp. RCC299]
Length = 262
Score = 226 bits (575), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 121/265 (45%), Positives = 159/265 (60%), Gaps = 11/265 (4%)
Query: 35 VKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIP 94
V+++S +P+AF+Y GFL+ ECDHLI + LKRS V L DVRTS GTF+P
Sbjct: 1 VEKLSDEPKAFLYHGFLSAEECDHLIKIGTPHLKRSTVVGGKDDTGVLDDVRTSFGTFLP 60
Query: 95 KGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLA 154
K D ++ GIE ++ ++ + EN E +Q+L+Y GQ+Y+ H D + GG R+A
Sbjct: 61 KKYDDVLYGIERRVEDFSQISYENQEQLQLLKYHDGQEYKDHQDGLTSP----NGGRRIA 116
Query: 155 TVLMYLSDVAKGGETVFPNAEEPP--RRRTPATNDDLSECA---KKGIAVKPRRGDALLF 209
TVLM+L + KGGET FP + P +R D+LS+CA +G+AVKPRRGDA+LF
Sbjct: 117 TVLMFLHEPEKGGETSFPQGKPLPAVAQRLRGMRDELSDCAWRDGRGLAVKPRRGDAVLF 176
Query: 210 FSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGGDCTDNN-ASCERWAA 268
FS N D S H+ CP + G KW+ATKWIH FD V C D A+C WA
Sbjct: 177 FSFKKNGGSDIASTHASCPTVGGVKWTATKWIHEKRFDTGVWREPKCVDEEPANCPGWAK 236
Query: 269 LGECTKNPEYMVGSAQLPGFCRRSC 293
GEC NP YM+G + PG C RSC
Sbjct: 237 SGECANNPAYMLG-GETPGKCLRSC 260
>gi|159489450|ref|XP_001702710.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280732|gb|EDP06489.1| predicted protein [Chlamydomonas reinhardtii]
Length = 252
Score = 226 bits (575), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 127/264 (48%), Positives = 155/264 (58%), Gaps = 14/264 (5%)
Query: 35 VKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRS-AVADNLSGESKLSDVRTSSGTFI 93
V ISW+PRAFV FLTD E H+ ++A+ ++RS VADN G S L D RTS GTFI
Sbjct: 1 VSVISWEPRAFVIRNFLTDQEATHIADVAQVHMRRSTVVADN--GSSVLDDYRTSYGTFI 58
Query: 94 PKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRL 153
+ ++A +ED++A T +P ED+QVLRY +GQ Y H D + RL
Sbjct: 59 NRYATPVVARVEDRVAVLTRVPVHYQEDMQVLRYGNGQYYHRHTDSLEND------SPRL 112
Query: 154 ATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLH 213
ATVL+YLSD GGET FP A P P SEC K +A KPR+GDALLF+S+
Sbjct: 113 ATVLLYLSDPELGGETAFPLAWAHP--DMPKVFGPFSECVKNNVAFKPRKGDALLFWSVK 170
Query: 214 TNA-IPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGGDCTDNNASCERWAALGEC 272
+ DP+S H GCPVI G KW+AT W+H F EE DCTD + C +W A GEC
Sbjct: 171 PDGKTEDPLSEHEGCPVIRGVKWTATVWVHTKPFRP--EEWDDCTDRHKECPKWKAAGEC 228
Query: 273 TKNPEYMVGSAQLPGFCRRSCKVC 296
KN YM G A G CR SC VC
Sbjct: 229 EKNHGYMQGDANQVGSCRLSCGVC 252
>gi|159464219|ref|XP_001690339.1| hypothetical protein CHLREDRAFT_114525 [Chlamydomonas reinhardtii]
gi|158279839|gb|EDP05598.1| predicted protein [Chlamydomonas reinhardtii]
Length = 244
Score = 226 bits (575), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 115/254 (45%), Positives = 159/254 (62%), Gaps = 10/254 (3%)
Query: 43 RAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIA 102
R F+ E FLTD E DH++ +++ +L+RS V +G S+ S +RTS G F+ +G+D ++
Sbjct: 1 RIFLIEHFLTDEEADHIVQVSERRLERSGVVAT-NGGSEESQIRTSFGVFLERGEDPVVK 59
Query: 103 GIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLSD 162
G+E++I+ T +P NGE +QVLRY+ QKY+ H+DYF K I GG+R ATVLMYL D
Sbjct: 60 GVEERISALTLMPVGNGEGLQVLRYQKEQKYDAHWDYFFHKDGIANGGNRYATVLMYLVD 119
Query: 163 VAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVS 222
+GGETVFPN P N SECA+ +A KP++G A+LF S+ + S
Sbjct: 120 TEEGGETVFPNIAAP-----GGENVGFSECARYHLAAKPKKGTAILFHSIKPTGELERKS 174
Query: 223 LHSGCPVIEGEKWSATKWIHVDSFDKIVEEGGDCTDNNASCERWAALGECTKNPEYMVGS 282
LH+ CPVI+G KWSA KWIHV + + G C D++ C WA GEC +N +M+G+
Sbjct: 175 LHTACPVIKGIKWSAAKWIHVKPQN--LPPG--CEDSDEMCPDWAEAGECERNASFMIGT 230
Query: 283 AQLPGFCRRSCKVC 296
PG C SCK C
Sbjct: 231 RARPGKCVASCKRC 244
>gi|359490628|ref|XP_002271805.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Vitis
vinifera]
Length = 312
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 114/272 (41%), Positives = 173/272 (63%), Gaps = 16/272 (5%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDV-RTS 88
++PS+V Q+SW+PRAF+Y GFL+D ECDHLI+LA + + A SG L + ++S
Sbjct: 52 VDPSRVIQLSWQPRAFLYRGFLSDEECDHLISLALGKKEELATNGGDSGNVVLKRLLKSS 111
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVR 148
G D + A IE +I+ WTFLPKEN E ++V++Y+ + + Y+YFS+K
Sbjct: 112 EGPLYID--DEVAARIEKRISAWTFLPKENSEPLEVVQYQF-ENAKQKYNYFSNKSTSKF 168
Query: 149 GGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALL 208
G +ATVL++LS+V +GGE FP +E + + LS+C + ++P +G+A+L
Sbjct: 169 GEPLMATVLLHLSNVTRGGELFFPESE--------SKSGILSDCTESSSGLRPVKGNAIL 220
Query: 209 FFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDK----IVEEGGDCTDNNASCE 264
FF++H NA PD S ++ CPV+EGE W ATK+ H+ + + +GG+CTD + +C
Sbjct: 221 FFNVHPNASPDKSSSYARCPVLEGEMWCATKFFHLRAIGRENVSFKLDGGECTDEDENCP 280
Query: 265 RWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
+WA++GEC +NP YM+GS G CR+SC VC
Sbjct: 281 KWASIGECQRNPIYMIGSPDYYGTCRKSCNVC 312
>gi|222623961|gb|EEE58093.1| hypothetical protein OsJ_08962 [Oryza sativa Japonica Group]
Length = 387
Score = 222 bits (565), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 108/187 (57%), Positives = 140/187 (74%), Gaps = 2/187 (1%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGK 97
ISW+PRAFVY FL+ ECD+LI LAK + +S V D+ +G+SK S VRTSSG F+ +G+
Sbjct: 102 ISWEPRAFVYHNFLSKEECDYLIGLAKPHMVKSTVVDSTTGKSKDSRVRTSSGMFLQRGR 161
Query: 98 DAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVL 157
D +I IE +IA +TF+P E+GE +QVL YE GQKYEPH+DYF D+ N GG R+AT+L
Sbjct: 162 DKVIRAIEKRIADYTFIPMEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRMATLL 221
Query: 158 MYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAI 217
MYLSDV +GGET+FP+A P N +LSECA+KG+AVKP+ GDALLF+S+ +A
Sbjct: 222 MYLSDVEEGGETIFPDANV-NSSSLPWYN-ELSECARKGLAVKPKMGDALLFWSMKPDAT 279
Query: 218 PDPVSLH 224
DP+SLH
Sbjct: 280 LDPLSLH 286
>gi|218191856|gb|EEC74283.1| hypothetical protein OsI_09531 [Oryza sativa Indica Group]
Length = 376
Score = 221 bits (564), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 106/187 (56%), Positives = 140/187 (74%), Gaps = 2/187 (1%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGK 97
ISW+PRAFVY FL+ ECD+LI LAK + +S V D+ +G+SK S VRTSSG F+ +G+
Sbjct: 102 ISWEPRAFVYHNFLSKEECDYLIGLAKPHMVKSTVVDSTTGKSKDSRVRTSSGMFLQRGR 161
Query: 98 DAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVL 157
D +I IE +IA +TF+P E+GE +QVL YE GQKYEPH+DYF D+ N GG R+AT+L
Sbjct: 162 DKVIRAIEKRIADYTFIPMEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRMATLL 221
Query: 158 MYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAI 217
MYLSDV +GGET+FP+A + ++LSECA+KG+AVKP+ GDALLF+S+ +A
Sbjct: 222 MYLSDVEEGGETIFPDANV--NSSSLPWYNELSECARKGLAVKPKMGDALLFWSMKPDAT 279
Query: 218 PDPVSLH 224
DP+SLH
Sbjct: 280 LDPLSLH 286
>gi|159487421|ref|XP_001701721.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280940|gb|EDP06696.1| predicted protein [Chlamydomonas reinhardtii]
Length = 336
Score = 221 bits (563), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 123/275 (44%), Positives = 160/275 (58%), Gaps = 15/275 (5%)
Query: 35 VKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIP 94
V+Q+ PRA+ + FLT E HL+ +A +LKRS V E + D+RTS G FI
Sbjct: 19 VQQVGLHPRAYYFHNFLTKAERAHLVRVAAPKLKRSTVVGGKG-EGVVDDIRTSYGMFIR 77
Query: 95 KGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLA 154
+ D ++ IE +I+ WT LP E+ EDIQ+LRY HGQ Y HYD + + V RLA
Sbjct: 78 RLSDPVVTRIEKRISLWTHLPVEHQEDIQILRYAHGQTYGAHYDSGASS-DHVGPKWRLA 136
Query: 155 TVLMYLSDVAKGGETVFPNAEEPPRRRTP-ATNDDLSECAKKGIAVKPRRGDALLFFSLH 213
T LMYLSDV +GGET FP+ P D S+CAK +A KP+ GDA+LF+S +
Sbjct: 137 TFLMYLSDVEEGGETAFPHNSVWADPSIPEQVGDKFSDCAKGHVAAKPKAGDAVLFYSFY 196
Query: 214 TNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSF--DKIV----------EEGGDCTDNNA 261
N DP S+H+GCPVI+G KW+A W+H F ++I + G CTD +A
Sbjct: 197 PNNTMDPASMHTGCPVIKGVKWAAPVWMHDIPFRPEEISGMTQHNMDRDPDAGTCTDLHA 256
Query: 262 SCERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
C WAA GEC N YM G + G CR+SCKVC
Sbjct: 257 RCTEWAAAGECENNKAYMCGGSNNLGACRKSCKVC 291
>gi|412992163|emb|CCO19876.1| predicted protein [Bathycoccus prasinos]
Length = 350
Score = 221 bits (562), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 113/271 (41%), Positives = 164/271 (60%), Gaps = 24/271 (8%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGK 97
ISW+PRAFV L++ EC+ ++ +AK +KRS V D+++GE K +RTS TF+ +GK
Sbjct: 83 ISWQPRAFVLHSILSEEECEEILRIAKPMMKRSTVVDSITGEIKTDPIRTSKQTFLARGK 142
Query: 98 DAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVR-----GGHR 152
++ +E++++ +T LP NGED+Q+L Y G+KY H+D + GG R
Sbjct: 143 YPVVTRVEERLSRFTMLPWYNGEDMQILSYGVGEKYSAHHDVGEKNTKSGQQLSADGGQR 202
Query: 153 LATVLMYLSDVAKGGETVFPNAE--EPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFF 210
+ATVL+YL D +GGET FP++E EP + SECAK G+A KP+RGD LLFF
Sbjct: 203 VATVLLYLQDTEEGGETAFPDSEWIEP---ESEYAQQKFSECAKNGVAFKPKRGDGLLFF 259
Query: 211 SLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFD------KIVEEGGDCTDNNASCE 264
S+ D S+H+GCPV++G KW+ATKWIH F K +EG C + + C+
Sbjct: 260 SITPEGDIDQKSMHAGCPVVKGTKWTATKWIHARPFHYKLPNPKPPKEG--CENTDERCK 317
Query: 265 RWAALGECTKNPEYMVGSAQLPGFCRRSCKV 295
WA GEC +NP +M + C+ +C+V
Sbjct: 318 GWANAGECERNPGFMTKN------CKWACRV 342
>gi|357135727|ref|XP_003569460.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like isoform 2
[Brachypodium distachyon]
Length = 314
Score = 221 bits (562), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 122/283 (43%), Positives = 172/283 (60%), Gaps = 23/283 (8%)
Query: 23 SFSSTAIIN--PSKVKQISWKPRAFVYEGFLTDLECDHLINLAK----SQLKRSAVADNL 76
SF+S++ I+ PS+ K+++W PR F+YEGFL+ +ECDHL+ +A+ S L +A A N+
Sbjct: 46 SFASSSHIDFDPSRSKRLAWHPRVFLYEGFLSGMECDHLVYVARLNIESSLLVNAGARNI 105
Query: 77 SGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPH 136
+ S +D R + KD +++ IED+I+ W+F+PKE+GE +Q+L+Y Q
Sbjct: 106 TQNS--TDARFKFQ--LADSKDIVVSKIEDRISLWSFIPKEHGESMQILKYGSNQS---- 157
Query: 137 YDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKG 196
D+ D GG+RL T+LMYLSDV +GGETVFP +E + T A LSECA G
Sbjct: 158 -DHNKDGTQSSSGGNRLVTILMYLSDVKQGGETVFPRSE---LKDTQAKEGALSECA--G 211
Query: 197 IAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDK---IVEEG 253
AVKP +GDA+L F+L + + D S + C V+EGEKW A K +H+ DK +
Sbjct: 212 YAVKPVKGDAILLFNLRPDGVTDSDSHYEDCSVLEGEKWLAIKHLHISKIDKSRSSLPSE 271
Query: 254 GDCTDNNASCERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
CTD + C WAA GEC NP +M+GS G CR+SC C
Sbjct: 272 DLCTDEDDKCVSWAAAGECYSNPVFMIGSPDYYGTCRKSCHAC 314
>gi|414591891|tpg|DAA42462.1| TPA: hypothetical protein ZEAMMB73_637248 [Zea mays]
Length = 207
Score = 219 bits (559), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 106/172 (61%), Positives = 131/172 (76%), Gaps = 9/172 (5%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
N S+VK +SW PR FVY+GFL+D ECDHL+ LAK +++RS VADN SG+S S+VRTSS
Sbjct: 40 FNSSRVKAVSWHPRIFVYKGFLSDAECDHLVTLAKKKIQRSMVADNESGKSVKSEVRTSS 99
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRG 149
G F+ K +D +++ IE++IA WTFLP+EN E++QVLRYE GQKYEPH+DYF D+VN RG
Sbjct: 100 GMFLDKRQDPVVSRIEERIAAWTFLPQENAENMQVLRYEPGQKYEPHFDYFHDRVNQARG 159
Query: 150 GHRLATVLMYLSDVAKGGETVFPNA---EEPPRRRTPATNDDLSECAKKGIA 198
GHR ATVLMYLS V +GGETVFPNA E P+ T SECA KG+A
Sbjct: 160 GHRYATVLMYLSTVREGGETVFPNAKGWESQPKDAT------FSECAHKGLA 205
>gi|302845026|ref|XP_002954052.1| hypothetical protein VOLCADRAFT_64430 [Volvox carteri f.
nagariensis]
gi|300260551|gb|EFJ44769.1| hypothetical protein VOLCADRAFT_64430 [Volvox carteri f.
nagariensis]
Length = 311
Score = 219 bits (558), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 124/271 (45%), Positives = 151/271 (55%), Gaps = 17/271 (6%)
Query: 35 VKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRS-AVADNLSGESKLSDVRTSSGTFI 93
V ISW+PRAFV FLT+ EC H+ +LA+ ++RS VADN G S L D RTS GTFI
Sbjct: 1 VSVISWQPRAFVIRNFLTEHECTHIADLAQVHMRRSTVVADN--GSSVLDDYRTSYGTFI 58
Query: 94 PKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRL 153
+ + +IA +ED++A T P ED+QVLRY GQ Y H D + R+
Sbjct: 59 NRYQTPVIAAVEDRVALLTRTPVVYQEDMQVLRYGLGQYYHRHTDSLEND------SPRM 112
Query: 154 ATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLH 213
ATVL+YLS+ GGET FP A S+C K +A KPRRGDALLF+S+
Sbjct: 113 ATVLLYLSEPELGGETAFPQAASWAHPAMAQLFGPFSDCVKGNVAFKPRRGDALLFWSVK 172
Query: 214 TNA-IPDPVSLHSGCPVIEGEKWSATKWIHVDSF-------DKIVEEGGDCTDNNASCER 265
+ DP S H GCPVI G KW+AT W+H F G CTD +A C R
Sbjct: 173 PDGRTEDPYSEHEGCPVIRGVKWTATVWVHTQPFRPEDFPPQPRSRLSGLCTDRHAECPR 232
Query: 266 WAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
WA GEC N YM G A G CRR+C VC
Sbjct: 233 WARAGECDNNSNYMKGDANQVGSCRRTCGVC 263
>gi|363543297|ref|NP_001241864.1| prolyl 4-hydroxylase 4-2 precursor [Zea mays]
gi|194704960|gb|ACF86564.1| unknown [Zea mays]
gi|347978810|gb|AEP37747.1| prolyl 4-hydroxylase 4-2 [Zea mays]
Length = 207
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 106/172 (61%), Positives = 130/172 (75%), Gaps = 9/172 (5%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
N S+VK +SW PR FVY+GFL+D ECDHL+ LAK + +RS VADN SG+S S+VRTSS
Sbjct: 40 FNSSRVKAVSWHPRIFVYKGFLSDAECDHLVTLAKKKTQRSMVADNESGKSVKSEVRTSS 99
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRG 149
G F+ K +D +++ IE++IA WTFLP+EN E++QVLRYE GQKYEPH+DYF D+VN RG
Sbjct: 100 GMFLDKRQDPVVSRIEERIAAWTFLPQENAENMQVLRYEPGQKYEPHFDYFHDRVNQARG 159
Query: 150 GHRLATVLMYLSDVAKGGETVFPNA---EEPPRRRTPATNDDLSECAKKGIA 198
GHR ATVLMYLS V +GGETVFPNA E P+ T SECA KG+A
Sbjct: 160 GHRYATVLMYLSTVREGGETVFPNAKGWESQPKDAT------FSECAHKGLA 205
>gi|55741082|gb|AAV64222.1| unknown [Zea mays]
Length = 369
Score = 218 bits (556), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 104/177 (58%), Positives = 130/177 (73%), Gaps = 6/177 (3%)
Query: 97 KDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATV 156
+D ++ IE++I+ WTFLP ENGE IQ+L Y++G+KYEPHYDYF DK N GGHR+ATV
Sbjct: 193 QDEVVTRIEERISAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKKNQALGGHRIATV 252
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
LMYLS+V KGGET+FPNAE + ++ S+CA+ G AVKP +GDALLFFSLH +A
Sbjct: 253 LMYLSNVEKGGETIFPNAEG---KLLQPKDNTWSDCARNGYAVKPVKGDALLFFSLHPDA 309
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGGD---CTDNNASCERWAALG 270
D SLH CPVIEG+KWSATKWIHV SFD V++ G C D+N C +WAA+
Sbjct: 310 TTDSDSLHGSCPVIEGQKWSATKWIHVRSFDLPVKQPGSSDGCEDDNILCPQWAAVA 366
>gi|108706360|gb|ABF94155.1| oxidoreductase, 2OG-Fe oxygenase family protein, putative [Oryza
sativa Japonica Group]
gi|125585047|gb|EAZ25711.1| hypothetical protein OsJ_09544 [Oryza sativa Japonica Group]
Length = 277
Score = 218 bits (555), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 112/223 (50%), Positives = 147/223 (65%), Gaps = 24/223 (10%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKS-QLKRSAVADNLSGESKLSDVRTS 88
+ S+ +SW+PRAF+YEGFL+D ECDHLI+LAK ++++S V D SGES S VRTS
Sbjct: 37 FDASRAVDVSWRPRAFLYEGFLSDAECDHLISLAKQGKMEKSTVVDGESGESVTSKVRTS 96
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLP-----------------KENGEDIQVLRYEHGQ 131
SG F+ K +D ++A IE++IA WT LP ENGE +Q+LRY G+
Sbjct: 97 SGMFLDKKQDEVVARIEERIAAWTMLPTECIIFYCFANFAILKLSENGESMQILRYGQGE 156
Query: 132 KYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSE 191
KYEPH+DY S + R G R+ATVLMYLS+V K G+++ P A R + ++ S+
Sbjct: 157 KYEPHFDYISGRQGSTREGDRVATVLMYLSNV-KMGDSLLPQA-----RLSQPKDETWSD 210
Query: 192 CAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEK 234
CA++G AVKP +G A+LFFSLH NA D SLH CPVIEGEK
Sbjct: 211 CAEQGFAVKPAKGSAVLFFSLHPNATLDTDSLHGSCPVIEGEK 253
>gi|357135725|ref|XP_003569459.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like isoform 1
[Brachypodium distachyon]
Length = 303
Score = 218 bits (554), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 116/279 (41%), Positives = 168/279 (60%), Gaps = 26/279 (9%)
Query: 23 SFSSTAIIN--PSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGES 80
SF+S++ I+ PS+ K+++W PR F+YEGFL+ +ECDHL+ +A+ ++ S + + +G
Sbjct: 46 SFASSSHIDFDPSRSKRLAWHPRVFLYEGFLSGMECDHLVYVARLNIESSLLVN--AGAR 103
Query: 81 KLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF 140
++ T D +++ IED+I+ W+F+PKE+GE +Q+L+Y Q D+
Sbjct: 104 NITQNST---------DDIVVSKIEDRISLWSFIPKEHGESMQILKYGSNQS-----DHN 149
Query: 141 SDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVK 200
D GG+RL T+LMYLSDV +GGETVFP +E + T A LSECA G AVK
Sbjct: 150 KDGTQSSSGGNRLVTILMYLSDVKQGGETVFPRSE---LKDTQAKEGALSECA--GYAVK 204
Query: 201 PRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDK---IVEEGGDCT 257
P +GDA+L F+L + + D S + C V+EGEKW A K +H+ DK + CT
Sbjct: 205 PVKGDAILLFNLRPDGVTDSDSHYEDCSVLEGEKWLAIKHLHISKIDKSRSSLPSEDLCT 264
Query: 258 DNNASCERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
D + C WAA GEC NP +M+GS G CR+SC C
Sbjct: 265 DEDDKCVSWAAAGECYSNPVFMIGSPDYYGTCRKSCHAC 303
>gi|18071415|gb|AAL58274.1|AC068923_16 putative prolyl 4-hydroxylase, alpha subunit [Oryza sativa Japonica
Group]
Length = 343
Score = 217 bits (552), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 106/193 (54%), Positives = 142/193 (73%), Gaps = 2/193 (1%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGK 97
+SW+PRAF+Y FL+ EC++LI+LAK +K+S V D +G SK S VRTSSG F+ +G+
Sbjct: 113 LSWEPRAFLYHNFLSKEECEYLISLAKPHMKKSTVVDASTGGSKDSRVRTSSGMFLGRGQ 172
Query: 98 DAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVL 157
D II IE +I+ +TF+P ENGE +QVL YE GQKYEPH+DYF D+ N GG R+AT+L
Sbjct: 173 DKIIRTIEKRISDYTFIPVENGEGLQVLHYEVGQKYEPHFDYFHDEFNTKNGGQRIATLL 232
Query: 158 MYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAI 217
MYLSDV +GGET+FP+++ +P N +LSECAKKG+AVKP+ GDALLF+S+ +
Sbjct: 233 MYLSDVEEGGETIFPSSKA-NSSSSPFYN-ELSECAKKGLAVKPKMGDALLFWSMRPDGS 290
Query: 218 PDPVSLHSGCPVI 230
D SLH P++
Sbjct: 291 LDATSLHGEIPIL 303
>gi|308802438|ref|XP_003078532.1| prolyl 4-hydroxylase alpha-1 subunit precursor (IC) [Ostreococcus
tauri]
gi|116056985|emb|CAL51412.1| prolyl 4-hydroxylase alpha-1 subunit precursor (IC) [Ostreococcus
tauri]
Length = 369
Score = 217 bits (552), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 120/282 (42%), Positives = 162/282 (57%), Gaps = 26/282 (9%)
Query: 39 SWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKD 98
S KP+A++ FL+ ECDHL+ LAK +L S V + G S S++RTS+G F+ K +D
Sbjct: 87 SKKPKAYLMRNFLSPQECDHLMMLAKRELAPSTVVGD-GGSSVASEIRTSAGMFLRKSQD 145
Query: 99 AIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIV--RGGHRLATV 156
+ IE++IA + +P +NGE +Q+LRY+ GQKY+PH+DYF DKVN RGG R+ATV
Sbjct: 146 DTVREIEERIARLSGVPVDNGEGMQILRYDKGQKYDPHFDYFHDKVNPAPKRGGQRVATV 205
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDL------SECAKKGIAVKPRRGDALLFF 210
L+YL D +GGET FPN P ++ ++CAK GI VK RGDA+LFF
Sbjct: 206 LIYLVDTEEGGETTFPNGRLPENFEEDEPDNPFAAHIKHTDCAKNGIPVKSVRGDAILFF 265
Query: 211 SLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVE------------EGGDCTD 258
S+ + D SLH CPVI G+KW+A KW+ V FD + E C D
Sbjct: 266 SMTKDGELDHGSLHGACPVIAGQKWTAVKWLRVAKFDGGFKDELPMIPLTRRTEREPCVD 325
Query: 259 NNASCERWAALGECTKNPEYM----VGSAQLPGFCRRSCKVC 296
C WA G C +NPE+M + P C +SC +C
Sbjct: 326 EWDDCASWARDGWCERNPEFMKFAGARDSHTPA-CPKSCGLC 366
>gi|159486447|ref|XP_001701251.1| hypothetical protein CHLREDRAFT_122372 [Chlamydomonas reinhardtii]
gi|158271833|gb|EDO97644.1| predicted protein [Chlamydomonas reinhardtii]
Length = 251
Score = 217 bits (552), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 109/265 (41%), Positives = 155/265 (58%), Gaps = 16/265 (6%)
Query: 34 KVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFI 93
+++ +SW PR F+Y FL+D EC H+ A +KRS+V +G S L +RTS GTFI
Sbjct: 1 RIETVSWNPRVFIYHNFLSDAECRHIKRTAAPMMKRSSVV-GTNGSSVLDTIRTSYGTFI 59
Query: 94 PKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRL 153
+ D ++ + ++A WT P EN ED+QVLRY GQKY H D +++ R+
Sbjct: 60 RRRHDPVVERVLRRVAAWTKAPPENQEDLQVLRYGPGQKYGAHMD------SLIDDSPRM 113
Query: 154 ATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLH 213
ATVL+YL D GGET FP++ + SECA+ +A +P++GDAL+F+S+
Sbjct: 114 ATVLLYLHDTEYGGETAFPDSGHWLDPSLAQSMGPFSECAQGHVAFRPKKGDALMFWSIK 173
Query: 214 TNAIPDPVSLHSGCPVIEGEKWSATKWIHV--DSFDKIVEEGGDCTDNNASCERWAALGE 271
+ DP+SLH+GCPV+ G KW+AT W+H ++D + G CTD + C+ W +GE
Sbjct: 174 PDGTHDPLSLHTGCPVVTGVKWTATSWVHSMPYNYDDYFKPGA-CTDLHDQCKHWERMGE 232
Query: 272 CTKNPEYMVGSAQLPGFCRRSCKVC 296
C KNP YM C RSC C
Sbjct: 233 CKKNPAYM------ESHCGRSCGAC 251
>gi|115434812|ref|NP_001042164.1| Os01g0174500 [Oryza sativa Japonica Group]
gi|55296794|dbj|BAD68120.1| prolyl 4-hydroxylase -like [Oryza sativa Japonica Group]
gi|113531695|dbj|BAF04078.1| Os01g0174500 [Oryza sativa Japonica Group]
gi|222617830|gb|EEE53962.1| hypothetical protein OsJ_00571 [Oryza sativa Japonica Group]
Length = 303
Score = 216 bits (551), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 114/275 (41%), Positives = 165/275 (60%), Gaps = 32/275 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+ +PSK K++SW PR F+YEGFL+D+ECDHL+++ + ++ S + S +++
Sbjct: 54 VFDPSKSKRLSWHPRIFLYEGFLSDMECDHLVSMGRGNMESSLAFTDGDRNSSYNNI--- 110
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQ----KYEPHYDYFSDKV 144
+D +++ IED+I+ W+FLPKENGE IQVL+Y + K EP
Sbjct: 111 --------EDIVVSKIEDRISLWSFLPKENGESIQVLKYGVNRSGSIKEEPKSS------ 156
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
G HRLAT+LMYLSDV +GGETVFP +E + A S+C+ G AV+P +G
Sbjct: 157 ---SGAHRLATILMYLSDVKQGGETVFPRSE---MKDAQAKEGAPSQCS--GYAVRPAKG 208
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFD---KIVEEGGDCTDNNA 261
+A+L F+L + D S + CPV+EGEKW A K I++ FD + +CTD +
Sbjct: 209 NAILLFNLRPDGETDKDSQYEECPVLEGEKWLAIKHINLRKFDYPKSSLASEDECTDEDD 268
Query: 262 SCERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
C WAA GEC +NP +M+GS+ G CR+SC+VC
Sbjct: 269 RCVSWAASGECDRNPVFMIGSSDYYGSCRKSCRVC 303
>gi|412988743|emb|CCO15334.1| predicted protein [Bathycoccus prasinos]
Length = 352
Score = 215 bits (548), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 116/270 (42%), Positives = 159/270 (58%), Gaps = 14/270 (5%)
Query: 31 NPSK--VKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
N SK ++ +SW PRAF+Y FL+ E HL++L + ++ RS V +G ++SD+RTS
Sbjct: 62 NSSKPWIEALSWDPRAFLYHNFLSKEEAKHLVDLGEPRVTRSTVVGGQTG--RVSDIRTS 119
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVR 148
GTFIPK D ++ IED+ A ++ +P + E +Q+LRY GQKY H D +
Sbjct: 120 FGTFIPKKYDEVLEKIEDRCAVFSGIPVAHQEQMQLLRYRDGQKYSDHTDGLISE----N 175
Query: 149 GGHRLATVLMYLSDVAKGGET--VFPNAEEPPRRRTPATNDDLSEC---AKKGIAVKPRR 203
GG R+AT+LM+L + +GGET V N + R T D S+C + KG AVKP+
Sbjct: 176 GGKRIATILMFLHEPTEGGETSFVLGNPLGKVKERIERTKDQFSDCGYRSGKGFAVKPKV 235
Query: 204 GDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGGDCTDNNASC 263
GDA+LFFS I D S+H+ CP + G KW+AT WIH FD DC D + C
Sbjct: 236 GDAILFFSFSEAGITDNNSMHASCPTLGGTKWTATMWIHERPFDTATWRKPDCKDLHQEC 295
Query: 264 ERWAALGECTKNPEYMVGSAQLPGFCRRSC 293
WA GEC KNP YM+G+ ++ G C RSC
Sbjct: 296 ANWANRGECKKNPIYMLGN-EVVGTCSRSC 324
>gi|302850293|ref|XP_002956674.1| hypothetical protein VOLCADRAFT_67269 [Volvox carteri f.
nagariensis]
gi|300258035|gb|EFJ42276.1| hypothetical protein VOLCADRAFT_67269 [Volvox carteri f.
nagariensis]
Length = 325
Score = 214 bits (544), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 117/280 (41%), Positives = 161/280 (57%), Gaps = 22/280 (7%)
Query: 24 FSSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLS 83
F S + + +++ ISWKPRA VY FL+D E H+I+LA Q+KRS V N E +
Sbjct: 30 FQSLSQLPTCRIQTISWKPRAVVYHNFLSDQEARHIIDLAHEQMKRSTVVGN-KNEGVVD 88
Query: 84 DVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDK 143
D+RTS GTF+ + +D +I IE+++A W+ +P + ED+QVLRY KY PH D
Sbjct: 89 DIRTSYGTFLRRAQDPVIMAIEERLALWSHMPPSHQEDMQVLRYGRTNKYGPHID----- 143
Query: 144 VNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRR 203
G R+ATVLMYL + G + +A E A + S CAK +A KP+R
Sbjct: 144 -----GLERVATVLMYLVGESPGPDLAPVSACE----CMYAEQSNPSACAKGHVAYKPKR 194
Query: 204 GDALLFFSLHTN-AIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKI------VEEGGDC 256
GDAL+FF + + D S+H+GCPV+ G KW+A KWIH F ++ + + G C
Sbjct: 195 GDALMFFDVKPDYTTTDGHSMHTGCPVVAGVKWNAVKWIHGTPFRRMRRNKPPLPDPGVC 254
Query: 257 TDNNASCERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
TD + C+ WA GEC NP YM+GS G CR +CK C
Sbjct: 255 TDLHEMCDTWARAGECQNNPGYMLGSNTGIGNCRLACKDC 294
>gi|363543293|ref|NP_001241862.1| prolyl 4-hydroxylase 2-1 precursor [Zea mays]
gi|347978802|gb|AEP37743.1| prolyl 4-hydroxylase 2-1 [Zea mays]
Length = 204
Score = 212 bits (540), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 98/169 (57%), Positives = 131/169 (77%), Gaps = 3/169 (1%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+PS+V Q+SW+PRAF+++GFL+D ECDHLI LAK +L++S VADN SG+S S+VRTSS
Sbjct: 31 FDPSRVVQLSWRPRAFLHKGFLSDAECDHLIALAKDKLEKSMVADNESGKSVQSEVRTSS 90
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRG 149
G F+ + +D ++ IE++I+ WTFLP ENGE IQ+L Y++G+KYEPHYDYF DK N G
Sbjct: 91 GMFLERKQDEVVTRIEERISAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKKNQALG 150
Query: 150 GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIA 198
GHR+ATVLMYLS+V KGGET+FPNAE + ++ S+CA+ G A
Sbjct: 151 GHRIATVLMYLSNVEKGGETIFPNAE---GKLLQPKDNTWSDCARNGYA 196
>gi|125546091|gb|EAY92230.1| hypothetical protein OsI_13950 [Oryza sativa Indica Group]
Length = 178
Score = 212 bits (540), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 94/147 (63%), Positives = 121/147 (82%)
Query: 28 AIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRT 87
+P++V Q+SW+PRAF+Y GFL+ ECDHL+NLAK ++++S VADN SG+S +S VRT
Sbjct: 30 GFYDPARVTQLSWRPRAFLYSGFLSHDECDHLVNLAKGRMEKSMVADNDSGKSIMSQVRT 89
Query: 88 SSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIV 147
SSGTF+ K +D I++GIE ++A WTFLP+EN E IQ+L YE GQKY+ H+DYF DK N+
Sbjct: 90 SSGTFLSKHEDDIVSGIEKRVAAWTFLPEENAESIQILHYELGQKYDAHFDYFHDKNNLK 149
Query: 148 RGGHRLATVLMYLSDVAKGGETVFPNA 174
RGGHR+ATVLMYL+DV KGGETVFPNA
Sbjct: 150 RGGHRVATVLMYLTDVKKGGETVFPNA 176
>gi|218187602|gb|EEC70029.1| hypothetical protein OsI_00603 [Oryza sativa Indica Group]
Length = 549
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 112/271 (41%), Positives = 164/271 (60%), Gaps = 24/271 (8%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
++ PSK K++SW PR F+YEGFL+D+ECDHL++ + + S + S +++
Sbjct: 300 LVYPSKSKRLSWHPRIFLYEGFLSDMECDHLVSTGRGNMDSSLAFTDGDRNSSYNNI--- 356
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVR 148
+D +++ IED+I+ W+FLPKENGE+IQVL+Y ++ ++
Sbjct: 357 --------EDIVVSKIEDRISLWSFLPKENGENIQVLKYGVNRR-----GSIKEEPKSST 403
Query: 149 GGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALL 208
GGH LAT+L+YLSDV +GGETVFP +E + A S+C+ G AV+P +G+ALL
Sbjct: 404 GGHWLATILIYLSDVKQGGETVFPRSE---MKDAQAKEGAPSQCS--GYAVRPAKGNALL 458
Query: 209 FFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH---VDSFDKIVEEGGDCTDNNASCER 265
F+L + D S + CPV+EGEKW A K IH +DS + +CTD + C
Sbjct: 459 LFNLRPDGEIDKDSQYEECPVLEGEKWLAIKHIHLRKLDSPKSSLASEDECTDEDDRCVS 518
Query: 266 WAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
WAA GEC +NP +M+GS+ G CR+SC+VC
Sbjct: 519 WAASGECDRNPVFMIGSSDYYGSCRKSCRVC 549
>gi|363807814|ref|NP_001242181.1| uncharacterized protein LOC100782154 [Glycine max]
gi|255644463|gb|ACU22735.1| unknown [Glycine max]
Length = 285
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 104/205 (50%), Positives = 141/205 (68%), Gaps = 2/205 (0%)
Query: 35 VKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIP 94
V+ +SW+PRAF+Y FLT EC++LIN A + +S V DN SGE + RTS+ +
Sbjct: 83 VEVMSWEPRAFLYHNFLTKEECEYLINTATPNMLKSLVIDNESGEGIETSYRTSTEYVVE 142
Query: 95 KGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLA 154
+GKD I+ IE +IA TF+P E+GE + V+RY GQ YEPH DYF ++ ++V GG R+A
Sbjct: 143 RGKDKIVRNIEKRIADVTFIPIEHGEPLHVIRYAVGQYYEPHVDYFEEEFSLVNGGQRIA 202
Query: 155 TVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHT 214
T+LMYLS+V GGETVFP A P N +LSEC + G+++KP+ GDALLF+S+
Sbjct: 203 TMLMYLSNVEGGGETVFPIANA-NFSSVPWWN-ELSECGQTGLSIKPKMGDALLFWSMKP 260
Query: 215 NAIPDPVSLHSGCPVIEGEKWSATK 239
+A DP++LH CPVI+G KWS TK
Sbjct: 261 DATLDPLTLHRACPVIKGNKWSCTK 285
>gi|357467075|ref|XP_003603822.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355492870|gb|AES74073.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 683
Score = 211 bits (538), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 110/227 (48%), Positives = 144/227 (63%), Gaps = 10/227 (4%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGK 97
+S PRA +Y FL+ EC+HLINLAK + RS V D ++GE K S RTSSG F+ +GK
Sbjct: 115 LSSVPRASMYHNFLSKEECEHLINLAKPFMARSLVVDGVTGEVKESSSRTSSGMFLDRGK 174
Query: 98 DAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVL 157
D I+ IE +IA T +P ENGE + V+ Y GQK EPHYDY SD V GG R+ATVL
Sbjct: 175 DKIVQNIERRIADITSVPIENGEGLHVIHYGVGQKCEPHYDYTSDGVVTKNGGPRVATVL 234
Query: 158 MYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAI 217
MYLSDV +GGETVFP+A+ +S+C+ G++VKP+ GDALLF+S+ +
Sbjct: 235 MYLSDVEEGGETVFPDAQ--------PNFTSVSKCSGDGLSVKPKMGDALLFWSMKPDGT 286
Query: 218 PDPVSLHSGCPVIEGEKWSATKWIHVDS--FDKIVEEGGDCTDNNAS 262
D SLH G PVI G KW++TKW+H+ G C + +A+
Sbjct: 287 LDTSSLHGGSPVIRGNKWASTKWLHLRECKLSGTTHAGNTCANKHAA 333
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 91/211 (43%), Positives = 124/211 (58%), Gaps = 29/211 (13%)
Query: 50 FLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIA 109
F + EC+HLINLAK + RS V D L+G+ + S RTSSG F+ +GKD I+ IE +IA
Sbjct: 372 FGSKEECEHLINLAKPFMTRSLVVDGLTGKGRESSARTSSGRFLERGKDKIVQNIEQRIA 431
Query: 110 TWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGET 169
T +P+ D + + + V GG R+ATVLMYLSDV +GGET
Sbjct: 432 DITSIPRM-ARDFML--------------FTAGGVVTKNGGPRVATVLMYLSDVEEGGET 476
Query: 170 VFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPV 229
VFPNA+ P N +S+ +KG++VKP+ GDALLF S+ + D SLH G PV
Sbjct: 477 VFPNAK-------PNIN-SVSKYPEKGLSVKPKMGDALLFRSMKPDGTLDTSSLHGGSPV 528
Query: 230 IEGEKWSATKWIHVDSFD------KIVEEGG 254
I G KW++TKW+H+ F +V++GG
Sbjct: 529 IRGNKWASTKWLHLTEFKVLGTALPVVDDGG 559
>gi|297797785|ref|XP_002866777.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297312612|gb|EFH43036.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 266
Score = 211 bits (537), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 109/194 (56%), Positives = 137/194 (70%), Gaps = 2/194 (1%)
Query: 31 NPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSG 90
N V+ ISW+PRA VY FLT EC +LI LAK +++S V D +G+S S VRTSSG
Sbjct: 73 NERWVEIISWEPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKTGKSTDSRVRTSSG 132
Query: 91 TFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGG 150
TF+ +G+D I IE +I+ +TF+P E+GE +QVL YE GQKYEPHYDYF D+ N GG
Sbjct: 133 TFLARGRDKTIREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHYDYFMDEYNTRNGG 192
Query: 151 HRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFF 210
R+ATVLMYLSDV +GGETVFP A + P N +LSEC K G++VKP+ GDALLF+
Sbjct: 193 QRIATVLMYLSDVEEGGETVFP-AAKGNYSAVPWWN-ELSECGKGGLSVKPKMGDALLFW 250
Query: 211 SLHTNAIPDPVSLH 224
S+ +A DP SLH
Sbjct: 251 SMTPDATLDPSSLH 264
>gi|10177121|dbj|BAB10411.1| prolyl 4-hydroxylase, alpha subunit-like protein [Arabidopsis
thaliana]
Length = 267
Score = 211 bits (537), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 109/194 (56%), Positives = 137/194 (70%), Gaps = 2/194 (1%)
Query: 31 NPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSG 90
N V+ ISW+PRA VY FLT EC +LI LAK +++S V D +G+S S VRTSSG
Sbjct: 74 NERWVEIISWEPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKTGKSTDSRVRTSSG 133
Query: 91 TFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGG 150
TF+ +G+D I IE +I+ +TF+P E+GE +QVL YE GQKYEPHYDYF D+ N GG
Sbjct: 134 TFLARGRDKTIREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHYDYFMDEYNTRNGG 193
Query: 151 HRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFF 210
R+ATVLMYLSDV +GGETVFP A + P N +LSEC K G++VKP+ GDALLF+
Sbjct: 194 QRIATVLMYLSDVEEGGETVFP-AAKGNYSAVPWWN-ELSECGKGGLSVKPKMGDALLFW 251
Query: 211 SLHTNAIPDPVSLH 224
S+ +A DP SLH
Sbjct: 252 SMTPDATLDPSSLH 265
>gi|224033439|gb|ACN35795.1| unknown [Zea mays]
Length = 180
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 99/180 (55%), Positives = 136/180 (75%), Gaps = 2/180 (1%)
Query: 67 LKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLR 126
+ +S V D+ +G+SK S VRTSSG F+ +G+D +I IE +IA +TF+P ++GE +QVL
Sbjct: 1 MVKSTVVDSTTGKSKDSRVRTSSGMFLQRGRDKVIRAIEKRIADYTFIPVDHGEGLQVLH 60
Query: 127 YEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATN 186
YE GQKYEPH+DYF D+ N GG R+AT+LMYLSDV +GGET+FP+A P N
Sbjct: 61 YEVGQKYEPHFDYFLDEFNTKNGGQRIATLLMYLSDVEEGGETIFPDANVNA-SSLPWYN 119
Query: 187 DDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
+LS+CAK+G++VKP+ GDALLF+S+ +A DP+SLH GCPVI+G KWS+TKW+H+ +
Sbjct: 120 -ELSDCAKRGLSVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIKGNKWSSTKWMHIHEY 178
>gi|238007346|gb|ACR34708.1| unknown [Zea mays]
Length = 180
Score = 211 bits (536), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 100/180 (55%), Positives = 135/180 (75%), Gaps = 2/180 (1%)
Query: 67 LKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLR 126
+ +S V D+ +G+SK S VRTSSG F+ +G+D +I IE +IA +TF+P ++GE +QVL
Sbjct: 1 MVKSTVVDSTTGKSKDSRVRTSSGMFLQRGRDKVIRVIEKRIADYTFIPVDHGEGLQVLH 60
Query: 127 YEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATN 186
YE GQKYEPH+DYF D+ N GG R+AT+LMYLSDV +GGET+FP+A P N
Sbjct: 61 YEVGQKYEPHFDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETIFPDA-NVNVSSLPWYN 119
Query: 187 DDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
+LSECAK+G++VKP+ GDALLF+S+ +A DP+SLH GCPVI G KWS+TKW+H+ +
Sbjct: 120 -ELSECAKRGLSVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIRGNKWSSTKWMHIHEY 178
>gi|303287328|ref|XP_003062953.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226455589|gb|EEH52892.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 259
Score = 210 bits (535), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 115/264 (43%), Positives = 160/264 (60%), Gaps = 23/264 (8%)
Query: 35 VKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIP 94
V+ ISW PRAF +TD ECD ++ LA+++++RS V D+ +GESK+ +RTS F+
Sbjct: 1 VEPISWHPRAFHLHNIMTDAECDEVLELARTRVRRSTVVDSTTGESKVDPIRTSEQCFLN 60
Query: 95 KGKDAIIAGIEDKIATWTFLPKENGEDIQ-----VLRYEHGQKYEPHYDY-----FSDKV 144
+G I++ IE ++ +T LP NGED+Q VL+Y +GQKY+ H+D S K
Sbjct: 61 RGHFPIVSVIEKRLERYTMLPWYNGEDLQARPSRVLKYSNGQKYDAHHDVGELDTASGKQ 120
Query: 145 NIVRGGHRLATVLMYLSDVAK--GGETVFPNAE--EPPRRRTPATNDDLSECAKKGIAVK 200
GGHR+ATVL+YLSDV GGET FP++E +P R SECA+ +AVK
Sbjct: 121 LAAEGGHRVATVLLYLSDVDDDGGGETAFPDSEWIDPTADR----GSGWSECAEDHVAVK 176
Query: 201 PRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHV----DSFDKIVEEGGDC 256
P++GD LLF+S+ + D S+H+GCPV+ G+ W+ATKWIH F C
Sbjct: 177 PKKGDGLLFWSITPEGVIDQQSMHAGCPVL-GKSWTATKWIHARPFRHQFPPPPAAPPGC 235
Query: 257 TDNNASCERWAALGECTKNPEYMV 280
D A C+ WA GEC KNP +M+
Sbjct: 236 ADTVAMCKSWANSGECKKNPGFML 259
>gi|255085784|ref|XP_002505323.1| predicted protein [Micromonas sp. RCC299]
gi|226520592|gb|ACO66581.1| predicted protein [Micromonas sp. RCC299]
Length = 215
Score = 209 bits (533), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 109/216 (50%), Positives = 145/216 (67%), Gaps = 14/216 (6%)
Query: 35 VKQISWKPRAFVYEGFLTDLECDHLINLAKSQ---LKRSAVADNLSGESKLSDVRTSSGT 91
++QISW+PRAFVY FLT EC HL+NLAK+ LKR+ VAD +G + SG
Sbjct: 2 IEQISWEPRAFVYHNFLTPEECAHLVNLAKATDGGLKRATVADARTGGTF-----PGSGA 56
Query: 92 FIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSD-KVNIVRGG 150
F+ + D I+ IE++I+ + +P ++GE +++LRY G+KY+PH+DYF D N+ G
Sbjct: 57 FLLRNHDPIVTRIEERISAFAMIPADHGEGMRILRYGRGEKYDPHHDYFDDGDKNLRFYG 116
Query: 151 HRLATVLMYLSDVAKGGETVFPNAE---EPPRR--RTPATNDDLSECAKKGIAVKPRRGD 205
R+ATVLMYLSDV GGETVFP EP R +++ D S+CAK + VKPRRGD
Sbjct: 117 QRVATVLMYLSDVESGGETVFPKHGAWIEPDEMDVRGRSSSKDSSKCAKGALHVKPRRGD 176
Query: 206 ALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
ALLF + H N DP SLH+GCPV+ GEKW+ATKW+
Sbjct: 177 ALLFHNCHLNGREDPTSLHAGCPVLRGEKWTATKWM 212
>gi|412993142|emb|CCO16675.1| predicted protein [Bathycoccus prasinos]
Length = 564
Score = 209 bits (533), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 115/267 (43%), Positives = 155/267 (58%), Gaps = 32/267 (11%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
KP+A+++ FL+ ECDHL+ LAK++L S V G S S +RTS+G F+ K D
Sbjct: 285 KPKAYLFRNFLSAEECDHLMKLAKAELAPSTVV-GAGGTSVPSTIRTSAGMFLRKAADKT 343
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIV--RGGHRLATVLM 158
+ IE +IA + P+ NGE +Q+LRY+ GQKY+PH+DYF D VN RGG R+AT+L+
Sbjct: 344 LENIEYRIAAASGTPEPNGEGMQILRYDVGQKYDPHFDYFHDAVNPSPKRGGQRMATMLI 403
Query: 159 YLSDVAKGGETVFPNAEEPPRRRTPATNDDL---------SECAKKGIAVKPRRGDALLF 209
YL + +GGET+FP R T A DL SEC K G+ VK +GDALLF
Sbjct: 404 YLENTKEGGETIFP-------RGTRAETFDLTEEGNPHEWSECTKHGLPVKSVKGDALLF 456
Query: 210 FSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKI-------------VEEGGDC 256
+SL + D SLH CPV++G+KW+A KWI V FD + E+ G C
Sbjct: 457 WSLTDDYKLDMGSLHGACPVVKGQKWTAVKWIRVAKFDGMFTSPLPMPALSRRTEQHGKC 516
Query: 257 TDNNASCERWAALGECTKNPEYMVGSA 283
D C +WA G C KN ++MV +
Sbjct: 517 VDEWDECAKWAKDGWCEKNKDFMVSNG 543
>gi|356559784|ref|XP_003548177.1| PREDICTED: uncharacterized protein LOC100795761 [Glycine max]
Length = 264
Score = 208 bits (530), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 114/284 (40%), Positives = 171/284 (60%), Gaps = 26/284 (9%)
Query: 19 LIRKSFSSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSG 78
++ +S + INPS+V QISW+PR F+Y+GFL+D ECD+L++LA AV + SG
Sbjct: 1 MLERSIHFSNRINPSRVVQISWQPRVFLYKGFLSDKECDYLVSLAY------AVKEKSSG 54
Query: 79 ESKLSD-VRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHY 137
LS+ V TS +D I+A IE++++ W FLPKE + +QV+ Y Q +
Sbjct: 55 NGGLSEGVETSLDM-----EDDILARIEERLSVWAFLPKEYSKPLQVMHYGPEQNGR-NL 108
Query: 138 DYFSDKVNIVRGGHRLATVLMYLS-DVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKG 196
DYF++K + G +AT+++YLS DV +GG+ +FP + P ++ S
Sbjct: 109 DYFTNKTQLELSGPLMATIILYLSNDVTQGGQILFPES-------VPGSSSWSSCSNSSN 161
Query: 197 IAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDK----IVEE 252
I ++P +G+A+LFFSLH +A PD S H+ CPV+EG+ WSA K+ + + +
Sbjct: 162 I-LQPVKGNAILFFSLHPSASPDKSSFHARCPVLEGDMWSAIKYFYAKPISRGKVSATLD 220
Query: 253 GGDCTDNNASCERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
GG+CTD + SC WAA+GEC +NP +M+GS G CR+SC C
Sbjct: 221 GGECTDEDDSCPAWAAVGECQRNPVFMIGSPDYYGTCRKSCNAC 264
>gi|302838815|ref|XP_002950965.1| hypothetical protein VOLCADRAFT_60971 [Volvox carteri f.
nagariensis]
gi|300263660|gb|EFJ47859.1| hypothetical protein VOLCADRAFT_60971 [Volvox carteri f.
nagariensis]
Length = 298
Score = 207 bits (528), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 112/280 (40%), Positives = 158/280 (56%), Gaps = 30/280 (10%)
Query: 34 KVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFI 93
+++ +SW PR F+Y FLTD EC H+ A +KRS+V +G S ++RTS GTFI
Sbjct: 1 RIEAVSWNPRVFIYHNFLTDGECRHIKRTAAPMMKRSSVVGQ-NGSSVTDNIRTSYGTFI 59
Query: 94 PKGKDAIIAGIEDKIATWTFLPKENGEDIQ------------VLRYEHGQKYEPHYDYFS 141
+ D +I I ++A WT P EN ED+Q VLRY GQKY H D
Sbjct: 60 RRRHDPVIERILRRVAAWTKAPPENQEDLQAGRGEGGREKERVLRYGIGQKYGAHMD--- 116
Query: 142 DKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKP 201
+++ R+ATVL+YL D +GGET FP++ SECA+ +A +P
Sbjct: 117 ---SLIDDSPRMATVLLYLHDTEEGGETAFPDSSSWLTPDLATRMGPFSECAQGHVAFRP 173
Query: 202 RRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVD--SFDKIVE---EGGDC 256
++GDAL+F+S+ + DP+S+H+GCPV++G KW+AT W+H ++D+ + E G C
Sbjct: 174 KKGDALMFWSIKPDGTHDPLSMHTGCPVVKGVKWTATSWVHSMPYAYDRYISHDGEPGAC 233
Query: 257 TDNNASCERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
TD + C WAA GEC +NP YM C SCK C
Sbjct: 234 TDLHDMCTVWAAAGECDRNPVYM------STHCGPSCKTC 267
>gi|302844281|ref|XP_002953681.1| hypothetical protein VOLCADRAFT_63898 [Volvox carteri f.
nagariensis]
gi|300261090|gb|EFJ45305.1| hypothetical protein VOLCADRAFT_63898 [Volvox carteri f.
nagariensis]
Length = 304
Score = 207 bits (528), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 106/274 (38%), Positives = 156/274 (56%), Gaps = 17/274 (6%)
Query: 35 VKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIP 94
++ ++WKPR F+Y F+TD+E H+I LA Q+KRS V G+S RT +
Sbjct: 1 IEHVAWKPRVFIYHNFITDMEAKHMIELAAPQMKRSTVV-GAGGQSVEDSYRTLYTAGVR 59
Query: 95 KGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLA 154
+ +D ++ IE+++A WT + + ED+Q+LRY GQ+Y+ H D D G R+A
Sbjct: 60 RYQDDVVERIENRVAAWTQISVLHQEDMQILRYGIGQQYKVHADTLRDD----EAGVRVA 115
Query: 155 TVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHT 214
TVL+YL++ GGET FP+++ + + S CAK +A P+RGDALLF+S+
Sbjct: 116 TVLIYLNEPEAGGETAFPDSQWVNPKLAETIGANFSACAKNHVAFAPKRGDALLFWSIGP 175
Query: 215 NAIP-DPVSLHSGCPVIEGEKWSATKWIHVDSF-----------DKIVEEGGDCTDNNAS 262
+ D + H+GCPV+ G KW+ATKWIH F V + G C D +
Sbjct: 176 DGTTEDYHASHTGCPVLSGVKWTATKWIHAKPFRPQEMAAGRPHQPYVRDPGVCYDESPR 235
Query: 263 CERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
C WAA G+C KN +YM+ +A PG CR++C C
Sbjct: 236 CAEWAARGDCEKNRDYMIVNAVSPGVCRKACGAC 269
>gi|363543299|ref|NP_001241865.1| prolyl 4-hydroxylase 5-1 [Zea mays]
gi|347978814|gb|AEP37749.1| prolyl 4-hydroxylase 5-1 [Zea mays]
Length = 180
Score = 207 bits (527), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 98/180 (54%), Positives = 134/180 (74%), Gaps = 2/180 (1%)
Query: 67 LKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLR 126
+ +S V D+ +G+SK S VRTSSG F+ +G+D +I IE +I +TF+P ++GE +QVL
Sbjct: 1 MVKSTVVDSTTGKSKDSRVRTSSGMFLQRGRDKVIRVIEKRITDYTFIPVDHGEGLQVLH 60
Query: 127 YEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATN 186
YE GQKYEPH+DYF D+ N GG R+AT+LM+LSDV +GGET+FP+A P N
Sbjct: 61 YEVGQKYEPHFDYFLDEFNTKNGGQRMATLLMHLSDVEEGGETIFPDA-NVNDSSLPWYN 119
Query: 187 DDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
+LSECAK+G++VKP+ GDALLF+S+ +A DP+SLH GCPVI G KWS+TKW+H+ +
Sbjct: 120 -ELSECAKRGLSVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIRGNKWSSTKWMHIHEY 178
>gi|302842389|ref|XP_002952738.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
gi|300262082|gb|EFJ46291.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
Length = 281
Score = 207 bits (527), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 110/250 (44%), Positives = 150/250 (60%), Gaps = 14/250 (5%)
Query: 55 ECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFL 114
E DH++ +++ +L+RS V S+ S++RTS G F+ +G+D I+ +E++IA WT +
Sbjct: 9 EADHIVKVSERRLERSGVVGGDG-GSETSNIRTSYGVFLDRGEDEIVKRVEERIAAWTLM 67
Query: 115 PKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNA 174
P NGE +QVLRY+ QKY+ H+DYF K I GG+R ATVLMYL D +GGETVFPN
Sbjct: 68 PVGNGEGLQVLRYQKEQKYDAHWDYFFHKDGITNGGNRYATVLMYLVDTEEGGETVFPNV 127
Query: 175 EEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEK 234
P N SECA+ +A KP++G A+LF S+ + SLH+ CPVI G K
Sbjct: 128 AAP-----GGENVGFSECARYHLAAKPKKGTAILFHSIKPTGELERKSLHTACPVIRGIK 182
Query: 235 WSATKWIH----VDSFDKIVEEGGD----CTDNNASCERWAALGECTKNPEYMVGSAQLP 286
WSA KWIH ++ + + D C D++ C WA GEC +N +MVGS P
Sbjct: 183 WSAAKWIHHAETIEQHPQPKVKPQDLPPGCEDSDEMCPEWADAGECERNASFMVGSRARP 242
Query: 287 GFCRRSCKVC 296
G C SCK C
Sbjct: 243 GKCVASCKRC 252
>gi|307109700|gb|EFN57937.1| hypothetical protein CHLNCDRAFT_142031 [Chlorella variabilis]
Length = 325
Score = 207 bits (527), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 124/301 (41%), Positives = 167/301 (55%), Gaps = 27/301 (8%)
Query: 13 LLSFSLLIRKSFSSTAIINPSKV----KQISWKPRAFVYEGFLTDLECDHLINLAKSQLK 68
L SLL+ + SS I+ + + +SW PRAFV F + E DH+I LA+ QL+
Sbjct: 6 LALISLLLAVAASSAGAIDTAAAHPWFEPVSWYPRAFVAHNFASKEETDHMIKLAQPQLR 65
Query: 69 RSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYE 128
RS V + GES + + RTS G FI + D +++ +E ++ATWT + EDIQVLRY
Sbjct: 66 RSTVVGS-RGESVVDNYRTSYGMFIRRHHDEVVSTLEKRVATWTKYNVTHQEDIQVLRYG 124
Query: 129 HGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAE--EPPRRRTPATN 186
Q+Y+ H+D D R ATVL+YLSDV GGET FPN+E +P P
Sbjct: 125 TTQEYKAHFDSLDDD------SPRTATVLIYLSDVESGGETTFPNSEWIDP---ALPKAL 175
Query: 187 DDLSECAKKGIAVKPRRGDALLFFSLHTNAIP-DPVSLHSGCPVIEGEKWSATKWIHVDS 245
SECA+ +A+KP+RGDA++F SL+ + D +LH+ CPVI G K+ A WIH
Sbjct: 176 GPFSECAQGHVAMKPKRGDAIVFHSLNPDGRSHDQHALHTACPVIVGVKYVAIFWIHTKP 235
Query: 246 FDKIVEEGG----------DCTDNNASCERWAALGECTKNPEYMVGSAQLPGFCRRSCKV 295
F +G DC D + C WAA GEC +NP +M G+A G CR SC
Sbjct: 236 FRPEQLKGPLAPEPPMVPEDCVDADPGCPGWAASGECDRNPGFMRGAATTLGTCRASCGD 295
Query: 296 C 296
C
Sbjct: 296 C 296
>gi|357467077|ref|XP_003603823.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355492871|gb|AES74074.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 291
Score = 207 bits (526), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 105/209 (50%), Positives = 139/209 (66%), Gaps = 7/209 (3%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGK 97
+S +PRA +Y FL+ EC+HLINLAK ++RS V D ++G+ L+ VRTSSGTF+ +GK
Sbjct: 88 LSSEPRASMYHNFLSKEECEHLINLAKPFMQRSLVVDGVTGQGILNSVRTSSGTFLERGK 147
Query: 98 DAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVL 157
D I+ +E +IA T +P ENGE +Q++ YE GQK+EPHYDY + GG R+ATVL
Sbjct: 148 DKIVQNVERRIADITSIPIENGEGLQIIHYEVGQKFEPHYDYNFNWRITNNGGPRVATVL 207
Query: 158 MYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAI 217
MYLSDV +GGETVFPNA+ P N KG+ VKP+ GDALLF+S+ +
Sbjct: 208 MYLSDVEEGGETVFPNAK-------PNFNSVSKYHPGKGLVVKPKMGDALLFWSVKPDGS 260
Query: 218 PDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
D SLH G PVI G KW++ K +H+ F
Sbjct: 261 LDTASLHGGSPVIRGSKWASNKLLHLTEF 289
>gi|242051901|ref|XP_002455096.1| hypothetical protein SORBIDRAFT_03g004265 [Sorghum bicolor]
gi|241927071|gb|EES00216.1| hypothetical protein SORBIDRAFT_03g004265 [Sorghum bicolor]
Length = 303
Score = 207 bits (526), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 115/271 (42%), Positives = 162/271 (59%), Gaps = 25/271 (9%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+ SK ++SW PR F+YEGFL+D+ECDHLI++A + + S V +G + S
Sbjct: 54 FDSSKSMRLSWHPRVFLYEGFLSDMECDHLISMAHGKKQSSLVVGGSAGNN-------SQ 106
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRG 149
G I +D I++ IED+I+ W+FLPK+ GE +Q+L+YE + DY + + G
Sbjct: 107 GASI---EDTIVSTIEDRISVWSFLPKDFGESMQILKYEVNKS-----DYNNYESQSSSG 158
Query: 150 GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLF 209
RL TVLMYLSDV +GGET FP +E + A SECA G AV+P RG+A+L
Sbjct: 159 HDRLVTVLMYLSDVKRGGETAFPRSELKGTKVELAAP---SECA--GYAVQPVRGNAILL 213
Query: 210 FSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFD----KIVEEGGDCTDNNASCER 265
F+L + + D S + C V+EGE+W A K IH+ D +V E +CTD + C
Sbjct: 214 FNLKPDGVIDKDSQYEMCSVLEGEEWLAIKHIHLRKIDTPKSSLVSED-ECTDEDDRCVS 272
Query: 266 WAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
WAA GEC +NP +M+G+ G CR+SC+VC
Sbjct: 273 WAAGGECDRNPIFMIGTPDYYGSCRKSCRVC 303
>gi|297803562|ref|XP_002869665.1| ShTK domain-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297315501|gb|EFH45924.1| ShTK domain-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 290
Score = 206 bits (525), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 109/270 (40%), Positives = 159/270 (58%), Gaps = 29/270 (10%)
Query: 27 TAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVR 86
+ ++P +V Q+SW+PR F+Y GFL++ E DHLI+L K + + G+++L
Sbjct: 50 SKFVDPRRVLQLSWQPRVFLYRGFLSEEESDHLISLRKDT--SEVTSGDADGKTQL---- 103
Query: 87 TSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNI 146
D ++AGIE+KI+ WTFLP+ENG I+V Y +K DYF ++ +
Sbjct: 104 -----------DPVVAGIEEKISAWTFLPRENGGSIKVRSY-TSEKSGKKLDYFGEEPSS 151
Query: 147 VRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
V LATV++YLS+ +GGE +FPN+E P++ C++ G ++P +G+A
Sbjct: 152 VLRESLLATVVLYLSNTTQGGELLFPNSEVKPKK----------SCSEDGNILRPVKGNA 201
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGGDCTDNNASCERW 266
+LFFS NA D S H CPV++GE ATK I+ + EE G+C+D + +CERW
Sbjct: 202 VLFFSRLLNASLDETSTHLICPVVKGELLVATKLIYAKKQAR-NEENGECSDEDENCERW 260
Query: 267 AALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
A LGEC KNP YM+GS G CR+SC C
Sbjct: 261 ANLGECKKNPVYMIGSPDYYGTCRKSCNAC 290
>gi|30686940|ref|NP_194290.2| Oxoglutarate/iron-dependent oxygenase [Arabidopsis thaliana]
gi|26451153|dbj|BAC42680.1| unknown protein [Arabidopsis thaliana]
gi|29893542|gb|AAP06823.1| unknown protein [Arabidopsis thaliana]
gi|332659681|gb|AEE85081.1| Oxoglutarate/iron-dependent oxygenase [Arabidopsis thaliana]
Length = 291
Score = 206 bits (523), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 109/270 (40%), Positives = 160/270 (59%), Gaps = 29/270 (10%)
Query: 27 TAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVR 86
+ ++P++V Q+SW PR F+Y GFL++ ECDHLI+L K + +V + G+++L
Sbjct: 51 SKFVDPTRVLQLSWLPRVFLYRGFLSEEECDHLISLRKETTEVYSV--DADGKTQL---- 104
Query: 87 TSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNI 146
D ++AGIE+K++ WTFLP ENG I+V Y +K DYF ++ +
Sbjct: 105 -----------DPVVAGIEEKVSAWTFLPGENGGSIKVRSYTS-EKSGKKLDYFGEEPSS 152
Query: 147 VRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
V LATV++YLS+ +GGE +FPN+E P+ + C + G ++P +G+A
Sbjct: 153 VLHESLLATVVLYLSNTTQGGELLFPNSEMKPK----------NSCLEGGNILRPVKGNA 202
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGGDCTDNNASCERW 266
+LFF+ NA D S H CPV++GE ATK I+ +I EE G+C+D + +C RW
Sbjct: 203 ILFFTRLLNASLDGKSTHLRCPVVKGELLVATKLIYAKKQARI-EESGECSDEDENCGRW 261
Query: 267 AALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
A LGEC KNP YM+GS G CR+SC C
Sbjct: 262 AKLGECKKNPVYMIGSPDYYGTCRKSCNAC 291
>gi|307110383|gb|EFN58619.1| hypothetical protein CHLNCDRAFT_19485 [Chlorella variabilis]
Length = 328
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 119/295 (40%), Positives = 162/295 (54%), Gaps = 44/295 (14%)
Query: 13 LLSFSLLIRKSFSSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAV 72
+L+FS L +T + ++V+ +SWKPRAFV+ F+T+ E DH++ LAK +KRS V
Sbjct: 16 VLTFSCL------ATPTTSTNRVEPVSWKPRAFVFHNFMTEEEADHIVALAKPFMKRSTV 69
Query: 73 ADNLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQK 132
G S +RTS GTF+ + +D I+ +E ++ATWT L + ED+Q+LRY GQK
Sbjct: 70 V-GAGGASVEDQIRTSYGTFLKRLQDPIVTAVEQRLATWTKLNVSHQEDMQILRYGIGQK 128
Query: 133 YEPHYDYFSDKVNIVRGGHRLATVLMYLSDVAK--GGETVFPNAEEPPRRRTPATNDDLS 190
Y HYD + R+ TVL+YLSDV GGET FP RR+
Sbjct: 129 YGAHYDSLDND------SPRVCTVLLYLSDVPADGGGETAFPGV----RRQ--------- 169
Query: 191 ECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSF---- 246
A+ P++GDALLF+SL + D SLH+GCP+I G KW+ATKWIH F
Sbjct: 170 -------ALYPKKGDALLFYSLKPDGTSDAYSLHTGCPIISGVKWTATKWIHTLPFRPHL 222
Query: 247 -----DKIVEEGGDCTDNNASCERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
+ + +C D A C+ WA GEC N ++M G A G CR SC C
Sbjct: 223 LGKEQAEAIVYPEECKDAQADCKAWADAGECENNEQFMRGDAFTLGNCRASCGDC 277
>gi|24417248|gb|AAN60234.1| unknown [Arabidopsis thaliana]
Length = 190
Score = 204 bits (519), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 92/153 (60%), Positives = 122/153 (79%)
Query: 17 SLLIRKSFSSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNL 76
S++ K+ +S+ +P++V Q+SW PR F+YEGFL+D ECDH I LAK +L++S VADN
Sbjct: 38 SVIKMKTSASSFGFDPTRVTQLSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADND 97
Query: 77 SGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPH 136
SGES S+VRTSSG F+ K +D I++ +E K+A WTFLP+ENGE +Q+L YE+GQKYEPH
Sbjct: 98 SGESVESEVRTSSGMFLSKRQDDIVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPH 157
Query: 137 YDYFSDKVNIVRGGHRLATVLMYLSDVAKGGET 169
+DYF D+ N+ GGHR+ATVLMYLS+V KGGET
Sbjct: 158 FDYFHDQANLELGGHRIATVLMYLSNVEKGGET 190
>gi|168001068|ref|XP_001753237.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162695523|gb|EDQ81866.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 284
Score = 204 bits (519), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 109/247 (44%), Positives = 150/247 (60%), Gaps = 22/247 (8%)
Query: 9 NFFFLLSFSLLIRKSFSSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLK 68
N F SL + T + K + ISW+PR + FL+ ECDHLINLA+ +L
Sbjct: 52 NQTFGSGLSLWANDEDARTLRVGLVKQEVISWQPRIILLHNFLSADECDHLINLARPRLV 111
Query: 69 RSAVADNLSGESKLSDVRTSSGTFIPKGKDA---IIAGIEDKIATWTFLPKENGEDIQVL 125
+S V D +G+ S VRTS+G F+ G D I IE +IA ++ +P +NGE +QVL
Sbjct: 112 KSTVVDATTGKGIESKVRTSTGMFL-NGNDRRHHTIQAIETRIAAYSMVPVQNGELLQVL 170
Query: 126 RYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPAT 185
RYE Q Y+ H+DYFSD+ N+ RGG R+AT+LMYL++ +GGET+FP A +
Sbjct: 171 RYESDQYYKAHHDYFSDEFNLKRGGQRVATMLMYLTEGVEGGETIFPQAGDK-------- 222
Query: 186 NDDLSECA-----KKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKW 240
EC+ K G+ VKP+RGDA+LF+S+ + DP SLH GC V+ GEKWS+TKW
Sbjct: 223 -----ECSCGGEMKIGVCVKPKRGDAVLFWSIKLDGQVDPTSLHGGCKVLSGEKWSSTKW 277
Query: 241 IHVDSFD 247
+ +FD
Sbjct: 278 MRQRAFD 284
>gi|302841711|ref|XP_002952400.1| hypothetical protein VOLCADRAFT_81799 [Volvox carteri f.
nagariensis]
gi|300262336|gb|EFJ46543.1| hypothetical protein VOLCADRAFT_81799 [Volvox carteri f.
nagariensis]
Length = 269
Score = 204 bits (519), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 101/205 (49%), Positives = 131/205 (63%), Gaps = 5/205 (2%)
Query: 43 RAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIA 102
R +++ GFLT ECD++ A+ +L+RS V D SG S +SD+RTS G F +G+DAI+
Sbjct: 44 RIYLWRGFLTPEECDYIRMKAEKRLERSGVVDTASGSSVVSDIRTSDGMFFERGEDAILE 103
Query: 103 GIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLSD 162
+E ++A WT P GE +QVLRY QKY+ H +YF K GG+R ATVL YL+D
Sbjct: 104 AVEQRLADWTMTPIWAGEALQVLRYRKDQKYDSHVNYFFHKEGSANGGNRWATVLTYLTD 163
Query: 163 VAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVS 222
+GGETVFP P N SECAK +AVKPR+GDA+LF S+ TN + S
Sbjct: 164 TEEGGETVFPKIPAP-----GGVNVGFSECAKYNLAVKPRKGDAILFHSMKTNGQLEERS 218
Query: 223 LHSGCPVIEGEKWSATKWIHVDSFD 247
LH CPVI+GEK+S TKWIH +D
Sbjct: 219 LHGACPVIKGEKFSMTKWIHAGHYD 243
>gi|356530852|ref|XP_003533993.1| PREDICTED: uncharacterized protein LOC100775928 [Glycine max]
Length = 302
Score = 204 bits (518), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 117/312 (37%), Positives = 178/312 (57%), Gaps = 47/312 (15%)
Query: 10 FFFLLSFSL-----------------LIRKSFSSTAIINPSKVKQISWKPRAFVYEGFLT 52
FFFL++ SL ++ S + INPS+V QISW+PR F+Y+GFL+
Sbjct: 13 FFFLIATSLTESSRKELRSKQETALQMLEHSIHYSNRINPSRVVQISWQPRVFLYKGFLS 72
Query: 53 DLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWT 112
D ECD+L++LA AV + SG S+ TF+ +D I+A IE++++ W
Sbjct: 73 DKECDYLVSLAY------AVKEKSSGNGGFSE---GVETFL-DIEDDILARIEERLSLWA 122
Query: 113 FLPKENGEDIQVLRYEHGQKYEPH---YDYFSDKVNIVRGGHRLATVLMYLSDVA-KGGE 168
FLPKE + +QV+ Y EP+ DYF++K + G +AT+++YLS+ A +GG+
Sbjct: 123 FLPKEYSKPLQVMHYGP----EPNGRNLDYFTNKTQLELSGPLMATIVLYLSNAATQGGQ 178
Query: 169 TVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCP 228
+FP E PR + ++ + ++P +G+A+LFFSLH +A PD S H+ CP
Sbjct: 179 ILFP--ESVPRSSSWSSC------SNSSNILQPVKGNAILFFSLHPSASPDKNSFHARCP 230
Query: 229 VIEGEKWSATKWIHVDSFDK----IVEEGGDCTDNNASCERWAALGECTKNPEYMVGSAQ 284
V+EG WSA K+ + + +GG+CTD + +C WAA+GEC +NP +M+GS
Sbjct: 231 VLEGNMWSAIKYFYAKPISSGEVSAISDGGECTDEDDNCPAWAAMGECQRNPVFMIGSPD 290
Query: 285 LPGFCRRSCKVC 296
G CR+SC C
Sbjct: 291 YYGTCRKSCNAC 302
>gi|414587756|tpg|DAA38327.1| TPA: hypothetical protein ZEAMMB73_894856 [Zea mays]
Length = 263
Score = 204 bits (518), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 104/213 (48%), Positives = 139/213 (65%), Gaps = 16/213 (7%)
Query: 34 KVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFI 93
K + ISW PR V+ FL+ ECD+L+ +A+ +L+ S V D +G+ SDVRTSSG F+
Sbjct: 56 KPEVISWTPRIIVFHNFLSSEECDYLMAIARPRLQISTVVDVATGKGVKSDVRTSSGMFV 115
Query: 94 --PKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGH 151
+ K ++ IE +I+ ++ +PKENGE IQVLRYE Q Y PH+DYFSD N+ RGG
Sbjct: 116 NSEERKSPVVQAIEKRISVFSQIPKENGELIQVLRYEASQYYRPHHDYFSDTFNLKRGGQ 175
Query: 152 RLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAK---KGIAVKPRRGDALL 208
R+AT+LMYL+D GGET FP A D C KG+ VKP +GDA+L
Sbjct: 176 RVATMLMYLTDGVVGGETHFPQA-----------GDGECSCGGNVVKGLCVKPNKGDAVL 224
Query: 209 FFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
F+S+ + DP S+HSGCPV++GEKWSATKW+
Sbjct: 225 FWSMGLDGNTDPNSIHSGCPVLKGEKWSATKWM 257
>gi|145354086|ref|XP_001421326.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144581563|gb|ABO99619.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 309
Score = 203 bits (517), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 105/221 (47%), Positives = 140/221 (63%), Gaps = 9/221 (4%)
Query: 35 VKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIP 94
+++IS PRA+VY FLT E + I A+ ++RS V + G SK SD RTSSG ++
Sbjct: 78 IERISESPRAYVYRNFLTREEAEATIAAARRTMRRSEVVNEADGTSKTSDERTSSGGWVS 137
Query: 95 KGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLA 154
++A IE ++A WT LP+ GE QV+RYE GQ+Y H DYF D+VN+ GG R A
Sbjct: 138 GEDSEVMANIERRVAAWTMLPRNRGETTQVMRYEAGQEYAAHDDYFHDEVNVKNGGQRAA 197
Query: 155 TVLMYLSDVAKGGETVFPNAE----EPPRRRTPATNDDLSECAKKG----IAVKPRRGDA 206
TVLMYLSDV +GGETVFP P ++ T + E A +G +AVKPRRGDA
Sbjct: 198 TVLMYLSDVEEGGETVFPRGTPLGGAAP-EKSGVTQGNACERALRGDPNVLAVKPRRGDA 256
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFD 247
LLFF++H N D + H+GCPV+ G KW+AT+W HV + +
Sbjct: 257 LLFFNVHLNGEVDERARHAGCPVVRGTKWTATRWQHVGALN 297
>gi|145347188|ref|XP_001418057.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144578285|gb|ABO96350.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 317
Score = 203 bits (516), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 105/264 (39%), Positives = 152/264 (57%), Gaps = 11/264 (4%)
Query: 35 VKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIP 94
V+ +SW PR F+ + FL+D EC+HLI L + +L+RS V N +S RTS GTF+
Sbjct: 36 VETLSWSPRVFLLKNFLSDEECEHLIELGEKKLERSTVV-NSDESGAVSTARTSFGTFVT 94
Query: 95 KGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLA 154
+ + +ED++A ++ +P E+ E +Q+LRY GQ+Y H+D + GG R+A
Sbjct: 95 RRLTETLQRVEDRVAKYSGIPWEHQEQLQLLRYRDGQEYVAHHDGIISE----NGGKRIA 150
Query: 155 TVLMYLSDVAKGGETVFPNAEEPPRRRTP--ATNDDLSECA---KKGIAVKPRRGDALLF 209
TVLM+L + GGET FP P + A D LSEC G +V P++G+A+LF
Sbjct: 151 TVLMFLREPTSGGETSFPQGTPLPETKAAFLANKDKLSECGWNDGNGFSVIPKKGEAVLF 210
Query: 210 FSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGGDCTDNNASCERWAAL 269
FS H N DP + H+ CP + G K++ATKWIH + F+ + CTD C WA
Sbjct: 211 FSFHINGTNDPFANHASCPTLGGTKYTATKWIHENPFETGTAKTPTCTDETELCPVWAQG 270
Query: 270 GECTKNPEYMVGSAQLPGFCRRSC 293
EC +NP +M+G + G C +SC
Sbjct: 271 HECERNPVFMMGEESV-GACSKSC 293
>gi|242075290|ref|XP_002447581.1| hypothetical protein SORBIDRAFT_06g004550 [Sorghum bicolor]
gi|241938764|gb|EES11909.1| hypothetical protein SORBIDRAFT_06g004550 [Sorghum bicolor]
Length = 263
Score = 202 bits (513), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 103/215 (47%), Positives = 141/215 (65%), Gaps = 20/215 (9%)
Query: 34 KVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFI 93
K + ISW PR ++ FL+ ECD+L+ +A+ +L+ S V D +G+ SDVRTSSG F+
Sbjct: 56 KPEVISWTPRIIIFHNFLSSEECDYLMAIARPRLQMSTVVDVATGKGVKSDVRTSSGMFV 115
Query: 94 --PKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGH 151
+ K +I IE +I+ ++ +PKENGE IQVLRYE Q Y PH+DYFSD N+ RGG
Sbjct: 116 NSEERKSPVIQAIEKRISVFSQIPKENGELIQVLRYEASQYYRPHHDYFSDTFNLKRGGQ 175
Query: 152 RLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECA-----KKGIAVKPRRGDA 206
R+AT+LMYL+D +GGET F A + EC+ KG+ VKP +GDA
Sbjct: 176 RVATMLMYLTDGVEGGETHFLQAGD-------------GECSCGGNVVKGLCVKPNKGDA 222
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
+LF+S+ + DP S+HSGCPV++GEKWSATKW+
Sbjct: 223 VLFWSMGLDGNTDPNSIHSGCPVLKGEKWSATKWM 257
>gi|308804269|ref|XP_003079447.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
gi|116057902|emb|CAL54105.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
Length = 363
Score = 201 bits (510), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 106/261 (40%), Positives = 151/261 (57%), Gaps = 19/261 (7%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGK 97
+SW PRAF+Y+ FLT+ EC+HLI L + +L+RS V + E + RTS GTFI +
Sbjct: 94 LSWSPRAFLYQNFLTEDECEHLIALGEKKLERSTVVGSKGKEGDVHSARTSFGTFITRRL 153
Query: 98 DAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVL 157
++ +ED++A ++ +P + E +Q+LRYE GQ+Y G R+ATVL
Sbjct: 154 TPTLSAVEDRVAEYSGIPWRHQEQLQLLRYEKGQEYG-------------NGEKRIATVL 200
Query: 158 MYLSDVAKGGETVFPNAEEPPRRRTP--ATNDDLSECA---KKGIAVKPRRGDALLFFSL 212
M+L + GGET FP+A P R+ + LS+C +G +V PR+GDA+LFFS
Sbjct: 201 MFLREPEFGGETHFPDATPLPATRSEFLGSRAKLSDCGWNEGRGFSVIPRKGDAILFFSH 260
Query: 213 HTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGGDCTDNNASCERWAALGEC 272
H N D + H+ CP + G K++ATKWIH FD E C D C++WA GEC
Sbjct: 261 HINGTSDDAASHASCPTLRGIKYTATKWIHEKEFDTTTFETPMCEDKEDMCDQWANSGEC 320
Query: 273 TKNPEYMVGSAQLPGFCRRSC 293
KNP +M+G + G C +SC
Sbjct: 321 EKNPVFMMG-IETVGSCSKSC 340
>gi|255545252|ref|XP_002513687.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223547595|gb|EEF49090.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 309
Score = 200 bits (509), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 107/274 (39%), Positives = 162/274 (59%), Gaps = 18/274 (6%)
Query: 27 TAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVR 86
T I+ +V Q+SW+PR F+Y+GFLTD ECD LI+LA + S G+ ++++
Sbjct: 50 TNRISLLQVVQLSWRPRVFLYKGFLTDEECDRLISLAHGAKEISKG----KGDGSRNNIQ 105
Query: 87 TSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNI 146
+S D ++A IE++I+ WTF+PKEN + +QV+ Y + E H+DYF +K +
Sbjct: 106 LASSESRSHIYDDLLARIEERISAWTFIPKENSKPLQVMHYGIEEARE-HFDYFDNKT-L 163
Query: 147 VRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
+ +AT+++YLS+V +GGE +FP +E + S+C K ++P +G+A
Sbjct: 164 ISNVSLMATLVLYLSNVTRGGEILFPKSE--------LKDKVWSDCTKDSSILRPVKGNA 215
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVE----EGGDCTDNNAS 262
+L F+ H NA D S H CPV+EGE W ATK V + ++ +G DCTD + +
Sbjct: 216 VLIFNAHLNASADSRSTHGRCPVLEGEMWCATKQFLVRATNEEKSLPDSDGSDCTDEDDN 275
Query: 263 CERWAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
C +WAALGEC +NP +M GS G CR+SC C
Sbjct: 276 CPKWAALGECQRNPIFMTGSPDYYGTCRKSCNAC 309
>gi|363543309|ref|NP_001241870.1| prolyl 4-hydroxylase 6-3 precursor [Zea mays]
gi|347978824|gb|AEP37754.1| prolyl 4-hydroxylase 6-3 [Zea mays]
Length = 208
Score = 200 bits (509), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 93/168 (55%), Positives = 125/168 (74%), Gaps = 3/168 (1%)
Query: 31 NPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSG 90
+P+ V Q+S +PRAF+Y GFL+D ECDHL++LAK +++S VADN SG+S S RTSSG
Sbjct: 31 DPASVTQLSSRPRAFLYSGFLSDTECDHLVSLAKGSMEKSMVADNDSGKSVASQARTSSG 90
Query: 91 TFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGG 150
TF+ K +D I++ IE ++A WTFLP+EN E +QVLRYE GQKY+ H+DYF D+ N+ GG
Sbjct: 91 TFLAKREDEIVSAIEKRVAAWTFLPEENAESLQVLRYETGQKYDAHFDYFHDRNNLKLGG 150
Query: 151 HRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIA 198
R+ATVLMYL+DV KGGE VFP+AE + T S+C++ G+A
Sbjct: 151 QRVATVLMYLTDVKKGGEAVFPDAEGSHLQYKDET---WSDCSRSGLA 195
>gi|115457822|ref|NP_001052511.1| Os04g0346000 [Oryza sativa Japonica Group]
gi|38346023|emb|CAE03962.2| OSJNBb0085H11.11 [Oryza sativa Japonica Group]
gi|113564082|dbj|BAF14425.1| Os04g0346000 [Oryza sativa Japonica Group]
gi|125547818|gb|EAY93640.1| hypothetical protein OsI_15426 [Oryza sativa Indica Group]
gi|125589953|gb|EAZ30303.1| hypothetical protein OsJ_14349 [Oryza sativa Japonica Group]
gi|215693934|dbj|BAG89133.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 267
Score = 200 bits (509), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 105/229 (45%), Positives = 143/229 (62%), Gaps = 19/229 (8%)
Query: 23 SFSSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKL 82
+F ++ P + ISW PR V+ FL+ ECD+L ++A+ +L+ S V D +G+
Sbjct: 52 AFLRLGLVKP---EVISWSPRIIVFHNFLSSEECDYLRSIARPRLQISTVVDVATGKGVK 108
Query: 83 SDVRTSSGTFIP--KGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF 140
S+VRTSSG F+ + K +I IE +I+ ++ +P+ENGE IQVLRYE Q Y PH+DYF
Sbjct: 109 SNVRTSSGMFVSSEERKLPVIQSIEKRISVYSQIPEENGELIQVLRYEPSQYYRPHHDYF 168
Query: 141 SDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAK---KGI 197
SD NI RGG R+AT+LMYL+D +GGET FP A D C KG+
Sbjct: 169 SDTFNIKRGGQRVATMLMYLTDGVEGGETHFPQA-----------GDGECSCGGKMVKGL 217
Query: 198 AVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
VKP +GDA+LF+S+ + D S+H GCPV+EGEKWSATKW+ F
Sbjct: 218 CVKPNKGDAVLFWSMGLDGETDSNSIHGGCPVLEGEKWSATKWMRQKEF 266
>gi|116309432|emb|CAH66506.1| OSIGBa0111I14.1 [Oryza sativa Indica Group]
Length = 267
Score = 200 bits (508), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 105/229 (45%), Positives = 143/229 (62%), Gaps = 19/229 (8%)
Query: 23 SFSSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKL 82
+F ++ P + ISW PR V+ FL+ ECD+L ++A+ +L+ S V D +G+
Sbjct: 52 AFLRLGLVKP---EVISWSPRIIVFHNFLSSEECDYLRSIARPRLQISTVVDVATGKGVK 108
Query: 83 SDVRTSSGTFIP--KGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF 140
S+VRTSSG F+ + K +I IE +I+ ++ +P+ENGE IQVLRYE Q Y PH+DYF
Sbjct: 109 SNVRTSSGMFVSSEERKLPVIQSIEKRISVYSQIPEENGELIQVLRYEPSQYYRPHHDYF 168
Query: 141 SDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAK---KGI 197
SD NI RGG R+AT+LMYL+D +GGET FP A D C KG+
Sbjct: 169 SDTFNIKRGGQRVATMLMYLTDGVEGGETHFPQA-----------GDGECSCGGKMVKGL 217
Query: 198 AVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
VKP +GDA+LF+S+ + D S+H GCPV+EGEKWSATKW+ F
Sbjct: 218 CVKPNKGDAVLFWSMGLDGETDSNSIHGGCPVLEGEKWSATKWMRQKEF 266
>gi|302765413|ref|XP_002966127.1| hypothetical protein SELMODRAFT_86017 [Selaginella moellendorffii]
gi|300165547|gb|EFJ32154.1| hypothetical protein SELMODRAFT_86017 [Selaginella moellendorffii]
Length = 201
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 99/201 (49%), Positives = 133/201 (66%), Gaps = 9/201 (4%)
Query: 45 FVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGI 104
++ +D ECDHLI LA +L+RS+V D +G K S RTS G F+ + D I++GI
Sbjct: 1 LIFFYLYSDDECDHLIGLALPRLRRSSVIDEKTGLGKDSRNRTSWGAFLRRDHDNIVSGI 60
Query: 105 EDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVA 164
ED+I++ TF+PKE GE +QV+RY+ GQK+EPH DY+ N GGHR+ T+L+YL++V
Sbjct: 61 EDRISSITFIPKEYGESLQVVRYKTGQKFEPHQDYYKLTENNNNGGHRIGTLLLYLTNVE 120
Query: 165 KGGETVFPNAEEPPRRRTPATND---DLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPV 221
GGETVFP A ND + SEC KKGI ++PRRGD LLF+ + DP
Sbjct: 121 NGGETVFPRA------LANVINDYSTNTSECTKKGIVIRPRRGDGLLFWITRPSGEIDPF 174
Query: 222 SLHSGCPVIEGEKWSATKWIH 242
S H GCPV++GEKW ATK++H
Sbjct: 175 SFHGGCPVVKGEKWLATKFLH 195
>gi|3805847|emb|CAA21467.1| putative protein [Arabidopsis thaliana]
gi|7270533|emb|CAB81490.1| putative protein [Arabidopsis thaliana]
Length = 307
Score = 197 bits (501), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 109/240 (45%), Positives = 148/240 (61%), Gaps = 42/240 (17%)
Query: 21 RKSFSSTAIINPSK-VKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGE 79
R+SF N + ++ ISW+PRAFVY FLT+ EC+HLI+LAK + +S V D +G+
Sbjct: 65 RESFGDEEDGNGDRWLEVISWEPRAFVYHNFLTNEECEHLISLAKPSMMKSKVVDVKTGK 124
Query: 80 S-------------------------------KLSDVRTSSGTFIPKGKDAIIAGIEDKI 108
S L VRTSSGTF+ +G D I+ IE++I
Sbjct: 125 SIDSRFCTLTSVVVFTFQLNLERFENSKFANPSLCRVRTSSGTFLNRGHDEIVEEIENRI 184
Query: 109 ATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGE 168
+ +TF+P ENGE +QVL YE GQ+YEPH+DYF D+ N+ +GG R+ATVLMYLSDV +GGE
Sbjct: 185 SDFTFIPPENGEGLQVLHYEVGQRYEPHHDYFFDEFNVRKGGQRIATVLMYLSDVDEGGE 244
Query: 169 TVFP----NAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLH 224
TVFP N + P D+LS+C K+G++V P++ DALLF+S+ +A DP SLH
Sbjct: 245 TVFPAAKGNVSDVPWW------DELSQCGKEGLSVLPKKRDALLFWSMKPDASLDPSSLH 298
>gi|159469311|ref|XP_001692811.1| predicted protein [Chlamydomonas reinhardtii]
gi|158278064|gb|EDP03830.1| predicted protein [Chlamydomonas reinhardtii]
Length = 273
Score = 196 bits (497), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 98/204 (48%), Positives = 129/204 (63%), Gaps = 5/204 (2%)
Query: 43 RAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIA 102
R ++++GFLT ECD++ A+ +L+RS V D SG S +SD+RTS G F +G+DAII
Sbjct: 44 RIYLWKGFLTPEECDYIRMKAEKRLERSGVVDTGSGGSVVSDIRTSDGMFFERGEDAIIE 103
Query: 103 GIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLSD 162
+E ++A WT P GE +QVLRY QKY+ H+DYF K GG+R ATVL+YL++
Sbjct: 104 AVEQRLADWTMTPIWGGESLQVLRYRKDQKYDSHWDYFFHKDGSSNGGNRWATVLLYLTE 163
Query: 163 VAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVS 222
+GGETVFP P N SECAK +AVKP +GDALLF S+ + S
Sbjct: 164 TEEGGETVFPKIPAP-----NGINVGFSECAKYNLAVKPHKGDALLFHSMKPTGELEERS 218
Query: 223 LHSGCPVIEGEKWSATKWIHVDSF 246
+H CPVI GEK+S TKWIH +
Sbjct: 219 MHGACPVIRGEKFSMTKWIHAGHY 242
>gi|308812133|ref|XP_003083374.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein (ISS)
[Ostreococcus tauri]
gi|116055254|emb|CAL57650.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein (ISS)
[Ostreococcus tauri]
Length = 311
Score = 195 bits (496), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 100/218 (45%), Positives = 137/218 (62%), Gaps = 11/218 (5%)
Query: 35 VKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIP 94
+++IS PRA+V+ FLTD ECD +I A ++ S V D+ SGE++ D R+S G ++
Sbjct: 68 IEKISDSPRAYVFREFLTDAECDRVIERAYPTMEASEVTDDDSGEARPDDARSSIGGWVS 127
Query: 95 KGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLA 154
D +I IE + +TW LP GE +QVLRYE GQKY+ H D+F D+ N+ GG R+A
Sbjct: 128 GDDDEVIRNIELRASTWAMLPMNRGETMQVLRYEKGQKYDAHDDFFHDEHNVKNGGQRVA 187
Query: 155 TVLMYLSDVAKGGETVFP-----NAEEPPRRRTPATNDDLSECAKKG----IAVKPRRGD 205
T+LMYLSDV +GGETVFP +P ++ T D+ E A + +AVKPRRGD
Sbjct: 188 TILMYLSDVEEGGETVFPLGTPLGGRDP--EKSGVTGDNACELASQNDPRVLAVKPRRGD 245
Query: 206 ALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHV 243
ALLFF+ H + D + H+GCPV G KW+ T+W V
Sbjct: 246 ALLFFNAHLSGEMDEKANHAGCPVNRGTKWTMTRWHRV 283
>gi|302823087|ref|XP_002993198.1| hypothetical protein SELMODRAFT_431327 [Selaginella moellendorffii]
gi|300138968|gb|EFJ05718.1| hypothetical protein SELMODRAFT_431327 [Selaginella moellendorffii]
Length = 269
Score = 195 bits (495), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 103/225 (45%), Positives = 141/225 (62%), Gaps = 19/225 (8%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKL---SDVR 86
I K + ++W PR + FL+ ECD+LI +A +L +S V D +G+++ S VR
Sbjct: 55 IGLVKPEVLNWSPRIILLHKFLSAEECDYLIAIAGPRLAKSTVVDTSTGKARHGIESKVR 114
Query: 87 TSSGTFIPK--GKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKV 144
TS+G F+ + +I IE +IA ++ +P ENGE +QVLRYE Q Y+PH+DYFSD+
Sbjct: 115 TSTGMFLSNYDRRYPMIQAIERRIAVYSMIPVENGELLQVLRYEPNQYYKPHHDYFSDQF 174
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECA---KKGIAVKP 201
N+ RGG R+ATVLMYLSDV +GGET+F P+ D EC +KG+ VKP
Sbjct: 175 NLKRGGQRVATVLMYLSDVEEGGETIF-----------PSVGDGECECGGELRKGLCVKP 223
Query: 202 RRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
R+GDA+LF+S + D SLH GC V+ GEKWSATKW+ F
Sbjct: 224 RKGDAILFWSAALDGNVDSNSLHGGCSVLRGEKWSATKWLRQSRF 268
>gi|303279839|ref|XP_003059212.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226459048|gb|EEH56344.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 409
Score = 195 bits (495), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 118/297 (39%), Positives = 163/297 (54%), Gaps = 38/297 (12%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVA--DNLSGES--KLSDV 85
+ ++V+++S PRA+++ FLT EC HLI ++ LKRS V D L GE+ + SD
Sbjct: 78 VGDARVEKLSDSPRAYLFREFLTKEECAHLIEISTPHLKRSTVVGDDALLGEADGRRSDY 137
Query: 86 RTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQ---VLRYEHGQKYEPHYDYFSD 142
RTS+G F+PK D ++ +E ++ ++ LP EN E +Q +LRYE GQ+Y H D F+
Sbjct: 138 RTSTGAFLPKLYDDVVTRVERRVEAFSRLPFENQEQLQARSLLRYELGQEYRDHVDGFAT 197
Query: 143 KVNIVRGGHRLATVLMYLSDVAKGGETVFPNAE----EPPRRRTPATNDDLSECA----- 193
+ GG R+ATVLM+L++ +GGET FPN E R +LS+CA
Sbjct: 198 E----NGGKRVATVLMFLAEPEEGGETAFPNGEPSEAVAARVAAQRARGELSDCAWRGGG 253
Query: 194 ----------KKGIAVKPRRGDALLFFSLHTNAIPDP-------VSLHSGCPVIEGEKWS 236
+G AVKPR GDA+LFFS + S H+ CP G KW+
Sbjct: 254 GGTAGGGRGNLRGFAVKPRLGDAVLFFSYDADDDGGYDGAEVSHASTHASCPTTRGVKWT 313
Query: 237 ATKWIHVDSFDKIVEEGGDCTDNNASCERWAALGECTKNPEYMVGSAQLPGFCRRSC 293
ATKWIH +F E +C D + C WA GEC KNP +M+G A PG C +SC
Sbjct: 314 ATKWIHERAFATGTWETPECVDRDDGCAGWARGGECAKNPGFMLGEAT-PGSCLKSC 369
>gi|326503458|dbj|BAJ86235.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326516134|dbj|BAJ88090.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 266
Score = 193 bits (490), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 102/215 (47%), Positives = 137/215 (63%), Gaps = 20/215 (9%)
Query: 34 KVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFI 93
K + ISW PR V+ FL+ ECD+L +A+ +L+ S V D +G+ SDVRTSSG F+
Sbjct: 59 KPEVISWTPRIIVFHNFLSSEECDYLREIARPRLEISTVVDVATGKGVKSDVRTSSGMFV 118
Query: 94 --PKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGH 151
+ K +I IE +I+ ++ +P ENGE IQVLRYE Q Y PH+DYFSD N+ RGG
Sbjct: 119 NSEERKLPVIKAIEKRISVFSQIPVENGELIQVLRYEPNQYYRPHHDYFSDTFNLKRGGQ 178
Query: 152 RLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECA-----KKGIAVKPRRGDA 206
R+AT+LMYL+D +GGET FP A + EC +G+ VKP +GDA
Sbjct: 179 RVATMLMYLTDGVEGGETHFPQAGD-------------GECICGGRLVRGLCVKPNKGDA 225
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
+LF+S+ + D SLHSGC V++GEKWSATKW+
Sbjct: 226 VLFWSMGLDGNTDSNSLHSGCAVVKGEKWSATKWM 260
>gi|302764100|ref|XP_002965471.1| hypothetical protein SELMODRAFT_67344 [Selaginella moellendorffii]
gi|300166285|gb|EFJ32891.1| hypothetical protein SELMODRAFT_67344 [Selaginella moellendorffii]
Length = 264
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 102/221 (46%), Positives = 140/221 (63%), Gaps = 19/221 (8%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKL---SDVR 86
I K + ++W PR + FL+ ECD+LI +A +L +S V D +G+++ S VR
Sbjct: 54 IGLVKPEVLNWSPRITLLHKFLSAEECDYLIAIAGPRLAKSTVVDTSTGKARHGIESKVR 113
Query: 87 TSSGTFIPK--GKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKV 144
TS+G F+ + +I IE +IA ++ +P ENGE +QVLRYE Q Y+PH+DYFSD+
Sbjct: 114 TSTGMFLSNYDRRYPMIEAIERRIAVYSMIPVENGELLQVLRYEPNQYYKPHHDYFSDQF 173
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECA---KKGIAVKP 201
N+ RGG R+ATVLMYLSDV +GGET+F P+ D EC +KG+ VKP
Sbjct: 174 NLKRGGQRVATVLMYLSDVEEGGETIF-----------PSVGDGECECGGELRKGLCVKP 222
Query: 202 RRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
R+GDA+LF+S + D SLH GC V+ GEKWSATKW+
Sbjct: 223 RKGDAILFWSAALDGNVDSNSLHGGCSVLRGEKWSATKWLR 263
>gi|357162904|ref|XP_003579560.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
distachyon]
Length = 266
Score = 191 bits (486), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 101/215 (46%), Positives = 137/215 (63%), Gaps = 20/215 (9%)
Query: 34 KVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFI 93
K + ISW PR V+ FL+ ECD L +A+ +L+ S V D +G+ SDVRTSSG F+
Sbjct: 59 KPEVISWTPRIIVFHNFLSSEECDFLKEIARPRLEISTVVDVATGKGVKSDVRTSSGMFV 118
Query: 94 --PKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGH 151
+ K +I IE +I+ ++ +P ENGE IQVLRYE Q Y PH+DYFSD N+ RGG
Sbjct: 119 NSEERKFPVIQAIEKRISVFSQIPVENGELIQVLRYEPSQYYRPHHDYFSDTFNLKRGGQ 178
Query: 152 RLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECA-----KKGIAVKPRRGDA 206
R+AT+LMYL+D +GGET FP A + EC+ +G+ VKP +GDA
Sbjct: 179 RVATMLMYLTDGVEGGETHFPQAGD-------------GECSCGGRIVRGLCVKPNKGDA 225
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
+LF+S+ + D S+HSGC V++GEKWSATKW+
Sbjct: 226 VLFWSMGLDGNTDSNSIHSGCAVLKGEKWSATKWM 260
>gi|297824279|ref|XP_002880022.1| AT-P4H-1 [Arabidopsis lyrata subsp. lyrata]
gi|297325861|gb|EFH56281.1| AT-P4H-1 [Arabidopsis lyrata subsp. lyrata]
Length = 283
Score = 191 bits (484), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 102/218 (46%), Positives = 137/218 (62%), Gaps = 16/218 (7%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
I K + +SW PR V FL+ EC++L +A+ +L+ S V D +G+ SDVRTSS
Sbjct: 72 IGNVKPEVVSWSPRIIVLHDFLSPEECEYLKAIARPRLQVSTVVDVKTGKGVKSDVRTSS 131
Query: 90 GTFIP--KGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIV 147
G F+ + + II IE +IA ++ +P ENGE IQVLRYE Q Y+PH+DYF+D N+
Sbjct: 132 GMFLTHVERSNPIIQAIEKRIAVFSQVPAENGELIQVLRYEPKQFYKPHHDYFADTFNLK 191
Query: 148 RGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECA---KKGIAVKPRRG 204
RGG R+AT+LMYL+D +GGET FP A D C KGI+VKP +G
Sbjct: 192 RGGQRVATMLMYLTDDVEGGETYFPLA-----------GDGDCTCGGKIMKGISVKPTKG 240
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
DA+LF+S+ + DP S+H GC V+ GEKWSATKW+
Sbjct: 241 DAVLFWSMGLDGQSDPRSIHGGCEVLSGEKWSATKWMR 278
>gi|449469338|ref|XP_004152378.1| PREDICTED: uncharacterized protein LOC101218968 [Cucumis sativus]
Length = 311
Score = 190 bits (483), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 103/271 (38%), Positives = 159/271 (58%), Gaps = 14/271 (5%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
I+PS+V Q+SW+PR F+Y+GFL+D ECDHLI+LA + + SG + +++ SS
Sbjct: 51 IDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGITVSTELLNSS 110
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRG 149
G + D I+A IE+++A WT LPK++ Q+++Y G++ + Y Y + +
Sbjct: 111 GVIL-NTTDDIVARIENRLAIWTLLPKDHSMPFQIMQYR-GEEAKHKYFYGNRSAMLPSS 168
Query: 150 GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLF 209
+ATV++YLSD A GGE +FP ++ + S KK ++P +G+A+LF
Sbjct: 169 EPLMATVVLYLSDSASGGEILFPESK--------VKSKFWSGRRKKNNFLRPVKGNAILF 220
Query: 210 FSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHV----DSFDKIVEEGGDCTDNNASCER 265
FS+H NA PD S H P+ +GE W ATK++++ + I + C D + SC +
Sbjct: 221 FSVHLNASPDKSSYHIRSPIRDGELWVATKFLYLGPPAGNKHTIQSDVDGCFDEDKSCPQ 280
Query: 266 WAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
WAA+GEC +N +MVGS G CR+SC C
Sbjct: 281 WAAIGECERNAVFMVGSPDYYGTCRKSCNAC 311
>gi|15224220|ref|NP_181836.1| P4H isoform 1 [Arabidopsis thaliana]
gi|3763917|gb|AAC64297.1| hypothetical protein [Arabidopsis thaliana]
gi|20197628|gb|AAM15158.1| hypothetical protein [Arabidopsis thaliana]
gi|26450452|dbj|BAC42340.1| unknown protein [Arabidopsis thaliana]
gi|29824245|gb|AAP04083.1| unknown protein [Arabidopsis thaliana]
gi|330255112|gb|AEC10206.1| P4H isoform 1 [Arabidopsis thaliana]
Length = 283
Score = 190 bits (482), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 102/218 (46%), Positives = 137/218 (62%), Gaps = 16/218 (7%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
I K + +SW PR V FL+ EC++L +A+ +L+ S V D +G+ SDVRTSS
Sbjct: 72 IGNVKPEVVSWSPRIIVLHDFLSPEECEYLKAIARPRLQVSTVVDVKTGKGVKSDVRTSS 131
Query: 90 GTFIPKGKDA--IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIV 147
G F+ + + II IE +IA ++ +P ENGE IQVLRYE Q Y+PH+DYF+D N+
Sbjct: 132 GMFLTHVERSYPIIQAIEKRIAVFSQVPAENGELIQVLRYEPQQFYKPHHDYFADTFNLK 191
Query: 148 RGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECA---KKGIAVKPRRG 204
RGG R+AT+LMYL+D +GGET FP A D C KGI+VKP +G
Sbjct: 192 RGGQRVATMLMYLTDDVEGGETYFPLA-----------GDGDCTCGGKIMKGISVKPTKG 240
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
DA+LF+S+ + DP S+H GC V+ GEKWSATKW+
Sbjct: 241 DAVLFWSMGLDGQSDPRSIHGGCEVLSGEKWSATKWMR 278
>gi|357445147|ref|XP_003592851.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
gi|355481899|gb|AES63102.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
Length = 281
Score = 188 bits (478), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 101/212 (47%), Positives = 137/212 (64%), Gaps = 14/212 (6%)
Query: 34 KVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFI 93
K + +SW PR + FL+ ECD+L +A +LK S V D +G+ SDVRTSSG F+
Sbjct: 74 KPEVLSWSPRIILLHNFLSYEECDYLRGVALPRLKISTVVDANTGKGIKSDVRTSSGMFL 133
Query: 94 P--KGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGH 151
+ K +I IE +I+ ++ +P ENGE +QVLRYE Q Y PH+DYFSD N+ RGG
Sbjct: 134 SHEERKYPMIHAIEKRISVYSQIPIENGELMQVLRYEKNQYYRPHHDYFSDTFNLKRGGQ 193
Query: 152 RLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAK--KGIAVKPRRGDALLF 209
R+AT+LMYL D +GGET FP+ A +D+ S K KG+ VKP +G+A+LF
Sbjct: 194 RIATMLMYLGDNVEGGETHFPS----------AGSDECSCGGKLTKGLCVKPVKGNAVLF 243
Query: 210 FSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
+S+ + DP S+H GCPV+ GEKWSATKW+
Sbjct: 244 WSMGLDGQSDPDSVHGGCPVLAGEKWSATKWM 275
>gi|159462456|ref|XP_001689458.1| predicted protein [Chlamydomonas reinhardtii]
gi|158283446|gb|EDP09196.1| predicted protein [Chlamydomonas reinhardtii]
Length = 221
Score = 188 bits (478), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 99/253 (39%), Positives = 141/253 (55%), Gaps = 33/253 (13%)
Query: 45 FVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGI 104
VY FL+D EC H+I+LA +Q+KRS V + + D+RTS GTF+ + D +IA I
Sbjct: 1 MVYHNFLSDRECRHIIDLAHAQMKRSTVVGS-KNAGVVDDIRTSYGTFLRRVPDPVIAAI 59
Query: 105 EDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVA 164
E ++A W+ LP + ED+QVLRY KY PH D G R+ATVL+YL
Sbjct: 60 EHRLALWSHLPASHQEDMQVLRYGPTNKYGPHID----------GLERVATVLIYLGQAE 109
Query: 165 KGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFF-SLHTNAIPDPVSL 223
+ +LS+CA+ +A KP+RGDAL+FF ++ D S+
Sbjct: 110 RA---------------------NLSQCARGRVAYKPKRGDALMFFDTMPDYKQTDVHSM 148
Query: 224 HSGCPVIEGEKWSATKWIHVDSFDKIVEEGGDCTDNNASCERWAALGECTKNPEYMVGSA 283
H+GCPV+EG KW+A KW+H + + + + G C + + CE WA GEC NP +M+G+
Sbjct: 149 HTGCPVVEGVKWNAVKWLHGTPYGRPLPDPGICANLHEMCETWALQGECKNNPGFMIGAG 208
Query: 284 QLPGFCRRSCKVC 296
G CR +C C
Sbjct: 209 ASMGSCRLACNDC 221
>gi|357467087|ref|XP_003603828.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355492876|gb|AES74079.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 156
Score = 187 bits (476), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 91/155 (58%), Positives = 115/155 (74%), Gaps = 2/155 (1%)
Query: 92 FIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGH 151
F+ +GKD II IE +IA +TF+P ENGE +QVL Y G+KYEPHYDYF D+ N GG
Sbjct: 2 FLKRGKDKIIQNIERRIADFTFIPVENGEGLQVLHYGVGEKYEPHYDYFLDEFNTKNGGQ 61
Query: 152 RLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFS 211
R+ATVLMYLSDV +GGETVFP A + P N DLSECA+KG+++KP+ GDALLF+S
Sbjct: 62 RVATVLMYLSDVEEGGETVFP-AAKANFSSVPWWN-DLSECARKGLSLKPKMGDALLFWS 119
Query: 212 LHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
+ +A D SLH GCPVI G KWS+TKW+H++ +
Sbjct: 120 MRPDATLDASSLHGGCPVIVGNKWSSTKWMHLEEY 154
>gi|449468746|ref|XP_004152082.1| PREDICTED: putative prolyl 4-hydroxylase-like [Cucumis sativus]
Length = 290
Score = 187 bits (475), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 97/213 (45%), Positives = 132/213 (61%), Gaps = 16/213 (7%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGK 97
+SW PR V FL+ ECD+L +A ++L+ S V D +G+ SD RTSSG F+ +
Sbjct: 85 VSWSPRIIVLHNFLSTKECDYLKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSHHE 144
Query: 98 D--AIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLAT 155
++ IE +I+ ++ +P ENGE IQVLRYE Q Y+PH+DYFSD N+ RGG R+AT
Sbjct: 145 KNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRIAT 204
Query: 156 VLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKK---GIAVKPRRGDALLFFSL 212
+LMYLS+ +GGET FP A C K G++VKP +GDA+LF+S+
Sbjct: 205 MLMYLSENIEGGETYFPKA-----------GSGECSCGGKTVPGLSVKPAKGDAVLFWSM 253
Query: 213 HTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDS 245
+ DP S+H GC V+ GEKWSATKW+ S
Sbjct: 254 GLDGQSDPKSIHGGCEVLSGEKWSATKWMRQKS 286
>gi|449488641|ref|XP_004158125.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized LOC101218968
[Cucumis sativus]
Length = 311
Score = 187 bits (475), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 102/271 (37%), Positives = 158/271 (58%), Gaps = 14/271 (5%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
I+PS+V Q+SW+PR F+Y+GFL+D ECDHLI+LA + + SG + +++ SS
Sbjct: 51 IDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGITVSTELLNSS 110
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRG 149
G + D I+A IE+++A WT LPK++ Q+++Y G++ + Y Y + +
Sbjct: 111 GVIL-NTTDDIVARIENRLAIWTLLPKDHSMPFQIMQYR-GEEAKHKYFYGNRSAMLPSS 168
Query: 150 GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLF 209
+ATV++YLSD A GGE +FP ++ + S KK ++P +G+A+L
Sbjct: 169 EPLMATVVLYLSDSASGGEILFPESK--------VKSKFWSGRRKKNNFLRPVKGNAILX 220
Query: 210 FSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHV----DSFDKIVEEGGDCTDNNASCER 265
FS+H NA PD S H P+ +GE W ATK++++ + I + C D + SC +
Sbjct: 221 FSVHLNASPDKSSYHIRSPIRDGELWVATKFLYLGPPAGNKHTIQSDVDGCFDEDKSCPQ 280
Query: 266 WAALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
WAA+GEC +N +MVGS G CR+SC C
Sbjct: 281 WAAIGECERNAVFMVGSPDYYGTCRKSCNAC 311
>gi|145341735|ref|XP_001415959.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144576182|gb|ABO94251.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 254
Score = 187 bits (474), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 99/257 (38%), Positives = 155/257 (60%), Gaps = 12/257 (4%)
Query: 33 SKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTF 92
++V+ +SW PRAF LT+ +C+ ++ +++++RS V D+++GESK+ +RTS TF
Sbjct: 1 TRVEPLSWYPRAFALRDALTEAQCEAVLRATRARVRRSTVVDSVTGESKVDPIRTSKQTF 60
Query: 93 IPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVR---- 148
+ + ++ ++ I D ++ T LP + ED+QVL Y G+KY+ H D ++ R
Sbjct: 61 LNRDEE-VVREIYDALSAVTMLPWTHNEDMQVLEYRVGEKYDAHEDVGAEDSLSGRELSK 119
Query: 149 -GGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDAL 207
GG R+ATVL+YL + GGET FP++E + T+ S+CA+ +A+KPRRGD L
Sbjct: 120 DGGKRVATVLLYLEEPEAGGETAFPDSEWIDPKMAEGTS--WSKCAEHRVAMKPRRGDGL 177
Query: 208 LFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSF--DKIVEEGGD--CTDNNASC 263
+F+S+ N D +LH GCPV+ G KW+AT W+H + + K E C D + C
Sbjct: 178 IFWSVDPNGKIDHRALHVGCPVVAGVKWTATVWVHAEPYRWQKPPEASATPGCEDAHDQC 237
Query: 264 ERWAALGECTKNPEYMV 280
WA GEC KNP +M+
Sbjct: 238 RGWANTGECDKNPGFML 254
>gi|224069056|ref|XP_002302889.1| predicted protein [Populus trichocarpa]
gi|222844615|gb|EEE82162.1| predicted protein [Populus trichocarpa]
Length = 287
Score = 186 bits (473), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 98/209 (46%), Positives = 130/209 (62%), Gaps = 16/209 (7%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGK 97
ISW PR V FL+ ECD+L LAK +L+ S V D +G+ S VRTSSG F+ +
Sbjct: 84 ISWSPRIIVLHDFLSSEECDYLRALAKPRLRISTVVDVKTGKGIESKVRTSSGMFLSSEE 143
Query: 98 DA--IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLAT 155
++ IE +I+ ++ +P ENGE IQVLRYE Q Y+PH+DYFSD N+ RGG R+AT
Sbjct: 144 KTYQVVQAIEKRISVYSQVPIENGELIQVLRYEKNQYYKPHHDYFSDTFNLKRGGQRVAT 203
Query: 156 VLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKK---GIAVKPRRGDALLFFSL 212
+LMYLSD +GGET FP A C K G++VKP +G+A+LF+S+
Sbjct: 204 MLMYLSDNVEGGETYFPMA-----------GSGKCSCGGKVVDGLSVKPIKGNAVLFWSM 252
Query: 213 HTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
+ DP S+H GC V+ G KWSATKW+
Sbjct: 253 GLDGQSDPSSIHGGCEVLSGVKWSATKWM 281
>gi|159481038|ref|XP_001698589.1| predicted protein [Chlamydomonas reinhardtii]
gi|158282329|gb|EDP08082.1| predicted protein [Chlamydomonas reinhardtii]
Length = 258
Score = 184 bits (466), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 103/270 (38%), Positives = 143/270 (52%), Gaps = 60/270 (22%)
Query: 34 KVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFI 93
+++ ISW PRAF+Y FL++ ECDHL ++ ++ RS V D+ +G+SKL D+RTS G
Sbjct: 7 RIETISWSPRAFIYHNFLSEAECDHLTDIGNKRVSRSLVVDSKTGQSKLDDIRTSYGAAF 66
Query: 94 PKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVN---IVRGG 150
+G+D +IA +E++IA WT LP E GE +Q+LRY GQKY+ H+D+F D V+ + G
Sbjct: 67 GRGEDPVIAAVEERIAEWTHLPPEYGEPMQILRYVDGQKYDAHWDWFDDPVHHAAYLHEG 126
Query: 151 HRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFF 210
+R ATVL+YLS V GGET P
Sbjct: 127 NRYATVLLYLSGVEGGGETNLP-------------------------------------- 148
Query: 211 SLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH----VDSFDKIVEEGGDCTDNNASCERW 266
+ DP+ + +G KW+ATKWIH + +D + G C D +C
Sbjct: 149 ------LADPIDKEA-----QGMKWTATKWIHNKPYMGKYDPL-RTAGRCADTGGNCAAR 196
Query: 267 AALGECTKNPEYMVGSAQLPGFCRRSCKVC 296
AA GECT N + MVG A G CR+SC C
Sbjct: 197 AAAGECTSNMDKMVGPA---GECRKSCNDC 223
>gi|356576923|ref|XP_003556579.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 287
Score = 184 bits (466), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 98/215 (45%), Positives = 135/215 (62%), Gaps = 20/215 (9%)
Query: 34 KVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFI 93
K + ++W PR + FL+ ECD+L +A +L S V D +G+ SDVRTSSG F+
Sbjct: 80 KPEVLNWSPRIILLHNFLSMEECDYLRAIALPRLHISNVVDTKTGKGIKSDVRTSSGMFL 139
Query: 94 --PKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGH 151
+ K ++ IE +I+ ++ +P ENGE +QVLRYE Q Y+PH+DYFSD N+ RGG
Sbjct: 140 NPQERKYPMVQAIEKRISVYSQIPIENGELMQVLRYEKNQYYKPHHDYFSDTFNLKRGGQ 199
Query: 152 RLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECA-----KKGIAVKPRRGDA 206
R+AT+LMYLSD +GGET FP A EC+ KG++VKP +G+A
Sbjct: 200 RIATMLMYLSDNIEGGETYFPLAGS-------------GECSCGGKLVKGLSVKPIKGNA 246
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
+LF+S+ + DP S+H GC VI GEKWSATKW+
Sbjct: 247 VLFWSMGLDGQSDPNSVHGGCEVISGEKWSATKWM 281
>gi|384250156|gb|EIE23636.1| hypothetical protein COCSUDRAFT_53414 [Coccomyxa subellipsoidea
C-169]
Length = 285
Score = 184 bits (466), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 93/212 (43%), Positives = 132/212 (62%), Gaps = 4/212 (1%)
Query: 35 VKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIP 94
V++ISW PRAF+Y G L+ ECD++IN A+ + ++ V D + + + +R + +I
Sbjct: 53 VERISWNPRAFLYRGLLSQDECDYIINAARPNMVKATVLDAKTKKQVPNKLRNNKEAYID 112
Query: 95 KGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLA 154
D +I IE +IA +TFLP +GE +++Y GQ Y PH D+ D + G R+A
Sbjct: 113 GSADDVIDQIERRIARYTFLPAAHGEPFHIMQYLPGQGYAPHTDWLDDWWHPRLGNERIA 172
Query: 155 TVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHT 214
T+++YLSDV +GGETVFPN+ P A S+CA++GIAVKP +GDALL ++L
Sbjct: 173 TMIIYLSDVVEGGETVFPNSTMQPHVGDAA----YSKCAQQGIAVKPVKGDALLLYNLLE 228
Query: 215 NAIPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
N D SLH GCPVI G KW+ATK I V+
Sbjct: 229 NGRNDGESLHQGCPVIRGVKWTATKRILVNQL 260
>gi|449443245|ref|XP_004139390.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 295
Score = 183 bits (464), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 100/220 (45%), Positives = 141/220 (64%), Gaps = 15/220 (6%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAV-ADNLSGESKLSDVRTSSGTFIPKGKDAI 100
PRAF+Y FL++ EC LINLAK +++RS V A N + E +S RTSSG F+ KG++ +
Sbjct: 83 PRAFIYHNFLSEKECSQLINLAKPRMERSLVSAQNTNWEGVVSSRRTSSGRFLAKGQNQL 142
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDY-FSDKVNIVRGGHRLATVLMY 159
+ IE +IA +TF+P ENGE + +L YE GQK+EPH+DY D + G R AT++MY
Sbjct: 143 VRRIEKRIAEFTFIPVENGEGLSILHYEVGQKFEPHHDYTHPDSFSFKSLGQRNATLVMY 202
Query: 160 LSDVAKGGETVFPNAEE---PPRR---RTPATNDDLSECAKKGIAVKPRRGDALLFFSLH 213
LS V +GG TVFP A++ RR + P D G++VKP+ GDALLF+S+
Sbjct: 203 LSGVKEGGATVFPEAKKCASSARRWWKKLPEYGKD------NGLSVKPKMGDALLFWSVK 256
Query: 214 TNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEG 253
+ DP SLH+ PV++G+KW K +HV + D + +EG
Sbjct: 257 PDGTLDPTSLHASSPVVKGDKWVGVKLMHVKAKD-LTQEG 295
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 29/79 (36%), Positives = 44/79 (55%), Gaps = 12/79 (15%)
Query: 162 DVAKGGETVFPNAEE-----PPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
++ +GGETVFP A + P ++ P D G+++KP+ GDAL F+S+ +
Sbjct: 11 NIEEGGETVFPAANQCVSSVPWWKKLPTHGKD-------GLSIKPKMGDALFFWSMKPDG 63
Query: 217 IPDPVSLHSGCPVIEGEKW 235
D SLH PVI G++W
Sbjct: 64 TLDYTSLHGSYPVIRGDEW 82
>gi|7269410|emb|CAB81370.1| hypothetical protein [Arabidopsis thaliana]
Length = 315
Score = 182 bits (462), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 109/309 (35%), Positives = 160/309 (51%), Gaps = 68/309 (22%)
Query: 27 TAIINPSKVKQISWKPR-------------------AFVYEGFLTDLECDHLINLAKSQL 67
+ ++P++V Q+SW PR F+Y GFL++ ECDHLI+L K
Sbjct: 36 SKFVDPTRVLQLSWLPRNVCFTASNFRGLKQFGMYRVFLYRGFLSEEECDHLISLGKETT 95
Query: 68 KRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLP------------ 115
+ +V + G+++L D ++AGIE+K++ WTFLP
Sbjct: 96 EVYSV--DADGKTQL---------------DPVVAGIEEKVSAWTFLPGGLFSCGQTAGL 138
Query: 116 --------KENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGG 167
ENG I+V Y +K DYF ++ + V LATV++YLS+ +GG
Sbjct: 139 CFSLDAHFSENGGSIKVRSYTS-EKSGKKLDYFGEEPSSVLHESLLATVVLYLSNTTQGG 197
Query: 168 ETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGC 227
E +FPN+E P+ + C + G ++P +G+A+LFF+ NA D S H C
Sbjct: 198 ELLFPNSEVKPK----------NSCLEGGNILRPVKGNAILFFTRLLNASLDGKSTHLRC 247
Query: 228 PVIEGEKWSATKWIHVDSFDKIVEEGGDCTDNNASCERWAALGECTKNPEYMVGSAQLPG 287
PV++GE ATK I+ +I EE G+C+D + +C RWA LGEC KNP YM+GS G
Sbjct: 248 PVVKGELLVATKLIYAKKQARI-EESGECSDEDENCGRWAKLGECKKNPVYMIGSPDYYG 306
Query: 288 FCRRSCKVC 296
CR+SC C
Sbjct: 307 TCRKSCNAC 315
>gi|2980790|emb|CAA18166.1| hypothetical protein [Arabidopsis thaliana]
Length = 316
Score = 182 bits (462), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 109/309 (35%), Positives = 160/309 (51%), Gaps = 68/309 (22%)
Query: 27 TAIINPSKVKQISWKPR-------------------AFVYEGFLTDLECDHLINLAKSQL 67
+ ++P++V Q+SW PR F+Y GFL++ ECDHLI+L K
Sbjct: 37 SKFVDPTRVLQLSWLPRNVCFTASNFRGLKQFGMYRVFLYRGFLSEEECDHLISLRKETT 96
Query: 68 KRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLP------------ 115
+ +V + G+++L D ++AGIE+K++ WTFLP
Sbjct: 97 EVYSV--DADGKTQL---------------DPVVAGIEEKVSAWTFLPGGLFSCGQTAGL 139
Query: 116 --------KENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGG 167
ENG I+V Y +K DYF ++ + V LATV++YLS+ +GG
Sbjct: 140 CFSLDAHFSENGGSIKVRSYTS-EKSGKKLDYFGEEPSSVLHESLLATVVLYLSNTTQGG 198
Query: 168 ETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGC 227
E +FPN+E P+ + C + G ++P +G+A+LFF+ NA D S H C
Sbjct: 199 ELLFPNSEMKPK----------NSCLEGGNILRPVKGNAILFFTRLLNASLDGKSTHLRC 248
Query: 228 PVIEGEKWSATKWIHVDSFDKIVEEGGDCTDNNASCERWAALGECTKNPEYMVGSAQLPG 287
PV++GE ATK I+ +I EE G+C+D + +C RWA LGEC KNP YM+GS G
Sbjct: 249 PVVKGELLVATKLIYAKKQARI-EESGECSDEDENCGRWAKLGECKKNPVYMIGSPDYYG 307
Query: 288 FCRRSCKVC 296
CR+SC C
Sbjct: 308 TCRKSCNAC 316
>gi|357517893|ref|XP_003629235.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355523257|gb|AET03711.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 196
Score = 182 bits (461), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 94/192 (48%), Positives = 128/192 (66%), Gaps = 16/192 (8%)
Query: 52 TDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATW 111
T EC+HLIN+AK + +S V D+ +G+S + RTSSGTFI +G D I+ IE +IA +
Sbjct: 14 TKEECEHLINIAKPSMHKSTV-DDETGKSVDNSARTSSGTFINRGHDKILRNIEQRIADF 72
Query: 112 TFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVF 171
TF+P ENGE + +L YE GQKYEPH D+F+D++N GG + GGETVF
Sbjct: 73 TFIPVENGESVNILHYEVGQKYEPHPDFFTDEINTKNGGEQ-------------GGETVF 119
Query: 172 PNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIE 231
P AE P N +LS+C KKG+++KP+ GDALLF+S+ + DP+S+H CPVI+
Sbjct: 120 PFAEG-NFSSVPWWN-ELSDCGKKGLSIKPKMGDALLFWSMKPDGTLDPLSMHGACPVIK 177
Query: 232 GEKWSATKWIHV 243
G+KWS TKW+ V
Sbjct: 178 GDKWSCTKWMRV 189
>gi|449520144|ref|XP_004167094.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 323
Score = 181 bits (460), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 98/214 (45%), Positives = 137/214 (64%), Gaps = 14/214 (6%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAV-ADNLSGESKLSDVRTSSGTFIPKGKDAI 100
PRAF+Y FL++ EC LINLAK +++RS V A N + E +S RTSSG F+ KG++ +
Sbjct: 74 PRAFIYHNFLSEKECSQLINLAKPRMERSLVSAQNTNWEGVVSSRRTSSGRFLAKGQNQL 133
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDY-FSDKVNIVRGGHRLATVLMY 159
+ IE +IA +TF+P ENGE + +L YE GQK+EPH+DY D + G R AT++MY
Sbjct: 134 VRRIEKRIAEFTFIPVENGEGLSILHYEVGQKFEPHHDYTHPDSFSFKSLGQRNATLVMY 193
Query: 160 LSDVAKGGETVFPNAEE---PPRR---RTPATNDDLSECAKKGIAVKPRRGDALLFFSLH 213
LS V +GG TVFP A++ RR + P D G++VKP+ GDALLF+S+
Sbjct: 194 LSGVKEGGATVFPEAKKCASSARRWWKKLPEYGKD------NGLSVKPKMGDALLFWSVK 247
Query: 214 TNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFD 247
+ DP SLH+ PV++G+KW K +HV + D
Sbjct: 248 PDGTLDPTSLHASSPVVKGDKWVGVKLMHVKAKD 281
Score = 44.7 bits (104), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 24/69 (34%), Positives = 38/69 (55%), Gaps = 12/69 (17%)
Query: 162 DVAKGGETVFPNAEE-----PPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
++ +GGETVFP A + P ++ P D G+++KP+ GDAL F+S+ +
Sbjct: 11 NIEEGGETVFPAANKCVSSVPWWKKLPTHGKD-------GLSIKPKMGDALFFWSMKPDG 63
Query: 217 IPDPVSLHS 225
D SLH+
Sbjct: 64 TLDYTSLHA 72
>gi|308799555|ref|XP_003074558.1| putative oxidoreductase (ISS) [Ostreococcus tauri]
gi|116000729|emb|CAL50409.1| putative oxidoreductase (ISS) [Ostreococcus tauri]
Length = 274
Score = 181 bits (459), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 101/273 (36%), Positives = 160/273 (58%), Gaps = 24/273 (8%)
Query: 35 VKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIP 94
V+ +SW PRAF L + E ++ LA++++ RS V D+ SG+S ++ +RTS TF+
Sbjct: 9 VEPLSWYPRAFALRNALDETEMRAILALARTRVARSTVIDSESGKSVVNPIRTSKQTFLS 68
Query: 95 KGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNI-----VRG 149
+ D ++ + +++++ T LP + ED+QVL Y G+KY+ H D + G
Sbjct: 69 R-NDPVVRKVLERMSSVTHLPWYHCEDLQVLEYSAGEKYDAHEDVGEEGTKSGDQLSKNG 127
Query: 150 GHRLATVLMYLSDVAKGGETVFPNAE--EPPRRRTPATNDDLSECAKKGIAVKPRRGDAL 207
G R+AT+L+YL + +GGET FP++E +P R +T + S+CA + +A+KP RGD L
Sbjct: 128 GKRVATILLYLEEPEEGGETAFPDSEWIDPERAKT----ETWSKCAHRRVAMKPTRGDGL 183
Query: 208 LFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKI-----VEEGGDCTDNNAS 262
+F+S+ + D +LH GCP G KW+AT W+H D ++ I V G C D +
Sbjct: 184 MFWSVRPDGTIDHRALHVGCPPTRGTKWTATIWVHADPYNWIKPPDPVPTIG-CEDKSDR 242
Query: 263 CERWAALGECTKNPEYMVGSAQLPGFCRRSCKV 295
C WA +GEC KNP +M+ + C+ SC+V
Sbjct: 243 CRGWANIGECDKNPSFMLEN------CKWSCRV 269
>gi|225433714|ref|XP_002268409.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
gi|296089634|emb|CBI39453.3| unnamed protein product [Vitis vinifera]
Length = 287
Score = 180 bits (456), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 95/217 (43%), Positives = 132/217 (60%), Gaps = 16/217 (7%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
I K + ++W PR + FL+ ECD+L +A+ L+ S V D +G+ SDVRTSS
Sbjct: 76 IGYVKPEILNWSPRIILLHSFLSSEECDYLRAMAEPLLQISTVVDAQTGKGIQSDVRTSS 135
Query: 90 GTFIPKGKDA--IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIV 147
G F+ I+ IE +I+ ++ +P ENGE IQVLRY+ Q Y+PH+DYFSD N+
Sbjct: 136 GMFLSPDDSTYPIVRAIEKRISVYSQVPVENGELIQVLRYKKSQFYKPHHDYFSDSFNLK 195
Query: 148 RGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKK---GIAVKPRRG 204
RGG R+AT+L+YLSD +GGET FP A C K G++V P +G
Sbjct: 196 RGGQRVATMLIYLSDNVEGGETYFPMA-----------GSGFCRCGGKSVRGLSVAPVKG 244
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
+A+LF+S+ + DP S+H GC V+ GEKWSATKW+
Sbjct: 245 NAVLFWSMGLDGQSDPNSIHGGCEVLAGEKWSATKWM 281
>gi|215697788|dbj|BAG91981.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 225
Score = 179 bits (455), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 99/250 (39%), Positives = 144/250 (57%), Gaps = 32/250 (12%)
Query: 54 LECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTF 113
+ECDHL+++ + ++ S + S +++ +D +++ IED+I+ W+F
Sbjct: 1 MECDHLVSMGRGNMESSLAFTDGDRNSSYNNI-----------EDIVVSKIEDRISLWSF 49
Query: 114 LPKENGEDIQVLRYEHGQ----KYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGET 169
LPKENGE IQVL+Y + K EP G HRLAT+LMYLSDV +GGET
Sbjct: 50 LPKENGESIQVLKYGVNRSGSIKEEPKSS---------SGAHRLATILMYLSDVKQGGET 100
Query: 170 VFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPV 229
VFP +E + A S+C+ G AV+P +G+A+L F+L + D S + CPV
Sbjct: 101 VFPRSE---MKDAQAKEGAPSQCS--GYAVRPAKGNAILLFNLRPDGETDKDSQYEECPV 155
Query: 230 IEGEKWSATKWIHVDSFD---KIVEEGGDCTDNNASCERWAALGECTKNPEYMVGSAQLP 286
+EGEKW A K I++ FD + +CTD + C WAA GEC +NP +M+GS+
Sbjct: 156 LEGEKWLAIKHINLRKFDYPKSSLASEDECTDEDDRCVSWAASGECDRNPVFMIGSSDYY 215
Query: 287 GFCRRSCKVC 296
G CR+SC+VC
Sbjct: 216 GSCRKSCRVC 225
>gi|302835042|ref|XP_002949083.1| hypothetical protein VOLCADRAFT_89416 [Volvox carteri f.
nagariensis]
gi|300265828|gb|EFJ50018.1| hypothetical protein VOLCADRAFT_89416 [Volvox carteri f.
nagariensis]
Length = 263
Score = 179 bits (454), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 102/228 (44%), Positives = 137/228 (60%), Gaps = 18/228 (7%)
Query: 20 IRKSFSSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGE 79
++KS +S + V+ +SW PRAFVY FLT ECDHLI LA +L+RS V S
Sbjct: 44 VQKSATSPGPGSGPWVETVSWMPRAFVYHQFLTPAECDHLIELATPKLERSMVVGTDS-- 101
Query: 80 SKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDY 139
+ D+RTS I G+ +I++ IE++IA WT VLRY +GQKY+ H+D+
Sbjct: 102 DLIDDIRTSFSASIMYGETSIVSSIEERIARWT-----------VLRYVNGQKYDAHWDW 150
Query: 140 FSD-KVNIVRGGHRLATVLMYLSDV--AKGGETVFPNAEEPPRRRTPATNDDLSECAKK- 195
F D +V G +R+ATVLMYLSDV A GGET P AE + S+CA +
Sbjct: 151 FDDNEVAKAGGSNRMATVLMYLSDVDPAAGGETALPLAEPLDPHKQSVDGQGYSQCAARM 210
Query: 196 GIAVKPRRGDALLFFSLH-TNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
GI+++PR+GD LLF+ + IPD +LH+ CP G KW+ATKWIH
Sbjct: 211 GISIRPRKGDVLLFWDMDPAGLIPDRHALHASCPTFSGTKWTATKWIH 258
>gi|255637879|gb|ACU19258.1| unknown [Glycine max]
Length = 287
Score = 177 bits (449), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 96/215 (44%), Positives = 132/215 (61%), Gaps = 20/215 (9%)
Query: 34 KVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFI 93
K + ++W PR + FL+ ECD+L LA +L S V D +G+ SDVRTSSG F+
Sbjct: 80 KPEVLNWSPRIILLHNFLSMEECDYLRALALPRLHISTVVDTKTGKGIKSDVRTSSGMFL 139
Query: 94 --PKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGH 151
+ K ++ IE +I+ ++ +P ENGE +QVLRYE Q Y+P +DYF D N+ RGG
Sbjct: 140 NSKERKYPMVQAIEKRISVYSQIPIENGELMQVLRYEKNQYYKPRHDYFFDTFNLKRGGQ 199
Query: 152 RLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECA-----KKGIAVKPRRGDA 206
+AT+LMYLSD +GGET FP A EC+ KG++VKP +G+A
Sbjct: 200 GIATMLMYLSDNIEGGETYFPLAGS-------------GECSCGGKLVKGLSVKPIKGNA 246
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
+LF+S+ + DP S+H GC VI GEKWSATKW+
Sbjct: 247 VLFWSMGLDGQSDPNSVHGGCEVISGEKWSATKWL 281
>gi|302844249|ref|XP_002953665.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
gi|300261074|gb|EFJ45289.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
Length = 245
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 93/194 (47%), Positives = 119/194 (61%), Gaps = 2/194 (1%)
Query: 35 VKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIP 94
V+Q+ PRA+++ FLT E H++ LA +LKRS V N GE + ++RTS G FI
Sbjct: 54 VEQVGLHPRAYLFHNFLTKAERAHMVRLAAPKLKRSTVVGN-DGEGVVDEIRTSYGMFIR 112
Query: 95 KGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLA 154
+ D +I IE +I+ WT LP E+ EDIQVLRY HGQ Y HYD DK N RLA
Sbjct: 113 RLADPVITRIEKRISLWTHLPIEHQEDIQVLRYAHGQTYGAHYDS-GDKSNEPGPKWRLA 171
Query: 155 TVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHT 214
T LMYLSDV +GGET FP P +SECAK +A KP+ GDA+LF+S +
Sbjct: 172 TFLMYLSDVEEGGETAFPQNSVWYDPTIPERIGPVSECAKGHVAAKPKAGDAVLFYSFYP 231
Query: 215 NAIPDPVSLHSGCP 228
N DP ++H+GCP
Sbjct: 232 NLTMDPAAMHTGCP 245
>gi|168043388|ref|XP_001774167.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674574|gb|EDQ61081.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 284
Score = 175 bits (444), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 95/214 (44%), Positives = 135/214 (63%), Gaps = 12/214 (5%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLS--DVRTSSGTFIPK 95
+SWKPRA +Y F + +C+ +I LA+++L S +A GES+ + ++RTSSGTF+
Sbjct: 78 LSWKPRALLYPNFASKEQCEAIIKLARTRLAPSGLALR-KGESEATTKEIRTSSGTFLRA 136
Query: 96 GKDAI--IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRL 153
+D +A +E+K+A T +P++NGE VLRY GQKY+ HYD F + R+
Sbjct: 137 SEDKTQSLAEVEEKMARATMIPRQNGEAFNVLRYNPGQKYDCHYDVFDPAEYGPQPSQRM 196
Query: 154 ATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLH 213
A+ L+YLSDV +GGET+FP + T + +C G+ VKPR+GDALLF+S+H
Sbjct: 197 ASFLLYLSDVEEGGETMFPFEN----FQNMNTGYNYKDCI--GLKVKPRQGDALLFYSMH 250
Query: 214 TNAIPDPVSLHSGCPVIEGEKWSATKWI-HVDSF 246
N D +LH CPVI+GEKW ATKWI + D F
Sbjct: 251 PNGTFDKTALHGSCPVIKGEKWVATKWIRNTDKF 284
>gi|159485424|ref|XP_001700744.1| hypothetical protein CHLREDRAFT_187378 [Chlamydomonas reinhardtii]
gi|158281243|gb|EDP06998.1| predicted protein [Chlamydomonas reinhardtii]
Length = 253
Score = 175 bits (444), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 98/213 (46%), Positives = 132/213 (61%), Gaps = 7/213 (3%)
Query: 35 VKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIP 94
++ ISW PRAF+Y GFL+ ECDHLI LA +L+RS V N S E + +RTS I
Sbjct: 38 IETISWVPRAFIYHGFLSHAECDHLIGLALPKLERSLVVGNKSDE--VDPIRTSYSASIG 95
Query: 95 KGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-SDKVNIVRGGHRL 153
+ ++A IE +IA WT LP+ + E ++VLRY +GQKY+ H+D+F + GG+R+
Sbjct: 96 YNETDVVADIEGRIARWTHLPRSHQEPMEVLRYINGQKYDAHWDWFDETETGGTGGGNRM 155
Query: 154 ATVLMYLSDV--AKGGETVFPNAEEPPRRRTPATNDDLSECAKK-GIAVKPRRGDALLFF 210
AT LMYLSD+ A GGET P A+ SECA K GI+V+P++GD LLF+
Sbjct: 156 ATALMYLSDMEPAAGGETALPLAQPLDWEVQGVEGRGYSECASKMGISVRPKKGDVLLFW 215
Query: 211 SLHTNAI-PDPVSLHSGCPVIEGEKWSATKWIH 242
+ PD +LH+ CP G KW+ATKWIH
Sbjct: 216 DMEPGGREPDRHALHASCPTFSGTKWTATKWIH 248
>gi|255584898|ref|XP_002533164.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223527036|gb|EEF29223.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 290
Score = 174 bits (440), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 94/208 (45%), Positives = 122/208 (58%), Gaps = 9/208 (4%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVA-DNLSGESKLSDVRTSSGTFIPKG 96
+SWKPRA + F T +C +IN+AK L S +A E +RTSSG F+
Sbjct: 83 LSWKPRALYFPNFATAEQCQSVINMAKPNLTPSTLALRKGETEENTKGIRTSSGMFLSAS 142
Query: 97 KD--AIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLA 154
+D ++ IE+KIA T LP+ NGE +LRYE GQKY HYD F+ + R+A
Sbjct: 143 EDKTGVLDAIEEKIARATMLPRANGEAFNILRYEIGQKYNSHYDAFNPAEYGPQKSQRVA 202
Query: 155 TVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHT 214
+ L+YLSDV +GGET+FP + + D +C G+ V+PRRGD LLF+SL
Sbjct: 203 SFLLYLSDVEEGGETMFPFENDLDVDESY----DFEKCI--GLQVRPRRGDGLLFYSLFP 256
Query: 215 NAIPDPVSLHSGCPVIEGEKWSATKWIH 242
N DP SLH CPVI+GEKW ATKWI
Sbjct: 257 NNTIDPTSLHGSCPVIKGEKWVATKWIR 284
>gi|168006299|ref|XP_001755847.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693166|gb|EDQ79520.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 299
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 96/214 (44%), Positives = 134/214 (62%), Gaps = 12/214 (5%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLS--DVRTSSGTFIPK 95
+SWKPRA +Y F + +C+ ++ LA+++L SA+A GES+ S D+RTSSGTF+
Sbjct: 93 LSWKPRALLYPRFASKEQCEAIMKLARTRLAPSALALR-KGESEDSTKDIRTSSGTFLRA 151
Query: 96 GKDAI--IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRL 153
+D + +E+K+A T +P+ENGE VL+Y GQKY+ HYD F + R+
Sbjct: 152 DEDTTRSLEQVEEKMAKATMIPRENGEAFNVLKYNVGQKYDCHYDVFDPAEYGPQPSQRM 211
Query: 154 ATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLH 213
A+ L+YLSDV +GGET+FP + D +C G+ VKPR+GDALLF+S+H
Sbjct: 212 ASFLLYLSDVEEGGETMFPFE----NFQNMNIGFDYKKCI--GMKVKPRQGDALLFYSMH 265
Query: 214 TNAIPDPVSLHSGCPVIEGEKWSATKWI-HVDSF 246
N D +LH CPVI+GEKW ATKWI + D F
Sbjct: 266 PNGTFDKSALHGSCPVIKGEKWVATKWIRNTDKF 299
>gi|15233345|ref|NP_195307.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
gi|3805848|emb|CAA21468.1| putative protein [Arabidopsis thaliana]
gi|7270534|emb|CAB81491.1| putative protein [Arabidopsis thaliana]
gi|332661175|gb|AEE86575.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
Length = 272
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 94/195 (48%), Positives = 123/195 (63%), Gaps = 32/195 (16%)
Query: 38 ISWKPRAFVYEGFL--------TDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
I+ +PRAFVY FL T+ ECDHLI+LAK + RS V + L+G + S RTSS
Sbjct: 91 ITKEPRAFVYHNFLALFFKICKTNEECDHLISLAKPSMARSKVRNALTGLGEESSSRTSS 150
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRG 149
GTFI G D I+ IE +I+ +TF+P+ENGE +QV+ YE GQK+EPH+D G
Sbjct: 151 GTFIRSGHDKIVKEIEKRISEFTFIPQENGETLQVINYEVGQKFEPHFD----------G 200
Query: 150 GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLF 209
R+ATVLMYLSDV KGGETVFP A+ +KKG++V+P++GDALLF
Sbjct: 201 FQRIATVLMYLSDVDKGGETVFPEAKGIK--------------SKKGVSVRPKKGDALLF 246
Query: 210 FSLHTNAIPDPVSLH 224
+S+ + DP S H
Sbjct: 247 WSMRPDGSRDPSSKH 261
>gi|334188665|ref|NP_001190630.1| iron ion binding / oxidoreductase/ oxidoreductase protein
[Arabidopsis thaliana]
gi|332010771|gb|AED98154.1| iron ion binding / oxidoreductase/ oxidoreductase protein
[Arabidopsis thaliana]
Length = 243
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 92/166 (55%), Positives = 114/166 (68%), Gaps = 4/166 (2%)
Query: 31 NPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSG 90
N V+ ISW+PRA VY FL EC +LI LAK +++S V D +G+S S VRTSSG
Sbjct: 74 NERWVEIISWEPRASVYHNFLE--ECKYLIELAKPHMEKSTVVDEKTGKSTDSRVRTSSG 131
Query: 91 TFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGG 150
TF+ +G+D I IE +I+ +TF+P E+GE +QVL YE GQKYEPHYDYF D+ N GG
Sbjct: 132 TFLARGRDKTIREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHYDYFMDEYNTRNGG 191
Query: 151 HRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKG 196
R+ATVLMYLSDV +GGETVFP A + P N +LSEC K G
Sbjct: 192 QRIATVLMYLSDVEEGGETVFP-AAKGNYSAVPWWN-ELSECGKGG 235
>gi|384250599|gb|EIE24078.1| hypothetical protein COCSUDRAFT_47131 [Coccomyxa subellipsoidea
C-169]
Length = 327
Score = 170 bits (431), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 91/220 (41%), Positives = 124/220 (56%), Gaps = 6/220 (2%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVA-DNLSGESKLSDVRTS 88
+ P ++ ISW PR +Y GF+ C H + +AK++L S +A G + +VRTS
Sbjct: 106 VQPQQL--ISWYPRIILYPGFIDPERCKHFVKVAKARLAPSGLALRTTEGPQETENVRTS 163
Query: 89 SGTFIPKGKD--AIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNI 146
GTF+ + D +IA +E+K A T LP +GE VLRY+ GQ Y+ HYD F +
Sbjct: 164 QGTFMSRKDDPAGVIAWVEEKAAQVTGLPVSHGEPFNVLRYQDGQHYDSHYDIFEPESYG 223
Query: 147 VRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
+ R+AT+L YL+DV +GGET+FP T + C G KPR GDA
Sbjct: 224 PQPSQRMATILFYLTDVEEGGETIFPLEGRYGPDLLKMTGFNYKSCT-TGFKYKPRMGDA 282
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
L+F+S+H N D +LH GCPV+ GEKW ATKWI F
Sbjct: 283 LMFYSMHPNGTFDKHALHGGCPVMAGEKWVATKWIRDKCF 322
>gi|297802348|ref|XP_002869058.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297314894|gb|EFH45317.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 245
Score = 170 bits (431), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 92/195 (47%), Positives = 124/195 (63%), Gaps = 32/195 (16%)
Query: 38 ISWKPRAFVYEGFL--------TDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
I+ +PRAFVY FL T+ EC+HLI+LAK + RS V + ++G + S RTSS
Sbjct: 58 IAKEPRAFVYHNFLALFFKFCKTNEECEHLISLAKPSMARSKVRNAITGLGEESSSRTSS 117
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRG 149
GTF+ KG D I+ IE +I+ +TF+P+ENGE +QV+ YE GQK+EPH+D G
Sbjct: 118 GTFLRKGHDKIVKEIEKRISEFTFIPEENGEALQVIHYEVGQKFEPHFD----------G 167
Query: 150 GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLF 209
R+ATVLMYLSDV KGGETVFP A+ +KKG++V+P++GDALLF
Sbjct: 168 FQRIATVLMYLSDVDKGGETVFPEAKGIK--------------SKKGVSVRPKKGDALLF 213
Query: 210 FSLHTNAIPDPVSLH 224
+S+ + DP S H
Sbjct: 214 WSMRPDGSQDPSSKH 228
>gi|302802700|ref|XP_002983104.1| hypothetical protein SELMODRAFT_234144 [Selaginella moellendorffii]
gi|300149257|gb|EFJ15913.1| hypothetical protein SELMODRAFT_234144 [Selaginella moellendorffii]
Length = 292
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 94/210 (44%), Positives = 130/210 (61%), Gaps = 15/210 (7%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGES--KLSDVRTSSGTFIPK 95
+SW PRA ++ F + +C+ +I+LAK++L S++A GE+ + DVRTS G F+
Sbjct: 85 LSWTPRALLFPKFASPAQCEAIISLAKTKLTPSSLALR-KGETATETQDVRTSHGCFLSS 143
Query: 96 GKD--AIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRL 153
+D +A +E+K+A T +PK +GE VLRYE GQKY HYD F+ + R+
Sbjct: 144 RQDKTGTLAWVEEKMAKATMIPKSHGEAFNVLRYEIGQKYNSHYDVFNPAEYGPQKSQRM 203
Query: 154 ATVLMYLSDVAKGGETVFP--NAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFS 211
A+ L+YLSDV +GGET+FP N E N D EC G+ VKP++GDALLF+S
Sbjct: 204 ASFLLYLSDVEEGGETMFPFENYEHMNE------NYDYKECI--GLKVKPKQGDALLFYS 255
Query: 212 LHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
+ N D +LH CPVI+GEKW ATKWI
Sbjct: 256 MFPNGTFDKTALHGSCPVIKGEKWVATKWI 285
>gi|302764866|ref|XP_002965854.1| hypothetical protein SELMODRAFT_84512 [Selaginella moellendorffii]
gi|300166668|gb|EFJ33274.1| hypothetical protein SELMODRAFT_84512 [Selaginella moellendorffii]
Length = 231
Score = 168 bits (426), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 94/210 (44%), Positives = 130/210 (61%), Gaps = 15/210 (7%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGES--KLSDVRTSSGTFIPK 95
+SW PRA ++ F + +C+ +I+LAK++L S++A GE+ + DVRTS G F+
Sbjct: 24 LSWTPRALLFPKFASPAQCEAIISLAKTKLTPSSLALR-KGETATETQDVRTSHGCFLSS 82
Query: 96 GKD--AIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRL 153
+D +A +E+K+A T +PK +GE VLRYE GQKY HYD F+ + R+
Sbjct: 83 RQDKTGTLAWVEEKMAKATMIPKSHGEAFNVLRYEIGQKYNSHYDVFNPAEYGPQKSQRM 142
Query: 154 ATVLMYLSDVAKGGETVFP--NAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFS 211
A+ L+YLSDV +GGET+FP N E N D EC G+ VKP++GDALLF+S
Sbjct: 143 ASFLLYLSDVEEGGETMFPFENYEHMNE------NYDYKECI--GLKVKPKQGDALLFYS 194
Query: 212 LHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
+ N D +LH CPVI+GEKW ATKWI
Sbjct: 195 MFPNGTFDKTALHGSCPVIKGEKWVATKWI 224
>gi|326492085|dbj|BAJ98267.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 347
Score = 168 bits (426), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 93/210 (44%), Positives = 127/210 (60%), Gaps = 15/210 (7%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSD--VRTSSGTFIPK 95
+SW+PRA + F T +C++++ AK++L+ S +A GES+ + +RTSSGTF+
Sbjct: 142 LSWQPRALYFPQFATAEQCENVVKTAKARLRPSTLALR-KGESEETTKGIRTSSGTFLSA 200
Query: 96 GKD--AIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRL 153
+D +A IE KIA T +P+ +GE VLRYE GQKY HYD F + R+
Sbjct: 201 EEDPTGALAEIETKIAKATMMPRSHGEPFNVLRYEIGQKYASHYDAFDPAQYGPQKSQRV 260
Query: 154 ATVLMYLSDVAKGGETVFP--NAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFS 211
A+ L+YL+DV +GGET+FP N + D +C G+ VKPR+GD LLF+S
Sbjct: 261 ASFLLYLTDVEEGGETMFPYENGD------NMNIGYDYEQCI--GLKVKPRKGDGLLFYS 312
Query: 212 LHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
L N DP SLH CPV+ GEKW ATKWI
Sbjct: 313 LMVNGTIDPTSLHGSCPVVRGEKWVATKWI 342
>gi|115455509|ref|NP_001051355.1| Os03g0761900 [Oryza sativa Japonica Group]
gi|14488368|gb|AAK63935.1|AC084282_16 putative dioxygenase [Oryza sativa Japonica Group]
gi|17027263|gb|AAL34117.1|AC090713_4 putative hydroxylase subunit [Oryza sativa Japonica Group]
gi|108711218|gb|ABF99013.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
Group]
gi|113549826|dbj|BAF13269.1| Os03g0761900 [Oryza sativa Japonica Group]
gi|125545807|gb|EAY91946.1| hypothetical protein OsI_13633 [Oryza sativa Indica Group]
Length = 310
Score = 168 bits (425), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 93/213 (43%), Positives = 128/213 (60%), Gaps = 19/213 (8%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSD--VRTSSGTFIPK 95
+SW+PRA + F T +C++++ AK +L S +A GE++ S +RTSSGTF+
Sbjct: 103 LSWQPRALYFPQFATSQQCENIVKTAKQRLMPSTLALR-KGETEESTKGIRTSSGTFLSS 161
Query: 96 GKD--AIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRL 153
+D +A +E KIA T +P+ +GE +LRYE GQ+Y HYD F + R+
Sbjct: 162 DEDPTGTLAEVEKKIAKATMIPRHHGEPFNILRYEIGQRYASHYDAFDPAQYGPQKSQRV 221
Query: 154 ATVLMYLSDVAKGGETVFP--NAEEPPRRRTPATNDDLSECAKK--GIAVKPRRGDALLF 209
A+ L+YL+DV +GGET+FP N E N D+ +K G+ VKPR+GD LLF
Sbjct: 222 ASFLLYLTDVEEGGETMFPYENGE----------NMDIGYDYEKCIGLKVKPRKGDGLLF 271
Query: 210 FSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+SL N DP SLH CPVI+GEKW ATKWI
Sbjct: 272 YSLMVNGTIDPTSLHGSCPVIKGEKWVATKWIR 304
>gi|125588006|gb|EAZ28670.1| hypothetical protein OsJ_12681 [Oryza sativa Japonica Group]
Length = 280
Score = 168 bits (425), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 93/213 (43%), Positives = 128/213 (60%), Gaps = 19/213 (8%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSD--VRTSSGTFIPK 95
+SW+PRA + F T +C++++ AK +L S +A GE++ S +RTSSGTF+
Sbjct: 73 LSWQPRALYFPQFATSQQCENIVKTAKQRLMPSTLALR-KGETEESTKGIRTSSGTFLSS 131
Query: 96 GKD--AIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRL 153
+D +A +E KIA T +P+ +GE +LRYE GQ+Y HYD F + R+
Sbjct: 132 DEDPTGTLAEVEKKIAKATMIPRHHGEPFNILRYEIGQRYASHYDAFDPAQYGPQKSQRV 191
Query: 154 ATVLMYLSDVAKGGETVFP--NAEEPPRRRTPATNDDLSECAKK--GIAVKPRRGDALLF 209
A+ L+YL+DV +GGET+FP N E N D+ +K G+ VKPR+GD LLF
Sbjct: 192 ASFLLYLTDVEEGGETMFPYENGE----------NMDIGYDYEKCIGLKVKPRKGDGLLF 241
Query: 210 FSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+SL N DP SLH CPVI+GEKW ATKWI
Sbjct: 242 YSLMVNGTIDPTSLHGSCPVIKGEKWVATKWIR 274
>gi|40809925|dbj|BAD07294.1| prolyl 4-hydroxylase [Nicotiana tabacum]
Length = 286
Score = 168 bits (425), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 95/208 (45%), Positives = 127/208 (61%), Gaps = 11/208 (5%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSD--VRTSSGTFIPK 95
+SW PRA + F + +C +I +AK+ ++ S++A +GE++ + +RTSSGTFI
Sbjct: 79 LSWFPRALYFPNFASIEQCQSIIKMAKANMEPSSLALR-TGETEETTKGIRTSSGTFISA 137
Query: 96 GKD--AIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRL 153
+D I+ IE+KIA T +PK +GE VLRYE GQ+Y+ HYD F + R
Sbjct: 138 SEDKTGILDLIEEKIAKATMIPKTHGEAFNVLRYEIGQRYQSHYDAFDPAQYGPQKSQRA 197
Query: 154 ATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLH 213
A+ L+YLSDV +GGETVFP + + D S+C G+ VKPRRGD LLF+SL
Sbjct: 198 ASFLLYLSDVEEGGETVFPYENG----QNMDASYDFSKCI--GLKVKPRRGDGLLFYSLF 251
Query: 214 TNAIPDPVSLHSGCPVIEGEKWSATKWI 241
N D SLH CPVI GEKW ATKWI
Sbjct: 252 PNGTIDLTSLHGSCPVIRGEKWVATKWI 279
>gi|224071291|ref|XP_002303388.1| predicted protein [Populus trichocarpa]
gi|222840820|gb|EEE78367.1| predicted protein [Populus trichocarpa]
Length = 297
Score = 167 bits (424), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 92/208 (44%), Positives = 126/208 (60%), Gaps = 12/208 (5%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSD--VRTSSGTFIPK 95
+SW+PRA Y GF+T +C H+IN+AK L+ S +A GE+ + +RTSSG F+
Sbjct: 92 LSWRPRALYYPGFITAEQCQHIINMAKPSLQPSTLALR-KGETAETTKGIRTSSGMFVFS 150
Query: 96 GKD--AIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRL 153
+D ++ IE+KIA T +P +GE VLRYE GQKY+ HYD F+ + R+
Sbjct: 151 SEDQAGVLQVIEEKIARATMIPSTHGEAFNVLRYEIGQKYDAHYDAFNPAEYGPQTSQRV 210
Query: 154 ATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLH 213
AT L+YLS+ +GGET FP + D +C G+ VKP +GDA+LF+S+
Sbjct: 211 ATFLLYLSNFEEGGETTFPIEND-----ENFEGYDAQKC--NGLRVKPHQGDAILFYSIF 263
Query: 214 TNAIPDPVSLHSGCPVIEGEKWSATKWI 241
N DP SLH+ C VI+GEKW ATKWI
Sbjct: 264 PNNTIDPASLHASCHVIKGEKWVATKWI 291
>gi|225438938|ref|XP_002279411.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
gi|296087348|emb|CBI33722.3| unnamed protein product [Vitis vinifera]
Length = 285
Score = 167 bits (422), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 94/208 (45%), Positives = 121/208 (58%), Gaps = 11/208 (5%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK--LSDVRTSSGTFIPK 95
+SW+PRA + F T +C +IN+AKS L S VA + GE + +RTSSG FI
Sbjct: 78 LSWRPRALYFPNFATSEQCQSIINMAKSNLTPSTVALRV-GEIRGNTEGIRTSSGVFISA 136
Query: 96 GKD--AIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRL 153
+D + IE KIA +P+ +GE VLRYE GQ+Y HYD F + HR+
Sbjct: 137 SEDKTGTLDLIEQKIARVIMIPRTHGEAFNVLRYEIGQRYNSHYDAFDPAEYGPQKSHRI 196
Query: 154 ATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLH 213
AT L+YLSDV +GGET+FP + + D C G+ VKP +GD LLF+S+
Sbjct: 197 ATFLVYLSDVEEGGETMFPFENGLNMDK----DYDFQRCI--GLKVKPHQGDGLLFYSMF 250
Query: 214 TNAIPDPVSLHSGCPVIEGEKWSATKWI 241
N DP SLH CPVI+GEKW ATKWI
Sbjct: 251 PNGTIDPTSLHGSCPVIKGEKWVATKWI 278
>gi|255573113|ref|XP_002527486.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223533126|gb|EEF34884.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 286
Score = 166 bits (421), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 96/208 (46%), Positives = 126/208 (60%), Gaps = 11/208 (5%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSD--VRTSSGTFIPK 95
+SWKPRA + F T +C ++I +AK +LK S +A GE+ S RTSSGTF+
Sbjct: 79 LSWKPRAVYFPDFATPEQCKNIIEMAKLRLKPSGLALR-KGETAESTKGTRTSSGTFLSA 137
Query: 96 GKDAI--IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRL 153
+D + IE KIA T +P+ +GE +LRYE GQKY+ HYD F+ + R+
Sbjct: 138 SEDGTGTLDFIEHKIARATMIPRSHGEAFNILRYEIGQKYDSHYDSFNPAEYGPQMSQRV 197
Query: 154 ATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLH 213
A+ L+YLSDV KGGET+FP ++ D +CA G+ VKPR+GD +LF+SL
Sbjct: 198 ASFLLYLSDVEKGGETMFPFE----NGVKISSVYDYKKCA--GLKVKPRQGDGILFYSLL 251
Query: 214 TNAIPDPVSLHSGCPVIEGEKWSATKWI 241
N D SLH CPVIEGEKW ATKWI
Sbjct: 252 PNGTIDQTSLHGSCPVIEGEKWVATKWI 279
>gi|225428938|ref|XP_002262952.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
gi|296083079|emb|CBI22483.3| unnamed protein product [Vitis vinifera]
Length = 284
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 95/214 (44%), Positives = 125/214 (58%), Gaps = 11/214 (5%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLS--DVRTSSGTFIPK 95
+SWKPRA + F T +C +I +AKS L+ S +A GE+ S RTSSGTFI
Sbjct: 77 LSWKPRALYFPRFATAEQCQSIIEMAKSHLRPSTLALR-QGETDESTKGTRTSSGTFISA 135
Query: 96 GKD--AIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRL 153
+D I+ +E KIA T +P+ +GE +LRYE GQ+Y HYD F+ + R+
Sbjct: 136 SEDKTGILDFVERKIAKATMIPRSHGEAFNILRYEIGQRYNSHYDAFNPAEYGPQTSQRV 195
Query: 154 ATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLH 213
A+ L+YLSDV +GGET+FP + T D +C G+ VKP+RGD LLF+S+
Sbjct: 196 ASFLLYLSDVEEGGETMFPFEHD----LNIGTGYDYKKCI--GLKVKPQRGDGLLFYSVF 249
Query: 214 TNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFD 247
N D SLH CPVI GEKW ATKWI + D
Sbjct: 250 PNGTIDRTSLHGSCPVIAGEKWVATKWIRDEQQD 283
>gi|147823227|emb|CAN70872.1| hypothetical protein VITISV_009065 [Vitis vinifera]
Length = 276
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 95/214 (44%), Positives = 125/214 (58%), Gaps = 11/214 (5%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLS--DVRTSSGTFIPK 95
+SWKPRA + F T +C +I +AKS L+ S +A GE+ S RTSSGTFI
Sbjct: 69 LSWKPRALYFPRFATAEQCQSIIEMAKSHLRPSTLALR-QGETDESTKGTRTSSGTFISA 127
Query: 96 GKD--AIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRL 153
+D I+ +E KIA T +P+ +GE +LRYE GQ+Y HYD F+ + R+
Sbjct: 128 SEDKTGILDFVERKIAKATMIPRSHGEAFNILRYEIGQRYNSHYDAFNPAEYGPQTSQRV 187
Query: 154 ATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLH 213
A+ L+YLSDV +GGET+FP + T D +C G+ VKP+RGD LLF+S+
Sbjct: 188 ASFLLYLSDVEEGGETMFPFEHD----LNIGTGYDYKKCI--GLKVKPQRGDGLLFYSVF 241
Query: 214 TNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFD 247
N D SLH CPVI GEKW ATKWI + D
Sbjct: 242 PNGTIDRTSLHGSCPVIAGEKWVATKWIRDEQQD 275
>gi|357114580|ref|XP_003559078.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
distachyon]
Length = 295
Score = 166 bits (419), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 91/210 (43%), Positives = 123/210 (58%), Gaps = 13/210 (6%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVA-DNLSGESKLSDVRTSSGTFIPKG 96
+SW+PRA + F T +C++++ AK++L+ S +A E +RTSSGTF+
Sbjct: 90 LSWQPRALYFPQFATSEQCENVVKTAKARLRPSTLALRKGETEETTKGIRTSSGTFLSAD 149
Query: 97 KDAI--IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLA 154
+D +A +E KIA T +P+ +GE VLRYE GQKY HYD F + R+A
Sbjct: 150 EDPTRTLAEVEKKIAKATMIPRSHGEPFNVLRYEIGQKYASHYDAFDPAQYGPQKSQRVA 209
Query: 155 TVLMYLSDVAKGGETVFP--NAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSL 212
+ L+YL+DV +GGET+FP N E D +C G+ VKPR+GD LLF+SL
Sbjct: 210 SFLLYLTDVEEGGETMFPYENGE------NMDIGYDYEQCI--GLKVKPRKGDGLLFYSL 261
Query: 213 HTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
N D SLH CPVI+GEKW ATKWI
Sbjct: 262 MVNGTIDLTSLHGSCPVIKGEKWVATKWIR 291
>gi|223945827|gb|ACN26997.1| unknown [Zea mays]
gi|414872966|tpg|DAA51523.1| TPA: prolyl 4-hydroxylase [Zea mays]
Length = 294
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 95/213 (44%), Positives = 127/213 (59%), Gaps = 19/213 (8%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSD--VRTSSGTFIPK 95
+SW+PRA + F T +C++++ AK +LK S +A GE+ S +RTSSGTF+
Sbjct: 89 LSWQPRALYFPQFATSEQCENIVKTAKERLKPSTLALR-KGETAESTKGIRTSSGTFLSA 147
Query: 96 GKDAI--IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRL 153
+D +A IE KIA T LP+ +GE VLRY GQ+Y HYD F + R+
Sbjct: 148 NEDPTETLAEIEKKIARATMLPRNHGEPFNVLRYNIGQRYASHYDAFDPAQYGPQKNQRV 207
Query: 154 ATVLMYLSDVAKGGETVFP--NAEEPPRRRTPATNDDLSECAKK--GIAVKPRRGDALLF 209
A+ L+YL+DV +GGET+FP N+E N D+ +K G+ VKPR+GD LLF
Sbjct: 208 ASFLLYLTDVEEGGETMFPYENSE----------NMDIGYDYEKCIGLKVKPRKGDGLLF 257
Query: 210 FSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+SL N D SLH CPVI+GEKW ATKWI
Sbjct: 258 YSLMVNGTIDRTSLHGSCPVIKGEKWVATKWIR 290
>gi|363807682|ref|NP_001242420.1| uncharacterized protein LOC100775302 [Glycine max]
gi|255641811|gb|ACU21174.1| unknown [Glycine max]
Length = 293
Score = 165 bits (418), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 90/208 (43%), Positives = 122/208 (58%), Gaps = 9/208 (4%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVA-DNLSGESKLSDVRTSSGTFIPKG 96
+SW+PRA + F T +C+++I++AK LK S +A E +RTSSG F+
Sbjct: 86 LSWRPRALYFPNFATAEQCENIIDVAKDGLKPSTLALRQGETEENTKGIRTSSGVFVSAS 145
Query: 97 KD--AIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLA 154
D +A IE+KIA T +P+ +GE +LRYE Q+Y HYD F+ + R+A
Sbjct: 146 GDKTGTLAVIEEKIARATMIPRSHGEAFNILRYEVDQRYNSHYDAFNPAEYGPQKSQRMA 205
Query: 155 TVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHT 214
+ L+YL+DV +GGET+FP N +C G+ VKPR+GD LLF+SL T
Sbjct: 206 SFLLYLTDVEEGGETMFPFENG----LNMDGNYGYEDCI--GLKVKPRQGDGLLFYSLLT 259
Query: 215 NAIPDPVSLHSGCPVIEGEKWSATKWIH 242
N DP SLH CPVI+GEKW ATKWI
Sbjct: 260 NGTIDPTSLHGSCPVIKGEKWVATKWIR 287
>gi|226499492|ref|NP_001150030.1| LOC100283657 [Zea mays]
gi|195636206|gb|ACG37571.1| prolyl 4-hydroxylase [Zea mays]
gi|347978804|gb|AEP37744.1| prolyl 4-hydroxylase 3 [Zea mays]
Length = 294
Score = 165 bits (418), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 95/213 (44%), Positives = 127/213 (59%), Gaps = 19/213 (8%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSD--VRTSSGTFIPK 95
+SW+PRA + F T +C++++ AK +LK S +A GE+ S +RTSSGTF+
Sbjct: 89 LSWQPRALYFPQFATSEQCENIVKTAKERLKPSTLALR-KGETAESTKGIRTSSGTFLSA 147
Query: 96 GKDAI--IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRL 153
+D +A IE KIA T LP+ +GE VLRY GQ+Y HYD F + R+
Sbjct: 148 NEDPTETLAEIEKKIARATMLPRNHGEPFNVLRYNIGQRYASHYDAFDPAQYGPQKNQRV 207
Query: 154 ATVLMYLSDVAKGGETVFP--NAEEPPRRRTPATNDDLSECAKK--GIAVKPRRGDALLF 209
A+ L+YL+DV +GGET+FP N+E N D+ +K G+ VKPR+GD LLF
Sbjct: 208 ASFLLYLTDVEEGGETMFPYENSE----------NMDIGYDYEKCIGLKVKPRKGDGLLF 257
Query: 210 FSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+SL N D SLH CPVI+GEKW ATKWI
Sbjct: 258 YSLMVNGTIDRTSLHGSCPVIKGEKWVATKWIR 290
>gi|297798522|ref|XP_002867145.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297312981|gb|EFH43404.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 288
Score = 165 bits (417), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 93/209 (44%), Positives = 121/209 (57%), Gaps = 11/209 (5%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGES--KLSDVRTSSGTFIPK 95
+SW+PRA + F T +C +I AK LK SA+A GE+ RTSSGTFI
Sbjct: 81 LSWRPRAIYFPNFATAEQCQAIIERAKVNLKPSALALR-KGETAENTKGTRTSSGTFISA 139
Query: 96 GKDAIIA--GIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRL 153
+D+ A +E KIA T +P+ +GE +LRYE GQKY+ HYD F+ + R+
Sbjct: 140 SEDSTGALDFVERKIARATMIPRSHGESFNILRYELGQKYDSHYDVFNPTEYGPQSSQRI 199
Query: 154 ATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLH 213
A+ L+YLSDV +GGET+FP T D +C G+ VKPR+GD LLF+S+
Sbjct: 200 ASFLLYLSDVEEGGETMFPFENGS----NMGTGYDYKQCI--GLKVKPRKGDGLLFYSVF 253
Query: 214 TNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
N D SLH CPV +GEKW ATKWI
Sbjct: 254 PNGTIDQTSLHGSCPVTKGEKWVATKWIR 282
>gi|224103711|ref|XP_002313164.1| predicted protein [Populus trichocarpa]
gi|222849572|gb|EEE87119.1| predicted protein [Populus trichocarpa]
Length = 294
Score = 164 bits (416), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 94/208 (45%), Positives = 125/208 (60%), Gaps = 11/208 (5%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLS--DVRTSSGTFIPK 95
+SWKPRA + F T +C+ +I + +S+LK S +A GE+ S D RTSSG+F+
Sbjct: 85 LSWKPRALYFPKFATPEQCESIIKMVESKLKPSTLALR-KGETAESTKDTRTSSGSFVSG 143
Query: 96 GKD--AIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRL 153
+D + IE KIA T +P+ +GE +LRYE GQKY+ HYD F+ + R
Sbjct: 144 SEDETGTLDFIEKKIAKATMIPQSHGEAFNILRYEIGQKYDSHYDAFNPDEYGQQSSQRT 203
Query: 154 ATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLH 213
A+ L+YLS+V +GGET+FP E P D +C G+ VKPR+GD LLF+SL
Sbjct: 204 ASFLLYLSNVEEGGETMFPF--ENGSAVIPGF--DYKQCV--GLKVKPRQGDGLLFYSLF 257
Query: 214 TNAIPDPVSLHSGCPVIEGEKWSATKWI 241
N DP SLH CPVI+G KW ATKWI
Sbjct: 258 PNGTIDPTSLHGSCPVIKGVKWVATKWI 285
>gi|340787855|ref|YP_004753320.1| peptidyl prolyl 4-hydroxylase-like protein subunit alpha
[Collimonas fungivorans Ter331]
gi|340553122|gb|AEK62497.1| Peptidyl prolyl 4-hydroxylase-like protein, alpha subunit
[Collimonas fungivorans Ter331]
Length = 289
Score = 164 bits (415), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 83/211 (39%), Positives = 122/211 (57%), Gaps = 26/211 (12%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
KPRA ++ L+ ECD LI L+K++L RS V D+ +G +KL + RTSSGTF +G
Sbjct: 99 KPRAILFGNVLSHDECDQLIALSKTKLLRSGVVDHQTGNTKLHEHRTSSGTFFHRGTTPF 158
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNIVRGGHRLAT 155
IA I+ ++A +P+ +GE +Q+L Y+ G +Y PHYDYF ++ RGG R AT
Sbjct: 159 IAMIDKRLAALMQVPESHGEGLQILNYQMGGEYRPHYDYFRPDAPGSAKHLARGGQRTAT 218
Query: 156 VLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTN 215
+++YL+DV GGET+FP + G+++ P +G A+ F +
Sbjct: 219 LIIYLNDVDGGGETIFP---------------------RNGLSIVPAKGSAIYFSYTNAE 257
Query: 216 AIPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
D +S H G PVIEGEKW ATKW+ + +
Sbjct: 258 NQLDSLSFHGGSPVIEGEKWIATKWVRQNEY 288
>gi|357476355|ref|XP_003608463.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
gi|355509518|gb|AES90660.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
Length = 297
Score = 164 bits (415), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 95/232 (40%), Positives = 136/232 (58%), Gaps = 14/232 (6%)
Query: 16 FSLLIRKSFSSTAIIN-PSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVAD 74
++L+ F +I + P +V +SWKPRA + F T +C++++++AK+ LK S++A
Sbjct: 67 YNLMTAGEFGDDSITSIPFQV--LSWKPRALYFPNFATAEQCENIVSVAKAGLKPSSLAL 124
Query: 75 NLSGES--KLSDVRTSSGTFIPKGKDAI--IAGIEDKIATWTFLPKENGEDIQVLRYEHG 130
GE+ +RTSSG F+ +D + IE+KIA T +P+ +GE +LRYE G
Sbjct: 125 R-KGETTENTKGIRTSSGVFLSASRDKTKTLEAIEEKIARATMIPRSHGEAFNILRYEVG 183
Query: 131 QKYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLS 190
Q+Y HYD F+ + R+A+ L+YL+DV +GGET+FP T D
Sbjct: 184 QRYNSHYDAFNPDEYGPQKSQRVASFLLYLTDVEEGGETMFPFENGLNMDGTYGYED--- 240
Query: 191 ECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
C G+ VKPR+GD LLF+SL N D SLH CPVI+GEKW ATKWI
Sbjct: 241 -CV--GLRVKPRQGDGLLFYSLLPNGTIDQTSLHGSCPVIKGEKWVATKWIR 289
>gi|388505024|gb|AFK40578.1| unknown [Medicago truncatula]
Length = 297
Score = 164 bits (414), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 94/232 (40%), Positives = 136/232 (58%), Gaps = 14/232 (6%)
Query: 16 FSLLIRKSFSSTAIIN-PSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVAD 74
++L+ F +I + P +V +SWKPRA + F T +C++++++AK+ LK S++A
Sbjct: 67 YNLMTAGEFGDDSITSIPFQV--LSWKPRALYFPNFATAEQCENIVSVAKAGLKPSSLAL 124
Query: 75 NLSGES--KLSDVRTSSGTFIPKGKDAI--IAGIEDKIATWTFLPKENGEDIQVLRYEHG 130
GE+ +RTSSG F+ +D + IE+KIA T +P+ +GE +LRYE G
Sbjct: 125 R-KGETTENTKGIRTSSGVFLSASRDKTKTLEAIEEKIARATMIPRSHGEAFNILRYEVG 183
Query: 131 QKYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLS 190
Q+Y HYD F+ + R+A+ L+YL+DV +GGET+FP T D
Sbjct: 184 QRYYSHYDAFNPDEYGPQKSQRVASFLLYLTDVEEGGETMFPFENGLNMDGTYGYED--- 240
Query: 191 ECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ G+ VKPR+GD LLF+SL N D SLH CPVI+GEKW ATKWI
Sbjct: 241 ---RVGLRVKPRQGDGLLFYSLLPNGTIDQTSLHGSCPVIKGEKWVATKWIR 289
>gi|356563543|ref|XP_003550021.1| PREDICTED: putative prolyl 4-hydroxylase-like [Glycine max]
Length = 293
Score = 163 bits (413), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 90/213 (42%), Positives = 122/213 (57%), Gaps = 9/213 (4%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVA-DNLSGESKLSDVRTSSGTFIPKG 96
+SW+PRA + F T +C+ +I++AK LK S +A E +RTSSG F+
Sbjct: 86 LSWRPRAVYFPNFATAEQCESIIDVAKDGLKPSTLALRQGETEDNTKGIRTSSGVFVSAS 145
Query: 97 KDAI--IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLA 154
+D + IE+KIA T +P+ +GE +LRYE Q+Y HYD F+ + R+A
Sbjct: 146 EDKTRTLDVIEEKIARATMIPRSHGEAFNILRYEVNQRYNSHYDAFNPAEYGPQKSQRMA 205
Query: 155 TVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHT 214
+ L+YL+DV +GGET+FP N +C G+ VKPR+GD LLF+SL T
Sbjct: 206 SFLLYLTDVEEGGETMFPFE----NGLNMDGNYGYEDCI--GLKVKPRQGDGLLFYSLLT 259
Query: 215 NAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFD 247
N DP SLH CPVI+GEKW ATKWI D
Sbjct: 260 NGTIDPTSLHGSCPVIKGEKWVATKWIRDQELD 292
>gi|255633460|gb|ACU17088.1| unknown [Glycine max]
Length = 207
Score = 163 bits (413), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 75/129 (58%), Positives = 100/129 (77%)
Query: 35 VKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIP 94
V+ +SW+PRAFVY FLT EC++LI++AK + +S+V D+ +G+SK S VRTSSGTF+
Sbjct: 79 VEVVSWEPRAFVYHNFLTKEECEYLIDIAKPNMHKSSVVDSETGKSKDSRVRTSSGTFLA 138
Query: 95 KGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLA 154
+G+D I+ IE +IA ++F+P E+GE +QVL YE GQKYEPHYDYF D N GG R+A
Sbjct: 139 RGRDKIVRDIEKRIAHYSFIPVEHGEGLQVLHYEVGQKYEPHYDYFLDDFNTKNGGQRIA 198
Query: 155 TVLMYLSDV 163
TVLMYL+DV
Sbjct: 199 TVLMYLTDV 207
>gi|449448264|ref|XP_004141886.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 294
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 91/209 (43%), Positives = 122/209 (58%), Gaps = 11/209 (5%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSD--VRTSSGTFIPK 95
+SW+PRA + F T +C ++NLAK +L+ S +A GE+ S VRTSSG F
Sbjct: 84 LSWRPRALYFPKFATAEQCQSIVNLAKPKLRPSTLALR-KGETAESTKGVRTSSGVFFSA 142
Query: 96 GKD--AIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRL 153
+D + IE+KIA T +P+ +GE +LRYE GQKY HYD F + R+
Sbjct: 143 SEDESGTLGVIEEKIARATMIPRTHGEAYNILRYEIGQKYNSHYDAFKPSEYGPQKSQRV 202
Query: 154 ATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLH 213
A+ L+YL+DV +GGET+FP T + C G+ VKPR+GD LLF+S+
Sbjct: 203 ASFLLYLTDVEEGGETMFPFENGLNMDGT----YNFQTCI--GLKVKPRQGDGLLFYSVF 256
Query: 214 TNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
N DP SLH CPVI+G+KW ATKWI
Sbjct: 257 PNGTIDPTSLHGSCPVIKGQKWVATKWIR 285
>gi|255647903|gb|ACU24410.1| unknown [Glycine max]
Length = 293
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 90/213 (42%), Positives = 121/213 (56%), Gaps = 9/213 (4%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVA-DNLSGESKLSDVRTSSGTFIPKG 96
+SW+PRA + F T +C+ +I++AK LK S +A E +RTSSG F+
Sbjct: 86 LSWRPRAVYFPNFATAEQCESIIDVAKDGLKPSTLALRQGETEDNTKGIRTSSGVFVSAS 145
Query: 97 KDAI--IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLA 154
+D + IE+KIA T +P+ +GE +LRYE Q+Y HYD F+ + R+A
Sbjct: 146 EDKTRTLDVIEEKIARATMIPRSHGEAFNILRYEVNQRYNSHYDAFNPAEYGPQKSQRMA 205
Query: 155 TVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHT 214
+ L+YL+DV +GGET+FP N C G+ VKPR+GD LLF+SL T
Sbjct: 206 SFLLYLTDVEEGGETMFPFENG----LNMDGNYGYEGCI--GLKVKPRQGDGLLFYSLLT 259
Query: 215 NAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFD 247
N DP SLH CPVI+GEKW ATKWI D
Sbjct: 260 NGTIDPTSLHGSCPVIKGEKWVATKWIRDQELD 292
>gi|242038031|ref|XP_002466410.1| hypothetical protein SORBIDRAFT_01g007280 [Sorghum bicolor]
gi|241920264|gb|EER93408.1| hypothetical protein SORBIDRAFT_01g007280 [Sorghum bicolor]
Length = 294
Score = 162 bits (409), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 93/213 (43%), Positives = 126/213 (59%), Gaps = 19/213 (8%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSD--VRTSSGTFIPK 95
+SW+PRA + F T +C++++ AK +LK S +A GE+ S +RTSSGTF+
Sbjct: 89 LSWQPRALYFPQFATSEQCENIVKTAKERLKPSTLALR-KGETAESTKGIRTSSGTFLSA 147
Query: 96 GKDAI--IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRL 153
+D +A IE KIA T +P+ +GE VLRY GQ+Y HYD F + R+
Sbjct: 148 NEDPTRTLAEIEKKIARATMIPRNHGEPFNVLRYNIGQRYASHYDAFDPVQYGPQKSQRV 207
Query: 154 ATVLMYLSDVAKGGETVFP--NAEEPPRRRTPATNDDLSECAKK--GIAVKPRRGDALLF 209
A+ L+YL++V +GGET+FP N E N D+ +K G+ VKPR+GD LLF
Sbjct: 208 ASFLLYLTNVEEGGETMFPYENGE----------NMDIGYDYEKCIGLKVKPRKGDGLLF 257
Query: 210 FSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+SL N D SLH CPVI+GEKW ATKWI
Sbjct: 258 YSLMVNGTIDRTSLHGSCPVIKGEKWVATKWIR 290
>gi|385137888|gb|AFI41205.1| oxygenase protein, partial [Arabidopsis thaliana]
Length = 288
Score = 162 bits (409), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 91/209 (43%), Positives = 120/209 (57%), Gaps = 11/209 (5%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGES--KLSDVRTSSGTFIPK 95
+SW+PRA + F T +C +I AK LK SA+A GE+ RTSSGTFI
Sbjct: 81 LSWRPRAIYFPNFATAEQCQAIIERAKVNLKPSALALR-KGETAENTKGTRTSSGTFISA 139
Query: 96 GKDAIIA--GIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRL 153
+++ A +E KIA T +P+ +GE +LRYE GQKY+ HYD F+ + R+
Sbjct: 140 SEESTGALDFVERKIARATMIPRSHGESFNILRYELGQKYDSHYDVFNPTEYGPQSSQRI 199
Query: 154 ATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLH 213
A+ L+YLSDV +GGET+FP D +C G+ VKPR+GD LLF+S+
Sbjct: 200 ASFLLYLSDVEEGGETMFPFENGS----NMGIGYDYKQCI--GLKVKPRKGDGLLFYSVF 253
Query: 214 TNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
N D SLH CPV +GEKW ATKWI
Sbjct: 254 PNGTIDQTSLHGSCPVTKGEKWVATKWIR 282
>gi|357483927|ref|XP_003612250.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355513585|gb|AES95208.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 204
Score = 162 bits (409), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 75/126 (59%), Positives = 97/126 (76%)
Query: 35 VKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIP 94
V+ +SW+PRAFVY FLT EC++LI++AK + +S V D+ +G+SK S VRTSSGTF+
Sbjct: 78 VEVVSWEPRAFVYHNFLTKEECEYLIDIAKPSMHKSTVVDSETGKSKDSRVRTSSGTFLA 137
Query: 95 KGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLA 154
+G+D I+ IE KIA +TF+P E+GE +QVL YE GQKYEPHYDYF D+ N GG R+A
Sbjct: 138 RGRDKIVRNIEKKIADFTFIPVEHGEGLQVLHYEVGQKYEPHYDYFLDEFNTKNGGQRIA 197
Query: 155 TVLMYL 160
TVLMYL
Sbjct: 198 TVLMYL 203
>gi|18418321|ref|NP_567941.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
gi|17381226|gb|AAL36425.1| unknown protein [Arabidopsis thaliana]
gi|20465827|gb|AAM20018.1| unknown protein [Arabidopsis thaliana]
gi|21592377|gb|AAM64328.1| putative dioxygenase [Arabidopsis thaliana]
gi|332660892|gb|AEE86292.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
Length = 288
Score = 161 bits (408), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 91/209 (43%), Positives = 120/209 (57%), Gaps = 11/209 (5%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGES--KLSDVRTSSGTFIPK 95
+SW+PRA + F T +C +I AK LK SA+A GE+ RTSSGTFI
Sbjct: 81 LSWRPRAIYFPNFATAEQCQAIIERAKVNLKPSALALR-KGETAENTKGTRTSSGTFISA 139
Query: 96 GKDAIIA--GIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRL 153
+++ A +E KIA T +P+ +GE +LRYE GQKY+ HYD F+ + R+
Sbjct: 140 SEESTGALDFVERKIARATMIPRSHGESFNILRYELGQKYDSHYDVFNPTEYGPQSSQRI 199
Query: 154 ATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLH 213
A+ L+YLSDV +GGET+FP D +C G+ VKPR+GD LLF+S+
Sbjct: 200 ASFLLYLSDVEEGGETMFPFENGS----NMGIGYDYKQCI--GLKVKPRKGDGLLFYSVF 253
Query: 214 TNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
N D SLH CPV +GEKW ATKWI
Sbjct: 254 PNGTIDQTSLHGSCPVTKGEKWVATKWIR 282
>gi|225428943|ref|XP_002263094.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
gi|296083076|emb|CBI22480.3| unnamed protein product [Vitis vinifera]
Length = 282
Score = 161 bits (408), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 92/207 (44%), Positives = 119/207 (57%), Gaps = 10/207 (4%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSA-VADNLSGESKLSDVRTSSGTFIPKG 96
+SWKPRA + F T +C +I +AKS L S V E +RTSSGTFI
Sbjct: 76 LSWKPRARYFPHFATAEQCQSIIEMAKSGLSPSTLVLRKGETEESTKGIRTSSGTFISAS 135
Query: 97 KD--AIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLA 154
+D I+ IE KIA T +P+ +GE +LRYE GQ+Y HYD S ++ R+A
Sbjct: 136 EDKTGILDFIERKIAKATMIPRNHGEVFNILRYEIGQRYNSHYDAISPAEYGLQTSQRIA 195
Query: 155 TVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHT 214
+ L+YLSDV +GGET+FP + + +C G+ VKPRRGD LLF+S+
Sbjct: 196 SFLLYLSDVEEGGETMFPFEHD-----LNINTFNSRKCI--GLKVKPRRGDGLLFYSVFP 248
Query: 215 NAIPDPVSLHSGCPVIEGEKWSATKWI 241
N D S+H CPVIEGEKW ATKWI
Sbjct: 249 NGTIDWTSMHGSCPVIEGEKWVATKWI 275
>gi|242085722|ref|XP_002443286.1| hypothetical protein SORBIDRAFT_08g016950 [Sorghum bicolor]
gi|241943979|gb|EES17124.1| hypothetical protein SORBIDRAFT_08g016950 [Sorghum bicolor]
Length = 147
Score = 161 bits (407), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 82/157 (52%), Positives = 108/157 (68%), Gaps = 12/157 (7%)
Query: 92 FIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVR-GG 150
F+ +G+D I+ IE +IA +T +P ENGE +QVL Y GQK+EPH+DY +D ++ + GG
Sbjct: 2 FLKRGQDTIVRTIEQRIADYTSVPIENGEPLQVLHYAVGQKFEPHFDY-TDGTSVTKIGG 60
Query: 151 HRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFF 210
R AT LMYLSDV +GGETVFPN AT + AK GI+VKP+ GDALLF+
Sbjct: 61 PRKATFLMYLSDVEEGGETVFPN----------ATAKGSAPSAKSGISVKPKMGDALLFW 110
Query: 211 SLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFD 247
S+ + DP SLH PVI+G+KWSATKWIHV+ ++
Sbjct: 111 SMKPDGSLDPKSLHGASPVIKGDKWSATKWIHVNKYN 147
>gi|356541677|ref|XP_003539300.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 297
Score = 160 bits (406), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 89/208 (42%), Positives = 125/208 (60%), Gaps = 13/208 (6%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSD--VRTSSGTFIPK 95
+SW PRA + F + +C+ +I +A+ LK S +A GE++ S +RTSSG F+
Sbjct: 92 LSWYPRALYFPNFASAEQCESIIEMARGGLKSSTLALR-KGETEESTKGIRTSSGVFMSA 150
Query: 96 GKD--AIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRL 153
+D I+ IE+KIA T +P+ +GE +LRYE GQKY HYD F + R+
Sbjct: 151 SEDETGILDAIEEKIAKATKIPRTHGEAFNILRYEVGQKYNSHYDAFDEAEYGPLQSQRV 210
Query: 154 ATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLH 213
A+ L+YL+DV +GGET+FP R + ++ +C G+ V+PR+GDALLF+SL
Sbjct: 211 ASFLLYLTDVPEGGETMFPYENGFNR------DGNVEDCI--GLRVRPRKGDALLFYSLL 262
Query: 214 TNAIPDPVSLHSGCPVIEGEKWSATKWI 241
N D S H CPVI+GEKW ATKWI
Sbjct: 263 PNGTIDQTSAHGSCPVIKGEKWVATKWI 290
>gi|449511009|ref|XP_004163837.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
alpha-1-like [Cucumis sativus]
Length = 294
Score = 160 bits (406), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 90/209 (43%), Positives = 121/209 (57%), Gaps = 11/209 (5%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSD--VRTSSGTFIPK 95
+SW+PRA + F T +C ++NLAK +L+ S +A GE+ S VRTSSG F
Sbjct: 84 LSWRPRALYFPKFATAEQCQSIVNLAKPKLRPSTLALR-KGETAESTKGVRTSSGVFFSA 142
Query: 96 GKD--AIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRL 153
+D + IE+K A T +P+ +GE +LRYE GQKY HYD F + R+
Sbjct: 143 SEDESGTLGVIEEKXARATMIPRTHGEAYNILRYEIGQKYNSHYDAFKPSEYGPQKSQRV 202
Query: 154 ATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLH 213
A+ L+YL+DV +GGET+FP T + C G+ VKPR+GD LLF+S+
Sbjct: 203 ASFLLYLTDVEEGGETMFPFENGLNMDGT----YNFQTCI--GLKVKPRQGDGLLFYSVF 256
Query: 214 TNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
N DP SLH CPVI+G+KW ATKWI
Sbjct: 257 PNGTIDPTSLHGSCPVIKGQKWVATKWIR 285
>gi|388523073|gb|AFK49598.1| unknown [Lotus japonicus]
Length = 318
Score = 160 bits (405), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 94/216 (43%), Positives = 124/216 (57%), Gaps = 15/216 (6%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSD--VRTSSGTFIPK 95
+SW P A + F T +C+ +I AK LK S + + GE+ S +RTSSG FI
Sbjct: 95 LSWNPHALYFPNFATAEQCESIIETAKEGLKPSTLVLRV-GETDESTTGIRTSSGVFISA 153
Query: 96 GKD--AIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRL 153
+D ++ IE+KIA T +P+ +GE VLRY+ GQKY HYD + + R+
Sbjct: 154 FEDKTGVLDVIEEKIARATKIPRTHGEAFNVLRYKVGQKYSSHYDALHPDIYGPQKSQRM 213
Query: 154 ATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKK--GIAVKPRRGDALLFFS 211
A+ L+YLSDV +GGET+FP N D S +K G+ VKPR+GD LLF+S
Sbjct: 214 ASFLLYLSDVPEGGETMFPFEN--------GLNMDGSYYYEKCIGLKVKPRKGDGLLFYS 265
Query: 212 LHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFD 247
L N DP+SLH CPVI+GEKW ATKWI D
Sbjct: 266 LFPNGTIDPMSLHGSCPVIKGEKWVATKWIRDQVLD 301
>gi|299532490|ref|ZP_07045880.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni S44]
gi|298719437|gb|EFI60404.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni S44]
Length = 299
Score = 159 bits (402), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 86/212 (40%), Positives = 122/212 (57%), Gaps = 32/212 (15%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
PR V+ L+D ECD +I A+ +++RS DN SG ++D RTS+G F +G++ +I
Sbjct: 112 PRVVVFGNLLSDEECDAIIAAARPRMRRSLTVDNQSGGEAVNDDRTSNGMFFQRGENELI 171
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDK-----VNIVRGGHRLATV 156
+ +E +IA P ENGE +QVL Y G +Y+PHYDYF+ + RGG R+ T+
Sbjct: 172 SLVEQRIARLLNWPLENGEGMQVLHYRPGAEYKPHYDYFAPNEPGTPTILKRGGQRVGTL 231
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
+MYL++ A+GG T FP+ G+ V PRRG+A +FFS +
Sbjct: 232 VMYLNEPARGGATTFPDV---------------------GLQVVPRRGNA-VFFSYNR-- 267
Query: 217 IPDPV--SLHSGCPVIEGEKWSATKWIHVDSF 246
PDP +LH G PV+EGEKW ATKW+ F
Sbjct: 268 -PDPATKTLHGGAPVLEGEKWIATKWLREREF 298
>gi|414870897|tpg|DAA49454.1| TPA: hypothetical protein ZEAMMB73_536273 [Zea mays]
Length = 222
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 74/123 (60%), Positives = 92/123 (74%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGK 97
+SW+PRAFVY FL+ ECDHLI+LAK +K+S V D+ +G SK S VRTSSG F+ +G+
Sbjct: 99 LSWEPRAFVYHNFLSKEECDHLISLAKPHMKKSTVVDSATGGSKDSRVRTSSGMFLRRGQ 158
Query: 98 DAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVL 157
D II IE +IA +TF+P E GE +QVL YE GQKYEPH+DYF D N GG R+AT+L
Sbjct: 159 DKIIRTIEKRIADYTFIPVEQGEGLQVLHYEVGQKYEPHFDYFHDDYNTKNGGQRIATLL 218
Query: 158 MYL 160
MYL
Sbjct: 219 MYL 221
>gi|264677094|ref|YP_003277000.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni CNB-2]
gi|262207606|gb|ACY31704.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni CNB-2]
Length = 306
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 85/212 (40%), Positives = 122/212 (57%), Gaps = 32/212 (15%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
PR V+ L+D ECD +I A+ +++RS DN SG ++D RTS+G F +G++ +I
Sbjct: 119 PRVVVFGNLLSDEECDAIIAAARPRMRRSLTVDNQSGGEAVNDDRTSNGMFFQRGENDLI 178
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDK-----VNIVRGGHRLATV 156
+ +E +IA P ENGE +QVL Y G +Y+PHYDYF+ + RGG R+ T+
Sbjct: 179 SLVEQRIARLLNWPLENGEGMQVLHYRPGAEYKPHYDYFAPNEPGTPTILKRGGQRVGTL 238
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
+MYL++ A+GG T FP+ G+ + PRRG+A +FFS +
Sbjct: 239 VMYLNEPARGGATTFPDV---------------------GLQIVPRRGNA-VFFSYNR-- 274
Query: 217 IPDPV--SLHSGCPVIEGEKWSATKWIHVDSF 246
PDP +LH G PV+EGEKW ATKW+ F
Sbjct: 275 -PDPATKTLHGGAPVLEGEKWIATKWLREREF 305
>gi|356496957|ref|XP_003517331.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 299
Score = 157 bits (398), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 91/232 (39%), Positives = 126/232 (54%), Gaps = 9/232 (3%)
Query: 19 LIRKSFSSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSA-VADNLS 77
L++ S I + +SW PRA + F++ +C+ +I +A+ LK S V
Sbjct: 73 LLKAGDSGDDYITLIPFQVLSWYPRALYFPNFVSAEQCETIIEMARGGLKPSTLVLRKGE 132
Query: 78 GESKLSDVRTSSGTFIPKGKD--AIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEP 135
E +RTS G F+ +D I+ IE+KIA T +P+ +GE +LRYE GQKY P
Sbjct: 133 TEESTKGIRTSYGVFMSASEDETGILDSIEEKIAKATKIPRTHGEAFNILRYEVGQKYSP 192
Query: 136 HYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKK 195
HYD F + R A+ L+YL+DV +GGET+FP R + D +C
Sbjct: 193 HYDAFDEAEFGPLQSQRAASFLLYLTDVPEGGETLFPYENGFNR----DGSYDFEDCI-- 246
Query: 196 GIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFD 247
G+ V+PR+GD LLF+SL N D S+H CPVI+GEKW ATKWI D
Sbjct: 247 GLRVRPRKGDGLLFYSLLPNGTIDQTSVHGSCPVIKGEKWVATKWIRDQVLD 298
>gi|356536125|ref|XP_003536590.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 286
Score = 157 bits (397), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 96/215 (44%), Positives = 129/215 (60%), Gaps = 15/215 (6%)
Query: 32 PSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLS--DVRTSS 89
PS++ +SW+PRA + F + C +I +AK +L+ S +A GE+ S D RTSS
Sbjct: 75 PSQI--LSWRPRAVFFPNFTSVEVCQQIIEMAKPKLEPSKLALR-KGETAESTKDTRTSS 131
Query: 90 GTFIPKGKD--AIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS-DKVNI 146
GTFI +D I+ +E KIA T +P+ +GE +L+YE GQKY+ HYD F+ D+
Sbjct: 132 GTFISASEDKSGILDLVERKIAKVTMIPRTHGEIFNILKYEVGQKYDSHYDAFNPDEYGS 191
Query: 147 VRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
V R+A+ L+YLS+V GGET+FP R D +C G+ VKPR+GD
Sbjct: 192 VES-QRIASFLLYLSNVEAGGETMFPYEGGLNIDR----GYDYQKCI--GLKVKPRQGDG 244
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
LLF+SL N D SLH CPVI+GEKW ATKWI
Sbjct: 245 LLFYSLLPNGKIDKTSLHGSCPVIKGEKWVATKWI 279
>gi|221068712|ref|ZP_03544817.1| Procollagen-proline dioxygenase [Comamonas testosteroni KF-1]
gi|220713735|gb|EED69103.1| Procollagen-proline dioxygenase [Comamonas testosteroni KF-1]
Length = 299
Score = 157 bits (397), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 86/212 (40%), Positives = 120/212 (56%), Gaps = 32/212 (15%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
PR V+ L+D ECD +I A +++RS DN SG ++D RTS+G F +G++ +I
Sbjct: 112 PRVVVFGNLLSDEECDAIIAAAGPRMQRSLTVDNQSGGEAVNDDRTSNGMFFQRGENDLI 171
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDK-----VNIVRGGHRLATV 156
+E +IA P ENGE +QVL Y G +Y+PHYDYF+ + RGG R+ T+
Sbjct: 172 CRVEQRIARLLNWPLENGEGMQVLHYRPGAEYKPHYDYFAPNEPGTPTILKRGGQRVGTL 231
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
+MYL++ A+GG T FP+ G+ V PRRG+A +FFS +
Sbjct: 232 VMYLNEPARGGATTFPDV---------------------GLQVVPRRGNA-VFFSYNR-- 267
Query: 217 IPDPV--SLHSGCPVIEGEKWSATKWIHVDSF 246
PDP +LH G PV+EGEKW ATKW+ F
Sbjct: 268 -PDPATKTLHGGAPVLEGEKWIATKWLREREF 298
>gi|412994121|emb|CCO14632.1| predicted protein [Bathycoccus prasinos]
Length = 341
Score = 157 bits (396), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 88/213 (41%), Positives = 127/213 (59%), Gaps = 8/213 (3%)
Query: 34 KVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGES--KLSDVRTSSGT 91
K + +S PR+ +Y F +D +CD ++ A+S+L +S +A GE+ ++RTSSGT
Sbjct: 127 KFQLLSTAPRSVMYRNFASDADCDAIVEAARSRLHKSGLALK-RGETLETTKNIRTSSGT 185
Query: 92 FIPKG--KDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRG 149
F+ + + +E+K+A T +P +GE +LRYE GQKY+ HYD F +
Sbjct: 186 FLTSKMEQSGALKRVEEKMARATHIPATHGEAYNILRYEIGQKYDSHYDMFDPSQYGPQR 245
Query: 150 GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLF 209
R+A+ L+YL+ +GGETVFP + R D + C + G+ VKPR+GDALLF
Sbjct: 246 SQRVASFLLYLTTPDEGGETVFPLEGQNGLYRLRGI--DYTSC-EAGLKVKPRKGDALLF 302
Query: 210 FSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+S+H N D SLH GCPVI G K+ ATKWIH
Sbjct: 303 WSVHPNNTFDRSSLHGGCPVISGTKFVATKWIH 335
>gi|255577610|ref|XP_002529682.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223530830|gb|EEF32693.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 165
Score = 157 bits (396), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 80/164 (48%), Positives = 105/164 (64%), Gaps = 18/164 (10%)
Query: 83 SDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSD 142
S+VRTSSG F+ + IE +I+ ++ +P ENGE +QVLRYE Q Y PH+DYFSD
Sbjct: 9 SNVRTSSGMFLSSEERKSPMAIEKRISVYSQVPIENGELVQVLRYEKSQFYRPHHDYFSD 68
Query: 143 KVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECA-----KKGI 197
N+ RGG R+AT+LMYLSD +GGET FP A EC+ KG+
Sbjct: 69 TFNLKRGGQRVATMLMYLSDNVEGGETYFPMAGS-------------GECSCGGKIVKGL 115
Query: 198 AVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
+VKP +GDA+LF+S+ + DP S+H GC V+ GEKWSATKW+
Sbjct: 116 SVKPIKGDAVLFWSMGLDGQSDPKSIHGGCEVLAGEKWSATKWM 159
>gi|326518408|dbj|BAJ88233.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 276
Score = 157 bits (396), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 85/186 (45%), Positives = 116/186 (62%), Gaps = 20/186 (10%)
Query: 34 KVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFI 93
K + ISW PR V+ FL+ ECD+L +A+ +L+ S V D +G+ SDVRTSSG F+
Sbjct: 59 KPEVISWTPRIIVFHNFLSSEECDYLREIARPRLEISTVVDVATGKGVKSDVRTSSGMFV 118
Query: 94 --PKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGH 151
+ K +I IE +I+ ++ +P ENGE IQVLRYE Q Y PH+DYFSD N+ RGG
Sbjct: 119 NSEERKLPVIKAIEKRISVFSQIPVENGELIQVLRYEPNQYYRPHHDYFSDTFNLKRGGQ 178
Query: 152 RLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECA-----KKGIAVKPRRGDA 206
R+AT+LMYL+D +GGET FP A + EC +G+ VKP +GDA
Sbjct: 179 RVATMLMYLTDGVEGGETHFPQAGD-------------GECICGGRLVRGLCVKPNKGDA 225
Query: 207 LLFFSL 212
+LF+S+
Sbjct: 226 VLFWSM 231
>gi|377810637|ref|YP_005043077.1| proCollegen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia sp.
YI23]
gi|357939998|gb|AET93554.1| proCollegen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia sp.
YI23]
Length = 297
Score = 156 bits (395), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 85/211 (40%), Positives = 115/211 (54%), Gaps = 26/211 (12%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
+P A + + FLT ECD LI LA+ +L RS V D ++G + R+S GTF + +
Sbjct: 101 RPAAVLLDEFLTGSECDQLIALARPRLSRSTVVDPVTGRDVAAGHRSSDGTFFRLAETPL 160
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNIVRGGHRLAT 155
+A +E +IA T L ENGE +Q+LRY+ G + PH DY +++ +I R G R+ T
Sbjct: 161 VARLEMRIAALTGLAAENGEGLQLLRYQPGAESTPHVDYLVAGNETNRESIARSGQRVGT 220
Query: 156 VLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTN 215
+LMYL+DV GGETVFP G +V PRRG AL F +
Sbjct: 221 LLMYLNDVEGGGETVFPQV---------------------GCSVVPRRGQALYFEYCNRA 259
Query: 216 AIPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
+ DP SLH+ P+ GEKW ATKWI F
Sbjct: 260 GVCDPASLHASTPLRSGEKWVATKWIRARRF 290
>gi|418530659|ref|ZP_13096582.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni ATCC 11996]
gi|371452378|gb|EHN65407.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni ATCC 11996]
Length = 299
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 84/212 (39%), Positives = 122/212 (57%), Gaps = 32/212 (15%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
PR V+ L++ ECD +I A+ +++RS DN SG ++D RTS+G F +G++ +I
Sbjct: 112 PRVVVFGNLLSNEECDAIIAAARPRMQRSLTVDNQSGGEAVNDDRTSNGMFFQRGENDLI 171
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDK-----VNIVRGGHRLATV 156
+ +E +IA P ENGE +QVL Y G +Y+PHYDYF+ + RGG R+ T+
Sbjct: 172 SRVEQRIARLLNWPLENGEGMQVLHYRPGAEYKPHYDYFAPNEPGTPTILKRGGQRVGTL 231
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
+MYL++ A+GG T FP+ G+ V PRRG+A +FFS +
Sbjct: 232 VMYLNEPARGGATTFPDV---------------------GLQVVPRRGNA-VFFSYNR-- 267
Query: 217 IPDPV--SLHSGCPVIEGEKWSATKWIHVDSF 246
P+P +LH G PV+EGEKW ATKW+ F
Sbjct: 268 -PEPATKTLHGGAPVLEGEKWIATKWLREREF 298
>gi|356574299|ref|XP_003555286.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 290
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 94/222 (42%), Positives = 127/222 (57%), Gaps = 12/222 (5%)
Query: 25 SSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLS- 83
S +N + +SW+PRA + F + C +I +AK +L+ S +A GE+ S
Sbjct: 65 SGEPFLNSIPFQILSWRPRAVYFPNFTSVEVCQQIIEMAKPKLEPSKLALR-KGETAEST 123
Query: 84 -DVRTSSGTFIPKGKD--AIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF 140
D RTSSGTFI +D I+ +E KIA T +P+ +GE +L+YE QKY+ HYD F
Sbjct: 124 KDTRTSSGTFISASEDKSGILDFVERKIAKVTMIPRTHGEKFNILKYEVAQKYDSHYDAF 183
Query: 141 S-DKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAV 199
+ D+ V R+A+ L+YLS+V GGET+FP D +C G+ V
Sbjct: 184 NPDEYGTVE-SQRIASFLLYLSNVEAGGETMFPYEG---GLNIDKGYYDYKKCI--GLKV 237
Query: 200 KPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
KPR+GD LLF+SL N D SLH CPVI+GEKW ATKWI
Sbjct: 238 KPRQGDGLLFYSLLPNGKIDKTSLHGSCPVIKGEKWVATKWI 279
>gi|413963357|ref|ZP_11402584.1| ProCollegen-proline dioxygenase [Burkholderia sp. SJ98]
gi|413929189|gb|EKS68477.1| ProCollegen-proline dioxygenase [Burkholderia sp. SJ98]
Length = 286
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 83/213 (38%), Positives = 118/213 (55%), Gaps = 26/213 (12%)
Query: 39 SWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKD 98
S +P + L D ECD LI + + ++RS+V D SG+ + R S G F+ D
Sbjct: 91 SEQPVIALVADVLDDTECDRLIEIGREHVQRSSVVDPDSGKEITIEERRSEGAFVNASTD 150
Query: 99 AIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSD-----KVNIVRGGHRL 153
A++ I+ +IA P ENGED+ +LRY G +Y PHYDYF + K ++ RGG R+
Sbjct: 151 ALVETIDRRIAELFRQPVENGEDLHILRYGMGGEYRPHYDYFPEEQAGSKHHMQRGGQRI 210
Query: 154 ATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLH 213
ATV++YL++V +GG+T FP+ G+A+ PRRG AL F ++
Sbjct: 211 ATVILYLNEVEQGGDTTFPDI---------------------GLAIHPRRGSALYFEYVN 249
Query: 214 TNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
DP +LH+G PV +GEKW ATKWI F
Sbjct: 250 ELGQSDPKTLHAGTPVEKGEKWIATKWIRRGRF 282
>gi|449520827|ref|XP_004167434.1| PREDICTED: putative prolyl 4-hydroxylase-like, partial [Cucumis
sativus]
Length = 164
Score = 154 bits (390), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 80/168 (47%), Positives = 106/168 (63%), Gaps = 16/168 (9%)
Query: 83 SDVRTSSGTFIPKGKD--AIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF 140
SD RTSSG F+ + ++ IE +I+ ++ +P ENGE IQVLRYE Q Y+PH+DYF
Sbjct: 4 SDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQFYKPHHDYF 63
Query: 141 SDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKK---GI 197
SD N+ RGG R+AT+LMYLS+ +GGET FP A C K G+
Sbjct: 64 SDTFNLKRGGQRIATMLMYLSENIEGGETYFPKA-----------GSGECSCGGKTVPGL 112
Query: 198 AVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDS 245
+VKP +GDA+LF+S+ + DP S+H GC V+ GEKWSATKW+ S
Sbjct: 113 SVKPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSGEKWSATKWMRQKS 160
>gi|91779740|ref|YP_554948.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
xenovorans LB400]
gi|91692400|gb|ABE35598.1| Procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
xenovorans LB400]
Length = 296
Score = 154 bits (389), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 84/211 (39%), Positives = 118/211 (55%), Gaps = 26/211 (12%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
+P A + + FL+ EC+ LI LA+ +L RS V D ++G + ++ R+S G F G+ +
Sbjct: 101 RPAAVLLDDFLSANECEQLIALARPRLSRSTVVDPVTGRNVVAGHRSSDGMFFRLGETPL 160
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNIVRGGHRLAT 155
IA +E +IA T LP ENGE +Q+L YE G + PH DY +++ +I R G R+ T
Sbjct: 161 IARLEARIAELTGLPVENGEGLQLLHYEAGAESTPHVDYLIAGNPANRESIARSGQRVGT 220
Query: 156 VLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTN 215
+LMYL+DV GGET+FP + G +V PRRG AL F +
Sbjct: 221 LLMYLNDVEGGGETMFP---------------------QTGWSVVPRRGQALYFEYGNRF 259
Query: 216 AIPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
+ DP SLH+ P+ GEKW ATKWI F
Sbjct: 260 GLADPSSLHTSTPLRAGEKWVATKWIRTRRF 290
>gi|385206010|ref|ZP_10032880.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
gi|385185901|gb|EIF35175.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
Length = 296
Score = 154 bits (388), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 84/211 (39%), Positives = 119/211 (56%), Gaps = 26/211 (12%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
+P A + + FL+ EC+ LI+LA+ +L RS V D ++G + ++ R+S G F G+ +
Sbjct: 101 RPAAILLDDFLSANECEQLISLARPRLSRSTVVDPVTGRNVVAGHRSSDGMFFRLGETPL 160
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNIVRGGHRLAT 155
IA +E +IA T LP ENGE +Q+L YE G + PH DY +++ +I R G R+ T
Sbjct: 161 IARLEARIAELTGLPVENGEGLQLLHYEVGAESTPHVDYLIAGNPANQESIARSGQRVGT 220
Query: 156 VLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTN 215
+LMYL+DV GGET+FP + G +V PRRG AL F +
Sbjct: 221 LLMYLNDVEGGGETMFP---------------------QTGWSVVPRRGQALYFEYGNRF 259
Query: 216 AIPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
+ DP SLH+ P+ GEKW ATKWI F
Sbjct: 260 GLADPSSLHTSTPLRVGEKWVATKWIRTRRF 290
>gi|413923982|gb|AFW63914.1| hypothetical protein ZEAMMB73_179176 [Zea mays]
Length = 222
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 70/123 (56%), Positives = 93/123 (75%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGK 97
ISW+PRAFVY FL+ EC++LI LAK + +S V D+ +G+SK S VRTSSG F+ +G+
Sbjct: 100 ISWEPRAFVYHNFLSKEECEYLIGLAKPHMVKSTVVDSTTGKSKDSRVRTSSGMFLQRGR 159
Query: 98 DAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVL 157
D +I IE +IA +TF+P ++GE +QVL YE GQKYEPH+DYF D+ N GG R+AT+L
Sbjct: 160 DKVIRVIEKRIADYTFIPVDHGEGLQVLHYEVGQKYEPHFDYFLDEFNTKNGGQRMATLL 219
Query: 158 MYL 160
MYL
Sbjct: 220 MYL 222
>gi|171059332|ref|YP_001791681.1| procollagen-proline dioxygenase [Leptothrix cholodnii SP-6]
gi|170776777|gb|ACB34916.1| Procollagen-proline dioxygenase [Leptothrix cholodnii SP-6]
Length = 287
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 85/211 (40%), Positives = 117/211 (55%), Gaps = 28/211 (13%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
PR V+ GFL+ ECD L+ LA+ +L RS DN +G S++++ RTS G F +G+ +I
Sbjct: 100 PRVVVFGGFLSHDECDALVALAQPRLARSETVDNDTGGSEVNEARTSQGMFFMRGEGELI 159
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNIVRGGHRLATV 156
+ IE +IA P ENGE +QVL Y G +Y+PHYDYF + RGG R+ T+
Sbjct: 160 SRIEARIAALLDWPLENGEGVQVLHYRPGAEYKPHYDYFDPAQPGTPTILKRGGQRVGTL 219
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
+MYL+ +GG T FP+ + V P +G+A +FFS + A
Sbjct: 220 VMYLNTPERGGGTTFPDV---------------------NLEVAPIKGNA-VFFS-YERA 256
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWIHVDSFD 247
P SLH G PV+ GEKW ATKW+ FD
Sbjct: 257 HPSTRSLHGGAPVLAGEKWVATKWLRQARFD 287
>gi|357453665|ref|XP_003597113.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
gi|357482683|ref|XP_003611628.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
gi|355486161|gb|AES67364.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
gi|355512963|gb|AES94586.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
Length = 294
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 87/227 (38%), Positives = 127/227 (55%), Gaps = 11/227 (4%)
Query: 19 LIRKSFSSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSG 78
L+ S I + +SW PRA + F + +CD +I +AK++L S + G
Sbjct: 67 LLHAGKSGDNFITSIPFQVLSWNPRALYFPNFASAEQCDRIIEMAKAELSPSRLMLR-EG 125
Query: 79 ESK--LSDVRTSSGTFIPKGKD--AIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYE 134
E++ +RTSSG FI +D ++ I++KIA +PK +G +LRY+ GQKY
Sbjct: 126 ETEEGTKGIRTSSGMFISASEDKTGLLEVIDEKIARAAKIPKTHGGAYNILRYKVGQKYN 185
Query: 135 PHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAK 194
HYD F+ + R+A+ L+YL+DV +GGET+FP ++ + +C
Sbjct: 186 SHYDAFNPAEYGPQESQRVASFLLYLTDVPEGGETMFPFE----NGSNMDSSYNFEDCI- 240
Query: 195 KGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
G+ +KP +GD LLF+SL N DP SLH CPVI+GEKW ATKWI
Sbjct: 241 -GLKIKPLKGDGLLFYSLFPNGTIDPTSLHGSCPVIKGEKWVATKWI 286
>gi|330821584|ref|YP_004350446.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
gladioli BSR3]
gi|327373579|gb|AEA64934.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
gladioli BSR3]
Length = 302
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 82/211 (38%), Positives = 114/211 (54%), Gaps = 26/211 (12%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
+P A + +GFL+ EC LI LA+ +L RS V D ++G + ++ R+S G F G+ +
Sbjct: 101 RPAAVLLDGFLSAGECRQLIELARPRLNRSTVVDPVTGRNIVAGHRSSDGMFFRLGETPL 160
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNIVRGGHRLAT 155
I+ IE +IA T P ENGE +Q+L YE G + PH DY ++ +I R G R+ T
Sbjct: 161 ISRIEQRIAALTGFPVENGEGLQMLHYEAGAESTPHVDYLVPGNPANAESIARSGQRVGT 220
Query: 156 VLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTN 215
+LMYL+DV GGET+FP G +V PRRG A F + +
Sbjct: 221 LLMYLNDVESGGETLFPQV---------------------GCSVVPRRGQAFYFEYGNGS 259
Query: 216 AIPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
DP SLH+ P+ G+KW ATKWI F
Sbjct: 260 GRSDPASLHASSPIGSGDKWVATKWIRTRRF 290
>gi|383757171|ref|YP_005436156.1| putative prolyl 4-hydroxylase alpha subunit [Rubrivivax gelatinosus
IL144]
gi|381377840|dbj|BAL94657.1| putative prolyl 4-hydroxylase alpha subunit homologue
oxidoreductase protein [Rubrivivax gelatinosus IL144]
Length = 279
Score = 152 bits (383), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 84/210 (40%), Positives = 117/210 (55%), Gaps = 28/210 (13%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
PR V+ G L+D ECD L+ LA+ +L RS DN +G S+++ RTS G F +G+ +I
Sbjct: 92 PRVVVFGGLLSDEECDELVALARPRLARSETVDNSTGGSEVNAARTSDGMFFERGEKPLI 151
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKVNIV-RGGHRLATV 156
IE +IA P E GE +QVLRY G +Y+PH+D+F NI+ RGG R+ TV
Sbjct: 152 ERIERRIAELVRWPVERGEGLQVLRYRPGAQYKPHHDFFDPAHPGTANILRRGGQRVGTV 211
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
+MYL+ A GG T FP G+ V+P +G+A +FFS +
Sbjct: 212 VMYLNTPAGGGATTFPEV---------------------GLEVQPVKGNA-VFFS-YERP 248
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
+ +LH G PV++GEKW ATKW+ F
Sbjct: 249 LASTRTLHGGAPVLDGEKWVATKWMREGVF 278
>gi|307108817|gb|EFN57056.1| hypothetical protein CHLNCDRAFT_143796 [Chlorella variabilis]
Length = 334
Score = 151 bits (381), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 89/212 (41%), Positives = 121/212 (57%), Gaps = 12/212 (5%)
Query: 35 VKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIP 94
++ +S PRA++ FL+ +CDH+I +A+ +L S +A K D ++ P
Sbjct: 131 MQLLSLYPRAYLMPRFLSQKQCDHVIAMAERRLAPSGLA------FKAGDTAENTRDEDP 184
Query: 95 KGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLA 154
D ++A IEDK+A T +P +GE VLRYE Q Y+ HYD FS++ + R+A
Sbjct: 185 ---DGVLAWIEDKLAAVTMIPAGHGEPFNVLRYEPSQHYDSHYDSFSEEEYGPQFSQRIA 241
Query: 155 TVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHT 214
TVL+YL+DV +GGETVF + R D C GI VKPR+GDALLFFS+
Sbjct: 242 TVLLYLADVEEGGETVFLLEGKGGLARL--ERIDYKAC-DTGIKVKPRQGDALLFFSVSV 298
Query: 215 NAIPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
N D SLH GCPV+ G KW+ TKWI F
Sbjct: 299 NGTLDKHSLHGGCPVVAGTKWAMTKWIRNRCF 330
>gi|124267278|ref|YP_001021282.1| hypothetical protein Mpe_A2091 [Methylibium petroleiphilum PM1]
gi|124260053|gb|ABM95047.1| conserved hypothetical protein [Methylibium petroleiphilum PM1]
Length = 289
Score = 150 bits (380), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 83/211 (39%), Positives = 114/211 (54%), Gaps = 28/211 (13%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
PR V+ G L+D ECD ++ LA ++L RS D +G S+++ RTS G F +G+ +
Sbjct: 102 PRVIVFSGLLSDAECDEIVALAGARLARSHTVDTATGASEVNAARTSDGMFFTRGEHPVC 161
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNIVRGGHRLATV 156
A E +IA P ENGE +QVL Y G +Y+PHYDYF + RGG R+AT+
Sbjct: 162 ARFEARIAALLNWPVENGEGLQVLHYRPGAEYKPHYDYFDPDQPGTPAVLRRGGQRVATL 221
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
+ YL+ +GG T FP+ G+ V P +G A +FFS +
Sbjct: 222 VTYLNTPTRGGGTTFPDI---------------------GLEVTPLKGHA-VFFS-YDRP 258
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWIHVDSFD 247
P SLH G PV+EG+KW ATKW+ V FD
Sbjct: 259 HPSTRSLHGGAPVLEGDKWVATKWLRVGRFD 289
>gi|186474111|ref|YP_001861453.1| procollagen-proline dioxygenase [Burkholderia phymatum STM815]
gi|184196443|gb|ACC74407.1| Procollagen-proline dioxygenase [Burkholderia phymatum STM815]
Length = 305
Score = 150 bits (380), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 80/206 (38%), Positives = 116/206 (56%), Gaps = 26/206 (12%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
+P+ V++ L+ ECD LI A+ +LKRS + SG + +RTS G + + +DA
Sbjct: 115 RPQVIVFDDVLSRDECDELIERARHRLKRSTTVNPESGREDVIQLRTSEGFWFQRCEDAF 174
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNIVRGGHRLAT 155
I ++ +I+ P E+GE +Q+L Y G +Y PH+DYF ++ RGG R+AT
Sbjct: 175 IERLDRRISALMNWPLEHGEGLQILHYTKGGEYRPHFDYFPPSQSGSVLHTSRGGQRVAT 234
Query: 156 VLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTN 215
+++YLSDVA GGETVFPNA G+AV R+G A+ F L+ +
Sbjct: 235 LIVYLSDVAGGGETVFPNA---------------------GLAVMARQGGAIYFRYLNGH 273
Query: 216 AIPDPVSLHSGCPVIEGEKWSATKWI 241
DP++LH G PV GEKW TKW+
Sbjct: 274 RQLDPLTLHGGAPVTNGEKWIMTKWM 299
>gi|30681957|ref|NP_850038.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
gi|330252315|gb|AEC07409.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
Length = 274
Score = 150 bits (378), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 81/210 (38%), Positives = 115/210 (54%), Gaps = 6/210 (2%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGK 97
+SW PR F F T +C+ +I++AK +LK S +A ++ + S + +
Sbjct: 71 LSWNPRVFYLPNFATKQQCEAVIDMAKPKLKPSTLALRKGETAETTQNYRSLHQHTDEDE 130
Query: 98 DAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVL 157
++A IE+KIA T PK+ E +LRY+ GQKY+ HYD F R+ T L
Sbjct: 131 SGVLAAIEEKIALATRFPKDYYESFNILRYQLGQKYDSHYDAFHSAEYGPLISQRVVTFL 190
Query: 158 MYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAI 217
++LS V +GGET+FP R D +C G+ VKPR+GDA+ F++L N
Sbjct: 191 LFLSSVEEGGETMFPFENG----RNMNGRYDYEKCV--GLKVKPRQGDAIFFYNLFPNGT 244
Query: 218 PDPVSLHSGCPVIEGEKWSATKWIHVDSFD 247
D SLH CPVI+GEKW ATKWI ++D
Sbjct: 245 IDQTSLHGSCPVIKGEKWVATKWIRDQTYD 274
>gi|377811809|ref|YP_005044249.1| ProCollegen-proline dioxygenase [Burkholderia sp. YI23]
gi|357941170|gb|AET94726.1| ProCollegen-proline dioxygenase [Burkholderia sp. YI23]
Length = 283
Score = 149 bits (377), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 79/211 (37%), Positives = 115/211 (54%), Gaps = 26/211 (12%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
+P + L+ ECD LI + + +++RS+V D SG L D R S G F+ D +
Sbjct: 90 EPVVALLADVLSPRECDRLIEIGRERVRRSSVVDPDSGGEVLIDARKSEGAFVNGSTDPL 149
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSD-----KVNIVRGGHRLAT 155
+A I+ +IA P ENGED+ +LRY G +Y PH+DYF + K ++ RGG R+AT
Sbjct: 150 VATIDRRIAELVQQPVENGEDLHILRYGAGGEYRPHFDYFPEEQAGSKHHMQRGGQRIAT 209
Query: 156 VLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTN 215
+++YL+ V +GG+T FP+ G+ + PRRG AL F ++
Sbjct: 210 LILYLNQVEEGGDTTFPDI---------------------GLTIHPRRGAALYFEYVNAL 248
Query: 216 AIPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
DP +LH+G PV GEKW ATKW+ F
Sbjct: 249 GQTDPRTLHAGMPVERGEKWIATKWMRRGRF 279
>gi|159489502|ref|XP_001702736.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280758|gb|EDP06515.1| predicted protein [Chlamydomonas reinhardtii]
Length = 231
Score = 149 bits (377), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 90/227 (39%), Positives = 120/227 (52%), Gaps = 18/227 (7%)
Query: 23 SFSSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKL 82
S S I P ++ +SW PR V+ GF+ +H++ LA + S +A + +
Sbjct: 7 SGSDVTYIIPFQI--LSWYPRIVVFPGFIDKARAEHIVKLAGKFMYPSGLAYRPGEQVES 64
Query: 83 SD-VRTSSGTFIPKGKD--AIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDY 139
S RTS+GTF+ G D ++ +E +IA T LP +NGE VL YEH Q Y+ H D
Sbjct: 65 SQQTRTSTGTFLSSGMDTEGVLGWVEQRIAAATLLPADNGEAFNVLHYEHMQHYDSHMDS 124
Query: 140 FSDKVNIVRGGHRLATVLMYLSDVAKGGETVFP-----NAEEPPRRRTPATNDDLSECAK 194
F K + R+ATVL+YLS+V +GGETVF A+ P + D C
Sbjct: 125 FDPKDFGPQPSQRIATVLLYLSEVLEGGETVFKKEGVDGADRPIQ--------DWRNCDD 176
Query: 195 KGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
PR GDA+LF+ N DP SLH GCPV +GEKW ATKWI
Sbjct: 177 GSFKYAPRMGDAVLFWGTRPNGEIDPHSLHGGCPVKKGEKWVATKWI 223
>gi|414587755|tpg|DAA38326.1| TPA: hypothetical protein ZEAMMB73_894856 [Zea mays]
Length = 244
Score = 149 bits (377), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 72/141 (51%), Positives = 96/141 (68%), Gaps = 2/141 (1%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFI--PK 95
ISW PR V+ FL+ ECD+L+ +A+ +L+ S V D +G+ SDVRTSSG F+ +
Sbjct: 60 ISWTPRIIVFHNFLSSEECDYLMAIARPRLQISTVVDVATGKGVKSDVRTSSGMFVNSEE 119
Query: 96 GKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLAT 155
K ++ IE +I+ ++ +PKENGE IQVLRYE Q Y PH+DYFSD N+ RGG R+AT
Sbjct: 120 RKSPVVQAIEKRISVFSQIPKENGELIQVLRYEASQYYRPHHDYFSDTFNLKRGGQRVAT 179
Query: 156 VLMYLSDVAKGGETVFPNAEE 176
+LMYL+D GGET FP E
Sbjct: 180 MLMYLTDGVVGGETHFPQEME 200
>gi|337280547|ref|YP_004620019.1| hypothetical protein Rta_28970 [Ramlibacter tataouinensis TTB310]
gi|334731624|gb|AEG94000.1| conserved hypothetical protein [Ramlibacter tataouinensis TTB310]
Length = 286
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 83/218 (38%), Positives = 116/218 (53%), Gaps = 28/218 (12%)
Query: 35 VKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIP 94
V Q + PR V+ L+D EC+ LI LAK +L RS +G ++++ RTSSG F
Sbjct: 92 VLQAMYNPRVVVFGSLLSDQECEQLIGLAKPRLARSLTVATKTGGEEVNEDRTSSGMFFQ 151
Query: 95 KGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNIVRG 149
+G++ ++A IE +IA P ENGE +QVL Y G +Y+PHYDYF + RG
Sbjct: 152 RGENELVARIEARIARLVNWPVENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPTILKRG 211
Query: 150 GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLF 209
G R+ T++MYL + KGG T FP+ + V P+RG +F
Sbjct: 212 GQRVGTLVMYLGEPEKGGGTTFPDVH---------------------LEVAPKRGHG-VF 249
Query: 210 FSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFD 247
FS + P +LH G PV+ GEKW ATKW+ F+
Sbjct: 250 FS-YERPHPSTRTLHGGAPVLAGEKWIATKWLRERRFE 286
>gi|254254263|ref|ZP_04947580.1| hypothetical protein BDAG_03558 [Burkholderia dolosa AUO158]
gi|124898908|gb|EAY70751.1| hypothetical protein BDAG_03558 [Burkholderia dolosa AUO158]
Length = 285
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 80/206 (38%), Positives = 114/206 (55%), Gaps = 26/206 (12%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
+P+ V+ L ECD +I + +L++S + +G ++ RTS GT+ G+DA+
Sbjct: 95 RPQIVVFGNVLDQDECDEMIQRSMHKLEQSTTVNAETGTQEVIRHRTSHGTWFQNGEDAL 154
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDK-----VNIVRGGHRLAT 155
I IE ++A P ENGE +QVLRY G +Y HYDYF ++ GG R+AT
Sbjct: 155 IRRIETRLAALMNCPVENGEGLQVLRYTPGGEYRSHYDYFQPTAAGSLTHVRTGGQRVAT 214
Query: 156 VLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTN 215
+++YL+DV GGETVFP A GI+V PRRGDA+ F ++
Sbjct: 215 LIVYLNDVPSGGETVFPEA---------------------GISVVPRRGDAVYFRYMNRL 253
Query: 216 AIPDPVSLHSGCPVIEGEKWSATKWI 241
DP +LH+G PV +GEKW TKW+
Sbjct: 254 RQLDPATLHAGAPVRDGEKWIMTKWV 279
>gi|375106426|ref|ZP_09752687.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderiales bacterium
JOSHI_001]
gi|374667157|gb|EHR71942.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderiales bacterium
JOSHI_001]
Length = 295
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 83/210 (39%), Positives = 116/210 (55%), Gaps = 28/210 (13%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
PR V+ G L+D ECD +++LA+ +L RS N SG S+++ RTS G F +G+ +
Sbjct: 108 PRVMVFGGLLSDEECDAMVDLARPRLARSETVHNGSGGSEVNAARTSDGMFFDRGEFPLC 167
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNIVRGGHRLATV 156
IE +IA P ENGE +QVLRY G +Y+ H+DYF + RGG R+ TV
Sbjct: 168 RTIEQRIAALVNWPVENGEGLQVLRYRPGSEYKAHHDYFDPAQPGTPTILKRGGQRVGTV 227
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
+MYL+ +GG T FP+ G+ V P +G+A +FFS + A
Sbjct: 228 VMYLNHPIRGGGTAFPDV---------------------GLEVAPFKGNA-VFFS-YDRA 264
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
P +LH+G PV+EGEKW ATKW+ F
Sbjct: 265 HPMTRTLHAGTPVLEGEKWVATKWVREGEF 294
>gi|428170517|gb|EKX39441.1| hypothetical protein GUITHDRAFT_114401 [Guillardia theta CCMP2712]
Length = 322
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 98/276 (35%), Positives = 138/276 (50%), Gaps = 44/276 (15%)
Query: 35 VKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKL--SDVRTSSGTF 92
++ +S PR F+ LT+ ECDHL++LA Q SA G +KL S RT+ +
Sbjct: 75 IETVSVDPRIFIVHNLLTEEECDHLVSLA-LQKGLSASLITPYGTNKLVESTTRTNKQAW 133
Query: 93 IPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKV----NIVR 148
+ +D ++ +EDKIA T E GE++QVL Y Q++ H+DYF N +
Sbjct: 134 LDFQQDDVVKRVEDKIAKLTKTTPEQGENLQVLHYAKSQQFTEHHDYFDPATDPPENYEK 193
Query: 149 GGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALL 208
GG+RL TV++YL +GGET F A + + +GDA++
Sbjct: 194 GGNRLITVIVYLQAAEEGGETHFGAAN---------------------LKLTAAKGDAVM 232
Query: 209 FFSL-HTNAIPDPV-----SLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGGDCTDNNAS 262
F++L H DP +LH+G P I+GEKW ATKWIH + E G C D +
Sbjct: 233 FYNLKHGCDGIDPTCVDKQTLHAGLPPIKGEKWVATKWIHERGYQS--ETSGGCFDKHPK 290
Query: 263 CERWAAL--GECTKNPEYMVGSAQLPGFCRRSCKVC 296
C WA EC NP +M + CRRSCK+C
Sbjct: 291 CTYWAGKTPTECKLNPVWMSKN------CRRSCKIC 320
>gi|295700439|ref|YP_003608332.1| procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
gi|295439652|gb|ADG18821.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
Length = 296
Score = 148 bits (373), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 86/220 (39%), Positives = 118/220 (53%), Gaps = 28/220 (12%)
Query: 34 KVKQIS--WKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGT 91
KV+ +S +P A FL+ EC+ LI LA+ +L RSAV D ++G ++ R+S G
Sbjct: 92 KVRVLSRLQRPAAVHLANFLSADECEQLIALAQPRLDRSAVVDPVTGRDVIATHRSSHGM 151
Query: 92 FIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNI 146
F G+ +IA IE +IA T P ENGE +Q+L YE G + PH DY +++ +I
Sbjct: 152 FFRLGETPLIARIEARIAELTATPVENGEGLQMLHYEEGAESTPHVDYLMTGNEANRESI 211
Query: 147 VRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
R G R+ T+LMYL DV GGETVFP G ++ P+RG A
Sbjct: 212 ARSGQRMGTLLMYLKDVEGGGETVFPQV---------------------GWSIVPQRGHA 250
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
L F + + DP SLH+ P+ G+KW ATKWI F
Sbjct: 251 LYFEYGNRYGMCDPSSLHASTPLRTGDKWVATKWIRTRRF 290
>gi|297600382|ref|NP_001049073.2| Os03g0166200 [Oryza sativa Japonica Group]
gi|255674232|dbj|BAF10987.2| Os03g0166200, partial [Oryza sativa Japonica Group]
Length = 135
Score = 148 bits (373), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 69/121 (57%), Positives = 86/121 (71%), Gaps = 1/121 (0%)
Query: 177 PPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWS 236
P R + ++ S+CA++G AVKP +G A+LFFSL+ NA DP SLH CPVI+GEKWS
Sbjct: 13 PQARLSQPKDETWSDCAEQGFAVKPTKGSAVLFFSLYPNATFDPGSLHGSCPVIQGEKWS 72
Query: 237 ATKWIHVDSFDKIVEEGGD-CTDNNASCERWAALGECTKNPEYMVGSAQLPGFCRRSCKV 295
ATKWIHV S+D+ D C D +A C WAA GEC KNP YMVG+++ PGFCR+SC V
Sbjct: 73 ATKWIHVRSYDENGRRSSDKCEDQHALCSSWAAAGECAKNPGYMVGTSESPGFCRKSCNV 132
Query: 296 C 296
C
Sbjct: 133 C 133
>gi|299065638|emb|CBJ36810.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
CMR15]
Length = 289
Score = 147 bits (372), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 80/205 (39%), Positives = 113/205 (55%), Gaps = 26/205 (12%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
PR +++ FL+D ECD LI L + +LKRS V + +GE L RTS G G+ +I
Sbjct: 97 PRIVLFQHFLSDEECDQLITLGRHRLKRSPVVNPETGEENLISARTSQGAMFQVGEHPLI 156
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS-----DKVNIVRGGHRLATV 156
A IE +IA T +P E+GE QVL Y+ G +Y+PH+DYF+ + + GG R+AT+
Sbjct: 157 ARIEARIAQATGVPVEHGEGFQVLHYQPGGEYQPHFDYFNPGRSGEARQLEVGGQRVATL 216
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
++YL+ V GG T FP K G+ V P +G+A+ F +
Sbjct: 217 VIYLNSVPAGGATGFP---------------------KLGLEVAPVKGNAVFFVYKRPDG 255
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWI 241
D +LH+G PV GEKW ATKW+
Sbjct: 256 TLDDKTLHAGLPVERGEKWIATKWL 280
>gi|357417854|ref|YP_004930874.1| procollagen-proline dioxygenase [Pseudoxanthomonas spadix BD-a59]
gi|355335432|gb|AER56833.1| Procollagen-proline dioxygenase [Pseudoxanthomonas spadix BD-a59]
Length = 283
Score = 147 bits (371), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 83/213 (38%), Positives = 115/213 (53%), Gaps = 32/213 (15%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
PR V+ L ECD LI LA+ Q+KRS V D +G+ + RTS G F +G + +
Sbjct: 96 PRVIVFGNLLAAEECDALIALARRQIKRSPVFDPDTGQDQQHQARTSEGMFFGRGANPLC 155
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNIVRGGHRLATV 156
A +E +IA P ENGE +QVLRY G +YEPHYDYF +V + RGG R+A++
Sbjct: 156 ARVEARIAALLNWPLENGEGLQVLRYGPGAQYEPHYDYFDPARPGAEVALRRGGQRVASL 215
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
++YL+ +GG T FP+A + V P +G+A+ F +
Sbjct: 216 VIYLNTPTQGGATTFPDAH---------------------LEVAPIKGNAVYF----SYD 250
Query: 217 IPDPV--SLHSGCPVIEGEKWSATKWIHVDSFD 247
P P+ +LH G PV+EGEKW ATKW+ D
Sbjct: 251 RPHPMTGTLHGGAPVVEGEKWVATKWLRERRHD 283
>gi|344172475|emb|CCA85118.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia syzygii R24]
Length = 289
Score = 147 bits (370), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 80/205 (39%), Positives = 113/205 (55%), Gaps = 26/205 (12%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
PR +++ FL+D ECD LI L + +LKRS V + +GE L RTS G G+ +I
Sbjct: 97 PRIVLFQHFLSDEECDELIALGRHRLKRSPVVNPETGEENLISARTSQGAMFQVGEHPLI 156
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS-----DKVNIVRGGHRLATV 156
A IE +IA T +P E+GE QVL Y+ G +Y+PH+DYF+ + + GG R+AT+
Sbjct: 157 ARIEARIAQATGVPVEHGEGFQVLHYQPGGEYQPHFDYFNPGRSGEARQLEVGGQRVATL 216
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
++YL+ V GG T FP K G+ V P +G+A+ F +
Sbjct: 217 VIYLNSVQAGGATGFP---------------------KLGLEVAPVKGNAVFFVYKRPDG 255
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWI 241
D +LH+G PV GEKW ATKW+
Sbjct: 256 TLDDNTLHAGLPVERGEKWIATKWL 280
>gi|344169181|emb|CCA81504.1| putative Prolyl 4-hydroxylase alpha subunit [blood disease
bacterium R229]
Length = 289
Score = 147 bits (370), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 80/205 (39%), Positives = 113/205 (55%), Gaps = 26/205 (12%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
PR +++ FL+D ECD LI L + +LKRS V + +GE L RTS G G+ +I
Sbjct: 97 PRIVLFQHFLSDEECDELIALGRHRLKRSPVVNPETGEENLISARTSQGAMFQVGEHPLI 156
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS-----DKVNIVRGGHRLATV 156
A IE +IA T +P E+GE QVL Y+ G +Y+PH+DYF+ + + GG R+AT+
Sbjct: 157 ARIEARIAQATGVPVEHGEGFQVLHYQPGGEYQPHFDYFNPGRSGEARQLEVGGQRVATL 216
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
++YL+ V GG T FP K G+ V P +G+A+ F +
Sbjct: 217 VIYLNSVQAGGATGFP---------------------KLGLEVAPVKGNAVFFVYKRPDG 255
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWI 241
D +LH+G PV GEKW ATKW+
Sbjct: 256 TLDDNTLHAGLPVERGEKWIATKWL 280
>gi|300690371|ref|YP_003751366.1| prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum PSI07]
gi|299077431|emb|CBJ50057.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
PSI07]
Length = 289
Score = 147 bits (370), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 80/205 (39%), Positives = 113/205 (55%), Gaps = 26/205 (12%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
PR +++ FL+D ECD LI L + +LKRS V + +GE L RTS G G+ +I
Sbjct: 97 PRIVLFQHFLSDEECDELIALGRHRLKRSPVVNPETGEENLISARTSQGAMFQVGEHPLI 156
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS-----DKVNIVRGGHRLATV 156
A IE +IA T +P E+GE QVL Y+ G +Y+PH+DYF+ + + GG R+AT+
Sbjct: 157 ARIEARIAQATGVPVEHGEGFQVLHYQPGGEYQPHFDYFNPGRSGEARQLEVGGQRVATL 216
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
++YL+ V GG T FP K G+ V P +G+A+ F +
Sbjct: 217 VIYLNSVQAGGATGFP---------------------KLGLEVAPVKGNAVFFVYKRPDG 255
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWI 241
D +LH+G PV GEKW ATKW+
Sbjct: 256 TLDDNTLHAGLPVERGEKWIATKWL 280
>gi|219121927|ref|XP_002181308.1| proly 4-hydroxylase [Phaeodactylum tricornutum CCAP 1055/1]
gi|217407294|gb|EEC47231.1| proly 4-hydroxylase [Phaeodactylum tricornutum CCAP 1055/1]
Length = 226
Score = 147 bits (370), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 86/219 (39%), Positives = 122/219 (55%), Gaps = 21/219 (9%)
Query: 35 VKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVA--DNLSGESKLSDVRTSSGTF 92
++ +S P EGFL+D EC ++ A+ ++ S V D G SD RTS F
Sbjct: 7 LETLSLVPLVLSVEGFLSDDECTYIQETAEPHMEYSEVTLMDKDQGRPA-SDFRTSQSAF 65
Query: 93 IPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS------DK--V 144
I DAI+ I+ + A+ +P+ + ED+QVLRY+ +KY+ H DYF DK +
Sbjct: 66 IRAHDDAILTDIDYRTASLVRIPRRHQEDVQVLRYDVTEKYDSHADYFDPALYTKDKRTL 125
Query: 145 NIVRGGH--RLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPR 202
++R GH R+ATV YLSDV KGGETVFP R A + +C K G+ VKP
Sbjct: 126 ALIRNGHRNRMATVFWYLSDVEKGGETVFP-------RFNGAQETSMKDC-KTGLKVKPE 177
Query: 203 RGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
+G ++F+S+ + D SLH CPV +G KW+A KW+
Sbjct: 178 KGKVIIFYSMTPDGALDEYSLHGACPVQKGTKWAANKWV 216
>gi|319786559|ref|YP_004146034.1| Procollagen-proline dioxygenase [Pseudoxanthomonas suwonensis 11-1]
gi|317465071|gb|ADV26803.1| Procollagen-proline dioxygenase [Pseudoxanthomonas suwonensis 11-1]
Length = 289
Score = 146 bits (369), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 86/205 (41%), Positives = 112/205 (54%), Gaps = 28/205 (13%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
PR V G L+D ECD L+ L++ +L+RS D +G S++ RTS GTF +G +
Sbjct: 102 PRVVVLGGLLSDEECDALVELSRPRLRRSTTVDAQTGGSQVHADRTSRGTFFERGAHPVC 161
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSD-----KVNIVRGGHRLATV 156
A IE +IA P ENGE +QVL Y G ++ PHYDYF +V + +GG R+ATV
Sbjct: 162 ATIEARIARLLEWPVENGEGLQVLHYPPGAEFRPHYDYFDPDEPGAEVLLRQGGQRVATV 221
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
+MYL+ A+GG T FP+A L A KG AV FFS +
Sbjct: 222 VMYLNTPARGGATTFPDAH-------------LEVAAVKGNAV---------FFS-YDRP 258
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWI 241
P +LH G PV EGEKW ATKW+
Sbjct: 259 HPMTRTLHGGAPVTEGEKWIATKWL 283
>gi|17547533|ref|NP_520935.1| hypothetical protein RSc2814 [Ralstonia solanacearum GMI1000]
gi|17429837|emb|CAD16521.1| putative prolyl 4-hydroxylase alpha subunit homologue
oxidoreductase protein [Ralstonia solanacearum GMI1000]
Length = 289
Score = 146 bits (369), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 79/205 (38%), Positives = 113/205 (55%), Gaps = 26/205 (12%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
PR +++ FL+D ECD LI L + +LKRS V + +GE L RTS G G+ ++
Sbjct: 97 PRIVLFQHFLSDEECDQLIALGRHRLKRSPVVNPETGEENLISARTSQGAMFQVGEHPLV 156
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS-----DKVNIVRGGHRLATV 156
A IE +IA T +P E+GE QVL Y+ G +Y+PH+DYF+ + + GG R+AT+
Sbjct: 157 ARIEARIAQATGVPVEHGEGFQVLHYQPGGEYQPHFDYFNPGRSGEARQLEVGGQRVATL 216
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
++YL+ V GG T FP K G+ V P +G+A+ F +
Sbjct: 217 VIYLNSVPAGGATGFP---------------------KLGLEVAPVKGNAVFFVYKRPDG 255
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWI 241
D +LH+G PV GEKW ATKW+
Sbjct: 256 TLDDNTLHAGLPVERGEKWIATKWL 280
>gi|389795384|ref|ZP_10198508.1| procollagen-proline dioxygenase [Rhodanobacter fulvus Jip2]
gi|388430823|gb|EIL87950.1| procollagen-proline dioxygenase [Rhodanobacter fulvus Jip2]
Length = 293
Score = 146 bits (368), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 81/210 (38%), Positives = 115/210 (54%), Gaps = 26/210 (12%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
P V + L D ECD LI + +L+RS D ++G ++ R+S GTF P D I
Sbjct: 97 PTIAVLDQVLDDEECDELIRRSADKLQRSTTVDPVNGGYEVIAARSSEGTFFPVNADDFI 156
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS-----DKVNIVRGGHRLATV 156
A ++ +IA P ENGE +QVL Y G +Y+PH+DYFS + +V GG R++T+
Sbjct: 157 ARLDRRIAELMNCPVENGEGLQVLHYGEGGEYQPHFDYFSPGDPGSEAQMVVGGQRVSTL 216
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
L+YL+DVA+GG TVFP G+ V PR+G A+ F + +
Sbjct: 217 LIYLNDVAQGGATVFPTL---------------------GLRVLPRKGMAVYFEYSNRDG 255
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
DP++LH G PV +GEKW TKW+ S+
Sbjct: 256 QVDPLTLHGGEPVEKGEKWIITKWMRQRSY 285
>gi|388519941|gb|AFK48032.1| unknown [Lotus japonicus]
Length = 151
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 70/148 (47%), Positives = 97/148 (65%), Gaps = 14/148 (9%)
Query: 97 KDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATV 156
K ++ IE +I+ ++ +P ENGE +QVLRYE Q Y+PH+DYF+D N+ RGG R+AT+
Sbjct: 9 KYPMVHAIEKRISVYSQVPIENGELMQVLRYEKNQYYKPHHDYFADTFNLKRGGQRIATM 68
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKK---GIAVKPRRGDALLFFSLH 213
LMYLSD +GGET FPN C K G++VKP +G+A+LF+S+
Sbjct: 69 LMYLSDNVEGGETYFPN-----------IGSGQCSCGGKTVEGLSVKPTKGNAVLFWSMG 117
Query: 214 TNAIPDPVSLHSGCPVIEGEKWSATKWI 241
+ DP+S+H GC V+ GEKWSATKW+
Sbjct: 118 LDGQSDPLSVHGGCEVLAGEKWSATKWM 145
>gi|421895470|ref|ZP_16325871.1| prolyl 4-hydroxylase alpha subunit homologue protein [Ralstonia
solanacearum MolK2]
gi|206586635|emb|CAQ17221.1| prolyl 4-hydroxylase alpha subunit homologue protein [Ralstonia
solanacearum MolK2]
Length = 283
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 79/205 (38%), Positives = 113/205 (55%), Gaps = 26/205 (12%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
PR +++ FL+D ECD LI L + +LKRS V + +GE L RTS G G+ ++
Sbjct: 91 PRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGEENLISARTSEGAMFQVGEHPLV 150
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS-----DKVNIVRGGHRLATV 156
A IE +IA T +P E+GE QVL Y G +Y+PH+DYF+ + + GG R+AT+
Sbjct: 151 ARIEARIAQATGVPVEHGEGFQVLHYHPGGEYQPHFDYFNPGRGGEARQLEVGGQRVATL 210
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
++YL+ V GG T FP K G+ V P +G+A+ F +
Sbjct: 211 VIYLNSVQAGGATGFP---------------------KLGLEVAPVKGNAVFFVYKRPDG 249
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWI 241
+ D +LH+G PV GEKW ATKW+
Sbjct: 250 MLDDNTLHAGLPVERGEKWIATKWL 274
>gi|302845120|ref|XP_002954099.1| hypothetical protein VOLCADRAFT_64439 [Volvox carteri f.
nagariensis]
gi|300260598|gb|EFJ44816.1| hypothetical protein VOLCADRAFT_64439 [Volvox carteri f.
nagariensis]
Length = 231
Score = 145 bits (366), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 88/224 (39%), Positives = 116/224 (51%), Gaps = 10/224 (4%)
Query: 22 KSFSSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGES- 80
+S S + P ++ +SW PR V+ GF+ +++I LA + S +A GE+
Sbjct: 6 ESGSDNVYVIPFQI--LSWYPRVVVFPGFIDKARAEYVIKLASKFMYPSGLAYR-PGETV 62
Query: 81 -KLSDVRTSSGTFIPKGKD--AIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHY 137
RTS+GTF+ D ++ +E +IA T LP ENGE VL YE Q Y+ HY
Sbjct: 63 DPSQQTRTSTGTFLAAAMDPEGVLGWVEQRIAAATLLPAENGEAFNVLHYEKEQHYDSHY 122
Query: 138 DYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGI 197
D F K + R+ATVL+YLS+V +GGETVF R D C
Sbjct: 123 DTFDPKEFGPQPSQRIATVLLYLSEVLEGGETVFKREGVDGENRVIG---DWRNCDDGSF 179
Query: 198 AVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
PR GDA+LF+ N DP +LH GCPV GEKW ATKWI
Sbjct: 180 KYMPRMGDAVLFWGTKPNGDIDPHALHGGCPVKRGEKWVATKWI 223
>gi|386332363|ref|YP_006028532.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum Po82]
gi|334194811|gb|AEG67996.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum Po82]
Length = 292
Score = 145 bits (365), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 79/205 (38%), Positives = 112/205 (54%), Gaps = 26/205 (12%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
PR +++ FL+D ECD LI L + +LKRS V + +GE L RTS G G+ ++
Sbjct: 100 PRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGEENLISARTSEGAMFQVGEHPLV 159
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS-----DKVNIVRGGHRLATV 156
A IE +IA T +P E+GE QVL Y G +Y+PH+DYF+ + + GG R+AT+
Sbjct: 160 ARIEARIAQATGVPVEHGEGFQVLHYHPGGEYQPHFDYFNPGRSGEARQLEVGGQRVATL 219
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
++YL+ V GG T FP K G+ V P +G+A+ F +
Sbjct: 220 VIYLNSVQAGGATGFP---------------------KLGLEVAPVKGNAVFFVYKRPDG 258
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWI 241
D +LH+G PV GEKW ATKW+
Sbjct: 259 TLDDNTLHAGLPVERGEKWIATKWL 283
>gi|83746819|ref|ZP_00943867.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum UW551]
gi|83726588|gb|EAP73718.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum UW551]
Length = 289
Score = 145 bits (365), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 79/205 (38%), Positives = 112/205 (54%), Gaps = 26/205 (12%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
PR +++ FL+D ECD LI L + +LKRS V + +GE L RTS G G+ ++
Sbjct: 97 PRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGEENLISARTSEGAMFQVGEHPLV 156
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS-----DKVNIVRGGHRLATV 156
A IE +IA T +P E+GE QVL Y G +Y+PH+DYF+ + + GG R+AT+
Sbjct: 157 ARIEARIAQATGVPVEHGEGFQVLHYHPGGEYQPHFDYFNPGRSGEARQLEVGGQRVATL 216
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
++YL+ V GG T FP K G+ V P +G+A+ F +
Sbjct: 217 VIYLNSVQAGGATGFP---------------------KLGLEVAPVKGNAVFFVYKRPDG 255
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWI 241
D +LH+G PV GEKW ATKW+
Sbjct: 256 TLDDNTLHAGLPVERGEKWIATKWL 280
>gi|207744371|ref|YP_002260763.1| prolyl 4-hydroxylase subunit alpha [Ralstonia solanacearum IPO1609]
gi|206595776|emb|CAQ62703.1| prolyl 4-hydroxylase alpha subunit homologue protein [Ralstonia
solanacearum IPO1609]
Length = 280
Score = 145 bits (365), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 79/205 (38%), Positives = 112/205 (54%), Gaps = 26/205 (12%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
PR +++ FL+D ECD LI L + +LKRS V + +GE L RTS G G+ ++
Sbjct: 88 PRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGEENLISARTSEGAMFQVGEHPLV 147
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS-----DKVNIVRGGHRLATV 156
A IE +IA T +P E+GE QVL Y G +Y+PH+DYF+ + + GG R+AT+
Sbjct: 148 ARIEARIAQATGVPVEHGEGFQVLHYHPGGEYQPHFDYFNPGRSGEARQLEVGGQRVATL 207
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
++YL+ V GG T FP K G+ V P +G+A+ F +
Sbjct: 208 VIYLNSVQAGGATGFP---------------------KLGLEVAPVKGNAVFFVYKRPDG 246
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWI 241
D +LH+G PV GEKW ATKW+
Sbjct: 247 TLDDNTLHAGLPVERGEKWIATKWL 271
>gi|300702992|ref|YP_003744594.1| prolyl 4-hydroxylase subunit alpha [Ralstonia solanacearum
CFBP2957]
gi|299070655|emb|CBJ41950.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
CFBP2957]
Length = 289
Score = 145 bits (365), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 79/205 (38%), Positives = 112/205 (54%), Gaps = 26/205 (12%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
PR +++ FL+D ECD LI L + +LKRS V + +GE L RTS G G+ ++
Sbjct: 97 PRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGEENLISARTSEGAMFQVGEHPLV 156
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS-----DKVNIVRGGHRLATV 156
A IE +IA T +P E+GE QVL Y G +Y+PH+DYF+ + + GG R+AT+
Sbjct: 157 ARIEARIAQATGVPVEHGEGFQVLHYHPGGEYQPHFDYFNPGRSGEARQLEVGGQRVATL 216
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
++YL+ V GG T FP K G+ V P +G+A+ F +
Sbjct: 217 VIYLNSVQAGGATGFP---------------------KLGLEVAPVKGNAVFFVYKRPDG 255
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWI 241
D +LH+G PV GEKW ATKW+
Sbjct: 256 TLDDNTLHAGLPVERGEKWIATKWL 280
>gi|295699617|ref|YP_003607510.1| procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
gi|295438830|gb|ADG17999.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
Length = 286
Score = 144 bits (364), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 76/211 (36%), Positives = 117/211 (55%), Gaps = 26/211 (12%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
+P+ V+ L+ EC LI ++ +LKRS + L+G + RTS G + +G+D +
Sbjct: 96 RPQLVVFADVLSAAECAELIERSRHRLKRSTTVNPLTGREDVIRNRTSEGVWYRRGEDQL 155
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS-----DKVNIVRGGHRLAT 155
IA +E +IA+ T P ENGE +QVL Y +Y PH+D+F+ V+ +GG R+AT
Sbjct: 156 IARVERRIASLTNWPLENGEGLQVLHYGTSGEYSPHFDFFAPDQPGSAVHTTQGGQRVAT 215
Query: 156 VLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTN 215
+++YL+DVA GGETVFP A G++V + G A+ F ++
Sbjct: 216 LIIYLNDVADGGETVFPTA---------------------GLSVAAQAGGAVYFRYMNAE 254
Query: 216 AIPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
DP +LH G PV+ G+KW TKW+ ++
Sbjct: 255 RQLDPSTLHGGAPVLAGDKWIMTKWMRERAY 285
>gi|332526359|ref|ZP_08402485.1| procollagen-proline dioxygenase [Rubrivivax benzoatilyticus JA2]
gi|332110495|gb|EGJ10818.1| procollagen-proline dioxygenase [Rubrivivax benzoatilyticus JA2]
Length = 224
Score = 144 bits (364), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 81/210 (38%), Positives = 116/210 (55%), Gaps = 28/210 (13%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
PR V+ G L++ ECD L+ LA+ +L RS DN +G S+++ RTS G F +G+ +I
Sbjct: 37 PRVVVFGGLLSEQECDELVALAQPRLLRSETVDNSTGGSEVNAARTSDGMFFERGETPLI 96
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKVNIV-RGGHRLATV 156
IE +IA P E GE +QVL Y G +Y+PH+D+F NI+ RGG R+ TV
Sbjct: 97 ERIERRIAELVHWPVERGEGLQVLHYRPGAQYKPHHDFFDPAHPGTANILRRGGQRVGTV 156
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
++YL+ A GG T FP G+ V+P +G+A +FFS +
Sbjct: 157 VIYLNTPAGGGATTFPEV---------------------GLEVQPIKGNA-VFFS-YERP 193
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
+ +LH G PV++GEKW ATKW+ F
Sbjct: 194 LASTRTLHGGAPVLDGEKWVATKWLREGVF 223
>gi|388567209|ref|ZP_10153646.1| procollagen-proline dioxygenase [Hydrogenophaga sp. PBC]
gi|388265592|gb|EIK91145.1| procollagen-proline dioxygenase [Hydrogenophaga sp. PBC]
Length = 296
Score = 144 bits (364), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 83/211 (39%), Positives = 111/211 (52%), Gaps = 28/211 (13%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
PR V L+ ECD +I AK +L RS +G +L+ RTSSG F +G+ +
Sbjct: 109 PRVVVLGNLLSAEECDAIIESAKPKLARSLTVQTATGGEELNADRTSSGMFFTRGQTPEV 168
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDK-----VNIVRGGHRLATV 156
+E +IA P ENGE +QVL Y G +Y+PHYDYF K + RGG R+AT+
Sbjct: 169 TAVERRIARLVGWPVENGEGLQVLHYRPGAEYKPHYDYFDPKEAGTPTILKRGGQRVATL 228
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
+MYL++ A+GG T FP+ G+ V P +G A +FFS +
Sbjct: 229 VMYLNEPARGGGTTFPDV---------------------GLEVAPVKGSA-VFFS-YDRP 265
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWIHVDSFD 247
P SLH G PV+EGEKW ATKW+ F
Sbjct: 266 HPTTRSLHGGAPVLEGEKWVATKWLREREFQ 296
>gi|421890664|ref|ZP_16321519.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
K60-1]
gi|378964031|emb|CCF98267.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
K60-1]
Length = 288
Score = 144 bits (363), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 79/205 (38%), Positives = 112/205 (54%), Gaps = 26/205 (12%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
PR +++ FL+D ECD LI L + +LKRS V + +GE L RTS G G+ ++
Sbjct: 96 PRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGEENLISARTSEGAMFQVGEHPLV 155
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS-----DKVNIVRGGHRLATV 156
A IE +IA T +P E+GE QVL Y G +Y+PH+DYF+ + + GG R+AT+
Sbjct: 156 ARIEARIAQATGVPVEHGEGFQVLHYHPGGEYQPHFDYFNPGRSGEARQLDVGGQRVATL 215
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
++YL+ V GG T FP K G+ V P +G+A+ F +
Sbjct: 216 VIYLNSVQAGGATGFP---------------------KLGLEVAPVKGNAVFFVYKRPDG 254
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWI 241
D +LH+G PV GEKW ATKW+
Sbjct: 255 TLDDNTLHAGLPVERGEKWIATKWL 279
>gi|403238305|ref|ZP_10916891.1| procollagen-proline dioxygenase [Bacillus sp. 10403023]
Length = 296
Score = 144 bits (363), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 82/209 (39%), Positives = 115/209 (55%), Gaps = 22/209 (10%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
KP + FL++ ECD LI +++ +LK S V D +GE K + RTS G ++
Sbjct: 108 KPFILHLDYFLSEEECDQLIEMSRERLKPSTVIDPKTGEEKAATGRTSKGMSFYLQENEF 167
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-SDKVNIVRGGHRLATVLMY 159
I +E +IA P ENGE +QVL Y G++Y+ H+DYF KV +GG R+ T L+Y
Sbjct: 168 IKKVEKRIAELIEFPVENGEGLQVLNYGIGEEYKSHFDYFPQSKVVPEKGGQRVGTFLIY 227
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV GGETVFP A G+++ P++G A+ F ++ D
Sbjct: 228 LNDVPAGGETVFPKA---------------------GVSIVPKKGSAVYFQYGNSKGEVD 266
Query: 220 PVSLHSGCPVIEGEKWSATKWIHVDSFDK 248
+SLHS PV EGEKW ATKWI ++ K
Sbjct: 267 RMSLHSSIPVSEGEKWVATKWIRQENIYK 295
>gi|323528042|ref|YP_004230194.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1001]
gi|323385044|gb|ADX57134.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1001]
Length = 300
Score = 144 bits (363), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 74/211 (35%), Positives = 117/211 (55%), Gaps = 26/211 (12%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
+P+ V+ L+ ECD +I ++ +LKRS + D +G+ + RTS G + +G+DA
Sbjct: 110 RPQVIVFANVLSPEECDEVIERSRHRLKRSTIVDPATGQEGVIRNRTSEGIWYQRGEDAF 169
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNIVRGGHRLAT 155
I ++ +IA+ P ENGE +Q+L Y +Y PH+DYF V+ RGG R+AT
Sbjct: 170 IERLDQRIASLMNWPVENGEGLQILHYGPTGEYRPHFDYFPPDQPGSAVHTARGGQRVAT 229
Query: 156 VLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTN 215
+++YL+DVA GGET+FP A G++V ++G A+ F ++
Sbjct: 230 LVVYLNDVADGGETIFPAA---------------------GLSVAAKQGGAVYFRYMNGQ 268
Query: 216 AIPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
DP++LH G PV G+KW TKW+ ++
Sbjct: 269 RQLDPLTLHGGAPVHAGDKWIMTKWMRERAY 299
>gi|209522122|ref|ZP_03270769.1| Procollagen-proline dioxygenase [Burkholderia sp. H160]
gi|209497434|gb|EDZ97642.1| Procollagen-proline dioxygenase [Burkholderia sp. H160]
Length = 296
Score = 144 bits (363), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 85/220 (38%), Positives = 118/220 (53%), Gaps = 28/220 (12%)
Query: 34 KVKQIS--WKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGT 91
KV+ +S +P A FL+ EC+ LI LA+ +L RS V D ++G + ++ R+S G
Sbjct: 92 KVRVLSRLQRPAAVHLADFLSADECEQLIALAQPRLDRSTVVDPVTGRNVVAGHRSSHGM 151
Query: 92 FIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNI 146
F G+ +I IE +IA T P ENGE +Q+L YE G + PH DY +++ +I
Sbjct: 152 FFRLGETPLIVRIEARIAALTGTPVENGEGLQMLHYEEGAESTPHVDYLITGNEANRESI 211
Query: 147 VRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
R G R+ T+LMYL DV GGETVFP + G +V P+RG A
Sbjct: 212 ARSGQRMGTLLMYLKDVEGGGETVFP---------------------QIGWSVAPQRGHA 250
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
L F + + DP SLH+ P+ G+KW ATKWI F
Sbjct: 251 LYFEYGNRFGLCDPSSLHASTPLRVGDKWVATKWIRTRRF 290
>gi|407708877|ref|YP_006792741.1| prolyl 4-hydroxylase [Burkholderia phenoliruptrix BR3459a]
gi|407237560|gb|AFT87758.1| prolyl 4-hydroxylase [Burkholderia phenoliruptrix BR3459a]
Length = 300
Score = 144 bits (363), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 74/211 (35%), Positives = 117/211 (55%), Gaps = 26/211 (12%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
+P+ V+ L+ ECD +I ++ +LKRS + D +G+ + RTS G + +G+DA
Sbjct: 110 RPQVIVFANVLSPEECDEVIERSRHRLKRSTIVDPATGQEGVIRNRTSEGIWYQRGEDAF 169
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNIVRGGHRLAT 155
I ++ +IA+ P ENGE +Q+L Y +Y PH+DYF V+ RGG R+AT
Sbjct: 170 IERLDRRIASLMNWPVENGEGLQILHYGPTGEYRPHFDYFPPDQPGSAVHTARGGQRVAT 229
Query: 156 VLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTN 215
+++YL+DVA GGET+FP A G++V ++G A+ F ++
Sbjct: 230 LVVYLNDVADGGETIFPAA---------------------GLSVAAKQGGAVYFRYMNGQ 268
Query: 216 AIPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
DP++LH G PV G+KW TKW+ ++
Sbjct: 269 RQLDPLTLHGGAPVRAGDKWIMTKWMRERAY 299
>gi|297825201|ref|XP_002880483.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297326322|gb|EFH56742.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 272
Score = 144 bits (363), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 84/212 (39%), Positives = 118/212 (55%), Gaps = 10/212 (4%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGES--KLSDVRTSSGTFIPK 95
+SW PR F F T +C+ +I++AK +LK S +A GE+ +VRT +
Sbjct: 69 LSWNPRVFYLPNFATKQQCEAVIDMAKPKLKPSLLALR-KGETAETTQNVRTRLKK-TDE 126
Query: 96 GKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLAT 155
+ I+A IE+KIA T +P + E +LRY+ GQKY+ HYD F + R+ T
Sbjct: 127 DESGILAAIEEKIALATRIPIDYYESFNILRYQLGQKYDSHYDAFHPAEYGPQISQRVVT 186
Query: 156 VLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTN 215
+++LS V +GGET+FP R D C G+ VKPR+GDA+ F++L N
Sbjct: 187 FILFLSSVEEGGETMFPFENG----RNMNGRYDYETCI--GLRVKPRQGDAIFFYNLLPN 240
Query: 216 AIPDPVSLHSGCPVIEGEKWSATKWIHVDSFD 247
D SLH CPVI+GEKW ATKWI ++D
Sbjct: 241 RTIDQTSLHGSCPVIKGEKWVATKWIRDQTYD 272
>gi|224001336|ref|XP_002290340.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220973762|gb|EED92092.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 483
Score = 144 bits (363), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 88/243 (36%), Positives = 136/243 (55%), Gaps = 28/243 (11%)
Query: 20 IRKSFSSTAIINPS---------KVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRS 70
+ + F T ++PS ++ +S +P EGFL+D ECD++ +A Q+K S
Sbjct: 237 VHEGFRRTVELDPSASSKSQKQVTIETLSLRPLVVSVEGFLSDEECDYIAEIASPQVKYS 296
Query: 71 AVADNLSGESK-LSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEH 129
+V+ + + K S+ RTS F+ D ++ I+ ++A+ T +P+ + E +QVLRY
Sbjct: 297 SVSLKDADKGKDSSEWRTSQSAFLSARDDEVLTEIDHRVASLTRIPRNHQEYVQVLRYGA 356
Query: 130 GQKYEPHYDYF------SDK--VNIVRGG--HRLATVLMYLSDVAKGGETVFPNAEEPPR 179
G+KY+ H+DYF SDK + ++ G +R ATV YL+DV GGET+FP P
Sbjct: 357 GEKYDSHHDYFDPSAYRSDKSTLRLIENGKKNRYATVFWYLTDVHDGGETIFPRYGGAP- 415
Query: 180 RRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGE-KWSAT 238
P ++ D S G+ VKP++G ++F+SL + DP SLH CPV E KW+A
Sbjct: 416 --APRSHKDCS----IGLKVKPQKGKVVIFYSLDASGEMDPFSLHGACPVGENNLKWAAN 469
Query: 239 KWI 241
KWI
Sbjct: 470 KWI 472
>gi|170690448|ref|ZP_02881615.1| Procollagen-proline dioxygenase [Burkholderia graminis C4D1M]
gi|170144883|gb|EDT13044.1| Procollagen-proline dioxygenase [Burkholderia graminis C4D1M]
Length = 307
Score = 143 bits (360), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 73/211 (34%), Positives = 116/211 (54%), Gaps = 26/211 (12%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
+P+ V+ L+ ECD +I ++ +LKRS + D +G+ + RTS G + +G+DA
Sbjct: 117 RPQVIVFANVLSPEECDEVIERSRHRLKRSTIVDPATGQEDVIRNRTSEGIWYQRGEDAF 176
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNIVRGGHRLAT 155
I ++ +IA+ P ENGE +Q+L Y +Y PH+DYF V+ RGG R+AT
Sbjct: 177 IERLDQRIASLMNWPVENGEGLQILHYGPTGEYRPHFDYFPPDQPGSMVHTARGGQRVAT 236
Query: 156 VLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTN 215
+++YL+DV GGET+FP A G++V ++G A+ F ++
Sbjct: 237 LVIYLNDVPDGGETIFPEA---------------------GLSVAAKQGGAVYFRYMNGQ 275
Query: 216 AIPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
DP++LH G PV G+KW TKW+ ++
Sbjct: 276 RQLDPLTLHGGAPVRAGDKWIMTKWMRERAY 306
>gi|395003644|ref|ZP_10387769.1| 2OG-Fe(II) oxygenase superfamily enzyme [Acidovorax sp. CF316]
gi|394318439|gb|EJE54870.1| 2OG-Fe(II) oxygenase superfamily enzyme [Acidovorax sp. CF316]
Length = 299
Score = 142 bits (359), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 81/211 (38%), Positives = 112/211 (53%), Gaps = 28/211 (13%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
KPR V+ L+ ECD LI A ++ RS +G +++D RTS G F +G++ +
Sbjct: 111 KPRIVVFGNLLSAEECDALIAAAAPRMARSLTVATKTGGEEVNDDRTSDGMFFQRGENPV 170
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNIVRGGHRLAT 155
+ IE++IA P ENGE +QVL Y G +Y+PHYDYF + RGG R+ T
Sbjct: 171 VQRIEERIARLLDWPIENGEGLQVLHYRPGAEYKPHYDYFDPGEPGTPTILKRGGQRVGT 230
Query: 156 VLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTN 215
++MYL+ KGG T FP+ + V P+RG+A +FFS +
Sbjct: 231 LVMYLNTPEKGGGTTFPDVH---------------------VEVAPQRGNA-VFFS-YER 267
Query: 216 AIPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
A P +LH G PVI GEKW ATKW+ F
Sbjct: 268 AHPATRTLHGGAPVIAGEKWIATKWLREREF 298
>gi|241664232|ref|YP_002982592.1| procollagen-proline dioxygenase [Ralstonia pickettii 12D]
gi|309783051|ref|ZP_07677770.1| procollagen-proline dioxygenase [Ralstonia sp. 5_7_47FAA]
gi|404397139|ref|ZP_10988932.1| hypothetical protein HMPREF0989_00773 [Ralstonia sp. 5_2_56FAA]
gi|240866259|gb|ACS63920.1| Procollagen-proline dioxygenase [Ralstonia pickettii 12D]
gi|308918159|gb|EFP63837.1| procollagen-proline dioxygenase [Ralstonia sp. 5_7_47FAA]
gi|348610674|gb|EGY60360.1| hypothetical protein HMPREF0989_00773 [Ralstonia sp. 5_2_56FAA]
Length = 288
Score = 142 bits (359), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 77/205 (37%), Positives = 113/205 (55%), Gaps = 26/205 (12%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
PR +++ FL+D ECD LI + +++LKRS V + +GE L RTS G G+ +I
Sbjct: 96 PRIVLFQHFLSDQECDELIAIGRNRLKRSPVVNPDTGEENLISARTSQGGMFQVGEHPLI 155
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS-----DKVNIVRGGHRLATV 156
A IE +IA +P E+GE QVL Y+ G +Y+PH+D+F+ + + GG R+AT+
Sbjct: 156 AKIEARIAQAVGVPVEHGEGFQVLNYQPGGEYQPHFDFFNPGRSGEARQLEVGGQRVATM 215
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
++YL+ V GG T FP K G+ V P +G+A+ F +
Sbjct: 216 VIYLNSVQAGGATGFP---------------------KLGLEVAPVKGNAVFFVYKRPDG 254
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWI 241
D +LH+G PV GEKW ATKW+
Sbjct: 255 TLDEDTLHAGLPVERGEKWIATKWL 279
>gi|187930127|ref|YP_001900614.1| procollagen-proline dioxygenase [Ralstonia pickettii 12J]
gi|187727017|gb|ACD28182.1| Procollagen-proline dioxygenase [Ralstonia pickettii 12J]
Length = 288
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 77/205 (37%), Positives = 113/205 (55%), Gaps = 26/205 (12%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
PR +++ FL+D ECD LI + +++LKRS V + +GE L RTS G G+ +I
Sbjct: 96 PRIVLFQHFLSDAECDELIAIGRNRLKRSPVVNPDTGEENLISARTSQGGMFQVGEHPLI 155
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS-----DKVNIVRGGHRLATV 156
A IE +IA +P E+GE QVL Y+ G +Y+PH+D+F+ + + GG R+AT+
Sbjct: 156 AKIEVRIAQAVGVPVEHGEGFQVLNYQPGGEYQPHFDFFNPGRSGEARQLEVGGQRVATM 215
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
++YL+ V GG T FP K G+ V P +G+A+ F +
Sbjct: 216 VIYLNSVQAGGATGFP---------------------KLGLEVAPVKGNAVFFVYKRPDG 254
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWI 241
D +LH+G PV GEKW ATKW+
Sbjct: 255 TLDEDTLHAGLPVERGEKWIATKWL 279
>gi|398804098|ref|ZP_10563100.1| 2OG-Fe(II) oxygenase superfamily enzyme [Polaromonas sp. CF318]
gi|398094921|gb|EJL85274.1| 2OG-Fe(II) oxygenase superfamily enzyme [Polaromonas sp. CF318]
Length = 277
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 81/210 (38%), Positives = 117/210 (55%), Gaps = 28/210 (13%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
P +V++ L+ EC+ LI A+S+L RS D +G +L+ RTS G F +G++ +I
Sbjct: 90 PELWVFDNLLSAAECEALIAAAESRLARSLTVDIRTGGEELNHDRTSHGMFYTRGENEVI 149
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNIVRGGHRLATV 156
IE +IA P +NGE +QVLRY G +Y+PHYDYF + RGG R+A++
Sbjct: 150 RRIEARIARLLNWPVQNGEGLQVLRYRRGAEYKPHYDYFDPGEPGTAAILRRGGQRVASL 209
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
+MYL + +GG TVFP+ G+ V+P++G A +FFS + A
Sbjct: 210 IMYLREPGEGGATVFPDI---------------------GLKVRPQQGSA-VFFS-YALA 246
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
P ++LH G PV GEKW ATKW+ F
Sbjct: 247 HPASLTLHGGEPVKSGEKWIATKWLREREF 276
>gi|307725787|ref|YP_003909000.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1003]
gi|307586312|gb|ADN59709.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1003]
Length = 313
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 72/206 (34%), Positives = 114/206 (55%), Gaps = 26/206 (12%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
+P+ V+ L+ EC +I ++ +LKRS + D +G + RTS G + +G+DA+
Sbjct: 123 RPQVIVFGNVLSPDECAEMIERSRHRLKRSTIVDPATGREDVIRNRTSEGIWYQRGEDAL 182
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNIVRGGHRLAT 155
I ++ +IA+ P ENGE +Q+L Y +Y PH+DYF V+ RGG R+AT
Sbjct: 183 IERLDQRIASLMNWPLENGEGLQILHYGPSGEYRPHFDYFPPDQPGSAVHTARGGQRVAT 242
Query: 156 VLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTN 215
+++YL+DV GGET+FP A G++V ++G A+ F ++
Sbjct: 243 LVVYLNDVPDGGETIFPEA---------------------GLSVAAQQGGAVYFRYMNGR 281
Query: 216 AIPDPVSLHSGCPVIEGEKWSATKWI 241
DP++LH G PV+ G+KW TKW+
Sbjct: 282 RQLDPLTLHGGAPVLSGDKWIMTKWV 307
>gi|333981907|ref|YP_004511117.1| procollagen-proline dioxygenase [Methylomonas methanica MC09]
gi|333805948|gb|AEF98617.1| Procollagen-proline dioxygenase [Methylomonas methanica MC09]
Length = 286
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 70/207 (33%), Positives = 119/207 (57%), Gaps = 26/207 (12%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
+P V + F++ EC+ LI ++ +L SA+ D +G+ ++ R+S GT+ +G+ +
Sbjct: 95 RPDIVVVDEFMSGEECEQLIEQSRRKLTPSAIVDPQTGKFQVIADRSSEGTYFQRGESPL 154
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDK-----VNIVRGGHRLAT 155
I+ ++ +I+ P+++GE IQ+L Y G +Y+PH+DYF + + + + G R+AT
Sbjct: 155 ISRLDRRISELMNWPEDHGEGIQILHYGVGAQYKPHFDYFLENESGGALQMTQSGQRVAT 214
Query: 156 VLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTN 215
++MYL++V +GGETVFP+ GI++ P+RG A F ++
Sbjct: 215 LVMYLNEVTEGGETVFPDV---------------------GISITPKRGSAAYFAYCNSL 253
Query: 216 AIPDPVSLHSGCPVIEGEKWSATKWIH 242
DP +LH G PV+ GEKW ATKW+
Sbjct: 254 GQVDPATLHGGAPVLTGEKWIATKWMR 280
>gi|418523362|ref|ZP_13089380.1| hypothetical protein WS7_20388 [Xanthomonas axonopodis pv.
malvacearum str. GSPB2388]
gi|410699993|gb|EKQ58573.1| hypothetical protein WS7_20388 [Xanthomonas axonopodis pv.
malvacearum str. GSPB2388]
Length = 286
Score = 141 bits (356), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 87/205 (42%), Positives = 112/205 (54%), Gaps = 28/205 (13%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
PR V GFL+D ECD LI LA+ +L RS DN +GE + RTS G + G+DA+
Sbjct: 96 PRVVVLGGFLSDGECDALIALARPRLARSRTVDNANGEHLVHAARTSDGMCLRVGQDALC 155
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS-DKVN----IVRGGHRLATV 156
IE +IA P ++GE +QVLRY G +Y PHYDYF D V + GG R+A++
Sbjct: 156 QRIEARIARLFDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAVGTPILLQAGGQRVASL 215
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
+MYL+ +GG T FP+A L A KG AV FFS +
Sbjct: 216 VMYLNTPERGGATRFPDAH-------------LDVAAVKGNAV---------FFS-YDRP 252
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWI 241
P SLH+G PV+ GEKW ATKW+
Sbjct: 253 HPMTRSLHAGAPVLAGEKWVATKWL 277
>gi|319792090|ref|YP_004153730.1| procollagen-proline dioxygenase [Variovorax paradoxus EPS]
gi|315594553|gb|ADU35619.1| Procollagen-proline dioxygenase [Variovorax paradoxus EPS]
Length = 280
Score = 141 bits (355), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 82/220 (37%), Positives = 113/220 (51%), Gaps = 32/220 (14%)
Query: 34 KVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFI 93
+V Q PR V+ L+ EC+ LI A+ +L RS + +G L+ RTS G F
Sbjct: 85 QVLQTMRHPRVIVFGNLLSTEECEGLIAAARVRLARSLTVETRTGGEVLNVDRTSDGMFF 144
Query: 94 PKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNIVR 148
+G++ I+A +E ++A P E GE +Q+LRY G +Y PHYDYF + R
Sbjct: 145 ERGENEIVARLEQRLAMLLRWPLEYGEGLQILRYAPGAQYRPHYDYFDPNEPGTPTILKR 204
Query: 149 GGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALL 208
GG R+AT++MYL + +GG T FP+ G+ V P RG +
Sbjct: 205 GGQRVATLVMYLQEPEQGGATTFPDV---------------------GLEVAPVRGTG-V 242
Query: 209 FFSLHTNAIPDPV--SLHSGCPVIEGEKWSATKWIHVDSF 246
FFS PDPV +LH G PV+ GEKW ATKW+ F
Sbjct: 243 FFSYDR---PDPVTRTLHGGAPVLAGEKWVATKWLREREF 279
>gi|120609859|ref|YP_969537.1| 2OG-Fe(II) oxygenase [Acidovorax citrulli AAC00-1]
gi|120588323|gb|ABM31763.1| 2OG-Fe(II) oxygenase [Acidovorax citrulli AAC00-1]
Length = 309
Score = 141 bits (355), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 76/212 (35%), Positives = 116/212 (54%), Gaps = 28/212 (13%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
+PR ++ L+ ECD +I+ A+ ++ RS +G +++D RTS+G F + ++ +
Sbjct: 121 QPRVVLFGNLLSPEECDAIIDAARPRMARSLTVATRTGGEEVNDDRTSNGMFFQREENPV 180
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNIVRGGHRLAT 155
+A +E +IA P ENGE +QVL Y G +Y+PHYDYF + RGG R+AT
Sbjct: 181 VARLEARIARLVNWPLENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPTILRRGGQRVAT 240
Query: 156 VLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTN 215
+++YL+D KGG T FP+ + V PRRG+A +FFS +
Sbjct: 241 IVIYLNDPEKGGGTTFPDVH---------------------LEVAPRRGNA-VFFS-YER 277
Query: 216 AIPDPVSLHSGCPVIEGEKWSATKWIHVDSFD 247
P +LH G PV+ G+KW ATKW+ F+
Sbjct: 278 PHPSTRTLHGGAPVVAGDKWIATKWLRERRFE 309
>gi|339327280|ref|YP_004686973.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator N-1]
gi|338167437|gb|AEI78492.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator N-1]
Length = 297
Score = 140 bits (354), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 75/205 (36%), Positives = 114/205 (55%), Gaps = 26/205 (12%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
P+ +++ LTD ECD L+ L++ +L RS V + +G+ L D RTS G + A+I
Sbjct: 105 PQVQLFQQLLTDDECDALVALSRGRLARSPVVNPDTGDENLIDARTSMGAMFQVAEHALI 164
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS-----DKVNIVRGGHRLATV 156
A IE +IA T +P E+GE +Q+L Y+ G +Y+PH+DYF+ + + GG R+AT+
Sbjct: 165 ARIEARIAAVTGVPAEHGEGLQILNYKPGGEYQPHFDYFNPQRPGEARQLSVGGQRIATL 224
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
++YL+ GG T FP + G+ V P +G+A+ F L +
Sbjct: 225 VIYLNTPEAGGATAFP---------------------RVGLEVAPVKGNAVYFSYLLPDG 263
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWI 241
D +LH+G PV GEKW ATKW+
Sbjct: 264 TLDERTLHAGLPVASGEKWIATKWL 288
>gi|299115886|emb|CBN75895.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein
[Ectocarpus siliculosus]
Length = 404
Score = 140 bits (354), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 79/212 (37%), Positives = 122/212 (57%), Gaps = 13/212 (6%)
Query: 35 VKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVA--DNLSGESKLSDVRTSSGTF 92
+K +S +P F FL D EC H+ A +K S V+ D+ G+ ++ RTS+ F
Sbjct: 193 MKTLSMEPLVFEARNFLLDEECKHIREKADPHMKPSPVSLMDHDKGKPD-TNWRTSTTYF 251
Query: 93 IPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDK-VNIVRGG- 150
+P +D ++ GI+ ++ +T +PK + E +QVL+Y+ GQ+Y H+D+ ++ + + GG
Sbjct: 252 MPSTRDPLLQGIDRRVEEFTRVPKSHQEQVQVLKYDKGQRYTAHHDFLDERTMRNMDGGR 311
Query: 151 -HRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLF 209
+R+ TV YLSDV +GGET+F PR D S+C G+ VKP G +F
Sbjct: 312 KNRMITVFWYLSDVEEGGETIF------PRYGGRTGRVDFSDCT-TGLKVKPVEGKVAMF 364
Query: 210 FSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
+SL + D SLH CPVI G+KW+A KW+
Sbjct: 365 YSLKPDGQFDDFSLHGACPVITGQKWAANKWV 396
>gi|390570433|ref|ZP_10250698.1| procollagen-proline dioxygenase [Burkholderia terrae BS001]
gi|389937613|gb|EIM99476.1| procollagen-proline dioxygenase [Burkholderia terrae BS001]
Length = 285
Score = 140 bits (354), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 73/206 (35%), Positives = 112/206 (54%), Gaps = 26/206 (12%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
+P+ ++ L+ EC LI A+ +LKRS + +G + +RTS G + + +DA
Sbjct: 95 RPQVIAFDDVLSGEECAELIERARHRLKRSTTVNPENGSEDVIQLRTSEGFWFQRCEDAF 154
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNIVRGGHRLAT 155
I ++ +I+ P E+GE +Q+L Y G +Y PH+DYF ++ RGG R+AT
Sbjct: 155 IERLDHRISALMNWPLEHGEGLQILHYRQGGEYRPHFDYFPPGQNGSVLHTARGGQRVAT 214
Query: 156 VLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTN 215
+++YLSDV GGETVFP+A G+AV R+G A+ F ++
Sbjct: 215 LIVYLSDVEGGGETVFPDA---------------------GLAVMARQGGAIYFRYMNGR 253
Query: 216 AIPDPVSLHSGCPVIEGEKWSATKWI 241
DP++LH G PV G+KW TKW+
Sbjct: 254 RQLDPLTLHGGAPVTSGDKWIMTKWM 279
>gi|326316001|ref|YP_004233673.1| procollagen-proline dioxygenase [Acidovorax avenae subsp. avenae
ATCC 19860]
gi|323372837|gb|ADX45106.1| Procollagen-proline dioxygenase [Acidovorax avenae subsp. avenae
ATCC 19860]
Length = 298
Score = 140 bits (354), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 76/212 (35%), Positives = 116/212 (54%), Gaps = 28/212 (13%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
+PR ++ L+ ECD +I+ A+ ++ RS +G +++D RTS+G F + ++ +
Sbjct: 110 QPRVVLFGNLLSPEECDAIIDAARPRMARSLTVATRTGGEEVNDDRTSNGMFFQREENPM 169
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNIVRGGHRLAT 155
+A +E +IA P ENGE +QVL Y G +Y+PHYDYF + RGG R+AT
Sbjct: 170 VAKLEARIARLVNWPLENGEGLQVLHYRPGAEYKPHYDYFDPTEPGTPTILRRGGQRVAT 229
Query: 156 VLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTN 215
+++YL+D KGG T FP+ + V PRRG+A +FFS +
Sbjct: 230 IVIYLNDPEKGGGTTFPDVH---------------------LEVAPRRGNA-VFFS-YER 266
Query: 216 AIPDPVSLHSGCPVIEGEKWSATKWIHVDSFD 247
P +LH G PV+ G+KW ATKW+ F+
Sbjct: 267 PHPSTRTLHGGAPVVAGDKWIATKWLRERRFE 298
>gi|420246706|ref|ZP_14750139.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. BT03]
gi|398073616|gb|EJL64785.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. BT03]
Length = 282
Score = 140 bits (353), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 73/206 (35%), Positives = 112/206 (54%), Gaps = 26/206 (12%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
+P+ ++ L+ EC LI A+ +LKRS + +G + +RTS G + + +DA
Sbjct: 92 RPQVIAFDDVLSGEECAELIERARHRLKRSTTVNPENGSEDVIQLRTSEGFWFQRCEDAF 151
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNIVRGGHRLAT 155
I ++ +I+ P E+GE +Q+L Y G +Y PH+DYF ++ RGG R+AT
Sbjct: 152 IERLDHRISALMNWPLEHGEGLQILHYRQGGEYRPHFDYFPPGQNGSVLHTARGGQRVAT 211
Query: 156 VLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTN 215
+++YLSDV GGETVFP+A G+AV R+G A+ F ++
Sbjct: 212 LIVYLSDVEGGGETVFPDA---------------------GLAVMARQGGAIYFRYMNGR 250
Query: 216 AIPDPVSLHSGCPVIEGEKWSATKWI 241
DP++LH G PV G+KW TKW+
Sbjct: 251 RQLDPLTLHGGAPVTSGDKWIMTKWM 276
>gi|319763870|ref|YP_004127807.1| procollagen-proline dioxygenase [Alicycliphilus denitrificans BC]
gi|330823866|ref|YP_004387169.1| procollagen-proline dioxygenase [Alicycliphilus denitrificans K601]
gi|317118431|gb|ADV00920.1| Procollagen-proline dioxygenase [Alicycliphilus denitrificans BC]
gi|329309238|gb|AEB83653.1| Procollagen-proline dioxygenase [Alicycliphilus denitrificans K601]
Length = 284
Score = 140 bits (353), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 77/210 (36%), Positives = 114/210 (54%), Gaps = 28/210 (13%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
PR ++ L+ EC +I A++++ RS SG +++ RTS G F +G++ +
Sbjct: 97 PRVVLFGNLLSPEECQAVIEAARTRMARSLTVQAASGGEEVNKDRTSDGMFFQRGENEAV 156
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVN-----IVRGGHRLATV 156
A +E++IA P ENGE +QVL Y G +Y+PHYDYF + RGG R+AT+
Sbjct: 157 ARLEERIARLVRWPVENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPRLLRRGGQRVATL 216
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
++YL+D +GG T FP+ + + PR+G+A +FFS + A
Sbjct: 217 VIYLNDPVRGGGTTFPDVP---------------------LEIGPRQGNA-VFFS-YGRA 253
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
P +LH G PVIEGEKW ATKW+ F
Sbjct: 254 HPSSRTLHGGAPVIEGEKWIATKWLREREF 283
>gi|365090417|ref|ZP_09328465.1| 2OG-Fe(II) oxygenase [Acidovorax sp. NO-1]
gi|363416516|gb|EHL23626.1| 2OG-Fe(II) oxygenase [Acidovorax sp. NO-1]
Length = 302
Score = 140 bits (353), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 83/211 (39%), Positives = 113/211 (53%), Gaps = 28/211 (13%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
+PR V+ L+ ECD LI A+ +L RS +G +++D RTS G F +G+ +
Sbjct: 114 QPRIVVFGNLLSPEECDALIADAQPRLARSLTVATKTGGEEINDDRTSDGMFFQRGQSPL 173
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKVNIV-RGGHRLAT 155
I IE++IA P ENGE +QVL Y G +Y+PHYDYF +IV RGG R+ T
Sbjct: 174 IQRIEERIARLLNWPIENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPSIVNRGGQRVGT 233
Query: 156 VLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTN 215
++MYL+ KGG T FP+ + V P+RG+A +FFS +
Sbjct: 234 LVMYLNTPEKGGGTTFPDVH---------------------LEVAPQRGNA-VFFS-YER 270
Query: 216 AIPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
P +LH G PVI GEKW ATKW+ F
Sbjct: 271 PHPSTRTLHGGAPVIAGEKWIATKWLREREF 301
>gi|372266874|ref|ZP_09502922.1| peptidyl prolyl 4-hydroxylase-like protein subunit alpha
[Alteromonas sp. S89]
Length = 294
Score = 140 bits (352), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 74/211 (35%), Positives = 112/211 (53%), Gaps = 26/211 (12%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
+P ++ FL + ECD L+ +++ L S V + G +L RTS GT +G+ +
Sbjct: 102 QPNIVLFANFLAEWECDALVEMSRPNLSPSRVVNTQHGAFELKPSRTSGGTHFARGETPL 161
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNIVRGGHRLAT 155
IA IE +IA+ +P+ +GE +Q+L Y +Y PHYD+F ++ + GG R+ T
Sbjct: 162 IADIEARIASLLKVPEAHGEPLQILHYPVSGEYRPHYDFFDPEKPGNQEVLAAGGQRVGT 221
Query: 156 VLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTN 215
++MYLSDV GG TVFP + G+ V+P++G AL F + +
Sbjct: 222 LIMYLSDVESGGATVFP---------------------RVGLEVQPQKGAALFFSYVGEH 260
Query: 216 AIPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
D SLH G PV+ GEKW ATKW+ +
Sbjct: 261 GKLDLQSLHGGSPVLAGEKWIATKWLRAAEY 291
>gi|241767624|ref|ZP_04765273.1| Procollagen-proline dioxygenase [Acidovorax delafieldii 2AN]
gi|241361463|gb|EER57922.1| Procollagen-proline dioxygenase [Acidovorax delafieldii 2AN]
Length = 318
Score = 140 bits (352), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 78/210 (37%), Positives = 111/210 (52%), Gaps = 28/210 (13%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
PR V+ L+ EC+ LI A ++ RS +G +++D RTS G F +G+ ++
Sbjct: 131 PRVVVFGNLLSPEECEALIAAAAPRMARSLTVATQTGGEEVNDDRTSHGMFFQRGESPLV 190
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNIVRGGHRLATV 156
IE++IA+ P ENGE +QVL Y G +Y+PHYDYF I RGG R+ T+
Sbjct: 191 QRIEERIASLLNWPIENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPTVIQRGGQRVGTL 250
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
+MYL+ +GG T FP+A+ I V P+RG+A FFS +
Sbjct: 251 VMYLNTPEQGGGTTFPDAQ---------------------IEVAPQRGNA-AFFS-YERP 287
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
P +LH G PV+ G+KW ATKW+ F
Sbjct: 288 TPSTRTLHGGAPVLAGDKWIATKWLREREF 317
>gi|398808448|ref|ZP_10567311.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
gi|398087480|gb|EJL78066.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
Length = 280
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 83/220 (37%), Positives = 113/220 (51%), Gaps = 32/220 (14%)
Query: 34 KVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFI 93
+V Q PR V+ L+ EC+ LI A+ +L RS + +G L+ RTS G F
Sbjct: 85 QVLQTMRHPRVVVFGNLLSAEECEGLIAAARVRLARSLTVETRTGGEVLNVDRTSDGMFF 144
Query: 94 PKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNIVR 148
+G++ I+A +E ++AT P E GE +Q+LRY G +Y PHYDYF + R
Sbjct: 145 ERGENEIVARLEQRLATLLRWPLEYGEGLQILRYAPGAQYRPHYDYFDPGEPGTPTILKR 204
Query: 149 GGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALL 208
GG R+AT++MYL + GG T FP+ G+ V P RG +
Sbjct: 205 GGQRVATLVMYLQEPEGGGATTFPDV---------------------GLEVAPVRGCG-V 242
Query: 209 FFSLHTNAIPDPV--SLHSGCPVIEGEKWSATKWIHVDSF 246
FFS PDPV +LH G PV+ GEKW ATKW+ F
Sbjct: 243 FFSYDR---PDPVTRTLHGGAPVLAGEKWVATKWLREREF 279
>gi|421749438|ref|ZP_16186877.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator HPC(L)]
gi|409771699|gb|EKN53918.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator HPC(L)]
Length = 319
Score = 139 bits (351), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 75/205 (36%), Positives = 114/205 (55%), Gaps = 26/205 (12%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
PR +++ L EC+ LI L++ +L RS V + +G+ L D RTS G G+ +I
Sbjct: 127 PRIALFQRLLMPDECEALIALSRGRLARSPVVNPDTGDENLIDARTSMGAMFQVGEHPLI 186
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS-----DKVNIVRGGHRLATV 156
+E +IA T +P E+GE +Q+L Y+ G +Y+PHYD+F+ + + GG R+AT+
Sbjct: 187 ERLEARIAAVTGVPVEHGEGLQILNYKPGAEYQPHYDFFNPQRPGEARQLRVGGQRMATL 246
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
++YL+DV GG T FP K G+ V P +G+A+ F L +
Sbjct: 247 VIYLNDVPAGGATAFP---------------------KLGLRVNPVQGNAVFFAYLGEDG 285
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWI 241
D +LH+G PV +GEKW ATKW+
Sbjct: 286 SLDERTLHAGLPVEQGEKWIATKWL 310
>gi|325915062|ref|ZP_08177391.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas vesicatoria
ATCC 35937]
gi|325538760|gb|EGD10427.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas vesicatoria
ATCC 35937]
Length = 286
Score = 139 bits (351), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 84/205 (40%), Positives = 109/205 (53%), Gaps = 28/205 (13%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
PR V GFL+D ECD +I LA+ +L RS DN +G + RTS + G+DA+
Sbjct: 96 PRVMVLGGFLSDAECDAMIALAQPRLARSRTVDNANGAHVVHAARTSDSMCLQLGQDALC 155
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNIVRGGHRLATV 156
IE +IA P ENGE +QVLRY G +Y+PHYDYF V + GG R+A++
Sbjct: 156 QRIEARIARLLDWPVENGEGLQVLRYGTGAEYQPHYDYFDPDAAGTPVLLQAGGQRVASL 215
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
+MYL+ +GG T FP+ L A KG AV FFS +
Sbjct: 216 VMYLNTPDRGGATRFPDVH-------------LDIAAIKGNAV---------FFS-YDRP 252
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWI 241
P SLH+G PV+ GEKW ATKW+
Sbjct: 253 HPMTRSLHAGAPVLAGEKWVATKWL 277
>gi|354334983|gb|AER23925.1| procollagen-proline dioxygenase [Variovorax sp. HH01]
Length = 280
Score = 139 bits (351), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 83/220 (37%), Positives = 112/220 (50%), Gaps = 32/220 (14%)
Query: 34 KVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFI 93
+V Q PR V+ L+ EC+ LI A+ +L RS + +G L+ RTS G F
Sbjct: 85 QVLQTMRHPRVVVFGNLLSAEECEGLIAAARVRLARSLTVETRTGGEVLNVDRTSDGMFF 144
Query: 94 PKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNIVR 148
+G++ I+A +E +IA P E GE +Q+LRY G +Y PHYDYF + R
Sbjct: 145 ERGENEIVARVEQRIAALLRWPLEFGEGLQILRYAPGAQYRPHYDYFDPSEPGTPTILKR 204
Query: 149 GGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALL 208
GG R+AT++MYL + GG T FP+ G+ V P RG +
Sbjct: 205 GGQRVATLVMYLQEPEGGGATTFPDV---------------------GLEVAPARGCG-V 242
Query: 209 FFSLHTNAIPDPV--SLHSGCPVIEGEKWSATKWIHVDSF 246
FFS PDPV +LH G PV+ GEKW ATKW+ F
Sbjct: 243 FFSYDR---PDPVTRTLHGGAPVLAGEKWVATKWLREREF 279
>gi|77748547|ref|NP_641044.2| hypothetical protein XAC0691 [Xanthomonas axonopodis pv. citri str.
306]
gi|381169877|ref|ZP_09879039.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas citri pv.
mangiferaeindicae LMG 941]
gi|380689647|emb|CCG35526.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas citri pv.
mangiferaeindicae LMG 941]
Length = 286
Score = 139 bits (350), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 84/205 (40%), Positives = 110/205 (53%), Gaps = 28/205 (13%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
PR V GFL+D ECD LI LA+ +L RS DN +GE + RTS + G+DA+
Sbjct: 96 PRVVVLGGFLSDGECDALIALARPRLARSRTVDNANGEHMVHAARTSDSMCLRVGQDALC 155
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNIVRGGHRLATV 156
IE +IA P ++GE +QVLRY G +Y PHYDYF + + GG R+A++
Sbjct: 156 QRIEARIARLFDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAAGTPILLQAGGQRVASL 215
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
+MYL+ +GG T FP+A L A KG AV FFS +
Sbjct: 216 VMYLNTPERGGATRFPDAH-------------LDVAAVKGNAV---------FFS-YDRP 252
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWI 241
P SLH+G PV+ GEKW ATKW+
Sbjct: 253 HPMTRSLHAGAPVLAGEKWVATKWL 277
>gi|21106803|gb|AAM35580.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri
str. 306]
Length = 306
Score = 139 bits (350), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 84/205 (40%), Positives = 110/205 (53%), Gaps = 28/205 (13%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
PR V GFL+D ECD LI LA+ +L RS DN +GE + RTS + G+DA+
Sbjct: 116 PRVVVLGGFLSDGECDALIALARPRLARSRTVDNANGEHMVHAARTSDSMCLRVGQDALC 175
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNIVRGGHRLATV 156
IE +IA P ++GE +QVLRY G +Y PHYDYF + + GG R+A++
Sbjct: 176 QRIEARIARLFDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAAGTPILLQAGGQRVASL 235
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
+MYL+ +GG T FP+A L A KG AV FFS +
Sbjct: 236 VMYLNTPERGGATRFPDAH-------------LDVAAVKGNAV---------FFS-YDRP 272
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWI 241
P SLH+G PV+ GEKW ATKW+
Sbjct: 273 HPMTRSLHAGAPVLAGEKWVATKWL 297
>gi|351731158|ref|ZP_08948849.1| 2OG-Fe(II) oxygenase [Acidovorax radicis N35]
Length = 303
Score = 139 bits (350), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 79/211 (37%), Positives = 110/211 (52%), Gaps = 28/211 (13%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
+PR V+ L+ ECD LI A ++ RS +G +++D RTS G F +G+ +
Sbjct: 115 QPRVVVFGNLLSPEECDALIADAAPRMARSLTVATKTGGEEINDDRTSDGMFFQRGQSPL 174
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNIVRGGHRLAT 155
I IE++IA P ENGE +QVL Y G +Y+PHYDYF + RGG R+ T
Sbjct: 175 IQRIEERIARLLNWPIENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPTIVKRGGQRVGT 234
Query: 156 VLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTN 215
++MYL+ KGG T FP+ + V P+RG+A +FFS +
Sbjct: 235 LVMYLNTPEKGGGTTFPDVH---------------------VEVAPQRGNA-VFFS-YER 271
Query: 216 AIPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
P +LH G PV+ GEKW ATKW+ F
Sbjct: 272 PHPSTRTLHGGAPVLAGEKWIATKWLREREF 302
>gi|239814309|ref|YP_002943219.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
gi|239800886|gb|ACS17953.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
Length = 279
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 81/220 (36%), Positives = 112/220 (50%), Gaps = 32/220 (14%)
Query: 34 KVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFI 93
+V Q PR V+ ++ EC+ LI A+ +L RS + +G L+ RTS G F
Sbjct: 84 QVLQTMRHPRVVVFGNLVSPEECEGLIAAARVRLARSLTVETRTGGEVLNVDRTSEGMFF 143
Query: 94 PKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNIVR 148
+G++ I+A +E +IA P E GE +Q+LRY G +Y PHYDYF + R
Sbjct: 144 ERGENDIVARLEQRIAALLRWPVEFGEGLQILRYAPGAQYRPHYDYFDPGEPGTPTILKR 203
Query: 149 GGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALL 208
GG R+AT++MYL + +GG T FP+ G+ V P RG +
Sbjct: 204 GGQRVATLVMYLQEPGQGGATTFPDV---------------------GLEVAPVRGTG-V 241
Query: 209 FFSLHTNAIPDPV--SLHSGCPVIEGEKWSATKWIHVDSF 246
FFS PDP +LH G PV+ GEKW ATKW+ F
Sbjct: 242 FFSYEE---PDPATRTLHGGAPVLAGEKWVATKWLREREF 278
>gi|418515355|ref|ZP_13081536.1| hypothetical protein MOU_00890 [Xanthomonas axonopodis pv.
malvacearum str. GSPB1386]
gi|410708074|gb|EKQ66523.1| hypothetical protein MOU_00890 [Xanthomonas axonopodis pv.
malvacearum str. GSPB1386]
Length = 216
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 86/205 (41%), Positives = 111/205 (54%), Gaps = 28/205 (13%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
PR V GFL+D ECD LI LA+ +L RS DN +GE + RTS + G+DA+
Sbjct: 26 PRVVVLGGFLSDGECDALIALARPRLARSRTVDNANGEHLVHAARTSDSMCLRVGQDALC 85
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS-DKVN----IVRGGHRLATV 156
IE +IA P ++GE +QVLRY G +Y PHYDYF D V + GG R+A++
Sbjct: 86 QRIEARIARLFDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAVGTPILLQAGGQRVASL 145
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
+MYL+ +GG T FP+A L A KG AV FFS +
Sbjct: 146 VMYLNTPERGGATRFPDAH-------------LDVAAVKGNAV---------FFS-YDRP 182
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWI 241
P SLH+G PV+ GEKW ATKW+
Sbjct: 183 HPMTRSLHAGAPVLAGEKWVATKWL 207
>gi|78046308|ref|YP_362483.1| 2OG-Fe(II) oxygenase [Xanthomonas campestris pv. vesicatoria str.
85-10]
gi|78034738|emb|CAJ22383.1| putative 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas
campestris pv. vesicatoria str. 85-10]
Length = 296
Score = 139 bits (349), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 84/205 (40%), Positives = 110/205 (53%), Gaps = 28/205 (13%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
PR V GFL+D ECD LI LA+ +L RS DN +GE + RTS + G+DA+
Sbjct: 106 PRVVVLGGFLSDEECDALIALARPRLARSRTVDNANGEHVVHAARTSDSMCLRLGQDALC 165
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNIVRGGHRLATV 156
IE +IA P ++GE +QVLRY G +Y PHYDYF V + GG R+A++
Sbjct: 166 QRIEARIARLLDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAAGTPVLVQAGGQRVASL 225
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
+MYL+ +GG T FP+A L A KG AV FFS +
Sbjct: 226 VMYLNTPERGGATRFPDAH-------------LDVAAVKGNAV---------FFS-YDRP 262
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWI 241
P SLH+G PV+ G+KW ATKW+
Sbjct: 263 HPMTRSLHAGAPVLAGDKWVATKWL 287
>gi|325925807|ref|ZP_08187179.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas perforans
91-118]
gi|325543793|gb|EGD15204.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas perforans
91-118]
Length = 286
Score = 139 bits (349), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 84/205 (40%), Positives = 109/205 (53%), Gaps = 28/205 (13%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
PR V GFL+D ECD LI LA+ L RS DN +GE + RTS + G+DA+
Sbjct: 96 PRVVVLGGFLSDEECDALIALARPHLARSRTVDNANGEHVVHAARTSDSMCLRLGQDALC 155
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNIVRGGHRLATV 156
IE +IA P ++GE +QVLRY G +Y PHYDYF V + GG R+A++
Sbjct: 156 QRIEARIARLLDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAAGTPVLVQAGGQRVASL 215
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
+MYL+ +GG T FP+A L A KG AV FFS +
Sbjct: 216 VMYLNTPERGGATRFPDAH-------------LDVAAVKGNAV---------FFS-YDRP 252
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWI 241
P SLH+G PV+ G+KW ATKW+
Sbjct: 253 HPMTRSLHAGAPVLAGDKWVATKWL 277
>gi|403234403|ref|ZP_10912989.1| Procollagen-proline dioxygenase [Bacillus sp. 10403023]
Length = 217
Score = 138 bits (348), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 72/201 (35%), Positives = 117/201 (58%), Gaps = 27/201 (13%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
+P V L+D ECD LI L+K ++ RS +A+ + + ++RTSS TFI + ++ I
Sbjct: 38 EPLIVVLGNVLSDEECDELIRLSKDRINRSKIAN-----ANVDNMRTSSSTFIEENENII 92
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYL 160
++ IE +I+ +P E GE +Q+L Y+ GQ+Y+ H+D+FS N + R++T++MYL
Sbjct: 93 VSRIEKRISQIMNIPTEYGEGLQILNYQVGQEYKSHFDFFSSPHNAINNP-RISTLVMYL 151
Query: 161 SDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDP 220
SDV +GGET FP K +V P++G A+ F + + +
Sbjct: 152 SDVEQGGETYFP---------------------KLHFSVSPQKGMAVYFEYFYNDQTLNE 190
Query: 221 VSLHSGCPVIEGEKWSATKWI 241
++LH G PVI G+KW+AT+W+
Sbjct: 191 LTLHGGAPVIVGDKWAATQWM 211
>gi|346723630|ref|YP_004850299.1| hypothetical protein XACM_0696 [Xanthomonas axonopodis pv.
citrumelo F1]
gi|346648377|gb|AEO41001.1| hypothetical protein XACM_0696 [Xanthomonas axonopodis pv.
citrumelo F1]
Length = 286
Score = 138 bits (348), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 84/205 (40%), Positives = 109/205 (53%), Gaps = 28/205 (13%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
PR V GFL+D ECD LI LA+ L RS DN +GE + RTS + G+DA+
Sbjct: 96 PRVVVLGGFLSDEECDALIALAQPHLARSRTVDNANGEHVVHAARTSDSMCLRLGQDALC 155
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNIVRGGHRLATV 156
IE +IA P ++GE +QVLRY G +Y PHYDYF V + GG R+A++
Sbjct: 156 QRIEARIARLLDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAAGTPVLVQAGGQRVASL 215
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
+MYL+ +GG T FP+A L A KG AV FFS +
Sbjct: 216 VMYLNTPERGGATRFPDAH-------------LDVAAVKGNAV---------FFS-YDRP 252
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWI 241
P SLH+G PV+ G+KW ATKW+
Sbjct: 253 HPMTRSLHAGAPVLAGDKWVATKWL 277
>gi|397568865|gb|EJK46391.1| hypothetical protein THAOC_34939 [Thalassiosira oceanica]
Length = 488
Score = 138 bits (348), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 85/222 (38%), Positives = 124/222 (55%), Gaps = 25/222 (11%)
Query: 35 VKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFI 93
++ +S KP GFL D ECD+++ A +K S V+ + + + SD RTS TF+
Sbjct: 267 IETLSMKPLVLSISGFLADEECDYIMEKAAPTMKYSGVSLKDADKGRPASDWRTSQSTFV 326
Query: 94 PKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF------SD--KVN 145
D I+ IE + A+ T +P + E +QVLRY +KY+ H+D+F SD +
Sbjct: 327 AAMGDPILRDIELRTASLTRVPVTHQEFVQVLRYGVTEKYDAHHDFFDPSSYRSDPGTLQ 386
Query: 146 IVRGG--HRLATVLMYLSDVAKGGETVFP--NAEEPPRRRTPATNDDLSECAKKGIAVKP 201
++ G +R ATV YL+DVA+GGET FP PPR D S C G+ VKP
Sbjct: 387 LIENGKKNRYATVFWYLTDVARGGETCFPRHGGAPPPR--------DFSMCT--GLKVKP 436
Query: 202 RRGDALLFFSLHTNAIPDPVSLHSGCPVIEGE--KWSATKWI 241
++G ++F+SL + DP+SLH CPV+ E KW+A KW+
Sbjct: 437 QKGKVIIFYSLDASGEMDPLSLHGACPVLGKEDIKWAANKWL 478
>gi|149180354|ref|ZP_01858859.1| prolyl 4-hydroxylase, alpha subunit [Bacillus sp. SG-1]
gi|148852546|gb|EDL66691.1| prolyl 4-hydroxylase, alpha subunit [Bacillus sp. SG-1]
Length = 212
Score = 137 bits (346), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 72/201 (35%), Positives = 116/201 (57%), Gaps = 30/201 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
+P V L+D ECD LI L+K +LKRS + + + +D+RTSS TF+ +G+ +
Sbjct: 36 EPLIVVLGNVLSDEECDALIGLSKDKLKRSKIGNTRNE----NDMRTSSSTFMEEGESEV 91
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYL 160
+ +E +I+ +P ENGE +Q+L Y+ GQ+Y+ H+D+F + N R++T++MYL
Sbjct: 92 VTRVEKRISQIMNIPYENGEGLQILNYKIGQEYKAHFDFFKNASN-----PRISTLVMYL 146
Query: 161 SDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDP 220
+DV +GGET FP K +V P++G A+ F + N +
Sbjct: 147 NDVEEGGETYFP---------------------KLNFSVSPQKGMAVYFEYFYDNQELND 185
Query: 221 VSLHSGCPVIEGEKWSATKWI 241
++LH G PVI G+KW+AT+W+
Sbjct: 186 LTLHGGAPVIIGDKWAATQWM 206
>gi|91789558|ref|YP_550510.1| procollagen-proline,2-oxoglutarate-4-dioxygenase [Polaromonas sp.
JS666]
gi|91698783|gb|ABE45612.1| Procollagen-proline,2-oxoglutarate-4-dioxygenase [Polaromonas sp.
JS666]
Length = 277
Score = 137 bits (346), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 77/210 (36%), Positives = 113/210 (53%), Gaps = 28/210 (13%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
P V+ L+D EC+ L+ +A+ +L RS + +G + + RTS G F +G++ ++
Sbjct: 90 PDLVVFGNLLSDSECEALMEVAQPRLARSLTVNIKTGGEERNRDRTSQGMFFARGENPLV 149
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNIVRGGHRLATV 156
+E +IA P + GE +QVLRY G +Y+PHYDYF + RGG R+AT+
Sbjct: 150 QRVEARIARLVGWPVDRGEGLQVLRYRQGAQYKPHYDYFDPAEPGTPAILQRGGQRVATL 209
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
+MYL++ +GG TVFP+ G+ V PRRG A +FFS + A
Sbjct: 210 IMYLNEPEQGGATVFPDI---------------------GLQVTPRRGTA-VFFS-YPAA 246
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
P ++ H G PV GEKW ATKW+ F
Sbjct: 247 NPASLTRHGGEPVKAGEKWIATKWLREREF 276
>gi|194290782|ref|YP_002006689.1| prolyl 4-hydroxylase subunit alpha [Cupriavidus taiwanensis LMG
19424]
gi|193224617|emb|CAQ70628.1| putative Prolyl 4-hydroxylase alpha subunit [Cupriavidus
taiwanensis LMG 19424]
Length = 296
Score = 137 bits (345), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 73/205 (35%), Positives = 114/205 (55%), Gaps = 26/205 (12%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
P+ +++ L+D ECD L+ L++ +L RS V + +G+ L D RTS G + A+I
Sbjct: 104 PQVQLFQQLLSDDECDALVALSRGRLARSPVVNPDTGDENLIDARTSMGAMFQVAEHALI 163
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS-----DKVNIVRGGHRLATV 156
A IE +IA T +P ++GE +Q+L Y+ G +Y+PH+DYF+ + + GG R+AT+
Sbjct: 164 ARIEARIAAVTGVPADHGEGLQILNYKPGGEYQPHFDYFNPQRPGEARQLSVGGQRIATL 223
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
++YL+ GG T FP + G+ V P +G+A+ F L +
Sbjct: 224 VIYLNTPEAGGATAFP---------------------RVGLEVAPVKGNAVYFSYLLPDG 262
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWI 241
D +LH+G PV GEKW ATKW+
Sbjct: 263 TLDDRTLHAGLPVAAGEKWIATKWL 287
>gi|428175714|gb|EKX44602.1| hypothetical protein GUITHDRAFT_71994 [Guillardia theta CCMP2712]
Length = 244
Score = 137 bits (345), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 88/223 (39%), Positives = 122/223 (54%), Gaps = 18/223 (8%)
Query: 34 KVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAV---ADNLSGESKLSD-VRTSS 89
+VK++S PR FV E FL+ EC+ +I A L S V D +GE K+ D VRTS
Sbjct: 19 EVKRLSSTPRLFVVENFLSAEECEEIIKTATPLLAPSTVLKQGDQSNGEEKVKDEVRTSE 78
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVR- 148
++ K I+A I ++ +P ED+QVL+Y Q Y HYD+F K+ R
Sbjct: 79 TAWLMDKKVPIVAKIRQRVEELIRIPMSYAEDMQVLKYTFKQHYHVHYDFFDPKMYPGRW 138
Query: 149 --GGHRLATVLMYLSDVAKGGETVFP----NAEEPPRRRTPATNDDLSECAKKGIAVKPR 202
G +RL TV YL+ V KGGET+FP +AEE + ++ ++ E + I VKP
Sbjct: 139 SSGHNRLVTVFFYLTSVEKGGETIFPFGNTSAEEHHKIQSWGPCENAVESS---IKVKPV 195
Query: 203 RGDALLFFSL----HTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
RG A++F+ + HT+ D SLH GC I GEKW+A WI
Sbjct: 196 RGSAVIFYLMKPHGHTHGELDHTSLHGGCDPIVGEKWAANYWI 238
>gi|224107311|ref|XP_002314441.1| predicted protein [Populus trichocarpa]
gi|222863481|gb|EEF00612.1| predicted protein [Populus trichocarpa]
Length = 84
Score = 137 bits (345), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 66/85 (77%), Positives = 70/85 (82%), Gaps = 1/85 (1%)
Query: 212 LHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGGDCTDNNASCERWAALGE 271
LH A+PD SLH+GCPVIEGEKWSATKWIHVDSFDK VE GG+CTD N SCERWAALGE
Sbjct: 1 LHPTAVPDISSLHAGCPVIEGEKWSATKWIHVDSFDKNVEAGGNCTDQNESCERWAALGE 60
Query: 272 CTKNPEYMVGSAQLPGFCRRSCKVC 296
TKN EY VGS LPG+CR S KVC
Sbjct: 61 RTKNTEYTVGSPDLPGYCRSS-KVC 84
>gi|430808003|ref|ZP_19435118.1| prolyl 4-hydroxylase [Cupriavidus sp. HMR-1]
gi|429499635|gb|EKZ98045.1| prolyl 4-hydroxylase [Cupriavidus sp. HMR-1]
Length = 293
Score = 137 bits (345), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 78/206 (37%), Positives = 115/206 (55%), Gaps = 28/206 (13%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
PR + + L D ECD ++ LA+ +L+RS V + +G+ L D RTS G G+ A++
Sbjct: 101 PRILLLQNLLDDAECDAVVALARDRLQRSPVVNPDTGDENLIDARTSMGAMFQVGEHALL 160
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDK-----VNIVRGGHRLATV 156
IE +IA T P E+GE QVL Y+ G +Y+PH+D+F+ K + GG R+AT+
Sbjct: 161 QRIEARIAAVTGWPVEHGEGFQVLNYKPGGEYQPHFDFFNPKRPGEARQLRVGGQRVATM 220
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLF-FSLHTN 215
++YL+ A GG T FP + G+ V P +G+A+LF + L
Sbjct: 221 VIYLNSPASGGATAFP---------------------RIGLEVAPVKGNAVLFSYGLPDG 259
Query: 216 AIPDPVSLHSGCPVIEGEKWSATKWI 241
A+ D +LH+G PV GEKW ATKW+
Sbjct: 260 AL-DERTLHAGLPVEAGEKWIATKWL 284
>gi|384429387|ref|YP_005638747.1| procollagen-proline, 2-oxoglutarate-4-dioxygenase [Xanthomonas
campestris pv. raphani 756C]
gi|341938490|gb|AEL08629.1| procollagen-proline, 2-oxoglutarate-4-dioxygenase [Xanthomonas
campestris pv. raphani 756C]
Length = 286
Score = 137 bits (345), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 83/205 (40%), Positives = 106/205 (51%), Gaps = 28/205 (13%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
PR V G L+D ECD LI LA+ QL RS DN G + RTS + G+DA+
Sbjct: 96 PRVVVLGGLLSDDECDALIALARPQLARSRTVDNRDGSEIVHAARTSHSMALQPGQDALC 155
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNIVRGGHRLATV 156
IE +IA P E+GE +QVLRY G +Y PHYDYF V + GG R+A++
Sbjct: 156 QRIEARIARLLEWPVEHGEGLQVLRYATGAQYAPHYDYFEPDAPGTPVLLQHGGQRVASL 215
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
+MYL+ +GG T FP+ L A KG AV FFS +
Sbjct: 216 VMYLNTPERGGATRFPDVH-------------LDVAAVKGNAV---------FFS-YDRP 252
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWI 241
P +LH+G PV+ GEKW ATKW+
Sbjct: 253 HPMTRTLHAGAPVLAGEKWVATKWL 277
>gi|94312029|ref|YP_585239.1| prolyl 4-hydroxylase [Cupriavidus metallidurans CH34]
gi|93355881|gb|ABF09970.1| prolyl 4-hydroxylase [Cupriavidus metallidurans CH34]
Length = 293
Score = 137 bits (344), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 78/206 (37%), Positives = 115/206 (55%), Gaps = 28/206 (13%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
PR + + L D ECD ++ LA+ +L+RS V + +G+ L D RTS G G+ A++
Sbjct: 101 PRILLLQNLLDDAECDAVVALARDRLQRSPVVNPDTGDENLIDARTSMGAMFQVGEHALL 160
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDK-----VNIVRGGHRLATV 156
IE +IA T P E+GE QVL Y+ G +Y+PH+D+F+ K + GG R+AT+
Sbjct: 161 QRIEARIAAVTGWPVEHGEGFQVLNYKPGGEYQPHFDFFNPKRPGEARQLRVGGQRVATM 220
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLF-FSLHTN 215
++YL+ A GG T FP + G+ V P +G+A+LF + L
Sbjct: 221 VIYLNSPASGGATAFP---------------------RIGLEVAPVKGNAVLFSYGLPDG 259
Query: 216 AIPDPVSLHSGCPVIEGEKWSATKWI 241
A+ D +LH+G PV GEKW ATKW+
Sbjct: 260 AL-DERTLHAGLPVEAGEKWIATKWL 284
>gi|77761111|ref|YP_241833.2| hypothetical protein XC_0735 [Xanthomonas campestris pv. campestris
str. 8004]
Length = 288
Score = 137 bits (344), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 83/205 (40%), Positives = 105/205 (51%), Gaps = 28/205 (13%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
PR V G L D ECD LI LA+ QL RS DN G + RTS + G+DA+
Sbjct: 98 PRVVVLGGLLADDECDALIALARPQLARSRTVDNRDGSEIVHAARTSHSMALQPGQDALC 157
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNIVRGGHRLATV 156
IE +IA P E+GE +QVLRY G +Y PHYDYF V + GG R+A++
Sbjct: 158 QRIEARIAQLLEWPVEHGEGLQVLRYATGAQYAPHYDYFEPDAPGTPVLLQHGGQRVASL 217
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
+MYL+ +GG T FP+ L A KG AV FFS +
Sbjct: 218 VMYLNTPERGGATRFPDVH-------------LDVAAVKGNAV---------FFS-YDRP 254
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWI 241
P +LH+G PV+ GEKW ATKW+
Sbjct: 255 HPMTRTLHAGAPVLAGEKWVATKWL 279
>gi|66572403|gb|AAY47813.1| conserved hypothetical protein [Xanthomonas campestris pv.
campestris str. 8004]
Length = 308
Score = 136 bits (343), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 83/205 (40%), Positives = 105/205 (51%), Gaps = 28/205 (13%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
PR V G L D ECD LI LA+ QL RS DN G + RTS + G+DA+
Sbjct: 118 PRVVVLGGLLADDECDALIALARPQLARSRTVDNRDGSEIVHAARTSHSMALQPGQDALC 177
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNIVRGGHRLATV 156
IE +IA P E+GE +QVLRY G +Y PHYDYF V + GG R+A++
Sbjct: 178 QRIEARIAQLLEWPVEHGEGLQVLRYATGAQYAPHYDYFEPDAPGTPVLLQHGGQRVASL 237
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
+MYL+ +GG T FP+ L A KG AV FFS +
Sbjct: 238 VMYLNTPERGGATRFPDVH-------------LDVAAVKGNAV---------FFS-YDRP 274
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWI 241
P +LH+G PV+ GEKW ATKW+
Sbjct: 275 HPMTRTLHAGAPVLAGEKWVATKWL 299
>gi|73542634|ref|YP_297154.1| procollagen-proline,2-oxoglutarate-4-dioxygenase [Ralstonia
eutropha JMP134]
gi|72120047|gb|AAZ62310.1| Procollagen-proline,2-oxoglutarate-4-dioxygenase [Ralstonia
eutropha JMP134]
Length = 282
Score = 136 bits (343), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 74/205 (36%), Positives = 111/205 (54%), Gaps = 26/205 (12%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
P +Y+ L+D ECD L+ LA+ +L RS V + +G+ L D RTS G G+ +I
Sbjct: 90 PSIRLYQHLLSDAECDALVELARGRLARSPVINPDTGDENLIDARTSMGAMFQVGEHTLI 149
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDK-----VNIVRGGHRLATV 156
IED+IA +P ++GE +Q+L Y+ G +Y+PH+D+F+ K + GG R AT+
Sbjct: 150 QRIEDRIAAVLGVPVDHGEGLQILNYKPGGEYQPHFDFFNPKRPGEARQLRVGGQRTATL 209
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
++YL+ GG T FP + G+ V P +G+A+ F L +
Sbjct: 210 VIYLNTPQAGGATAFP---------------------RIGLEVAPVKGNAVYFSYLQPDG 248
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWI 241
D +LH+G PV GEKW ATKW+
Sbjct: 249 KLDERTLHAGLPVQSGEKWIATKWL 273
>gi|319652240|ref|ZP_08006358.1| prolyl 4-hydroxylase [Bacillus sp. 2_A_57_CT2]
gi|317396063|gb|EFV76783.1| prolyl 4-hydroxylase [Bacillus sp. 2_A_57_CT2]
Length = 216
Score = 136 bits (342), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 70/201 (34%), Positives = 119/201 (59%), Gaps = 26/201 (12%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
+P + L+D ECD LI +K +++RS VA++L ++ ++RTSS TF +G++ I
Sbjct: 37 EPLIVILGNVLSDEECDQLIQQSKDRMQRSKVANSL----EVDELRTSSSTFFHEGENEI 92
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYL 160
+A IE +I+ +P E+GE +Q+L Y+ GQ+Y+ H+D+FS + R++T++MYL
Sbjct: 93 VARIEKRISQIMNIPVEHGEGLQILNYKIGQEYKAHFDFFS-STSRAASNPRISTLVMYL 151
Query: 161 SDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDP 220
+DV +GGET FP K +V P++G A+ F + + +
Sbjct: 152 NDVEQGGETYFP---------------------KLNFSVSPQKGMAVYFEYFYNDQNLND 190
Query: 221 VSLHSGCPVIEGEKWSATKWI 241
++LH G PV+ G+KW+AT+W+
Sbjct: 191 LTLHGGAPVVMGDKWAATQWM 211
>gi|389770666|ref|ZP_10192118.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
gi|388429637|gb|EIL86932.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
Length = 286
Score = 136 bits (342), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 71/206 (34%), Positives = 113/206 (54%), Gaps = 26/206 (12%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
+P V +G L+ ECD LI A ++L+RS + D +G+ + R+S GTF D
Sbjct: 94 QPVLAVLDGVLSHEECDELIRRAAAKLQRSTIVDPTTGKHETIADRSSEGTFFEINADDF 153
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNIVRGGHRLAT 155
IA ++ +I+ LP ++GE +Q+L Y G +Y+PH+D+F V + GG R++T
Sbjct: 154 IARLDRRISALMNLPVDHGEGLQILHYGPGGEYKPHFDFFPPGDPGSAVQMATGGQRVST 213
Query: 156 VLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTN 215
++MYL++V GG T+FP + G++V P++G A+ F ++
Sbjct: 214 LVMYLNEVEDGGATIFP---------------------ELGLSVLPKKGSAVYFEYTNSR 252
Query: 216 AIPDPVSLHSGCPVIEGEKWSATKWI 241
DP +LH G PV+ GEKW TKW+
Sbjct: 253 GQLDPRTLHGGAPVLRGEKWIVTKWM 278
>gi|333912984|ref|YP_004486716.1| procollagen-proline dioxygenase [Delftia sp. Cs1-4]
gi|333743184|gb|AEF88361.1| Procollagen-proline dioxygenase [Delftia sp. Cs1-4]
Length = 294
Score = 136 bits (342), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 76/212 (35%), Positives = 115/212 (54%), Gaps = 32/212 (15%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
PR V+ L+ ECD +I A+ ++ RS SG +++D RTS+G F +G+ I+
Sbjct: 107 PRIVVFGNLLSHEECDAIIAAARPRMARSLTVATQSGGEEINDDRTSNGMFFQRGETGIV 166
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS-----DKVNIVRGGHRLATV 156
+ +E++IA P ++GE +QVL Y G +Y+PH+DYF+ + RGG R+ T+
Sbjct: 167 SQLEERIARLLRWPLDHGEGLQVLHYGPGAEYKPHHDYFAPGEPGTPTILKRGGQRVGTL 226
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
++YL++ +GG T+FP + V PRRG+A +FFS
Sbjct: 227 VIYLNEPERGGATIFPEVP---------------------LQVVPRRGNA-VFFSYER-- 262
Query: 217 IPDPV--SLHSGCPVIEGEKWSATKWIHVDSF 246
PDP +LH G PV+ GEKW ATKW+ F
Sbjct: 263 -PDPSTRTLHGGAPVLAGEKWIATKWLREREF 293
>gi|229086310|ref|ZP_04218488.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-44]
gi|228697005|gb|EEL49812.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-44]
Length = 220
Score = 136 bits (342), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 74/226 (32%), Positives = 126/226 (55%), Gaps = 30/226 (13%)
Query: 18 LLIRKSFSSTAIINPSKVKQISW--KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADN 75
L I T + +++ IS +P V E L+D EC+ LI L+K +KRS +
Sbjct: 17 LTIFNHIGDTIVTEDREIQIISRVEEPLIVVLENVLSDEECESLIELSKDSMKRSKIG-- 74
Query: 76 LSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEP 135
++ ++RTSSGTF+ + + +A IE ++++ +P E+GE + +L+Y GQ+Y+
Sbjct: 75 --ASREVDNIRTSSGTFLEE--NETVAIIEKRVSSIMNIPVEHGEGLHILKYTPGQEYKA 130
Query: 136 HYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKK 195
HYDYF++ +R++T++MYL+DV +GGET FP K
Sbjct: 131 HYDYFAEHSRAAEN-NRISTLVMYLNDVEEGGETFFP---------------------KL 168
Query: 196 GIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
+++ P++G A+ F + + + ++LH G PVI+GEKW AT+W+
Sbjct: 169 NLSIAPKKGSAVYFEYFYNDKSLNELTLHGGAPVIKGEKWVATQWM 214
>gi|160900716|ref|YP_001566298.1| procollagen-proline dioxygenase [Delftia acidovorans SPH-1]
gi|160366300|gb|ABX37913.1| Procollagen-proline dioxygenase [Delftia acidovorans SPH-1]
Length = 294
Score = 136 bits (342), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 76/212 (35%), Positives = 115/212 (54%), Gaps = 32/212 (15%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
PR V+ L+ ECD +I A+ ++ RS SG +++D RTS+G F +G+ I+
Sbjct: 107 PRIVVFGNLLSHEECDAIIAAARPRMARSLTVATQSGGEEINDDRTSNGMFFQRGETGIV 166
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS-----DKVNIVRGGHRLATV 156
+ +E++IA P ++GE +QVL Y G +Y+PH+DYF+ + RGG R+ T+
Sbjct: 167 SQLEERIARLLRWPLDHGEGLQVLHYGPGAEYKPHHDYFAPGEPGTPTILKRGGQRVGTL 226
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
++YL++ +GG T+FP + V PRRG+A +FFS
Sbjct: 227 VIYLNEPERGGATIFPEVP---------------------LQVVPRRGNA-VFFSYER-- 262
Query: 217 IPDPV--SLHSGCPVIEGEKWSATKWIHVDSF 246
PDP +LH G PV+ GEKW ATKW+ F
Sbjct: 263 -PDPSTRTLHGGAPVLAGEKWIATKWLREREF 293
>gi|407938132|ref|YP_006853773.1| 2OG-Fe(II) oxygenase [Acidovorax sp. KKS102]
gi|407895926|gb|AFU45135.1| 2OG-Fe(II) oxygenase [Acidovorax sp. KKS102]
Length = 303
Score = 135 bits (341), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 79/211 (37%), Positives = 110/211 (52%), Gaps = 28/211 (13%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
+PR V+ L+ ECD LI A+ ++ RS +G +++ RTS G F +G+ +
Sbjct: 115 QPRIVVFGNLLSPEECDALIAAAEPRMARSLTVATKTGGEEINADRTSDGMFFQRGQSPL 174
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVN-----IVRGGHRLAT 155
I IE++IA P ENGE +QVL Y G +Y+PHYDYF I RGG R+ T
Sbjct: 175 IQRIEERIARLLQWPIENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPSIIKRGGQRVGT 234
Query: 156 VLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTN 215
++MYL+ KGG T FP+ + V P+RG+A +FFS +
Sbjct: 235 LVMYLNTPDKGGGTTFPDVH---------------------LEVAPQRGNA-VFFS-YER 271
Query: 216 AIPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
P +LH G PVI G+KW ATKW+ F
Sbjct: 272 PHPSTRTLHGGAPVIAGDKWIATKWLREREF 302
>gi|239792190|dbj|BAH72464.1| ACYPI007079 [Acyrthosiphon pisum]
Length = 249
Score = 135 bits (341), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 70/218 (32%), Positives = 122/218 (55%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K ++ + PR +Y L D E + + +A+ +LKR+ V + +GE + +D R S
Sbjct: 31 LIAPLKEEEAFFSPRIILYRDVLYDNEIEVIKRMAQPRLKRATVQNYKTGELEFADYRIS 90
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-SDKVNIV 147
++ + +D ++A + ++ T L E E++QV+ Y G Y+PHYD+ ++++N
Sbjct: 91 KSAWLKEHEDVVVANVAKRVEVMTGLTTETAEELQVVNYGVGGHYDPHYDFARTEEINAF 150
Query: 148 RG---GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+ATVL Y+SDVA+GG TVFP G+A++P +G
Sbjct: 151 KSLGTGNRIATVLFYMSDVAQGGATVFPWL---------------------GVALQPVKG 189
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A ++F+L+ + D + H+ CPV++G KW KW+H
Sbjct: 190 TAAVWFNLYPSGNGDLRTRHAACPVLQGSKWVCNKWLH 227
>gi|187920106|ref|YP_001889137.1| procollagen-proline dioxygenase [Burkholderia phytofirmans PsJN]
gi|187718544|gb|ACD19767.1| Procollagen-proline dioxygenase [Burkholderia phytofirmans PsJN]
Length = 295
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 71/211 (33%), Positives = 115/211 (54%), Gaps = 26/211 (12%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
+P+ V+ L+ EC +I ++ +LKRS + +G+ + RTS G + +G+DA
Sbjct: 105 RPQVIVFGDVLSPDECAEMIERSRHRLKRSTTVNPETGKEDVIRNRTSEGIWYQRGEDAF 164
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNIVRGGHRLAT 155
I ++ +I++ P ENGE +Q+L Y +Y PH+DYF V+ +GG R+AT
Sbjct: 165 IERMDRRISSLMNWPVENGEGLQILHYGTTGEYRPHFDYFPPDQPGSAVHTAQGGQRVAT 224
Query: 156 VLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTN 215
+++YL+DV GGET+FP A GI+V R+G A+ F ++
Sbjct: 225 LVIYLNDVPDGGETIFPEA---------------------GISVAARQGGAVYFRYMNGQ 263
Query: 216 AIPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
DP++LH G PV+ G+KW TKW+ ++
Sbjct: 264 RQLDPLTLHGGAPVLGGDKWIMTKWMRERAY 294
>gi|193688213|ref|XP_001943683.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 1
[Acyrthosiphon pisum]
Length = 552
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 70/218 (32%), Positives = 122/218 (55%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K ++ + PR +Y L D E + + +A+ +LKR+ V + +GE + +D R S
Sbjct: 334 LIAPLKEEEAFFSPRIILYRDVLYDNEIEVIKRMAQPRLKRATVQNYKTGELEFADYRIS 393
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-SDKVNIV 147
++ + +D ++A + ++ T L E E++QV+ Y G Y+PHYD+ ++++N
Sbjct: 394 KSAWLKEHEDVVVANVAKRVEVMTGLTTETAEELQVVNYGVGGHYDPHYDFARTEEINAF 453
Query: 148 RG---GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+ATVL Y+SDVA+GG TVFP G+A++P +G
Sbjct: 454 KSLGTGNRIATVLFYMSDVAQGGATVFPWL---------------------GVALQPVKG 492
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A ++F+L+ + D + H+ CPV++G KW KW+H
Sbjct: 493 TAAVWFNLYPSGNGDLRTRHAACPVLQGSKWVCNKWLH 530
>gi|348683507|gb|EGZ23322.1| hypothetical protein PHYSODRAFT_310730 [Phytophthora sojae]
Length = 417
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 81/218 (37%), Positives = 121/218 (55%), Gaps = 18/218 (8%)
Query: 35 VKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKL-SDVRTSSGTFI 93
++ +S P F E FL D E D ++NL+ LK S V E++ +D RTS+ F+
Sbjct: 201 LETLSMTPLVFSVEEFLKDDEIDIIMNLSLEHLKPSGVTLMDGHENRAATDWRTSTTYFL 260
Query: 94 PKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNIVR 148
P I I+ +++ T +P ++ ED+QVLRYE QKY+ H DYF + +I+
Sbjct: 261 PSDAHPKIDEIDQRVSDLTKVPIDHQEDVQVLRYEKTQKYDHHTDYFPVEHHKNAPHILE 320
Query: 149 G-----GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRR 203
+R+ TV Y+SDVAKGG T+FP A PR + + +C G+ V P++
Sbjct: 321 SIDYGYKNRMITVFWYMSDVAKGGHTIFPRAGGAPRPTS------MKDCTT-GLNVPPKK 373
Query: 204 GDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
++F+S+ N DP+SLH GCPV EG K+S KW+
Sbjct: 374 RKVIVFYSMLPNGEGDPMSLHGGCPVEEGVKYSGNKWV 411
>gi|328696638|ref|XP_003240086.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
[Acyrthosiphon pisum]
Length = 534
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 70/218 (32%), Positives = 122/218 (55%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K ++ + PR +Y L D E + + +A+ +LKR+ V + +GE + +D R S
Sbjct: 316 LIAPLKEEEAFFSPRIILYRDVLYDNEIEVIKRMAQPRLKRATVQNYKTGELEFADYRIS 375
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-SDKVNIV 147
++ + +D ++A + ++ T L E E++QV+ Y G Y+PHYD+ ++++N
Sbjct: 376 KSAWLKEHEDVVVANVAKRVEVMTGLTTETAEELQVVNYGVGGHYDPHYDFARTEEINAF 435
Query: 148 RG---GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+ATVL Y+SDVA+GG TVFP G+A++P +G
Sbjct: 436 KSLGTGNRIATVLFYMSDVAQGGATVFPWL---------------------GVALQPVKG 474
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A ++F+L+ + D + H+ CPV++G KW KW+H
Sbjct: 475 TAAVWFNLYPSGNGDLRTRHAACPVLQGSKWVCNKWLH 512
>gi|325922187|ref|ZP_08183974.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas gardneri ATCC
19865]
gi|325547306|gb|EGD18373.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas gardneri ATCC
19865]
Length = 285
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 83/205 (40%), Positives = 108/205 (52%), Gaps = 28/205 (13%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
PR V FL+D ECD LI LA+ +L RS DN +G + RTS + G+DA+
Sbjct: 96 PRVVVLGDFLSDAECDALIALAQPRLARSRTVDNDNGAQIVHAARTSDSMCLQLGQDALC 155
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNIVRGGHRLATV 156
IE +IA P ++GE +QVLRY G +Y+PHYDYF V + GG RLA++
Sbjct: 156 QRIEARIARLLDWPVDHGEGLQVLRYATGAEYQPHYDYFDPTAAGTPVLLQAGGQRLASL 215
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
+MYL+ +GG T FP+ L A KG AV FFS +
Sbjct: 216 VMYLNTPERGGATRFPDVH-------------LDVAAVKGNAV---------FFS-YDRP 252
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWI 241
P SLH+G PV+ GEKW ATKW+
Sbjct: 253 HPMTRSLHAGAPVLAGEKWVATKWL 277
>gi|319652187|ref|ZP_08006306.1| hypothetical protein HMPREF1013_02919 [Bacillus sp. 2_A_57_CT2]
gi|317396176|gb|EFV76895.1| hypothetical protein HMPREF1013_02919 [Bacillus sp. 2_A_57_CT2]
Length = 283
Score = 135 bits (340), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 77/209 (36%), Positives = 115/209 (55%), Gaps = 22/209 (10%)
Query: 34 KVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFI 93
KV + KP + L+ ECD LI+L++S+L+ S V D SGE + RTS
Sbjct: 88 KVLSRNEKPFVLHLDQVLSSEECDELISLSRSRLQPSLVVDRGSGEERAGSGRTSKSMAF 147
Query: 94 PKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKV-NIVRGGHR 152
++ ++ IE +IA T P ENGE +Q+L Y G++Y+PH+D+F + + +GG R
Sbjct: 148 RLKENELVERIETRIAELTGYPAENGEGLQILNYGLGEEYKPHFDFFPPHMADASKGGQR 207
Query: 153 LATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSL 212
+ T L+YL+DV GGETVF +K G++ P++G A+ F
Sbjct: 208 VGTFLIYLNDVEDGGETVF---------------------SKAGLSFVPKKGAAIYFHYG 246
Query: 213 HTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
+ D +S+HS PV +GEKW+ATKWI
Sbjct: 247 NAQGQLDRLSVHSSVPVRKGEKWAATKWI 275
>gi|423669823|ref|ZP_17644852.1| hypothetical protein IKO_03520 [Bacillus cereus VDM034]
gi|423673973|ref|ZP_17648912.1| hypothetical protein IKS_01516 [Bacillus cereus VDM062]
gi|401298950|gb|EJS04550.1| hypothetical protein IKO_03520 [Bacillus cereus VDM034]
gi|401309524|gb|EJS14857.1| hypothetical protein IKS_01516 [Bacillus cereus VDM062]
Length = 216
Score = 135 bits (339), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 73/202 (36%), Positives = 119/202 (58%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D ECD LI L+KS+++RS V G S+ ++D+RTSSG F+ + +
Sbjct: 38 EPLIVVLANVLSDEECDELIELSKSKMERSKV-----GSSRDVNDIRTSSGAFLEE--NE 90
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ + IE +I++ T +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 91 LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFAEHSRSA-ANNRISTLVMY 149
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + + +
Sbjct: 150 LNDVEEGGETFFP---------------------KLNLSVHPRKGMAVYFEYFYQDQLLN 188
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 189 ELTLHGGAPVTKGEKWIATQWV 210
>gi|113869198|ref|YP_727687.1| prolyl 4-hydroxylase alpha subunit [Ralstonia eutropha H16]
gi|113527974|emb|CAJ94319.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia eutropha H16]
Length = 297
Score = 135 bits (339), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 73/205 (35%), Positives = 112/205 (54%), Gaps = 26/205 (12%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
P+ +++ LTD ECD L+ L++ +L RS V + +G+ L D RTS G + +I
Sbjct: 105 PQVQLFQQLLTDDECDALVALSRGRLARSPVVNPDTGDENLIDARTSMGAMFQVAEHPLI 164
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS-----DKVNIVRGGHRLATV 156
IE +IA T +P E+GE +Q+L Y+ G +Y+PH+DYF+ + + GG R+AT+
Sbjct: 165 TRIEARIAAVTGVPAEHGEGLQILNYKPGGEYQPHFDYFNPQRPGEARQLSVGGQRIATL 224
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
++YL+ GG T FP + G+ V P +G+A+ F L +
Sbjct: 225 VIYLNTPEAGGATAFP---------------------RVGLEVAPVKGNAVYFSYLLPDG 263
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWI 241
D +LH+G PV GEKW ATKW+
Sbjct: 264 ALDERTLHAGLPVAFGEKWIATKWL 288
>gi|363814557|ref|NP_001242754.1| uncharacterized protein LOC100794585 [Glycine max]
gi|255628535|gb|ACU14612.1| unknown [Glycine max]
Length = 238
Score = 135 bits (339), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 69/143 (48%), Positives = 94/143 (65%), Gaps = 2/143 (1%)
Query: 34 KVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFI 93
K + ++W PR + FL+ ECD+L LA +L S V D +G+ SDVRTSSG F+
Sbjct: 80 KPEVLNWSPRIILLHNFLSMEECDYLRALALPRLHISTVVDTKTGKGIKSDVRTSSGMFL 139
Query: 94 --PKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGH 151
+ K ++ IE +I+ ++ +P ENGE +QVLRYE Q Y+PH+DYFSD N+ RGG
Sbjct: 140 NSKERKYPMVQAIEKRISVYSQIPIENGELMQVLRYEKNQYYKPHHDYFSDTFNLKRGGQ 199
Query: 152 RLATVLMYLSDVAKGGETVFPNA 174
R+AT+LMYLSD + GET FP A
Sbjct: 200 RIATMLMYLSDNIERGETYFPLA 222
>gi|423598444|ref|ZP_17574444.1| hypothetical protein III_01246 [Bacillus cereus VD078]
gi|423660914|ref|ZP_17636083.1| hypothetical protein IKM_01311 [Bacillus cereus VDM022]
gi|401236714|gb|EJR43171.1| hypothetical protein III_01246 [Bacillus cereus VD078]
gi|401300955|gb|EJS06544.1| hypothetical protein IKM_01311 [Bacillus cereus VDM022]
Length = 216
Score = 134 bits (338), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/202 (36%), Positives = 118/202 (58%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D ECD LI L+KS++KRS V G S+ ++D+RTSSG F+ + +
Sbjct: 38 EPLIVVLANVLSDEECDELIELSKSKMKRSKV-----GSSRDVNDIRTSSGAFLEE--NE 90
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ + IE +I++ T +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 91 LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFAEHSRSA-ANNRISTLVMY 149
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 150 LNDVEEGGETFFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSLN 188
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 189 ELTLHGGAPVTKGEKWIATQWV 210
>gi|77747935|ref|NP_638775.2| hypothetical protein XCC3429 [Xanthomonas campestris pv. campestris
str. ATCC 33913]
Length = 288
Score = 134 bits (337), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 82/205 (40%), Positives = 104/205 (50%), Gaps = 28/205 (13%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
PR V G L D ECD LI LA+ QL RS DN G + RTS + G+DA+
Sbjct: 98 PRVVVLGGLLADDECDALIALARPQLARSRTVDNRDGSEIVHAARTSHSMALQPGQDALC 157
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNIVRGGHRLATV 156
IE +IA P E+GE +QVLRY G +Y PHYDYF V + GG R+A++
Sbjct: 158 QRIEARIAQLLEWPVEHGEGLQVLRYATGAQYAPHYDYFEPDAPGTPVLLQHGGQRVASL 217
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
+MYL+ +GG T P+ L A KG AV FFS +
Sbjct: 218 VMYLNTPERGGATRVPDVH-------------LDVAAVKGNAV---------FFS-YDRP 254
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWI 241
P +LH+G PV+ GEKW ATKW+
Sbjct: 255 HPMTRTLHAGAPVLAGEKWVATKWL 279
>gi|21114687|gb|AAM42699.1| conserved hypothetical protein [Xanthomonas campestris pv.
campestris str. ATCC 33913]
Length = 308
Score = 134 bits (337), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 83/207 (40%), Positives = 105/207 (50%), Gaps = 32/207 (15%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
PR V G L D ECD LI LA+ QL RS DN G + RTS + G+DA+
Sbjct: 118 PRVVVLGGLLADDECDALIALARPQLARSRTVDNRDGSEIVHAARTSHSMALQPGQDALC 177
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNIVRGGHRLATV 156
IE +IA P E+GE +QVLRY G +Y PHYDYF V + GG R+A++
Sbjct: 178 QRIEARIAQLLEWPVEHGEGLQVLRYATGAQYAPHYDYFEPDAPGTPVLLQHGGQRVASL 237
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
+MYL+ +GG T P+ L A KG AV FFS
Sbjct: 238 VMYLNTPERGGATRVPDVH-------------LDVAAVKGNAV---------FFSYDR-- 273
Query: 217 IPDPV--SLHSGCPVIEGEKWSATKWI 241
P P+ +LH+G PV+ GEKW ATKW+
Sbjct: 274 -PHPMTRTLHAGAPVLAGEKWVATKWL 299
>gi|374370415|ref|ZP_09628419.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus basilensis OR16]
gi|373098067|gb|EHP39184.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus basilensis OR16]
Length = 454
Score = 134 bits (337), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 74/209 (35%), Positives = 112/209 (53%), Gaps = 26/209 (12%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
PR +++ LTD ECD L+ LA+ +L RS V + +G+ L + RTS G G+ +I
Sbjct: 132 PRVTLFQQLLTDAECDALVALARGRLARSPVINPDTGDENLIEARTSLGAMFQVGEHPLI 191
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS-----DKVNIVRGGHRLATV 156
IED IA T + E GE +Q+L Y+ G +Y+PHYD+F+ + + GG R+ T+
Sbjct: 192 ERIEDCIAAVTGIAAERGEGLQILNYKPGGEYQPHYDFFNPQRPGEARQLKVGGQRVGTL 251
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
++YL+ GG T FP K G+ V P +G+A+ F ++
Sbjct: 252 VIYLNSPLAGGATAFP---------------------KLGLEVAPVKGNAVYFSYRKSDG 290
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWIHVDS 245
D +LH+G PV GEKW ATKW++ +
Sbjct: 291 ALDERTLHAGLPVEAGEKWIATKWLNART 319
>gi|340357957|ref|ZP_08680560.1| prolyl 4-hydroxylase [Sporosarcina newyorkensis 2681]
gi|339616017|gb|EGQ20677.1| prolyl 4-hydroxylase [Sporosarcina newyorkensis 2681]
Length = 211
Score = 134 bits (336), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 70/201 (34%), Positives = 114/201 (56%), Gaps = 27/201 (13%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
+P V L+D ECD LI LA ++KRS + + +++RTSS FI ++ I
Sbjct: 32 EPLIVVLGNVLSDEECDELIQLAGDKVKRSKIGTT----REENELRTSSSMFIEDDENLI 87
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYL 160
+ ++ +I+ +P E+GE +Q+LRY GQ+Y+ H+D+FS I +R++T++MYL
Sbjct: 88 VTRVKKRISAIMKIPMEHGEGLQILRYTPGQQYKAHHDFFSSDSKIT--NNRISTLVMYL 145
Query: 161 SDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDP 220
+DV +GGET FP+ + +V PR+G A+ F +++ +
Sbjct: 146 NDVEQGGETFFPHLK---------------------FSVSPRKGMAVYFEYFYSDQTLND 184
Query: 221 VSLHSGCPVIEGEKWSATKWI 241
+LH G PV+EGEKW AT+W+
Sbjct: 185 FTLHGGAPVVEGEKWVATQWM 205
>gi|423489423|ref|ZP_17466105.1| hypothetical protein IEU_04046 [Bacillus cereus BtB2-4]
gi|402431659|gb|EJV63723.1| hypothetical protein IEU_04046 [Bacillus cereus BtB2-4]
Length = 216
Score = 133 bits (335), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 73/202 (36%), Positives = 118/202 (58%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D ECD LI L+KS+++RS V G S+ ++D+RTSSG F+ + +
Sbjct: 38 EPLIVVLANVLSDEECDELIELSKSKMERSKV-----GSSRDVNDIRTSSGAFLEE--NE 90
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ + IE +I++ T +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 91 LTSKIEKRISSITNVPVSHGEGLHILNYEVDQEYKAHYDYFAEHSRSA-ANNRISTLVMY 149
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 150 LNDVEEGGETFFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSLN 188
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 189 ELTLHGGAPVTKGEKWIATQWV 210
>gi|385205097|ref|ZP_10031967.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
gi|385184988|gb|EIF34262.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
Length = 292
Score = 133 bits (335), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 69/211 (32%), Positives = 114/211 (54%), Gaps = 26/211 (12%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
+P+ V+ L+ EC +I ++ +LKRS + +G+ + RTS G + +G+D
Sbjct: 102 RPQMIVFADVLSPDECAEMIERSRHRLKRSTTVNPATGKEDVIRNRTSEGIWYQRGEDPF 161
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNIVRGGHRLAT 155
I ++ +I++ P ENGE +Q+LRY +Y PH+DYF V+ +GG R+AT
Sbjct: 162 IERMDRRISSLMNWPVENGEGLQLLRYGTTGEYRPHFDYFPPDQPGSTVHTAQGGQRVAT 221
Query: 156 VLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTN 215
+++YL+DV GGET+FP A G++V +G A+ F ++
Sbjct: 222 LVIYLNDVPDGGETIFPEA---------------------GMSVAASQGGAVYFRYMNGR 260
Query: 216 AIPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
DP++LH G PV+ G+KW TKW+ ++
Sbjct: 261 RQLDPLTLHGGAPVLSGDKWIMTKWMRERAY 291
>gi|229019457|ref|ZP_04176278.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1273]
gi|229025700|ref|ZP_04182104.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1272]
gi|423417837|ref|ZP_17394926.1| hypothetical protein IE3_01309 [Bacillus cereus BAG3X2-1]
gi|228735575|gb|EEL86166.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1272]
gi|228741812|gb|EEL91991.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1273]
gi|401107008|gb|EJQ14965.1| hypothetical protein IE3_01309 [Bacillus cereus BAG3X2-1]
Length = 216
Score = 133 bits (335), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 73/202 (36%), Positives = 118/202 (58%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D ECD LI L+K+++KRS V G S+ ++D+RTSSG F+ + +
Sbjct: 38 EPLIVVLANVLSDEECDELIELSKNKMKRSKV-----GSSRDVNDIRTSSGAFLEE--NE 90
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ + IE +I++ T +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 91 LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFAEHSRSA-ANNRISTLVMY 149
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 150 LNDVEEGGETFFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSLN 188
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 189 ELTLHGGAPVTKGEKWIATQWV 210
>gi|157111033|ref|XP_001651361.1| prolyl 4-hydroxylase alpha subunit 1, putative [Aedes aegypti]
gi|108878552|gb|EAT42777.1| AAEL005714-PA, partial [Aedes aegypti]
Length = 522
Score = 133 bits (335), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 72/219 (32%), Positives = 122/219 (55%), Gaps = 23/219 (10%)
Query: 23 SFSSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKL 82
+ S+ + I P K++++ KP+ ++ L+D E + L LAK L+R+ +A+ +G+++
Sbjct: 310 NLSAFSKIGPFKLEEMHLKPKIVIFHDVLSDTEIELLKRLAKPILERATIANQQTGKAER 369
Query: 83 SDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSD 142
S R S ++ P + I I ++A T L + E++QV+ Y G +Y+PH+D+F
Sbjct: 370 SKDRVSKSSWFPDEYHSTIRTITKRVADMTGLSMDTAEELQVVNYGLGGQYDPHFDFFH- 428
Query: 143 KVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPR 202
++ +R+ATVL Y+SDV+ GG TVFP K G+ ++ R
Sbjct: 429 -WGKLKEVNRIATVLFYMSDVSIGGATVFP---------------------KLGVTLEAR 466
Query: 203 RGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
+G A +++LH++ D +LH CPV+ GEKW A KWI
Sbjct: 467 KGTAAFWYNLHSSGELDYSTLHGACPVLIGEKWVANKWI 505
>gi|163941996|ref|YP_001646880.1| 2OG-Fe(II) oxygenase [Bacillus weihenstephanensis KBAB4]
gi|229013455|ref|ZP_04170592.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides DSM 2048]
gi|423495146|ref|ZP_17471790.1| hypothetical protein IEW_04044 [Bacillus cereus CER057]
gi|423498060|ref|ZP_17474677.1| hypothetical protein IEY_01287 [Bacillus cereus CER074]
gi|163864193|gb|ABY45252.1| 2OG-Fe(II) oxygenase [Bacillus weihenstephanensis KBAB4]
gi|228747867|gb|EEL97733.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides DSM 2048]
gi|401151239|gb|EJQ58691.1| hypothetical protein IEW_04044 [Bacillus cereus CER057]
gi|401161347|gb|EJQ68714.1| hypothetical protein IEY_01287 [Bacillus cereus CER074]
Length = 216
Score = 133 bits (335), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 73/202 (36%), Positives = 118/202 (58%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D ECD LI L+KS+++RS V G S+ ++D+RTSSG F+ + +
Sbjct: 38 EPLIVVLANVLSDEECDELIELSKSKMERSKV-----GSSRDVNDIRTSSGAFLEE--NE 90
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ + IE +I++ T +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 91 LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFAEHSRSA-ANNRISTLVMY 149
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 150 LNDVEEGGETFFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSLN 188
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 189 ELTLHGGAPVTKGEKWIATQWV 210
>gi|224056224|ref|XP_002298763.1| predicted protein [Populus trichocarpa]
gi|222846021|gb|EEE83568.1| predicted protein [Populus trichocarpa]
Length = 175
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 78/182 (42%), Positives = 104/182 (57%), Gaps = 16/182 (8%)
Query: 62 LAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKD--AIIAGIEDKIATWTFLPKENG 119
+AKS+LK S +A L T+ TFI +D + IE KIA T +P+ +G
Sbjct: 1 MAKSKLKPSTLA--------LRKGETTESTFIGGSEDKTGTLDFIERKIAKATMIPQSHG 52
Query: 120 EDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPR 179
E +LRYE GQKY+ HYD F+ + R+A+ L+YLS V +GGET+FP
Sbjct: 53 EAFNILRYEIGQKYDSHYDAFNPDEYGPQPSQRVASFLLYLSSVEEGGETMFPFE----N 108
Query: 180 RRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATK 239
++ + +C G+ VKPR+GD LLF+SL N D SLH CPVI+GEKW ATK
Sbjct: 109 GSAVSSGFEYKQCV--GLKVKPRQGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATK 166
Query: 240 WI 241
WI
Sbjct: 167 WI 168
>gi|294627644|ref|ZP_06706226.1| 2OG-Fe II oxygenase superfamily protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 11122]
gi|292597996|gb|EFF42151.1| 2OG-Fe II oxygenase superfamily protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 11122]
Length = 296
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 83/205 (40%), Positives = 108/205 (52%), Gaps = 28/205 (13%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
P V GFL+ ECD LI LA+ +L RS DN +GE + RTS + G+DA+
Sbjct: 106 PCVVVLGGFLSGGECDALIALARPRLARSRTVDNANGEHVVHAARTSDSMCLRVGQDALC 165
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNIVRGGHRLATV 156
IE +IA P ++GE +QVLRY G +Y PHYDYF V + GG R+A++
Sbjct: 166 QRIEARIARLLDWPVDHGEGLQVLRYGTGAEYRPHYDYFDPDAAGTPVLLQAGGQRVASL 225
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
+MYL+ +GG T FP+A L A KG AV FFS +
Sbjct: 226 VMYLNTPERGGATRFPDAH-------------LDVAAVKGNAV---------FFS-YDRP 262
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWI 241
P SLH+G PV+ GEKW ATKW+
Sbjct: 263 HPMTRSLHAGAPVLAGEKWVATKWL 287
>gi|229135058|ref|ZP_04263863.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST196]
gi|228648443|gb|EEL04473.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST196]
Length = 216
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 73/202 (36%), Positives = 117/202 (57%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D EC LI L+KS +KRS V G S+ ++D+RTSSG F+ + +
Sbjct: 38 EPLIVVLANVLSDEECAELIELSKSNMKRSKV-----GSSRDVNDIRTSSGAFLEE--NE 90
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ + IE +I++ T +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 91 LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFAEHSRSA-ANNRISTLVMY 149
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + + +
Sbjct: 150 LNDVEEGGETFFP---------------------KLNLSVHPRKGMAVYFEYFYQDQLLN 188
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 189 ELTLHGGAPVTKGEKWIATQWV 210
>gi|423604110|ref|ZP_17580003.1| hypothetical protein IIK_00691 [Bacillus cereus VD102]
gi|401245796|gb|EJR52149.1| hypothetical protein IIK_00691 [Bacillus cereus VD102]
Length = 216
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 79/219 (36%), Positives = 121/219 (55%), Gaps = 33/219 (15%)
Query: 27 TAIINPSKVKQISWK---PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-L 82
AI+ + QI K P V L+D ECD LI L+K++L RS V G S+ +
Sbjct: 21 NAIMTEDREIQIISKFEEPLIVVLGNVLSDEECDELIELSKNKLARSKV-----GSSRDV 75
Query: 83 SDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSD 142
+D+RTSSG F+ + + A IE +I++ +P +GE + +L YE Q+Y+ HYDYF++
Sbjct: 76 NDIRTSSGAFLDD--NELTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYFAE 133
Query: 143 KVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPR 202
+R++T++MYL+DV +GGET FP K ++V PR
Sbjct: 134 HSRSA-ANNRISTLVMYLNDVEEGGETFFP---------------------KLNLSVHPR 171
Query: 203 RGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
+G A+ F H + + ++LH G PV +GEKW AT+W+
Sbjct: 172 KGMAVYFEYFHQDQSLNELTLHGGAPVTKGEKWIATQWV 210
>gi|423512354|ref|ZP_17488885.1| hypothetical protein IG3_03851 [Bacillus cereus HuA2-1]
gi|402449325|gb|EJV81162.1| hypothetical protein IG3_03851 [Bacillus cereus HuA2-1]
Length = 216
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 73/202 (36%), Positives = 117/202 (57%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D EC LI L+KS +KRS V G S+ ++D+RTSSG F+ + +
Sbjct: 38 EPLIVVLANVLSDEECAELIELSKSNMKRSKV-----GSSRDVNDIRTSSGAFLEE--NE 90
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ + IE +I++ T +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 91 LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFAEHSRSA-ANNRISTLVMY 149
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + + +
Sbjct: 150 LNDVEEGGETFFP---------------------KLNLSVHPRKGMAVYFEYFYQDQLLN 188
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 189 ELTLHGGAPVTKGEKWIATQWV 210
>gi|294666178|ref|ZP_06731433.1| 2OG-Fe II oxygenase superfamily protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
gi|292604043|gb|EFF47439.1| 2OG-Fe II oxygenase superfamily protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
Length = 296
Score = 132 bits (333), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 83/205 (40%), Positives = 108/205 (52%), Gaps = 28/205 (13%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
P V GFL+ ECD LI LA+ +L RS DN +GE + RTS + G+DA+
Sbjct: 106 PCVVVLGGFLSGGECDALIALARPRLARSRTVDNANGEHVVHAARTSDSMCLRVGQDALC 165
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNIVRGGHRLATV 156
IE +IA P ++GE +QVLRY G +Y PHYDYF V + GG R+A++
Sbjct: 166 QRIEARIARLLDWPVDHGEGLQVLRYGTGAEYRPHYDYFDPDAAGTPVLLQAGGQRVASL 225
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
+MYL+ +GG T FP+A L A KG AV FFS +
Sbjct: 226 VMYLNTPERGGATRFPDAH-------------LDVAAVKGNAV---------FFS-YDRP 262
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWI 241
P SLH+G PV+ GEKW ATKW+
Sbjct: 263 HPMTRSLHAGAPVLAGEKWVATKWL 287
>gi|148653656|ref|YP_001280749.1| procollagen-proline dioxygenase [Psychrobacter sp. PRwf-1]
gi|148572740|gb|ABQ94799.1| Procollagen-proline dioxygenase [Psychrobacter sp. PRwf-1]
Length = 268
Score = 132 bits (333), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 79/209 (37%), Positives = 110/209 (52%), Gaps = 26/209 (12%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGK 97
+ +KP V FL+ ECD LI+ A +LK S V D G RTS+ T +G+
Sbjct: 75 VCYKPFVTVINDFLSPEECDALISDADQKLKASRVVDPEDGSFVEHSARTSTSTGYHRGE 134
Query: 98 DAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNIVRGGHR 152
II IE +IA P ++GE +QVLRYE G +Y PH+D+F S ++ +GG R
Sbjct: 135 IDIIKTIEARIADLINWPVDHGEGLQVLRYEDGGEYRPHFDFFDPAKKSSRLVTKQGGQR 194
Query: 153 LATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSL 212
+ T LMYLS+V GG T FPN ++P +G AL F +
Sbjct: 195 VGTFLMYLSEVDSGGSTRFPNL---------------------NFEIRPNKGSALYFANT 233
Query: 213 HTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
+ A +P++LH+G PV EG K+ ATKW+
Sbjct: 234 NLKAEIEPLTLHAGMPVTEGVKYLATKWL 262
>gi|393200372|ref|YP_006462214.1| prolyl 4-hydroxylase [Solibacillus silvestris StLB046]
gi|327439703|dbj|BAK16068.1| prolyl 4-hydroxylase [Solibacillus silvestris StLB046]
Length = 211
Score = 132 bits (333), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 71/201 (35%), Positives = 117/201 (58%), Gaps = 27/201 (13%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
+P + L+D EC +LI+ A S+L+RS +A + ++S +RTSSG F + ++ +
Sbjct: 29 EPLIVKFLNVLSDEECQNLIDCASSRLERSKLA-----KKEISSIRTSSGMFFEENENPL 83
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYL 160
I+ IE +I++ LP E+ E +QVL YE GQ+++PH+D+F + +R+ T+++YL
Sbjct: 84 ISEIEKRISSLMHLPIEHAEGLQVLHYEPGQEFKPHFDFFGPN-HPSSSNNRICTLVVYL 142
Query: 161 SDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDP 220
+DV +GG T FPN GI P++G A+ F + + +
Sbjct: 143 NDVEEGGVTTFPNL---------------------GIVNVPKKGTAVYFEYFYNDQKLNE 181
Query: 221 VSLHSGCPVIEGEKWSATKWI 241
++LHSG PVI+GEKW AT+W+
Sbjct: 182 LTLHSGEPVIQGEKWVATQWM 202
>gi|423400914|ref|ZP_17378087.1| hypothetical protein ICW_01312 [Bacillus cereus BAG2X1-2]
gi|401653904|gb|EJS71447.1| hypothetical protein ICW_01312 [Bacillus cereus BAG2X1-2]
Length = 216
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 74/202 (36%), Positives = 116/202 (57%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D ECD LI L+KS++KRS V G S+ ++D+RTSSG F+ +
Sbjct: 38 EPLIVVLGNVLSDEECDELIELSKSKMKRSKV-----GSSRDVNDIRTSSGAFLDD--NE 90
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ A IE +I++ +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 91 LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-ANNRISTLVMY 149
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 150 LNDVEEGGETFFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSLN 188
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 189 ELTLHGGAPVTKGEKWIATQWV 210
>gi|423478381|ref|ZP_17455096.1| hypothetical protein IEO_03839 [Bacillus cereus BAG6X1-1]
gi|402428543|gb|EJV60640.1| hypothetical protein IEO_03839 [Bacillus cereus BAG6X1-1]
Length = 216
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 74/202 (36%), Positives = 116/202 (57%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D ECD LI L+KS++KRS V G S+ ++D+RTSSG F+ +
Sbjct: 38 EPLIVVLGNVLSDEECDELIELSKSKMKRSKV-----GSSRDVNDIRTSSGAFLDD--NE 90
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ A IE +I++ +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 91 LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-ANNRISTLVMY 149
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 150 LNDVEEGGETFFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSLN 188
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 189 ELTLHGGAPVTKGEKWIATQWV 210
>gi|423521903|ref|ZP_17498376.1| hypothetical protein IGC_01286 [Bacillus cereus HuA4-10]
gi|401176565|gb|EJQ83760.1| hypothetical protein IGC_01286 [Bacillus cereus HuA4-10]
Length = 216
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 73/202 (36%), Positives = 117/202 (57%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D ECD LI L+K+ +KRS V G S+ ++D+RTSSG F+ + +
Sbjct: 38 EPLIVVLANVLSDEECDKLIELSKNNMKRSKV-----GSSRDVNDIRTSSGAFLEENE-- 90
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ + IE +I++ T +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 91 LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFAEHSRSA-ANNRISTLVMY 149
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 150 LNDVEEGGETFFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSLN 188
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 189 ELTLHGGAPVTKGEKWIATQWV 210
>gi|229061929|ref|ZP_04199257.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH603]
gi|228717372|gb|EEL69042.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH603]
Length = 216
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 73/202 (36%), Positives = 117/202 (57%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D EC LI L+KS +KRS V G S+ ++D+RTSSG F+ + +
Sbjct: 38 EPLIVVLANVLSDEECAELIELSKSNMKRSKV-----GSSRDVNDIRTSSGAFLEE--NE 90
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ + IE +I++ T +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 91 LTSKIEKRISSITNVPVVHGEGLHILNYEVDQEYKAHYDYFAEHSRSA-ANNRISTLVMY 149
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + + +
Sbjct: 150 LNDVEEGGETFFP---------------------KLNLSVHPRKGMAVYFEYFYQDQLLN 188
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 189 ELTLHGGAPVTKGEKWIATQWV 210
>gi|389793983|ref|ZP_10197143.1| 2OG-Fe(II) oxygenase [Rhodanobacter fulvus Jip2]
gi|388433014|gb|EIL89992.1| 2OG-Fe(II) oxygenase [Rhodanobacter fulvus Jip2]
Length = 282
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 76/213 (35%), Positives = 113/213 (53%), Gaps = 33/213 (15%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
P V +G L++ EC LI LA+ +L+R+ D+ G+ ++ RTS G F G+ ++
Sbjct: 93 PALRVLDGLLSERECADLIELARPRLQRALTVDS-DGKQQIDQRRTSEGMFFRAGETPLV 151
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS------DKVNIVRGGHRLAT 155
A IE ++A +P +GE +Q+L Y GQ+YEPHYD+F DK+ R G R+A+
Sbjct: 152 AAIEQRLAQLLGVPASHGEGLQILHYGPGQEYEPHYDWFDPALPGYDKLT-ARAGQRIAS 210
Query: 156 VLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTN 215
V+MYL+ +GG T FP G+ V RRG A ++F+
Sbjct: 211 VVMYLNTPERGGGTAFPEI---------------------GLTVTARRG-AAVYFAYEGG 248
Query: 216 AIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDK 248
D SLH+G PV++GEKW AT W+ F +
Sbjct: 249 ---DQSSLHAGLPVLQGEKWIATHWLRERPFGQ 278
>gi|222111817|ref|YP_002554081.1| procollagen-proline dioxygenase [Acidovorax ebreus TPSY]
gi|221731261|gb|ACM34081.1| Procollagen-proline dioxygenase [Acidovorax ebreus TPSY]
Length = 289
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 73/210 (34%), Positives = 111/210 (52%), Gaps = 28/210 (13%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
PR ++ L+ EC +I+ A+ ++ RS +G +++ RTS G F +G+ ++
Sbjct: 102 PRVVLFGNLLSPEECQAIIDAAQPRMARSLTVQTTTGGEEVNADRTSDGMFFQRGETPVV 161
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNIVRGGHRLATV 156
+E++IA P +NGE +QVL Y G +Y+PHYDYF + RGG R+AT+
Sbjct: 162 QRLEERIARLVRWPIQNGEGLQVLHYRPGAEYKPHYDYFDPDQPGTSTIVRRGGQRVATL 221
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
++YL++ KGG T FP+ + V PR+G+A +FFS +
Sbjct: 222 VIYLNNPRKGGGTTFPDVP---------------------LEVAPRQGNA-VFFS-YERP 258
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
P +LH G VIEGEKW ATKW+ F
Sbjct: 259 HPSTRTLHGGASVIEGEKWIATKWLREREF 288
>gi|206978009|ref|ZP_03238895.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
H3081.97]
gi|423373947|ref|ZP_17351286.1| hypothetical protein IC5_03002 [Bacillus cereus AND1407]
gi|206743809|gb|EDZ55230.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
H3081.97]
gi|401094762|gb|EJQ02832.1| hypothetical protein IC5_03002 [Bacillus cereus AND1407]
Length = 216
Score = 132 bits (331), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 79/219 (36%), Positives = 121/219 (55%), Gaps = 33/219 (15%)
Query: 27 TAIINPSKVKQISWK---PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-L 82
AI+ + QI K P V L+D ECD LI L+K++L RS V G S+ +
Sbjct: 21 NAIMTEDREIQIISKFEEPLIVVLGNVLSDEECDKLIELSKNKLARSKV-----GSSRDV 75
Query: 83 SDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSD 142
+D+RTSSG F+ D + A IE +I++ +P +GE + +L YE Q+Y+ HYDYF++
Sbjct: 76 NDIRTSSGAFLDD--DELTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYFAE 133
Query: 143 KVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPR 202
+R++T++MYL+DV +GGET FP K ++V PR
Sbjct: 134 HSRSA-ANNRISTLVMYLNDVEEGGETFFP---------------------KLNLSVHPR 171
Query: 203 RGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
+G A+ F + + + ++LH G PV +GEKW AT+W+
Sbjct: 172 KGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210
>gi|121595595|ref|YP_987491.1| 2OG-Fe(II) oxygenase [Acidovorax sp. JS42]
gi|120607675|gb|ABM43415.1| 2OG-Fe(II) oxygenase [Acidovorax sp. JS42]
Length = 289
Score = 132 bits (331), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 73/210 (34%), Positives = 111/210 (52%), Gaps = 28/210 (13%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
PR ++ L+ EC +I+ A+ ++ RS +G +++ RTS G F +G+ ++
Sbjct: 102 PRVVLFGNLLSPEECQAIIDAAQPRMARSLTVQTTTGGEEVNADRTSDGMFFQRGETPVV 161
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNIVRGGHRLATV 156
+E++IA P +NGE +QVL Y G +Y+PHYDYF + RGG R+AT+
Sbjct: 162 QRLEERIARLVRWPIQNGEGLQVLHYRPGAEYKPHYDYFDPDQPGTSTIVRRGGQRVATL 221
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
++YL++ KGG T FP+ + V PR+G+A +FFS +
Sbjct: 222 VIYLNNPLKGGGTTFPDVP---------------------LEVAPRQGNA-VFFS-YERP 258
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
P +LH G VIEGEKW ATKW+ F
Sbjct: 259 HPSTRTLHGGASVIEGEKWIATKWLREREF 288
>gi|319943342|ref|ZP_08017624.1| 2OG-Fe(II) oxygenase [Lautropia mirabilis ATCC 51599]
gi|319743157|gb|EFV95562.1| 2OG-Fe(II) oxygenase [Lautropia mirabilis ATCC 51599]
Length = 311
Score = 132 bits (331), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 74/212 (34%), Positives = 110/212 (51%), Gaps = 26/212 (12%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
P V G L+D ECD +I L++ ++K S V D SG S S VR S G+ +G++ ++
Sbjct: 121 PNIAVIRGLLSDEECDEVIRLSRGKMKTSQVVDRESGGSYESSVRKSEGSHFERGENELV 180
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDK-----VNIVRGGHRLATV 156
IE +++ LP GE +Q+L Y G +Y+ H D+F K V GG R+ TV
Sbjct: 181 RRIEARLSALVDLPVNRGEPLQILHYGPGGEYKAHQDFFEPKDPGSAVLTRVGGQRIGTV 240
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
+MYL+DV +GGET FP+ G + KP +G A+ F + +
Sbjct: 241 VMYLNDVPEGGETAFPDI---------------------GFSAKPIKGSAVYFEYQNADG 279
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWIHVDSFDK 248
D LH+G PVI G+KW TKW+ +++
Sbjct: 280 QLDYRCLHAGMPVIRGDKWIMTKWLRERPYEQ 311
>gi|423389445|ref|ZP_17366671.1| hypothetical protein ICG_01293 [Bacillus cereus BAG1X1-3]
gi|401641536|gb|EJS59253.1| hypothetical protein ICG_01293 [Bacillus cereus BAG1X1-3]
Length = 216
Score = 132 bits (331), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 72/202 (35%), Positives = 118/202 (58%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D EC+ LI L+K+++KRS V G S+ ++D+RTSSG F+ + +
Sbjct: 38 EPLIVVLANVLSDEECEELIELSKNKMKRSKV-----GSSRDVNDIRTSSGAFLEE--NE 90
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ + IE +I++ T +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 91 LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFAEHSRSA-ANNRISTLVMY 149
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 150 LNDVEEGGETFFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSLN 188
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 189 ELTLHGGAPVTKGEKWIATQWV 210
>gi|423368291|ref|ZP_17345723.1| hypothetical protein IC3_03392 [Bacillus cereus VD142]
gi|401081042|gb|EJP89322.1| hypothetical protein IC3_03392 [Bacillus cereus VD142]
Length = 216
Score = 132 bits (331), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 72/202 (35%), Positives = 117/202 (57%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D EC LI L+K+ +KRS V G S+ ++D+RTSSG F+ + +
Sbjct: 38 EPLIVVLANVLSDEECAELIELSKNNMKRSKV-----GSSRDVNDIRTSSGAFLEE--NE 90
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ + IE +I++ T +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 91 LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFAEHSRSA-ANNRISTLVMY 149
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + + +
Sbjct: 150 LNDVEEGGETFFP---------------------KLNLSVHPRKGMAVYFEYFYQDQLLN 188
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 189 ELTLHGGAPVTKGEKWIATQWV 210
>gi|30264308|ref|NP_846685.1| prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. Ames]
gi|47529753|ref|YP_021102.1| prolyl 4-hydroxylase subunit alpha [Bacillus anthracis str. 'Ames
Ancestor']
gi|65321616|ref|ZP_00394575.1| hypothetical protein Bant_01005109 [Bacillus anthracis str. A2012]
gi|165873278|ref|ZP_02217887.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0488]
gi|167634610|ref|ZP_02392930.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0442]
gi|167638693|ref|ZP_02396969.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0193]
gi|170687507|ref|ZP_02878724.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0465]
gi|170709341|ref|ZP_02899757.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0389]
gi|177655890|ref|ZP_02937082.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0174]
gi|190566156|ref|ZP_03019075.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Tsiankovskii-I]
gi|196034803|ref|ZP_03102210.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
W]
gi|227817011|ref|YP_002817020.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
anthracis str. CDC 684]
gi|228929280|ref|ZP_04092307.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pondicheriensis BGSC 4BA1]
gi|228935557|ref|ZP_04098373.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
andalousiensis BGSC 4AW1]
gi|229123754|ref|ZP_04252949.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 95/8201]
gi|229604260|ref|YP_002868528.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0248]
gi|254683996|ref|ZP_05147856.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. CNEVA-9066]
gi|254721830|ref|ZP_05183619.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A1055]
gi|254736344|ref|ZP_05194050.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Western North America USA6153]
gi|254741382|ref|ZP_05199069.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Kruger B]
gi|254753983|ref|ZP_05206018.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Vollum]
gi|254757854|ref|ZP_05209881.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Australia 94]
gi|386738126|ref|YP_006211307.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. H9401]
gi|421506493|ref|ZP_15953416.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. UR-1]
gi|421638315|ref|ZP_16078911.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. BF1]
gi|30258953|gb|AAP28171.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Ames]
gi|47504901|gb|AAT33577.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. 'Ames Ancestor']
gi|164710995|gb|EDR16563.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0488]
gi|167513541|gb|EDR88911.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0193]
gi|167530062|gb|EDR92797.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0442]
gi|170125767|gb|EDS94678.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0389]
gi|170668702|gb|EDT19448.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0465]
gi|172079923|gb|EDT65028.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0174]
gi|190563075|gb|EDV17041.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Tsiankovskii-I]
gi|195992342|gb|EDX56303.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
W]
gi|227005734|gb|ACP15477.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. CDC 684]
gi|228659889|gb|EEL15534.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 95/8201]
gi|228824095|gb|EEM69911.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
andalousiensis BGSC 4AW1]
gi|228830570|gb|EEM76180.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pondicheriensis BGSC 4BA1]
gi|229268668|gb|ACQ50305.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0248]
gi|384387978|gb|AFH85639.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. H9401]
gi|401823486|gb|EJT22633.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. UR-1]
gi|403394741|gb|EJY91981.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. BF1]
Length = 216
Score = 131 bits (330), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 74/202 (36%), Positives = 115/202 (56%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D ECD LI L+KS+L RS V G S+ ++D+RTSSG F+ +
Sbjct: 38 EPLIVVLGNVLSDEECDELIELSKSKLARSKV-----GSSRDVNDIRTSSGAFLDD--NE 90
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ A IE +I++ +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 91 LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-ANNRISTLVMY 149
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 150 LNDVEEGGETFFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSLN 188
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 189 ELTLHGGAPVTKGEKWIATQWV 210
>gi|49187135|ref|YP_030387.1| prolyl 4-hydroxylase subunit alpha [Bacillus anthracis str. Sterne]
gi|228947951|ref|ZP_04110238.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
monterrey BGSC 4AJ1]
gi|49181062|gb|AAT56438.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Sterne]
gi|228811938|gb|EEM58272.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
monterrey BGSC 4AJ1]
Length = 232
Score = 131 bits (330), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 74/202 (36%), Positives = 115/202 (56%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D ECD LI L+KS+L RS V G S+ ++D+RTSSG F+ +
Sbjct: 54 EPLIVVLGNVLSDEECDELIELSKSKLARSKV-----GSSRDVNDIRTSSGAFLDD--NE 106
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ A IE +I++ +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 107 LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-ANNRISTLVMY 165
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 166 LNDVEEGGETFFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSLN 204
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 205 ELTLHGGAPVTKGEKWIATQWV 226
>gi|91778899|ref|YP_554107.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
xenovorans LB400]
gi|91691559|gb|ABE34757.1| Procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
xenovorans LB400]
Length = 292
Score = 131 bits (330), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 68/211 (32%), Positives = 113/211 (53%), Gaps = 26/211 (12%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
+P+ V+ L+ EC +I ++ +LKRS + +G+ + RTS G + +G+D
Sbjct: 102 RPQVIVFADVLSPDECAEMIERSRHRLKRSTTVNPATGKEDVIRNRTSEGIWYQRGEDPF 161
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNIVRGGHRLAT 155
I ++ +I++ P ENGE +Q+L Y +Y PH+DYF V+ +GG R+AT
Sbjct: 162 IERMDRRISSLMNWPVENGEGLQILHYGTTGEYRPHFDYFPPDQPGSAVHTAQGGQRVAT 221
Query: 156 VLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTN 215
+++YL+DV GGET+FP A G++V +G A+ F ++
Sbjct: 222 LVIYLNDVPDGGETIFPEA---------------------GMSVAASQGGAVYFRYMNDR 260
Query: 216 AIPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
DP++LH G PV+ G+KW TKW+ ++
Sbjct: 261 RQLDPLTLHGGAPVLAGDKWIMTKWMRERAY 291
>gi|430751569|ref|YP_007214477.1| 2OG-Fe(II) oxygenase [Thermobacillus composti KWC4]
gi|430735534|gb|AGA59479.1| 2OG-Fe(II) oxygenase superfamily enzyme [Thermobacillus composti
KWC4]
Length = 215
Score = 131 bits (330), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 80/228 (35%), Positives = 123/228 (53%), Gaps = 32/228 (14%)
Query: 16 FSLLIRKSFSST--AIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVA 73
S+L +SF S ++ + + Q +P +E L+D EC LI A +LK S +
Sbjct: 5 MSILPVQSFYSLDDGVVEATVLHQ---EPLIVRFERLLSDDECRQLIETAAPRLKESKLV 61
Query: 74 DNLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKY 133
+ + +SD+RTS G F + + I IE +IA +P E+ E +QVL Y GQ+Y
Sbjct: 62 NKV-----VSDIRTSRGMFFEEEESPFIHRIERRIAQLMNVPIEHAEGLQVLHYGPGQEY 116
Query: 134 EPHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECA 193
+ H+D+F+ R +R++T+++YL+DV +GGETVFP
Sbjct: 117 KAHHDFFAPGSPAARN-NRISTLIVYLNDVEEGGETVFPLL------------------- 156
Query: 194 KKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
GIA+KP+RG AL F + N + ++LHS PV+ GEKW AT+W+
Sbjct: 157 --GIAMKPKRGAALYFEYFYRNQALNDLTLHSSVPVVRGEKWVATQWM 202
>gi|301093292|ref|XP_002997494.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262110636|gb|EEY68688.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 324
Score = 131 bits (329), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 78/218 (35%), Positives = 122/218 (55%), Gaps = 18/218 (8%)
Query: 35 VKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKL-SDVRTSSGTFI 93
++ +S P F + FL D E D ++ L+ LK S V E + +D RTS+ F+
Sbjct: 108 LETLSLTPLVFSVDEFLKDDEIDIIMALSLEHLKPSTVTLMDGHEDRAATDWRTSTTYFL 167
Query: 94 PKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDK--------VN 145
K + + I+ ++A T +P ++ ED+QVLRYE QKY+ H DYF + +
Sbjct: 168 SSSKHSKLDEIDQRVADLTKVPVDHQEDVQVLRYEETQKYDHHTDYFPVEHHKNSPHVLE 227
Query: 146 IVRGGH--RLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRR 203
+ G+ R+ TV Y+SDVAKGG T+FP A PR ++ + +C+ G+ V P++
Sbjct: 228 SIDYGYKNRMITVFWYMSDVAKGGHTIFPRAGGAPRPQS------MKDCS-TGLKVSPKK 280
Query: 204 GDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
++F+S+ N DP+SLH GCPV +G K+S KW+
Sbjct: 281 RKVIVFYSMLPNGQGDPMSLHGGCPVEDGIKYSGNKWV 318
>gi|228960501|ref|ZP_04122151.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pakistani str. T13001]
gi|229047930|ref|ZP_04193506.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH676]
gi|423630961|ref|ZP_17606708.1| hypothetical protein IK5_03811 [Bacillus cereus VD154]
gi|423650103|ref|ZP_17625673.1| hypothetical protein IKA_03890 [Bacillus cereus VD169]
gi|228723387|gb|EEL74756.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH676]
gi|228799198|gb|EEM46165.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pakistani str. T13001]
gi|401264328|gb|EJR70440.1| hypothetical protein IK5_03811 [Bacillus cereus VD154]
gi|401282521|gb|EJR88420.1| hypothetical protein IKA_03890 [Bacillus cereus VD169]
Length = 248
Score = 131 bits (329), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 71/202 (35%), Positives = 116/202 (57%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D ECD LI ++K+++KRS V G S+ ++D+RTSSG F+ +
Sbjct: 70 EPLIVVLANVLSDEECDELIEMSKNKMKRSKV-----GSSRDVNDIRTSSGAFLED--NE 122
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ + IE +I++ +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 123 LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-ANNRISTLVMY 181
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 182 LNDVEEGGETYFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSLN 220
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 221 ELTLHGGAPVTKGEKWIATQWV 242
>gi|229168980|ref|ZP_04296697.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH621]
gi|423591765|ref|ZP_17567796.1| hypothetical protein IIG_00633 [Bacillus cereus VD048]
gi|228614572|gb|EEK71680.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH621]
gi|401231898|gb|EJR38400.1| hypothetical protein IIG_00633 [Bacillus cereus VD048]
Length = 216
Score = 131 bits (329), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 73/202 (36%), Positives = 116/202 (57%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D EC LI L+KS +KRS V G S+ ++D+RTSSG F+ + +
Sbjct: 38 EPLIVVLANVLSDEECAELIELSKSNMKRSKV-----GSSRDVNDIRTSSGAFLEE--NE 90
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ IE +I++ T +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 91 LTWKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFAEHSRSA-ANNRISTLVMY 149
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + + +
Sbjct: 150 LNDVEEGGETFFP---------------------KLNLSVHPRKGMAVYFEYFYQDQLLN 188
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 189 ELTLHGGAPVTKGEKWIATQWV 210
>gi|398810140|ref|ZP_10568970.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
gi|398083831|gb|EJL74535.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
Length = 296
Score = 130 bits (328), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 73/197 (37%), Positives = 109/197 (55%), Gaps = 26/197 (13%)
Query: 55 ECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFL 114
EC+ LI LA+ +L S D LSG + + R+S G F ++A IA ++ +++ L
Sbjct: 112 ECEELIALARPRLAPSTTVDPLSGRDLVGEQRSSLGMFFRLRENAFIARLDQRVSELMNL 171
Query: 115 PKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNIVRGGHRLATVLMYLSDVAKGGET 169
P ENGE +QVL Y G + PH+D+ ++K ++ R G R++T++ YL++V +GGET
Sbjct: 172 PVENGEGLQVLCYPAGAQSMPHFDFLVPSNAANKASLARSGQRVSTLVSYLNEVEEGGET 231
Query: 170 VFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPV 229
+FP EC G +V PRRG A+ F ++ D SLH+G PV
Sbjct: 232 IFP------------------EC---GWSVPPRRGSAVYFEYCNSLGQVDHASLHAGGPV 270
Query: 230 IEGEKWSATKWIHVDSF 246
+ GEKW ATKW+ F
Sbjct: 271 LHGEKWVATKWMRQRRF 287
>gi|423358724|ref|ZP_17336227.1| hypothetical protein IC1_00704 [Bacillus cereus VD022]
gi|401084596|gb|EJP92842.1| hypothetical protein IC1_00704 [Bacillus cereus VD022]
Length = 248
Score = 130 bits (328), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 71/202 (35%), Positives = 116/202 (57%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D ECD LI ++K+++KRS V G S+ ++D+RTSSG F+ +
Sbjct: 70 EPLIVVLANVLSDEECDKLIEMSKNKMKRSKV-----GSSRDVNDIRTSSGAFLED--NE 122
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ + IE +I++ +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 123 LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-ANNRISTLVMY 181
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 182 LNDVEEGGETYFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSLN 220
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 221 ELTLHGGAPVTKGEKWIATQWV 242
>gi|75760922|ref|ZP_00740932.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
israelensis ATCC 35646]
gi|423385740|ref|ZP_17362996.1| hypothetical protein ICE_03486 [Bacillus cereus BAG1X1-2]
gi|423561293|ref|ZP_17537569.1| hypothetical protein II5_00697 [Bacillus cereus MSX-A1]
gi|74491592|gb|EAO54798.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
israelensis ATCC 35646]
gi|401201550|gb|EJR08415.1| hypothetical protein II5_00697 [Bacillus cereus MSX-A1]
gi|401635796|gb|EJS53551.1| hypothetical protein ICE_03486 [Bacillus cereus BAG1X1-2]
Length = 248
Score = 130 bits (328), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 71/202 (35%), Positives = 116/202 (57%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D ECD LI ++K+++KRS V G S+ ++D+RTSSG F+ +
Sbjct: 70 EPLIVVLANVLSDEECDKLIEMSKNKMKRSKV-----GSSRDVNDIRTSSGAFLED--NE 122
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ + IE +I++ +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 123 LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-ANNRISTLVMY 181
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 182 LNDVEEGGETYFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSLN 220
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 221 ELTLHGGAPVTKGEKWIATQWV 242
>gi|423527903|ref|ZP_17504348.1| hypothetical protein IGE_01455 [Bacillus cereus HuB1-1]
gi|402451566|gb|EJV83385.1| hypothetical protein IGE_01455 [Bacillus cereus HuB1-1]
Length = 248
Score = 130 bits (328), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 71/202 (35%), Positives = 116/202 (57%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D ECD LI ++K+++KRS V G S+ ++D+RTSSG F+ +
Sbjct: 70 EPLIVVLANVLSDEECDKLIEMSKNKMKRSKV-----GSSRDVNDIRTSSGAFLED--NE 122
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ + IE +I++ +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 123 LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-ANNRISTLVMY 181
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 182 LNDVEEGGETYFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSLN 220
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 221 ELTLHGGAPVTKGEKWIATQWV 242
>gi|228902749|ref|ZP_04066896.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL
4222]
gi|228967277|ref|ZP_04128313.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
sotto str. T04001]
gi|402564350|ref|YP_006607074.1| prolyl 4-hydroxylase subunit alpha domain-containing protein
[Bacillus thuringiensis HD-771]
gi|434377355|ref|YP_006611999.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
thuringiensis HD-789]
gi|228792646|gb|EEM40212.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
sotto str. T04001]
gi|228856936|gb|EEN01449.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL
4222]
gi|401793002|gb|AFQ19041.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
thuringiensis HD-771]
gi|401875912|gb|AFQ28079.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
thuringiensis HD-789]
Length = 216
Score = 130 bits (328), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 71/202 (35%), Positives = 116/202 (57%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D ECD LI ++K+++KRS V G S+ ++D+RTSSG F+ +
Sbjct: 38 EPLIVVLANVLSDEECDKLIEMSKNKMKRSKV-----GSSRDVNDIRTSSGAFLED--NE 90
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ + IE +I++ +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 91 LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-ANNRISTLVMY 149
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 150 LNDVEEGGETYFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSLN 188
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 189 ELTLHGGAPVTKGEKWIATQWV 210
>gi|229163182|ref|ZP_04291137.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus R309803]
gi|228620245|gb|EEK77116.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus R309803]
Length = 229
Score = 130 bits (328), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 73/202 (36%), Positives = 114/202 (56%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D ECD LI L+KS+L RS V G S+ ++D+RTS G F+ +
Sbjct: 51 EPLIVVLGNVLSDEECDELIELSKSKLARSKV-----GSSRDVNDIRTSKGAFLDDNE-- 103
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ A IE +I++ +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 104 LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-ANNRISTLVMY 162
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 163 LNDVEEGGETFFP---------------------KLNLSVNPRKGMAVYFEYFYQDQSLN 201
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 202 ELTLHGGAPVTKGEKWIATQWV 223
>gi|196046329|ref|ZP_03113555.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
03BB108]
gi|376268135|ref|YP_005120847.1| Peptidyl prolyl 4- hydroxylase like protein [Bacillus cereus
F837/76]
gi|196022799|gb|EDX61480.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
03BB108]
gi|364513935|gb|AEW57334.1| Peptidyl prolyl 4- hydroxylase like protein [Bacillus cereus
F837/76]
Length = 216
Score = 130 bits (328), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 73/202 (36%), Positives = 115/202 (56%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D ECD LI L+K++L RS V G S+ ++D+RTSSG F+ +
Sbjct: 38 EPLIVVLGNVLSDEECDELIELSKNKLARSKV-----GSSRDVNDIRTSSGAFLDD--NE 90
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ A IE +I++ +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 91 LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-ANNRISTLVMY 149
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 150 LNDVEEGGETFFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSLN 188
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 189 ELTLHGGAPVTKGEKWIATQWV 210
>gi|196041590|ref|ZP_03108882.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
NVH0597-99]
gi|218905373|ref|YP_002453207.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
AH820]
gi|225866219|ref|YP_002751597.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
03BB102]
gi|423550018|ref|ZP_17526345.1| hypothetical protein IGW_00649 [Bacillus cereus ISP3191]
gi|196027578|gb|EDX66193.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
NVH0597-99]
gi|218537435|gb|ACK89833.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
AH820]
gi|225786013|gb|ACO26230.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
03BB102]
gi|401189634|gb|EJQ96684.1| hypothetical protein IGW_00649 [Bacillus cereus ISP3191]
Length = 216
Score = 130 bits (328), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 73/202 (36%), Positives = 115/202 (56%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D ECD LI L+K++L RS V G S+ ++D+RTSSG F+ +
Sbjct: 38 EPLIVVLGNVLSDEECDELIELSKNKLARSKV-----GSSRDVNDIRTSSGAFLDD--NE 90
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ A IE +I++ +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 91 LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-ANNRISTLVMY 149
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 150 LNDVEEGGETFFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSLN 188
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 189 ELTLHGGAPVTKGEKWIATQWV 210
>gi|423518940|ref|ZP_17495421.1| hypothetical protein IG7_04010 [Bacillus cereus HuA2-4]
gi|401159995|gb|EJQ67374.1| hypothetical protein IG7_04010 [Bacillus cereus HuA2-4]
Length = 216
Score = 130 bits (327), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 71/202 (35%), Positives = 117/202 (57%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D EC LI L+K+ +KRS V G S+ ++D+RTSSG F+ + +
Sbjct: 38 EPLIVVLANVLSDEECAELIELSKNNMKRSKV-----GSSRDVNDIRTSSGAFLEE--NE 90
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ + IE +I++ T +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 91 LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFAEHSRSA-ANNRISTLVMY 149
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP + ++V PR+G A+ F + + + +
Sbjct: 150 LNDVEEGGETFFP---------------------QLNLSVHPRKGMAVYFEYFYQDQLLN 188
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 189 ELTLHGGAPVTKGEKWIATQWV 210
>gi|301055727|ref|YP_003793938.1| prolyl 4-hydroxylase subunit alpha [Bacillus cereus biovar
anthracis str. CI]
gi|300377896|gb|ADK06800.1| prolyl 4-hydroxylase, alpha subunit [Bacillus cereus biovar
anthracis str. CI]
Length = 216
Score = 130 bits (327), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 73/202 (36%), Positives = 115/202 (56%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D ECD LI L+K++L RS V G S+ ++D+RTSSG F+ +
Sbjct: 38 EPLIVVLGNVLSDEECDELIELSKNKLARSKV-----GSSRDVNDIRTSSGAFLDD--NE 90
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ A IE +I++ +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 91 LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-ANNRISTLVMY 149
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 150 LNDVEEGGETFFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSLN 188
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 189 ELTLHGGAPVTKGEKWIATQWV 210
>gi|52141260|ref|YP_085568.1| prolyl 4-hydroxylase, alpha subunit [Bacillus cereus E33L]
gi|51974729|gb|AAU16279.1| prolyl 4-hydroxylase, alpha subunit [Bacillus cereus E33L]
Length = 232
Score = 130 bits (327), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 73/202 (36%), Positives = 115/202 (56%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D ECD LI L+K++L RS V G S+ ++D+RTSSG F+ +
Sbjct: 54 EPLIVVLGNVLSDEECDELIELSKNKLARSKV-----GSSRDVNDIRTSSGAFLDD--NE 106
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ A IE +I++ +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 107 LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-ANNRISTLVMY 165
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 166 LNDVEEGGETFFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSLN 204
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 205 ELTLHGGAPVTKGEKWIATQWV 226
>gi|30022316|ref|NP_833947.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 14579]
gi|229129515|ref|ZP_04258486.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-Cer4]
gi|29897873|gb|AAP11148.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 14579]
gi|228654120|gb|EEL09987.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-Cer4]
Length = 232
Score = 130 bits (327), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 70/202 (34%), Positives = 116/202 (57%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D ECD LI ++K++++RS + G S+ ++D+RTSSG F+ K
Sbjct: 54 EPLIVVLANVLSDEECDELIEMSKNKMERSKI-----GSSRDVNDIRTSSGAFLEDNK-- 106
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ + IE +I++ +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 107 LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-ANNRISTLVMY 165
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 166 LNDVEEGGETYFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSLN 204
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 205 ELTLHGGAPVTKGEKWIATQWV 226
>gi|229002593|ref|ZP_04160640.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock3-17]
gi|229003816|ref|ZP_04161625.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock1-4]
gi|228757417|gb|EEM06653.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock1-4]
gi|228758520|gb|EEM07660.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock3-17]
Length = 219
Score = 130 bits (327), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 69/201 (34%), Positives = 116/201 (57%), Gaps = 28/201 (13%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
+P V L+D EC+ LI ++K+++KRS + + K +D+RTSSG F+ + + I
Sbjct: 41 EPLIVVLANVLSDEECETLIEMSKNKMKRSKIGIS----RKTNDIRTSSGAFLEESE--I 94
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYL 160
IE +IA+ +P +GE +Q+L+Y GQ+Y+ HYD+F + + +R++T++MYL
Sbjct: 95 TTRIERRIASIMNVPAPHGEGLQILKYTVGQEYQAHYDFFVEN-SAAASNNRMSTLVMYL 153
Query: 161 SDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDP 220
+ V +GGET FP K ++V P++G A+ F + + +
Sbjct: 154 NHVEEGGETFFP---------------------KLNLSVSPKKGMAVYFEYFYQDESINK 192
Query: 221 VSLHSGCPVIEGEKWSATKWI 241
++LH G PVI+GEKW AT+W+
Sbjct: 193 LTLHGGAPVIKGEKWVATQWM 213
>gi|423437685|ref|ZP_17414666.1| hypothetical protein IE9_03866 [Bacillus cereus BAG4X12-1]
gi|423503075|ref|ZP_17479667.1| hypothetical protein IG1_00641 [Bacillus cereus HD73]
gi|401120840|gb|EJQ28636.1| hypothetical protein IE9_03866 [Bacillus cereus BAG4X12-1]
gi|402459296|gb|EJV91033.1| hypothetical protein IG1_00641 [Bacillus cereus HD73]
Length = 248
Score = 130 bits (327), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 69/201 (34%), Positives = 113/201 (56%), Gaps = 28/201 (13%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
+P V L+D ECD LI ++K+++KRS V ++D+RTSSG F+ + +
Sbjct: 70 EPLIVVLANVLSDEECDELIEMSKNKMKRSKVG----SARDVNDIRTSSGAFLED--NEL 123
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYL 160
+ IE +I++ +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MYL
Sbjct: 124 TSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-ANNRISTLVMYL 182
Query: 161 SDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDP 220
+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 183 NDVEEGGETYFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSLNE 221
Query: 221 VSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 222 LTLHGGAPVTKGEKWIATQWV 242
>gi|423457579|ref|ZP_17434376.1| hypothetical protein IEI_00719 [Bacillus cereus BAG5X2-1]
gi|401147963|gb|EJQ55456.1| hypothetical protein IEI_00719 [Bacillus cereus BAG5X2-1]
Length = 216
Score = 130 bits (327), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 73/202 (36%), Positives = 114/202 (56%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D ECD LI L+KS+L RS V G S+ ++D+RTSSG F+ +
Sbjct: 38 EPLIVVLGNVLSDEECDELIELSKSKLARSKV-----GSSRDVNDIRTSSGAFLED--NE 90
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ IE +I++ +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 91 LTVKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-ANNRISTLVMY 149
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 150 LNDVEEGGETFFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSLN 188
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 189 ELTLHGGAPVTKGEKWIATQWV 210
>gi|228954520|ref|ZP_04116545.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
kurstaki str. T03a001]
gi|449091198|ref|YP_007423639.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
kurstaki str. HD73]
gi|228805177|gb|EEM51771.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
kurstaki str. T03a001]
gi|449024955|gb|AGE80118.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
kurstaki str. HD73]
Length = 216
Score = 130 bits (327), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 69/201 (34%), Positives = 113/201 (56%), Gaps = 28/201 (13%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
+P V L+D ECD LI ++K+++KRS V ++D+RTSSG F+ + +
Sbjct: 38 EPLIVVLANVLSDEECDELIEMSKNKMKRSKVG----SARDVNDIRTSSGAFLED--NEL 91
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYL 160
+ IE +I++ +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MYL
Sbjct: 92 TSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-ANNRISTLVMYL 150
Query: 161 SDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDP 220
+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 151 NDVEEGGETYFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSLNE 189
Query: 221 VSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 190 LTLHGGAPVTKGEKWIATQWV 210
>gi|228990015|ref|ZP_04149988.1| Prolyl 4-hydroxylase alpha subunit [Bacillus pseudomycoides DSM
12442]
gi|228769681|gb|EEM18271.1| Prolyl 4-hydroxylase alpha subunit [Bacillus pseudomycoides DSM
12442]
Length = 219
Score = 130 bits (327), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 69/201 (34%), Positives = 116/201 (57%), Gaps = 28/201 (13%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
+P V L+D EC+ LI ++K+++KRS + + K +D+RTSSG F+ + + I
Sbjct: 41 EPLIVVLANVLSDEECETLIEMSKNKMKRSKIGVS----RKTNDIRTSSGAFLEESE--I 94
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYL 160
IE +IA+ +P +GE +Q+L+Y GQ+Y+ HYD+F + + +R++T++MYL
Sbjct: 95 TTRIERRIASIMNVPAPHGEGLQILKYTVGQEYQAHYDFFVEN-SAAASNNRMSTLVMYL 153
Query: 161 SDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDP 220
+ V +GGET FP K ++V P++G A+ F + + +
Sbjct: 154 NHVEEGGETFFP---------------------KLNLSVSPKKGMAVYFEYFYQDESINK 192
Query: 221 VSLHSGCPVIEGEKWSATKWI 241
++LH G PVI+GEKW AT+W+
Sbjct: 193 LTLHGGAPVIKGEKWVATQWM 213
>gi|217961727|ref|YP_002340297.1| prolyl 4-hydroxylase subunit alpha domain-containing protein
[Bacillus cereus AH187]
gi|222097680|ref|YP_002531737.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
Q1]
gi|229198365|ref|ZP_04325071.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1293]
gi|375286242|ref|YP_005106681.1| prolyl 4-hydroxylase subunit alpha domain-containing protein
[Bacillus cereus NC7401]
gi|423354732|ref|ZP_17332357.1| hypothetical protein IAU_02806 [Bacillus cereus IS075]
gi|423566803|ref|ZP_17543050.1| hypothetical protein II7_00026 [Bacillus cereus MSX-A12]
gi|423574080|ref|ZP_17550199.1| hypothetical protein II9_01301 [Bacillus cereus MSX-D12]
gi|217067199|gb|ACJ81449.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
AH187]
gi|221241738|gb|ACM14448.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
Q1]
gi|228585065|gb|EEK43177.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1293]
gi|358354769|dbj|BAL19941.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
NC7401]
gi|401086280|gb|EJP94507.1| hypothetical protein IAU_02806 [Bacillus cereus IS075]
gi|401212649|gb|EJR19392.1| hypothetical protein II9_01301 [Bacillus cereus MSX-D12]
gi|401215318|gb|EJR22035.1| hypothetical protein II7_00026 [Bacillus cereus MSX-A12]
Length = 216
Score = 130 bits (326), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 78/219 (35%), Positives = 121/219 (55%), Gaps = 33/219 (15%)
Query: 27 TAIINPSKVKQISWK---PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-L 82
AI+ + QI K P V L+D ECD LI L+K++L RS V G S+ +
Sbjct: 21 NAIMTEDREIQIISKFEEPLIVVLGNVLSDEECDKLIELSKNKLARSKV-----GSSRDV 75
Query: 83 SDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSD 142
+D+RTSSG F+ + + A IE +I++ +P +GE + +L YE Q+Y+ HYDYF++
Sbjct: 76 NDIRTSSGAFLDD--NELTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYFAE 133
Query: 143 KVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPR 202
+R++T++MYL+DV +GGET FP K ++V PR
Sbjct: 134 HSRSA-ANNRISTLVMYLNDVEEGGETFFP---------------------KLNLSVHPR 171
Query: 203 RGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
+G A+ F + + + ++LH G PV +GEKW AT+W+
Sbjct: 172 KGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210
>gi|406665340|ref|ZP_11073114.1| hypothetical protein B857_00901 [Bacillus isronensis B3W22]
gi|405387266|gb|EKB46691.1| hypothetical protein B857_00901 [Bacillus isronensis B3W22]
Length = 211
Score = 130 bits (326), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 70/201 (34%), Positives = 117/201 (58%), Gaps = 27/201 (13%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
+P + L+D EC +LI+ A S+L+RS +A + ++S +RTSSG F + ++ +
Sbjct: 29 EPLIVKFLNVLSDEECQNLIDCASSRLERSKLA-----KKEISSIRTSSGMFFEENENPL 83
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYL 160
I+ IE +I++ LP E+ E +QVL YE GQ+++ H+D+F + +R++T+++YL
Sbjct: 84 ISEIEKRISSLMHLPIEHAEGLQVLHYEPGQEFKAHFDFFGPN-HPSSSNNRISTLVVYL 142
Query: 161 SDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDP 220
+DV +GG T FPN GI P++G A+ F + + +
Sbjct: 143 NDVEEGGVTTFPNL---------------------GIVNVPKKGTAVYFEYFYNDQKLNE 181
Query: 221 VSLHSGCPVIEGEKWSATKWI 241
++LHSG PVI+GEKW AT+W+
Sbjct: 182 LTLHSGEPVIQGEKWVATQWM 202
>gi|229140971|ref|ZP_04269515.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST26]
gi|228642547|gb|EEK98834.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST26]
Length = 232
Score = 130 bits (326), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 78/219 (35%), Positives = 121/219 (55%), Gaps = 33/219 (15%)
Query: 27 TAIINPSKVKQISWK---PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-L 82
AI+ + QI K P V L+D ECD LI L+K++L RS V G S+ +
Sbjct: 37 NAIMTEDREIQIISKFEEPLIVVLGNVLSDEECDKLIELSKNKLARSKV-----GSSRDV 91
Query: 83 SDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSD 142
+D+RTSSG F+ + + A IE +I++ +P +GE + +L YE Q+Y+ HYDYF++
Sbjct: 92 NDIRTSSGAFLDD--NELTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYFAE 149
Query: 143 KVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPR 202
+R++T++MYL+DV +GGET FP K ++V PR
Sbjct: 150 HSRSA-ANNRISTLVMYLNDVEEGGETFFP---------------------KLNLSVHPR 187
Query: 203 RGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
+G A+ F + + + ++LH G PV +GEKW AT+W+
Sbjct: 188 KGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 226
>gi|42783360|ref|NP_980607.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 10987]
gi|42739288|gb|AAS43215.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
ATCC 10987]
Length = 216
Score = 130 bits (326), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 78/219 (35%), Positives = 121/219 (55%), Gaps = 33/219 (15%)
Query: 27 TAIINPSKVKQISWK---PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-L 82
AI+ + QI K P V L+D ECD LI L+K++L RS V G S+ +
Sbjct: 21 NAIMTEDREIQIISKFEEPLIVVLGNVLSDEECDELIELSKNKLARSKV-----GSSRDV 75
Query: 83 SDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSD 142
+D+RTSSG F+ + + A IE +I++ +P +GE + +L YE Q+Y+ HYDYF++
Sbjct: 76 NDIRTSSGAFLDD--NELTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYFAE 133
Query: 143 KVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPR 202
+R++T++MYL+DV +GGET FP K ++V PR
Sbjct: 134 HSRSA-ANNRISTLVMYLNDVEEGGETFFP---------------------KLNLSVHPR 171
Query: 203 RGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
+G A+ F + + + ++LH G PV +GEKW AT+W+
Sbjct: 172 KGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWM 210
>gi|423582447|ref|ZP_17558558.1| hypothetical protein IIA_03962 [Bacillus cereus VD014]
gi|401213326|gb|EJR20067.1| hypothetical protein IIA_03962 [Bacillus cereus VD014]
Length = 248
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 71/207 (34%), Positives = 117/207 (56%), Gaps = 30/207 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D ECD LI ++K+++KRS V G S+ ++D+RTSSG F+ +
Sbjct: 70 EPLIVVLANVLSDEECDELIEMSKNKMKRSKV-----GSSRDVNDIRTSSGAFLEDSE-- 122
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ IE +I++ +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 123 LTLKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-ANNRISTLVMY 181
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 182 LNDVEEGGETYFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSLN 220
Query: 220 PVSLHSGCPVIEGEKWSATKWIHVDSF 246
++LH G PV +GEKW AT+W+ ++
Sbjct: 221 ELTLHGGAPVTKGEKWIATQWVRRGTY 247
>gi|423558182|ref|ZP_17534484.1| hypothetical protein II3_03386 [Bacillus cereus MC67]
gi|401191450|gb|EJQ98472.1| hypothetical protein II3_03386 [Bacillus cereus MC67]
Length = 216
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 71/202 (35%), Positives = 117/202 (57%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D ECD LI L+K+++KRS + G S+ ++D+RTSSG F+ + +
Sbjct: 38 EPLIVVLANVLSDEECDGLIELSKNKIKRSKI-----GSSRDVNDIRTSSGAFLEENE-- 90
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ + IE +I++ +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 91 LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQEYKAHYDYFAEHSRSA-ANNRISTLVMY 149
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 150 LNDVEEGGETFFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSLN 188
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 189 ELTLHGGAPVTKGEKWVATQWV 210
>gi|347964867|ref|XP_309164.4| AGAP000971-PA [Anopheles gambiae str. PEST]
gi|333466515|gb|EAA04901.5| AGAP000971-PA [Anopheles gambiae str. PEST]
Length = 553
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 67/217 (30%), Positives = 115/217 (52%), Gaps = 25/217 (11%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
I P K+++ +P +Y ++D E + + + A+ + +R+ V + +GE + ++ R S
Sbjct: 337 IGPLKLEEAYLRPYIVIYHDVMSDREIERIKHYARPRFRRATVQNYKTGELEFANYRISK 396
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-SDKVNIVR 148
++ +D +I I ++ T L E E++QV+ Y G YEPH+D+ ++ N +
Sbjct: 397 SAWLKDAEDEMIRTISQRVEDMTGLTMETAEELQVVNYGIGGHYEPHFDFARREERNAFK 456
Query: 149 G---GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGD 205
G+R+ATVL Y+SDV +GG TVFP+ +A+ PR+G
Sbjct: 457 SLGTGNRIATVLFYMSDVTQGGATVFPSL---------------------NLALWPRKGT 495
Query: 206 ALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A +F+LH + D + H+ CPV+ G KW + KWIH
Sbjct: 496 AAFWFNLHASGRGDYATRHAACPVLTGTKWVSNKWIH 532
>gi|384182063|ref|YP_005567825.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
finitimus YBT-020]
gi|324328147|gb|ADY23407.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
finitimus YBT-020]
Length = 216
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 78/219 (35%), Positives = 121/219 (55%), Gaps = 33/219 (15%)
Query: 27 TAIINPSKVKQISWK---PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-L 82
AI+ + QI K P V L+D ECD LI L+K++L RS V G S+ +
Sbjct: 21 NAIMTEDREIQIISKFEEPLIVVLGNVLSDEECDELIELSKNKLARSKV-----GSSRDV 75
Query: 83 SDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSD 142
+D+RTSSG F+ + + A IE +I++ +P +GE + +L YE Q+Y+ HYDYF++
Sbjct: 76 NDIRTSSGAFLDD--NELTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYFAE 133
Query: 143 KVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPR 202
+R++T++MYL+DV +GGET FP K ++V PR
Sbjct: 134 HSRSA-ANNRISTLVMYLNDVEEGGETFFP---------------------KLNLSVHPR 171
Query: 203 RGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
+G A+ F + + + ++LH G PV +GEKW AT+W+
Sbjct: 172 KGMAVYFEYFYQDRSLNELTLHGGAPVTKGEKWIATQWV 210
>gi|228922987|ref|ZP_04086280.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
huazhongensis BGSC 4BD1]
gi|228836620|gb|EEM81968.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
huazhongensis BGSC 4BD1]
Length = 216
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 71/202 (35%), Positives = 115/202 (56%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D ECD LI ++K+++KRS V G S+ ++D+RTSSG F+ +
Sbjct: 38 EPLIVVLANVLSDEECDELIEMSKNKMKRSKV-----GSSRDVNDIRTSSGAFLEDSE-- 90
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ IE +I++ +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 91 LTLKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-ANNRISTLVMY 149
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 150 LNDVEEGGETYFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSLN 188
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 189 ELTLHGGAPVTKGEKWIATQWV 210
>gi|218899396|ref|YP_002447807.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
G9842]
gi|218542449|gb|ACK94843.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
G9842]
Length = 216
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 71/202 (35%), Positives = 116/202 (57%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D ECD LI ++K+++KRS V G S+ ++D+RTSSG F+ +
Sbjct: 38 EPLIVVLANVLSDEECDELIEMSKNKMKRSKV-----GSSRDVNDIRTSSGAFLED--NE 90
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ + IE +I++ +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 91 LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSAVN-NRISTLVMY 149
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 150 LNDVEEGGETYFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSLN 188
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 189 ELTLHGGAPVTKGEKWIATQWV 210
>gi|228910069|ref|ZP_04073889.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL 200]
gi|228849586|gb|EEM94420.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL 200]
Length = 248
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 71/202 (35%), Positives = 116/202 (57%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D ECD LI ++K+++KRS V G S+ ++D+RTSSG F+ +
Sbjct: 70 EPLIVVLANVLSDEECDELIEMSKNKMKRSKV-----GSSRDVNDIRTSSGAFLED--NE 122
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ + IE +I++ +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 123 LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-VNNRISTLVMY 181
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 182 LNDVEEGGETYFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSLN 220
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 221 ELTLHGGAPVTKGEKWIATQWV 242
>gi|423634936|ref|ZP_17610589.1| hypothetical protein IK7_01345 [Bacillus cereus VD156]
gi|401278922|gb|EJR84852.1| hypothetical protein IK7_01345 [Bacillus cereus VD156]
Length = 248
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 71/202 (35%), Positives = 115/202 (56%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D ECD LI ++K+++KRS V G S+ ++D+RTSSG F+ +
Sbjct: 70 EPLIVVLANVLSDEECDELIEMSKNKMKRSKV-----GSSRDVNDIRTSSGAFLEDSE-- 122
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ IE +I++ +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 123 LTLKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-ANNRISTLVMY 181
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 182 LNDVEEGGETYFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSLN 220
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 221 ELTLHGGAPVTKGEKWIATQWV 242
>gi|321463241|gb|EFX74258.1| hypothetical protein DAPPUDRAFT_22132 [Daphnia pulex]
Length = 523
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 75/225 (33%), Positives = 116/225 (51%), Gaps = 33/225 (14%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
++ P K++Q S++P + + L+D E + + LAK L RS V L ++S+VRTS
Sbjct: 306 LLMPIKIEQHSFEPAIYTFHDVLSDEEIETIKELAKPLLARSMVQGKLGVGHEVSNVRTS 365
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLP----KENGEDIQVLRYEHGQKYEPHYDYF-SDK 143
++P+G ++ + +I T L ++ E +QV Y G Y PH+DY DK
Sbjct: 366 KTAWLPEGLHPLLNRLSRRIGLITGLKTDPIRDEAELLQVANYGIGGHYSPHHDYLMKDK 425
Query: 144 VNI-------VRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKG 196
+ ++ G R+AT + YL+DV +GG T FP A G
Sbjct: 426 ADFEYMHHRELQAGDRIATFMFYLNDVERGGSTAFPRA---------------------G 464
Query: 197 IAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
+AVKP +G A +F+L + PDP++LH CPV+ G KW + KWI
Sbjct: 465 VAVKPVKGGAAFWFNLKRSGKPDPLTLHGACPVLLGHKWVSNKWI 509
>gi|281307110|pdb|3ITQ|A Chain A, Crystal Structure Of A Prolyl 4-Hydroxylase From Bacillus
Anthracis
gi|281307111|pdb|3ITQ|B Chain B, Crystal Structure Of A Prolyl 4-Hydroxylase From Bacillus
Anthracis
Length = 216
Score = 129 bits (324), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 73/202 (36%), Positives = 114/202 (56%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D ECD LI L+KS+L RS V G S+ ++D+RTSSG F+ +
Sbjct: 38 EPLIVVLGNVLSDEECDELIELSKSKLARSKV-----GSSRDVNDIRTSSGAFLDD--NE 90
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ A IE +I++ +P +GE + +L YE Q+Y+ HYDYF++ +R++T++ Y
Sbjct: 91 LTAKIEKRISSIXNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-ANNRISTLVXY 149
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 150 LNDVEEGGETFFP---------------------KLNLSVHPRKGXAVYFEYFYQDQSLN 188
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 189 ELTLHGGAPVTKGEKWIATQWV 210
>gi|239816557|ref|YP_002945467.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
gi|239803134|gb|ACS20201.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
Length = 296
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 69/197 (35%), Positives = 110/197 (55%), Gaps = 26/197 (13%)
Query: 55 ECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFL 114
EC+ LI LA+ +L S D L+G ++L R+S G F ++A +A ++++++ L
Sbjct: 112 ECEALIALARPRLAPSTSVDPLTGRNRLGAQRSSLGMFFRLRENAFVARLDERLSELMNL 171
Query: 115 PKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNIVRGGHRLATVLMYLSDVAKGGET 169
P ENGE +QVL Y G + PH+D+ +++ ++ R G R++T++ YL++V +GGET
Sbjct: 172 PVENGEGLQVLHYPAGAQSLPHFDFLVPSNAANQASLQRSGQRVSTLVAYLNEVEEGGET 231
Query: 170 VFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPV 229
VFP + G +V P+RG A+ F ++ D SLH+G PV
Sbjct: 232 VFP---------------------ETGWSVSPQRGGAVYFEYCNSLGQVDHASLHAGAPV 270
Query: 230 IEGEKWSATKWIHVDSF 246
+ GEKW ATKW+ F
Sbjct: 271 LSGEKWVATKWMRQRRF 287
>gi|226314793|ref|YP_002774689.1| hypothetical protein BBR47_52080 [Brevibacillus brevis NBRC 100599]
gi|226097743|dbj|BAH46185.1| conserved hypothetical protein [Brevibacillus brevis NBRC 100599]
Length = 215
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 73/206 (35%), Positives = 115/206 (55%), Gaps = 28/206 (13%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
+P V L+D ECD LI ++ +L+RS + ++ S ++ +RTSSG F + +
Sbjct: 35 EPLVVVLGNVLSDSECDELIEHSRERLQRSKIGEDRS----VNSIRTSSGVFCEQTE--T 88
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYL 160
I IE +I+ +P E+G+ +QVLRY GQ+Y+PHYD+F++ + +R++T++MYL
Sbjct: 89 ITRIEKRISQIMNIPIEHGDGLQVLRYTPGQEYKPHYDFFAE-TSRASTNNRISTLVMYL 147
Query: 161 SDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDP 220
+DV +GGETVFP ++V P +G A+ F + N +
Sbjct: 148 NDVEQGGETVFPLLH---------------------LSVFPTKGMAVYFEYFYRNQEVNE 186
Query: 221 VSLHSGCPVIEGEKWSATKWIHVDSF 246
+LH+G VI GEKW AT W+ SF
Sbjct: 187 FTLHAGAQVIHGEKWVATMWMRRQSF 212
>gi|423657194|ref|ZP_17632493.1| hypothetical protein IKG_04182 [Bacillus cereus VD200]
gi|401289937|gb|EJR95641.1| hypothetical protein IKG_04182 [Bacillus cereus VD200]
Length = 248
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 69/202 (34%), Positives = 116/202 (57%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D ECD LI ++K++++RS + G S+ ++D+RTSSG F+ +
Sbjct: 70 EPLIVVLANVLSDEECDELIEMSKNKMERSKI-----GSSRDVNDIRTSSGAFLED--NE 122
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ + IE +I++ +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 123 LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-ANNRISTLVMY 181
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 182 LNDVEEGGETYFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSLN 220
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 221 ELTLHGGAPVTKGEKWIATQWV 242
>gi|423483822|ref|ZP_17460512.1| hypothetical protein IEQ_03600 [Bacillus cereus BAG6X1-2]
gi|401141373|gb|EJQ48928.1| hypothetical protein IEQ_03600 [Bacillus cereus BAG6X1-2]
Length = 216
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 69/202 (34%), Positives = 117/202 (57%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V ++D ECD LI ++K+++KRS + G S+ ++D+RTSSG F+ + +
Sbjct: 38 EPLIVVLGNVISDEECDELIEMSKNKIKRSTI-----GSSRDVNDIRTSSGAFLEE--NE 90
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ + IE +I++ +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 91 LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-ANNRISTLVMY 149
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 150 LNDVEEGGETFFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSLN 188
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 189 ELTLHGGAPVTKGEKWIATQWV 210
>gi|423541303|ref|ZP_17517694.1| hypothetical protein IGK_03395 [Bacillus cereus HuB4-10]
gi|401172491|gb|EJQ79712.1| hypothetical protein IGK_03395 [Bacillus cereus HuB4-10]
Length = 216
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 69/202 (34%), Positives = 117/202 (57%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V ++D ECD LI ++K+++KRS + G S+ ++D+RTSSG F+ + +
Sbjct: 38 EPLIVVLGNVISDEECDELIEMSKNKIKRSTI-----GSSRDVNDIRTSSGAFLEE--NE 90
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ + IE +I++ +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 91 LTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-ANNRISTLVMY 149
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 150 LNDVEEGGETFFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSLN 188
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 189 ELTLHGGAPVTKGEKWIATQWV 210
>gi|423615424|ref|ZP_17591258.1| hypothetical protein IIO_00750 [Bacillus cereus VD115]
gi|401259961|gb|EJR66134.1| hypothetical protein IIO_00750 [Bacillus cereus VD115]
Length = 216
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 69/202 (34%), Positives = 117/202 (57%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V ++D ECD LI ++K+++KRS + G S+ ++D+RTSSG F+ + +
Sbjct: 38 EPLIVVLGNVISDEECDELIEMSKNKIKRSTI-----GSSRDVNDIRTSSGAFLEE--NE 90
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ + IE +I++ +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 91 LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-ANNRISTLVMY 149
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 150 LNDVEEGGETFFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSLN 188
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 189 ELTLHGGAPVTKGEKWIATQWV 210
>gi|228916870|ref|ZP_04080433.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pulsiensis BGSC 4CC1]
gi|228842793|gb|EEM87878.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pulsiensis BGSC 4CC1]
Length = 232
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 72/202 (35%), Positives = 114/202 (56%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D ECD LI L+K++L RS V G S+ ++D+RTS G F+ +
Sbjct: 54 EPLIVVLGNVLSDEECDELIELSKNKLARSKV-----GSSRDVNDIRTSKGAFLDD--NE 106
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ A IE +I++ +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 107 LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-ANNRISTLVMY 165
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 166 LNDVEEGGETFFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSLN 204
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 205 ELTLHGGAPVTKGEKWIATQWV 226
>gi|321474898|gb|EFX85862.1| hypothetical protein DAPPUDRAFT_309117 [Daphnia pulex]
Length = 541
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 73/217 (33%), Positives = 112/217 (51%), Gaps = 25/217 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I P K++ S KPR +Y +TD E + LA+S+L+RS V ++L+G S+ + R +
Sbjct: 329 FIQPIKMELASLKPRLVIYHNVVTDEEIETAKKLAQSRLRRSTVQNSLTGASEPTKYRIA 388
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVR 148
F+ + I + +I T L E++QV Y G YEPHYD+ + K + +
Sbjct: 389 KAAFLQNSEHDHIVKMTRRIGDVTGLDMTTAEELQVCNYGIGGHYEPHYDH-ARKGEVQK 447
Query: 149 G---GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGD 205
G+R+AT + Y+SDV GG TVFP +A+ P++G
Sbjct: 448 DFGWGNRIATWMFYMSDVEAGGATVFPQI---------------------NLALWPQKGS 486
Query: 206 ALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A +F+LH N D ++ H+ CPV+ G KW + KWIH
Sbjct: 487 AAFWFNLHPNGEGDDLTQHAACPVLTGSKWVSNKWIH 523
>gi|205374182|ref|ZP_03226981.1| prolyl 4-hydroxylase alpha subunit [Bacillus coahuilensis m4-4]
Length = 210
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 67/201 (33%), Positives = 116/201 (57%), Gaps = 28/201 (13%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
+P V L+D ECD LI+L+K ++ RS +A N + +D+RTS+ F+P+ +
Sbjct: 32 EPFVAVLGNVLSDEECDELISLSKDRMNRSKIAGN-----QENDIRTSTSVFLPEDASEV 86
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYL 160
+ +E +I+ +P E+GE +Q+L Y+ GQ+Y+ H+D+FS K I R++T+++YL
Sbjct: 87 VQRVEKRISQIMNIPVEHGEGLQLLNYQIGQEYKAHFDFFSPKKLI--ENPRISTLVLYL 144
Query: 161 SDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDP 220
+DV +GG+T FPN + ++V P +G A+ F + + + +
Sbjct: 145 NDVEEGGDTYFPNLK---------------------LSVSPHKGMAVYFEYFYDDPMLNE 183
Query: 221 VSLHSGCPVIEGEKWSATKWI 241
++LH G PV G+KW+AT W+
Sbjct: 184 LTLHGGAPVTIGDKWAATMWM 204
>gi|229186477|ref|ZP_04313640.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BGSC 6E1]
gi|228596991|gb|EEK54648.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BGSC 6E1]
Length = 216
Score = 128 bits (322), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 72/202 (35%), Positives = 115/202 (56%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D ECD LI L+K++L RS V G S+ ++D+RTSSG F+ +
Sbjct: 38 EPLIVVLGNVLSDEECDELIELSKNKLARSKV-----GSSRDVNDIRTSSGAFLDD--NE 90
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ A IE +I++ +P +GE + +L YE Q+Y+ HYDYF++ +R++T+++Y
Sbjct: 91 LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-ANNRISTLVIY 149
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 150 LNDVEEGGETFFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSLN 188
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 189 ELTLHGGAPVTKGEKWIATQWV 210
>gi|49480949|ref|YP_038297.1| prolyl 4-hydroxylase subunit alpha [Bacillus thuringiensis serovar
konkukian str. 97-27]
gi|49332505|gb|AAT63151.1| prolyl 4-hydroxylase, alpha subunit [Bacillus thuringiensis serovar
konkukian str. 97-27]
Length = 232
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 72/202 (35%), Positives = 114/202 (56%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D ECD LI L+K++L RS V G S+ ++D+RTSSG F+ +
Sbjct: 54 EPLIVVLGNVLSDEECDELIELSKNKLARSKV-----GSSRDVNDIRTSSGAFLDDNE-- 106
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ IE +I++ +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 107 LTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-ANNRISTLVMY 165
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 166 LNDVEEGGETFFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSLN 204
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 205 ELTLHGGAPVTKGEKWIATQWV 226
>gi|398818543|ref|ZP_10577128.1| 2OG-Fe(II) oxygenase superfamily enzyme [Brevibacillus sp. BC25]
gi|398027481|gb|EJL21031.1| 2OG-Fe(II) oxygenase superfamily enzyme [Brevibacillus sp. BC25]
Length = 220
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 73/206 (35%), Positives = 116/206 (56%), Gaps = 28/206 (13%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
+P V L+D ECD LI ++ +L+RS + ++ S ++ +RTSSG F + +
Sbjct: 40 EPLVVVLGNVLSDSECDELIEHSRERLQRSKIGEDGS----VNSIRTSSGVFCEQTET-- 93
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYL 160
I IE +I+ +P E+G+ +QVLRY GQ+Y+PHYD+F++ + +R++T++MYL
Sbjct: 94 ITRIEKRISQIMNIPIEHGDGLQVLRYTPGQEYKPHYDFFAE-TSRASTNNRISTLVMYL 152
Query: 161 SDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDP 220
+DV +GGETVFP ++V P +G A+ F ++N +
Sbjct: 153 NDVEQGGETVFPLLH---------------------LSVFPTKGMAVYFEYFYSNQELND 191
Query: 221 VSLHSGCPVIEGEKWSATKWIHVDSF 246
+LH+G VI GEKW AT W+ SF
Sbjct: 192 FTLHAGTQVIHGEKWVATMWMRRQSF 217
>gi|3297815|emb|CAA19873.1| putative protein [Arabidopsis thaliana]
gi|7270340|emb|CAB80108.1| putative protein [Arabidopsis thaliana]
Length = 257
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 77/189 (40%), Positives = 106/189 (56%), Gaps = 11/189 (5%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGES--KLSDVRTSSGTFIPK 95
+SW+PRA + F T +C +I AK LK SA+A GE+ RTSSGTFI
Sbjct: 30 LSWRPRAIYFPNFATAEQCQAIIERAKVNLKPSALALR-KGETAENTKGTRTSSGTFISA 88
Query: 96 GKDAIIA--GIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRL 153
+++ A +E KIA T +P+ +GE +LRYE GQKY+ HYD F+ + R+
Sbjct: 89 SEESTGALDFVERKIARATMIPRSHGESFNILRYELGQKYDSHYDVFNPTEYGPQSSQRI 148
Query: 154 ATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLH 213
A+ L+YLSDV +GGET+FP D +C G+ VKPR+GD LLF+S+
Sbjct: 149 ASFLLYLSDVEEGGETMFPFENG----SNMGIGYDYKQCI--GLKVKPRKGDGLLFYSVF 202
Query: 214 TNAIPDPVS 222
N D V+
Sbjct: 203 PNGTIDQVN 211
>gi|118479416|ref|YP_896567.1| prolyl 4-hydroxylase subunit alpha [Bacillus thuringiensis str. Al
Hakam]
gi|118418641|gb|ABK87060.1| prolyl 4-hydroxylase, alpha subunit [Bacillus thuringiensis str. Al
Hakam]
Length = 232
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 72/202 (35%), Positives = 115/202 (56%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D ECD LI L+K++L RS V G S+ ++D+RTSSG F+ +
Sbjct: 54 EPLIVVLGNVLSDEECDELIELSKNKLARSKV-----GSSRDVNDIRTSSGAFLDDNE-- 106
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ A IE +I++ +P +GE + +L YE Q+Y+ HYDYF++ +R++T+++Y
Sbjct: 107 LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-ANNRISTLVIY 165
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 166 LNDVEEGGETFFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSLN 204
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 205 ELTLHGGAPVTKGEKWIATQWV 226
>gi|423452458|ref|ZP_17429311.1| hypothetical protein IEE_01202 [Bacillus cereus BAG5X1-1]
gi|401140096|gb|EJQ47653.1| hypothetical protein IEE_01202 [Bacillus cereus BAG5X1-1]
Length = 216
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 70/202 (34%), Positives = 117/202 (57%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D ECD LI L+K++++RS + G S+ ++D+RTSSG F+ + +
Sbjct: 38 EPLIVVLANVLSDEECDGLIELSKNKIERSKI-----GSSRDVNDIRTSSGAFLEE--NE 90
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ + IE +I++ +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 91 LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQEYKAHYDYFAEHSRSA-ANNRISTLVMY 149
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 150 LNDVEEGGETFFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSLN 188
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 189 ELTLHGGAPVTKGEKWVATQWV 210
>gi|423406337|ref|ZP_17383486.1| hypothetical protein ICY_01022 [Bacillus cereus BAG2X1-3]
gi|401660331|gb|EJS77813.1| hypothetical protein ICY_01022 [Bacillus cereus BAG2X1-3]
Length = 216
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 72/202 (35%), Positives = 114/202 (56%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D ECD LI L+K++L RS V G S+ ++D+RTS G F+ +
Sbjct: 38 EPLIVVLGNVLSDEECDKLIELSKNKLARSKV-----GSSRDVNDIRTSKGAFLDD--NE 90
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ A IE +I++ +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 91 LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-ANNRISTLVMY 149
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 150 LNDVEEGGETYFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSLN 188
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 189 ELTLHGGAPVTKGEKWIATQWV 210
>gi|423395462|ref|ZP_17372663.1| hypothetical protein ICU_01156 [Bacillus cereus BAG2X1-1]
gi|401654873|gb|EJS72412.1| hypothetical protein ICU_01156 [Bacillus cereus BAG2X1-1]
Length = 216
Score = 128 bits (321), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 72/202 (35%), Positives = 114/202 (56%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D ECD LI L+K++L RS V G S+ ++D+RTS G F+ +
Sbjct: 38 EPLIVVLGNVLSDEECDKLIELSKNKLARSKV-----GSSRDVNDIRTSKGAFLDD--NE 90
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ A IE +I++ +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 91 LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-ANNRISTLVMY 149
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 150 LNDVEEGGETYFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSLN 188
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 189 ELTLHGGAPVTKGEKWIATQWV 210
>gi|229146822|ref|ZP_04275187.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST24]
gi|228636650|gb|EEK93115.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST24]
Length = 216
Score = 128 bits (321), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 69/202 (34%), Positives = 115/202 (56%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D ECD LI ++K++++RS + G S+ ++D+RTSSG F+ +
Sbjct: 38 EPLIVVLANVLSDEECDELIEMSKNKMERSKI-----GSSRDVNDIRTSSGAFLED--NE 90
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ + IE +I++ +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 91 LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-ANNRISTLVMY 149
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + +
Sbjct: 150 LNDVEEGGETYFP---------------------KLNLSVHPRKGMAVYFEYFYQGQSLN 188
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 189 ELTLHGGAPVTKGEKWIATQWV 210
>gi|365158975|ref|ZP_09355162.1| hypothetical protein HMPREF1014_00625 [Bacillus sp. 7_6_55CFAA_CT2]
gi|363625964|gb|EHL76973.1| hypothetical protein HMPREF1014_00625 [Bacillus sp. 7_6_55CFAA_CT2]
Length = 248
Score = 128 bits (321), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 69/202 (34%), Positives = 116/202 (57%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D ECD LI ++K++++RS + G S+ ++D+RTSSG F+ +
Sbjct: 70 EPLIVVLANVLSDEECDELIEMSKNKMERSKI-----GSSRDVNDIRTSSGAFLED--NE 122
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ + IE +I++ +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 123 LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-ANNRISTLVMY 181
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 182 LNDVEEGGETYFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSLN 220
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 221 ELTLHGGAPVTKGEKWIATQWV 242
>gi|206971296|ref|ZP_03232247.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
AH1134]
gi|229081494|ref|ZP_04213993.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-2]
gi|423411965|ref|ZP_17389085.1| hypothetical protein IE1_01269 [Bacillus cereus BAG3O-2]
gi|423432249|ref|ZP_17409253.1| hypothetical protein IE7_04065 [Bacillus cereus BAG4O-1]
gi|206734068|gb|EDZ51239.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
AH1134]
gi|228701801|gb|EEL54288.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-2]
gi|401104033|gb|EJQ12010.1| hypothetical protein IE1_01269 [Bacillus cereus BAG3O-2]
gi|401117005|gb|EJQ24843.1| hypothetical protein IE7_04065 [Bacillus cereus BAG4O-1]
Length = 216
Score = 128 bits (321), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 69/202 (34%), Positives = 116/202 (57%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D ECD LI ++K++++RS + G S+ ++D+RTSSG F+ +
Sbjct: 38 EPLIVVLANVLSDEECDELIEMSKNKMERSKI-----GSSRDVNDIRTSSGAFLED--NE 90
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ + IE +I++ +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 91 LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-ANNRISTLVMY 149
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 150 LNDVEEGGETYFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSLN 188
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 189 ELTLHGGAPVTKGEKWIATQWV 210
>gi|229111709|ref|ZP_04241257.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-15]
gi|296504733|ref|YP_003666433.1| prolyl 4-hydroxylase subunit alpha [Bacillus thuringiensis BMB171]
gi|423585282|ref|ZP_17561369.1| hypothetical protein IIE_00694 [Bacillus cereus VD045]
gi|423640681|ref|ZP_17616299.1| hypothetical protein IK9_00626 [Bacillus cereus VD166]
gi|228671703|gb|EEL26999.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-15]
gi|296325785|gb|ADH08713.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis BMB171]
gi|401233925|gb|EJR40411.1| hypothetical protein IIE_00694 [Bacillus cereus VD045]
gi|401279742|gb|EJR85664.1| hypothetical protein IK9_00626 [Bacillus cereus VD166]
Length = 248
Score = 128 bits (321), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 69/202 (34%), Positives = 115/202 (56%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D ECD LI ++K++++RS + G S+ ++D+RTSSG F+ +
Sbjct: 70 EPLIVVLANVLSDEECDELIEMSKNKMERSKI-----GSSRDVNDIRTSSGAFLED--NE 122
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ IE +I++ +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 123 FTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-ANNRISTLVMY 181
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 182 LNDVEEGGETYFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSLN 220
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 221 ELTLHGGAPVTKGEKWIATQWV 242
>gi|229174912|ref|ZP_04302432.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus MM3]
gi|228608580|gb|EEK65882.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus MM3]
Length = 216
Score = 128 bits (321), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 72/202 (35%), Positives = 113/202 (55%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D ECD LI L+KS+L RS V G S+ ++D+RTS G F+ +
Sbjct: 38 EPLIVVLGNVLSDEECDELIELSKSKLARSKV-----GSSRDVNDIRTSKGAFLDD--NE 90
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ IE +I++ +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 91 LTVKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-ANNRISTLVMY 149
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 150 LNDVEEGGETFFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSLN 188
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 189 ELTLHGGAPVTKGEKWIATQWV 210
>gi|352086439|ref|ZP_08953941.1| Procollagen-proline dioxygenase [Rhodanobacter sp. 2APBS1]
gi|389799401|ref|ZP_10202396.1| procollagen-proline dioxygenase [Rhodanobacter sp. 116-2]
gi|351679404|gb|EHA62545.1| Procollagen-proline dioxygenase [Rhodanobacter sp. 2APBS1]
gi|388442818|gb|EIL98985.1| procollagen-proline dioxygenase [Rhodanobacter sp. 116-2]
Length = 284
Score = 128 bits (321), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 74/213 (34%), Positives = 112/213 (52%), Gaps = 33/213 (15%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
P V E L+ EC+ LI LA+ +L+R+ D+ G ++ RTS G F + ++
Sbjct: 95 PALRVLENILSTQECEELIALARPRLQRALTVDS-EGRQQVDRRRTSEGMFFTLNEVPLV 153
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDK------VNIVRGGHRLAT 155
IE ++A +P +GE +Q+L Y GQ+YEPH+D+F + + V GG R+A+
Sbjct: 154 GRIEQRLAALLRVPASHGEGLQILHYLPGQEYEPHFDWFDPEQPGYGAITAV-GGQRIAS 212
Query: 156 VLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTN 215
V+MYL+ A+GG T FP G+ V RRG A ++F+
Sbjct: 213 VVMYLNTPARGGGTAFPEL---------------------GLTVTARRGSA-VYFAYEGG 250
Query: 216 AIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDK 248
DP SLH+G PV++GEKW ATKW+ + +
Sbjct: 251 ---DPSSLHAGLPVLDGEKWIATKWLRERPYKR 280
>gi|229180513|ref|ZP_04307855.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 172560W]
gi|228602937|gb|EEK60416.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 172560W]
Length = 232
Score = 128 bits (321), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 69/202 (34%), Positives = 116/202 (57%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D ECD LI ++K++++RS + G S+ ++D+RTSSG F+ +
Sbjct: 54 EPLIVVLANVLSDEECDELIEMSKNKMERSKI-----GSSRDVNDIRTSSGAFLEDNE-- 106
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ + IE +I++ +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 107 LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-ANNRISTLVMY 165
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 166 LNDVEEGGETYFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSLN 204
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 205 ELTLHGGAPVTKGEKWIATQWV 226
>gi|423426372|ref|ZP_17403403.1| hypothetical protein IE5_04061 [Bacillus cereus BAG3X2-2]
gi|401111119|gb|EJQ19018.1| hypothetical protein IE5_04061 [Bacillus cereus BAG3X2-2]
Length = 248
Score = 128 bits (321), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 69/202 (34%), Positives = 116/202 (57%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D ECD LI ++K++++RS + G S+ ++D+RTSSG F+ +
Sbjct: 70 EPLIVVLANVLSDEECDELIEISKNKMERSKI-----GSSRDVNDIRTSSGAFLED--NE 122
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ + IE +I++ +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 123 LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-ANNRISTLVMY 181
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 182 LNDVEEGGETYFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSLN 220
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 221 ELTLHGGAPVTKGEKWIATQWV 242
>gi|228941395|ref|ZP_04103947.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
berliner ATCC 10792]
gi|228974327|ref|ZP_04134896.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
thuringiensis str. T01001]
gi|228980919|ref|ZP_04141223.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
gi|384188306|ref|YP_005574202.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
chinensis CT-43]
gi|410676625|ref|YP_006928996.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
gi|452200698|ref|YP_007480779.1| Peptidyl prolyl 4-hydroxylase-like protein, alpha subunit [Bacillus
thuringiensis serovar thuringiensis str. IS5056]
gi|228778855|gb|EEM27118.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
gi|228785377|gb|EEM33387.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
thuringiensis str. T01001]
gi|228818321|gb|EEM64394.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
berliner ATCC 10792]
gi|326942015|gb|AEA17911.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
chinensis CT-43]
gi|409175754|gb|AFV20059.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
gi|452106091|gb|AGG03031.1| Peptidyl prolyl 4-hydroxylase-like protein, alpha subunit [Bacillus
thuringiensis serovar thuringiensis str. IS5056]
Length = 216
Score = 128 bits (321), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 70/202 (34%), Positives = 115/202 (56%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D EC LI ++K+++KRS V G S+ ++D+RTSSG F+ +
Sbjct: 38 EPLIVVLANVLSDEECGELIEMSKNKMKRSKV-----GSSRDVNDIRTSSGAFLED--NE 90
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ + IE +I++ +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 91 LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-ANNRISTLVMY 149
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 150 LNDVEEGGETYFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSLN 188
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 189 ELTLHGGAPVTKGEKWIATQWV 210
>gi|423470454|ref|ZP_17447198.1| hypothetical protein IEM_01760 [Bacillus cereus BAG6O-2]
gi|402436583|gb|EJV68613.1| hypothetical protein IEM_01760 [Bacillus cereus BAG6O-2]
Length = 216
Score = 127 bits (320), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 70/202 (34%), Positives = 117/202 (57%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D ECD LI L+K++++RS + G S+ ++D+RTSSG F+ + +
Sbjct: 38 EPLIVVLANVLSDEECDGLIELSKNKIERSKI-----GSSRDVNDIRTSSGAFLEE--NE 90
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ + IE +I++ +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 91 LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQEYKAHYDYFAEHSRSA-ANNRISTLVMY 149
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 150 LNDVEEGGETFFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSLN 188
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 189 ELTLHGGAPVTKGEKWVATQWM 210
>gi|229071739|ref|ZP_04204954.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus F65185]
gi|228711334|gb|EEL63294.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus F65185]
Length = 232
Score = 127 bits (320), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 69/202 (34%), Positives = 116/202 (57%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D ECD LI ++K++++RS + G S+ ++D+RTSSG F+ +
Sbjct: 54 EPLIVVLANVLSDEECDELIEMSKNKMERSKI-----GSSRDVNDIRTSSGAFLEDNE-- 106
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ + IE +I++ +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 107 LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-ANNRISTLVMY 165
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 166 LNDVEEGGETYFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSLN 204
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 205 ELTLHGGAPVTKGEKWIATQWM 226
>gi|47567794|ref|ZP_00238502.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus G9241]
gi|47555471|gb|EAL13814.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus G9241]
Length = 216
Score = 127 bits (319), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 76/219 (34%), Positives = 119/219 (54%), Gaps = 33/219 (15%)
Query: 27 TAIINPSKVKQISWK---PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-L 82
AI+ + QI K P V L+D ECD LI L+K++L RS V G S+ +
Sbjct: 21 NAIMTEDREIQIISKFEEPLIVVLGNVLSDEECDELIELSKNKLARSKV-----GSSRDV 75
Query: 83 SDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSD 142
+D+RTS G F+ + + IE +I++ +P +GE + +L YE Q+Y+ HYDYF++
Sbjct: 76 NDIRTSKGAFLDD--NELTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAE 133
Query: 143 KVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPR 202
+R++T++MYL+DV +GGET FP K ++V PR
Sbjct: 134 HSRSA-ANNRISTLVMYLNDVEEGGETFFP---------------------KLNLSVHPR 171
Query: 203 RGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
+G A+ F + + + ++LH G PV +GEKW AT+W+
Sbjct: 172 KGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210
>gi|229157835|ref|ZP_04285910.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 4342]
gi|228625792|gb|EEK82544.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 4342]
Length = 232
Score = 127 bits (319), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 76/219 (34%), Positives = 119/219 (54%), Gaps = 33/219 (15%)
Query: 27 TAIINPSKVKQISWK---PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-L 82
AI+ + QI K P V L+D ECD LI L+K++L RS V G S+ +
Sbjct: 37 NAIMTEDREIQIISKFEEPLIVVLGNVLSDEECDELIELSKNKLARSKV-----GSSRDV 91
Query: 83 SDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSD 142
+D+RTS G F+ + + IE +I++ +P +GE + +L YE Q+Y+ HYDYF++
Sbjct: 92 NDIRTSKGAFLDDNE--LTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAE 149
Query: 143 KVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPR 202
+R++T++MYL+DV +GGET FP K ++V PR
Sbjct: 150 HSRSA-ANNRISTLVMYLNDVEEGGETFFP---------------------KLNLSVHPR 187
Query: 203 RGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
+G A+ F + + + ++LH G PV +GEKW AT+W+
Sbjct: 188 KGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 226
>gi|445499353|ref|ZP_21466208.1| prolyl 4-hydroxylase alpha subunit [Janthinobacterium sp. HH01]
gi|444789348|gb|ELX10896.1| prolyl 4-hydroxylase alpha subunit [Janthinobacterium sp. HH01]
Length = 272
Score = 127 bits (319), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 72/221 (32%), Positives = 109/221 (49%), Gaps = 29/221 (13%)
Query: 29 IINPSKVKQISW---KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDV 85
+ P +V ++ + +P+ + L+D ECD +I ++ RS V G S + +
Sbjct: 67 VAAPDRVAEVLFVLKQPQIILLGNVLSDEECDAIIAHCGTRYTRSTVTGEADGSSMVHEG 126
Query: 86 RTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF----- 140
RTS FI +G+ + IE ++A P E E Q+ +Y+ Q+Y PHYD+
Sbjct: 127 RTSEMAFIQRGEAEVAERIERRLAALAHWPAECSEPFQLQKYDATQEYRPHYDWLDPDSS 186
Query: 141 SDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVK 200
+ ++ RGG RLAT ++YLSDV +GG TVFP G+ V
Sbjct: 187 GHRSHLARGGQRLATFILYLSDVEQGGGTVFPGL---------------------GLEVY 225
Query: 201 PRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
P++G AL F + N PD +LH G PV+ G K A KW+
Sbjct: 226 PKKGSALWFLNTDINHQPDKRTLHGGAPVVRGTKIIANKWL 266
>gi|228987427|ref|ZP_04147547.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
tochigiensis BGSC 4Y1]
gi|228772399|gb|EEM20845.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
tochigiensis BGSC 4Y1]
Length = 232
Score = 127 bits (319), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 76/219 (34%), Positives = 119/219 (54%), Gaps = 33/219 (15%)
Query: 27 TAIINPSKVKQISWK---PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-L 82
AI+ + QI K P V L+D ECD LI L+K++L RS V G S+ +
Sbjct: 37 NAIMTEDREIQIISKFEEPLIVVLGNVLSDEECDELIELSKNKLARSKV-----GSSRDV 91
Query: 83 SDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSD 142
+D+RTS G F+ + + IE +I++ +P +GE + +L YE Q+Y+ HYDYF++
Sbjct: 92 NDIRTSKGAFLDD--NELTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAE 149
Query: 143 KVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPR 202
+R++T++MYL+DV +GGET FP K ++V PR
Sbjct: 150 HSRSA-ANNRISTLVMYLNDVEEGGETFFP---------------------KLNLSVHPR 187
Query: 203 RGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
+G A+ F + + + ++LH G PV +GEKW AT+W+
Sbjct: 188 KGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 226
>gi|229192445|ref|ZP_04319408.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 10876]
gi|228591022|gb|EEK48878.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 10876]
Length = 216
Score = 127 bits (319), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 68/202 (33%), Positives = 116/202 (57%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V ++D ECD LI ++K++++RS + G S+ ++D+RTSSG F+ +
Sbjct: 38 EPLIVVLANVISDEECDELIEMSKNKMERSKI-----GSSRDVNDIRTSSGAFLED--NE 90
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ + IE +I++ +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 91 LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-ANNRISTLVMY 149
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 150 LNDVEEGGETYFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSLN 188
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 189 ELTLHGGAPVTKGEKWIATQWV 210
>gi|170064953|ref|XP_001867740.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
gi|167882143|gb|EDS45526.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
Length = 509
Score = 127 bits (318), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 76/221 (34%), Positives = 120/221 (54%), Gaps = 26/221 (11%)
Query: 25 SSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSD 84
SS + P K + ++ P VY +D E LI LAKS++ R+ + D+ GE ++S+
Sbjct: 292 SSFLRLAPLKEEVLNLDPFITVYHDVASDREISKLIELAKSRISRATIRDD--GEPQVSN 349
Query: 85 VRTSSGTFIPKGKDAIIAGIEDKIATWTF-LPKENGEDIQVLRYEHGQKYEPHYDYFSDK 143
RTS ++ G D ++ ++ ++ T L +++ E +QV Y G Y H+D+ +
Sbjct: 350 ARTSQNAWLDAGDDRVVTTLDRRVGDMTGGLRQQSYEMLQVNNYGVGGHYVAHHDWAMEA 409
Query: 144 VNI--VRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKP 201
V +R G+R+ATV+ YLSDV GG TVFP + G+AV P
Sbjct: 410 VPYAGLRVGNRIATVMFYLSDVEIGGATVFP---------------------QLGLAVFP 448
Query: 202 RRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
R+G A+L+++L+ N D +LH+ CPV+ G KW A +WIH
Sbjct: 449 RKGSAILWYNLYRNGKGDRRTLHAACPVLSGSKWVANQWIH 489
>gi|402555628|ref|YP_006596899.1| prolyl 4-hydroxylase subunit alpha [Bacillus cereus FRI-35]
gi|401796838|gb|AFQ10697.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus FRI-35]
Length = 216
Score = 127 bits (318), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 77/219 (35%), Positives = 120/219 (54%), Gaps = 33/219 (15%)
Query: 27 TAIINPSKVKQISWK---PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-L 82
AI+ + QI K P V L+D EC LI L+K++L RS V G S+ +
Sbjct: 21 NAIMTEDREIQIISKFEEPLIVVLGNVLSDEECGELIELSKNKLARSKV-----GSSRDV 75
Query: 83 SDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSD 142
+D+RTSSG F+ + + A IE +I++ +P +GE + +L YE Q+Y+ HYDYF++
Sbjct: 76 NDIRTSSGAFLDD--NELTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYFAE 133
Query: 143 KVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPR 202
+R++T++MYL+DV +GGET FP K ++V PR
Sbjct: 134 HSRSA-ANNRISTLVMYLNDVEEGGETFFP---------------------KLNLSVHPR 171
Query: 203 RGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
+G A+ F + + + ++LH G PV +GEKW AT+W+
Sbjct: 172 KGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWM 210
>gi|423612451|ref|ZP_17588312.1| hypothetical protein IIM_03166 [Bacillus cereus VD107]
gi|401246040|gb|EJR52392.1| hypothetical protein IIM_03166 [Bacillus cereus VD107]
Length = 254
Score = 127 bits (318), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 68/201 (33%), Positives = 114/201 (56%), Gaps = 28/201 (13%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
+P V L+D ECD LI L+K++++RS + + ++D+RTSSG F+ + +
Sbjct: 76 EPLIVVLANVLSDEECDELIELSKNKMERSKIGSS----RNVNDIRTSSGAFLEE--NEF 129
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYL 160
+ IE +I++ T +P +GE + +L Y Q+Y+ HYDYF++ +R++T++MYL
Sbjct: 130 TSKIEKRISSITNVPVAHGEGLHILNYAVDQEYKAHYDYFAEHSRSA-ANNRISTLVMYL 188
Query: 161 SDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDP 220
+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 189 NDVEEGGETFFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSLNE 227
Query: 221 VSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 228 LTLHGGAPVTKGEKWIATQWM 248
>gi|229093299|ref|ZP_04224414.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-42]
gi|228690082|gb|EEL43879.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-42]
Length = 232
Score = 127 bits (318), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 71/202 (35%), Positives = 113/202 (55%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D ECD LI L+K++L RS V G S+ ++D+RTSSG F+ +
Sbjct: 54 EPLIVVLGNVLSDEECDELIELSKNKLARSKV-----GSSRDVNDIRTSSGAFLDD--NE 106
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ IE +I++ +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 107 LTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-ANNRISTLVMY 165
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 166 LNDVEEGGETFFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSLN 204
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW T+W+
Sbjct: 205 ELTLHGGAPVTKGEKWITTQWV 226
>gi|195061068|ref|XP_001995918.1| GH14106 [Drosophila grimshawi]
gi|193891710|gb|EDV90576.1| GH14106 [Drosophila grimshawi]
Length = 511
Score = 127 bits (318), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 75/212 (35%), Positives = 108/212 (50%), Gaps = 24/212 (11%)
Query: 32 PSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGT 91
P K++ + P V+ L+ E D+L NLA+ LKR+ V +++G+ VRTS G
Sbjct: 307 PLKMEIVLLNPFIVVFHDALSPQEIDYLQNLARPLLKRTTV--HVNGKYVSRRVRTSKGA 364
Query: 92 FIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-SDKVNIVRGG 150
++ + + + IE ++ T L + E ++ Y G Y HYD+F + K G
Sbjct: 365 WLERDLNNLTRRIERRVVDMTELSMQGSEAYNIMNYGLGGHYAAHYDFFNTTKQQTSETG 424
Query: 151 HRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFF 210
R+ATVL YLSDV +GG TVFPN + +AV P RG AL ++
Sbjct: 425 DRIATVLFYLSDVEQGGATVFPNLK---------------------LAVSPERGMALFWY 463
Query: 211 SLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+L N D +LH GCPV+ G KW T WIH
Sbjct: 464 NLLDNGTGDTRTLHGGCPVLVGSKWVMTLWIH 495
>gi|329913962|ref|ZP_08276011.1| hypothetical protein IMCC9480_1311 [Oxalobacteraceae bacterium
IMCC9480]
gi|327545257|gb|EGF30515.1| hypothetical protein IMCC9480_1311 [Oxalobacteraceae bacterium
IMCC9480]
Length = 280
Score = 126 bits (317), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 67/205 (32%), Positives = 110/205 (53%), Gaps = 26/205 (12%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
PR V L+D ECD + +++++ RS DN SG ++ D RTS I +G+ +I
Sbjct: 92 PRIVVLGNVLSDDECDAIAAMSRTRFARSTTIDNASGINRFDDSRTSESAHIQRGETELI 151
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKV-----NIVRGGHRLATV 156
A I+ ++A + P ++GE +Q+ +Y+ G +Y PH+D+F + ++ + G RLAT+
Sbjct: 152 ARIDARLAALSGWPVDHGEPLQLQKYQAGNEYRPHFDWFDPALAGTAKHLEKSGQRLATI 211
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
++YL+DV +GG T FP G+ V P++G AL F +
Sbjct: 212 ILYLTDVEEGGGTSFPGI---------------------GLDVHPQKGGALFFRNTTPYG 250
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWI 241
+PD + H+G PV +G K A KW+
Sbjct: 251 VPDRKTQHAGLPVEKGTKIIANKWL 275
>gi|317127314|ref|YP_004093596.1| Procollagen-proline dioxygenase [Bacillus cellulosilyticus DSM
2522]
gi|315472262|gb|ADU28865.1| Procollagen-proline dioxygenase [Bacillus cellulosilyticus DSM
2522]
Length = 229
Score = 126 bits (317), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 67/204 (32%), Positives = 114/204 (55%), Gaps = 33/204 (16%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
+P + L++ ECD LI+L+K +++RS +++ + D+RTSS F ++ +
Sbjct: 43 EPLIVLLGNVLSEEECDQLISLSKDRIERSKISN-----KSVHDLRTSSSMFFDDAENDV 97
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS---DKVNIVRGGHRLATVL 157
++ +E +++ +P ++GE IQ+L Y GQ+Y+ HYDYFS KVN R++T++
Sbjct: 98 VSTVEKRVSQIMKIPVDHGEGIQILNYAIGQEYKAHYDYFSSGNSKVN----NPRISTLV 153
Query: 158 MYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAI 217
MYL+DV GGET FP K V P++G A+ F + +
Sbjct: 154 MYLNDVEAGGETYFP---------------------KLNFYVAPKKGMAVYFEYFYNDTT 192
Query: 218 PDPVSLHSGCPVIEGEKWSATKWI 241
+ ++LH G PV+ G+KW+AT+W+
Sbjct: 193 LNELTLHGGAPVVIGDKWAATQWM 216
>gi|357605723|gb|EHJ64752.1| prolyl 4-hydroxylase alpha subunit [Danaus plexippus]
Length = 235
Score = 126 bits (317), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 67/215 (31%), Positives = 112/215 (52%), Gaps = 23/215 (10%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+ P +++ + P V+ L+D E D++ +A+ + +R+ V D +GE + R S
Sbjct: 10 LAPVRMEYLYRNPDIIVFNDVLSDYEIDYIKRIAQPRFRRATVHDPATGELVPAHYRISK 69
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVR- 148
++ + A++A + ++A T L E++QV+ Y G Y+PH+D+ + N
Sbjct: 70 SAWLKDEESAVVARVSRRVADITGLSMTTAEELQVVNYGIGGHYDPHFDFARKEENAFEK 129
Query: 149 -GGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDAL 207
G+R+ATVL Y+SDVA+GG TVF + G++V PRRG A+
Sbjct: 130 FNGNRIATVLFYMSDVAQGGATVF---------------------TELGLSVFPRRGSAV 168
Query: 208 LFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ +LH + D + H+ CPV+ G KW KWIH
Sbjct: 169 FWLNLHPSGEGDLATRHAACPVLRGSKWVCNKWIH 203
>gi|229075940|ref|ZP_04208916.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-18]
gi|229117732|ref|ZP_04247101.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-3]
gi|407706764|ref|YP_006830349.1| alpha/beta fold family hydrolase [Bacillus thuringiensis MC28]
gi|423377905|ref|ZP_17355189.1| hypothetical protein IC9_01258 [Bacillus cereus BAG1O-2]
gi|423464099|ref|ZP_17440867.1| hypothetical protein IEK_01286 [Bacillus cereus BAG6O-1]
gi|423547540|ref|ZP_17523898.1| hypothetical protein IGO_03975 [Bacillus cereus HuB5-5]
gi|423622677|ref|ZP_17598455.1| hypothetical protein IK3_01275 [Bacillus cereus VD148]
gi|228665709|gb|EEL21182.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-3]
gi|228707255|gb|EEL59452.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-18]
gi|401179261|gb|EJQ86434.1| hypothetical protein IGO_03975 [Bacillus cereus HuB5-5]
gi|401260797|gb|EJR66965.1| hypothetical protein IK3_01275 [Bacillus cereus VD148]
gi|401636171|gb|EJS53925.1| hypothetical protein IC9_01258 [Bacillus cereus BAG1O-2]
gi|402420366|gb|EJV52637.1| hypothetical protein IEK_01286 [Bacillus cereus BAG6O-1]
gi|407384449|gb|AFU14950.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis MC28]
Length = 216
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 66/201 (32%), Positives = 114/201 (56%), Gaps = 28/201 (13%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
+P V ++D EC+ LI ++K+++KRS + ++D+RTSSG F+ + + +
Sbjct: 38 EPLIVVLGNVISDEECNELIEMSKNKIKRSTIG----SARDVNDIRTSSGAFLEE--NEL 91
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYL 160
+ IE +I++ +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MYL
Sbjct: 92 TSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-ANNRISTLVMYL 150
Query: 161 SDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDP 220
+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 151 NDVEEGGETFFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSLNE 189
Query: 221 VSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 190 LTLHGGAPVTKGEKWIATQWV 210
>gi|423448819|ref|ZP_17425698.1| hypothetical protein IEC_03427 [Bacillus cereus BAG5O-1]
gi|401129413|gb|EJQ37096.1| hypothetical protein IEC_03427 [Bacillus cereus BAG5O-1]
Length = 216
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 68/202 (33%), Positives = 116/202 (57%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V ++D ECD LI ++K+++KRS + G S+ ++D+RTSSG F+ + +
Sbjct: 38 EPLIVVLGNVISDEECDELIEMSKNKIKRSTI-----GSSRDVNDIRTSSGAFLEE--NE 90
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ + IE +I++ +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 91 LTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-ANNRISTLVMY 149
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 150 LNDVEEGGETFFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSLN 188
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G V +GEKW AT+W+
Sbjct: 189 ELTLHGGASVTKGEKWIATQWV 210
>gi|229152436|ref|ZP_04280628.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1550]
gi|228631044|gb|EEK87681.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1550]
Length = 248
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 68/207 (32%), Positives = 117/207 (56%), Gaps = 30/207 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D EC LI ++K++++RS + G S+ ++D+RTSSG F+ +
Sbjct: 70 EPLIVVLANVLSDEECGELIEMSKNKMERSKI-----GSSRDVNDIRTSSGAFLED--NE 122
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ + IE +I++ +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 123 LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-ANNRISTLVMY 181
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 182 LNDVEEGGETYFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSIN 220
Query: 220 PVSLHSGCPVIEGEKWSATKWIHVDSF 246
++LH G PV +GEKW AT+W+ ++
Sbjct: 221 ELTLHGGAPVTKGEKWIATQWVRRGTY 247
>gi|229104864|ref|ZP_04235524.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-28]
gi|228678581|gb|EEL32798.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-28]
Length = 216
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 68/202 (33%), Positives = 116/202 (57%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V ++D EC LI ++K+++KRS + G S+ ++D+RTSSG F+ + +
Sbjct: 38 EPLIVVLGNVISDEECGELIEMSKNKIKRSTI-----GSSRDVNDIRTSSGAFLEE--NE 90
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ + IE +I++ +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 91 LTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-ANNRISTLVMY 149
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 150 LNDVEEGGETFFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSLN 188
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 189 ELTLHGGAPVTKGEKWIATQWV 210
>gi|218231188|ref|YP_002369041.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
B4264]
gi|218159145|gb|ACK59137.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
B4264]
Length = 216
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 68/202 (33%), Positives = 115/202 (56%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D EC LI ++K++++RS + G S+ ++D+RTSSG F+ +
Sbjct: 38 EPLIVVLANVLSDEECGELIEMSKNKMERSKI-----GSSRDVNDIRTSSGAFLED--NE 90
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ + IE +I++ +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 91 LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-ANNRISTLVMY 149
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 150 LNDVEEGGETYFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSIN 188
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 189 ELTLHGGAPVTKGEKWIATQWV 210
>gi|386712780|ref|YP_006179102.1| prolyl 4-hydroxylase alpha subunit [Halobacillus halophilus DSM
2266]
gi|384072335|emb|CCG43825.1| prolyl 4-hydroxylase alpha subunit [Halobacillus halophilus DSM
2266]
Length = 211
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 65/200 (32%), Positives = 112/200 (56%), Gaps = 28/200 (14%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
P + +++ EC+ LI L+K+++ RS + + ++SD+RTSS TF+P+ D +
Sbjct: 34 PLIAILGNVVSEEECEELIFLSKNKMNRSKIG----SQHEVSDIRTSSSTFLPE--DDLT 87
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLS 161
IE ++A +P E+GE + +L Y+ GQ+Y+ HYDYF K R++T+++YL+
Sbjct: 88 NRIEKRVAQIMNVPVEHGEGLHILNYKQGQEYKAHYDYFRSKAKAANNP-RISTLVLYLN 146
Query: 162 DVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPV 221
DV +GGET FP+ +++ P +G A+ F +++ + +
Sbjct: 147 DVEEGGETYFPHM---------------------NLSISPHKGMAVYFEYFYSDPLINER 185
Query: 222 SLHSGCPVIEGEKWSATKWI 241
+LH G PV GEKW+AT W+
Sbjct: 186 TLHGGSPVTSGEKWAATMWV 205
>gi|251794605|ref|YP_003009336.1| procollagen-proline dioxygenase [Paenibacillus sp. JDR-2]
gi|247542231|gb|ACS99249.1| Procollagen-proline dioxygenase [Paenibacillus sp. JDR-2]
Length = 209
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 64/201 (31%), Positives = 114/201 (56%), Gaps = 28/201 (13%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
+P + + L+ ECD LI+LA ++++R+ + + +S+VRTSS F + ++
Sbjct: 31 EPLILILDNVLSWAECDLLIDLASARMQRAKIGSS----HDVSEVRTSSSMFFEESENEC 86
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYL 160
I +E ++A +P + E +QVLRY+ G++Y PH+DYF+ ++ +R++T++MYL
Sbjct: 87 IGQVEARVAELMNIPVSHAEPLQVLRYQPGEQYHPHFDYFTQGSSM---NNRISTLVMYL 143
Query: 161 SDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDP 220
+DV +GGET FP+ +V P++G A+ F + + +
Sbjct: 144 NDVEEGGETYFPSLH---------------------FSVTPKKGSAVYFEYFYNDTRLNE 182
Query: 221 VSLHSGCPVIEGEKWSATKWI 241
++LH+G PV GEKW AT+W+
Sbjct: 183 LTLHAGHPVEAGEKWVATQWM 203
>gi|325267002|ref|ZP_08133672.1| 2OG-Fe(II) oxygenase [Kingella denitrificans ATCC 33394]
gi|324981502|gb|EGC17144.1| 2OG-Fe(II) oxygenase [Kingella denitrificans ATCC 33394]
Length = 279
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 72/205 (35%), Positives = 106/205 (51%), Gaps = 26/205 (12%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
P V + F+T EC LI LA+ +++ + V D +GE RTS + + +I
Sbjct: 91 PEVVVLDNFITAEECAQLIALAEGKVEDATVVDPATGEFVKHQDRTSMNAAFARAEHPLI 150
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNIVRGGHRLATV 156
A +E +IA P ENGE +QVLRY G +Y+ H+DYF + N+ GG R+ T
Sbjct: 151 ARLEARIAAAIHWPAENGEGMQVLRYRSGGEYKAHFDYFDTQSEGGRKNMQTGGQRVGTF 210
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
L+YL DV GG T F PA N ++P++G AL F + N
Sbjct: 211 LVYLCDVDAGGATRF-----------PALN----------FEIRPKKGMALFFANTLPNG 249
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWI 241
+P++LH+G PV+ G K+ A+KW+
Sbjct: 250 EGNPLTLHAGVPVVSGVKYLASKWL 274
>gi|195159311|ref|XP_002020525.1| GL13465 [Drosophila persimilis]
gi|194117294|gb|EDW39337.1| GL13465 [Drosophila persimilis]
Length = 578
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 75/225 (33%), Positives = 121/225 (53%), Gaps = 27/225 (12%)
Query: 23 SFSSTAI----INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSG 78
S+++TA + P K + +S P +Y +T LE L NL+K +KR A+ N
Sbjct: 362 SYNTTAAPFLRLAPFKTELLSLAPYMVLYHDVITPLESLTLKNLSKPHMKRRAMTFNKQK 421
Query: 79 ESKLSDV-RTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHY 137
L D RTS+ ++ ++A++ +E ++ T EN E Q++ Y G Y+PH
Sbjct: 422 LRPLIDSGRTSNSVWLTSHENAVMERLERRVGVMTNFEMENSEVYQLINYGIGGHYKPHT 481
Query: 138 DYF-SDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKG 196
D+F + ++ GG R+ATVL YLSDV +GG T+FP +
Sbjct: 482 DHFETPQLEHRGGGDRIATVLFYLSDVPQGGATLFP---------------------RLN 520
Query: 197 IAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
I+V+PR+GDALL+++L+ + ++H+ CP+I+G KW+ KWI
Sbjct: 521 ISVQPRQGDALLWYNLNDRGQGEIGTVHTSCPIIKGSKWALVKWI 565
>gi|91091610|ref|XP_969386.1| PREDICTED: similar to prolyl 4-hydroxylase alpha subunit 1,
putative [Tribolium castaneum]
gi|270001037|gb|EEZ97484.1| hypothetical protein TcasGA2_TC011321 [Tribolium castaneum]
Length = 536
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 66/217 (30%), Positives = 116/217 (53%), Gaps = 25/217 (11%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
I P KV++ +P F++ L D E + +A+ + KR+ V + +GE +++ R S
Sbjct: 320 IAPFKVEEAHHRPDIFIFRDVLADSEIATIKRMAQPRFKRATVQNTDTGELEIAQYRISK 379
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-SDKVNIVR 148
++ + + IA + +++ T L E++QV+ Y G YEPH+D+ D+ N +
Sbjct: 380 SAWLKEEEHKHIADVSQRVSDMTGLTMSTAEELQVVNYGIGGHYEPHFDFARRDERNAFK 439
Query: 149 G---GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGD 205
G+R+ATVL Y+SDV +GG TVFP+ + +++ P++G
Sbjct: 440 SLGTGNRIATVLFYMSDVEQGGATVFPSIQ---------------------VSLWPQKGS 478
Query: 206 ALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A +++LH + D ++ H+ CPV+ G KW + KWIH
Sbjct: 479 AAFWYNLHPSGDGDKMTRHAACPVLTGSKWVSNKWIH 515
>gi|229031885|ref|ZP_04187873.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1271]
gi|228729503|gb|EEL80492.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1271]
Length = 216
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 71/202 (35%), Positives = 112/202 (55%), Gaps = 30/202 (14%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDA 99
+P V L+D EC LI L+KS+L RS V G S+ ++D+RTS G F+ +
Sbjct: 38 EPLIVVLGNVLSDEECGELIELSKSKLARSKV-----GSSRDVNDIRTSKGAFLDD--NE 90
Query: 100 IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMY 159
+ IE +I++ +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MY
Sbjct: 91 LTTKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-ANNRISTLVMY 149
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
L+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 150 LNDVEEGGETFFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSLN 188
Query: 220 PVSLHSGCPVIEGEKWSATKWI 241
++LH G PV +GEKW AT+W+
Sbjct: 189 ELTLHGGAPVTKGEKWIATQWV 210
>gi|402813396|ref|ZP_10862991.1| hypothetical protein PAV_1c08470 [Paenibacillus alvei DSM 29]
gi|402509339|gb|EJW19859.1| hypothetical protein PAV_1c08470 [Paenibacillus alvei DSM 29]
Length = 215
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 69/206 (33%), Positives = 115/206 (55%), Gaps = 28/206 (13%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
+P + L++ ECD LI +K +L+RS + + E ++ +RTSSG F + +
Sbjct: 35 EPLIVILGNVLSNEECDELIEHSKERLQRSKIGE----ERSVNQIRTSSGVFCEE--NET 88
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYL 160
+A IE +I+ +P E+G+ +QVL Y GQ+Y+PH+D+F+D + +R++T++MYL
Sbjct: 89 VAKIEKRISQIMNIPIEHGDGLQVLLYAPGQEYKPHFDFFAD-TSRASANNRISTLVMYL 147
Query: 161 SDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDP 220
+DV +GGET FP ++V P +G A+ F ++N +
Sbjct: 148 NDVEEGGETTFPML---------------------NLSVFPSKGMAVYFEYFYSNHELNE 186
Query: 221 VSLHSGCPVIEGEKWSATKWIHVDSF 246
+LH+G PV +GEKW AT W+ +F
Sbjct: 187 RTLHAGAPVRKGEKWVATMWMRRQTF 212
>gi|390176896|ref|XP_002136934.2| GA26861 [Drosophila pseudoobscura pseudoobscura]
gi|388858831|gb|EDY67492.2| GA26861 [Drosophila pseudoobscura pseudoobscura]
Length = 513
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 75/224 (33%), Positives = 119/224 (53%), Gaps = 27/224 (12%)
Query: 23 SFSSTAI----INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSG 78
S+++TA + P K + +S P +Y +T LE L NL+K +KR A+ N
Sbjct: 299 SYNTTAAPFLRLAPFKTEILSLSPYMVLYHDVITPLESLTLKNLSKPHMKRRAMTFNKQK 358
Query: 79 ESKLSDV-RTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHY 137
L D RTS+ ++ ++A++ +E ++ T EN E Q++ Y G Y+PH
Sbjct: 359 LRPLIDSGRTSNSVWLTSHENAVMERLERRVGVMTNFEMENSEVYQLINYGIGGHYKPHT 418
Query: 138 DYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGI 197
D+F + GG R+ATVL YLSDV +GG T+FP + I
Sbjct: 419 DHFETPQH-RGGGDRIATVLFYLSDVPQGGATLFP---------------------RLNI 456
Query: 198 AVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
+V+PR+GDALL+++L+ + ++H+ CP+I+G KW+ KWI
Sbjct: 457 SVQPRQGDALLWYNLNDRGQGEIGTVHTSCPIIQGSKWALVKWI 500
>gi|195505207|ref|XP_002099404.1| GE23380 [Drosophila yakuba]
gi|194185505|gb|EDW99116.1| GE23380 [Drosophila yakuba]
Length = 540
Score = 124 bits (312), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 75/217 (34%), Positives = 117/217 (53%), Gaps = 27/217 (12%)
Query: 30 INPSKVKQISWKPR-AFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+ P K++Q++ P A+V+E L D E D ++ K +KRS V SG S +++RTS
Sbjct: 324 LAPFKIEQLNLDPYVAYVHE-VLWDSEIDMIMEHGKGNMKRSMVGQ--SGNSTTTEIRTS 380
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVR 148
T++ + +A I+ ++ T L E+ E +Q++ Y G +YEPH+D+ D V
Sbjct: 381 QNTWLWYDANPWLAKIKQRLEDVTGLSTESAEPLQLVNYGIGGQYEPHFDFMEDDGQKVF 440
Query: 149 G--GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
G G+RLAT L YL+DVA GG T FP +AV P +G
Sbjct: 441 GWKGNRLATALFYLNDVALGGATAFPFLR---------------------LAVPPVKGSL 479
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHV 243
L++++LH++ D + H+GCPV++G KW +W HV
Sbjct: 480 LIWYNLHSSTHKDFRTKHAGCPVLQGSKWICNEWFHV 516
>gi|196011902|ref|XP_002115814.1| hypothetical protein TRIADDRAFT_30039 [Trichoplax adhaerens]
gi|190581590|gb|EDV21666.1| hypothetical protein TRIADDRAFT_30039 [Trichoplax adhaerens]
Length = 534
Score = 124 bits (311), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 70/219 (31%), Positives = 111/219 (50%), Gaps = 26/219 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+ +P V+ IS +P +Y L DLE + L LA L+R+ V + +G+ + + R S
Sbjct: 321 LFSPINVEVISLQPYILIYHNLLNDLEVEALKTLAAPMLQRATVHNKDTGKLEYATYRIS 380
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDK 143
++ ++ I I T L E+ E +Q+ Y G YEPH+D+ +D
Sbjct: 381 KSAWLNDDDHPLVRRISTLIEDVTGLTMESAEALQIANYGIGGHYEPHFDHADVRSGTDV 440
Query: 144 VNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRR 203
+GG+R+AT+L+YLS V GG TVF +A G+ ++PR+
Sbjct: 441 FKTWKGGNRIATMLIYLSSVELGGATVFSSA---------------------GVRIEPRQ 479
Query: 204 GDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
G A +++LH N + ++ H+ CPV+ G KW A KWIH
Sbjct: 480 GSAAFWYNLHRNGNGNNLTRHAACPVLIGSKWIANKWIH 518
>gi|221126103|ref|XP_002165259.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Hydra
magnipapillata]
Length = 533
Score = 124 bits (310), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 73/217 (33%), Positives = 110/217 (50%), Gaps = 22/217 (10%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P K++ + P +Y +TD E H+I AK L+R+ V D ++G+ +D R S
Sbjct: 320 ILKPLKMEVLHHDPYIELYYELITDDEAKHIIKFAKPLLRRAFVHDMVTGDLIYADYRVS 379
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSD---KVN 145
T+I + D I A I ++ T L E +QV Y +YEPH+D+ + K
Sbjct: 380 KNTWIAEDMDVIAAKIIRRVGDVTGLNMRYAEHLQVANYGIAGQYEPHFDHSTGTRPKHF 439
Query: 146 IVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGD 205
GG+R+AT+L+YLSDV GG TVF N G+ P +G
Sbjct: 440 DRWGGNRIATMLLYLSDVDWGGRTVFTN-------------------TAPGVGTDPIKGA 480
Query: 206 ALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ +++L N +P + H+GCPV+ G+KW A WIH
Sbjct: 481 GVFWYNLLRNGKSNPKTQHAGCPVVLGQKWVANLWIH 517
>gi|229098707|ref|ZP_04229647.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-29]
gi|423441025|ref|ZP_17417931.1| hypothetical protein IEA_01355 [Bacillus cereus BAG4X2-1]
gi|423533441|ref|ZP_17509859.1| hypothetical protein IGI_01273 [Bacillus cereus HuB2-9]
gi|228684786|gb|EEL38724.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-29]
gi|402417686|gb|EJV49986.1| hypothetical protein IEA_01355 [Bacillus cereus BAG4X2-1]
gi|402463660|gb|EJV95360.1| hypothetical protein IGI_01273 [Bacillus cereus HuB2-9]
Length = 216
Score = 124 bits (310), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 65/201 (32%), Positives = 113/201 (56%), Gaps = 28/201 (13%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
+P V ++D EC+ LI ++K+++KRS + ++D+RTSSG F+ + + +
Sbjct: 38 EPLIVVLGNVISDEECNELIEMSKNKIKRSTIG----SARDVNDIRTSSGAFLEE--NEL 91
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYL 160
+ IE +I++ +P +GE + +L YE Q+Y+ HYDYF++ +R++T++MYL
Sbjct: 92 TSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYFAEHSRSA-ANNRISTLVMYL 150
Query: 161 SDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDP 220
+DV +GGET FP K ++V PR+G A+ F + + +
Sbjct: 151 NDVEEGGETFFP---------------------KLNLSVHPRKGMAVYFEYFYQDQSLNE 189
Query: 221 VSLHSGCPVIEGEKWSATKWI 241
++LH G V +GEKW AT+W+
Sbjct: 190 LTLHGGASVTKGEKWIATQWV 210
>gi|170064960|ref|XP_001867743.1| prolyl 4-hydroxylase subunit alpha-1 [Culex quinquefasciatus]
gi|167882146|gb|EDS45529.1| prolyl 4-hydroxylase subunit alpha-1 [Culex quinquefasciatus]
Length = 545
Score = 124 bits (310), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 63/217 (29%), Positives = 117/217 (53%), Gaps = 25/217 (11%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
I P K+++ +P +Y ++D E + + LAK + +R+ V + +GE ++++ R S
Sbjct: 330 IAPLKLEEAHLEPYIVIYHEVMSDAEIEVIKRLAKPRFRRATVQNYKTGELEVANYRISK 389
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-SDKVNIVR 148
++ + +++ + ++ T L E++QV+ Y G YEPH+D+ ++ N +
Sbjct: 390 SAWLKDEEHSVVRTVGQRVEDMTGLTMTTAEELQVVNYGIGGHYEPHFDFARREEKNAFK 449
Query: 149 G---GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGD 205
G+R+ATVL Y+SDV++GG TVFP+ +A++P++G
Sbjct: 450 SLGTGNRIATVLFYMSDVSQGGATVFPSIR---------------------VALRPKKGT 488
Query: 206 ALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A +++LH + D + H+ CPV+ G KW + KWIH
Sbjct: 489 AAFWYNLHASGHGDYATRHAACPVLTGTKWVSNKWIH 525
>gi|389809938|ref|ZP_10205598.1| procollagen-proline dioxygenase [Rhodanobacter thiooxydans LCS2]
gi|388441354|gb|EIL97635.1| procollagen-proline dioxygenase [Rhodanobacter thiooxydans LCS2]
Length = 284
Score = 123 bits (309), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 75/206 (36%), Positives = 108/206 (52%), Gaps = 33/206 (16%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
P V E L+ ECD LI LA+ +L+R+ D+ G ++ RTS G F + ++
Sbjct: 95 PALRVLENILSARECDELIALARPRLQRALTVDS-EGRQQVDRRRTSEGMFFTLDEVPLV 153
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS------DKVNIVRGGHRLAT 155
IE ++A +P +GE +Q+L Y GQ YEPH+D+F + + V GG R+A+
Sbjct: 154 GRIERRVAALLDVPASHGEGLQILHYLPGQAYEPHFDWFDPDQPGYETITAV-GGQRIAS 212
Query: 156 VLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTN 215
V+MYL+ A+GG T FP G+ V RRG A ++F+
Sbjct: 213 VVMYLNTPARGGGTAFPAL---------------------GLTVTARRGAA-VYFAYEGG 250
Query: 216 AIPDPVSLHSGCPVIEGEKWSATKWI 241
D SLH+G PV+EGEKW ATKW+
Sbjct: 251 ---DCSSLHAGLPVLEGEKWIATKWL 273
>gi|253575459|ref|ZP_04852796.1| prolyl 4-hydroxylase [Paenibacillus sp. oral taxon 786 str. D14]
gi|251845106|gb|EES73117.1| prolyl 4-hydroxylase [Paenibacillus sp. oral taxon 786 str. D14]
Length = 215
Score = 123 bits (309), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 69/201 (34%), Positives = 107/201 (53%), Gaps = 27/201 (13%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
+P +E LTD EC LI A +L+ S + + + +S++RTS G F + ++
Sbjct: 29 EPLIMRFERLLTDDECRQLIEAAAPRLRESKLVNKV-----VSEIRTSRGMFFEEEENPF 83
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYL 160
I IE +I+ +P E+ E +QVL Y GQ+Y+ HYD+F + +R++T+++YL
Sbjct: 84 IHRIEKRISALMNVPIEHAEGLQVLHYGPGQEYQAHYDFFGPN-SPSASNNRISTLIIYL 142
Query: 161 SDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDP 220
+DV GGETVFP + + VKP RG AL F + +
Sbjct: 143 NDVEAGGETVFPLLD---------------------LEVKPERGSALYFEYFYRQQELNN 181
Query: 221 VSLHSGCPVIEGEKWSATKWI 241
++LHS PV+ GEKW AT+W+
Sbjct: 182 LTLHSSVPVVRGEKWVATQWM 202
>gi|384046522|ref|YP_005494539.1| prolyl 4-hydroxylase alpha subunit [Bacillus megaterium WSH-002]
gi|345444213|gb|AEN89230.1| Prolyl 4-hydroxylase alpha subunit [Bacillus megaterium WSH-002]
Length = 219
Score = 123 bits (309), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 65/201 (32%), Positives = 113/201 (56%), Gaps = 26/201 (12%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
+P V L++ ECD LI L+K +++RS + E +++ +RTSSG F + ++ +
Sbjct: 38 EPLVLVLGNVLSNEECDELIQLSKDKMQRSKIG----AEREVNSIRTSSGMFFEESENEL 93
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYL 160
+ IE +++ E E +Q+L+Y Q+Y+ H+DYF+ + +R++T++MYL
Sbjct: 94 VHQIERRLSKIMGPSIEYAEGLQILKYLPDQEYKAHHDYFTSASKASKN-NRISTLVMYL 152
Query: 161 SDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDP 220
+DV +GGET FP K G+++ P +G A+ F +++A +
Sbjct: 153 NDVEEGGETYFP---------------------KLGLSISPTKGMAVYFEYFYSDAELND 191
Query: 221 VSLHSGCPVIEGEKWSATKWI 241
+LH G PVI+GEKW AT+W+
Sbjct: 192 RTLHGGAPVIKGEKWVATQWM 212
>gi|89096248|ref|ZP_01169141.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus sp.
NRRL B-14911]
gi|89089102|gb|EAR68210.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus sp.
NRRL B-14911]
Length = 217
Score = 123 bits (309), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 67/201 (33%), Positives = 114/201 (56%), Gaps = 26/201 (12%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
+P + L+D EC+ LI +++ +LKRS + + + + D+RTSS F +G++ +
Sbjct: 38 EPLIVILGNVLSDEECEGLIRMSEDKLKRSKIGNTRT----VDDIRTSSSMFFEEGENEL 93
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYL 160
+A IE +++ +P E+GE +Q+L Y GQ+Y+ H+D+FS R++T++MYL
Sbjct: 94 VARIERRLSQIMNIPVEHGEGLQMLNYHIGQEYKAHFDFFSSSSRAASNP-RISTLVMYL 152
Query: 161 SDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDP 220
+DV +GGET FP K +V P++G A+ F + N +
Sbjct: 153 NDVEEGGETYFP---------------------KLNFSVNPQKGSAVYFEYFYDNQDLND 191
Query: 221 VSLHSGCPVIEGEKWSATKWI 241
++LH G PVI+G KW+AT+W+
Sbjct: 192 LTLHGGAPVIKGSKWAATQWM 212
>gi|301115862|ref|XP_002905660.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262110449|gb|EEY68501.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 215
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 78/211 (36%), Positives = 113/211 (53%), Gaps = 18/211 (8%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRTSSGTFIPKGKDAI 100
P F E FL D E D ++ L+ L S V E++ +D RTS+ ++ +
Sbjct: 3 PLVFSVEEFLRDDEIDVILELSMPHLAPSGVTLQDGHENRPATDWRTSTTYWLDSSSHPV 62
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS--------DKVNIVRGGH- 151
+ I+ + A +P + E +QVLRYE Q Y+ H DYFS D + + G+
Sbjct: 63 VQTIDKRTADLVKVPISHQESVQVLRYEPTQHYDQHLDYFSAERHRNSPDVLKRIEYGYK 122
Query: 152 -RLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFF 210
R+ TV Y+SDVAKGG T F + PR P++N D S+ GI+V P++ ++F+
Sbjct: 123 NRMITVFWYMSDVAKGGHTNFARSGGLPR---PSSNKDCSQ----GISVAPKKRKVVVFY 175
Query: 211 SLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
S+ N DP+SLH+GCPV EG K S KWI
Sbjct: 176 SMLPNGEGDPMSLHAGCPVEEGIKLSGNKWI 206
>gi|198477152|ref|XP_002136738.1| GA29216 [Drosophila pseudoobscura pseudoobscura]
gi|198145043|gb|EDY71755.1| GA29216 [Drosophila pseudoobscura pseudoobscura]
Length = 517
Score = 122 bits (307), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 74/227 (32%), Positives = 119/227 (52%), Gaps = 29/227 (12%)
Query: 23 SFSSTAI----INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVA--DNL 76
S+++TA + P K + +S P +Y +T LE L NL+K +KR A+ +NL
Sbjct: 299 SYNTTAAPFLRLAPFKTEILSLSPYMVLYHDVITPLESLTLKNLSKPLMKRRAMVMVNNL 358
Query: 77 SGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPH 136
+ RTS+ ++ ++A++ +E ++ T EN E Q++ Y G Y+PH
Sbjct: 359 KVRPFIDSGRTSNSVWLASHENAVMERLERRVGVMTNFEMENSEVYQLINYGIGGHYKPH 418
Query: 137 YDYFSDKVNIVR--GGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAK 194
D+F GG R+ATVL YLSDV +GG T+FP +
Sbjct: 419 TDHFETPQAPEHRGGGDRIATVLFYLSDVPQGGATLFP---------------------R 457
Query: 195 KGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
I+V+PR+GDALL+++L+ + ++H+ CP+I+G KW+ KWI
Sbjct: 458 LNISVQPRQGDALLWYNLNDRGQGEIGTVHTSCPIIQGSKWALVKWI 504
>gi|157114985|ref|XP_001658091.1| prolyl 4-hydroxylase alpha subunit 1, putative [Aedes aegypti]
gi|108877086|gb|EAT41311.1| AAEL007038-PA [Aedes aegypti]
Length = 545
Score = 122 bits (307), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 66/217 (30%), Positives = 115/217 (52%), Gaps = 25/217 (11%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
I P K+++ + KP +Y +++ E + + LAK + +R+ V + +GE ++++ R S
Sbjct: 330 IAPLKLEEANLKPYIVIYHDVISEAEMELVKRLAKPRFRRATVQNYKTGELEVANYRISK 389
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-SDKVNIVR 148
++ + I I +++ T L E++QV+ Y G YEPH+D+ ++ N +
Sbjct: 390 SAWLKDHEHPYIKAIGERVEDMTGLTMSTAEELQVVNYGIGGHYEPHFDFARREETNAFK 449
Query: 149 G---GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGD 205
G+R+ATVL Y+SDV +GG TVFP+ +A+ P++G
Sbjct: 450 SLGTGNRIATVLFYMSDVTQGGATVFPSLR---------------------LALWPKKGA 488
Query: 206 ALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A +F+LH + D + H+ CPV+ G KW + KWIH
Sbjct: 489 AAFWFNLHASGQGDYSTRHAACPVLTGTKWVSNKWIH 525
>gi|112984520|ref|NP_001037195.1| prolyl 4-hydroxylase alpha subunit precursor [Bombyx mori]
gi|37543673|gb|AAM21932.1| prolyl 4-hydroxylase alpha subunit [Bombyx mori]
Length = 550
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 70/215 (32%), Positives = 112/215 (52%), Gaps = 23/215 (10%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+ P KV+Q+ KP F++ +TD E + + AK + KR+ V D +GE + R S
Sbjct: 325 LAPIKVEQMYVKPDIFMFHEVMTDDEIEFIKKRAKPRFKRAVVHDPKTGELTPAHYRISK 384
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVR- 148
+++ + +IA I ++ T L + E++QV+ Y G YEPH+D+ + N
Sbjct: 385 SSWLRDEESPVIARITQRVTDMTGLSMLHAEELQVVNYGIGGHYEPHFDFARKRENPFTK 444
Query: 149 -GGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDAL 207
GG+R+ATVL Y+SDVA+GG TVF + G+++ P + A
Sbjct: 445 FGGNRIATVLFYMSDVAQGGATVF---------------------TELGLSLFPIKRAAA 483
Query: 208 LFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ +LH + D + H+ CPV+ G KW + KWIH
Sbjct: 484 FWLNLHASGEGDLATRHAACPVLRGSKWVSNKWIH 518
>gi|294499597|ref|YP_003563297.1| 2OG-Fe(II) oxygenase family protein [Bacillus megaterium QM B1551]
gi|294349534|gb|ADE69863.1| 2OG-Fe(II) oxygenase family protein [Bacillus megaterium QM B1551]
Length = 219
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 66/201 (32%), Positives = 112/201 (55%), Gaps = 26/201 (12%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
+P V L++ ECD LI L+K +++RS + +++ +RTSSG F + ++ +
Sbjct: 38 EPLVLVLGNVLSNEECDELIQLSKDKMQRSKIG----AAREVNSIRTSSGMFFEESENEL 93
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYL 160
+ IE +++ E E +QVL+Y Q+Y+ H+DYF+ + +R++T++MYL
Sbjct: 94 VHQIERRLSKIMGPSIEYAEGLQVLKYLPDQEYKAHHDYFTSASKASKN-NRISTLVMYL 152
Query: 161 SDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDP 220
+DV +GGET FP K G++V P +G A+ F +++A +
Sbjct: 153 NDVEEGGETYFP---------------------KLGLSVSPTKGMAVYFEYFYSDAELND 191
Query: 221 VSLHSGCPVIEGEKWSATKWI 241
+LH G PVI+GEKW AT+W+
Sbjct: 192 RTLHGGAPVIKGEKWVATQWM 212
>gi|66820122|ref|XP_643703.1| hypothetical protein DDB_G0275385 [Dictyostelium discoideum AX4]
gi|60471803|gb|EAL69758.1| hypothetical protein DDB_G0275385 [Dictyostelium discoideum AX4]
Length = 221
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 77/221 (34%), Positives = 116/221 (52%), Gaps = 36/221 (16%)
Query: 32 PSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGT 91
P K+ ++S PR + GFLTD EC+ LI+ +K++L+ E R+ G
Sbjct: 22 PVKLIELSQAPRIYRIPGFLTDEECEFLIDTSKNKLRPC-------NEISSGVHRSGWGL 74
Query: 92 FIPKGKD--AIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS-----DKV 144
F+ +G++ I I +K+ ++ + E+ E +QV+RY G++ H+DYF+ +
Sbjct: 75 FMKEGEEDHQITKNIFNKMKSFVNI-SESCEVMQVIRYNQGEETSSHFDYFNPLTTNGSM 133
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
I G R+ T+LMYL DV +GGET FP GI VKP +G
Sbjct: 134 KIGLYGQRVCTILMYLCDVEEGGETTFPEV---------------------GIKVKPIKG 172
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDS 245
DA+LF++ N DP+SLH G PV++G KW A K I+ S
Sbjct: 173 DAVLFYNCKPNGDVDPLSLHQGDPVLKGNKWVAIKLINQKS 213
>gi|195452742|ref|XP_002073480.1| GK13123 [Drosophila willistoni]
gi|194169565|gb|EDW84466.1| GK13123 [Drosophila willistoni]
Length = 540
Score = 121 bits (304), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 71/214 (33%), Positives = 110/214 (51%), Gaps = 24/214 (11%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+ P KV+Q++ P + ++D E D LI Q+KRS V G S +S+VRTS
Sbjct: 320 LGPFKVEQLNLDPYVAYFHNVISDDETDDLIEHGMGQVKRSRVGT--VGNSTVSEVRTSQ 377
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIV-R 148
T++ + + ++ ++ T L E+ E +Q++ Y G YEPHYD+ DKV
Sbjct: 378 NTWLWYEQQPWLKNLKLRLEDITGLGMESAEPLQLVNYGIGGHYEPHYDFVEDKVTTFGW 437
Query: 149 GGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALL 208
G+RL T L+YL++V GG T FP + +AV P +G L+
Sbjct: 438 KGNRLLTALLYLNEVPMGGATAFPYLK---------------------LAVPPVKGSLLV 476
Query: 209 FFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+++LH + PD + H+GCPV+ G KW +W H
Sbjct: 477 WYNLHRSLDPDFRTKHAGCPVLMGSKWVCNEWFH 510
>gi|319795182|ref|YP_004156822.1| procollagen-proline dioxygenase [Variovorax paradoxus EPS]
gi|315597645|gb|ADU38711.1| Procollagen-proline dioxygenase [Variovorax paradoxus EPS]
Length = 296
Score = 121 bits (303), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 71/208 (34%), Positives = 106/208 (50%), Gaps = 27/208 (12%)
Query: 45 FVYEGFLTDL-ECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAG 103
F G + D EC LI +AK +L S + D +SG +SD R S G F ++ ++A
Sbjct: 102 FAALGNVVDAHECKALIEMAKPRLAPSTLVDPMSGRDVVSDKRASWGMFFRLCENDLVAR 161
Query: 104 IEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNIVRGGHRLATVLM 158
++ +++ LP ENGE + +L Y G EPH+DY +++ +I R G R++T++
Sbjct: 162 LDRRLSALMNLPLENGEGLHLLYYPTGAGSEPHHDYLAPTNAANRESIARSGQRVSTLVT 221
Query: 159 YLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIP 218
YL+D +GG+TVFP + G+AV P RG+A F N
Sbjct: 222 YLNDAPEGGQTVFP---------------------QLGLAVSPIRGNACYFEYCDGNGRV 260
Query: 219 DPVSLHSGCPVIEGEKWSATKWIHVDSF 246
D SLH+ PV G+KW TKW+ F
Sbjct: 261 DARSLHASAPVTRGDKWVMTKWMRERRF 288
>gi|389775678|ref|ZP_10193553.1| procollagen-proline dioxygenase [Rhodanobacter spathiphylli B39]
gi|388437120|gb|EIL93940.1| procollagen-proline dioxygenase [Rhodanobacter spathiphylli B39]
Length = 284
Score = 121 bits (303), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 75/206 (36%), Positives = 108/206 (52%), Gaps = 33/206 (16%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
P V E L EC+ LI LA+ +LKR+ + G +++ RTS G F + ++
Sbjct: 95 PALRVLENLLAAEECEELIALAQPRLKRALTVAS-DGSNQVDQRRTSEGMFFTLNELPLV 153
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS------DKVNIVRGGHRLAT 155
IE ++AT +P +GE +Q+L Y GQ+YEPH+D+F D + V GG R+A+
Sbjct: 154 GRIEQRLATLLGMPVSHGEGLQILHYLPGQEYEPHFDWFDPQQPGYDTITAV-GGQRVAS 212
Query: 156 VLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTN 215
V+MYL+ A+GG T FP + G+ V RRG A ++F+
Sbjct: 213 VVMYLNTPAQGGGTAFP---------------------ELGLTVTARRG-AAVYFAYEGG 250
Query: 216 AIPDPVSLHSGCPVIEGEKWSATKWI 241
D SLH+G PV GEKW ATKW+
Sbjct: 251 ---DQQSLHAGLPVQRGEKWIATKWL 273
>gi|198449643|ref|XP_001357664.2| GA15938 [Drosophila pseudoobscura pseudoobscura]
gi|198130698|gb|EAL26798.2| GA15938 [Drosophila pseudoobscura pseudoobscura]
Length = 549
Score = 121 bits (303), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 69/214 (32%), Positives = 111/214 (51%), Gaps = 24/214 (11%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
++P KV+Q+S P + L+D E + +I K Q+ RS + +G S +SD+RTS
Sbjct: 334 LSPFKVEQLSGDPYVAYFHDVLSDKESEQIIEHGKGQVTRSEIGQ--TGNSTVSDIRTSQ 391
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSD-KVNIVR 148
T++ + +A I+ ++ T L + E +Q++ Y G +YEPH+D+ D + N
Sbjct: 392 NTWLWYENNPWLADIKQRLEDITGLSTDTAEPLQLVNYGIGGQYEPHFDFMDDAEKNFGW 451
Query: 149 GGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALL 208
G+RL T L YL+DV GG T FP +AV P +G L+
Sbjct: 452 KGNRLLTALFYLNDVPLGGATAFPFLH---------------------LAVPPVKGSLLV 490
Query: 209 FFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+++LH + D + H+GCPV++G KW +W H
Sbjct: 491 WYNLHRSLHKDFRTKHAGCPVLKGSKWICNQWFH 524
>gi|295704991|ref|YP_003598066.1| 2OG-Fe(II) oxygenase [Bacillus megaterium DSM 319]
gi|294802650|gb|ADF39716.1| 2OG-Fe(II) oxygenase family protein [Bacillus megaterium DSM 319]
Length = 219
Score = 120 bits (302), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 65/201 (32%), Positives = 112/201 (55%), Gaps = 26/201 (12%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
+P V L++ ECD LI L+K +++RS + +++ +RTSSG F + ++ +
Sbjct: 38 EPLVLVLGNVLSNEECDELIRLSKDKMQRSKIG----AAREVNSIRTSSGMFFDESENEL 93
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYL 160
+ IE +++ E E +Q+L+Y Q+Y+ H+DYF+ + +R++T++MYL
Sbjct: 94 VHQIERRLSKIMGPSIEYAEGLQILKYLPDQEYKAHHDYFTSASKASKN-NRISTLVMYL 152
Query: 161 SDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDP 220
+DV +GGET FP K G++V P +G A+ F +++A +
Sbjct: 153 NDVEEGGETYFP---------------------KLGLSVSPTKGMAVYFEYFYSDAELND 191
Query: 221 VSLHSGCPVIEGEKWSATKWI 241
+LH G PVI+GEKW AT+W+
Sbjct: 192 RTLHGGAPVIKGEKWVATQWM 212
>gi|21711777|gb|AAM75079.1| RE70601p [Drosophila melanogaster]
Length = 316
Score = 120 bits (302), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 72/226 (31%), Positives = 109/226 (48%), Gaps = 25/226 (11%)
Query: 19 LIRKSFSSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSG 78
L ++ S I+ P K++ + P +Y L+ E L +A LKR+ V SG
Sbjct: 97 LYNRTTSPFLILAPLKMELVGLDPYMVLYHDVLSPKEIKELQGMATPSLKRATVYQASSG 156
Query: 79 ESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYD 138
+++ RTS + P G + + + +I+ T E +Q++ Y G Y+ HYD
Sbjct: 157 RNEVVKTRTSKVAWFPDGYNPLTVRLNARISDMTGFNLYGSEMLQLMNYGLGGHYDQHYD 216
Query: 139 YFSDKVN---IVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKK 195
+F +K N G R+ATVL YL+DV +GG TVFPN +
Sbjct: 217 FF-NKTNSNMTAMSGDRIATVLFYLTDVEQGGATVFPNIRK------------------- 256
Query: 196 GIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
AV P+RG +++++L N D +LH+ CPVI G KW KWI
Sbjct: 257 --AVFPQRGSVVMWYNLKDNGQIDTQTLHAACPVIVGSKWVCNKWI 300
>gi|281206564|gb|EFA80750.1| putative prolyl 4-hydroxylase alpha subunit [Polysphondylium
pallidum PN500]
Length = 251
Score = 120 bits (302), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 80/231 (34%), Positives = 121/231 (52%), Gaps = 38/231 (16%)
Query: 19 LIRKSFSSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSG 78
++ K SST N K+ ++S KPR + FLTD EC+HLI +K++LK
Sbjct: 43 VVNKDKSSTD--NIPKLIEVSQKPRIYRIPKFLTDEECEHLIETSKNKLKPC-------N 93
Query: 79 ESKLSDVRTSSGTFIPKGKD--AIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPH 136
E R+ G F+ +G++ + I +++ T+ L E+ E +QV+RY G++ H
Sbjct: 94 EISSGVHRSGWGLFMKEGEEDHPVTQNIFNRMKTFVNL-TESSEVMQVIRYNPGEETSAH 152
Query: 137 YDYFS-----DKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSE 191
+DYF+ + I G R+ T+LMYL+DV +GGET FP
Sbjct: 153 FDYFNPLTTNGAMKIGLYGQRICTILMYLADVEEGGETSFPEV----------------- 195
Query: 192 CAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ VKP +GDA+LF++ N DP+SLH G PVI+G KW A K ++
Sbjct: 196 ----NVKVKPIKGDAVLFYNCKPNGEVDPLSLHQGDPVIKGTKWIAIKLVN 242
>gi|195341590|ref|XP_002037389.1| GM12139 [Drosophila sechellia]
gi|194131505|gb|EDW53548.1| GM12139 [Drosophila sechellia]
Length = 525
Score = 120 bits (301), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 70/225 (31%), Positives = 110/225 (48%), Gaps = 23/225 (10%)
Query: 19 LIRKSFSSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSG 78
L ++ S I+ P K++ + +P +Y L+ E L +A LKR+ V SG
Sbjct: 306 LYNRTTSPFLILAPLKMELVGLEPYMVLYHDVLSPKEITELQGMATPGLKRATVYQASSG 365
Query: 79 ESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYD 138
+++ RTS + P G + + + +I+ T E +Q++ Y G Y+ HYD
Sbjct: 366 RNEVVKTRTSKVAWFPDGYNPLTVRLNARISDMTGFNLYGSEMLQLMNYGLGGHYDQHYD 425
Query: 139 YFSDKVN--IVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKG 196
+F++ + G R+ATVL YL+DV +GG TVFPN +
Sbjct: 426 FFNNTNSNMTAMSGDRIATVLFYLTDVEQGGATVFPNIRK-------------------- 465
Query: 197 IAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
AV P+RG +++++L N D +LH+ CPVI G KW KWI
Sbjct: 466 -AVFPQRGSVVMWYNLRDNGQIDTQTLHAACPVIVGSKWVCNKWI 509
>gi|321474876|gb|EFX85840.1| hypothetical protein DAPPUDRAFT_309107 [Daphnia pulex]
Length = 528
Score = 120 bits (301), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 71/220 (32%), Positives = 105/220 (47%), Gaps = 29/220 (13%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I P K+++ KP +Y G + D E D + LA+ + KR+ V D +G S R +
Sbjct: 316 FIQPVKMEEALLKPLLVIYHGVIFDAEIDVVKKLAQPRFKRTGVTDRDTGRSMPVQYRIA 375
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVR 148
F+ + +I + ++ T L ED+QV Y G Y PH+DY + +
Sbjct: 376 KAAFLKDSEHNLIVKMSRRVGDITGLDMAASEDLQVCNYGIGGHYVPHFDY--ARQGEIH 433
Query: 149 G------GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPR 202
G G+R+AT L Y+SDV GG TVFP G A+ P+
Sbjct: 434 GPRDLDWGNRIATWLFYMSDVEAGGATVFPAV---------------------GAALWPQ 472
Query: 203 RGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+G A +++L N D +LH+GCPV+ G KW + KWIH
Sbjct: 473 KGSAAFWYNLRPNGNGDEDTLHAGCPVLTGSKWVSNKWIH 512
>gi|363543307|ref|NP_001241869.1| prolyl 4-hydroxylase 6-4 precursor [Zea mays]
gi|347978826|gb|AEP37755.1| prolyl 4-hydroxylase 6-4 [Zea mays]
Length = 145
Score = 120 bits (301), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 52/105 (49%), Positives = 75/105 (71%)
Query: 31 NPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSG 90
+P+ V Q+S +PRAF+Y GFL+D ECDH+++LAK +++S VADN SG+S S RTSSG
Sbjct: 31 DPASVTQLSSRPRAFLYSGFLSDTECDHIVSLAKGSMEKSMVADNDSGKSVASQARTSSG 90
Query: 91 TFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEP 135
TF+ K +D I++ IE ++A WTFLP+EN E +Q + + P
Sbjct: 91 TFLAKREDEIVSAIEKRVAAWTFLPEENAESLQSCATKPARSMTP 135
>gi|347972274|ref|XP_001237637.3| AGAP004611-PA [Anopheles gambiae str. PEST]
gi|333469330|gb|EAU76664.3| AGAP004611-PA [Anopheles gambiae str. PEST]
Length = 514
Score = 120 bits (301), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 70/218 (32%), Positives = 107/218 (49%), Gaps = 28/218 (12%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
I+P K+++++ P +Y +++ E D +I+++K + RS V D+ E +S RTSS
Sbjct: 304 ISPLKLQEVNHDPMIVMYHDVISNKEIDAIISISKPLMHRSMVGDD--HEKAVSKTRTSS 361
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKV 144
++ ++ + + T L E +QV Y G Y PHYDY +
Sbjct: 362 NAWLDDVMHPVVRTLSQRTEDMTNLAMTAAERLQVGNYGIGGHYLPHYDYAVAEEGKEVY 421
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+ATV+ YLSDVA GG TVFP G+ V P++G
Sbjct: 422 PSIGKGNRIATVMYYLSDVAIGGATVFPQL---------------------GLGVFPQKG 460
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++LH N D +LH CPV G KW KWIH
Sbjct: 461 SAIFWYNLHANGTVDHRTLHGACPVFVGSKWVGNKWIH 498
>gi|330799463|ref|XP_003287764.1| hypothetical protein DICPUDRAFT_151895 [Dictyostelium purpureum]
gi|325082219|gb|EGC35708.1| hypothetical protein DICPUDRAFT_151895 [Dictyostelium purpureum]
Length = 220
Score = 120 bits (300), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 77/227 (33%), Positives = 117/227 (51%), Gaps = 36/227 (15%)
Query: 26 STAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDV 85
S + P K+ ++S KPR + FLT+ EC+HLI+ +K++L+ E
Sbjct: 16 SCKVEKPIKLIELSQKPRVYRIPEFLTEEECNHLIDTSKNKLRPC-------NEISSGVH 68
Query: 86 RTSSGTFIPKGKDA--IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS-- 141
R+ G F+ +G++ + I +K+ + + ++ E +Q++RY G++ HYDYF+
Sbjct: 69 RSGWGLFMKEGEEEHPVTKNIFNKMKNFVNI-SDSCEVMQIIRYNPGEETSAHYDYFNPL 127
Query: 142 ---DKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIA 198
+ I G R+ T+LMYL DV +GGET FP GI
Sbjct: 128 TTNGSMKIGLYGQRICTILMYLCDVEEGGETSFPEV---------------------GIK 166
Query: 199 VKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDS 245
VKP RGDA+LF++ N DP+SLH G PV +G KW A K I+ S
Sbjct: 167 VKPIRGDAVLFYNCKPNGDVDPLSLHQGDPVTKGTKWVAIKLINQKS 213
>gi|195575145|ref|XP_002105540.1| GD16902 [Drosophila simulans]
gi|194201467|gb|EDX15043.1| GD16902 [Drosophila simulans]
Length = 525
Score = 120 bits (300), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 72/226 (31%), Positives = 109/226 (48%), Gaps = 25/226 (11%)
Query: 19 LIRKSFSSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSG 78
L ++ S I+ P K++ + P +Y L+ E L +A LKR+ V SG
Sbjct: 306 LYNRTTSPFLILAPLKMELVGLDPYMVLYHDVLSPKEITELQGMATPGLKRATVYQASSG 365
Query: 79 ESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYD 138
+++ RTS + P G + + + +I+ T E +Q++ Y G Y+ HYD
Sbjct: 366 RNEVVKTRTSKVAWFPDGYNPLTVRLNARISDMTGFNLYGSEMLQLMNYGLGGHYDQHYD 425
Query: 139 YFSDKVN---IVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKK 195
+F +K N G R+ATVL YL+DV +GG TVFPN +
Sbjct: 426 FF-NKTNSNMTAMSGDRIATVLFYLTDVEQGGATVFPNIRK------------------- 465
Query: 196 GIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
AV P+RG +++++L N D +LH+ CPVI G KW KWI
Sbjct: 466 --AVFPQRGSVVMWYNLRDNGQIDTQTLHAACPVIVGSKWVCNKWI 509
>gi|239915958|ref|NP_001070123.2| prolyl 4-hydroxylase alpha II-like precursor [Danio rerio]
Length = 490
Score = 120 bits (300), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 74/211 (35%), Positives = 109/211 (51%), Gaps = 34/211 (16%)
Query: 32 PSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGT 91
P K +++ +P+ Y ++D E + L ++A+ +L RS +G +SD+RTS
Sbjct: 298 PVKEEELWDEPKIIRYHDVISDTEIETLKDIARPELTRSQ-----TGWGVISDIRTSQSV 352
Query: 92 FIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGH 151
F+ + +A I +IA T L E+ E + V Y G +Y PH+D D+VN
Sbjct: 353 FLEEV--GTVARISQRIADITGLSVESAEKLHVQNYGIGGRYTPHFDT-GDEVN-----E 404
Query: 152 RLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFS 211
R AT L+Y+SDV GG TVF N G+AVKP +G A+ +++
Sbjct: 405 RTATFLIYMSDVEVGGATVFTNV---------------------GVAVKPEKGSAVFWYN 443
Query: 212 LHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
LH N D + H+GCPV+ G KW A KWIH
Sbjct: 444 LHKNGELDLKTKHAGCPVLVGNKWVANKWIH 474
>gi|92096574|gb|AAI15350.1| LOC557059 protein [Danio rerio]
Length = 508
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 74/211 (35%), Positives = 109/211 (51%), Gaps = 34/211 (16%)
Query: 32 PSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGT 91
P K +++ +P+ Y ++D E + L ++A+ +L RS +G +SD+RTS
Sbjct: 316 PVKEEELWDEPKIIRYHDVISDTEIETLKDIARPELTRSQ-----TGWGVISDIRTSQSV 370
Query: 92 FIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGH 151
F+ + +A I +IA T L E+ E + V Y G +Y PH+D D+VN
Sbjct: 371 FLEEV--GTVARISQRIADITGLSVESAEKLHVQNYGIGGRYTPHFDT-GDEVN-----E 422
Query: 152 RLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFS 211
R AT L+Y+SDV GG TVF N G+AVKP +G A+ +++
Sbjct: 423 RTATFLIYMSDVEVGGATVFTNV---------------------GVAVKPEKGSAVFWYN 461
Query: 212 LHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
LH N D + H+GCPV+ G KW A KWIH
Sbjct: 462 LHKNGELDLKTKHAGCPVLVGNKWVANKWIH 492
>gi|321474875|gb|EFX85839.1| hypothetical protein DAPPUDRAFT_309105 [Daphnia pulex]
Length = 545
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 70/216 (32%), Positives = 109/216 (50%), Gaps = 25/216 (11%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
I P K+++ KPR VY ++D E + + LA+ + +R+ V SGE + S R +
Sbjct: 334 IQPIKMEEALLKPRIVVYHDIISDEEIETIKRLAQPRFERATVQKKESGEREFSRYRIAK 393
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRG 149
++ + ++ I ++ T L ED+QV Y G YEPHYDY + K + +
Sbjct: 394 SAWLKHEEHDYVSDINFRVGDITGLDMATSEDLQVCNYGIGGHYEPHYDY-ARKGEVQQD 452
Query: 150 ---GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
G R+AT L Y+SDV GG TVFP K +++ P++G A
Sbjct: 453 FGWGGRIATWLFYMSDVEAGGATVFP---------------------KLNLSLWPQKGSA 491
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+F+L+ N + ++ H+GCPV+ G KW A WIH
Sbjct: 492 AFWFNLYPNGEGNEMTQHAGCPVLTGSKWVANYWIH 527
>gi|156370129|ref|XP_001628324.1| predicted protein [Nematostella vectensis]
gi|156215298|gb|EDO36261.1| predicted protein [Nematostella vectensis]
Length = 541
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 70/239 (29%), Positives = 112/239 (46%), Gaps = 46/239 (19%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGES-------- 80
++ P +++++ KP ++ F+TD E + LA +LKR+ V D ++GE
Sbjct: 297 VLKPGRIERVFVKPEVLIFRNFITDSEIKRIKELATPRLKRATVKDPVTGELIFANYRIS 356
Query: 81 --------------KLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLR 126
+ ++ R S ++ +D ++ I ++ ++ L ED+QV+
Sbjct: 357 KRRATIQHPVTGKLEFANYRISKSGWLRDEEDELVKRISYRVQAYSGLNMTTSEDLQVVN 416
Query: 127 YEHGQKYEPHYDYF---SDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTP 183
Y G YEPHYD+ DK + G+R+AT L YLSDV GG TVF
Sbjct: 417 YGIGGHYEPHYDFARDGEDKFTSLGTGNRIATFLSYLSDVEAGGGTVFTRV--------- 467
Query: 184 ATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
G V P++GDA +++L + D + H+ CPV+ G KW A KWIH
Sbjct: 468 ------------GATVWPQKGDAAFWYNLKRSGDGDSSTRHAACPVLVGSKWVANKWIH 514
>gi|24651477|ref|NP_733395.1| prolyl-4-hydroxylase-alpha PV [Drosophila melanogaster]
gi|20269812|gb|AAM18061.1|AF495539_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]PV [Drosophila
melanogaster]
gi|23172718|gb|AAN14252.1| prolyl-4-hydroxylase-alpha PV [Drosophila melanogaster]
Length = 525
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 72/226 (31%), Positives = 109/226 (48%), Gaps = 25/226 (11%)
Query: 19 LIRKSFSSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSG 78
L ++ S I+ P K++ + P +Y L+ E L +A LKR+ V SG
Sbjct: 306 LYNRTTSPFLILAPLKMELVGLDPYMVLYHDVLSPKEIKELQGMATPGLKRATVYQASSG 365
Query: 79 ESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYD 138
+++ RTS + P G + + + +I+ T E +Q++ Y G Y+ HYD
Sbjct: 366 RNEVVKTRTSKVAWFPDGYNPLTVRLNARISDMTGFNLYGSEMLQLMNYGLGGHYDQHYD 425
Query: 139 YFSDKVN---IVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKK 195
+F +K N G R+ATVL YL+DV +GG TVFPN +
Sbjct: 426 FF-NKTNSNMTAMSGDRIATVLFYLTDVEQGGATVFPNIRK------------------- 465
Query: 196 GIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
AV P+RG +++++L N D +LH+ CPVI G KW KWI
Sbjct: 466 --AVFPQRGSVVMWYNLKDNGQIDTQTLHAACPVIVGSKWVCNKWI 509
>gi|344175386|emb|CCA88057.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia syzygii R24]
Length = 331
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 73/214 (34%), Positives = 106/214 (49%), Gaps = 30/214 (14%)
Query: 35 VKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDV--RTSSGTF 92
V+ +S PRA + L+ ECD LI A+S+L S V + SG+ +++ S +F
Sbjct: 125 VQFVSHHPRAALISDLLSTQECDALIEQARSRLTTSYVIEYESGQEVVNEATRSCSCASF 184
Query: 93 IPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF------SDKVNI 146
P+ + I ++ A P + E + RY G+++ PH DYF +DK+ +
Sbjct: 185 PPEEMSMLQKRIVERAARLVGQPGAHCEGVTFARYLPGEQFRPHVDYFRGAVLNNDKI-M 243
Query: 147 VRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
GHR+ATVL+YL++V GG T FPN G V+P++G A
Sbjct: 244 GSSGHRIATVLLYLNEVEAGGATFFPN---------------------PGFEVRPQKGGA 282
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKW 240
L F + DP SLH GC V +GEKW AT W
Sbjct: 283 LYFAYQQADGSMDPTSLHEGCAVTQGEKWIATLW 316
>gi|195159313|ref|XP_002020526.1| GL14040 [Drosophila persimilis]
gi|194117295|gb|EDW39338.1| GL14040 [Drosophila persimilis]
Length = 549
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 68/214 (31%), Positives = 111/214 (51%), Gaps = 24/214 (11%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
++P KV+Q+S P + L+D E + +I K Q+ RS + +G S +S++RTS
Sbjct: 334 LSPFKVEQLSGDPYVAYFHDVLSDKESEQIIEHGKGQVTRSEIGQ--TGNSTVSEIRTSQ 391
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSD-KVNIVR 148
T++ + +A I+ ++ T L + E +Q++ Y G +YEPH+D+ D + N
Sbjct: 392 NTWLWYENNPWLADIKQRLEDITGLSTDTAEPLQLVNYGIGGQYEPHFDFMDDAEKNFGW 451
Query: 149 GGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALL 208
G+RL T L YL+DV GG T FP +AV P +G L+
Sbjct: 452 KGNRLLTALFYLNDVPLGGATAFPFLH---------------------LAVPPVKGSLLV 490
Query: 209 FFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+++LH + D + H+GCPV++G KW +W H
Sbjct: 491 WYNLHRSLHKDFRTKHAGCPVLKGSKWICNEWFH 524
>gi|389728965|ref|ZP_10189244.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
gi|388441204|gb|EIL97500.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
Length = 285
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 77/214 (35%), Positives = 109/214 (50%), Gaps = 33/214 (15%)
Query: 34 KVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRS-AVADNLSGESKLSDVRTSSGTF 92
+V + P V++G L+D EC LI LAK +L+R+ VA++ G ++ + RTS G F
Sbjct: 87 RVMLAAETPPLRVFDGLLSDDECAALIELAKPRLQRARTVAED--GAQQIDEHRTSDGMF 144
Query: 93 IPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSD-----KVNIV 147
G+ +I IE +IA +P ++GE +QVL Y GQ+YEPH D+F
Sbjct: 145 FGLGEQPLIERIEARIAALLGIPVDHGEGLQVLHYLPGQQYEPHQDWFDPTQPGYAAITA 204
Query: 148 RGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDAL 207
GG R+A++++YL+ GG T FP G+ V RG A+
Sbjct: 205 TGGQRIASLVIYLNTPDAGGGTAFPEI---------------------GLTVTALRGSAV 243
Query: 208 LFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
F T D SLH+G PV GEKW ATKW+
Sbjct: 244 CF----TYESGDVFSLHAGLPVTRGEKWIATKWL 273
>gi|428182311|gb|EKX51172.1| hypothetical protein GUITHDRAFT_92735 [Guillardia theta CCMP2712]
Length = 190
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 74/194 (38%), Positives = 106/194 (54%), Gaps = 27/194 (13%)
Query: 67 LKRSAVAD-NLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVL 125
+ RS +A+ ++ + RTSS ++ K D ++A I ++A LP E ED+QVL
Sbjct: 1 MGRSTIAEAGNEAKNGVGSARTSSTAWLSKTADPLVAKIRTRVAELVKLPMELAEDMQVL 60
Query: 126 RYEHGQKYEPHYDYFSDKVNIVR------GGHRLATVLMYLSDVAKGGETVFPNAEEPPR 179
Y Q Y H+D+F NI R G +R TV YLSDV +GGETVFP A R
Sbjct: 61 HYSKNQHYWAHHDFFDP--NIYRGFVTSPGQNRFITVFFYLSDVEEGGETVFPFANGDDR 118
Query: 180 RRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTN-----AIPDPV-------SLHSGC 227
R T D ++C+ +G+ VKP+ G+A++F+S+ PD + SLH GC
Sbjct: 119 RVT-----DFADCS-RGLKVKPKAGNAIIFYSMLAKRQQEICPPDDLGCNLDVRSLHGGC 172
Query: 228 PVIEGEKWSATKWI 241
VI+G+KW+A WI
Sbjct: 173 DVIKGDKWAANYWI 186
>gi|195113237|ref|XP_002001174.1| GI10637 [Drosophila mojavensis]
gi|193917768|gb|EDW16635.1| GI10637 [Drosophila mojavensis]
Length = 529
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 72/215 (33%), Positives = 108/215 (50%), Gaps = 24/215 (11%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+ P K++ IS P +Y ++ E L +LA LKR+ V + S + + RTS
Sbjct: 318 LAPLKMELISLDPYMVIYHDVISPSEISELQSLAVPGLKRATVFNQQSMRNHVVKTRTSK 377
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKV--NIV 147
T++ + + + +I T E +QV+ Y G Y+ HYDYF+ V ++
Sbjct: 378 VTWLLDTLNQLTIRLNRRITDMTGFDMYGSEMLQVMNYGLGGHYDKHYDYFNSSVAADLT 437
Query: 148 R-GGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
R G R+ATVL YL+DV +GG TVFPN E+ AV P+ G A
Sbjct: 438 RLNGDRIATVLFYLTDVEQGGATVFPNIEK---------------------AVFPKSGTA 476
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
+++++L + DP +LH+ CPVI G KW KWI
Sbjct: 477 VVWYNLRHDGNGDPQTLHAACPVIVGSKWVCNKWI 511
>gi|194765168|ref|XP_001964699.1| GF22909 [Drosophila ananassae]
gi|190614971|gb|EDV30495.1| GF22909 [Drosophila ananassae]
Length = 525
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 69/218 (31%), Positives = 108/218 (49%), Gaps = 29/218 (13%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
I P K +++S P +Y + E D + L ++LKR+ + + ES +S+VRTS
Sbjct: 290 IAPLKAEELSRDPLLILYHDVIYQSEIDTIRKLTTNKLKRATITS--TNESVVSNVRTSQ 347
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF------SDK 143
TF+P +D ++A I+ ++A T ED Q Y G Y H D+F +
Sbjct: 348 FTFLPVTEDKVLATIDRRVADMTNFNMRYAEDHQFANYGIGGHYGQHMDWFYQPSFDAGL 407
Query: 144 VNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRR 203
V+ G+R+ATVL YLSDV +GG T FP+ + +KP++
Sbjct: 408 VSSPEMGNRIATVLFYLSDVTQGGGTAFPHLR---------------------VLLKPKK 446
Query: 204 GDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
A +++LH + + DP + H CP+I G KW +WI
Sbjct: 447 YAAAFWYNLHASGVGDPRTQHGACPIISGSKWVQNRWI 484
>gi|321474952|gb|EFX85916.1| hypothetical protein DAPPUDRAFT_45616 [Daphnia pulex]
Length = 537
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 66/218 (30%), Positives = 110/218 (50%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I P K+++ KP VY ++D E + + +AK + KR+ + ++ +GE + ++ R S
Sbjct: 323 FIQPIKMEEALLKPMIVVYHDVMSDDEIETVKKMAKPRFKRATIRNSKTGELEPANYRIS 382
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDY----FSDKV 144
++ + I + ++ T L ED+QV+ Y G YEPH+DY ++
Sbjct: 383 KSAWLKSEEHDHILKVTRRVGDITGLDMSTAEDLQVVNYGIGGHYEPHFDYARTETTEAF 442
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV GG TVFP G AV PR+G
Sbjct: 443 KELGWGNRIATWLFYMSDVEAGGATVFPPT---------------------GAAVWPRKG 481
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A +++L+ N + ++ H+ CPV+ G KW + +WIH
Sbjct: 482 SAAFWYNLYPNGKGNELTRHAACPVLSGSKWVSNRWIH 519
>gi|189241578|ref|XP_969458.2| PREDICTED: similar to prolyl 4-hydroxylase alpha subunit 1,
putative [Tribolium castaneum]
Length = 515
Score = 118 bits (295), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 70/213 (32%), Positives = 109/213 (51%), Gaps = 23/213 (10%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
I P KV+Q P ++ L+D E + + LA+S+L + + S + +L R S
Sbjct: 315 IAPFKVEQAHLDPDILIFHNVLSDCEIETMKQLAQSRLVTAVFENPHSKQLELFPFRISK 374
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRG 149
++ + +A + ++A T L E+ QV+ Y G YEPH+D+ S +
Sbjct: 375 VAWLEDQEHQHLAVVAQRVAHMTGLTLSTAEEFQVVNYGIGGHYEPHFDFQSTVDPAI-- 432
Query: 150 GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLF 209
G R+ TVL YLSDV +GG TVFP + ++V P++G A+++
Sbjct: 433 GSRIETVLFYLSDVEQGGATVFPEIQ---------------------VSVWPQKGSAVVW 471
Query: 210 FSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
F+LH + D + H+GCPV+ G KW ATKWIH
Sbjct: 472 FNLHPSGDGDQRTKHAGCPVLIGSKWIATKWIH 504
>gi|195110931|ref|XP_002000033.1| GI24862 [Drosophila mojavensis]
gi|193916627|gb|EDW15494.1| GI24862 [Drosophila mojavensis]
Length = 549
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 67/219 (30%), Positives = 113/219 (51%), Gaps = 29/219 (13%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
++ P KV+++S P ++ + E D L+ LAK+++ R+ V + S S +S+ RTS
Sbjct: 321 LLAPLKVEELSHDPLLVLFHDVIYQSEIDTLMRLAKNKIHRATVTGHNS--SVVSNARTS 378
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF------SD 142
TF+PK + ++ I+ ++A T L E ED Q+ Y G Y H D+F +
Sbjct: 379 QFTFLPKTRHKVLRTIDQRVADMTDLHLEYAEDHQLANYGIGGHYAQHMDWFYPITFETK 438
Query: 143 KVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPR 202
+V+ G+R+ TVL YLSDV +GG T FP ++ ++P+
Sbjct: 439 QVSNPEMGNRIGTVLFYLSDVEQGGATAFPALKQ---------------------LLRPK 477
Query: 203 RGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
+ A +++LH + + D ++H CP+I G KW +WI
Sbjct: 478 KHAAAFWYNLHASGVGDARTMHGACPIIVGSKWVLNRWI 516
>gi|290243077|ref|YP_003494747.1| Procollagen-proline dioxygenase [Thioalkalivibrio sp. K90mix]
gi|288945582|gb|ADC73280.1| Procollagen-proline dioxygenase [Thioalkalivibrio sp. K90mix]
Length = 575
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 72/213 (33%), Positives = 110/213 (51%), Gaps = 29/213 (13%)
Query: 35 VKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIP 94
++ +S P + FL EC+ LI+LA+ ++KR+ V +L G S +S RT S ++
Sbjct: 50 METLSQDPLVVYLDEFLEPGECEALIHLAQGRMKRALV--SLDGSSGVSQGRTGSNCWLR 107
Query: 95 KGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS-DKVNIVR----G 149
++ + I +++A P E E +QV+ Y H Q+Y PHYD + D +R G
Sbjct: 108 YQEEPLARRIGERVAKRVGFPLEYAEPLQVIHYGHEQEYRPHYDAYDLDTPRGLRCTRQG 167
Query: 150 GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLF 209
G R+ T L+YL++V +GG T FPNA G+ V PR+G +F
Sbjct: 168 GQRMVTALLYLNEVEEGGATAFPNA---------------------GVEVAPRKGRIAIF 206
Query: 210 FSLHTN-AIPDPVSLHSGCPVIEGEKWSATKWI 241
++ + P P SLH G PV GEKW+A+ W
Sbjct: 207 NNVGADPGRPHPRSLHGGMPVKSGEKWAASIWF 239
>gi|270001038|gb|EEZ97485.1| hypothetical protein TcasGA2_TC011322 [Tribolium castaneum]
Length = 509
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 70/213 (32%), Positives = 109/213 (51%), Gaps = 23/213 (10%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
I P KV+Q P ++ L+D E + + LA+S+L + + S + +L R S
Sbjct: 309 IAPFKVEQAHLDPDILIFHNVLSDCEIETMKQLAQSRLVTAVFENPHSKQLELFPFRISK 368
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRG 149
++ + +A + ++A T L E+ QV+ Y G YEPH+D+ S +
Sbjct: 369 VAWLEDQEHQHLAVVAQRVAHMTGLTLSTAEEFQVVNYGIGGHYEPHFDFQSTVDPAI-- 426
Query: 150 GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLF 209
G R+ TVL YLSDV +GG TVFP + ++V P++G A+++
Sbjct: 427 GSRIETVLFYLSDVEQGGATVFPEIQ---------------------VSVWPQKGSAVVW 465
Query: 210 FSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
F+LH + D + H+GCPV+ G KW ATKWIH
Sbjct: 466 FNLHPSGDGDQRTKHAGCPVLIGSKWIATKWIH 498
>gi|343172438|gb|AEL98923.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein, partial
[Silene latifolia]
gi|343172440|gb|AEL98924.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein, partial
[Silene latifolia]
Length = 120
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 60/118 (50%), Positives = 75/118 (63%), Gaps = 7/118 (5%)
Query: 124 VLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTP 183
VLRYE GQKY HYD F + R+A+ L+YLSDV +GGET+FP +
Sbjct: 4 VLRYEVGQKYNSHYDAFHPAEYGPQKSQRIASFLLYLSDVEEGGETMFPYEND-----NI 58
Query: 184 ATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
+N D +C G+ VKPR+GD LLF+SL +N DP S+H CPVI+GEKW ATKWI
Sbjct: 59 DSNYDYVQCI--GLKVKPRQGDGLLFYSLFSNGTIDPTSIHGSCPVIKGEKWVATKWI 114
>gi|255607134|ref|XP_002538686.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223510975|gb|EEF23697.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 318
Score = 117 bits (294), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 71/217 (32%), Positives = 109/217 (50%), Gaps = 33/217 (15%)
Query: 34 KVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFI 93
KV + PR +++ L+D ECD LI ++S+L+RS V N + D RTS G +
Sbjct: 118 KVVMVCTAPRIALFDDVLSDAECDALIAASRSRLQRSKVVANRGSGEFVDDTRTSYGAYF 177
Query: 94 PKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDK-----VNIVR 148
KG+++++A I+ +IA T P + E +Q+L Y G +Y PH+DYF + +
Sbjct: 178 NKGENSLVATIQRRIAELTRWPLTHAEPLQILNYGLGGEYLPHFDYFEPQQPGLPSPLES 237
Query: 149 GGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALL 208
GG R+ATV+MYL+DV GG T+FP+ + +PR+G A+
Sbjct: 238 GGQRIATVVMYLNDVEAGGGTIFPHLN---------------------LETRPRKGGAIY 276
Query: 209 FFSLHTNAIPDPVSLHSGCPV---IEGEKWSATKWIH 242
F + + S+ S C I KW AT+W
Sbjct: 277 F----SYQLAVARSIRSRCMAARRIARRKWIATQWFR 309
>gi|433460968|ref|ZP_20418587.1| prolyl 4-hydroxylase alpha subunit [Halobacillus sp. BAB-2008]
gi|432190746|gb|ELK47751.1| prolyl 4-hydroxylase alpha subunit [Halobacillus sp. BAB-2008]
Length = 211
Score = 117 bits (294), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 64/201 (31%), Positives = 107/201 (53%), Gaps = 28/201 (13%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
+P+ + +++ EC+ LI L+K ++ RS + + +SD+RTSS F+P D +
Sbjct: 33 EPKIAILGNVVSEEECEALIRLSKDKVNRSKIG----SDHDVSDIRTSSSAFLPD--DEL 86
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYL 160
IE ++A +P E+GE I +L Y+ GQ+Y+ H+DYF + R++T+++YL
Sbjct: 87 TGRIEKRLAQIMNVPVEHGEGIHILHYKPGQEYKAHHDYFRSTSRAAKNP-RISTLVLYL 145
Query: 161 SDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDP 220
+DV +GGET FP + + V P +G A+ F + + +
Sbjct: 146 NDVEEGGETYFP---------------------EMNLTVSPHKGMAVYFEYFYNDPAINE 184
Query: 221 VSLHSGCPVIEGEKWSATKWI 241
+LH G PV GEKW+AT W+
Sbjct: 185 RTLHGGSPVTAGEKWAATMWV 205
>gi|148226320|ref|NP_001087703.1| prolyl 4-hydroxylase, alpha polypeptide 2 precursor [Xenopus
laevis]
gi|51703693|gb|AAH81114.1| MGC83530 protein [Xenopus laevis]
Length = 533
Score = 117 bits (293), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 71/217 (32%), Positives = 110/217 (50%), Gaps = 25/217 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P K++ PR Y L+D E + + LAK +L R+ V D +G +++ R S
Sbjct: 323 ILGPIKMEDEWDSPRIVRYLDVLSDEEIEKIKELAKPRLARATVRDPKTGVLTVANYRVS 382
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDK---VN 145
++ + D +I + ++ T L K+ E +QV Y G +YEPH+D FS + N
Sbjct: 383 KSAWLEEYDDPVIGRVNSRMQAITGLTKDTAELLQVANYGMGGQYEPHFD-FSRRPFDSN 441
Query: 146 IVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGD 205
+ G+RLAT L Y+SDV GG TVFP+ G A+ PR+G
Sbjct: 442 LKTEGNRLATYLNYMSDVEAGGATVFPDF---------------------GAAIWPRKGT 480
Query: 206 ALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 481 AVFWYNLFRSGEGDYRTRHAACPVLVGSKWVSNKWFH 517
>gi|348501574|ref|XP_003438344.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oreochromis
niloticus]
Length = 615
Score = 117 bits (293), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 68/219 (31%), Positives = 114/219 (52%), Gaps = 27/219 (12%)
Query: 29 IINPSKVKQISW-KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRT 87
++ P K +Q W +P Y ++D E + + LAK +L+R+ +++ ++G + + R
Sbjct: 403 LLAPVK-QQDEWDRPYIVRYLDIISDAEIERVKQLAKPRLRRATISNPITGVLETASYRI 461
Query: 88 SSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DK 143
S ++ + D +I I D+I T L + E++QV Y G +YEPH+D+ D
Sbjct: 462 SKSAWLTEYDDPMIEKINDRIEGVTGLEMDTAEELQVANYGVGGQYEPHFDFGRKDEPDA 521
Query: 144 VNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRR 203
+ G+R+AT L Y+SDV+ GG TVFP+ G AV P++
Sbjct: 522 FKELGTGNRIATWLFYMSDVSAGGATVFPDV---------------------GAAVWPQK 560
Query: 204 GDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
G A+ +++L + D + H+ CPV+ G KW + KWIH
Sbjct: 561 GTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWIH 599
>gi|326436053|gb|EGD81623.1| p4ha2 protein [Salpingoeca sp. ATCC 50818]
Length = 548
Score = 117 bits (293), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 70/215 (32%), Positives = 108/215 (50%), Gaps = 26/215 (12%)
Query: 29 IINPSKVKQI-SWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRT 87
+ P KV+ + + R V+ F + EC HL + K +L+R+ + G + + R
Sbjct: 324 FLKPIKVEHLHEGRQRLQVFRQFASPEECRHLQHAGKRRLERAVAWTD--GRFQPVEFRI 381
Query: 88 SSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIV 147
S+ ++ DAI+ I +I T + E E +Q+ Y G YEPH+D+ S N
Sbjct: 382 STAAWLQPDHDAIVKRIHGRIEDATQVDIEYAEALQISNYGMGGFYEPHFDHSSRGTNP- 440
Query: 148 RGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDAL 207
G RLAT ++YL+ V +GG T FP + G AV+P GDA+
Sbjct: 441 -DGERLATFMIYLNPVKQGGFTAFP---------------------RLGAAVQPGYGDAV 478
Query: 208 LFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+++L + + DP++LH CPV+ G KW A KWIH
Sbjct: 479 FWYNLQPSGVGDPLTLHGACPVLRGSKWVANKWIH 513
>gi|198449502|ref|XP_001357605.2| GA15937 [Drosophila pseudoobscura pseudoobscura]
gi|198130635|gb|EAL26739.2| GA15937 [Drosophila pseudoobscura pseudoobscura]
Length = 510
Score = 117 bits (293), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 69/221 (31%), Positives = 113/221 (51%), Gaps = 27/221 (12%)
Query: 23 SFSSTAIIN--PSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGES 80
+F++T + P K++ + P VY L+D E ++ +A+ ++ R++ + S
Sbjct: 298 NFTTTPFLRLAPLKMELLGEHPYVVVYHDVLSDSEIAEILEMAERRMARTSTVAQPNRTS 357
Query: 81 KLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF 140
S RT+ G ++ + +A+ I ++ + L E E +QV+ Y G Y PH D+F
Sbjct: 358 --SPTRTAMGAWLKRSSNALTRRIARRVRDMSGLQLEGSERMQVINYGIGGHYVPHKDWF 415
Query: 141 SDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVK 200
+ ++ G+RLATVL YL+DV +GG T+F AE V
Sbjct: 416 TQHPEVM--GNRLATVLFYLTDVEQGGATMFNKAEH---------------------KVL 452
Query: 201 PRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
PRRG AL +++LHT+ D + H+ CP+I G KW T+WI
Sbjct: 453 PRRGTALFWYNLHTDGEGDWSTTHAACPIIVGSKWVLTQWI 493
>gi|195505255|ref|XP_002099425.1| GE23368 [Drosophila yakuba]
gi|194185526|gb|EDW99137.1| GE23368 [Drosophila yakuba]
Length = 528
Score = 117 bits (292), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 70/226 (30%), Positives = 107/226 (47%), Gaps = 25/226 (11%)
Query: 19 LIRKSFSSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSG 78
L ++ S ++ P K++ + P +Y L+ E L +A LKR+ V SG
Sbjct: 309 LYNRTTSPFLMLAPLKMELVGLDPYMVLYHDVLSAKEIKELQGMATPGLKRATVFQAASG 368
Query: 79 ESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYD 138
+++ RTS + P G + + +I T E +Q++ Y G Y+ HYD
Sbjct: 369 RNEVVRTRTSKVAWFPDGYSPLTVRLNARITDMTGFNLHGSEMLQLMNYGLGGHYDQHYD 428
Query: 139 YFSDKVN---IVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKK 195
YF + +N G R+ATVL YL+DV +GG TVFPN +
Sbjct: 429 YF-NTINSNLTAMSGDRIATVLFYLTDVEQGGATVFPNIRK------------------- 468
Query: 196 GIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
AV P+RG +++++L + D +LH+ CPVI G KW KWI
Sbjct: 469 --AVFPQRGSVIMWYNLKDDGQIDTQTLHAACPVIVGSKWVCNKWI 512
>gi|195159142|ref|XP_002020441.1| GL13994 [Drosophila persimilis]
gi|194117210|gb|EDW39253.1| GL13994 [Drosophila persimilis]
Length = 493
Score = 117 bits (292), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 68/216 (31%), Positives = 106/216 (49%), Gaps = 24/216 (11%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+ P K++ + P +Y ++ LE L ++A LKR+ V S++ RTS
Sbjct: 283 LAPLKMELVGLDPYMVLYHDVISALEISQLQDMATPGLKRATVYKASGRRSEVVKTRTSK 342
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF--SDKVNIV 147
+ P + + + +IA T E +Q + Y G Y+ HYD+F S N+
Sbjct: 343 VAWFPDTFNELTERLNRRIADMTNFDLLGSEMLQAMNYGLGGHYDKHYDFFNASTAANLT 402
Query: 148 R-GGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
+ G R+ATVL YL+DV +GG TVFPN + AV P+RG A
Sbjct: 403 QMNGDRIATVLFYLTDVEQGGATVFPNIRK---------------------AVFPQRGSA 441
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+++++L + P+P +LH+ CPV+ G KW KWI
Sbjct: 442 IIWYNLKDDGDPNPQTLHAACPVLVGSKWVCNKWIR 477
>gi|383864775|ref|XP_003707853.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Megachile
rotundata]
Length = 550
Score = 117 bits (292), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 67/217 (30%), Positives = 112/217 (51%), Gaps = 25/217 (11%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
I P K ++ PR +Y + D E + + +A+ + KR+ V + +G ++++ R S
Sbjct: 331 IAPFKEEEAYLDPRIVIYHNVIYDEEIETIKRMAQPRFKRATVQNYKTGALEIANYRISK 390
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-SDKVNIVR 148
++ + + +A + ++ T L E E++QV+ Y G YEPH+D+ ++ N +
Sbjct: 391 SAWLQEHEHKHVAAVSKRVEHMTSLNVETAEELQVVNYGIGGHYEPHFDFARKEETNAFK 450
Query: 149 G---GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGD 205
G+R+ATVL Y+SDV +GG TVF A N I++ PR+G
Sbjct: 451 SLGTGNRIATVLYYMSDVEQGGGTVF-----------TAIN----------ISLWPRKGS 489
Query: 206 ALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A +F+L N D + H+ CPV+ G KW A KW+H
Sbjct: 490 AAFWFNLKPNGEGDLRTRHAACPVLTGSKWVANKWLH 526
>gi|218665910|ref|YP_002425647.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrooxidans ATCC 23270]
gi|218518123|gb|ACK78709.1| oxidoreductase, 2OG-Fe(II) oxygenase family [Acidithiobacillus
ferrooxidans ATCC 23270]
Length = 248
Score = 117 bits (292), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 65/202 (32%), Positives = 105/202 (51%), Gaps = 22/202 (10%)
Query: 46 VYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGIE 105
+ G LT C +LI + +S L+ + V D +G+ R S + + I+ +
Sbjct: 67 AWAGLLTPENCQNLIAIGQSLLRPATVTDEQTGQEVAHGERVSEMAWPKRDDYPILQSLA 126
Query: 106 DKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-SDKVNIVRGGHRLATVLMYLSDVA 164
+ IA T +P + E +Q+L Y G +Y+PHYD F +D + +GG+R AT+++YL+ V
Sbjct: 127 EGIAQLTGIPIDCQEPLQILHYRPGGEYKPHYDAFAADAPTLRQGGNRQATLILYLNAVE 186
Query: 165 KGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLH 224
+GGET FP + G+ V P G + F +L+ P+SLH
Sbjct: 187 EGGETAFP---------------------ELGLQVSPIPGGGVFFRNLNEEGQRHPLSLH 225
Query: 225 SGCPVIEGEKWSATKWIHVDSF 246
+G PV +GEKW AT+WI +++
Sbjct: 226 AGLPVRKGEKWIATQWIRQEAY 247
>gi|125772807|ref|XP_001357662.1| GA15946 [Drosophila pseudoobscura pseudoobscura]
gi|54637394|gb|EAL26796.1| GA15946 [Drosophila pseudoobscura pseudoobscura]
Length = 549
Score = 116 bits (291), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 66/217 (30%), Positives = 113/217 (52%), Gaps = 25/217 (11%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+ P K+++ P +Y + D E D + +A+ + +R+ V ++++G + ++ R S
Sbjct: 333 LGPLKLEEAHKDPYIVIYHDAMYDSEMDLIKRMARPRFRRATVQNSVTGALETANYRISK 392
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-SDKVNIVR 148
++ +D++IA + + A T L E+ E++QV+ Y G Y PH+D+ ++
Sbjct: 393 SAWLKTEEDSVIAKVVQRTADMTGLDMESAEELQVVNYGIGGHYAPHFDFARREEKRAFE 452
Query: 149 G---GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGD 205
G G+R+ATVL Y+SDV +GG TVF RT A+ P+RG
Sbjct: 453 GLNLGNRIATVLFYMSDVEQGGATVFTTL------RT---------------ALWPKRGT 491
Query: 206 ALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A + +LH + D + H+ CPV+ G KW + KWIH
Sbjct: 492 AAFWMNLHRDGEGDKRTQHAACPVLTGTKWVSNKWIH 528
>gi|301613004|ref|XP_002936004.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Xenopus
(Silurana) tropicalis]
Length = 526
Score = 116 bits (291), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 66/218 (30%), Positives = 111/218 (50%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I++P+K + KPR Y ++D E + LAK +L+R+ +++ ++G + + R +
Sbjct: 314 ILSPTKQEDEWDKPRIVRYHDIISDEEISKVKELAKPRLRRATISNPITGVLETAQYRIT 373
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ +D ++A + +I T L E++QV Y G +YEPH+D+ D
Sbjct: 374 KSAWLSGYEDPVVARLNRRIEGVTGLDMSTAEELQVANYGIGGQYEPHFDFLRKYEPDAF 433
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV GG TVFP G AV P++G
Sbjct: 434 KKLGTGNRVATWLFYMSDVEAGGATVFPEV---------------------GAAVYPKKG 472
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KWIH
Sbjct: 473 TAVFWYNLLESGEGDYSTRHAACPVLVGNKWVSNKWIH 510
>gi|195159323|ref|XP_002020531.1| GL13463 [Drosophila persimilis]
gi|194117300|gb|EDW39343.1| GL13463 [Drosophila persimilis]
Length = 487
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 64/217 (29%), Positives = 111/217 (51%), Gaps = 25/217 (11%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+ P K+++ P +Y + D E D + +A+ + +R+ V ++++G + ++ R S
Sbjct: 271 LGPLKLEEAHKDPYIVIYHDAMYDSEMDLIKRMARPRFRRATVQNSVTGALETANYRISK 330
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-SDKVNIVR 148
++ +D++IA + + A T L E+ E++QV+ Y G Y PH+D+ ++
Sbjct: 331 SAWLKTEEDSVIAKVVQRTADMTGLDMESAEELQVVNYGIGGHYAPHFDFARREEKRAFE 390
Query: 149 G---GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGD 205
G G+R+ATVL Y+SDV +GG TVF A+ P+RG
Sbjct: 391 GLNLGNRIATVLFYMSDVEQGGATVFTTLR---------------------TALWPKRGT 429
Query: 206 ALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A + +LH + D + H+ CPV+ G KW + KWIH
Sbjct: 430 AAFWMNLHRDGEGDKRTQHAACPVLTGTKWVSNKWIH 466
>gi|328876967|gb|EGG25330.1| putative prolyl 4-hydroxylase alpha subunit [Dictyostelium
fasciculatum]
Length = 244
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 74/216 (34%), Positives = 113/216 (52%), Gaps = 36/216 (16%)
Query: 34 KVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFI 93
K+ ++S PR + FL+ EC+HLI+++K++L+ E R+ G F+
Sbjct: 25 KLIEMSQCPRVYRVPDFLSPAECEHLIDISKNKLRPC-------NEISSGVHRSGWGLFM 77
Query: 94 PKGKDA--IIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS-----DKVNI 146
+G++ ++ I ++ L EN E +QV+RY G++ HYDYF+ + I
Sbjct: 78 KEGEEDHDVVKKIFQRMKMLVNL-TENCEVMQVIRYHPGEETSAHYDYFNPLTTNGAMKI 136
Query: 147 VRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
G R+ T+LMYLS+V +GGET FP G+ VKP +GDA
Sbjct: 137 GLYGQRVCTILMYLSEVEEGGETSFPEV---------------------GVKVKPVKGDA 175
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+LF++ N DP+SLH G PVI+G KW A K I+
Sbjct: 176 VLFYNCKPNGEVDPLSLHQGDPVIKGTKWVAIKLIN 211
>gi|195159144|ref|XP_002020442.1| GL13995 [Drosophila persimilis]
gi|194117211|gb|EDW39254.1| GL13995 [Drosophila persimilis]
Length = 535
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 69/221 (31%), Positives = 113/221 (51%), Gaps = 27/221 (12%)
Query: 23 SFSSTAIIN--PSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGES 80
+F++T + P K++ + P VY L+D E ++ +A+ ++ R++ + S
Sbjct: 323 NFTTTPFLRLAPLKMELLGEHPYVVVYHDVLSDSEIAEILEMAERRMARTSTVAQPNRTS 382
Query: 81 KLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF 140
S RT+ G ++ + +A+ I ++ + L E E +QV+ Y G Y PH D+F
Sbjct: 383 --SPTRTALGAWLKRSSNALTRRIARRVRDMSGLQLEGSERMQVINYGIGGHYVPHKDWF 440
Query: 141 SDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVK 200
+ ++ G+RLATVL YL+DV +GG T+F AE V
Sbjct: 441 TQHPEVM--GNRLATVLFYLTDVEQGGATMFNKAEH---------------------KVL 477
Query: 201 PRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
PRRG AL +++LHT+ D + H+ CP+I G KW T+WI
Sbjct: 478 PRRGTALFWYNLHTDGEGDWSTTHAACPIIVGSKWVLTQWI 518
>gi|240974259|ref|XP_002401836.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
gi|215491070|gb|EEC00711.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
Length = 490
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 67/226 (29%), Positives = 114/226 (50%), Gaps = 25/226 (11%)
Query: 21 RKSFSSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGES 80
R + S ++ P+K + + KPR +Y ++ E D + LA+ +LKR+ V + SGE
Sbjct: 270 RTNGSPFLLLQPAKEEVMFPKPRIVIYHDVMSKHEMDVVKLLAQPRLKRATVQNYKSGEL 329
Query: 81 KLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF 140
++++ R S ++ + +IA + +I T L + E++QV+ Y G YEPH+D+
Sbjct: 330 EVANYRISKSAWLRNEEHGVIARVTRRIEHITGLSADTAEELQVVNYGIGGHYEPHFDFA 389
Query: 141 ----SDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKG 196
+ + G+R+AT L Y+SDV GG TVFP
Sbjct: 390 RREEKNAFQSLGTGNRIATWLNYMSDVPAGGATVFPQLR--------------------- 428
Query: 197 IAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ + P +G A +++LH + D ++ H+ CPV+ G KW + KW H
Sbjct: 429 LTLWPEKGAAAFWYNLHRSGEGDMLTRHAACPVLAGSKWVSNKWFH 474
>gi|195505190|ref|XP_002099397.1| GE10881 [Drosophila yakuba]
gi|194185498|gb|EDW99109.1| GE10881 [Drosophila yakuba]
Length = 487
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 64/217 (29%), Positives = 111/217 (51%), Gaps = 25/217 (11%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+ P K+++ P +Y + D E D + +A+ + +R+ V ++++G + ++ R S
Sbjct: 271 LGPLKLEEAHADPYIVIYHDAMYDSEIDVIKRMARPRFRRATVQNSVTGALETANYRISK 330
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-SDKVNIVR 148
++ +D +I + + A T L E+ E++QV+ Y G YEPH+D+ ++
Sbjct: 331 SAWLKTHEDRVIGTVVQRTADMTGLDMESAEELQVVNYGIGGHYEPHFDFARKEEERAFE 390
Query: 149 G---GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGD 205
G G+R+ATVL Y+SDV +GG TVF + A+ PR+G
Sbjct: 391 GLNLGNRIATVLFYMSDVEQGGATVFTSLH---------------------TALFPRKGT 429
Query: 206 ALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A + +LH + D + H+ CPV+ G KW + KWIH
Sbjct: 430 AAFWMNLHRDGQGDVRTRHAACPVLTGTKWVSNKWIH 466
>gi|194905290|ref|XP_001981166.1| GG11918 [Drosophila erecta]
gi|190655804|gb|EDV53036.1| GG11918 [Drosophila erecta]
Length = 525
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 69/227 (30%), Positives = 108/227 (47%), Gaps = 25/227 (11%)
Query: 19 LIRKSFSSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSG 78
L ++ S+ ++ P K++ + P +Y L+ E L +A L R+ V SG
Sbjct: 306 LYNRTTSAFLMLAPLKMELVGLDPYMVLYHDVLSAKEIKELQGMATPGLTRATVFQASSG 365
Query: 79 ESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYD 138
+++ RTS + P + + + +IA T E +Q++ Y G Y+ HYD
Sbjct: 366 RNEVVKTRTSKVAWFPDSYNPLTVRLNARIADMTGFNLYGSEMLQLMNYGLGGHYDQHYD 425
Query: 139 YFSDKVN---IVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKK 195
+F + +N G R+ATVL YL+DV +GG TVFPN +
Sbjct: 426 FF-NTINSNLTAMSGDRIATVLFYLTDVEQGGATVFPNIRK------------------- 465
Query: 196 GIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
AV P+RG +++++L N D +LH+ CPVI G KW KWI
Sbjct: 466 --AVFPQRGSVIMWYNLQDNGQTDNKTLHAACPVIVGSKWVCNKWIR 510
>gi|147834798|emb|CAN75013.1| hypothetical protein VITISV_039948 [Vitis vinifera]
Length = 282
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 72/192 (37%), Positives = 91/192 (47%), Gaps = 41/192 (21%)
Query: 85 VRTSSGTFIPKGKD--AIIAGIEDKIATWTFLPKENGE---------------------- 120
+R SG FI +D + IE KIA +P+ +GE
Sbjct: 90 IRLCSGVFISASEDKTGTLDLIEQKIARVIMIPRTHGEIKPKENCLNWLGQVPPFEFVVM 149
Query: 121 -----------DIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGET 169
+LRYE GQ+Y HYD F + HR+AT L+YLSDV +GGET
Sbjct: 150 KRFLTDVVYHVAFNILRYEIGQRYNSHYDAFDPAEYGPQKSHRIATFLVYLSDVEEGGET 209
Query: 170 VFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPV 229
+FP + D C G+ VKP +GD LLF+S+ N DP SLH CPV
Sbjct: 210 MFPFENG----LNMDKDYDFQRCI--GLKVKPHQGDGLLFYSMFPNGTIDPTSLHGSCPV 263
Query: 230 IEGEKWSATKWI 241
I+GEKW ATKWI
Sbjct: 264 IKGEKWVATKWI 275
>gi|348688210|gb|EGZ28024.1| hypothetical protein PHYSODRAFT_321730 [Phytophthora sojae]
Length = 487
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 80/234 (34%), Positives = 122/234 (52%), Gaps = 22/234 (9%)
Query: 19 LIRKSFSSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSG 78
L++ +F ++ ++ IS P F E FL D E D ++ L+ L S V
Sbjct: 256 LVKNTFGRGDLV----METISMTPLVFSVEEFLRDDEIDVVLELSMPHLAPSGVTLQDGH 311
Query: 79 ESK-LSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHY 137
E++ +D RTS+ ++ ++ I+ + A +P + E +QVLRYEH Q Y+ H
Sbjct: 312 ENRPATDWRTSTTYWLESSSHPVVQDIDKRTADLVKVPISHQESVQVLRYEHTQHYDQHL 371
Query: 138 DYFS--------DKVNIVRGGH--RLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATND 187
DYFS D + + G+ R+ TV Y+SDVAKGG T F A P P TN
Sbjct: 372 DYFSVKRHRNSADVLKKIEHGYKNRMITVFWYMSDVAKGGHTNFARAGGLP---PPPTN- 427
Query: 188 DLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
+ +G++V P++ ++F+S+ N DP+SLH+GCPV EG K S KW+
Sbjct: 428 ---KGCTQGLSVVPKKRKVVVFYSMLPNGEGDPMSLHAGCPVEEGIKMSGNKWV 478
>gi|195425415|ref|XP_002061004.1| GK10713 [Drosophila willistoni]
gi|194157089|gb|EDW71990.1| GK10713 [Drosophila willistoni]
Length = 502
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 71/214 (33%), Positives = 108/214 (50%), Gaps = 24/214 (11%)
Query: 32 PSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGT 91
P KV+ ++ P +Y L D E + L LA + RS + D + + RTS+
Sbjct: 277 PFKVEILNNLPFVAIYHDVLYDREIEELKRLAVPTITRSTIYDYDKEGNVPVNFRTSNSV 336
Query: 92 FIPKGKDAIIAGIEDKIATWTFLP--KENGEDIQVLRYEHGQKYEPHYDYF-SDKVNIVR 148
F+ ++ + ++A T L K + +D+QV+ Y G Y H+D+F D+
Sbjct: 337 FLLNNASYLVDILRQRVADMTHLNVFKNSSDDLQVMNYGLGGYYRYHFDFFGKDESPNKL 396
Query: 149 GGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALL 208
G R+ TVL+Y++DV +GG TVFP I P++G AL+
Sbjct: 397 LGDRIITVLIYMTDVQQGGATVFPALR---------------------ITNFPKKGSALI 435
Query: 209 FFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
F +L N PDP +LH+GCPV+ G KW+ATKWI+
Sbjct: 436 FRNLDNNISPDPSTLHAGCPVLFGSKWAATKWIY 469
>gi|383642155|ref|ZP_09954561.1| hypothetical protein SeloA3_06917 [Sphingomonas elodea ATCC 31461]
Length = 327
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 73/213 (34%), Positives = 103/213 (48%), Gaps = 27/213 (12%)
Query: 36 KQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFI-P 94
+ + PR + GFL+ EC H+ A+ L+ S V D SG +RTS G I P
Sbjct: 134 RTVRADPRVEHFPGFLSREECAHVATTAQDLLEPSFVLDPNSGRPIPHPIRTSDGGAIGP 193
Query: 95 KGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLA 154
++ ++ I +IA T E GE + VLRY GQ+Y H D + N R+A
Sbjct: 194 TNENLVVRAINLRIAAATGTAVEQGESLTVLRYARGQEYRRHLDTIAGAEN-----QRIA 248
Query: 155 TVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHT 214
T ++YL+D +GGET FP I V+PR GDA+ F ++
Sbjct: 249 TFIVYLNDGFEGGETHFP---------------------LLNIQVRPRIGDAIRFDTIRP 287
Query: 215 NAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFD 247
+ PDP +H+G PV G KW AT+WI + D
Sbjct: 288 DGTPDPRLVHAGQPVRNGVKWIATRWIRREPVD 320
>gi|345305838|ref|XP_001508476.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Ornithorhynchus
anatinus]
Length = 493
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 68/218 (31%), Positives = 110/218 (50%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P+K + KPR Y ++D E + + +LAK +L R+ V D +G+ + R S
Sbjct: 281 ILAPAKQEDEWDKPRIVRYHEIISDAEIETVKDLAKPRLSRATVHDPETGKLTTAQYRVS 340
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ +D +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 341 KSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAF 400
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV+ GG TVFP G +V P++G
Sbjct: 401 KELGTGNRIATWLFYMSDVSAGGATVFPEV---------------------GASVWPKKG 439
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KWIH
Sbjct: 440 TAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWIH 477
>gi|195341536|ref|XP_002037362.1| GM12882 [Drosophila sechellia]
gi|194131478|gb|EDW53521.1| GM12882 [Drosophila sechellia]
Length = 550
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 61/217 (28%), Positives = 111/217 (51%), Gaps = 25/217 (11%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+ P K++++ P +Y + D E D + +A+ + +R+ V ++++G + ++ R S
Sbjct: 334 LGPLKLEEVHADPYIVIYHDAMYDSEIDLIKRMARPRFRRATVQNSVTGALETANYRISK 393
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDK----VN 145
++ +D +I + + A T L ++ E++QV+ Y G YEPH+D+ +
Sbjct: 394 SAWLKTQEDRVIETVVQRTADMTGLDMDSAEELQVVNYGIGGHYEPHFDFARKEEERAFE 453
Query: 146 IVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGD 205
+ G+R+ATVL Y+SDV +GG TVF + A+ P++G
Sbjct: 454 GINLGNRIATVLFYMSDVEQGGATVFTSLH---------------------TALFPKKGT 492
Query: 206 ALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A + +LH + D + H+ CPV+ G KW + KWIH
Sbjct: 493 AAFWMNLHRDGQGDVRTRHAACPVLTGTKWVSNKWIH 529
>gi|321474877|gb|EFX85841.1| hypothetical protein DAPPUDRAFT_208740 [Daphnia pulex]
Length = 545
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 64/217 (29%), Positives = 112/217 (51%), Gaps = 25/217 (11%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
I P K+++ KP +Y + D E + + +A+ + KR+ V ++++G + ++ R S
Sbjct: 332 IQPVKMEEAFHKPLIVIYHNVINDDEIETVKKMAQPRFKRATVQNSVTGNLEPANYRISK 391
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-SDKVNIVR 148
++ + + + ++ T L ED+QV+ Y G YEPH+DY ++VN +
Sbjct: 392 SAWLKSEEHDHVFKVTRRVGDVTGLDMATAEDLQVVNYGIGGHYEPHFDYARKEEVNAFK 451
Query: 149 G---GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGD 205
G+R+AT L Y+S+V GG TVFP K +A+ P++G
Sbjct: 452 DLGWGNRVATWLFYMSEVEAGGATVFP---------------------KLNLALWPQKGS 490
Query: 206 ALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A +++LH N + ++ H+ CPV+ G KW + KWIH
Sbjct: 491 AAFWYNLHPNGEGNELTRHAACPVLTGSKWVSNKWIH 527
>gi|416009427|ref|ZP_11561250.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein
[Acidithiobacillus sp. GGI-221]
gi|339836568|gb|EGQ64151.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein
[Acidithiobacillus sp. GGI-221]
Length = 196
Score = 115 bits (288), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 64/202 (31%), Positives = 104/202 (51%), Gaps = 22/202 (10%)
Query: 46 VYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGIE 105
+ G LT C +LI + +S L+ + V D +G+ R S + + I+ +
Sbjct: 15 AWAGLLTPENCQNLIAIGQSLLRPATVTDEQTGQEVAHGERVSEMAWPKRDDHPILQSLA 74
Query: 106 DKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-SDKVNIVRGGHRLATVLMYLSDVA 164
+ IA T +P + E +Q+L Y G +Y+PHYD F +D + +GG+R T+++YL+ V
Sbjct: 75 EGIAQLTGIPIDCQEPLQILHYRPGGEYKPHYDAFAADAPTLRQGGNRQGTLILYLNAVE 134
Query: 165 KGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLH 224
+GGET FP + G+ V P G + F +L+ P+SLH
Sbjct: 135 EGGETAFP---------------------ELGLQVSPIPGGGVFFRNLNEEGQRHPLSLH 173
Query: 225 SGCPVIEGEKWSATKWIHVDSF 246
+G PV +GEKW AT+WI +++
Sbjct: 174 AGLPVRKGEKWIATQWIRQEAY 195
>gi|307190793|gb|EFN74662.1| Prolyl 4-hydroxylase subunit alpha-2 [Camponotus floridanus]
Length = 476
Score = 115 bits (288), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 66/217 (30%), Positives = 112/217 (51%), Gaps = 25/217 (11%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
I P K ++ PR VY + D E + + +A+ + KR+ V + +G ++++ R S
Sbjct: 257 IAPLKEEEAYLDPRIVVYHNVIYDEEIETIKRMAQPRFKRATVQNYKTGALEIANYRISK 316
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-SDKVNIVR 148
++ + + +A + ++ T + E E++QV+ Y G YEPH+D+ ++ N +
Sbjct: 317 SAWLQEHEHKHVAAVSKRVEHMTSMSIETAEELQVVNYGIGGHYEPHFDFARKEETNAFK 376
Query: 149 G---GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGD 205
G+R+ATVL Y+SDV +GG TVF A N I++ PR+G
Sbjct: 377 SLGTGNRIATVLYYMSDVEQGGGTVF-----------TAIN----------ISLWPRKGS 415
Query: 206 ALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A +++L N D + H+ CPV+ G KW A KW+H
Sbjct: 416 AAFWYNLKPNGEGDFKTRHAACPVLTGSKWVANKWLH 452
>gi|24651407|ref|NP_733371.1| prolyl-4-hydroxylase-alpha EFB [Drosophila melanogaster]
gi|20269806|gb|AAM18058.1|AF495536_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]EFB
[Drosophila melanogaster]
gi|15292529|gb|AAK93533.1| SD05564p [Drosophila melanogaster]
gi|23172692|gb|AAF57053.2| prolyl-4-hydroxylase-alpha EFB [Drosophila melanogaster]
gi|220946562|gb|ACL85824.1| PH4alphaEFB-PA [synthetic construct]
Length = 550
Score = 115 bits (288), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 62/217 (28%), Positives = 112/217 (51%), Gaps = 25/217 (11%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+ P K++++ P +Y + D E D + +A+ + +R+ V ++++G + ++ R S
Sbjct: 334 LGPLKLEEVHADPYIVIYHDAMYDSEIDLIKRMARPRFRRATVQNSVTGALETANYRISK 393
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-SDKVNIVR 148
++ +D +I + + A T L ++ E++QV+ Y G YEPH+D+ ++
Sbjct: 394 SAWLKTQEDRVIETVVQRTADMTGLDMDSAEELQVVNYGIGGHYEPHFDFARKEEQRAFE 453
Query: 149 G---GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGD 205
G G+R+ATVL Y+SDV +GG TVF + A+ P++G
Sbjct: 454 GLNLGNRIATVLFYMSDVEQGGATVFTSLH---------------------TALFPKKGT 492
Query: 206 ALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A + +LH + D + H+ CPV+ G KW + KWIH
Sbjct: 493 AAFWMNLHRDGQGDVRTRHAACPVLTGTKWVSNKWIH 529
>gi|302830268|ref|XP_002946700.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
gi|300267744|gb|EFJ51926.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
Length = 186
Score = 115 bits (288), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 69/159 (43%), Positives = 89/159 (55%), Gaps = 5/159 (3%)
Query: 85 VRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKV 144
VRTS GTF+ + +EDKIA T LP+ NGE VL Y+H Q Y+ H D F K
Sbjct: 20 VRTSKGTFLGGDSSPALRWLEDKIAAVTLLPRTNGEFWNVLNYKHSQHYDSHMDSFDPKE 79
Query: 145 NIVRGGHRLATVLMYLSDVA-KGGETVFPNAEEPPRRRTPATNDDLSEC-AKKGIAVKPR 202
+ R+ATV++ LSD GGETVF E P +N ++C A G+ KPR
Sbjct: 80 YGPQYSQRIATVIVVLSDDGLMGGETVF-KREGKSSINKPISN--WTDCDADGGLKYKPR 136
Query: 203 RGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
GDA+LF+S + DP +LH CPV+ G KW A KW+
Sbjct: 137 AGDAVLFWSARPDGQLDPHALHGSCPVVTGNKWVAVKWL 175
>gi|194765138|ref|XP_001964684.1| GF23317 [Drosophila ananassae]
gi|190614956|gb|EDV30480.1| GF23317 [Drosophila ananassae]
Length = 520
Score = 115 bits (288), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 64/214 (29%), Positives = 105/214 (49%), Gaps = 22/214 (10%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+ P K++ + P +Y L+ E D + +A LKR+ V G++++ RTS
Sbjct: 315 LAPLKMEIVGLNPYMVIYHDVLSSAEIDEMKEMATPSLKRATVYKASLGKNEVVKTRTSK 374
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-SDKVNIVR 148
+ P +++ + +I T E +Q++ Y G Y+ HYD+F + + +
Sbjct: 375 VAWFPDSYNSLTLRLNARIHDMTGFDLSGSEMLQLMNYGLGGHYDKHYDFFNATEKSSSL 434
Query: 149 GGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALL 208
G R+ATVL Y+SDV +GG TVFPN + V P+RG A++
Sbjct: 435 TGDRIATVLFYMSDVEQGGATVFPNIYK---------------------TVYPQRGTAVM 473
Query: 209 FFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+++L + PD +LH+ CPV+ G KW KWI
Sbjct: 474 WYNLKDDGQPDEQTLHAACPVLVGSKWVCNKWIR 507
>gi|410900628|ref|XP_003963798.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Takifugu
rubripes]
Length = 548
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 68/219 (31%), Positives = 113/219 (51%), Gaps = 27/219 (12%)
Query: 29 IINPSKVKQISW-KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRT 87
+++P K +Q W +P Y ++D E + + LAK +L+R+ +++ ++G + + R
Sbjct: 336 VLSPVK-QQDEWDRPYIVRYIDIISDKEIETVKKLAKPRLRRATISNPITGVLETASYRI 394
Query: 88 SSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DK 143
S ++ + +I I +I T L + E++QV Y G +YEPH+D+ D
Sbjct: 395 SKSAWLTGYEHPVIEIINQRIEDLTGLEMDTAEELQVANYGVGGQYEPHFDFGRKDEPDA 454
Query: 144 VNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRR 203
+ G+R+AT L Y+SDVA GG TVFP+ G AV P++
Sbjct: 455 FKELGTGNRIATWLFYMSDVAAGGATVFPDV---------------------GAAVWPQK 493
Query: 204 GDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
G A+ +++L N D + H+ CPV+ G KW + KWIH
Sbjct: 494 GTAVFWYNLFANGEGDYSTRHAACPVLVGNKWVSNKWIH 532
>gi|195575089|ref|XP_002105512.1| GD21521 [Drosophila simulans]
gi|194201439|gb|EDX15015.1| GD21521 [Drosophila simulans]
Length = 550
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 62/217 (28%), Positives = 112/217 (51%), Gaps = 25/217 (11%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+ P K++++ P +Y + D E D + +A+ + +R+ V ++++G + ++ R S
Sbjct: 334 LGPLKLEEVHADPYIVIYHDAMYDSEIDLIKRMARPRFRRATVQNSVTGALETANYRISK 393
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-SDKVNIVR 148
++ +D +I + + A T L ++ E++QV+ Y G YEPH+D+ ++
Sbjct: 394 SAWLKTQEDRVIETVVQRTADMTGLDMDSAEELQVVNYGIGGHYEPHFDFARKEEERAFE 453
Query: 149 G---GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGD 205
G G+R+ATVL Y+SDV +GG TVF + A+ P++G
Sbjct: 454 GLNLGNRIATVLFYMSDVEQGGATVFTSLH---------------------TALFPKKGT 492
Query: 206 ALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A + +LH + D + H+ CPV+ G KW + KWIH
Sbjct: 493 AAFWMNLHRDGQGDVRTRHAACPVLTGTKWVSNKWIH 529
>gi|195391766|ref|XP_002054531.1| GJ24504 [Drosophila virilis]
gi|194152617|gb|EDW68051.1| GJ24504 [Drosophila virilis]
Length = 545
Score = 114 bits (286), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 69/219 (31%), Positives = 108/219 (49%), Gaps = 29/219 (13%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
++ P KV+++S P +Y + E D L L K+++ R+ V N S +S+ RTS
Sbjct: 317 LLAPLKVEELSHDPLLVLYHDVIYQSEIDTLAKLTKNKIHRATVTGN--NASVVSNARTS 374
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS------D 142
TFIPK + ++ I+ ++A T L ED Q+ Y G Y H D+FS
Sbjct: 375 QFTFIPKTRHKVLRTIDQRVADMTDLNMVFAEDHQLANYGIGGHYAQHMDWFSPNAFETK 434
Query: 143 KVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPR 202
+V G+R+ATVL YL+DV +GG T FP ++ +KP+
Sbjct: 435 QVANSEMGNRIATVLFYLTDVEQGGGTAFPVLKQ---------------------LLKPK 473
Query: 203 RGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
+ A +++LH + D ++H CP+I G KW +WI
Sbjct: 474 KYAAAFWYNLHASGAGDVRTMHGACPIIVGSKWVLNRWI 512
>gi|159490898|ref|XP_001703410.1| prolyl 4-hydroxylase [Chlamydomonas reinhardtii]
gi|158280334|gb|EDP06092.1| prolyl 4-hydroxylase [Chlamydomonas reinhardtii]
Length = 429
Score = 114 bits (286), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 79/210 (37%), Positives = 109/210 (51%), Gaps = 12/210 (5%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVA----DNLSGESKLSDVRTSSGTFI 93
+S PR V+ F+ + +I LA + S +A + + E + VRTS GTF+
Sbjct: 218 LSLYPRIKVFPNFVDKARREEIIALASKFMYPSGLAYRPGEQVEAEQQ---VRTSKGTFL 274
Query: 94 PKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRL 153
+ +E KIA T +P++NGE VL Y+H Q Y+ H D F K + R+
Sbjct: 275 GGDSSPALTWLESKIAAVTDIPRQNGEFWNVLNYKHTQHYDSHMDSFDPKEYGQQYSQRI 334
Query: 154 ATVLMYLSDVA-KGGETVFPNAEEPPRRRTPATNDDLSEC-AKKGIAVKPRRGDALLFFS 211
ATV++ LSD GGETVF E P TN ++C A G+ KPR GDA+LF+S
Sbjct: 335 ATVIVVLSDEGLVGGETVF-KREGKANIDKPITN--WTDCDADGGLRYKPRAGDAVLFWS 391
Query: 212 LHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
+ D +LH CPV+ G KW A KWI
Sbjct: 392 AFPDGRLDQHALHGSCPVVTGNKWVAVKWI 421
>gi|195390835|ref|XP_002054073.1| GJ22993 [Drosophila virilis]
gi|194152159|gb|EDW67593.1| GJ22993 [Drosophila virilis]
Length = 525
Score = 114 bits (286), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 67/215 (31%), Positives = 108/215 (50%), Gaps = 24/215 (11%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+ P K + ++ P +Y +T E L LA LKR+ V + G + + RTS
Sbjct: 317 LAPLKTELLALDPYMVLYHDVITPSEIRELQYLAVPTLKRATVFNQKMGRNTVVKTRTSK 376
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKV--NIV 147
T++ + + + +I+ T E +QV+ Y G Y+ H+DYF+ + ++
Sbjct: 377 VTWLTDSLNPLTVRLNRRISDMTGFDLYGSEMLQVMNYGLGGHYDLHFDYFNATIAKDLT 436
Query: 148 R-GGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
+ G R+ATVL YL+DV +GG TVFPN ++ A+ P++G A
Sbjct: 437 KLNGDRIATVLFYLTDVEQGGATVFPNIKQ---------------------AIFPKKGTA 475
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
+++++L N DP +LH+ CPVI G KW KWI
Sbjct: 476 VMWYNLRHNNDGDPQTLHAACPVIVGSKWVCNKWI 510
>gi|260825357|ref|XP_002607633.1| hypothetical protein BRAFLDRAFT_59428 [Branchiostoma floridae]
gi|229292981|gb|EEN63643.1| hypothetical protein BRAFLDRAFT_59428 [Branchiostoma floridae]
Length = 520
Score = 114 bits (285), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 66/217 (30%), Positives = 107/217 (49%), Gaps = 24/217 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
++ P +++Q+ KP+ +V LTD E + + LA+ +L+R+ V +GE +L+ R S
Sbjct: 309 LLAPIRLEQVFDKPKLWVLHNILTDPEMEVIKKLAQPRLRRARVESPTTGEGELASYRIS 368
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIV- 147
++ + +I + ++ T L E E +QV+ Y G YEPH+D +
Sbjct: 369 KSAWLYDWEHRVIRRVNQRVEDVTGLTMETAELLQVVNYGIGGHYEPHFDCATKDEEFAL 428
Query: 148 --RGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGD 205
G R+AT+L Y+SDV GG TVFP G V P +G
Sbjct: 429 DPNEGDRIATMLFYMSDVEAGGATVFPQV---------------------GARVVPEKGA 467
Query: 206 ALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+++L + D ++ H+GCPV+ G KW + KWIH
Sbjct: 468 GAFWYNLLKSGEGDMLTEHAGCPVLVGSKWVSNKWIH 504
>gi|303273602|ref|XP_003056161.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226462245|gb|EEH59537.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 750
Score = 114 bits (285), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 85/234 (36%), Positives = 116/234 (49%), Gaps = 55/234 (23%)
Query: 46 VYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIP--KGKDAIIAG 103
V++ FL+ +ECD L+ +A L+RS V D KLS+ RTSS TF+ K ++ ++
Sbjct: 533 VFDHFLSAVECDDLVAIAAPDLRRSRVTDG-----KLSEGRTSSSTFLTGCKQEEPLVRA 587
Query: 104 IEDKI-----------------------------ATWTFLPKEN----GEDIQVLRYEHG 130
IE ++ +T F + N E +QV+RY G
Sbjct: 588 IEQRLLRAVQSATLIAAQPNVYDSNERHGQPYRGSTSRFSQRPNLLQGAEPMQVVRYTEG 647
Query: 131 QKYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLS 190
Q Y HYD +K +R R AT +MYL+DV GG T FP A R D
Sbjct: 648 QMYTAHYD---NKQGCLR---RTATFMMYLTDVHSGGATHFPRAVPVSMR------DGCG 695
Query: 191 ECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVD 244
+ A GI + P+RG AL+F+S+ + I D SLH PVIEGEKW ATKW+ D
Sbjct: 696 DAA--GIRIWPKRGRALVFWSV-SGGIEDVRSLHEAEPVIEGEKWIATKWLRED 746
>gi|345481336|ref|XP_001600680.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Nasonia
vitripennis]
Length = 556
Score = 114 bits (284), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 63/217 (29%), Positives = 111/217 (51%), Gaps = 25/217 (11%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
I P K ++ PR +Y + D E + + +A+ + KR+ V + +GE ++++ R S
Sbjct: 337 IAPFKEEEAYLDPRIVIYHDVIYDDEIETIKRMAQPRFKRATVQNYKTGELEIANYRISK 396
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-SDKVNIVR 148
++ + + + + ++ T + E E++QV+ Y G YEPH+D+ ++ N +
Sbjct: 397 SAWLQEHEHKHVRAVSQRVEHMTSMSIETAEELQVVNYGIGGHYEPHFDFARREEKNAFK 456
Query: 149 G---GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGD 205
G+R+ATVL Y+SDV +GG TVF K I++ P++G
Sbjct: 457 SLGTGNRIATVLYYMSDVEQGGGTVF---------------------TKINISLWPKKGS 495
Query: 206 ALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A +++L N D + H+ CPV+ G KW A KW+H
Sbjct: 496 AAFWYNLKPNGEGDYKTRHAACPVLTGSKWVANKWLH 532
>gi|449673565|ref|XP_002167120.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Hydra
magnipapillata]
Length = 571
Score = 114 bits (284), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 64/216 (29%), Positives = 106/216 (49%), Gaps = 24/216 (11%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+ P KV+++ P F+ +++ + + + A L+R+ + D ++G+ + +D R S
Sbjct: 361 LKPQKVERVWVDPEIFILRNIISEKQINLIKEAASPMLRRATIQDPITGKLRHADYRISK 420
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF---SDKVNI 146
++ K + +E + T L E +QV Y G YEPH+D+ D+
Sbjct: 421 SAWLSTNKYNFLQALEARTQATTGLDLSYAEQLQVANYGLGGHYEPHFDHSRENEDRFTD 480
Query: 147 VRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
+ G+R+ATVL YLSDV GG TVF + AV P +GDA
Sbjct: 481 LGMGNRIATVLFYLSDVEAGGATVFTVGK---------------------TAVFPSKGDA 519
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ +F+L N +P + H+ CPV+ G+KW + WIH
Sbjct: 520 VFWFNLKRNGKGNPNTRHAACPVLVGQKWVSNWWIH 555
>gi|414587754|tpg|DAA38325.1| TPA: hypothetical protein ZEAMMB73_894856 [Zea mays]
Length = 169
Score = 114 bits (284), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 55/114 (48%), Positives = 77/114 (67%), Gaps = 2/114 (1%)
Query: 34 KVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFI 93
K + ISW PR V+ FL+ ECD+L+ +A+ +L+ S V D +G+ SDVRTSSG F+
Sbjct: 56 KPEVISWTPRIIVFHNFLSSEECDYLMAIARPRLQISTVVDVATGKGVKSDVRTSSGMFV 115
Query: 94 --PKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVN 145
+ K ++ IE +I+ ++ +PKENGE IQVLRYE Q Y PH+DYFSD V+
Sbjct: 116 NSEERKSPVVQAIEKRISVFSQIPKENGELIQVLRYEASQYYRPHHDYFSDTVS 169
>gi|195452726|ref|XP_002073473.1| GK14136 [Drosophila willistoni]
gi|194169558|gb|EDW84459.1| GK14136 [Drosophila willistoni]
Length = 550
Score = 114 bits (284), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 62/220 (28%), Positives = 112/220 (50%), Gaps = 31/220 (14%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+ P K+++ P +Y + D E D + +A+ + +R+ V ++++G + ++ R S
Sbjct: 334 LGPLKLEEAHMDPYIVIYHDAMYDSEMDLIKRMARPRFRRATVQNSVTGALETANYRISK 393
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS-------D 142
++ +D +I + + A T L ++ E++QV+ Y G YEPH+D+ +
Sbjct: 394 SAWLKTEEDQVIGTVVQRTADMTGLDMDSAEELQVVNYGIGGHYEPHFDFARREEKRAFE 453
Query: 143 KVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPR 202
+N+ G+R+ATVL Y+SDV +GG TVF + A+ P+
Sbjct: 454 GLNL---GNRIATVLFYMSDVEQGGATVFTSLH---------------------AALWPK 489
Query: 203 RGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+G A + +LH + D + H+ CPV+ G KW + KWIH
Sbjct: 490 KGTAAFWMNLHRDGEGDVRTRHAACPVLTGTKWVSNKWIH 529
>gi|395501518|ref|XP_003755140.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Sarcophilus
harrisii]
Length = 385
Score = 114 bits (284), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 67/218 (30%), Positives = 110/218 (50%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P+K + KPR + ++D E + + +LAK +L R+ V D +G+ + R S
Sbjct: 173 ILAPAKQEDEWDKPRIVRFHEIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVS 232
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ +D +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 233 KSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAF 292
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV+ GG TVFP G +V P++G
Sbjct: 293 KELGTGNRIATWLFYMSDVSAGGATVFPEV---------------------GASVWPKKG 331
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KWIH
Sbjct: 332 TAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWIH 369
>gi|354483223|ref|XP_003503794.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 1
[Cricetulus griseus]
Length = 534
Score = 114 bits (284), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 66/218 (30%), Positives = 110/218 (50%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P+K + KPR + ++D E + + +LAK +L R+ V D +G+ + R S
Sbjct: 322 ILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVS 381
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ +D +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 382 KSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAF 441
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV+ GG TVFP G +V P++G
Sbjct: 442 QELGTGNRIATWLFYMSDVSAGGATVFPEV---------------------GASVWPKKG 480
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW+H
Sbjct: 481 TAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLH 518
>gi|474940|emb|CAA55546.1| gamma-butyrobetaine,2-oxoglutarate dioxygenase [Rattus norvegicus]
Length = 534
Score = 114 bits (284), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 66/218 (30%), Positives = 110/218 (50%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P+K + KPR + ++D E + + +LAK +L R+ V D +G+ + R S
Sbjct: 322 ILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVS 381
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ +D +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 382 KSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAF 441
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV+ GG TVFP G +V P++G
Sbjct: 442 RELGTGNRIATWLFYMSDVSAGGATVFPEV---------------------GASVWPKKG 480
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW+H
Sbjct: 481 TAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLH 518
>gi|332026992|gb|EGI67088.1| Prolyl 4-hydroxylase subunit alpha-1 [Acromyrmex echinatior]
Length = 415
Score = 114 bits (284), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 66/217 (30%), Positives = 111/217 (51%), Gaps = 25/217 (11%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
I P K ++ PR VY + D E + + +A+ + KR+ V + +G ++++ R S
Sbjct: 196 IAPFKEEEAYLDPRIVVYHNVIYDEEIETIKRMAQPRFKRATVQNYKTGALEIANYRISK 255
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-SDKVNIVR 148
++ + + +A + ++ T + E E++QV+ Y G YEPH+D+ ++ N +
Sbjct: 256 SAWLQEHEHKHVAAVSKRVEHMTSMSVETAEELQVVNYGIGGHYEPHFDFARKEETNAFK 315
Query: 149 G---GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGD 205
G+R+ATVL Y+SDV +GG TVF A N I++ PR+G
Sbjct: 316 SLGTGNRIATVLYYMSDVEQGGGTVF-----------TAIN----------ISLWPRKGS 354
Query: 206 ALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A + +L N D + H+ CPV+ G KW A KW+H
Sbjct: 355 AAFWHNLKPNGEGDFKTRHAACPVLTGSKWVANKWLH 391
>gi|198449500|ref|XP_001357604.2| GA15939 [Drosophila pseudoobscura pseudoobscura]
gi|198130634|gb|EAL26738.2| GA15939 [Drosophila pseudoobscura pseudoobscura]
Length = 528
Score = 114 bits (284), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 67/215 (31%), Positives = 105/215 (48%), Gaps = 24/215 (11%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+ P K++ + P +Y ++ E L ++A LKR+ V S++ RTS
Sbjct: 318 LAPLKMELVGLDPYMVLYHDVISAPEISQLQDMATPGLKRATVYKASGRRSEVVKTRTSK 377
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF--SDKVNIV 147
+ P + + + +IA T E +Q + Y G Y+ HYD+F S N+
Sbjct: 378 VAWFPDTFNELTERLNRRIADMTNFDLLGSEMLQAMNYGLGGHYDKHYDFFNASTATNLT 437
Query: 148 R-GGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
+ G R+ATVL YL+DV +GG TVFPN + AV P+RG A
Sbjct: 438 QMNGDRIATVLFYLTDVEQGGATVFPNIRK---------------------AVFPQRGSA 476
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
+++++L + P+P +LH+ CPV+ G KW KWI
Sbjct: 477 IIWYNLKDDGDPNPQTLHAACPVLVGSKWVCNKWI 511
>gi|115313004|gb|AAI24075.1| Zgc:152670 [Danio rerio]
Length = 235
Score = 114 bits (284), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 73/211 (34%), Positives = 107/211 (50%), Gaps = 34/211 (16%)
Query: 32 PSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGT 91
P K +++ +P+ Y ++D E + L ++A+ +L RS +G +S++RTS
Sbjct: 43 PVKEEELWDEPKIIRYHDVISDTEIETLKDIARPELTRSQ-----TGWGVISEIRTSQSV 97
Query: 92 FIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGH 151
F+ + +A I +IA T L E+ E + V Y G +Y PH+D D VN
Sbjct: 98 FLDE--VGTVARISQRIADITGLSVESAEKLHVQNYGIGGRYTPHFDAGGD-VN-----E 149
Query: 152 RLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFS 211
R AT L+Y+SDV GG TVF N G+AVKP +G A+ + +
Sbjct: 150 RTATFLIYMSDVEVGGATVFTNV---------------------GVAVKPEKGSAVFWNN 188
Query: 212 LHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
LH N D + H+GCPV+ G KW A KWIH
Sbjct: 189 LHKNGELDLKTKHAGCPVLVGNKWVANKWIH 219
>gi|334314087|ref|XP_003339988.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2
[Monodelphis domestica]
Length = 537
Score = 114 bits (284), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 65/218 (29%), Positives = 113/218 (51%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P+K + KPR + ++D E + + +LAK +L+R+ +++ ++G + + R S
Sbjct: 325 ILAPAKQEDEWDKPRIVRFHEIISDAEIEIVKDLAKPRLRRATISNPITGVLETAHYRIS 384
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ +D +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 385 KSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAF 444
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV+ GG TVFP G +V P++G
Sbjct: 445 KELGTGNRIATWLFYMSDVSAGGATVFPEV---------------------GASVWPKKG 483
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KWIH
Sbjct: 484 TAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWIH 521
>gi|307211752|gb|EFN87747.1| Prolyl 4-hydroxylase subunit alpha-1 [Harpegnathos saltator]
Length = 415
Score = 114 bits (284), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 65/217 (29%), Positives = 111/217 (51%), Gaps = 25/217 (11%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
I P K ++ PR Y + D E + + +A+ + KR+ V + +G ++++ R S
Sbjct: 196 IAPFKEEEAYLDPRIVFYHNVIYDEEIETIKRMAQPRFKRATVQNYKTGALEIANYRISK 255
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-SDKVNIVR 148
++ + + +A + ++ T + E E++QV+ Y G YEPH+D+ ++ N +
Sbjct: 256 SAWLQEHEHKHVAAVSKRVEHMTSMSVETAEELQVVNYGIGGHYEPHFDFARKEETNAFK 315
Query: 149 G---GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGD 205
G+R+ATVL Y+SDV +GG TVF A N I++ PR+G
Sbjct: 316 SLGTGNRIATVLYYMSDVEQGGGTVF-----------TAIN----------ISLWPRKGS 354
Query: 206 ALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A +++L N D + H+ CPV+ G KW A KW+H
Sbjct: 355 AAFWYNLKPNGEGDFKTRHAACPVLTGSKWVANKWLH 391
>gi|354483225|ref|XP_003503795.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 2
[Cricetulus griseus]
Length = 534
Score = 114 bits (284), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 64/218 (29%), Positives = 112/218 (51%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P+K + KPR + ++D E + + +LAK +L+R+ +++ ++G + R S
Sbjct: 322 ILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGNLETVHYRIS 381
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ +D +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 382 KSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAF 441
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV+ GG TVFP G +V P++G
Sbjct: 442 QELGTGNRIATWLFYMSDVSAGGATVFPEV---------------------GASVWPKKG 480
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW+H
Sbjct: 481 TAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLH 518
>gi|344254200|gb|EGW10304.1| Prolyl 4-hydroxylase subunit alpha-1 [Cricetulus griseus]
Length = 507
Score = 114 bits (284), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 64/218 (29%), Positives = 112/218 (51%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P+K + KPR + ++D E + + +LAK +L+R+ +++ ++G + R S
Sbjct: 295 ILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGNLETVHYRIS 354
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ +D +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 355 KSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAF 414
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV+ GG TVFP G +V P++G
Sbjct: 415 QELGTGNRIATWLFYMSDVSAGGATVFPEV---------------------GASVWPKKG 453
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW+H
Sbjct: 454 TAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLH 491
>gi|432904500|ref|XP_004077362.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oryzias
latipes]
Length = 555
Score = 113 bits (283), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 65/219 (29%), Positives = 113/219 (51%), Gaps = 27/219 (12%)
Query: 29 IINPSKVKQISW-KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRT 87
++ P K +Q W +P Y +++ E D + LAK +L+R+ +++ ++G + + R
Sbjct: 338 VLAPVK-QQDEWDRPYIVRYIDIISEAEMDKIKQLAKPRLRRATISNPVTGVLETAPYRI 396
Query: 88 SSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DK 143
S ++ +D ++ I +I T L + E++QV Y G +YEPH+D+ D
Sbjct: 397 SKSAWLTAYEDPVVEKINQRIEDLTGLEMDTAEELQVANYGVGGQYEPHFDFGRKDEPDA 456
Query: 144 VNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRR 203
+ G+R+AT L Y+SDV+ GG TVFP+ G +V P++
Sbjct: 457 FKELGTGNRIATWLFYMSDVSAGGATVFPDV---------------------GASVGPQK 495
Query: 204 GDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
G A+ +++L + D + H+ CPV+ G KW + KWIH
Sbjct: 496 GTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWIH 534
>gi|51036657|ref|NP_742059.2| prolyl 4-hydroxylase subunit alpha-1 precursor [Rattus norvegicus]
gi|90111077|sp|P54001.2|P4HA1_RAT RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; Flags: Precursor
gi|50927553|gb|AAH78703.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I [Rattus norvegicus]
gi|149038787|gb|EDL93076.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide, isoform CRA_a
[Rattus norvegicus]
Length = 534
Score = 113 bits (283), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 66/218 (30%), Positives = 110/218 (50%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P+K + KPR + ++D E + + +LAK +L R+ V D +G+ + R S
Sbjct: 322 ILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVS 381
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ +D +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 382 KSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAF 441
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV+ GG TVFP G +V P++G
Sbjct: 442 RELGTGNRIATWLFYMSDVSAGGATVFPEV---------------------GASVWPKKG 480
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW+H
Sbjct: 481 TAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLH 518
>gi|170064956|ref|XP_001867741.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
gi|167882144|gb|EDS45527.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
Length = 520
Score = 113 bits (283), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 77/232 (33%), Positives = 117/232 (50%), Gaps = 30/232 (12%)
Query: 17 SLLIRKSFSSTAIIN--PSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVAD 74
L R S+T + P K++ ++ +P VY ++D E LI LA+ +KRSAV D
Sbjct: 297 QLFCRYETSATPFLRLAPLKLEVVNLEPLIVVYHEAVSDREIAKLIELARPLIKRSAVGD 356
Query: 75 NLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTF-LPKENGEDIQVLRYEHGQKY 133
S ++S +R S + D I+ + + L + + E +QV Y G Y
Sbjct: 357 TRS--EQISKIRISQNAWFENEHDPIVETLNQRARDMAGGLNEPSYELLQVNNYGLGGFY 414
Query: 134 EPHYDYFSDKVNIV--RG-GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLS 190
HYD+ S N +G G+R+AT++ YLSDV +GG TVFP
Sbjct: 415 SIHYDW-STSANPFPNKGMGNRIATLMFYLSDVQEGGSTVFP------------------ 455
Query: 191 ECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ +AV+PR+G A+ +++LH N + +LH+ CPV+ G KW A KWIH
Sbjct: 456 ---RLNLAVRPRKGTAIFWYNLHRNGKGNKKTLHAACPVLIGSKWVANKWIH 504
>gi|334314085|ref|XP_001363658.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1
[Monodelphis domestica]
Length = 537
Score = 113 bits (283), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 67/218 (30%), Positives = 110/218 (50%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P+K + KPR + ++D E + + +LAK +L R+ V D +G+ + R S
Sbjct: 325 ILAPAKQEDEWDKPRIVRFHEIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVS 384
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ +D +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 385 KSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAF 444
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV+ GG TVFP G +V P++G
Sbjct: 445 KELGTGNRIATWLFYMSDVSAGGATVFPEV---------------------GASVWPKKG 483
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KWIH
Sbjct: 484 TAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWIH 521
>gi|26336999|dbj|BAC32183.1| unnamed protein product [Mus musculus]
gi|148700261|gb|EDL32208.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide, isoform CRA_b [Mus
musculus]
Length = 534
Score = 113 bits (283), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 66/218 (30%), Positives = 110/218 (50%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P+K + KPR + ++D E + + +LAK +L R+ V D +G+ + R S
Sbjct: 322 ILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVS 381
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ +D +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 382 KSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAF 441
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV+ GG TVFP G +V P++G
Sbjct: 442 RELGTGNRIATWLFYMSDVSAGGATVFPEV---------------------GASVWPKKG 480
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW+H
Sbjct: 481 TAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLH 518
>gi|148233143|ref|NP_001090904.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Sus scrofa]
gi|83778522|gb|ABC47142.1| procollagen-proline 2-oxoglutarate-4-dioxygenase [Sus scrofa]
Length = 534
Score = 113 bits (283), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 113/218 (51%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P+K + KPR + ++D E D + +LAK +L+R+ +++ ++G+ + R S
Sbjct: 322 ILAPAKQEDEWDKPRIIRFHDIISDAEIDIVKDLAKPRLRRATISNPITGDLETVHYRIS 381
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ ++ +++ + +I T L E++QV Y G +YEPH+D+ D
Sbjct: 382 KSAWLSGYENPVVSRLNMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAF 441
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV+ GG TVFP G +V P++G
Sbjct: 442 KELGTGNRIATWLFYMSDVSAGGATVFPEV---------------------GASVWPKKG 480
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW+H
Sbjct: 481 TAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLH 518
>gi|74225936|dbj|BAE28745.1| unnamed protein product [Mus musculus]
Length = 561
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 64/218 (29%), Positives = 112/218 (51%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P+K + KPR + ++D E + + +LAK +L+R+ +++ ++G + R S
Sbjct: 322 ILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPVTGALETVHYRIS 381
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ +D +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 382 KSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAF 441
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV+ GG TVFP G +V P++G
Sbjct: 442 RELGTGNRIATWLFYMSDVSAGGATVFPEV---------------------GASVWPKKG 480
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW+H
Sbjct: 481 TAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLH 518
>gi|328790718|ref|XP_392392.4| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Apis mellifera]
Length = 415
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 65/217 (29%), Positives = 112/217 (51%), Gaps = 25/217 (11%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
I P K ++ PR VY + D E + + +A+ + KR+ V + +G ++++ R S
Sbjct: 196 IAPFKEEEAYLDPRIVVYHNVIYDDEIETIKRMAQPRFKRATVQNYKTGALEIANYRISK 255
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-SDKVNIVR 148
++ + + +A + ++ T + + E++QV+ Y G YEPH+D+ ++ N +
Sbjct: 256 SAWLQEHEHKHVAAVSRRVEHMTSMTVDTAEELQVVNYGIGGHYEPHFDFARKEETNAFK 315
Query: 149 G---GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGD 205
G+R+ATVL Y+SDV +GG TVF A N IA+ P++G
Sbjct: 316 SLGTGNRIATVLYYMSDVEQGGGTVF-----------TAIN----------IALWPKKGS 354
Query: 206 ALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A +++L N D + H+ CPV+ G KW A KW+H
Sbjct: 355 AAFWYNLKPNGEGDFKTRHAACPVLTGSKWVANKWLH 391
>gi|292619367|ref|XP_001922562.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Danio rerio]
Length = 541
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 66/217 (30%), Positives = 109/217 (50%), Gaps = 25/217 (11%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
I P K + +PR Y +T+ E + + L+K +L+R+ +++ ++G + + R S
Sbjct: 330 IGPVKQEDEWDRPRIIRYHEIITEQEIEKIKELSKPRLRRATISNPITGVLETAHYRISK 389
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKVN 145
++ + ++ I +I T L + E++QV Y G +YEPH+D+ D
Sbjct: 390 SAWLAAYEHPVVDRINQRIEDITGLNVKTAEELQVANYGVGGQYEPHFDFGRKDEPDAFK 449
Query: 146 IVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGD 205
+ G+R+AT L Y+SDVA GG TVFP G AVKP +G
Sbjct: 450 ELGTGNRIATWLFYMSDVAAGGATVFPEV---------------------GAAVKPLKGT 488
Query: 206 ALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KWIH
Sbjct: 489 AVFWYNLFPSGEGDYSTRHAACPVLVGNKWVSNKWIH 525
>gi|74224984|dbj|BAE38205.1| unnamed protein product [Mus musculus]
Length = 534
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 64/218 (29%), Positives = 112/218 (51%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P+K + KPR + ++D E + + +LAK +L+R+ +++ ++G + R S
Sbjct: 322 ILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPVTGALETVHYRIS 381
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ +D +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 382 KSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAF 441
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV+ GG TVFP G +V P++G
Sbjct: 442 RELGTGNRIATWLFYMSDVSAGGATVFPEV---------------------GASVWPKKG 480
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW+H
Sbjct: 481 TAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLH 518
>gi|344274274|ref|XP_003408942.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2
[Loxodonta africana]
Length = 534
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 114/218 (52%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P+K + KPR + ++D E + + +LAK +L+R+ +++ ++G+ + R S
Sbjct: 322 ILAPAKQEDEWDKPRIVRFHDIISDAEIEVVKDLAKPRLRRATISNPITGDLETVHYRIS 381
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ ++ +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 382 KSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAF 441
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV+ GG TVFP+ G +V P++G
Sbjct: 442 KELGTGNRIATWLFYMSDVSAGGATVFPDV---------------------GASVWPKKG 480
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW+H
Sbjct: 481 TAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLH 518
>gi|149038788|gb|EDL93077.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide, isoform CRA_b
[Rattus norvegicus]
Length = 534
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 64/218 (29%), Positives = 112/218 (51%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P+K + KPR + ++D E + + +LAK +L+R+ +++ ++G + R S
Sbjct: 322 ILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPVTGALETVHYRIS 381
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ +D +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 382 KSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAF 441
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV+ GG TVFP G +V P++G
Sbjct: 442 RELGTGNRIATWLFYMSDVSAGGATVFPEV---------------------GASVWPKKG 480
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW+H
Sbjct: 481 TAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLH 518
>gi|33859596|ref|NP_035160.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Mus musculus]
gi|20455506|sp|Q60715.2|P4HA1_MOUSE RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; Flags: Precursor
gi|16307134|gb|AAH09654.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide [Mus musculus]
gi|74144306|dbj|BAE36020.1| unnamed protein product [Mus musculus]
gi|74146660|dbj|BAE41331.1| unnamed protein product [Mus musculus]
gi|148700260|gb|EDL32207.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide, isoform CRA_a [Mus
musculus]
Length = 534
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 64/218 (29%), Positives = 112/218 (51%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P+K + KPR + ++D E + + +LAK +L+R+ +++ ++G + R S
Sbjct: 322 ILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPVTGALETVHYRIS 381
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ +D +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 382 KSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAF 441
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV+ GG TVFP G +V P++G
Sbjct: 442 RELGTGNRIATWLFYMSDVSAGGATVFPEV---------------------GASVWPKKG 480
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW+H
Sbjct: 481 TAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLH 518
>gi|380025232|ref|XP_003696381.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Apis florea]
Length = 537
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 65/217 (29%), Positives = 112/217 (51%), Gaps = 25/217 (11%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
I P K ++ PR VY + D E + + +A+ + KR+ V + +G ++++ R S
Sbjct: 318 IAPFKEEEAYLDPRIVVYHNVIYDDEIETIKRMAQPRFKRATVQNYKTGALEIANYRISK 377
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-SDKVNIVR 148
++ + + +A + ++ T + + E++QV+ Y G YEPH+D+ ++ N +
Sbjct: 378 SAWLQEHEHKHVAAVSRRVEHMTSMTVDTAEELQVVNYGIGGHYEPHFDFARKEETNAFK 437
Query: 149 G---GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGD 205
G+R+ATVL Y+SDV +GG TVF A N IA+ P++G
Sbjct: 438 SLGTGNRIATVLYYMSDVEQGGGTVF-----------TAIN----------IALWPKKGS 476
Query: 206 ALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A +++L N D + H+ CPV+ G KW A KW+H
Sbjct: 477 AAFWYNLKPNGEGDFKTRHAACPVLTGSKWVANKWLH 513
>gi|74148153|dbj|BAE36242.1| unnamed protein product [Mus musculus]
Length = 454
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 66/218 (30%), Positives = 110/218 (50%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P+K + KPR + ++D E + + +LAK +L R+ V D +G+ + R S
Sbjct: 242 ILAPAKQEDEWDKPRIIRFHDIISDAENEIVKDLAKPRLSRATVHDPETGKLTTAQYRVS 301
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ +D +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 302 KSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAF 361
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV+ GG TVFP G +V P++G
Sbjct: 362 RELGTGNRIATWLFYMSDVSAGGATVFPEV---------------------GASVWPKKG 400
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW+H
Sbjct: 401 TAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLH 438
>gi|66772331|gb|AAY55477.1| IP03959p [Drosophila melanogaster]
gi|66772361|gb|AAY55492.1| IP03859p [Drosophila melanogaster]
Length = 541
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 71/217 (32%), Positives = 114/217 (52%), Gaps = 27/217 (12%)
Query: 30 INPSKVKQISWKPR-AFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
++P K++Q++ P A+V+E L D E D ++ K ++RS V S S S+VR S
Sbjct: 325 LSPFKIEQLNVDPYVAYVHE-VLWDSEIDTIMEHGKGNMERSKVGQ--SENSTTSEVRIS 381
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVR 148
T++ + ++ I+ ++ T L E+ E +Q++ Y G +YEPH+D+ D V
Sbjct: 382 RNTWLWYDANPWLSKIKQRLEDVTGLSTESAEPLQLVNYGIGGQYEPHFDFVEDDGQSVF 441
Query: 149 G--GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
G+RL T L YL+DVA GG T FP +AV P +G
Sbjct: 442 SWKGNRLLTALFYLNDVALGGATAFPFLR---------------------LAVPPVKGSL 480
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHV 243
L++++LH++ D + H+GCPV++G KW +W HV
Sbjct: 481 LIWYNLHSSTHKDFRTKHAGCPVLQGSKWICNEWFHV 517
>gi|443709454|gb|ELU04126.1| hypothetical protein CAPTEDRAFT_167710 [Capitella teleta]
Length = 535
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 65/217 (29%), Positives = 108/217 (49%), Gaps = 26/217 (11%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
INP + + +++ P VY ++D + D + LA +L R+ V ++++GE + + R S
Sbjct: 325 INPLREETMNFDPWIAVYHQLMSDKDIDDIKALATPRLARATVVNSVTGELEFAKYRISK 384
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVR- 148
++ + +A I ++ + T L E++Q+ Y G YEPH+DY S +
Sbjct: 385 SGWLKDEEHPTVAKISNRCSALTNLSLSTVEELQIANYGIGGHYEPHFDY-SRLAEVTSF 443
Query: 149 ---GGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGD 205
G+R+ TV+ YLSDV GG TVF A G ++P +G
Sbjct: 444 DHWRGNRILTVIFYLSDVEAGGGTVFMTA---------------------GTKLRPEKGA 482
Query: 206 ALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A ++++LH + D + H+ CPV+ G KW A KW H
Sbjct: 483 AAVWYNLHPDGTGDDETKHAACPVLTGNKWVANKWFH 519
>gi|24651424|ref|NP_733376.1| prolyl-4-hydroxylase-alpha SG1 [Drosophila melanogaster]
gi|23172697|gb|AAF57059.2| prolyl-4-hydroxylase-alpha SG1 [Drosophila melanogaster]
gi|66772443|gb|AAY55533.1| IP03659p [Drosophila melanogaster]
gi|220951214|gb|ACL88150.1| PH4alphaSG1-PA [synthetic construct]
gi|220959938|gb|ACL92512.1| PH4alphaSG1-PA [synthetic construct]
Length = 540
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 71/217 (32%), Positives = 114/217 (52%), Gaps = 27/217 (12%)
Query: 30 INPSKVKQISWKPR-AFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
++P K++Q++ P A+V+E L D E D ++ K ++RS V S S S+VR S
Sbjct: 324 LSPFKIEQLNVDPYVAYVHE-VLWDSEIDTIMEHGKGNMERSKVGQ--SENSTTSEVRIS 380
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVR 148
T++ + ++ I+ ++ T L E+ E +Q++ Y G +YEPH+D+ D V
Sbjct: 381 RNTWLWYDANPWLSKIKQRLEDVTGLSTESAEPLQLVNYGIGGQYEPHFDFVEDDGQSVF 440
Query: 149 G--GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
G+RL T L YL+DVA GG T FP +AV P +G
Sbjct: 441 SWKGNRLLTALFYLNDVALGGATAFPFLR---------------------LAVPPVKGSL 479
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHV 243
L++++LH++ D + H+GCPV++G KW +W HV
Sbjct: 480 LIWYNLHSSTHKDFRTKHAGCPVLQGSKWICNEWFHV 516
>gi|410251926|gb|JAA13930.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
Length = 566
Score = 112 bits (281), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 113/218 (51%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P+K + KPR + ++D E + + +LAK +L+R+ +++ ++G+ + R S
Sbjct: 322 ILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVHYRIS 381
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ ++ +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 382 KSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAF 441
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV+ GG TVFP G +V P++G
Sbjct: 442 KELGTGNRIATWLFYMSDVSAGGATVFPEV---------------------GASVWPKKG 480
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW+H
Sbjct: 481 TAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLH 518
>gi|194765194|ref|XP_001964712.1| GF22904 [Drosophila ananassae]
gi|190614984|gb|EDV30508.1| GF22904 [Drosophila ananassae]
Length = 547
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 61/217 (28%), Positives = 112/217 (51%), Gaps = 25/217 (11%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+ P K+++ +P +Y + D E + + +A+ + +R+ V ++++G + ++ R S
Sbjct: 331 LGPLKLEEAHQEPYIVIYHDAMYDSEIELIKRMARPRFRRATVQNSVTGALETANYRISK 390
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-SDKVNIVR 148
++ +D +I + + A T L ++ E++QV+ Y G YEPH+D+ ++
Sbjct: 391 SAWLKTEEDHVIGTVVQRTADMTGLDMDSAEELQVVNYGIGGHYEPHFDFARKEEKRAFE 450
Query: 149 G---GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGD 205
G G+R+ATVL Y+SDV +GG TVF + A+ P++G
Sbjct: 451 GLNLGNRIATVLFYMSDVEQGGATVFTSLH---------------------TALFPKKGT 489
Query: 206 ALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A + +LH + D + H+ CPV+ G KW + KWIH
Sbjct: 490 AAFWMNLHRDGEGDVRTRHAACPVLTGTKWVSNKWIH 526
>gi|410295850|gb|JAA26525.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
gi|410295854|gb|JAA26527.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
Length = 534
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 65/218 (29%), Positives = 111/218 (50%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P+K + KPR + ++D E + + +LAK +L+R+ V D +G+ + R S
Sbjct: 322 ILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKDLAKPRLRRATVHDPETGKLTTAQYRVS 381
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ ++ +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 382 KSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAF 441
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV+ GG TVFP G +V P++G
Sbjct: 442 KELGTGNRIATWLFYMSDVSAGGATVFPEV---------------------GASVWPKKG 480
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW+H
Sbjct: 481 TAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLH 518
>gi|327267604|ref|XP_003218589.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Anolis
carolinensis]
Length = 542
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 68/218 (31%), Positives = 108/218 (49%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P K + +PR + ++D E + + LAK +L R+ V D +G+ + R S
Sbjct: 330 ILRPVKQEDEWDRPRIVRFVEIISDEEIETVKELAKPRLSRATVHDPQTGKLTTAHYRVS 389
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ ++ I+A I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 390 KSAWLSGYENPIVARINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAF 449
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV+ GG TVFP G +V PR+G
Sbjct: 450 KELGTGNRIATWLFYMSDVSAGGATVFPEV---------------------GASVWPRKG 488
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KWIH
Sbjct: 489 TAVFWYNLFPSGEGDYSTRHAACPVLVGNKWVSNKWIH 526
>gi|291404184|ref|XP_002718472.1| PREDICTED: prolyl 4-hydroxylase, alpha I subunit isoform 2
[Oryctolagus cuniculus]
Length = 534
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 113/218 (51%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P+K + KPR + ++D E + + +LAK +L+R+ +++ ++G+ + R S
Sbjct: 322 ILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVHYRIS 381
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ ++ +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 382 KSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAF 441
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV+ GG TVFP G +V P++G
Sbjct: 442 KELGTGNRIATWLFYMSDVSAGGATVFPEV---------------------GASVWPKKG 480
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW+H
Sbjct: 481 TAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLH 518
>gi|391342914|ref|XP_003745760.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Metaseiulus
occidentalis]
Length = 525
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 108/218 (49%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ PSK++ I +P ++ ++D E +I L+ +LKR+ V + SGE ++++ R S
Sbjct: 313 ILQPSKLEVIHERPYLALFHDIMSDDEIQTVIELSAPRLKRATVQNAKSGELEVANYRIS 372
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF----SDKV 144
++ ++ + + T L E++QV+ Y G YE H+D+ D
Sbjct: 373 KSAWLKNHDHEVVERLSFRFEYLTGLTHLTAEELQVVNYGIGGHYEAHFDFARRDEKDAF 432
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT + Y+SDV GG TVFP + G+ V P +G
Sbjct: 433 KQLGTGNRIATWINYMSDVKAGGATVFP---------------------RLGLTVWPEKG 471
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A +++LH + D ++ H+ CPV+ G KW + KW H
Sbjct: 472 SAAFWWNLHRSGEGDILTRHAACPVLAGSKWVSNKWFH 509
>gi|350416719|ref|XP_003491070.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Bombus
impatiens]
Length = 557
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 64/217 (29%), Positives = 112/217 (51%), Gaps = 25/217 (11%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
I P K ++ PR VY + D E + + +A+ + KR+ V + +G ++++ R S
Sbjct: 338 IAPFKEEEAYLDPRIVVYHNVIYDEEIETIKRMAQPRFKRATVQNYKTGALEIANYRISK 397
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-SDKVNIVR 148
++ + + +A + ++ T + + E++QV+ Y G YEPH+D+ ++ N +
Sbjct: 398 SAWLQEHEHEHVAAVSRRVEHMTSMTVDTAEELQVVNYGIGGHYEPHFDFARKEETNAFK 457
Query: 149 G---GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGD 205
G+R+ATVL Y+SDV +GG TVF A N I++ P++G
Sbjct: 458 SLGTGNRIATVLYYMSDVEQGGGTVF-----------TAIN----------ISLWPKKGS 496
Query: 206 ALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A +++L N D + H+ CPV+ G KW A KW+H
Sbjct: 497 AAFWYNLKPNGEGDFKTRHAACPVLTGSKWVANKWLH 533
>gi|395820526|ref|XP_003783615.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Otolemur
garnettii]
Length = 534
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 113/218 (51%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P+K + KPR + ++D E + + +LAK +L+R+ +++ ++G+ + R S
Sbjct: 322 ILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVHYRIS 381
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ ++ +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 382 KSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAF 441
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV+ GG TVFP G +V P++G
Sbjct: 442 KELGTGNRIATWLFYMSDVSAGGATVFPEV---------------------GASVWPKKG 480
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW+H
Sbjct: 481 TAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLH 518
>gi|344274272|ref|XP_003408941.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1
[Loxodonta africana]
Length = 534
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 65/218 (29%), Positives = 111/218 (50%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P+K + KPR + ++D E + + +LAK +L R+ V D +G+ + R S
Sbjct: 322 ILAPAKQEDEWDKPRIVRFHDIISDAEIEVVKDLAKPRLSRATVHDPETGKLTTAQYRVS 381
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ ++ +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 382 KSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAF 441
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV+ GG TVFP+ G +V P++G
Sbjct: 442 KELGTGNRIATWLFYMSDVSAGGATVFPDV---------------------GASVWPKKG 480
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW+H
Sbjct: 481 TAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLH 518
>gi|190788|gb|AAA36535.1| prolyl 4-hydroxylase alpha subunit (EC 1.14.11.2) [Homo sapiens]
Length = 534
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 113/218 (51%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P+K + KPR + ++D E + + +LAK +L+R+ +++ ++G+ + R S
Sbjct: 322 ILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVHYRIS 381
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ ++ +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 382 KSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAF 441
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV+ GG TVFP G +V P++G
Sbjct: 442 KELGTGNRIATWLFYMSDVSAGGATVFPEV---------------------GASVWPKKG 480
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW+H
Sbjct: 481 TAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLH 518
>gi|410914996|ref|XP_003970973.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Takifugu
rubripes]
Length = 538
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 70/218 (32%), Positives = 106/218 (48%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
++ P K + P Y FL++ E + + LAK +L R+ V D SG + R S
Sbjct: 326 LLKPIKEEDEWDSPNIVRYLDFLSNEEIEKIKELAKPKLARATVRDPKSGVLTTASYRVS 385
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ +D IIA + +I T L + E +QV Y G +YEPH+D+ D
Sbjct: 386 KSAWLEGEEDPIIARVNQRIEDLTGLTVKTAELLQVANYGVGGQYEPHFDFSRKDEPDAF 445
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV GG TVFP+ G A+ PR+G
Sbjct: 446 KRLGTGNRVATFLNYMSDVEAGGATVFPDF---------------------GAAIWPRKG 484
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KWIH
Sbjct: 485 TAVFWYNLFKSGEGDYRTRHAACPVLVGNKWVSNKWIH 522
>gi|63252888|ref|NP_001017962.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Homo
sapiens]
gi|197099666|ref|NP_001125733.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Pongo abelii]
gi|217272849|ref|NP_001136067.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Homo
sapiens]
gi|114631177|ref|XP_001140234.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3 [Pan
troglodytes]
gi|114631181|ref|XP_001140652.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 7 [Pan
troglodytes]
gi|2507090|sp|P13674.2|P4HA1_HUMAN RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; Flags: Precursor
gi|75061858|sp|Q5RAG8.1|P4HA1_PONAB RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; Flags: Precursor
gi|602675|gb|AAA59068.1| alpha-subunit of prolyl 4-hydroxylase [Homo sapiens]
gi|23271226|gb|AAH34998.1| Prolyl 4-hydroxylase, alpha polypeptide I [Homo sapiens]
gi|55729010|emb|CAH91242.1| hypothetical protein [Pongo abelii]
gi|56403853|emb|CAI29712.1| hypothetical protein [Pongo abelii]
gi|119574854|gb|EAW54469.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I, isoform CRA_c [Homo
sapiens]
gi|119574855|gb|EAW54470.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I, isoform CRA_d [Homo
sapiens]
gi|123981532|gb|ABM82595.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I [synthetic
construct]
gi|123996359|gb|ABM85781.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I [synthetic
construct]
gi|261861532|dbj|BAI47288.1| prolyl 4-hydroxylase, alpha polypeptide I [synthetic construct]
gi|410295852|gb|JAA26526.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
gi|410349611|gb|JAA41409.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
Length = 534
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 113/218 (51%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P+K + KPR + ++D E + + +LAK +L+R+ +++ ++G+ + R S
Sbjct: 322 ILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVHYRIS 381
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ ++ +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 382 KSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAF 441
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV+ GG TVFP G +V P++G
Sbjct: 442 KELGTGNRIATWLFYMSDVSAGGATVFPEV---------------------GASVWPKKG 480
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW+H
Sbjct: 481 TAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLH 518
>gi|380813206|gb|AFE78477.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Macaca
mulatta]
gi|384947328|gb|AFI37269.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Macaca
mulatta]
Length = 534
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 113/218 (51%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P+K + KPR + ++D E + + +LAK +L+R+ +++ ++G+ + R S
Sbjct: 322 ILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVHYRIS 381
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ ++ +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 382 KSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAF 441
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV+ GG TVFP G +V P++G
Sbjct: 442 KELGTGNRIATWLFYMSDVSAGGATVFPEV---------------------GASVWPKKG 480
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW+H
Sbjct: 481 TAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLH 518
>gi|90085216|dbj|BAE91349.1| unnamed protein product [Macaca fascicularis]
Length = 244
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 65/218 (29%), Positives = 110/218 (50%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P+K + KPR + ++D E + + +LAK +L R+ V D +G+ + R S
Sbjct: 32 ILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVS 91
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ ++ +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 92 KSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAF 151
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV+ GG TVFP G +V P++G
Sbjct: 152 KELGTGNRIATWLFYMSDVSAGGATVFPEV---------------------GASVWPKKG 190
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW+H
Sbjct: 191 TAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLH 228
>gi|355562502|gb|EHH19096.1| hypothetical protein EGK_19739 [Macaca mulatta]
gi|355782842|gb|EHH64763.1| hypothetical protein EGM_18071 [Macaca fascicularis]
gi|383418719|gb|AFH32573.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Macaca
mulatta]
Length = 534
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 113/218 (51%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P+K + KPR + ++D E + + +LAK +L+R+ +++ ++G+ + R S
Sbjct: 322 ILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVHYRIS 381
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ ++ +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 382 KSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAF 441
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV+ GG TVFP G +V P++G
Sbjct: 442 KELGTGNRIATWLFYMSDVSAGGATVFPEV---------------------GASVWPKKG 480
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW+H
Sbjct: 481 TAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLH 518
>gi|340722330|ref|XP_003399560.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Bombus
terrestris]
Length = 557
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 64/217 (29%), Positives = 112/217 (51%), Gaps = 25/217 (11%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
I P K ++ PR VY + D E + + +A+ + KR+ V + +G ++++ R S
Sbjct: 338 IAPFKEEEAYLDPRIVVYHNVIYDEEIETIKRMAQPRFKRATVQNYKTGALEIANYRISK 397
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-SDKVNIVR 148
++ + + +A + ++ T + + E++QV+ Y G YEPH+D+ ++ N +
Sbjct: 398 SAWLQEHEHEHVAAVSRRVEHMTSMTVDTAEELQVVNYGIGGHYEPHFDFARKEETNAFK 457
Query: 149 G---GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGD 205
G+R+ATVL Y+SDV +GG TVF A N I++ P++G
Sbjct: 458 SLGTGNRIATVLYYMSDVEQGGGTVF-----------TAIN----------ISLWPKKGS 496
Query: 206 ALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A +++L N D + H+ CPV+ G KW A KW+H
Sbjct: 497 AAFWYNLKPNGEGDFKTRHAACPVLTGSKWVANKWLH 533
>gi|836898|gb|AAC52197.1| prolyl 4-hydroxylase alpha(I)-subunit, partial [Mus musculus]
gi|1096887|prf||2112362A Pro 4-hydroxylase:SUBUNIT=alpha:ISOTYPE=I
Length = 526
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 66/218 (30%), Positives = 109/218 (50%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P+K + KPR + ++D E + + LAK +L R+ V D +G+ + R S
Sbjct: 314 ILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKYLAKPRLSRATVHDPETGKLTTAQYRVS 373
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ +D +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 374 KSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAF 433
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV+ GG TVFP G +V P++G
Sbjct: 434 RELGTGNRIATWLFYMSDVSAGGATVFPEV---------------------GASVWPKKG 472
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW+H
Sbjct: 473 TAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLH 510
>gi|20269816|gb|AAM18063.1|AF495541_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]SG1
[Drosophila melanogaster]
Length = 540
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 71/215 (33%), Positives = 112/215 (52%), Gaps = 27/215 (12%)
Query: 32 PSKVKQISWKPR-AFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSG 90
P K++Q++ P A+V+E L D E D ++ K ++RS V S S S+VR S
Sbjct: 326 PFKIEQLNIDPYVAYVHE-VLWDSEIDTIMEHGKGNMERSKVGQ--SENSTTSEVRISRN 382
Query: 91 TFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRG- 149
T++ + ++ I+ ++ T L E+ E +Q++ Y G +YEPH+D+ D V
Sbjct: 383 TWLWYDANPWLSKIKQRLEDVTGLSTESAEPLQLVNYGIGGQYEPHFDFVEDDGQSVFSW 442
Query: 150 -GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALL 208
G+RL T L YL+DVA GG T FP +AV P +G L+
Sbjct: 443 KGNRLLTALFYLNDVALGGATAFPFLR---------------------LAVPPVKGSLLI 481
Query: 209 FFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHV 243
+++LH++ D + H+GCPV++G KW +W HV
Sbjct: 482 WYNLHSSTHKDFRTKHAGCPVLQGSKWICNEWFHV 516
>gi|56118630|ref|NP_001007975.1| prolyl 4-hydroxylase, alpha polypeptide 2 precursor [Xenopus
(Silurana) tropicalis]
gi|51513259|gb|AAH80485.1| p4ha2 protein [Xenopus (Silurana) tropicalis]
Length = 527
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 74/227 (32%), Positives = 113/227 (49%), Gaps = 29/227 (12%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I++P KV+ PR Y L+D E + LAK +L R+ V D +G +++ R S
Sbjct: 325 ILSPVKVEDEWDSPRIVRYLNALSDEEIAKIKELAKPKLARATVRDPKTGVLSVANYRVS 384
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDK---VN 145
++ + D +IA + ++ T L + E +QV Y G +YEPH+D FS + N
Sbjct: 385 KSAWLEENDDPVIARVNLRMQAITGLTVDTAELLQVANYGMGGQYEPHFD-FSRRPFDSN 443
Query: 146 IVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGD 205
+ G+RLAT L Y+SDV GG TVFP+ G A+ P++G
Sbjct: 444 LKTDGNRLATFLNYMSDVEAGGATVFPDF---------------------GAAIWPKKGT 482
Query: 206 ALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDS--FDKIV 250
A+ +++L + D + H+ CPV+ G KW KW H FD +V
Sbjct: 483 AVFWYNLFRSGEGDYRTRHAACPVLVGSKWG--KWTHTQDHHFDSVV 527
>gi|195110919|ref|XP_002000027.1| GI24860 [Drosophila mojavensis]
gi|193916621|gb|EDW15488.1| GI24860 [Drosophila mojavensis]
Length = 487
Score = 112 bits (279), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 61/217 (28%), Positives = 110/217 (50%), Gaps = 25/217 (11%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+ P K+++ P +Y + D E + L +A+ + +R+ V + ++G + ++ R S
Sbjct: 271 LAPLKLEEAFLDPYIVIYHDAMFDSEIEVLKRMARPRFRRATVQNAVTGALETANYRISK 330
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-SDKVNIVR 148
++ + +I + + A T L ++ E++QV+ Y G YEPH+D+ +++
Sbjct: 331 SAWLKTAEHRVIGTVVQRTADMTGLDMDSAEELQVVNYGIGGHYEPHFDFARREEIRAFE 390
Query: 149 G---GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGD 205
G G+R+ATVL Y+SDV +GG TVF + +KP++G
Sbjct: 391 GLNLGNRIATVLFYMSDVEQGGATVFTSLH---------------------AVLKPKKGT 429
Query: 206 ALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A + +LH + D + H+ CPV+ G KW + KWIH
Sbjct: 430 AAFWMNLHRSGEGDVRTRHAACPVLTGSKWVSNKWIH 466
>gi|410251924|gb|JAA13929.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
Length = 566
Score = 112 bits (279), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 65/218 (29%), Positives = 110/218 (50%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P+K + KPR + ++D E + + +LAK +L R+ V D +G+ + R S
Sbjct: 322 ILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVS 381
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ ++ +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 382 KSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAF 441
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV+ GG TVFP G +V P++G
Sbjct: 442 KELGTGNRIATWLFYMSDVSAGGATVFPEV---------------------GASVWPKKG 480
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW+H
Sbjct: 481 TAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLH 518
>gi|349604936|gb|AEQ00344.1| Prolyl 4-hydroxylase subunit alpha-1-like protein, partial [Equus
caballus]
Length = 302
Score = 112 bits (279), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 65/218 (29%), Positives = 110/218 (50%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P+K + KPR + ++D E + + +LAK +L R+ V D +G+ + R S
Sbjct: 90 ILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVS 149
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ ++ +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 150 KSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAF 209
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV+ GG TVFP G +V P++G
Sbjct: 210 KELGTGNRIATWLFYMSDVSAGGATVFPEV---------------------GASVWPKKG 248
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW+H
Sbjct: 249 TAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLH 286
>gi|402880501|ref|XP_003903839.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like, partial
[Papio anubis]
Length = 379
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 65/218 (29%), Positives = 110/218 (50%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P+K + KPR + ++D E + + +LAK +L R+ V D +G+ + R S
Sbjct: 167 ILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVS 226
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ ++ +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 227 KSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAF 286
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV+ GG TVFP G +V P++G
Sbjct: 287 KELGTGNRIATWLFYMSDVSAGGATVFPEV---------------------GASVWPKKG 325
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW+H
Sbjct: 326 TAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLH 363
>gi|195061074|ref|XP_001995919.1| GH14105 [Drosophila grimshawi]
gi|193891711|gb|EDV90577.1| GH14105 [Drosophila grimshawi]
Length = 513
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 64/212 (30%), Positives = 102/212 (48%), Gaps = 21/212 (9%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+ P K++ + P +Y ++ E + L LA +LKR+ V D ++ + + RTS
Sbjct: 309 LAPLKMELLQLDPYMVLYHDAISPREIEDLQFLAMPRLKRAKVVDQVTHRNMMVKERTSK 368
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRG 149
T++ +A + +I + E +QV+ Y G Y HYD+ +
Sbjct: 369 VTWLGDATNAFTMRLNKRIEDMSGFTMYGSEMLQVMNYGLGGHYASHYDFLNATSKTRLN 428
Query: 150 GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLF 209
G R+ATV+ YLSDV +GG TVFP ++ AV P+RG A+++
Sbjct: 429 GDRIATVMFYLSDVEQGGATVFPKIQK---------------------AVFPQRGTAIIW 467
Query: 210 FSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
++L N D ++H+ CPVI G KW KWI
Sbjct: 468 YNLKENGDFDTNTIHAACPVIVGSKWVCNKWI 499
>gi|291404182|ref|XP_002718471.1| PREDICTED: prolyl 4-hydroxylase, alpha I subunit isoform 1
[Oryctolagus cuniculus]
Length = 534
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 65/218 (29%), Positives = 110/218 (50%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P+K + KPR + ++D E + + +LAK +L R+ V D +G+ + R S
Sbjct: 322 ILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVS 381
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ ++ +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 382 KSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAF 441
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV+ GG TVFP G +V P++G
Sbjct: 442 KELGTGNRIATWLFYMSDVSAGGATVFPEV---------------------GASVWPKKG 480
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW+H
Sbjct: 481 TAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLH 518
>gi|296220402|ref|XP_002756291.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Callithrix
jacchus]
Length = 534
Score = 111 bits (278), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 65/218 (29%), Positives = 110/218 (50%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P+K + KPR + ++D E + + +LAK +L R+ V D +G+ + R S
Sbjct: 322 ILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVS 381
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ ++ +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 382 KSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAF 441
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV+ GG TVFP G +V P++G
Sbjct: 442 KELGTGNRIATWLFYMSDVSAGGATVFPEV---------------------GASVWPKKG 480
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW+H
Sbjct: 481 TAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLH 518
>gi|190786|gb|AAA36534.1| prolyl 4-hydroxylase alpha subunit (EC 1.14.11.2) [Homo sapiens]
Length = 534
Score = 111 bits (278), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 65/218 (29%), Positives = 110/218 (50%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P+K + KPR + ++D E + + +LAK +L R+ V D +G+ + R S
Sbjct: 322 ILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVS 381
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ ++ +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 382 KSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAF 441
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV+ GG TVFP G +V P++G
Sbjct: 442 KELGTGNRIATWLFYMSDVSAGGATVFPEV---------------------GASVWPKKG 480
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW+H
Sbjct: 481 TAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLH 518
>gi|397490069|ref|XP_003816032.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Pan paniscus]
Length = 488
Score = 111 bits (278), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 65/218 (29%), Positives = 110/218 (50%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P+K + KPR + ++D E + + +LAK +L R+ V D +G+ + R S
Sbjct: 276 ILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVS 335
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ ++ +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 336 KSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAF 395
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV+ GG TVFP G +V P++G
Sbjct: 396 KELGTGNRIATWLFYMSDVSAGGATVFPEV---------------------GASVWPKKG 434
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW+H
Sbjct: 435 TAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLH 472
>gi|395820524|ref|XP_003783614.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1 [Otolemur
garnettii]
Length = 534
Score = 111 bits (278), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 65/218 (29%), Positives = 110/218 (50%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P+K + KPR + ++D E + + +LAK +L R+ V D +G+ + R S
Sbjct: 322 ILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVS 381
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ ++ +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 382 KSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAF 441
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV+ GG TVFP G +V P++G
Sbjct: 442 KELGTGNRIATWLFYMSDVSAGGATVFPEV---------------------GASVWPKKG 480
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW+H
Sbjct: 481 TAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLH 518
>gi|194905397|ref|XP_001981189.1| GG11929 [Drosophila erecta]
gi|190655827|gb|EDV53059.1| GG11929 [Drosophila erecta]
Length = 538
Score = 111 bits (278), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 66/215 (30%), Positives = 109/215 (50%), Gaps = 24/215 (11%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
++P K +Q++ P + L D E + ++ + ++RS V S SK++D RTS
Sbjct: 323 LSPFKFEQLNLDPYVALVHHVLWDSEMEMIMQHGRGSMERSKVGQ--SENSKIADRRTSQ 380
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVR- 148
T++ + ++ I+ ++ T L E+ E +Q+L Y G +YEPH+D+ D I
Sbjct: 381 NTWLWYDVNPWLSRIKQRLEDVTGLSTESAEPLQLLNYGIGGQYEPHFDFVEDAEKIFGW 440
Query: 149 GGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALL 208
RL T + Y++DVA GG T FP +AV P +G L+
Sbjct: 441 QDDRLMTAIFYINDVALGGATAFPFLR---------------------LAVPPEKGSLLM 479
Query: 209 FFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHV 243
+ +LH++ D S H+GCP+++G KW T+W HV
Sbjct: 480 WNNLHSSLHKDYRSKHAGCPILQGSKWICTEWFHV 514
>gi|380813208|gb|AFE78478.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Macaca
mulatta]
gi|384947330|gb|AFI37270.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Macaca
mulatta]
Length = 534
Score = 111 bits (278), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 65/218 (29%), Positives = 110/218 (50%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P+K + KPR + ++D E + + +LAK +L R+ V D +G+ + R S
Sbjct: 322 ILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVS 381
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ ++ +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 382 KSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAF 441
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV+ GG TVFP G +V P++G
Sbjct: 442 KELGTGNRIATWLFYMSDVSAGGATVFPEV---------------------GASVWPKKG 480
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW+H
Sbjct: 481 TAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLH 518
>gi|291190274|ref|NP_001167096.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide precursor [Salmo
salar]
gi|223648100|gb|ACN10808.1| Prolyl 4-hydroxylase subunit alpha-1 precursor [Salmo salar]
Length = 545
Score = 111 bits (278), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 64/218 (29%), Positives = 111/218 (50%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
++ P K + +PR Y L++ E + + LAK +L+R+ +++ ++G + + R S
Sbjct: 333 VLGPVKQEDEWDRPRIIRYHDVLSNSEIEKVKELAKPRLRRATISNPITGVLETAHYRIS 392
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ +D ++ I +I T L + E++QV Y G +YEPH+D+ D
Sbjct: 393 KSAWLTAYEDPVVDKINQRIEDITGLNVKTAEELQVANYGVGGQYEPHFDFGRKDEPDAF 452
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L+Y+SDV GG TVF + G AV P++G
Sbjct: 453 KELGTGNRIATWLIYMSDVPSGGATVFTDV---------------------GAAVWPKKG 491
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KWIH
Sbjct: 492 SAVFWYNLFPSGEGDYSTRHAACPVLVGNKWVSNKWIH 529
>gi|426255744|ref|XP_004021508.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1 [Ovis
aries]
Length = 534
Score = 111 bits (278), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 112/218 (51%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P+K + KPR + ++D E + + +LAK +L+R+ +++ ++G+ + R S
Sbjct: 322 ILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVHYRIS 381
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ ++ +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 382 KSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAF 441
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV GG TVFP G +V P++G
Sbjct: 442 KELGTGNRIATWLFYMSDVLAGGATVFPEV---------------------GASVWPKKG 480
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW+H
Sbjct: 481 TAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLH 518
>gi|428183249|gb|EKX52107.1| hypothetical protein GUITHDRAFT_150687 [Guillardia theta CCMP2712]
Length = 315
Score = 111 bits (278), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 66/206 (32%), Positives = 104/206 (50%), Gaps = 26/206 (12%)
Query: 42 PRAFVYEGFLTDLECDHLINLA-KSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
PR +V LT EC+ L +L + ++++ + E S RT++ ++ + +
Sbjct: 88 PRIYVLHNILTKEECESLKSLGVMAGMEKALIIPYGGKELVESSTRTNTAAWLEYHQGPV 147
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKV----NIVRGGHRLATV 156
+ +E+ +A T ENGE++Q+L Y+ Q+++ H+DYF N GG+RLAT
Sbjct: 148 VTKLENLLAKVTNTEPENGENLQILHYQTSQQFKEHHDYFDPATDPPENFEPGGNRLATA 207
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
++YL + +GGET F K VKP G A+LF+ L +
Sbjct: 208 IIYLQNAEEGGETDF---------------------MKIDTKVKPEAGSAVLFYDLKPDG 246
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWIH 242
D +++HSG P GEKW ATKWIH
Sbjct: 247 SVDKLTIHSGNPPKGGEKWVATKWIH 272
>gi|115495019|ref|NP_001069238.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Bos taurus]
gi|122144801|sp|Q1RMU3.1|P4HA1_BOVIN RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; Flags: Precursor
gi|92097479|gb|AAI14709.1| Prolyl 4-hydroxylase, alpha polypeptide I [Bos taurus]
gi|296472132|tpg|DAA14247.1| TPA: prolyl 4-hydroxylase subunit alpha-1 precursor [Bos taurus]
gi|440892721|gb|ELR45796.1| Prolyl 4-hydroxylase subunit alpha-1 [Bos grunniens mutus]
Length = 534
Score = 111 bits (278), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 112/218 (51%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P+K + KPR + ++D E + + +LAK +L+R+ +++ ++G+ + R S
Sbjct: 322 ILAPAKQEDEWDKPRIIRFHDIISDAEIEVVKDLAKPRLRRATISNPITGDLETVHYRIS 381
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ ++ +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 382 KSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAF 441
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV GG TVFP G +V P++G
Sbjct: 442 KELGTGNRIATWLFYMSDVLAGGATVFPEV---------------------GASVWPKKG 480
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW+H
Sbjct: 481 TAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLH 518
>gi|268536692|ref|XP_002633481.1| C. briggsae CBR-PHY-2 protein [Caenorhabditis briggsae]
gi|94442973|emb|CAJ98659.1| prolyl 4-hydroxylase [Caenorhabditis briggsae]
Length = 539
Score = 111 bits (278), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 65/217 (29%), Positives = 110/217 (50%), Gaps = 25/217 (11%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
I P KV+ + + P A +++ ++D E + + LA +LKR+ V ++ +GE + + R S
Sbjct: 313 IAPIKVEILRFDPLAVLFKNVISDSEIEVIKELASPKLKRATVQNSKTGELEHATYRISK 372
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF----SDKVN 145
++ D +I + +I +T L + E++QV Y G Y+PH+D+ +
Sbjct: 373 SAWLKGDLDPVIDRVNRRIEDFTGLNQATSEELQVANYGLGGHYDPHFDFARKEEKNAFK 432
Query: 146 IVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGD 205
+ G+R+ATVL Y+S +GG TVF + G AV P + D
Sbjct: 433 TLNTGNRIATVLFYMSQPERGGATVFNHL---------------------GTAVFPSKND 471
Query: 206 ALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
AL +++L + D + H+ CPV+ G KW + KWIH
Sbjct: 472 ALFWYNLRRDGEGDLRTRHAACPVLLGVKWVSNKWIH 508
>gi|332244067|ref|XP_003271193.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
alpha-1 [Nomascus leucogenys]
Length = 502
Score = 111 bits (278), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 65/218 (29%), Positives = 110/218 (50%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P+K + KPR + ++D E + + +LAK +L R+ V D +G+ + R S
Sbjct: 290 ILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVS 349
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ ++ +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 350 KSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAF 409
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV+ GG TVFP G +V P++G
Sbjct: 410 KELGTGNRIATWLFYMSDVSAGGATVFPEV---------------------GASVWPKKG 448
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW+H
Sbjct: 449 TAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLH 486
>gi|348576112|ref|XP_003473831.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cavia
porcellus]
Length = 534
Score = 111 bits (278), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 65/218 (29%), Positives = 110/218 (50%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P+K + KPR + ++D E + + +LAK +L R+ V D +G+ + R S
Sbjct: 322 ILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVS 381
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ ++ +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 382 KSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAF 441
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV+ GG TVFP G +V P++G
Sbjct: 442 KELGTGNRIATWLFYMSDVSAGGATVFPEV---------------------GASVWPKKG 480
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW+H
Sbjct: 481 TAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLH 518
>gi|73952886|ref|XP_850682.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3 [Canis
lupus familiaris]
Length = 534
Score = 111 bits (278), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 65/218 (29%), Positives = 110/218 (50%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P+K + KPR + ++D E + + +LAK +L R+ V D +G+ + R S
Sbjct: 322 ILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVS 381
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ ++ +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 382 KSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAF 441
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV+ GG TVFP G +V P++G
Sbjct: 442 KELGTGNRIATWLFYMSDVSAGGATVFPEV---------------------GASVWPKKG 480
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW+H
Sbjct: 481 TAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLH 518
>gi|383418721|gb|AFH32574.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Macaca
mulatta]
Length = 534
Score = 111 bits (278), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 65/218 (29%), Positives = 110/218 (50%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P+K + KPR + ++D E + + +LAK +L R+ V D +G+ + R S
Sbjct: 322 ILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVS 381
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ ++ +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 382 KSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAF 441
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV+ GG TVFP G +V P++G
Sbjct: 442 KELGTGNRIATWLFYMSDVSAGGATVFPEV---------------------GASVWPKKG 480
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW+H
Sbjct: 481 TAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLH 518
>gi|63252886|ref|NP_000908.2| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Homo
sapiens]
gi|114631173|ref|XP_508168.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 13 [Pan
troglodytes]
gi|602676|gb|AAA59069.1| alpha-subunit of prolyl 4-hydroxylase [Homo sapiens]
gi|62897481|dbj|BAD96680.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I variant [Homo
sapiens]
gi|119574852|gb|EAW54467.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I, isoform CRA_a [Homo
sapiens]
gi|119574853|gb|EAW54468.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I, isoform CRA_b [Homo
sapiens]
gi|410349609|gb|JAA41408.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
gi|410349613|gb|JAA41410.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
Length = 534
Score = 111 bits (278), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 65/218 (29%), Positives = 110/218 (50%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P+K + KPR + ++D E + + +LAK +L R+ V D +G+ + R S
Sbjct: 322 ILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVS 381
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ ++ +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 382 KSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAF 441
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV+ GG TVFP G +V P++G
Sbjct: 442 KELGTGNRIATWLFYMSDVSAGGATVFPEV---------------------GASVWPKKG 480
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW+H
Sbjct: 481 TAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLH 518
>gi|195341548|ref|XP_002037368.1| GM12149 [Drosophila sechellia]
gi|194131484|gb|EDW53527.1| GM12149 [Drosophila sechellia]
Length = 537
Score = 111 bits (278), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 70/217 (32%), Positives = 113/217 (52%), Gaps = 27/217 (12%)
Query: 30 INPSKVKQISWKPR-AFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
++P K++Q++ P A+V+E L D E D +I K ++RS V S ++VR S
Sbjct: 321 LSPFKIEQLNIDPYVAYVHE-VLWDSEIDTIIEHGKGNMERSKVGQ--IENSTTTEVRIS 377
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVR 148
T++ + ++ I+ ++ T L E+ E +Q++ Y G +YEPH+D+ D V
Sbjct: 378 RNTWLWYDANPWLSKIKQRLEDVTGLSTESAEPLQLVNYGIGGQYEPHFDFVEDDGKTVF 437
Query: 149 G--GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
G+RL T L YL+DVA GG T FP +AV P +G
Sbjct: 438 SWKGNRLLTALFYLNDVALGGATAFPFLR---------------------LAVPPVKGSL 476
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHV 243
L++++LH++ D + H+GCPV++G KW +W HV
Sbjct: 477 LIWYNLHSSTHKDFRTKHAGCPVLQGSKWICNEWFHV 513
>gi|410927705|ref|XP_003977281.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Takifugu
rubripes]
Length = 531
Score = 111 bits (277), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 66/218 (30%), Positives = 107/218 (49%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + +P Y L++ E + + LAK +L+R+ V D +G+ + R S
Sbjct: 319 VIGPVKQEDEWDRPHIVRYHDILSNREMETVKELAKPRLRRATVHDPQTGQLTTAPYRVS 378
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ + ++ I +I T L ED+QV Y G +YEPHYD+ D
Sbjct: 379 KSAWLGAFEHPVVDRINQRIEDITGLDVSTAEDLQVANYGVGGQYEPHYDFGRKDEPDAF 438
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L+Y+S+V GG TVF + G +V P++G
Sbjct: 439 KELGTGNRIATWLLYMSEVQAGGATVFTDI---------------------GASVSPKKG 477
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++LH + D + H+ CPV+ G KW + KWIH
Sbjct: 478 SAVFWYNLHPSGDGDYRTRHAACPVLLGNKWVSNKWIH 515
>gi|432106758|gb|ELK32410.1| Prolyl 4-hydroxylase subunit alpha-1 [Myotis davidii]
Length = 534
Score = 111 bits (277), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 65/218 (29%), Positives = 110/218 (50%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P+K + KPR + ++D E + + +LAK +L R+ V D +G+ + R S
Sbjct: 322 ILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVS 381
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ ++ +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 382 KSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAF 441
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV+ GG TVFP G +V P++G
Sbjct: 442 KELGTGNRIATWLFYMSDVSAGGATVFPEV---------------------GASVWPKKG 480
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW+H
Sbjct: 481 TAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLH 518
>gi|301770069|ref|XP_002920453.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Ailuropoda
melanoleuca]
Length = 534
Score = 111 bits (277), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 65/218 (29%), Positives = 110/218 (50%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P+K + KPR + ++D E + + +LAK +L R+ V D +G+ + R S
Sbjct: 322 ILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVS 381
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ ++ +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 382 KSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAF 441
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV+ GG TVFP G +V P++G
Sbjct: 442 KELGTGNRIATWLFYMSDVSAGGATVFPEV---------------------GASVWPKKG 480
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW+H
Sbjct: 481 TAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLH 518
>gi|403183473|gb|EJY58123.1| AAEL017524-PA, partial [Aedes aegypti]
Length = 212
Score = 111 bits (277), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 67/218 (30%), Positives = 110/218 (50%), Gaps = 28/218 (12%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
I P K+++ S P +Y ++D E + +I ++K LKRS V ++ S E +S+ RTS
Sbjct: 1 IAPFKLEEASLDPLIVIYHNAISDKEIEQIIQVSKPMLKRSMVGESFSKE--VSNERTSQ 58
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKV 144
++ ++ + + T L +++ E +QV Y G Y PH+D+ +
Sbjct: 59 NAWLADYDFELVKVLSLRTEDMTGLDRKSYESLQVNNYGIGGFYLPHFDWVRTNGTEEPY 118
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT++ YLSDV +GG TVFP G+ V P++G
Sbjct: 119 KDMGLGNRIATLMYYLSDVEQGGATVFPQI---------------------GVGVFPKKG 157
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D +LH CPV+ G KW A KWIH
Sbjct: 158 SAIFWYNLLPDGTGDERTLHGACPVLLGSKWVANKWIH 195
>gi|291230950|ref|XP_002735430.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Saccoglossus
kowalevskii]
Length = 533
Score = 111 bits (277), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 60/218 (27%), Positives = 109/218 (50%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P+K + + KP+ ++ + E + LA +L+R+ + ++++G + ++ R S
Sbjct: 320 ILQPAKEEVVFDKPKLIIFHDAILTNEIRKVKALASPRLRRATIQNSVTGNLEFAEYRIS 379
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-SDKVNIV 147
++ + ++ + +I +T L + E++QV Y G YEPH+D+ +++N
Sbjct: 380 KSAWLSEDDGDVVHRLNHRIEQYTGLTMDTAEELQVANYGLGGHYEPHFDFARKEEINAF 439
Query: 148 RG---GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV GG TVFP G + P +G
Sbjct: 440 KSLNTGNRIATFLFYMSDVEAGGATVFPQV---------------------GARLIPEKG 478
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A +++L N D + H+ CPV+ G KW + KWIH
Sbjct: 479 SAAFWYNLLKNGEGDYSTRHAACPVLVGSKWVSNKWIH 516
>gi|195444366|ref|XP_002069834.1| GK11733 [Drosophila willistoni]
gi|194165919|gb|EDW80820.1| GK11733 [Drosophila willistoni]
Length = 517
Score = 110 bits (276), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 72/222 (32%), Positives = 109/222 (49%), Gaps = 28/222 (12%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
I P KV+ ++ P Y L D E + L ++ Q++RS + ++ + RTS+
Sbjct: 312 IAPFKVELLNRSPYVAAYYDVLNDSEIEELKLMSSPQIRRSLLYNHTLDIDQADVDRTSN 371
Query: 90 GTFIPKGKDAIIAGIEDKIATWT--FLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIV 147
F+ + ++ I + A T ++ + ED+QV+ Y G +Y PH DYF +
Sbjct: 372 SVFMEETGITLLETISQRAADMTDLYVTAISSEDLQVINYGLGGQYTPHCDYFDENA--- 428
Query: 148 RGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDAL 207
G RLATVL YL+DV +GG TVFP ++ P++G AL
Sbjct: 429 ENGDRLATVLFYLTDVQQGGATVFPFLR---------------------LSYFPKKGSAL 467
Query: 208 LFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKI 249
+F +L D S HS CPV+ G KW ATKWI+ FD++
Sbjct: 468 IFRNLDNAMSGDKDSTHSACPVLFGNKWVATKWIY--HFDQM 507
>gi|412986224|emb|CCO17424.1| predicted protein [Bathycoccus prasinos]
Length = 557
Score = 110 bits (276), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 85/250 (34%), Positives = 124/250 (49%), Gaps = 53/250 (21%)
Query: 23 SFSSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKL 82
+F+S+AI+ +S P FV+E FL + EC+ L LA LKRS V D KL
Sbjct: 308 TFNSSAIV----AYCVSLSPLLFVFENFLHESECEFLRTLADKDLKRSRVTDG-----KL 358
Query: 83 SDVRTSSGTFI--PKGKDAIIAGIEDK----------IATWTF--LPKENGEDIQVLRYE 128
S+ RTSS F+ KGK+ ++ IE + + T F L + E +Q++RY
Sbjct: 359 SNGRTSSSCFLIGAKGKEDVVKTIERRMLDAIRSTPVLTTRRFDTLKLKGSEPMQIVRYG 418
Query: 129 HGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAE----EPP------ 178
+KY H+D N R+AT + YLSD +GG T FP AE EP
Sbjct: 419 KNEKYTSHFD------NKAGSFRRVATFMCYLSDQCEGGCTNFPKAEPLFLEPSFDEHGA 472
Query: 179 ------RRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAI-PDPVSLHSGCPVIE 231
+++T A+ + G+ + P+ G A+LFFS+ +P+SLH G V +
Sbjct: 473 FKPFGRKKKTVASE-------QHGVKIHPKLGRAILFFSISEEPFRENPLSLHEGQTVRK 525
Query: 232 GEKWSATKWI 241
GEK+ TKW+
Sbjct: 526 GEKFICTKWL 535
>gi|47227817|emb|CAG08980.1| unnamed protein product [Tetraodon nigroviridis]
Length = 285
Score = 110 bits (276), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 66/219 (30%), Positives = 115/219 (52%), Gaps = 28/219 (12%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
++ P++ + +S +P +Y F++D E + + + A+ L+RS VA + ++ R S
Sbjct: 74 LLRPARRETLSLQPYVVLYHDFISDTEAEEIKHHAQLGLRRSVVATR--DKQVTAEYRIS 131
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKEN--GEDIQVLRYEHGQKYEPHYDYF---SDK 143
++ + ++ ++ +I+ T L ++ GE +QV+ Y G YEPH+D+ S
Sbjct: 132 KSAWLKGSAQSAVSRLDQRISMLTGLNVQHPHGEYLQVVNYGIGGHYEPHFDHATSPSSP 191
Query: 144 VNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRR 203
V ++ G+R+ATV++YLS V GG T F A +V +
Sbjct: 192 VFKLKTGNRVATVMIYLSSVEAGGSTAFIYAN---------------------FSVPVMK 230
Query: 204 GDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++LH N DP +LH+GCPV+ G+KW A KWIH
Sbjct: 231 NAAIFWWNLHRNGRGDPDTLHAGCPVLIGDKWVANKWIH 269
>gi|198449635|ref|XP_001357660.2| GA21971 [Drosophila pseudoobscura pseudoobscura]
gi|198130694|gb|EAL26794.2| GA21971 [Drosophila pseudoobscura pseudoobscura]
Length = 549
Score = 110 bits (276), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 73/255 (28%), Positives = 120/255 (47%), Gaps = 33/255 (12%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
++ P K +++S P +Y + E D + L +++ R+ V L+ +S +S+VRTS
Sbjct: 315 LLAPLKAEELSHDPLLVLYHDVIYQSEIDVIRQLTTNRMARAMV--TLTNQSTVSNVRTS 372
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDK----- 143
TFI K + ++ I+ ++A T L + ED Q Y G Y H D+F++
Sbjct: 373 QITFIAKTEHEVLQTIDRRVADMTNLNMDYAEDHQFANYGIGGHYGQHMDWFTETTFDNG 432
Query: 144 -VNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPR 202
V+ G+R+ATVL YLSDVA+GG T FP ++ ++P+
Sbjct: 433 LVSSTEMGNRIATVLFYLSDVAQGGGTAFPYLKQ---------------------HLRPK 471
Query: 203 RGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGGDCTDNNAS 262
+ A + +LH D + H CP+I G KW +WI + + + C + S
Sbjct: 472 KYAAAFWHNLHAAGRGDARTQHGACPIIAGSKWVLNRWIR----EFVQSDRRPCLLWDDS 527
Query: 263 CERWAALGECTKNPE 277
+A + E KN E
Sbjct: 528 LATYAQIMELAKNQE 542
>gi|156352054|ref|XP_001622587.1| predicted protein [Nematostella vectensis]
gi|156209158|gb|EDO30487.1| predicted protein [Nematostella vectensis]
Length = 531
Score = 110 bits (276), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 67/222 (30%), Positives = 111/222 (50%), Gaps = 30/222 (13%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
I P KV+++ P ++ + D E +++ A +L+R+ V + +GE + +D R S
Sbjct: 315 IRPLKVEELHSDPPIWMLRDVMYDSEIEYIKRTATPKLRRATVTNLKTGELEFADYRISK 374
Query: 90 GTFIPKGKD----AIIAGIEDKIATWTFLPK--ENGEDIQVLRYEHGQKYEPHYDYFSDK 143
++ +D I+ + + + T L + E +Q++ Y YEPH+D+ ++
Sbjct: 375 SGWLEDPRDDNEEKILNRVNRRTSIITGLDTTPRSAEALQIVNYGAAGHYEPHFDHATEA 434
Query: 144 VNIVRG---GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVK 200
V+ + G+R+ATVL Y+SDV GG TVF +AE VK
Sbjct: 435 VSSILKLGIGNRIATVLYYMSDVEAGGATVFVDAE---------------------AIVK 473
Query: 201 PRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
P +GDA +++LH N D + H+ CP+I G KW KWIH
Sbjct: 474 PSKGDAAFWYNLHKNGKGDERTRHAACPIIVGSKWVCNKWIH 515
>gi|151556370|gb|AAI47868.1| P4HA1 protein [Bos taurus]
Length = 534
Score = 110 bits (276), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 65/218 (29%), Positives = 109/218 (50%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P+K + KPR + ++D E + + +LAK +L R+ V D +G+ + R S
Sbjct: 322 ILAPAKQEDEWDKPRIIRFHDIISDAEIEVVKDLAKPRLSRATVHDPETGKLTTAQYRVS 381
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ ++ +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 382 KSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAF 441
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV GG TVFP G +V P++G
Sbjct: 442 KELGTGNRIATWLFYMSDVLAGGATVFPEV---------------------GASVWPKKG 480
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW+H
Sbjct: 481 TAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLH 518
>gi|426255746|ref|XP_004021509.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Ovis
aries]
Length = 534
Score = 110 bits (275), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 65/218 (29%), Positives = 109/218 (50%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P+K + KPR + ++D E + + +LAK +L R+ V D +G+ + R S
Sbjct: 322 ILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVS 381
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ ++ +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 382 KSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAF 441
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV GG TVFP G +V P++G
Sbjct: 442 KELGTGNRIATWLFYMSDVLAGGATVFPEV---------------------GASVWPKKG 480
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW+H
Sbjct: 481 TAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLH 518
>gi|195113239|ref|XP_002001175.1| GI10638 [Drosophila mojavensis]
gi|193917769|gb|EDW16636.1| GI10638 [Drosophila mojavensis]
Length = 511
Score = 110 bits (275), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 68/216 (31%), Positives = 109/216 (50%), Gaps = 34/216 (15%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVAD---NLSGESKLSDVR 86
+ P K++ + P ++ L+ E D L +A+ L+RS V N+ G+ R
Sbjct: 307 LAPIKMEVLVLDPLVVIFHDVLSSREIDGLQEIARPHLERSMVVKYRANVQGKH-----R 361
Query: 87 TSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-SDKVN 145
S+GT++ + + + IE +IA L E E V+ Y G +Y+ H+D+F +D V
Sbjct: 362 ISAGTWVERKYNNLTWRIERRIADMVDLNLEGSEPFYVINYGIGGQYKAHWDFFGADTVE 421
Query: 146 IVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGD 205
+RLATVL Y++DV +GG TVFP + G V+ +RG+
Sbjct: 422 ----DNRLATVLFYMNDVEQGGATVFP---------------------RLGQTVRAKRGN 456
Query: 206 ALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
AL ++++ N D +LH GCP++ G KW T+WI
Sbjct: 457 ALFWYNMQHNGTVDDRTLHGGCPILVGSKWIFTQWI 492
>gi|170591592|ref|XP_001900554.1| prolyl 4-hydroxylase [Brugia malayi]
gi|16415740|emb|CAC82616.1| prolyl 4-hydroxylase [Brugia malayi]
gi|21425621|emb|CAD19314.1| prolyl 4-hydroxylase [Brugia malayi]
gi|158592166|gb|EDP30768.1| prolyl 4-hydroxylase, putative [Brugia malayi]
Length = 541
Score = 110 bits (275), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 64/217 (29%), Positives = 110/217 (50%), Gaps = 25/217 (11%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+ P KV+ + + P A ++ +TD E + LA +L+R+ V ++++GE + + RTS
Sbjct: 320 LAPFKVEILRFNPLAVLFRDVITDEEVTMIQMLATPRLRRATVQNSITGELETASYRTSK 379
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-SDKVNIVR 148
++ + ++ I +I T L +E E++QV Y G Y+PH+D+ ++VN +
Sbjct: 380 SAWLKDEEHEVVHRINKRIDLMTNLEQETSEELQVGNYGIGGHYDPHFDFARREEVNAFQ 439
Query: 149 G---GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGD 205
G+RLAT+L Y++ GG TVF + V P + D
Sbjct: 440 SLNTGNRLATLLFYMTQPESGGATVFTEVK---------------------TTVMPSKND 478
Query: 206 ALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
AL +++L + D + H+ CPV+ G KW + KWIH
Sbjct: 479 ALFWYNLLRSGEGDLRTRHAACPVLTGTKWVSNKWIH 515
>gi|195055779|ref|XP_001994790.1| GH14110 [Drosophila grimshawi]
gi|193892553|gb|EDV91419.1| GH14110 [Drosophila grimshawi]
Length = 487
Score = 110 bits (275), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 62/217 (28%), Positives = 112/217 (51%), Gaps = 25/217 (11%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+ P K+++ P +Y + D E + L +A+ + +R+ V ++++G + ++ R S
Sbjct: 271 LAPLKLEEAFMDPYIVIYHDAMYDSEIEVLKRMARPRFRRATVQNSVTGALETANYRISK 330
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNI--- 146
++ + II + + A T L ++ E++QV+ Y G YEPH+D+ + +
Sbjct: 331 SAWLKTPEHEIIGTVVQRTADMTGLDMDSAEELQVVNYGIGGHYEPHFDFARREEKLAFE 390
Query: 147 -VRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGD 205
+ G+R+AT+L Y+SDV +GG TVF + RT A+ P++G
Sbjct: 391 GLNLGNRIATMLFYMSDVQQGGATVFTSL------RT---------------ALWPKKGT 429
Query: 206 ALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A + +LH + D + H+ CPV+ G KW + KWIH
Sbjct: 430 AAFWMNLHRSGEGDARTRHAACPVLTGSKWVSNKWIH 466
>gi|195452746|ref|XP_002073482.1| GK14141 [Drosophila willistoni]
gi|194169567|gb|EDW84468.1| GK14141 [Drosophila willistoni]
Length = 541
Score = 110 bits (274), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 66/218 (30%), Positives = 111/218 (50%), Gaps = 29/218 (13%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+ P KV++++ P +Y + E D + NL ++++ R+ V + S++S VRTS
Sbjct: 318 LAPLKVEELNHNPLLVLYHDVIYQSEIDVIRNLTENEISRATVIG--AKGSEVSKVRTSQ 375
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF------SDK 143
TFIPK + ++ I+ ++A + L + E Q Y G Y H D+F ++
Sbjct: 376 FTFIPKTRHKVLQTIDQRVADMSNLNMDYAELHQFANYGIGGHYAQHNDWFGQDAFDNEL 435
Query: 144 VNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRR 203
V+ G+R+ATVL YLSDVA+GG T FP+ ++ ++P++
Sbjct: 436 VSSPEMGNRIATVLFYLSDVAQGGGTAFPHLKQ---------------------LLQPKK 474
Query: 204 GDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
A + +LH + + D +LH CP+I G KW +WI
Sbjct: 475 YAAAFWHNLHASGVGDLRTLHGACPIIAGSKWVQNRWI 512
>gi|402593814|gb|EJW87741.1| hypothetical protein WUBG_01349 [Wuchereria bancrofti]
Length = 541
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 64/217 (29%), Positives = 110/217 (50%), Gaps = 25/217 (11%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+ P KV+ + + P A ++ +TD E + LA +L+R+ V ++++GE + + RTS
Sbjct: 320 LAPFKVEILRFNPLAVLFRDVITDEEITMIQMLATPRLRRATVQNSITGELETASYRTSK 379
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-SDKVNIVR 148
++ + ++ I +I T L +E E++QV Y G Y+PH+D+ ++VN +
Sbjct: 380 SAWLKDEEHEVVHRINKRIDLMTNLEQETSEELQVGNYGIGGHYDPHFDFARREEVNAFQ 439
Query: 149 G---GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGD 205
G+RLAT+L Y++ GG TVF + V P + D
Sbjct: 440 SLNTGNRLATLLFYMTQPESGGATVFTEVK---------------------TTVMPSKND 478
Query: 206 ALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
AL +++L + D + H+ CPV+ G KW + KWIH
Sbjct: 479 ALFWYNLLRSGEGDLRTRHAACPVLTGTKWVSNKWIH 515
>gi|219124513|ref|XP_002182546.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217405892|gb|EEC45833.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 193
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 73/214 (34%), Positives = 112/214 (52%), Gaps = 28/214 (13%)
Query: 35 VKQISWKPRAFVYEGFLTDLECDHLINLA--KSQLKRSAVADNLSGESKLSDVRTSSGTF 92
VK +S PRAF E FLTD+E DH++ L K+ ++RS S +S+ RTSS T+
Sbjct: 1 VKALSCAPRAFQVENFLTDVEADHIVGLVQKKNDMQRS------STNGHISETRTSSTTW 54
Query: 93 IPKGKDAIIAGIEDKIATW-----TFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIV 147
+ + D +I I ++A L + ED+Q++ Y GQ+Y H+D+ K +
Sbjct: 55 LARHSDPVIDSIFRRVADTLKMDEAMLHRRINEDLQIVHYGVGQQYTAHHDFGYPKGD-P 113
Query: 148 RGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDAL 207
R MYL+DV GG+T FP R R TN L+ V P++G A+
Sbjct: 114 GSPSRSINFCMYLNDVPAGGQTSFP------RWRNAETNGALN--------VVPKKGTAM 159
Query: 208 LFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
+F+ ++ + D ++ H+ PVIEGEK+ + WI
Sbjct: 160 IFYMVNPDGNLDDLTHHAALPVIEGEKFFSNLWI 193
>gi|348505573|ref|XP_003440335.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Oreochromis
niloticus]
Length = 517
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 70/220 (31%), Positives = 115/220 (52%), Gaps = 30/220 (13%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRT 87
++ P++ + +S +P +Y F+TD E + + +LA L+RS VA +GE + +D R
Sbjct: 306 MLMPARRELVSLQPYVVLYHDFVTDTEAEDIKSLAHPGLRRSVVA---AGEKQATADYRI 362
Query: 88 SSGTFIPKGKDAIIAGIEDKIATWTFLPKEN--GEDIQVLRYEHGQKYEPHYDYF---SD 142
S ++ +I+ ++ +I+ T L ++ GE +QV+ Y G YEPH+D+ S
Sbjct: 363 SKSAWLKGSAQSIVGKLDQRISLLTGLNVKHPYGEYLQVVNYGIGGHYEPHFDHATSPSS 422
Query: 143 KVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPR 202
V ++ G+R+AT ++YLS V GG T F A +V
Sbjct: 423 PVFKLKTGNRVATFMIYLSPVEAGGSTAFIYA---------------------NFSVPVV 461
Query: 203 RGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++LH N D +LH+GCPV+ G+KW A KWIH
Sbjct: 462 EKAAIFWWNLHRNGEGDDDTLHAGCPVLIGDKWVANKWIH 501
>gi|387016442|gb|AFJ50340.1| Prolyl 4-hydroxylase subunit alpha-2-like [Crotalus adamanteus]
Length = 533
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 67/216 (31%), Positives = 107/216 (49%), Gaps = 23/216 (10%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
II P K + P Y L+D E + + LAK +L R+ V D +G +++ R S
Sbjct: 323 IIAPFKEEDEWDSPHIVRYYEVLSDEEIEKIKELAKPKLARATVRDPKTGVLTVANYRVS 382
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIV- 147
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ +I
Sbjct: 383 KSSWLEEEDDLVVARVNHRMEQITGLTTKTAELLQVANYGMGGQYEPHFDFSRRPFDITL 442
Query: 148 -RGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
G+RLAT L Y+SDV GG TVFP+ G A+ P++G A
Sbjct: 443 KTEGNRLATFLNYMSDVEAGGATVFPDF---------------------GAAIWPKKGTA 481
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 482 VFWYNLFRSGEGDYRTRHAACPVLVGCKWVSNKWFH 517
>gi|449280261|gb|EMC87600.1| Prolyl 4-hydroxylase subunit alpha-1 [Columba livia]
Length = 536
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 66/218 (30%), Positives = 107/218 (49%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P K + KPR + ++D E + + LAK +L R+ V D +G+ + R S
Sbjct: 324 ILGPVKQEDEWDKPRIVRFLDIISDEEIETVKELAKPRLSRATVHDPETGKLTTAHYRVS 383
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ + +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 384 KSAWLSGYESPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAF 443
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV+ GG TVFP G +V PR+G
Sbjct: 444 KELGTGNRIATWLFYMSDVSAGGATVFPEV---------------------GASVWPRKG 482
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW+H
Sbjct: 483 TAVFWYNLFPSGEGDYSTRHAACPVLVGNKWVSNKWLH 520
>gi|451927223|gb|AGF85101.1| 4-hydroxylase [Moumouvirus goulette]
Length = 239
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 73/208 (35%), Positives = 107/208 (51%), Gaps = 33/208 (15%)
Query: 45 FVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGI 104
F+ + F+ +C ++N +S+L S V +SG++K +R S ++ K D ++ +
Sbjct: 57 FIIKNFINKEKCGEIMNNTQSKLFDSEV---ISGKNKA--IRNSQQCWVSK-YDPMVKSM 110
Query: 105 EDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF---SDKVN--IVRGGHRLATVLMY 159
KI+ +P +N ED+QV+RY GQ Y H+D +DK N I RGG R TVL+Y
Sbjct: 111 FQKISQQFNIPIQNAEDLQVVRYLPGQYYNEHHDACCDNNDKCNEFISRGGQRCLTVLIY 170
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIP- 218
L++ +GG T F N G+ VKP GDA++F+ L N
Sbjct: 171 LNNEFEGGHTFFKNL---------------------GLKVKPETGDAIVFYPLAKNTSKC 209
Query: 219 DPVSLHSGCPVIEGEKWSATKWIHVDSF 246
P+SLH+G PV GEKW A W SF
Sbjct: 210 HPLSLHAGMPVTNGEKWIANLWFRERSF 237
>gi|321474953|gb|EFX85917.1| hypothetical protein DAPPUDRAFT_309108 [Daphnia pulex]
Length = 549
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 66/219 (30%), Positives = 104/219 (47%), Gaps = 26/219 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I P K+++ KP +Y + D E + + LA + KR+ V ++ +G+ + + R S
Sbjct: 334 FIQPLKMEEAFLKPLLVIYHDVIFDEEIETVKKLAHPRFKRTTVMNSATGKLETAKYRIS 393
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVR 148
F+ + + + ++ T L ED+QV Y G YEPH+DY I
Sbjct: 394 KAAFLKNKEHHHVLKMSRRVGAITGLDMSTAEDLQVCNYGIGGHYEPHFDYARKNETIGF 453
Query: 149 GG-----HRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRR 203
+R+AT L Y+SDV GG TVF PA N +A+ P++
Sbjct: 454 NKDSGWRNRIATWLFYMSDVEAGGATVF-----------PALN----------VALWPQK 492
Query: 204 GDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
G A +++L N + ++ H+ CPV+ G KW A KWIH
Sbjct: 493 GSAAFWYNLFPNGEGNELTRHAACPVLTGSKWVANKWIH 531
>gi|312032360|ref|NP_001185667.1| prolyl 4-hydroxylase subunit alpha-1 isoform 4 precursor [Gallus
gallus]
Length = 536
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 110/218 (50%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P K + KPR + ++D E + + LAK +L+R+ +++ ++G + + R S
Sbjct: 324 ILGPVKQEDEWDKPRIVRFLDIISDEEIETVKELAKPRLRRATISNPITGALETAHYRIS 383
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ + +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 384 KSAWLSGYESPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAF 443
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV+ GG TVFP G +V P++G
Sbjct: 444 KELGTGNRIATWLFYMSDVSAGGATVFPEV---------------------GASVWPKKG 482
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW+H
Sbjct: 483 TAVFWYNLFPSGEGDYSTRHAACPVLVGNKWVSNKWLH 520
>gi|432949777|ref|XP_004084253.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Oryzias
latipes]
Length = 532
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 71/221 (32%), Positives = 106/221 (47%), Gaps = 25/221 (11%)
Query: 25 SSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSD 84
S ++ P K + P Y L+D E + + LAK +L R+ V D +G +
Sbjct: 318 SPRLLLKPIKEEDEWDNPHIVRYLNILSDQEIEKIKELAKPRLARATVRDPKTGVLTTAP 377
Query: 85 VRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDK- 143
R S ++ D +I + +I T L E E +QV Y G +YEPH+D FS +
Sbjct: 378 YRVSKSAWLEGEDDPVIDRVNQRIQDITGLTVETAELLQVANYGVGGQYEPHFD-FSRRP 436
Query: 144 --VNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKP 201
N+ G+RLAT L Y+SDV GG TVFP+ G ++ P
Sbjct: 437 FDSNLKVDGNRLATFLNYMSDVEAGGATVFPDF---------------------GASIWP 475
Query: 202 RRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
R+G A+ +++L + D + H+ CPV+ G KW + KWIH
Sbjct: 476 RKGTAVFWYNLFRSGEGDYRTRHAACPVLVGSKWVSNKWIH 516
>gi|242018356|ref|XP_002429643.1| Prolyl 4-hydroxylase alpha-1 subunit precursor, putative [Pediculus
humanus corporis]
gi|212514628|gb|EEB16905.1| Prolyl 4-hydroxylase alpha-1 subunit precursor, putative [Pediculus
humanus corporis]
Length = 534
Score = 109 bits (272), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 59/205 (28%), Positives = 102/205 (49%), Gaps = 25/205 (12%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
PR +Y L+D E + LA + KR+ V ++ +G+ +++ R S ++ +
Sbjct: 334 PRIVLYHDVLSDREIKTIQQLAVPRFKRATVQNSETGKLEVAHYRISKSAWLEDVDHPYV 393
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF----SDKVNIVRGGHRLATVL 157
A + ++ T L E +QV+ Y G YEPH+D+ + + G+R+AT+L
Sbjct: 394 AKVSQRVEDITGLNMATAESLQVVNYGIGGHYEPHFDFARKEEKNAFQSLGTGNRIATIL 453
Query: 158 MYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAI 217
Y+SDV++GG TVFP + +++ P++G A +++L N
Sbjct: 454 FYMSDVSQGGATVFPGIK---------------------VSLWPKKGTAAFWYNLRKNGE 492
Query: 218 PDPVSLHSGCPVIEGEKWSATKWIH 242
D ++ H+ CPV+ G KW KWIH
Sbjct: 493 GDYLTRHAACPVLTGSKWVCNKWIH 517
>gi|17541712|ref|NP_502317.1| Protein PHY-2 [Caenorhabditis elegans]
gi|32171589|sp|Q20065.1|P4HA2_CAEEL RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
alpha-2; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-2; Flags: Precursor
gi|3876769|emb|CAA93469.1| Protein PHY-2 [Caenorhabditis elegans]
Length = 539
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 64/217 (29%), Positives = 109/217 (50%), Gaps = 25/217 (11%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+ P KV+ + + P A +++ + D E + + LA +LKR+ V ++ +GE + + R S
Sbjct: 313 LAPIKVEILRFDPLAVLFKNVIHDSEIEVIKELASPKLKRATVQNSKTGELEHATYRISK 372
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF----SDKVN 145
++ D +I + +I +T L + E++QV Y G Y+PH+D+ +
Sbjct: 373 SAWLKGDLDPVIDRVNRRIEDFTNLNQATSEELQVANYGLGGHYDPHFDFARKEEKNAFK 432
Query: 146 IVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGD 205
+ G+R+ATVL Y+S +GG TVF + G AV P + D
Sbjct: 433 TLNTGNRIATVLFYMSQPERGGATVFNHL---------------------GTAVFPSKND 471
Query: 206 ALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
AL +++L + D + H+ CPV+ G KW + KWIH
Sbjct: 472 ALFWYNLRRDGEGDLRTRHAACPVLLGVKWVSNKWIH 508
>gi|326923461|ref|XP_003207954.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 1
[Meleagris gallopavo]
Length = 536
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 110/218 (50%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P K + KPR + ++D E + + LAK +L+R+ +++ ++G + + R S
Sbjct: 324 ILGPVKQEDEWDKPRIVRFLDIISDEEIETVKELAKPRLRRATISNPITGALETAHYRIS 383
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ + +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 384 KSAWLSGYESPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAF 443
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV+ GG TVFP G +V P++G
Sbjct: 444 KELGTGNRIATWLFYMSDVSAGGATVFPEV---------------------GASVWPKKG 482
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW+H
Sbjct: 483 TAVFWYNLFPSGEGDYSTRHAACPVLVGNKWVSNKWLH 520
>gi|348518914|ref|XP_003446976.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Oreochromis
niloticus]
Length = 536
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 67/218 (30%), Positives = 105/218 (48%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
++ P K + P Y L+D E + + LAK +L R+ V D +G ++ R S
Sbjct: 324 LLKPVKEEDEWDSPHIVRYLDLLSDEEIEKIKELAKPRLARATVRDPKTGVLTTANYRVS 383
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ +D +I + +I T L E E +QV Y G +YEPH+D+ D
Sbjct: 384 KSAWLEGEEDPVIDRVNQRIEAITGLTVETAELLQVANYGVGGQYEPHFDFSRKDEPDAF 443
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV GG TVFP+ G A+ PR+G
Sbjct: 444 KRLGTGNRVATFLNYMSDVEAGGATVFPDF---------------------GAAIWPRKG 482
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
++ +++L + D + H+ CPV+ G KW + KWIH
Sbjct: 483 TSVFWYNLFRSGEGDYRTRHAACPVLVGSKWVSNKWIH 520
>gi|332140647|ref|YP_004426385.1| Prolyl 4-hydroxylase alpha subunit [Alteromonas macleodii str.
'Deep ecotype']
gi|327550669|gb|AEA97387.1| Prolyl 4-hydroxylase alpha subunit [Alteromonas macleodii str.
'Deep ecotype']
Length = 376
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 67/200 (33%), Positives = 99/200 (49%), Gaps = 25/200 (12%)
Query: 46 VYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFI-PKGKDAIIAGI 104
VYE L++ EC +LI + LK S V D ++G K+ VRTS I P D I +
Sbjct: 180 VYESILSEYECRYLITKFNALLKPSMVVDPVTGRGKIDSVRTSYVAVIEPAHCDWITRKL 239
Query: 105 EDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYD---YFSDKVNIVRGGHRLATVLMYLS 161
+ I+ T ++NGE + +LRY GQ+Y+PHYD +D + G R+ T L+YL+
Sbjct: 240 DKTISQITHTLRQNGEALNLLRYSPGQQYKPHYDGLNEINDALMFKDGKQRIKTALVYLN 299
Query: 162 DVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPV 221
+++GGET+FP K I + P+ G ++F + N
Sbjct: 300 TISEGGETLFP---------------------KLDIRIAPKSGTMVVFSNSDENGKLLLN 338
Query: 222 SLHSGCPVIEGEKWSATKWI 241
S H+G P + KW TKWI
Sbjct: 339 SYHAGAPTVSENKWLVTKWI 358
>gi|312032358|ref|NP_001185666.1| prolyl 4-hydroxylase subunit alpha-1 isoform 3 precursor [Gallus
gallus]
Length = 536
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 110/218 (50%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P K + KPR + ++D E + + LAK +L+R+ +++ ++G + + R S
Sbjct: 324 ILGPVKQEDEWDKPRIVRFLDIISDEEIETVKELAKPRLRRATISNPITGALETAHYRIS 383
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ + +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 384 KSAWLSGYESPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAF 443
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV+ GG TVFP G +V P++G
Sbjct: 444 KELGTGNRIATWLFYMSDVSAGGATVFPEV---------------------GASVWPKKG 482
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW+H
Sbjct: 483 TAVFWYNLFPSGEGDYSTRHAACPVLVGNKWVSNKWLH 520
>gi|195505218|ref|XP_002099409.1| GE10887 [Drosophila yakuba]
gi|194185510|gb|EDW99121.1| GE10887 [Drosophila yakuba]
Length = 521
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 72/226 (31%), Positives = 110/226 (48%), Gaps = 30/226 (13%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
I P K +++ P +Y + E D + L +++LKR+ V + ES +S+VRTS
Sbjct: 290 IAPLKAEELFQDPLLVLYHDVIYQSEIDVIRKLTENRLKRATVTGH--NESVVSNVRTSQ 347
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDK------ 143
TFIP +++ I+ ++A T L + ED Q Y G Y H D+F
Sbjct: 348 FTFIPVSAHKVLSTIDQRVADMTNLNMKYAEDHQFANYGIGGHYGQHMDWFYQTTIDAGL 407
Query: 144 VNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRR 203
++ G+R+ATVL YLSDV++GG T FP RT +KP++
Sbjct: 408 ISSPEMGNRIATVLFYLSDVSQGGGTAFPQL------RT---------------LLKPKK 446
Query: 204 GDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI-HVDSFDK 248
A + +LH + + D + H CP+I G KW +WI VD D+
Sbjct: 447 YAAAFWHNLHASGVGDVRTQHGACPIIAGSKWVQNRWIREVDQSDR 492
>gi|381200649|ref|ZP_09907785.1| Prolyl 4-hydroxylase alpha subunit [Sphingobium yanoikuyae XLDN2-5]
Length = 305
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 71/210 (33%), Positives = 100/210 (47%), Gaps = 29/210 (13%)
Query: 39 SWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS-SGTFIPKGK 97
W R F FLT EC H+I+ ++ L+ + V D SG VRTS G F P +
Sbjct: 120 GWDVRLF--RQFLTGDECHHVISEGQALLEPAMVIDPRSGRPMPHPVRTSDGGIFGPARE 177
Query: 98 DAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVL 157
D +I I +IA + GE + +LRY GQ+Y H+D N R T+L
Sbjct: 178 DLVIQAINRRIAAASGTMLSGGEPLTLLRYAVGQQYRQHHDCLPHVRN-----QRAWTML 232
Query: 158 MYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAI 217
+YL++ GGET+FP + G++VK R+GDALLF +
Sbjct: 233 IYLNEGYAGGETIFP---------------------RLGLSVKGRKGDALLFRNTDAQGQ 271
Query: 218 PDPVSLHSGCPVIEGEKWSATKWIHVDSFD 247
++H G PV+ G+KW T+WI D D
Sbjct: 272 AAEAAVHLGAPVMAGQKWLCTRWIRHDRHD 301
>gi|387016440|gb|AFJ50339.1| Prolyl 4-hydroxylase subunit alpha-1-like [Crotalus adamanteus]
Length = 543
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 62/218 (28%), Positives = 111/218 (50%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P + + +PR + +++ E + + L+K +L+R+ +++ ++G + + R S
Sbjct: 331 ILGPVRQEDEWDRPRIVRFLDIISNEEIEKVKELSKPRLRRATISNPITGVLETAHYRIS 390
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ ++ ++A I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 391 KSAWLSGYENPVVARINQRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAF 450
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDVA GG TVFP G +V P++G
Sbjct: 451 KELGTGNRIATWLFYMSDVAAGGATVFPEV---------------------GASVWPKKG 489
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KWIH
Sbjct: 490 TAVFWYNLFPSGEGDYSTRHAACPVLVGNKWVSNKWIH 527
>gi|432926124|ref|XP_004080841.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oryzias
latipes]
Length = 523
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 66/218 (30%), Positives = 105/218 (48%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++ E + + LAK +L+R+ V D +G+ + R S
Sbjct: 311 LIGPVKQEDEWDSPYIVRYHDVASEKEMETVKELAKPRLRRATVHDPQTGKLTTAQYRVS 370
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF----SDKV 144
++ + I+ I +I T L ED+QV Y G +YEPH+D+ +D
Sbjct: 371 KSAWLGSHEHPIVDRINQRIEDITGLDVSTAEDLQVANYGVGGQYEPHFDFGRKDEADAF 430
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L+Y+SDV GG TVF + G V P++G
Sbjct: 431 EELGTGNRIATWLLYMSDVQAGGNTVFTDI---------------------GAVVWPKKG 469
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++LH + D + H+ CPV+ G KW + KWIH
Sbjct: 470 TAVFWYNLHRSGEGDYRTRHAACPVLVGNKWVSNKWIH 507
>gi|224052167|ref|XP_002191912.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Taeniopygia
guttata]
Length = 536
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 66/218 (30%), Positives = 106/218 (48%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P K + KPR + ++D E + + LAK +L R+ V D +G+ + R S
Sbjct: 324 ILGPVKQEDEWDKPRIVRFLDIISDEEIETVKELAKPRLSRATVHDPETGKLTTAHYRVS 383
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ + +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 384 KSAWLSGYESPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAF 443
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV+ GG TVFP G +V PR+G
Sbjct: 444 KELGTGNRIATWLFYMSDVSAGGATVFPEV---------------------GASVWPRKG 482
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW KW+H
Sbjct: 483 TAVFWYNLFPSGEGDYSTRHAACPVLVGNKWVFNKWLH 520
>gi|395509387|ref|XP_003758979.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1
[Sarcophilus harrisii]
Length = 534
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 65/216 (30%), Positives = 107/216 (49%), Gaps = 23/216 (10%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y L+D E + + LAK +L R+ V D +G +++ R S
Sbjct: 324 LIAPFKEEDEWDSPHIVRYYDVLSDEEIERIKELAKPKLARATVRDPKTGVLTVANYRVS 383
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVN--I 146
+++ +G D +IA + ++ T L + E +QV Y G +YEPH+D+ + +
Sbjct: 384 KSSWLEEGDDPVIAQLNRRMHYITGLSVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGL 443
Query: 147 VRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
G+RLAT L Y+SDV GG TVFP+ G + P++G +
Sbjct: 444 KTEGNRLATFLNYMSDVEAGGATVFPDF---------------------GATIWPKKGTS 482
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 483 VFWYNLFRSGEGDYRTRHAACPVLVGSKWVSNKWFH 518
>gi|395814850|ref|XP_003780953.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Otolemur
garnettii]
Length = 544
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 71/220 (32%), Positives = 111/220 (50%), Gaps = 30/220 (13%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLS-DVRT 87
++ P + + I +P +Y F++D E + LA+ L+RS VA SGE +L D R
Sbjct: 333 LLQPIRKEVIHLEPFVALYHDFVSDSEAQKIRELAEPWLQRSVVA---SGEKQLQVDYRI 389
Query: 88 SSGTFIPKGKDAIIAGIEDKIATWTFLPKE--NGEDIQVLRYEHGQKYEPHYDYF---SD 142
S ++ D ++ ++ +IA T L + E +QV+ Y G YEPH+D+ S
Sbjct: 390 SKSAWLKDTVDPMLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPSS 449
Query: 143 KVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPR 202
+ ++ G+R+AT ++YLS V GG T F A +V
Sbjct: 450 PLYRMKSGNRVATFMIYLSSVEAGGATAFIYA---------------------NFSVPVV 488
Query: 203 RGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ AL +++LH N D +LH+GCPV+ G+KW A KWIH
Sbjct: 489 KNAALFWWNLHRNGEGDSDTLHAGCPVLVGDKWVANKWIH 528
>gi|195452776|ref|XP_002073495.1| GK13117 [Drosophila willistoni]
gi|194169580|gb|EDW84481.1| GK13117 [Drosophila willistoni]
Length = 487
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 68/218 (31%), Positives = 103/218 (47%), Gaps = 32/218 (14%)
Query: 32 PSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGT 91
P K++ I P +Y ++ E L +AK QLKR+ V ++ +LS RT+
Sbjct: 278 PLKMELIGLDPYMVLYHDVISPNEIAELQEMAKPQLKRARVYNSTKNTDQLSKTRTAKLA 337
Query: 92 FIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGH 151
+ + + + +I T E +QV+ Y G Y H+DYF N +G H
Sbjct: 338 WFLDTFNQLTERLNQRIMDMTNFVLNGSEMLQVMNYGLGGYYVKHFDYF----NTTKGPH 393
Query: 152 -------RLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
R+ATVL YL+DV +GG TVFP ++ AV P+RG
Sbjct: 394 ITQINGDRIATVLFYLNDVEQGGATVFPEIKK---------------------AVFPKRG 432
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+++++L + + +LH+GCPVI G KW KWI
Sbjct: 433 SAIMWYNLKDDGEGNRDTLHAGCPVIVGSKWVCNKWIR 470
>gi|194905436|ref|XP_001981196.1| GG11753 [Drosophila erecta]
gi|190655834|gb|EDV53066.1| GG11753 [Drosophila erecta]
Length = 550
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 60/217 (27%), Positives = 110/217 (50%), Gaps = 25/217 (11%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+ P K+++ P ++ + D E D + +A+ + +R+ V ++++G + ++ R S
Sbjct: 334 LGPLKLEEAHADPYIVIFHDAMYDGEIDLIKRMARPRFRRATVQNSVTGALETANYRISK 393
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-SDKVNIVR 148
++ + +I + + A T L ++ E++QV+ Y G YEPH+D+ ++
Sbjct: 394 SAWLKTPEHRVIETVVQRTADMTGLDMDSAEELQVVNYGIGGHYEPHFDFARKEEQRAFE 453
Query: 149 G---GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGD 205
G G+R+ATVL Y+SDV +GG TVF + A+ P++G
Sbjct: 454 GLNLGNRIATVLFYMSDVEQGGATVFTSLH---------------------TALFPKKGT 492
Query: 206 ALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A + +LH + D + H+ CPV+ G KW + KWIH
Sbjct: 493 AAFWMNLHRDGQGDVRTRHAACPVLTGTKWVSNKWIH 529
>gi|196011900|ref|XP_002115813.1| hypothetical protein TRIADDRAFT_59899 [Trichoplax adhaerens]
gi|190581589|gb|EDV21665.1| hypothetical protein TRIADDRAFT_59899 [Trichoplax adhaerens]
Length = 581
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 72/221 (32%), Positives = 105/221 (47%), Gaps = 32/221 (14%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVA---DNLSGESKLSDV 85
+ +P V+ +S +P +Y LT+ E L LA LKR+ V D GE +
Sbjct: 339 LFSPLNVEVLSLQPYIVIYHNLLTNSEVVLLKTLASPLLKRAVVVGKPDKEYGEE--TTY 396
Query: 86 RTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS--DK 143
R S ++ K + I I L E E +Q+ Y G YEPH D+ DK
Sbjct: 397 RISKTAWLDKEDHPAVKRITTLIGDIIGLTSETAEPLQIANYGIGGHYEPHLDFIESEDK 456
Query: 144 VNIV----RGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAV 199
+ R G+R+ATVL+YLS+V GG TVFP K G+ V
Sbjct: 457 EALSEYTSRIGNRIATVLIYLSNVEAGGATVFP---------------------KAGVRV 495
Query: 200 KPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKW 240
+PR+G A ++++H N + +S+H+ CPV+ G KW+A W
Sbjct: 496 EPRQGSAAFWYNMHRNGEGNKLSVHAACPVLIGSKWAANLW 536
>gi|395509389|ref|XP_003758980.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2
[Sarcophilus harrisii]
Length = 536
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 65/218 (29%), Positives = 107/218 (49%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y L+D E + + LAK +L R+ V D +G +++ R S
Sbjct: 324 LIAPFKEEDEWDSPHIVRYYDVLSDEEIERIKELAKPKLARATVRDPKTGVLTVANYRVS 383
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF----SDKV 144
+++ +G D +IA + ++ T L + E +QV Y G +YEPH+D+ D
Sbjct: 384 KSSWLEEGDDPVIAQLNRRMHYITGLSVKTAELLQVANYGMGGQYEPHFDFSRKGEQDAF 443
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV GG TVFP+ G + P++G
Sbjct: 444 KHLGTGNRVATFLNYMSDVEAGGATVFPDF---------------------GATIWPKKG 482
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
++ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 483 TSVFWYNLFRSGEGDYRTRHAACPVLVGSKWVSNKWFH 520
>gi|194905294|ref|XP_001981167.1| GG11919 [Drosophila erecta]
gi|190655805|gb|EDV53037.1| GG11919 [Drosophila erecta]
Length = 533
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 70/231 (30%), Positives = 107/231 (46%), Gaps = 34/231 (14%)
Query: 23 SFSSTAIIN--PSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGES 80
+F++T + P K +QI KP +Y L+ E L+ A +K + V + +
Sbjct: 305 NFTTTPFLRLAPLKTEQIGLKPYVVLYHEVLSAREISMLMGKAAQNMKNTRVQSEKAVNT 364
Query: 81 KLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF 140
RT+ G ++ K + + I +I T + ED QV+ Y G Y H+DYF
Sbjct: 365 NRE--RTAKGYWLKKESNEMTRRITRRIVDMTGFDLADSEDFQVINYGIGGHYSLHFDYF 422
Query: 141 SDKVNIVRG---------GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSE 191
+ G G R+ATVL YL+DV +GG TVF N
Sbjct: 423 GFASSNYTGERSHHSIVLGDRIATVLFYLTDVEQGGATVFGNV----------------- 465
Query: 192 CAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
G +V P+ G A+ +++L T+ DP++ H+ CPV+ G KW T+WIH
Sbjct: 466 ----GYSVYPQAGTAIFWYNLDTDGNGDPLTRHASCPVVVGSKWVMTEWIH 512
>gi|224011205|ref|XP_002295377.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|209583408|gb|ACI64094.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 207
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 77/218 (35%), Positives = 112/218 (51%), Gaps = 33/218 (15%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRS---AVADNLSG--ESKLSDVRTSSGTFIPKG 96
P EGFL+D EC+ I L + +RS A NL G +SK S RTS+ T+ +G
Sbjct: 9 PWVVAIEGFLSDEECNRFIELGGDRYERSTEYASTMNLDGTFDSKESSGRTSTNTWCGEG 68
Query: 97 --KDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLA 154
D II + +++ + T +P N ED+Q++RYE GQ+YE H+DY S + G R+
Sbjct: 69 CRDDPIIKKVIERMESLTGIPYANFEDLQLVRYEIGQRYEEHHDYSSSHEG-TQYGPRIL 127
Query: 155 TVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHT 214
TV YL+DV +GG T F D+L +P+RG AL++ S T
Sbjct: 128 TVFFYLNDVEEGGGTQF---------------DELD------FVTEPKRGMALIWPST-T 165
Query: 215 NAIPDPV---SLHSGCPVIEGEKWSATKWIHVDSFDKI 249
N PD + + H PV +G K+ A WIH+ + +
Sbjct: 166 NEAPDVMDDWTWHEALPVTKGIKYGANTWIHLRDYQNV 203
>gi|312032356|ref|NP_001185665.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Gallus
gallus]
Length = 536
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 65/218 (29%), Positives = 107/218 (49%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P K + KPR + ++D E + + LAK +L R+ V D +G+ + R S
Sbjct: 324 ILGPVKQEDEWDKPRIVRFLDIISDEEIETVKELAKPRLSRATVHDPETGKLTTAHYRVS 383
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ + +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 384 KSAWLSGYESPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAF 443
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV+ GG TVFP G +V P++G
Sbjct: 444 KELGTGNRIATWLFYMSDVSAGGATVFPEV---------------------GASVWPKKG 482
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW+H
Sbjct: 483 TAVFWYNLFPSGEGDYSTRHAACPVLVGNKWVSNKWLH 520
>gi|312080225|ref|XP_003142509.1| prolyl 4-hydroxylase 2 [Loa loa]
Length = 541
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 65/217 (29%), Positives = 109/217 (50%), Gaps = 25/217 (11%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+ P KV+ + + P A + +TD E + LA +L+R+ V ++++GE + + RTS
Sbjct: 320 LAPFKVEILRFSPLAVFFRDVITDEEVTIIQMLATPRLRRATVQNSITGELETASYRTSK 379
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-SDKVNIVR 148
++ + I+ I +I T L +E E++QV Y G Y+PH+D+ ++VN +
Sbjct: 380 SAWLKDEEHEIVHRINRRIDLMTNLEQETSEELQVGNYGIGGHYDPHFDFARREEVNAFQ 439
Query: 149 G---GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGD 205
G+RLAT+L Y++ GG TVF + V P + D
Sbjct: 440 SLNTGNRLATLLFYMTQPESGGATVFTEVK---------------------TTVMPSKND 478
Query: 206 ALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
AL +++L + D + H+ CPV+ G KW + KWIH
Sbjct: 479 ALFWYNLLRSGEGDLRTRHAACPVLIGSKWVSNKWIH 515
>gi|428178571|gb|EKX47446.1| hypothetical protein GUITHDRAFT_152114 [Guillardia theta CCMP2712]
Length = 262
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 69/226 (30%), Positives = 113/226 (50%), Gaps = 33/226 (14%)
Query: 35 VKQISWKPRAFVYEGFLTDLECDHLINLA-KSQLKRSAVADNLSGESKLSDVRTSSGTFI 93
++QI+ PR F LT EC+HL+ LA + L ++ + + + S RT+ G ++
Sbjct: 57 LEQINASPRVFRIRNLLTKQECEHLMLLAFRKGLSKTMIMPYGTHKLVESTTRTNDGAWL 116
Query: 94 PKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHG-QKYEPHYDYFSDKVN----IVR 148
+D ++ +E+ + T + GE++QVL Y +G Q ++ HYDYF + +
Sbjct: 117 DFLQDDVVRRLEETLGKLTKTTPQQGENLQVLHYSNGAQFFQEHYDYFDPARDPPESFEQ 176
Query: 149 GGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALL 208
GG+R TV++YL +GGET FP G+ + + GDAL+
Sbjct: 177 GGNRYITVIVYLEAALEGGETHFPEL---------------------GLKLTAQPGDALM 215
Query: 209 FFSLH---TNAIPDPV---SLHSGCPVIEGEKWSATKWIHVDSFDK 248
F++L + PD V ++H+ P + GEKW A KWIH + K
Sbjct: 216 FYNLKEHCSGTDPDCVEKKTIHAALPPVRGEKWVAVKWIHEKPYQK 261
>gi|393909803|gb|EFO21561.2| prolyl 4-hydroxylase 2 [Loa loa]
Length = 542
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 65/217 (29%), Positives = 109/217 (50%), Gaps = 25/217 (11%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+ P KV+ + + P A + +TD E + LA +L+R+ V ++++GE + + RTS
Sbjct: 321 LAPFKVEILRFSPLAVFFRDVITDEEVTIIQMLATPRLRRATVQNSITGELETASYRTSK 380
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-SDKVNIVR 148
++ + I+ I +I T L +E E++QV Y G Y+PH+D+ ++VN +
Sbjct: 381 SAWLKDEEHEIVHRINRRIDLMTNLEQETSEELQVGNYGIGGHYDPHFDFARREEVNAFQ 440
Query: 149 G---GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGD 205
G+RLAT+L Y++ GG TVF + V P + D
Sbjct: 441 SLNTGNRLATLLFYMTQPESGGATVFTEVK---------------------TTVMPSKND 479
Query: 206 ALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
AL +++L + D + H+ CPV+ G KW + KWIH
Sbjct: 480 ALFWYNLLRSGEGDLRTRHAACPVLIGSKWVSNKWIH 516
>gi|324507368|gb|ADY43128.1| Prolyl 4-hydroxylase subunit alpha-2 [Ascaris suum]
Length = 534
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 66/216 (30%), Positives = 110/216 (50%), Gaps = 27/216 (12%)
Query: 32 PSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGT 91
P KV+ + + P +++ ++D E + + LA +LKR+ V + +G+ + ++ R S
Sbjct: 315 PIKVEILRFSPLVVLFKQVISDYEIEVIEKLAIPKLKRATVQNARTGDLEYANYRISKSA 374
Query: 92 FIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNI----- 146
++ I I +I T L +E E++Q Y G Y+PH+D F+ K +I
Sbjct: 375 WLKGTDHPAIDRINKRIDLMTNLNQETAEELQAQNYGIGGHYDPHFD-FARKEDINAFKT 433
Query: 147 VRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
+ G+R+AT+L+Y+SDV GG TVF + G AV P + DA
Sbjct: 434 LNTGNRIATILIYMSDVESGGATVFNHL---------------------GNAVFPSKYDA 472
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
L +++L + D + H+ CPV+ G KW + KWIH
Sbjct: 473 LFWYNLRRDGEGDLRTRHAACPVLTGIKWVSNKWIH 508
>gi|195055767|ref|XP_001994784.1| GH14132 [Drosophila grimshawi]
gi|193892547|gb|EDV91413.1| GH14132 [Drosophila grimshawi]
Length = 537
Score = 108 bits (269), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 69/217 (31%), Positives = 107/217 (49%), Gaps = 27/217 (12%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
++ P KV++++ P +Y + E D L L + + +R+ V N S +S RTS
Sbjct: 322 LLAPLKVEELNRNPLLVLYHDVIYQSEIDVLNKLNRKRYERAGVVIN--STSTVSKKRTS 379
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF--SDKVNI 146
FI + ++ I+ ++A T L + ED Q+ Y G Y H+D+F SD N
Sbjct: 380 QHIFIAATRHKVLRTIDQRVADMTNLNMQYAEDHQLADYGIGGHYSQHFDWFGNSDLANS 439
Query: 147 V--RGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
G+R+ATVL YLSDVA+GG T FP ++ +KP++
Sbjct: 440 KCDEMGNRIATVLFYLSDVAQGGGTAFPILKQ---------------------LLKPKKY 478
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
A +++LH + D +LH GCP+I G KW +WI
Sbjct: 479 AAAFWYNLHASGKGDWRNLHGGCPIIVGSKWVLNRWI 515
>gi|170064951|ref|XP_001867739.1| prolyl 4-hydroxylase subunit alpha-2 [Culex quinquefasciatus]
gi|167882142|gb|EDS45525.1| prolyl 4-hydroxylase subunit alpha-2 [Culex quinquefasciatus]
Length = 516
Score = 108 bits (269), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 68/231 (29%), Positives = 114/231 (49%), Gaps = 28/231 (12%)
Query: 17 SLLIRKSFSSTAIINPSKVKQ--ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVAD 74
+L R S++ + + +KQ ++ P VY +D E + +I L + Q+ RS V D
Sbjct: 293 NLYCRYHMSTSPFLRLAPLKQEVVNLDPFVAVYHDAASDAEINKVIELGRPQINRSMVGD 352
Query: 75 NLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTF-LPKENGEDIQVLRYEHGQKY 133
+ + ++S RTS +++ ++A + + L + E +QV Y G Y
Sbjct: 353 --AAKKEVSKSRTSQNSWLTDYDHPVVAALSRRTKDMALGLDETAYESLQVNNYGIGGHY 410
Query: 134 EPHYDYFSDKVNI--VRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSE 191
PHYD+ ++ + G+R+AT++ YLSDV +GG TVFP+
Sbjct: 411 LPHYDWSREENPYPELNTGNRIATLMFYLSDVEEGGATVFPHL----------------- 453
Query: 192 CAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
G+ V P++G A+ +++L + D +LH CPV+ G KW A KWIH
Sbjct: 454 ----GVGVFPKKGTAIFWYNLRASGKGDEKTLHGACPVLIGSKWVANKWIH 500
>gi|212530|gb|AAA49002.1| prolyl 4-hydroxylase, alpha subunit (EC 1.14.11.2), partial [Gallus
gallus]
Length = 489
Score = 108 bits (269), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 65/218 (29%), Positives = 107/218 (49%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P K + KPR + ++D E + + LAK +L R+ V D +G+ + R S
Sbjct: 277 ILGPVKQEDEWDKPRIVRFLDIISDEEIETVKELAKPRLSRATVHDPETGKLTTAHYRVS 336
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ + +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 337 KSAWLSGYESPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAF 396
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV+ GG TVFP G +V P++G
Sbjct: 397 KELGTGNRIATWLFYMSDVSAGGATVFPEV---------------------GASVWPKKG 435
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW+H
Sbjct: 436 TAVFWYNLFPSGEGDYSTRHAACPVLVGNKWVSNKWLH 473
>gi|326923463|ref|XP_003207955.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 2
[Meleagris gallopavo]
Length = 536
Score = 108 bits (269), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 65/218 (29%), Positives = 107/218 (49%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P K + KPR + ++D E + + LAK +L R+ V D +G+ + R S
Sbjct: 324 ILGPVKQEDEWDKPRIVRFLDIISDEEIETVKELAKPRLSRATVHDPETGKLTTAHYRVS 383
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ + +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 384 KSAWLSGYESPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAF 443
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV+ GG TVFP G +V P++G
Sbjct: 444 KELGTGNRIATWLFYMSDVSAGGATVFPEV---------------------GASVWPKKG 482
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW+H
Sbjct: 483 TAVFWYNLFPSGEGDYSTRHAACPVLVGNKWVSNKWLH 520
>gi|432891690|ref|XP_004075614.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Oryzias
latipes]
Length = 517
Score = 108 bits (269), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 68/220 (30%), Positives = 112/220 (50%), Gaps = 30/220 (13%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLS-DVRT 87
++ P K + +S +P +Y F+TD E + + A+ L+RS VA SGE++ + + R
Sbjct: 306 LLLPVKREVLSLQPYVVIYHNFITDREAEEIKGFAQPALRRSVVA---SGENQATVEYRI 362
Query: 88 SSGTFIPKGKDAIIAGIEDKIATWTFLPKE--NGEDIQVLRYEHGQKYEPHYDYF---SD 142
S ++ + I+ ++ +I+ T L E +QV+ Y G YEPH+D+ S
Sbjct: 363 SKSAWLKGSESCIVGKLDQRISMLTGLNVRPPYAEYLQVVNYGIGGHYEPHFDHATSPSS 422
Query: 143 KVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPR 202
V ++ G+R+AT ++YLS V GG T F A +V
Sbjct: 423 PVFKLKTGNRVATFMIYLSSVEAGGSTAFIYA---------------------NFSVPVL 461
Query: 203 RGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ A+ +++LH N D +LH+GCPV+ G+KW A KW+H
Sbjct: 462 KKAAIFWWNLHRNGRGDAETLHAGCPVLIGDKWVANKWVH 501
>gi|312032354|ref|NP_001185664.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Gallus
gallus]
Length = 536
Score = 108 bits (269), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 65/218 (29%), Positives = 107/218 (49%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P K + KPR + ++D E + + LAK +L R+ V D +G+ + R S
Sbjct: 324 ILGPVKQEDEWDKPRIVRFLDIISDEEIETVKELAKPRLSRATVHDPETGKLTTAHYRVS 383
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ + +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 384 KSAWLSGYESPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAF 443
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV+ GG TVFP G +V P++G
Sbjct: 444 KELGTGNRIATWLFYMSDVSAGGATVFPEV---------------------GASVWPKKG 482
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW+H
Sbjct: 483 TAVFWYNLFPSGEGDYSTRHAACPVLVGNKWVSNKWLH 520
>gi|195391754|ref|XP_002054525.1| GJ24502 [Drosophila virilis]
gi|194152611|gb|EDW68045.1| GJ24502 [Drosophila virilis]
Length = 487
Score = 108 bits (269), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 59/217 (27%), Positives = 110/217 (50%), Gaps = 25/217 (11%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+ P K+++ P +Y + D E + + +A+ + +R+ V ++++G + ++ R S
Sbjct: 271 LAPLKLEEAYMDPYIVIYHDAMYDSEIEIIKRMARPRFRRATVQNSVTGALETANYRISK 330
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-SDKVNIVR 148
++ + +I + + A T L ++ E++QV+ Y G YEPH+D+ ++
Sbjct: 331 SAWLKTAEHRVIGTVVQRTADMTGLDMDSAEELQVVNYGIGGHYEPHFDFARREEKRAFE 390
Query: 149 G---GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGD 205
G G+R+AT+L Y+SDV +GG TVF + A+ P++G
Sbjct: 391 GLNLGNRIATMLFYMSDVEQGGATVFTSLH---------------------AALWPKKGT 429
Query: 206 ALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A + +LH + D + H+ CPV+ G KW + KWIH
Sbjct: 430 AAFWMNLHRSGEGDVRTRHAACPVLTGSKWVSNKWIH 466
>gi|129365|sp|P16924.1|P4HA1_CHICK RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1
Length = 516
Score = 107 bits (268), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 65/218 (29%), Positives = 107/218 (49%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P K + KPR + ++D E + + LAK +L R+ V D +G+ + R S
Sbjct: 304 ILGPVKQEDEWDKPRIVRFLDIISDEEIETVKELAKPRLSRATVHDPETGKLTTAHYRVS 363
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ + +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 364 KSAWLSGYESPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAF 423
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV+ GG TVFP G +V P++G
Sbjct: 424 KELGTGNRIATWLFYMSDVSAGGATVFPEV---------------------GASVWPKKG 462
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW+H
Sbjct: 463 TAVFWYNLFPSGEGDYSTRHAACPVLVGNKWVSNKWLH 500
>gi|310831339|ref|YP_003969982.1| putative prolyl 4-hydroxylase alpha subunit [Cafeteria
roenbergensis virus BV-PW1]
gi|309386523|gb|ADO67383.1| putative prolyl 4-hydroxylase alpha subunit [Cafeteria
roenbergensis virus BV-PW1]
Length = 210
Score = 107 bits (268), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 74/210 (35%), Positives = 100/210 (47%), Gaps = 30/210 (14%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGK 97
+S P + + L EC H+I + ++LK + V+ N G LS RT + ++
Sbjct: 6 LSQDPLIYYVDNVLNKQECYHIIKITSNKLKPALVSGNSRGF--LSTGRTGTNCWLSHKN 63
Query: 98 DAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF----SDKVN--IVRGGH 151
D I I KI P EN E+ QVL Y QKYE HYD F S+K + +GG
Sbjct: 64 DEITFNIALKITNLVNKPLENAENFQVLHYSTNQKYEYHYDAFPIDNSEKAKRCLKKGGQ 123
Query: 152 RLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFS 211
RL T L+YL++V KGGET F N I + P+ G L+F +
Sbjct: 124 RLLTALIYLNNVTKGGETEFKNL---------------------NIKITPKIGRILVFEN 162
Query: 212 LHTNAI-PDPVSLHSGCPVIEGEKWSATKW 240
N++ P SLHSG VIEGEK+ W
Sbjct: 163 TLQNSLNKHPDSLHSGKQVIEGEKYVINLW 192
>gi|47550697|ref|NP_999856.1| prolyl 4-hydroxylase, alpha polypeptide I b precursor [Danio rerio]
gi|28277826|gb|AAH45890.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I [Danio rerio]
Length = 536
Score = 107 bits (268), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 110/218 (50%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
++ P K + +PR Y ++D E + + +AK +L+R+ +++ ++G + + R S
Sbjct: 324 LLAPVKQEDEWDRPRIVRYHEIISDSEIETVKEMAKPRLRRATISNPITGVLETAPYRIS 383
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ + + I I +I T L + E++QV Y G +YEPH+D+ D
Sbjct: 384 KSAWLSGYEHSTIERINQRIEDVTGLEMDTAEELQVANYGVGGQYEPHFDFGRKDEPDAF 443
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV+ GG TVF + G AV P++G
Sbjct: 444 KELGTGNRIATWLFYMSDVSAGGATVFTDV---------------------GAAVWPKKG 482
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KWIH
Sbjct: 483 TAVFWYNLFPSGEGDYSTRHAACPVLVGNKWVSNKWIH 520
>gi|224007761|ref|XP_002292840.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220971702|gb|EED90036.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 490
Score = 107 bits (268), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 75/220 (34%), Positives = 114/220 (51%), Gaps = 33/220 (15%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLA-KSQLKRSAVADNL----SGESKLSDVRTSSG-- 90
+S P ++ FLTD EC+ +I L K++ +RS + S +S +S RTS
Sbjct: 284 MSQPPWIITFDNFLTDEECNQMIQLGYKAKYERSKDVGEMQIDGSYDSVVSKGRTSENAW 343
Query: 91 -TFIPKGKDAIIAG-IEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVR 148
+F K ++ A I D+I+T T +P + ED Q+L+YE GQ Y H+DY + R
Sbjct: 344 CSFRDKCRNTTTAQLIHDRISTVTGIPANHSEDFQILKYEKGQFYRSHHDYIEHQEK-RR 402
Query: 149 GGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALL 208
G R+ T +YLSDV +GG+T FP K IAVKP++G A+L
Sbjct: 403 CGPRVLTFFLYLSDVEEGGDTNFP---------------------KLSIAVKPKKGSAVL 441
Query: 209 FFS-LHTN-AIPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
+ S L +N ++ DP + H V+ G K+ A W+H+ +
Sbjct: 442 WPSVLDSNPSMKDPRTDHEAQEVVNGTKFGANAWLHLHDY 481
>gi|194905372|ref|XP_001981184.1| GG11758 [Drosophila erecta]
gi|190655822|gb|EDV53054.1| GG11758 [Drosophila erecta]
Length = 550
Score = 107 bits (268), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 74/243 (30%), Positives = 112/243 (46%), Gaps = 40/243 (16%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
I P K +++ P +Y + E D + L +++L R+ V + ES +S+VRTS
Sbjct: 318 IAPLKAEELFQDPLLVLYHDVIYQSEIDVIRKLTENRLMRATVTGH--NESLVSNVRTSQ 375
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDK------ 143
TFIP +++ I+ ++A T L + ED Q Y G Y H D+F
Sbjct: 376 FTFIPASAHKVLSTIDQRVADMTNLNMKYAEDHQFANYGIGGHYGQHMDWFYQTTFDAGL 435
Query: 144 VNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRR 203
V+ G+R+ATVL YLSDV++GG T FP RT +KP++
Sbjct: 436 VSSPEMGNRIATVLFYLSDVSQGGGTAFPQL------RT---------------LLKPKK 474
Query: 204 GDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGGDCTDNNASC 263
A + +LH + + D + H CP+I G KW +WI FD+ + C
Sbjct: 475 YAAAFWHNLHASGVGDVRTQHGACPIIAGSKWVQNRWIR--EFDQ---------SDRRPC 523
Query: 264 ERW 266
E W
Sbjct: 524 ELW 526
>gi|291190128|ref|NP_001167431.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Salmo salar]
gi|223649060|gb|ACN11288.1| Prolyl 4-hydroxylase subunit alpha-2 precursor [Salmo salar]
Length = 538
Score = 107 bits (268), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 65/218 (29%), Positives = 105/218 (48%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
++ P K + P Y L+D E + + LAK +L R+ V D +G ++ R S
Sbjct: 326 LLQPMKEEDEWDSPHIVRYLNALSDSEIEKIKELAKPRLARATVRDPKTGVLTTANYRVS 385
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ +D +I + +I T L + E +Q+ Y G +YEPH+D+ D
Sbjct: 386 KSAWLEGEEDPVIERVNQRIEDITGLTTQTAELLQIANYGVGGQYEPHFDFSRKDEPDAF 445
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV GG TVFP+ G A+ P++G
Sbjct: 446 KTLGTGNRVATFLNYMSDVEAGGATVFPDF---------------------GAAIYPKKG 484
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KWIH
Sbjct: 485 TAVFWYNLFRSGEGDYRTRHAACPVLVGCKWVSNKWIH 522
>gi|195110925|ref|XP_002000030.1| GI22756 [Drosophila mojavensis]
gi|193916624|gb|EDW15491.1| GI22756 [Drosophila mojavensis]
Length = 533
Score = 107 bits (267), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 64/215 (29%), Positives = 101/215 (46%), Gaps = 22/215 (10%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
++ P K+++ S P Y L+ + L +A ++RS V G+ S R S
Sbjct: 314 LLAPLKLEEHSLDPLVVSYHDMLSPQQIGELRAMAVPHMQRSTVNPLSGGQRMKSAFRVS 373
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNI-V 147
++P ++ + + T L E +QV Y G YEPH+D+F D +
Sbjct: 374 KNAWLPYSTHPMMGRMLRDVGDATGLDMTYCEQLQVANYGVGGHYEPHWDFFRDSRHYPA 433
Query: 148 RGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDAL 207
G+R+AT + YLSDV +GG T FP AV+P+ G+ L
Sbjct: 434 AEGNRIATAIFYLSDVEQGGATAFPFL---------------------NFAVRPQLGNIL 472
Query: 208 LFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+++LH ++ D + H+GCPV++G KW A WIH
Sbjct: 473 FWYNLHRSSDEDYRTKHAGCPVLKGSKWIANIWIH 507
>gi|116008434|ref|NP_651806.2| CG9698 [Drosophila melanogaster]
gi|113194862|gb|AAF57062.2| CG9698 [Drosophila melanogaster]
Length = 547
Score = 107 bits (267), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 69/218 (31%), Positives = 105/218 (48%), Gaps = 29/218 (13%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
I P K +++ P +Y + E D + L +++L R+ + + ES +S+VRTS
Sbjct: 321 IAPLKAEELFQDPLLVLYHDVIYQSEIDVIRKLTENRLMRATITSH--NESVVSNVRTSQ 378
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDK------ 143
TFIP +++ I+ ++A T L + ED Q Y G Y H D+F
Sbjct: 379 FTFIPVTAHKVLSTIDQRVADMTNLNMKYAEDHQFANYGIGGHYGQHMDWFYQTTFDAGL 438
Query: 144 VNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRR 203
V+ G+R+ATVL YLSDVA+GG T FP RT +KP++
Sbjct: 439 VSSPEMGNRIATVLFYLSDVAQGGGTAFPQL------RT---------------LLKPKK 477
Query: 204 GDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
A + +LH + + D + H CP+I G KW +WI
Sbjct: 478 YAAAFWHNLHASGVGDVRTQHGACPIIAGSKWVQNRWI 515
>gi|410860761|ref|YP_006975995.1| prolyl 4-hydroxylase subunit alpha [Alteromonas macleodii AltDE1]
gi|410818023|gb|AFV84640.1| Prolyl 4-hydroxylase alpha subunit [Alteromonas macleodii AltDE1]
Length = 376
Score = 107 bits (267), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 72/217 (33%), Positives = 104/217 (47%), Gaps = 27/217 (12%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+ PSKV S VYE L++ EC +LI + LK S V D ++G K+ VRTS
Sbjct: 165 VYEPSKVLDKSLPIE--VYESILSEYECRYLIAKFSALLKPSMVVDPVTGRGKIDSVRTS 222
Query: 89 SGTFI-PKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYD---YFSDKV 144
I P D I ++ I+ T ++NGE + +LRY GQ+Y+PHYD +D +
Sbjct: 223 YVAVIEPTHCDWITRKLDKIISQITHTLRQNGEALNLLRYSPGQQYKPHYDGLNEINDAL 282
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
G R+ T L+YL+ + +GGET+FP K I + P+ G
Sbjct: 283 MFKDGKQRIKTALVYLNTINEGGETLFP---------------------KLDIRIAPKSG 321
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
++F + N S H+G P + KW TKWI
Sbjct: 322 TMVVFSNSDENGKLLLNSYHAGAPTVSENKWLVTKWI 358
>gi|326928728|ref|XP_003210527.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Meleagris
gallopavo]
Length = 535
Score = 107 bits (266), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 64/216 (29%), Positives = 106/216 (49%), Gaps = 23/216 (10%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E + + LAK +L R+ V D +G ++ R S
Sbjct: 325 VIAPFKEEDEWDSPHIVRYYDVMSDEEIEKIKQLAKPKLARATVRDPKTGVLTVASYRVS 384
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVN--I 146
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ + +
Sbjct: 385 KSSWLEEDDDPVVAKVNQRMQQITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSTL 444
Query: 147 VRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
G+RLAT L Y+SDV GG TVFP+ G A+ P++G A
Sbjct: 445 KSEGNRLATFLNYMSDVEAGGATVFPDF---------------------GAAIWPKKGTA 483
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 484 VFWYNLFRSGEGDYRTRHAACPVLVGCKWVSNKWFH 519
>gi|327265288|ref|XP_003217440.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Anolis
carolinensis]
Length = 554
Score = 107 bits (266), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 65/218 (29%), Positives = 107/218 (49%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y L+D E + + LAK +L R+ V D +G +++ R S
Sbjct: 342 LIAPFKEEDEWDSPHIVRYYNVLSDEEIEKIKELAKPKLARATVRDPKTGVLTVANYRVS 401
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ D
Sbjct: 402 KSSWLEEEDDLVVAKVNQRMEHITGLTVKTAELLQVANYGMGGQYEPHFDFSRKEEPDAF 461
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV GG TVFP+ G A+ P++G
Sbjct: 462 KRLGTGNRVATFLNYMSDVEAGGATVFPDF---------------------GAAIWPKKG 500
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 501 TAVFWYNLFRSGEGDYRTRHAACPVLVGCKWVSNKWFH 538
>gi|351706369|gb|EHB09288.1| Prolyl 4-hydroxylase subunit alpha-2 [Heterocephalus glaber]
Length = 535
Score = 107 bits (266), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 65/218 (29%), Positives = 106/218 (48%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E D + LAK +L R+ V D +G ++ R S
Sbjct: 323 LIAPFKEEDEWDSPHIVRYYNVMSDEEIDRIKELAKPKLARATVRDPKTGVLTVASYRVS 382
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF----SDKV 144
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ D
Sbjct: 383 KSSWLEEDDDPVVARVNRRMQYITGLTVQTAELLQVANYGMGGQYEPHFDFSRNHERDAF 442
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV GG TVFP+ G A+ P++G
Sbjct: 443 KRLGTGNRVATFLNYMSDVEAGGATVFPDL---------------------GAALWPKKG 481
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 482 TAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 519
>gi|344296798|ref|XP_003420090.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Loxodonta
africana]
Length = 544
Score = 107 bits (266), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 69/220 (31%), Positives = 111/220 (50%), Gaps = 30/220 (13%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLS-DVRT 87
++ P + + I +P +Y F+ D+E + LA+ L+RS VA SGE +L D R
Sbjct: 333 LLQPFRKEVIHLEPYVVLYHDFVNDMEAQKIKGLAEPWLQRSVVA---SGEKQLQVDYRI 389
Query: 88 SSGTFIPKGKDAIIAGIEDKIATWTFLPKE--NGEDIQVLRYEHGQKYEPHYDYF---SD 142
S ++ D ++ ++ +IA T L + E +QV+ Y G YEPH+D+ S
Sbjct: 390 SKSAWLKDSVDPMLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPSS 449
Query: 143 KVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPR 202
+ ++ G+R+AT ++YLS V GG T F A ++
Sbjct: 450 PLYRMKSGNRVATFMIYLSAVEAGGATAFIYA---------------------NFSMPVV 488
Query: 203 RGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ AL +++LH + D +LH+GCPV+ G+KW A KWIH
Sbjct: 489 KNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIH 528
>gi|57525020|ref|NP_001006155.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Gallus gallus]
gi|82082587|sp|Q5ZLK5.1|P4HA2_CHICK RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
alpha-2; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-2; Flags: Precursor
gi|53129464|emb|CAG31388.1| hypothetical protein RCJMB04_5l17 [Gallus gallus]
Length = 534
Score = 107 bits (266), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 64/216 (29%), Positives = 106/216 (49%), Gaps = 23/216 (10%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E + + LAK +L R+ V D +G ++ R S
Sbjct: 324 LIAPFKEEDEWDSPHIVRYYDVMSDEEIEKIKQLAKPKLARATVRDPKTGVLTVASYRVS 383
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVN--I 146
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ + +
Sbjct: 384 KSSWLEEDDDPVVAKVNQRMQQITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSTL 443
Query: 147 VRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
G+RLAT L Y+SDV GG TVFP+ G A+ P++G A
Sbjct: 444 KSEGNRLATFLNYMSDVEAGGATVFPDF---------------------GAAIWPKKGTA 482
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 483 VFWYNLFRSGEGDYRTRHAACPVLVGCKWVSNKWFH 518
>gi|198429625|ref|XP_002128613.1| PREDICTED: similar to procollagen-proline, 2-oxoglutarate
4-dioxygenase (proline 4-hydroxylase), alpha 1
polypeptide [Ciona intestinalis]
Length = 195
Score = 107 bits (266), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 62/200 (31%), Positives = 103/200 (51%), Gaps = 34/200 (17%)
Query: 51 LTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIAT 110
++D E + +LAK +L+R+ V + ++G + + R S ++ +I + +I+
Sbjct: 1 MSDKEMAMIKSLAKPRLRRATVQNPVTGVLEFAHYRVSKSAWLKDEDHPVIKRVCQRISD 60
Query: 111 WTFLPKENGEDIQVLRYEHGQKYEPHYDY--------FSDKVNIVRGGHRLATVLMYLSD 162
T L E E++Q+ Y G +YEPH+DY F D+V G+R+AT L Y+S+
Sbjct: 61 VTGLSMETAEELQIANYGVGGQYEPHFDYSRKSDFGKFDDEV-----GNRIATFLTYMSN 115
Query: 163 VAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVS 222
V +GG TVF + GIAV+P +G A+ +++L + D +
Sbjct: 116 VEQGGSTVFLHP---------------------GIAVRPIKGSAVFWYNLLPSGAGDERT 154
Query: 223 LHSGCPVIEGEKWSATKWIH 242
H+ CPV+ G KW + KWIH
Sbjct: 155 RHAACPVLTGVKWVSNKWIH 174
>gi|407682954|ref|YP_006798128.1| prolyl 4-hydroxylase subunit alpha [Alteromonas macleodii str.
'English Channel 673']
gi|407244565|gb|AFT73751.1| prolyl 4-hydroxylase subunit alpha [Alteromonas macleodii str.
'English Channel 673']
Length = 376
Score = 107 bits (266), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 68/203 (33%), Positives = 106/203 (52%), Gaps = 28/203 (13%)
Query: 49 GFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIP-KGKDAIIAGIEDK 107
G L+D+ECD+++ KS L+ S V + L+G D+RTS I + D I +E K
Sbjct: 186 GVLSDIECDYMLLRYKSLLQPSMVLNPLNGNPMKDDIRTSEVAIITNQWVDWISREVEVK 245
Query: 108 IATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSD----KVNIVR-GGHRLATVLMYLSD 162
I+ + ++GE + +LRY+ GQ+Y+PHYD F+D + +I+ GG R T+L YL+
Sbjct: 246 ISRMSDTKPQHGEPLNLLRYKDGQEYKPHYDGFTDTQLKQTSIIEEGGQRTHTILAYLNS 305
Query: 163 VAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVS 222
+++G T FP K GI + P +G + F ++ N + S
Sbjct: 306 LSEGA-THFP---------------------KLGITIYPEKGKLVSFLNVDKNLALEKQS 343
Query: 223 LHSGCPVIEGEKWSATKWIHVDS 245
H G PV EKW TKW+ ++S
Sbjct: 344 YHCGQPVSTNEKWMLTKWVRLNS 366
>gi|54792285|emb|CAG28668.1| prolyl 4-hydroxylase alpha-2 subunit [Gallus gallus]
Length = 538
Score = 107 bits (266), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 64/218 (29%), Positives = 106/218 (48%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E + + LAK +L R+ V D +G ++ R S
Sbjct: 327 LIAPFKEEDEWDSPHIVRYYDVMSDEEIEKIKQLAKPKLARATVRDPKTGVLTVASYRVS 386
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ D
Sbjct: 387 KSSWLEEDDDPVVAKVNQRMQQITGLTVKTAELLQVANYGMGGQYEPHFDFSRKDEPDAF 446
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV GG TVFP+ G A+ P++G
Sbjct: 447 KRLGTGNRVATFLNYMSDVEAGGATVFPDF---------------------GAAIWPKKG 485
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 486 TAVFWYNLFRSGEGDYRTRHAACPVLVGCKWVSNKWFH 523
>gi|427410797|ref|ZP_18900999.1| hypothetical protein HMPREF9718_03473 [Sphingobium yanoikuyae ATCC
51230]
gi|425710785|gb|EKU73805.1| hypothetical protein HMPREF9718_03473 [Sphingobium yanoikuyae ATCC
51230]
Length = 322
Score = 106 bits (265), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 69/210 (32%), Positives = 100/210 (47%), Gaps = 29/210 (13%)
Query: 39 SWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS-SGTFIPKGK 97
W R F FLT EC H+I+ ++ L+ + V D SG +RTS G F P +
Sbjct: 137 GWDVRLF--RQFLTGDECHHVISEGQALLEPAMVIDPRSGRPMPHPIRTSDGGIFGPARE 194
Query: 98 DAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVL 157
D +I I +IA + GE + +LRY GQ+Y H+D N R T+L
Sbjct: 195 DLVIQAINRRIAAASGTMLSGGEPLTLLRYAVGQQYRQHHDCLPHVRN-----QRAWTML 249
Query: 158 MYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAI 217
+YL++ GGET+FP + G++VK R+G+ALLF +
Sbjct: 250 IYLNEGYAGGETIFP---------------------RLGLSVKGRKGNALLFRNTDAQGQ 288
Query: 218 PDPVSLHSGCPVIEGEKWSATKWIHVDSFD 247
++H G PV+ G+KW T+WI D D
Sbjct: 289 AAEAAVHLGAPVMAGQKWLCTRWIRHDRHD 318
>gi|260825355|ref|XP_002607632.1| hypothetical protein BRAFLDRAFT_84679 [Branchiostoma floridae]
gi|229292980|gb|EEN63642.1| hypothetical protein BRAFLDRAFT_84679 [Branchiostoma floridae]
Length = 519
Score = 106 bits (265), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 63/217 (29%), Positives = 106/217 (48%), Gaps = 24/217 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
++ P +++Q+ KP+ +V L+D E + + LA+ +L+ +A + +G + LS R S
Sbjct: 308 LLAPIRLEQVFDKPKLWVLHNILSDPEMEVIKKLAQPRLRPAATQNPTTGGAVLSSYRIS 367
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIV- 147
++ + +I ++ ++ T L E E +QV+ Y G YEPH+D +
Sbjct: 368 KNAWLYYWEHRLINRVKQRVEDATGLTMETAEPLQVINYGIGGHYEPHFDCATKDEEFAL 427
Query: 148 --RGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGD 205
G R+AT+L Y+SDV GG TVFP G V P +G
Sbjct: 428 DPNEGDRIATMLFYMSDVEAGGATVFPQV---------------------GARVVPEKGA 466
Query: 206 ALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+++L + D ++ H+GCPV+ G KW + WIH
Sbjct: 467 GAFWYNLLKSGEGDMLTEHAGCPVLVGSKWVSNMWIH 503
>gi|449267219|gb|EMC78185.1| Prolyl 4-hydroxylase subunit alpha-2 [Columba livia]
Length = 538
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 64/218 (29%), Positives = 106/218 (48%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E + + LAK +L R+ V D +G ++ R S
Sbjct: 326 LIAPFKEEDEWDSPHIVRYYDVMSDEEIEKIKQLAKPKLARATVRDPKTGVLTVASYRVS 385
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ D
Sbjct: 386 KSSWLEEDDDPVVAKVNQRMQQITGLTVKTAELLQVANYGMGGQYEPHFDFSRKDEPDAF 445
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV GG TVFP+ G A+ P++G
Sbjct: 446 KRLGTGNRVATFLNYMSDVEAGGATVFPDF---------------------GAAIWPKKG 484
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 485 TAVFWYNLFRSGEGDYRTRHAACPVLVGCKWVSNKWFH 522
>gi|397643670|gb|EJK76008.1| hypothetical protein THAOC_02250 [Thalassiosira oceanica]
Length = 480
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 74/248 (29%), Positives = 109/248 (43%), Gaps = 46/248 (18%)
Query: 39 SWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVAD---------NLSGESKLSDVRTSS 89
S +PR F FL+ E D + + + +A N G+ + RTS
Sbjct: 202 SSEPRVFYVHNFLSAAEADEFVKFSTAPENPYKMAPSTGGTHKAWNQGGDGAVLTTRTSE 261
Query: 90 GTFIPKGKDAI-IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF----SDKV 144
F K + + ++ + + IQ+LRY+ GQ Y H+DYF S
Sbjct: 262 NAFDITTKQSFDVKKRAFRLLRMNGYQENMADGIQILRYKVGQAYVAHHDYFPTHQSKDF 321
Query: 145 N---IVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSE---------- 191
N + G +R AT+ +YLSDV+ GG+TVFPN E+ ++P + L E
Sbjct: 322 NWDPLSGGSNRFATIFLYLSDVSYGGQTVFPNCEKLSAEKSPELVERLGESPSASELKEF 381
Query: 192 ------------------CAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGE 233
C +K AV PRRGDA+LF+S + + D SLH CP++ G
Sbjct: 382 VSNAGLMEGSWEDNLIHKCYEK-FAVPPRRGDAILFYSQRPDGLLDTNSLHGACPILNGT 440
Query: 234 KWSATKWI 241
KW A W+
Sbjct: 441 KWGANLWV 448
>gi|195575143|ref|XP_002105539.1| GD16913 [Drosophila simulans]
gi|194201466|gb|EDX15042.1| GD16913 [Drosophila simulans]
Length = 534
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 74/230 (32%), Positives = 106/230 (46%), Gaps = 33/230 (14%)
Query: 23 SFSSTAIIN--PSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGES 80
+F++T + P K++QI P +Y L+ E LI A +K + V G
Sbjct: 305 NFTTTPFLRLAPLKIEQIGLDPYVVLYHEVLSAREISMLIGKAAQNMKNTRVHKE-QGVP 363
Query: 81 KLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF 140
K + RT+ G + K + + GI +I T + E QV+ Y G Y H DYF
Sbjct: 364 KKNRGRTAKGFWFKKESNELTKGITRRIMDMTGFDLADSEGFQVINYGIGGHYLLHMDYF 423
Query: 141 ----SDKVNIVRG-----GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSE 191
S+ + G G R+ATVL YL+DV +GG TVF
Sbjct: 424 DFASSNHTDTRSGYSMDLGDRIATVLFYLTDVEQGGATVF-------------------- 463
Query: 192 CAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
A G +V P+ G A+ +++L TN DP + H+ CPVI G KW T+WI
Sbjct: 464 -ADVGYSVYPQAGTAIFWYNLDTNGKGDPRTRHAACPVIVGSKWVMTEWI 512
>gi|363539943|ref|YP_004894760.1| mg709 gene product [Megavirus chiliensis]
gi|448825700|ref|YP_007418631.1| putative prolyl 4-hydroxylase [Megavirus lba]
gi|350611108|gb|AEQ32552.1| putative prolyl 4-hydroxylase [Megavirus chiliensis]
gi|371944083|gb|AEX61911.1| putative prolyl4-hydroxylase [Megavirus courdo7]
gi|425701637|gb|AFX92799.1| putative prolyl 4-hydroxylase [Megavirus courdo11]
gi|444236885|gb|AGD92655.1| putative prolyl 4-hydroxylase [Megavirus lba]
Length = 240
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 71/210 (33%), Positives = 105/210 (50%), Gaps = 33/210 (15%)
Query: 43 RAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIA 102
+ FV + F+ +C ++ +++L S V +SG++ S +R S +IPK D ++
Sbjct: 56 KPFVIKNFIEPSKCQEIMKNCRNKLFDSEV---ISGKN--SKIRNSQQCWIPKN-DPMVL 109
Query: 103 GIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVN-----IVRGGHRLATVL 157
+ + I+ +P EN ED+QV+RY GQ Y H+D D + I RGG R TVL
Sbjct: 110 NMFENISKQFGIPFENAEDLQVVRYLPGQYYNEHHDACCDDTDKCREFISRGGQRKLTVL 169
Query: 158 MYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAI 217
+YL++ +GG T F N E + KP GDAL+F+ L N
Sbjct: 170 IYLNNEFEGGCTYFKNLE---------------------LRAKPSTGDALVFYPLAKNVN 208
Query: 218 P-DPVSLHSGCPVIEGEKWSATKWIHVDSF 246
P+SLH+G PV GEKW A W + F
Sbjct: 209 KCHPLSLHAGMPVTSGEKWIANIWFRENRF 238
>gi|194765174|ref|XP_001964702.1| GF23328 [Drosophila ananassae]
gi|190614974|gb|EDV30498.1| GF23328 [Drosophila ananassae]
Length = 542
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 61/214 (28%), Positives = 109/214 (50%), Gaps = 24/214 (11%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+ P KV+Q++ P + + E + +I ++RS V S + S++RTS+
Sbjct: 327 LAPFKVEQLNLDPYVAYFHEAINSSEMEQIIEKGLGSMERSRVGQ--SQNATTSEIRTSA 384
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVR- 148
T++ ++ ++ I+ ++ T L E+ E +Q++ Y G +YEPH+D+ + +
Sbjct: 385 NTWLWYNENPWLSKIKQRLEDITGLSTESAEPLQLVNYGIGGQYEPHFDFVEEPQKVFGW 444
Query: 149 GGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALL 208
G+R+ T L Y++DVA GG T FP + +AV P +G L+
Sbjct: 445 KGNRMLTALFYINDVALGGATAFPFLQ---------------------LAVPPVKGSLLV 483
Query: 209 FFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+++LH + D + H+GCPVI+G KW +W H
Sbjct: 484 WYNLHRSLHKDFRTKHAGCPVIKGSKWICNEWFH 517
>gi|431838427|gb|ELK00359.1| Prolyl 4-hydroxylase subunit alpha-3 [Pteropus alecto]
Length = 483
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 70/224 (31%), Positives = 112/224 (50%), Gaps = 30/224 (13%)
Query: 25 SSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLS- 83
S ++ P + + I +P +Y F++DLE + LA+ L+RS VA SGE +L
Sbjct: 268 SPYLLLQPVRKEVIHLEPYVVLYHDFVSDLEAQKIRGLAEPWLQRSVVA---SGEKQLPV 324
Query: 84 DVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKE--NGEDIQVLRYEHGQKYEPHYDYF- 140
+ R S ++ D ++ ++ +IA T L + E +QV+ Y G YEPH+D+
Sbjct: 325 EYRISKSAWLKDTADPMLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHAT 384
Query: 141 --SDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIA 198
S + ++ G+R+AT ++YLS V GG T F A +
Sbjct: 385 SPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYA---------------------NFS 423
Query: 199 VKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
V + AL +++LH + D +LH+ CPV+ G+KW A KWIH
Sbjct: 424 VPVVKNAALFWWNLHRSGEGDSDTLHAACPVLVGDKWVANKWIH 467
>gi|351696981|gb|EHA99899.1| Prolyl 4-hydroxylase subunit alpha-3 [Heterocephalus glaber]
Length = 572
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 69/220 (31%), Positives = 112/220 (50%), Gaps = 30/220 (13%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLS-DVRT 87
++ P + + I +P +Y F++D E + LA+ L+RS VA SGE +L + R
Sbjct: 361 LLQPVRKEVIHLEPYVALYHDFVSDPEAQKIRKLAEPWLQRSVVA---SGEKQLQVEYRI 417
Query: 88 SSGTFIPKGKDAIIAGIEDKIATWTFLPKEN--GEDIQVLRYEHGQKYEPHYDYF---SD 142
S ++ D ++ ++ +IA T L ++ E +QV+ Y G YEPH+D+ S
Sbjct: 418 SKSAWLKDTADPVLVTLDHRIAALTGLDVQHPYAEYLQVVNYGIGGHYEPHFDHATSPSS 477
Query: 143 KVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPR 202
+ ++ G+R+AT ++YLS V GG T F A +V
Sbjct: 478 PLYRMKSGNRVATFMIYLSSVEAGGATAFIYA---------------------NFSVPVV 516
Query: 203 RGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ AL +++LH + D +LH+GCPV+ G+KW A KWIH
Sbjct: 517 KNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIH 556
>gi|195064500|ref|XP_001996577.1| GH12091 [Drosophila grimshawi]
gi|193895397|gb|EDV94263.1| GH12091 [Drosophila grimshawi]
Length = 521
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 63/228 (27%), Positives = 115/228 (50%), Gaps = 30/228 (13%)
Query: 21 RKSFSSTAIIN--PSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSG 78
R +F++T + P K+++I+ P +Y + D E + + L+ Q++ + +
Sbjct: 303 RYNFTTTPFLRLAPLKLEEINHDPYVVMYHNVIYDSEIEEMKRLS-PQMQNGYIHGYKAN 361
Query: 79 ESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYD 138
++K++D+ + + + +I T + +QV + G +E HYD
Sbjct: 362 QTKVTDIAARVNWLVENT--PFLERMNQRITDMTGFDLKEFPSVQVANFGIGNNFEAHYD 419
Query: 139 Y-FSDKV---NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAK 194
Y F +V ++ G RLA+++ Y SDV GG TVFP+ +
Sbjct: 420 YIFGKRVRKEDVGDLGDRLASIIFYSSDVPLGGATVFPDIQ------------------- 460
Query: 195 KGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+AV+P++G++LL+++L + PDP SLHS CPV+ G +W+ TKW+H
Sbjct: 461 --VAVQPQKGNSLLWYNLFDDGTPDPRSLHSVCPVVVGSRWTLTKWLH 506
>gi|348523976|ref|XP_003449499.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oreochromis
niloticus]
Length = 594
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 62/218 (28%), Positives = 109/218 (50%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y +++ + + + LAK +L+R+ +++ ++G + + R S
Sbjct: 382 VIGPVKQEDEWDSPHIVRYHNIVSEKDMEKVKELAKPRLRRATISNPVTGVLETAHYRIS 441
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ + ++ I I T L + ED+QV Y G +YEPH+D+ D
Sbjct: 442 KSAWLGAYEHPVVDKINQLIEDVTGLNVKTAEDLQVANYGLGGQYEPHFDFGRKDEPDAF 501
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L+Y++DV GG TVF + G AVKP++G
Sbjct: 502 EELGTGNRIATWLLYMTDVQAGGATVFTDI---------------------GAAVKPKKG 540
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L+ + D + H+ CPV+ G KW + KWIH
Sbjct: 541 TAVFWYNLYPSGEGDYRTRHAACPVLLGNKWVSNKWIH 578
>gi|345326417|ref|XP_001510155.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like
[Ornithorhynchus anatinus]
Length = 888
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 65/218 (29%), Positives = 107/218 (49%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y L+D E + + LAK +L R+ V D +G +++ R S
Sbjct: 676 LIAPFKEEDEWDSPHIVRYYDVLSDEEIEKIKELAKPKLARATVRDPKTGVLTVANYRVS 735
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ D
Sbjct: 736 KSSWLEEEDDPVVAQVNRRMQYITGLTVKTAELLQVANYGMGGQYEPHFDFSRKDEPDAF 795
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV GG TVFP+ G A+ P++G
Sbjct: 796 KRLGTGNRVATFLNYMSDVEAGGATVFPDF---------------------GAAIWPKKG 834
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 835 TAVFWYNLFRSGEGDYRTRHAACPVLVGCKWVSNKWFH 872
>gi|224068121|ref|XP_002191580.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Taeniopygia
guttata]
Length = 539
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 64/216 (29%), Positives = 106/216 (49%), Gaps = 23/216 (10%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E + + LAK +L R+ V D +G ++ R S
Sbjct: 329 LIAPFKEEDEWDSPHIVRYYDVMSDEEIEKIKQLAKPRLARATVRDPKTGVLTVASYRVS 388
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVN--I 146
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ + +
Sbjct: 389 KSSWLEEDDDPVVAKVNQRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSTL 448
Query: 147 VRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
G+RLAT L Y+SDV GG TVFP+ G A+ P++G A
Sbjct: 449 KSEGNRLATFLNYMSDVEAGGATVFPDF---------------------GAAIWPKKGTA 487
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 488 VFWYNLFRSGEGDYRTRHAACPVLVGCKWVSNKWFH 523
>gi|194905410|ref|XP_001981191.1| GG11931 [Drosophila erecta]
gi|190655829|gb|EDV53061.1| GG11931 [Drosophila erecta]
Length = 537
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 65/215 (30%), Positives = 102/215 (47%), Gaps = 22/215 (10%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P K+++ S P + L+ + L +A +++RS V G+ K S R S
Sbjct: 318 ILAPLKLEEHSLDPYVASFHDMLSPRKISQLREMAVPRMQRSTVNPRPGGQHKKSAFRVS 377
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNI-V 147
++ +AG+ + T L E +QV Y G YEPH+D+F D +
Sbjct: 378 KNAWLAYEAHPTMAGMLRDLKDATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPSHYPA 437
Query: 148 RGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDAL 207
G+R+AT + YLS+V +GG T FP + AVKP+ G+ L
Sbjct: 438 AEGNRIATAIFYLSEVEQGGATAFPFLD---------------------FAVKPQLGNVL 476
Query: 208 LFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+++LH + D + H+GCPV++G KW WIH
Sbjct: 477 FWYNLHRSLDKDYRTKHAGCPVLKGSKWIGNVWIH 511
>gi|198466401|ref|XP_002135182.1| GA23910 [Drosophila pseudoobscura pseudoobscura]
gi|198150583|gb|EDY73809.1| GA23910 [Drosophila pseudoobscura pseudoobscura]
Length = 530
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 71/246 (28%), Positives = 119/246 (48%), Gaps = 31/246 (12%)
Query: 2 SPTRLSLNFFFLLSFSLLIRKSFSSTAIIN--PSKVKQISWKPRAFVYEGFLTDLECDHL 59
+P + F +L+ R +F++T + P K+++++ P +Y L D E + L
Sbjct: 294 TPYEIGCRGLFPKRTNLVCRYNFTTTPFLRLAPLKMEEVNHDPYIVLYHEVLYDREIEEL 353
Query: 60 INLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENG 119
+K+ + + E+K+ ++ + + A I +I T
Sbjct: 354 KKQSKNMINGFSEPQQ---ENKIREIIARHAWWWEQTTTR--ARIYQRITDITGFQLFVQ 408
Query: 120 EDIQVLRYEHGQKYEPHYDYFSDKVNI--VRGGHRLATVLMYLSDVAKGGETVFPNAEEP 177
E++ V Y G + PHYDY + +I GG L T+L Y+SD+ +GG T+FP+
Sbjct: 409 EELNVANYGLGTIFGPHYDYTPENYDIGWFMGGP-LGTILFYVSDLQQGGATIFPSI--- 464
Query: 178 PRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSA 237
I V PR+G ALL+F+L+ + PDP +LHS CPVIEG++W+
Sbjct: 465 ------------------NITVSPRKGSALLWFNLYDDGEPDPRTLHSSCPVIEGDRWTL 506
Query: 238 TKWIHV 243
TKW+H+
Sbjct: 507 TKWVHL 512
>gi|441432545|ref|YP_007354587.1| Prolyl 4-hydroxylase [Acanthamoeba polyphaga moumouvirus]
gi|371944705|gb|AEX62527.1| putative prolyl4-hydroxylase [Moumouvirus Monve]
gi|440383625|gb|AGC02151.1| Prolyl 4-hydroxylase [Acanthamoeba polyphaga moumouvirus]
Length = 239
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 71/208 (34%), Positives = 106/208 (50%), Gaps = 33/208 (15%)
Query: 45 FVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGI 104
F+ + F+ +C ++N +++L S V +SG++K +R S ++ K D ++ +
Sbjct: 57 FIIKNFINKEKCKEIMNNTQNKLFDSEV---ISGKNKA--IRNSQQCWVSK-YDPMVKSM 110
Query: 105 EDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF---SDKVN--IVRGGHRLATVLMY 159
KI+ +P EN ED+QV+RY GQ Y H+D +DK N I RGG R TVL+Y
Sbjct: 111 FQKISQQFNIPLENAEDLQVVRYLPGQYYNEHHDACCDNNDKCNEFISRGGQRCLTVLVY 170
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIP- 218
L++ +GG T F N + VKP GDA++F+ L N
Sbjct: 171 LNNEFEGGHTFFKNL---------------------NLKVKPETGDAIVFYPLAKNTSKC 209
Query: 219 DPVSLHSGCPVIEGEKWSATKWIHVDSF 246
P+SLH+G PV GEKW A W +F
Sbjct: 210 HPLSLHAGMPVTSGEKWIANLWFRERAF 237
>gi|194765178|ref|XP_001964704.1| GF23330 [Drosophila ananassae]
gi|190614976|gb|EDV30500.1| GF23330 [Drosophila ananassae]
Length = 537
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 64/215 (29%), Positives = 102/215 (47%), Gaps = 22/215 (10%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
++ P K+++ + P Y L+ + L +A +++RS V G++K S R S
Sbjct: 318 LLAPLKLEEHNLDPYVVTYHDMLSAQKIRDLRQMAVPRMRRSTVNPLPGGQNKKSAFRVS 377
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNI-V 147
++ + G+ + T L E +QV Y G YEPH+D+F D +
Sbjct: 378 KNAWLAYESHPTMEGMLRDLKDATGLDTTYCEQLQVANYGVGGHYEPHWDFFRDPNHYPA 437
Query: 148 RGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDAL 207
G+R+AT + YLSDV +GG T FP + AVKP+ G+ L
Sbjct: 438 EEGNRIATAIFYLSDVEQGGATAFPFLD---------------------FAVKPQLGNVL 476
Query: 208 LFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+++LH + D + H+GCPV++G KW WIH
Sbjct: 477 FWYNLHRSLDMDYRTKHAGCPVLKGSKWIGNVWIH 511
>gi|312383453|gb|EFR28539.1| hypothetical protein AND_03427 [Anopheles darlingi]
Length = 341
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 62/240 (25%), Positives = 113/240 (47%), Gaps = 48/240 (20%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+ P K+++ +P +Y ++D E + + + A+ + +R+ V + +GE + ++ R S
Sbjct: 102 LAPLKLEEAYRQPDIVIYHDVMSDREIELIKHYARPRFRRATVQNYKTGELEFANYRISK 161
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-SDKVNIVR 148
++ + +I + ++ T L E++QV+ Y G YEPH+D+ ++ N +
Sbjct: 162 SAWLKDTEHEVIRTVNQRVEDMTGLTMATAEELQVVNYGIGGHYEPHFDFARREERNAFK 221
Query: 149 G---GHRLATVLMY-----------------------LSDVAKGGETVFPNAEEPPRRRT 182
G+R+ATVL Y +SDV +GG TVFP+
Sbjct: 222 SLGTGNRIATVLFYVSDLCLCHTSHTNADFRFLSVGQMSDVTQGGATVFPSL-------- 273
Query: 183 PATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+A++PR+G A + +LH + D + H+ CPV+ G KW + KWIH
Sbjct: 274 -------------NLALRPRKGTAAFWHNLHASGNGDYATRHAACPVLTGTKWVSNKWIH 320
>gi|417402564|gb|JAA48127.1| Putative prolyl 4-hydroxylase alpha subunit [Desmodus rotundus]
Length = 544
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 68/220 (30%), Positives = 110/220 (50%), Gaps = 30/220 (13%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLS-DVRT 87
++ P + + + +P +Y F+ DLE + A+ L+RS VA SGE +L + R
Sbjct: 333 LLQPIRKEVVHLEPYVVLYHDFVNDLEAQKIRGFAEPWLQRSVVA---SGEKQLPVEYRI 389
Query: 88 SSGTFIPKGKDAIIAGIEDKIATWTFLPKE--NGEDIQVLRYEHGQKYEPHYDYF---SD 142
S ++ D ++ ++ +IA T L + E +QV+ Y G YEPH+D+ S
Sbjct: 390 SKSAWLKDTVDPMLVTLDRRIAALTGLDTQPPYAEHLQVVNYGIGGHYEPHFDHATSPSS 449
Query: 143 KVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPR 202
+ ++ G+R+AT ++YLS V GG T F A +V
Sbjct: 450 PLYRMKSGNRVATFMIYLSSVEAGGATAFIYA---------------------NFSVPVV 488
Query: 203 RGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ AL +++LH + D +LH+GCPV+ G+KW A KWIH
Sbjct: 489 KNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIH 528
>gi|398806116|ref|ZP_10565064.1| 2OG-Fe(II) oxygenase superfamily enzyme [Polaromonas sp. CF318]
gi|398089832|gb|EJL80333.1| 2OG-Fe(II) oxygenase superfamily enzyme [Polaromonas sp. CF318]
Length = 294
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 64/204 (31%), Positives = 90/204 (44%), Gaps = 31/204 (15%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
PR V + FL+ ECD L A+ + V D + R++ +P ++
Sbjct: 95 PRIVVLDNFLSSEECDGLCEEARPAFAPATVVDPHQDAVHAAHFRSNDSAQLPAAGSELV 154
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLS 161
+E +I T P E +Q+ RY GQ Y PHYD+F + +GG RLAT+++YL
Sbjct: 155 RRVEARIERLTGWPSAFCETLQLQRYAQGQDYRPHYDFFGQDMVEAQGGQRLATLILYLR 214
Query: 162 DVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDP- 220
GG T F N G+ + PR+G AL F PDP
Sbjct: 215 APEAGGATYFANL---------------------GMRIAPRKGSALFF------TYPDPG 247
Query: 221 ---VSLHSGCPVIEGEKWSATKWI 241
+LH G V+ GEKW AT+W
Sbjct: 248 NNSGTLHGGEAVLAGEKWIATQWF 271
>gi|607947|gb|AAA62207.1| prolyl 4-hydroxylase alpha subunit [Caenorhabditis elegans]
Length = 558
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 62/221 (28%), Positives = 109/221 (49%), Gaps = 25/221 (11%)
Query: 26 STAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDV 85
S + P KV+ + P A +++ ++D E + LAK +L R+ V D+++G+ +
Sbjct: 311 SFLVYAPIKVEIKRFNPLAVLFKDVISDDEVAAIQELAKPKLARATVHDSVTGKLVTATY 370
Query: 86 RTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF----S 141
R S ++ + + ++ + +I T L E E++Q+ Y G Y+PH+D+ S
Sbjct: 371 RISKSAWLKEWEGDVVETVNKRIGYMTNLEMETAEELQIANYGIGGHYDPHFDHAKKEES 430
Query: 142 DKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKP 201
+ G+R+ATVL Y+S + GG TVF A+ + P
Sbjct: 431 KSFESLGTGNRIATVLFYMSQPSHGGGTVFTEAKS---------------------TILP 469
Query: 202 RRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ DAL +++L+ +P + H+ CPV+ G KW + KWIH
Sbjct: 470 TKNDALFWYNLYKQGDGNPDTRHAACPVLVGIKWVSNKWIH 510
>gi|363543389|ref|NP_001241704.1| prolyl 4-hydroxylase 6-2 precursor [Zea mays]
gi|347978822|gb|AEP37753.1| prolyl 4-hydroxylase 6-2 [Zea mays]
Length = 162
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 46/85 (54%), Positives = 65/85 (76%)
Query: 31 NPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSG 90
+P+ V Q+S +PRAF+Y GFL+D ECDH+++LAK +++S VADN SG+S S RTSSG
Sbjct: 31 DPASVTQLSSRPRAFLYSGFLSDTECDHIVSLAKGSMEKSMVADNDSGKSVASQARTSSG 90
Query: 91 TFIPKGKDAIIAGIEDKIATWTFLP 115
TF+ K +D I++ IE ++A WTF P
Sbjct: 91 TFLAKREDEIVSAIEKRVAAWTFPP 115
>gi|406596009|ref|YP_006747139.1| prolyl 4-hydroxylase subunit alpha [Alteromonas macleodii ATCC
27126]
gi|406373330|gb|AFS36585.1| prolyl 4-hydroxylase subunit alpha [Alteromonas macleodii ATCC
27126]
Length = 376
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 67/203 (33%), Positives = 106/203 (52%), Gaps = 28/203 (13%)
Query: 49 GFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIP-KGKDAIIAGIEDK 107
G L+D+ECD+++ KS L+ S V + L+G D+RTS I + D I +E K
Sbjct: 186 GVLSDIECDYMLLRYKSLLQPSMVLNPLNGNPMKDDIRTSEVAIITNQWVDWISREVEVK 245
Query: 108 IATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSD----KVNIVR-GGHRLATVLMYLSD 162
++ + ++GE + +LRY+ GQ+Y+PHYD F+D + +I+ GG R T+L YL+
Sbjct: 246 MSRMSDTKPQHGEPLNLLRYKDGQEYKPHYDGFTDTQLKQTSIIEEGGQRTHTILAYLNS 305
Query: 163 VAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVS 222
+++G T FP K GI + P +G + F ++ N + S
Sbjct: 306 LSEGA-THFP---------------------KLGITIYPEKGKLVSFLNVDKNLALEKQS 343
Query: 223 LHSGCPVIEGEKWSATKWIHVDS 245
H G PV EKW TKW+ ++S
Sbjct: 344 YHCGQPVSTNEKWMLTKWVRLNS 366
>gi|363729586|ref|XP_417248.3| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Gallus gallus]
Length = 542
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 68/220 (30%), Positives = 111/220 (50%), Gaps = 30/220 (13%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLS-DVRT 87
++ P+K + + +P +Y F++D E + + LA L+RS VA SGE + + R
Sbjct: 331 LLQPAKKETLRLQPYIVLYHDFVSDAEAETIKGLAGPWLQRSVVA---SGEKQQKVEYRI 387
Query: 88 SSGTFIPKGKDAIIAGIEDKIATWTFLPKE--NGEDIQVLRYEHGQKYEPHYDYFSDK-- 143
S ++ D ++ +E ++A T L E +QV+ Y G YEPH+D+ + +
Sbjct: 388 SKSAWLKDTADPVVQALELRMAAITGLDLRPPYAEYLQVVNYGLGGHYEPHFDHATSRKS 447
Query: 144 -VNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPR 202
+ ++ G+R+ATV++YLS V GG T F A +V
Sbjct: 448 PLYRMKSGNRIATVMIYLSAVEAGGSTAFIYAN---------------------FSVPVV 486
Query: 203 RGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ AL +++L N D +LH+GCPV+ G+KW A KWIH
Sbjct: 487 KNAALFWWNLRRNGDGDGDTLHAGCPVLAGDKWVANKWIH 526
>gi|195505202|ref|XP_002099402.1| GE23382 [Drosophila yakuba]
gi|194185503|gb|EDW99114.1| GE23382 [Drosophila yakuba]
Length = 537
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 65/214 (30%), Positives = 101/214 (47%), Gaps = 22/214 (10%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+ P K+++ S P Y L+ + L +A +++RS V G+ K S R S
Sbjct: 319 LAPLKLEEHSLDPYVATYHDMLSPRKISQLREMAVPRMRRSTVNPLPGGQHKKSAFRVSK 378
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNI-VR 148
++ + G+ + T L E +QV Y G YEPH+D+F D +
Sbjct: 379 NAWLAYESHPTMVGMLRDLKEATGLDTTYCEQLQVANYGVGGHYEPHWDFFRDPNHYPEE 438
Query: 149 GGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALL 208
G+R+AT + YLS+V +GG T FP + IAVKP+ G+ L
Sbjct: 439 EGNRIATAIFYLSEVEQGGATAFPFLD---------------------IAVKPQLGNVLF 477
Query: 209 FFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+++LH + D + H+GCPV++G KW WIH
Sbjct: 478 WYNLHRSLDKDYRTKHAGCPVLKGSKWIGNVWIH 511
>gi|443709455|gb|ELU04127.1| hypothetical protein CAPTEDRAFT_149240 [Capitella teleta]
Length = 532
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 65/217 (29%), Positives = 108/217 (49%), Gaps = 26/217 (11%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
INP + + S +P VY + D E + + +A +L R+ V ++ +G+ + + R S
Sbjct: 324 INPLREETASLEPWIAVYHQLMNDHEIERIKEMATPRLARATVHNSATGQLEHAKYRISK 383
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVR- 148
++ +D +IA I ++ + T L E++QV+ Y G +YEPH+D FS +
Sbjct: 384 SGWLRDEEDPLIARISERCSALTNLSLTTVEELQVVNYGIGGQYEPHFD-FSRRSEPTAF 442
Query: 149 ---GGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGD 205
G+R+ TV+ Y++DV GG TVF +A G+ V P +G
Sbjct: 443 EKWRGNRILTVIYYMTDVEAGGATVFLDA---------------------GVKVYPEKGS 481
Query: 206 ALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A ++ +L + D + H+ CPV+ G KW A KW H
Sbjct: 482 AAVWHNLLPSGEGDMRTRHAACPVLTGSKWVANKWFH 518
>gi|195391760|ref|XP_002054528.1| GJ22757 [Drosophila virilis]
gi|194152614|gb|EDW68048.1| GJ22757 [Drosophila virilis]
Length = 534
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 63/215 (29%), Positives = 101/215 (46%), Gaps = 22/215 (10%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
++ P K+++ S P + L+ L +A ++RS V G+ + S R S
Sbjct: 315 LLAPLKLEEHSLDPLVVTFHDMLSQHRIAELREMAVPHMQRSTVNPLPGGQRRKSAFRVS 374
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNI-V 147
++P + + ++ T L E +QV Y G YEPH+D+F D +
Sbjct: 375 KNAWLPYSTHPTMGRMLRDVSDATGLDMTFCEQLQVANYGVGGHYEPHWDFFRDSRHYPA 434
Query: 148 RGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDAL 207
G+R+AT + YLSDV +GG T FP AV+P+ G+ L
Sbjct: 435 AEGNRIATAIFYLSDVEQGGATAFPFL---------------------NFAVRPQLGNIL 473
Query: 208 LFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+++LH ++ D + H+GCPV++G KW A WIH
Sbjct: 474 FWYNLHRSSDMDFRTKHAGCPVLKGSKWIANIWIH 508
>gi|17552840|ref|NP_499464.1| Protein DPY-18 [Caenorhabditis elegans]
gi|20455505|sp|Q10576.2|P4HA1_CAEEL RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; AltName: Full=Protein dumpy-18; Flags:
Precursor
gi|3881011|emb|CAA21045.1| Protein DPY-18 [Caenorhabditis elegans]
gi|6900013|emb|CAB71298.1| prolyl 4-hydroxylase alpha subunit 1 [Caenorhabditis elegans]
Length = 559
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 61/218 (27%), Positives = 108/218 (49%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+ P KV+ + P A +++ ++D E + LAK +L R+ V D+++G+ + R S
Sbjct: 315 VYAPIKVEIKRFNPLAVLFKDVISDDEVAAIQELAKPKLARATVHDSVTGKLVTATYRIS 374
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF----SDKV 144
++ + + ++ + +I T L E E++Q+ Y G Y+PH+D+ S
Sbjct: 375 KSAWLKEWEGDVVETVNKRIGYMTNLEMETAEELQIANYGIGGHYDPHFDHAKKEESKSF 434
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+ATVL Y+S + GG TVF A+ + P +
Sbjct: 435 ESLGTGNRIATVLFYMSQPSHGGGTVFTEAKS---------------------TILPTKN 473
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
DAL +++L+ +P + H+ CPV+ G KW + KWIH
Sbjct: 474 DALFWYNLYKQGDGNPDTRHAACPVLVGIKWVSNKWIH 511
>gi|297689698|ref|XP_002822285.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Pongo abelii]
Length = 544
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 70/220 (31%), Positives = 110/220 (50%), Gaps = 30/220 (13%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLS-DVRT 87
++ P + + I +P +Y F++D E + LA+ L+RS VA SGE +L + R
Sbjct: 333 LLQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVA---SGEKQLQVEYRI 389
Query: 88 SSGTFIPKGKDAIIAGIEDKIATWTFLPKE--NGEDIQVLRYEHGQKYEPHYDYF---SD 142
S ++ D ++ + +IA T L E +QV+ Y G YEPH+D+ S
Sbjct: 390 SKSAWLKDTVDPMLVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPSS 449
Query: 143 KVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPR 202
+ ++ G+R+AT ++YLS V GG T F A ++V
Sbjct: 450 PLYRMKSGNRVATFMIYLSSVEAGGATAFIYA---------------------NLSVPVV 488
Query: 203 RGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
R AL +++LH + D +LH+GCPV+ G+KW A KWIH
Sbjct: 489 RNAALFWWNLHRSGEGDSDTLHAGCPVLVGDKWVANKWIH 528
>gi|357459545|ref|XP_003600053.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
truncatula]
gi|355489101|gb|AES70304.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
truncatula]
Length = 156
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 62/137 (45%), Positives = 86/137 (62%), Gaps = 9/137 (6%)
Query: 39 SWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKD 98
SWK + +YE + EC+HLI L K L+RS ++D +G+ + + G F+ KD
Sbjct: 19 SWKIK--LYE---SKEECEHLIKLGKPYLERSRISDKRTGKGIENRFAYACGGFV---KD 70
Query: 99 AIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLM 158
II IE +I +P ENGE +QV+ Y GQK+ PHYD S++ + GG R+AT LM
Sbjct: 71 KIIKNIEQRIPDIISIPVENGEGLQVIHYGVGQKFVPHYDSRSNE-SFWNGGPRVATFLM 129
Query: 159 YLSDVAKGGETVFPNAE 175
YLSDV +GGETVFP+A+
Sbjct: 130 YLSDVEEGGETVFPSAK 146
>gi|332211329|ref|XP_003254773.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Nomascus
leucogenys]
Length = 544
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 70/220 (31%), Positives = 110/220 (50%), Gaps = 30/220 (13%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLS-DVRT 87
++ P + + I +P +Y F++D E + LA+ L+RS VA SGE +L + R
Sbjct: 333 LLQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVA---SGEKQLQVEYRI 389
Query: 88 SSGTFIPKGKDAIIAGIEDKIATWTFLPKE--NGEDIQVLRYEHGQKYEPHYDYF---SD 142
S ++ D ++ + +IA T L E +QV+ Y G YEPH+D+ S
Sbjct: 390 SKSAWLKDTVDPMLVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPSS 449
Query: 143 KVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPR 202
+ ++ G+R+AT ++YLS V GG T F A ++V
Sbjct: 450 PLYRMKSGNRVATFMIYLSSVEAGGATAFIYA---------------------NLSVPVV 488
Query: 203 RGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
R AL +++LH + D +LH+GCPV+ G+KW A KWIH
Sbjct: 489 RNAALFWWNLHRSGEGDSDTLHAGCPVLVGDKWVANKWIH 528
>gi|297515507|gb|ADI44133.1| RT08151p [Drosophila melanogaster]
Length = 546
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 68/218 (31%), Positives = 104/218 (47%), Gaps = 29/218 (13%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
I P K +++ P +Y + E D + L +++L R+ + + ES +S+VRTS
Sbjct: 321 IAPLKAEELFQDPLLVLYHDVIYQSEIDVIRKLTENRLMRATITSH--NESVVSNVRTSQ 378
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDK------ 143
TFIP +++ I+ ++A T L + ED Q Y G Y H D+F
Sbjct: 379 FTFIPVTAHKVLSTIDQRVADMTNLNMKYAEDHQFANYGIGGHYGQHMDWFYQTTFDAGL 438
Query: 144 VNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRR 203
V+ G+R+A VL YLSDVA+GG T FP RT +KP++
Sbjct: 439 VSSPEMGNRIAAVLFYLSDVAQGGGTAFPQL------RT---------------LLKPKK 477
Query: 204 GDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
A + +LH + + D + H CP+I G KW +WI
Sbjct: 478 YAAAFWHNLHASGVGDVRTQHGACPIIAGSKWVQNRWI 515
>gi|195505251|ref|XP_002099423.1| GE23370 [Drosophila yakuba]
gi|194185524|gb|EDW99135.1| GE23370 [Drosophila yakuba]
Length = 534
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 72/230 (31%), Positives = 107/230 (46%), Gaps = 34/230 (14%)
Query: 23 SFSSTAIIN--PSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGES 80
+F++T + P K +QI P +Y L+ E LI+ A +K + V + +
Sbjct: 306 NFTTTPFLRLAPLKTEQIGLDPYVVLYHEVLSAREISMLISKAAQNMKNTRV--HRETKP 363
Query: 81 KLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF 140
K + RT+ G ++ K + + I +I T + ED QV+ Y G Y H DYF
Sbjct: 364 KTNRGRTAKGHWLKKESNELTRRITRRIVDMTGFDLADSEDFQVINYGIGGHYFLHMDYF 423
Query: 141 SDKVNIVRG---------GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSE 191
+ G G R+ATVL YLSDV +GG TVF N
Sbjct: 424 DYASSNYTGPRSRQSKVLGDRIATVLFYLSDVEQGGATVFGNV----------------- 466
Query: 192 CAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
G +V P+ G A+ +++L T+ DP++ H+ CPVI G KW T+WI
Sbjct: 467 ----GYSVYPQAGTAIFWYNLDTDGNGDPLTRHASCPVIVGSKWVMTEWI 512
>gi|354504916|ref|XP_003514519.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Cricetulus
griseus]
Length = 509
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 69/220 (31%), Positives = 112/220 (50%), Gaps = 30/220 (13%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLS-DVRT 87
++ P++ + I +P +Y F++D E + LA+ L+RS VA SGE +L + R
Sbjct: 298 LLQPARKEVIHLRPFVALYHDFVSDAEAQKIRELAEPWLQRSVVA---SGEKQLPVEYRI 354
Query: 88 SSGTFIPKGKDAIIAGIEDKIATWTFLPKE--NGEDIQVLRYEHGQKYEPHYDYF---SD 142
S ++ D ++ ++ +IA T L + E +QV+ Y G YEPH+D+ S
Sbjct: 355 SKSAWLKDTVDPMLGTLDHRIAALTGLDIQPPYAEYLQVVNYGIGGHYEPHFDHATSPSS 414
Query: 143 KVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPR 202
+ ++ G+R+AT ++YLS V GG T F A +V
Sbjct: 415 PLYRMKSGNRVATFMIYLSAVEAGGATAFIYA---------------------NFSVPVV 453
Query: 203 RGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ AL +++LH + D +LH+GCPV+ G+KW A KWIH
Sbjct: 454 KNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIH 493
>gi|256083648|ref|XP_002578053.1| prolyl 4-hydroxylase alpha subunit 1 [Schistosoma mansoni]
gi|360044447|emb|CCD81995.1| putative prolyl 4-hydroxylase alpha subunit 1 [Schistosoma mansoni]
Length = 584
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 65/216 (30%), Positives = 102/216 (47%), Gaps = 24/216 (11%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
I P K + ++ PR ++ + E + + LA +L+R+ V + ++G +++ RTS
Sbjct: 367 IGPVKEETLNPDPRIVMWYDLIFPSEIEKIKELATPRLRRATVKNPVTGILEIAFYRTSK 426
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNI--- 146
++P I I +I T L E ED+QV Y G Y PH+D+ +
Sbjct: 427 SAWLPHSMSEITDQISQRIRAVTGLSLETAEDLQVGNYGLGGHYAPHFDFGRKREKDAFE 486
Query: 147 VRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
V+ G+R+AT++ YLSDV GG TVF + G V P++G A
Sbjct: 487 VKNGNRIATIIFYLSDVQAGGATVF---------------------NRIGTRVVPKKGAA 525
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+F+L N D + H+ CPV+ G KW W H
Sbjct: 526 GFWFNLLPNGEGDLRTRHAACPVLAGSKWVMNLWFH 561
>gi|410910256|ref|XP_003968606.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Takifugu
rubripes]
Length = 540
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 65/219 (29%), Positives = 111/219 (50%), Gaps = 28/219 (12%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
++ P + + +S +P +Y F++D E + + A+ L+RS VA + ++ R S
Sbjct: 329 LLRPVRREVLSLRPYVVLYHDFISDSESEEIKQHAQLGLRRSVVATG--DKQATAEYRIS 386
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKEN--GEDIQVLRYEHGQKYEPHYDYF---SDK 143
++ + ++ ++ KI+ T L ++ GE +QV+ Y G YEPH+D+ S
Sbjct: 387 KSAWLKGSAHSTVSRLDQKISMLTGLNVQHPHGEYLQVVNYGIGGHYEPHFDHATSPSSP 446
Query: 144 VNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRR 203
V ++ G+R+AT ++YLS V GG T F A +V +
Sbjct: 447 VFKLKTGNRVATFMIYLSSVEAGGSTAFIYA---------------------NFSVPVMK 485
Query: 204 GDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++LH N D +LH+GCPV+ G+KW A KWIH
Sbjct: 486 NAAIFWWNLHRNGEGDADTLHAGCPVLIGDKWVANKWIH 524
>gi|323452216|gb|EGB08091.1| hypothetical protein AURANDRAFT_26622 [Aureococcus anophagefferens]
Length = 190
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 63/206 (30%), Positives = 100/206 (48%), Gaps = 30/206 (14%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
PR F+ L++ ECDH+I L +++S V G S RTS ++ + I+
Sbjct: 1 PRVFLVREMLSEFECDHIIELGTKVVRKSMVG---QGGGFTSKTRTSENGWLRRSASPIL 57
Query: 102 AGIEDKIATWTFLPKE------NGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLAT 155
I + + + N E++QV+RY+ Q+Y PH+D+ D R T
Sbjct: 58 ENIYKRFGDVLGIDHDLLRSGKNAEELQVVRYDRSQEYAPHHDFGDDGTP----QQRFLT 113
Query: 156 VLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTN 215
+L+Y+ +GG T FP A ND + G+ V P RGDA+LF+S+ +
Sbjct: 114 LLLYIQLPEEGGATSFPKA-----------NDGM------GVQVVPARGDAVLFYSMLPD 156
Query: 216 AIPDPVSLHSGCPVIEGEKWSATKWI 241
D ++LH+G PV +G+KW W+
Sbjct: 157 GNADDLALHAGMPVRKGQKWVCNLWV 182
>gi|195159303|ref|XP_002020521.1| GL13468 [Drosophila persimilis]
gi|194117290|gb|EDW39333.1| GL13468 [Drosophila persimilis]
Length = 415
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 66/225 (29%), Positives = 110/225 (48%), Gaps = 42/225 (18%)
Query: 23 SFSSTAI----INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVA--DNL 76
S+++TA + P K + +S P +Y +T LE L NL+K +KR A+ +NL
Sbjct: 214 SYNTTAAPFLRLAPFKTELLSLSPYMVLYHDVITPLESLTLKNLSKPLMKRRAMVMVNNL 273
Query: 77 SGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPH 136
+ RTS+ ++ ++A++ +E ++ T EN E Q++ Y G Y+PH
Sbjct: 274 KVRPFIDSGRTSNSVWLTSHENAVMERLERRVGVMTNFEMENSEVYQLINYGIGGHYKPH 333
Query: 137 YDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKG 196
D+F LSDV +GG T+FP +
Sbjct: 334 TDHFETP---------------QLSDVPQGGATLFP---------------------RLN 357
Query: 197 IAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
I+V+PR+GDALL+++L+ + ++H+ CP+I+G KW+ KWI
Sbjct: 358 ISVQPRQGDALLWYNLNDRGQGEIGTVHTSCPIIKGSKWALVKWI 402
>gi|167519971|ref|XP_001744325.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163777411|gb|EDQ91028.1| predicted protein [Monosiga brevicollis MX1]
Length = 492
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 61/200 (30%), Positives = 97/200 (48%), Gaps = 23/200 (11%)
Query: 43 RAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIA 102
R ++ F + EC HL + +L R+ + G + + R S+ ++ D ++
Sbjct: 305 RLQIFRNFASAQECAHLREEGRKKLSRAVAWTD--GAFRPVEFRISTAAWLQPDHDDVVT 362
Query: 103 GIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLSD 162
+ +IA T L E E +QV Y G YE HYD+ + + + G R+AT ++YL+
Sbjct: 363 NLHTRIADATQLDLEFAEALQVSNYGIGGFYETHYDHHASRERELPEGDRIATFMIYLNQ 422
Query: 163 VAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVS 222
V +GG T FP + G AV+P GDA+ +++L + D +
Sbjct: 423 VEQGGYTAFP---------------------RLGAAVEPGHGDAVFWYNLLPDGESDNNT 461
Query: 223 LHSGCPVIEGEKWSATKWIH 242
LH CPV++G KW A KWIH
Sbjct: 462 LHGACPVLQGSKWVANKWIH 481
>gi|339236271|ref|XP_003379690.1| prolyl 4-hydroxylase subunit alpha-1 [Trichinella spiralis]
gi|316977627|gb|EFV60702.1| prolyl 4-hydroxylase subunit alpha-1 [Trichinella spiralis]
Length = 558
Score = 105 bits (261), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 67/238 (28%), Positives = 107/238 (44%), Gaps = 46/238 (19%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+ P KV+ + WKP+ + G ++D E + LA LKR+ V + +G+ + + R S
Sbjct: 324 LAPIKVEVMHWKPKIVYFRGVISDEEIAVIKQLASPLLKRATVHNADTGQLETASYRISK 383
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYD----------- 138
++ + ++ I D+I T L E E +Q+ Y G Y+PH+D
Sbjct: 384 SAWLKDTEHEVVKRISDRIDMMTDLTMETAELLQIANYGIGGHYDPHFDMSTRGESDPYE 443
Query: 139 ------------YFSDKVNI--VRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPA 184
Y +D + + G+R+ATVL Y+S GG TVF + +
Sbjct: 444 EGTGNRIATVLFYTNDPYSFESLNAGNRIATVLFYISQPEAGGGTVFTSHK--------- 494
Query: 185 TNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
I V+P + DA +F++ PD + H+ CPV+ G KW A KWIH
Sbjct: 495 ------------ITVEPSKYDAAFWFNVLQGGEPDMSTRHAACPVLAGTKWVANKWIH 540
>gi|195390833|ref|XP_002054072.1| GJ22994 [Drosophila virilis]
gi|194152158|gb|EDW67592.1| GJ22994 [Drosophila virilis]
Length = 496
Score = 105 bits (261), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 70/214 (32%), Positives = 97/214 (45%), Gaps = 24/214 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
++ P K++ P V+ L+ E D L LA+ L+R+ V E RTS
Sbjct: 290 LLAPIKMEIRLLNPFIIVFHDVLSPREIDELQKLARPLLERTTVVKFKKYEK--DSRRTS 347
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDK-VNIV 147
GT+I + + + IE +I L E QV+ Y G Y H D+ D +
Sbjct: 348 KGTWIERDHNNLTKRIERRITDMVELDLRYSEPFQVMNYGLGGHYAAHEDFLGDTWADKK 407
Query: 148 RGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDAL 207
R+ATVL YL+DV +GG TVF + AV P+RG AL
Sbjct: 408 EEDDRIATVLFYLTDVEQGGATVFTILNQ---------------------AVSPKRGTAL 446
Query: 208 LFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
+++LH N D +LH GCPV+ G KW T WI
Sbjct: 447 FWYNLHRNGTGDTRTLHGGCPVLVGSKWIMTLWI 480
>gi|194871359|ref|XP_001972833.1| GG13662 [Drosophila erecta]
gi|190654616|gb|EDV51859.1| GG13662 [Drosophila erecta]
Length = 515
Score = 105 bits (261), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 66/233 (28%), Positives = 123/233 (52%), Gaps = 37/233 (15%)
Query: 17 SLLIRKSFSSTAIIN--PSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVAD 74
+L+ R +FS+ A + P K+++IS P ++ ++D E + + K ++K+
Sbjct: 296 NLVCRYNFSTNAFLKLAPLKMEEISRDPYIVMFHEVISDKEIEEM----KGEIKQME--- 348
Query: 75 NLSGESKLSDVR-TSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKY 133
+G + L + + S + + + I D+I+ T E IQ+ + G +
Sbjct: 349 --NGWTSLEEPKEIVSHIYWITKESSFSKRINDRISDMTGFKVEEFPAIQLANFGVGGYF 406
Query: 134 EPHYDYFSDKVNIVRG----GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDL 189
+PHYDY+++++ + G RLA++++Y +V++GG+TVFP+ +
Sbjct: 407 KPHYDYYTERLKELDANNTLGDRLASIIIYAGEVSQGGQTVFPDIK-------------- 452
Query: 190 SECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+AV+P++G AL +F+ ++ PDP SLHS CPVI G +W+ TKW+H
Sbjct: 453 -------VAVEPKKGKALFWFNDFDDSSPDPRSLHSVCPVIVGSRWTITKWLH 498
>gi|355709028|gb|AES03457.1| prolyl 4-hydroxylase, alpha polypeptide III [Mustela putorius furo]
Length = 477
Score = 105 bits (261), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 68/224 (30%), Positives = 112/224 (50%), Gaps = 30/224 (13%)
Query: 25 SSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLS- 83
S ++ P + + I +P +Y F++D+E + LA+ L+RS VA SGE +L
Sbjct: 263 SPYLLLQPIRKEVIHLEPYVVLYHDFVSDMEAQKIRGLAEPWLQRSVVA---SGEKQLPV 319
Query: 84 DVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKE--NGEDIQVLRYEHGQKYEPHYDYFS 141
+ R S ++ D ++ ++ +I T L + E +QV+ Y G YEPH+D+ +
Sbjct: 320 EYRISKSAWLKDTVDPLLVNLDHRIGALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHAT 379
Query: 142 DKVNIV---RGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIA 198
+ + + G+R+AT ++YLS V GG T F A +
Sbjct: 380 SPTSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYA---------------------NFS 418
Query: 199 VKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
V + AL +++LH + D +LH+GCPV+ G+KW A KWIH
Sbjct: 419 VPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIH 462
>gi|194213450|ref|XP_001495951.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Equus
caballus]
Length = 548
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 68/220 (30%), Positives = 111/220 (50%), Gaps = 30/220 (13%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLS-DVRT 87
++ P + + I +P +Y F++D E + LA+ L+RS VA SGE +L + R
Sbjct: 337 LLQPVRKEVIHLEPYVVLYHDFVSDSEAQKIRGLAEPWLQRSVVA---SGEKQLPVEYRI 393
Query: 88 SSGTFIPKGKDAIIAGIEDKIATWTFLPKE--NGEDIQVLRYEHGQKYEPHYDYFSDKVN 145
S ++ D ++ ++ +IA T L + E +QV+ Y G YEPH+D+ + +
Sbjct: 394 SKSAWLKDTVDPMLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPTS 453
Query: 146 IV---RGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPR 202
+ + G+R+AT ++YLS V GG T F A +V
Sbjct: 454 PLYRMKSGNRVATFMIYLSSVEAGGATAFIYA---------------------NFSVPVV 492
Query: 203 RGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ AL +++LH + D +LH+GCPV+ G+KW A KWIH
Sbjct: 493 KNAALFWWNLHRSGEGDSDTLHAGCPVLVGDKWVANKWIH 532
>gi|344264847|ref|XP_003404501.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1
[Loxodonta africana]
Length = 536
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 106/218 (48%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E + + +AK +L R+ V D +G ++ R S
Sbjct: 324 LIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKQIAKPKLARATVRDPKTGVLTVASYRVS 383
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF----SDKV 144
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ D
Sbjct: 384 KSSWLEEDDDPVVAQVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRSHEQDAF 443
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV GG TVFP+ G A+ P++G
Sbjct: 444 KRLGTGNRVATFLNYMSDVEAGGATVFPDL---------------------GAAIWPKKG 482
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 483 TAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 520
>gi|47218149|emb|CAG10069.1| unnamed protein product [Tetraodon nigroviridis]
Length = 595
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 70/241 (29%), Positives = 113/241 (46%), Gaps = 49/241 (20%)
Query: 29 IINPSKVKQISW-KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVA-------------- 73
I++P K +Q W +P Y ++D E + + LAK +L+R+ ++
Sbjct: 361 ILSPVK-QQDEWDRPYIVRYLDIISDKEIELVKQLAKPRLRRATISNPITGVLETASYRI 419
Query: 74 --------DNLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVL 125
D +G+ + R S ++ + +I I +I T L + E++QV
Sbjct: 420 SKRRATVHDPQTGKLTTAQYRVSKSAWLTGYEHPVIETINQRIEDLTGLEVDTAEELQVA 479
Query: 126 RYEHGQKYEPHYDYFS----DKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRR 181
Y G +YEPH+D+ D + G+R+AT L Y+SDVA GG TVFP+
Sbjct: 480 NYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVAAGGATVFPDV------- 532
Query: 182 TPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
G AV P++G A+ +++L T+ D + H+ CPV+ G KW + KWI
Sbjct: 533 --------------GAAVWPQKGSAVFWYNLFTSGEGDYSTRHAACPVLVGNKWVSNKWI 578
Query: 242 H 242
H
Sbjct: 579 H 579
>gi|324511726|gb|ADY44875.1| Prolyl 4-hydroxylase subunit alpha-1 [Ascaris suum]
Length = 550
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 66/217 (30%), Positives = 108/217 (49%), Gaps = 25/217 (11%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+ P KV+ + + P A ++ ++D E + +A +LKR+ V ++ +GE + + R S
Sbjct: 317 LAPFKVEILRFNPLAVLFVDIISDEEAKMIQQIATPRLKRATVQNSKTGELETAAYRISK 376
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDK----VN 145
++ G +I I +I T L +E E++Q+ Y G Y+PH+D+ +
Sbjct: 377 SAWLKGGDHELIDRINRRIELMTNLIQETSEELQIANYGVGGHYDPHFDFARKEEPKAFE 436
Query: 146 IVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGD 205
+ G+RLATVL YL++ GG TVF RT AV P +
Sbjct: 437 SLGTGNRLATVLFYLTEPEIGGGTVFTEL------RT---------------AVMPSKNG 475
Query: 206 ALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
AL +++L+ + D + H+ CPV+ G KW A KWIH
Sbjct: 476 ALFWYNLYRSGEGDLRTRHAACPVLVGIKWVANKWIH 512
>gi|410972729|ref|XP_003992809.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Felis catus]
Length = 533
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 69/224 (30%), Positives = 111/224 (49%), Gaps = 30/224 (13%)
Query: 25 SSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLS- 83
S ++ P + + I +P +Y F+ DLE + LA+ L+RS VA SGE +L
Sbjct: 318 SPYLLLQPIRKEVIHLEPYVVLYHDFVNDLEAQKIRGLAEPWLQRSVVA---SGEKQLPV 374
Query: 84 DVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKE--NGEDIQVLRYEHGQKYEPHYDYFS 141
+ R S ++ D ++ ++ +I T L + E +QV+ Y G YEPH+D+ +
Sbjct: 375 EYRISKSAWLKDTVDPLLVTLDHRIGALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHAT 434
Query: 142 DKVNIV---RGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIA 198
+ + + G+R+AT ++YLS V GG T F A +
Sbjct: 435 SPTSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYA---------------------NFS 473
Query: 199 VKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
V + AL +++LH + D +LH+GCPV+ G+KW A KWIH
Sbjct: 474 VPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIH 517
>gi|326914688|ref|XP_003203656.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Meleagris
gallopavo]
Length = 539
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 68/220 (30%), Positives = 111/220 (50%), Gaps = 30/220 (13%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLS-DVRT 87
++ P+K + + +P +Y F++D E + + LA L+RS VA SGE + + R
Sbjct: 328 LLQPAKKETLRLQPYIVLYHDFVSDAEAETIKGLAGPWLQRSVVA---SGEKQQKVEYRI 384
Query: 88 SSGTFIPKGKDAIIAGIEDKIATWTFLP--KENGEDIQVLRYEHGQKYEPHYDYFSDK-- 143
S ++ D ++ +E ++A T L E +QV+ Y G YEPH+D+ + +
Sbjct: 385 SKSAWLKDTADPVVRALELRMAAITGLDLRPPYAEYLQVVNYGLGGHYEPHFDHATSRKS 444
Query: 144 -VNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPR 202
+ ++ G+R+ATV++YLS V GG T F A +V
Sbjct: 445 PLYRMKSGNRIATVMIYLSAVEAGGSTAFIYAN---------------------FSVPVV 483
Query: 203 RGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ AL +++L N D +LH+GCPV+ G+KW A KWIH
Sbjct: 484 KNAALFWWNLRRNGDGDGDTLHAGCPVLAGDKWVANKWIH 523
>gi|291387300|ref|XP_002710241.1| PREDICTED: prolyl 4-hydroxylase, alpha II subunit isoform 1
precursor (predicted)-like isoform 1 [Oryctolagus
cuniculus]
Length = 533
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 64/216 (29%), Positives = 106/216 (49%), Gaps = 23/216 (10%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E + + +AK +L R+ V D +G ++ R S
Sbjct: 323 LIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVS 382
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVN--I 146
+++ + D ++A I ++ T L + E +QV Y G +YEPH+D+ + +
Sbjct: 383 KSSWLEEDDDPVVARINRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGL 442
Query: 147 VRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
G+RLAT L Y+SDV GG TVFP+ G A+ P++G A
Sbjct: 443 KTEGNRLATFLNYMSDVEAGGATVFPDL---------------------GAAIWPKKGTA 481
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 482 VFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 517
>gi|195341588|ref|XP_002037388.1| GM12140 [Drosophila sechellia]
gi|194131504|gb|EDW53547.1| GM12140 [Drosophila sechellia]
Length = 534
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 74/230 (32%), Positives = 104/230 (45%), Gaps = 33/230 (14%)
Query: 23 SFSSTAIIN--PSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGES 80
+F++T + P K++QI P +Y L+ E LI A +K + V G
Sbjct: 305 NFTTTPFLRLAPLKIEQIGLDPYVVLYHEVLSAREISMLIGKATQNMKNTRVHKE-QGVP 363
Query: 81 KLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF 140
K + RT+ G + K + + GI +I T + E QV+ Y G Y H DYF
Sbjct: 364 KKNRGRTAKGFWFKKESNELTKGITRRIMDMTGFDLADSEGFQVINYGIGGHYLLHMDYF 423
Query: 141 ---SDKVNIVRG------GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSE 191
S R G R+ATVL YL+DV +GG TVF
Sbjct: 424 DFASSNHTDTRSSYSMDLGDRIATVLFYLTDVEQGGATVF-------------------- 463
Query: 192 CAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
A G +V P+ G A+ +++L TN DP + H+ CPVI G KW T+WI
Sbjct: 464 -ADVGYSVYPQAGTAIFWYNLDTNGKGDPRTKHAACPVIVGSKWVMTEWI 512
>gi|38454288|ref|NP_942070.1| prolyl 4-hydroxylase subunit alpha-3 precursor [Rattus norvegicus]
gi|81870816|sp|Q6W3E9.1|P4HA3_RAT RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
alpha-3; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-3; Flags: Precursor
gi|36962768|gb|AAQ87605.1| collagen prolyl 4-hydroxylase alpha III subunit [Rattus norvegicus]
Length = 544
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 68/220 (30%), Positives = 111/220 (50%), Gaps = 30/220 (13%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLS-DVRT 87
++ P++ + I +P +Y F++D E + LA+ L+RS VA SGE +L + R
Sbjct: 333 LLQPARKEVIHLRPLVALYHDFVSDEEAQKIRELAEPWLQRSVVA---SGEKQLQVEYRI 389
Query: 88 SSGTFIPKGKDAIIAGIEDKIATWTFLPKE--NGEDIQVLRYEHGQKYEPHYDYF---SD 142
S ++ D ++ ++ +IA T L + E +QV+ Y G YEPH+D+ S
Sbjct: 390 SKSAWLKDTVDPVLVTLDRRIAALTGLDIQPPYAEYLQVVNYGIGGHYEPHFDHATSPSS 449
Query: 143 KVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPR 202
+ ++ G+R AT+++YLS V GG T F +V
Sbjct: 450 PLYKMKSGNRAATLMIYLSSVEAGGATAF---------------------IYGNFSVPVV 488
Query: 203 RGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ AL +++LH + D +LH+GCPV+ G+KW A KWIH
Sbjct: 489 KNAALFWWNLHRSGEGDDDTLHAGCPVLVGDKWVANKWIH 528
>gi|344264849|ref|XP_003404502.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2
[Loxodonta africana]
Length = 534
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 63/216 (29%), Positives = 106/216 (49%), Gaps = 23/216 (10%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E + + +AK +L R+ V D +G ++ R S
Sbjct: 324 LIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKQIAKPKLARATVRDPKTGVLTVASYRVS 383
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVN--I 146
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ + +
Sbjct: 384 KSSWLEEDDDPVVAQVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGL 443
Query: 147 VRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
G+RLAT L Y+SDV GG TVFP+ G A+ P++G A
Sbjct: 444 KTEGNRLATFLNYMSDVEAGGATVFPDL---------------------GAAIWPKKGTA 482
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 483 VFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 518
>gi|424863736|ref|ZP_18287648.1| prolyl 4-hydroxylase subunit alpha-2 [SAR86 cluster bacterium
SAR86A]
gi|400757057|gb|EJP71269.1| prolyl 4-hydroxylase subunit alpha-2 [SAR86 cluster bacterium
SAR86A]
Length = 205
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 61/213 (28%), Positives = 99/213 (46%), Gaps = 29/213 (13%)
Query: 35 VKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIP 94
V S P +V FL+D EC+ + + K +++R+ V + ES+ RT+ ++
Sbjct: 10 VTLYSADPIVYVVNNFLSDDECEAFVEMGKGKMERAKVISD--DESEFHASRTNDFCWLE 67
Query: 95 KGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKV-----NIVRG 149
+I + + + +P N E Q++ Y G +Y+PH+D F N G
Sbjct: 68 HSASDVIHEVSKRFSVLVKMPINNAEQFQLVYYGPGNEYKPHFDAFDKTTKEGQNNWFPG 127
Query: 150 GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLF 209
G R+ T L YL+DV +GG T FP K ++VKP +GD ++F
Sbjct: 128 GQRMVTALAYLNDVEEGGATDFP---------------------KINVSVKPNKGDVVVF 166
Query: 210 FS-LHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
+ + +P +LH G PV+ GEKW+ W
Sbjct: 167 HNCIEGTTEINPQALHGGSPVVAGEKWAVNLWF 199
>gi|323454062|gb|EGB09933.1| hypothetical protein AURANDRAFT_14928, partial [Aureococcus
anophagefferens]
Length = 182
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 70/205 (34%), Positives = 103/205 (50%), Gaps = 32/205 (15%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
P + + FLT+ ECD LI+ AK + + V +GE +S RTSS ++ + +
Sbjct: 1 PPIYTVQNFLTEEECDALIDSAKDHMTPAPVVGPGNGEVSVS--RTSSTCYLARED---L 55
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVR-----GGHRLATV 156
+ K+ T P E+ E QV RY G+ Y+PHYD F R GG R+ATV
Sbjct: 56 PSVCTKVCALTGKPLEHLELPQVGRYRGGEFYKPHYDAFDTSSADGRRFAQNGGQRVATV 115
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
L+YL+DV +GGET F +K G+ +KPR+G+AL+FF +
Sbjct: 116 LVYLNDVERGGETSF---------------------SKLGVRIKPRKGNALIFFPATLDG 154
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWI 241
+ D LH+ P ++ KW + WI
Sbjct: 155 VLDQNYLHAAEPAVD-PKWVSQIWI 178
>gi|291387304|ref|XP_002710243.1| PREDICTED: prolyl 4-hydroxylase, alpha II subunit isoform 1
precursor (predicted)-like isoform 3 [Oryctolagus
cuniculus]
Length = 535
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 64/218 (29%), Positives = 106/218 (48%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E + + +AK +L R+ V D +G ++ R S
Sbjct: 323 LIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVS 382
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF----SDKV 144
+++ + D ++A I ++ T L + E +QV Y G +YEPH+D+ D
Sbjct: 383 KSSWLEEDDDPVVARINRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRNNERDAF 442
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV GG TVFP+ G A+ P++G
Sbjct: 443 KRLGTGNRVATFLNYMSDVEAGGATVFPDL---------------------GAAIWPKKG 481
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 482 TAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 519
>gi|195055773|ref|XP_001994787.1| GH17427 [Drosophila grimshawi]
gi|193892550|gb|EDV91416.1| GH17427 [Drosophila grimshawi]
Length = 538
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 65/215 (30%), Positives = 101/215 (46%), Gaps = 22/215 (10%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
++ P K+++ S P Y L+ + L +A +KRS V +SK S R S
Sbjct: 319 VLAPLKLEEHSLDPLVVSYHDMLSPQQIIELRQMAVPHMKRSTVNPLPGRQSKKSAFRVS 378
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNI-V 147
++ ++ + ++ T L E +QV Y G YEPH+D+F D +
Sbjct: 379 KNAWLEYDTHPMMGRMLRDLSDATGLDMTYCEQLQVANYGVGGHYEPHWDFFVDSQHYPA 438
Query: 148 RGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDAL 207
G+R+AT + YLSDV +GG T FP AV+P+ G+ L
Sbjct: 439 EEGNRIATAIFYLSDVEQGGATAFPFL---------------------NFAVRPQLGNIL 477
Query: 208 LFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+++LH + D + H+GCPV++G KW A WIH
Sbjct: 478 FWYNLHRSLDMDYRTKHAGCPVLKGSKWIANIWIH 512
>gi|194751829|ref|XP_001958226.1| GF23628 [Drosophila ananassae]
gi|190625508|gb|EDV41032.1| GF23628 [Drosophila ananassae]
Length = 484
Score = 104 bits (260), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 65/234 (27%), Positives = 119/234 (50%), Gaps = 36/234 (15%)
Query: 17 SLLIRKSFSSTAIIN--PSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVAD 74
+L+ R + ++T + P K++++S P +Y ++D E + + L D
Sbjct: 267 NLVCRYNATTTPFLKLAPLKLEEVSLDPYIVLYHNVISDREIEEMKGLIDE-------MD 319
Query: 75 NLSGESKLSDVRTSSGTFIPKGKDAIIAG-IEDKIATWTFLPKENGEDIQVLRYEHGQKY 133
N G + L++ R + K++ + +I T + +Q+ + G ++
Sbjct: 320 N--GWTDLNESREIVSRLVWLTKESRFRKRLNLRIRDITGFNVDEIRGLQIANFGVGGQF 377
Query: 134 EPHYDYFSDKV---NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLS 190
+PHYDYF++++ N G R+A+++ Y+ DV GG+TVFP+ +
Sbjct: 378 KPHYDYFTERILRLNNTILGDRIASIIFYVGDVVHGGQTVFPDIQ--------------- 422
Query: 191 ECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVD 244
IAVKP++G +L +F+ +A PDP SLHS CPV+ G++W+ TKW+H +
Sbjct: 423 ------IAVKPQKGSSLFWFNTFDDATPDPRSLHSVCPVLIGDRWTITKWLHYE 470
>gi|241999340|ref|XP_002434313.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
gi|215496072|gb|EEC05713.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
Length = 267
Score = 104 bits (259), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 72/231 (31%), Positives = 105/231 (45%), Gaps = 38/231 (16%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
++ P K++ +S PR V+ FL EC+ ++++ +L R+ V E S RT+
Sbjct: 45 VLQPFKIEVLSEDPRIVVFPDFLNPRECEIFRSISQEKLSRAKVYLGGPPEGGFSLRRTN 104
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDY--FSDKVNI 146
++ ++ + +IA T L + E QV Y G Y PH DY F +
Sbjct: 105 KVAWMSDDLHPLLGKVSRRIALATGLTLTSAEMYQVANYGLGGHYIPHPDYAGFGEAQGD 164
Query: 147 V--RGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+RLAT+L+YL+DVA GG T F N +AVKP G
Sbjct: 165 IYKSSGNRLATMLIYLADVAGGGATAFINMR---------------------LAVKPTLG 203
Query: 205 DALLFFSLHTNAIP-------------DPVSLHSGCPVIEGEKWSATKWIH 242
AL +++L P DP + H GCPV+ G KW TKWIH
Sbjct: 204 TALFWYNLKPYDGPIVNESFWNQRRFGDPRTFHMGCPVLTGSKWIVTKWIH 254
>gi|226874876|ref|NP_035161.2| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Mus
musculus]
gi|148701601|gb|EDL33548.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide, isoform CRA_f [Mus
musculus]
Length = 537
Score = 104 bits (259), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 106/218 (48%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E + + +AK +L R+ V D +G ++ R S
Sbjct: 325 LIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVS 384
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF----SDKV 144
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ D
Sbjct: 385 KSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRSDEQDAF 444
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV GG TVFP+ G A+ P++G
Sbjct: 445 KRLGTGNRVATFLNYMSDVEAGGATVFPDL---------------------GAAIWPKKG 483
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 484 TAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 521
>gi|395736139|ref|XP_003776705.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Pongo abelii]
Length = 575
Score = 104 bits (259), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 63/216 (29%), Positives = 106/216 (49%), Gaps = 23/216 (10%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E + + +AK +L R+ V D +G ++ R S
Sbjct: 365 LIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVS 424
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVN--I 146
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ + +
Sbjct: 425 KSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGL 484
Query: 147 VRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
G+RLAT L Y+SDV GG TVFP+ G A+ P++G A
Sbjct: 485 KTEGNRLATFLNYMSDVEAGGATVFPDL---------------------GAAIWPKKGTA 523
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 524 VFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 559
>gi|410948134|ref|XP_003980796.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Felis
catus]
Length = 535
Score = 104 bits (259), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 106/218 (48%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E + + +AK +L R+ V D +G ++ R S
Sbjct: 323 LIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVS 382
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF----SDKV 144
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ D
Sbjct: 383 KSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRKNEQDAF 442
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV GG TVFP+ G A+ P++G
Sbjct: 443 KRLGTGNRVATFLNYMSDVEAGGATVFPDL---------------------GAAIWPKKG 481
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 482 TAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 519
>gi|148701597|gb|EDL33544.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide, isoform CRA_b [Mus
musculus]
Length = 506
Score = 104 bits (259), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 106/218 (48%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E + + +AK +L R+ V D +G ++ R S
Sbjct: 294 LIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVS 353
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF----SDKV 144
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ D
Sbjct: 354 KSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRSDEQDAF 413
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV GG TVFP+ G A+ P++G
Sbjct: 414 KRLGTGNRVATFLNYMSDVEAGGATVFPDL---------------------GAAIWPKKG 452
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 453 TAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 490
>gi|405965633|gb|EKC30995.1| Prolyl 4-hydroxylase subunit alpha-1 [Crassostrea gigas]
Length = 617
Score = 104 bits (259), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 57/218 (26%), Positives = 110/218 (50%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
++ P+K +++ P +Y ++D E D + +A L R+ V + +G+ + ++ R S
Sbjct: 405 LLKPAKEEEVYLNPWIVIYHDVVSDKEIDTIKRIATPLLSRATVHNPRTGKLETAEYRVS 464
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF----SDKV 144
++ G D +I + ++I+ T L E++Q+ Y G +YEPH+D+ ++
Sbjct: 465 KSAWLKDGDDPVIHNVNNRISDITGLSMATAEELQIANYGLGGQYEPHFDFARREETEAF 524
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+++V GG TVF + G+ + P +G
Sbjct: 525 RDLGSGNRIATWLTYMTNVDAGGATVFTHI---------------------GVKLFPIKG 563
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A +++L+ + + H+ CPV+ G+KW + KWIH
Sbjct: 564 AAAFWYNLYRSGDGIFDTRHAACPVLVGQKWVSNKWIH 601
>gi|198418585|ref|XP_002122034.1| PREDICTED: similar to Prolyl 4-hydroxylase subunit alpha-1 (4-PH
alpha-1)
(Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1) [Ciona intestinalis]
Length = 525
Score = 104 bits (259), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 63/219 (28%), Positives = 110/219 (50%), Gaps = 29/219 (13%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVAD-NLSGESKLSDVRTS 88
I P KV+++ P + + + + + + ++K L R+ V N +G + D+RTS
Sbjct: 309 IKPVKVEELCNSPHIVQFYDVINNDDIETIKKMSKKHLSRALVTGPNNTG--IVEDIRTS 366
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSD-----K 143
+ K + + +I+ T L +E ED+QV Y +Y+PH+DY D +
Sbjct: 367 KVAWFKKNDFTAVKKLYTRISEMTGLSEETFEDLQVANYGLAGEYQPHFDYTEDPSIYKR 426
Query: 144 VNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRR 203
+ G+R+AT+L+YL+DV +GG T F EP I KP +
Sbjct: 427 EDGAEVGNRIATMLLYLNDVKEGGRTAFI---EPK------------------IVAKPIK 465
Query: 204 GDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
G A+ +++L+ + + DP + H+ CPV+ G KW++ W+H
Sbjct: 466 GSAVFWYNLYPSGLGDPRTRHASCPVVIGNKWASNVWVH 504
>gi|354474413|ref|XP_003499425.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1
[Cricetulus griseus]
Length = 535
Score = 104 bits (259), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 106/218 (48%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E + + +AK +L R+ V D +G ++ R S
Sbjct: 323 LIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVS 382
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF----SDKV 144
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ D
Sbjct: 383 KSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRSDEQDAF 442
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV GG TVFP+ G A+ P++G
Sbjct: 443 KRLGTGNRVATFLNYMSDVEAGGATVFPDL---------------------GAAIWPKKG 481
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 482 TAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 519
>gi|2498741|sp|Q60716.1|P4HA2_MOUSE RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
alpha-2; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-2; Flags: Precursor
gi|836900|gb|AAC52198.1| prolyl 4-hydroxylase alpha(II)-subunit [Mus musculus]
gi|18073923|emb|CAC85691.1| Prolyl 4-hydroxylase alpha IIb subunit [Mus musculus]
gi|1096888|prf||2112362B Pro 4-hydroxylase:SUBUNIT=alpha:ISOTYPE=II
Length = 537
Score = 104 bits (259), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 106/218 (48%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E + + +AK +L R+ V D +G ++ R S
Sbjct: 325 LIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVS 384
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF----SDKV 144
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ D
Sbjct: 385 KSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRSDDEDAF 444
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV GG TVFP+ G A+ P++G
Sbjct: 445 KRLGTGNRVATFLNYMSDVEAGGATVFPDL---------------------GAAIWPKKG 483
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 484 TAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 521
>gi|148701600|gb|EDL33547.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide, isoform CRA_e [Mus
musculus]
Length = 593
Score = 104 bits (259), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 63/216 (29%), Positives = 106/216 (49%), Gaps = 23/216 (10%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E + + +AK +L R+ V D +G ++ R S
Sbjct: 383 LIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVS 442
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVN--I 146
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ + +
Sbjct: 443 KSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGL 502
Query: 147 VRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
G+RLAT L Y+SDV GG TVFP+ G A+ P++G A
Sbjct: 503 KTEGNRLATFLNYMSDVEAGGATVFPDL---------------------GAAIWPKKGTA 541
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 542 VFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 577
>gi|443697961|gb|ELT98195.1| hypothetical protein CAPTEDRAFT_181380 [Capitella teleta]
Length = 530
Score = 104 bits (259), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 66/221 (29%), Positives = 104/221 (47%), Gaps = 34/221 (15%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
I+P K + +++ P +VY LTD + + +++ +L RS V ++ LS+ RTS
Sbjct: 320 ISPLKEEMLNFDPAIYVYHDVLTDSQNAIIKEVSRPKLHRSGVFSKTDADTGLSNFRTSQ 379
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDY--------FS 141
+ +IA + K + + L E E +QVL Y G YEPH+D+ FS
Sbjct: 380 TAWHDDSTHPLIARLSQKASAISNLTLETVEHLQVLNYGIGGLYEPHWDFVQGEERNEFS 439
Query: 142 DKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKP 201
+ +R+AT + YLS++ GG TV+P G AV P
Sbjct: 440 ES-----DRNRVATFICYLSELEAGGYTVYPTV---------------------GAAVVP 473
Query: 202 RRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
R+ L+++L N D + H+ CP++ G KW A KW H
Sbjct: 474 RKNSCALWYNLMRNGTGDYRTYHAACPILYGYKWVANKWFH 514
>gi|335283456|ref|XP_003354320.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Sus scrofa]
Length = 535
Score = 104 bits (259), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 106/218 (48%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E + + +AK +L R+ V D +G ++ R S
Sbjct: 323 LIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVS 382
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF----SDKV 144
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ D
Sbjct: 383 KSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRKDEQDAF 442
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV GG TVFP+ G A+ P++G
Sbjct: 443 KRLGTGNRVATFLNYMSDVEAGGATVFPDL---------------------GAAIWPKKG 481
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 482 TAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 519
>gi|209862961|ref|NP_001129548.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Mus
musculus]
gi|17390970|gb|AAH18411.1| P4ha2 protein [Mus musculus]
gi|18073922|emb|CAC85690.1| Prolyl 4-hydroxylase alpha IIa subunit [Mus musculus]
gi|74211515|dbj|BAE26490.1| unnamed protein product [Mus musculus]
Length = 535
Score = 104 bits (259), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 63/216 (29%), Positives = 106/216 (49%), Gaps = 23/216 (10%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E + + +AK +L R+ V D +G ++ R S
Sbjct: 325 LIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVS 384
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVN--I 146
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ + +
Sbjct: 385 KSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGL 444
Query: 147 VRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
G+RLAT L Y+SDV GG TVFP+ G A+ P++G A
Sbjct: 445 KTEGNRLATFLNYMSDVEAGGATVFPDL---------------------GAAIWPKKGTA 483
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 484 VFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 519
>gi|410948132|ref|XP_003980795.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Felis
catus]
gi|410948136|ref|XP_003980797.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Felis
catus]
Length = 533
Score = 104 bits (259), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 63/216 (29%), Positives = 106/216 (49%), Gaps = 23/216 (10%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E + + +AK +L R+ V D +G ++ R S
Sbjct: 323 LIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVS 382
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVN--I 146
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ + +
Sbjct: 383 KSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGL 442
Query: 147 VRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
G+RLAT L Y+SDV GG TVFP+ G A+ P++G A
Sbjct: 443 KTEGNRLATFLNYMSDVEAGGATVFPDL---------------------GAAIWPKKGTA 481
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 482 VFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 517
>gi|268572523|ref|XP_002641343.1| C. briggsae CBR-DPY-18 protein [Caenorhabditis briggsae]
gi|94442971|emb|CAJ98658.1| prolyl 4-hydroxylase [Caenorhabditis briggsae]
Length = 559
Score = 104 bits (259), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 62/218 (28%), Positives = 106/218 (48%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I P KV+ + P A +++ ++D E + LAK +L R+ V D+++G+ + R S
Sbjct: 315 IYAPIKVEIKRFNPLAVLFKDVISDEEVATIQELAKPKLARATVHDSVTGKLVTATYRIS 374
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF----SDKV 144
++ + ++ + +I T L E E++Q+ Y G Y+PH+D+ S
Sbjct: 375 KSAWLKAWEHEVVERVNKRIDLMTNLEMETAEELQIANYGIGGHYDPHFDHAKKEESKSF 434
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+ATVL Y+S + GG TVF + V P +
Sbjct: 435 ESLGTGNRIATVLFYMSQPSHGGGTVFTEVKS---------------------TVLPTKN 473
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
DAL +++L+ +P + H+ CPV+ G KW + KWIH
Sbjct: 474 DALFWYNLYKQGDGNPDTRHAACPVLVGIKWVSNKWIH 511
>gi|395736141|ref|XP_003776706.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Pongo abelii]
Length = 577
Score = 104 bits (259), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 106/218 (48%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E + + +AK +L R+ V D +G ++ R S
Sbjct: 365 LIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVS 424
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF----SDKV 144
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ D
Sbjct: 425 KSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTF 484
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV GG TVFP+ G A+ P++G
Sbjct: 485 KHLGTGNRVATFLNYMSDVEAGGATVFPDL---------------------GAAIWPKKG 523
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 524 TAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 561
>gi|426369750|ref|XP_004051847.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3, partial [Gorilla
gorilla gorilla]
Length = 517
Score = 103 bits (258), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 70/220 (31%), Positives = 109/220 (49%), Gaps = 30/220 (13%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLS-DVRT 87
++ P + + I +P +Y F++D E + LA+ L+RS VA SGE +L + R
Sbjct: 306 LLQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVA---SGEKQLQVEYRI 362
Query: 88 SSGTFIPKGKDAIIAGIEDKIATWTFLPKE--NGEDIQVLRYEHGQKYEPHYDYF---SD 142
S ++ D + + +IA T L E +QV+ Y G YEPH+D+ S
Sbjct: 363 SKSAWLKDTVDPKLVALNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPSS 422
Query: 143 KVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPR 202
+ ++ G+R+AT ++YLS V GG T F A ++V
Sbjct: 423 PLYRMKSGNRVATFMIYLSSVEAGGATAFIYA---------------------NLSVPVV 461
Query: 203 RGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
R AL +++LH + D +LH+GCPV+ G+KW A KWIH
Sbjct: 462 RNAALFWWNLHRSGEGDSDTLHAGCPVLVGDKWVANKWIH 501
>gi|354474415|ref|XP_003499426.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2
[Cricetulus griseus]
Length = 533
Score = 103 bits (258), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 63/216 (29%), Positives = 106/216 (49%), Gaps = 23/216 (10%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E + + +AK +L R+ V D +G ++ R S
Sbjct: 323 LIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVS 382
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVN--I 146
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ + +
Sbjct: 383 KSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGL 442
Query: 147 VRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
G+RLAT L Y+SDV GG TVFP+ G A+ P++G A
Sbjct: 443 KTEGNRLATFLNYMSDVEAGGATVFPDL---------------------GAAIWPKKGTA 481
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 482 VFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 517
>gi|149052606|gb|EDM04423.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide (predicted),
isoform CRA_c [Rattus norvegicus]
Length = 506
Score = 103 bits (258), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 106/218 (48%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E + + +AK +L R+ V D +G ++ R S
Sbjct: 294 LIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVS 353
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF----SDKV 144
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ D
Sbjct: 354 KSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRSDERDAF 413
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV GG TVFP+ G A+ P++G
Sbjct: 414 KRLGTGNRVATFLNYMSDVEAGGATVFPDL---------------------GAAIWPKKG 452
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 453 TAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 490
>gi|348557544|ref|XP_003464579.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
[Cavia porcellus]
Length = 533
Score = 103 bits (258), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 63/216 (29%), Positives = 106/216 (49%), Gaps = 23/216 (10%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E + + +AK +L R+ V D +G ++ R S
Sbjct: 323 LIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVS 382
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVN--I 146
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ + +
Sbjct: 383 KSSWLEEEDDPVVARVNRRMQQITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGL 442
Query: 147 VRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
G+RLAT L Y+SDV GG TVFP+ G A+ P++G A
Sbjct: 443 KTEGNRLATFLNYMSDVEAGGATVFPDL---------------------GAALWPKKGTA 481
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 482 VFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 517
>gi|198449524|ref|XP_002136918.1| GA26871 [Drosophila pseudoobscura pseudoobscura]
gi|198130646|gb|EDY67476.1| GA26871 [Drosophila pseudoobscura pseudoobscura]
Length = 530
Score = 103 bits (258), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 71/221 (32%), Positives = 112/221 (50%), Gaps = 32/221 (14%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+ P +++++S P VY L+D E + +A+ LK V + SK S VRT+
Sbjct: 314 LAPLRMEELSLDPYIVVYHNVLSDAEIAKVERVAEPLLKSIGVGE--MDNSKKSKVRTAL 371
Query: 90 GTFIPKGKDAIIAG------IEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDK 143
G +IP K+ I+G I +I T L ++G+ +Q+++Y +G Y+ H+DY +D
Sbjct: 372 GAWIPD-KNMHISGWPVIQRIVRRIHDMTGLIIKHGQVVQLIKYGYGGHYDTHFDYLNDS 430
Query: 144 VNIVRG-GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPR 202
+ I + G R+ATVL YL+DV GG TVFP + + V
Sbjct: 431 LPITQALGDRMATVLFYLNDVKHGGSTVFPVLK---------------------LKVPSE 469
Query: 203 RGDALLFFSLHTNAIP-DPVSLHSGCPVIEGEKWSATKWIH 242
RG L+++++H D +LH CPVI+G K + WIH
Sbjct: 470 RGKVLVWYNMHGETHDLDSRTLHGSCPVIDGAKTVLSCWIH 510
>gi|157818741|ref|NP_001101745.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Rattus norvegicus]
gi|149052604|gb|EDM04421.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide (predicted),
isoform CRA_a [Rattus norvegicus]
Length = 535
Score = 103 bits (258), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 106/218 (48%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E + + +AK +L R+ V D +G ++ R S
Sbjct: 323 LIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVS 382
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF----SDKV 144
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ D
Sbjct: 383 KSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRSDERDAF 442
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV GG TVFP+ G A+ P++G
Sbjct: 443 KRLGTGNRVATFLNYMSDVEAGGATVFPDL---------------------GAAIWPKKG 481
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 482 TAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 519
>gi|334311009|ref|XP_001371555.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Monodelphis
domestica]
Length = 534
Score = 103 bits (258), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 64/216 (29%), Positives = 105/216 (48%), Gaps = 23/216 (10%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y L+D E + + ++K +L R+ V D +G + R S
Sbjct: 324 LIAPFKEEDEWDSPHIVRYYDVLSDEEIEKIKEISKPKLSRATVRDPKTGHLIVVSYRIS 383
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVN--I 146
+++ + D IIA + ++ T L + E +QV Y G +YEPH+D+ + +
Sbjct: 384 KSSWLKEDDDPIIAQVNRRMQYITGLSVKTAELLQVSNYGMGGQYEPHFDFSRRPFDSGL 443
Query: 147 VRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
G+RLAT L Y+SDV GG TVFP+ G A+ P++G +
Sbjct: 444 KTEGNRLATFLNYMSDVEAGGATVFPDF---------------------GAAIWPKKGTS 482
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 483 VFWYNLFRSGECDYRTRHAACPVLVGSKWVSNKWFH 518
>gi|443697959|gb|ELT98193.1| hypothetical protein CAPTEDRAFT_162820 [Capitella teleta]
Length = 347
Score = 103 bits (258), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 65/216 (30%), Positives = 104/216 (48%), Gaps = 24/216 (11%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
I+P K + +++ P +VY LTD + + +++ +L RS V ++ LS+ RTS
Sbjct: 137 ISPLKEEMLNFDPAIYVYHDVLTDSQNAIIKEVSRPKLHRSGVFSKTDADTGLSNFRTSQ 196
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS-DKVNIVR 148
+ +IA + K + + L E E +QVL Y G YEPH+D+ ++ N
Sbjct: 197 TAWHDDSTHPLIARLSQKASAISNLTLETVEHLQVLNYGIGGLYEPHWDFVQGEERNEFS 256
Query: 149 GG--HRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
+R+AT + YLS++ GG TV+P G AV PR+
Sbjct: 257 ESDRNRVATFICYLSELEAGGYTVYPTV---------------------GAAVVPRKNSC 295
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
L+++L N D + H+ CP++ G KW A KW H
Sbjct: 296 ALWYNLMRNGTGDYRTYHAACPILYGYKWVANKWFH 331
>gi|332221662|ref|XP_003259982.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 4 [Nomascus
leucogenys]
Length = 556
Score = 103 bits (258), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 63/216 (29%), Positives = 106/216 (49%), Gaps = 23/216 (10%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E + + +AK +L R+ V D +G ++ R S
Sbjct: 346 LIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVS 405
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVN--I 146
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ + +
Sbjct: 406 KSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGL 465
Query: 147 VRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
G+RLAT L Y+SDV GG TVFP+ G A+ P++G A
Sbjct: 466 KTEGNRLATFLNYMSDVEAGGATVFPDL---------------------GAAIWPKKGTA 504
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 505 VFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 540
>gi|340367965|ref|XP_003382523.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Amphimedon
queenslandica]
Length = 525
Score = 103 bits (258), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 67/219 (30%), Positives = 106/219 (48%), Gaps = 27/219 (12%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I++P K + KP+ +++ +TD E + L LA +L R+ V +GE + R S
Sbjct: 313 ILSPIKTEVAFVKPKIYIFYDIVTDREIERLKELANPKLNRATVHGE-NGELLHATYRIS 371
Query: 89 SGTFIPKGKDAI--IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF---SDK 143
++ D + + I+ +I T L E +QV+ Y G +YEPHYD+ D
Sbjct: 372 KSGWLSGSDDPLGYVDRIDQRIEDVTGLTMSTAEQLQVVNYGIGGQYEPHYDFARTGEDT 431
Query: 144 VNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRR 203
+ G+R++T+L+Y+SDV KGG TVFP G + P +
Sbjct: 432 FTSLGSGNRISTLLIYMSDVEKGGATVFPGV---------------------GARLVPIK 470
Query: 204 GDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A +++L + D + H+GCPV+ G KW KWIH
Sbjct: 471 RAAAYWWNLKRSGDGDYSTRHAGCPVLVGSKWVCNKWIH 509
>gi|73988166|ref|XP_851718.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Canis lupus
familiaris]
Length = 544
Score = 103 bits (258), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 67/220 (30%), Positives = 110/220 (50%), Gaps = 30/220 (13%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLS-DVRT 87
++ P + + I +P +Y F+ D+E + LA+ L+RS VA SGE +L + R
Sbjct: 333 LLQPVRKEVIHLEPYVVLYHDFVNDVEAQKIRGLAEPWLQRSVVA---SGEKQLPVEYRI 389
Query: 88 SSGTFIPKGKDAIIAGIEDKIATWTFLPKE--NGEDIQVLRYEHGQKYEPHYDYFSDKVN 145
S ++ D ++ ++ +I T L + E +QV+ Y G YEPH+D+ + +
Sbjct: 390 SKSAWLKDTVDPLLVTLDHRIGALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPTS 449
Query: 146 IV---RGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPR 202
+ + G+R+AT ++YLS V GG T F A +V
Sbjct: 450 PLYRMKSGNRVATFMIYLSSVEAGGATAFIYA---------------------NFSVPVV 488
Query: 203 RGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ AL +++LH + D +LH+GCPV+ G+KW A KWIH
Sbjct: 489 KNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIH 528
>gi|332221656|ref|XP_003259979.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Nomascus
leucogenys]
gi|332221658|ref|XP_003259980.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Nomascus
leucogenys]
Length = 535
Score = 103 bits (258), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 63/216 (29%), Positives = 106/216 (49%), Gaps = 23/216 (10%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E + + +AK +L R+ V D +G ++ R S
Sbjct: 325 LIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVS 384
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVN--I 146
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ + +
Sbjct: 385 KSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGL 444
Query: 147 VRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
G+RLAT L Y+SDV GG TVFP+ G A+ P++G A
Sbjct: 445 KTEGNRLATFLNYMSDVEAGGATVFPDL---------------------GAAIWPKKGTA 483
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 484 VFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 519
>gi|397644755|gb|EJK76534.1| hypothetical protein THAOC_01697 [Thalassiosira oceanica]
Length = 475
Score = 103 bits (257), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 79/237 (33%), Positives = 110/237 (46%), Gaps = 37/237 (15%)
Query: 33 SKVKQISWKPRAFVYEGFLTDLECDHLINLA-KSQLKRSAVADNLSGESKLSDVRTSSGT 91
SK P +E FLT+ EC H+I K++ +RS + + VR++ T
Sbjct: 264 SKAVDKEQPPWVITFENFLTEDECTHMIEQGRKAEYERSEDVGEVQADGSYDSVRSTGRT 323
Query: 92 F------IPKG--KDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDK 143
G D I+ + D+IA T + + ED Q+L+YE GQ Y H+DY +
Sbjct: 324 SENAWCSFRDGCRNDTIVELVHDRIAKVTGIGANHSEDFQILKYEPGQFYRQHHDYIEHQ 383
Query: 144 VNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRR 203
+ R G R+ T +YLSDV +GG T FP K GIAVKP+
Sbjct: 384 RD-RRCGPRVLTFFLYLSDVEEGGATNFP---------------------KLGIAVKPKV 421
Query: 204 GDALLFFSLHTNAIP---DPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGGDCT 257
G ALL+ S+ N+ P D + H VI G K+ A WIH+ + EEG CT
Sbjct: 422 GRALLWPSV-LNSEPRNKDGRTDHEAQDVIAGVKYGANAWIHLHDYQAAQEEG--CT 475
>gi|332221664|ref|XP_003259983.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 5 [Nomascus
leucogenys]
Length = 558
Score = 103 bits (257), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 106/218 (48%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E + + +AK +L R+ V D +G ++ R S
Sbjct: 346 LIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVS 405
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF----SDKV 144
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ D
Sbjct: 406 KSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTF 465
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV GG TVFP+ G A+ P++G
Sbjct: 466 KHLGTGNRVATFLNYMSDVEAGGATVFPDL---------------------GAAIWPKKG 504
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 505 TAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 542
>gi|119582749|gb|EAW62345.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_d
[Homo sapiens]
Length = 488
Score = 103 bits (257), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 63/216 (29%), Positives = 106/216 (49%), Gaps = 23/216 (10%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E + + +AK +L R+ V D +G ++ R S
Sbjct: 278 LIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVS 337
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVN--I 146
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ + +
Sbjct: 338 KSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGL 397
Query: 147 VRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
G+RLAT L Y+SDV GG TVFP+ G A+ P++G A
Sbjct: 398 KTEGNRLATFLNYMSDVEAGGATVFPDL---------------------GAAIWPKKGTA 436
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 437 VFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 472
>gi|297675927|ref|XP_002815905.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Pongo
abelii]
gi|395736137|ref|XP_003776704.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Pongo abelii]
Length = 533
Score = 103 bits (257), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 63/216 (29%), Positives = 106/216 (49%), Gaps = 23/216 (10%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E + + +AK +L R+ V D +G ++ R S
Sbjct: 323 LIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVS 382
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVN--I 146
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ + +
Sbjct: 383 KSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGL 442
Query: 147 VRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
G+RLAT L Y+SDV GG TVFP+ G A+ P++G A
Sbjct: 443 KTEGNRLATFLNYMSDVEAGGATVFPDL---------------------GAAIWPKKGTA 481
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 482 VFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 517
>gi|114601548|ref|XP_001162501.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 9 [Pan
troglodytes]
gi|114601562|ref|XP_001162805.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 16 [Pan
troglodytes]
gi|114601564|ref|XP_517917.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 19 [Pan
troglodytes]
gi|397518354|ref|XP_003829356.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Pan
paniscus]
gi|397518356|ref|XP_003829357.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Pan
paniscus]
gi|397518360|ref|XP_003829359.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 4 [Pan
paniscus]
gi|410215942|gb|JAA05190.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
gi|410255606|gb|JAA15770.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
gi|410331277|gb|JAA34585.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
gi|410331281|gb|JAA34587.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
Length = 533
Score = 103 bits (257), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 63/216 (29%), Positives = 106/216 (49%), Gaps = 23/216 (10%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E + + +AK +L R+ V D +G ++ R S
Sbjct: 323 LIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVS 382
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVN--I 146
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ + +
Sbjct: 383 KSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGL 442
Query: 147 VRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
G+RLAT L Y+SDV GG TVFP+ G A+ P++G A
Sbjct: 443 KTEGNRLATFLNYMSDVEAGGATVFPDL---------------------GAAIWPKKGTA 481
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 482 VFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 517
>gi|386780652|ref|NP_001247763.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Macaca mulatta]
gi|383422579|gb|AFH34503.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Macaca
mulatta]
gi|384939466|gb|AFI33338.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Macaca
mulatta]
Length = 533
Score = 103 bits (257), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 63/216 (29%), Positives = 106/216 (49%), Gaps = 23/216 (10%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E + + +AK +L R+ V D +G ++ R S
Sbjct: 323 LIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVS 382
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVN--I 146
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ + +
Sbjct: 383 KSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGL 442
Query: 147 VRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
G+RLAT L Y+SDV GG TVFP+ G A+ P++G A
Sbjct: 443 KTEGNRLATFLNYMSDVEAGGATVFPDL---------------------GAAIWPKKGTA 481
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 482 VFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 517
>gi|63252891|ref|NP_001017973.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Homo
sapiens]
gi|63252893|ref|NP_001017974.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Homo
sapiens]
gi|217272861|ref|NP_001136070.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Homo
sapiens]
gi|18073925|emb|CAC85688.1| Prolyl 4-hydroxylase alpha IIa subunit [Homo sapiens]
gi|23274221|gb|AAH35813.1| Prolyl 4-hydroxylase, alpha polypeptide II [Homo sapiens]
gi|37183058|gb|AAQ89329.1| P4HA2 [Homo sapiens]
gi|119582745|gb|EAW62341.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_a
[Homo sapiens]
gi|119582750|gb|EAW62346.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_a
[Homo sapiens]
gi|123983232|gb|ABM83357.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II [synthetic
construct]
gi|157928048|gb|ABW03320.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II [synthetic
construct]
Length = 533
Score = 103 bits (257), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 63/216 (29%), Positives = 106/216 (49%), Gaps = 23/216 (10%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E + + +AK +L R+ V D +G ++ R S
Sbjct: 323 LIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVS 382
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVN--I 146
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ + +
Sbjct: 383 KSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGL 442
Query: 147 VRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
G+RLAT L Y+SDV GG TVFP+ G A+ P++G A
Sbjct: 443 KTEGNRLATFLNYMSDVEAGGATVFPDL---------------------GAAIWPKKGTA 481
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 482 VFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 517
>gi|81870817|sp|Q6W3F0.1|P4HA3_MOUSE RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
alpha-3; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-3; Flags: Precursor
gi|36962749|gb|AAQ87604.1| collagen prolyl 4-hydroxylase alpha III subunit [Mus musculus]
Length = 542
Score = 103 bits (257), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 67/220 (30%), Positives = 111/220 (50%), Gaps = 30/220 (13%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLS-DVRT 87
++ P++ + + +P +Y F++D E + LA+ L+RS VA SGE +L + R
Sbjct: 331 LLQPARKEVVHLRPLIALYHDFVSDEEAQKIRELAEPWLQRSVVA---SGEKQLQVEYRI 387
Query: 88 SSGTFIPKGKDAIIAGIEDKIATWTFLPKE--NGEDIQVLRYEHGQKYEPHYDYF---SD 142
S ++ D ++ ++ +IA T L + E +QV+ Y G YEPH+D+ S
Sbjct: 388 SKSAWLKDTVDPMLVTLDHRIAALTGLDIQPPYAEYLQVVNYGIGGHYEPHFDHATSPSS 447
Query: 143 KVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPR 202
+ ++ G+R+AT ++YLS V GG T F +V
Sbjct: 448 PLYRMKSGNRVATFMIYLSSVEAGGATAF---------------------IYGNFSVPVV 486
Query: 203 RGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ AL +++LH + D +LH+GCPV+ G+KW A KWIH
Sbjct: 487 KNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIH 526
>gi|195159164|ref|XP_002020452.1| GL13506 [Drosophila persimilis]
gi|194117221|gb|EDW39264.1| GL13506 [Drosophila persimilis]
Length = 536
Score = 103 bits (257), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 71/221 (32%), Positives = 111/221 (50%), Gaps = 32/221 (14%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+ P +++++S P VY L+D E + +A+ LK V + SK S VRT+
Sbjct: 320 LAPLRMEELSLDPYIVVYHNVLSDAEIAKVERVAEPLLKSIGVGE--MDNSKKSKVRTAL 377
Query: 90 GTFIPKGKDAI-----IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKV 144
G +IP I I I +I T L + G+ +Q+++Y +G Y+ H+DY +D +
Sbjct: 378 GAWIPDENMHISGWPVIQRIVRRIHDMTGLIIKRGQVVQLIKYGYGGHYDTHFDYLNDSL 437
Query: 145 NIVRG-GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRR 203
I + G R+ATVL YL+DV GG TVFP + + V R
Sbjct: 438 PITQALGDRMATVLFYLNDVKHGGSTVFPVLQ---------------------LKVPSER 476
Query: 204 GDALLFFSLH--TNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
G L+++++H T+ + D +LH CPVI+G K + WIH
Sbjct: 477 GKVLVWYNMHGETHDL-DSRTLHGSCPVIDGAKTVLSCWIH 516
>gi|348557542|ref|XP_003464578.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 1
[Cavia porcellus]
Length = 535
Score = 103 bits (257), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 106/218 (48%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E + + +AK +L R+ V D +G ++ R S
Sbjct: 323 LIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVS 382
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF----SDKV 144
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ D
Sbjct: 383 KSSWLEEEDDPVVARVNRRMQQITGLTVKTAELLQVANYGMGGQYEPHFDFSRSHERDAF 442
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV GG TVFP+ G A+ P++G
Sbjct: 443 KRLGTGNRVATFLNYMSDVEAGGATVFPDL---------------------GAALWPKKG 481
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 482 TAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 519
>gi|52139015|gb|AAH82538.1| P4ha3 protein [Mus musculus]
Length = 404
Score = 103 bits (257), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 67/220 (30%), Positives = 111/220 (50%), Gaps = 30/220 (13%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLS-DVRT 87
++ P++ + + +P +Y F++D E + LA+ L+RS VA SGE +L + R
Sbjct: 193 LLQPARKEVVHLRPLIALYHDFVSDEEAQKIRELAEPWLQRSVVA---SGEKQLQVEYRI 249
Query: 88 SSGTFIPKGKDAIIAGIEDKIATWTFLPKE--NGEDIQVLRYEHGQKYEPHYDYF---SD 142
S ++ D ++ ++ +IA T L + E +QV+ Y G YEPH+D+ S
Sbjct: 250 SKSAWLKDTVDPMLVTLDHRIAALTGLDIQPPYAEYLQVVNYGIGGHYEPHFDHATSPSS 309
Query: 143 KVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPR 202
+ ++ G+R+AT ++YLS V GG T F +V
Sbjct: 310 PLYRMKSGNRVATFMIYLSSVEAGGATAF---------------------IYGNFSVPVV 348
Query: 203 RGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ AL +++LH + D +LH+GCPV+ G+KW A KWIH
Sbjct: 349 KNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIH 388
>gi|403255937|ref|XP_003920661.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Saimiri
boliviensis boliviensis]
gi|403255939|ref|XP_003920662.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Saimiri
boliviensis boliviensis]
gi|403255943|ref|XP_003920664.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 4 [Saimiri
boliviensis boliviensis]
Length = 533
Score = 103 bits (257), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 63/216 (29%), Positives = 106/216 (49%), Gaps = 23/216 (10%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E + + +AK +L R+ V D +G ++ R S
Sbjct: 323 LIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVS 382
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVN--I 146
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ + +
Sbjct: 383 KSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGL 442
Query: 147 VRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
G+RLAT L Y+SDV GG TVFP+ G A+ P++G A
Sbjct: 443 KTEGNRLATFLNYMSDVEAGGATVFPDL---------------------GAAIWPKKGTA 481
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 482 VFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 517
>gi|227908832|ref|NP_796135.3| prolyl 4-hydroxylase subunit alpha-3 precursor [Mus musculus]
Length = 542
Score = 103 bits (257), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 67/220 (30%), Positives = 111/220 (50%), Gaps = 30/220 (13%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLS-DVRT 87
++ P++ + + +P +Y F++D E + LA+ L+RS VA SGE +L + R
Sbjct: 331 LLQPARKEVVHLRPLIALYHDFVSDEEAQKIRELAEPWLQRSVVA---SGEKQLQVEYRI 387
Query: 88 SSGTFIPKGKDAIIAGIEDKIATWTFLPKE--NGEDIQVLRYEHGQKYEPHYDYF---SD 142
S ++ D ++ ++ +IA T L + E +QV+ Y G YEPH+D+ S
Sbjct: 388 SKSAWLKDTVDPMLVTLDHRIAALTGLDIQPPYAEYLQVVNYGIGGHYEPHFDHATSPSS 447
Query: 143 KVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPR 202
+ ++ G+R+AT ++YLS V GG T F +V
Sbjct: 448 PLYRMKSGNRVATFMIYLSSVEAGGATAF---------------------IYGNFSVPVV 486
Query: 203 RGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ AL +++LH + D +LH+GCPV+ G+KW A KWIH
Sbjct: 487 KNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIH 526
>gi|116283554|gb|AAH17062.1| P4HA2 protein [Homo sapiens]
Length = 504
Score = 103 bits (257), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 63/216 (29%), Positives = 106/216 (49%), Gaps = 23/216 (10%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E + + +AK +L R+ V D +G ++ R S
Sbjct: 294 LIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVS 353
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVN--I 146
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ + +
Sbjct: 354 KSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGL 413
Query: 147 VRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
G+RLAT L Y+SDV GG TVFP+ G A+ P++G A
Sbjct: 414 KTEGNRLATFLNYMSDVEAGGATVFPDL---------------------GAAIWPKKGTA 452
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 453 VFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 488
>gi|73970649|ref|XP_850109.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Canis
lupus familiaris]
Length = 533
Score = 103 bits (257), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 63/216 (29%), Positives = 106/216 (49%), Gaps = 23/216 (10%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E + + +AK +L R+ V D +G ++ R S
Sbjct: 323 LIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVS 382
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVN--I 146
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ + +
Sbjct: 383 KSSWLEEDDDPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGL 442
Query: 147 VRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
G+RLAT L Y+SDV GG TVFP+ G A+ P++G A
Sbjct: 443 KTEGNRLATFLNYMSDVEAGGATVFPDL---------------------GAAIWPKKGTA 481
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 482 VFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 517
>gi|355709025|gb|AES03456.1| prolyl 4-hydroxylase, alpha polypeptide II [Mustela putorius furo]
Length = 532
Score = 103 bits (257), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 63/216 (29%), Positives = 106/216 (49%), Gaps = 23/216 (10%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E + + +AK +L R+ V D +G ++ R S
Sbjct: 323 LIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVS 382
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVN--I 146
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ + +
Sbjct: 383 KSSWLEEDDDPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGL 442
Query: 147 VRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
G+RLAT L Y+SDV GG TVFP+ G A+ P++G A
Sbjct: 443 KTEGNRLATFLNYMSDVEAGGATVFPDL---------------------GAAIWPKKGTA 481
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 482 VFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 517
>gi|195159317|ref|XP_002020528.1| GL14042 [Drosophila persimilis]
gi|194117297|gb|EDW39340.1| GL14042 [Drosophila persimilis]
Length = 534
Score = 103 bits (257), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 64/214 (29%), Positives = 100/214 (46%), Gaps = 22/214 (10%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+ P K+++ S P Y L+ + L +A ++ RS V G++K S R S
Sbjct: 316 LAPLKLEEHSLDPFVVTYHDMLSPRKIADLRLMAVPRMHRSTVNPLPGGQNKKSSFRVSK 375
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNI-VR 148
++ + G+ ++ T L E +QV Y G YEPH+D+F D +
Sbjct: 376 NAWLAYDSHPTMGGMLSDLSDATGLDMTFCEQLQVANYGVGGHYEPHWDFFRDPDHYPAE 435
Query: 149 GGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALL 208
G+R+AT + YLSDV +GG T FP AVKP+ G+ L
Sbjct: 436 EGNRMATAIFYLSDVEQGGATAFPFL---------------------NFAVKPQLGNVLF 474
Query: 209 FFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
++++H + D + H+GCPV++G KW WIH
Sbjct: 475 WYNVHRSLDVDYRTKHAGCPVLKGSKWIGNVWIH 508
>gi|78046960|ref|YP_363135.1| hypothetical protein XCV1404 [Xanthomonas campestris pv.
vesicatoria str. 85-10]
gi|78035390|emb|CAJ23035.1| conserved hypothetical protein [Xanthomonas campestris pv.
vesicatoria str. 85-10]
Length = 418
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 69/209 (33%), Positives = 95/209 (45%), Gaps = 25/209 (11%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSG-TFIPKGKDAI 100
PR Y L+ EC L+ LA+ L+ S V D + + VRTS G T P +D
Sbjct: 228 PRIEEYAAVLSADECRLLMLLARPHLRASKVIDPNDASTGRAPVRTSHGATLDPIIEDFA 287
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF---SDKVNIVRGGHRLATVL 157
+ ++A LP + E + VL Y G++Y H DY + + G+R TV
Sbjct: 288 ARAAQSRLAACAQLPLAHAEPLSVLCYAPGEQYRAHRDYLPPGTIAADRPTAGNRQRTVC 347
Query: 158 MYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAI 217
+YL+DV GGET FP A G+ V+PR G + F +LH +
Sbjct: 348 VYLNDVGAGGETEFPVA---------------------GVRVRPRPGTLVCFDNLHADGR 386
Query: 218 PDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
PD SLH+G PV G KW T W +
Sbjct: 387 PDADSLHAGLPVTAGSKWLGTLWFRQQRY 415
>gi|442757047|gb|JAA70682.1| Putative prolyl 4-hydroxylase alpha subunit [Ixodes ricinus]
Length = 532
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 63/220 (28%), Positives = 112/220 (50%), Gaps = 32/220 (14%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+ P K+++ + KP V L D + + +I AK +L++S + + RTSS
Sbjct: 314 LQPIKLEEFNLKPYVVVLRDLLQDRDLNDMIAFAKPRLEQSKTL--CAADKDGPPSRTSS 371
Query: 90 GTFIPKGKDAIIAG-----IEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKV 144
T++ +DA +A ++ + T ++ E Q+ Y G Y PH+DYF +
Sbjct: 372 NTWL-NDEDAPVAARVNQYLQSLLGLGTLFSRDEAEKYQLANYGIGGHYVPHHDYFEEFQ 430
Query: 145 NIVRG---GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKP 201
+G G+R+AT+++Y+SDV +GG TVFP+ G+ V P
Sbjct: 431 TPSKGNRFGNRVATLMIYMSDVEEGGATVFPSL---------------------GVRVSP 469
Query: 202 RRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
++GDA+ ++++ ++ + ++ H+GCPV+ G KW A KW
Sbjct: 470 KKGDAVFWWNIMSSWEGEMLTWHAGCPVLYGSKWIANKWF 509
>gi|426349879|ref|XP_004042513.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Gorilla gorilla
gorilla]
Length = 565
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 63/216 (29%), Positives = 106/216 (49%), Gaps = 23/216 (10%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E + + +AK +L R+ V D +G ++ R S
Sbjct: 355 LIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVS 414
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVN--I 146
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ + +
Sbjct: 415 KSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGL 474
Query: 147 VRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
G+RLAT L Y+SDV GG TVFP+ G A+ P++G A
Sbjct: 475 KTEGNRLATFLNYMSDVEAGGATVFPDL---------------------GAAIWPKKGTA 513
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 514 VFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 549
>gi|301754231|ref|XP_002912939.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Ailuropoda
melanoleuca]
Length = 535
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 106/218 (48%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E + + +AK +L R+ V D +G ++ R S
Sbjct: 323 LIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVS 382
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF----SDKV 144
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ D
Sbjct: 383 KSSWLEEDDDPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRKNEQDAF 442
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV GG TVFP+ G A+ P++G
Sbjct: 443 KRLGTGNRVATFLNYMSDVEAGGATVFPDL---------------------GAAIWPKKG 481
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 482 TAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 519
>gi|119582748|gb|EAW62344.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_c
[Homo sapiens]
Length = 565
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 63/216 (29%), Positives = 106/216 (49%), Gaps = 23/216 (10%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E + + +AK +L R+ V D +G ++ R S
Sbjct: 355 LIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVS 414
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVN--I 146
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ + +
Sbjct: 415 KSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGL 474
Query: 147 VRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
G+RLAT L Y+SDV GG TVFP+ G A+ P++G A
Sbjct: 475 KTEGNRLATFLNYMSDVEAGGATVFPDL---------------------GAAIWPKKGTA 513
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 514 VFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 549
>gi|346724248|ref|YP_004850917.1| hypothetical protein XACM_1335 [Xanthomonas axonopodis pv.
citrumelo F1]
gi|346648995|gb|AEO41619.1| hypothetical protein XACM_1335 [Xanthomonas axonopodis pv.
citrumelo F1]
Length = 418
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 69/209 (33%), Positives = 95/209 (45%), Gaps = 25/209 (11%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSG-TFIPKGKDAI 100
PR Y L+ EC L+ LA+ L+ S V D + + VRTS G T P +D
Sbjct: 228 PRIEEYAAVLSADECRLLMLLARPHLRASKVIDPNDASTGRAPVRTSHGATLDPIIEDFA 287
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF---SDKVNIVRGGHRLATVL 157
+ ++A LP + E + VL Y G++Y H DY + + G+R TV
Sbjct: 288 ARAAQSRLAACAQLPLAHAEPLSVLCYAPGEQYRAHRDYLPPGTIAADRPTAGNRQRTVC 347
Query: 158 MYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAI 217
+YL+DV GGET FP A G+ V+PR G + F +LH +
Sbjct: 348 VYLNDVGAGGETEFPVA---------------------GVRVRPRPGTLVCFDNLHADGR 386
Query: 218 PDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
PD SLH+G PV G KW T W +
Sbjct: 387 PDADSLHAGLPVTAGSKWLGTLWFRQQRY 415
>gi|195452778|ref|XP_002073496.1| GK13116 [Drosophila willistoni]
gi|194169581|gb|EDW84482.1| GK13116 [Drosophila willistoni]
Length = 521
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 64/213 (30%), Positives = 102/213 (47%), Gaps = 24/213 (11%)
Query: 32 PSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGT 91
P K++ I P +Y ++ E L +AK +LKR+ V ++ ++ RT+
Sbjct: 315 PLKMELIGLDPYMVLYHDVISPNEIAELQEMAKPELKRATVYNSTKNTNQFVKTRTAKVA 374
Query: 92 FIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVN--IVR- 148
+ + + + +I T E +QV+ Y G Y H+DYF+ N I +
Sbjct: 375 WFLDTFNQLTERLNQRIMDMTNFVLNGSEMLQVMNYGLGGYYVKHFDYFNTTTNPHISQI 434
Query: 149 GGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALL 208
G R+ATVL YL+DV +GG TVFP ++ AV P+RG A++
Sbjct: 435 NGDRIATVLFYLNDVEQGGATVFPEIKK---------------------AVFPKRGSAIM 473
Query: 209 FFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
+++L + + +LH+ CPVI G KW KWI
Sbjct: 474 WYNLKDDGEGNRDTLHAACPVIVGSKWVCNKWI 506
>gi|125772813|ref|XP_001357665.1| GA21991 [Drosophila pseudoobscura pseudoobscura]
gi|54637397|gb|EAL26799.1| GA21991 [Drosophila pseudoobscura pseudoobscura]
Length = 534
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 64/214 (29%), Positives = 100/214 (46%), Gaps = 22/214 (10%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+ P K+++ S P Y L+ + L +A ++ RS V G++K S R S
Sbjct: 316 LAPLKLEEHSLDPFVVTYHDMLSPRKIADLRLMAVPRMHRSTVNPLPGGQNKKSSFRVSK 375
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNI-VR 148
++ + G+ ++ T L E +QV Y G YEPH+D+F D +
Sbjct: 376 NAWLAYDSHPTMGGMLSDLSDATGLDMTFCEQLQVANYGVGGHYEPHWDFFRDPDHYPAE 435
Query: 149 GGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALL 208
G+R+AT + YLSDV +GG T FP AVKP+ G+ L
Sbjct: 436 EGNRMATAIFYLSDVEQGGATAFPFL---------------------NFAVKPQLGNVLF 474
Query: 209 FFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
++++H + D + H+GCPV++G KW WIH
Sbjct: 475 WYNVHRSLDVDYRTKHAGCPVLKGSKWIGNVWIH 508
>gi|426229219|ref|XP_004008688.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 1
[Ovis aries]
Length = 535
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 106/218 (48%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E + + +AK +L R+ V D +G ++ R S
Sbjct: 323 LIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVS 382
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF----SDKV 144
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ D
Sbjct: 383 KSSWLEEDDDPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRKDEQDAF 442
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV GG TVFP+ G A+ P++G
Sbjct: 443 KRLGTGNRVATFLNYMSDVEAGGATVFPDL---------------------GAAIWPKKG 481
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 482 TAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 519
>gi|114601566|ref|XP_001162222.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Pan
troglodytes]
gi|114601568|ref|XP_001162843.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 17 [Pan
troglodytes]
gi|397518358|ref|XP_003829358.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Pan
paniscus]
gi|397518362|ref|XP_003829360.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 5 [Pan
paniscus]
gi|410215944|gb|JAA05191.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
gi|410255608|gb|JAA15771.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
gi|410331279|gb|JAA34586.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
Length = 535
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 106/218 (48%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E + + +AK +L R+ V D +G ++ R S
Sbjct: 323 LIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVS 382
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF----SDKV 144
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ D
Sbjct: 383 KSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTF 442
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV GG TVFP+ G A+ P++G
Sbjct: 443 KHLGTGNRVATFLNYMSDVEAGGATVFPDL---------------------GAAIWPKKG 481
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 482 TAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 519
>gi|395817620|ref|XP_003782263.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Otolemur
garnettii]
Length = 540
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 106/218 (48%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E + + +AK +L R+ V D +G ++ R S
Sbjct: 328 LIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVS 387
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF----SDKV 144
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ D
Sbjct: 388 KSSWLEEDDDPVVARVNHRMQHITGLSVKTAELLQVANYGVGGQYEPHFDFSRNHERDAF 447
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV GG TVFP+ G A+ P++G
Sbjct: 448 KRLGTGNRVATFLNYMSDVEAGGATVFPDL---------------------GAAIWPKKG 486
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 487 TAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 524
>gi|226874885|ref|NP_001029465.2| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Bos
taurus]
gi|296485623|tpg|DAA27738.1| TPA: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Bos taurus]
Length = 533
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 63/216 (29%), Positives = 106/216 (49%), Gaps = 23/216 (10%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E + + +AK +L R+ V D +G ++ R S
Sbjct: 323 LIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVS 382
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVN--I 146
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ + +
Sbjct: 383 KSSWLEEDDDPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGL 442
Query: 147 VRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
G+RLAT L Y+SDV GG TVFP+ G A+ P++G A
Sbjct: 443 KTEGNRLATFLNYMSDVEAGGATVFPDL---------------------GAAIWPKKGTA 481
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 482 VFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 517
>gi|33589818|ref|NP_878907.1| prolyl 4-hydroxylase subunit alpha-3 precursor [Homo sapiens]
gi|114639354|ref|XP_001174896.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Pan troglodytes]
gi|397487266|ref|XP_003814725.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Pan paniscus]
gi|74738714|sp|Q7Z4N8.1|P4HA3_HUMAN RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
alpha-3; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-3; Flags: Precursor
gi|33188232|gb|AAP97874.1| prolyl 4-hydroxylase alpha III subunit [Homo sapiens]
gi|36962719|gb|AAQ87603.1| collagen prolyl 4-hydroxylase alpha III subunit [Homo sapiens]
gi|37182165|gb|AAQ88885.1| GPGA711 [Homo sapiens]
gi|109658570|gb|AAI17334.1| Prolyl 4-hydroxylase, alpha polypeptide III [Homo sapiens]
gi|119595341|gb|EAW74935.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide III, isoform CRA_b
[Homo sapiens]
gi|410219716|gb|JAA07077.1| prolyl 4-hydroxylase, alpha polypeptide III [Pan troglodytes]
gi|410248278|gb|JAA12106.1| prolyl 4-hydroxylase, alpha polypeptide III [Pan troglodytes]
gi|410336087|gb|JAA36990.1| prolyl 4-hydroxylase, alpha polypeptide III [Pan troglodytes]
Length = 544
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 70/220 (31%), Positives = 109/220 (49%), Gaps = 30/220 (13%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLS-DVRT 87
++ P + + I +P +Y F++D E + LA+ L+RS VA SGE +L + R
Sbjct: 333 LLQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVA---SGEKQLQVEYRI 389
Query: 88 SSGTFIPKGKDAIIAGIEDKIATWTFLPKE--NGEDIQVLRYEHGQKYEPHYDYF---SD 142
S ++ D + + +IA T L E +QV+ Y G YEPH+D+ S
Sbjct: 390 SKSAWLKDTVDPKLVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPSS 449
Query: 143 KVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPR 202
+ ++ G+R+AT ++YLS V GG T F A ++V
Sbjct: 450 PLYRMKSGNRVATFMIYLSSVEAGGATAFIYA---------------------NLSVPVV 488
Query: 203 RGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
R AL +++LH + D +LH+GCPV+ G+KW A KWIH
Sbjct: 489 RNAALFWWNLHRSGEGDSDTLHAGCPVLVGDKWVANKWIH 528
>gi|229084249|ref|ZP_04216532.1| 2OG-Fe(II) oxygenase [Bacillus cereus Rock3-44]
gi|228699049|gb|EEL51751.1| 2OG-Fe(II) oxygenase [Bacillus cereus Rock3-44]
Length = 235
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 70/205 (34%), Positives = 95/205 (46%), Gaps = 29/205 (14%)
Query: 47 YEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGIED 106
YE +T EC LI+LA+ L+ S V N E K S VRTS + I
Sbjct: 52 YEKVVTQTECHQLIDLARHGLQPSKVIGN--SEQKTSAVRTSDTIGFQHHLTELTLQICK 109
Query: 107 KIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS-----DKVNIVRGGHRLATVLMYLS 161
+IA+ LP E +Q+ RY+ G K+ H+D F+ K+ + G R+ T L+YL+
Sbjct: 110 RIASIVELPLNYAEHLQIARYQVGGKFNAHFDTFNPSTELGKMYLSENGQRIITALLYLN 169
Query: 162 DVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIP-DP 220
+V+ GGET FP I V P G L+F + N+
Sbjct: 170 NVSAGGETSFPLL---------------------NIQVAPSEGTLLVFENCKKNSNERHA 208
Query: 221 VSLHSGCPVIEGEKWSATKWIHVDS 245
+S+H GC V EGEKW AT W H S
Sbjct: 209 LSIHEGCAVHEGEKWIATLWFHEKS 233
>gi|226874889|ref|NP_001152881.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Bos
taurus]
gi|296485624|tpg|DAA27739.1| TPA: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Bos taurus]
Length = 535
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 106/218 (48%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E + + +AK +L R+ V D +G ++ R S
Sbjct: 323 LIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVS 382
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF----SDKV 144
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ D
Sbjct: 383 KSSWLEEDDDPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRKDEQDAF 442
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV GG TVFP+ G A+ P++G
Sbjct: 443 KRLGTGNRVATFLNYMSDVEAGGATVFPDL---------------------GAAIWPKKG 481
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 482 TAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 519
>gi|440912197|gb|ELR61789.1| Prolyl 4-hydroxylase subunit alpha-2, partial [Bos grunniens mutus]
Length = 535
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 63/216 (29%), Positives = 106/216 (49%), Gaps = 23/216 (10%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E + + +AK +L R+ V D +G ++ R S
Sbjct: 325 LIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVS 384
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVN--I 146
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ + +
Sbjct: 385 KSSWLEEDDDPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGL 444
Query: 147 VRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
G+RLAT L Y+SDV GG TVFP+ G A+ P++G A
Sbjct: 445 KTEGNRLATFLNYMSDVEAGGATVFPDL---------------------GAAIWPKKGTA 483
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 484 VFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 519
>gi|403255941|ref|XP_003920663.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Saimiri
boliviensis boliviensis]
gi|403255945|ref|XP_003920665.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 5 [Saimiri
boliviensis boliviensis]
Length = 535
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 106/218 (48%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E + + +AK +L R+ V D +G ++ R S
Sbjct: 323 LIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVS 382
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF----SDKV 144
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ D
Sbjct: 383 KSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDAF 442
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV GG TVFP+ G A+ P++G
Sbjct: 443 KHLGTGNRVATFLNYMSDVEAGGATVFPDL---------------------GAAIWPKKG 481
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 482 TAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 519
>gi|4758868|ref|NP_004190.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Homo
sapiens]
gi|217272863|ref|NP_001136071.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Homo
sapiens]
gi|20455169|sp|O15460.1|P4HA2_HUMAN RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
alpha-2; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-2; Flags: Precursor
gi|2439985|gb|AAB71339.1| prolyl 4-hydroxylase alpha (II) subunit [Homo sapiens]
gi|18073926|emb|CAC85689.1| Prolyl 4-hydroxylase alpha IIb subunit [Homo sapiens]
gi|119582746|gb|EAW62342.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_b
[Homo sapiens]
gi|119582747|gb|EAW62343.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_b
[Homo sapiens]
Length = 535
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 106/218 (48%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E + + +AK +L R+ V D +G ++ R S
Sbjct: 323 LIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVS 382
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF----SDKV 144
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ D
Sbjct: 383 KSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTF 442
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV GG TVFP+ G A+ P++G
Sbjct: 443 KHLGTGNRVATFLNYMSDVEAGGATVFPDL---------------------GAAIWPKKG 481
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 482 TAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 519
>gi|332221660|ref|XP_003259981.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Nomascus
leucogenys]
Length = 537
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 106/218 (48%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E + + +AK +L R+ V D +G ++ R S
Sbjct: 325 LIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVS 384
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF----SDKV 144
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ D
Sbjct: 385 KSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTF 444
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV GG TVFP+ G A+ P++G
Sbjct: 445 KHLGTGNRVATFLNYMSDVEAGGATVFPDL---------------------GAAIWPKKG 483
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 484 TAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 521
>gi|297675929|ref|XP_002815906.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Pongo
abelii]
Length = 535
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 106/218 (48%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E + + +AK +L R+ V D +G ++ R S
Sbjct: 323 LIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVS 382
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF----SDKV 144
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ D
Sbjct: 383 KSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTF 442
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV GG TVFP+ G A+ P++G
Sbjct: 443 KHLGTGNRVATFLNYMSDVEAGGATVFPDL---------------------GAAIWPKKG 481
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 482 TAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 519
>gi|59809017|gb|AAH89446.1| P4HA3 protein [Homo sapiens]
Length = 528
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 70/220 (31%), Positives = 109/220 (49%), Gaps = 30/220 (13%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLS-DVRT 87
++ P + + I +P +Y F++D E + LA+ L+RS VA SGE +L + R
Sbjct: 317 LLQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVA---SGEKQLQVEYRI 373
Query: 88 SSGTFIPKGKDAIIAGIEDKIATWTFLPKE--NGEDIQVLRYEHGQKYEPHYDYF---SD 142
S ++ D + + +IA T L E +QV+ Y G YEPH+D+ S
Sbjct: 374 SKSAWLKDTVDPKLVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPSS 433
Query: 143 KVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPR 202
+ ++ G+R+AT ++YLS V GG T F A ++V
Sbjct: 434 PLYRMKSGNRVATFMIYLSSVEAGGATAFIYA---------------------NLSVPVV 472
Query: 203 RGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
R AL +++LH + D +LH+GCPV+ G+KW A KWIH
Sbjct: 473 RNAALFWWNLHRSGEGDSDTLHAGCPVLVGDKWVANKWIH 512
>gi|119582752|gb|EAW62348.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_f
[Homo sapiens]
Length = 567
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 106/218 (48%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E + + +AK +L R+ V D +G ++ R S
Sbjct: 355 LIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVS 414
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF----SDKV 144
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ D
Sbjct: 415 KSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTF 474
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV GG TVFP+ G A+ P++G
Sbjct: 475 KHLGTGNRVATFLNYMSDVEAGGATVFPDL---------------------GAAIWPKKG 513
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 514 TAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 551
>gi|374620441|ref|ZP_09692975.1| 2OG-Fe(II) oxygenase superfamily enzyme [gamma proteobacterium
HIMB55]
gi|374303668|gb|EHQ57852.1| 2OG-Fe(II) oxygenase superfamily enzyme [gamma proteobacterium
HIMB55]
Length = 570
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 70/222 (31%), Positives = 103/222 (46%), Gaps = 29/222 (13%)
Query: 39 SWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKD 98
S P V ++ +EC +LI LAK +KR+ V L K S+ RT S ++ +D
Sbjct: 18 SLDPLVGVRNNVISPVECAYLIELAKPHIKRAGVV--LDEGYKESEGRTGSNHWLKYDED 75
Query: 99 AIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVR-----GGHRL 153
++ + +I+ LP E E +Q++ Y Q+Y PH+D F+ + + GG RL
Sbjct: 76 DVVQSVGQRISDIVGLPLEYAESMQIIHYGPEQEYRPHFDAFNLSLPKGQRAAKWGGQRL 135
Query: 154 ATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFF-SL 212
T L+YL+ V GG T FP K GI V G ++F +
Sbjct: 136 VTALVYLNKVEAGGATQFP---------------------KLGITVPALPGRMVIFHNTT 174
Query: 213 HTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEGG 254
H + P P+SLH+G PV GEKW+ W + E GG
Sbjct: 175 HDISGPHPLSLHAGMPVEAGEKWAFNMWFRLQDTTTEFEFGG 216
>gi|74353841|gb|AAI03334.1| Prolyl 4-hydroxylase, alpha polypeptide II [Bos taurus]
Length = 487
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 63/216 (29%), Positives = 106/216 (49%), Gaps = 23/216 (10%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E + + +AK +L R+ V D +G ++ R S
Sbjct: 277 LIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVS 336
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVN--I 146
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ + +
Sbjct: 337 KSSWLEEDDDPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGL 396
Query: 147 VRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
G+RLAT L Y+SDV GG TVFP+ G A+ P++G A
Sbjct: 397 KTEGNRLATFLNYMSDVEAGGATVFPDL---------------------GAAIWPKKGTA 435
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 436 VFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 471
>gi|195330778|ref|XP_002032080.1| GM23711 [Drosophila sechellia]
gi|194121023|gb|EDW43066.1| GM23711 [Drosophila sechellia]
Length = 490
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 67/214 (31%), Positives = 101/214 (47%), Gaps = 36/214 (16%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
I P K++++S P V+ + D E D ++N S +L+ + S+VRTS
Sbjct: 288 IAPLKMEELSLDPYMVVFHDVVYDTEIDGMLN-------SSNFVLSLTDSGQKSEVRTSK 340
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVR- 148
++I K + +++ T E + ++ Y G Y HYD F + N R
Sbjct: 341 DSYIVDAK-----SLNERVTDMTGFSMEMSDPFSLINYGLGGHYMLHYD-FHEYTNTTRP 394
Query: 149 -GGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDAL 207
G R+ATVL YL +V GG T+FP K IAV P++G A+
Sbjct: 395 KQGDRIATVLFYLGEVDSGGATIFP---------------------KINIAVTPKKGSAV 433
Query: 208 LFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
+++LH + + SLHS CPVI G K+ TKWI
Sbjct: 434 FWYNLHNSGAMNLKSLHSACPVISGSKYVLTKWI 467
>gi|57997558|emb|CAI46066.1| hypothetical protein [Homo sapiens]
Length = 533
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 60/203 (29%), Positives = 101/203 (49%), Gaps = 23/203 (11%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAII 101
P Y ++D E + + +AK +L R+ V D +G ++ R S +++ + D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395
Query: 102 AGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVN--IVRGGHRLATVLMY 159
A + ++ T L + E +QV Y G +YEPH+D+ + + G+RLAT L Y
Sbjct: 396 ARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 455
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPD 219
+SDV GG TVFP+ G A+ P++G A+ +++L + D
Sbjct: 456 MSDVEAGGATVFPDL---------------------GAAIWPKKGTAVFWYNLLRSGEGD 494
Query: 220 PVSLHSGCPVIEGEKWSATKWIH 242
+ H+ CPV+ G KW + KW H
Sbjct: 495 YRTRHAACPVLVGCKWVSNKWFH 517
>gi|195575099|ref|XP_002105517.1| GD17024 [Drosophila simulans]
gi|194201444|gb|EDX15020.1| GD17024 [Drosophila simulans]
Length = 537
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 64/215 (29%), Positives = 100/215 (46%), Gaps = 22/215 (10%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+ P K+++ S P + L+ + L +A ++ RS V G+ K S R S
Sbjct: 318 FLAPLKLEEHSLDPYVATFHDMLSPRKISQLREMAVPRMHRSTVNPLPGGQLKKSAFRVS 377
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNI-V 147
++ + G+ + T L E +QV Y G YEPH+D+F D +
Sbjct: 378 KNAWLAYESHPTMVGMLRDLKDATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPA 437
Query: 148 RGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDAL 207
G+R+AT + YLS+V +GG T FP + IAVKP+ G+ L
Sbjct: 438 EEGNRIATAIFYLSEVEQGGATAFPFLD---------------------IAVKPQLGNVL 476
Query: 208 LFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+++LH + D + H+GCPV++G KW WIH
Sbjct: 477 FWYNLHRSLDKDYRTKHAGCPVLKGSKWIGNVWIH 511
>gi|426229221|ref|XP_004008689.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
[Ovis aries]
Length = 487
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 63/216 (29%), Positives = 106/216 (49%), Gaps = 23/216 (10%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E + + +AK +L R+ V D +G ++ R S
Sbjct: 277 LIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVS 336
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVN--I 146
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ + +
Sbjct: 337 KSSWLEEDDDPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGL 396
Query: 147 VRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
G+RLAT L Y+SDV GG TVFP+ G A+ P++G A
Sbjct: 397 KTEGNRLATFLNYMSDVEAGGATVFPDL---------------------GAAIWPKKGTA 435
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 436 VFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 471
>gi|281350467|gb|EFB26051.1| hypothetical protein PANDA_009188 [Ailuropoda melanoleuca]
Length = 511
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 60/211 (28%), Positives = 108/211 (51%), Gaps = 25/211 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P+K + KPR + ++D E + + +LAK +L+R+ +++ ++G+ + R S
Sbjct: 322 ILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVHYRIS 381
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ ++ +++ I +I T L E++QV Y G +YEPH+D+ D
Sbjct: 382 KSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAF 441
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV+ GG TVFP G +V P++G
Sbjct: 442 KELGTGNRIATWLFYMSDVSAGGATVFPEV---------------------GASVWPKKG 480
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKW 235
A+ +++L + D + H+ CPV+ G KW
Sbjct: 481 TAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511
>gi|190402274|gb|ACE77683.1| prolyl 4-hydroxylase subunit alpha-2 precursor (predicted) [Sorex
araneus]
Length = 533
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 63/216 (29%), Positives = 105/216 (48%), Gaps = 23/216 (10%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E + + +AK +L R+ V D +G + R S
Sbjct: 323 LIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTTASYRVS 382
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVN--I 146
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ + +
Sbjct: 383 KSSWLEETDDPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGL 442
Query: 147 VRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
G+RLAT L Y+SDV GG TVFP+ G A+ P++G A
Sbjct: 443 KTEGNRLATFLNYMSDVEAGGATVFPDL---------------------GAAIWPKKGTA 481
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 482 VFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 517
>gi|195341544|ref|XP_002037366.1| GM12151 [Drosophila sechellia]
gi|194131482|gb|EDW53525.1| GM12151 [Drosophila sechellia]
Length = 537
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 64/215 (29%), Positives = 99/215 (46%), Gaps = 22/215 (10%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+ P K+++ S P + L + L +A ++ RS V G+ K S R S
Sbjct: 318 FLAPLKLEEHSLDPYVATFHDMLNPRKISQLREMAVPRMHRSTVNPLPGGQLKKSAFRVS 377
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNI-V 147
++ + G+ + T L E +QV Y G YEPH+D+F D +
Sbjct: 378 KNAWLAYESHPTMVGMLRDLKDATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPA 437
Query: 148 RGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDAL 207
G+R+AT + YLS+V +GG T FP + IAVKP+ G+ L
Sbjct: 438 EEGNRIATAIFYLSEVEQGGATAFPFLD---------------------IAVKPQLGNVL 476
Query: 208 LFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+++LH + D + H+GCPV++G KW WIH
Sbjct: 477 FWYNLHRSLDKDYRTKHAGCPVLKGSKWIGNVWIH 511
>gi|308476969|ref|XP_003100699.1| hypothetical protein CRE_15564 [Caenorhabditis remanei]
gi|308264511|gb|EFP08464.1| hypothetical protein CRE_15564 [Caenorhabditis remanei]
Length = 573
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 68/236 (28%), Positives = 109/236 (46%), Gaps = 45/236 (19%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
I P KV+ + + P A +++ ++D E + LA +LKR+ V ++ +GE + + R S
Sbjct: 329 IAPIKVEILRFDPLAVLFKNVISDSEIKVIKELASPKLKRATVQNSKTGELEHATYRISK 388
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRG 149
++ +I + +I +T L + E++QV Y G Y+PH+D F+ N G
Sbjct: 389 SAWLKGDLHPVIERVNRRIEDFTGLYQGTSEELQVANYGLGGHYDPHFD-FARIANYGLG 447
Query: 150 GH-----------------------RLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATN 186
GH R+ATVL Y+S +GG TVF +
Sbjct: 448 GHYEPHYDMSLKEEKNAFKTLNTGNRIATVLFYMSQPERGGATVFNHL------------ 495
Query: 187 DDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
G AV P + DAL +++L + D + H+ CPV+ G KW + KWIH
Sbjct: 496 ---------GTAVFPSKNDALFWYNLRRDGEGDLRTRHAACPVLLGVKWVSNKWIH 542
>gi|224006596|ref|XP_002292258.1| hypothetical protein THAPSDRAFT_263536 [Thalassiosira pseudonana
CCMP1335]
gi|220971900|gb|EED90233.1| hypothetical protein THAPSDRAFT_263536 [Thalassiosira pseudonana
CCMP1335]
Length = 206
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 75/215 (34%), Positives = 106/215 (49%), Gaps = 25/215 (11%)
Query: 42 PRAFVYEGFLTDLECDHLINLAKS-QLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
PR F FL+ E D L+ + + + A N G + RTS F I
Sbjct: 2 PRVFYVHNFLSADEADELVAFSMAPSTGGTHKAWNQGGSNAKLTTRTSMNAF------DI 55
Query: 101 IAGIEDKIATWTF------LPKENGED-IQVLRYEHGQKYEPHYDYF-----SDKV-NIV 147
+ +I F KEN D IQ+LRYE GQ Y H+DYF +D + +
Sbjct: 56 TTKLSFRIKRRAFRLLRMGAYKENLADGIQILRYELGQAYIAHHDYFPVRQSNDHLWDPS 115
Query: 148 RGG-HRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
+GG +R AT+ +YLSDV GG+T+ E+ + D L + +AV PRRGDA
Sbjct: 116 KGGSNRFATIFLYLSDVEVGGQTL----EKDAGVDAGSWEDKLVDQCYSKLAVPPRRGDA 171
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
+LF+S + + DP SLH CP+++G KW A W+
Sbjct: 172 ILFYSQYPDGHLDPNSLHGACPILKGTKWGANLWV 206
>gi|301759032|ref|XP_002915381.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Ailuropoda
melanoleuca]
Length = 539
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 67/220 (30%), Positives = 110/220 (50%), Gaps = 30/220 (13%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLS-DVRT 87
++ P + + I +P +Y F++D E + LA+ L+RS VA SGE +L + R
Sbjct: 328 LLQPVRKEVIHLEPYVVLYHDFVSDGEAQKIRGLAEPWLQRSVVA---SGEKQLPVEYRI 384
Query: 88 SSGTFIPKGKDAIIAGIEDKIATWTFLPKE--NGEDIQVLRYEHGQKYEPHYDYFSDKVN 145
S ++ D ++ ++ +I T L + E +QV+ Y G YEPH+D+ + +
Sbjct: 385 SKSAWLKDTVDPLLVTLDHRIGALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPTS 444
Query: 146 ---IVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPR 202
++ G+R+AT ++YLS V GG T F A +V
Sbjct: 445 PLYRMKSGNRVATFMIYLSSVEAGGATAFIYA---------------------NFSVPVV 483
Query: 203 RGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ AL +++LH + D +LH+GCPV+ G+KW A KWIH
Sbjct: 484 KNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIH 523
>gi|227553849|gb|ACP40552.1| IP22178p [Drosophila melanogaster]
Length = 467
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 64/215 (29%), Positives = 100/215 (46%), Gaps = 22/215 (10%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+ P K+++ S P + L+ + L +A ++ RS V G+ K S R S
Sbjct: 248 FLAPLKLEEHSLDPYVATFHDILSPGKISQLREMAVPRMHRSTVNPLPGGQLKKSAFRVS 307
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNI-V 147
++ + G+ + T L E +QV Y G YEPH+D+F D +
Sbjct: 308 KNAWLAYESHPTMVGMLRDLKDATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPA 367
Query: 148 RGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDAL 207
G+R+AT + YLS+V +GG T FP + IAVKP+ G+ L
Sbjct: 368 EEGNRIATAIFYLSEVEQGGATAFPFLD---------------------IAVKPQLGNVL 406
Query: 208 LFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+++LH + D + H+GCPV++G KW WIH
Sbjct: 407 FWYNLHRSLDKDYRTKHAGCPVLKGSKWIGNVWIH 441
>gi|395817618|ref|XP_003782262.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Otolemur
garnettii]
Length = 538
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 62/216 (28%), Positives = 106/216 (49%), Gaps = 23/216 (10%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E + + +AK +L R+ V D +G ++ R S
Sbjct: 328 LIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVS 387
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVN--I 146
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ + +
Sbjct: 388 KSSWLEEDDDPVVARVNHRMQHITGLSVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGL 447
Query: 147 VRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
G+R+AT L Y+SDV GG TVFP+ G A+ P++G A
Sbjct: 448 KTEGNRVATFLNYMSDVEAGGATVFPDL---------------------GAAIWPKKGTA 486
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 487 VFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 522
>gi|378706224|gb|AFC35025.1| hypothetical protein OtV6_117c [Ostreococcus tauri virus RT-2011]
Length = 194
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 69/202 (34%), Positives = 101/202 (50%), Gaps = 31/202 (15%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
KP+ V FLT+ E H+ A+S+L S +A N + + + D S T + +D +
Sbjct: 20 KPK--VIRNFLTEDEIAHIKKEAESKLTTSTIAANGTIDKNMRD----SDTAWLELEDPV 73
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYL 160
+ + + + T P N E +QVLRY+ G Y+PH D FSD V+G R+ T+++ L
Sbjct: 74 VNRVTQRCVSLTDRPLINCEKLQVLRYKEGGFYKPHQDTFSD----VKGNKRMYTIILAL 129
Query: 161 SDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDP 220
+D +GGETVFPN RR+ K +GDAL F +L +
Sbjct: 130 NDDYEGGETVFPNL----RRK-----------------YKLNKGDALFFHTLDNYELMTS 168
Query: 221 VSLHSGCPVIEGEKWSATKWIH 242
+LH G PV GEKW W+H
Sbjct: 169 KALHGGAPVKSGEKWVCNLWVH 190
>gi|195591298|ref|XP_002085379.1| GD14755 [Drosophila simulans]
gi|194197388|gb|EDX10964.1| GD14755 [Drosophila simulans]
Length = 515
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 61/233 (26%), Positives = 119/233 (51%), Gaps = 37/233 (15%)
Query: 17 SLLIRKSFSSTAIIN--PSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVAD 74
+L+ R + S+ A + P K++++S P ++ ++D E + + + + +
Sbjct: 296 NLVCRYNSSTNAFLQLAPLKMEEVSRDPYIVLFHEMISDKEIEEM---------KGEITE 346
Query: 75 NLSGESKLSDVR-TSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKY 133
+G + L D + S + + + + I +I+ T E IQ+ + G +
Sbjct: 347 MENGWTSLGDSKEIVSRVYWIRKESSFSKRINQRISDMTGFKLEEFPAIQLANFGVGGYF 406
Query: 134 EPHYDYFSDKVNIV----RGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDL 189
+PHYDY++D++ V G R+ +++ Y +V++GG+TVFP+ +
Sbjct: 407 KPHYDYYTDRLKEVDVNNTLGDRIGSIIFYAGEVSQGGQTVFPDLK-------------- 452
Query: 190 SECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+AV+P++G+AL +F+ ++ PDP +LHS CPVI G +W+ TKW+H
Sbjct: 453 -------VAVEPKKGNALFWFNAFDDSSPDPRTLHSVCPVIVGSRWTITKWLH 498
>gi|402894624|ref|XP_003910453.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
alpha-3 [Papio anubis]
Length = 535
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 68/220 (30%), Positives = 109/220 (49%), Gaps = 30/220 (13%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLS-DVRT 87
++ P + + I +P +Y F++D E + A+ L+RS VA SGE +L + R
Sbjct: 324 LLQPIRKEVIHLEPYIALYHDFVSDSEAQKIREFAEPWLQRSVVA---SGEKQLQVEYRI 380
Query: 88 SSGTFIPKGKDAIIAGIEDKIATWTFLPKE--NGEDIQVLRYEHGQKYEPHYDYF---SD 142
S ++ D ++ + +IA T L E +QV+ Y G YEPH+D+ S
Sbjct: 381 SKSAWLKDTVDPMLVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPSS 440
Query: 143 KVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPR 202
+ ++ G+R+AT ++YLS V GG T F A ++V
Sbjct: 441 PLYRMKSGNRVATFMIYLSSVEAGGATAFIYA---------------------NLSVPVV 479
Query: 203 RGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ AL +++LH + D +LH+GCPV+ G+KW A KWIH
Sbjct: 480 KNAALFWWNLHRSGEGDSDTLHAGCPVLVGDKWVANKWIH 519
>gi|194904100|ref|XP_001981000.1| GG23922 [Drosophila erecta]
gi|190652703|gb|EDV49958.1| GG23922 [Drosophila erecta]
Length = 490
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 66/214 (30%), Positives = 101/214 (47%), Gaps = 36/214 (16%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
I P K++++S P VY + D E D ++N + L +L+ + S+VR S
Sbjct: 288 IAPLKMEELSSDPYMVVYHDVIYDSEIDLMLNASNFSL-------SLTNSGQKSEVRASK 340
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRG 149
++I K + D++ T L E + ++ Y G Y HYDY + N+ R
Sbjct: 341 DSYIVDSK-----TLNDRVTDMTGLSMEMSDPFSMINYGIGGHYMLHYDY-HEYSNMTRE 394
Query: 150 --GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDAL 207
G R+ATVL YL +V GG T+FP + I V P++G A+
Sbjct: 395 KYGDRIATVLFYLGEVHSGGATIFP---------------------RINITVTPKKGSAV 433
Query: 208 LFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
+++LH + +LHS CPVI G K+ TKWI
Sbjct: 434 FWYNLHNSGAMHSETLHSACPVISGSKYVLTKWI 467
>gi|113682363|ref|NP_001038463.1| prolyl 4-hydroxylase, alpha polypeptide I a precursor [Danio rerio]
Length = 522
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 67/239 (28%), Positives = 108/239 (45%), Gaps = 47/239 (19%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVA---------------- 73
I P K + +PR Y +T+ E + + L+K +L+R+ ++
Sbjct: 289 IGPVKQEDEWDRPRIIRYHEIITEQEIEKIKELSKPRLRRATISNPITGVLETAHYRISK 348
Query: 74 ------DNLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRY 127
D +G+ + R S ++ + ++ I +I T L + E++QV Y
Sbjct: 349 RRATVHDPQTGKLTTAQYRVSKSAWLAAYEHPVVDRINQRIEDITGLNVKTAEELQVANY 408
Query: 128 EHGQKYEPHYDYFS----DKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTP 183
G +YEPH+D+ D + G+R+AT L Y+SDVA GG TVFP
Sbjct: 409 GVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVAAGGATVFPEV--------- 459
Query: 184 ATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
G AVKP +G A+ +++L + D + H+ CPV+ G KW + KWIH
Sbjct: 460 ------------GAAVKPLKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKWVSNKWIH 506
>gi|24651420|ref|NP_733374.1| prolyl-4-hydroxylase-alpha NE1 [Drosophila melanogaster]
gi|7301952|gb|AAF57058.1| prolyl-4-hydroxylase-alpha NE1 [Drosophila melanogaster]
gi|363987308|gb|AEW43896.1| FI16820p1 [Drosophila melanogaster]
Length = 537
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 64/215 (29%), Positives = 100/215 (46%), Gaps = 22/215 (10%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+ P K+++ S P + L+ + L +A ++ RS V G+ K S R S
Sbjct: 318 FLAPLKLEEHSLDPYVATFHDILSPGKISQLREMAVPRMHRSTVNPLPGGQLKKSAFRVS 377
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNI-V 147
++ + G+ + T L E +QV Y G YEPH+D+F D +
Sbjct: 378 KNAWLAYESHPTMVGMLRDLKDATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPA 437
Query: 148 RGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDAL 207
G+R+AT + YLS+V +GG T FP + IAVKP+ G+ L
Sbjct: 438 EEGNRIATAIFYLSEVEQGGATAFPFLD---------------------IAVKPQLGNVL 476
Query: 208 LFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+++LH + D + H+GCPV++G KW WIH
Sbjct: 477 FWYNLHRSLDKDYRTKHAGCPVLKGSKWIGNVWIH 511
>gi|350014318|dbj|GAA37183.1| prolyl 4-hydroxylase [Clonorchis sinensis]
Length = 595
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 64/216 (29%), Positives = 101/216 (46%), Gaps = 24/216 (11%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
I P K + + PR ++ + E + LA +L+R+ V + ++G+ + + RTS
Sbjct: 378 IGPVKEEVLYPDPRIVMWYDVIHPSEVGRIQELALPRLRRATVKNPVTGKLENAYYRTSK 437
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVN---I 146
++ G D + + +I T L E ED+QV Y G Y PH+D+ +
Sbjct: 438 SAWLQDGLDEVTHRLNQRIHALTGLAMETAEDLQVGNYGIGGYYAPHFDFGRKREKDAFE 497
Query: 147 VRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
V G+R+AT++ YL+DV GG TVF + G +VKP RG A
Sbjct: 498 VENGNRIATIIFYLTDVKAGGATVF---------------------NRFGASVKPVRGAA 536
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+++LH + D + H CPV+ G KW W H
Sbjct: 537 GFWYNLHPSGEGDLRTRHVACPVLVGSKWVMNVWFH 572
>gi|292621357|ref|XP_691737.4| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Danio rerio]
Length = 538
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 64/223 (28%), Positives = 109/223 (48%), Gaps = 28/223 (12%)
Query: 25 SSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSD 84
S + P + + IS +P ++ GF+T E ++ A L+RS VA ++ + ++
Sbjct: 323 SPALFLQPIRREIISLQPYVVLFHGFVTQAEAKNIRKYAMPGLRRSVVASGMNQAT--AE 380
Query: 85 VRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKE--NGEDIQVLRYEHGQKYEPHYDYF-- 140
R S ++ + ++ ++ +I T L + E +QV+ Y G YEPH+D+
Sbjct: 381 YRISKSAWLKESAHEVVGKLDQRITLVTGLNVQPPYAEYLQVVNYGIGGHYEPHFDHATS 440
Query: 141 -SDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAV 199
S + ++ G+R+AT+++YLS V GG T F A +V
Sbjct: 441 DSSPLYRLKTGNRVATIMIYLSPVQAGGSTAFIYA---------------------NFSV 479
Query: 200 KPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ AL +++LH N + +LH+GCPVI G KW A KW+H
Sbjct: 480 PVVQNAALFWWNLHKNGQGNVDTLHAGCPVIVGNKWVANKWVH 522
>gi|148701598|gb|EDL33545.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide, isoform CRA_c [Mus
musculus]
gi|149052607|gb|EDM04424.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide (predicted),
isoform CRA_d [Rattus norvegicus]
Length = 189
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 58/194 (29%), Positives = 99/194 (51%), Gaps = 23/194 (11%)
Query: 51 LTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIAT 110
++D E + + +AK +L R+ V D +G ++ R S +++ + D ++A + ++
Sbjct: 1 MSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQH 60
Query: 111 WTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVN--IVRGGHRLATVLMYLSDVAKGGE 168
T L + E +QV Y G +YEPH+D+ + + G+RLAT L Y+SDV GG
Sbjct: 61 ITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGA 120
Query: 169 TVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCP 228
TVFP+ G A+ P++G A+ +++L + D + H+ CP
Sbjct: 121 TVFPDL---------------------GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACP 159
Query: 229 VIEGEKWSATKWIH 242
V+ G KW + KW H
Sbjct: 160 VLVGCKWVSNKWFH 173
>gi|403263105|ref|XP_003923900.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
alpha-3, partial [Saimiri boliviensis boliviensis]
Length = 534
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 68/220 (30%), Positives = 109/220 (49%), Gaps = 30/220 (13%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLS-DVRT 87
++ P + + + +P +Y F++D E + LA+ L+RS VA SGE +L + R
Sbjct: 323 LLQPIQKEVLHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVA---SGEKQLQVEYRI 379
Query: 88 SSGTFIPKGKDAIIAGIEDKIATWTFLPKE--NGEDIQVLRYEHGQKYEPHYDYF---SD 142
S ++ D ++ + +IA T L E +QV+ Y G YEPH+D+ S
Sbjct: 380 SKSAWLKDTVDPMLVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPSS 439
Query: 143 KVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPR 202
+ ++ G+R+AT ++YLS V GG T F A ++V
Sbjct: 440 PLYRMKSGNRVATFMIYLSSVEAGGATAFIYA---------------------NLSVPVV 478
Query: 203 RGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ AL +++LH + D +LH+GCPV+ G KW A KWIH
Sbjct: 479 KNAALFWWNLHRSGEGDSDTLHAGCPVLVGNKWVANKWIH 518
>gi|20269818|gb|AAM18064.1| prolyl 4-hydroxylase alpha-related protein PH4[alpha]NE1
[Drosophila melanogaster]
Length = 286
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 64/215 (29%), Positives = 100/215 (46%), Gaps = 22/215 (10%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+ P K+++ S P + L+ + L +A ++ RS V G+ K S R S
Sbjct: 67 FLAPLKLEEHSLDPYVATFHDILSPGKISQLREMAVPRMHRSTVNPLPGGQLKKSAFRVS 126
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNI-V 147
++ + G+ + T L E +QV Y G YEPH+D+F D +
Sbjct: 127 KNAWLAYESHPTMVGMLRDLKDATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPA 186
Query: 148 RGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDAL 207
G+R+AT + YLS+V +GG T FP + IAVKP+ G+ L
Sbjct: 187 EEGNRIATAIFYLSEVEQGGATAFPFLD---------------------IAVKPQLGNVL 225
Query: 208 LFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+++LH + D + H+GCPV++G KW WIH
Sbjct: 226 FWYNLHRSLDKDYRTKHAGCPVLKGSKWIGNVWIH 260
>gi|195166681|ref|XP_002024163.1| GL22882 [Drosophila persimilis]
gi|194107518|gb|EDW29561.1| GL22882 [Drosophila persimilis]
Length = 534
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 62/232 (26%), Positives = 116/232 (50%), Gaps = 29/232 (12%)
Query: 17 SLLIRKSFSSTAIIN--PSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVAD 74
+L+ R +F++T + P K+++++ P +Y L+D E + + + Q+
Sbjct: 309 NLVCRYNFTTTPFLRLAPLKMEEVNHDPYIVMYHEVLSDREIEEMKGRS-GQMSNGWADQ 367
Query: 75 NLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYE 134
+ +K+ D+ + + + AI + +I+ T ED+QV Y G ++
Sbjct: 368 KEANSTKIRDIVCRHTWW--REQSAIKERVNRRISDMTNFDFPPQEDLQVANYGLGTHFK 425
Query: 135 PHYDYFSDKV---NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSE 191
PHYDY SD +++ G RL +++ Y SDV +GG TVFP +
Sbjct: 426 PHYDYTSDGYETPDVLTLGDRLGSIIFYASDVPQGGATVFPRSR---------------- 469
Query: 192 CAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHV 243
+++ PR+G ++ +++L+ + D S HS CPVI G++W+ TKW+H+
Sbjct: 470 -----VSIFPRKGSSVFWYNLYDDGRIDTRSQHSVCPVIVGDRWTLTKWLHI 516
>gi|47213360|emb|CAF90979.1| unnamed protein product [Tetraodon nigroviridis]
Length = 511
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 66/225 (29%), Positives = 106/225 (47%), Gaps = 32/225 (14%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + PR Y L++ E + + LA+ +L+R+ V D +G+ + R S
Sbjct: 294 VIGPVKQEDEWDHPRIVRYHDVLSNREMEKVKELARPRLRRATVHDPRTGQLTTAPYRVS 353
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS----DKV 144
++ + I+ I +I T L ED+QV Y G +YEPH+D+ D
Sbjct: 354 KSAWLGAFEHPIVDQINQRIEDITGLDVSTAEDLQVANYGVGGQYEPHFDFGQKDEPDAF 413
Query: 145 NIVRGGHRLATVLMY-------LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGI 197
+ G+R+AT L+Y +SDV GG TVF + G
Sbjct: 414 EELGTGNRIATWLLYVSAAVLRMSDVQAGGATVFTDI---------------------GA 452
Query: 198 AVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+V P++G A+ +++L + D + H+ CPV+ G KW + KWIH
Sbjct: 453 SVLPQKGSAVFWYNLRPSGDGDYRTRHAACPVLLGNKWVSNKWIH 497
>gi|196011912|ref|XP_002115819.1| hypothetical protein TRIADDRAFT_59908 [Trichoplax adhaerens]
gi|190581595|gb|EDV21671.1| hypothetical protein TRIADDRAFT_59908 [Trichoplax adhaerens]
Length = 300
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 69/226 (30%), Positives = 107/226 (47%), Gaps = 33/226 (14%)
Query: 25 SSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSD 84
SS P ++++S P +Y ++ E + L LA QL+ + V S +++ +
Sbjct: 87 SSKTRFMPYAIEEMSRDPLIILYHNLTSNAEMESLKALAAKQLQPAGVYHTTSADNRNLE 146
Query: 85 --VRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSD 142
R + FI + A+ + I ++ T L E +QV+ Y +Y PHYD F
Sbjct: 147 GYTRIAKMAFILDEESAVASAITQRLQDVTGLNMNFSEPLQVINYGIAGQYTPHYDTFPA 206
Query: 143 KV-NIVRGGH-RLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVK 200
K + H RLAT ++YLSDV +GG TVF N + V
Sbjct: 207 KSGDRSHPSHDRLATAILYLSDVERGGATVFTNIN---------------------VRVL 245
Query: 201 PRRGDALLFFSLHTNAIPD----PVSLHSGCPVIEGEKWSATKWIH 242
PR+G+ ++++ N +PD P +LH+GCPV+ G KW A KWI
Sbjct: 246 PRKGNVIIWY----NYLPDGNLHPGTLHAGCPVLVGSKWIANKWIQ 287
>gi|308497208|ref|XP_003110791.1| CRE-DPY-18 protein [Caenorhabditis remanei]
gi|308242671|gb|EFO86623.1| CRE-DPY-18 protein [Caenorhabditis remanei]
Length = 559
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 61/218 (27%), Positives = 105/218 (48%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+ P KV+ + P A +++ ++D E + LAK +L R+ V D+ +G+ + R S
Sbjct: 315 VYAPIKVEIKRFNPLAVLFKDVISDDEVATIQELAKPKLARATVHDSATGKLVTATYRIS 374
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF----SDKV 144
++ + + ++ + +I T L E E++Q+ Y G Y+PH+D+ S
Sbjct: 375 KSAWLKEWEHEVVERVNKRIELMTNLEMETAEELQIANYGIGGHYDPHFDHAKKEESKSF 434
Query: 145 NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+ATVL Y+S + GG TVF + V P +
Sbjct: 435 ESLGTGNRIATVLFYMSQPSHGGGTVFTEVKS---------------------TVLPTKN 473
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
DAL +++L +P + H+ CPV+ G KW + KWIH
Sbjct: 474 DALFWYNLFKQGDGNPDTRHAACPVLVGIKWVSNKWIH 511
>gi|452752943|ref|ZP_21952682.1| eukaryotic Peptidyl prolyl 4-hydroxylase, alpha subunit [alpha
proteobacterium JLT2015]
gi|451959765|gb|EMD82182.1| eukaryotic Peptidyl prolyl 4-hydroxylase, alpha subunit [alpha
proteobacterium JLT2015]
Length = 314
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 61/214 (28%), Positives = 100/214 (46%), Gaps = 27/214 (12%)
Query: 34 KVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFI 93
+ + +S P + + EC +L ++ +L+ S + D +G + VRTS G +
Sbjct: 121 RTEPVSETPSIRMVRHLFSSAECAYLQQMSAPRLRPSTILDPQTGARRPDPVRTSVGAAL 180
Query: 94 -PKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHR 152
P +D ++ + +IA T + GE + +LRY Q+Y PH+D + N R
Sbjct: 181 SPVEEDLVVGMLNRRIAAATGTDRMQGEPLHILRYSGAQEYRPHHDAVAGLEN-----QR 235
Query: 153 LATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSL 212
T+++YL+ +GGET FP + G ++ R+GDALLF +L
Sbjct: 236 SHTLIVYLTADYEGGETAFP---------------------ELGFRLRGRQGDALLFANL 274
Query: 213 HTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSF 246
+ PD H+G P G KW AT+WI +
Sbjct: 275 REDGRPDLRMRHAGLPATSGAKWIATRWIRTRPY 308
>gi|426245942|ref|XP_004016760.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3, partial [Ovis
aries]
Length = 514
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 69/226 (30%), Positives = 110/226 (48%), Gaps = 30/226 (13%)
Query: 23 SFSSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKL 82
S S ++ P + + I +P +Y F++D E + LA+ L+RS VA SGE +L
Sbjct: 297 SSSPYLLLQPVRKEVIHLEPYVVLYHDFVSDAEAQKIRGLAEPWLQRSVVA---SGEKQL 353
Query: 83 S-DVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKE--NGEDIQVLRYEHGQKYEPHYDY 139
+ R S ++ D ++ ++ +IA T L + E +QV+ Y G YEPH+D+
Sbjct: 354 PVEYRISKSAWLKDTVDPVLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDH 413
Query: 140 F---SDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKG 196
S + + G+R+AT ++YLS V GG T F
Sbjct: 414 ATSPSSPLYRMNSGNRVATFMIYLSSVEAGGATAF---------------------IYGN 452
Query: 197 IAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+V + AL +++LH + D +LH+ CPV+ G+KW A KWIH
Sbjct: 453 FSVPVVKNAALFWWNLHRSGEGDGDTLHAACPVLVGDKWVANKWIH 498
>gi|126736198|ref|ZP_01751941.1| response regulator receiver domain protein (CheY-like) [Roseobacter
sp. CCS2]
gi|126714364|gb|EBA11232.1| response regulator receiver domain protein (CheY-like) [Roseobacter
sp. CCS2]
Length = 217
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 70/202 (34%), Positives = 101/202 (50%), Gaps = 28/202 (13%)
Query: 44 AFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAG 103
A + + F DL H+I+L + L R+ V D+ +G KL + RT+ I + D +A
Sbjct: 16 AVIDDVFDEDL-AQHVISLGQEALVRATVVDS-AGGGKLDESRTNDSGTIDQWSDPKLAS 73
Query: 104 IEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVN----IVRGGHRLATVLMY 159
+ I+ LP EN E Q+LRYE QK++PH D F + V I RGG RL T + Y
Sbjct: 74 LVTTISDLVRLPPENSEPSQLLRYEGEQKFDPHTDAFDNTVGGRDFISRGGQRLFTTICY 133
Query: 160 LSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHT-NAIP 218
L++V KGGET FP + I + P+ G L+F + A+
Sbjct: 134 LNNVGKGGETEFPALK---------------------IKIAPKLGRVLIFGNTRLGTAME 172
Query: 219 DPVSLHSGCPVIEGEKWSATKW 240
P S H G PV +GEK++ + W
Sbjct: 173 HPHSTHGGRPVKDGEKYALSIW 194
>gi|198466403|ref|XP_002135183.1| GA23911 [Drosophila pseudoobscura pseudoobscura]
gi|198150584|gb|EDY73810.1| GA23911 [Drosophila pseudoobscura pseudoobscura]
Length = 534
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 64/247 (25%), Positives = 119/247 (48%), Gaps = 29/247 (11%)
Query: 2 SPTRLSLNFFFLLSFSLLIRKSFSSTAIIN--PSKVKQISWKPRAFVYEGFLTDLECDHL 59
+P + F L+ R +F++T + P K+++++ P +Y L+D E + +
Sbjct: 294 TPYEIGCRGLFPKRTKLVCRYNFTTTPFLRLAPLKMEEVNHDPYIVMYHEVLSDREIEEM 353
Query: 60 INLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENG 119
+ Q+ + +K+ D+ + + + AI + +I+ T
Sbjct: 354 KGRS-GQMSNGWADQKEANSTKIRDIVCRHTWW--REQSAIKERVNRRISDMTNFDFPPQ 410
Query: 120 EDIQVLRYEHGQKYEPHYDYFSDKV---NIVRGGHRLATVLMYLSDVAKGGETVFPNAEE 176
ED+QV Y G ++PHYDY SD +++ G RL +++ Y SDV +GG TVFP +
Sbjct: 411 EDLQVANYGLGTHFKPHYDYTSDGYETPDVLTLGDRLGSIIFYASDVPQGGATVFPRSR- 469
Query: 177 PPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWS 236
+++ PR+G ++ +++L+ + D S HS CPVI G++W+
Sbjct: 470 --------------------VSIFPRKGSSVFWYNLYDDGRIDTRSQHSVCPVIVGDRWT 509
Query: 237 ATKWIHV 243
TKW+H+
Sbjct: 510 LTKWLHI 516
>gi|355691582|gb|EHH26767.1| hypothetical protein EGK_16829 [Macaca mulatta]
gi|355750162|gb|EHH54500.1| hypothetical protein EGM_15360 [Macaca fascicularis]
gi|384939464|gb|AFI33337.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Macaca
mulatta]
Length = 535
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 109/218 (50%), Gaps = 25/218 (11%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E + + +AK +L R+ V D +G ++ R S
Sbjct: 323 LIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVS 382
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-SDKVNIV 147
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+ +D+ +
Sbjct: 383 KSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERHTF 442
Query: 148 R---GGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRG 204
+ G+R+AT L Y+SDV GG TVFP+ G A+ P++G
Sbjct: 443 KHLGTGNRVATFLNYMSDVEAGGATVFPDL---------------------GAAIWPKKG 481
Query: 205 DALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 482 TAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 519
>gi|116496629|gb|AAI26171.1| Prolyl 4-hydroxylase, alpha polypeptide III [Homo sapiens]
Length = 544
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 69/220 (31%), Positives = 109/220 (49%), Gaps = 30/220 (13%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLS-DVRT 87
++ P + + I +P +Y F++D E + LA+ L+RS VA SGE +L + R
Sbjct: 333 LLQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVA---SGEKQLQVEYRI 389
Query: 88 SSGTFIPKGKDAIIAGIEDKIATWTFLPKE--NGEDIQVLRYEHGQKYEPHYDYF---SD 142
S ++ + + + +IA T L E +QV+ Y G YEPH+D+ S
Sbjct: 390 SKSAWLKDTVNPKLVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPSS 449
Query: 143 KVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPR 202
+ ++ G+R+AT ++YLS V GG T F A ++V
Sbjct: 450 PLYRMKSGNRVATFMIYLSSVEAGGATAFIYA---------------------NLSVPVV 488
Query: 203 RGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
R AL +++LH + D +LH+GCPV+ G+KW A KWIH
Sbjct: 489 RNAALFWWNLHRSGEGDSDTLHAGCPVLVGDKWVANKWIH 528
>gi|195172672|ref|XP_002027120.1| GL20071 [Drosophila persimilis]
gi|194112933|gb|EDW34976.1| GL20071 [Drosophila persimilis]
Length = 455
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 66/217 (30%), Positives = 107/217 (49%), Gaps = 26/217 (11%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+ P KV+ +S P +Y + D E + L + A ++RS V S E + RTS
Sbjct: 247 LAPFKVEPLSQDPYIAMYHDVIYDSEIEELKDNAFPDMERSKVY-TYSDEDSKNTGRTSM 305
Query: 90 GTFIPKGKDAIIAGIEDKIATWT---FLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVN- 145
F + + + ++ T L + +++ VL Y +Y H DYF +
Sbjct: 306 SAFQTDHQYKAVTKVNRRVMHMTGFEVLADGSSDELLVLNYATAAQYLTHSDYFGPAYSE 365
Query: 146 IVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGD 205
++ G R+ATVL YL+DV +GG+TVFP + GI P +G
Sbjct: 366 YIQRGDRIATVLFYLNDVEQGGKTVFP---------------------RLGIFRSPMKGS 404
Query: 206 ALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A++F++++++ DP + H GCPV+ G KW+ATKWI+
Sbjct: 405 AVVFYNMNSSLQGDPRTEHGGCPVLVGTKWAATKWIY 441
>gi|443705944|gb|ELU02240.1| hypothetical protein CAPTEDRAFT_227850 [Capitella teleta]
Length = 475
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 64/212 (30%), Positives = 99/212 (46%), Gaps = 32/212 (15%)
Query: 34 KVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFI 93
K + + P +++ F++D E L ++A+ Q + SAV D+ GES R SS F+
Sbjct: 176 KTELLHANPEIYLFHDFISDSEIQRLKDMAEPQFQSSAVLDDTGGESFFDVSRLSSTAFV 235
Query: 94 PKGKDAIIAGIEDKIATWTFLPKE------NGEDIQVLRYEHGQKYEPHYDYFSDKVN-- 145
D ++A + +++ T L E E +QVLRY G Y PHYD + +
Sbjct: 236 NDSND-LVASLNRRVSKLTGLQTEVLDSFSESESLQVLRYGPGGLYTPHYDTLGSEADLP 294
Query: 146 --IVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRR 203
I G R+AT ++YL GG TVFP +++ ++
Sbjct: 295 PYIQHTGDRIATFILYLDIATAGGATVFPLLP---------------------MSIPIQK 333
Query: 204 GDALLFFSLHTNAIPDPVSLHSGCPVIEGEKW 235
G A +F+LH + D +LH+ CPVI G KW
Sbjct: 334 GAAAFWFNLHPDGSLDRRTLHAACPVIRGTKW 365
>gi|170029530|ref|XP_001842645.1| prolyl 4-hydroxylase subunit alpha-1 [Culex quinquefasciatus]
gi|167863229|gb|EDS26612.1| prolyl 4-hydroxylase subunit alpha-1 [Culex quinquefasciatus]
Length = 522
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 68/219 (31%), Positives = 106/219 (48%), Gaps = 36/219 (16%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+ P KV+++S +P ++Y ++D E D LI L K++L R+ V +S VR S
Sbjct: 306 LAPLKVEEVSLEPPIYLYHKVISDEEIDKLIELGKARLNRATVG------QMVSQVRISQ 359
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGED-IQVLRYEHGQKYEPHYDYFSD-----K 143
++ + D ++ ++ + + G D +QV Y G PHYD S+ +
Sbjct: 360 NVWLSEEVDPLLGVLQRRTYDMSRGLSMQGFDMVQVNNYGIGGHNIPHYDCDSEYPPFPQ 419
Query: 144 VNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRR 203
N+ G+RLAT++ YLSDV GG TVFP + + V P +
Sbjct: 420 FNM---GNRLATLMYYLSDVEVGGGTVFP---------------------RLSLGVFPIK 455
Query: 204 GDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
G A+ + ++H N D LH+GCP + G KW A WIH
Sbjct: 456 GSAIFWHNVHHNGNVDERMLHAGCPTLIGSKWVANIWIH 494
>gi|301104296|ref|XP_002901233.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262101167|gb|EEY59219.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 535
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 81/249 (32%), Positives = 116/249 (46%), Gaps = 41/249 (16%)
Query: 32 PSKVKQISWKPRAFVYEGFLTDLECDHLINLA------KSQLKRSAV-ADNLSGESKLSD 84
P ++ IS PR F F + E D LI ++L++S V A++ + K S
Sbjct: 178 PVIIESISESPRTFRLHNFFSGEEADKLIKRTLEIDDPSNKLQQSTVGANDNKNKKKKSK 237
Query: 85 VRTSSGTF--IPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-- 140
RTS F + + I + D ++ F + + +Q+LRY+ Q Y H DYF
Sbjct: 238 HRTSENAFDTVSEAAVDIRKRVFDVLSLGEF-QADMADGLQLLRYQQKQAYIAHEDYFPV 296
Query: 141 --SDKVNI---VRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATN--------- 186
+ N G +R ATV +YLSDV +GG+TVFP AE P T +
Sbjct: 297 GAAKDFNFDPHKGGSNRFATVFLYLSDVPRGGQTVFPLAEMPEGLPTEYQHPPNSAQDYE 356
Query: 187 --------------DDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEG 232
D + +C+ K +A P +G A+LF+S N DP SLH GCPV+EG
Sbjct: 357 AIGAELFEPGSWEMDMVRKCSTK-LASYPSKGGAVLFYSQKPNGELDPKSLHGGCPVLEG 415
Query: 233 EKWSATKWI 241
KW A W+
Sbjct: 416 TKWGANLWV 424
>gi|48675383|ref|NP_001001598.1| prolyl 4-hydroxylase subunit alpha-3 precursor [Bos taurus]
gi|75053350|sp|Q75UG4.1|P4HA3_BOVIN RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
alpha-3; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-3; Flags: Precursor
gi|47115494|dbj|BAD18888.1| Collagen prolyl 4-hydroxylase alpha III subunit [Bos taurus]
gi|296479828|tpg|DAA21943.1| TPA: prolyl 4-hydroxylase subunit alpha-3 precursor [Bos taurus]
Length = 544
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 69/226 (30%), Positives = 110/226 (48%), Gaps = 30/226 (13%)
Query: 23 SFSSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKL 82
S S ++ P + + I +P +Y F++D E + LA+ L+RS VA SGE +L
Sbjct: 327 SSSPYLLLQPVRKEVIHLEPYVVLYHDFVSDAEAQTIRGLAEPWLQRSVVA---SGEKQL 383
Query: 83 S-DVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKE--NGEDIQVLRYEHGQKYEPHYDY 139
+ R S ++ D ++ ++ +IA T L + E +QV+ Y G YEPH+D+
Sbjct: 384 PVEYRISKSAWLKDTVDPVLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDH 443
Query: 140 F---SDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKG 196
S + + G+R+AT ++YLS V GG T F
Sbjct: 444 ATSPSSPLYRMNSGNRVATFMIYLSSVEAGGATAF---------------------IYGN 482
Query: 197 IAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+V + AL +++LH + D +LH+ CPV+ G+KW A KWIH
Sbjct: 483 FSVPVVKNAALFWWNLHRSGEGDGDTLHAACPVLVGDKWVANKWIH 528
>gi|323453493|gb|EGB09364.1| hypothetical protein AURANDRAFT_15704, partial [Aureococcus
anophagefferens]
Length = 148
Score = 101 bits (251), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 59/150 (39%), Positives = 82/150 (54%), Gaps = 22/150 (14%)
Query: 95 KGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLA 154
+ A++A IE+ T +PKEN E QVLRY HGQ+Y H+D S N + G R+
Sbjct: 19 RATRAVMARIEEV----TGVPKENYESFQVLRYTHGQQYRAHHD-MSRGDNALACGPRIY 73
Query: 155 TVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHT 214
T MY SDV KGGET FP + P + K + + P+RG ALL+ S+ +
Sbjct: 74 TFFMYFSDVEKGGETEFPMVKRP---------------SGKTVKIAPKRGSALLWPSVTS 118
Query: 215 N--AIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ DP + H+ PV+EG K++A WIH
Sbjct: 119 DDPTAQDPRTRHAALPVVEGTKFAANAWIH 148
>gi|440899661|gb|ELR50930.1| Prolyl 4-hydroxylase subunit alpha-3, partial [Bos grunniens mutus]
Length = 478
Score = 101 bits (251), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 69/226 (30%), Positives = 110/226 (48%), Gaps = 30/226 (13%)
Query: 23 SFSSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKL 82
S S ++ P + + I +P +Y F++D E + LA+ L+RS VA SGE +L
Sbjct: 261 SSSPYLLLQPVRKEVIHLEPYVVLYHDFVSDAEAQTIRGLAEPWLQRSVVA---SGEKQL 317
Query: 83 S-DVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKE--NGEDIQVLRYEHGQKYEPHYDY 139
+ R S ++ D ++ ++ +IA T L + E +QV+ Y G YEPH+D+
Sbjct: 318 PVEYRISKSAWLKDTVDPVLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDH 377
Query: 140 F---SDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKG 196
S + + G+R+AT ++YLS V GG T F
Sbjct: 378 ATSPSSPLYRMNSGNRVATFMIYLSSVEAGGATAF---------------------IYGN 416
Query: 197 IAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+V + AL +++LH + D +LH+ CPV+ G+KW A KWIH
Sbjct: 417 FSVPVVKNAALFWWNLHRSGEGDGDTLHAACPVLVGDKWVANKWIH 462
>gi|313844088|ref|YP_004061751.1| hypothetical protein OlV1_118c [Ostreococcus lucimarinus virus
OlV1]
gi|312599473|gb|ADQ91495.1| hypothetical protein OlV1_118c [Ostreococcus lucimarinus virus
OlV1]
Length = 195
Score = 101 bits (251), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 66/197 (33%), Positives = 96/197 (48%), Gaps = 29/197 (14%)
Query: 46 VYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGIE 105
V FLT+ E H++ AK++L S +A+N + K+ D T+ F D ++ +
Sbjct: 24 VIPNFLTEDERKHIMEKAKTKLDVSTIAENRVVDKKVRDSETAWLDFT----DPVVMRVA 79
Query: 106 DKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVAK 165
+ A+ T P N E +QVLRY+ G Y PH D FSD +G R+ TV++ L+D +
Sbjct: 80 RRCASLTDRPIMNCEHLQVLRYKPGGHYRPHQDTFSD----TKGNKRMYTVILALNDDYE 135
Query: 166 GGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHS 225
GET FPN ++ K + GDALLF +L + +LH
Sbjct: 136 EGETEFPNLKK---------------------KYKLKAGDALLFHTLDNYELMTSKALHG 174
Query: 226 GCPVIEGEKWSATKWIH 242
G PV GEKW W+H
Sbjct: 175 GKPVKSGEKWVCNLWVH 191
>gi|156370133|ref|XP_001628326.1| predicted protein [Nematostella vectensis]
gi|156215300|gb|EDO36263.1| predicted protein [Nematostella vectensis]
Length = 526
Score = 101 bits (251), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 66/217 (30%), Positives = 109/217 (50%), Gaps = 25/217 (11%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+ P ++ +S P+ ++ L+++E + ++ LA+ +L+R+ V + +GE + D R S
Sbjct: 314 LKPVAMEIVSVNPQITLFHNVLSEMEIEQMLELARPRLRRARVNNLETGEIEDVDYRISQ 373
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVN---- 145
++ I+ I ++ T L GE +QV Y G YEPH+D+ D N
Sbjct: 374 IAWLSDSDGDIVRRINRRVGFITGLNTNTGECLQVNNYGVGGHYEPHFDHSLDMENSPIA 433
Query: 146 IVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGD 205
+ G+R+AT + YLS+V GG TVF K G+ P +G
Sbjct: 434 SLGQGNRIATFMFYLSEVEAGGSTVF---------------------IKTGVKTNPFKGG 472
Query: 206 ALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A+ +++L + D SLH+GCPV+ G KW A KW+H
Sbjct: 473 AVFWYNLKKSGEGDWDSLHAGCPVLIGNKWVANKWLH 509
>gi|21358233|ref|NP_651814.1| prolyl-4-hydroxylase-alpha NE3 [Drosophila melanogaster]
gi|20269810|gb|AAM18060.1|AF495538_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]NE3
[Drosophila melanogaster]
gi|15291443|gb|AAK92990.1| GH21465p [Drosophila melanogaster]
gi|23172714|gb|AAN14251.1| prolyl-4-hydroxylase-alpha NE3 [Drosophila melanogaster]
gi|220945610|gb|ACL85348.1| PH4alphaNE3-PA [synthetic construct]
gi|220955396|gb|ACL90241.1| PH4alphaNE3-PA [synthetic construct]
Length = 481
Score = 101 bits (251), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 64/220 (29%), Positives = 104/220 (47%), Gaps = 31/220 (14%)
Query: 25 SSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSD 84
S+ I+ P K+++IS +P VY L D + LI LA+ LK + + D+ E++ S
Sbjct: 288 SAFLILAPLKMEEISLEPHIVVYHDILPDKDIQQLITLAEPLLKPTEMFDDNKNEAR-SS 346
Query: 85 VRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKV 144
RT G ++ + ++ T L G I +++Y G Y +YD+F +
Sbjct: 347 YRTPLGG-------PLLDSLTQRMRDITGLQIRQGNPINIIKYGFGAPYTNYYDFFKKRN 399
Query: 145 NIVRG-GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRR 203
+ +G G R+AT + YL+D GG TVFP + + V R
Sbjct: 400 SESKGFGDRMATFMFYLNDAPYGGATVFP---------------------RLNVKVPAER 438
Query: 204 GDALLFFSLHTNAIP-DPVSLHSGCPVIEGEKWSATKWIH 242
G L +++L+ + +P ++H+ CPV G KW T WIH
Sbjct: 439 GKVLFWYNLNGDTHDMEPTTMHAACPVFHGSKWVMTAWIH 478
>gi|388548946|gb|AFK66147.1| prolyl 4-hydroxylase alpha subunit [Ostreococcus lucimarinus virus
OlV3]
Length = 196
Score = 101 bits (251), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 68/206 (33%), Positives = 100/206 (48%), Gaps = 31/206 (15%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
+PR V G +T E +H++ A S+L S VA+N + K+ D T+ D +
Sbjct: 21 EPR--VVRGLVTPKEREHIMKKASSKLDVSTVAENRIIDKKIRDSETAWLDM----DDPV 74
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYL 160
+ + +K + T P N E +QVLRY+ G Y+PH D FSD +G R+ TV++ L
Sbjct: 75 VKRVCEKCVSLTDRPLTNCEQLQVLRYKPGGHYKPHQDTFSD----TKGNKRMYTVILAL 130
Query: 161 SDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDP 220
+D +GGET FPN ++ K + GDAL F +L +
Sbjct: 131 NDEYEGGETEFPNLKK---------------------KYKLKAGDALFFHTLDNYELLTS 169
Query: 221 VSLHSGCPVIEGEKWSATKWIHVDSF 246
+LH G PV GEKW W+H S+
Sbjct: 170 KALHGGRPVKSGEKWVCNLWVHKHSY 195
>gi|442747091|gb|JAA65705.1| Putative prolyl 4-hydroxylase alpha subunit [Ixodes ricinus]
Length = 533
Score = 101 bits (251), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 64/221 (28%), Positives = 109/221 (49%), Gaps = 33/221 (14%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+ P K+++ + KP V L D + + +I AK +L++S + + RTSS
Sbjct: 314 LQPIKLEEYNLKPYVVVLRDLLQDRDLNDMIAFAKPRLEQSKTL--CAADKDGPPPRTSS 371
Query: 90 GTFIPKGKDAIIAG-----IEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKV 144
T++ DA +A ++ + T K+ E Q+ Y G Y PH+DY + +
Sbjct: 372 NTWL-DDDDAPVAARVNQYLQSLLGLGTLYGKDEAEKYQLANYGIGGHYVPHHDYLEESL 430
Query: 145 NIVRG----GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVK 200
+ G R+AT+++Y+SDV +GG TVFP+ G+ V
Sbjct: 431 TSSKKHRLFGDRVATLMIYMSDVEEGGATVFPSL---------------------GVRVS 469
Query: 201 PRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
PR+GDA+ ++++ ++ D ++ H+GCPV+ G KW A KW
Sbjct: 470 PRKGDAVFWWNIKSSWEGDVLTWHAGCPVLYGSKWIANKWF 510
>gi|195572619|ref|XP_002104293.1| GD18524 [Drosophila simulans]
gi|194200220|gb|EDX13796.1| GD18524 [Drosophila simulans]
Length = 472
Score = 100 bits (250), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 66/214 (30%), Positives = 101/214 (47%), Gaps = 36/214 (16%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
I P K++++S P V+ + D E D ++N S +L+ + S+VRTS
Sbjct: 270 IAPLKMEELSLDPYMVVFHDVVYDTEIDGMLN-------SSNFGLSLTDSGQKSEVRTSK 322
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVR- 148
++I + + +++ T E + ++ Y G Y HYD F + N R
Sbjct: 323 DSYIVDSE-----SLNERVTDMTGFSMEMSDPFSLINYGLGGHYMLHYD-FHEYTNTTRP 376
Query: 149 -GGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDAL 207
G R+ATVL YL +V GG T+FP K IAV P++G A+
Sbjct: 377 KQGDRIATVLFYLGEVDSGGATIFP---------------------KINIAVTPKKGSAV 415
Query: 208 LFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
+++LH + + SLHS CPVI G K+ TKWI
Sbjct: 416 FWYNLHNSGAMNLKSLHSACPVISGSKYVLTKWI 449
>gi|323445926|gb|EGB02303.1| hypothetical protein AURANDRAFT_39521 [Aureococcus anophagefferens]
Length = 239
Score = 100 bits (250), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 64/209 (30%), Positives = 98/209 (46%), Gaps = 26/209 (12%)
Query: 38 ISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGK 97
+S P + + F + C+HLI A+ L + V G + + +R +S ++
Sbjct: 31 LSADPLVYFIDDFADEDSCEHLIRQARPSLGGAEV-QTRRGSAARTAIRRASSCWLAARG 89
Query: 98 DAIIAGIEDKIATWTFLPKENGEDIQVLRYE--HGQKYEPHYDYF-SDKVNIVRGGHRLA 154
D + +ED I P+E E V+RY G++Y H D F + + RGG RL
Sbjct: 90 DEALEHLEDAICAELGAPEERTEFFHVVRYRPSTGERYAAHADAFEAGNAELERGGQRLT 149
Query: 155 TVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHT 214
T L+YLSDV GG TVFP G++V PRRG L+F ++
Sbjct: 150 TALLYLSDVGAGGATVFP---------------------ALGLSVAPRRGRLLVFANVAD 188
Query: 215 NAIPDPVSLHSGCPVI-EGEKWSATKWIH 242
+ D ++H+G P+ + EKW A KW+
Sbjct: 189 DTTVDARTVHAGEPIAGDTEKWIANKWVR 217
>gi|195069801|ref|XP_001997031.1| GH12975 [Drosophila grimshawi]
gi|193891500|gb|EDV90366.1| GH12975 [Drosophila grimshawi]
Length = 242
Score = 100 bits (250), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 64/222 (28%), Positives = 101/222 (45%), Gaps = 25/222 (11%)
Query: 32 PSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGT 91
P +++++ P ++ E L LA+ +L+RS V + E ++ R S GT
Sbjct: 23 PYRLEELHLDPYVIQVHDIISAEETIVLQQLARPELQRSMVYSLSNSEHISTNFRISQGT 82
Query: 92 FIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDK----VNIV 147
F + I+ + + + L + E +QV Y G YEPH D FS+ +N
Sbjct: 83 FFEYHEHPIMQRMSQHLENISGLDMRSAEQLQVANYGIGGHYEPHMDSFSENHNYGINTY 142
Query: 148 RGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDAL 207
+R+AT + YLS+V GG T FP + V+P RG L
Sbjct: 143 MSTNRVATGIYYLSNVEAGGGTAFPFLP---------------------LLVEPERGSLL 181
Query: 208 LFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKI 249
+++LH + D + H+GCPV+ G KW A WI + + D I
Sbjct: 182 FWYNLHRSGDLDYRTKHAGCPVLMGSKWIANVWIRLSNQDHI 223
>gi|195159150|ref|XP_002020445.1| GL13509 [Drosophila persimilis]
gi|194117214|gb|EDW39257.1| GL13509 [Drosophila persimilis]
Length = 554
Score = 100 bits (250), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 65/220 (29%), Positives = 104/220 (47%), Gaps = 28/220 (12%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+ P +++++S P VY L+D E + + + LKRS V D + S RT+
Sbjct: 338 LAPLRMEELSLDPYIVVYHNVLSDAEIAEVERVTEPLLKRSVVFDGKENKMSTSKKRTAL 397
Query: 90 GTFIPKGK-----DAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKV 144
G ++P A+I I +I T L + +D+Q+++Y +G Y+ H+DYF+
Sbjct: 398 GAWLPDDNMDVSGRAVIQRILRRIHELTGLIMNDRQDMQLIKYGYGGHYDIHFDYFNTSS 457
Query: 145 NIVRG-GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRR 203
I + G R+ATVL YL+DV GG T F + + + V R
Sbjct: 458 PITKARGDRMATVLFYLNDVKHGGSTAFTDLQ---------------------LKVPSER 496
Query: 204 GDALLFFSLHTNAIP-DPVSLHSGCPVIEGEKWSATKWIH 242
G L ++++ D +LH CPVI+G K + WIH
Sbjct: 497 GKVLFWYNMRGETHDLDSRTLHGACPVIDGTKSILSCWIH 536
>gi|311977988|ref|YP_003987108.1| putative prolyl 4-hydroxylase [Acanthamoeba polyphaga mimivirus]
gi|81999799|sp|Q5UP57.1|P4H_MIMIV RecName: Full=Putative prolyl 4-hydroxylase; Short=4-PH; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
gi|55417206|gb|AAV50856.1| prolyl 4-hydroxylase [Acanthamoeba polyphaga mimivirus]
gi|308204490|gb|ADO18291.1| putative prolyl 4-hydroxylase [Acanthamoeba polyphaga mimivirus]
gi|339061535|gb|AEJ34839.1| prolyl 4-hydroxylase [Acanthamoeba polyphaga mimivirus]
gi|351737756|gb|AEQ60791.1| Prolyl 4-hydroxylase [Acanthamoeba castellanii mamavirus]
gi|398257408|gb|EJN41016.1| prolyl 4-hydroxylase [Acanthamoeba polyphaga lentillevirus]
Length = 242
Score = 100 bits (250), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 68/210 (32%), Positives = 96/210 (45%), Gaps = 33/210 (15%)
Query: 43 RAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIA 102
+ FV + +C ++ A +L S V LSG K ++R S +I K + ++
Sbjct: 59 KPFVLNNLINPTKCQEIMQFANGKLFDSQV---LSGTDK--NIRNSQQMWISKN-NPMVK 112
Query: 103 GIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVN-----IVRGGHRLATVL 157
I + I +P +N ED+QV+RY Q Y H+D D I RGG R+ TVL
Sbjct: 113 PIFENICRQFNVPFDNAEDLQVVRYLPNQYYNEHHDSCCDSSKQCSEFIERGGQRILTVL 172
Query: 158 MYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAI 217
+YL++ G T FPN + KP+ GDAL+F+ L N+
Sbjct: 173 IYLNNEFSDGHTYFPNLNQ---------------------KFKPKTGDALVFYPLANNSN 211
Query: 218 P-DPVSLHSGCPVIEGEKWSATKWIHVDSF 246
P SLH+G PV GEKW A W F
Sbjct: 212 KCHPYSLHAGMPVTSGEKWIANLWFRERKF 241
>gi|296217074|ref|XP_002754870.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Callithrix
jacchus]
Length = 544
Score = 100 bits (249), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 67/220 (30%), Positives = 108/220 (49%), Gaps = 30/220 (13%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLS-DVRT 87
++ P + + + +P +Y F++D E + A+ L+RS VA SGE +L + R
Sbjct: 333 VLQPIQKEILHLEPYIALYHDFVSDSEAQKIREFAEPWLQRSVVA---SGEKQLQVEYRI 389
Query: 88 SSGTFIPKGKDAIIAGIEDKIATWTFLPKE--NGEDIQVLRYEHGQKYEPHYDYF---SD 142
S ++ D ++ + +IA T L E +QV+ Y G YEPH+D+ S
Sbjct: 390 SKSAWLKDTVDPMLVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPSS 449
Query: 143 KVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPR 202
+ ++ G+R+AT ++YLS V GG T F A ++V
Sbjct: 450 PLYRMKSGNRVATFMIYLSSVEAGGATAFIYA---------------------NLSVPVV 488
Query: 203 RGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ AL +++LH + D +LH+GCPV+ G KW A KWIH
Sbjct: 489 KNAALFWWNLHRSGEGDSDTLHAGCPVLVGNKWVANKWIH 528
>gi|410447164|ref|ZP_11301266.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [SAR86 cluster
bacterium SAR86E]
gi|409980151|gb|EKO36903.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [SAR86 cluster
bacterium SAR86E]
Length = 214
Score = 100 bits (249), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 66/214 (30%), Positives = 103/214 (48%), Gaps = 34/214 (15%)
Query: 39 SWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKD 98
S P ++ + FL+DLECD IN A+ +L+ S V + E KL RTS +I +
Sbjct: 18 SVNPIVYLVKNFLSDLECDAFINEAEGRLQDSTVI-GANDEIKLG-ARTSQNCWIEHDAN 75
Query: 99 AIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF-----SDKVNIVRGGHRL 153
++ + +++ +P N E Q+ YE ++Y+P +D F K N GG R+
Sbjct: 76 ELVHEVSKRLSILAQIPIRNAEQYQLACYEKDEEYKPRFDSFDFDTLEGKKNWEPGGQRM 135
Query: 154 ATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSL- 212
T+++YL+DV GG T FP K G + P++GD ++ +
Sbjct: 136 LTIIVYLNDVQSGGGTDFP---------------------KLGFTIPPKKGDVVVLNNTC 174
Query: 213 ---HTNAIPD--PVSLHSGCPVIEGEKWSATKWI 241
N P+ P SLH+G PV+ G+KW T W
Sbjct: 175 DDDSQNGHPNIHPNSLHAGMPVLSGKKWIVTLWF 208
>gi|195055775|ref|XP_001994788.1| GH17428 [Drosophila grimshawi]
gi|193892551|gb|EDV91417.1| GH17428 [Drosophila grimshawi]
Length = 540
Score = 100 bits (249), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 64/223 (28%), Positives = 101/223 (45%), Gaps = 25/223 (11%)
Query: 31 NPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSG 90
P +++++ P ++ E L LA+ +L+RS V + E ++ R S G
Sbjct: 320 QPYRLEELHLDPYVIQVHDIISAEETIVLQQLARPELQRSMVYSLSNSEHISTNFRISQG 379
Query: 91 TFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDK----VNI 146
TF + I+ + + + L + E +QV Y G YEPH D FS+ +N
Sbjct: 380 TFFEYHEHPIMQRMSQHLENISGLDMRSAEQLQVANYGIGGHYEPHMDSFSENHNYGINT 439
Query: 147 VRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDA 206
+R+AT + YLS+V GG T FP + V+P RG
Sbjct: 440 YMSTNRVATGIYYLSNVEAGGGTAFPFLP---------------------LLVEPERGSL 478
Query: 207 LLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKI 249
L +++LH + D + H+GCPV+ G KW A WI + + D I
Sbjct: 479 LFWYNLHRSGDLDYRTKHAGCPVLMGSKWIANVWIRLSNQDHI 521
>gi|198459366|ref|XP_002138685.1| GA24919 [Drosophila pseudoobscura pseudoobscura]
gi|198136669|gb|EDY69243.1| GA24919 [Drosophila pseudoobscura pseudoobscura]
Length = 448
Score = 100 bits (249), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 66/217 (30%), Positives = 107/217 (49%), Gaps = 26/217 (11%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+ P KV+ +S P +Y + D E + L + A ++RS V + K + RTS
Sbjct: 240 LAPFKVEPLSQDPYIAMYHDVIYDSEIEELKDNAFPDMERSKVYTYSDKDGKDTG-RTSM 298
Query: 90 GTFIPKGKDAIIAGIEDKIATWT---FLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVN- 145
F + + + ++ T L + +++ VL Y +Y H DYF +
Sbjct: 299 SAFQTDHQYTAVTKVNRRVMHMTGFEVLADGSSDELLVLNYATAAQYLTHSDYFGPAYSE 358
Query: 146 IVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGD 205
++ G R+ATVL YL+DV +GG+TVFP + GI P +G
Sbjct: 359 YIQRGDRIATVLFYLNDVEQGGKTVFP---------------------RLGIFRSPMKGS 397
Query: 206 ALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
A++F++L+++ DP + H GCPV+ G KW+ATKWI+
Sbjct: 398 AVVFYNLNSSLQGDPRTEHGGCPVLVGTKWAATKWIY 434
>gi|74216495|dbj|BAE25162.1| unnamed protein product [Mus musculus]
Length = 187
Score = 100 bits (249), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 58/192 (30%), Positives = 97/192 (50%), Gaps = 23/192 (11%)
Query: 53 DLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWT 112
D E + + +AK +L R+ V D +G ++ R S +++ + D ++A + ++ T
Sbjct: 1 DEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHIT 60
Query: 113 FLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVN--IVRGGHRLATVLMYLSDVAKGGETV 170
L + E +QV Y G +YEPH+D+ + + G+RLAT L Y+SDV GG TV
Sbjct: 61 GLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATV 120
Query: 171 FPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVI 230
FP+ G A+ P++G A+ +++L + D + H+ CPV+
Sbjct: 121 FPDL---------------------GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVL 159
Query: 231 EGEKWSATKWIH 242
G KW + KW H
Sbjct: 160 VGCKWVSNKWFH 171
>gi|299115443|emb|CBN75608.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 548
Score = 100 bits (249), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 76/244 (31%), Positives = 110/244 (45%), Gaps = 47/244 (19%)
Query: 35 VKQISWKPRAFVYEGFLTDLECDHLIN--LAKSQ----LKRSAVADNLSGESKLSDVRTS 88
++ +S PR F F+ E D +I L +Q LKRS+ + +S RTS
Sbjct: 207 LETLSHSPRVFSLYNFMDMEEADSIIEDALGMTQEAYRLKRSSTG---TKGKAISKTRTS 263
Query: 89 SGTFIPKGKDAI--------IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF 140
F+ A + GIE+ TW + +QVLRY Q Y H+DY
Sbjct: 264 DNAFVTHTNTAQALKRRIFQLLGIEEYHETW-------ADGLQVLRYNESQAYVAHFDYL 316
Query: 141 S-----DKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPAT---------N 186
D + G +R ATV++Y +DV +GGETVF +A P T N
Sbjct: 317 ESAEGHDFKSEGLGTNRFATVVLYFNDVREGGETVFTHAPGIDHHLVPDTKVPVREVLEN 376
Query: 187 DDLSECA---------KKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSA 237
DL ++ + V P+RG A+LF++ H + D S H CPVI+G+KW+A
Sbjct: 377 LDLPRSGWEEKLLLQCRRHMVVAPKRGQAVLFYNQHPDGRKDLSSEHGACPVIDGQKWAA 436
Query: 238 TKWI 241
W+
Sbjct: 437 NLWV 440
>gi|395521232|ref|XP_003764722.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Sarcophilus
harrisii]
Length = 521
Score = 100 bits (249), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 67/220 (30%), Positives = 107/220 (48%), Gaps = 30/220 (13%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLS-DVRT 87
++ P + + + +P +Y F++D E + A L+RS VA SGE + + R
Sbjct: 310 LLQPVRKEVLHLEPYIVLYHDFVSDSEAQKIRGFAAPWLQRSVVA---SGEKQQQVEYRI 366
Query: 88 SSGTFIPKGKDAIIAGIEDKIATWTFLPKE--NGEDIQVLRYEHGQKYEPHYDYF---SD 142
S ++ D I+ ++ +IA T L + E +QV+ Y G YEPH+D+ S
Sbjct: 367 SKSAWLKDTVDPILVSLDRRIAALTGLNVQPPYAEHLQVVNYGIGGHYEPHFDHATSPSS 426
Query: 143 KVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPR 202
+ + G+R+AT ++YLS V GG T F A +V
Sbjct: 427 PLYRMNSGNRVATFMIYLSSVEAGGSTAFIYAN---------------------FSVPVV 465
Query: 203 RGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ AL +++LH + D +LH+GCPV+ G+KW A KWIH
Sbjct: 466 KNAALFWWNLHRSGQGDGDTLHAGCPVLVGDKWVANKWIH 505
>gi|194764881|ref|XP_001964556.1| GF23245 [Drosophila ananassae]
gi|190614828|gb|EDV30352.1| GF23245 [Drosophila ananassae]
Length = 460
Score = 100 bits (249), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 65/215 (30%), Positives = 102/215 (47%), Gaps = 31/215 (14%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
I P K+++IS P VY + + E + L L S +L GES++S +RTS
Sbjct: 257 IAPLKMEEISTDPYMVVYHDVIYENEINWL-------LDNSDFRTSLVGESQISTLRTSQ 309
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF--SDKVNIV 147
++ IE +I T L + ED ++ Y G Y+ HYD++ S+ + +
Sbjct: 310 DMPFGANSGEVMRNIEKRIKDMTGLSMDLSEDFMLINYGIGGTYKMHYDFYVYSEPLRFL 369
Query: 148 RGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDAL 207
RG R+ TVL YL DV G TVFP I++ P++G A+
Sbjct: 370 RG-ERIVTVLFYLGDVELSGSTVFPFL---------------------NISITPKKGSAV 407
Query: 208 LFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
++++LH + + H CPV+ G K+ TKWI+
Sbjct: 408 MWYNLHNSGDVHQKTQHCACPVVVGSKYVLTKWIN 442
>gi|195441323|ref|XP_002068462.1| GK20483 [Drosophila willistoni]
gi|194164547|gb|EDW79448.1| GK20483 [Drosophila willistoni]
Length = 550
Score = 100 bits (248), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 73/244 (29%), Positives = 115/244 (47%), Gaps = 44/244 (18%)
Query: 17 SLLIRKSFSSTAIIN--PSKVKQISWKPRAFVYEGFLTDLECDHL---------INLAKS 65
+L+ R +F+++ + P K+++IS P Y L+D E + L IN +
Sbjct: 321 NLVCRYNFTTSPFLQLAPMKLEEISLDPYIVQYHDVLSDNEIEDLKREGIKGTMIN-GWT 379
Query: 66 QLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVL 125
LK S +N ES+ R + I I+ I +I T E + IQ+
Sbjct: 380 SLKSSNATEN---ESRTIVARVA----IMSPSLEIVQRINRRIIDMTGFNIEESKTIQLA 432
Query: 126 RYEHGQKYEPHYDYFSDKV----NIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRR 181
+ G + PHYDY D++ + + G R+A+V+ Y DV +GG T FP
Sbjct: 433 AFSVGGFFMPHYDYLYDRLLDTDVLKKLGDRVASVIFYAGDVTEGGATNFP--------- 483
Query: 182 TPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
+ + V+P++G AL +++ + PDP SLHS CPV+ G +W+ TKWI
Sbjct: 484 ------------RNQLVVQPKKGSALFWYNKFDDGSPDPRSLHSICPVVVGSRWTITKWI 531
Query: 242 HVDS 245
H DS
Sbjct: 532 HQDS 535
>gi|313229039|emb|CBY18191.1| unnamed protein product [Oikopleura dioica]
Length = 522
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 62/213 (29%), Positives = 101/213 (47%), Gaps = 22/213 (10%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+ P KV++I+ +P + L D E + L L + +L R+ V D + + +D R S
Sbjct: 311 LGPWKVEEIAKQPYVVRFFDILNDNEINSLERLGEEKLARATVFDPATHKLVNADYRVSK 370
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRG 149
++ + +I+ T L E E +Q+ Y G +YEPHYDY + +I
Sbjct: 371 SAWLKDEDSDTVEKYNRRISRLTGLDLEYAEQLQMSNYGIGGQYEPHYDYSRREWDIY-N 429
Query: 150 GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLF 209
R+AT L YL+ V +GG TVF + G+ ++ +G A+ +
Sbjct: 430 NRRIATWLSYLTTVEQGGGTVF---------------------TELGLHIRSIKGSAVFW 468
Query: 210 FSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
++L N D + H+ CPV+ G KW + KWIH
Sbjct: 469 YNLLPNGSGDERTRHAACPVLRGNKWVSNKWIH 501
>gi|407699315|ref|YP_006824102.1| prolyl 4-hydroxylase subunit alpha [Alteromonas macleodii str.
'Black Sea 11']
gi|407248462|gb|AFT77647.1| Prolyl 4-hydroxylase alpha subunit [Alteromonas macleodii str.
'Black Sea 11']
Length = 354
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 68/200 (34%), Positives = 95/200 (47%), Gaps = 25/200 (12%)
Query: 46 VYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFI-PKGKDAIIAGI 104
+Y L++ EC +LI S L+ S V D L+G K+ +VRTS I P D I +
Sbjct: 158 LYVDVLSEYECAYLITKFSSLLQPSMVVDPLTGNGKVDNVRTSYVAIIAPSYCDWITRKL 217
Query: 105 EDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFS---DKVNIVRGGHRLATVLMYLS 161
+ I+ T P+ NGE + +LRY GQ+Y+PHYD + D G R+ T L+YL+
Sbjct: 218 DKVISQVTHTPRCNGEALNLLRYTPGQQYKPHYDALNEDHDGSMYKDGKQRIKTALVYLN 277
Query: 162 DVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPV 221
V +GGET FP K I+V P G+ ++F + +
Sbjct: 278 TVRQGGETRFP---------------------KLDISVSPTLGNMVVFSNSDESGKLLLN 316
Query: 222 SLHSGCPVIEGEKWSATKWI 241
S H G P KW TKWI
Sbjct: 317 SYHLGAPTFSENKWLVTKWI 336
>gi|195505241|ref|XP_002099419.1| GE10893 [Drosophila yakuba]
gi|194185520|gb|EDW99131.1| GE10893 [Drosophila yakuba]
Length = 508
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 67/223 (30%), Positives = 112/223 (50%), Gaps = 33/223 (14%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
I+ P K+++IS +P VY L D + LI LA+ +L+ + V + E++ SD R++
Sbjct: 292 ILAPFKMEEISLEPYIVVYHDILPDKDMQQLIALAEPRLRPTEVFEEDKSEARTSD-RSA 350
Query: 89 SGTFIPKGKDAIIAG------IEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSD 142
GTF+P KD +G + ++ T + + +++Y G +Y ++D+F+
Sbjct: 351 LGTFLP-FKDMNPSGGPLLDRLTQRMRDITGIQIRHENTFNIIKYGFGSQYATNFDFFNG 409
Query: 143 KVNIVRG-GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKP 201
+ + G G R+ATVL YL+D GG TVFP + + V
Sbjct: 410 TNSEMEGYGDRMATVLFYLNDAPNGGATVFPRID---------------------VKVTA 448
Query: 202 RRGDALLFFSLH--TNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
RG L + +L+ T+ + +P +LH+ CPV +G KW WIH
Sbjct: 449 ERGKVLFWHNLNGETHDV-EPNTLHAACPVFQGSKWVMAAWIH 490
>gi|195452734|ref|XP_002073476.1| GK13124 [Drosophila willistoni]
gi|194169561|gb|EDW84462.1| GK13124 [Drosophila willistoni]
Length = 536
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 61/214 (28%), Positives = 100/214 (46%), Gaps = 22/214 (10%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+ P K+++ S P Y L+ + L +A ++RS V G++K S R S
Sbjct: 318 LAPLKMEEHSLDPFVVTYHDMLSPNKIAQLREMAVPHMRRSTVNPLPGGQNKKSSFRVSK 377
Query: 90 GTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNI-VR 148
++ + + ++ T L E +QV Y G YEPH+D+F + +
Sbjct: 378 NAWLAYETHPTMGKMLRDLSDTTGLDMTYCEQLQVANYGVGGHYEPHWDFFRNPDHYPAE 437
Query: 149 GGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALL 208
G+R+AT + YLS+V +GG T FP AV+P+ G+ L
Sbjct: 438 EGNRIATAIYYLSEVEQGGATAFPFL---------------------NFAVRPQLGNVLF 476
Query: 209 FFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+++LH ++ D + H+GCPV++G KW WIH
Sbjct: 477 WYNLHRSSDMDYRTKHAGCPVLKGSKWIGNVWIH 510
>gi|390989473|ref|ZP_10259770.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas axonopodis
pv. punicae str. LMG 859]
gi|372555742|emb|CCF66745.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas axonopodis
pv. punicae str. LMG 859]
Length = 152
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 63/161 (39%), Positives = 84/161 (52%), Gaps = 28/161 (17%)
Query: 86 RTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYF----- 140
RTS + G+DA+ IE +IA P ++GE +QVLRY G +Y PHYDYF
Sbjct: 6 RTSDSMCLRVGQDALCQRIEARIARLFDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAA 65
Query: 141 SDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVK 200
+ + GG R+A+++MYL+ +GG T FP+A L A KG AV
Sbjct: 66 GTPILLQAGGQRVASLVMYLNTPERGGATRFPDAH-------------LDVAAVKGNAV- 111
Query: 201 PRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
FFS + P SLH+G PV+ GEKW ATKW+
Sbjct: 112 --------FFS-YDRPHPMTRSLHAGAPVLTGEKWVATKWL 143
>gi|348555277|ref|XP_003463450.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Cavia porcellus]
Length = 584
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 66/219 (30%), Positives = 106/219 (48%), Gaps = 28/219 (12%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
++ P + + I +P +Y F++D E + LA+ L+RS VA G+ + R S
Sbjct: 373 LLQPVRKEVIHLEPYVALYHDFVSDPEAQKIRELAEPWLQRSVVASG--GKQLQVEYRIS 430
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKE--NGEDIQVLRYEHGQKYEPHYDYF---SDK 143
++ D ++ + +IA T L E +QV+ Y G YEPH+D+ S
Sbjct: 431 KSAWLKDTVDPMLVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPSSP 490
Query: 144 VNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRR 203
+ ++ G+R+AT ++YLS V GG T F A +V +
Sbjct: 491 LFRMKSGNRVATFMIYLSSVEAGGATAFIYA---------------------NFSVPVVK 529
Query: 204 GDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
AL +++LH + D +LH+GCPV+ G+KW A KWIH
Sbjct: 530 NAALFWWNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIH 568
>gi|195341584|ref|XP_002037386.1| GM12898 [Drosophila sechellia]
gi|194131502|gb|EDW53545.1| GM12898 [Drosophila sechellia]
Length = 536
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 63/220 (28%), Positives = 106/220 (48%), Gaps = 28/220 (12%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+ P +++++S P +Y L+D E + L ++K L+R+ V G +++ R++
Sbjct: 311 LAPFRMEELSLDPYVVLYHNVLSDPEIEKLKPMSKPFLERAKVFRVEKGSDEIAPSRSAD 370
Query: 90 GTFIPKGKD-----AIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKV 144
G ++P ++ I +I T L +G +Q L+Y G + PHYDYF+ K
Sbjct: 371 GAWLPHQDTDPDDLEVLRRIGRRIKDLTGLNTRSGSQMQFLKYGFGGHFVPHYDYFNSKT 430
Query: 145 NIV-RGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRR 203
+ + R G R+ATVL YL++V GG T FP K + V ++
Sbjct: 431 SYLERVGDRIATVLFYLNNVDHGGATAFP---------------------KLNLVVPTQK 469
Query: 204 GDALLFFSLHTNAIP-DPVSLHSGCPVIEGEKWSATKWIH 242
G AL + +L + D + H CP+I G K T+WI+
Sbjct: 470 GSALFWHNLDRKSYDYDTCTFHGACPLISGTKLVMTRWIY 509
>gi|386771382|ref|NP_649044.3| CG18233 [Drosophila melanogaster]
gi|383291998|gb|AAF49254.3| CG18233 [Drosophila melanogaster]
Length = 515
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 60/233 (25%), Positives = 119/233 (51%), Gaps = 37/233 (15%)
Query: 17 SLLIRKSFSSTAIIN--PSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVAD 74
+L+ R + S+ A + P K+++IS P ++ ++D + + + + + +
Sbjct: 296 NLVCRYNSSTNAFLKLAPLKMEEISRDPYIVMFHEVISDKDIEEM---------KGEITE 346
Query: 75 NLSGESKLSDVR-TSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKY 133
+G + L D + S + + + + I +I+ T E IQ+ + G +
Sbjct: 347 MENGWTSLGDPKEIVSRVYWIRKESSFSKRINQRISDMTGFKLEEFPAIQLANFGVGGYF 406
Query: 134 EPHYDYFSDKVNIV----RGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDL 189
+PHYD+++D++ V G R+ +++ Y +V++GG+TVFP+ +
Sbjct: 407 KPHYDFYTDRLKEVDVNNTLGDRIGSIIFYAGEVSQGGQTVFPDLK-------------- 452
Query: 190 SECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+AV+P++G+AL +F+ ++ PDP SLHS CPV+ G +W+ TKW+H
Sbjct: 453 -------VAVEPKKGNALFWFNAFDDSTPDPRSLHSVCPVLVGSRWTITKWLH 498
>gi|260665980|ref|YP_003212934.1| hypothetical protein H665_p111 [Ostreococcus tauri virus 1]
gi|260160998|emb|CAY39699.1| hypothetical protein OTV1_111 [Ostreococcus tauri virus 1]
Length = 185
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 69/202 (34%), Positives = 99/202 (49%), Gaps = 31/202 (15%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
+PR V + F+T+ E H+I A+ +L+ S VA+N + K+ D S T D +
Sbjct: 9 EPR--VIKEFITEEERKHIIRKAQKKLEVSTVAENRVVDKKVRD----SETAWLDDSDPV 62
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYL 160
+ + +K + T P N E IQVLRY+ G Y PH D FSD +G R+ TV++ L
Sbjct: 63 VKRVMEKCVSLTDRPLVNCEHIQVLRYKPGGHYSPHQDTFSD----TKGNKRMYTVILGL 118
Query: 161 SDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDP 220
+D +GGET FPN ++ K + GDAL F +L +
Sbjct: 119 NDDYEGGETEFPNLKK---------------------KYKLKAGDALFFHTLDNYELMTS 157
Query: 221 VSLHSGCPVIEGEKWSATKWIH 242
+LH G PV GEKW W+H
Sbjct: 158 KALHGGRPVESGEKWICNLWVH 179
>gi|314055201|ref|YP_004063539.1| prolyl 4-hydroxylase [Ostreococcus tauri virus 2]
gi|313575092|emb|CBI70105.1| prolyl 4-hydroxylase [Ostreococcus tauri virus 2]
gi|388548689|gb|AFK65891.1| prolyl 4-hydroxylase alpha subunit [Ostreococcus lucimarinus virus
OlV6]
Length = 199
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 65/197 (32%), Positives = 95/197 (48%), Gaps = 29/197 (14%)
Query: 46 VYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAIIAGIE 105
V G +T E +H++ A S+L S VA+N + K+ D T+ D ++ +
Sbjct: 24 VVRGLVTPKEREHIMKKASSKLDVSTVAENRIIDKKIRDSETAWLDM----DDPVVKRVC 79
Query: 106 DKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVAK 165
+K + T P N E +QVLRY+ G Y+PH D FSD +G R+ TV++ L+D +
Sbjct: 80 EKCVSLTDRPLTNCEQLQVLRYKPGGHYKPHQDTFSD----TKGNKRMYTVILALNDEYE 135
Query: 166 GGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHS 225
GGET FPN ++ K + GDAL F +L + +LH
Sbjct: 136 GGETEFPNLKK---------------------KYKLKAGDALFFHTLDNYELMTSKALHG 174
Query: 226 GCPVIEGEKWSATKWIH 242
G PV GEKW W+H
Sbjct: 175 GRPVKSGEKWVCNLWVH 191
>gi|195379216|ref|XP_002048376.1| GJ13933 [Drosophila virilis]
gi|194155534|gb|EDW70718.1| GJ13933 [Drosophila virilis]
Length = 521
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 60/216 (27%), Positives = 107/216 (49%), Gaps = 28/216 (12%)
Query: 32 PSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGT 91
P K+++IS P +Y L+D E + + L+ ++ A N ++ D+ +G
Sbjct: 315 PLKLEEISHDPYIVMYHNVLSDSEIEEMKQLS-VLMENGLSATNKPNNTEPLDIVARAGW 373
Query: 92 FIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDY-FSDKVN---IV 147
+ + I +I T + + + Y G ++PHYDY + +V+ +
Sbjct: 374 LVEA--TPFLERINRRITDMTGFDVLDMWAVLLANYGIGNYFKPHYDYMYGGRVSGEAVA 431
Query: 148 RGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDAL 207
G R+AT++ Y SDVA+GG T FP+ + +AV+P++G++L
Sbjct: 432 ELGERIATLIFYASDVAQGGATNFPDIQ---------------------VAVQPQKGNSL 470
Query: 208 LFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHV 243
++++ + PDP SLHS CP I G +W+ TKW+H+
Sbjct: 471 FWYNMFDDGTPDPRSLHSVCPTIVGSRWTLTKWLHM 506
>gi|126327904|ref|XP_001367838.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Monodelphis
domestica]
Length = 559
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 66/220 (30%), Positives = 107/220 (48%), Gaps = 30/220 (13%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLS-DVRT 87
++ P + + + +P +Y F++D E + A L+RS VA SGE + + R
Sbjct: 348 LLQPVRKEVLHLEPYIVLYHDFVSDSEAQKIRGFAAPWLQRSVVA---SGEKQQQVEYRI 404
Query: 88 SSGTFIPKGKDAIIAGIEDKIATWTFLPKE--NGEDIQVLRYEHGQKYEPHYDYF---SD 142
S ++ D ++ ++ +IA T L + E +QV+ Y G YEPH+D+ S
Sbjct: 405 SKSAWLKDTVDPMLVSLDHRIAALTGLNVQPPYAEHLQVVNYGIGGHYEPHFDHATSPSS 464
Query: 143 KVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPR 202
+ + G+R+AT ++YLS V GG T F A +V
Sbjct: 465 PLYRMNSGNRVATFMIYLSSVEAGGSTAFIYA---------------------NFSVPVV 503
Query: 203 RGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
+ AL +++LH + D +LH+GCPV+ G+KW A KWIH
Sbjct: 504 KNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIH 543
>gi|228993272|ref|ZP_04153188.1| hypothetical protein bpmyx0001_40040 [Bacillus pseudomycoides DSM
12442]
gi|228766340|gb|EEM14983.1| hypothetical protein bpmyx0001_40040 [Bacillus pseudomycoides DSM
12442]
Length = 195
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 72/208 (34%), Positives = 99/208 (47%), Gaps = 36/208 (17%)
Query: 41 KPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSSGTFIPKGKDAI 100
+P YE +T EC LI L+K ++ A A +GE R S T++P +
Sbjct: 11 EPFVAQYEQIITPAECQELIELSKKHIQ-PAQAYGHTGE------RKSDFTWLPHYSHGL 63
Query: 101 IAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYD----YFSDKVNIV-RGGHRLAT 155
++ + + IAT LP + E +Q RYE G K++ H D + D N V +GG RL T
Sbjct: 64 VSQVSELIATAMPLPLNHAEPLQAARYEVGGKFDAHIDCYGTWHEDGRNRVEQGGQRLYT 123
Query: 156 VLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLH-- 213
++YL+ V GGET FP+ + V P G L+F +
Sbjct: 124 AILYLNTVNAGGETFFPSL---------------------NLTVTPSEGKLLVFENCKRG 162
Query: 214 TNAIPDPVSLHSGCPVIEGEKWSATKWI 241
TN P P+SLH GC V EGEKW AT W
Sbjct: 163 TNE-PHPLSLHEGCAVHEGEKWIATLWF 189
>gi|219113023|ref|XP_002186095.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|209582945|gb|ACI65565.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 508
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 69/216 (31%), Positives = 110/216 (50%), Gaps = 37/216 (17%)
Query: 42 PRAFVYEGFLTDLECDHLINLA-KSQLKRSAVADNLSGESKLSD-VRTSSGTFIPKGKDA 99
PR F + FL+D+E +HL+N+A K +LKRS + S E+ +D RTS+ +IP+ +D
Sbjct: 288 PRVFEVKDFLSDMEVEHLLNIASKRKLKRSTMHAGGSSEATTNDDTRTSTNDWIPRHQDL 347
Query: 100 I----------IAGIEDKIATW---TFLPKEN------GEDIQVLRYEHGQKYEPHYDY- 139
I + +++ + W + +P+ E +Q++ Y+ GQ+Y PH+D+
Sbjct: 348 ITDTIYRRAADLLQMDEALLRWRRKSEIPEFTESHISISERLQLVNYQVGQQYTPHHDFT 407
Query: 140 FSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAV 199
VN+ R AT+L YL+D GGET FP ++ + V
Sbjct: 408 MPGLVNMQPS--RFATLLFYLNDDMDGGETAFPRWLH-------------ADEEGGSLKV 452
Query: 200 KPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKW 235
KP +G A+LF++L + D S H+ PV GEKW
Sbjct: 453 KPEKGKAILFYNLLPDGNYDERSEHAALPVRRGEKW 488
>gi|194905419|ref|XP_001981192.1| GG11932 [Drosophila erecta]
gi|190655830|gb|EDV53062.1| GG11932 [Drosophila erecta]
Length = 535
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 67/224 (29%), Positives = 101/224 (45%), Gaps = 29/224 (12%)
Query: 32 PSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAV---ADNLSGESKLSDVRTS 88
P K++++ P + + + L A+ ++KRS V A N G+S + RTS
Sbjct: 316 PFKLEELHLDPPVVQLHQVIGSKDAESLQRTARPRIKRSTVYSLAGN--GDSTAAAFRTS 373
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVR 148
G ++A + + ++ L E ED+QV Y G YEPH+D F D
Sbjct: 374 QGASFNYSRNAATKLLSHHVGDFSGLNMEYAEDLQVANYGIGGHYEPHWDSFPDNHVYQE 433
Query: 149 G---GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGD 205
G G+R+AT + YLSDV GG T FP + V P RG
Sbjct: 434 GDLHGNRIATAIYYLSDVEAGGGTAFPFLP---------------------LLVTPERGS 472
Query: 206 ALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKI 249
L +++LH + D + H+ CPV++G KW A WI + D +
Sbjct: 473 LLFWYNLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQDNV 516
>gi|390459659|ref|XP_002806656.2| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
alpha-2 [Callithrix jacchus]
Length = 579
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 64/242 (26%), Positives = 112/242 (46%), Gaps = 45/242 (18%)
Query: 25 SSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSD 84
+S +I P K + P Y ++D E + + +AK +L R+ V D +G ++
Sbjct: 343 ASQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVAS 402
Query: 85 VRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDY----- 139
R S +++ + D ++A + ++ T L + E +QV Y G +YEPH+D+
Sbjct: 403 YRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPF 462
Query: 140 ----------------FSDKVNIVR---GGHRLATVLMYLSDVAKGGETVFPNAEEPPRR 180
++D+ + + G+R+AT L Y+SDV GG TVFP+
Sbjct: 463 DSGLKTEGNRLATFLNYNDERDAFKHLGTGNRVATFLNYMSDVEAGGATVFPDL------ 516
Query: 181 RTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKW 240
G A+ P++G A+ +++L + D + H+ CPV+ G KW + KW
Sbjct: 517 ---------------GAAIWPKKGTAVFWYNLLRSGXGDYRTRHAACPVLVGCKWVSNKW 561
Query: 241 IH 242
H
Sbjct: 562 FH 563
>gi|156333122|ref|XP_001619372.1| hypothetical protein NEMVEDRAFT_v1g151555 [Nematostella vectensis]
gi|156202442|gb|EDO27272.1| predicted protein [Nematostella vectensis]
Length = 144
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 52/146 (35%), Positives = 75/146 (51%), Gaps = 21/146 (14%)
Query: 97 KDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATV 156
+D ++ I ++ ++ L ED+QV+ Y G YEPHYD+ DK + G+R+AT
Sbjct: 10 EDELVKRISYRVQAYSGLNMTTSEDLQVVNYGIGGHYEPHYDFARDKFTSLGTGNRIATF 69
Query: 157 LMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRRGDALLFFSLHTNA 216
L YLSDV GG TVF + G V P++GDA +++L +
Sbjct: 70 LSYLSDVEAGGGTVF---------------------TRVGATVWPQKGDAAFWYNLKRSG 108
Query: 217 IPDPVSLHSGCPVIEGEKWSATKWIH 242
D + H+ CPV+ G KW A KWIH
Sbjct: 109 DGDSSTRHAACPVLVGSKWVANKWIH 134
>gi|443707037|gb|ELU02831.1| hypothetical protein CAPTEDRAFT_181697 [Capitella teleta]
Length = 538
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 65/223 (29%), Positives = 103/223 (46%), Gaps = 35/223 (15%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESK-LSDVRT 87
I+ P+K + + P +Y +TD E D + ++K +L RS V G K + D RT
Sbjct: 326 ILVPAKEEVMFLDPFIAIYHNLMTDKEADMIKRISKPKLHRSGVFTYSGGNQKPVQDYRT 385
Query: 88 SSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDY-------- 139
S +I + +I + ++ + T L + E QV+ Y G YEPH+D+
Sbjct: 386 SKSAWIEDEEHPMIRRVSERTSALTDLSLDTVELFQVVNYGIGGHYEPHFDFARPNEIAT 445
Query: 140 FSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAV 199
F +V G+R+ TV+ Y++ GG TVFP+ G+ +
Sbjct: 446 FDPEV-----GNRIITVIFYVAAPEAGGATVFPDL---------------------GVKL 479
Query: 200 KPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
P +G ++++L N D + H+GCP I G KW A KW H
Sbjct: 480 WPEKGSCAVWWNLMRNGEGDYRTKHAGCPTITGSKWIANKWYH 522
>gi|198449504|ref|XP_002136909.1| GA26876 [Drosophila pseudoobscura pseudoobscura]
gi|198130636|gb|EDY67467.1| GA26876 [Drosophila pseudoobscura pseudoobscura]
Length = 527
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 65/221 (29%), Positives = 106/221 (47%), Gaps = 30/221 (13%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+ P +++++S P VY L+D E + + + LKRS V D + S RT+
Sbjct: 311 LAPLRMEELSLDPYIVVYHNVLSDAEIAEVERVTEPLLKRSVVFDGKGNKMSTSKRRTAL 370
Query: 90 GTFIPKGK-----DAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKV 144
G ++P A+I I +I T L + +D+Q+++Y +G Y+ H+DYF+
Sbjct: 371 GAWLPDDNMDVSGRAVIQRIFRRIHELTGLIINDRQDMQLIKYGYGGHYDIHFDYFNTST 430
Query: 145 NIVRG-GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRR 203
I + G R+ATVL YL+D+ GG T F + + + V R
Sbjct: 431 PITKARGDRMATVLFYLNDMKHGGSTAFTDLQ---------------------LKVPSER 469
Query: 204 GDALLFFSL--HTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
G L ++++ T+ + D +LH CPVI G K + WIH
Sbjct: 470 GKVLFWYNMRGETHDV-DSRTLHGACPVINGTKTILSCWIH 509
>gi|147791524|emb|CAN70717.1| hypothetical protein VITISV_029140 [Vitis vinifera]
Length = 173
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 65/193 (33%), Positives = 88/193 (45%), Gaps = 61/193 (31%)
Query: 83 SDVRTSSGTFIPKGKDA---------------------------IIAGIEDKIATWTFLP 115
SDVRTSSG F+ + IE +I+ ++ +P
Sbjct: 10 SDVRTSSGMFLSPDDSTYPIVRVFVVPPMEGFWNSCGLSNSLCLFLQAIEKRISVYSQVP 69
Query: 116 KENGEDIQVLRYEHGQKYEPHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAE 175
ENGE IQ N+ RGG R+AT+L+YLSD +GGET FP A
Sbjct: 70 VENGELIQF--------------------NLKRGGQRVATMLIYLSDNVEGGETYFPMA- 108
Query: 176 EPPRRRTPATNDDLSECAKK---GIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEG 232
C K G++V P +G+A+LF+S+ + DP S+H GC V+ G
Sbjct: 109 ----------GSGFCRCGGKSVRGLSVAPVKGNAVLFWSMGLDGQSDPNSIHGGCEVLAG 158
Query: 233 EKWSATKWIHVDS 245
EKWSATKW+ S
Sbjct: 159 EKWSATKWMRQRS 171
>gi|432109537|gb|ELK33711.1| Prolyl 4-hydroxylase subunit alpha-2 [Myotis davidii]
Length = 555
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 63/238 (26%), Positives = 110/238 (46%), Gaps = 45/238 (18%)
Query: 29 IINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTS 88
+I P K + P Y ++D E + +AK +L R+ V D +G ++ R S
Sbjct: 323 LIAPFKEEDEWDSPHIVRYYDVMSDEEIQRIKEIAKPKLARATVRDPKTGVLTVASYRVS 382
Query: 89 SGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDY--------- 139
+++ + D ++A + ++ T L + E +QV Y G +YEPH+D+
Sbjct: 383 KSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGL 442
Query: 140 ------------FSDKVNIVR---GGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPA 184
++D+ ++ + G+R+AT L Y+SDV GG TVFP+
Sbjct: 443 KTEGNRLATFLNYNDEQDVFKHLGTGNRVATFLNYMSDVEAGGATVFPDL---------- 492
Query: 185 TNDDLSECAKKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
G A+ P++G A+ +++L + D + H+ CPV+ G KW + KW H
Sbjct: 493 -----------GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 539
>gi|198449508|ref|XP_002136911.1| GA26875 [Drosophila pseudoobscura pseudoobscura]
gi|198130638|gb|EDY67469.1| GA26875 [Drosophila pseudoobscura pseudoobscura]
Length = 516
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 65/220 (29%), Positives = 103/220 (46%), Gaps = 28/220 (12%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+ P +++++S P VY L D E + + + LKRS V D + S RT+
Sbjct: 300 LAPLRMEELSLDPYIVVYHNVLCDAEIAEVERVTEPLLKRSVVFDGKENKMSTSKKRTAL 359
Query: 90 GTFIPKGK-----DAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKV 144
G ++P A+I I +I T L + +D+Q+++Y +G Y+ H+DYF+
Sbjct: 360 GAWLPDDNMDVSGRAVIQRIFRRIHELTGLIINDRQDMQLIKYGYGGHYDIHFDYFNTSS 419
Query: 145 NIVRG-GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRR 203
I + G R+ATVL YL+DV GG T F + + + V R
Sbjct: 420 PITKARGDRMATVLFYLNDVKHGGSTAFTDLQ---------------------LKVPSER 458
Query: 204 GDALLFFSLHTNAIP-DPVSLHSGCPVIEGEKWSATKWIH 242
G L ++++ D +LH CPVI+G K + WIH
Sbjct: 459 GKVLFWYNMRGETHDLDSRTLHGACPVIDGTKTILSCWIH 498
>gi|195113245|ref|XP_002001178.1| GI22115 [Drosophila mojavensis]
gi|193917772|gb|EDW16639.1| GI22115 [Drosophila mojavensis]
Length = 498
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 64/219 (29%), Positives = 109/219 (49%), Gaps = 34/219 (15%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+ P K++ +S P V+ + D E HL N A+ L RS V + + ES +S VRT+
Sbjct: 279 LAPFKMELLSEDPYIVVFHDVIYDSEIKHLRNTAEPLLHRSYVKKS-NNESVVSKVRTAK 337
Query: 90 GTFIPKGKDA-----IIAGIEDKIATWTFL--PKENGEDIQVLRYEHGQKYEPHYDYFSD 142
G F+ + + ++ ++ ++ + L +E ++Q L Y+ G Y H DYF+
Sbjct: 338 GAFMHADRLSPESAQVVQRLKQRMGDLSDLNIKREGYNEMQYLNYDFGDHYLLHMDYFNI 397
Query: 143 KVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPR 202
+N R+AT L+YL+DV +GG T+FP ++ AV P
Sbjct: 398 SMN-----DRIATFLIYLNDVTRGGGTIFPQVKQ---------------------AVHPE 431
Query: 203 RGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWI 241
+G +L++++++N + SLH CPV+ G K + WI
Sbjct: 432 KGKLILWYNMNSNLDYELASLHGACPVLIGRKIAIVYWI 470
>gi|195159146|ref|XP_002020443.1| GL13510 [Drosophila persimilis]
gi|194117212|gb|EDW39255.1| GL13510 [Drosophila persimilis]
Length = 527
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 65/221 (29%), Positives = 106/221 (47%), Gaps = 30/221 (13%)
Query: 30 INPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVADNLSGESKLSDVRTSS 89
+ P +++++S P VY L+D E + + + LKRS V D + S RT+
Sbjct: 311 LAPLRMEELSLDPYIVVYHNVLSDAEIAEVERVTEPLLKRSVVFDGKENKMSTSKKRTAL 370
Query: 90 GTFIPKGK-----DAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKYEPHYDYFSDKV 144
G ++P A+I I +I T L + +D+Q+++Y +G Y+ H+DYF+
Sbjct: 371 GAWLPDDNMDVSGRAVIQRIFRRIHELTGLIINDRQDMQLIKYGYGGHYDIHFDYFNTST 430
Query: 145 NIVRG-GHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECAKKGIAVKPRR 203
I + G R+ATVL YL+D+ GG T F + + + V R
Sbjct: 431 PITKARGDRMATVLFYLNDMKHGGSTAFTDLQ---------------------LKVPSER 469
Query: 204 GDALLFFSL--HTNAIPDPVSLHSGCPVIEGEKWSATKWIH 242
G L ++++ T+ + D +LH CPVI G K + WIH
Sbjct: 470 GKVLFWYNMRGETHDL-DSRTLHGACPVINGTKTILSCWIH 509
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.318 0.135 0.415
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,019,036,358
Number of Sequences: 23463169
Number of extensions: 218079457
Number of successful extensions: 432364
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1474
Number of HSP's successfully gapped in prelim test: 690
Number of HSP's that attempted gapping in prelim test: 426452
Number of HSP's gapped (non-prelim): 2579
length of query: 296
length of database: 8,064,228,071
effective HSP length: 141
effective length of query: 155
effective length of database: 9,050,888,538
effective search space: 1402887723390
effective search space used: 1402887723390
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 76 (33.9 bits)